Vol. 102, No. 1
DUKE MATHEMATICAL JOURNAL
© 2000
SINGULAR POINTS AND LIMIT CYCLES OF PLANAR POLYNOMIAL VECTOR FIELDS ILIA ITENBERG and EUGENI˘I SHUSTIN
Contents 0. 1. 2. 3. 4. 5. 6. 7.
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Gluing theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Planar vector fields with many limit cycles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Singular points of planar vector fields: Formulation of results . . . . . . . . . . . . . . Topological classification of collections of singular points . . . . . . . . . . . . . . . . . Elementary vector fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Thin topological classification: Basic construction . . . . . . . . . . . . . . . . . . . . . . . . Thin topological classification: Other cases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1 4 7 10 11 18 21 30
0. Introduction. The subject of the article is real planar vector fields x˙ = P (x, y), y˙ = Q(x, y), where P , Q are polynomials. We consider two questions related to the second part of the Hilbert 16th problem [Hi]. (1) What can be the number and arrangement of limit cycles of a vector field of degree d in R2 ? (2) Given a classification of singular points of planar vector fields, how many singular points of each type can a vector field of degree d in R2 have? Our approach to these problems comes from the Viro method [V1] to [V4] (see also [IV] and [R]) invented in the framework of the first part of the 16th problem, topology of real algebraic varieties. This method, actually, consists in reducing a problem on polynomials with an arbitrary Newton polyhedron to that on polynomials with smaller Newton polyhedra. Various applications and developments related to the topology of real algebraic varieties and their singularities can be found in [GKZ], [I1], [I2], [IV], [S1] to [S3], and [St]. Note also that Newton polyhedra and diagrams have been used since the last century for the local study of singular points of differential systems. (For the modern account, see, e.g., [Br].) In this paper we prove Viro-type “gluing” theorems for planar polynomial vector fields (see Theorems 1.3.1, 1.4.1, and Corollary 1.4.2). Using gluing theorems we construct vector fields with many limit cycles and vector fields with given numbers of singular points of prescribed types. The exact statements (see Theorem 2.1 on limit Received 11 June 1997. Revision received 19 January 1999. 1991 Mathematics Subject Classification. Primary 34C05. Itenberg partially supported by European Community contract CHRX-CT94-0506. Shustin partially supported by grant number 6836-1-9 of the Ministry of Science and Arts, Israel. 1
2
ITENBERG AND SHUSTIN
cycles and Theorems 3.3 and 3.4 on singular points) are presented below. It is known (see [E] and [Il2]) that a polynomial vector field has only finitely many limit cycles, but no general upper bound (depending only on the degree) is found. On the other hand, one can look for examples of fields with a large number of limit cycles. Among the known examples of vector fields with many limit cycles, one can mention quadratic fields with four limit cycles (see [An], [CW], and [Sh]), cubic fields with 11 limit cycles (see [Z]), vector fields of degree d close to Hamiltonian ones and having (d 2 + 5d − 14)/2 limit cycles (see [O]) or d(d + 1)/2 − 1 limit cycles (see [Il1] and [P]), and the vector fields of degrees d = 2k − 1, k ≥ 2, with (1/2)d 2 log2 d + O(d 2 ) limit cycles (see [CL]). The construction of C. J. Christopher and N. G. Lloyd [CL] provides an asymptotic lower bound (1/8)d 2 log2 d for the maximal number of limit cycles of a planar vector field of degree d. We improve this asymptotic lower bound in the following way (see Theorem 2.1): For any integer d ≥ 3 there exists a planar vector field of degree d with at least (1/2)d 2 log2 d − Cd 2 log2 log2 d limit cycles, where C is a positive number that does not depend on d. Let P0 , P1 , . . . , Pn be polynomials in n variables. Under certain “general position” assumptions on Pi ’s, A. Khovanski˘i [Kh1] obtained sharp estimates (in terms of degrees of Pi ’s) for the total index of the vector field (P1 , . . . , Pn ) in Rn and in the regions P0 > 0 and P0 < 0. In particular, for n = 2, he proved that the total index I of singular points of a vector field of degree d, having no singularities at infinity, satisfies |I | ≤ d, and, under the assumption that there are d 2 real singular points (so all the points have index ±1), all the values of I having the same parity as d in this range can be realized by a suitable vector field (see also [Kh2]). We refine Khovanski˘i’s result for planar vector fields in two steps. First, we distinguish between repellers and attractors (having the same index +1). We call the corresponding classification topological and prove the following statement (see Theorem 3.3): For any nonnegative integers d, s0 , σ1 , σ2 , and σ3 satisfying 2s0 + σ1 + σ2 + σ3 = d 2 ,
|σ1 + σ2 − σ3 | ≤ d,
there exists a planar vector field of degree d with 2s0 imaginary singular points, σ1 attractors, σ2 repellers, and σ3 saddles. Second, we distinguish between a source and a source-focus, and between a sink and a sink-focus. The corresponding classification is called thin topological. The following theorem gives an asymptotically complete thin topological classification of collections of singular points of vector fields satisfying some generality conditions (see Subsection 1.4, Section 3, and Theorem 3.4): For any nonnegative integers d, s0 , s1 , s2 , s3 , s4 , and s5 satisfying 2s0 + s1 + s2 + s3 + s4 + s5 = d 2 ,
|s1 + s2 + s3 + s4 − s5 | ≤ d,
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
and
3
1, 1,
if d is odd, s1 + s2 + s3 + s4 − s5 ≥ 3, if d is even, (d − 1)(d − 2) < 2s0 < d 2 , s1 + s2 ≥ s1 + s2 + s3 + s4 − s5 ≥ 0, 2, if d is even, 2s0 ≤ (d − 1)(d − 2), s1 + s2 + s3 + s4 − s5 ≥ 0, there exists a planar vector field of degree d with 2s0 imaginary singular points, s1 sinks, s2 sources, s3 sink-foci, s4 source-foci, and s5 saddles. The last statement is proved by means of the gluing Theorem 1.4.1. To prove the statement on topological classification, we use another gluing theorem (see Theorem 4.1.1, Section 4), which is based on a torus action on (R∗ )2 preserving only topological types of singular points. The reason is that the proof of the result on topological classification becomes much simpler and shorter than possible arguments with general gluing theorems. We point out in this connection that gluing theorems may have various forms in accordance with the problem studied. Remark. An answer to the question on singular points was announced in [IS] with a sketch of the proof. However, that article contained minor flaws in its assertions, and the proofs were incomplete. In the present article we give the corrected statements and the complete proofs. Moreover, we add important details in the approach used. The paper consists of several sections. In Section 1, we formulate and prove the gluing theorems. In the subsequent sections, the gluing theorems are transformed into algorithmic procedures whose outputs are vector fields with many limit cycles (see Section 2) or with a given collection of types of singular points (see Sections 4–7). Such a procedure contains two main steps: (1) analysis of properties of vector fields with small Newton polygons; (2) subdivision of given (large) Newton polygons into appropriate small subpolygons; search for suitable vector fields with these small Newton polygons in order to provide the glued vector field with the required properties. For the problem on limit cycles, both the steps are carried out in Section 2. For the problem on topological classification of collections of singular points, both the steps are performed in Section 4 by means of the methods of [IS]. In Section 5, we carry out the first step for the problem on thin topological classification of collections of singular points. In Section 6, we describe the basic construction and give the thin topological classification for the case when all the singular points are real and the total index is equal to −d. In Section 7, we explain how to complete the proof, but we skip the details, because of the large number (more than ten) of cases, each of them requiring a slight modification of the main algorithm. Acknowledgements. A part of this work was done during the authors’ stay in Mathematisches Forschungsinstitut Oberwolfach under the program “Research in
4
ITENBERG AND SHUSTIN
Paris” supported by Volkswagen-Stiftung, and in Max-Planck-Institut für Mathematik. The authors are grateful to these institutions for the hospitality and excellent work conditions. We are also grateful to L. Gavrilov and the referee for valuable remarks. 1. Gluing theorems 1.1. General scheme of gluing vector fields. By polygon, we mean a convex polygon with integral vertices contained in the positive quadrant R2+ = {(λ, µ) ∈ R2 : λ, µ ≥ 0}. First, we define a class of vector fields adapted for the gluing procedure. For a polygon , define polygons x = conv (i, j − 1) : (i, j ) ∈ ∩ Z2 , j ≥ 1 , y = conv (i − 1, j ) : (i, j ) ∈ ∩ Z2 , i ≥ 1 . A polynomial vector field x˙ = P (x, y), y˙ = Q(x, y) is called admissible if the Newton polygons of the polynomials P , Q coincide with x , y , respectively, for some polygon . In this case, the polygon is called the Newton polygon of the given vector field. It is easy to see that Hamiltonian vector fields are admissible. Assume that we are given the following objects: (1) a nondegenerate (i.e., of dimension 2) polygon , (2) a subdivision = 1 ∪ · · · ∪ N , where 1 , . . . , N are convex polygons, (3) a convex piecewise-linear function ν : → R, which is linear on each polygon 1 , . . . , N and nonlinear on each union k ∪ l , k = l. Suppose also that we have a collection of nonzero real numbers Aij indexed by the integral points of the polygon x and that we have a collection of nonzero real numbers Bij indexed by the integral points of the polygon y . We define the functions ν x , ν y on the polygons x , y by ν x (i, j ) = ν(i, j + 1),
(i, j ) ∈ x ,
ν y (i, j ) = ν(i + 1, j ),
(i, j ) ∈ y .
Consider the vector fields Vk with Newton polygons k (1 ≤ k ≤ N ), defined by Aij x i y j , (1.1.1) y˙ = Bij x i y j . Vk : x˙ = (i,j )∈xk
y
(i,j )∈k
Let us introduce a 1-parametric family V (t), t > 0, of vector fields: x y (1.1.2) Aij x i y j t ν (i,j ) , y˙ = Bij x i y j t ν (i,j ) . V (t) : x˙ = (i,j )∈x
(i,j )∈y
Note that all these vector fields are admissible. The idea of the gluing method is that the properties of the block vector fields (1.1.1), stable with respect to small deformations and invariant with respect to multiplication of vector fields by positive
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
5
constants and linear transformations (x, y) → (αx, βy), α, β > 0, determine the corresponding properties of a glued vector field V (t) for a sufficiently small positive value of t. In the present paper we consider two examples of such properties. 1.2. Limit cycles of vector fields. Let " be a nonsingular limit cycle of a planar vector field V , that is, an isolated nonsingular trajectory of V (for detailed definitions see, e.g., [AIl] and [Ha]). The monodromy map of " is defined as follows. Take a point z ∈ " and the straight line $ passing through z and orthogonal to ". Let w ∈ $ be close to z. Then the trajectory of V , (1.2.1)
x(τ ), y(τ ) ,
τ ≥ 0,
x(0), y(0) = w,
is close to " and returns to a neighborhood U of z after one rotation. We define ϕz (w) ∈ $ as the intersection point of trajectory (1.2.1) with $ ∩ U corresponding to the minimal positive τ . Given a local coordinate s on $ with s(z) = 0, the monodromy map ϕz (s) is analytic and satisfies ϕz (0) = 0. The zero s = 0 of the function ϕz (s)−s is isolated. We call a nonsingular limit cycle " simple if s = 0 is a simple zero of the function ϕz (s) − s. It is clear that the multiplicity of " does not depend on the choice of z ∈ ". The monodromy map and its derivative depend smoothly on small variations of the vector field, and hence a simple limit cycle is stable with respect to such deformations. The limit cycle scheme (V ) of V (by analogy with the real schemes of plane algebraic curves) is the set of simple limit cycles of V lying in (R∗ )2 partially ordered by the relation "1 ≺ "2 if the limit cycle "1 lies inside the limit cycle "2 . 1.3. Gluing vector fields with limit cycles Theorem 1.3.1. In the notation of Subsections 1.1 and 1.2,
for t > 0 small enough, the disjoint union of partially ordered sets (V1 ) · · · (VN ) can be embedded into (V (t)) (as an induced partially ordered subset). Proof. First, we represent the vector fields (1.1.1), (1.1.2) as follows: ˙ = Vk,x = Vk : xy
Ai,j −1 x i y j ,
yx ˙ = Vk,y =
(i,j )∈k
Bi−1,j x i y j ,
(i,j )∈k
k = 1, . . . , N, V (t) : xy ˙ =
(i,j )∈
Ai,j −1 x i y j t ν(i,j ) ,
yx ˙ =
Bi−1,j x i y j t ν(i,j ) .
(i,j )∈
Let lk (i, j ) = αk i + βk j + γk be the linear function equal to ν(i, j ) on k , and let νk = ν − lk , k = 1, . . . , N . The linear transformation Mk (x, y) = (xt αk , yt βk ) takes
6
ITENBERG AND SHUSTIN
the field V (t) into a field Vk (t): xy ˙ = t αk +βk +γk
(i,j )∈
yx ˙ = t αk +βk +γk
Ai,j −1 x i y j t νk (i,j ) = t αk +βk +γk Vk,x + O(t) , Bi−1,j x i y j t νk (i,j ) = t αk +βk +γk Vk,y + O(t) .
(i,j )∈
Let + ⊂ (R∗ )2 be a compact set whose interior contains all the simple limit cycles of V1 , . . . , VN lying in (R∗ )2 . Clearly, for small positive t, the vector fields Vk (t) in + have simple limit cycles close to the limit cycles of Vk , k = 1, . . . , N . The transformations Mk only move limit cycles in (R∗ )2 . Note that |αk − αm | + |βk − βm | > 0,
k = m.
Thus, we can choose t > 0 such that M1−1 (+), . . . , MN−1 (+) are disjoint, and thereby we obtain an embedding of the disjoint union of (V1 ), . . . , (VN ) into the partially ordered set of simple limit cycles of V (t) in M1−1 (+) ∪ · · · ∪ MN−1 (+). 1.4. Gluing vector fields with singular points. A point z∗ = (x∗ , y∗ ) ∈ C2 is called singular for a vector field V : x˙ = P , y˙ = Q, if P (x∗ , y∗ ) = Q(x∗ , y∗ ) = 0. A singular point z∗ is called nondegenerate if
det A = 0,
∂P /∂x A= ∂Q/∂x
∂P /∂y
, ∂Q/∂y z ∗
and, for z∗ ∈ R2 , the linearization matrix A has no eigenvalues with zero real part. We call a vector field of degree d nondegenerate if it has d 2 nondegenerate singular points in (C∗ )2 . Let Ei , i ∈ I, be the types of nondegenerate singular points under some equivalence relation. A type Ei is called stable if any singular point z∗ belonging to Ei keeps its type under small variations of the vector field in the class of polynomial vector fields, under multiplication of the vector field by positive constants, and under linear transformations (x, y) → (αx, βy), α, β > 0. For a vector field V , we denote by c(V ) the distribution of all the singular points of V in (C∗ )2 into the stable types Ei , i ∈ I . Namely, c(V ) = (ci , i ∈ I ), where ci is the number of singular points of V in (C∗ )2 of type Ei . Theorem 1.4.1. In the above notation, for a sufficiently small t > 0, the vector field V (t) given by (1.1.2) satisfies ci (V (t)) ≥ N k=1 ci (Vk ) for all i ∈ I . The proof coincides with the proof of Theorem 1.3.1 in the following sense: we repeat the arguments word for word, replacing limit cycles by singular points of a given type.
7
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
Corollary 1.4.2. In the above notation, suppose that N k=1 i∈I ci (Vk ) is equal to 2 vol(x , y ), where vol(x , y ) denotes the mixed area of x and y . Then N c V (t) = c(Vk ). k=1
Proof. The statement follows from Theorem 1.4.1, since 2 vol(x , y ) is the maximal possible number of singular points in (C∗ )2 of a vector field with Newton polygon (see [Be]). 2. Planar vector fields with many limit cycles. In 1995 C. J. Christopher and N. G. Lloyd [CL] constructed, for any k ≥ 2, a vector field of degree d = 2k − 1 with at least 1 22k−1 (k − 3) + 3 2k − 1 = (d + 1)2 log2 (d + 1) − 3 + 3d 2 limit cycles. This construction holds a record for the number of limit cycles, and, provided the best known asymptotic lower bound for the maximal possible number of limit cycles of planar polynomial vector fields: there exists a family Vd , d = 1, 2, 3, . . . , where Vd is a vector field of degree d with at least (1/8)d 2 log2 d +O(d 2 ) limit cycles. The gluing procedure combined with Christopher and Lloyd’s examples allows us to improve the last statement. Theorem 2.1. There is a positive constant C such that for any integer d ≥ 3 there exists a vector field of degree d with at least 1 2 d log2 d − Cd 2 log2 log2 d 2 limit cycles. To construct the proclaimed vector fields we use the following auxiliary ones. (1)
(2)
Proposition 2.2. For any k ≥ 2 there exist admissible vector fields Vk and Vk with Newton triangles conv{(1, 1), (1 + 2k , 1), (1, 1 + 2k )} and conv{(1 + 2k , 1 + 2k ), (i) (1 + 2k , 1), (1, 1 + 2k )}, respectively, such that each Vk , i = 1, 2, has at least (k − 3)22k−1 + 3(2k − 1) ≥ (k − 3)22k−1 simple limit cycles in (R∗ )2 . Proof. Let k ≥ 2 and x˙ = Pk (x, y), y˙ = Qk (x, y) be a vector field of degree − 1 with at least (k − 3)22k−1 + 3(2k − 1) ≥ (k − 3)22k−1 simple limit cycles in (R∗ )2 . Such a vector field can be obtained from Christopher and Lloyd’s examples by a suitable shift in R2 . The multiplication of a vector field by xy does not affect its limit cycles in (R∗ )2 . Put 2k
k (x, y) = xyPk (x, y) + xpk (x), P
k (x, y) = xyQk (x, y) + yqk (y), Q
where pk , qk are polynomials in one variable of degree 2k . If the absolute values (1) k , Q k ) of the coefficients of pk , qk are sufficiently small, the vector field Vk = (P
8
ITENBERG AND SHUSTIN
d
2k1
..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... . ..... ..... ..... ..... . . . . ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... . ..... ..... ..... ..... . . . ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... . ..... . . ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ... .. . .
2k1
. ......
2k1 + 2k2 . . . d
Figure 1
with Newton triangle conv{(1, 1), (1 + 2k , 1), (1, 1 + 2k )} has at least (k − 3)22k−1 (1) limit cycles in (R∗ )2 . Applying to Vk the transformation (x, y) → (1/x, 1/y) and k (2) multiplying the result by (xy)2 , one obtains the required vector field Vk with Newton triangle conv{(1 + 2k , 1 + 2k ), (1 + 2k , 1), (1, 1 + 2k )}. Proof of Theorem 2.1. Without loss of generality we assume that d ≥ 36. We construct a required vector field of degree d using the following subdivision of the triangle T = conv{(0, 0), (0, d + 1), (d + 1, 0)}. Put d = [d − 6 log2 d], and consider the binary decomposition d = 2k1 + 2k2 + · · · . This decomposition defines a natural tiling of the triangle conv{(0, 0), (d , 0), (0, d )} shown in Figure 1. Remove all the triangles of the tiling with the edges of length 2ki < d / log2 d. Then, enlarge by 3 the horizontal and vertical edges of each remaining triangle, and shift the resulting triangles so that their interiors become disjoint (as it is shown in Figure 2). Observe that the union U of the enlarged triangles is contained in the triangle bounded by the vertical axis, the horizontal line j = −3 log2 d, and the line i + j = d + 3 log2 d. Hence the figure U1 , obtained by shifting U up by [3 log2 d], is contained in the triangle T = conv{(0, 0), (d +1, 0), (d +1, 0)}. Denote by T1 , . . . , Ts the triangles of U1 . For any triangle conv (l, m), l + 3 + 2ki , m , l, m + 3 + 2ki or Tr = k k i i , conv (l, m), l − 3 − 2 , m , l, m − 3 − 2 we take the triangle
9
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
j 2k1 + 2k2 + 6
2k1 + 3
. ....
..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... .......... ..... .. ..... ..... ............ ..... ....... ..... ..... ............. ..... .. ..... ..... ............ ..... ....... ..... ..... ............. ..... .. ..... ..... ............ ..... ....... ..... ..... ............. ..... .. ..... ..... ............ ..... ....... ..... ..... ..... ..... ..... ..... .......... ..... ..... ............ ..... ..... .. ....... ..... ..... . . . . ..... . ............ ..... ..... ....... .. ..... ..... . . ..... ................ ..... ..... .. ....... ..... ..... ..... ................. ..... ..... ....... .. ..... ..... ..... .................. ..... ..... .. ....... ..... ..... ..... ................. ..... ..... ....... .. ..... ..... ..... .................. ..... ..... .. ....... ..... ..... ..... ................. ..... ..... ....... ..... ..... ..... ............. ..... ..... .. ..... ..... ..... . . ..... .......... ..... ..... ....... ..... ..... . ..... ............ ..... ..... .. ..... ..... ..... ..... ............ ..... ..... ....... ..... ..... ..... ............. ..... ..... .. ..... ..... ..... ..... ............ ..... ..... ....... ..... ..... ..... ............. ..... ..... .. ..... ..... ..... ..... ............ ..... ..... ....... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ... .
...... .
i
Figure 2
conv (l + 1, m + 1), l + 1 + 2ki , m + 1 , l + 1, m + 1 + 2ki , τr = conv (l − 1, m − 1), l − 1 − 2ki , m − 1 , l − 1, m − 1 − 2ki
resp.,
(shown by dashes in Figure 2). Then we triangulate the complement T \∪sr=1 τr in an appropriate way in order to obtain a convex triangulation of T . Define block vector fields as follows. For each triangle τr = conv{(l + 1, m + 1), (1) (l +1+2ki , m+1), (l +1, m+1+2ki )} take the vector field Vki from Proposition 2.2 with Newton triangle conv{(1, 1), (1 + 2ki , 1), (1, 1 + 2ki )}, multiplied by x l y m , and for each triangle τr = conv{(l −1, m−1), (l −1−2ki , m−1), (l −1, m−1−2ki )} take (2) the vector field Vki from Proposition 2.2 with Newton triangle conv{(1+2ki , 1+2ki ), k
k
(1 + 2ki , 1), (1, 1 + 2ki )}, multiplied by x l−2−2 i y m−2−2 i . For the remaining integral points in T , take arbitrary generic real numbers to be the coefficients of vector fields corresponding to the elements of the triangulation of T \ ∪sr=1 τr . By Theorem 1.3.1 we can glue all the described block vector fields into a vector field of degree d. To estimate the number of limit cycles of the latter vector field, we note that (1) by Proposition 2.2, the block vector field with Newton triangle τr has at least (ki − 3)22ki −1 simple limit cycles in (R∗ )2 , (2) by construction, all the numbers ki satisfy ki ≥ log2 d − log log2 d − 3, and the sum sr=1 22ki −1 , which is equal to the area of ∪sr=1 τr , satisfies s r=1
22ki −1 ≥
2 2d 1 2d 2 1 d − d− − 6 log2 d . ≥ 2 log2 d 2 log2 d
10
ITENBERG AND SHUSTIN
This gives the following lower bound for the total number of limit cycles:
2 1 2d 1 log2 d − log log2 d − 6 · d− − 6 log2 d ≥ d 2 log2 d − Cd 2 log2 log2 d, 2 log2 d 2
for some C > 0 which does not depend on d. 3. Singular points of planar vector fields: Formulation of results. The space of linearization matrices of real nondegenerate singular points has three connected components. They correspond to three topological types of the local phase portraits (see, e.g., [AIl] and [Ha]). Denote these types as follows: 41 attractors, 42 repellers, 43 saddles. A real nondegenerate singular point is called strongly nondegenerate if the eigenvalues λ1 and λ2 of its linearization matrix are distinct. The space of linearization matrices of strongly nondegenerate singular points has five connected components defining thin topological types: S1 sinks, λ1 < λ2 < 0; S2 sources, λ1 > λ2 > 0; S3 sink-foci, λ1 , λ2 ∈ R, Re λ1 = Re λ2 < 0; S4 source-foci, λ1 , λ2 ∈ R, Re λ1 = Re λ2 > 0; S5 saddle points, λ1 > 0 > λ2 . Clearly, 41 ⊃ S1 ∪ S3 , 42 ⊃ S2 ∪ S4 , 43 = S5 . For a nondegenerate vector field, denote by σi and si the numbers of singular points in (R∗ )2 of types 4i (1 ≤ i ≤ 3) and Si (1 ≤ i ≤ 5), respectively, and denote by 2s0 the number of imaginary singular points in (C∗ )2 . Then, clearly, 2s0 + σ1 + σ2 + σ3 = d 2
(3.1) and (see [Kh1]) (3.2)
|σ1 + σ2 − σ3 | ≤ d.
Theorem 3.3 (Topological classification). For any nonnegative integers d, s0 , σ1 , σ2 , and σ3 satisfying (3.1) and (3.2), there exists a vector field of degree d with 2s0 imaginary singular points and σi real singular points of type 4i , i = 1, 2, 3. This is the complete topological classification of collections of singular points of planar polynomial nondegenerate vector fields. Theorem 3.4 (Thin topological classification). For any nonnegative integers d, s0 , s1 , s2 , s3 , s4 , and s5 satisfying 2s0 + s1 + s2 + s3 + s4 + s5 = d 2 ,
|s1 + s2 + s3 + s4 − s5 | ≤ d,
11
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
and
(3.5)
1, 1,
if d is odd, s1 + s2 + s3 + s4 − s5 ≥ 3, if d is even, (d − 1)(d − 2) < 2s0 < d 2 , s1 + s2 ≥ s1 + s2 + s3 + s4 − s5 ≥ 0, 2, if d is even, 2s0 ≤ (d − 1)(d − 2), s1 + s2 + s3 + s4 − s5 ≥ 0,
there exists a vector field of degree d with 2s0 imaginary singular points and si real singular points of type Si , i = 1, 2, 3, 4, 5. This result is an asymptotically complete thin topological classification of collections of singular points of planar polynomial nondegenerate vector fields whose real singular points are strongly nondegenerate. We conjecture that the restriction (3.5) is not necessary. Proofs of Theorems 3.3 and 3.4 are presented in Sections 4 and 5–7, respectively. 4. Topological classification of collections of singular points 4.1. Another gluing theorem. Theorem 3.3 is, in fact, a corollary of Theorem 3.4. However, here we present an alternative proof based on a gluing theorem different from Theorem 1.4.1. This specific version of gluing is adapted for the topological classification. Suppose that we have the following objects: (1) a nondegenerate polygon , (2) a subdivision = 1 ∪ · · · ∪ N , where 1 , . . . , N are convex polygons, (3) a convex piecewise-linear function ν : → R, which is linear on each polygon 1 , . . . , N and nonlinear on each union k ∪ l , k = l, (4) a collection of real numbers Aij , Bij indexed by (i, j ) ∈ ∩ Z2 , such that Aij , Bij = 0 for all vertices (i, j ) of all the polygons 1 , . . . , N . Introduce the vector fields Vk : x˙ = Aij x i y j , y˙ = Bij x i y j , k = 1, . . . , N (i,j )∈k
and the family of vector fields Aij x i y j t ν(i,j ) , V (t) : x˙ = (i,j )∈
(i,j )∈k
y˙ =
Bij x i y j t ν(i,j ) ,
t > 0.
(i,j )∈
Unlike Section 1, here the components of the vector fields have the same Newton polygon. Theorem 4.1.1. Assume that each vector field Vk , k = 1, . . . , N , has in (C∗ )2 exactly 2 · vol(k ) nondegenerate singular points, assume the coefficient ξ22 of the
12
ITENBERG AND SHUSTIN
linearization matrix (ξrs )r,s=1,2 of Vk at any real nonsaddle singular point differs from zero, and assume the function ν(λ, µ) satisfies (4.1.2)
∂ν ∂ν > ∂λ ∂µ
at any point in the interior of any linearity domain. Then, for t > 0 small enough, there exists a one-to-one correspondence Ꮿ between the set of singular points of the vector field V (t) in (C∗ )2 and the disjoint union of the sets of singular points of the vector fields V1 , . . . , VN in (C∗ )2 having the following properties: (1) an imaginary singular point of Vk , 1 ≤ k ≤ N, corresponds to an imaginary nondegenerate singular point of V (t), (2) a saddle singular point of Vk , 1 ≤ k ≤ N, corresponds to a saddle singular point of V (t), (3) a real nonsaddle singular point of Vk , 1 ≤ k ≤ N, with the linearization matrix (ξrs )r,s=1,2 corresponds to a real nondegenerate singular point of V (t), which is a repeller if ξ22 > 0 and an attractor if ξ22 < 0. Proof. Let lk (i, j ) = αk i +βk j +γk be a linear function equal to ν(i, j ) on k . We apply the linear transformation Mk (x, y) = (xt αk , yt βk ) to the field V (t) and obtain the vector field y˙ = t βk +γk Vk,y + O(t) . Vk (t) : x˙ = t αk +γk Vk,x + O(t) , As in the proof of Theorem 1.3.1, for a sufficiently small t > 0, one gets an inclusion of the disjoint union of the sets of singular points of the vector fields Vk in (C∗ )2 into the set of singular points of V (t) in (C∗ )2 . This inclusion is, in fact, a bijection due to Bernstein’s theorem [Be]. Clearly, imaginary singular points correspond to imaginary ones. A real singular point (x0 , y0 ) of Vk with the linearization matrix ξ11 ξ12 Ꮽ= ξ21 ξ22 corresponds to a singular point (x0 + O(t), y0 + O(t)) of the field Vk (t) with the linearization matrix α α k ξ k ξ + O(t) t + O(t) t 11 12 Ꮽ = t γk (4.1.3) . t βk ξ21 + O(t) t βk ξ22 + O(t) Since det(Ꮽ) · det(Ꮽ ) > 0 for a positive sufficiently small t and since αk > βk by (4.1.2), one easily derives the relations given in the statement. 4.2. Proof of Theorem 3.3 in the case s0 = 0. Consider the triangle T = {(0, 0), (d, 0), (0, d)} in R2 divided into the following triangles δj k = conv (j, k), (j + 1, k), (j, k + 1) , γj k = conv (j + 1, k), (j, k + 1), (j + 1, k + 1) ,
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
13
where j , k are nonnegative integers. These triangles are the linearity domains of the convex piecewise-linear function ν = ν1 + ν2 + ν3 : T → R, where ν1 is a convex piecewise-linear function with the linearity domains {(λ, µ) ∈ T : n ≤ λ ≤ n + 1},
n ∈ Z,
ν2 is a convex piecewise-linear function with the linearity domains {(λ, µ) ∈ T : n ≤ µ ≤ n + 1},
n ∈ Z,
and ν3 is a convex piecewise-linear function with the linearity domains {(λ, µ) ∈ T : n ≤ λ + µ ≤ n + 1},
n ∈ Z.
Adding a linear function Kλ with a large K > 0 to ν(λ, µ), we get the property (4.1.2). Given two collections of nonzero real numbers aj k and bj k indexed by the integral points of T , we associate with each triangle Ti = δj k or γj k in the triangulation of T a vector field Vi = (Pi (x, y), Qi (x, y)) such that the triangle Ti is the Newton polygon of Pi and Qi and such that the coefficients of Pi (resp., Qi ) coincide with the numbers aj k (resp., bj k ) at the vertices of Ti . Such a vector field Vi has one singular point in (R∗ )2 (see Subsection 1.4). Now let us glue the fields Vi into the field aj k x j y k t ν(j,k) , y˙ = bj k x j y k t ν(j,k) , t > 0. V (t) : x˙ = j +k≤d
j +k≤d
Let Vδ and Vγ be the vector fields x˙ = aj k x j y k + aj +1,k x j +1 y k + aj,k+1 x j y k+1 , Vδ : y˙ = bj k x j y k + bj +1,k x j +1 y k + bj,k+1 x j y k+1 , Vγ :
x˙ = aj +1,k+1 x j +1 y k+1 + aj +1,k x j +1 y k + aj,k+1 x j y k+1 , y˙ = bj +1,k+1 x j +1 y k+1 + bj +1,k x j +1 y k + bj,k+1 x j y k+1 .
(The Newton triangles δj k and γj k of the components of Vδ and Vγ , resp., are shown in Figure 3.) Proposition 4.2.1. In the above notation, for t > 0 small enough, exactly one of the singular points of the field V (t), assigned by the correspondence Ꮿ (see Theorem 4.1.1) to the singular points of the vector fields Vδ and Vγ , is a saddle. Proof. The linearization matrix at the singular point (x0 , y0 ) of Vδ is a aj,k+1 j Ꮽ(Vδ ) = x0 y0k j +1,k (4.2.2) , bj +1,k bj,k+1
14
ITENBERG AND SHUSTIN .....
k +1
k
..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... .
γj k
δj k
j
j +1
...... .
Figure 3
and the linearization matrix at the singular point (x1 , y1 ) of Vγ is 2 2 j −1 k−1 aj,k+1 y1 aj +1,k x1 . Ꮽ(Vγ ) = −x1 y1 bj,k+1 y12 bj +1,k x12 Hence 2 2j 2j det Ꮽ(Vδ ) · det Ꮽ(Vγ ) = −x0 y02k x1 y12k aj +1,k bj,k+1 − aj,k+1 bj +1,k < 0, which completes the proof. The following statement shows how to control the topological types of singular points of the field V (t) by means of a suitable choice of numbers aj k , bj k . Proposition 4.2.3. In the above notation, given nonnegative integers j, k, nonzero real numbers aj k , aj +1,k , bj k , bj +1,k , and m = 1, 2, 3, there exist nonzero numbers aj,k+1 , bj,k+1 such that, for t > 0 small enough, the singular point of the vector field V (t), assigned by the correspondence Ꮿ to the singular point of the vector field Vδ , belongs to the class 4m . j
Proof. By (4.2.2) one has det(Ꮽ(Vδ )) = (x0 y0k )2 (aj +1,k bj,k+1 − aj,k+1 bj +1,k ). So, by Theorem 4.1.1, to have a saddle at the corresponding singular point of the field V (t), one should choose aj,k+1 , bj,k+1 so that aj +1,k bj,k+1 − aj,k+1 bj +1,k < 0. To have a repeller at the corresponding singular point of V (t), by Theorem 4.1.1 and (4.2.2), one has to satisfy the conditions (in the notation of Theorem 4.1.1) 2j j j det Ꮽ(Vδ ) = x0 y02k aj +1,k bj,k+1 − aj,k+1 bj +1,k > 0, ξ22 = x0 y0 bj,k+1 > 0.
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
15
Since x0 =
aj,k+1 bj k − aj k bj,k+1 , aj +1,k bj,k+1 − aj,k+1 bj +1,k
y0 =
aj k bj +1,k − aj +1,k bj k , aj +1,k bj,k+1 − aj,k+1 bj +1,k
the previous inequalities are equivalent to the system aj +1,k bj,k+1 − aj,k+1 bj +1,k > 0, j k aj,k+1 bj k − aj k bj,k+1 aj k bj +1,k − aj +1,k bj k bj,k+1 > 0, which always has a solution aj,k+1 = 0, bj,k+1 = 0. Similarly, one proves the statement in the case of an attractor. Now, we note that any nonnegative integers σ1 , σ2 , and σ3 satisfying σ 1 + σ2 + σ 3 = d 2 ,
−d ≤ σ1 + σ2 − σ3 ≤ d,
can be represented as follows: σm = σm + σm , σ1 + σ2 = σ3 =
σm , σm ≥ 0,
d(d − 1) , 2
m = 1, 2, 3,
σ1 + σ2 + σ3 = d.
Consider again the triangle T = conv((0, 0), (d, 0), (0, d)) and its triangulation introduced above. Introduce the following order at the integer points of T : (4.2.4)
(j1 , k1 ) ≺ (j2 , k2 ) ⇐⇒ k1 < k2
or
k1 = k 2 ,
j1 > j 2 .
Associate with any integer point (j, k) of T a pair of numbers aj k , bj k , considering the integer points successively in the order (4.2.4). According to Propositions 4.2.1 and 4.2.3, the collections {aj k } and {bj k } can be chosen in such a way that (1) d vector fields corresponding to the triangles δj k , where j + k = d − 1, have altogether σm points of type 4m , m = 1, 2, 3, (2) d(d −1)/2 vector fields corresponding to the triangles δj k , where j +k < d −1, have altogether σm points of type 4m , m = 1, 2, (3) d(d −1)/2 vector fields corresponding to the triangles γj k , where j +k < d −1, (which are, actually, determined by the previous vector fields according to Proposition 4.2.1) have altogether σ3 = d(d − 1)/2 saddles. Finally, using Theorem 4.1.1, we obtain the desired vector field. 4.3. Proof of Theorem 3.3 in the case 0 < s0 ≤ d(d −1)/2. We define the following subdivision of the triangle T with the vertices (0, 0), (d, 0), (0, d). The triangle T contains d(d − 1)/2 squares sqj k = conv (j, k), (j + 1, k), (j, k + 1), (j + 1, k + 1) .
16
ITENBERG AND SHUSTIN
Ordering these squares by (4.2.4), we choose the first s0 squares from this set and then cover the rest of T by the triangles δj k , γj k , introduced in Subsection 4.2. Clearly, there exists a convex piecewise-linear function ν : T → R, whose linearity domains are these s0 squares and triangles. Proposition 4.3.1. For any generic nonzero real numbers aj k , aj +1,k , aj +1,k+1 , bj k , bj +1,k , bj +1,k+1 , there exist nonzero real numbers aj,k+1 , bj,k+1 such that the vector field
x˙ = P (x, y) = x j y k aj k + aj +1,k x + aj,k+1 y + aj +1,k+1 xy , y˙ = Q(x, y) = x j y k bj k + bj +1,k x + bj,k+1 y + bj +1,k+1 xy
has two imaginary nondegenerate singular points in (C∗ )2 . The term generic nonzero real numbers means that the six numbers mentioned in the assertion are chosen in a Zariski-open subset of R6 . Proof. The system P (x, y) = Q(x, y) = 0 in (C∗ )2 is reduced to the equation aj +1,k bj +1,k+1 − aj +1,k+1 bj +1,k x 2 + aj +1,k bj,k+1 + aj k bj +1,k+1 − aj,k+1 bj +1,k − aj +1,k+1 bj k x + aj k bj,k+1 − aj,k+1 bj k = 0, with discriminant 2 D = aj +1,k bj,k+1 + aj k bj +1,k+1 − aj,k+1 bj +1,k − aj +1,k+1 bj k − 4 aj +1,k bj +1,k+1 − aj +1,k+1 bj +1,k aj k bj,k+1 − aj,k+1 bj k . Given generic aj k , aj +1,k , aj +1,k+1 , bj k , bj +1,k , bj +1,k+1 , the equation D = 0 defines a parabola in the plane aj,k+1 , bj,k+1 . Hence, there exist nonzero aj,k+1 , bj,k+1 such that D < 0, which completes the proof. Now, we use Propositions 4.2.1 and 4.2.3 to choose pairs of numbers aj k , bj k at the integer points (j, k), j + k ≤ d of T (considering the points successively in the order (4.2.4)), in such a way that the vector field corresponding to each of s0 chosen squares sqj k has two imaginary nondegenerate points, and the vector fields corresponding to the triangles δj k , γj k in the given subdivision of T provide the prescribed quantities of saddles, repellers, and attractors of the field V (t) : x˙ =
aj k x j y k t ν(j,k) ,
j +k≤d
as was done in the previous subsection.
y˙ =
j +k≤d
bj k x j y k t ν(j,k) ,
17
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
4.4. Proof of Theorem 3.3 in the case s0 > d(d −1)/2. In this case σ1 +σ2 +σ3 = d 2 − 2s0 = d − 2u, 0 < u ≤ d/2. We introduce the following new element in our construction. Proposition 4.4.1. Let u ≤ d/2 be a positive integer. For any fixed generic polynomials f (x), g(x) of degree d − 2u, there exists a vector field x˙ = P (x, y) = y 2u f (x) + aj k x j y k , k<2u, j ≤d−k y˙ = Q(x, y) = y 2u g(x) + bj k x j y k , k<2u, j ≤d−k
where polynomials P and Q have the Newton quadrangle = conv (0, 0), (0, 2u), (d − 2u, 2u), (d, 0) with 4u(d − u) distinct imaginary nondegenerate singular points in (C∗ )2 . Proof. Take two generic polynomials aj k x j y k , P¯ (x, y) = y u f (x) +
¯ Q(x, y) = y u g(x) +
k
bj k x j y k
k
with Newton quadrangle conv((0, 0), (0, u), (d − u, u), (d, 0)). A transformation y → y +A (where A is an appropriate number) makes all the ordinates of intersection ¯ = 0 negative, preserving the Newton polygons of the polynomials P¯ points P¯ = Q ¯ ¯ and Q. Then P (x, y) = P¯ (x, y 2 ), Q(x, y) = Q(x, y 2 ) satisfy the conditions required. Let us take a triangle
T = conv (0, d), (0, 2u), (d − 2u, 2u)
and divide it into the (d − 2u)(d − 2u − 1)/2 squares sqj k , k ≥ 2u, j + k ≤ d − 2, and d − 2u triangles δj k , j = 0, . . . , d − 2u − 1, j + k = d − 1. Now, using Propositions 4.2.3 and 4.3.1 and Theorem 4.1.1, we construct a vector field V : x˙ = aj k x j y k , y˙ = bj k x j y k (j,k)∈T
(j,k)∈T
having (d − 2u)(d − 2u − 1) imaginary nondegenerate singular points in (C∗ )2 , σ1 repellers, σ2 attractors, and σ3 saddles. Since these types of singular points are stable, one can assume that the polynomials f (x) =
d−2u j =0
are generic.
aj,2u x j ,
g(x) =
d−2u j =0
bj,2u x j
18
ITENBERG AND SHUSTIN
Now, by Theorem 4.1.1, we glue the field V and the field from Proposition 4.4.1 and get a vector field V with (d − 2u)(d − 2u − 1) + 4u(d − 2u) = 2s0 imaginary singular points, σ3 saddles, and σ1 + σ2 repellers and attractors. We have only to show that a repeller of the field V corresponds to a repeller of V , and an attractor of V corresponds to an attractor of V . By Theorem 4.1.1, this depends on the signs of the entries ξ22 of the linearization matrices at repellers and attractors of V . By (4.1.3), the last coefficient of the linearization matrix of V at a real nonsaddle singular β point is equal to t1α t2 ( ξ22 + O(t1 + t2 )), where t1 , t2 > 0 are the parameters in the first and second gluing procedures, α, β are some real numbers, and ξ22 is the entry of the linearization matrix of the corresponding singular point of a field with Newton triangle δj k used in the first gluing. Hence, by Theorem 4.1.1, sign( ξ22 ) determines the same topological type of the corresponding singular points of V and V , and we are done. 5. Elementary vector fields. Here we describe block vector fields used in the proof of Theorem 3.4 in Sections 6 and 7. An admissible vector field is called elementary if its Newton polygon is a triangle δ of area 1/2. We say that an elementary vector field is complete if δ does not have vertices on the coordinate axes, and it is boundary if δ has one or two vertices on the coordinate axes but does not have an edge on the coordinate axes. Note that an elementary vector field does not have singular points in (C∗ )2 if δ has an edge on the coordinate axes. Proposition 5.1. If an elementary vector field x˙ = P (x, y) = A1 ε1 x i1 y j1 −1 + A2 ε2 x i2 y j2 −1 + A3 ε3 x i3 y j3 −1 , (5.2)
y˙ = Q(x, y) = ε1 x i1 −1 y j1 + ε2 x i2 −1 y j2 + ε3 x i3 −1 y j3 , A1 A2 A3 = 0, ε1 , ε2 , ε3 = ±1,
satisfies (5.3)
(A2 − A1 )(A1 − A3 )(A3 − A2 ) = 0,
then it has exactly one singular point in (R∗ )2 . Proof. Under the condition (5.3), we easily derive from the system P (x, y) = Q(x, y) = 0 that (5.4)
x i2 −i1 y j2 −j1 = ε1 ε2
A1 − A 3 , A3 − A 2
x i3 −i1 y j3 −j1 = ε1 ε3
A2 − A 1 . A3 − A 2
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
19
It implies the required statement, because the elementarity of V means that i1 j1 1 det i2 j2 1 = (i2 − i1 )(j3 − j1 ) − (i3 − i1 )(j2 − j1 ) = ±1. i3 j3 1 The determinant and the trace of the linearization matrix at a singular point of a vector field and the discriminant of the characteristic polynomial of the linearization matrix are called the determinant, the trace, and the discriminant of the singular point, respectively. If the vector field has only one singular point in (R∗ )2 , the determinant (resp., the trace and the discriminant) of this singular point is also called the determinant (resp., the trace and the discriminant) of the vector field. An elementary vector field of form (5.2) satisfying (5.3) is called nondegenerate elementary (abbreviated further as NDE). For an NDE vector field V , the linearization matrix at the unique singular point of V in (R∗ )2 , the determinant, the trace, and the discriminant are denoted by Ꮽ(V ), det(V ), tr(V ), and discr(V ), respectively. The types S1 , . . . , S5 differ by signs of the above numbers: (1) S1 = {det(V ) > 0, tr(V ) < 0, discr(V ) > 0}, (2) S2 = {det(V ) > 0, tr(V ) > 0, discr(V ) > 0}, (3) S3 = {det(V ) > 0, tr(V ) < 0, discr(V ) < 0}, (4) S4 = {det(V ) > 0, tr(V ) > 0, discr(V ) < 0}, (5) S5 = {det(V ) < 0}. Straightforward calculations give a description of singular points of NDE vector fields. The exact statements are presented in Propositions 5.5, 5.7, 5.8, 5.12, and 5.13. In these propositions, we suppose that the vertices of Newton triangles are numbered counterclockwise. Proposition 5.5. Let (5.2) be a complete NDE vector field V with the Newton triangle δ = conv((i1 , j1 ), (i2 , j2 ), (i3 , j3 )). Then sign det(V ) = sign (A3 − A1 )(A2 − A3 )(A1 − A2 ) , (5.6) sign discr(V ) = sign @(A1 , A2 , A3 )2 − 4(A3 − A1 )(A1 − A2 )(A2 − A3 ) , where @(A1 , A2 , A3 ) = (i2 −i1 )A2 +j2 −j1 (A3 −A1 )+ (i3 −i1 )A3 +j3 −j1 (A1 −A2 ). To describe the situation with sign(tr(V )), we introduce two types of triangles of area 1/2: a triangle is of type 1 if it has a vertex with both the coordinates odd (clearly, such a vertex is unique), and it is of type 2 otherwise. Proposition 5.7. In the notation of Proposition 5.5, if δ is of type 1 and i1 , j1 are odd, then @(A1 , A2 , A3 ) ; sign tr(V ) = sign(ε1 ) · sign A2 − A 3
20
ITENBERG AND SHUSTIN
if δ is of type 2, then sign tr(V ) = sign(ε1 ε2 ε3 ) · sign A(A1 , A2 , A3 ) , where A is a rational function. Proposition 5.8. Let V be a boundary NDE vector field with the Newton triangle δ = conv((0, j1 ), (i2 , j2 ), (i3 , j3 )), i2 i3 j1 j2 j3 = 0, given by x˙ = A1 ε1 y j1 −1 + A2 ε2 x i2 y j2 −1 + A3 ε3 x i3 y j3 −1 , y˙ = ε2 x i2 −1 y j2 + ε3 x i3 −1 y j3 , A1 > 0, A2 A3 = 0, ε1 , ε2 , ε3 = ±1. Then (5.9)
sign det(V ) = sign(A3 − A2 ), sign discr(V ) = sign (i2 A2 − i3 A3 − j3 + j2 )2 − 4(A3 − A2 ) .
If δ is of type 1, then (5.10)
sign ε2 (i2 A2 − i3 A3 − j3 + j2 ) , sign tr(V ) = sign ε3 (i3 A3 − i2 A2 − j2 + j3 ) ,
if i2 , j2 are odd, if i3 , j3 are odd.
If δ is of type 2, then (5.11) sign tr(V ) = sign(ε1 ε2 ε3 ) · sign A1 (A2 − A3 )(i2 A2 − i3 A3 − j3 + j2 ) . Similar formulae hold when a vertex of the Newton triangle lies on the other axis. Proposition 5.12. Let V be a boundary NDE vector field with the Newton triangle δ = conv((i1 , 0), (0, j2 ), (i3 , j3 )), i1 i3 j2 j3 = 0, given by x˙ = A2 ε2 y j2 −1 + A3 ε3 x i3 y j3 −1 ,
y˙ = ε1 x i1 −1 + ε3 x i3 −1 y j3 ,
A2 > 0, A3 = 0, ε1 , ε2 , ε3 = ±1. Then sign det(V ) = sign(A3 ) · sign(i1 j3 + i3 j2 − i1 j2 ), discr(V ) > 0, sign(ε3 ) · sign(i3 A3 + j3 ), if i3 , j3 are odd, sign tr(V ) = sign(ε1 ε2 ε3 ) · sign A2 A3 (i3 A3 + j3 ) , otherwise. Proposition 5.13. Any elementary vector field V = (P , Q) can be turned into a vector field with coefficients 0, ±1 in the y-component ˙ by means of a suitable linear transformation (x, y) → (xα, yβ), α, β > 0, and multiplication of the vector field by
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
21
a positive number. The parameters A1 , A2 , A3 , ε1 , ε2 , ε3 , defined in Propositions 5.1, 5.5, 5.8, and 5.12 for the corresponding elementary field, are determined uniquely. Moreover, ε1 , ε2 , ε3 are the signs of the coefficients of Q(x, y), and A1 , A2 , A3 are homogeneous functions of order 1 in the coefficients of P (x, y) and of order −1 in the coefficients of Q(x, y). 6. Thin topological classification: Basic construction. Now we can start the proof of Theorem 3.4. The cases d = 0, 1 are obvious. In the sequel, we suppose that d ≥ 2. In this section, we consider the special case s0 = 0,
s1 + s2 + s3 + s4 − s5 = −d.
The construction described here is basically the same for all other cases. In the next section we show what kind of modifications should be done to complete the proof of Theorem 3.4. A vector field of degree d has in general the Newton triangle T = conv (0, 0), (0, d + 1), (d + 1, 0) . We describe a triangulation of T and block vector fields corresponding to this triangulation. Throughout Subsections 6.1–6.5 we assume that d is even. 6.1. Triangulation. We define a piecewise-linear convex function ν1 + ν2 + ν3 : T → R, where ν1 is a convex piecewise-linear function with the linearity domains (λ, µ) ∈ R2 : 2i ≤ µ ≤ 2i + 2 , i ∈ Z, ν2 is a convex piecewise-linear function with the linearity domains (λ, µ) ∈ R2 : 2i ≤ λ ≤ 2i + 2 , i ∈ Z, and ν3 is a convex piecewise-linear function with the linearity domains (λ, µ) ∈ R2 : 2i − 1 ≤ λ + µ ≤ 2i + 1 , i ∈ Z. The intersections of these domains define a subdivision of T into the hexagons Hij , i, j ≥ 0, i + j ≤ (d − 2)/2, with vertices v1 = (2i, 2j + 1), v4 = (2i + 2, 2j + 1),
v2 = (2i + 1, 2j ),
v3 = (2i + 2, 2j ),
v5 = (2i + 1, 2j + 2),
v6 = (2i, 2j + 2),
and triangles covering T \ ∪i,j Hij (see Figure 4). A triangle τ of area 1/2 is called odd if conv((0, 0), τ ) is a nondegenerate quadrangle. It is called even otherwise.
22
ITENBERG AND SHUSTIN .....
d +1
..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ... .. . .
v6 v5 v1 v4 v2 v3
H00
d +1
. ......
Figure 4
For any hexagon Hij , we take a triangulation into six triangles of area 1/2: three odd triangles, one even triangle of type 1, and two even triangles of type 2. Namely, let c = (2i + 1, 2j + 1) be the center of Hij . If j ≥ i, we triangulate Hij by the segments [c, v1 ], [c, v3 ], [c, v5 ], [v1 , v5 ], [v1 , v3 ], [v3 , v5 ]. If i > j , we triangulate Hij by the segments [c, v2 ], [c, v4 ], [c, v6 ], [v2 , v6 ], [v2 , v4 ], [v4 , v6 ] (see Figure 5). Denote by D the described triangulation of T . 6.2. Block vector fields. Suppose that we have certain numbers εkl = ±1, Akl = 0, where (k, l) ∈ T ∩ Z2 . Consider elementary vector fields Vδ with Newton triangles δ ∈ D: x˙ = Akl εkl x k y l−1 , y˙ = εkl x k−1 y l . (k,l)∈δ, l≥1
(k,l)∈δ, k≥1
Proposition 6.2.1. Let Vδ , δ ∈ D, be a complete or boundary elementary vector field close to a Hamiltonian one, that is, any Akl with (k, l) ∈ δ, k > 0 is close to −l/k. Then Vδ has a saddle (resp., a focus) in (R∗ )2 if δ is an odd (resp., even) triangle. Proof. If Vδ is close to a Hamiltonian vector field, the type of the singular point of Vδ in (R∗ )2 is determined by the type of the critical point of the corresponding trinomial with Newton triangle δ. It follows from [S2, Propositions 2.2.1, 2.2.4, and 2.2.5]
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
23
.....
d +1
..... ..... ..... ..... ..... ..... ..... ..... ..... ..... .... ..... ......... ..... ............. . . ... ... ...... ..... . ... ...... . . . . ... ...... ..... . . ... ..... . . ... ..... ... ..... ..... ... ............. ..... ..... .. ............. ..... ..... ..... ..... ........ ..... ..... ..... ..... ........ ..... ... ..... ..... ........ . ....... ........ ..... ..... ..... ....... ..... .. ..... ..... ....... .... .. ..... ..... ....... ........ ..... ..... ............. ..... ..... ....... .......... .... . . . . . .. ..... ........ ..... ............. ..... ............. . . . . . ... .... ... ..... . .... ..... ..... ... ...... ... ..... . . . . . . . . . ... ...... ... ...... .. ..... . ..... . . ... ...... . . . . . . . . . . ... ..... . .. ... ... ..... ........ ... ..... ... ......... ........ ... ..... ............. ..... ..... . ............ ... ..... ............. . ..... ..... ..... ............. ..... ..... ........ ..... ... ..... .. ..... ........ ..... ..... ........ ..... ... ..... ... ..... ........ ..... ..... ........ . ..... ... ....... ..... ....... ........ ..... ..... ..... ....... ..... ....... ..... .. ..... ..... ....... ......... ..... ..... ....... .... .. ..... . ..... . . ....... ........ ..... ....... ........ ..... ..... ............. ..... ............. ..... ..... . . . .......... ......... ..... ..... .............. ..... .............. ............ . . ..... . . . ..... .................. .................. ..... ..... ............ . . . . . . . . . . . . . . . . . . . ..... . . . . . . . . ..... ... ..... ... ..... ........ . ... ...... ......... ... ..... ....... ..... ..... . ... ...... . . . . . . . . . . . . . . . ....... . ... ..... ............. ......... ... ...... ... ....... ....... ......... ....... ..... ..... ..... . . . . . . . . . . . . . . . . . . . ....... .... ... .... ....... ..... . ... ...... ... ...... ........... ..... ........... ..... ... ..... ..... . . . . . ........... . . . . . . . ......... ..... ... ... ..... .... ... ..... ............. ..... ...... ..... ... ... . . . . . ............. ..... . ..... ..... . . . ..... ..... ........ ..... .... ..... ......... ..... ......... .... ..... ..... . . . ..... ........ ..... ... . ..... ... ..... . ..... ..... ... ..... ........ . ..... ... ..... ..... ..... ..... .. . . . . . . ....... ........ ..... ..... . . ..... .. ..... ..... ... ....... ..... .. ..... ..... ..... ..... ... ..... ..... .. . . . . ....... .... ... . . . . ..... ........ ..... ..... ........ ...... ....... ....... ..... ....... ..... ..... ........ ..... ............. ..... .......... ..... ........ ....... .
. ......
d +1 Figure 5
that if δ is an odd triangle, the trinomial has a saddle (thus, Vδ has a saddle); if δ is even, the trinomial has an extremum (thus, Vδ has a focus). Remark. The statement of Proposition 6.2.1 is true for any triangle δ of area 1/2 such that no straight line through the origin contains an edge of δ. Suppose that each number Akl with k > 0 is sufficiently close to −l/k. Then the vector fields Vδ , δ ∈ D, have a total of d(d + 1)/2 saddles and d(d − 1)/2 foci. 6.3. Refinement procedure. In this subsection we describe a procedure (we call it the refinement procedure) that allows us to obtain a vector field with si singular points of type Si (i = 1, . . . , 5) starting from the block vector fields described in the previous subsection. The procedure is quite general. It can be applied (as it is done in Section 7) to different collections of block vector fields. Step 1. Sources and sinks. Consider the families Vδ (α), δ ∈ D, 1 ≤ α < ∞, of vector fields x˙ = α ·
(k, l)∈δ, l≥1
Akl εkl x k y l−1 ,
y˙ =
εkl x k−1 y l .
(k, l)∈δ, k≥1
Clearly, the sign of the determinant of a singular point does not change in such a family; hence saddles remain saddles, and foci turn into sinks or sources and back. We say that an elementary vector field δ is absolutely generic if one of the following conditions holds:
24
ITENBERG AND SHUSTIN
(1) Vδ is a complete NDE vector field, and the sign of the discriminant of Vδ (α) coincides with the sign of a polynomial of degree 4 in α (compare with formula (5.6)); (2) Vδ is a boundary NDE vector field with exactly one vertex on the coordinate axes, and the sign of the discriminant of Vδ (α) coincides with the sign of a quadratic polynomial in α (compare with formula (5.9)); (3) Vδ is an NDE vector field with at least two vertices on the coordinate axes. Proposition 6.3.1. By a small variation of the numbers Akl , one can get the following: (i) all the vector fields Vδ are absolutely generic; (ii) for any α ≥ 1, at most one of the vector fields Vδ (α), δ ∈ D, has a positive determinant and the zero discriminant. Proof. The fact that all the vector fields Vδ can be made absolutely generic by a small variation of the numbers Akl is straightforward: If Vδ is a complete NDE vector field, the coefficient at α 4 in @(A1 , A2 , A3 )2 − 4(A3 − A1 )(A1 − A2 )(A2 − A3 ) (see formula (5.6)) is equal to (i2 − i1 )A2 (A3 − A1 ) + (i3 − i1 )A3 (A1 − A2 ))2 ; if Vδ is a boundary NDE vector field with exactly one vertex on the coordinate y-axis, the coefficient at α 2 in (i2 A2 −i3 A3 −j3 +j2 )2 −4(A3 −A2 ) (see formula (5.9)) is equal to (i2 A2 − i3 A3 )2 . The set of values of α such that at least one of the vector fields Vδ (α) has discriminant zero is finite. Let δ1 , . . . , δr be the triangles in D such that the corresponding vector fields Vδ1 (α0 ), . . . , Vδr (α0 ) have positive determinants and the zero discriminant. We call a vertex of δi interior if it does not belong to a coordinate axis. Consider two triangles δ1 and δ2 . Note that either δ2 has an interior vertex that does not belong to δ1 , or δ1 has an interior vertex that does not belong to δ2 . Varying the coefficient that corresponds to this interior vertex of δ1 or δ2 , we can decrease by 1 the number of vector fields (among Vδ (α0 ), δ ∈ D) having positive determinants and the zero discriminant. Proceeding in this way, we get a collection of absolutely generic vector fields Vδ , δ ∈ D, such that, for any α ≥ 1, there exists at most one vector field Vδ (α) with positive determinant and discriminant zero. Using Proposition 6.3.1, we suppose that all the vector fields Vδ are absolutely generic and that, for any α ≥ 1, at most one of the vector fields Vδ (α), δ ∈ D, has a positive determinant and the discriminant equal to zero. Consider the set of values of α such that one of the vector fields Vδ (α) with positive determinant has discriminant zero. Note that this set does not depend on signs εkl . Moreover, if α0 belongs to this set and the discriminant of Vδ (α0 ) is equal to zero, the type of modification of a singular point of Vδ (α) at α0 is also independent of the choice of the signs εkl . Formulae (5.6) and (5.9) show that, for any δ ∈ D, the discriminant of Vδ (α) is positive if α is sufficiently large. Hence, a focus of Vδ (α) necessarily turns into a source or a sink when α → ∞.
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
25
.....
v
. .. ... ... .... ... ... ... .... ... ..................... . ... ...... .. .. ... ....... ..... ... ... ..... ... ... ..... .. ... ..... . . ..... ... ..... .... ... ..... .. ... ....... . ... ............ . ... . . ... ... .......... ... ..... .. ..... ... ... ..... ... ... ..... ... ..... .. . ..... .... . ... . . ........ . ... . . ... .... ....... .......... . ... .. . ...... . . . . . ... ... . . . ............. . . . . . . . ... ... ......... ......... ............................... .............................. . ....... ....... ....... . . . . ..
v
δ(v)
l
v
...... ...... .......
....... ....... ....... . . . ...
...... ......
k
...... .
Figure 6
Now we choose α0 ≥ 1 such that, for exactly s1 + s2 vector fields Vδ with even δ, the corresponding vector fields Vδ (α0 ) have sources or sinks and, for the remaining s3 + s4 vector fields Vδ with even δ, the vector fields Vδ (α0 ) have foci. Step 2. Sources or sinks. To distinguish between sources and sinks, foci-sources and foci-sinks, we choose suitable numbers εkl , k + l ≤ d + 1, to prescribe the signs of the traces of the vector fields Vδ (α0 ) with even δ. For any point (2i + 1, 2j + 1) ∈ T , there exists only one even triangle of type 1 with a vertex at this point. Hence, by Proposition 5.7, one can prescribe the sign of the trace of the singular point of the corresponding elementary vector field choosing ε2i+1, 2j +1 independently of all other numbers εkl . To establish the other numbers εkl , we apply the inductive procedure from [S2, Lemma 5.3]. Consider even triangles of type 2. Introduce the set K of all the vertices of these triangles. For any even triangle δ of type 2, we denote by v(δ) ∈ K its vertex contained in the interior of the convex hull of δ and the origin. Clearly v(δ ) = v(δ ) if δ = δ , and we write δ = δ(v) for v = v(δ). In any even triangle δ of type 2, let us mark each edge [v , v ] such that v = v(δ), and let us orient this edge from v to v . Consider the oriented graph E, whose vertex set is K and whose arcs are all the marked edges. Let us show that E does not have oriented cycles. Indeed, take in E the shortest cycle oriented clockwise, and let v = (k, l) be a vertex of this cycle having the minimal slope l/k. Then v belongs to two arcs [v, v ], [v , v], and we see that [v , v] must intersect the triangle δ(v) and that [v, v ] ⊂ δ(v) (see Figure 6). Similarly, we prohibit cycles oriented counterclockwise. Thus, the orientation of E defines a partial order in K. Define the order N(v) of v ∈ K to be the maximal path
26
ITENBERG AND SHUSTIN
length from one of the minimal points to v, and put Kr = {v ∈ K | N(v) = r}. Now, we establish the sign distribution by the decreasing induction on the point order. Let us fix the signs of all integral points of T which do not belong to K. Assume that the signs of points of K of order greater than or equal to r are already fixed. According to Proposition 5.7, we provide the prescribed signs of traces of singular points for elementary vector fields with Newton triangles δ(v), v ∈ Kr−1 , by setting suitable εkl at v ∈ Kr−1 . Finally, note that since no v ∈ Kr−1 belongs to triangles δ(w), N (w) ≥ r, the previous operation has no influence on these triangles. Step 3. Gluing. The final stage is the gluing of the elementary vector fields obtained into a vector field of degree d. The only thing we need to verify is that there exists a piecewise-linear convex function ν : T → R whose domains of linearity coincide with the triangles of the triangulation D. This is proved in the following subsection. As a result of the gluing, we obtain a vector field of degree d with si singular points of type Si , i = 1, . . . , 5. 6.4. Convexity of the triangulation D. A triangulation of T is convex if there exists a piecewise-linear convex function ν : T → R whose domains of linearity coincide with the triangles of the triangulation. Proposition 6.4.1. The triangulation D of T is convex. Proof. We define the required function ν inductively. Lemma 6.4.2. Denote by U the square conv((0, 0), (2, 0), (2, 2), (0, 2)). Let a00 , a10 , a20 , a21 , and a22 be real numbers such that a00 + a20 > 2a10 ,
a20 + a22 > 2a21 .
Then, (a) there exists a piecewise-linear convex function ν : U → R such that ν ((0, 0)) = a00 ,
ν ((1, 0)) = a10 ,
ν ((2, 1)) = a21 ,
ν ((2, 0)) = a20 ,
ν ((2, 2)) = a22 ,
whose domains of linearity are the triangles as shown in Figure 7(a); (b) there exists a piecewise-linear convex function ν : U → R such that ν ((0, 0)) = a00 ,
ν ((1, 0)) = a10 ,
ν ((2, 1)) = a21 ,
ν ((2, 0)) = a20 ,
ν ((2, 2)) = a22 ,
whose domains of linearity are the triangles as shown in Figure 7(b).
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS .....
.....
.......... ..... ..... ................ ..... ......... ......... ... ... ..... ......... ... ...... ......... ......... ....... ..... ... ...... ....... .... ... ..... ....... .... ... ...... ............ ..... ........... ... .. ..... ... .... ..... ..... ..... ..... ..... ..... ... ..... . ..... .. . . ..... ... ... . . . . ..... ... . ..... .. ..... ........ ........ .......... .
... ............ ..... ............. ..... ... ...... .... . ... ..... . . . .. ... ...... ..... ... ...... .... . . .. . ... . .... ... ......... . . . . . .......... ..... ... ..... ............. ..... ..... ..... ........ ..... ... ..... ........ . ..... ........ ....... ......... ..... ..... . ....... ..... .. ..... ....... .... ... ..... ....... ....... ..... ............ ..... ..... .
. ......
7(a)
27
. ......
7(b)
.....
.....
........ ... ........ ... ......... ....... ... ....... ... ....... ... ....... ....... ... ....... ... ....... ... ....... ... .. ... .... ... ..... ..... ... . . . . ... ..... . . ... . ... ..... ... ......... ........ .
.. ............. ..... ............ ... ..... ..... ..... ... ...... . . . .. ... ..... ..... ... ...... ..... . ... ...... . . ... ..... . ... . . . .. ... ........... ............. ... ..... ........ ... ..... ........ ... ..... ........ ... ....... ..... . ....... ..... ....... ..... ..... ....... .. ..... ....... .. ..... ....... .
...... .
7(c)
...... .
7(d) Figure 7
Proof. (a) Let us choose a value ν (0, 2) such that the point (0, 2, ν ((0, 2))) in is close from above to the plane containing the points (1, 0, ν ((1, 0))), (2, 0, ν ((2, 0))), and (2, 1, ν ((2, 1))). More explicitly, we take R3
a02 = ν ((0, 2)) = 2a10 + 2a21 − 3a20 + F02 , where F02 is a positive sufficiently small number. The projection of the lower part of the polyhedron conv (0, 0, a00 ), (1, 0, a10 ), (2, 0, a20 ), (2, 1, a21 ), (2, 2, a22 ), (0, 2, a02 ) to the xy-coordinate plane gives the triangulation of U as shown in Figure 7(c). We should now subdivide the triangles of area greater than 1/2. To subdivide the triangle of area 3/2, we choose ν ((1, 1)) in such a way that the point (1, 1, ν ((1, 1))) is very close from below to the plane containing (1, 0, ν ((1, 0))), (2, 1, ν ((2, 1))), and (0, 2, ν ((0, 2))). Namely, we take a11 = ν ((1, 1)) = a10 /3 + a21 /3 + a02 /3 − F11 , where F11 ! F02 is a positive sufficiently small number. Now, to subdivide the triangles of area 1, we can take a01 = ν ((0, 1)) = a00 /2 + a02 /2 − F01 ,
28
ITENBERG AND SHUSTIN
where F01 is a positive sufficiently small number, and a12 = ν ((1, 2)) = a22 /2 + a02 /2 − F12 , where F12 is a positive sufficiently small number. It remains to extend ν to be linear on each triangle of the triangulation of U as shown in Figure 7(a). (b) We can take a01 = ν ((0, 1)) = 2a10 + a21 − 2a20 − F01 , where F01 is a positive sufficiently small number, a12 = ν ((1, 2)) = a10 + 2a21 − 2a20 − F12 , where F12 = F01 , and a sufficiently large a02 = ν ((0, 2)). The projection of the lower part of the polyhedron conv (0, 0, a00 ), (1, 0, a10 ), (2, 0, a20 ), (2, 1, a21 ), (2, 2, a22 ), (1, 2, a12 ), (0, 2, a02 ), (0, 1, a01 ) , to the xy-coordinate plane, gives the triangulation of U as shown in Figure 7(d). We can now subdivide the triangle of area 3/2 taking a11 = ν ((1, 1)) = a20 /3 + a01 /3 + a12 /3 − F11 , where F11 ! F01 is a positive sufficiently small number. Then, it remains to extend ν to be linear on each triangle of the triangulation of U as shown in Figure 7(b). Now, we can define a function ν˜ : T → R with the following properties: (1) piecewise-linear; (2) convex on each square Uij = conv (2i, 2j ), (2i + 2, 2j ), (2i + 2, 2j + 2), (2i, 2j + 2) (where i, j ≥ 0, i + j ≤ d/2 − 2) and on each quadrangle conv (2i, d − 2i − 2), (2i + 2, d − 2i − 2), (2i + 2, d − 2i − 1), (2i, d − 2i + 1) (where 0 ≤ i ≤ d/2 − 1); (3) the domains of linearity of ν˜ coincide with the triangles of the triangulation D. The function ν˜ can be defined using Lemma 6.4.2 for the squares Uij = conv (2i, 2j ), (2i + 2, 2j ), (2i + 2, 2j + 2), (2i, 2j + 2) , considering them in the following order: Uij ≺ Ui j , if i > i or if i = i and j < j . The precise procedure we use is as follows. First, we define ν˜ in the triangle conv (d + 1, 0), (d, 1), (d, 0)
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
29
as an arbitrary linear function, and we extend the restriction of this linear function on the interval [(d, 0), (d + 1, 0)] to a convex piecewise-linear function on [(0, 0), (d + 1, 0)] with the intervals of linearity coinciding with the intervals [(i, 0), (i + 1, 0)] (i = 0, . . . , d). Then, we define ν˜ step-by-step in the strips conv (2i, 0), (2i + 2, 0), (2i + 2, d − 2i − 1), (2i, d − 2i + 1) , i = d/2 − 1, . . . , 0. At the step, where ν˜ is already defined in conv (2i + 2, 0), (d + 1, 0), (2i + 2, d − 2i − 1) , we use Lemma 6.4.2 to define the function ν˜ in the squares Uij (where j = 0, . . . , d/2− i − 2) and, then, in the pentagon conv (2i, d − 2i − 2), (2i + 2, d − 2i − 2), (2i + 2, d − 2i − 1), (2i + 1, d − 2i), (2i, d − 2i) . To define the function ν˜ in the pentagons, we use a version of Lemma 6.4.2 in which we ignore the triangle conv((1, 2), (2, 1), (2, 2)). After fixing the values of ν˜ in the pentagon conv (2i, d − 2i − 2), (2i + 2, d − 2i − 2), (2i + 2, d − 2i − 1), (2i + 1, d − 2i), (2i, d − 2i) , we take ν˜ ((2i, d − 2i + 1)) equal to a sufficiently large number and extend ν˜ linearly to the triangle conv (2i, d − 2i), (2i + 1, d − 2i), (2i, d − 2i + 1) . Now we take ν = ν1 + ν2 + F ν˜ , where ν1 and ν2 are defined in Subsection 6.1 and where F is a positive sufficiently small number. 6.5. Gluing. Let us sum up. We glue the compatible vector fields constructed in Subsection 6.3 (using the refinement procedure applied to the block vector fields as described in Subsection 6.2) into the vector field of degree d: x˙ = α0 Akl εkl x k y l−1 t ν(k, l) , k+l≤d+1, l≥1
y˙ =
εkl x k−1 y l t ν(k, l) ,
k+l≤d+1, k≥1
where ν is the function constructed in Subsection 6.4. By Corollary 1.4.2, for a sufficiently small t > 0, the latter vector field has si singular points of type Si , i = 1, . . . , 5. This completes the proof of Theorem 3.4 in the case s0 = 0, with even d.
s1 + s2 + s3 + s4 − s5 = −d
30
ITENBERG AND SHUSTIN .....
d +1 d
...... ............ ............ ................. ............... ............... ......... ...... ... ....... ...... ... ....... ...... ..... ... ...... ..... ..... ..... .... .......... .............. ..... ... ...... ........ ..... ... .. ... ... ..... ..... .. ... ... ... ..... ..... ... ... ... ... ..... ..... ... ... ... ... ..... ....... ... ... .. ..... ...... ... ... ... ..... ..... .. .. ... ..... ..... ... ... ... ....... ..... ... ... .. ........ ..... ... ... ... .. ..... ..... ... ... ... ... ..... ..... .. ...... ... ..... ........ ...... .. ..... ........ ...... ... ..... ..... ........ ..... ... ..... ..... ...... .. ..... ..... ... ..... . ..... ...... .. ... ...... ..... ..... ... ..... ...... ... .... ........ ..... ............. .. ..... .............. .. ..... ............ .. ..... .............. .. ..... .............. ..... .......... ... .
d
...... .
d +1
Figure 8
6.6. Case of odd d. Assume now that d is odd. We modify the previous reasoning as follows. In the triangle T = conv (0, 0), (d, 0), (0, d) we define all the objects as before. The strip conv (d, 0), (d + 1, 0), (0, d + 1), (0, d) is divided into the triangles (see Figure 8) conv (0, d + 1), (i, d − i), (i + 1, d − i − 1) , conv (d, 0), (i, d + 1 − i), (i + 1, d − i) ,
i = 0, . . . , d − 1, i = 0, . . . , d.
The chosen triangulation of T is convex. This easily follows from the fact that the triangulation D of T is convex. All the new triangles are either odd or even of type 2. Thus, we can naturally extend the construction used in the case of even d. 7. Thin topological classification: Other cases. To cover other cases in Theorem 3.4, we modify the above procedure to get (i) the total index s1 + s2 + s3 + s4 − s5 between −d and 1, (ii) the total index s1 + s2 + s3 + s4 − s5 between zero and d, (iii) the prescribed number of imaginary singular points. We show which elements in the main algorithm should be changed but do not describe in detail all possible particular cases. All the cases can be covered by a combination of the techniques presented in this section.
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
31
The difference between cases (i) and (ii) comes from the fact that a generic polynomial gives a Hamiltonian vector field with a total index less than or equal to 1. So, in the main case treated in Section 6 and in case (i), we basically use vector fields close to Hamiltonian ones. In case (ii), we have to involve non-Hamiltonian vector fields. Remark. The restriction (3.5) appears in our construction, because we cannot always coordinate Hamiltonian and non-Hamiltonian vector fields in the gluing procedure. 7.1. How to get the total index −d < s1 + s2 + s3 + s4 − s5 ≤ 1 for odd d and −d < s1 + s2 + s3 + s4 − s5 < 0 for even d. We have to replace up to (d + 1)/2 saddles by foci in the main construction. (How to obtain sinks or sources from foci is explained in Section 6.) We modify the elementary vector fields corresponding to the triangles adjacent to the side [(d + 1, 0), (0, d + 1)] of the Newton triangle T . If d is odd, we replace the elementary vector fields with Newton triangles σi = conv (d, 0), (d + 1 − 2i, 2i), (d + 2 − 2i, 2i − 1) , σi = conv (d, 0), (d + 2 − 2i, 2i − 1), (d + 3 − 2i, 2i − 2) (giving two saddles) by a vector field close to the Hamiltonian one coming from the polynomial Fi (x, y) = x d + x d+1−2i y 2i + bi x d+2−2i y 2i+1 + x d+3−2i y 2i+2 with the Newton triangle σi ∪ σi . This new vector field gives (for an appropriate choice of bi ) a saddle and a focus due to the following statement. Proposition 7.1.1. If (7.1.2)
4 > bi2 >
16i(i − 1) , (2i − 1)2
then Fi (x, y) has one extremum in (R∗ )2 . The value of Fi,xy (x, y) at that extremum changes its sign when substituting −bi for bi . Proof. It is shown in [S2, Lemma 3.5.2] that, under condition (7.1.2), the polynomial Fi (x, y) has one extremum (R∗ )2 . The change in sign of bi is equivalent to the coordinate transformation (x, y) → (x, −y), which, clearly, moves the extremum to another quadrant and changes the sign of Fi,xy at this point. If d is even, we replace the elementary vector fields with Newton triangles σi = conv (2i − 1, d + 2 − 2i), (2i, d + 1 − 2i), (2i, d − 2i) , σi = conv (2i, d + 1 − 2i), (2i + 1, d − 2i), (2i, d − 2i) (giving a saddle and a focus) by the following vector field.
32
ITENBERG AND SHUSTIN
Proposition 7.1.3. For any integer i ∈ [1, d/4] and any k = 3, 4, there exist a, b ∈ R such that the vector field x˙ = −(d + 2 − 2i)x 2i−1 y d+1−2i − (d − 2i)x 2i+1 y d−1−2i −(d − 2i)x 2i y d−1−2i − ax 2i y d−2i , Vi (a, b) : y˙ = (2i − 1)x 2i−2 y d+2−2i + (2i + 1)x 2i y d−2i +2ix 2i−1 y d−2i + bx 2i−1 y d+1−2i , with the Newton triangle σi ∪ σi , has two foci of type Sk in (R∗ )2 . Proof. The field V (0, 0) is Hamiltonian with the integral H (x, y) = x 2i−1 y d+2−2i + x 2i+1 y d−2i + x 2i y d−2i , which has two extrema (x ∗ , y ∗ ) and (x ∗ , −y ∗ ) in (R∗ )2 according to [S2, Section 3.3]. Hence for a, b close to zero, V (a, b) has, in a neighborhood of (x ∗ , y ∗ ), (x ∗ , −y ∗ ), two foci with the trace (d + 1 − 2i)b − 2ia (x ∗ )2i−1 (y ∗ )d−2i + O(a 2 + b2 ) at each point. 7.2. How to get the total index 3 ≤ s1 + s2 + s3 + s4 − s5 ≤ d for odd d and 0 ≤ s1 +s2 +s3 +s4 −s5 ≤ d for even d. In the case of odd d, we divide the triangle T into the triangle T = conv (0, 0), (d, 0), (0, d) and the band
T = conv (0, d), (d + 1, 0), (0, d + 1), (d, 0) .
Subdivide the band T into the elementary triangles σi = conv (d + 1, 0), (d − i, i), (d − i − 1, i + 1) , i = 0, . . . , d − 1, σi = conv (0, d), (d + 1 − i, i), (d − i, i + 1) , i = 0, . . . , d. Take block vector fields of the main construction in T , vector fields close to Hamiltonian ones (all having foci) with Newton triangles σi , and non-Hamiltonian vector fields with Newton triangles σi constructed inductively using the following statement. Proposition 7.2.1. For any nonzero real numbers a, b1 , b2 such that b1 /b2 > 1, for any positive integer i < d, and for any j = 1, . . . , 5, there exist nonzero real numbers c1 and c2 such that c1 /c2 > 1 and the vector field V : x˙ = b1 x i+1 y d−i−1 + c1 x i y d−i + ay d−1 , y˙ = b2 x i y d−i + c2 x i−1 y d+1−i , with Newton triangle conv((0, d), (i +1, d −i), (i, d +1−i)), has a singular point of type Sj in (R∗ )2 .
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
33
Proof. The proof has straightforward calculations. If i = d in Proposition 7.2.1, a corresponding vector field with Newton triangle conv((0, d), (d + 1, 0), (d, 1)) cannot have a focus. It produces the restriction (3.5) in our construction. Now let d be even. We again divide the triangle T into the triangle T = conv (0, 0), (d, 0), (0, d) and the band T = conv (0, d), (d + 1, 0), (0, d + 1), (d, 0) . The band T is subdivided into the elementary triangles σi = conv (d + 1, 0), (d − i, i), (d − i − 1, i + 1) , i = 0, . . . , d − 1, σi = conv (0, d), (d + 1 − i, i), (d − i, i + 1) , i = 0, . . . , d. Take in T block vector fields of the construction described above in the case of odd d with the total index equal to d. Then, take vector fields compatible with those already taken with Newton triangles σi (these vector fields are non-Hamiltonian and have saddles), and take non-Hamiltonian vector fields with Newton triangles σi constructed inductively using Proposition 7.2.1. In this construction, we have s1 + s2 ≥ 2. 7.3. How to obtain imaginary singular points. The idea is to use the vector fields described in the following four statements. Proposition 7.3.1. For any integers k ≥ 2, 0 ≤ s ≤ k/2, and any real a, b = 0, there exists a nondegenerate vector field j
i+j ≤k−1
Bij x i y j ,
i
with Newton triangle θ = conv((k + 1, 0), (0, k), (0, 0)), having s saddles and s foci of prescribed types in (R∗ )2 and having k(k − 1) − 2s nondegenerate singular points in (C∗ )2 \(R∗ )2 . Proof. Let k ≥ 3 be odd. We divide the triangle θ into the quadrangle θ0 = conv (2, 0), (k + 1, 0), (0, k), (0, 2s + 1) and the triangles
34
ITENBERG AND SHUSTIN .....
...
k 2s + 1
..... .. ..... .. ..... .. ..... ..... .. ..... ..... ..... .. ..... ..... ..... ..... .. ..... ..... ..... ..... .. ..... ..... ..... .. ..... ..... ..... ..... .. ..... ..... ..... ..... .. ..... ..... ..... ..... .. ..... ..... ..... ... .. ..... ... ..... ..... ..... .... .. ..... ..... ..... ..... ...... ..... .. ...... ..... ..... . ..... ..... .. ..... ...... ..... ..... . ..... ... ........ .. ..... ... ...... ..... ..... ... ...... ..... .. ..... ... ...... ..... ..... ... ..... .. ..... ... ..... ..... ... ... ... ... ..... ... ... ... ..... .... ..... ... ... ... ..... ....... ...... . . . ..... ........... .... .... .... ..... ...... ........... ... ... ... ..... . ............. ... ... ... ..... ..... ....... ...... ...... ..... ... ..... ...... ..... ...... ... ..... .... ...... ............ ... ..... ... ... .... ........... ... ..... . . . ... ... ...... ... ..... ..... . ..... ..... . ..... .. ..... ..... ..... .... ..... ..... ..... ..... ... ... ... .. ..... ..... .. ... ... ... ..... .... ..... ... ... .. .. ..... ... ..... .. ... ... ... ..... . ..... ... .. ... .. ..... .... ........ ... .. ... ..... .. ....... ... ... .. ..... .. ...... ... ...... .......... . ....... ..... ... ...... ..... ....... ......... ..... ... .... ....... . ........ . . . ..... ... ..... ....... ..... . ..... .. ..... ....... ......... . . . . ....... ....... ....... ...................... ....... ....... ........... .......... ....... ........... ..... ................ ...... ............ ... ...
k +1
2
. ......
Figure 9
(7.3.2)
θ1 = conv (2, 0), (0, 1), (0, 0) , (1) θ2i = conv (0, 2i − 1), (2, 0), (1, i) , (1) θ2i+1 = conv (2, 0), (0, 2i + 1), (1, i) , (2) θ2i = conv (0, 2i − 1), (1, i), (0, 2i) , (2) θ2i+1 = conv (0, 2i + 1), (1, i), (0, 2i) , k −1 i = 1, . . . , 2
(see Figure 9). We intend to glue Hamiltonian vector fields having these Newton polygons. First, consider the polynomial G(x, y) = x k+1 + y k + y 2s+1 + x 2 + x 2 y + x k−1 y, with the Newton polygon θ0 . The derivative Gy (x, y) = ky k−1 + (2s + 1)y 2s + x 2 + x k−1 does not vanish in (R∗ )2 and has Newton polygon θ0 = conv (0, k − 1), (0, 2s), (2, 0), (k − 1, 0) . If G (x, y) is a generic polynomial with Newton polygon θ0 , then the derivative Gy (x, y) has Newton polygon θ0 . Assuming that G (x, y) is sufficiently close to
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS
35
G(x, y), we get that Gy (x, y) does not vanish in (R∗ )2 . Thus, G (x, y) has only imaginary critical points in (C∗ )2 , and, due to its generality, all these critical points are nondegenerate and their number is k(k − 1) − 2s (the maximal possible value for the Newton polygon θ0 ). Now we introduce polynomials with Newton triangles (see (7.3.2)), taking suitable coefficients from G (x, y) and arbitrary nonzero coefficients (2) for the rest of monomials. Observe that the polynomials with Newton triangles θ1 , θm , ∗ 2 m = 2, . . . , k, do not have critical points in (C ) , each of the polynomials with (1) Newton triangles θ2i , i = 1, . . . , (k − 1)/2, has a saddle in (R∗ )2 , and each of the (1) polynomials with Newton triangles θ2i+1 , i = 1, . . . , (k − 1)/2, has an extremum in (R∗ )2 . Varying the glued vector field with Newton triangle θ, we obtain a vector field with s foci, s saddles, and k(k − 1) − 2s imaginary nondegenerate singular points. Changing signs of coefficients of 1, x, . . . , x (k−1)/2 in the expression for x, ˙ we distinguish between sink-foci and source-foci by (5.10) and (5.11). Let k ≥ 2 be even. Then we repeat the previous procedure for the subdivision into the polygons θ0 = conv (0, 2), (k + 1, 0), (0, k), (2s + 1, 0) , θ1 = conv (0, 2), (1, 0), (0, 0) , (1) θ2i = conv (2i − 1, 0), (0, 2), (i, 1) , (2) θ2i = conv (2i − 1, 0), (i, 1), (2i, 0) ,
(1) θ2i+1 = conv (0, 2), (2i + 1, 0), (i, 1) , (2) θ2i+1 = conv (2i+ 1, 0), (i, 1), (2i, 0) , k i = 1, . . . , . 2 Proposition 7.3.3. For any integers i > 0, j ≥ 0 and for real nonzero numbers a, b, c, there exists q = 0 such that the polynomial F (x, y) = x i y j ax + by + cxy + qy 2 has two imaginary nondegenerate critical points in (C∗ )2 .
Proof. The Newton parallelogram conv((i + 1, j ), (i, j + 1), (i + 1, j + 1), (i, j + 2)) of F is of area 1. Hence F has at most two nondegenerate critical points in (C∗ )2 . From the system Fx = Fy = 0, we easily deduce (i + j + 2)cqy 2 + (j + 1)bc + (2i + j + 2)aq y + (i + j + 1)ab = 0. We have to find q such that this equation has two imaginary roots, or, equivalently, its discriminant D(q) = (j + 1)2 b2 c2 − 2 2i 2 + 2ij + j 2 + 4i + 3j + 2 abcq + (2i + j + 2)2 a 2 q 2 is negative. This is possible because D(q) has two real roots, since its discriminant 4i(i + 1)(i + j + 1)(i + j + 2)a 2 b2 c2 is always positive.
36
ITENBERG AND SHUSTIN
Proposition 7.3.4. For any nonzero numbers a, b and for positive integer i ≤ d/2, the polynomial F (x, y) = ax d + ax d+1 + bx d−2i y 2i+1 + bx d y, with Newton triangle conv((d, 0), (d + 1, 0), (d − 2i, 2i + 1)), has 2i imaginary nondegenerate critical points in (C∗ )2 . Proof. It is straightforward from Fy (x, y) = b (2i + 1)x d−2i y 2i + x d = 0,
x, y ∈ R∗ .
Proposition 7.3.5. For any nonzero real numbers a, b1 , and b2 and for any integer 1 < i < d, there exist nonzero real numbers c1 , c2 , d1 , and d2 such that the vector field x˙ = b1 x i+1 y d−i−1 + c1 x i y d−i + d1 x i−1 y d−i+1 + ay d−1 , y˙ = b2 x i y d−i + c2 x i−1 y d+1−i + d2 x i−2 y d+2−i , with Newton triangle conv((0, d), (i +1, d −i), (i −1, d +2−i)), has two imaginary nondegenerate singular points in (C∗ )2 . Proof. We can take arbitrary nonzero numbers c1 , c2 , d1 , and d2 such that c22 − 4b2 d2 < 0. References [An] [AIl] [Be] [Br] [CW] [CL] [E] [GKZ] [Ha] [Hi] [Il1]
E. A. Andronova, Quadratic systems, close to conservative, with four limit cycles, Selecta Math. Sov. 9 (1990), 365–370. V. I. Arnol’d and Ju. S. Il’yashenko, “Ordinary differential equations” in Dynamical Systems, I, Encyclopaedia Math. Sci. 1, Springer-Verlag, Berlin, 1988, 1–148. D. Bernstein, The number of roots of a system of equations, Funct. Anal. Appl. 9 (1975), 183–185. A. D. Bruno, Local Methods in Nonlinear Differential Equations, Springer Ser. Soviet Math., Springer-Verlag, Berlin, 1989. L. S. Chen and M. S. Wang, The relative position and number of limit cycles of the quadratic differential systems, Acta Math. Sinica 22 (1979), 751–758. C. J. Christopher and N. G. Lloyd, Polynomial systems: A lower bound for the Hilbert numbers, Proc. Roy. Soc. London Ser. A 450 (1995), 219–224. J. Écalle, J. Martinet, R. Moussu, and J.-P. Ramis, Non-accumulation des cycles-limites, I, C. R. Acad. Sci. Paris Sér. I Math. 304 (1987), 375–377. I. M. Gel’fand, M. M. Kapranov, and A. V. Zelevinsky, Discriminants, Resultants, and Multidimensional Determinants, Math. Theory Appl., Birkhäuser, Boston, 1994. Ph. Hartman, Ordinary Differential Equations, 2d ed., Birkhäuser, Boston, 1982. D. Hilbert, Mathematische Probleme, Arch. Math. Phys. 1 (1901), 213–137. Ju. S. Il’yashenko, The appearance of limit cycles under a perturbation of the equation dw/dz = −Rz /Rw , where R(z, w) is a polynomial (in Russian), Mat. Sb. (N.S.) 78 (1969), 360–373.
SINGULAR POINTS AND LIMIT CYCLES OF VECTOR FIELDS [Il2] [I1] [I2]
[IS] [IV] [Kh1] [Kh2] [O] [P] [R] [Sh] [S1] [S2] [S3]
[St] [V1]
[V2]
[V3] [V4] [Z]
37
, Finiteness Theorems for Limit Cycles, Transl. Math. Monogr. 94, Amer. Math. Soc., Providence, 1991. I. Itenberg, Contre-exemples à la conjecture de Ragsdale, C. R. Acad. Sci. Paris Sér. I Math. 317 (1993), 277–282. , “Counter-examples to Ragsdale conjecture and T -curves” in Real Algebraic Geometry and Topology (East Lansing, Mich., 1993), Contemp. Math. 182, Amer. Math. Soc., Providence, 1995, 55–72. I. Itenberg and E. Shustin, Newton polygons and singular points of real polynomial vector fields, C. R. Acad. Sci. Paris Sér. I Math. 319 (1994), 963–968. I. Itenberg and O. Viro, Patchworking algebraic curves disproves the Ragsdale conjecture, Math. Intelligencer 18 (1996), 19–28. A. G. Khovanski˘ı, Index of a polynomial vector field, Funct. Anal. Appl. 13 (1979), 38–45. , Boundary indices of polynomial 1-forms with homogeneous components (in Russian), Algebra i Analiz 10 (1998), 193–222. N. F. Otrokov, On the number of limit cycles of differential equations in a neighbourhood of a singular point (in Russian), Mat. Sb. (N.S.) 34 (1954), 127–144. I. A. Pushkar’, A multidimensional generalization of Il’yashenko’s theorem on abelian integrals, Funct. Anal. Appl. 31 (1997), 100–108. J.-J. Risler, Construction d’hypersurfaces réelles (d’après Viro), Astérisque 216 (1993), 69–86, Séminaire Bourbaki 1992/93, exp. no. 763. S. L. Shi, A concrete example of the existence of four limit cycles for plane quadratic systems, Sci. Sinica 23 (1980), 153–158. E. Shustin, Real plane algebraic curves with prescribed singularities, Topology 32 (1993), 845–856. , Surjectivity of the resultant map: A solution to the inverse Bezout problem, Comm. Algebra 23 (1995), 1145–1163. , “Critical points of real polynomials, subdivisions of Newton polyhedra and topology of real algebraic hypersurfaces” in Topology of Real Algebraic Varieties and Related Topics, Amer. Math. Soc. Transl. Ser. 2 173, Amer. Math. Soc., Providence, 1996, 203–223. B. Sturmfels, Viro’s theorem for complete intersections, Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 21 (1994), 377–386. O. Ya. Viro, “Gluing of algebraic hypersurfaces, smoothing of singularities and construction of curves” (in Russian) in Proceedings of the Leningrad International Topological Conference (Leningrad, 1982), Nauka, Leningrad, 1983, 149–197. , “Gluing of plane real algebraic curves and constructions of curves of degrees 6 and 7” in Topology (Leningrad, 1982), Lecture Notes in Math. 1060, Springer-Verlag, Berlin, 1984, 187–200. , Real plane algebraic plane curves: Constructions with controlled topology, Leningrad Math. J. 1 (1990), 1059–1134. , Patchworking real algebraic varieties, U.U.D.M. Report 1994:42, Uppsala University, 1994. H. Zoładek, ¸ Eleven small limit cycles in a cubic vector field, Nonlinearity 8 (1995), 843– 860.
Itenberg: Institut de Recherche Mathématique de Rennes, Centre National de la Recherche Scientifique, Campus de Beaulieu, F-35042 Rennes Cedex, France; ilia.itenberg @univ-rennes1.fr Shustin: School of Mathematical Sciences, Tel Aviv University, Ramat Aviv, Tel Aviv 69978, Israel;
[email protected]
Vol. 102, No. 1
DUKE MATHEMATICAL JOURNAL
© 2000
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES FABIENNE PROSMANS and JEAN-PIERRE SCHNEIDERS
Contents 0. 1. 2. 3. 4. 5. 6. 7.
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The functor IB : ᐀c → Ᏽnd(Ꮾan) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ˆ in Ᏽnd(Ꮾan) . . . . . . . . . . . . . . . . . . . . . . . . Some acyclicity results for L and ⊗ A topological version of Cartan’s Theorem B . . . . . . . . . . . . . . . . . . . . . . . . . . . . A factorization formula for IB(ᏻX×Y ) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Poincaré duality for IB(ᏻX ) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A holomorphic Schwartz kernel theorem. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Reconstruction theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
39 45 52 62 67 70 79 82
0. Introduction. In algebraic analysis, one represents systems of analytic linear partial differential equations on a complex analytic manifold X by modules over the ring ᏰX of linear partial differential operators with analytic coefficients. Using this representation, the holomorphic solutions of the homogeneous system associated to the ᏰX -module ᏹ correspond to Ᏼom ᏰX ᏹ, ᏻX , where ᏻX denotes the ᏰX -module of holomorphic functions. If one wants also to take into consideration the compatibility conditions, then one has to study the full solution complex ol(ᏹ) = R Ᏼom ᏰX ᏹ, ᏻX in the derived category D + (CX ) of sheaves of C-vector spaces. In [6] (see also [9]), it was shown that the functor ol induces an equivalence between the derived category formed by the bounded complexes of regular holonomic ᏰX -modules and that formed by the bounded complexes of C-constructible CX -modules. This equivalence is usually called the Riemann-Hilbert correspondence. One of its corollaries is that it is possible to reconstruct a complex of regular holonomic ᏰX -modules from its complex of holomorphic solutions. Our aim in this paper is to extend this reconstruction theorem to perfect complexes of Ᏸ∞ X -modules by taking into account the natural topology of the complex of Received 2 April 1999. Revision received 14 May 1999. 1991 Mathematics Subject Classification. Primary 32C38, 46M20, 35A27; Secondary 32C37, 58G05. 39
40
PROSMANS AND SCHNEIDERS
holomorphic solutions. Informally, the relation we obtain is of the type ᏹ RᏴomtop ol(ᏹ), ᏻX and follows from the fact that
Ᏸ∞ X RᏴomtop ᏻX , ᏻX .
To give a meaning to these formulas, we have to work in the derived category of sheaves with values in the category of ind-objects of the category of Banach spaces using the techniques and results of [18]. For clarification, let us briefly recall the main facts concerning quasi-abelian homological algebra and sheaf theory established in [18]. The central notion is that of a quasi-abelian category (i.e., an additive category with kernels and cokernels such that the pushforward (resp., the pullback) of a kernel (resp., a cokernel) is still a kernel (resp., a cokernel)). Let Ᏹ be such a category. A morphism of Ᏹ is said to be strict if its coimage is canonically isomorphic to its image, and a complex d k−1
dk
· · · −→ X k−1 −−−−→ Xk −−→ X k+1 −→ · · · of Ᏹ is said to be strictly exact in degree k if d k−1 is strict and ker d k = im d k−1 . Localizing the triangulated category K(Ᏹ) of complexes “modulo homotopy” by the null system formed by the complexes that are strictly exact in every degree gives us the derived category D(Ᏹ). This category has two canonical t-structures. Here we only use the left one. Its heart ᏸᏴ(Ᏹ) is formed by the complexes of the form d −1
0 −→ X−1 −−−→ X0 −→ 0, where d −1 is a monomorphism. The cohomology functor LH k : D(Ᏹ) −→ ᏸᏴ(Ᏹ) sends the complex X· to the complex 0 −→ coim d k−1 −→ ker d k −→ 0 with ker d k in degree 0. In [18], it was shown that in most problems of homological algebra and sheaf theory, one may replace the quasi-abelian category Ᏹ by the abelian category ᏸᏴ(Ᏹ) without loosing any information. It was also shown there that if Ᏹ is elementary (i.e., if it has a small strictly generating set formed by tiny projective objects), then the sheaves with values in Ᏹ share most of the usual properties of sheaves of abelian groups (including Poincaré-Verdier duality). Moreover, if Ᏹ has a closed structure given by an internal tensor product and an internal Hom functor satisfying some natural assumptions, then the Künneth theorem holds for sheaves with values in Ᏹ.
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
41
Let us now introduce the quasi-abelian categories that we use in this paper and fix our notation. Following [15], we let Ꮾan (resp., Ᏺr, ᐀c) denote the quasi-abelian category of Banach spaces (resp., Fréchet spaces, arbitrary locally convex topological vector spaces). Let us recall (see, e.g., [12]) that for any set I , the space l 1 (I ) (resp., l ∞ (I )) of summable (resp., bounded) sequences of C indexed by I is projective (resp., injective) in Ꮾan. Using these spaces, we show easily that Ꮾan has enough injective and projective objects. Recall also that the category Ꮾan has a canonical structure of a closed additive category given by a right exact tensor product ˆ : Ꮾan × Ꮾan −→ Ꮾan ⊗ and a left exact internal Hom L : Ꮾanop × Ꮾan −→ Ꮾan. ˆ L denote the left derived functor of ⊗ ˆ and RL the right derived functor of Letting ⊗ L , we have the adjunction formula ˆ L F, G RHom E, RL(F, G) . RHom E ⊗ Let U, V be two universes such that V U. As usual, let ᏮanU denote the category formed by the Banach spaces that belong to U, and consider the category Ᏽnd V ᏮanU of ind-objects of ᏮanU. Recall that the objects of Ᏽnd V(ᏮanU) are functors E : Ᏽ −→ ᏮanU, where Ᏽ is a V-small filtering category, and recall that if E : Ᏽ −→ ᏮanU,
F : −→ ᏮanU
are two such functors, then HomᏵnd V (ᏮanU ) (E, F ) = lim lim HomᏮanU E(i), F (j ) . ← −− → i∈Ᏽ j ∈
For further details on ind-objects, we refer the reader to classical sources (such as [1], [2]) and to [13]. Following the standard usage and to avoid confusion, we let “lim” E(i) − → i∈Ᏽ
denote the functor E : Ᏽ → ᏮanU considered as an object of Ᏽnd V(ᏮanU). Similarly, we let “X” denote the ind-object associated to the U-Banach space X. In other words,
42
PROSMANS AND SCHNEIDERS
we set “X” = “lim” C(i), − → i∈Ᏽ
where Ᏽ is a one-point category and C : Ᏽ → ᏮanU is the constant functor with value X. Note that in the rest of the paper, we do not make the universes U, V explicit in our notation because this is not really necessary for a clear understanding. Using [18], we see that the category Ᏽnd(Ꮾan) of ind-objects of Ꮾan is an elementary closed quasi-abelian category. It follows that sheaves with values in Ᏽnd(Ꮾan) share most of the usual properties of abelian sheaves (including Künneth theorem and Poincaré-Verdier duality). In Ᏽnd(Ꮾan), the internal tensor product ˆ : Ᏽnd(Ꮾan) × Ᏽnd(Ꮾan) −→ Ᏽnd(Ꮾan) ⊗ and the internal Hom functor op L : Ᏽnd(Ꮾan) × Ᏽnd(Ꮾan) −→ Ᏽnd(Ꮾan) are characterized by
ˆ “lim” Fj = lim lim “Ei ⊗F ˆ j” “lim” Ei ⊗ − → → − → − →− i∈I
and
j ∈J
i∈I j ∈J
L “lim” Ei , “lim” Fj = lim lim “L Ei , Fj ”. − → → ← −− − → i∈I
j ∈J
i∈I j ∈J
The internal tensor product (resp., internal Hom functor, external tensor product) for ˆ ), and we use other ˆ (resp., ᏸ , sheaves with values in Ᏽnd(Ꮾan) is denoted by ⊗ notations of sheaf theory in their usual form. In particular, Rf ! (resp., f ! ) denotes the direct (resp., inverse) image with proper support, and ωX (resp., D(·) = Rᏸ (·, ωX )) denotes the Poincaré-Verdier dualizing complex (resp., functor). Let us now describe with some details the content of this paper. In the first section, we study the functor IB : ᐀c → Ᏽnd(Ꮾan) defined by setting , IB(E) = “lim” E − → B B∈ᏮE
where ᏮE is the set of absolutely convex bounded subsets of E and EB is the linear hull of B. We establish the properties of this functor, which we need in the rest of the paper. More precisely, we prove that if E is bornological and F is complete, then HomᏵnd(Ꮾan) IB(E), IB(F ) Hom᐀c (E, F ) and
IB L b (E, F ) L IB(E), IB(F ) .
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
43
Here L b (E, F ) is the vector space Hom᐀c (E, F ) endowed with the system of seminorms {pB : p-continuous seminorm of F, B-bounded subset of E}, where
pB (h) = sup p h(e) . e∈B
Moreover, we show that IB is compatible with projective limits of filtering projective systems of complete spaces. We also show its compatibility with complete inductive limits of injective inductive systems of Fréchet spaces indexed by N. ˆ in The second section is devoted to the proof of some acyclicity results for L and ⊗ Ᏽnd(Ꮾan). First we show that if E is a Dual Fréchet Nuclear (DFN) space and if F is a Fréchet space, then both LH k (RHom(IB(E), IB(F ))) and LH k (RL(IB(E), IB(F ))) are 0 for k = 0. (Note that a related result was obtained for the category ᐀c by Palamodov in [10].) Next we establish that if E and F are objects of Ᏽnd(Ꮾan) with E nuclear, then (∗)
L
ˆ F E ⊗F. ˆ E⊗
We begin Section 3 by proving that if X is a topological space with a countable basis and if F is a presheaf of Fréchet spaces on X that is a sheaf of vector spaces, then U −→ IB F (U ) (U open of X) is a sheaf with values in Ᏽnd(Ꮾan). This shows, in particular, that IB(ᏻX ) is a sheaf with values in Ᏽnd(Ꮾan) for any complex analytic manifold X. We end the section by establishing that R U, IB ᏻX U, IB ᏻX if U is an open subset of X such that H k (U, ᏻX ) 0 (k = 0). This result may be viewed as a topological version of Cartan’s Theorem B. As a corollary, if X is a Stein manifold, we get a similar isomorphism with U replaced by any holomorphically convex compact subset of X. In Section 4, using (∗), we show that L ˆ IB ᏻY IB ᏻX×Y IB ᏻX for any complex analytic manifolds X and Y . This allows us to obtain a topological Künneth theorem for holomorphic cohomology. Section 5 is devoted to the proof that for any complex analytic manifold X of dimension dX , the Poincaré dual of IB(ᏻX ) is isomorphic to IB(!X )[dX ]. Since the problem is of local nature, by a series of reductions using the results established in the previous sections, we find that it is sufficient to show that if P is a closed interval of C and V is an open interval of Cn , then R P ×V C × V , IB ᏻC×V L IB ᏻC (P ) , IB ᏻV (V ) [−1].
44
PROSMANS AND SCHNEIDERS
This isomorphism is obtained by proving that, in this situation, one has a split exact sequence of the form 0 −→ ᏻC×V (C × V ) −→ ᏻC×V (C \ P ) × V −→ L b ᏻC (P ), ᏻV (V ) −→ 0 in ᐀c. We begin Section 6 by giving the general form of the kernels of continuous cohomological correspondences between sheaves of holomorphic differential forms. More precisely, we show that if X, Y are complex analytic manifolds of dimension dX , dY , then −1 r ! (dX −r,s) [dX ] Rᏸ qX IB !X , qY IB !sY . IB !X×Y As a consequence, we find that for any morphism of complex analytic manifolds f : X → Y , we have a canonical isomorphism (0,d ) Rᏸ f −1 IB ᏻY , IB ᏻX δf−1 R)f IB !X×YY [dY ], where )f is the graph of f in X × Y and δf : X → X × Y is the associated graph embedding. In particular, LH k Rᏸ f −1 IB ᏻY , IB ᏻX =0 for k = 0 and R Ᏼom f −1 IB ᏻY , IB ᏻX Ᏸ∞ X→Y .
(∗∗)
This contains the fact that continuous endomorphisms of ᏻX may be identified with partial differential operators of infinite order, as conjectured by Sato and then established by Ishimura in [5]. We begin the last section by proving an abstract reconstruction theorem for perfect complexes of modules over a ring in the closed category hv(X; Ᏽnd(Ꮾan)). Because of the embedding functor I˜ᐂ : hv(X; ᐂ) −→ hv X; Ᏽnd(Ꮾan) (where ᐂ denotes the category of C-vector spaces), we are also able to prove a similar formula for perfect complexes of modules over an ordinary sheaf of rings. Using (∗∗) with f = idX , we get a topological reconstruction theorem for Ᏸ∞ X -modules. More precisely, we prove that the functors Rᏸ I˜ᐂ (Ᏸ∞ ) I˜ᐂ (·), IB ᏻX : D − ᏹod Ᏸ∞ −→ D + hv X; Ᏽnd(Ꮾan) X X
and R Ᏼom ·, IB ᏻX : D − hv X; Ᏽnd(Ꮾan) −→ D + ᏹod Ᏸ∞ X
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
45
are well defined and that R Ᏼom Rᏸ I˜ᐂ (Ᏸ∞ ) I˜ᐂ (ᏹ), IB ᏻX , IB ᏻX ᏹ X
Ᏸ∞ X -modules
for any perfect complex of ᏹ. Note that the image of ᏹ by the first functor above is a kind of topologized version of the holomorphic solution complex of ᏹ, and note that the preceding formula may be viewed as a way to reconstruct a perfect system of analytic partial differential equations of infinite order from its holomorphic solutions. 1. The functor IB : ᐀c → Ᏽnd(Ꮾan). For any object E of ᐀c, we denote by ᏮE the set of absolutely convex bounded subsets of E and by ᏮE the set of closed absolutely convex bounded subsets of E. If B ∈ ᏮE , we let EB denote the seminormed space obtained by endowing the linear hull of B in E with the gauge seminorm pB associated to B. Definition 1.1. To define the functor IB : ᐀c −→ Ᏽnd(Ꮾan), we proceed as follows. For any object E of ᐀c, we set , IB(E) = “lim” E − → B B∈ᏮE
B denotes as usual the completion of EB . Consider a morphism f : E → F where E B → F f (B) . of ᐀c. For any B ∈ ᏮE ,f (B) ∈ ᏮF . Hence f induces a morphism E This morphism being functorial in B, we obtain a morphism −→ “lim” F “lim” E − → B − → f (B) B∈ᏮE
in Ᏽnd(Ꮾan). We define
B∈ᏮE
IB(f ) : IB(E) −→ IB(F )
by composing the preceding morphism with the canonical morphism . −→ “lim” F “lim” F − → B − → f (B) B∈ᏮE
B∈ᏮF
Remark 1.2. If E is a Banach space, then IB(E) “E”. As a matter of fact, because any bounded subset of E is included in a ball b(ρ) centered at the origin, we have IB(E) “lim” Eb(ρ) , − → ρ>0
and the conclusion follows from the isomorphism Eb(ρ) E.
46
PROSMANS AND SCHNEIDERS
Lemma 1.3. Let E and F be two objects of ᐀c. Then lim lim Hom᐀c EB , FB B(E, F ), ← − − → B∈ᏮE B ∈ᏮF
where
B(E, F ) = f : E −→ F : f linear, f (B)-bounded in F if B-bounded in E .
Remark 1.4. If E and F are objects of ᐀c, we have Hom᐀c (E, F ) ⊂ B(E, F ). In general, this inclusion is strict, but, as is well known, it turns into an equality if E is bornological (i.e., if any absolutely convex subset of E that absorbs any bounded subset is a neighborhood of zero). Proposition 1.5. Let E and F be two objects of ᐀c. If E is bornological and F complete, then HomᏵnd(Ꮾan) IB(E), IB(F ) Hom᐀c (E, F ). Proof. Since the inclusion ᏮF ⊂ ᏮF is cofinal, we have HomᏵnd(Ꮾan) IB(E), IB(F ) HomᏵnd(Ꮾan) “lim” EB , “lim” FB − → − → B∈ᏮE
lim ← −
B ∈ᏮF
B . B , F lim HomᏮan E − →
B∈ᏮE B ∈ᏮF
Since F is complete, FB is a Banach space, and B Hom᐀c EB , FB . B , F HomᏮan E It follows that
HomᏵnd(Ꮾan) IB(E), IB(F ) lim ← −
lim Hom᐀c EB , FB − →
B∈ᏮE B ∈ᏮF
B(E, F ) Hom᐀c (E, F ), where the second isomorphism follows from Lemma 1.3 and the last isomorphism from Remark 1.4. Proposition 1.6. Let IL : Ᏽnd(Ꮾan) −→ ᐀c denote the functor defined by
IL “lim” Ei = lim Ei . − → − → i∈Ᏽ
i∈Ᏽ
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
47
Let E be an object of Ᏽnd(Ꮾan), and let F be a complete object of ᐀c. Then HomᏵnd(Ꮾan) E, IB(F ) Hom᐀c IL(E), F . Proof. Assuming E “lim” Ei , we have − → i∈Ᏽ HomᏵnd(Ꮾan) E, IB(F ) lim HomᏵnd(Ꮾan) “Ei ”, IB(F ) ← − i∈Ᏽ lim HomᏵnd(Ꮾan) IB(Ei ), IB(F ) ← − i∈Ᏽ lim Hom᐀c (Ei , F ) Hom᐀c IL(E), F , ← − i∈Ᏽ
where the second isomorphism follows from Remark 1.2 and the third follows from Proposition 1.5. Corollary 1.7. Let Ᏽ be a small category. For any functor X : Ᏽop −→ ᐀c such that X(i) is complete for any i ∈ Ᏽ, we have IB lim X(i) lim IB X(i) . ← − ← − i∈Ᏽ
i∈Ᏽ
Proof. For any object E of Ᏽnd(Ꮾan), we have HomᏵnd(Ꮾan) E, IB lim X(i) Hom᐀c IL(E), lim X(i) ← − ← − i∈Ᏽ i∈Ᏽ lim Hom᐀c IL(E), X(i) ← − i∈Ᏽ lim HomᏵnd(Ꮾan) E, IB X(i) , ← − i∈Ᏽ
where the first and last isomorphisms follow from Proposition 1.6. The conclusion follows from the theory of representable functors. Proposition 1.8. Assume that (Fn , fm,n )n∈N is an inductive system of Fréchet spaces with injective transition morphisms and that lim Fn − →
n∈N
is complete. Then the canonical morphism
lim IB Fn −→ IB lim Fn − → − →
n∈N
is an isomorphism.
n∈N
48
PROSMANS AND SCHNEIDERS
Proof. Applying IB to the canonical morphisms rn : Fn −→ lim Fn − → n∈N
and using the characterization of inductive limits, we get the canonical morphism lim IB Fn −→ IB lim Fn . (∗) − → − → n∈N
n∈N
Let B be a closed absolutely convex bounded subset of lim n∈N Fn . It follows from, − → for example, [8, Chapter IV, §19] that for some n ∈ N, B is the image of a closed absolutely convex bounded subset Bn of Fn by the canonical morphism rn . Since rn is injective, it induces the isomorphism of seminormed spaces ∼ Fn B −−→ lim Fn . n − → B
n∈N
Hence we get the isomorphism of Banach spaces ∼
n . lim Fn −−→ F Bn − → B
n∈N
Composing with the morphism
n ” −→ IB Fn −→ lim IB Fn , “ F Bn − → n∈N
we get a canonical morphism “ lim Fn ” −→ lim IB Fn . − → − → B
n∈N
n∈N
Finally, using the characterization of inductive limits, we obtain a canonical morphism IB lim Fn = lim “ lim Fn ” −→ lim IB Fn . − → − → − → − → n∈N
B∈Ꮾlim Fn − →
n∈N
B
n∈N
A direct computation shows that this morphism is a left and right inverse of (∗). Remark 1.9. Note that, due to [10, Proposition 7.2 and Corollary 7.2], a countable filtering inductive system of Fréchet spaces which is lim-acyclic in ᐀c is essentially − → equivalent to an inductive system that satisfies the assumptions of the preceding proposition. Hence, IB also commutes with the inductive limit functor in such a situation.
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
49
Definition 1.10. Let E and F be two objects of ᐀c. As usual, we denote by L b (E, F ) the vector space Hom᐀c (E, F ) endowed with the system of seminorms pB : p-continuous seminorm of F, B-bounded subset of E , where
pB (f ) = sup p f (e) . e∈B
Lemma 1.11. Let E and F be two objects of ᐀c. Assume E is bornological. Then L b (E, F ) lim L b EB , F ← − B∈ᏮE
in ᐀c. Moreover, assume that F is complete. Then B , F L b (E, F ) lim L b E ← − B∈ᏮE
in ᐀c. Proof. Keeping in mind the properties of bornological spaces, it is clear from the definition of L b (E, F ) that L b (E, F ) lim L b EB , F . ← − B∈ᏮE
B is included in the closure of a semiball of EB , any bounded Since any ball of E subset of EB is included in the closure of a bounded subset of EB . This property and the completeness of F show that B , F . L b EB , F L b E Hence we have the conclusion. Lemma 1.12. If E is a Banach space and F is a complete object of ᐀c, then IB L b (E, F ) L “E”, IB(F ) . Proof. For any B ∈ ᏮF , set Bb = f ∈ Hom᐀c (E, F ) : e ≤ 1 ⇒ f (e) ∈ B . Clearly Bb belongs to ᏮL b (E,F ) . Moreover, if B is closed in F , then Bb is closed in L b (E, F ), and one checks easily that L b (E, F ) B L E, FB b
50
PROSMANS AND SCHNEIDERS
as Banach spaces. Hence, one has successively IB L b (E, F ) = “lim” L b (E, F ) B “lim” L b (E, F ) B − → − → b B∈ᏮL b (E,F )
B ∈ᏮF
“lim” L E, FB L “E”, IB(F ) , − → B ∈ᏮF
where the second isomorphism follows from the fact that the inclusion Bb : B ∈ ᏮF ⊂ ᏮL b (E,F ) is cofinal. Proposition 1.13. Let E and F be two objects of ᐀c. Assume that E is bornological and F is complete. Then IB L b (E, F ) L IB(E), IB(F ) . Proof. We have successively (1) (2) (3)
lim L b EB , F ← − B∈ᏮE B , F lim IB L b E ← − B∈ᏮE B ”, IB(F ) lim L “E ← − B∈ᏮE L IB(E), IB(F ) ,
IB L b (E, F ) IB
where the isomorphism (1) follows from Lemma 1.11, (2) follows from Corollary 1.7, and (3) follows from Lemma 1.12. Remark 1.14. (1) Let E, F , G be three objects of ᐀c. Recall that a bilinear application b : E × F −→ G is continuous if and only if for any continuous seminorm r of G, there are continuous seminorms p and q of E and F , respectively, such that r b(x, y) ≤ p(x)q(y). (2) Let E and F be two objects of ᐀c with P and Q as systems of seminorms. As usual, if p ∈ P and q ∈ Q, we let p ⊗q denote the seminorm on E ⊗F defined by (p ⊗q)(u) = inf p(xi )q(yi ). u=
xi ⊗yi
i
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
51
Recall that E ⊗π F is the object of ᐀c obtained by endowing E ⊗F with the system of seminorms induced by p ⊗q : p ∈ P , q ∈ Q . From this definition, it follows immediately that any continuous bilinear map b : E × F −→ G factors uniquely through a continuous linear map E ⊗π F −→ G. ˆ π q is the ˆ π F denotes the completion of E ⊗π F and that p ⊗ Finally, recall that E ⊗ ˆ seminorm of E ⊗π F induced by p ⊗q. Proposition 1.15. There is a canonical morphism ˆ IB(E) ⊗IB(F ) −→ IB E ⊗π F . Proof. For B ∈ ᏮE and B ∈ ᏮF , let B ⊗B denote the absolutely convex hull of b ⊗b : b ∈ B, b ∈ B . This is clearly a bounded absolutely convex subset of E ⊗F . As a matter of fact, (p ⊗q) b ⊗b ≤ p(b)q(b ) ≤ sup p(b) sup q(b ). b∈B
b ∈B
Moreover, we have a canonical linear map EB ⊗FB −→ E ⊗π F B⊗B . This map is clearly continuous because e ⊗ f ∈ B ⊗ B when e ∈ B and e ∈ B . Applying the completion functor, we get a morphism B ⊗ B −→ E ˆF E ⊗π F B⊗B and, hence, a morphism B ” ⊗“ B ” IB E B ⊗ B −→ IB E ⊗ F . ˆ F ˆF “E π Using the definition of inductive limits, we get a morphism ˆ IB(E) ⊗IB(F ) lim − →
B ” ⊗“ B ” −→ IB E ⊗ F . ˆ F lim “E π − →
B∈ᏮE B ∈ᏮF
52
PROSMANS AND SCHNEIDERS
ˆ in Ᏽnd(Ꮾan). Hereafter, we let c0 2. Some acyclicity results for L and ⊗ 1 (resp., l ) denote as usual the Banach spaces formed by the sequences x = (xn )n∈N of complex numbers that converge to 0 (resp., that are summable); the norm is defined by ∞
xn . xc0 = sup xn resp., xl 1 = n∈N
n=0
For any Banach space X, we also set for short D(X) = L (X, C). Lemma 2.1. Let X, Y be two Banach spaces, and let f : X → Y be a nuclear map. Then there is a continuous linear map p : X → c0 and a nuclear map c : c0 → Y making the diagram 0 ? c ?? ~ ??c p ~~ ?? ~~ ? ~ ~ /Y X f commutative. Proof. Since f : X → Y is nuclear, there is a bounded sequence xn∗ of D(X), a bounded sequence yn of Y , and a summable sequence λn of complex numbers such that +∞ f (x) = λn xn∗ , x yn ∀x ∈ X. n=0
Since λn is summable, one can find a sequence rn of nonzero complex numbers converging to zero such that λn /rn is still summable. It is easy to check that the maps p : X → c0 and c : c0 → Y , defined by
p(x)n = rn xn∗ , x
c(s) =
and
+∞ λn n=0
rn
yn sn ,
have the requested properties. Definition 2.2. A projective system E : I op → Ꮾan, where I is a filtering ordered set, is nuclear if for any i ∈ I , there is j ∈ I , j ≥ i such that the transition morphism ei,j : Ej −→ Ei is nuclear. Lemma 2.3. Let I be an infinite filtering ordered set, and let E : I op → Ꮾan be a nuclear projective system. Then, in ᏼro(Ꮾan), we have “lim” Ei “lim” Xk , ← − ← − i∈I
k∈K
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
53
where X : K op → Ꮾan is a projective system with nuclear transition morphisms such that X k = c0 for any k ∈ K and #K = #I . Proof. Consider the set K = (i, j ) ∈ I × I : j ≥ i, ei,j : Ej −→ Ei nuclear . The relation “≥,” defined by setting (i , j ) ≥ (i, j ) if (i , j ) = (i, j ) or i ≥ j , turns K into a filtering ordered set. By Lemma 2.1, for any k = (i, j ) ∈ K, we may choose a continuous linear map pk : Ej → c0 and a nuclear map ck : c0 → Ei , which make the diagram 0 ? c ?? ~ ?? ck pk ~~ ?? ~~ ?? ~ ~ / Ei Ej e i,j
commutative. For any k ∈ K, we set Xk = c0 and xk,k = idXk . If k = (i , j ) > k = (i, j ), we set xk,k = pk # ej,i # ck : Xk −→ Xk . Because the map ck is nuclear, xk,k is also nuclear. An easy computation shows that if k < k < k , then xk,k # xk ,k = xk,k . Consider the functors 6 : K −→ I
7 : K −→ I
and
defined by 6((i, j )) = i and 7((i, j )) = j . They are clearly cofinal, and if k ≥ k in K, the diagrams Xk
ck
xk,k
Xk
/ E6(k )
ck
E7(k )
e6(k),6(k )
e7(k),7(k )
/ E6(k)
pk
xk,k
E7(k)
/ Xk
pk
/ Xk
are commutative. Hence we get the two morphisms “lim” Xk −→ “lim” E6(k) “lim” Ei , ← − ← − ← − k∈K
k∈K
i∈I
“lim” Ej “lim” E7(k) −→ “lim” Xk . ← − ← − ← − j ∈I
k∈K
k∈K
Since these morphisms are easily checked to be inverse of each other, the proof is complete. Remark 2.4. Hereafter, as usual, we let en denote the element of c0 defined by (en )m = δn,m , and we let en∗ denote the element of D(c0 ) defined by ∗ en , x = x n .
54
PROSMANS AND SCHNEIDERS
Lemma 2.5. For any Banach space Y and any nuclear map u : c0 −→ Y, the sequence u(en )Y is summable, and for any x ∈ c0 , we have u(x) =
+∞ n=0
en∗ , x u(en ).
Proof. Since u is nuclear, we can find a bounded sequence xn∗ of D(c0 ), a bounded sequence yn of Y , and a summable sequence λn of complex numbers such that u(x) =
+∞ n=0
λn xn∗ , x yn
for any x ∈ c0 . Using the isomorphism D(c0 ) l 1 , we see that +∞
∗
x , em = x ∗
(∗)
m=0
n D(c0 ) .
n
Therefore, M
+∞ M
∗
λn
x , em yn Y
u(em ) ≤
m=0
≤ ≤
m=0 n=0 +∞
n
∗
λn x
n=0 +∞ n=0
n D(c0 ) yn Y
λn sup x ∗ 0 sup yn Y , n D(c ) n∈N
n∈N
and the sequence u(en )Y is summable. Moreover, u(x) = = =
+∞ +∞
λn xn∗ , em xm yn
n=0 m=0 +∞ +∞
xm
m=0 +∞ m=0
n=0
λn xn∗ , em yn
∗ , x u(em ), em
where the permutation of sums is justified using (∗).
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
55
Lemma 2.6. Let I be an infinite filtering ordered set, and let X : I op → Ꮾan be a nuclear projective system. Assume that Y is a Fréchet space. Then the morphisms ˆ π Y −→ L b D(Xi ), Y , ϕi : X i ⊗ defined by setting ˆ π y (x ∗ ) = x ∗ , x y ϕi x ⊗
∀x ∈ Xi , y ∈ Y, x ∗ ∈ D(Xi ),
induce an isomorphism ˆ π Y “lim” L b D(Xi ), Y . “lim” Xi ⊗ ← − ← − i∈I
i∈I
In particular, for Y = C, we have
“lim” Xi “lim” D D(Xi ) . ← − ← − i∈I
i∈I
Proof. By Lemma 2.3, we may assume that Xi = c0 for any i ∈ I and that the transition morphisms xi,j : Xj −→ Xi (j > i) are nuclear. It is easy to check that ϕi is a well-defined continuous map. By Lemma 2.5, we know that the sequence (xi,j (en )Xi )n∈N is summable and that xi,j (c) =
+∞ n=0
en∗ , c xi,j (en ) ∀c ∈ Xj .
Therefore, we may define a continuous linear map ˆπY ψi,j : L b D(Xj ), Y −→ Xi ⊗ by setting ψi,j (h) =
+∞ n=0
ˆ π h en∗ . xi,j (en ) ⊗
It is easy to see that the morphisms ϕi and ψi,j induce morphisms of pro-objects ˆ π Y −→ “lim” L b D(Xi ), Y “lim” Xi ⊗ ← − ← − i∈I
and
i∈I
ˆ π Y. “lim” L b D(Xi ), Y −→ “lim” Xi ⊗ ← − ← − i∈I
i∈I
A direct computation shows that these morphisms are inverse of each other.
56
PROSMANS AND SCHNEIDERS
Definition 2.7. We say that a filtering projective system E : I op → ᐀c satisfies condition ML if for any i ∈ I , any seminorm p of Ei , and any ; > 0, there is i ≥ i such that ei,i (Ei ) ⊂ bp (;) + ei,i (Ei ) ∀i ≥ i , where bp (;) denotes as usual the semiball of radius ; and center 0 associated to the seminorm p. Remark 2.8. By [15, Proposition 1.2.9] (which is a direct consequence of [14, Theorem 5.6]), a countable filtering projective system of Fréchet spaces is lim-acyclic ← − in ᐀c if and only if it satisfies condition ML. Lemma 2.9. Let E : I op → ᐀c and F : J op → ᐀c be two filtering projective systems. If E and F satisfy condition ML, then the projective system ˆ π F : (I × J )op −→ ᐀c E⊗ defined by
ˆ π F (i, j ) = Ei ⊗ ˆ π Fj E⊗
satisfies condition ML. ˆ π Fj . It follows ˆ π q be a seminorm of Ei ⊗ Proof. Let (i, j ) ∈ I × J and let p ⊗ from our assumptions that there is i ≥ i and j ≥ j such that (∗)
ei,i (Ei ) ⊂ bp (1) + ei,i (Ei ) ∀i ≥ i
and (∗∗)
fj,j Fj ⊂ bq (1) + fj,j Fj
∀j ≥ j .
Fix (i , j ) ≥ (i , j ). Since the maps ei,i , fj,j and ei,i are continuous, we can find a seminorm p of Ei , a seminorm q of Fj , and a seminorm p of Ei such that p # ei,i ≤ p ,
q # fj,j ≤ q ,
p # ei,i ≤ p .
Consider ; > 0, and let z be an element of Ei ⊗π Fj of the type x ⊗π y , where x ∈ Ei , y ∈ Fj . Using (∗) and (∗∗) above, we obtain x ∈ Ei and y ∈ Fj such that ; p ei,i x − ei,i x ≤ 2 1 + q y and
; . q fj,j y − fj,j y ≤ 2 1 + p x
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
57
For z = x ⊗π y ∈ Ei ⊗π Fj , we get
p ⊗π q
ei,i ⊗π fj,j z − ei,i ⊗π fj,j z = p ⊗π q ei,i x − ei,i x ⊗π fj,j y + ei,i x ⊗π fj,j y − fj,j y ≤ p ei,i x − ei,i x q fj,j y + p ei,i x q fj,j y − fj,j y ≤ ;.
Since any element of Ei ⊗π Fj is a finite sum of elements of the type considered above, we see that for any ; > 0, ei,i ⊗π fj,j Ei ⊗π Fj ⊂ bp⊗ q (;) + ei,i ⊗π fj,j Ei ⊗π Fj . π
ˆ π Fj . The conclusion follows directly since Ei ⊗π Fj is dense in Ei ⊗ Remark 2.10. Let E be an object of ᐀c. Recall that E is of type FN if it is a nuclear Fréchet space and that E is of type DFN if it is isomorphic to the strong dual of a nuclear Fréchet space. Lemma 2.11. Assume X is an FN space. Then there is a projective system Xn , xn,m n∈N of Banach spaces such that (a) there is an isomorphism X lim Xn ; ← − n∈N
(b) for m > n, the transition map xn,m : Xm −→ Xn is nuclear and has a dense range; (c) there is an isomorphism Db (X) lim D(Xn ), − → n∈N
where Db (X) denotes the strong dual of X; (d) for m > n, the transition map D(xn,m ) : D(Xn ) −→ D(Xm ) is nuclear and injective.
58
PROSMANS AND SCHNEIDERS
Proof. Since X is an FN space, there is a cofinal increasing sequence (pn )n∈N of continuous seminorms of X such that the canonical map Xpn+1 −→ Xpn is nuclear. For such a sequence, the canonical map pn+1 −→ X pn X is also nuclear and has a dense range. Moreover, it is well known (see, e.g., [8, Chapter IV, §19]) that . X lim X ← − pn n∈N
Clearly
pn , Di (X) lim D Xpn lim D X − → − → n∈N
n∈N
where Di (X) is the inductive dual of X. Recall that an absolutely convex subset V is a neighborhood of 0 in Di (X) if it absorbs any equicontinuous subset of X . Hence, it is clear that a neighborhood of 0 in Db (X) is a neighborhood of 0 in Di (X). We know that X is reflexive (see, e.g., [11, §5.3.2]). Hence, Db (X) is bornological (see, e.g., [8, Chapter VI, §29]). The space X being itself bornological, the bounded subsets of Db (X) are equicontinuous. So any neighborhood of 0 in Di (X) is a neighborhood of 0 in Db (X), and Di (X) Db (X). Since (d) follows directly from (b), the proof is complete. Proposition 2.12. Assume that E is a DFN space and F is a Fréchet space. Then the canonical morphism Hom IB(E), IB(F ) −→ RHom IB(E), IB(F ) is an isomorphism. Proof. Since E is an DFN space, there is an FN space X such that E Db (X). Let (Xn , xn,m ) be a projective system of the kind considered in Lemma 2.11. We have E Db (X) lim D(Xn ). − → n∈N
Since the transition morphisms D(xn,m ) : D(Xn ) −→ D(Xm )
(m > n)
are injective and E is complete, Proposition 1.8 and Remark 1.2 show that IB(E) lim “D(Xn )”. − → n∈N
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
59
Using Lemma 2.3, we find a nuclear projective system (Yn , yn,m ) with Yn = c0 such that “lim” Xn “lim” Yn . ← − ← − n∈N
n∈N
It follows that IB(E) lim “D(Yn )”. − → n∈N
Hence we have successively (1) (2) (3) (4) (5)
RHom IB(E), IB(F ) RHom L lim “D(Yn )”, IB(F ) − → n∈N R lim RHom “D(Yn )”, IB(F ) ← − n∈N R lim Hom “D(Yn )”, IB(F ) ← − n∈N R lim Hom IB D(Yn ) , IB(F ) ← − n∈N R lim Hom᐀c D(Yn ), F , ← −
n∈N
where (1) follows from the fact that filtering inductive limits are exact in Ᏽnd(Ꮾan), (2) follows from [13, Proposition 3.6.3], (3) follows from the fact that “D(Yn )” “D(c0 )” “l 1 ” is projective in Ᏽnd(Ꮾan), (4) follows from Remark 1.2, and (5) follows from Proposition 1.5. By Lemma 2.6, we have the isomorphism ˆ π F “lim” L b D(Yn ), F . “lim” Yn ⊗ ← − ← − n∈N
n∈N
Forgetting the topologies and applying the derived projective limit functor for proobjects (see [13]), we obtain the isomorphism ˆ π F R lim Hom᐀c D(Yn ), F . R lim Yn ⊗ ← − ← − n∈N
n∈N
Since (Xn , xn,m )n∈N satisfies condition ML, it is lim-acyclic in ᐀c (see Remark 2.8). ← − It follows that (Yn , yn,m )n∈N is also lim-acyclic in ᐀c and hence satisfies condition ← − ˆ π F ) is concentrated in degree 0. ML. Using Lemma 2.9, we see that R lim n∈N (Yn ⊗ ← − It follows that the projective system
Hom᐀c D(Yn ), F n∈N
is lim-acyclic, and the conclusion follows. ← −
60
PROSMANS AND SCHNEIDERS
Theorem 2.13. Assume that E is a DFN space and F is a Fréchet space. Then the canonical morphism L IB(E), IB(F ) −→ RL IB(E), IB(F ) is an isomorphism. Proof. It is sufficient to show that LH k RL IB(E), IB(F ) 0 for k > 0. This is the case if
Hom “l 1 (I )”, RL IB(E), IB(F )
is concentrated in degree 0 for any set I . Let I be an arbitrary set. Since “l 1 (I )” is a projective object of Ᏽnd(Ꮾan) and since F is complete, we have RL “l 1 (I )”, IB(F ) L IB l 1 (I ) , IB(F ) IB L b l 1 (I ), F IB l ∞ (I, F ) , where l ∞ (I, F ) is the Fréchet space formed by the bounded families (xi )i∈I of F (a fundamental system of seminorms given by pI : p-continuous seminorm of F , where pI ((xi )i∈I ) = supi∈I p(xi )). Therefore, we have the chain of isomorphisms Hom “l 1 (I )”, RL IB(E), IB(F ) RHom “l 1 (I )”, RL IB(E), IB(F ) ˆ L IB(E), IB(F ) RHom “l 1 (I )” ⊗ ˆ L “l 1 (I )”, IB(F ) RHom IB(E) ⊗ RHom IB(E), RL “l 1 (I )”, IB(F ) RHom IB(E), IB l ∞ (I, F ) , and the conclusion follows from Proposition 2.12. Lemma 2.14. Let X, Y be two Banach spaces and let f : X → Y be a nuclear map. Then there is a nuclear map p : X → l 1 and a continuous linear map c : l 1 → Y making the diagram 1 ? l ?? ??c p ?? ? /Y X f commutative.
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
61
Proof. The result follows from an argument entirely similar to the one used in the proof of Lemma 2.1. Definition 2.15. An inductive system E : I → Ꮾan, where I is a filtering ordered set, is nuclear if for any i ∈ I , there is j ∈ I , j ≥ i such that the transition morphism ej,i : Ei −→ Ej is nuclear. An object of Ᏽnd(Ꮾan) is nuclear if it corresponds to a nuclear inductive system. Remark 2.16. Working as in the proof of Proposition 2.12, we see easily that IB(E) is nuclear if E is a DFN space. Lemma 2.17. Let I be an infinite filtering ordered set, and let E : I → Ꮾan be a nuclear inductive system. Then “lim” Ei “lim” Xk , − → − → i∈I
k∈K
where X : K → Ꮾan is an inductive system with nuclear transition morphisms such that X k = l 1 for any k ∈ K and #K = #I . Proof. The result is obtained by working as in the proof of Lemma 2.3, provided that Lemma 2.14 is used in place of Lemma 2.1. Lemma 2.18. Let I be a filtering ordered set. For any F ∈ D − (Ᏽnd(Ꮾan)) and any E ∈ D − (Ᏽnd(Ꮾan)I ), we have ˆ L F lim Ei ⊗ ˆ LF . lim Ei ⊗ − → − → i∈I
i∈I
Proof. If P· is a projective resolution of F , we have successively L ˆ F lim Ei ⊗P ˆ · lim Ei ⊗P ˆ · lim Ei ⊗ ˆ LF . lim Ei ⊗ − → − → − → − → i∈I
i∈I
i∈I
i∈I
Proposition 2.19. Let E and F be objects of Ᏽnd(Ꮾan). Assume that E is nuclear. Then ˆ L F E ⊗F. ˆ E⊗ Proof. Using Lemma 2.17, we may assume that E = “lim” Xi , − → i∈I
where X : I → Ꮾan is a filtering inductive system with Xi = l 1 , the transition morphisms xj,i : Xi −→ Xj
62
PROSMANS AND SCHNEIDERS
being nuclear. We may also assume that F = “lim” Yj , − → j ∈J
where Y : J → Ꮾan is a filtering inductive system. Then we have ˆ L F “lim” Xi ⊗ ˆ L “lim” Yj E⊗ − → − → i∈I
j ∈J
L
ˆ “Yj ” lim lim “Xi ” ⊗ → − →−
(1)
i∈I j ∈J
ˆ j” lim lim “Xi ” ⊗“Y − →− → i∈I j ∈J ˆ “lim” Xi ⊗ “lim” Yj − → − →
(2)
i∈I
j ∈J
ˆ E ⊗F, where (1) follows from Lemma 2.18 and (2) follows from the fact that “Xi ” “l 1 ” is projective in Ᏽnd(Ꮾan). 3. A topological version of Cartan’s Theorem B Proposition 3.1. Let X be a topological space with a countable basis. If F is a presheaf of Fréchet spaces on X that is a sheaf of vector spaces, then U −→ IB F (U ) (U open of X) is a sheaf with values in Ᏽnd(Ꮾan). Proof. Let U be an open subset of X and let ᐁ be an open covering of U . Consider the sequence (∗)
α
0 −→ F (U ) −−→
V ∈ᐁ
β
F (V ) −−→
F (V ∩ W ),
V ,W ∈ᐁ
where α and β are the continuous applications defined by pV # α = rV ,U
and
pV ,W # β = rV ∩W,V # pV − rV ∩W,W # pW ,
where pV and pV ,W are the canonical projections and rV ,U is the restriction map. Since F is a sheaf of vector spaces, this sequence is algebraically exact. Let us show that it is strictly exact. (1) If ᐁ is countable, F (U ), V ∈ᐁ F (V ), and V ,W ∈ᐁ F (V ∩ W ) are Fréchet spaces. Then, by the homomorphism theorem, the sequence (∗) is strictly exact.
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
63
(2) Assume that ᐁ is not countable. Since X has a countable basis, there is a countable set A of open subsets of X such that for any open V of X, V =
Uk ,
Uk ∈ A.
k∈N
Then consider the countable set
ᐂ = V ∈ A : ∃ V ∈ ᐁ such that V ⊂ V .
For any U ∈ ᐁ, we may assume that U = k∈N Uk , with Uk ∈ ᐂ. It follows that ᐂ covers any U in ᐁ and therefore is a covering of U . Hence, by (1), the sequence α
0 −→ F (U ) −−→
β F V −−→
V ∈ᐂ
F V ∩W
V ,W ∈ᐂ
is strictly exact. Now consider a map f : ᐂ → ᐁ such that V ⊂ f (V ) for any V ∈ ᐂ. Then consider the commutative diagram 0
/ F (U )
0
/ F (U )
α
/
V ∈ᐁ F (V )
β
/
γ
id
α
/ V ∈ᐂ F V
V ,W ∈ᐁ F (V
∩W)
δ
β
/
F V ∩W , V ,W ∈ᐂ
where γ and δ are, respectively, defined by pV # γ = rV ,f (V ) # pf (V ) and pV ,W # δ = rV ∩W ,f (V )∩f (W ) # pf (V ),f (W ) . To prove that the sequence (∗) is strictly exact, it is sufficient to establish that α is a kernel of β. Let h : X → V ∈ᐁ F (V ) be a morphism of ᐀c such that β #h = 0. Since β # γ # h = δ # β # h = 0 and since α is a kernel of β , there is a unique morphism h : X → F (U ) such that α # h = γ # h. Set h = h − α # h . We clearly have γ # h = 0 and β # h = 0. Fix V ∈ ᐁ. For any V ∈ ᐂ such that V ⊂ V , we have 0 = pV ,f (V ) # β # h = rV ∩f (V ),V # pV # h − rV ∩f (V ),f (V ) # pf (V ) # h .
64
PROSMANS AND SCHNEIDERS
It follows that rV ,V # pV # h = rV , V ∩f (V ) # rV ∩f (V ), V # pV # h = rV , V ∩f (V ) # rV ∩f (V ), f (V ) # pf (V ) # h = rV , f (V ) # pf (V ) # h = pV # γ # h = 0. Since {V ∈ ᐂ : V ⊂ V } is a covering of V and since F is a sheaf of vector spaces, we get pV # h = 0 ∀V ∈ ᐁ. It follows that h = 0 and that h = α # h . Since α is injective, h is the unique morphism of ᐀c such that h = α # h . Therefore, α is a kernel of β, and the sequence (∗) is strictly exact. Finally, since the functor IB preserves projective limits of complete objects of ᐀c (see Corollary 1.7), the sequence IB(β) IB(α) IB F (V ) −−−−→ IB F (V ∩ W ) 0 −→ IB F (U ) −−−−→ V ∈ᐁ
V ,W ∈ᐁ
is strictly exact in Ᏽnd(Ꮾan) . Hence, we reach the conclusion. Definition 3.2. We let IB(F ) denote the sheaf with values in Ᏽnd(Ꮾan) associated to a presheaf F of the kind considered in Proposition 3.1. Hereafter, let X denote a complex analytic manifold of complex dimension dX . We let ᏻX denote the sheaf of holomorphic functions on X. Recall that for any open subset U of X, ᏻX (U ) has a canonical structure of FN space. Moreover, recall that if V is a relatively compact open subset of U , the restriction morphism ᏻX (U ) −→ ᏻX (V )
is nuclear. In particular, if K is a compact subset of X, then ᏻX (K) lim ᏻX (U ),
− →
U ⊃K U open
topologized as an inductive limit, is a DFN space. Proposition 3.3. For any compact subset K of X, we have K, IB(ᏻX ) IB ᏻX (K) .
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
65
Proof. We know that K has a fundamental system (Un )n∈N of relatively compact open neighborhoods such that U n+1 ⊂ Un for any n ∈ N. If necessary, replacing Un by the union of those of its connected components that meet K, we may even assume that any connected component of Un meets K. In this case, it follows from the principle of unique continuation that the restriction ᏻX (Un ) −→ ᏻX Un+1 is injective. Moreover, by cofinality,
ᏻX (K) lim ᏻX Un .
− →
n∈N
Hence, by Proposition 1.8, it follows that IB lim ᏻX Un lim IB ᏻX Un . − → − → n∈N
n∈N
Since K is a taut subspace of X, a cofinality argument shows that K, IB ᏻX lim Un , IB ᏻX , − → n∈N
and the conclusion follows. Hereafter, we let Ꮿ∞,X denote the sheaf of rings formed by functions of class C∞ . (p,q) More generally, we let Ꮿ∞,X denote the sheaf of differential forms of class C∞ and (p,q)
of bitype (p, q). Recall that for any open subset U of X, Ꮿ∞,X (U ) has a canonical (p,q)
structure of FN space. Since the conditions of Proposition 3.1 are satisfied, IB(Ꮿ∞,X ) is a sheaf with values in Ᏽnd(Ꮾan). (p,q)
Proposition 3.4. The sheaf IB(Ꮿ∞,X ) is (U, ·)-acyclic for any open subset U of X. Proof. For any object E of Ᏽnd(Ꮾan), let hE : Ᏽnd(Ꮾan) −→ Ꮽb denote the functor defined by setting hE (F ) = Hom(E, F ). Using the techniques developed in [18], it is easy to show that (p,q) (p,q) Hom P , R U, IB Ꮿ∞,X R U, hP IB Ꮿ∞,X for any projective object P of Ᏽnd(Ꮾan). Therefore, the result is true if the sheaf of (p,q) abelian groups hP (IB(Ꮿ∞,X )) is soft. This follows from the fact that it clearly has a canonical structure of a Ꮿ∞,X -module.
66
PROSMANS AND SCHNEIDERS
Theorem 3.5. If U is an open subset of X such that H k U, ᏻX 0 (k > 0) algebraically, then
R U, IB ᏻX IB ᏻX (U ) . (p,q)
Proof. As is well known, since Ꮿ∞,U is a soft sheaf, the Dolbeault complex (0,0)
∂
(0,1)
∂
(0,n)
0 −→ Ꮿ∞,X −→ Ꮿ∞,X · · · −→ Ꮿ∞,X −→ 0 is a (U, ·)-acyclic resolution of ᏻX . Therefore, R(U, ᏻX ) is given by the complex ∂ ∂ (0,0) (0,1) (0,n) 0 −→ U, Ꮿ∞,X −→ U, Ꮿ∞,X · · · −→ U, Ꮿ∞,X −→ 0. Moreover, since H k (U, ᏻX ) 0 for k > 0, the sequence ∂ ∂ (0,0) (0,1) (0,n) 0 −→ U, ᏻX −→ U, Ꮿ∞,X −→ U, Ꮿ∞,X · · · −→ U, Ꮿ∞,X −→ 0 (p,q)
is algebraically exact. Since ᏻX (U ) and Ꮿ∞,X (U ) are FN spaces, the last sequence is strictly exact in ᐀c. Using [18, Proposition 3.2.26], one sees easily that the sequence (∗)
(0,0) (0,n) 0 −→ U, IB ᏻX −→ U, IB Ꮿ∞,X · · · −→ U, IB Ꮿ∞,X −→ 0
is strictly exact in Ᏽnd(Ꮾan). For any open ball b of X, Cartan’s Theorem B shows that H k b, ᏻX 0 (k > 0). Hence the sequence (0,0) (0,n) 0 −→ b, IB ᏻX −→ b, IB Ꮿ∞,X · · · −→ b, IB Ꮿ∞,X −→ 0 is strictly exact in Ᏽnd(Ꮾan). Filtering inductive limits being exact in Ᏽnd(Ꮾan), we see that (0,0) (0,1) (0,n) 0 −→ IB ᏻX −→ IB Ꮿ∞,X −→ IB Ꮿ∞,X · · · −→ IB Ꮿ∞,X −→ 0 is a strictly exact sequence of sheaves with values in Ᏽnd(Ꮾan). Moreover, since by (p,q) Proposition 3.4, IB(Ꮿ∞,U ) is (U, ·)-acyclic, R(U, IB(ᏻX )) is given by (0,0) (0,1) (0,n) 0 −→ U, IB Ꮿ∞,X −→ U, IB Ꮿ∞,X · · · −→ U, IB Ꮿ∞,X −→ 0. The sequence (∗) being strictly exact, we get R U, IB ᏻX U, IB ᏻX .
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
67
Proposition 3.6. If X is a Stein manifold and K is a holomorphically convex compact subset of X, we have R K, IB ᏻX IB ᏻX (K) . Proof. It is well known that K has a fundamental system ᐂ of Stein open neighborhoods. By tautness, it follows that for k > 0, we have LH k K, IB ᏻX lim LH k V , IB ᏻX 0, − → V ∈ᐂ
where the second isomorphism follows from Theorem 3.5. Hence, using Proposition 3.3, we get R K, IB ᏻX K, IB ᏻX IB ᏻX (K) . Remark 3.7. Note that all the results in this section clearly hold if we replace ᏻX by the sheaf of holomorphic sections of a holomorphic vector bundle. In particular, p they hold for the sheaf !X of holomorphic p-forms. 4. A factorization formula for IB(ᏻX×Y ) Definition 4.1. For any ρ = (ρ1 , . . . , ρp ) ∈ ]0, +∞[p , we set
)ρ = z ∈ Cp : z1 < ρ1 , . . . , zp < ρp , and we denote by Aρ the object of ᐀c defined by endowing
aα ρ α < +∞ Aρ = (aα )α∈Np : α
with the norm
α (aα )α∈Np =
aα ρ . α
p
Lemma 4.2. For any ρ ∈ ]0, +∞[ , we have the isomorphism A ρ l 1 Np . In particular, Aρ is a Banach space. Proof. This follows directly from the fact that the application u : Aρ −→ l 1 Np , defined by u((aα )α∈Np ) = (aα ρ α )α∈Np , is continuous and bijective. Lemma 4.3. For any p ∈ N, lim − →
ρ∈]0,+∞[p
IB ᏻCp )ρ
lim − →
ρ∈]0,+∞[p
IB Aρ .
68
PROSMANS AND SCHNEIDERS
Proof. This follows directly from the fact that the canonical restriction morphism ᏻCp )ρ −→ ᏻCp )ρ may be factored through Aρ for ρ > ρ. Proposition 4.4. Assume that X, Y are complex analytic manifolds. Then there is a canonical isomorphism L ˆ IB ᏻY IB ᏻX×Y . IB ᏻX Proof. Let U , V be open subsets of X and Y . The map uU,V : ᏻX (U ) × ᏻY (V ) −→ ᏻX×Y (U × V ), defined by setting uU,V (f, g)(u, v) = f (u) g(v), is clearly bilinear and continuous. Hence, it induces a morphism ᏻX (U ) ⊗π ᏻY (V ) −→ ᏻX×Y (U × V ),
and by Proposition 1.15, we get a morphism ˆ µU,V : IB ᏻX (U ) ⊗IB ᏻY (V ) −→ IB ᏻX×Y (U × V ) , which is clearly well behaved with respect to the restriction of U or V . Therefore, we get a canonical morphism ˆ IB ᏻY −→ IB ᏻX×Y . µ : IB ᏻX To show that it is an isomorphism, it is sufficient to work at the level of germs and to prove that ˆ µ(x,y) : IB ᏻX x ⊗IB ᏻY y −→ IB ᏻX×Y (x,y)
is an isomorphism. The problem being local, we may assume that X = Cp , Y = Cp , x = 0, y = 0. In this case, Lemma 4.3 shows that IB ᏻX x lim IB Aρ , lim IB Aρ , IB ᏻY y − → p − → ρ∈]0,+∞[
and
IB ᏻX×Y (x,y)
ρ ∈]0,+∞[p
lim − →
(ρ,ρ )∈]0,+∞[p+p
IB A(ρ,ρ ) .
A direct computation shows that through these isomorphisms, µx,y corresponds to the inductive limit of the maps ˆ τρ,ρ : IB Aρ ⊗IB Aρ −→ IB A(ρ,ρ )
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
69
associated to the continuous bilinear maps tρ,ρ : Aρ × Aρ −→ A(ρ,ρ ) defined by
tρ,ρ aα α∈Np , aα α ∈Np = aα aα (α,α )∈Np+p .
Since the diagram
ˆ IB Aρ ⊗IB Aρ τρ,ρ
/o
IB A(ρ,ρ ) /o
/ “Aρ ⊗A ˆ ρ ”
“tρ,ρ ”
/ “A(ρ,ρ ) ”
is clearly commutative, to prove that µ(x,y) is an isomorphism, it is sufficient to prove that tρ,ρ is an isomorphism. Thanks to Lemma 4.2, this fact is an easy consequence of the well-known isomorphism 1 p ˆ N l 1 Np+p . l 1 Np ⊗l By Proposition 3.3, IB ᏻX x {x}, IB ᏻX IB ᏻX ({x}) . Since ᏻX ({x}) is a DFN space, Proposition 2.19 shows that L ˆ IB ᏻY IB ᏻX ⊗IB ˆ IB ᏻX x ⊗ ᏻY y . y x Therefore, L ˆ IB ᏻY IB ᏻX ˆ IB ᏻY IB ᏻX×Y , IB ᏻX as requested. Corollary 4.5. If A, B are subsets of X and Y , then L ˆ R c B; IB ᏻY . R c A × B, IB ᏻX×Y R c A, IB ᏻX ⊗ In particular, if X, Y are Stein manifolds and if K, L are holomorphically convex compact subsets of X and Y , then ˆ IB ᏻX×Y (K × L) IB ᏻX (K) ⊗IB ᏻY (L) . Proof. The first part is a direct consequence of Theorem 4.4 and the Künneth theorem for sheaves with values in Ᏽnd(Ꮾan). The second part follows from the first part, using Proposition 3.6, Proposition 2.19, and the fact that ᏻX (K) is a DFN space.
70
PROSMANS AND SCHNEIDERS
5. Poincaré duality for IB(ᏻX ) Proposition 5.1. Assume that X, Y are complex analytic manifolds of dimension dX and dY . Then there is a canonical integration morphism : RqY ! IB !X×Y dX×Y −→ IB !Y dY . X
Proof. Recall that integration along the fibers of qY (i.e., on X) defines morphisms p+dX ,q+dX p,q −→ Ꮿ∞,Y (p, q ∈ Z), (∗) : qY ! Ꮿ∞,X×Y X
which are compatible with ∂ and ∂. Fix p, q ∈ Z. Let K be a compact subset of X, and let U be an open subset of Y . It is easy to check that the morphism p+dX ,q+dX p,q : K×U X × U ; Ꮿ∞,X×Y −→ U ; Ꮿ∞,Y X
is continuous for the canonical topologies. Applying IB, we get a morphism p+dX ,q+dX p,q K×U X × U ; IB Ꮿ∞,X×Y −→ U ; IB Ꮿ∞,Y . Taking the inductive limit on K, we get a morphism p+dX ,q+dX p,q U ; qY ! IB Ꮿ∞,X×Y −→ U ; IB Ꮿ∞,Y and, hence, a morphism p+dX ,q+dX p,q qY ! IB Ꮿ∞,X×Y −→ IB Ꮿ∞,Y of sheaves with values in Ᏽnd(Ꮾan). Thanks to the compatibility of (∗) with ∂ and ∂, we also get a morphism of complexes dX×Y ,· X ,· qY ! IB Ꮿ∞,X×Y dX×Y −→ IB Ꮿd∞,Y dY . Using the properties of Dolbeault resolutions, we get the requested integration morphism : RqY ! IB !X×Y dX×Y −→ IB !Y dY . X
Remark 5.2. Assume that X, Y , Z are complex analytic manifolds. Then one checks easily that the Fubini theorem gives rise to the commutative diagram:
RqZ ! RqY ×Z ! IB !X×Y ×Z dX×Y ×Z
X
/ RqZ IB !Y ×Z dY ×Z !
O
RqZ ! IB !X×Y ×Z dX×Y ×Z
X×Y
Y
/ IB !Z dZ .
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
71
Moreover, using the linearity of the integral, one gets the commutative diagram:
RqY ! IB !X×Y dX×Y
ˆ ⊗IB ᏻY
ˆ
X ⊗id
/ IB !Y dY ⊗IB ˆ ᏻY
projection
ˆ Y−1 IB ᏻY RqY ! IB !X×Y dX×Y ⊗q
cup product
cup product
RqY ! IB !X×Y dX×Y
X
/ IB !Y dY .
Theorem 5.3. Assume that X is a complex analytic manifold of dimension dX , and let aX : X → {pt} denote the canonical map. Then the morphism p d −p dX −→ D IB !X , IB !XX induced by adjunction from d −p p ˆ # I: aX! IB !XX dX ⊗IB !X −→ IB(C), X
is an isomorphism. Proof. Because the problem is local, it is sufficient to treat the case p = 0 and to show that the morphism R U ; IB !U dU −→ RL R c U ; IB ᏻU , IB(C) , obtained by adjunction from L ˆ R c U ; IB ᏻU −→ IB(C), # I: R U ; IB !U dU ⊗ X
is an isomorphism for any open interval U of CdU . This follows directly from Proposition 5.5 with V reduced to a point. Remark 5.4. As we will show elsewhere, the preceding theorem may be used to simplify the topological duality theory for coherent analytic sheaves. Proposition 5.5. Assume that U is an open interval of CdU and that V is an open interval of CdV . Then the canonical morphism ϕU,V : R U × V , IB !U ×V dU ×V −→ RL R c U ; IB ᏻU , R V ; IB !V dV ,
72
PROSMANS AND SCHNEIDERS
obtained by adjunction from
L ˆ R c U ; IB ᏻU # I: R U × V , IB !U ×V dU ×V ⊗ X −→ R V ; IB !V dV ,
is an isomorphism. Proof. Let W be an open interval of CdW , and assume that ϕU,V ×W and ϕV ,W are isomorphisms. Then we have successively
(1) (2) (3) (4)
R U × V × W ; IB !U ×V ×W dU ×V ×W RL R c U ; IB ᏻU , R V × W, IB !V ×W dV ×W RL R c U ; IB ᏻU , RL R c V ; IB ᏻV , R W ; IB !W dW L ˆ R c V ; IB ᏻV , R W ; IB !W dW RL R c U ; IB ᏻU ⊗ RL R c U × V ; IB ᏻU ×V , R W ; IB !W dW ,
where (1) and (2) follow from our assumptions, (3) is obtained by adjunction, and (4) comes from Corollary 4.5. Using Remark 5.2, we easily check that the composition of the preceding isomorphisms is equal to ϕU ×V ,W . Hence, an induction on dU reduces the problem to the case where dU = 1. This is dealt with in Proposition 5.6. Proposition 5.6. Assume that U is an open interval of C and that V is an open interval of Cn . Then the canonical morphism R U × V , IB !U ×V dU ×V −→ RL R c U ; IB ᏻU , R V ; IB !V dV is an isomorphism. Proof. For P = U , sheaf theory gives us the two distinguished triangles R ∂P ×V C × V , IB !C×V −→ R P ×V C × V , IB !C×V +1 −→ R U × V , IB !U ×V −−→ and +1 R c U, IB ᏻU −→ R P , IB ᏻC −→ R ∂P , IB ᏻC −−→, where ∂P denotes the boundary of P . If we apply the functor RL(·, IB(!V (V ))) to the last triangle, we obtain the morphism of distinguished triangles
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
R ∂P ×V C × V , IB ᏻC×V [1]
α
/ RL R ∂P , IB ᏻC , IB !V (V )
R P ×V C × V , IB ᏻC×V [1]
β
/ RL R P , IB ᏻC , IB !V (V )
R U × V , IB ᏻU ×V [1]
+1
γ
73
/ RL R c U, IB ᏻC , IB !V (V )
+1
where α and β are isomorphisms of the type considered in Proposition 5.7 (∂P is a finite union of closed intervals of C). It follows that γ is an isomorphism. Proposition 5.7. Assume that K is a finite union of closed intervals of C and that V is an open interval of Cn . Then the canonical morphism R K×V C × V ; IB !C×V dC×V −→ RL R K; IB ᏻC , R V ; IB !V dV , obtained by adjunction from
L ˆ R K; IB ᏻC # I: R K×V C × V ; IB !C×V dC×V ⊗ C −→ R V ; IB !V dV ,
is an isomorphism. Proof. Assume first that K is a closed interval of C. Since P × V is closed in C × V , we have the distinguished triangle R P ×V C × V , IB ᏻC×V −→ R C × V , IB ᏻC×V +1 −→ R C \ P × V , IB ᏻC×V −−→ . By Cartan’s Theorem B and Theorem 3.5, we have the isomorphisms R C × V , IB ᏻC×V IB ᏻC×V C × V and
R (C \ P ) × V , IB ᏻC×V IB ᏻC×V (C \ P ) × V .
Hence, the long exact sequence associated to the preceding distinguished triangle ensures that LH kP ×V C × V , IB ᏻC×V = 0 ∀k ≥ 2
74
PROSMANS AND SCHNEIDERS
and that the sequence 0 GF @A
/ LH 0 P ×V C × V , IB ᏻC×V
/ IB ᏻC×V (C × V )
/ IB ᏻC×V (C \ P ) × V
/ LH 1 P ×V C × V , IB ᏻC×V
ED BC /0
is strictly exact. Applying the functor IB to the sequence of Proposition 5.9, we get the split exact sequence 0 −→ IB ᏻC×V (C × V ) −→ IB ᏻC×V (C \ P ) × V (∗) −→ IB L b ᏻC (P ), ᏻV (V ) −→ 0 in Ᏽnd(Ꮾan). Therefore, LH 0P ×V C × V , IB ᏻC×V = 0 and
LH 1P ×V C × V , IB ᏻC×V IB L b ᏻC (P ), ᏻV (V ) .
Combining these results with Proposition 1.13, Theorem 2.13, Theorem 3.5, and Proposition 3.6, we obtain successively R P ×V C × V , IB ᏻC×V L IB ᏻC (P ) , IB ᏻV (V ) [−1] RL R P ; IB ᏻC , R V ; IB ᏻV [−1]. From Proposition 5.8, it follows easily that the canonical morphism R P ×V C × V ; IB !C×V dC×V −→ RL R P ; IB ᏻC , R V ; IB !V dV is an isomorphism. Now assume that the result has been established when K is a union of k < N closed intervals of C, and let us prove it when K=
N
Pi ,
i=1
where Pi (i = 1, . . . , N ) is a closed interval of C. Set L = N−1 i=1 Pi and Q = PN . By the Mayer-Vietoris theorem associated to the decomposition K = L∪Q, we have the
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
75
distinguished triangle R K, IB ᏻC R L, IB ᏻC ⊕ R Q, IB ᏻC R L ∩ Q, IB ᏻC . +1
Applying the functor RL(·, IB(!V (V ))), we obtain the distinguished triangle A = RL R L ∩ Q, IB ᏻC , IB !V (V ) B = RL R L, IB ᏻC ⊕ R Q, IB ᏻC , IB !V (V )
C = RL R K, IB ᏻC , IB !V (V ) . +1
Now consider the Mayer-Vietoris distinguished triangle A = R (L∩Q)×V C × V , IB !C×V B = R L×V C × V , IB !C×V ⊕ R Q×V C × V , IB !C×V
C = R K×V C × V , IB !C×V .
Since L∩Q = morphisms
N−1 i=1
(Pi ∩PN ) is a union of N −1 closed intervals of C, the canonical A [1] −→ A
and
B [1] −→ B
are isomorphisms. The canonical diagram A [1]
/ B [1]
/ C [1]
O A
O /B
/C
+1
+1
/
/
76
PROSMANS AND SCHNEIDERS
being commutative, the canonical morphism C [1] −→ C is also an isomorphism, and the conclusion follows. Proposition 5.8. Let P be a compact interval of C and let V be an open interval of Cn . Then : HP1 ×V C × V , !C×V −→ H 0 V , !V C
sends the class of ω = h(z, v) dz ∧ dv ∈ H 0 C \ P , !C×V
to
∂P
h(z, v) dz dv,
where P is a compact interval of C such that P ◦ ⊃ P . Proof. Let Ᏽ· be an injective resolution of !C×V . Let (v+1,·)
u· : Ꮿ∞,C×V −→ Ᏽ· denote a morphism extending id : !C×V → !C×V . The class c of ω in HP1 ×V C × V , !C×V H 1 P (C × V , Ᏽ· ) is represented by dσ , where σ ∈ (C × V , Ᏽ0 ) extends u0 (ω) ∈ ((C \ P ) × V , Ᏽ0 ). Let ϕ be a function of class C∞ on C equal to 1 on C \ P and equal to 0 on P , where P and P are compact intervals such that P ◦ ⊃ P and P ◦ ⊃ P . Then it is clear that u0 (ϕω) ∈ (C × V , Ᏽ0 ) and that σ − u0 (ϕω) ∈ c×V C × V , Ᏽ0 . Therefore, dσ and du0 (ϕ) give the same class in H 1 ( c×V (C × V , Ᏽ· )). It follows (v+1,·) that c corresponds to the class of ∂(ϕω) in H 1 ( c×V (C × V , Ꮿ∞,C×V )). Since c represents the image of c by the canonical map 1 HP1 ×V C × V , !C×V −→ Hc×V C × V , !C×V , we see that
C
c=
C
∂(ϕω) =
Hence, we have the conclusion.
P \P
∂ϕω =
∂P
ϕω −
∂P
ϕω =
∂P
ω.
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
77
Proposition 5.9. Let P be a closed interval of C, and let V be an open interval of Cn . Then in ᐀c, we have a split exact sequence of the form T r 0 −→ ᏻC×V (C × V ) −→ ᏻC×V (C \ P ) × V −−→ L b ᏻC (P ), ᏻV (V ) −→ 0, where r is the canonical restriction map and T is defined by setting h(z, v) g(z) dz T (h)(ϕ)(v) = ∂P
where g is a holomorphic extension of ϕ ∈ ᏻC (P ) on an open neighborhood U of P and P is a compact interval of C such that P ◦ ⊃ P and P ⊂ U . Proof. Note that the definition of T is meaningful since the right-hand side clearly does not depend on the choices of U , g, and P . It is also clear that the function T (h)(ϕ) is holomorphic on V and that the operator T is linear. Let us show that T is continuous. Let p be a continuous seminorm of Lb (ᏻC (P ), ᏻV (V )). We may assume that there is a bounded subset B of ᏻC (P ) and a compact subset K of V such that
p(τ ) = sup sup τ (ϕ)(v) , τ ∈ L b ᏻC (P ), ᏻV (V ) . ϕ∈B v∈K
For n > 0, set Un = {u ∈ C : d(u, P ) < 1/n}. By cofinality, we have ᏻC (P ) lim ᏻC (Un ).
− →
n>0
Moreover, for any n > 0, ᏻC (Un ) is a Fréchet space, and the restriction ᏻC (Un ) −→ ᏻC Un+1 is injective. Hence, by [8, Chapter IV, §19], there is n ∈ N and a bounded subset Bn of ᏻC (Un ) such that B ⊂ rUn (Bn ). Choosing a compact interval Pn of C such that Pn ◦ ⊃ P and Pn ⊂ Un , we see that
p T (h) ≤ sup sup h(z, v) g(z) dz ,
g∈Bn v∈K ∂Pn and we can find C > 0 such that
p T (h) ≤ C sup sup g(z) g∈Bn z∈∂Pn
sup
(z,v)∈∂Pn ×K
h(z, v) .
Let us consider the linear map S : L b ᏻC (P ), ᏻV (V ) −→ ᏻC×V (C \ P ) × V ,
78
PROSMANS AND SCHNEIDERS
defined by setting S(τ )(z, v) =
1 τ 2iπ
1 (v). z−u
Let us check that S is continuous. Consider a compact subset K of C \ P and a compact subset L of V . Since the set 1 BK = :z∈K z−u is bounded in ᏻC (C \ K), rP ,C\K (BK ) is a bounded subset of ᏻC (P ), and we have
S(τ )(z, v) ≤ 1 2π (z,v)∈K×L sup
sup
sup τ (f )(v) .
f ∈rP ,C\K (BK ) v∈L
For any τ ∈ Lb (ᏻC (P ), ᏻV (V )) and ϕ ∈ ᏻC (P ), there is an open U of C, containing P and g ∈ ᏻC (U ) such that ϕ = rU (g). Let K be a closed interval included in U and such that K ◦ ⊃ P , and let Ꮿ be the oriented boundary of K. For any v ∈ V , using the continuity of τ and Cauchy representation formula, we have 1 1 T S(τ ) (ϕ)(v) = (v)g(z) dz τ 2iπ Ꮿ z−u g(z) 1 dz (v) =τ 2iπ Ꮿ z − u = τ (g)(v) = τ (ϕ)(v). It follows that T # S = id or, in other words, that S is a section of T . Let us consider the continuous linear map R : ᏻC×V (C \ P ) × V −→ ᏻC×V (C × V ), defined as follows. Let h ∈ ᏻC×V ((C \ P ) × V ) and z ∈ C. Consider R > 0 such that z ∈ PR◦ = z : d(z, P ) < R . Then, for any v ∈ V , we set R(h)(z, v) =
1 2iπ
ᏯR
h(u, v) du, u−z
where ᏯR is the oriented boundary of PR . Since, for any f ∈ ᏻC×V (C × V ) and any (z, v) ∈ C × V , we have 1 1 r(f )(u, v) f (u, v) R r(f ) (z, v) = du = du = f (z, v), 2iπ ᏯR u − z 2iπ ᏯR u − z
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
79
we see that R # r = id. The map R is thus a retraction of r. Using a well-known result of homological algebra, the proof is complete if we show that r # R + S # T = id . To this end, consider h ∈ ᏻC×V ((C\P )×V ) and (z, v) ∈ (C\P )×V . Fix R > 0 such that z ∈ PR◦ and let ᏯR denote the oriented boundary of PR . Let Ꮿ be the oriented boundary of a closed interval K ⊂ PR such that z ∈ K and K ◦ ⊃ P . Letting R denote the oriented boundary of PR \ K ◦ and using Cauchy integral formula, we get 1 2iπ ᏯR 1 = 2iπ ᏯR 1 = 2iπ R = h(z, v).
(r # R + S # T )(h)(z, v) =
1 h(ξ, v) 1 dξ + T (h) (v) ξ −z 2iπ z−u 1 h(ξ, v) h(ξ, v) dξ + dξ ξ −z 2iπ Ꮿ z − ξ h(ξ, v) dξ ξ −z
Remark 5.10. The preceding result is a slightly more precise form of a special case of the Köthe-Grothendieck duality theorem (see [7], [3], [4]). 6. A holomorphic Schwartz kernel theorem (r,s)
Definition 6.1. Let X and Y be complex analytic manifolds. We define !X×Y to be the subsheaf of !r+s X×Y whose sections are the holomorphic differential forms that are locally a finite sum of forms of the type ωi,j dxi1 ∧ · · · ∧ dxir ∧ dyi1 ∧ · · · ∧ dyis , where x and y are holomorphic local coordinate systems on X and Y . (r,s)
Remark 6.2. Clearly (W ; !X×Y ) has a canonical structure of FN space for any (r,s)
open subset W of X × Y . Therefore, using Proposition 3.1, we see that IB(!X×Y ) is a sheaf with value in Ᏽnd(Ꮾan). Moreover, using Proposition 4.4, one can check easily that L s (r,s) ˆ IB !Y . IB !X×Y IB !rX Theorem 6.3. Assume that X, Y are complex analytic manifolds of dimension dX , dY . Then we have a canonical isomorphism −1 r ! (dX −r,s) IB !X×Y dX Rᏸ qX IB !X , qY IB !sY .
80
PROSMANS AND SCHNEIDERS
Proof. We have successively (1)
−1 r ! −1 r ! dY −s R ᏸ qX IB !X , qY IB !sY dY Rᏸ qX IB !X , qY D IB !Y −1 r −1 dY −s Rᏸ qX IB !X , D qY IB !Y −1 r IB !X , Rᏸ qY−1 IB !dYY −s , ωX×Y R ᏸ qX L dY −s ˆ IB ! , ωX×Y Rᏸ IB !rX Y (r,d −s) (2) , ωX×Y Rᏸ IB !X×YY (r,d −s) D IB !X×YY (dX −r,s) IB !X×Y (3) dX×Y , where ωX×Y denotes the dualizing complex on X × Y for sheaves with values in
Ᏽnd(Ꮾan). Note that (1) and (3) follow from Theorem 5.3 and that (2) comes from
Remark 6.2. As a consequence, we may now give Propositions 5.6 and 5.7 their full generality. Corollary 6.4. Let X, Y be complex analytic manifolds of dimension dX and dY . Assume that K is a compact subset of X. Then (dX −r,s) R K×Y X × Y ; IB !X×Y dX RL R K; IB !rX ; R Y ; IB !sY . Moreover, if X and Y are Stein manifolds and K is holomorphically convex in X, then these complexes are concentrated in degree 0 and isomorphic to IB L b !rX (K), !sY (Y ) . Proof. Transposing a classical result of the theory of abelian sheaves to sheaves with values in Ᏽnd(Ꮾan), we see that −1 Ᏺ, qY! Ᏻ RL R(K; Ᏺ); R(Y ; Ᏻ) R K×Y X × Y ; Rᏸ qX if Ᏺ and Ᏻ are objects of hv(X; Ᏽnd(Ꮾan)) and hv(Y ; Ᏽnd(Ꮾan)). This formula, combined with Theorem 6.3, gives the first part of the result. The second part follows from Proposition 3.6, Theorem 3.5 (using Remark 3.7), Theorem 2.13, and Proposition 1.13. Corollary 6.5. Let X, Y be complex analytic manifolds of dimension dX and dY . Then (dX −r,s) R X × Y ; IB !X×Y dX RL R c X; IB !rX , R Y ; IB !sY .
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
81
Proof. This follows directly from the general isomorphism −1 R X × Y ; Rᏸ qX Ᏺ, qY! Ᏻ RL R c (X; Ᏺ), R(Y ; Ᏻ) , which holds for any objects Ᏺ and Ᏻ of hv(X; Ᏽnd(Ꮾan)) and hv(Y ; Ᏽnd(Ꮾan)). Lemma 6.6. Let X be a complex analytic manifold of dimension dX , and let Y be a complex analytic submanifold of X of dimension dY . Then 0 LH k R Y IB ᏻX for k = dX − dY . Proof. Since the problem is local, it is sufficient to show that 0 LH k R {0}×V U × V ; IB ᏻU ×V for k = dX − dY if U and V are Stein open neighborhoods of 0 in CdX −dY and CdY . In this situation, {0} is a holomorphically convex compact subset of U , and from Corollary 6.4 we get that R {0}×V U × V ; IB ᏻU ×V dX − dY IB L b ᏻU ({0}), ᏻV (V ) . The conclusion follows directly. Theorem 6.7. For any morphism of complex analytic manifolds f : X → Y , we have a canonical isomorphism (0,d ) Rᏸ f −1 IB ᏻY , IB ᏻX δf−1 R )f IB !X×YY dY , where )f is the graph of f in X × Y and where δf : X → X × Y is the associated graph embedding. In particular, LH k Rᏸ f −1 IB ᏻY , IB ᏻX =0 for k = 0, and
R Ᏼom f −1 IB ᏻY , IB ᏻX Ᏸ∞ X→Y .
Proof. Using Theorem 6.3, we see that ! (0,d ) IB !X×YY dY Rᏸ qY−1 IB ᏻY , qX IB ᏻX . Applying δf! , we get successively ! (0,d ) IB ᏻX δf! IB !X×YY dY δf! Rᏸ qY−1 IB ᏻY , qX ! IB ᏻX Rᏸ δf−1 qY−1 IB ᏻY , δf! qX −1 ! IB ᏻY , qX # δf IB ᏻX R ᏸ qY # δ f Rᏸ f −1 IB ᏻY , IB ᏻX .
82
PROSMANS AND SCHNEIDERS
This gives the first part of the result. To get the second part, it is sufficient to use Lemma 6.6 if we remember that, following [16], we have (0,dY ) −1 Ᏸ∞ X→Y δf R )f !X×Y dY . Corollary 6.8. For any complex analytic manifold X of dimension dX , we have a canonical isomorphism (0,d ) Rᏸ IB ᏻX , IB ᏻX δ −1 R ) IB !X×XX dX , where ) is the diagonal of X × X and where δ : X → X × X is the diagonal embedding. In particular, =0 LH k Rᏸ IB ᏻX , IB ᏻX for k = 0, and
R Ᏼom IB ᏻX , IB ᏻX Ᏸ∞ X.
Remark 6.9. Note that the fact that continuous endomorphisms of ᏻX may be identified with partial differential operators of infinite order was conjectured by Sato and proved in [5]. The vanishing of the topological Ᏹxt k (k > 0) is, to our knowledge, entirely new. 7. Reconstruction theorem. Let be a ring on X with values in Ᏽnd(Ꮾan) (i.e., a ring of the closed category hv(X; Ᏽnd(Ꮾan)) (see [18])). Let ᏹod() denote the quasi-abelian category formed by -modules. If ᏹ, ᏺ are two -modules, it is easy to see that ᏸ (ᏹ, ᏺ) is endowed with both a structure of a right -module and a compatible structure of a left -module. These structures give two maps / ᏸ (ᏹ , ᏺ ) / ᏸ , ᏸ (ᏹ , ᏺ ) . As usual, we denote their equalizer by ᏸ (ᏹ, ᏺ). In this way, we get a functor ᏸ (·, ·) : ᏹod()op × ᏹod() −→ hv X; Ᏽnd(Ꮾan) , which is clearly continuous on each variable and in particular left exact. Using the techniques of [18], it is easy to see that ᏹod() has enough injective objects, and working as in [18, Proposition 2.3.10], one sees that the functor ᏸ (·, ·) has a right derived functor op Rᏸ (·, ·) : D − ᏹod() × D + ᏹod() −→ D + hv X; Ᏽnd(Ꮾan) . Now let Ᏹ be a sheaf on X with values in Ᏽnd(Ꮾan) and let ᏺ be an -module. Since ᏸ (Ᏹ, ᏺ) is canonically endowed with a structure of -module, we get a functor op ᏸ (·, ·) : hv X; Ᏽnd(Ꮾan) × ᏹod() −→ ᏹod().
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
83
We check directly that this functor may be derived on the right by resolving the first argument by a complex of K − (hv(X; Ᏽnd(Ꮾan))) with components of the type Pi U i∈I
i
(where Pi is a projective object of Ᏽnd(Ꮾan) and Ui is an open subset of X) and the second argument by a complex of K + (ᏹod()) with flabby components. This gives us a derived functor op Rᏸ (·, ·) : D − hv X; Ᏽnd(Ꮾan) × D + ᏹod() −→ D + ᏹod() , which reduces to the usual Rᏸ functor if we forget the -module structures. Finally, recall that an object ᏹ of D b (ᏹod()) is perfect if there are integers p ≤ q such that for any x ∈ X, there is a neighborhood U of x with the property that ᏹ|U is isomorphic to a complex of the type 0 −→ ᏼp −→ · · · −→ ᏼq −→ 0, where each ᏼk is a direct summand of a free U -module of finite type. We denote b (ᏹod()) the triangulated subcategory of D b (ᏹod()) formed by perfect by Dpf objects. Proposition 7.1. Let ᏺ be a sheaf on X with values in Ᏽnd(Ꮾan) such that LH k Rᏸ (ᏺ, ᏺ) = 0 (k = 0), and let be the ring ᏸ (ᏺ, ᏺ) of internal endomorphisms of ᏺ. Then ᏺ is an module, and the functor b ᏹod() −→ D b hv X; Ᏽnd(Ꮾan) Rᏸ (·, ᏺ) : Dpf is well defined. Moreover, we have a canonical isomorphism Rᏸ Rᏸ (ᏹ, ᏺ), ᏺ ᏹ b (ᏹod()). In particular, Rᏸ (·, ᏺ) identifies in D(ᏹod()) for any ᏹ ∈ Dpf b b Dpf (ᏹod()) with a full triangulated subcategory of D (hv(X; Ᏽnd(Ꮾan))). b (ᏹod()), it is clear that Proof. For any ᏹ ∈ Dpf
Rᏸ (ᏹ, ᏺ) ∈ D b hv X; Ᏽnd(Ꮾan) since Rᏸ (, ᏺ) ᏺ. The canonical morphism L
ˆ Rᏸ (ᏹ, ᏺ) −→ ᏺ ᏹ⊗
84
PROSMANS AND SCHNEIDERS
induces by adjunction a morphism
ᏹ −→ Rᏸ Rᏸ (ᏹ, ᏺ), ᏺ .
If ᏹ , Rᏸ (ᏹ, ᏺ) ᏺ and Rᏸ Rᏸ (ᏹ, ᏺ), ᏺ Rᏸ (ᏺ, ᏺ) ᏸ (ᏺ, ᏺ) , and the preceding morphism is an isomorphism. It follows that it is also an isomorphism for ᏹ k and, hence, if ᏹ is a direct summand of a free -module of finite type. Thanks to the local structure of perfect complexes, the conclusion follows easily. Let us consider the two functors Iᐂ : ᐂ −→ Ᏽnd(Ꮾan) E −→
“lim” F − →
F ⊂E dim F <+∞
and Lᐂ : Ᏽnd(Ꮾan) −→ ᐂ “lim” Ei −→ lim Ei , − → − → i∈Ᏽ
i∈Ᏽ
where ᐂ denotes the category of C-vector spaces. They are clearly linked by the adjunction formula Hom Iᐂ (E), F Hom E, Lᐂ (F ) and they are both exact. Moreover, Lᐂ # Iᐂ = id . For any sheaf E on X with values in ᐂ, we let I˜ᐂ (E) denote the sheaf associated to the presheaf U −→ Iᐂ E(U ) . Similarly, to any sheaf F on X with values in Ᏽnd(Ꮾan), we let L˜ ᐂ (F ) denote the sheaf U −→ Lᐂ F (U ) . Working at the level of fibers, it is easy to check that L˜ ᐂ # I˜ᐂ = id . Proposition 7.2. Let ᏺ be a sheaf on X with values in Ᏽnd(Ꮾan) such that LH k R Ᏼom (ᏺ, ᏺ) = 0 (k = 0),
A TOPOLOGICAL RECONSTRUCTION THEOREM FOR Ᏸ∞ -MODULES
85
and let ᐂ be the ring Ᏼom (ᏺ, ᏺ) of endomorphisms of ᏺ. Then ᏺ is an I˜ᐂ (ᐂ )module and the functor b Rᏸ I˜ᐂ (ᐂ ) I˜ᐂ (·), ᏺ : Dpf ᏹod(ᐂ ) −→ D b hv X; Ᏽnd(Ꮾan) is well defined. Moreover, we have a canonical isomorphism R Ᏼom Rᏸ I˜ᐂ (ᐂ ) I˜ᐂ (ᏹ), ᏺ , ᏺ ᏹ b (ᏹod( )). in D(ᏹod(ᐂ )) for any ᏹ ∈ Dpf ᐂ b (ᏹod( )) with a full triangu˜ In particular, Rᏸ I˜ᐂ (ᐂ ) (Iᐂ (·), ᏺ) identifies Dpf ᐂ b lated subcategory of D (hv(X; Ᏽnd(Ꮾan))).
Proof. Applying L˜ ᐂ to the morphism I˜ᐂ (ᏹ) −→ Rᏸ Rᏸ I˜ᐂ (ᐂ ) I˜ᐂ (ᏹ), ᏺ , ᏺ , we get a canonical morphism
ᏹ −→ R Ᏼom Rᏸ I˜ᐂ (ᐂ ) I˜ᐂ (ᏹ), ᏺ , ᏺ
since L˜ ᐂ # Rᏸ R Ᏼom . The conclusion follows by working as in the proof of Proposition 7.1. Theorem 7.3. Assume that X is a complex analytic manifold of dimension dX . Then the sheaf IB(ᏻX ) is an I˜ᐂ (Ᏸ∞ X )-module and the functor b ᏹod Ᏸ∞ −→ D b hv X; Ᏽnd(Ꮾan) Rᏸ I˜ᐂ (Ᏸ∞ ) I˜ᐂ (·), IB ᏻX : Dpf X X
is well defined. Moreover, we have a canonical isomorphism R Ᏼom Rᏸ I˜ᐂ (Ᏸ∞ ) I˜ᐂ (ᏹ), IB ᏻX , IB ᏻX ᏹ X
b ∞ in D(ᏹod(Ᏸ∞ X )) for any ᏹ ∈ Dpf (ᏹod(ᏰX )). b (ᏹod(Ᏸ∞ )) with a full triIn particular, Rᏸ I˜ᐂ (Ᏸ∞ ) (I˜ᐂ (·), IB(ᏻX )) identifies Dpf X X
angulated subcategory of D b (hv(X; Ᏽnd(Ꮾan))).
Proof. Thanks to Corollary 6.8, this is an easy consequence of Proposition 7.2.
References [1]
M. Artin, A. Grothendieck, and J.-L. Verdier, eds., Théorie des topos et cohomologie étale des schémas, Tome 1: Théorie des topos, Séminaire de Géométrie Algébrique du Bois-Marie (SGA 4), Lecture Notes in Math. 269, Springer-Verlag, Berlin, 1972.
86 [2] [3] [4] [5]
[6] [7] [8] [9] [10]
[11] [12] [13] [14] [15] [16]
[17] [18]
PROSMANS AND SCHNEIDERS M. Artin and B. Mazur, Etale Homotopy, Lecture Notes in Math. 100, Springer-Verlag, Berlin, 1969. A. Grothendieck, Sur certains espaces de fonctions holomorphes, I, J. Reine Angew. Math. 192 (1953), 35–64. , Sur certains espaces de fonctions holomorphes, II, J. Reine Angew. Math. 192 (1953), 77–95. R. Ishimura, Homomorphismes du faisceau des germes de fonctions holomorphes dans luimême et opérateurs différentiels, Mem. Fac. Sci. Kyushu Univ. Ser. A 32 (1978), 301– 312. M. Kashiwara, The Riemann-Hilbert problem for holonomic systems, Publ. Res. Inst. Math. Sci. 20 (1984), 319–365. G. Köthe, Dualität in der Funktionentheorie, J. Reine Angew. Math. 191 (1953), 30–49. , Topological Vector Spaces, I, Grundlehren Math. Wiss. 159, Springer-Verlag, New York, 1969. Z. Mebkhout, Une autre équivalence de catégories, Compositio Math. 51 (1984), 63–88. V. P. Palamodov, Homological methods in the theory of locally convex spaces (in Russian), Uspekhi Mat. Nauk 26, no. 1, (1971), 3–65; English transl. in Russian Math. Surveys 26 (1971), 1–64. A. Pietsch, Nuclear Locally Convex Spaces, Ergeb. Math. Grenzgeb. (3) 66, Springer-Verlag, New York, 1972. F. Prosmans, Algèbre homologique quasi-abélienne, Mémoire de DEA, 1995, available from http://www-math.univ-paris13.fr/˜prosmans/. , Derived limits in quasi-abelian categories, preprint, 1998, to appear in Bull. Soc. Roy. Sci. Liège, available from http://www-math.univ-paris13.fr/˜prosmans/. , Derived projective limits of topological abelian groups, J. Funct. Anal. 162 (1999), 135–177. , Derived categories for functional analysis, to appear in Publ. Res. Inst. Math. Sci., available from http://www-math.univ-paris13.fr/˜prosmans/. M. Sato, T. Kawai, and M. Kashiwara, “Microfunctions and pseudo-differential equations” in Hyperfunctions and Pseudo-Differential Equations (Katata, 1971), ed. H. Komatsu, Lecture Notes in Math. 287, Springer-Verlag, Berlin, 1973, 265–529. J.-P. Schneiders, Cosheaves homology, Bull. Soc. Math. Belg. Sér. B 39 (1987), 1–31. , Quasi-abelian categories and sheaves, Mém. Soc. Math. France (N.S.) 76 (1999).
Laboratoire Analyse, Géométrie et Applications, Unité Mixte de Recherche 7539, Université Paris 13, Avenue J.-B. Clément, 93430 Villetaneuse, France; prosmans@math. univ-paris13.fr;
[email protected]
Vol. 102, No. 1
DUKE MATHEMATICAL JOURNAL
© 2000
A NONREMOVABLE GENERIC 4-BALL IN THE UNIT SPHERE OF C3 B. JÖRICKE and N. SHCHERBINA
1. Introduction. The title of this paper is related to a subject that became quite popular during the last decade, namely, the subject of removable singularities in various situations of multidimensional complex analysis. In this paper we deal with a global problem in the aforementioned field. Global problems are, as a rule, less understood than local ones, and quite often they are related to interesting geometric problems in several variables. Recall the basic definitions of removability. For more background information and motivation, the reader is referred to previous papers on the subject, for example, [ChSt], [Jö1], [Jö2], [Jö3], [St], [Lu]. We apologize for omitting references on a large number of interesting papers in the field. For a smooth real hypersurface in Cn , as usual, we let ∂¯b denote the system of tangential Cauchy-Riemann operators. Definition 1. Let ⊂ Cn , n ≥ 2, be a bounded, strictly pseudoconvex domain with boundary ∂ of class C 2 . A compact subset K of ∂ is called (Lp , ∂¯b )removable if any function f of class Lp on ∂ (with respect to (2n−1)-dimensional surface measure), which satisfies the (weak) equations ∂¯b f = 0 on ∂ \ K, satisfies these equations on the whole of ∂. Here p ∈ [1, ∞]. In classical situations (like the Cauchy-Riemann equation in the complex plane), there do not exist nonempty removable singularities for L1 -functions. The situation changes for tangential Cauchy-Riemann operators. This effect is based on compulsory analytic extension of solutions to the tangential Cauchy-Riemann equations, which motivates the following. Definition 2. Let ⊂ Cn , n ≥ 2, be a bounded, strictly pseudoconvex domain with C 2 boundary. A compact K ⊂ ∂ is called removable if any continuous (but not necessarily bounded) function f on ∂ \ K, which satisfies there the tangential Cauchy-Riemann equations ∂¯b f = 0, extends to an analytic function in . It is difficult to give a geometric description of removable singularities in strictly pseudoconvex boundaries in the general case. Having in mind some analogy to the classical Painlevé problem, we can ask for removability of singularities contained in Received 16 September 1998. 1991 Mathematics Subject Classification. Primary 32D20, 32C25; Secondary 32C16. Shcherbina’s work partially supported by the Swedish Natural Science Council. 87
88
JÖRICKE AND SHCHERBINA
generic codimension 1 submanifolds of strictly pseudoconvex boundaries. As usual, a manifold M imbedded into Cn is called generic (with respect to the complex structure) if for each point p ∈ M the complex tangent space TpC M(= Tp M ∩ iTp M) has minimal possible dimension, namely, n − c, where c is the real codimension of M ⊂ Cn if the latter does not exceed n or c equals n otherwise. In the following, an m-manifold (e.g., an m-ball) always means a real manifold of real dimension m. Smooth means of class C ∞ . A compact subset of a generic m-manifold is called a closed generic m-ball if it is the diffeomorphic image of a closed ball in Rm . For n = 2, the following result was first proved in [Jö1]; see also [FoSt] and [Du] for other proofs and generalizations. Theorem 0. Closed totally real (i.e., generic) C 2 discs in strictly pseudoconvex boundaries in C2 are removable. For n > 2, the corresponding problem was raised by several authors. Problem 1. Is every closed generic (2n − 2)-ball contained in a strictly pseudoconvex boundary in Cn , n > 2, removable? Note that the “simplest” nonremovable singularity (in the sense of Definition 2) is the boundary of a complex variety of codimension 1 contained and relatively closed in with “not too bad behaviour near ∂” (for more detailed information, see [Ch] and the aforementioned references on removable singularities). In well-behaved cases, the boundary of this variety is a closed manifold of codimension 3 in Cn , and this way we get the only nonremovable compact sets contained in codimension 3 submanifolds of Cn (see [ChSt], [Jö2]). Hence the problem is closely related to the following geometric problem. Problem 2. Can the boundary of a codimension 1 analytic subvariety of a strictly pseudoconvex domain be a smooth submanifold of a generic ball of real dimension 2n − 2 contained in ∂? Analytic varieties play an important role in complex analysis and geometry, so the question itself is interesting. The problem becomes much easier if we weaken the assumptions. It is not so hard to see that a generic (2n − 2)-manifold M in Cn with more complicated topology than balls (e.g., a manifold with infinite fundamental group) may contain the boundary of a codimension 1 analytic variety in such a way that the mentioned boundary bounds a domain on M. One such example for the case n = 3 was given in [Do]. Note that, in this example, the manifold M is not contained in the boundary of a strictly pseudoconvex domain. The easy part of our proof (see Step 2 of the proof of Lemma 2) gives an example of such a manifold M contained in the unit sphere of C3 . Note that, in the case n = 2, Theorem 0 implies that Problem 2 has a negative answer. In general, generic 2-balls in C2 may contain boundaries of 1-dimensional
A NONREMOVABLE 4-BALL
89
analytic varieties as was known long before from Wermer’s example [We]. Note also that for n ≥ 3, most of the suggestions were towards a negative answer to Problem 2 (and, moreover, towards a positive answer to Problem 1). For example, it is not difficult to see that (under the conditions imposed in Problem 2) the boundary of the mentioned analytic variety cannot be of class C 2 and diffeomorphic to a (2n−3)sphere. (Otherwise, there would exist a nonsingular vector field on the (2n − 2)-ball, which is transversal to the aforementioned (2n − 3)-sphere by [Fo].) A proof of a similar statement in a slightly more general situation is written, for example, in [Do]. Moreover, for any smooth generic (2n − 2)-ball contained in a strictly pseudoconvex boundary, there exist arbitrarily small (in certain C k -norm) perturbations that are removable. This assertion is true in much more general situations. (A generic CRmanifold that is imbedded into Cn and in general position consists of one CR-orbit.) The authors will give a proof in a forthcoming paper. Nevertheless, for dimensions n ≥ 3, Problem 2 has a positive answer. More precisely, we prove the following theorem. We let Bj denote the unit ball in Cj and we let ∂Bj denote its boundary. Theorem 1. There exist a smooth generic 4-ball Ꮾ4 imbedded into the unit sphere ∂B3 and a smooth (simple) zero-set X of an entire function in C3 which intersects ∂B3 transversally such that X ∩ ∂B3 ⊂ Ꮾ4 . As a corollary, we get the following theorem. Theorem 2. There exists a smooth generic nonremovable 4-ball Ꮾ4 contained in ∂B3 . Moreover, Ꮾ4 is not (L1 , ∂¯b )-removable either. Proof of Theorem 2. Indeed, let f be an entire function in C3 such that f −1 (0) =X and grad f = 0 on X. The function 1/f restricted to ∂B3 \ X is continuous and satisfies the tangential Cauchy-Riemann equations there. Moreover, by assumption, we have |1/f (z)| ≤ const · dist(z, X). Since X intersects ∂B3 transversally, it follows that 1/f is in L1 on ∂B3 with respect to 5-dimensional surface measure. But 1/f does not extend analytically to B3 , which would be the case if Ꮾ4 (and hence also X ∩ ∂B3 ) were removable or (L1 , ∂¯b )-removable, respectively. Acknowledgments. The authors are grateful to several people for useful discussions, especially concerning explanation of the material. Among them are A. Juhl, N. Kruzhilin, and O. Viro. O. Viro pointed out the reference on the isotopy extension theorem (our original proof differed slightly from the present one; it was longer and more technical, but a bit more explicit). N. Kruzhilin read the manuscript and gave useful comments. The present proof of Lemma 4 actually follows his advice. The rest of the paper is devoted to the proof of Theorem 1. 2. Proof of Theorem 1: A reduction. We start by guessing the manifold X. Dimca told us the following example of a family of smooth algebraic hypersurfaces
90
JÖRICKE AND SHCHERBINA
X in C3 whose intersections with the unit sphere are compact maximally complex 3-manifolds with group. For notational convenience, we work √ infinite fundamental √ with the ball 3B3 of radius 3 centered at zero instead of the unit ball and with its boundary def √ S5 = ∂ 3B3 . (1) Put X = (z1 , z2 , z3 ) ∈ C3 : z1 z2 z3 = ,
> 0,
(2)
and A = X ∩ S5 .
(3)
Lemma 1. A is empty for > 1 and is diffeomorphic to (S 1 )3 for 0 < < 1. For the latter choice of , it is a compact maximally complex CR-manifold. For = 1, it degenerates and is diffeomorphic to (S 1 )2 . Here S 1 is a circle (say, the unit circle in the complex plane). Proof. Use coordinates zj = rj ζj
with rj > 0, ζj ∈ S 1 .
(4)
In these coordinates, A is the direct product of the 2-torus ᐀2 = ζ1 , ζ2 , ζ1−1 ζ2−1 : |ζ1 | = |ζ2 | = 1 and the set =
r 1 , r2 , r1 r2
: r1 > 0, r2 > 0,
2 def m(r1 , r2 ) = r12 + r22 + 2 2 r1 r2
(5) =3 .
The smooth function m has a minimum at ( 1/3 , 1/3 ) and m( 1/3 , 1/3 ) = 3 2/3 . All the other points are regular for m. Therefore, is empty if > 1, it is a point if = 1, and it is diffeomorphic to a circle for < 1. √ Note that for = 1, the algebraic hypersurface X1 is tangent to S5 (= ∂( 3B3 )) at the points of A1 = ᐀2 ; hence for p ∈ ᐀2 , we have Tp X1 = Tpc S5 . Our goal is to find a smooth generic real 4-ball Ꮾ4 imbedded into S5 which contains A for certain < 1 close to 1. The following lemma reduces the problem to a simpler one. Lemma 2. Suppose that there is a C ∞ -smooth 4-ball B 4 imbedded into S5 which contains the 2-torus A1 = ᐀2 and has the following property. For each p ∈ ᐀2 ⊂ B 4 , the tangent space Tp B 4 coincides with the complex tangent space Tpc S5 of the sphere:
A NONREMOVABLE 4-BALL
91
Tp B 4 = Tpc S5
(6)
for p ∈ ᐀2 .
Then there is a smooth generic 4-ball Ꮾ4 that contains A for some positive close to 1, < 1. Of course, B 4 is not generic (see (6)). Proof. Step 1. We want to change B 4 slightly near ᐀2 . The new 4-ball coincides near ᐀2 with a perturbation of the following auxiliary 4-manifold ᏹ4 = ᏹδ4 ,
ᏹδ4 = r1 ζ1 , r2 ζ2 , 3 − r12 − r22 ·ζ1−1 ζ2−1 : |ζ1 | = |ζ2 | = 1, |1−r1 | < δ, |1−r2 | < δ . Here δ > 0 is sufficiently small. ᏹδ4 contains A for close to 1; in particular, it contains ᐀2 and Tp ᏹδ4 = Tpc S5
for p ∈ ᐀2 .
In other words, ᏹδ4 and B 4 have first-order contact along ᐀2 . We use this later. It is convenient to write ᏹδ4 as a graph: ᏹδ4 = (z1 , z2 , z3 ) ∈ C3 : (z1 , z2 ) ∈ δ , z3 = Ᏻ(z1 , z2 ) ,
(7)
(8)
where δ =
r1 eiϕ1 , r2 eiϕ2 ∈ C2 : |1 − r1 | < δ, |1 − r2 | < δ
and
Ᏻ r1 eiϕ1 , r2 eiϕ2 =
3 − r12 − r22 · e−i(ϕ1 +ϕ2 ) .
(9)
(10)
The complex tangencies of ᏹδ4 are those points (z1 , z2 , Ᏻ(z1 , z2 )) for which both derivatives ∂z¯ 1 Ᏻ(z1 , z2 ) and ∂z¯ 2 Ᏻ(z1 , z2 ) vanish. (Recall that the complex tangencies of a manifold imbedded into Cn are the points for which the tangent space is complex. In our case these are exactly the nongeneric points of ᏹδ4 .) In polar coordinates this means that ((r1 ∂/∂r1 ) + (i∂/∂ϕ1 ))Ᏻ and ((r2 ∂/∂r2 ) + (i∂/∂ϕ2 ))Ᏻ vanish at those points. It is now easy to compute directly that the set of complex tangencies of ᏹδ4 is exactly ᐀2 . (There is another way to see that ᏹδ4 \᐀2 does not contain complex tangencies. Indeed, if p ∈ A ⊂ ᏹδ4 is a complex tangency, then, since Tp ᏹδ4 contains Tp A , it also contains the complexification of this space, namely, Tp X . This is impossible if < 1, since by a theorem of Forstneriˇc [Fo], Tp A ⊂ Tpc S5 for < 1 and for every p ∈ A .) In the next step, we make a small perturbation of ᏹδ4 , keeping ᏹδ4 \ ᏹδ41 fixed for an arbitrarily small δ1 > 0, δ1 < δ, in order to get a generic manifold. The perturbed ˜ 4 ) still contains a lot of A ’s. Moreover, it is arbitrarily close (say, ˜ 4 (= ᏹ manifold ᏹ δ δ,δ1 in C 1 ) to ᏹδ4 and is contained in S5 . (Note that the existence of such a perturbation
92
JÖRICKE AND SHCHERBINA
can also be obtained using the general h-principle of Gromov (see [Gr]). Instead we give a short explicit construction.) Step 2. Replace on δ1 the function Ᏻ by the following function Ᏻ1 : Ᏻ1 r1 eiϕ1 , r2 eiϕ2 = Ᏻ r1 eiϕ1 , r2 eiϕ2 · eig(r1 ,r2 ) (11) = 3 − r12 − r22 · e−i(ϕ1 +ϕ2 )+ig(r1 ,r2 ) , with a smooth real function g with small C 1 norm, compactly supported in {|1 − ¯ r1 | < δ1 , |1 − r2 | < δ1 }. The ∂-derivatives ∂z¯ 1 Ᏻ1 and ∂z¯ 2 Ᏻ1 of Ᏻ1 , multiplied by 2r1 e−iϕ1 and 2r2 e−iϕ2 , respectively, are equal to the following expressions: ∂ ∂ Ᏻ1 r1 eiϕ1 , r2 eiϕ2 +i r1 ∂r ∂ϕ1 1 −r 2 + 3 − r 2 − r 2 ∂g 1 1 2 = + ir1 3 − r12 − r22 · (r1 , r2 ) e−i(ϕ1 +ϕ2 )+ig(r1 ,r2 ) , (12) ∂r1 3 − r2 − r2 1
2
∂ ∂ r2 +i Ᏻ1 r1 eiϕ1 , r2 eiϕ2 ∂r2 ∂ϕ2 −r 2 + 3 − r 2 − r 2 ∂g 2 1 2 = + ir2 3 − r12 − r22 · (r1 , r2 ) e−i(ϕ1 +ϕ2 )+ig(r1 ,r2 ) . (13) ∂r2 3 − r12 − r22
Since 3 − r12 − r22 = r32 , we see that the first term in (12) vanishes if and only if r1 = r3 and the first term in (13) vanishes if and only if r2 = r3 . Moreover, the first terms in brackets are real and the second ones are imaginary. Now choose g with compact support in {|1 − rj | < δ1 , j = 1, 2} in such a way that at least one of the derivatives (∂/∂r1 )g or (∂/∂r2 )g does not vanish for r1 = r2 = 1. Then at all points ¯ of δ1 , at least one of the ∂-derivatives of Ᏻ1 does not vanish. Let Ᏻ1 = Ᏻ on δ \δ1 ˜ 4 be the graph of Ᏻ1 over δ . Then, as desired, ᏹ ˜ 4 ⊂ S5 , ᏹ ˜ 4 is generic, is and let ᏹ δ δ δ close (in C 1 ) to ᏹδ4 , and contains some A . Step 3. It follows from assumption (6) that for δ small enough, a part of the ball B 4 can be represented as the graph of a function z3 = ᏳB (z1 , z2 ) defined on the set δ . Take δ2 ∈ (δ1 , δ) close to δ, so that the graph of Ᏻ1 over δ2 \ δ1 still contains some A , and take δ3 ∈ (δ2 , δ). For a smooth nonnegative cut-off function χ that is 1 on ¯ δ2 and 0 on \δ3 , put Ᏻ2 = χ Ᏻ1 +(1−χ)ᏳB . Let Bˇ 4 denote the ball B 4 with the graph of ᏳB over δ replaced by the graph of Ᏻ2 over δ . By Thom’s transversality theorem (see [GoGu, Theorem 4.9] or [Hi]), there is a 4-ball B˜ 4 ⊂ S5 arbitrarily close (say, in C 1 ) to Bˇ 4 with only isolated complex tangencies which coincides with Bˇ 4 ¯ δ2 . over the set Shrinking B˜ 4 a little, we may assume that the number of complex tangencies is finite. Moreover, by construction, they are outside the relatively compact domain
A NONREMOVABLE 4-BALL
93
bounded by some A on B˜ 4 . Remove from B˜ 4 a suitable small neighbourhood of a smooth curve which joins one complex tangency with a boundary point of B˜ 4 and does not meet A and the other complex tangencies. This decreases the number of complex tangencies by 1. After finitely many steps, we get the desired 4-ball Ꮾ4 . The following almost obvious lemma gives a further reduction. Lemma 3. To obtain the B 4 of Lemma 2, it is enough to find a C ∞ -smooth 3-ball B 3 ⊃ ᐀2 imbedded into S5 together with a smooth nonsingular normal vector field n on B 3 which is orthogonal to Tpc S5 at the points p ∈ ᐀2 . Orthogonality is always understood with respect to the usual Euclidean scalar product of Cn . Normality of n is meant with respect to the imbedding of B 3 into S5 , that is, n(p) ∈ Tp S5 , and n(p) is orthogonal to Tp B 3 . Proof. For each p ∈ B 3 ⊂ S5 , the set of vectors in Tp S5 which are normal to B 3 is a 2-plane. Choose an orientation on B 3 and S5 , and orient the 2-planes so that the orientations are compatible. For each p, we rotate the vector n(p) in the corresponding normal 2-plane in “positive” direction around the origin by the angle π/2 and denote the thus obtained vector by t(p). In other words, the pair (n(p), t(p)) gives the above chosen orientation on the normal plane, and t is a smooth, nonsingular normal vector field on B 3 orthogonal to n. Using exponential mappings, we may slightly extend B 3 in the direction of t and get a 4-ball B 4 , B 3 ⊂ B 4 ⊂ S5 . The obtained 4-ball has the following property: At points p ∈ B 3 ⊂ B 4 , the tangent space Tp B 4 is spanned by Tp B 3 and t; hence it is orthogonal to n. In other words, B 4 satisfies the conditions of Lemma 2. Lemmas 1, 2, and 3 reduce Theorem 1 to the following slightly easier assertion, which we state explicitly and prove in the following section. Proposition 1. There exists a C ∞ -smooth 3-ball B 3 imbedded into S5 which contains ᐀2 and a C ∞ -smooth nonsingular normal vector field n on B 3 which is orthogonal to Tpc S5 at the points p ∈ ᐀2 . 3. Proof of Proposition 1. First note that for p = eiϕ1 , eiϕ2 , e−i(ϕ1 +ϕ2 ) ∈ ᐀2 ,
(14)
the outer normal to S5 is the vector given by (eiϕ1 , eiϕ2 , e−i(ϕ1 +ϕ2 ) ). The vector def 1 N(p) = √ ieiϕ1 , ieiϕ2 , ie−i(ϕ1 +ϕ2 ) (15) 3 is of length 1 and orthogonal to the previous vector; hence it is tangential to the sphere S5 and orthogonal to the complex tangent space Tpc S5 . In other words, N may serve as the restriction of the desired vector field n to ᐀2 . The first step of the proof is to give an explicit description of a 3-sphere 3 ⊂ S5 that contains ᐀2 and has the property that N is normal to 3 at points of ᐀2 . The part
94
JÖRICKE AND SHCHERBINA
of the desired 3 which is contained in S5 \ {z1 · z2 · z3 = 0} is in coordinates (4) the direct product of ᐀2 with a smooth curve (16) t −→ R(t) = R1 (t), R2 (t), R3 (t) , t ∈ (−1, 1), running in the r-variables. (We choose the functions Rj below.) In other words, let ᐀2 (t), t ∈ (−1, 1), denote the following 2-torus, which is isotopic in S5 to ᐀2 : def
᐀2 (t) =
R1 (t)eiϕ1 , R2 (t)eiϕ2 , R3 (t)e−i(ϕ1 +ϕ2 ) : eiϕ1 ∈ S 1 , eiϕ2 ∈ S 1 .
(17)
Then the part of the desired 3 which is outside {z1 · z2 · z3 = 0} can be described by 3 \ {z1 · z2 · z3 = 0} = ∪t∈(−1,1) ᐀2 (t).
(18)
Now choose a smooth curve R(t), t ∈ (−1, 1), with R12 (t) + R22 (t) + R32 (t) = 3,
(19)
which has the following crucial properties: (a) near −1, R3 (t) = R1 (t) and R1 (t) → 0 for t → −1; (b) near +1, R3 (t) = R2 (t) and R2 (t) → 0 for t → 1; (c) R1 (0) = R2 (0) = R3 (0) = 1, and hence ᐀2 (0) = ᐀2 . For example, R can be obtained in the following way. Define a curve R˜ that satisfies (19) and such that R˜ 3 (t) = R˜ 1 (t) if t ∈ (−1, 0] and R˜ 3 (t) = R˜ 2 (t) if t ∈ [0, 1). Then ˜ The curve R is obtained by making R˜ (a)–(c) are satisfied for R replaced by R. smooth near zero in a suitable way. For simplicity, we also assume that R satisfies the following condition (d). (d) The curve t → R(t), t ∈ (−1, 1), has nondegenerate projection (R1 (t), R2 (t)) onto the (r1 , r2 )-plane; hence the image R((−1, 1)) is the graph of the function R3 over a smooth curve in the (r1 , r2 )-plane. Moreover, near t = 0, the curve t → R(t) has nondegenerate projection onto the r1 -axis as well as onto the r2 -axis. Figure 1 shows an example of a curve R that satisfies all four conditions. Condition (a) means that for t = −1, the 2-torus ᐀2 (t) degenerates: We may put ᐀2 (−1) to be equal to the circle def
def
᐀2 (−1) = *1 =
√ iϕ iϕ 0, 3e 2 , 0 : e 2 ∈ S 1 .
Analogously, for t = +1, we have the degeneration def
def
᐀2 (1) = *2 =
√ iϕ 3e 1 , 0, 0 : eiϕ1 ∈ S 1 .
Joining the circles *1 and *2 to the set (18) described above, we get the desired def
3 = ∪t∈(−1,1) ᐀2 (t) ∪ *1 ∪ *2 = ∪t∈[−1,1] ᐀2 (t).
(20)
95
A NONREMOVABLE 4-BALL
r3
R(0) = (1, 1, 1)
R(t), t ∈ (−1, 1)
r2 R(+1)
R(−1)
r1 Figure 1
Over each point R(t) with t ∈ (−1, 1), we have a 2-torus in ζ -coordinates; over R(1) and R(−1), we have a circle. The degeneration of the 2-torus to a circle for t = +1 and t = −1 is crucial to obtain a 3-sphere. To prove that (20) actually defines a 3sphere, we consider its covering by two √ solid tori Z1 and Z2 which intersect along a “collar” Ꮿ. Here Z1 contains *1 = {(0, 3eiϕ2 , 0) : eiϕ2 ∈ S 1 }, and near *1 , the set Z1 is given by
def Zδ1 = z1 , 3 − 2|z1 |2 · eiϕ2 , z¯ 1 e−iϕ2 : z1 ∈ (1 − δ)D, eiϕ2 ∈ S 1 . (21) Here δ > 0 is small. D is the open unit disc in the complex plane, S 1 is its boundary, and (1 − δ)D is the disc of radius (1 − δ) centered at zero. Globally, Z1 is given by the formula
Z1 = a1 (|z1 |) · z1 , a2 (|z1 |) · 3 − 2|z1 |2 · eiϕ2 ,
a3 (|z1 |) · z¯ 1 · e−iϕ2 : z1 ∈ (1 + δ)D, eiϕ2 ∈ S 1 (22) with smooth positive functions aj on (1+δ)D, depending only on the radius |z1 | and equal to 1 on (1 − δ)D (see (21)). These functions are uniquely determined by the condition that Z1 lies on the sphere 3 . Similarly,
Z2 = b1 (|z2 |) · 3 − 2|z2 |2 · eiϕ1 , b2 (|z2 |) · z2 ,
b3 (|z2 |) · z¯ 2 · e−iϕ1 : z2 ∈ (1 + δ)D, eiϕ1 ∈ S 1 , (23)
96
JÖRICKE AND SHCHERBINA
with smooth positive functions bj on (1+δ)D, depending only on the radius |z2 | and equal to 1 on (1 − δ)D. Then the collar Ꮿ = Z1 ∩ Z2 can be written as Ꮿ = R1 (t)eiϕ1 , R2 (t)eiϕ2 , R3 (t)e−i(ϕ1 +ϕ2 ) : t ∈ (−σ, σ ), eiϕ1 ∈ S 1 , eiϕ2 ∈ S 1 (24) for a suitable σ ∈ (0, 1). Here ᐀2 ⊂ Ꮿ = Z1 ∩ Z2 ; that is, aj (1) = bj (1) = 1 for j = 1, 2, 3. It is now clear that 3 is actually a 3-sphere. Moreover, since Tp ᐀2 ⊂ Tpc S5 for p ∈ ᐀2 , it follows from (19) and (c) that N is orthogonal to 3 at points of ᐀2 . Unfortunately, N does not extend to a nonsingular normal field on 3 (otherwise, removing from 3 a small closed 3-ball, we would be done by Lemma 3). This is easy to understand and is based on the following fact. Consider, for example, Z1 . Let ϕ0
ϕ0
Z1 2 be the set of points in Z1 with fixed argument ϕ2 = ϕ20 . Then Z1 2 is topologically a disc with nondegenerate projection onto the disc (1 + δ)D in the first coordinate; ϕ0
ϕ0
᐀2 ∩ Z1 2 projects onto the unit circle; the set *1 ∩ Z1 2 projects onto the origin. The set of nonvanishing normal vectors for Z1 in S5 retracts to a circle with nondegenerate projection onto the third coordinate. Indeed, a neighbourhood of Z1 on S5 can be
parametrized by
iϕ2 iϕ2 2 2 z1 , e , z3 −→ z1 , 3 − |z1 | − |z3 | · e , z3 ,
(25)
and Z1 can be given by an equation of the form z3 = h1 z1 , eiϕ2 = a3 |z1 | · z¯ 1 · e−iϕ2 .
(26) ϕ0
It is now easy to see that N has “nontrivial winding” along the circle ᐀2 ∩ Z1 2 . This is the obstruction for extending N. On the other hand, we see now that there is a smooth nonsingular normal vector field on 3 . Lemma 4. On every 3-sphere S 3 that is smoothly imbedded into S5 , there exists a smooth nonsingular vector field m that is normal to S 3 (normal with respect to the imbedding of S 3 into S5 ). Proof. Let E be the bundle over S 3 of the unit vectors that are tangent to S5 and normal to S 3 . Our goal is to prove that there exists a smooth section of this bundle. Let B13 and B23 be a covering of S 3 by two smooth closed 3-balls such that B13 ∩ B23 is a smooth 2-sphere. We denote this 2-sphere by S 2 . Since the restriction of E to B13 is trivial, there exists a smooth section s1 of E over B13 . Since the fibers of E are diffeomorphic to the circle S 1 and since the restriction of E to B23 is also trivial, the restriction of s1 to S 2 defines (in the last trivialization) an element of the homotopy group π2 (S 1 ). Then, from the triviality of π2 (S 1 ), it follows that there
97
A NONREMOVABLE 4-BALL
n(ζ )
τ
Figure 2
exists an extension s2 of s1 | S 2 to a smooth section of E over B23 . Define a section s of E over the whole of S 3 to be equal to s1 over B13 and equal to s2 over B23 . After smoothing s, if necessary, we obtain the required section of E, which is the desired smooth nonsingular normal vector field m on S 3 . The plan of the last step of the proof of Proposition 1 is the following. We replace the pair (Ꮿ, m) (the collar (see (24)) together with the restriction of the transversal vector ˜ ,m ˜ ) that is identical to the previous one outside field m to this collar) by a new pair (Ꮿ ˜ and m ˜ | ᐀2 = N (see a smaller tubular neighbourhood of ᐀2 , and moreover, ᐀2 ⊂ Ꮿ (15) for the definition of N). def ˜ ,m ˜ ) is the fact that N and n = m | ᐀2 are The key to the existence of the pair (Ꮿ homotopic as normal vector fields on ᐀2 imbedded into S5 . Indeed, we give below an explicit description of the homotopy N(t, ζ ), ζ ∈ ᐀2 , t ∈ [0, 1], of N and n, and we associate with this homotopy a homotopy of orthogonal transformations of the fibers of the normal bundle of ᐀2 . Using exponential mappings, this gives an isotopy of imbeddings of a small tubular neighbourhood of ᐀2 on S5 into S5 , which fixes ᐀2 . This isotopy describes how to bend a small part of the collar Ꮿ around ᐀2 to get a ˜ . The isotopy extension theorem (see below for more details) allows small part of Ꮿ ˜. us to get the whole collar Ꮿ For psychological reasons we include Figure 2, which is an example of how to ˜ around ᐀2 in dependence on the relation between n(ζ ) and N(ζ ). The figure bend Ꮿ ˜ with a normal to ᐀2 3-manifold through a given point shows the intersection of Ꮿ ζ ∈ ᐀2 . It corresponds to the case n(ζ ) = −N(ζ ). The aforementioned intersection is contained in the plane spanned by n(ζ ) and τ (ζ ) (compare with the notation used below for the description of the homotopy N(t, ζ )). The intersection of the old collar Ꮿ with the 3-manifold is the τ (ζ ) axis.
98
JÖRICKE AND SHCHERBINA
The following lemma realizes the plan of the proof of Proposition 1. Lemma 5. There is a diffeomorphism F of S5 which is identical on ᐀2 and outside a small tubular neighbourhood of ᐀2 such that: (a) F∗ (m) | ᐀2 = N; (b) the vector field N is orthogonal to F (3 ) on ᐀2 . Here, as usual, F∗ (m) denotes the pushforward on vector fields induced by F . Lemma 5 implies Proposition 1. We get the desired 3-ball B 3 by removing from the smooth 3-sphere F (3 ) a small closed 3-ball that does not meet ᐀2 . From the transversal vector field F∗ (m) on F (3 ), we get the required normal vector field n on B 3 . Indeed, for each point p in F (3 ), we take the orthogonal projection of the vector F∗ (m)(p) onto the normal space to F (3 ). Since, by Lemma 5, F∗ (m) | ᐀2 = N is already orthogonal to F (3 ), the normal field n coincides with F∗ (m) = N on ᐀2 . The proof of Proposition 1 (and hence of Theorem 1) is completed with the proof of Lemma 5. Proof of Lemma 5. Recall that the key for the existence of the diffeomorphism def F is the fact that N and n = m | ᐀2 are homotopic as normal vector fields on ᐀2 imbedded into S5 . We start with the explicit description of the homotopy N(t, ζ ), ζ ∈ ᐀2 , t ∈ [0, 1], of n and N. More precisely, we construct a smooth mapping N(t, ζ ), ζ ∈ ᐀2 , t ∈ [0, 1], such that for each ζ and t, the vector N(t, ζ ) is tangent to S5 and normal to ᐀2 and, moreover, N(0, ζ ) = n(ζ ) and N(1, ζ ) = N(ζ ). Let N be the bundle over ᐀2 of the unit vectors that are tangent to S5 and normal to ᐀2 . The fiber of this bundle over a point ζ ∈ ᐀2 is the unit 2-sphere S 2 (ζ ) in the 3-dimensional linear space which is tangent to S5 and normal to ᐀2 (the latter space considered as a real linear subspace of the tangent space Tζ C3 of C3 ). In the following we identify Tζ C3 and C3 . To describe the homotopy of N and n, we use the round metric on S 2 (ζ ) (i.e., the metric induced from the Euclidean metric on C3 ) and geodesics in this metric. Recall that a geodesic on S 2 (ζ ) in the round metric, which joins two points p1 and p2 , is an arc of the “big” circle, which is the intersection of S 2 (ζ ) with the 2-plane through the points 0, p1 , and p2 . We always take the shorter arc (in our case, it will always be a quarter, not three quarters of the big circle) without mentioning this explicitly further. Consider a unit vector field τ on ᐀2 which is tangent to 3 and normal to ᐀2 (there are two such vector fields opposite to each other). Divide [0, 1] into two intervals of equal length, J1 = [0, 1/2] and J2 = [1/2, 1]. For t ∈ J1 = [0, 1/2], ζ ∈ ᐀2 , we let the point N(t, ζ ) move with some velocity s(t) (in contrast to the parametrization by length) along the geodesic from n(ζ ) to τ (ζ ), starting with n(ζ ) for t = 0 and reaching τ (ζ ) for t = 1/2. In other words, N(t, ζ ) is the point on the (shorter) geodesic between n(ζ ) and τ (ζ ) with geodesic t 1/2 distance 0 s(t ) dt from n(ζ ). Moreover, 0 s(t ) dt = π/2, which is the length of the quarter circle, the geodesic joining n(ζ ) and τ (ζ ).
A NONREMOVABLE 4-BALL
99
The velocity s depends only on t ∈ [0, 1/2], not on ζ ∈ ᐀2 ; it is of class C ∞ , is positive on the interior of the interval, and vanishes of infinite order at the endpoints. For t ∈ J2 = [1/2, 1], we let the point N(t, ζ ) move on S 2 (ζ ) with the velocity s(t − (1/2)), t ∈ J2 , (s as defined above), along the geodesic from N(1/2, ζ ) = τ (ζ ) to N(ζ ). (For each ζ ∈ ᐀2 , this geodesic is a quarter of a big circle.) Here ζ is an arbitrary point of ᐀2 . Since the velocity s vanishes of infinite order at 0 and 1/2, we get a C ∞ map N : [0, 1] × ᐀2 → N such that N(t, ζ ) = n(ζ ) for t = 0, and N(t, ζ ) = N(ζ ) for t = 1. Define the homotopy R(t, ζ ), t ∈ [0, 1], of orthogonal transformations of the fibers of N associated with N(t, ζ ) in the following way. R(t, ζ ) = Id (the identity on S 2 (ζ )) for t = 0, ζ ∈ ᐀2 . For t ∈ [0, 1/2], define t R(t, ζ ) to be rotation of S 2 (ζ ) by the angle 0 s(t ) dt around the axis orthogonal to the 2-plane containing the geodesic in the direction from n(ζ ) towards τ (ζ ). R(t, ζ ) is chosen in such a way that N(t, ζ ) = R(t, ζ )n(ζ ). Note that R(1/2, ζ ) is rotation by the angle π/2 around the mentioned axis. t For t ∈ [1/2, 1], define R(t, ζ ) to be R(1/2, ζ ) followed by rotation by the angle 1/2 s(t − 1/2) dt around the new axis, which is now orthogonal to the 2-plane that contains the geodesic joining τ (ζ ) and N(ζ ). Note that R(1, ζ ) n(ζ ) = N(ζ ).
(27)
The exponential mapping applied in the normal directions to ᐀2 defines a diffeomorphism of a small neighbourhood of ᐀2 in the normal bundle of ᐀2 onto a small open subset of S5 which contains ᐀2 (a “tubular neighbourhood of ᐀2 in S5 ”). Composing the homotopy R(t, ζ ) with the aforementioned exponential mappings, we get a smooth isotopy of imbeddings of the mentioned tubular neighbourhood of ᐀2 into S5 . This isotopy fixes ᐀2 and is infinitesimally orthogonal on ᐀2 . The isotopy extension theorem (see [Hi, Theorem 1.4]) allows us to extend this isotopy to a diffeotopy of S5 . Indeed, put in the notation of [Hi] M = S5 , U equal to the mentioned tubular neighbourhood of ᐀2 and A equal to the closure of a smaller tubular neighbourhood of ᐀2 , A ⊂ U . The openness condition required in Theorem 1.4 for the isotopy of U is then satisfied. Let F be the diffeomorphism of the diffeotopy of Theorem 1.4 which is obtained for t = 1. It coincides with the identity on ᐀2 and outside some tubular neighbourhood of ᐀2 . By construction (recall that R(t, ζ ) are orthogonal transformadef tions of the fibers of N and (27) holds with n(ζ ) = m(ζ ), ζ ∈ ᐀2 ), it follows that F∗ (m) = N on ᐀2 and F (3 ) is orthogonal to N on ᐀2 . This proves Lemma 5. References [Ch] [ChSt]
E. M. Chirka, Complex Analytic Sets (in Russian), “Nauka”, Moscow, 1985; English transl. in Math. Appl. (Soviet Ser.) 46, Kluwer Acad. Publ., Dordrecht, 1989. E. M. Chirka and E. L. Stout, “Removable singularities in the boundary” in Contributions to Complex Analysis and Analytic Geometry, Aspects Math. E 26, Vieweg,
100 [Di] [Do]
[Du]
[Fo] [FoSt] [GoGu] [Gr]
[Hi] [Jö1] [Jö2] [Jö3] [Lu] [St]
[We]
JÖRICKE AND SHCHERBINA Braunschweig, 1994, 43–104. A. Dimca, private communication. A. V. Domrin, On the spanning of maximally complex cycles by CR-submanifolds (in Russian), Mat. Zametki 60 (1996), 776–777; English transl. in Math. Notes 60 (1996), 582–583. J. Duval, “Surfaces convexes dans un bord pseudoconvexe” in Colloque d’Analyse Complexe et Géométrie (Marseille, 1992), Astérisque 217, Soc. Math. France, Montrouge, 1993, 6, 103–118. F. Forstneriˇc, Regularity of varieties in strictly pseudoconvex domains, Publ. Mat. 32 (1988), 145–150. F. Forstneriˇc and E. L. Stout, A new class of polynomially convex sets, Ark. Mat. 29 (1991), 51–62. M. Golubitsky and V. Guillemin, Stable Mappings and Their Singularities, Grad. Texts in Math. 14, Springer-Verlag, New York, 1973. M. L. Gromov, Convex integration of differential relations, I (in Russian), Izv. Akad. Nauk SSSR Ser. Mat. 37 (1973), 329–343; English transl. in Math. USSR-Izv. 7 (1973), 329–343. M. W. Hirsch, Differential Topology, Grad. Texts in Math. 33, Springer-Verlag, New York, 1976. B. Jöricke, Removable singularities of CR-functions, Ark. Mat. 26 (1988), 117–143. , Boundaries of singularity sets, removable singularities, and CR-invariant subsets of CR-manifolds, J. Geom. Anal. 9 (1999), 257–300. , Removable singularities of Lp CR-functions on hypersurfaces, J. Geom. Anal. 9 (1999), 429–456. G. Lupacciolu, Characterization of removable sets in strongly pseudoconvex boundaries, Ark. Mat. 32 (1994), 455–473. E. L. Stout, “Removable singularities for the boundary values of holomorphic functions” in Several Complex Variables: Proceedings of the Special Year Held at the MittagLeffler Institute (Stockholm, 1987/1988), Math. Notes 38, Princeton Univ. Press, Princeton, 1993, 600–629. J. Wermer, On a domain equivalent to the bidisk, Math. Ann. 248 (1980), 193–194.
Jöricke: Department of Mathematics, University of Uppsala, S-75106, Uppsala, Sweden;
[email protected] Shcherbina: Department of Mathematics, University of Göteborg, S-412 96 Göteborg, Sweden;
[email protected]
Vol. 102, No. 1
DUKE MATHEMATICAL JOURNAL
© 2000
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3 ALAN T. HUCKLEBERRY, STEFAN KEBEKUS, and THOMAS PETERNELL
Contents 1. 2. 3. 4. 5. 6. 7. 8. 9. 10.
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 Setup and general results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 The case where G is semisimple . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Main methods for the elimination of solvable groups . . . . . . . . . . . . . . . . . . . . . . 106 Elimination of the solvable groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 Proof of the C∗ × C∗ -action argument . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Considerations concerning the C∗ -action argument . . . . . . . . . . . . . . . . . . . . . . . 111 The case of a discrete fixed-point set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114 Proof of the C∗ -action argument if F is a curve . . . . . . . . . . . . . . . . . . . . . . . . . . 120 Some further remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
1. Introduction. This note is motivated by the following classical problem: Is there a complex structure on the 6-sphere S 6 , in other words, is S 6 a complex manifold? It has been known since the paper [BS] that all other spheres S 2n , n > 1, do not even admit an almost complex structure. On the other hand S 6 admits many almost complex structures; see the presentation in [Ste, Part III, Sect. 41.17]. It is generally believed that none of them is integrable. Suppose S 6 has a complex structure X. Then by [CDP] every meromorphic function on X is constant. Moreover X is not Kähler, since b2 (X) = 0. Therefore the problem is quite inaccessible by standard methods of complex geometry. In this paper we prove the following theorem. Theorem 1.1. X is not almost homogeneous. In other words, the automorphism group Aut ᏻ (X) does not have an open orbit. This is related as follows to the question of existence of complex structures on the underlying differentiable manifold of P3 (C): As above, assume X = S 6 has the structure of a complex manifold. For p ∈ X, let πp : Xp → X denote the blow-up of X at p with πp−1 (p) =: Ep . Of course Ep = P2 (C). Since sufficiently small neighborhoods of a hyperplane in Pn (C) are differentiably identifiable with neighborhoods Received 5 May 1999. 1991 Mathematics Subject Classification. Primary 53C15; Secondary 32M05, 32M12. 101
102
HUCKLEBERRY, KEBEKUS, AND PETERNELL
of a blown-up point, it follows that Xp is diffeomorphic to P3 (C). Suppose that for given points p, q ∈ X there exists a biholomorphic mapping ψ : Xp → Xq . Note that, since ψ(Ep ) generates the cohomology H 2 (Xq , Z), it follows that ψ(Ep ) ∩ Eq = ∅. If C := ψ(Ep ) ∩ Eq = Eq , then πq | ψ(Ep ) would be a modification that maps the curve C ⊂ ψ(Ep ) ∼ = P2 (C) to a point. Since this is impossible, ψ(Ep ) = Eq , and ψ induces an automorphism gψ : X → X with gψ (p) = q. Let Ᏺ denote the orbit space X/ Autᏻ (X). For ξ, η ∈ Ᏺ with ξ = η and p ∈ ξ, q ∈ η representatives, it follows that Xp and Xq are not biholomorphically equivalent. Of course Ᏺ may very well be non-Hausdorff and, from certain points of view, an unreasonable parameter space, but by abuse of language we nevertheless refer to it as a family of complex structures on P3 (C). Since Aut ᏻ (X) does not have an open orbit, its generic orbit in X is at least 1-codimensional. In particular, for general p, the semiuniversal deformation space of Xp is at least 1-dimensional. In this sense we have the following consequence of Theorem 1.1. Corollary 1.2. If S 6 admits a complex structure, then there is a 1-dimensional family of complex structures on P3 . Corollary 1.3. Let X be a complex structure on S 6 . Then X carries at most two linearly independent holomorphic vector fields: h0 (T X) ≤ 2. Proof. If the generic Aut ᏻ (X)-orbit is 2-dimensional, then there are globally defined holomorphic vector fields V1 and V2 such that U := {p ∈ X : V1 ∧ V2 (p) = 0} is a dense, Zariski-open subset. If V3 is any holomorphic field on X, then, since V1 ∧ V2 ∧ V3 ≡ 0, there exist m1 , m2 ∈ ᏻ(U ) with V3 = m1 V1 + m2 V2 on U . By explicit computation in local coordinates, one verifies that these coefficients are in fact meromorphic on X and the relation between the fields extends to X. On the other hand, it has been proven in [CDP] that X possesses only the constant meromorphic functions. Thus, it follows in this case that dimC (X, T X) = 2. The other cases, that is, those where the generic orbit dimension is smaller, are handled analogously. By semicontinuity we obtain from Corollary 1.3 the following corollary. Corollary 1.4. The inequality h0 (T X ⊗ L) ≤ 2 holds for generic L ∈ Pic0 (X). Our theorem should be seen in a more general context, or rather program, for investigating complex structures on S 6 . Namely, we would like to prove that the tangent bundle T X is stable with respect to a Gauduchon metric. It would then follow that X carries a Hermite-Einstein connection (see [LY]). At this point one could employ powerful analytic tools to investigate the problem further. In order to prove the stability, it would seem necessary to verify the statements: (A) H 0 (X, T X ⊗ L) = 0 for all L ∈ Pic0 (X); (B) H 0 (X, 1X ⊗ L) = 0 for all L ∈ Pic0 (X). Hence our theorem is the first approach to (A). The next step should be to investigate
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
103
group actions in general. We feel that the study of group actions on highly nonalgebraic manifolds is of independent interest, and we hope the methods that we develop in this paper are useful in a broader context. 2. Setup and general results. In this section we gather results that are used throughout the paper and give the general setup. 2.1. Setup and outline of proof. The entire paper is devoted to proving that a complex structure on S 6 cannot be almost homogeneous. We argue by contradiction. More precisely, we make the following assumption. Assumptions 2.1. Throughout the paper, X denotes a complex manifold whose underlying differentiable manifold is S 6 , and G denotes a connected, simply connected complex Lie group that acts holomorphically and almost effectively on X and has an open orbit G · x0 =: . The G-isotropy at x0 is always denoted by . Remark 2.2. Recall that the automorphism group Autᏻ (X) is a complex Lie group that acts holomorphically on X. Of course it may not be simply connected. However, its universal cover G is also a complex Lie group that is acting holomorphically. Thus, since we are willing to accept a discrete ineffectivity, I = {g ∈ G : g(x) = x, ∀x ∈ X}, the assumptions above are equivalent to assuming that Aut ᏻ (X) has an open orbit, that is, that X is almost homogeneous. The structure of the proof can be outlined as follows: Using the fact that X has only finitely many analytic hypersurfaces in connection with topology and Lie theory of the situation at hand, it is shown in Sections 3 that if G exists, then it must be solvable. In Section 4, we give a number of methods that are applied in Section 5 in order to rule out this case, too. Two of the methods involve a lengthy proof that we have preferred to give separately in Section 6 and Sections 7–9. The technical heart of this work lies in Sections 7–9 where we rule out the following situation: we suppose that there exists a subgroup C∗ < Aut ᏻ (X) and that E := X \ is an irreducible, nonnormal, rational surface where C∗ acts as an algebraic transformation group. The nonnormal locus N ⊂ E and its preimage N˜ in the normalization E˜ play a key role in the remainder of the proof. It follows from the Betti number information on X that N and N˜ are connected and have the same first Betti numbers. An analysis of the ˜ + 1 (see Sections 8–9). C∗ -action on E shows, however, that in fact b1 (N) = b1 (N) 2.2. Meromorphic functions, discrete isotropy, and the dimension of E. Since there are no nonconstant meromorphic functions on X by [CDP], we have that dim G = h0 (T X) = 3. For this, observe that if h0 (T X) ≥ 4, then KX has two linearly independent sections. Proposition 2.3. The G-isotropy at a point x0 ∈ is discrete. Furthermore, E := X \ is nonempty and purely 1-codimensional in X.
104
HUCKLEBERRY, KEBEKUS, AND PETERNELL
Proof. Since dim G = 3, it is clear that is discrete. In particular, since χTop (X) > 0, every vector field has zeroes and thus E = ∅. If the vector fields X1 , X2 , X3 form a basis of (X, T X), then E = {X1 ∧ X2 ∧ X3 = 0}. In particular, E is of pure codimension 1 and −KX = ᏻX irreducible components of E and λi > 0.
λi Ei , where Ei are the
Let G = R · S be the Levi-Malcev decomposition, so that R is the radical of G, that is, the maximal connected solvable normal subgroup of G and S is semisimple; moreover R ∩ S is discrete. Since a semisimple complex Lie group has dimension at least 3, we have only two cases; namely, that G is semisimple or solvable. 2.3. Topology of and E. Since the topology of X is well known, the Betti numbers of and E are closely related. Notation 2.4. If Y is a topological space, set bi (Y ) := hi (Y ; Q). Proposition 2.5. For q ∈ {1, . . . , 4}, the following equation of Betti numbers holds: bq (E) = b5−q (). Proof. Recall from algebraic topology that there is an exact cohomology sequence associated to the pair (X, E): · · · −→ H q (X; Q) −→ H q (E; Q) −→ H q+1 (X, E; Q) −→ H q+1 (X, Q) −→ · · · . Since X is homeomorphic to the 6-sphere, bq (X) = bq+1 (X) = 0 for all numbers 1 ≤ q ≤ 4, so that H q (E; Q) ∼ = H q+1 (X, E; Q). An application of the Alexander duality theorem yields H q+1 (X, E; Q) ∼ = H5−q (; Q) and, hence, the claim. 2.4. Fixed points of reductive groups. Many of our arguments involve linearization of group actions at fixed points. We recall the following theorem on faithful linearization. Theorem 2.6. Assume that a reductive complex Lie group H acts holomorphically on a complex manifold M, and assume that x ∈ M is an H -fixed point. For h ∈ H , let T (h) : Tx → Tx be the tangential map. Then there exist neighborhoods U of x and V of 0 ∈ Tx M and an isomorphism φ : U → V such that φ ◦ h = T (h) ◦ φ for all h in a given maximal compact subgroup of H . Furthermore, if W is a neighborhood of the maximal compact subgroup and U ⊂ U open so that W U ⊂ U , then (T (w) ◦ φ)(x) = (φ ◦ w)(x) for all x ∈ U . In this setting we call U a linearizing neighborhood of x. See [Huc] or [HO, p. 11f] for more information about linearization. The fixed-point set of a reductive group also possesses certain topological properties. In our special case this implies the following proposition.
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
105
Proposition 2.7. Suppose that Aut(X) contains a subgroup I ∼ = C∗ . Let F be ∼ the fixed-point set of I . Then either F = P1 , or F consists of two disjoint points. Proof. As a first point, note that it follows from the linearization theorem that F is smooth; in particular, its irreducible components are disjoint. Furthermore, χTop (F ) = χTop (X) (see [KP] for a proof in the algebraic setting which carries over immediately to, for example, compact complex spaces.) In our situation, for p ∈ F , the group I stabilizes the complement X \ {p} ∼ = R6 and thus F \{p} is acyclic (see [Bre]). Consequently, F consists of two points or it is irreducible. Thus it remains to show that F is at most 1-dimensional. Suppose F is 2-dimensional. Since H 4 (X, Z) = 0, it follows that c2 (T X|F ) = 0 and, since H 2 (X, Z) = 0, the normal bundle NF,X is topologically trivial. Consequently 0 = c2 (T X|F ) = c2 (F ) + NF,X · KF = χTop (F ), which is contrary to χTop (F ) = 2. 3. The case where G is semisimple. In this section we treat the case that G is semisimple. Since dim G = 3, it follows that G ∼ = SL2 . The most basic property of almost transitive SL2 -actions on 3-folds is found in the following lemma. Lemma 3.1. The G-action on X does not have a fixed point. In particular, if E˜ i is the normalization of Ei , then E˜ i is smooth. Proof. Assume that x ∈ X is a G-fixed point. Linearize the G-action at x and recall from the representation theory of SL2 that no 3-dimensional SL2 -representation space is SL2 -almost homogeneous (see [MU, Lemma 1.12] for a more detailed proof). This is a contradiction. Lemma 3.2. If Ei ⊂ E is an irreducible component, then G acts almost transitively on Ei . In particular, the normalization E˜ i is either rational or a Hopf surface. Proof. If G did not have an open orbit in Ei , then by Lemma 3.1 all G-orbits would be 1-dimensional. Thus the normalization E˜ i would be the product P1 (C)×C, where G operates transitively on the first factor and C is a smooth curve. In particular, it follows that a maximal torus T ∼ = C∗ would have two disjoint copies of C as a fixedpoint set in Ei . Note that, since the normalization map E˜ i → Ei is equivariant with respect to G ∼ = SL2 , the T -fixed-point set in Ei is the disjoint union of two curves, contrary to Proposition 2.7. The “In particular . . . ” clause of the statement results from the classification of the almost homogeneous surfaces; see [HO, p. 92] or [Pot]. Now we exclude both possibilities in the following proposition. Proposition 3.3. The group G is not semisimple.
106
HUCKLEBERRY, KEBEKUS, AND PETERNELL
Proof. It follows from Lemma 3.2 that the normalization E˜ i of a component of E is an almost homogeneous Hirzebruch surface, P2 (C) equipped with either the defining representation of SO3 (C) or the representation of SL2 with a fixed point or a homogeneous Hopf surface (see [HO] or [Pot]). Consequently, a maximal torus T ∼ = C ∗ < G has only isolated fixed points in X and, if Ei were rational, it would already have three or four fixed points in Ei alone. Thus we may assume that every such component is a homogeneous Hopf surface. But this is also not possible, because such a surface is a homogeneous space G/H , where H 0 is unipotent; that is, T has no fixed points. 4. Main methods for the elimination of solvable groups. We begin by presenting several methods that involve the normalizer of subgroups of isotropy groups. Recall that G is a connected, simply connected complex Lie group acting almost transitively on X with an open orbit = G · x0 . By Proposition 3.3 we may assume that G is solvable and, by the remarks in Section 2, that the isotropy := Gx0 is discrete. 4.1. The normalizer arguments. Throughout the paper if H is a subgroup of G, then N(H ) denotes its normalizer in G and N(H )0 its identity component. If H is discrete, it follows that N(H )0 = Z(H )0 , where Z(H ) denotes its centralizer. 4.1.1. The 2-dimensional normalizer argument. Note that as a subgroup of the discrete group , the ineffectivity I is itself discrete and, being normal, is therefore central. Thus, if H < is not normal in G, then it acts nontrivially on . Proposition 4.1 (2-dimensional normalizer argument). If H < is an arbitrary subgroup, then dim N(H ) = 2. Proof. Suppose not and note that the 2-dimensional orbit F := Z(H )0 · x0 is a component of the set of H -fixed points in . Since the full set X H of H -fixed points is closed, F is Zariski open in its closure F , which is 1-codimensional in X. Observe that {gF | g ∈ G} is an infinite set of hypersurfaces and consequently there exists a nonconstant meromorphic function on X (see [Kra, Thm. 1]1 ). However, such a function does not exist (see [CDP]). 4.1.2. The 1-dimensional normalizer argument Proposition 4.2 (1-dimensional normalizer argument). If H < is any subgroup and dim N(H ) = 1, then N(H )0 ∩ is a lattice of rank 2. Proof. The 1-dimensional group Z := Z(H )0 acts transitively on every component of the H -fixed-point set H ⊂ . These components are of course Zariski open in their closures. Set C := Z · x0 and note that if C = C, then C · E > 0, contrary to h2 (X; Z) = 0. Thus the orbit C is a compact 1-dimensional complex torus. 1 See
[FF] for a more general result.
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
107
4.2. Arguments involving reductive subgroups of Aut(X). In many cases we are able to rule out the existence of certain subgroups of Aut(X) under additional assumptions on the topology of . For convenience, use the following notation. Notation 4.3. If Y and Z are topological spaces, we say that Y has “Betti-type Z” if all the Betti numbers of Y and Z agree. 4.2.1. The C∗ × C∗ -action argument Proposition 4.4. If E is connected and has at least two irreducible components, then there does not exist an (effective) action of (C∗ )2 on X. The proof is given in Section 6. 4.2.2. The torus-action argument Proposition 4.5 (Torus-action argument). The automorphism group of X does not contain a compact complex torus. In particular, if < is an arbitrary subgroup that is normal in G and contained in a 1-dimensional Abelian subgroup A, then rank( ) = 1. Proof. Assume that T < Aut(X) is a compact complex torus. Since χTop (X) = 2, the Lefschetz fixed-point formula shows that every vector field must have zeros. In particular, T must have a fixed point in X. On the other hand, all representations of T are trivial, a contradiction to the theorem on faithful linearization! 4.2.3. The C∗ -action argument Proposition 4.6 (C∗ -action argument). If the open orbit ⊂ X has the same Betti numbers as the circle S 1 , then the automorphism group of X does not contain a subgroup that is isomorphic to C∗ . The proof of the preceding proposition turns out to be a rather involved matter. We give it in Sections 7–9. 4.3. Topological observations. Recall that G is solvable and simply connected, for example, it is a cell, and = G/ , where is discrete. Thus the topology of is completely determined by . It follows directly from Proposition 2.5 that = {e}. Restriction on the Betti type of Proposition 4.7 (Betti-type argument). The open orbit ⊂ X does not have the Betti type of a S 1 × S 1 . Proof. Assume to the contrary. Then, by Proposition 2.5, we have b4 (E) = 2 and b3 (E) = 1. Similar to the proof of Proposition 2.5, the sequence of the pair (E, ) and Alexander duality give 0 −→ H 0 (X, E; Q) −→ H 0 (X; Q) −→ H 0 (E; Q) −→ H 1 (X, E; Q) −→ · · · =H6 (;Q)=0
=Q
=H5 (;Q)=0
108
HUCKLEBERRY, KEBEKUS, AND PETERNELL
so that b0 (E) = 1. In particular, E is connected and has exactly two irreducible components E1 and E2 . Now E1 |E2 is a Cartier divisor that is effective, nonzero, but homologeously equivalent to zero, so that no desingularization of E2 is Kähler. By the Mayer-Vietoris sequence · · · −→ H 3 (E) −→ H 3 (E1 ) ⊕ H 3 (E2 ) −→ H 3 (E1 ∩ E2 ) −→ · · · , =0
we obtain b3 (E1 ) + b3 (E2 ) ≤ b3 (E) = 1. Hence we may assume that b3 (E1 ) = 0. Let ν : E˜ 1 → E1 be the normalization. Let N ⊂ E be the (set-theoretical) nonnormal locus and N˜ := ν −1 (N). The sequence · · · −→ H q (E1 ) −→ H q E˜ 1 ⊕ H q (N) −→ H q N˜ −→ · · · , q ≥ 1, yields b3 (E1 ) ≥ b3 (E˜ 1 ) so that b3 (E˜ 1 ) = 0. (See [BK, Prop. 3.A.7, p. 98] for an explanation of the sequence.) Now consider the minimal desingularization π : Eˆ 1 → E˜ 1 . Since R q π∗ (Z) is a skyp,q scraper sheaf with support only finitely many points for q > 0, E2 := H p (E˜ 1 , R q p,q ∼ p,q π∗ (Z)) = 0 for all p, q > 0. Thus, E2 = E∞ in the Leray spectral sequence, and since R 3 π∗ (Z) = 0 and H 3 (E˜ 1 , Z) = 0, we obtain b3 (Eˆ 1 ) = 0. Thus b1 (Eˆ 1 ) = 0, and the classification of surfaces yields that Eˆ 1 is Kähler2 . This is a contradiction!
5. Elimination of the solvable groups. Our goal here is to eliminate the possibility that X is almost homogeneous with respect to the action of a solvable group. The proof requires a bit of the special knowledge given by the classification of the simply connected 3-dimensional solvable Lie groups. 5.1. Classification of the relevant Lie groups. We now recall the classification mentioned above (see, e.g., [Jac]). In this case, G is biholomorphic to C3 as a complex manifold and the group structure is isomorphic to one of the following: G0 : This is the well-known Abelian group C3 . G1 : We could also denote this group by G2 (0). The multiplication is given as a1 , b1 , c1 a2 , b2 , c2 = a1 + a2 , ea1 b2 + b1 , c1 + c2 . G2 (τ ): Here τ is any complex number other than zero. The multiplication is a1 , b1 , c1 a2 , b2 , c2 = a1 + a2 , ea1 b2 + b1 , eτ a1 c2 + c1 . G3 : The multiplication is a1 , b1 , c1 a2 , b2 , c2 = a1 + a2 , ea1 b2 + b1 + a1 ea1 c2 , ea1 c2 + c1 . 2 Actually
are Kähler.
it is easy to see that κ(Eˆ 1 ) = −∞ so that we do not have to use the fact that K3-surfaces
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
109
H3 : This is the Heisenberg group, where the multiplication is
a1 , b 1 , c 1 a 2 , b 2 , c 2 = a 1 + a 2 , b 1 + b 2 + a 1 c 2 , c 1 + c 2 .
For a detailed study of the discrete subgroups of such groups, see [ES, Sect. 1]. 5.2. The elimination. Here we eliminate the possibility of an action of a solvable group by utilizing a general strategy along with some knowledge of the Lie groups that occur. Proposition 5.1. If is normal in G and A < G is any closed connected Abelian subgroup, then ⊂ A. In particular, the group G is not isomorphic to G0 ∼ = C3 . Proof. Assume to the contrary, that is, that ⊂ A. Since G/A is acyclic, G/ has the same homotopy type as A/ , that is, the same homotopy type as a real torus. Our argument depends on rank(). rank() = 1: Then has the Betti type of the circle S 1 . Take a 1-dimensional subgroup A < G, A ∼ = C with ⊂ A and observe that A / ∼ = C∗ acts nontrivially on X, contrary to the C∗ -action argument in Proposition 4.6. rank() = 2, 3, 4: Betti number considerations show that the assumptions of Proposition 4.4 are satisfied. However, the assumption on rank() together with the torusaction argument allow us to construct a (C∗ )2 -action coming from the group A. rank() = 5: Here G = A is Abelian and has two ends so that Proposition 4.4 may not be applied. Let E0 ⊂ E be an irreducible component and x0 ∈ E0 a generic point so that the dimension of the G-orbit through x0 is maximal. We consider the isotropy group Gx0 , which is positive-dimensional. Let H < Gx0 be a closed 1dimensional connected subgroup and note that, since rank() = 5, H ∩ = {e}: otherwise the image of would be a lattice of rank 5 in G/H ∼ = C2 . Thus, we obtain ∗ ∼ an action of H /( ∩ H ) = C on X which fixes the closure of the G-orbit G · x0 pointwise. If G · x0 is a point, then G, and thus H , fix E0 pointwise, contrary to Proposition 2.7. If G · x0 is 1-dimensional and closed, then it is an elliptic curve. In this case Fix(H ) also does not have the required properties of Proposition 2.7. Finally, if G.x0 is 1-dimensional and is not closed, then G0x0 ∩ has rank 4. In this case, after moding out by ineffectivity, the 2-dimensional compact complex torus T := G0x0 /(G0x0 ∩ ) acts holomorphically on X with a nonempty fixed-point set. Thus, contrary to almost ineffectivity of the G-action, it acts trivially; see the torusaction argument in Proposition 4.5. This finishes the proof. A weaker statement holds if is not necessarily normal. Lemma 5.2. For all groups A < G with A ∼ = C, we have ⊂ A. Proof. Assume to the contrary. Again the Betti types of G/ and of A/ agree. We may assume that is not normal.
110
HUCKLEBERRY, KEBEKUS, AND PETERNELL
Since A is contained in the normalizer of , the 2-dimensional normalizer argument implies that A = NG ()0 . In this setting the 1-dimensional normalizer argument yields that ∼ = Z2 , contrary to the Betti-type argument. We need the following technical lemma on centralizers of elements in G. ∼ G1 , G2 (τ ), G3 , or H3 and g ∈ G, then dim ZG (g) ≥ 1 and Lemma 5.3. If G = one of the following holds: (1) dim ZG (g) ≥ 2, or (2) ZG (g)0 ⊂ (0, C, C), or (3) ZG (g)0 = ZG (g), that is, ZG (g) is connected. Here ZG (g) denotes the set of elements commuting with g. Proof. The statement that dim ZG (g) ≥ 1 can be checked directly. To show that one of (1), (2), or (3) holds, for any number a ∈ C define the set Za := (b, c) ∈ C2 | (a, b, c) ∈ G commutes with g . Note that it is sufficient to show that, for all a, either dim Za > 0 or #Za ∈ {0, 1} holds. The last statement follows by an elementary calculation of commutators. Now we start with the the following proposition. Proposition 5.4. The group G is not solvable. Proof. By Proposition 5.1, we may assume that G ∼ = G1 , G2 (τ ), G3 , or H3 . If G∼ = G1 or H3 , then for all g ∈ G we have that dim ZG (g) ≥ 2. Thus, if ZG ⊂ G denotes the centralizer of G, then, by the 2-dimensional normalizer argument, < ZG . A direct calculation shows that ZG is contained in a connected Abelian subgroup, contrary to Proposition 5.1. Now assume that G ∼ = G2 (τ ) or G3 . Note that G ∼ = A G where G = (0, C, C) is the commutator group of G, and A as well as G are connected and Abelian. Furthermore, dim ZG (g ) = 2 for all g ∈ G . Thus, by the 2-dimensional normalizer argument, ∩ G = {e}, and the projection π1 : G → A is an injective group morphism, if restricted to . This shows that is Abelian, which in turn implies that < ZG (γ ) for all γ ∈ . If dim ZG (γ ) ≥ 2 for all γ ∈ , then is again central and contained in a connected Abelian subgroup. Thus the same argument as above applies. Thus we may assume that there exists a γ ∈ with dim ZG (γ ) = 1. If ZG (γ ) is connected, then ⊂ ZG (γ ) and we obtain a contradiction to Lemma 5.2. If ZG (γ ) is not connected, then Lemma 5.3 asserts that ZG (γ )0 ⊂ G . But this is also not possible: use the 1-dimensional normalizer argument to see that there exists an element γ ∈ ∩ G . However, we have already ruled out this possibility in the previous paragraph. This finishes the proof of the main Theorem 1.1 up to the proof of Propositions 4.4 and 4.6.
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
111
6. Proof of the C∗ × C∗ -action argument. In this section we carry out the proof of the C∗ × C∗ -action argument in Proposition 4.4. Proof of Proposition 4.4. For ease of notation, let T ∼ = = C∗ ×C∗ act on X and C∗ ∼ H < T be a subgroup. By Proposition 2.7 there exists an H -fixed point y0 ∈ X. Since its G-isotropy is positive-dimensional, it follows directly from the characterization of Proposition 2.3 that y0 ∈ , that is, y0 ∈ E. Let E0 ⊂ E be any irreducible component containing y0 , let η : E˜ 0 → E0 be the normalization, and let ρ : Eˆ 0 → E˜ 0 be a T -equivariant desingularization. Let E1 be a component of E with E0 ∩ E1 = ∅. It follows that the surface Eˆ 0 contains an effective divisor that is numerically equivalent to zero, namely, (η ◦ ρ)∗ (E1 ). This implies that the algebraic dimension a(Eˆ 0 ) is at most 1. Observe that if T has a fixed point x0 ∈ Eˆ 0 , then linearization at x0 shows either that the T -action on Eˆ 0 is almost transitive or that the T -action on Eˆ 0 contains a positive-dimensional ineffectivity C∗ ∼ = I < T . Notice that the latter possibility is ruled out by Proposition 2.7. Assume that Eˆ 0 is T -almost homogeneous but contains no T -fixed points. In this case, for x ∈ (η ◦ ρ)−1 {y0 } the orbit T · x is closed and 1-dimensional, that is, an elliptic curve that is blown down by ρ to an isolated boundary point of the open T -orbit in E˜ 0 . It follows that E˜ 0 is a homogeneous cone over P1 (C) (see [HO]). This is again contrary to a(Eˆ 0 ) ≤ 1. Thus we may assume that all T -orbits are 1-dimensional. It follows that, for every x ∈ Eˆ 0 , the analytic set Fix(Tx ) = x ∈ Eˆ 0 | tx = x for all t ∈ the isotropy group Tx is 1-dimensional and a finite union of T -orbits. In particular every T -orbit is closed, that is, an elliptic curve. Since there are infinitely many such orbits, a(Eˆ 0 ) = 1. Let h : Eˆ 0 → C denote the algebraic reduction. Since its fibers are connected, h is T -equivariant. Furthermore, all curves are contained in h-fibers. Thus the T -orbits, which are now known to be 1-dimensional and closed, are components of h-fibers; that is, the h-fibers are exactly the T -orbits. In particular, no T -orbit can be blown down by ρ and Eˆ 0 = E˜ 0 . Now H ∼ = C∗ has ˜ a fixed point z ∈ E0 . But this implies that FixE˜ 0 (H ) contains the elliptic curve T · z, contrary to Proposition 2.7. Thus the final case has been eliminated and the proof is complete. 7. Considerations concerning the C∗ -action argument. The remainder of the paper is devoted to proving Proposition 4.6. As was indicated at the beginning of Section 2, this completes the proof of our main theorem, that is, that X is not almost homogeneous. Accepting the situation presented to us by Proposition 4.6, we operate here under
112
HUCKLEBERRY, KEBEKUS, AND PETERNELL
the assumptions that has the Betti type of S 1 and that there exists an effective C∗ action on X. We begin by deriving some topological consequences of the assumption on the Betti type of . 7.1. Topological constraints. At this place, it seems advisable to fix some notation that is used in the sequel. Notation 7.1. Let ν : E˜ → E be the normalization and π : Eˆ → E˜ be the minimal desingularization. Write δ := π ◦ν. Let N ⊂ E be the nonnormal locus equipped with the complex conductor ideal structure and N˜ := ν −1 (N) the analytic preimage. ˜ and E are Cohen-Macaulay space; see [Mor, p. 166, 3.34(2)] for Notice that N , N, a proof. It follows from Serre’s criterion that the nonnormal locus N ⊂ E must be of ˜ pure dimension 1. The same holds for N˜ ⊂ E. Proposition 7.2. The divisor E is irreducible and not normal. Its normalization E˜ has only rational singularities and the minimal desingularization Eˆ is rational. In particular, E˜ is Q-factorial. Proof. Since has Betti-type S 1 , it follows from Proposition 2.5 that b4 (E) = 1; that is, E is irreducible. Suppose that E is normal. Since b3 (E) = 0, we have ˆ = 0 (see the proof of Proposition 4.7); hence b1 (E) ˆ = 0, and Eˆ is Kähler. b3 (E) ∗ ∗ ˆ ≤ 0, Since KEˆ ⊂ π (KE ) and π (KE ) is linearly equivalent to zero, we have κ(E) ˆ ˆ and by b3 (E) = 0, E is either rational or birational to a K3-surface or an Enriques surface. But because Eˆ possesses a C∗ -action, the latter cases are excluded. So Eˆ (and hence E) are rational surfaces. As a consequence, note that R 1 π∗ (Q) = 0 by the Leray spectral sequence and ˆ Q = H 2 (E, Q) = 0. H 1 E, ˆ Q) = H 0 (E, R 2 π∗ (Q)) and we conclude that H 2 (E, ˆ Q) is generated Thus H 2 (E, by the π -exceptional curves. ˆ Then we find m ∈ N, λi ∈ Z, and π-exceptional Now take an ample divisor A on E. ˆ curves Ci ⊂ E such that
c1 ᏻEˆ (mA) = c1 ᏻEˆ λi Ci . Since Eˆ is rational, we have the linear equivalence mA = λi Ci , which is absurd. Consequence: E is not normal. Using the formula ωE˜ = ν ∗ (ωE ) − N˜ and the fact that N˜ is an effective Weil divisor supported on the preimage of the nonnormal locus (observe that E is a Gorenstein space!), the “old” arguments still apply and give the rationality of E. In order to show that E˜ has only rational singularities, we check R 1 π∗ ᏻEˆ = 0.
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
113
Since H 1 (ᏻEˆ ) = 0, the Leray spectral sequence yields an embedding H 0 R 1 π∗ ᏻEˆ −→ H 2 ᏻE˜ .
Now E˜ is a Cohen-Macaulay space and therefore H 2 (ᏻE˜ ) ∼ = H 0 (ωE˜ ). Since ωE˜ ⊂ ∗ ∗ 2 ν (ωE ) and ωE˜ = ν (ωE ) = ᏻE˜ , we have H (ᏻE˜ ) = 0. Therefore R 1 π∗ (ᏻEˆ ) = 0. ˜ and Proposition 7.3. The following Betti numbers are equal: b0 (N) = b0 (N) ˜ b1 (N ) = b1 (N). Proof. We use the following Mayer-Vietoris sequence for reduced cohomology: ˜ Q) ⊕ H˜ q (N; Q) · · · −→ H˜ q (E; Q) −→ H˜ q (E; ˜ Q) −→ H˜ q+1 (E; Q) −→ · · · −→ H˜ q (N; (see [BK, Prop. 3.A.7, p. 98] for information about this sequence). ˜ Q) = h0 (N; Q), since H 1 (E; Q) = 0 by Proposition 2.5. Furthermore So h0 (N; ˜ ˜ − b1 (N ), since H 2 (E; Q) = 0. b1 (E) = b1 (N) Lemma 7.4. The space N is connected. Proof. Using the fact that E is a Cohen-Macaulay space and actually a Gorenstein space, by [Mor, p. 166, 3.34(2)] there exists an exact sequence −1 0 −→ ᏻE −→ ν∗ ᏻE˜ −→ ωE ⊗ ωN −→ 0. Since c1 (ν ∗ (ωE )) = 0 so that ν ∗ (ωE ) = ᏻE˜ , 0 −→ ωE −→ ωE ⊗ ν∗ ᏻE˜ −→ ωN −→ 0 =ν∗ ν ∗ (ωE )=ν∗ (ᏻE˜ ) is also exact. Since N is a Cohen-Macaulay space, Serre duality holds. Thus, the associated long cohomology sequence gives ˜ ᏻ ˜ −→ H 1 (N, ωN ) −→ H 2 (E, ωE ) −→ · · · · · · −→ H 1 E, E =0
∼ =H 0 (N,ᏻN )
∼ =H 0 (E,ᏻE )∼ =C
so that h0 (N, ᏻN ) ≤ 1, and the reduced subspace Nred is connected. 7.2. Orbits of the C∗ -action. For the sake of completeness we outline some known facts of C∗ -actions on rational surfaces: The orbits are always constructible. As a consequence we see that E necessarily contains attractive and repulsive fixed points, a fact that is crucial in the sequel. More information can be found in the works of Białynicki-Birula and of Sommese (see, e.g., [BBS]).
114
HUCKLEBERRY, KEBEKUS, AND PETERNELL
Lemma 7.5. If H is a linear algebraic group and ι : C∗ → H is a holomorphic map that is a (set-theoretical) group morphism, then ι is algebraic and, in particular, the image ι(C∗ ) is a closed algebraic subgroup of H . Proof. Let J < H be the Zariski closure of ι(C∗ ) in H . It is immediate that J is connected and Abelian and, since it is affine, J ∼ = Cn × (C∗ )m . Now use the fact that there is no nonconstant holomorphic group morphism from C∗ to C and that any holomorphic group morphism from C∗ to C∗ is given by z → zk . Lemma 7.6. Let S be a (possibly singular) irreducible compact surface with a holomorphic action of C∗ . Suppose additionally that S is rational. Then: (1) All C∗ -orbits are constructible, and their closures are rational. In particular, for all x ∈ S, the limits limλ∈C∗ , λ→0 λ · x and limλ∈C∗ , λ→∞ λ · x exist. (2) If F ⊂ S is the set of the C∗ -fixed points, then there are two irreducible subspaces F0 , F∞ ⊂ F (not necessarily distinct) and a Zariski-open set U ⊂ S such that for all x ∈ U : lim
λ∈C∗ , λ→0
λ · x ∈ F0 ,
lim
λ∈C∗ , λ→∞
λ · x ∈ F∞ .
Proof. Suppose for the moment that S is smooth. Then Aut(S) is linear algebraic and acts algebraically; this is a consequence of the fact that, since b1 (S) = 0, S can be equivariantly embedded into some Pn —see [Bla] for a proof. By Lemma 7.5, any closed subgroup C∗ < Aut(S) is linear algebraic and acts algebraically. In particular, C∗ -orbits are constructible, and all C∗ -stable curves in S contain fixed points. This already proves (1). In order to prove assertion (2), embed C∗ → P1 in the usual way. Then there exists a rational morphism φ : P1 × S S. Since the set of fundamental points of φ −1 is of codimension ≥ 2, there exists an open set U ⊂ S such that φ|P1 ×U is regular. Consequence: for all x ∈ U the limits limλ∈C∗ , λ→0 λ · x and limλ∈C∗ , λ→∞ λ · x exist. Set F0 = φ(0 × U ) and F∞ := φ(∞ × U ). If S is singular, then let δ : Sˆ → S be an equivariant resolution and find Fˆ0 , ˆ F∞ , and Uˆ ⊂ Sˆ as above. It is sufficient to set F0 := δ(Fˆ0 ), F∞ := δ(Fˆ∞ ), and U := δ(U ) ∩ Sreg , where Sreg is the (open) set of regular points in S. 8. The case of a discrete fixed-point set. Our goal here is to prove the C∗ -action argument under the assumption that the set F of C∗ -fixed points is discrete. We have seen in Proposition 2.7 that F necessarily consists of two distinct points, which we denote by F0 and F∞ . Since its G-isotropy contains C∗ , it follows again from Proposition 2.3 that F0 , F∞ ∈ E. It is shown that the discreteness assumption, which ˜ (see Proposition 8.21). is made throughout this section, is contrary to b1 (N) = b1 (N)
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
115
Notation 8.1. In the sequel, U0 and U∞ are disjoint linearizing neighborhoods of the points F0 and F∞ in X. Since E ∩ U0 and E ∩ U∞ are closed in U0 and U∞ , respectively, after shrinking U0 and U∞ we can assume that E ∩ U0 and E ∩ U∞ are connected and F0 (resp., F∞ ) is contained in every component. 8.1. Symmetry lemmas. Here we prove several lemmas which show that the situations at F0 and at F∞ are very similar. First, we investigate the following. 8.1.1. Weights of the C∗ -actions on TF0 X and TF∞ X. The main result of this section is the following proposition. Proposition 8.2. The locally closed spaces E ∩ U0 and E ∩ U∞ are reducible. One of the components of each space is smooth, and the C∗ -action has a totally attractive (resp., repulsive) fixed point there. Since the proof is somewhat lengthy, and we use a number of partial results later, we subdivide the proof into a sequence of lemmas and corollaries. Lemma 8.3. After swapping F0 and F∞ and, if necessary, replacing the C∗ -action by (C∗ )−1 , the weights of the C∗ -action on the tangent space TF0 X have the following signs: (+ + −). Proof. By symmetry, we only have to exclude the following distribution of signs of the weights of the C∗ -action on TF0 X and TF∞ X: (+ + +), (+ + +): This contradicts Lemma 7.6. (+ + +), (− − −): Let x ∈ E be a generic point. Choose a sequence (xn )n∈N ⊂ such that limn→∞ xn = x. Then there exist numbers λ, λ ∈ C∗ such that λ · x ∈ U0 and λ · x ∈ U∞ and there exists an n ∈ N such that λ · xn ∈ U0 , and λ xn ∈ U∞ . Linearization shows that limλ→0 λxn = F0 and limλ→∞ λxn = F∞ so that C := C∗ · xn is a closed curve in X and C ∩E = F0 ∪F∞ . But then E ·C ≥ 2, a contradiction to H 2 (X, Z) = 0. Assume from now on that the C∗ -action on TF0 X has weights of type (+ + −). Let x, y, and z be coordinates for the associated weight spaces. By the linearization Theorem 2.6, we can view x, y, and z as giving local coordinates on U0 . After performing a C∗ -equivariant change of coordinates on TF0 X, we may assume that the unit ball in TF0 X is contained in the image of U0 . Corollary 8.4. After swapping F0 and F∞ and, if necessary, replacing the C∗ action by (C∗ )−1 , the weights of the C∗ -action on the tangent space TF0 X have signs (+ + −) and the locally closed subspace E ∩ U0 is reducible. One of the components of E ∩ U0 is smooth, and the C∗ -action has a totally attracting fixed point there. Proof. It follows directly from Lemma 7.6 that E ∩ U0 contains infinitely many C∗ -stable curves containing F0 . Since it is closed in U0 , we have that {z = 0} ∩ U0 ⊂ E. Since {z = 0} is smooth, in order to see that E ∩U0 is reducible, it suffices to show E ∩ U0 is not normal. Since E is a hypersurface in X, it is a Cohen-Macaulay space,
116
HUCKLEBERRY, KEBEKUS, AND PETERNELL
and it follows from Serre’s criterion that the nonnormal locus N ⊂ E must be of codimension 1. By Lemma 7.6, N contains C∗ -fixed points. If N contains F0 , then we can stop here. Otherwise, if N contains F∞ but not F0 , then the signs of the weights at F∞ are necessarily (− − +); we show this by ruling out all other possibilities: (+ + +): does not occur, or else we obtain a contradiction to Lemma 7.6; (+ + −): is similar; (− − −): then limλ→0 λx does not exist for generic x ∈ N . Since every 1-dimensional component of N contains a C∗ -fixed point, it is rational. Thus, the limit exists on the normalization of N, which in turn implies that the limit exists on N. In this case swap F0 and F∞ and start anew. For the remainder of this section, fix F0 and F∞ so that we are in the situation of the above corollary. Notation 8.5. Let E0,i denote the irreducible components of E ∩ U0 , and let E0,0 be the smooth component in the preceding corollary. Lemma 8.6. Choose numbers a, b, and c ∈ N+ and let the group C∗ act on C3 by λ : (x, y, z) → (λa x, λb y, λ−c z). Let S ⊂ C3 be an irreducible C∗ -stable divisor with S = {z = 0}, but S ∩ {z = 0} = ∅. Then {x = y = 0} ⊂ S. Proof. Choose a point s ∈ {z = 0} ∩ S. Let sn = (xn , yn , zn ) be a sequence in S with zn = 0 and lim sn = s. Choose λn ∈ (zn )1/c . Note that lim λn = 0. Then λn · sn ∈ E and lim λn sn = lim(λan xn , λbn yn , 1) = (0, 0, 1). Corollary 8.7. Every component E0,i (i = 0) contains {x = y = 0}. Proof. In view of the preceding Lemma 8.6, we must show that E0,i ∩{z = 0} = ∅. This, however, is clear since all components E0,i contain F0 . Lemma 8.8. If the signs of the weights of the C∗ -action on TF∞ X are all negative, then there exists a curve C − ⊂ E satisfying (1) C − ∩ E0,0 is a curve (i.e., dim C − ∩ E0,0 = 1), and (2) F∞ ⊂ C − . Proof. By Corollary 8.4, E ∩U0 is reducible, and by Corollary 8.7, {x = y = 0} ⊂ E ∩U0 . The z-axis is the weight space to the negative weight, so that limλ→∞ (0, 0, 1) = F0 . But the limit limλ→0 (0, 0, 1) exists; this is because Eˆ is rational, and the limit exists there. Due to the negative weights, it is impossible that limλ→0 (0, 0, 1) = F∞ . Thus limλ→0 (0, 0, 1) = F0 and, for all λ sufficiently small, λ · (0, 0, 1) ∈ E0,0 . Lemma 8.9. The weights of the C∗ -action on TF∞ X have signs (− − +). Proof. It is clear that at least two of the signs must be negative—this is because for generic x ∈ E, limλ→∞ λ·x = F0 . Now suppose the weights were (a, b, c) which
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
117
were all negative. The weights of the C∗ -action on TF0 E0,0 are denoted by d and e. We consider the weighted projective spaces P(−a,−b,−c) (TF∞ X) and P(d,e) (TF0 E0,0 ). These are parameter spaces for C∗ -stable curves in X and E0,0 passing through F∞ or F0 , respectively. The analytic subspace E ∩ U∞ gives a closed subspace in the weighted projective space E ⊂ P(−a,−b,−c) (TF∞ X) parameterizing curves in E ∩ U∞ . We now construct a map from E to P(d,e) (TF0 E0,0 ). First we fix some notation. Let A0 : E0,0 → TF0 E0,0 and A∞ : U∞ → TF∞ X be the linearizing maps, and let π0 : TF0 E0,0 \ {0} → P(d,e) (TF0 E0,0 ) be the canonical projection. Given an arbitrary point x ∈ E, there exists a neighborhood U (x) ⊂ E and a section σ : U (x) → A∞ (U∞ ) ⊂ TF∞ X. After shrinking U (x), if necessary, there is a λ ∈ C∗ such that (λ ◦ σ )(U (x)) ⊂ E0,0 \ {F0 }: simply choose a λ such that (λ ◦ σ )(v) ∈ E∞ and set U (x) := λ−1 ((λ ◦ σ )(U (x)) ∩ E0,0 ); this is the shrinkage that might be unavoidable. This way we obtain a map ιx := (π0 ◦ λ ◦ σ ) : U (x) → P(d,e) (TF0 E). In order to obtain a global map ι : E → P(d,e) (TF0 E), it suffices to show that ιx does not depend on the choice of σ and λ. Indeed, choosing different σ and λ for all y ∈ U (x), there is a unique number λy ∈ C∗ such that (λ◦σ )(y) = λy (λ ◦σ )(y). This already shows that (π0 ◦ λ ◦ σ )(y) = (π0 ◦ λ ◦ σ )(y), and the existence of the global map ι is shown. It is obvious that ι is injective. This is how we make use of ι : by Lemma 8.8, π(C − ∩E0,0 )\{F0 } is not contained in the image of ι, so that the image must be contained in P(d,e) (TF0 E)\(π(C − ∩E0,0 \ {F0 })) ∼ = C. By the maximum principle, the image must be a point, a contradiction to ι being injective. Corollary 8.10. E ∩ U∞ is reducible. There exists a smooth component E∞,0 with totally repulsive fixed point. Proof. The existence of E∞,0 follows exactly as in Corollary 8.4. Similarly, it follows from the argumentation of corollary 8.4 that E ∩ U∞ is reducible if we show that the nonnormal locus N ⊂ E intersects E∞,0 . Suppose this is not the case. If x ∈ E∞,0 , then limλ→0 λx = F0 and the argumentation used in the proof of Lemma 8.9 yields a contradiction. Notation 8.11. In analogy to the notation introduced above, let E∞,i denote the irreducible components of E ∩ U∞ , and let E∞,0 be the smooth component whose existence is asserted by Corollary 8.10. 8.1.2. Loops. Now turn to the nonnormal locus of N ⊂ E . The following is an important notion. Notation 8.12. If S is a compact complex space with a holomorphic action of C∗ , call a C∗ -stable curve C ⊂ S a “loop” if C is the closure of an orbit and contains exactly one fixed point.
118
HUCKLEBERRY, KEBEKUS, AND PETERNELL
Lemma 8.13. There is at most one loop containing F0 , and at most one containing F∞ . The number of loops containing F0 is equal to the number of loops containing F∞ . Proof. There is only one curve in E ∩U0 containing a point x such that limλ→∞ λ· x = F0 (namely, the z-axis). The situation at F∞ is similar. Argue as in the proof of Lemma 8.9, using a map P(a,b) TF0 E0,0 −→ P(c,d) TF∞ E∞,0 to exclude the possibility that there is no loop at F0 and one at F∞ or vice versa. Lemma 8.14. The number of irreducible components of N ∩E0,0 equals the number of irreducible components of N ∩ E∞,0 .
Proof. Decompose N = NL ∪ i Ni , where the NL are loops and the Ni are other components. By Lemma 8.13, the claim is true if one considers loops only. If Ni is one of the other components and x ∈ Ni a generic point, then limλ→0 λ · x = F0 and limλ→∞ λ·x = F∞ so that Ni ∩E0,0 and Ni ∩E∞,0 are both irreducible components of N ∩ E0,0 and N ∩ E∞,0 , respectively. 8.2. Preparations: C∗ -action on normal surfaces. Let S be a smooth connected algebraic surface equipped with an algebraic C∗ -action with an attractive (resp., repulsive) fixed point F∞ (resp., F0 ). Let U ⊂ S be as in Lemma 7.6. Notation 8.15. A curve C = C+ ∪ C1 ∪ · · · ∪ Ck ∪ C− , k ≥ 0, as in the diagram F0 •
> C+
•p1
> C1
•p2
> C2
···
> C−
•F∞ ,
which is invariant under the C∗ -action, is called an “external chain.” A C∗ -fixed point different from F0 and F∞ is an “external fixed point.” Lemma 8.16. The complement of U in S is a union of external chains without common components. Proof. At every external fixed point the weights of the linearization must be of type (+−). Now allow S to have normal singularities at external fixed points. Applying the above argument to the desingularization Sˆ and blowing back down, we have the same result. Lemma 8.17. If S is a normal compact rational surface equipped with a C∗ -action having a smooth point F0 as a source and a smooth point F∞ as a sink, then the complement S \ U is a union of external chains. Corollary 8.18. In the setting of the preceding lemma there are no loops in S.
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
119
8.3. Computation of b1 of curves Lemma 8.19. Let C be the union of c rational curves having s singular points. Let ν : C˜ → C be the normalization of C, and let s˜ be the number of points in ν −1 (CSing ). For convenience, let δ = s˜ − s. Then b1 (C) = 1 − c + δ. Proof. Apply the Mayer-Vietoris sequence (see the proof of Proposition 7.3), where E := C, N := CSing , and so forth: 0 = h˜ 0 (E) − h˜ 0 E˜ + h˜ 0 (N) + h˜ 0 N˜ − h1 (E) + h1 E˜ + h1 (N) . 0
c−1
s−1
s˜ −1
b1
0
8.4. The proof in the case where F is discrete Notation 8.20. An “outer orbit” is an orbit that flows in the opposite direction from the generic orbit, that is, with source F∞ and sink F0 . An irreducible C∗ -invariant curve C ⊂ E containing F0 as a source and F∞ as a sink is called a “crossing curve” if E is not locally irreducible at the generic point of C. Proposition 8.21. The case where F is discrete does not occur. Proof. Assume to the contrary and use the notation introduced above. Let n be the number of components in N ∩ E0,0 . Then, by Lemma 8.19, n − 1 if N consists of crossing curves only, b1 (N ) = n if N contains an outer orbit, n + 1 if N contains loops; on the other hand, since the number of external chains in E˜ plus the number of curves in U˜ ∩ N˜ is also n (note that there are n components of N˜ containing F˜0 ), we have b1 N˜ = n − 1. It remains to show that the case where N consists only of crossing curves does not occur. In that case, since all components E0,i , i = 0, contain {x = y = 0} and N does not contain an outer orbit, there is only one such component E0,1 . Furthermore, E0,0 ∩ E0,1 consists of closures of C∗ -orbits that flow from F0 to F∞ . ˜ If p ∈ N˜ is an Observe that in this situation there cannot be external chains in E: external fixed point, then it is the intersection point of a curve flowing into p with a ˜ flow out of curve flowing out of p. But if ν(p) = F0 , then all curves in N = ν(N) ν(p), and we would have a contradiction. The analogue holds if ν(p) = F∞ .
120
HUCKLEBERRY, KEBEKUS, AND PETERNELL
9. Proof of the C∗ -action argument if F is a curve. If F is a curve, then F ∼ = P1 by Proposition 2.7. It is very easy to calculate b1 (N). However, we first have to show that F ⊂ N. Lemma 9.1. Suppose dim F = 1. If x ∈ F is an arbitrary point and U is a linearizing neighborhood of C∗ about x, then the weights of the C∗ -action on Tx X have signs (0+−), and if x, y, and z are coordinates associated to the weight spaces, then {yz = 0} ⊂ E ∩ U . Recall from the theorem on linearization (Theorem 2.6) that x, y, and z can be viewed to give coordinates on U . Proof. We have to exclude the possibility that the signs of the nonzero weights are equal. Suppose they were both positive. Then for a point y ∈ (E \ F ) ∩ U , the limit limλ→∞ λy would not exist in F. This contradicts the fact that F is the full C∗ -fixed-point set. Now it is a direct consequence of Lemma 7.6 that {zy = 0} ⊂ E ∩ U . Note that F 0 = F∞ = F . Lemma 9.2. If F ∼ = P1 , then b1 (N) = (# of irreducible components of N) − 1.
Proof. Decompose N = F ∪ i Ni . We show by induction that b1(F ∪ i=1,...,k Ni ) = k. Start: k = 0. It is clear that b1 (F ) = b1 (P1 ) = 0. Step: Since for all x ∈ Ni , the limits limλ→0 λx and limλ→∞ λx exist and are C∗ -fixed, that is, contained in F , there are only two possibilities: (1) limλ→0 λx = limλ→∞ λx for all x ∈ Nk , that is, Nk is a loop (see the notation 8.1.2). Then b1 (Nk ) = 1 and Nk ∩ F is a single point. (2) limλ→0 λx = limλ→∞ λx. Then the normalization N˜ k → Nk is injective and thus b1 (Ni ) = 0. Furthermore Nk ∩ F are two points. sequence associated to the decomposition F ∪
In any case the Mayer-Vietoris
N = (F ∪ N ) i=1,...,k i i=1,...,k−1 i ∪ Nk shows directly that b1 F ∪
i=1,...,k
Ni = b 1 F ∪
Ni + 1 = k.
i=1,...,k−1
˜ and finally Again we use a description of the rational surface E˜ to calculate b1 (N) to derive a contradiction. Lemma 9.3. There exists a surjective morphism with connected fibers φ : E˜ → P1 such that the induced C∗ -action on P1 is trivial. ˜ Since the Proof. Let C and C be two generic irreducible C∗ -stable curves in E. ∗ ˜ C -fixed-point set in E does not contain a totally attractive fixed point (this is because ˜ there is none in E), C and C are disjoint and do not intersect the singular locus of E.
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
121
Since E˜ is rational, C and C are linearly equivalent as divisors and yield the desired map. Lemma 9.4. The situation that F ∼ = P1 does not occur. This finishes the proof of the C∗ -action argument in Proposition 4.6. Proof. We show that b1 (N˜ ) = (# of irreducible components of N) − 2, contradicting Lemma 9.2. In accordance with the notation and the results of Lemma 7.6, let F0 and F∞ ⊂ E˜ denote the C∗ -fixed-point curves. These are sections for φ : E˜ → P1 , ˜ and they are contained in the preimage ν −1 (F ). In particular, F0 ∪ F∞ ⊂ N.
−1 ˜ Again decompose N = F ∪ i Ni . Set Ni := ν (Ni ). First, we show that N˜ i ∩ F0 and N˜ i ∩ F∞ are both single points. Realize from the description of Lemma 9.1 that ν|F0 : F0 → N is injective, and note that if y ∈ Ni is a generic point, then
N˜ i ∩ F0 ⊂ ν −1 lim λy ∩ F0 . λ→0
The expression on the right-hand side denotes a single point. A similar argumentation holds for F∞ . Second, we claim that b1 (N˜ i ) = 0. Recall that N˜ i are C∗ -stable and do not contain F0 or F∞ as an irreducible component. Thus, the irreducible components of N˜ i are irreducible components of φ-fibers. This is sufficient information to apply the Mayer-Vietoris sequence. For brevity, let k := (# of irreducible components of N). The beginning of the cohomological Mayer˜ Vietoris sequence associated to the decomposition N˜ = (F0 ∪ F∞ ) ∪ i Ni yields: h0 (F0 ∪ F∞ ) + h0 N˜ i h0 N˜ − i =2 =1 ≥h0 (
In other words,
i Ni )=k−1
0 +h (F0 ∪ F∞ ) ∩ N˜ i −h1 N˜ = 0. i =2(k−1)
˜ ≤ k − 2. h1 (N)
10. Some further remarks. We end with some general remarks. Again, let X denote a hypothetical complex structure on S 6 , this time not necessarily almost homogeneous. Proposition 10.1. There is no nonzero 3-form on X; that is, H 0 (KX ) = 0. Proof. Let σ ∈ H 0 (3X ). Then dσ = 0. Since H 3 (X; Q) = 0, the de Rham theorem yields a 2-form η with σ = dη. Thus σ ∧σ = dη ∧ σ = d(η ∧ σ ) − η ∧ d σ = 0. X
Hence σ = 0.
X
X
X
122
HUCKLEBERRY, KEBEKUS, AND PETERNELL
Corollary 10.2. We have the following: (1) H 3 (X, ᏻX ) = 0; (2) H 0 (KX ⊗ L) = 0 for generic L ∈ Pic(X); (3) h1 (X, ᏻX ) = h2 (X, ᏻX ) + 1; (4) Pic(X) = Pic0 (X) # H 1 (X, ᏻX ) is a positive-dimensional complex vector space. Proof. Items (1) and (2) are clear, (3) follows from the Riemann-Roch theorem, and (4) from H 1 (X, Z) = H 3 (X, Z) = 0. Observe that it is not at all clear whether κ(X) = −∞. Proving this would seem to be as complicated as to show that there are no divisors on X at all. (Notice that KX cannot be torsion by the above results!) See Proposition 10.4. We now prove a result that is the analogue of our main theorem for the cotangent bundle. The following version of this statement was pointed out to us by M. Toma. It improved our original bound by 1. Proposition 10.3. We have h0 (1X ) ≤ 1. Proof. By Proposition 10.1, H 0 1X ⊗ H 0 2X
∧
/ H 0 3 = 0. X
Now assume that ω1 and ω2 are two linearly independent 1-forms. Then, since a(X) = 0, ω1 (x) and ω2 (x) are linearly independent for a generic point x ∈ X. Consequently, if µ ∈ H 0 (2X ), then ω1 (x) ∧ ω2 (x) = c(x)µ(x). Thus any two 2-forms must be linearly dependent at all points of X. Again using that a(X) = 0, h0 (2X ) = 1. 1 (X) = 0 and therefore d : H 0 (1 ) → H 0 (2 ) is injective. On the other hand, HdR X X Thus there do not exist such 1-forms. Note that by semicontinuity of dimension it follows as above that h0 (1X ⊗ L) ≤ 2 for L a generic line bundle. Finally, we observe that statement (B) of the introduction is valid under the assumption that there are no curves in X. Proposition 10.4. (1) Assume that X has no compact curves. Then H 0 (1X ⊗ L) = 0 for all line bundles L, and h0 (T X ⊗ L) ≤ 1. (2) Assume that X has no divisors. Then h0 (1X ⊗ L) ≤ 2 for all line bundles L. Proof. (1) Let ω ∈ H 0 (1X ⊗ L). Suppose ω = 0. Then there is a divisor D (possibly empty) such that the section ω ∈ H 0 (1X ⊗ L ⊗ ᏻX (−D)) has no zeroes in codimension 1; hence it has only finitely many zeroes by our assumption. Thus c3 1X ⊗ L ⊗ ᏻX (−D) = c3 1X ≥ 0. But c3 (X) = −c3 (1X ) = 2, which is a contradiction. The inequality h0 (T X ⊗L) ≤ 1 is an immediate consequence from H 0 (1X ⊗ L) = 0.
GROUP ACTIONS ON S 6 AND COMPLEX STRUCTURES ON P3
123
(2) Assume h0 (1X ⊗L) = 3, and take a basis (ωi ). Then ω1 ∧ω2 ∧ω3 is a nonzero section of KX ⊗(3L). Since there are no divisors on X, the ωi are linearly independent everywhere; hence 1X ⊗ L # ᏻ⊕3 X . This again contradicts c3 (X) = 2. Acknowledgement. We thank the referee for a number of suggestions that led to improvement of the exposition as well as to simplifications of several mathematical points. References [BK]
[BBS] [Bla] [BS] [Bre] [CDP]
[ES] [FF] [Huc]
[HO] [Jac] [KP] [Kra]
[LY]
[Mor] [MU]
[Pot]
G. Barthel and L. Kaup, “Sur la topologie des surfaces complexes compactes” in Topologie des surfaces complexes compactes singulières, Sem. Math. Sup. 80, Presses Univ. Montréal, Montréal, 1982, 61–297. A. Białynicki-Birula and A. Sommese, Quotients by C∗ ×C∗ actions, Trans. Amer. Math. Soc. 289 (1985), 519–543. A. Blanchard, Sur les variétés analytiques complexes, Ann. Sci. École. Norm. Sup. (3) 73 (1956), 157–202. A. Borel and J.-P. Serre, Groupes de Lie et puissances réduites de Steenrod, Amer. J. Math. 75 (1953), 409–448. G. Bredon, Introduction to Compact Transformation Groups, Pure Appl. Math. 46, Academic Press, New York, 1972. F. Campana, J.-P. Demailly, and T. Peternell, The algebraic dimension of compact complex threefolds with vanishing second Betti number, Compositio Math. 112 (1998), 77–91. J. Erdman-Snow, “On the classification of solv-manifolds in dimension 2 and 3” in Journées Complexes 85 (Nancy, 1985), Inst. Élie Cartan 10, Univ. Nancy, Nancy, 1986, 57–103. G. Fischer and O. Forster, Ein Endlichkeitssatz für Hyperflächen auf kompakten komplexen Räumen, J. Reine Angew. Math. 306 (1979), 88–93. A. Huckleberry, “Actions of groups of holomorphic transformations” in Several Complex Variables, VI: Complex Manifolds, Encyclopaedia Math. Sci. 69, Springer-Verlag, Berlin, 1990, 143–196. A. Huckleberry and E. Oeljeklaus, Classification Theorems for Almost Homogeneous Spaces, Inst. Élie Cartan 9, Univ. Nancy, Nancy, 1984. N. Jacobson, Lie Algebras, Interscience Tracts in Pure Appl. Math. 10, Interscience Publishers, New York, 1962. H. Kraft and V. L. Popov, Semisimple group actions on the three-dimensional affine space are linear, Comment. Math. Helv. 60 (1985), 466–479. V. A. Krasnov, Compact complex manifolds without meromorphic functions (in Russian), Mat. Zametki 17 (1975), 119–122; English translation in Math. Notes 17 (1975), 69–71. J. Li and S.-T. Yau, “Hermitian-Yang-Mills connection on non-Kähler manifolds ” in Mathematical Aspects of String Theory (San Diego, Calif., 1986), Adv. Ser. Math. Phys. 1, World Scientific, Singapore, 1987, 560–573. S. Mori, Threefolds whose canonical bundles are not numerically effective, Ann. of Math. (2) 116 (1982), 133–176. S. Mukai and H. Umemura, “Minimal rational threefolds” in Algebraic Geometry (Tokyo and Kyoto, 1982), Lecture Notes in Math. 1016, Springer-Verlag, Berlin, 1983, 490– 518. J. Potters, On almost homogeneous compact complex analytic surfaces, Invent. Math. 8 (1969), 224–266.
124 [Ste]
HUCKLEBERRY, KEBEKUS, AND PETERNELL N. Steenrod, The Topology of Fibre Bundles, Princeton Math. Ser. 14, Princeton Univ. Press, Princeton, 1951.
Huckleberry: Fakultät und Institut für Mathematik, Ruhr-Universität Bochum, D44780 Bochum, Germany;
[email protected] Kebekus: Lehrstuhl Mathematik VIII, Universität Bayreuth, D-95440 Bayreuth, Germany;
[email protected] Peternell: Lehrstuhl Mathematik I, Universität Bayreuth, D-95440 Bayreuth, Germany,
[email protected]
Vol. 102, No. 1
DUKE MATHEMATICAL JOURNAL
© 2000
SOLUTIONS, SPECTRUM, AND DYNAMICS FOR SCHRÖDINGER OPERATORS ON INFINITE DOMAINS A. KISELEV and Y. LAST
1. Introduction and main results. In this paper we investigate the relations between the rate of decay of solutions of Schrödinger equations, continuity properties of spectral measures of the corresponding operators, and dynamical properties of the corresponding quantum systems. The first main result of this paper shows that, in great generality, certain upper bounds on the rate of growth of L2 norms of generalized eigenfunctions over expanding balls imply certain minimal singularity of the spectral measures. Consider an operator HV defined by the differential expression HV = − + V (x) on some connected infinite domain with a smooth boundary and with Dirichlet boundary conditions on ∂. The case of = Rd is not excluded; no boundary conditions are needed in this case. To every vector φ ∈ L2 () we associate a spectral measure µφ in the usual way (namely, µφ is the unique Borel measure on R obeying f (E) dµφ (E) = (f (HV )φ, φ) for any Borel function f ). For any measure µ, we define the upper α-derivative D α µ(E) in the standard way: D α µ(E) = lim sup δ→0
µ(E − δ, E + δ) . δα
We denote by BR the ball of radius R centered at the origin, and we use the notation f BR for the L2 norm of the function f restricted to BR . We denote by Wml the usual Sobolev spaces of functions f such that D l f exists in the distributional sense l and (|u|m +|D l u|m )dx < ∞. We say that f (x) ∈ Wm,loc () if f (x) ∈ Wml (∩BR ) for every R < ∞. One of the main theorems that we prove here is the following. Theorem 1.1. Assume that the potential V (x) belongs to L∞ loc and is bounded from below, and that is a domain with piecewise smooth boundary. Suppose that there exists a distributional solution u(x, E) of the generalized eigenfunction equation HV − E u(x, E) = 0 (1) Received 25 August 1998. Revision received 21 May 1999. 1991 Mathematics Subject Classification. Primary 35J10, 81Q10; Secondary 35P05. Kiselev partially supported by National Science Foundation grant number DMS-9801530. Last partially supported by National Science Foundation grant number DMS-9801474. 125
126
KISELEV AND LAST
satisfying the boundary conditions and such that for some α, 0 ≤ α ≤ 1, we have lim inf R −α (2) |u(x, E)|2 dx < ∞. R→∞
BR ∩
Fix some compactly supported φ(x) ∈ L2 () such that φ(x)u(x, E) dx = 0.
Then we have D α µφ (E) > 0. Remarks. (1) Notice that under our assumptions on the potential, we have u ∈ 2 by standard results on Sobolev estimates for elliptic operators (see, e.g., [12]), W2,loc and the boundary values for u are well defined. (2) We choose not to formulate Theorem 1.1 for more general classes of potentials, domains, and boundary conditions in order to be able to give a transparent proof. Certainly, we can extend this theorem to wider classes of potentials and boundary conditions. The nature of the limitations is clear from the proof and the Stark operators example in Appendix B. For instance, when = Rd , we only ask that the negative part of the potential, V− , belong to the Kato class K d (see, e.g., [3], [39] for the definition of Kato classes). (3) If we replace < ∞ in equation (2) by = 0, we obtain that D α µφ (E) = ∞. Theorem 1.1 provides information on the pointwise behavior of spectral measures from rather simple and natural assumptions about the behavior of generalized eigenfunctions. From this theorem follow new criteria for the existence of absolutely continuous spectrum or singular continuous spectrum of given dimensional characteristics (see Section 2, and, in particular, Theorem 2.5 for more details). This contrasts the well-known result (see [5], [38], [39]) that existence of a polynomially bounded (but not L2 ) solution of (1) implies that the energy E belongs to the essential spectrum of HV but gives no further information on the structure of the essential spectrum. To the best of our knowledge, Theorem 1.1 is the first rigorous result providing a relation between the behavior of solutions and pointwise properties of the spectral measures for multidimensional Schrödinger operators. A result analogous to Theorem 1.1 also holds for discrete Schrödinger operators defined on some ⊂ Zd by (hv u)(n) = u(m) + v(n)u(n). |m−n|=1, m∈
We discuss this extension in Section 3. In Appendix A, we also indicate that results similar to Theorem 1.1 hold for more general elliptic and higher-order operators.
127
SOLUTIONS, SPECTRUM, AND DYNAMICS
The motivation for seeking relations between the pointwise in energy behavior of solutions and properties of spectral measures comes from the fact that in many problems the solutions are among the objects we can hope to investigate. When we are interested in the fine structure of the spectrum of Schrödinger operators for which the methods of scattering theory are not applicable, there are very limited tools in higher dimensions that may be effectively used for spectral analysis. On the other hand, for one-dimensional Schrödinger operators, the subordinacy theory created by Gilbert and Pearson [15], [14] and further extended by Jitomirskaya and Last [19], [20], [21] provides a powerful method for spectral analysis. The main results of these papers give a necessary and sufficient link between the behavior of solutions and the singularity of the spectral measure. Subordinacy theory played an important role in many recent results in one-dimensional spectral theory (see [7], [9], [18], [19], [20], [21], [24], [26], [30], [35]). In this paper, we derive only a sufficient-type relation between the solutions and the spectrum, but in much greater generality. However, in contrast to subordinacy theory, which requires comparison of different solutions, we need information about only one solution—the one obeying the appropriate boundary conditions. We remark that for one-dimensional Schrödinger operators, the result of Theorem 1.1 can be derived from subordinacy theory [20], [21]. Our second major result in this paper establishes a fundamental relation between spectral properties, generalized eigenfunctions, and quantum dynamics, and in particular, provides new bounds for the transport properties of quantum systems. We study the behavior of the time-averaged moments of the position operator X under the Schrödinger evolution. Pick some initial state ψ and consider
m
|X|
T
1 = T
T 0
m |X| exp − iH t ψ, exp − iH t ψ dt. V
V
Recall that a measure µ is called α-continuous if it gives zero weight to any set of zero α-dimensional Hausdorff measure (we recall the definition of these measures in Section 2). Let us denote by Pαc the spectral projector on the α-continuous spectral subspace, the set of all vectors ξ such that µξ is α-continuous (see [29]). In particular, if µψ has an α-continuous component (i.e., Pαc ψ = 0), then the following lower bound holds [8], [16], [17], [29]: m |X| T ≥ Cm T mα/d (here d is the space dimension, and Cm is a constant depending on µψ and m). Recall that for a wide class of Schrödinger operators, one has a generalized eigenfunction expansion theorem (see, e.g., [5], [30], [39]). In particular, for every ψ, there is a unique unitary map Uψ from the cyclic subspace Ᏼψ , generated by the vector ψ and the operator HV , to L2 (R, dµψ (E)). This map sends ψ to a function equal to 1 everywhere and realizes a unitary equivalence Uψ HV |Ᏼψ Uψ−1 = E, where E stands
128
KISELEV AND LAST
for the operator of multiplication by E. The operator Uψ is an integral operator with kernel u(x, E), where, for each fixed E, the u(x, E)’s solve (1) and are called generalized eigenfunctions. We say that the u(x, E)’s correspond to ψ if they constitute the kernel of the unitary map Uψ described above. Note that they are only defined a.e. with respect to µψ . We prove the following theorem, which holds in both the discrete and continuous settings. Theorem 1.2. Let ψ be a vector for which there exists a Borel set S ⊂ R of positive µψ measure, such that the restriction of µψ to S is α-continuous and, in addition, the generalized eigenfunctions u(x, E) for all E ∈ S satisfy (3)
lim sup R −γ u(x, E)2BR < ∞ R→∞
for some γ such that 0 < γ < d. Then for any m > 0, there exists a constant Cm such that m |X| T ≥ Cm T mα/γ (4) for all T > 0. Remarks. (1) Theorem 1.2 is somewhat related to (although it does not coincide with) some recent heuristic results in [23]. (2) It may be seen from Theorem 1.1 that we cannot have γ < α, since it would follow that the upper γ -derivative of the spectral measure is positive on too large a set (see Corollary 2.6). The physical reason is that when V is bounded from below, the velocity is bounded, and the propagation rate is at most ballistic. However, the range of applicability of Theorem 1.2 is wider than that of Theorem 1.1. In particular, it is applicable to operators with strongly negative potentials, such as Stark operators, which exhibit faster-than-ballistic transport. See Appendix B. The somewhat striking aspect of Theorem 1.2 is that for a fixed nonzero spectral dimension, faster decay of u(x, E) leads to faster transport. Theorem 1.2 shows that the behavior of the generalized eigenfunctions plays an important role in determining dynamical properties of quantum systems. We apply Theorem 1.2 to investigate the dynamics in the random decaying potentials model studied in [26]. When weakly coupled, these systems have (almost surely) some singular continuous spectrum with local dimensions that depend on the energy, but we show that the dynamical spreading of wavepackets, for any energy region where the spectrum is continuous, is almost ballistic with probability 1. More precisely, we show that for almost every realization, we have, for every - > 0, a bound of the form m |X| T ≥ Cm,- T m(1−-) . The paper is organized as follows. In Section 2, we prove Theorem 1.1 and its corollaries, rendering new spectral criteria. In Section 3, we sketch the argument for
SOLUTIONS, SPECTRUM, AND DYNAMICS
129
similar results in the discrete setting. In Section 4, we consider some simple examples, in particular, showing that the result of Theorem 1.1 provides only a sufficient but not necessary criterion for positivity of the derivative of the spectral measure. It is, however, an optimal result in the sense that one cannot, in general, say more by looking only at the rate of growth of the L2 norm (see Section 5). It remains an interesting open question to find additional properties of solutions that determine the spectrum (or other important characteristics of the operator, such as transport properties) completely. In Section 5, we study the relationship between solutions, spectral dimension, and quantum dynamics, in particular, proving Theorem 1.2. In the appendices, we indicate further possible generalizations for elliptic and higher-order operators and consider dynamics for strongly perturbed one-dimensional Stark operators. The example of Stark operators provides another illustration of the relationship between the behavior of solutions and transport properties. Acknowledgements. We thank Y. Avron, I. Guarneri, R. Ketzmerick, B. Simon and S. Tcheremchantsev for stimulating discussions. We are grateful to the referees for useful sugestions and corrections. 2. Solutions and spectrum: Continuous case. We begin the proof of Theorem 1.1 with the following simple observation. Lemma 2.1. Let A be a selfadjoint operator acting on a Hilbert space Ᏼ and fix a vector φ ∈ Ᏼ. Let z ∈ C \ R. Then 2 = Im (A − z)−1 φ, φ . Im z(A − z)−1 φᏴ Proof. Consider the spectral representation associated with a vector φ and perform a straightforward computation:
dµφ dµφ Im = Im z . 2 R t −z R |t − z| The first idea in the proof of Theorem 1.1 is to estimate from below Im((HV − E − i-)−1 φ, φ) as - → 0. Such an estimate is equivalent to an estimate on the upper α-derivative of the spectral measure by the following lemma. β
Lemma 2.2. Let Qµ (E) denote Qβµ (E) = lim sup - β Im -→0
dµ(t) . t − E − i-
Then 1−α D α µ(E) ≤ C1 Qµ (E) ≤ C2 D α µ(E),
where C1 , C2 are positive constants depending only on α.
130
KISELEV AND LAST
Proof. The proof is a direct computation. For details, we refer to [11, Lemmas 3.2 and 3.3]. To derive an estimate on the Borel transform, we use Lemma 2.1, namely, estimates from below on the norm of the function −1 θ (x, E + i-) = HV − E − i- φ(x) over balls of radius of order 1/- as - goes to zero over some properly chosen sequence. The last technical lemmas that we need for the proof concern estimation of the W21 norms of u(x, E) and θ (x, z) in terms of their L2 norms. Lemma 2.3. Let ⊂ Rd be a domain with piecewise smooth boundary. Suppose that the potential V belongs to L∞ loc and is bounded from below, and let HV denote an operator with Dirichlet boundary conditions on ∂. Suppose that the function g(x, z) satisfies Dirichlet boundary conditions and HV − z g(x, z) = φ(x), where φ ∈ L2 () is compactly supported and real valued, and z is in general complex. Then (5) gW 1 (BR ∩) ≤ C(z, V− ) gL2 (BR+1 ∩) + φL2 () . 2
The constant in (5) depends only on the lower bound on V and on z, and may be chosen uniformly for z in any compact set. Proof. The proof is standard, and we provide it for the sake of completeness. See, for example, [3], [39] for detailed exposition of similar results and further references. Throughout the proof, we assume that the function g is sufficiently smooth to justify integration by parts (local W22 is sufficient). Clearly this is the case under our assumptions on V (see, e.g., [12]). To prove the bound (5) with the constant independent of R, let g(x, z) = g1 (x, z) + ig2 (x, z), where g1 , g2 are real valued. For any ψ ∈ C ∞ () such that 1 ≥ ψ(x) ≥ 0, ψ(x) = 1 when x ∈ BR ∩ , ψ(x) = 0 when x ∈ / BR+1 ∩ , we have (∇g1 )2 dx ≤ ψ(∇g1 )2 dx BR ∩ B ∩ R+1 ∂g1 g1 dσ − ψ (∇ψ)(∇g1 )g1 dx = (6) ∂n ∂(∩BR+1 ) BR+1 ∩ ψg1 g1 dx, − BR+1 ∩
131
SOLUTIONS, SPECTRUM, AND DYNAMICS
where dσ is the surface measure on ∂( ∩ BR+1 ) induced from Rd . The first term vanishes because g1 vanishes on ∂ and ψ vanishes on (∂BR ) ∩ . Furthermore, by Green’s formula, ∂ψ 2 (g1 ) dσ − (7) 2 (∇ψ)(∇g1 )g1 dx = ψ(g1 )2 dx. BR+1 ∩ ∂(BR+1 ∩) ∂n BR+1 ∩ The boundary term in this equality is also equal to zero. Substituting (7) into (6), we find 1 2 (∇g1 ) dx ≤ ψ(g1 )2 dx 2 BR ∩ B ∩ R+1 ψg1 (Re z − V )g1 + φ − (Im z)g2 dx. + BR+1 ∩
Therefore,
g1 2W 1 (B ) ≤ Cψ φ2L2 + 2(1 + |z|)+V− L∞ g1 2L2 (B 2
R
R+1
+| Im z|g2 2L2 (B )
R+1 )
.
A similar estimate holds for g2 . Combining these two estimates, we obtain the result of the lemma. Remarks. (1) We have not tried to determine the most general classes of potentials and boundary conditions for which Lemma 2.3 holds. With slightly more technical effort, we can treat some other boundary conditions, such as Neumann, for instance. (2) For the case of the whole space, the lemma is true under the assumption that V− ∈ K d , the Kato class, which allows singularities in the negative part of the potential (see [39] for the definition and properties of potentials from these classes). This result follows from the technique developed in [3], [39], which uses Brownian motion to derive subsolution estimates implying bounds like in Lemma 2.3. Although [3], [39] consider only real z (and homogeneous equation), it is not hard to see that their arguments extend to give results like (5). We now introduce an important object in our consideration. Suppose S is a do2 (S). We denote by main with piecewise smooth boundary and f , g belong to W2,loc W∂S [f, g] the following expression:
∂f ∂g (8) f (t) (t) − (t)g(t) dσ (t), W∂S [f, g] = ∂n ∂n ∂S where σ is the surface measure induced from Rd , and ∂/∂n is the derivative in the 2 outer normal direction. The definition makes sense for W2,loc functions by Sobolev trace theorems (see, e.g., [13]). The notation W stresses the fact that in one dimension, the corresponding expression is related to the Wronskian of two functions (precisely, it is the difference of the Wronskians taken at the endpoints of the interval S). We abuse verbal notation and call the expression (8) the Wronskian of f and g over ∂S for the rest of this paper. The final lemma we need is the following.
132
KISELEV AND LAST
Lemma 2.4. Suppose that two functions f , g are locally W22 and satisfy Dirichlet boundary condition on ∂. Then for every R,
R
0
W∂(B
r ∩)
[f, g] dr ≤ f W 1 (BR ∩) gW 1 (BR ∩) . 2
2
Proof. We have W∂∩BR [f, g] = 0 since f and g satisfy the boundary conditions. Next note that R W(∂B ∩) [f, g] dr ≤ (|f ||∇g| + |∇f ||g|) dx r BR ∩
0
≤ f W 1 (BR ∩) gW 1 (BR ∩) . 2
2
We used the Cauchy-Schwartz inequality in the last step. Now we are ready to prove Theorem 1.1. Proof of Theorem 1.1. An interplay of the scales in space and in the spectral parameter plays an important role in the analysis. Let us assume that φ(x)u(x, E) dx = c = 0.
Take sufficiently large R0 , such that supp φ ⊂ BR0 . By Green’s formula, we have E θ (x, E + i-)u(x, E) dx BR0 ∩ = W∂(BR0 ∩) [θ, u] + HV θ(x, E + i-)u(x, E) dx BR0 ∩
= W∂(BR0 ∩) [θ, u] + (E + i-) +
BR0 ∩
BR0 ∩
θ(x, E + i-)u(x, E) dx
φ(x)u(x, E) dx.
In the above computation, we used the definition of θ(x, z) and the fact that the function u satisfies (HV − E)u = 0. Hence we obtain (9) θ(x, E + i-)u(x, E) dx. W∂(BR0 ∩) [θ, u] = −c − iBR0 ∩
Let us integrate (9) from R0 to some larger value of R: R R W∂(B )∩ [θ, u] dr ≥ |c|(R − R0 ) − dr r R0
R0
Br ∩
θ(x, E + i-)u(x, E) dx .
133
SOLUTIONS, SPECTRUM, AND DYNAMICS
Using Lemmas 2.3 and 2.4, we see that (10) C 2 θ (x, E + i-)L2 (BR+1 ∩) + φL2 uL2 (BR+1 ∩) R ≥ |c|(R − R0 ) − drθ(x, E + i-)L2 (Br ∩) uL2 (Br ∩) . 0
According to assumption (2) of the theorem, there exists a sequence Rn → ∞, such that (11)
α/2
uL2 (BRn ∩) ≤ C1 Rn .
Let us set -n = C2 /Rn , and pick R + 1 = Rn and - = -n in (10). We obtain 2 C + C2 θ (x, E + i-n )L2 (BRn ∩) + φL2 uL2 (BRn ∩) ≥ |c|(Rn − R0 − 1). Substituting (11) into the last inequality, we find that there exists some constant C3 such that for n large enough, we have (12)
1−(α/2)
θ (x, E + i-n )L2 (BRn ∩) ≥ C3 Rn
− φL2 .
Now it remains to invoke Lemma 2.1 and note that
−1 2 Im HV − E − i-n φ, φ ≥ -n θ x, E + i-n L
2 (BRn )
for every n. Using the estimate (12) and the relation between Rn and -n , we find −1 Im HV − E − i-n φ, φ ≥ C4 -nα−1 for sufficiently small -n. The application of Lemma 2.2 now completes the proof. Remarks. (1) Theorem 1.1 also holds for wider classes of potentials and boundary conditions. The restrictions of the classes come from Lemma 2.3, the necessary estimate on the energy norms. With the help of smooth mollifiers to justify integration by parts, Theorem 1.1 can be extended to the classes to which one can extend Lemma 2.3. (2) We also note that the same argument as in the proof implies that D α µφ (E) = ∞ if, instead of (2) in the assumption of Theorem 1.2, we suppose that lim inf R −α u(x, E)2BR = 0. R→∞
We use this fact in the proof of Corollary 2.6. The next question that we would like to discuss is a sufficient condition for the existence of the various components of the spectrum. Let us recall the definition of
134
KISELEV AND LAST
Hausdorff measures and dimension. For α ∈ [0, 1] and any S ⊂ R, the α-dimensional Hausdorff measure of S is defined by hα (S) = lim
inf
δ→0 δ-covers
∞
|Iγ |α ,
γ =1
where Iγ are the intervals constituting the cover. The Hausdorff dimension of a set S is the infimum of all values of α such that hα (S) = 0. First, we prove the following theorem. Theorem 2.5. Let HV be a Schrödinger operator, with V and satisfying the same conditions as in Theorem 1.1. Suppose that for a measurable set S of positive hα measure, for each E ∈ S, there exists a nontrivial solution u(x, E) of the generalized eigenfunction equation (1) satisfying the boundary conditions such that lim inf R −α u(x, E)2BR < ∞. R→∞
Then there exists a vector ϕ ∈ L2 (R n ) such that µϕ (S1 ) > 0 for any S1 ⊂ S of positive hα measure. In particular, if α = 1, we have an absolutely continuous spectrum filling the set S. Remark. In many applications, particularly in one dimension, one applies a reasoning different from that suggested by Theorem 2.5 to derive the existence of various dimensional spectral components from results like Theorem 1.1. One proves the existence of solutions as in (2) for a.e. E, and then uses rank-one perturbation arguments (see, e.g., [20], [26]). Proof of Theorem 2.5. Recall that for every self-adjoint operator, there is an associated spectral measure of maximal type, µ, such that for every ψ and any measurable set S, µψ (S) > 0 implies µ(S) > 0. A vector χ is of the maximal type if for any measurable set S, µχ (S) > 0 given that µ(S) > 0. We show that for any S1 ⊂ S of positive α-dimensional Hausdorff measure, there exists a vector ψ with µψ (S1 ) > 0. By the standard argument for the existence of vectors of maximal type (see, e.g., [6]), this would imply existence of the vector ϕ as in the theorem. Pick some ball BR0 such that u(x, E)L2 (BR ∩ ) = 0 for energies E in a subset S2 of S1 of positive hα 0 measure (it is easy to see that such a ball exists, because of the σ -additivity of hα ). We remark that for a wide class of operators HV , an arbitrary ball will do because of the unique continuation (solutions u(x, E) cannot vanish identically on any ball), but there is no need to invoke these results. Pick a basis {ψn (x)}∞ n=1 in the Hilbert space L2 (BR0 ∩ ). Since {ψn } forms a basis, for every E ∈ S2 , there exists an n such that ψn (x)u(x, E) dx = 0. BR0 ∩
Consider the functions D α µψn on the set S2 . By Theorem 1.1, for every E ∈ S2 , there exists an n such that D α µψn (E) > 0. In particular, by σ -additivity of hα , there
SOLUTIONS, SPECTRUM, AND DYNAMICS
135
exists an n0 such that D α µψn0 (E) > 0 for every E in a set Sn0 ⊂ S2 of positive hα measure. By the results of Rogers-Taylor theory (see [36, Theorem 63]), it follows that the measure µψn0 gives positive weight to the set Sn0 , and hence to the set S1 . The case of the absolutely continuous spectrum corresponds to α = 1; in this case, the application of Rogers-Taylor theory may be replaced by the well-known fact that a measure gives positive weight to a set of positive Lebesgue measure when its derivative is positive a.e. in this set. From Theorem 2.5 (or, essentially, from its proof and the remark after the proof of Theorem 1.1), we have the following corollary. Corollary 2.6. For any α, the set S of energies E, for which there exists a solution u(x, E) satisfying lim inf R −α u(x, E)2BR = 0,
(13) has zero
R→∞
hα
measure.
Remark. The fact that there may be only countably many values of E (counting multiplicities) for which equation (1) has L2 solutions satisfying the boundary conditions, is an obvious consequence of the separability of the Hilbert space L2 (). This corollary may be viewed as a less trivial generalization for slower rates of decay. Proof of Corollary 2.6. Suppose that S has positive hα measure. By the remark α φ after the proof of Theorem 1.1, (13) implies that D µ (E) = ∞ for every E ∈ S and finitely supported φ such that u(x, E)φ(x) = 0. Proceeding as in the proof of Theorem 2.5, we can find a vector ϕ such that D α µϕ (E) = ∞ for any E in some set of positive hα measure. This is not possible by Rogers-Taylor theory (see [36, Theorem 67]) and therefore gives a contradiction. We remark that for α = 1, this argument reduces to the well-known statement that a finite Borel measure µψ cannot have an infinite derivative on a set of positive Lebesgue measure. We would like to end this section by drawing a link with the well-known results of Rellich [34] and Kato [22], who showed, respectively, that for the free Laplacian and the Laplacian with a short-range perturbation (i.e., a potential that satisfies |V (x)| ≤ C(1 + |x|)−1−- ), there are no solutions satisfying (13) with α = 1 for any energy. Corollary 2.6 shows that for a much larger class of potentials, such solutions are still in some sense “exceptional” and can only occur on a set of energies of zero Lebesgue measure. 3. Solutions and spectrum: Discrete case. In this section, we consider discrete Schrödinger operators. All the results of the previous section extend to the discrete setting. In fact, the proofs are easier due to the absence of the Sobolev estimates issue, and there are no restrictions on potential. Let be some connected infinite domain in Zd . We define the Schrödinger operator hv on L2 () with Dirichlet boundary conditions by
136
KISELEV AND LAST
h v f (n) =
f (m) + v(n)f (n).
|n−m|=1, m∈
It is easy to check that the operator defined in this way is self-adjoint. We need an analog of Green’s formula in the discrete setting. For any domain S ⊂ Z d , let us denote by ∂S the set of points outside S that have a point of S within a unit distance. We have, for any two functions f , g, f (m) h g(l) − g(m) f (l) , v f (n)g(n) − f (n)hv g(n) = n∈S
m∈∂S
l∈NS (m)
l∈NS (m)
where NS (m) denotes the set of neighbors of the point m ∈ ∂S lying in S (so that |m − n| = 1 for any n ∈ NS (m)). Therefore, we say that the analog of the Wronskian over ∂S of two functions is, in the discrete setting, f (m) g(l) − g(m) f (l) . w∂S [f, g] = m∈∂S
l∈NS (m)
l∈NS (m)
For convenience, in all considerations for the discrete case, we replace the balls BR with cubes CR . The point n = (n1 , . . . , nd ) of the lattice belongs to CR if and only if |ni | ≤ R for all i = 1, . . . , d. We now formulate and prove an analog of Theorem 1.1 in the discrete case. Theorem 3.1. Suppose that there exists a solution u(n, E) of the generalized eigenfunction equation (14) hv − E u(n, E) = 0 satisfying the Dirichlet boundary conditions on ∂. Suppose that for some α, 0 ≤ α ≤ 1, we have (15) |u(n, E)|2 dx < ∞. lim inf R −α R→∞
n∈CR ∩
Fix some vector φ of compact support such that u(n, E)φ(n) = 0. n
Then we have D α µφ (E) > 0. In particular, if u(n0 , E) = 0, then D α µδn0 (E) > 0 (here δn0 is a function equal to 1 at n0 and 0 otherwise).
SOLUTIONS, SPECTRUM, AND DYNAMICS
137
Proof. The argument repeats the proof of Theorem 1.1, except that we do not need Lemma 2.3. The analog of Lemma 2.4 is proven directly by the observation that w∂(∩Cr ) [f, g] = w∂Cr \∂ [f, g] and R w∂C
r \∂
r=1
[f, g] ≤ df L2 (CR+1 ∩) gL2 (CR+1 ∩) .
Remark. As in the continuous case, the result is also true for more general boundary conditions. The following theorem is an analog of Theorem 2.5. Theorem 3.2. Suppose that for each energy E in some measurable set S of positive hα measure, there exists a nontrivial solution u(n, E) of (14) satisfying Dirichlet boundary conditions and having the property (15). Then there exists a vector φ ∈ L2 (Zd ), such that µφ (S1 ) > 0 for any set S1 ⊂ S of positive hα measure. In particular, if α = 1, the set S is an essential support of the absolutely continuous part of the measure µφ restricted to S. The proof of this theorem is the same as the proof of Theorem 2.5. 4. Examples and discussion. The purpose of this section is purely illustrative—to show where the solutions we are studying are known to occur. However, these observations also partly lead us to the issue that is the topic of the next section: the relationships between generalized eigenfunctions, spectrum, and dynamics. In addition, we show that the criteria given by Theorems 1.1 and 3.1 are sufficient but not, in general, necessary for the positivity of the derivatives of spectral measures. We give an explicit example to confirm this statement. Our first remark is that solutions u(x, E) satisfying lim inf R −1 u(x, E)2BR < ∞ R→∞
exist for every energy E = 0 in the spectrum in the case of the free Laplacian operator in Rd or in the cylinder with Dirichlet boundary conditions. In the cylinder case, we may take u(x, E) = exp i E − El x1 Zl (x2 , . . . , xd ), where x1 is the coordinate along the rotation axis, El is any eigenvalue (less than E) of the Laplace operator with Dirichlet boundary conditions on the d−one-dimensional ball, and Zl (x2 , . . . , xd ) is any eigenfunction corresponding to this eigenvalue. In the free case, we can take any function √ u(x, E) = r −(d/2)+1 Jν Er Yl (θ),
138
KISELEV AND LAST
where Yl is any of the spherical harmonics corresponding to the eigenvalue El = l(l + d − 2) of the Laplace-Beltrami operator on the d-dimensional sphere, and Jν is a Bessel function (without singularity at the origin) with ν defined by ν 2 = l(l + d − 2) + ((d/2) − 1)2 . Note that for large r,
√ √ 2πν − π −1/2 Jν Er ∼ Cr 1 + o(1) . cos Er − 4 See, for example, [4], [44] for more information on spherical harmonics and Bessel functions. Using the results of Agmon theory and related estimates on the Fourier transform (see [1] or [33], and [2]), it is straightforward to show that the existence for every E ∈ (0, ∞) of solutions with the rate of growth of the L2 norm as in (2) with α = 1 extends to perturbations of the free Laplacian by short-range potentials, |V (x)| ≤ C(1+|x|)−1−- , if C is sufficiently small. In one dimension, it was recently shown in [7], [35] that such solutions exist for a.e. E ∈ (0, ∞) for any potential V satisfying |V (x)| ≤ C(1+|x|)−(1/2)−- . This implies that the absolutely continuous spectrum of the free operator in one dimension is stable under all perturbations decaying at this rate. This result is optimal: there are potentials that satisfy |V (x)| ≤ C(1 + |x|)−1/2 and lead to purely singular spectrum in (0, ∞). The corresponding question about the borderline decay for the stability of the absolutely continuous spectrum is open in higher dimensions, with any power in [1, 1/2] a possible candidate, in principle. We make the following conjecture. Conjecture 4.1. Suppose that HV is a Schrödinger operator in Rd for which |V (x)| ≤ C(1 + |x|)−(1/2)−- , - > 0. Then the absolutely continuous spectrum of the operator HV fills the whole positive semiaxis. This conjecture would, in particular, be clear from the following. Conjecture 4.2. Under the conditions of the previous conjecture, for a.e. E ∈ (0, ∞), there exists a solution u(x, E) of the generalized eigenfunction equation satisfying (2) with α = 1. Our next example concerns Schrödinger operators with periodic potentials. Let V (x) be a smooth periodic potential of period 1 in all variables x1 , . . . , xd . Given E in the spectrum of HV , consider the boundary value problem
(16)
(HV − E)b(x, E) = 0, ∂ j b = exp(iθl ) j , l = 1, . . . , d, j = 0, 1. ∂xl xl =1 ∂xl xl =0 j
∂j b
The set of all values of θ ∈ [0, 2π)d for which there exist solutions of the boundary value problem (16) is called the real (physical) Fermi surface FE . From well-known results on spectral properties of periodic differential operators (see [28]), it follows
139
SOLUTIONS, SPECTRUM, AND DYNAMICS
that for all but a countable set of energies in the spectrum (exceptional points corresponding to band edges), we can find solutions u(x, E) of the generalized eigenfunction equation (1) of the following type: u(x, E) = b(x, θ, E)γ (θ) dσ, (17) S
where S ⊂ FE is a piece of an analytic (d −1)-dimensional surface, γ (θ) is a C0∞ (S)function, and b(x, θ, E) are Bloch functions satisfying (16): b(x, θ, E) = exp(iθx)f (x, θ, E), where f (x, θ, E) is periodic with period 1 in all directions in x, continuous in x, and analytic (as an L2 ([0, 1)d ) vector) in θ ∈ S. We claim that u(x, E) satisfies lim inf R −1 u(x, E)2BR ≤ ∞. R→∞
This can be shown in a way similar to the proof of this property in the case of Fourier transforms of measures supported on (d − 1)-dimensional smooth surfaces (see [2]). Represent the equation of the surface S as θd = s(θ1 , . . . , θd−1 ) (we can assume that S is small enough and θd is chosen so that this is possible). Then we can rewrite (17) as exp iθ x + is θ xd f x, θ , E γ θ dθ , u(x, E) = S
where the integration is now over the projection S of S on the hyperplane θd = 0, θ denotes first d − 1 coordinates, and γ includes the Jacobian from the change of variables. Fix the value of xd and integrate over the cube CR in the other coordinates x = x1 , . . . , xd−1 : |u(x, E)|2 dx = γ θ γ θ˜ exp i s θ − s θ˜ xd CR
S S
×
CR
exp i θ − θ˜ x f x , θ , E f x , θ˜ , E dx dθ d θ˜ .
Without loss of generality, take R to be an integer. Then we obtain d−1 sin R + (1/2) θj − θ˜j |u(x, E)|2 dx = dθ d θ˜ (18) ψ θ , θ˜ , sin(1/2) θj − θ˜ C S S j =1
R
j
where ψ θ , θ˜ = γ θ γ θ˜ exp i s θ − s θ˜ xd f x , θ , E f x , θ˜ , E exp i θ − θ˜ x dx . × C1
140
KISELEV AND LAST
Figure 1. The spiral domain
Due to the properties of f and γ , the function ψ is smooth, and hence the right-hand side in (18) converges as R → ∞ to the constant 2 2 f x , θ , E dx . C= dθ ψ θ , θ = dθ γ θ S
S
C1
Therefore, integrating in xd from −R to R, we obtain |u(x, E)|2 dx ≤ CR, BR
as claimed. Our last example in this section shows that the criteria for the positivity of derivatives of spectral measures, given by Theorems 1.1 and 3.1, provide a sufficient, but, in general, not necessary condition. The example is especially simple and transparent in the discrete setting. Let us consider the discrete plane Z2 and let be an infinite “spiral” in this plane (see Figure 1; we marked by × the points that do not belong to the domain). Consider h 0 defined on the spiral with Dirichlet boundary conditions. 2 By inspection, we see that h 0 acts on l () as a free one-dimensional Jacobi matrix. Hence the spectrum is absolutely continuous in [−2, 2], and for every E in this interval, there exists an explicitly computable unique solution u(n, E) of the generalized eigenfunction equation satisfying the boundary conditions:
E u(n, E) = sin cos−1 n . 2 This is a standard discrete plane wave. If we measure the linear distance N along the spiral, the square of the l 2 -norm of this solution grows as N. However, in Z2 , we have u(x, E)2BR ∩ ∼ R 2 .
SOLUTIONS, SPECTRUM, AND DYNAMICS
141
Hence in this case, we cannot find solutions as in Theorem 1.1. We remark that [40] contains an example of a bounded spiral jelly roll domain on which the Laplace operator with Neumann boundary conditions has absolutely continuous spectrum. In this case, for a.e. E in the spectrum, the norm of solutions becomes infinite for finite R. 5. Solutions and dynamics. In this section, we prove Theorem 1.2 and apply it to study quantum dynamics in the random decaying potentials model studied in [10] and more recently in [26], [27]. The previous section provided us with several examples of operators with absolutely continuous spectrum and solutions satisfying the condition (2) in Theorem 1.1 for α = 1, and one example of an operator with absolutely continuous spectrum, but without such solutions. For the former three, the transport is ballistic for every vector (i.e., |X|m T ∼ T m ); for the latter, it is easy to see that the transport is not ballistic (it is diffusive in Z2 ). Theorem 1.2 indicates that this is not a coincidence. The proof of Theorem 1.2 is an extension of the proof of Theorem 6.1 in [29], and it is essentially the same in both the discrete and continuous settings. We use a discrete notation that formally only covers the discrete case, but the continuous case follows from it in a totally straightforward manner (which essentially amounts to replacing n by x and some summations by integrals). We note that in [29, Theorem 6.2] the continuous case gets an independent treatment, based on semigroup kernel inequalities. This is not needed here, since we assume the existence of eigenfunction expansions with suitable properties. This allows our Theorem 1.2 to cover some cases, such as Stark operators, that are excluded from [29, Theorem 6.2]. Recall that a measure µ is called uniformly α-Hölder continuous (denoted UαH) if there exists a constant C such that for every interval I with |I | ≤ 1, we have (19)
µ(I ) ≤ C|I |α .
α-continuous measures (recall that this means measures giving zero weight to all sets of zero hα measure) can be approximated by UαH measures in the following sense. Theorem (Rogers-Taylor [37]). A finite Borel measure µ on R is α-continuous if and only if, for every - > 0, there exist two mutually singular Borel measures µ-1 and µ-2 , such that µ = µ-1 + µ-2 , where µ-1 is UαH and µ-2 (R) < -. For UαH measures, we can study dynamics with the aid of the following Strichartz estimate. Theorem (Strichartz [43]). Let µ be a finite UαH measure, and for each f ∈ L2 (R, dµ), denote f µ(t) = exp(−ixt)f (x) dµ(x). Then there exists a constant C1 , depending only on µ (more precisely, only on C in (19)), such that for any f ∈ L2 (R, dµ) and T > 0,
142
KISELEV AND LAST
|f µ|2
< C1 f 2 T −α ,
T
where f is the L2 norm of f . We now prove Theorem 1.2. Proof of Theorem 1.2. Without loss of generality, assume ψ = 1. We first establish the existence of a Borel set S˜ ⊂ S, for which the following three properties are true: (i) PS˜ ψ > 0. (ii) The restriction of the spectral measure µψ to S˜ is UαH. (iii) There exists a constant C2 , such that for each E ∈ S˜ and R > 0, the corresponding generalized eigenfunction u(n, E) satisfies |u(n, E)|2 < C2 R γ . |n|
We establish (i)–(iii) in two stages. First, note that the function |u(n, E)|2 f (E) ≡ sup R −γ R>0
|n|
is a measurable function of E, which, by (3), is finite everywhere on S. Thus, since S= ∞ {E ∈ S | f (E) < k}, there is clearly a Borel subset S1 ⊂ S of positive k=1 ψ µ measure and a constant C2 , such that f (E) < C2 for any E ∈ S1 . That is, property (iii) holds for S1 . Next, since the restriction of µψ to S1 is α-continuous, the aforementioned Rogers-Taylor theorem implies that there is a Borel subset S˜ ⊂ S1 of positive µψ measure (so that property (i) holds) such that the restriction of µψ to S˜ is UαH (so that property (ii) holds). Let us now denote ψ1 = PS˜ ψ, ψ2 = PR\S˜ ψ, where P denotes the spectral projection over the corresponding set. Then ψ = ψ1 + ψ2 , and ψ1 , ψ2 are mutually orthogonal so that 1 = ψ2 = ψ1 2 + ψ2 2 . Let PRT be the projector on the set of sites n with |n| ≤ RT . RT is a function of the time parameter T to be chosen later. Given any vector ϕ, we routinely use the notation ϕ(t) = exp(−ih v t)ϕ. By the Strichartz theorem, we have 2 1 T exp(−iEt)u(n, E) dµψ1 (E) dt PRT ψ1 (t)2 T = T 0 |n|≤RT −α ≤ C1 T |u(n, E)|2 dµψ1 (E) |n|
≤ C1 ψ1 2 sup
E∈S˜ |n|
|u(n, E)|2 T −α ,
143
SOLUTIONS, SPECTRUM, AND DYNAMICS
and so
PRT ψ1 (t)2
(20)
T
γ
≤ C2 C1 ψ1 2 RT T −α .
For each T > 0, we now define RT =
ψ1 2 T α 64 C2 C1
1/γ ,
such that we have
PRT ψ1 (t)2
T
<
ψ1 4 , 64
and thus
PRT ψ(t)2
T
2 2 ≤ PRT ψ1 (t) + PRT ψ2 (t) ≤ PRT ψ1 (t) + ψ2 T T
2
2 2 ψ1 ≤ PRT ψ1 (t)2 T + ψ2 < + ψ2 8 ψ1 4 1 1 + ψ2 2 + ψ2 ψ1 2 < ψ2 2 + ψ1 2 64 4 2 1 = 1 − ψ1 2 . 2 =
Since
PR ψ(t) 2 + 1 − PR ψ(t) 2 = 1, T T T T
we obtain
1 − PR ψ(t) 2 > 1 ψ1 2 , T T 2
which implies
|X|m
T
1 ψ1 2 > ψ1 2 RTm = 2 2
ψ1 2 64 C2 C1
m/γ
T αm/γ ,
proving (4). Note that the above proof does not attempt to provide optimal estimates. We could (by allowing various constants to grow) choose ψ1 to have a norm that is arbitrarily close to that of PS ψ, and RT ∼ T α/γ so that (1 − PRT )ψ(t)2 T is larger than something arbitrarily close to PS ψ2 . This means that there is a component of the wave packet of size corresponding to PS ψ that is spreading on average at a rate of at least T α/γ .
144
KISELEV AND LAST
We now apply Theorem 1.2 to investigate dynamics for the following model. Let vω (n) be independent random variables such that (21) E vω (n) = 0,
1/2 E vω (n)2 = λn−1/2 ,
and
sup |vω (n)| ≤ Cn−(1/3)−δ , ω
δ > 0.
For example, if we take idependent, √ √identically distributed random variables aω (n) with uniform distribution in [− 3, 3], then vω (n) = λn−1/2 aω (n) satisfy all the conditions. The half-line random Schrödinger operators hω with, say, Dirichlet boundary conditions at zero and potential vω exhibit very rich spectral structure. Such operators where studied by Delyon, Simon, and Souillard [10], and more recently by Kotani and Ushiroya [27] and Kiselev, Last, and Simon [26]. Our study here is based mainly on the results of the last paper. In particular, the following was proven in [26]. Theorem (KLS [26]). For all ω, the essential spectrum of hω is [−2, 2]. If |λ| < 2, then for a.e. ω, hω has purely singular continuous spectrum in {E | |E| < (4−λ2 )1/2 } and only dense pure point spectrum in {E | (4 − λ2 )1/2 < |E| < 2}. For a.e. ω and E ∈ (−2, 2), (22)
λ2 log TE (n, 0) = , n→∞ log n 8 − 2E 2 lim
and there exists an initial condition θ(ω) at zero such that (23)
λ2 log TE (n, 0)uθ(ω) =− , n→∞ log n 8 − 2E 2 lim
where uθ (ω) is the 2-vector corresponding to the boundary condition θ(ω) at 0, and TE (n, 0) is the transfer matrix from 0 to n at energy E. √ √ This theorem implies that for a.e. ω and E ∈ (− 4 − λ2 , 4 − λ2 ), the spectral measure µ (corresponding to the vector δ1 ) has local Hausdorff dimension (24)
α(E, λ) =
4 − E 2 − λ2 4 − E2
at energy E, in the sense that for any - > 0, there is a δ so that µ(A) = 0 if A is a subset of (E − δ, E + δ) of Hausdorff dimension less than α(E, λ) − -, and there is a subset B of Hausdorff dimension less than α(E, λ) + - such that µ((E − δ, E + δ) \ B) = 0. These properties of the spectral measure follow from (22), (23) by subordinacy theory [19], [20]. See [26] for details. Remark. The KLS theorem also provides an example indicating that the criterion of Theorem 1.1 is optimal in the sense that one cannot, in general, say more by looking at the rate of growth of the L2 -norm. Indeed, by (22), for a.e. ω, all solutions
SOLUTIONS, SPECTRUM, AND DYNAMICS
145
u(n, ˜ E) for every energy E in the continuous spectrum satisfy R −ρ u(n, ˜ E)2BR ≤ C, for any λ2 4 − E2 and all R. In particular, for every ρ > 1, we can take λ sufficiently small to ensure the existence of an interval Iρ around E = 0 such that for a.e. ω, all solutions (and in particular the one obeying the boundary condition) satisfy ρ > 1+
˜ E)2BR < C R −ρ u(n, for E ∈ Iρ . Yet for a.e. E ∈ Iρ , we have Dµ(E) = 0 since the measure is purely singular. This shows that no condition of type (2) with α > 1 leads, in general, to pointwise estimates on the derivatives of spectral measures. This remark sounds trivial in one dimension, but it is straightforward (using the analysis of [26] for the continuous analog Vω (x) of the family of random potentials we study) to give a similar example that works in any dimension (in the continuous case). Set HVω = −+λVω (r) with spherically symmetric potential. Using spherical symmetry, one shows that the spectrum of HVω is purely singular with probability 1. However, for every ρ > 1, there are solutions for a.e. ω and all energies E sufficiently large such that (2) holds with α = ρ. The following theorem shows that as long as our operators hω have some continuous spectrum (which may be of arbitrarily small dimension), their transport properties are arbitrarily close to ballistic. Theorem 5.1. Consider the family hω of random Schrödinger operators defined on Z+ with potential λvω (n), where λ < 2 and the potential satisfies (21). Then for a.e. ω, for every ψ such that Pc (ω)ψ = 0 (where Pc (ω) is the projector on the continuous spectrum of the operator hω ), we have that for every - > 0 and m > 0, there is a positive constant C-,m,ω such that for any T > 0, m |X| ψ(t), ψ(t) T ≥ C-,m,ω T m(1−-) . (25) Proof. By the results of the Gilbert-Pearson theory, the spectral measure µ is supported on the set of the energies E for which the decaying solution (23) satisfies the boundary condition (namely, θ(ω) coincides with the Dirichlet boundary condition). Moreover, these decaying solutions, which we denote by u(n, E), are exactly the generalized eigenfunctions in the sense of Theorem 1.2, if we normalize them by setting u(1, E) = 1. Fix ω such that the results of the KLS theorem hold. (23) implies that the generalized eigenfunctions u(n, E) of the operator hω satisfy (26)
lim sup R −γ u(n, E)2BR ≤ ∞ R→∞
146
KISELEV AND LAST
for√ every γ √ > α(E, λ) given by (22). Pick an open energy interval I = (E1 , E2 ) ⊂ (− 4 − λ2 , 4 − λ2 ), such that 0 ∈ / I , and µψ (I ) > 0. Let α1 = α(E1 , λ), α2 = α(E2 , λ). α(E, λ) is monotone on I . Assume, without loss, that α1 < α2 . The restriction of µψ to I is α1 -continuous, and by (26), lim supR→∞ R −α2 u(n, E)2BR < ∞ for any generalized eigenfunction u(n, E) with E ∈ I . Thus, by Theorem 1.2, for each m > 0, there is a constant Cm,I,ω such that for all T > 0, m |X| ψ(t), ψ(t) T ≥ Cm,I,ω T mα1 /α2 . Since Pc (ω)ψ = 0, we can clearly choose such an interval I with α1 /α2 > 1 − - and µψ (I ) > 0. Thus, Theorem 5.1 follows. Remark. By using an extension of the proof of Theorem 1.2, we can show that there is actually a component of the wave packet of size corresponding to Pc (ω)ψ that is spreading on average at a rate that is arbitrarily close to ballistic. More explicitly, we can show that for a.e. ω, for every - > 0 and ρ > 0, there exists a constant Cω,ρ,such that if RT = Cω,ρ,- T 1−- , then (27) PRT ψ(t)2 T ≤ ψ − Pc (ω)ψ2 + ρ. This easily yields Theorem 5.1 and is thus a stronger statement. Appendices A. Generalizations. The whole proof of Theorem 1.1 readily extends to more general settings. Namely, we can replace the operator HV with a general, uniformly elliptic self-adjoint operator J such that J = ∂l − iAl (x) alk (x) ∂k − iAk (x) + V (x), provided that alk , Al , and V are “nice enough” (for example, bounded and sufficiently smooth). The proof for this case is very similar. Green’s formula leads us to consider the following modified Wronskian: W∂S [f, g] = cos n, xl alk ∂k − iAk u v − u cos n, xk ∂l − iAl alk v dσ. ∂S
It is clear that under our assumptions, the analog of Lemma 2.4 holds. The estimate of Lemma 2.3 also holds with the constant independent of R by the standard Sobolev estimates for bounded sufficiently smooth coefficients (see, e.g., [13], [31]). The rest of the proof does not change. A similar remark applies to some higher-order operators and systems. In particular, in one dimension, a self-adjoint half-line differential operator of order 2n is given by the expression (n) (n−1) (Lf )(x) = (−1)n p0 f (n) + (−1)n−1 p1 f (n−1) + · · · + pn f
SOLUTIONS, SPECTRUM, AND DYNAMICS
147
and a set of self-adjoint boundary conditions at zero. The analog of the Wronskian in this case is determined by integration by parts: (28) WL,x [f, g] =
j n
(m−1) (j −m) (m−1) (−1)m pj f (j ) g − f (j −m) pj g (j ) ,
j =1 m=1
where all values are taken at the point x. The analog of the Sobolev estimates of Lemma 2.3 is now the claim that for a solution of (L − E)u = φ, uW2m (BR ) ≤ CuL2 (BR+1 ) holds for m ≤ 2n−1. Such estimates (in fact, for m ≤ 2n) are well known to hold for operators with bounded sufficiently smooth coefficients (see, e.g., [31]). The analog of Lemma 2.4 follows directly from (28); the rest of the proof of Theorem 1.1 does not change. In particular, we have the following theorem. Theorem A.1. Let L denote the self-adjoint differential operator of order 2n with bounded sufficiently smooth (say, infinitely differentiable) coefficients. Suppose that for every E in a set S of positive Lebesgue measure, there exists a bounded solution u(x, E) of the generalized eigenfunction equation (L − E)u = 0 satisfying the boundary conditions. Suppose that for a compactly supported function φ ∈ L2 , we have u(x, E)φ(x) dx = 0 for a.e E ∈ S. Then the absolutely continuous part of the spectral measure µφ fills S (so that µφ (S1 ) > 0 for any S1 ⊂ S of positive Lebesgue measure). Remark. Of course, we can also allow for φ that are not L2 , but from the Sobolev space H−2 (HV ), such as the δ function and its derivatives up to 2n−1, which are often used in the setting of one-dimensional differential operators. The spectral measure is not finite in this case, but nothing else changes. Theorem A.1 follows from the above discussion and proof of Theorem 2.5. This result may be viewed as a sort of analog of [41], [42] for the higher-order case. It is typical, though, that our condition involves only one solution ([41], [42] require all solutions to be bounded) because the possible multiplicity of the spectrum makes it unreasonable to demand all solutions to be bounded (in higher-order cases) to get absolutely continuous spectrum. On the other hand, our result does not guarantee pure absolute continuity.
148
KISELEV AND LAST
B. One-dimensional perturbed Stark operators. In this appendix, we make a remark concerning dynamical properties of a certain class of perturbed Stark operators. We denote by HV ,S the operator defined on the whole axis by the differential expression −
d2 − x + V (x). dx 2
Our results are based on the following theorem, proved in [25]. Theorem [25]. Suppose that |V (x)| ≤ C(1 + |x|)−(1/3)−- , or that V is bounded and has a derivative V that is bounded and Hölder continuous. Then the whole axis (−∞, ∞) is an essential support of the absolutely continuous part of the spectral measure µ. Moreover, for a.e. E ∈ R, there exist two linearly independent solutions u± (x, E), such that u± (x, E) = x
−1/4
2 3/2 exp ± i x + f± (x, E) 1 + o(1) 3
as x → +∞, where |f± (x, E)| ≤ C(1 + x)−1/2 . Stark operators do not fit into the framework provided by Theorem 1.1 because of the strong negative part of the potential (and resulting failure of Lemma 2.3). Indeed, for a.e. energy E, we have a solution u(x, E) that satisfies R −1/2 u(x, E)BR ≤ C(E), which, if Theorem 1.1 were true, would imply D 1/2 (E) > 0 a.e. E. It should be possible to prove an analog of Theorem 1.1 for some perturbed Stark operators, taking into account that instead of the Sobolev estimates of Lemma 2.3, we have ∇u2BR ≤ CRu2BR . However, the criterion of Theorem 1.2 applies, immediately giving the following theorem. Theorem B.1. Under the conditions of the previous theorem, for every vector ψ with nonzero projection on the absolutely continuous subspace, we have m |X| ψ(t), ψ(t) T ≥ CT 2m . We note that there are examples (see [32]) of potentials V satisfying |V (x)| ≤ C(x)(1 + |x|)−1/3 , where C(x) tends to infinity as x → ∞, but arbitrarily slowly, such that for a corresponding Stark operator, there is a dense set of eigenvalues embedded in the absolutely
SOLUTIONS, SPECTRUM, AND DYNAMICS
149
continuous spectrum. Theorem B.1 shows that such potentials, nevertheless, do not slow down dynamics corresponding to the absolutely continuous component. References [1] [2] [3] [4] [5] [6]
[7]
[8]
[9] [10] [11]
[12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22]
S. Agmon, Spectral properties of Schrödinger operators and scattering theory, Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 2 (1979), 151–218. S. Agmon and L. Hörmander, Asymptotic properties of solutions of differential equations with simple characteristics, J. Anal. Math. 30 (1976), 1–38. M. Aizenman and B. Simon, Brownian motion and Harnack inequality for Schrödinger operators, Comm. Pure Appl. Math. 35 (1982), 209–273. Bateman Manuscript Project, Higher Transcendental Functions, 2, McGraw-Hill, New York, 1953–1955. J. M. Berezanskii, Expansions in Eigenfunctions of Selfadjoint Operators, Transl. Math. Monogr. 17, Amer. Math. Soc., Providence, 1968. M. Birman and M. Solomyak, Spectral Theory of Selfadjoint Operators in Hilbert Space, Trans. S. Khrushchëv and V. Peller, Math. Appl. (Soviet Ser.), D. Rédel Publishing Co., Dordrecht, 1987. M. Christ and A. Kiselev, Absolutely continuous spectrum for one-dimensional Schrödinger operators with slowly decaying potentials: Some optimal results, J. Amer. Math. Soc. 11 (1998), 771–797. J. M. Combes, “Connections between quantum dynamics and spectral properties of timeevolution operators” in Differential Equations with Applications to Mathematical Physics, Math. Sci. Engrg. 192, Academic Press, Boston, 1993, 59–68. D. Damanik, α-continuity properties of one-dimensional quasicrystals, Comm. Math. Phys. 192 (1998), 169–182. F. Delyon, B. Simon, and B. Souillard, From power pure point to continuous spectrum in disordered systems, Ann. Inst. H. Poincaré Phys. Théor. 42 (1985), 283–309. R. del Rio, S. Jitomirskaya, Y. Last, and B. Simon, Operators with singular continuous spectrum, IV. Hausdorff dimensions, rank one perturbations, and localization, J. Anal. Math. 69 (1996), 153–200. L. C. Evans, Partial Differential Equations, Grad. Stud. Math. 19, Amer. Math. Soc., Providence, 1998. D. Gilbarg and N. Trudinger, Elliptic Partial Differential Equations of Second Order, Grundlehren Math. Wiss. 224, Springer-Verlag, Berlin, 1977. D. Gilbert, On subordinacy and analysis of the spectrum of Schrödinger operators with two singular endpoints, Proc. Roy. Soc. Edinburgh Sect. A 112 (1989), 213–229. D. Gilbert and D. Pearson, On subordinacy and analysis of the spectrum of one-dimensional Schrödinger operators, J. Math. Anal. Appl. 128 (1987), 30–56. I. Guarneri, Spectral properties of quantum diffusion on discrete lattices, Europhys. Lett. 10 (1989), 95–100. , On an estimate concerning quantum diffusion in the presence of a fractal spectrum, Europhys. Lett. 21 (1993), 729–733. S. Jitomirskaya, in preparation. S. Jitomirskaya and Y. Last, Dimensional Hausdorff properties of singular continuous spectra, Phys. Rev. Lett. 76 (1996), 1765–1769. , Power law subordinacy and singular spectra. I. Half-line operators, preprint. , Power law subordinacy and singular spectra. II. Line operators, in preparation. T. Kato, Growth properties of solutions of the reduced wave equation with a variable coefficient, Comm. Pure Appl. Math. 12 (1959), 403–425.
150 [23] [24]
[25] [26]
[27] [28] [29] [30] [31] [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44]
KISELEV AND LAST R. Ketzmerick, K. Kruse, S. Kraut, and T. Geisel, What determines the spreading of a wave-packet?, Phys. Rev. Lett. 79 (1997), 1959–1963. A. Kiselev, Absolutely continuous spectrum of one-dimensional Schrödinger operators and Jacobi matrices with slowly decreasing potentials, Comm. Math. Phys. 179 (1996), 377–400. , Absolutely continuous spectrum of perturbed Stark operators, to appear in Trans. Amer. Math. Soc. A. Kiselev, Y. Last, and B. Simon, Modified Prüfer and EFGP transforms and the spectral analysis of one-dimensional Schrödinger operators, Comm. Math. Phys. 194 (1998), 1–45. S. Kotani and N. Ushiroya, One-dimensional Schrödinger operators with random decaying potentials, Comm. Math. Phys. 115 (1988), 247–266. P. Kuchment, Bloch solutions of periodic partial differential equations, Funktsional Anal. i Prilozhen 14 (1980), 65–66. Y. Last, Quantum dynamics and decompositions of singular continuous spectra, J. Funct. Anal. 142 (1996), 406–445. Y. Last and B. Simon, Eigenfunctions, transfer matrices, and absolutely continuous spectrum of one-dimensional Schrödinger operators, Invent. Math. 135 (1999), 329–367. S. Mizohata, The Theory of Partial Differential Equations, Trans. K. Miyahara, Cambridge University Press, New York, 1973. S. N. Naboko and A. B. Pushnitskii, Point spectrum on a continuous spectrum for weakly perturbed Stark type operators, Funct. Anal. Appl. 29 (1995), 248–257. M. Reed and B. Simon, Methods of Modern Mathematical Physics, IV. Analysis of Operators, Academic Press, New York, 1978. F. Rellich, Jber das asymptotische Verhalten der Lösungen von u + λu = 0 in unendlichen Gebieten, Über. Deutsch. Math. Verein 53 (1943), 57–65. C. Remling, The absolutely continuous spectrum of one-dimensional Schrödinger operators with decaying potentials, Comm. Math. Phys. 193 (1998), 151–170. C. A. Rogers, Hausdorff Measures, Cambridge University Press, London, 1970. C. A. Rogers and S. J. Taylor, Additive set functions in Euclidean space. II, Acta Math. 109 (1963), 207–240. I. Sch’nol, On the behavior of the Schrödinger equation, Mat. Sb. 42 (1957), 273–286. B. Simon, Schrödinger semigroups, Bull. Amer. Math. Soc. (N.S.) 7 (1982), 447–526. , The Neumann Laplacian of a jelly roll, Proc. Amer. Math. Soc. 114 (1992), 783–785. , Bounded eigenfunctions and absolutely continuous spectra for one-dimensional Schrödinger operators, Proc. Amer. Math. Soc. 124 (1996), 3361–3369. G. Stolz, Bounded solutions and absolute continuity of Sturm-Liouville operators, J. Math. Anal. Appl. 169 (1992), 210–228. R. S. Strichartz, Fourier asymptotics of fractal measures, J. Funct. Anal. 89 (1990), 154–187. N. Vilenkin, Special functions and the theory of group representations (in Russian), “Nauka,” Moscow, 1991.
Kiselev: Department of Mathematics, University of Chicago, Chicago, Illinois 60637, USA;
[email protected] Last: Department of Mathematics, California Institute of Technology, Pasadena, California 91125, USA;
[email protected]
Vol. 102, No. 1
DUKE MATHEMATICAL JOURNAL
© 2000
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD IS NOT FINITELY GENERATED CLAIRE VOISIN 1. Introduction. If X is a Kähler variety, the intermediate Jacobian J 2k−1 (X) is defined as the complex torus J 2k−1 (X) = H 2k−1 (X, C)/F k H 2k−1 (X) ⊕ H 2k−1 (X, Z), k 2k−1(X), where F k H 2k−1(X) is the set of classes representable by a closed form in F A that is, which is locally of the form I,J αI,J dzI ∧ dzJ , with |I | + |J | = 2k − 1 and |I | ≥ k. Griffiths [9] has defined the Abel-Jacobi map
kX : ᐆkhom (X) −→ J 2k−1 (X), where ᐆkhom (X) is the group of codimension k algebraic cycles homologous to zero on X. Using the identification n−k+1 2n−2k+1 ∗ F H (X) 2k−1 J , n = dim X (X) = H2n−2k+1 (X, Z) given by Poincaré duality, kX associates to the cycle Z = ∂, where is a real chain of dimension 2n − 2k + 1 well defined up to a 2n − 2k + 1-cycle, the element ∗ ∈ F n−k+1 H 2n−2k+1 (X) /H2n−2k+1 (X, Z),
which is well defined using the isomorphism F n−k+1 A2n−2k+1 (X)c F n−k+1 H 2n−2k+1 (X) ∼ . = dF n−k+1 A2n−2k (X) If (Zt )t∈C is a flat family of codimension k algebraic cycles on X parametrized by a smooth irreducible curve C, the map t → kX (Zt − Z0 ) factors through a homomorphism from the Jacobian J (C) to J 2k−1 (X), and one can show that the image of this morphism is a complex subtorus of J 2k−1 (X) whose tangent space is contained in H k−1,k (X) ⊂ H 2k−1 (X, C)/F k H 2k−1 (X). Defining the subgroup Received 9 March 1999. 1991 Mathematics Subject Classification. Primary 14C25, 14C30, 14M99, 32J25. Author’s work partially supported by the project Algebraic Geometry in Europe. 151
152
CLAIRE VOISIN
ᐆkalg (X) ⊂ ᐆkhom (X) of cycles algebraically equivalent to zero as the subgroup gen-
erated by the cycles Zt − Z0 for any family as above and defining the Griffiths group Griff k (X) as the quotient ᐆkhom (X)/ᐆkalg (X), it follows that the Abel-Jacobi map induces a morphism kX : Griff k (X) −→ J 2k−1 (X)tr ,
where J 2k−1 (X)tr is the quotient of J 2k−1 (X) by its maximal subtorus having its tangent space contained in H k−1,k (X). In this paper, we are mainly interested in the case where n = 3, k = 2. We use then the notation J (X), X . In [10], Griffiths proved the following theorem. Theorem 1. If X is a general quintic threefold and Z is the difference of two distinct lines in X, X (Z) is not a torsion point in J (X). Furthermore, J (X)tr = J (X). From this it follows that Griff(X) contains nontorsion elements. In [3] Clemens, using the countably many isolated rational curves in X, proved the following theorem. Theorem 2. If X is a general quintic threefold, Im X ⊗ Q is not a finitedimensional Q–vector space. In particular, Griff(X) ⊗ Q is not a finite-dimensional Q–vector space. Clemens’s theorem has been extended to complete intersections by Paranjape [15] and to Abelian threefolds by Nori [14]. (In the last case, J (X)tr is different from J (X), and one considers the Abel-Jacobi map with value in J (X)tr .) Notice that it is conjectured (see [13]) that for codimension-two cycles, the Abel-Jacobi map 2X : Griff(X) → J (X)tr is injective, so both statements should be equivalent. More recently, Nori [13] proved that there may exist nontorsion cycles in Griff k (X) for any k ≥ 3 (so X has to be of dimension at least 4), which are annihilated by the Abel-Jacobi map. Combining Nori’s ideas and the study of the Abel-Jacobi map for the general cubic sevenfold in P8 , Albano and Collino [1] even proved that for k ≥ 3 the kernel of the Abel-Jacobi map kX : Griff k (X) → J 2k−1 (X)tr may be nonfinitely generated. In this paper, we consider another kind of generalization of the Clemens theorem: Instead of a quintic threefold, we consider a Calabi-Yau threefold X; that is, X is a Kähler threefold with trivial canonical bundle such that H 2 (ᏻX ) = 0 (so, in particular, X is projective). For such X it is well known that the local moduli space of X is smooth of dimension dim H 1 (TX ) = dim H 1,2 (X). In [17] we proved the following. Theorem 3. Let X be a Calabi-Yau threefold. If h1 (TX ) = 0, the general deformation Xt of X satisfies that the Abel-Jacobi map Xt : ᐆ2 (Xt ) −→ J 2 (Xt ) of Xt is nontrivial, even modulo torsion.
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
153
It is easy to check that J (Xt )tr = J (Xt ) for a general point t, so the theorem implies that Griff(Xt ) contains nontorsion elements. We prove in this paper the following result. Theorem 4. Let X be a Calabi-Yau threefold. If h1 (TX ) = 0, the general deformation Xt of X has the property that the Abel-Jacobi map Xt : ᐆ2 (Xt ) −→ J (Xt ) is such that Im Xt ⊗ Q is an infinite-dimensional Q–vector space. In particular, Griff(Xt ) ⊗ Q is an infinite-dimensional Q–vector space. The one-cycles in Xt we use to prove this result are the same as in [18]. Namely, we consider for |Lt | a sufficiently ample linear system on Xt , the surfaces S ∈ |Lt |, jS S → Xt having a class λ ∈ Ker(jS ∗ : H 2 (S, Z) → H 4 (Xt , Z)), which is in F 1 H 2 (S); that is, λ is algebraic, λ = c1 (Dλ ) for some divisor Dλ on S, by the Lefschetz theorem on (1, 1)-classes. It was proved in [17] that there are countably many isolated such surfaces in Xt , and the countably many corresponding one-cycles Zλ = jS ∗ (Dλ ) homologous to zero in Xt were proved in [18] to generate a nontorsion subgroup of J (Xt ) by the Abel-Jacobi map. We were unable to show, however, that this subgroup is nonfinitely generated. The method we use is in some sense related to a suggestion of Clemens in [4]. He suggested that a proof of the nonfinite generation of the Griffiths group of the general quintic threefold could be obtained by studying the ramification loci of the various generically finite coverings πd : ᏹd → ᏹ, where ᏹ is the moduli space for the quintic threefold and ᏹd parametrizes a quintic threefold X and a degree d isolated rational curve C in it. Along the ramification divisor of πd , the curve C ⊂ X has an infinitesimal deformation η in X, and there is a corresponding element X∗ (η) ∈ H 1,2 (X), which is the differential of X applied to the deformation η of the corresponding cycle in X. However, another important ingredient is the complexified Abel-Jacobi map; we use the complexified infinitesimal Abel-Jacobi map to prove Theorem 4. The “complexjS
ified” objects we study are the following: If S → X, and λ ∈ Ker(jS ∗ : H 2 (S, C) → H 4 (Xt , C)), we define Uλ as the set of deformations (Xt , St ) of the pair (X, S) such that the fixed class λt ∈ H 2 (St , C) ∼ = H 2 (S, C) belongs to F 1 H 2 (St ). It turns out that when X is a Calabi-Yau threefold and ᏹ is its local moduli space, most varieties Uλ are generically finite covers of ᏹ (by the map (Xt , St ) → Xt ). A point (Xt , St ) of ramification of this map then corresponds to a surface St ⊂ Xt that admits an infinitesimal deformation η such that λt ∈ F 1 H 2 (St ) remains (infinitesimally) in η F 1 H 2 (St ). There is then an associated complexified infinitesimal Abel-Jacobi invariant Xt ∗ (η) ∈ H 1,2 (Xt ). Notice that if λ is integral, it is the class of a divisor in St and we get the same invariant as above.
154
CLAIRE VOISIN
In Section 2, we introduce various Hodge theoretic objects and study the varieties Uλ defined above. We also define the complexified Abel-Jacobi map and “compute” its differential. In Section 3, we give a very simple infinitesimal criterion, which implies that the infinitesimal invariants above are nonzero and that if the image of the Abel-Jacobi map of Xt were finitely generated, these infinitesimal invariants would vanish. It follows that if this criterion is satisfied, then Theorem 4 is true. This infinitesimal criterion concerns the infinitesimal variation of Hodge structure of a generic sufficiently ample surface S ⊂ X. In Section 4, we check this criterion, which reduces (see [7]) to the study of Jacobian rings, that is, quotients of rings of functions by Jacobian ideals, generated by the derivatives of the defining equation of the surface along vector fields. 2. Noether-Lefschetz loci and infinitesimal Abel-Jacobi map. Part of the material in this section works in the general situation of a family of smooth surfaces → B contained in a family of smooth threefolds ᐄ → B; however, we restrict π → B is the local universal family of the discussion to the following situation: ᐄ − deformations of a Calabi-Yau threefold X. B is a smooth ball, which can be assumed to be as small as we want. We have dim B = dim H 1 (TX ). Now let L be an ample line bundle on X; since H 1 (ᏻX ) = H 2 (ᏻX ) = 0 and H i (L) = 0, i > 0, by Kodaira vanishing and KX trivial, L extends uniquely to a line bundle ᏸ on ᐄ, and p → B is smooth dim H 0 (Xt , Lt ) = dim H 0 (X, L) for any t ∈ B. Then P(R 0 π∗ ᏸ) − over B, and we denote by U ⊂ P(R 0 π∗ ᏸ) the open set parametrizing smooth surπS πX faces. Let then −→ U be the universal family, ᐄU −→ U be the pullback to U π → B, and j : → ᐄU be the natural inclusion. First we have the of the family ᐄ − following lemma. Lemma 1. For sufficiently ample L, the tangent space TU,t at a point t identifies to H 1 (TSt ) by the Kodaira-Spencer map. It is also isomorphic to H 1 (TXStt ) by the Kodaira-Spencer map for pairs, where TXStt is the kernel of the natural map TXStt −→ NSt /Xt . Proof. We have the exact sequence 0 −→ TXt (−Lt ) −→ TXStt −→ TSt −→ 0, which induces the natural map H 1 TXStt −→ H 1 TSt , from the deformations of the pair to the deformations of the surface. So by Serre vanishing, the map above is an isomorphism for sufficiently ample L.
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
155
Next we have the exact sequence p∗ 0 −→ H 0 Lt |St −→ TU,t −−→ TB,p(t) −→ 0 and the exact sequence defining TXStt , 0 −→ TXStt −→ TXt → Lt |St −→ 0, which induces the exact sequence 0 −→ H 0 Lt |St −→ H 1 TXStt −→ H 1 (TXt ) −→ 0, since H 0 (TXt ) = 0 and H 1 (Lt |St ) = 0. Finally, the Kodaira-Spencer map TU,t → H 1 (TXStt ) fits into the commutative diagram 0
/ H 0 Lt |S t
/ TU,t
0
/ H 0 Lt |S t
/ H 1 T St Xt
p∗
/ TB,p(t)
/0
/ H 1 (TX ) t
/ 0,
where the first and last vertical maps are the identity. It follows immediately that the middle map is an isomorphism. We have on U the primitive variation of Hodge structure of the family of surfaces
: namely, let
j∗ HZ2 := Ker R 2 πS ∗ Z −−→ R 4 πX∗ Z
be the local system whose fiber at t is jt ∗ H 2 (St , Z)0 := Ker H 2 (St , Z) −−→ H 4 (Xt , Z) . Let Ᏼ2 := HZ2 ⊗ ᏻU , with its Gauss-Manin connection ∇ S : Ᏼ2 → Ᏼ2 ⊗ *U , whose local system of flat sections is HC2 = HZ2 ⊗ C. Let F i Ᏼ2 , 0 ≤ i ≤ 2 be the Hodge bundles, with fiber F i Ᏼt2 = F i H 2 (St ) ∩ Ker jt ∗ ,
F i H 2 (St ) = ⊕p≥i H p,2−p (St )
and associated quotients Ᏼi,2−i = F i Ᏼ2 /F i+1 Ᏼ2 . By transversality, we have ∇ S F i Ᏼ2 ⊂ F i−1 Ᏼ2 ⊗ *U . We denote by
S
∇ : Ᏼi, 2−i −→ Ᏼi−1, 3−i ⊗ *U
156
CLAIRE VOISIN S
the ᏻU -linear map deduced from ∇ S by transversality, so that ∇ fits into the commutative diagram / F i Ᏼ2 ⊗ * ∇ S : F i+1 Ᏼ2 U ∇ S : F i Ᏼ2 S
∇ :
Ᏼi, 2−i
/ F i−1 Ᏼ2 ⊗ * U / Ᏼi−1, 3−i ⊗ * . U
S
For λ ∈ H i,j (Sv )0 we then have ∇ (λ) ∈ Hom(TU,v , H i−1,j +1 (Sv )0 ). For η ∈ TU,v , S we denote by ∇ η the induced map H i,j (Sv )0 → H i−1,j +1 (Sv )0 . Let V be a simply connected open subset of U . Then the local system HC2 is trivial on V , so that if v0 ∈ V and λ ∈ H 2 (Sv0 , C)0 , we can view λ as a section of HC2 on V . We then define the component of the Noether-Lefschetz locus determined by λ as Vλ = t ∈ V , λt ∈ F 1 H 2 (St )0 . Vλ is an analytic subvariety of V , defined by the vanishing of the projection in Ᏼ0,2 of the flat, hence holomorphic, section λ ∈ Ᏼ2 . If t ∈ Vλ , λt ∈ F 1 H 2 (St )0 and hence has a projection λ1,1 ∈ H 1,1 (St )0 = Ᏼt1,1 . Then the next lemma follows from the t S definition of ∇ . S
Lemma 2. The Zariski tangent space to Vλ at t is equal to Ker ∇ (λ1,1 t ), where 1,1 0,2 ∇ (λt ) ∈ Hom(TV ,t , H (St )). S
Note that usually the terminology of the Noether-Lefschetz locus is reserved to the case where λ is rational. In this case, by the Lefschetz theorem on (1, 1)-classes, Vλ is the set of points v ∈ V where the class λv is algebraic; that is, any multiple mλ λv that is an integral class is the class [Dλ,v ] of a divisor on Sv . Then since jv ∗ λv = 0, jv ∗ (Dλ,v ) is a one-cycle homologous to zero in Xv . We have the following convenient interpretation of Vλ : Let v0 be any point of V ; then HC2 ∼ = V × H 2 (Sv0 , C)0 . Viewing F 1 Ᏼ2 , Ᏼ2 as vector bundles, we have a map (2.0) φ : F 1 Ᏼ2 −→ H 2 Sv0 , C 0 ∼ obtained as the composition of the inclusion F 1 Ᏼ2 ⊂ Ᏼ2 , the isomorphism Ᏼ2 = H 2 (Sv0 , C)0 × V given by the trivialization of HC2 , and the first projection. Then we have that Vλ is naturally isomorphic to φ −1 (λv0 ). Indeed, by definition, Vλ × λv0 ⊂ V × H 2 (Sv0 , C)0 ∼ = Ᏼ2 is the scheme-theoretic intersection of Vλ × λv0 and F 1 Ᏼ2 in 2 Ᏼ ; but this is also the definition of the fiber φ −1 (λv0 ). 2 , gives the In other words, the flat section λ restricted to Vλ , which is in F 1 Ᏼ|V λ −1 reverse isomorphism Vλ → φ (λv0 ). We abuse notation in Section 3 and view, by 2 . this isomorphism, Vλ as a subvariety of F 1 Ᏼ|V
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
157
We denote by 1,1 λ1,1 ∈ Ᏼ|V λ
(2.1)
2 . Now if v ∈ V and λ1,1 ∈ H 1,1 (S ) , let the projection of the section λ ∈ F 1 Ᏼ|V v 0 1 2 1,1 λ1 , λ2 ∈ F H (Sv )0 be two liftings of λ , so that λ1 = λ2 + η, for some η ∈ H 2,0 (Sv ). By Lemma 2 the tangent spaces to Vλi at v coincide, and the two sections / λ1,1 i , which are defined on the first-order neighbourhood Vλ of v in Vλi (where i = 1 or 2) are equal at v. However, their derivatives do not coincide. In fact, we have the next lemma. 1,1 1,1 Lemma 3. The derivative at v of the section λ1,1 1 − λ2 of Ᏼ|Vλ/ (which vanishes 1,1 at v), is equal to −∇(η)|TVλ ,v : TVλ ,v → H (Sv )0 .
Proof. Let h ∈ TVλ/ ,v and let Zh be the scheme of length two supported on v with tangent vector h. Then the section λh1 = λ1|Zh of F 1 Ᏼ2 is the flat section that extends λ1 ∈ F 1 H 2 (Sv )0 and that remains in F 1 Ᏼ2 . Now, η being given above, let µh2 := λh1 + /∇hS (η) ˜ − η, ˜ where η˜ is a section of F 2 Ᏼ2 on Zh extending η. Then clearly µh2 is flat and its value at v is equal to λ2 . Furthermore, µh2 is a section of F 1 Ᏼ2 on Zh by transversality. It follows that λh2 = µh2 . Hence, ˜ + η, ˜ λh1 − λh2 = −/∇h (η) S
so that by projecting to Ᏼ1,1 and using the definition of ∇ , we get
λh1
1,1
1,1 S − λh2 = −/∇ (η)(h),
which proves the lemma. We now turn to the generalized Abel-Jacobi map and its infinitesimal version. For u ∈ U , let Yu = Xu − Su . We have an exact sequence Res
0 −→ H 3 (Xu ) −→ H 3 (Yu ) −−→ H 2 (Su )0 −→ 0 of cohomology groups with integral coefficients, and H 3 (Yu , C) carries a mixed Hodge structure compatible with the Hodge structures on H 3 (Xu ) and H 2 (Su )0 . Namely, we have a decreasing filtration F i H 3 (Yu ), 0 ≤ i ≤ 3, such that F i H 3 (Yu ) ∩ H 3 (Xu ) = F i H 3 (Xu ), Res F i H 3 (Yu ) = F i−1 H 2 (Su )0 , where F i H 3 (Xu ) = ⊕p≥i H p,3−p (Xu ) is the Hodge filtration of Xu .
158
CLAIRE VOISIN
Working in families, we get the local system 3 HY,Z = R 3 πY ∗ Z,
where πY = πX|ᐅ , ᐅ = ᐄU − . We then define the associated Hodge bundles ᏴY3 by tensorizing the local system with ᏻU . We denote by ∇ Y the Gauss-Manin connection on ᏴY3 . This bundle is equipped with the Hodge filtration by holomorphic subbundles F i ᏴY3 , which satisfy Griffiths transversality ∇ Y F i ᏴY3 ⊂ F i−1 ᏴY3 ⊗ *U . We denote by HZ3 , Ᏼ3 , F i Ᏼ3 , and ∇ X the analogous objects on B that describe the variation of Hodge structure of the family π : ᐄ → B; that is, HZ3 = R 3 π∗ Z, F i Ᏼ3 ⊂ Ᏼ3 with Ᏼ3 = HZ3 ⊗ ᏻB , and ∇ X : Ᏼ3 → Ᏼ3 ⊗ *B with ∇ X F i Ᏼ3 ⊂ F i−1 Ᏼ3 ⊗ *B . We then have an exact sequence of variation of mixed Hodge structures 3 0 −→ p∗ HZ3 −→ HY,Z −→ HZ2 −→ 0.
(2.2)
3 of (2.2). Denoting by On our open set V , let us choose a splitting rZ : HZ2 → HY,Z 1 2 1 2 P : F Ᏼ → B the composite of the bundle map F Ᏼ → U and the map p : U → B, the section rZ allows us to construct a section
s∈
P ∗ Ᏼ3 F 2 Ᏼ3
(2.3)
2 as follows: If (v, λ) ∈ F 1 Ᏼ2 , that is, λ ∈ F 1 H 2 (S ) , let λ be a lifting over F 1 Ᏼ|V v 0 F 2 3 of λ in F H (Yv ). Then we define
s(v, λ) = λF − rZ (λ) mod .F 2 H 3 (Xv ). This is a well-defined element of H 3 (Xv , C)/F 2 H 3 (Xv ), since clearly λF − rZ (λ) belongs to H 3 (Xv , C) and λF (v) is defined up to F 2 H 3 (Xv ). In fact, we are mainly interested with the restriction of s to the subvarieties 3 as the comφ −1 (λ0 ) ∼ = Vλ . We may then consider these sections of p∗ Ᏼ3 /F 2 Ᏼ|V λ plexified Abel-Jacobi map, as we explain now. Suppose that λ ∈ H 2 (Sv , Z)0 ∩ F 1 H 2 (Sv ). Then λ = [Dλ ] for some divisor Dλ on Sv , and jv ∗ (Dλ ) is a one-cycle homologous to zero on Xv . It is then well known that the element Xv jv ∗ (Dλ ) ∈ J (Xv ) = H 3 (Xv , C)/F 2 H 3 (Xv ) ⊕ H 3 (Xv , Z)
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
159
is equal to λF − rZ (λ) mod .F 2 H 3 (Xv ) ⊕ H 3 (Xv , Z). (The fact that we consider it modulo H 3 (Xv , Z) makes it independent of the retraction rZ .) In other words, for integral λ we find that s|Vλ mod .HZ3 is equal to the section νλ of the pullback to Vλ of the family of intermediate Jacobians J (Xb )b∈B given by νλ (v) = Xv jv ∗ (Dλ ) ∈ J (Xv ), v ∈ Vλ . We now want to study the infinitesimal properties of the map φ defined in (2.0) or equivalently of the varieties Vλ . Recall that for v ∈ U, λ ∈ H 1,1 (Sv )0 we have the map S ∇ (λ) : H 1 TSu = TU,u −→ H 2 ᏻSu , which induces
S ∇ (λ) : H 0 Lu |Su = Ker p∗ ⊂ TU,u −→ H 2 ᏻSu .
Note that by Serre duality and because KXv is trivial, both spaces have the same dimension. We have the following lemma. Lemma 4. The following are equivalent: S (i) ∇ (λ) : H 0 (Lu |Su ) → H 2 (ᏻSu ) is an isomorphism; (ii) for any λ˜ ∈ F 1 H 2 (Sv )0 projecting to λ modulo F 2 H 2 (Sv ), the map (P , φ) : F 1 Ᏼ02 −→ B × H 2 Sv0 , C 0 ˜ is étale at λ. Proof. We may clearly assume that u = v0 since the change of base point simply composes φ with the natural isomorphism H 2 (Sv0 )0 ∼ = H 2 (Su )0 . Consider (P , φ)∗ : 1 2 TF 1 Ᏼ2 ,λ˜ → TB,p(u) ×TH 2 (Su )0 ,φ(λ) ˜ . Since on F H (Su )0 ⊂ TF 1 Ᏼ2 ,λ˜ this map is simply the inclusion F 1 H 2 (Su )0 ⊂ H 2 (Su )0 = TH 2 (Su )0 ,φ(λ) ˜ , this map induces
2 (P , φ)0,2 ∗ : TU,u −→ TB,p(u) × H ᏻSu .
S S It is then immediate, using the definition of ∇ , to show that (P , φ)0,2 ∗ = p∗ , ∇ . But (P , φ)∗ is an isomorphism if and only if (P , φ)0,2 ∗ is an isomorphism. Since p∗ is sur2 jective, this is also equivalent to (P , φ)0,2 ∗ |Ker p∗ being an isomorphism onto H (ᏻSu ), S
that is, to ∇ (λ) : H 0 (Lu |Su ) → H 2 (ᏻSu ) being an isomorphism. So Lemma 4 is proved. In fact, the proof shows the following lemma. Lemma 5. The kernel of (P , φ)∗ at λ˜ identifies naturally via the projection to TU,u to S Ker ∇ (λ) : H 0 Lu |Su −→ H 2 ᏻSu ,
160
CLAIRE VOISIN p(u) λ
that is, to the vertical part TV p(u) of TVλ˜ , where V ˜ P(H 0 (L
p−1 (p(u)).
p(u) ))
λ˜
is the intersection of Vλ˜ with
= The reverse isomorphism is given by the differential of p(u) 1 2 ˜ the natural section λ of F Ᏼ on V ˜ . λ
We now study the infinitesimal variation of mixed Hodge structure of the family πY
ᐅ → U . It is described as above, by transversality, by a series of maps Y
∇ : F i /F i+1 ᏴY3 −→ F i−1 /F i ᏴY3 ⊗ *U , which fit into the commutative diagram i i+1 3 X ∗ ∇ ◦ p ∗ : p F /F Ᏼ
/ p ∗ F i−1 /F i Ᏼ3 ⊗ *U
i /F i+1 Ᏼ3 F ∇ : Y
/ F i−1 /F i Ᏼ3 ⊗ *U Y
i−1 /F i Ᏼ2 F ∇ :
/ F i−2 /F i−1 Ᏼ2 ⊗ * , U
Y
S
(2.4)
where the first vertical maps are injective and the last ones are surjective. Composing Y ∇ with the restriction map *U,u → H 0 (Lu |Su )∗ then gives a map F 2 /F 3 H 3 (Yu ) −→ Hom H 0 Lu |Su , F 1 /F 2 H 3 (Yu ) , which obviously factors through F 1 /F 2 H 2 (Su )0 since the composition of p∗ with the restriction to H 0 (Lu |Su ) = Ker p∗ is zero. (This simply means that there is no variation of Hodge structure for X in the fibers of p.) So we have constructed a map µ0 : H 1 *Su 0 −→ Hom H 0 Lu |Su , F 1 /F 2 H 3 (Yu ) , which induces
µ1 : H 0 Lu |Su −→ Hom H 1 *Su 0 , F 1 /F 2 H 3 (Yu ) .
We then have the following. Lemma 6. There is a natural isomorphism (depending on the choice of a trivialization of KXu ) ∗ F 1 /F 2 H 3 (Yu ) ∼ = (TU,u )∗ = H 1 TSu such that for any η ∈ H 0 (Lu |S ) ∼ = H 0 (KS ), the map u
u
µ1 (η) : TU,u −→ H 1 *Su 0
t
S
identifies to ∇ (η).
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
161
Notice that in the identification H 0 (Lu |Su ) ∼ = H 0 (KSu ), we use the same trivialization of KXu . Proof. Recall the isomorphisms TU,u = H 1 (TSu ) = H 1 TXSuu of Lemma 1. Now TXSuu is dual to *Xu (log Su ), so that, choosing a trivialization of KXu , we get an isomorphism H 1 (TXSuu ) ∼ = (H 2 (*Xu (log Su )))∗ , and taking into account the natural isomorphism (see [5]) H 2 (*Xu (log Su )) = F 1 /F 2 H 3 (Yu ), we get the first assertion. Next it is known (this is an easy generalization of [9]) that the map Y
∇ : F 2 /F 3 ᏴY3 −→ F 1 /F 2 ᏴY3 ⊗ *U identifies to the map given by the interior product H 1 *2Xu (log Su ) −→ Hom H 1 TXSuu , H 2 *Xu (log Su ) .
(2.5)
The image of the map (2.5) is contained in the set of symmetric homomorphisms from H 1 (TXSuu ) to its dual; indeed the dual of (2.5) is equal to the (symmetric) cup product
2 Su Su Su 1 1 2 TXu , H TXu ⊗ H TXu −→ H taking into account the isomorphism 2 TXSuu = (*2Xu (log Su ))∗ , the triviality of KXu , and Serre duality. It follows that for λ ∈ H 1 (*2Xu (log Su )), η, χ ∈ H 1 (TXSuu ), we have
Y Y ∇ (λ)(η), χ = ∇ (λ)(χ), η .
(2.6)
Now note that the inclusion H 0 (Lu |Su ) → H 1 TXSuu is dual to the residue map Res : H 2 (*Xu (log Su )) → H 2 (ᏻSu ), so that for η ∈ H 0 (Lu |Su ), (2.6) gives
Y Y ∇ (λ)(η), χ = Res ∇ (λ)(χ) , η ,
(2.7)
where the second pairing in (2.7) is the duality between H 0 (Lu |Su ) and H 2 (ᏻSu ). (We always use the same trivialization of KXu to compute the pairings.) But by Y S diagram (2.4), we have Res(∇ (λ)(χ)) = ∇ (Res(λ))(χ)) and by definition of µ1 , Y we have ∇ (λ)(η) = µ1 (η)(Res(λ)). So we have proved for any λ ∈ H 1 (*Su )0 , for any η ∈ H 0 (Lu |Su ) and χ ∈ H 1 TXSuu , the equality
S µ1 (η)(λ), χ = ∇ (λ)(χ), η ,
(2.8)
where the first pairing is the duality above between H 1 TXSuu and H 2 (*Xu (log Su )), while the second one is the duality between H 0 (Lu |Su ) and H 2 (ᏻSu ). Finally, for
162
CLAIRE VOISIN
S η ∈ H 0 (Lu |Su ) ∼ = H 0 (KSu ), λ ∈ H 1 (*Su )0 , χ ∈ H 1 TXuu , we have the equalities
t
S (2.8) S S ∇ (η) (λ), χ = λ, ∇ (η)(χ) = ∇ (λ)(χ), η = µ1 (η)(λ), χ ,
where the second equality is standard and follows from the fact that the intersection pairing on H 2 (Su )0 is flat with respect to the Gauss-Manin connection and that, for this S pairing, H 2,0 (Su ) is perpendicular to F 1 H 2 (Su )0 . This proves that t (∇ (η)) = µ1 (η), as we wanted. We conclude this section with the “complexified infinitesimal Abel-Jacobi map.” Recall that from Lemma 5 we get, for λ ∈ H 1 (*Su )0 with lifting λ˜ ∈ F 1 H 2 (Su )0 , S a natural identification between Ker ∇ (λ)|H 0 (Lu|S ) and Ker P∗ : TV ˜ ×λ˜ ,λ˜ → TB,p(u) , u λ where by definition of V ˜ , V ˜ × λ˜ ⊂ U × H 2 (Su0 , C)0 is in fact contained in F 1 Ᏼ2 . λ
λ
Now consider the section s of the bundle P ∗ Ᏼ3 /F 2 Ᏼ3 constructed in (2.3). Since this bundle is naturally trivial on the fibers of P , it makes sense to differentiate s|V ˜ ×λ˜ λ in the direction contained in Ker P∗ . It follows that we have a map S
ds : Ker P∗ = Ker ∇ (λ)|H 0 (Lu|S ) −→ H 3 (Xu )/F 2 H 3 (Xu ). u
On the other hand, the map µ0 : H 1 *Su 0 −→ Hom H 0 Lu |Su , F 1 /F 2 H 3 (Yu ) S
satisfies Res ◦µ0 (λ) = ∇ (λ)|H 0 (Lu|S
u)
and hence induces a map
S
µ2 : Ker ∇ (λ)|H 0 (Lu|S ) −→ H 3 (Xu )/F 2 H 3 (Xu ). u
Now we have the following. Lemma 7. We have the equality ds = µ2 . Proof. Indeed, recall that s|V ˜ ×λ˜ is equal to λ˜ F − r Z (λ˜ ) mod . F 2 Ᏼ3 , where λ˜ F is λ 2 in F 2 Ᏼ3 ˜ any lifting of λ˜ ∈ F 1 Ᏼ|V Y |V . Since λ is flat, we have ˜ λ˜
λ
∇ X λ˜ F − rZ λ˜ = ∇ Y λ˜ F ; Y
since λ˜ F is a section of F 2 ᏴY3 |V , we have, by definition of ∇ , λ˜
Y ∇ Y λ˜ F mod . F 2 ᏴY3 = ∇ λF ,
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
163
where λF is the projection of λ˜ F in F 2 ᏴY3 /F 3 ᏴY3 . But then for h ∈ TVλ˜ ,u ∩ Ker p∗ = S
Ker ∇ (λ)|H 0 (Lu|S ) , we have u
Y ds(h) = ∇hX λ˜ F − r Z λ˜ mod . F 2 H 3 (Xu ) = ∇ λF (h), and by definition of µ2 the right-hand side is equal to µ2 (h). The reason we call ds or µ2 the complexified infinitesimal Abel-Jacobi map is, again, that if λ˜ is an integral class, we have shown that s|V ˜ ×λ˜ is a lifting of the λ normal function νλ˜ (u) = Xu jSu ∗ Dλ˜ ,u ∈ J (Xu ) to a section of p ∗ (Ᏼ3 /F 2 Ᏼ3 ) on Vλ˜ . Then if h ∈ Ker p∗ , ds(h) is simply the differential of νλ˜ in the direction h, which makes sense since Xu , and hence J (Xu ) remains constant in the direction h. 3. An infinitesimal criterion for the nonfinite generation of the image of the Abel-Jacobi map. With the notation of Section 2, we now assume that dim B > 0. Recall that we have defined for Su ⊂ Xu and for λ ∈ H 1 (*Su )0 , η ∈ H 0 (KSu ) the maps S ∇ (η) : H 1 TSu −→ H 1 *Su 0 , S ∇ (λ) : H 1 TSu −→ H 2 ᏻSu . We prove in this section the following infinitesimal criterion for the infinite generation of the Griffiths group of the general fiber Xb . Proposition 1. Assume that Lu is sufficiently ample and that for generic u ∈ U and generic η ∈ H 0 (KSu ), we have that S (i) the map ∇ (η) : H 1 (TSu ) → H 1 (*Su )0 is injective; S (ii) for generic λ ∈ H 1 (*Su )0 such that ∇ (λ)(η) = 0, we have that S Ker ∇ (λ) : H 0 Lu |Su −→ H 2 ᏻSu is generated by η. Then for the general point t ∈ B, the Abel-Jacobi map φXt of Xt satisfies that Im Xt ⊗ Q is an infinite-dimensional Q–vector space. In assumption (ii), η is viewed as an element of H 0 (Lu |Su ) ⊂ H 1 (TSu ). To start the proof, we first note the following lemma. Lemma 8. Assumption (ii) implies that for generic u and generic λ ∈ H 1 (*Su )0 , the map S ∇ (λ) : H 0 Lu |Su −→ H 2 ᏻSu is an isomorphism.
164
CLAIRE VOISIN
Proof. Identifying H 0 (Lu |Su ) with H 0 (KSu ) by a trivialization of KXu , the maps ∇ (λ) : H 0 (Lu |Su ) → H 2 (ᏻSu ) are symmetric, with respect to Serre duality. Hence S ∇ (λ) determines a quadric qλ on P(H 0 (Lu |Su )). If λ is as in assumption (ii), the quadric qλ has η for only singular point, and since η is generic, it is not in the base locus of the system of quadrics qλ . Hence the tangent space at λ to the discriminant hypersurface, parametrizing singular quadrics qλ being equal to the set of qλ vanishing at η, is a proper subspace of H 1 (*Su )0 , so the generic qλ is smooth. S
S
Now note that the condition that ∇ (λ) : H 0 (Lu |Su ) → H 2 (ᏻSu ) is an isomorphism is Zariski open on H 1 (*Su )0 , which is the complexification of HR1,1 (Su )0 := H 1,1 (Su ) ∩ H 2 (Su , R)0 . So if it is satisfied at some point, it will be satisfied at some real point λ ∈ HR1,1 (Su )0 , which obviously has a natural (real) lifting λ in F 1 H 2 (Su )0 . From Lemma 4 we know that at such a λ ∈ F 1 Ᏼ2 the map (P , φ) : F 1 Ᏼ2 −→ B × H 2 (Su0 , C)0 is étale, so it is a local isomorphism for the usual topology. Hence there are open connected neighbourhoods B ⊂ B of p(u), V ⊂ H 2 (Su0 , C)0 of φ(λ), and W ⊂ (P ,φ) F 1 Ᏼ2 of λ, with W ∼ = B × V . Finally, note that since φ(λ) is real, the rational 2 points in V ∩H (Su0 , Q)0 are Zariski dense in V . For any such rational point λ ∈ V , the fiber φ −1 (λ) ∩ W is then naturally isomorphic to B by P , and it parametrizes jt then the pairs St → Xt such that λt is algebraic on St . For each such λ, we choose an integer mλ such that mλ λ is integral, and then mλ λ = c1 (Dλ,t ) on St . Hence we get a normal function νλ on B , that is, a section of the sheaf = Ᏼ3 /F 2 Ᏼ3 ⊕ HZ3
defined by
νλ (t) = Xt jt ∗ (Dλ,t ) ∈ J (Xt ).
, in order to prove Proposition 1. So we assume We use the countably many νλ , λ ∈ VQ by contradiction the following assumption:
(∗) For any general point t ∈ B , the image of Xt tensorized by Q is finitely generated. Then we have the following. such that for any λ ∈ V , Lemma 9. If (∗) holds, there exists λ1 , . . . , λN ∈ VQ Q there exist integers m = 0, m1 , . . . , mN , satisfying the equality mνλ = mi νλi in . i . For any sequence Proof. Choose an ordering λi , i ∈ N, of the elements of VQ (αi )i∈N of integers with only finitely many nonzero elements, let
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
Bα · = t ∈ B ,
165
αi νλi (t) = 0 in J (Xt ) .
i
Then Bα · is an analytic subset of B , so any point in B = B − Bα · Bα · =B
is general. On B , any relation with integral the other hand, by definition, if t ∈ coefficients i αi νλi (t) = 0 in J (Xt ) implies that i αi νλi = 0 in . Lemma 9 follows, taking any t ∈ B at which (∗) holds. Coming back to the section s of P ∗ (Ᏼ3 /F 2 Ᏼ3 )|W ) defined in (2.3), we get the following corollary. Corollary 1. Under the same assumption (∗), for any λ ∈ V , s|B ×λ belongs to the finite vector space K of holomorphic sections of (Ᏼ3 /F 2 Ᏼ3 )|B generated by the image of HC3 |B in (Ᏼ3 /F 2 Ᏼ3 )|B and by liftings ν˜ λi of νλi in (Ᏼ3 /F 2 Ᏼ3 )|B for i = 1, . . . , N . Here we use the isomorphism W ∼ = B × V given by (P , φ). . Indeed, a relation mν = Proof. By Lemma 9, the conclusion is true for λ ∈ VQ λ m ν in is equivalent to a relation i λ i i Ᏼ3 m˜νλ = mi ν˜ λi + α in , F 2 Ᏼ3 |B i
where α ∈ HZ3 and ν˜ λi , ν˜ λ are liftings of our normal functions in (Ᏼ3 /F 2 Ᏼ3 )|B . On the other hand, we have shown in Section 2 that mλ s|B ×λ , mλi s|B ×λi give such liftings. In order to deduce from this that the conclusion is true for any λ ∈ V , we use now in V . To be precise, using a trivialization of the bundle the Zariski density of VQ (Ᏼ3 /F 2 Ᏼ3 )|B , Corollary 1 will follow now from the next lemma. Lemma 10. Let K be a finite-dimensional set of functions on B . Let f be a function on B × V , where V is a connected open set of Ck meeting Rk , such that for any λ ∈ V ∩ Qk , f|B ×λ ∈ K. Then for any λ ∈ V , f|B ×λ ∈ K. Proof. Let k = dim K and let p1 , . . . , pk be points on B such that the restriction map K → ⊕i ᏻpi is an isomorphism. Then we have a basis (ki ) of K such that ki (pj ) = δij . So any element g of K satisfies g = i g(pi )ki . It follows that the function f (b, v) − i f (pi , v)ki (b) on B × V vanishes on B × (V ∩ Qk ), hence everywhere, by the (analytic) Zariski density of B × (V ∩ Qk ) in B × V . Now we use analytic continuation to conclude the following.
166
CLAIRE VOISIN
Corollary 2. Let U be any open subset of U contained in the image of the 2 and that V is open projection W → U . (We recall that W is an open subset of F 1 Ᏼ|V 2 such that (P , φ) is in U .) Then under the same assumption (∗), for any λu ∈ F 1 Ᏼ|U ∗ étale at λu , the section s|Uλ ,0 belongs to P (K), where Uλu ,0 denotes the irreducible u component of Uλ u ∼ = φ −1 (φ(λu )) containing u (which is unic since by hypothesis Uλ u is smooth at u). 2 ) denote the Zariski-dense open subset of F 1 Ᏼ2 where Proof. Let (F 1 Ᏼ|U et |U 2 1 (P , φ) is étale. We can cover (F Ᏼ|U )et by connected open sets Wi isomorphic to Bi × Vi by (P , φ) for some open subsets Bi of B and Vi of H 2 ((Su0 , C)0 ). Then for λu ∈ Wi , Bi × φ(λu ) is open in Uλ u ,0 . So if we show that s|Bi ×φ(λu ) belongs to K, there is a k ∈ K such that
P ∗ k|Bi ×φ(λu ) = s|Bi ×φ(λu ) , and this will be true everywhere on Uλ u ,0 by analytic continuation. So it suffices to prove the following: For any Wi ∼ = Bi × Vi , we have that for any λ ∈ Vi , s|Bi ×λ belongs to K. But using the same argument as in Corollary 1, we see that if this is true for Wi and if Wi ∩ Wj = ∅, this is true as well for Wj . Since this 2 ) is connected, this is true for all W . is true on W by Corollary 1 and (F 1 Ᏼ|U et i Corollary 2 is proved. We conclude with the following corollary. 2 be such that U is irreducible reduced, and Corollary 3. Let λu ∈ F 1 Ᏼ|U λu generically finite over B via p. Then if (∗) is satisfied, for any h ∈ TUλ ,λu , such that
P∗ (h) = 0 in TB,p(u) , we have ds(h) = 0 in H 3 (Xu )/F 2 H 3 (Xu ).
u
This is immediate since P : Uλ u → B is a generic isomorphism, that is, (P , φ) is étale at the generic point of Uλ u . We then can apply the previous corollary and conclude that for some k ∈ K, we have P ∗ (k) = s on some open set of Uλ u . Hence the equality is true everywhere by irreducibility, and it follows that the vertical derivatives of s|Uλ vanish. Proof of Proposition 1. We now show that the hypotheses of Proposition 1 contradict the conclusion of Corollary 3. The hypotheses are as follows: S (i) the map ∇ (η) : H 1 (TSu ) → H 1 (*Su )0 is injective for generic u and η ∈ 0 H (KSu ); S (ii) for generic λ ∈ H 1 (*Su )0 , such that ∇ (λ)(η) = 0, we have that S Ker ∇ (λ) : H 0 Lu |Su −→ H 2 ᏻSu is generated by η.
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
167
Recall from Lemma 6 that the transposed map t
satisfies
∗ F 1 H 3 (Yu ) S ∼ ∇ (η) : H 1 *Su 0 −→ H 1 TSu = 2 3 F H (Yu ) S S Res ◦t ∇ (η) = ∇ η : H 1 *Su 0 −→ H 2 ᏻSu . S
Now hypothesis (i) says that t (∇ (η)) is surjective; furthermore, the condition dim B > S 0 implies dim F 1 H 3 (Xu )/F 2 H 3 (Xu ) > 0. It follows that for generic λ ∈ Ker ∇ η , we S
have t (∇ (η))(λ) = 0 in
F 1 H 3 (Yu ) F 1 H 3 (Xu ) 2 = Ker Res : −→ H ᏻ . Su F 2 H 3 (Xu ) F 2 H 3 (Yu ) S
S
Note also that by definition ∇ η (λ) = ∇ (λ)(η) ∈ H 2 (ᏻSu ), so we conclude from assumption (ii) that we can find λ such that S (a) Ker ∇ (λ) is generated by η, with η generic in H 0 (Lu |Su ); S
(b) t (∇ (η))(λ) = 0 in F 1 H 3 (Xu )/F 2 H 3 (Xu ). S Now recall Lemmas 6 and 7, which say that for η ∈ Ker ∇ (λ), so that η is tangent 1 2 to Vλ˜ at u for any λ˜ ∈ F H (Su )0 over λ, and η is annihilated by p∗ , t
F 1 H 3 (Xu ) S ∇ (η) (λ) ∈ 2 3 F H (Xu )
is equal to ds|Vλ˜ (η). So the hypotheses imply that for any λ˜ ∈ F 1 H 2 (Su )0 over λ, the vertical derivative of s|Vλ˜ is nonzero. In order to contradict Corollary 3, it suffices now to show that we can choose λ˜ so that Vλ˜ is smooth at u and generically finite over B. The first statement follows easily from (a) and (b): indeed, to prove the smoothness of Vλ˜ at u, it suffices to show that S ∇ (λ) : H 1 TSu −→ H 2 ᏻSu is surjective or that its dual t
∗ F 1 H 3 (Yu ) S ∇ (λ) : H 0 Lu |Su −→ H 1 TSu = 2 3 F H (Yu )
is injective. S S But one sees easily, as in Section 2, that Res ◦t (∇ (λ)) is equal to ∇ (λ)|H 0 (Lu|S ) , u and hence its kernel is generated by η by (a). Furthermore, one has the equality t
S F 1 H 3 (Xu ) S , ∇ (λ) (η) =t ∇ (η) (λ) ∈ 2 3 F H (Xu )
168
CLAIRE VOISIN S
and by (b) this is nonzero. So t (∇ (λ)) is injective, as we wanted to prove. What remains is to show that for general λ˜ lifting λ, the variety Vλ˜ is generically finite over B via P . Recall from (2.1) that on Vλ˜ we have a natural section λ˜ 1,1 of the bundle Ᏼ1,1 . Now on the total space of Ᏼ1,1 , let Ᏸ be the discriminant hypersurface; that is, for any u, S Ᏸu = λ ∈ Ᏼu1,1 , ∇ (λ) : H 0 Lu |Su −→ H 2 ᏻSu is not an isomorphism . If q : F 1 Ᏼ2 → Ᏼ1,1 is the natural projection, it follows from Lemma 4 that q −1 (Ᏸ) is equal to the ramification locus of the map (P , φ). This implies that the ramification locus of the map P|Vλ˜ is equal to (λ˜ 1,1 )−1 (Ᏸ). Hence P|Vλ˜ is generically finite if and only if λ˜ 1,1 (Vλ˜ ) is not contained in Ᏸ. Now since η is contained in the vertical tangent space of Vλ˜ at u, it suffices to prove that λ˜ 1,1 ∗ (η) is not tangent to Ᏸu at λ. But the symmetric maps S ∇ (µ) : H 0 Lu |Su −→ H 2 ᏻSu can be viewed as quadrics qµ on P(H 0 (Lu |Su )). Then the assumption on λ means that qλ has η as its only singular point. It follows that the tangent space to Ᏸu at λ is the set {µ ∈ H 1 (*Su )0 , qµ (η) = 0}. Now we use Lemma 3 and conclude that if λ˜ 1,1 ∗ (η) was tangent to Ᏸu at λ for any ˜λ lifting λ, the subspace ∇ Sη (H 0 (KSu )) of H 1 (*Su )0 would be tangent to Ᏸu at λ. Hence we would have the following: For any ω ∈ H 0 (KSu ),
η, ∇
S
S ∇ η (ω) (η) = 0.
(3.9)
This cannot hold for generic η and sufficiently ample L for the following reason: One can show (and this is done in the next section) by describing the variation of Hodge structure of the family of surfaces Su (with fixed Xu ) in terms of the Jacobian ring associated to Su ⊂ Xu (see [7] and [10]) that there is a natural surjective map ψ : H 0 3Lu |Su −→ H 2 ᏻSu and that (3.9) would mean exactly that ψ(η3 ) = 0. But if L is sufficiently ample, the multiplication map S 3 H 0 (Lu ) → H 0 (3Lu ) is surjective, so that ψ(η3 ) = 0 for any η would imply that ψ = 0, which is absurd since H 2 (ᏻSu ) = 0. So we have obtained the desired contradiction with the conclusion of Corollary 3, and this shows that the finiteness assumption (∗) is absurd. The proof of Proposition 1 is now complete.
4. Checking the infinitesimal criterion for any Calabi-Yau threefold. In this section we prove that conditions (i) and (ii) of Proposition 1 are satisfied for a sufficiently large multiple of an ample line bundle on X. This will conclude the proof of Theorem 4. We start with the proof of (i).
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
169
Proposition 2. Let X be a Calabi-Yau threefold and L1 be a line bundle on X. If L1 is sufficiently ample, any sufficiently large multiple L of L1 satisfies the property (i): that is, for generic u ∈ |L| and generic α ∈ H 0 (KSu ), the map S ∇ (α) : H 1 TSu −→ H 1 *Su 0 is injective. Proof. It is known from [9] that the composition of this map with the inclusion H 1 *Su 0 ⊂ H 1 *Su is nothing but the multiplication map by α: H 1 TSu −→ H 1 TSu ⊗ KSu ∼ = H 1 *Su . So the transposed map ∗ ∼ H 1 *Su 0 −→ H 1 TSu = H 1 *Su ⊗ KSu is also the multiplication by α, and we have to show that it is surjective for generic α. We know from [7] and [10] that for sufficiently ample L and smooth S ∈ |L|, the residues on S of the classes of the 3-forms P ω/s 2 generate F 1 H 2 (S)0 , so that their projections modulo H 2,0 (S) generate H 1 (*S )0 , where ω is a generator of H 0 (KX ), P varies in H 0 (2L), and s ∈ H 0 (L) is an equation for S. Hence we have a surjective map H 0 (2L) −→ H 1 (*S )0 .
(4.10)
Similarly, considering residues of meromorphic forms P ω/s 3 , where P ∈ H 0 (3L), we get a surjective map H 0 (3L) −→ H 2 (ᏻS ).
(4.11)
One then shows exactly as in [2] that for α ∈ H 0 (L), one has the commutative diagram α : H 0 (2L)
/ H 0 (3L)
S ∇ α : H 1 (*S )0
/ H 2 (ᏻ ). S
(4.12)
These maps can be obtained as well by looking at the exact sequence 0 −→ ᏻS (−L) −→ *X|S −→ *S −→ 0,
(4.13)
170
CLAIRE VOISIN
which, by taking the second exterior power and tensoring with L, gives 0 −→ *S −→ *X 2|S (L) −→ KS (L) −→ 0.
(4.14)
Using the isomorphism KS (L) ∼ = 2L|S given by ω and the fact that the induced map H 1 *Su −→ H 1 *2X|S (L) ∼ = H 2 *2X is equal to jS ∗ , we get by the long exact sequence induced by (4.14) the desired map H 0 (2L|S ) → H 1 (*S )0 , with kernel H 0 (*X 2|S (L)). Tensoring (4.14) by any line bundle L , we also get maps H 0 2L + L|S −→ H 1 *S (L ) , and in particular H 0 3L|S −→ H 1 *S (L) .
(4.15)
The map (4.11) is then simply obtained by composing the map (4.15) with the map (4.16) δ : H 1 *S (d) −→ H 2 (ᏻS ) deduced from the exact sequence (4.13) twisted by L. It is then obvious that the following diagram is commutative: H 0 2L|S H 1 (*S )0
α
α
/ H 0 3L|S / H 1 *S (L) .
(4.17)
Furthermore, it also follows from the commutativity of diagrams (4.12) and (4.17) that for λ ∈ H 1 (*S )0 , α ∈ H 0 (ᏻS (L)), one has S
∇ α (λ) = δ(αλ).
(4.18)
In order to show the surjectivity of α : H 1 (*S )0 −→ H 1 *S (L) for generic S and α, we do the following. Let L1 = ᏻX (1) be sufficiently ample on X and let φ0 , . . . , φ3 ∈ H 0 (L1 ) define a map φ : X → P3 . For d sufficiently large, let @ ⊂ P3 be defined by σ ∈ H 0 (ᏻP3 (d)) and let S = φ −1 (@) be defined by s = φ ∗ (σ ) ∈ H 0 (ᏻX (d)). Let R ∈ |ᏻX (4)| be the ramification locus of φ. For @ we have the exact sequences analogous to (4.14): 2 0 −→ *@ (k) −→ *P 3 |@ (d + k) −→ K@ (d + k) −→ 0,
(4.19)
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
171
which can be pulled back to S and which give rise to maps (taking into account the isomorphism K@ ∼ = ᏻ@ (d − 4)) (4.20) H 0 ᏻS (2d − 4 + k) −→ H 1 φ ∗ *@ (k) . We have the following lemma. Lemma 11. The diagram H 0 ᏻS (2d − 4 + k)
/ H 1 φ ∗ *@ (k) φ∗
r
H 0 ᏻS (2d + k)
/ H 1 *S (k)
(4.21)
is commutative for an adequate choice of equation r ∈ H 0 (ᏻX (4)) for R. This follows easily from the fact that the composite φ∗ φ∗ TX −−→ φ ∗ (TP3 ) ∼ = φ ∗ *2P3 (4) −−→ *2X (4) ∼ = TX (4) is the multiplication by r, where the choice of r is determined by the isomorphisms KX ∼ = ᏻX and KP3 ∼ = ᏻP3 (−4). As a consequence of Lemma 11, we get the following. Lemma 12. Let d be sufficiently large, and let @, t ∈ H 0 (ᏻ@ (d − 4)) satisfy the following condition: the multiplication map t : H 1 *@ (−4) −→ H 1 *@ (d − 8) is surjective. Then the multiplication map φ ∗ t : H 1 (*S )0 −→ H 1 *S (d − 4) satisfies that Im φ ∗ t contains rH 1 (*S (d − 8)). Proof. From Lemma 11 we conclude that the image of the map φ ∗ : H 1 φ ∗ *@ (d − 4) −→ H 1 *S (d − 4) contains rH 1 (*S (d − 8)). Indeed, we have the commutative diagrams H 0 ᏻS (3d − 8) r
H 0 ᏻS (3d − 4)
/ H 1 φ ∗ *@ (d − 4) φ∗
/ H 1 *S (d − 4)
(4.22)
172
CLAIRE VOISIN
and H 0 ᏻS (3d − 8) r
H 0 ᏻS (3d − 4)
/ H 1 *S (d − 8) r
/ H 1 *S (d − 4) ,
(4.23)
where the surjectivity of the first horizontal map is easy to check. So it suffices to prove that if d is large enough, the assumption on @, t implies that the multiplication map φ ∗ t : H 1 (φ ∗ *@ )0 −→ H 1 φ ∗ *@ (d − 4) is surjective. Now consider the exact sequences 0 −→ J 2d−8 −→ H 0 ᏻ@ (2d − 8) −→ H 1 *@ (−4) −→ 0, 0 −→ J 3d−12 −→ H 0 ᏻ@ (3d − 12) −→ H 1 *@ (d − 8) −→ 0 constructed above, where J ∗ ⊂ H 0 (ᏻ@ (∗)) is the Jacobian ideal of @, that is, the image of H 0 (*2P3 (−4 + ∗ − d)|@ ) under the map induced by (4.19). The hypothesis on t means exactly that J 3d−12 + tH 0 ᏻ@ (2d − 8) = H 0 ᏻ@ (3d − 12) . Now if d is large enough, the multiplication map H 0 ᏻS (4) ⊗ φ ∗ H 0 ᏻ@ (3d − 12) −→ H 0 ᏻS (3d − 8) is surjective. It follows that H 0 ᏻS (4) · φ ∗ J 3d−12 + φ ∗ tH 0 ᏻS (2d − 4) = H 0 ᏻS (3d − 8) . Since H 0 (ᏻS (4)) · φ ∗ J 3d−12 vanishes in H 1 (φ ∗ *@ (d − 4)) and the map H 0 ᏻS (3d − 8) −→ H 1 φ ∗ *@ (d − 4) is surjective, it follows that φ ∗ t : H 1 (φ ∗ *@ )0 −→ H 1 φ ∗ *@ (d − 4) is surjective. We now conclude the proof of Proposition 2. It is quite easy to verify that for generic @, t the condition of Lemma 12 is satisfied. So we have (@, t) such that Im φ ∗ t contains rH 1 (*S (d − 8)). We want to conclude that φ ∗ t : H 1 (*S )0 −→ H 1 *S (d − 4)
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
173
is in fact surjective. Consider the surjective map H 0 (ᏻS (3d − 4)) → H 1 (*S (d − 4)). It has for kernel the space JS3d−4 = ds(u), u ∈ H 0 TX (2d − 4)|S . The fact that Im φ ∗ t contains rH 1 (*S (d − 8)) means then that rH 0 (ᏻS (3d − 8))|φ ∗ t=s=0 is contained in JS3d−4 |φ ∗ t=s=0 . Now let s/ = s + /rφ ∗ t. We first show that for generic t, σ , and φ rH 1 *S/ (d − 8) = H 1 *S/ (d − 4) .
(4.24)
Equivalently, we have to show that the multiplication map r : H 1 TS/ (4) → H 1 TS/ (8)
(4.25)
is injective. Looking at the exact sequences 0 −→ TS/ (4) −→ TX (4)|S/ −→ ᏻS/ (d + 4) −→ 0, 0 −→ TS/ (8) −→ TX (8)|S/ −→ ᏻS/ (d + 8) −→ 0, and using the fact that ᏻX (1) is sufficiently ample, we find that the kernel of the map (4.25) identifies to the set u ∈ H 0 TX (8)|R , ds/ (u)|r=s/ =0 = 0 . Now we have the equality ds/ (u)|r=s/ =0 = ds(u)|r=s/ =0 + /tdr(u)|r=s/ =0 , where the curves {r = s = 0} and {r = s/ = 0} coincide. (Notice that all these derivatives only make sense when restricted to the vanishing locus of the considered equation.) Now clearly for sufficiently large d, general t, and any u in the fixed vector space H 0 (TX (8)|R ), the right-hand side vanishes if and only if ds(u)|r=s/ =0 = 0
and
dr(u)|r=s/ =0 = 0.
But if φ is generic, the surface R is reduced, and the second condition means that u is tangent to it. Then clearly there is at most for each such u a one-dimensional family of curves {r = s = 0} on the surface R which are tangent to u; that is, s satisfies the first condition. Since u varies in the fixed subspace of H 0 (TX (8)|R ) of elements tangent to R, it follows that for d large enough and generic σ , the two conditions above imply that u = 0, so that the map (4.25) is injective.
174
CLAIRE VOISIN
This means, as above, that we have JS3d−4 / |r=s
/ =0
= H 0 ᏻS/ (3d − 4) |r=s
/ =0
,
so that, in particular, JS3d−4 / |φ∗t=r=s
/ =0
= H 0 ᏻS/ (3d − 4) |φ ∗ t=r=s
/ =0
.
But the curve defined by s = φ ∗ t = 0 is equal to the curve defined by s/ = φ ∗ t = 0; the restriction map H 0 TX (2d − 4) −→ H 0 TX (2d− 4)|S is surjective, and for u ∈ H 0 (TX (2d − 4)), we have ds/ (u)|r=s=φ ∗ t=0 = ds(u)|r=s=t=0 .
(4.26)
It follows that we have as well JS3d−4 |φ∗t=r=s=0 = H 0 ᏻS (3d − 4) |φ ∗ t=r=s=0 . Since JS3d−4 |φ∗t=s=0 contains rH 0 (ᏻS (3d − 8))|φ ∗ t=s=0 , this implies that JS3d−4 |φ∗t=s=0 = H 0 ᏻS (3d − 4) |φ ∗ t=s=0 ,
which is equivalent to the fact that φ ∗ t : H 1 (*S )0 → H 1 (*S (d − 4)) is surjective. Finally, it is easy to check that for generic t ∈ H 0 (ᏻ@ (4)), the multiplication map φ ∗ t : H 1 *S (d − 4) −→ H 1 *S (d) is surjective, so we have proved that for generic t ∈ H 0 (ᏻ@ (d)) the multiplication map φ ∗ t : H 1 (*S )0 −→ H 1 *S (d) is surjective. Thus Proposition 2 is proved. It remains now to check condition (ii) in Proposition 1. Proposition 3. Let L1 be ample on the Calabi-Yau threefold X. Then for any sufficiently large multiple L of L1 and any generic S ∈ |L|, α ∈ H 0 (L|S ), and λ ∈ S
H 1 (*S )0 such that ∇ (λ)(α) = 0 in H 2 (ᏻS ), we have that S Ker ∇ (λ) : H 0 (L|S ) −→ H 2 (ᏻS ) is generated by α. We follow this strategy: We again consider a generic map φ : X → P3 , with L1 = ᏻX (1) = φ ∗ (ᏻP3 (1)) sufficiently ample, and surfaces S = φ −1 (@) for generic
175
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
@ ⊂ P3 of degree d sufficiently large. So S = V (s), @ = V (σ ) with s = φ ∗ (σ ). S Next let t ∈ H 0 (ᏻ@ (d)) be generic. Then for α = φ ∗ t, we know that ∇ α : S H 1 (*S )0 → H 2 (ᏻS ) is surjective, so the set of λ ∈ H 1 (*S )0 such that ∇ (λ)(α) = 0 S in H 2 (ᏻS ), which is equal to Ker ∇ α , has the minimal generic dimension. Hence it suffices to prove Proposition 3 for such (S, α). S First, we show that for generic λ ∈ H 1 (φ ∗ *@ )0 such that ∇ (λ)(α) = 0 in H 2 (ᏻS ), S that is, λ ∈ Ker ∇ α , the kernel of S ∇ (λ) : H 0 ᏻS (d) −→ H 2 (ᏻS ) is generated by α and J˜@ := Ker H 0 ᏻS (d) −→ H 1 (T@ ) = Im H 0 φ ∗ (TP3 ) −→ H 0 ᏻS (d) . S
Then we conclude that for generic λ ∈ Ker ∇ α , the kernel of S ∇ (λ) : H 0 ᏻS (d) −→ H 2 (ᏻS ) is generated by α, by showing that the set of quadrics qλ on P(H 0 (ᏻS (d))) for S λ ∈ Ker ∇ α has no base point on P(J˜@ ). Let us introduce the notation Rσ = C[X0 , . . . , X3 ]/Jσ , where Jσ is the ideal gen∂σ . Using (4.19), we get isomorphisms erated by the partial derivatives ∂X i Rσ2d−4 ∼ = H 1 (*@ )0 ,
Rσ2d−4+k ∼ = H 1 *@ (k) ,
(4.27)
for any integer k = 0. Furthermore, Rσ is Gorenstein: we have Rσ4d−8 ∼ = C, and the pairings Rσ2d−4−k × Rσ2d−4+k −→ Rσ4d−8
(4.28)
are perfect. We first show the following. Proposition 4. Assume that ᏻX (1) is sufficiently ample, that φ is generic, and that d is sufficiently large. Let t ∈ H 0 (ᏻ@ (d)) be generic and assume that there exist λ1 ∈ Rσ2d−8 ∼ = H 1 (*@ (−4)), A ∈ H 0 (ᏻ@ (2)) satisfying the following properties: (a) Ker λ1 : Rσd → Rσ3d−8 ∼ = (Rσd )∗ is generated by the image t of t in Rσd ; (b) Aλ1 : Rσd−1 → Rσ3d−7 ∼ = (Rσd−1 )∗ is an isomorphism; 2 d−2 3d−6 ∼ (c) A λ1 : Rσ → Rσ = (Rσd−2 )∗ is an isomorphism; 3 d−3 3d−5 ∼ (d) A λ1 : Rσ → Rσ = (Rσd−3 )∗ is an isomorphism; 4 d−4 3d−4 ∼ (e) A λ1 : Rσ → Rσ = (Rσd−4 )∗ is an isomorphism. 0 Then for ψ ∈ H (ᏻX (4)) generic and Q ∈ H 0 (ᏻX (2)) generic, the element λ = ψφ ∗ (λ1 ) + Qφ ∗ (Aλ1 ) + φ ∗ A2 λ1
176
CLAIRE VOISIN S
of H 1 (φ ∗ *@ )0 satisfies that ∇ (λ)(α) = 0 in H 2 (ᏻS ), where α = φ ∗ (t), and the kernel of S ∇ (λ) : H 0 ᏻS (d) −→ H 2 (ᏻS ) is generated by α and J˜@ . S
It is clear that α is in the kernel of ∇ (λ), since we have λ = f φ ∗ (λ1 ), where λ1 ∈ H 1 (*@ (−4)) satisfies tλ1 = 0 in H 1 (*@ (d − 4)). This implies that αλ = 0 in S H 1 (*S (d)) and a fortiori ∇ (λ)(α) = 0 in H 2 (ᏻS ), since we have by (4.18) S
∇ (λ)(α) = δ(αλ). S
Also J˜@ is contained in the kernel of ∇ (λ). Indeed, since λ ∈ H 1 (φ ∗ *@ )0 , the map S
∇ (λ) : H 1 (TS ) −→ H 2 (ᏻS ), which is given by interior product, clearly factors through H 1 (φ ∗ (T@ )). Let us first prove the following lemma. Lemma 13. Let λ = A2 λ1 ∈ Rσ2d−4 . Assumptions (a),. . . ,(e) on λ1 , A imply the following: (i) λ : Rσd−2 → Rσ3d−6 ∼ = (Rσd−2 )∗ is an isomorphism; (ii) Aλ1 : (Ker λ )d−1 → (Coker λ )3d−7 is an isomorphism; (iii) λ1 : (Ker λ )d → (Coker λ )3d−8 has its kernel generated by t. λ
Here we denote by (Ker λ )∗ (resp., (Coker λ )∗ ) the kernel of the multiplication by : Rσ∗ → Rσ2d−4+∗ (resp., the cokernel of the multiplication by λ : Rσ∗−2d+4 → Rσ∗ ).
Proof. (i) is assumption (c). (ii) Let u ∈ (Ker λ )d−1 and assume Aλ1 u = 0 in (Coker λ )3d−7 . This means that Aλ1 u = A2 λ1 v in Rσ3d−7 for some v ∈ Rσd−3 . By assumption (b), it follows that u = Av. Then A2 λ1 u = 0 implies that A3 λ1 v = 0; by (d), v = 0, so u = 0. We prove (iii) in the same way. In order to prove Proposition 4, we first study the map µλ : H 1 φ ∗ (T@ ) −→ H 1 φ ∗ *@ (d) , which is the factorization of the multiplication map by φ ∗ (λ ) ∈ H 1 (φ ∗ *@ ): µλ : H 0 ᏻS (d) −→ H 1 φ ∗ *@ (d) , using the surjective map
H 0 ᏻS (d) −→ H 1 φ ∗ (T@ ) .
(We use the fact that H 1 (φ ∗ (TP3 )|S ) = 0.) Notice that from (4.18), the composition of µλ with the map δ : H 1 (φ ∗ *@ ) → H 2 (ᏻS ) of (4.16) is equal to the factorization S through H 1 (φ ∗ (T@ )) of ∇ (φ ∗ (λ ))|H 0 (ᏻS (d)) . We have the following lemma.
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
Lemma 14. Let K = (H 0 (ᏻX (1))/H 0 (ᏻP3 (1)))∗ . Choose a splitting H 0 ᏻX (1) ∼ = K ∗ ⊕ H 0 ᏻP3 (1) ;
177
(4.29)
then Ker µλ is naturally isomorphic to (Ker λ )d ⊕ K ∗ ⊗ (Ker λ )d−1 , and Coker µλ is naturally isomorphic to (Coker λ )3d−8 ⊕ K ⊗ (Coker λ )3d−7 . Notice that the map from (Ker λ )d ⊕K ∗ ⊗(Ker λ )d−1 to Ker µλ is the natural one: indeed, (Ker λ )d identifies to the kernel of µλ |φ ∗ (H 1 (T@ )) while (Ker λ )d−1 identifies to the kernel of the analogous map H 1 φ ∗ (T@ )(−1) −→ H 1 φ ∗ *@ (d − 1) restricted to φ ∗ (H 1 (T@ (−1))). Notice also that both statements are dual to each other: indeed, the map µλ is symmetric with respect to the Serre duality isomorphism 1 ∗ ∗ H 1 φ ∗ (T@ ) ∼ = H φ *@ (d) ; so (Ker λ )d is dual to (Coker λ )3d−8 and (Ker λ )d−1 is dual to (Coker λ )3d−7 by the pairings (4.28). So it suffices to prove the second statement, and for this we can replace µλ by µλ . To prove it we first prove the following. Lemma 15. Let Ᏹ be the vector bundle φ∗ ᏻX (2) on P3 , and let be the cokernel of the natural map H 0 ᏻX (2) ⊗ ᏻP3 −→ Ᏹ; then the splitting (4.29) gives an isomorphism ∼ = ᏻP3 (−2) ⊕ K ⊗ ᏻP3 (−1) . Proof. Let ⊂ X × P3 be the graph of φ. Then = R 1 pr2 ∗ Ᏽ ⊗ pr1∗ ᏻX (2) . Now if Q is defined by the exact sequence 0 −→ Q −→ H 0 ᏻP3 (1) ⊗ ᏻP3 −→ ᏻP3 (1) −→ 0, Ᏽ has the resolution
0
3
2
∗ ∗ −→ pr2 Q ⊗ pr1 ᏻX (−1) −→ pr2 Q ⊗ pr1∗ ᏻX (−1) −→ pr2∗ Q ⊗ pr1∗ ᏻX (−1) −→ Ᏽ −→ 0. ∗
One concludes from this that is isomorphic to Ker β :
3
2
Q ⊗ H 3 ᏻX (−1) −→ Q ⊗ H 3 (ᏻX ).
(4.30)
178
CLAIRE VOISIN
Now the dual of the map β is simply the natural map α : Q ⊗ ᏻP3 (1) −→ H 0 ᏻX (1) ⊗ ᏻP3 (1), from which it follows easily that
= (Coker α)∗ ∼ = ᏻP3 (−2) ⊕ K ⊗ ᏻP3 (−1) .
Tensorizing the exact sequence H 0 ᏻX (2) ⊗ ᏻP3 −→ Ᏹ −→ −→ 0
(4.31)
with ᏻ@ (d − 2), we deduce easily from Lemma 15 the following. Corollary 4. The splitting (4.29) gives an isomorphism H 0 ᏻS (d) ∼ = H 0 ᏻ@ (d − 4) ⊕ K ⊗ H 0 ᏻ@ (d − 3) . 0 0 H ᏻX (2) H ᏻ@ (d − 2) Similarly, tensorizing the exact sequence (4.31) with *@ (d − 2) and using Lemma 15, we easily get the following. Corollary 5. The splitting (4.29) gives an isomorphism H 1 φ ∗ *@ (d) 1 1 ∼ H (d − 4) ⊕ K ⊗ H (d − 3) . * * = @ @ H 0 ᏻX (2) φ ∗ H 1 *@ (d − 2) Proof of Lemma 14. By assumption (i) on λ , it follows that Im µλ contains H 0 (ᏻX (2)φ ∗ (H 1 (*@ (d − 2)), since it means that the map λ : H 1 (T@ (−2)) → H 1 (*@ (d − 2)) is surjective. So it suffices to study the cokernel of the induced map H 1 φ ∗ *@ (d) H 0 ᏻS (d) −→ . ρλ : 0 H ᏻX (2) H 0 ᏻ@ (d − 2) H 0 ᏻX (2) φ ∗ H 1 *@ (d − 2) But applying Corollaries 4 and 5, ρλ gives a map H 0 ᏻ@ (d− 4) ⊕ K⊗H 0 ᏻ@ (d− 3) −→ H 1 *@ (d− 4) ⊕ K⊗H 1 *@ (d− 3) . This last map is now easily seen to be the direct sum of the multiplication map by λ ∈ H 1 (*@ )0 , from which we conclude that Coker µλ ∼ = Coker ρλ ∼ = (Coker λ )3d−8 ⊕ K ⊗ (Coker λ )3d−7 , using the isomorphisms H 1 *@ (d − 4) ∼ = Rσ3d−8 , of (4.27).
H 1 *@ (d − 3) ∼ = Rσ3d−7 ,
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
179
Next for Q ∈ H 0 (ᏻX (2)), let λ2 = Qφ ∗ (Aλ1 ) ∈ H 1 (φ ∗ *@ )0 . Again, the multiplication map µλ2 : H 0 ᏻS (d) −→ H 1 φ ∗ *@ (d) induces a symmetric map H 1 φ ∗ (T@ ) −→ H 1 φ ∗ *@ (d) , and hence a symmetric map ∗
∗
∗
µλ2 : φ H (T@ ) ⊕ K ⊗ φ H T@ (−1) 1
1
H 1 φ ∗ *@ (d) , −→ 0 H ᏻX (2) H 1 φ ∗ *@ (d − 2)
that is, by Lemma 14 a map µλ2 : Rσd ⊕ K ∗ ⊗ Rσd−1 −→ Rσ3d−8 ⊕ K ⊗ Rσ3d−7 . We have the following lemma. Lemma 16. The map µλ2 vanishes on Rσd , and on K ∗ ⊗ Rσd−1 it is computed as follows: There is a natural map E : H 0 ᏻX (2) −→ Hom(K ∗ , K) such that µλ2 : K ∗ ⊗ Rσd−1 −→ K ⊗ Rσ3d−7 is equal to E(Q) ⊗ Aλ1 . Proof. The first statement is obvious, since µλ2 (φ ∗ (H 0 (ᏻ@ (d)))) is contained in As for the second one, consider the commutative diagram Qφ ∗ (Aλ1 ) / H 0 ᏻS (3d − 4) H 0 ᏻS (d) H 0 (ᏻX (2)) · φ ∗ (H 1 (*@ (d − 2))).
H 0 ᏻS (d)
µλ2
/ H 1 φ ∗ *@ (d) ,
where λ1 is any lifting of λ1 in H 0 (ᏻ@ (2d −8)); the second vertical map was defined in (4.20). The commutative diagram shows that it suffices to prove more generally the following lemma: Consider the multiplication map H 0 ᏻX (d) H 0 ᏻX (d + k + 2) ∗ −→ Qφ P : 0 H ᏻP3 (2) H 0 ᏻX (d − 2) H 0 ᏻP3 (2) H 0 ᏻX (d + k)
180
CLAIRE VOISIN
for any P ∈ H 0 (ᏻP3 (k)). Using the isomorphism deduced from Lemma 15, H 0 ᏻX (d + k + 2) ∼ = H 0 ᏻP3 (d − 2 + k) ⊕ K ⊗ H 0 ᏻP3 (d − 1 + k) , 0 0 H ᏻP3 (2) H ᏻX (d + k) Qφ ∗ P induces a map K ∗ ⊗ H 0 ᏻP3 (d − 1) −→ K ⊗ H 0 ᏻP3 (d − 1 + k) ; then we have the following. Lemma 17. There is a natural map E : H 0 ᏻX (2) −→ Hom(K ∗ , K), such that this map is equal to E(Q) ⊗ P . Proof. We construct the map E as follows: Let ᏸ be the cokernel of the natural map H 0 ᏻX (1) ⊗ ᏻP3 −→ φ∗ ᏻX (1). Using the equality
ᏸ = R1 pr2 ∗ Ᏽ ⊗ pr1∗ ᏻX (1) ,
where the notation is as in the proof of Lemma 15, and the resolution (4.30), we find that ᏸ is isomorphic to the dual of the cokernel of the natural map Q(1) ⊗ H 0 ᏻX (1) −→ H 0 ᏻX (2) ⊗ ᏻP3 (1). In particular, there is a natural inclusion of ᏸ in H 0 (ᏻX (2))∗ ⊗ ᏻP3 (−1). Tensorizing by ᏻP3 (1) and taking global sections, we get a map ∗ χ : H 0 ᏻX (2) −→ H 0 ᏻX (2) whose image is the set of linear forms vanishing on H 0 (ᏻP3 (1))·H 0 (ᏻX (1)). Recalling that K ∗ = H 0 (ᏻX (1))/H 0 (ᏻP3 (1)), such a linear form χ(Q) obviously induces a symmetric bilinear form on K ∗ and, hence, a map E(Q) : K ∗ → K. The statement concerning the multiplication is then clear: in fact, it clearly suffices to do the case d = l = 0, and then this results from the definition of E. So Lemma 17 (hence also Lemma 16) is proved. We also need the following lemma. Lemma 18. If φ is generic, for generic Q ∈ H 0 (ᏻX (2)), the map E(Q) : K ∗ → K is an isomorphism.
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
181
Proof. Notice that each E(Q) is symmetric and hence defines a quadric qQ on K ∗ . In fact, qQ (k) = χ (Q)(k 2 ), with the notation of the above proof. But we know that the map χ has for image the set of linear forms vanishing on H 0 (ᏻP3 (1))·H 0 (ᏻX (1)). So to prove the lemma, it suffices to show that this set, viewed as a set of quadrics on H 0 (ᏻX (1)), has exactly for base locus H 0 (ᏻP3 (1)). But the base locus of this set of quadrics is exactly the set k ∈ H 0 ᏻX (1) , k 2 ∈ H 0 ᏻP3 (1) · H 0 ᏻX (1) . So we have to prove that for generic φ = (φ0 , . . . , φ3 ) the condition k 2 ∈ φ0 , . . . , φ3 implies that k ∈ φ0 , . . . , φ3 . This is easy: It suffices to degenerate (φ0 , . . . , φ3 ) to the linear system of elements of H 0 (ᏻX (1)) vanishing on a certain number of points of X, and verify that one can do this while keeping the dimension of φ0 , . . . , φ3 ⊂ H 0 (ᏻX (2)) constant. Then for the degenerated system (φ0 , . . . , φ3 ), the result is obvious; this implies the same thing for the generic system. Similarly, let λ3 = ψφ ∗ (λ1 ) ∈ H 1 (φ ∗ *@ )0 , for any ψ ∈ H 0 (ᏻX (4)). Then the multiplication map µλ3 : H 0 ᏻS (d) −→ H 1 φ ∗ *@ (d) induces a map
µλ3 : Rσd ∼ = φ ∗ H 1 (T@ ) −→ H 1 *@ (d − 4) ∼ = Rσ3d−8 ,
where we use Corollary 5 to realize H 1 (*@ (d − 4)) as a quotient of H 1 (φ ∗ *@ (d)). Then we have the following. Lemma 19. There is a nonzero map : H 0 (ᏻX (4)) → C such that µλ3 is equal to (ψ)λ1 . This is not difficult. In fact, ∈ (H 0 (ᏻX (4)))∗ is simply given by the inclusion of C = H 3 (ᏻP3 (−4)) in H 3 (ᏻX (−4)). Proof of Proposition 4. We know from Lemma 13 that Aλ1 : (Ker λ )d−1 → (Coker λ )3d−7 is an isomorphism. By Lemma 18, we also know that for generic Q the map E(Q) : K ∗ → K is an isomorphism. Using Lemmas 14 and 16, we conclude that for generic Q the map induced by µλ2 Ker µλ −→ Coker µλ vanishes on
(Ker λ )d
and induces a (symmetric) isomorphism d−1 3d−7 K ∗ ⊗ Ker λ −→ K ⊗ Coker λ .
Next by Lemma 13, we know that the map λ1 : (Ker λ )d → (Coker λ )3d−8 has for kernel exactly t . Using Lemmas 14 and 19, we conclude that for generic ψ the map induced by µλ3 Ker µλ −→ Coker µλ
182
CLAIRE VOISIN
induces a (symmetric) map
Ker λ
d
3d−8 −→ Coker λ ,
which has for kernel exactly t . But then it follows immediately that for generic Q and ψ and for λ = λ + λ2 + λ3 , the map µλ : H 1 φ ∗ (T@ ) −→ H 1 φ ∗ *@ (d) has its kernel generated by φ ∗ t ∈ H 1 (φ ∗ T@ ). To conclude the proof of Proposition 4, we now simply note that the map δ : H 1 φ ∗ *@ (d) −→ H 2 (ᏻS ) is injective. To see this, it suffices to prove that H 1 (φ ∗ (*P3 (d))|S ) = 0 or that H 2 (φ ∗ (*P3 )) = 0, which is easy. Then we have proved that δ ◦µλ has its kernel generated by φ ∗ (t) and since this map S is equal to the factorization through H 1 (φ ∗ T@ ) of ∇ (λ) : H 0 (ᏻS (d)) → H 2 (ᏻS ), it follows that this last map has its kernel generated by J˜@ and α = φ ∗ t. So Proposition 4 is proved. Next we prove the following lemma. Lemma 20. Assume that for t generic in H 0 (ᏻ@ (d)), there exists λ ∈ H 1 (φ ∗ *@ )0 S such that ∇ (λ) : H 0 (ᏻS (d)) → H 2 (ᏻS ) has its kernel generated by J˜@ and α = S S φ ∗ (t). Then for generic λ ∈ Ker ∇ α ⊂ H 1 (*S )0 the kernel Ker ∇ (λ) : H 0 (ᏻS (d)) → H 2 (ᏻS ) is generated by α. Proof. For any λ ∈ H 1 (*S )0 , the map S ∇ (λ) : H 0 ᏻS (d) −→ H 2 (ᏻS ) is symmetric with respect to Serre duality, so it determines a quadric qλ on P(H 0 (ᏻS (d))). We know by assumption that there is a qλ , which has for singular locus the projective space generated by α and J˜@ , and we want to conclude that the generic qλ singular at α has α as its only singular point. By Bertini, it clearly suffices to prove that the system of quadrics qλ singular at α has no base point on the projective space S P(J˜@ ). Now note that the set Ker ∇ α , which exactly parametrizes this linear system, identifies to S λ ∈ H 1 (*S )0 , λ ⊥ ∇ α H 0 (KS ) , where the symbols ⊥ refer to the pairing on H 1 (*S )0 . Furthermore, by definition, S the condition qλ (u) = 0 is equivalent to λ ⊥ ∇ u (u). Recalling that the map S
∇ u : H 0 (KS ) −→ H 1 (*S )0
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
183
identifies to the composite of the multiplication by u H 0 (KS ) = H 0 ᏻS (d) −→ H 0 ᏻS (2d) and of the map (4.10)
H 0 ᏻS (2d) −→ H 1 (*S )0 , S
we conclude that u is in the base locus of the system of quadrics Ker ∇ α if and only if the image of u2 in H 1 (*S )0 is contained in the image of αH 0 (ᏻS (d)) in H 1 (*S )0 , which has for kernel the space JS2d , image of H 0 (TX (d)|S ) in H 0 (ᏻS (d)) . So the proof of Lemma 20 is concluded by the following lemma. Lemma 21. For generic σ, t, the condition u2 = φ ∗ t.v mod .JS2d for u ∈ J˜@d , v ∈ H 0 (ᏻS (d)) implies that u = 0. The proof of this last lemma is not very difficult, so we do not give it here. From Lemma 20 and Proposition 4, we conclude now with the following. Corollary 6. If for σ, t generic, there exist A, λ1 satisfying the assumptions of Proposition 4, then Proposition 3 is true. Proof of Proposition 3. It remains only to show the existence of A, λ1 satisfying conditions (a) to (e) of Proposition 4. For any integer k we have the map given by the multiplication in the Jacobian ring of σ Rσ2k −→ Homsym Rσ2d−4−k , Rσ2d−4+k , where the subscript “sym” refers to the perfect pairings (4.28). We denote by Dσ2k ⊂ P(Rσ2k ) the discriminant hypersurface for these families of quadrics. It is easy to show that for generic σ and for any 0 ≤ k ≤ 2d − 4, Dσ2k = P(Rσ2k ). This is what we want to show: For generic σ and generic t ∈ Rσd , there exists λ1 ∈ Dσ2d−8 such that Ker λ1 is generated by t. Furthermore, for generic A ∈ H 0 (ᏻ@ (2)), we have Ak λ1 ∈ Dσ2d−8+2k , for 1 ≤ k ≤ 4. Now notice that the degree of Dσ2k is equal to the rank of Rσ2d−4−k . In particular, we have the following: d 0 Dσ2d−6 = rk Rσd−1 < rk Rσd = d 0 Dσ2d−8 , d 0 Dσ2d−4 = rk Rσd−2 < rk Rσd = d 0 Dσ2d−8 , d 0 Dσ2d−2 = rk Rσd−3 < rk Rσd = d 0 Dσ2d−8 , d 0 Dσ2d = rk Rσd−4 < rk Rσd = d 0 Dσ2d−8 . Furthermore, it is easy to show that for σ generic and A generic we have Ak Rσ2d−8 ⊂ Dσ2d−8+2k for 1 ≤ k ≤ 4.
184
CLAIRE VOISIN
Then we contend that the existence of A, λ1 satisfying conditions (a) to (e) follows from the next lemma. Lemma 22. Let σ be generic, and let ⊂ P(Rσd ) × P(Rσ2d−8 ) be defined as = (t, λ1 ), tλ1 = 0 in Rσ3d−8 . Then has only one component gen of dimension at least equal to dim P(Rσ2d−8 )−1. Indeed, we know that for generic t ∈ Rσd , the map t : Rσ2d−8 → Rσ3d−8 is surjective. It follows that the principal component of (the one that dominates P(Rσd )) is exactly of dimension dim P(Rσ2d−8 ) − 1, and it must be equal to gen . So gen is of dimension dim P(Rσ2d−8 ) − 1. But gen has to dominate Dσ2d−8 by the second projection, since has no other component of dimension at least equal to dim Dσ2d−8 . Since dim gen = dim Dσ2d−8 , the second projection gen −→ Dσ2d−8
must be birational, and any other component of is sent to a proper subset of Dσ2d−8 . It follows that Dσ2d−8 is irreducible, and its generic element λ1 satisfies that Ker λ1 is generated by t for generic t ∈ Rσd . But then Dσ2d−8 is also reduced. For degree reasons, we cannot then have Ak Dσ2d−8 ⊂ Dσ2d−8+2k for 1 ≤ k ≤ 4, and since Dσ2d−8 is irreducible, it follows that for generic λ1 ∈ Dσ2d−8 , we have Ak λ1 ∈ Dσ2d−8+2k for 1 ≤ k ≤ 4. So Lemma 22 implies the existence of A, λ1 satisfying conditions (a) to (e). Proof of Lemma 22. One has to prove that there exists no nonempty proper subset Z ⊂ P(Rσd ) such that for z ∈ Z the multiplication map z : Rσ2d−8 → Rσ3d−8 has cokernel of dimension at least equal to k = codim Z. Equivalently, by duality the map z : Rσd → R 2d has a kernel of dimension at least equal to k = codim Z. Let l ≤ d be such that h0 ᏻP3 (l) ≤ k < h0 ᏻP3 (l + 1) . One first verifies that there exists 0 < / < / < 1 such that for d large enough, σ generic, and Z as above, one has /d < l < / d. This follows from the following facts, which are proved by a dimension count: (a) there exists 0 < / < 1 such that for sufficiently large d, generic σ , and any t = 0 ∈ Rσ[/d] , the multiplication map t : Rσd → Rσd+[/d] is injective; (b) there exists B ∈ Rσd−[/d] such that the multiplication map B : Rσd+[/d] → R 2d is injective. It follows from (a) and (b) that BRσ[/d] does not meet Z, which implies that l +1 ≥ /d. Also it follows from (a) and (b) that for any z ∈ Rσd , we have Ker z ∩ BRσ[/d] = {0}. Hence for z ∈ Z, we have k ≤ dim Ker z ≤ h0 (d) − h0 ([/d]) ≤ h0 ([/ d]),
THE GRIFFITHS GROUP OF A GENERAL CALABI-YAU THREEFOLD
185
where / is chosen so as to satisfy the last inequality for large d. This gives the other inequality. Now one shows that for any l < / d, d large enough, and for generic σ , there exists C ∈ Rσd−l−2 such that the multiplication map C : Rσd+l+2 −→ Rσ2d is injective. Consider now the map C : Rσl+2 → Rσd . Then for z ∈ Rσl+2 , we have Ker z : Rσd −→ Rσd+l+2 = Ker Cz : Rσd −→ Rσ2d ; hence, in particular, if Cz ∈ Z, we have dim Ker(z : Rσd → Rσd+l+2 ) ≥ h0 (l). Hence we conclude that if Z = CRσl+2 ∩ Z, we have that the codimension of Z in P(Rσl+2 ) is at most equal to h0 (l +1), and for z ∈ Z , dim Ker(z : Rσd → Rσd+l+2 ) ≥ h0 (l). This is absurd because of the following fact (which is proved by looking at the Fermat equation): The dimension of the subspace Z of P(Rσl+2 ) defined by the condition z ∈ Z ⇐⇒ dim Ker z : Rσd −→ Rσd+l+2 ≥ h0 (l) is not greater than 140, for generic σ . This obviously contradicts the fact that Z ⊂ Z and dim Z ≥ h0 (l + 2) − h0 (l + 1), which is strictly greater than 140 for d large enough, since l > /d. So the existence of such Z for generic σ is absurd, and Lemma 22 is proved. The proof of Proposition 3 is now finished, and together with Propositions 1 and 2, it implies Theorem 4. References [1] [2]
[3] [4]
[5] [6]
[7]
A. Albano and A. Collino, On the Griffiths group of the cubic sevenfold, Math. Ann. 299 (1994), 715–726. J. Carlson and P. Griffiths, “Infinitesimal variations of Hodge structure and the global Torelli problem” in Journées de Géométrie Algébrique (Angers, 1979), ed. A. Beauville, Sijthoff & Noordhoff, Alphen aan den Rijn, 1980, 51–76. H. Clemens, Homological equivalence, modulo algebraic equivalence, is not finitely generated, Inst. Hautes Études Sci. Publ. Math. 58 (1983), 19–38. , “Some results about Abel-Jacobi mappings” in Topics in Transcendental Algebraic Geometry (Princeton, 1981/1982), ed. P. Griffiths, Ann. of Math. Stud. 106, Princeton Univ. Press, Princeton, 1984, 289–304. P. Deligne, Théorie de Hodge, II, Inst. Hautes Études Sci. Publ. Math. 40 (1971), 5–57. R. Donagi and E. Markman, “Cubics, integrable systems, and Calabi-Yau threefolds” in Proceedings of the Hirzebruch 65 Conference on Algebraic Geometry (Ramat Gan, 1993), Israel Math. Conf. Proc. 9, Bar-Ilan Univ., Ramat Gan, 1996, 199–221. M. Green, The period map for hypersurface sections of high degree of an arbitrary variety, Compositio Math. 55 (1985), 135–156.
186 [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18]
CLAIRE VOISIN , Griffiths’ infinitesimal invariant and the Abel-Jacobi map, J. Differential Geom. 29 (1989), 545–555. P. Griffiths, Periods of integrals on algebraic manifolds, I, Amer. J. Math. 90 (1968), 568–626; II, 805–865. , On the periods of certain rational integrals, I, Ann. of Math. (2) 90 (1969), 460–495; II, 496–541. , Infinitesimal variations of Hodge structure, III: Determinantal varieties and the infinitesimal invariant of normal functions, Compositio Math. 50 (1983), 267–324. S.-O. Kim, Noether-Lefschetz locus for surfaces, Trans. Amer. Math. Soc. 324 (1991), 369–384. M. Nori, Algebraic cycles and Hodge-theoretic connectivity, Invent. Math. 111 (1993), 349– 373. , Cycles on the generic abelian threefold, Proc. Indian Acad. Sci. Math. Sci. 99 (1989), 191–196. K. Paranjape, Curves on threefolds with trivial canonical bundle, Proc. Indian Acad. Sci. Math. Sci. 101 (1991), 199–213. Z. Ran, Hodge theory and the Hilbert scheme, J. Differential Geom. 37 (1993), 191–198. C. Voisin, Densité du lieu de Noether-Lefschetz pour les sections hyperplanes des variétés de Calabi-Yau de dimension 3, Internat. J. Math. 3 (1992), 699–715. , Sur l’application d’Abel-Jacobi des variétés de Calabi-Yau de dimension trois, Ann. Sci. École Norm. Sup. (4) 27 (1994), 209–226.
Institut de Mathématiques de Jussieu, Centre National de la Recherche Scientifique, Unité Mixte de Recherche 9994, 75251 Paris Cedex 05, France
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
A CHARACTERIZATION OF RATIONAL SINGULARITIES SÁNDOR J. KOVÁCS
The main purpose of this note is to present a characterization of rational singularities in characteristic 0. The essence of the characterization is that it is enough to require less than the usual definition. Theorem 1. Let φ : Y → X be a morphism of varieties over C, and let ρ :
ᏻX → Rφ∗ ᏻY be the associated natural morphism. Assume that Y has rational singularities and there exists a morphism (in the derived category of ᏻX -modules) ρ : Rφ∗ ᏻY → ᏻX such that ρ ◦ ρ is a quasi-isomorphism of ᏻX with itself. Then X
has only rational singularities. If ρ exists, it could be considered similar to a trace operator. In fact, for any finite morphism of normal varieties, ρ exists because of the trace operator. Note that for the first statement of Theorem 1, φ does not need to be birational. In particular, Theorem 1 implies that quotient singularities are rational, including quotients by reductive groups as in [B, Corollaire]. In the latter case, ρ is given by the Reynolds operator. A well-known and widely used theorem states that in characteristic 0, canonical singularities are Cohen-Macaulay and therefore rational (see [E] and [KMM]). The original proofs are based on a very clever use of Grothendieck duality simultaneously for a resolution and its restriction onto the exceptional divisor and on a double loop induction. Kollár gave a simpler proof in [K2, §11] without using derived categories but still relying on a technically hard vanishing theorem. Recently Kollár and Mori found a simple proof allowing nonempty boundaries. They do not use derived categories either, but restrict to the projective case (see [KM, 5.18]). These proofs are ingenious, but one would like to have a simple natural proof (at least in the “classical” case, when the boundary is empty). As an application of Theorem 1 a simple proof is given here in the “classical” case, but without the projective assumption. This proof seems even simpler than that of Kollár and Mori. Derived categories and Grothendieck duality are used, but in such a simple way that one is tempted to say that this proof is the most natural one. Note also that everything used here was already available when the question was raised for the first time. A statement similar to Theorem 1 was given in [K2, 3.12]. Some ideas of the Received 19 May 1999. Revision received 23 June 1999. 1991 Mathematics Subject Classification. Primary 14B05; Secondary 14E15. Author’s work partially supported by an American Mathematical Society Centennial Fellowship and by National Science Foundation grant number DMS-9818357. 187
188
SÁNDOR J. KOVÁCS
present proof already appear there. In addition, the results of [K1] and [K2] can be used to show that for projective varieties the existence of ρ is actually equivalent to X having rational singularities. Note that the assumption of Theorem 1 implies that φ has to be surjective. Theorem 2. Let φ : Y → X be a surjective morphism of projective varieties over C, and let ρ : ᏻX → Rφ∗ ᏻY be the associated natural morphism. Assume that both X and Y have rational singularities. Then there exists a morphism ρ : Rφ∗ ᏻY → ᏻX such that ρ ◦ ρ is a quasi-isomorphism of ᏻX with itself. Finally, as a byproduct of the proof, a partial generalization of Kempf’s criterion for rational singularities (cf. [KKMS, p. 50]) is presented. Theorem 3. Let φ : Y → X be a surjective morphism of projective varieties over C. Let N = dim Y and n = dim X. Assume that Y has rational singularities. Then X has rational singularities if and only if X is Cohen-Macaulay and R N−n φ∗ ωY ωX . Acknowledgements. I would like to thank János Kollár for valuable comments and the editors for their kind flexibility. Definitions and notation. Throughout the article, the ground field is always C, the field of complex numbers. A variety means a separated variety of finite type over C. A divisor D is called Q-Cartier if mD is Cartier for some m > 0. A normal variety X is said to have log-terminal, (resp., canonical) singularities if KX is Q-Cartier. For any resolution of singularities, f : Y → X, with the collection of exceptional prime divisors {Ei }, there exist ai ∈ Q, ai > −1 (resp., ai ≥ 0) such that KY ≡ f ∗ KX + ai Ei (cf. [KMM] and [CKM]). The index of a normal variety X with KX Q-Cartier is the smallest positive integer m such that mKX is Cartier. For a normal variety X with KX Q-Cartier, there exists locally an index-1 cover: that is, a finite surjective morphism σ : X → X such that X has index 1. A log-terminal variety of index 1 is canonical, and an easy computation shows that finite covers of log-terminal (resp., canonical) singularities are log-terminal (resp., canonical). In particular, the index-1 cover of a log-terminal variety is canonical (see [R, 1.7, 1.9] and [CKM, 6.7, 6.8]). Let X be a normal variety and φ : Y → X a resolution of singularities. X is said to have rational singularities if R i φ∗ ᏻY = 0 for all i > 0 or, equivalently, if the natural map ᏻX → Rφ∗ ᏻY is a quasi-isomorphism. · denote the dualizing complex of X; that is, ω· = f ! C, where f : X → Let ωX X Spec C is the structure map (cf. [H]). Main ingredients. Let φ : Y → X be a proper morphism. Then one has the following: • Grothendieck duality [H, VII]: For all G· -bounded complexes of ᏻY -modules, Rφ R Ᏼom (G· , ω· ) R Ᏼom (Rφ G· , ω· ). ∗
Y
Y
X
∗
X
• Adjointness [H, II.5.10]: For all F · -bounded complexes of ᏻX -modules and
A CHARACTERIZATION OF RATIONAL SINGULARITIES
189
G· -bounded complexes of ᏻY -modules, Rφ∗ R ᏴomY (Lφ ∗ F · , G· ) R ᏴomX (F · , Rφ∗ G· ). If φ is a resolution of singularities, then one has • Grauert-Riemenschneider vanishing [GR]: R i φ∗ ωY = 0 for i > 0. This is referred to as “GR vanishing.” Lemma 1 [KKMS, p. 50], [K2, 11.9]. Let X be a normal variety, and let φ : Y → X be a resolution of singularities. If X is Cohen-Macaulay and ωX φ∗ ωY , then X has rational singularities. Proof. By GR vanishing, ω· Rφ ω· , and then ∗ Y
X
· , ω· ) R Ᏼom (Rφ ω· , ω· ) ᏻX R ᏴomX (ωX X ∗ Y X X · · Rφ R Ᏼom (ω , ω ) Rφ ᏻ . ∗
Y
Y
∗ Y
Y
Proof of Theorem 1. Let π : X˜ → X be a resolution of X and σ : Y˜ → Y be a resolution of Y such that φ ◦ σ factors through π: There exists ψ : Y˜ → X˜ such that φ ◦ σ = π ◦ ψ. Then one has the following commutative diagram: ρ
ᏻX α
/ Rφ∗ ᏻY β
Rπ∗ ᏻX˜
γ
/ Rφ∗ Rσ∗ ᏻ ˜ . Y
Now ρ has a left inverse by assumption and β is a quasi-isomorphism since Y has rational singularities. Therefore (ρ ◦ β −1 ◦ γ ) ◦ α is a quasi-isomorphism of ᏻX with itself, so one may assume that φ is a resolution of singularities. · ) to the quasi-isomorphism Next apply R ᏴomX ( , ωX ρ
ρ
ᏻX −→ Rφ∗ ᏻY −−→ ᏻX .
Then
· −→ Rφ ω· −→ ω· ωX ∗ Y X · ) ⊆ R i φ ω· R i+d φ ω . Now is a quasi-isomorphism as well. Hence hi (ωX ∗ Y ∗ Y · ) = 0 for i > −d, R i+d φ∗ ωY = 0 for i > −d by GR vanishing. Therefore hi (ωX so X is Cohen-Macaulay. The above proof also shows that ωX −→ φ∗ ωY −→ ωX is an isomorphism, so ωX φ∗ ωY . Therefore X has rational singularities by Lemma 1. Theorem 4 [E]. Log-terminal singularities are rational. Proof. Let X be a variety with log-terminal singularities. By Theorem 1 it is enough to prove that the index-1 cover of X has rational singularities. Thus one can
190
SÁNDOR J. KOVÁCS
assume that X has canonical singularities and ωX is a line bundle. Let φ : Y → X be a resolution of singularities of X. By assumption there exists a nontrivial morphism ι : Lφ ∗ ωX φ ∗ ωX → ωY . Its adjoint morphism on X is ωX → Rφ∗ ωY , which is a quasi-isomorphism by GR vanishing. Applying R ᏴomY ( , ωY ) (not R ᏴomY ( , ωY· )) to ι, one obtains Rφ∗ R ᏴomO Y (ωY , ωY )
/ Rφ∗ R ᏴomY (Lφ ∗ ωX , ωY )
/ R ᏴomX (ωX , Rφ∗ ωY )
/ ᏻX .
ρ
Rφ∗ ᏻY
The last quasi-isomorphism uses the fact that Rφ∗ ωY ωX and that ωX is a line bundle. It is easy to see that ρ ◦ ρ acts trivially on ᏻX ; hence Theorem 1 can be applied. The following is a simple consequence of [K1, 7.6]. Theorem 5. Let φ : Y → X be a surjective morphism of projective varieties over C, and assume that both X and Y have rational singularities. Let N = dim Y and n = dim X. Then R N−n φ∗ ωY ωX . Proof. Let π : X˜ → X be a resolution of X and σ : Y˜ → Y a resolution of Y such that φ ◦ σ factors through π; that is, there exists ψ : Y˜ → X˜ such that φ ◦ σ = π ◦ ψ. Then one has the following commutative diagram: R(φ ◦ σ )∗ ωY˜ [N]
δ
α
Rφ∗ ωY [N]
/ Rπ∗ ω ˜ [n] X β
γ
/ ωX [n].
Because both X and Y have rational singularities, α and β are quasi-isomorphisms by GR vanishing. Next take −nth cohomology of these complexes. By [K2, 3.4], R −n (φ ◦ σ )∗ ωY˜ [N] R N−n (π ◦ ψ)∗ ωY˜ π∗ R N−n ψ∗ ωY˜ , so one has π∗ R N−n ψ∗ ωY˜
h−n (δ)
h−n (α)
R N−n φ∗ ωY
/ π∗ ωX˜ h−n (β)
h−n (γ )
/ ωX .
Now h−n (δ) is an isomorphism by [K1, 7.6], so h−n (γ ) is an isomorphism as well.
A CHARACTERIZATION OF RATIONAL SINGULARITIES
191
Lemma 2. Let φ : Y → X be a surjective morphism of projective varieties over C and ρ : ᏻX → Rφ∗ ᏻY the associated natural morphism. Let N = dim Y and n = dim X. Assume that Y has rational singularities. If X is Cohen-Macaulay and R N−n φ∗ ωY ωX , then there exists a morphism ρ : Rφ∗ ᏻY → ᏻX such that ρ ◦ρ is a quasi-isomorphism of ᏻX with itself. Proof. Let σ : Y˜ → Y be a resolution of singularities. Since Y has rational singularities, ᏻY Rσ∗ ᏻY˜ and ωY Rσ∗ ωY˜ . Thus one may assume that Y is smooth. · ω [n] R N−n φ ω [n] is a direct summand of Rφ ω [N] By [K2, 3.1] ωX X ∗ Y ∗ Y Rφ∗ ωY· . Therefore there exist morphisms whose composition is a quasi-isomorphism ·. · ωX −→ Rφ∗ ωY· −→ ωX · ) to this quasi-isomorphism, one concludes that Applying R ᏴomX ( , ωX ρ
ρ
ᏻX −→ Rφ∗ ᏻY −→ ᏻX
is a quasi-isomorphism as well. Corollary. Theorems 2 and 3 hold. References [B] [CKM] [E] [GR] [H] [KMM]
[KKMS] [K1] [K2] [KM] [R]
J.-F. Boutot, Singularités rationnelles et quotient par les groupes réductifs, Invent. Math. 88 (1987), 65–68. H. Clemens, J. Kollár, and S. Mori, Higher-Dimensional Complex Geometry, Astérisque 166 (1988). R. Elkik, Rationalité des singularités canoniques, Invent. Math. 64 (1981), 1–6. H. Grauert and O. Riemenschneider, Verschwindungssätze für analytische Kohomologiegruppen auf komplexen Räumen, Invent. Math. 11 (1970), 263–292. R. Hartshorne, Residues and Duality, Lecture Notes in Math. 20, Springer-Verlag, Berlin, 1966. Y. Kawamata, K. Matsuda, and K. Matsuki, “Introduction to the minimal model problem” in Algebraic Geometry (Sendai, 1985), Adv. Stud. Pure Math. 10, NorthHolland, Amsterdam, 1987, 283–360. G. Kempf, F. Knudsen, D. Mumford, and B. Saint-Donat, Toroidal Embeddings, I, Lecture Notes in Math. 339, Springer-Verlag, Berlin, 1973. J. Kollár, Higher direct images of dualizing sheaves, II, Ann. of Math. (2) 124 (1986), 171–202. , “Singularities of pairs” in Algebraic Geometry (Santa Cruz, 1995), Proc. Sympos. Pure Math. 62, Amer. Math. Soc., Providence, 1997, 221–287. J. Kollár and S. Mori, Birational Geometry of Algebraic Varieties, Cambridge Tracts in Math. 134, Cambridge Univ. Press, Cambridge, 1998. M. Reid, “Canonical 3-folds” in Journées de Géométrie Algébrique d’Angers (Juillet 1979), Sijthoff & Noordhoff, Alphen aan den Rijn, 1980, 273–310.
Department of Mathematics, University of Chicago, Chicago, Illinois 60637 USA;
[email protected];
[email protected]
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
SMOOTHNESS OF PROJECTIONS, BERNOULLI CONVOLUTIONS, AND THE DIMENSION OF EXCEPTIONS YUVAL PERES and WILHELM SCHLAG n 1. Introduction. The distribution νλ of the random series ∞ 0 ±λ , where the 1 signs are chosen independently with probability 2 , has been studied by many authors since the two seminal papers by Erd˝os in 1939 and 1940. It is immediate that νλ is singular for λ < 21 . In [6], Erd˝os showed that νλ is singular for infinitely many λ ∈ ( 21 , 1), namely those λ such that λ−1 is a Pisot number. In [7] he proved that νλ is absolutely continuous for a.e. λ ∈ (1 − δ, 1), where δ < 0.01. It had been observed before by Jessen and Wintner [21] that νλ is either absolutely continuous or singular with respect to Lebesgue measure for any choice of λ. Works of Alexander and Yorke [2], Przytycki and Urba´nski [37], and Ledrappier [27] showed the importance of these distributions in several problems in dynamical systems. Solomyak [39] proved a conjecture made by Garsia in 1962: νλ is absolutely continuous for a.e. λ ∈ ( 21 , 1). In fact, Solomyak showed that νλ has density in L2 for a.e. λ ∈ ( 21 , 1). Peres and Solomyak [34] then gave a simpler proof of this. A survey of the results obtained on Bernoulli convolutions from the 1930s to 1999 is presented in [33]. In the present paper, we show that the density of νλ has a fractional derivative in L2 for a.e. λ ∈ ( 21 , 1), and we obtain a bound on the integrated Sobolev norm. We then use this bound to establish that for any closed interval I ⊂ ( 21 , 1], the set of λ ∈ I such that νλ is singular, has Hausdorff dimension less than 1 (see Theorem 5.4 and Corollary 5.6). In order to prove these results, we were led to develop a general method, which can be applied to improve previous results on Hausdorff dimensions of sums of Cantor sets, distance sets, and self-similar sets with deleted digits. (An exposition of our method for the case of Bernoulli convolutions is in [33].) Heuristically, the measures νλ may be viewed as nonlinear projections of the uniform measure on sequence space. Indeed, the analysis of νλ in [39] and [34] employed techniques developed by Kaufman [23] and Mattilla [30] to study orthogonal projections of sets and measures in Euclidean space. The following (informally stated) principles were first discovered by Marstrand [28], Kaufman [23], [24], and Mattila [29], [30] in the study of orthogonal projections, and then they were extended to a variety of other settings by Falconer [12], Hu and Received 6 January 1999. 1991 Mathematics Subject Classification. Primary 42B25; Secondary 28A78. Y. Peres’s work partially supported by National Science Foundation grant number DMS-9404391. W. Schlag’s work supported by a National Science Foundation Postdoctoral Fellowship at the Mathematical Sciences Research Institute, Berkeley, California. 193
194
PERES AND SCHLAG
Taylor [16], Pollicott and Simon [36], Solomyak [39], [40], [41], Peres and Solomyak [34], [35], and Sauer and Yorke [38]: • if an (m − )-dimensional measure is projected to m-dimensional space, then typically its dimension is preserved; • if an (m + )-dimensional measure is projected to m-dimensional space, then typically the projected measure is absolutely continuous, with a density in L2 . In making these principles precise, the appropriate notion of dimension of measures must be used (correlation or information dimension, but not Minkowski or packing dimension; see Jarvenpää [20] and [38]), and the parametrized family of generalized projections considered must be sufficiently rich (at least m-dimensional). A more delicate requirement is a certain “transversality condition” (e.g., [31, Lemma 3.11]). For Bernoulli convolutions and other self-similar measures, this condition involves bounds on the double zeros of certain power series, and it was first made explicit by Pollicott and Simon [36]. Transversality is crucial in the works of Solomyak [39], [40], [41], Peres and Solomyak [34], [35], and in the present paper. The general principles that underly our work can be informally stated as follows: • if an (m − )-dimensional measure µ is projected to an m-dimensional space, then the set of parameters where the projection of µ has lower dimension than µ itself, has codimension at least ; • if an (m + )-dimensional measure is projected to m-dimensional space, then typically the projected measure has a density with a fractional derivative of order
/2 in L2 , and the set of parameters where the projected measure is singular has codimension at least . The notion of Sobolev dimension of a measure, defined in (2.3) below, yields a unification of these two principles. For orthogonal projections, these strengthened principles have been stated precisely, and established as theorems, by Kaufman [24], Falconer [9], and Mattila [29], except that they did not consider fractional derivatives. Their proofs are based on averaging with respect to an appropriate Frostman measure on parameter space. In the setting of Bernoulli convolutions, this averaging approach is not powerful enough to establish the last principle, so we were forced to develop a different technique. Our methods apply to several concrete problems, which we now describe. Throughout the paper, dim (without subscripts) means Hausdorff dimension. Orthogonal projections. To motivate the general formulations, we start by clarifying the important results of Falconer [9] on exceptional directions for projections in Rd (see Sections 2 and 6). As a corollary, we find that for any Borel set E ⊂ Rd with dim E > 2 and for a.e. direction θ in the sphere S d−1 , the orthogonal projection projθ (E) of E to the line through 0 and θ has nonempty interior. More precisely (see Corollary 6.2), dim θ ∈ S d−1 : projθ (E) has empty interior ≤ d + 1 − dim E.
SMOOTHNESS OF PROJECTIONS
195
Bernoulli convolutions. Let νλ be the distribution series ∞ of the random Symmetric ∞ n , so that ν has the Fourier transform ν (ξ ) = n ξ ). Here we cos(λ ±λ λ λ n=0 n=0 sharpen the result of Solomyak [39] that νλ is absolutely continuous for a.e. λ ∈ ( 21 , 1) by showing that the 2, γ -Sobolev norm satisfies ∞ νλ (ξ )2 |ξ |2γ dξ < ∞ νλ 22,γ = −∞
for a.e. λ < 0.649 such that λ1+2γ > 21 . Moreover, for some constant C > 0 and all small > 0, we have
1 + , 1 : νλ is singular < 1 − C . dim λ ∈ 2 p
Bernoulli convolutions. Let νλ be the distribution of the random sum Asymmetric ∞ n , where the signs are chosen independently with probabilities (p, 1 − p). ±λ n=0 p Peres and Solomyak [35] showed that νλ is absolutely continuous for a.e. λ ∈ (p p (1− p p)1−p , 1), but νλ has a density in L2 for a.e. λ ∈ (p 2 + (1 − p)2 , 1) and not for any p smaller λ. The Pisot numbers still provide infinitely many examples of singular νλ . In the present paper, we show that ∞ p 2 2γ νλ (ξ ) |ξ | dξ < ∞ −∞
for a.e. λ < 0.649 such that λ1+2γ > p 2 + (1 − p)2 . Moreover, if p ∈ ( 13 , 23 ), then for some constant C > 0 and all > 0,
p dim λ ∈ p 2 + (1 − p)2 + , 1 : νλ ∈ L2 < 1 − C ,
p dim λ ∈ p p (1 − p)1−p + , 1 : νλ is singular < 1 − C . Intersections of sets with spheres. Suppose E ⊂ Rd is a Borel set with d ≥ 2. Let S0 ⊂ Rd be a strictly convex, closed C ∞ -hypersurface surrounding the origin. Then dim x ∈ Rd : (x + rS0 ) ∩ E = ∅ for a.e. r > 0 ≤ 1 + d − dim E. This statement fails if S0 is the boundary of a cube, or more generally, if S0 contains a piece of a hyperplane. If S0 is the sphere S d−1 , then the estimate above reduces to a strengthened form of Falconer’s result in [10] about distance sets. For any Borel set A ⊂ Rd with Hausdorff dimension dim A > (d +1)/2, there are points x ∈ A such that the pinned distance set {|x − y| : y ∈ A} has positive Lebesgue measure. Moreover, the set of x ∈ Rd where this fails has Hausdorff dimension at most d + 1 − dim A (see Corollary 8.4). Variants involving the Hausdorff dimension of the radii, as well as estimates where x is restricted to a hyperplane, can be found in Section 8.2. We also obtain results if S0 is assumed to be only in C 2,δ for some 1 > δ > 0.
196
PERES AND SCHLAG
Sums of Cantor sets. Let λ = (1 − λ)
∞
n
an λ : an ∈ {0, 1}
n=0
be the middle-α Cantor set for α = 1 − 2λ. Let K ⊂ R be any compact set. Peres and Solomyak [35] showed that Ᏼ1 (K + λ ) > 0 for almost every λ ∈ (0, 21 ) such that dim K +dim λ > 1, and that dim(K + λ ) = dim K +dim λ for a.e. λ ∈ (0, 21 ) such that dim K + dim λ < 1. In Theorem 5.12, we prove that 1 dim λ ∈ λ0 , : Ᏼ1 (K + λ ) = 0 ≤ 2 − (dim K + dim λ0 ), 2 dim λ ∈ (0, λ1 ) : dim(K + λ ) < dim K + dim λ ≤ dim K + dim λ1 . We also establish similar estimates for Cantor sets whose symbols belong to the Hölder space C 1,δ for some δ > 0. Keane-Smorodinsky {0, 1, 3}-problem. The Hausdorff dimension of the set ∞ i ai λ : ai ∈ {0, 1, 3} (λ) = i=0
has been studied by several authors, since it is perhaps the simplest example of a parametrized family of self-similar sets with overlap. For λ ≤ 41 , it is easy to see that (λ) is self-similar with Hausdorff dimension log 3/(− log λ). Keane, Smorodinsky, and Solomyak [26] proved that (λ) = [0, 3/(1−λ)] if λ > 25 and that dim (λ) < 1 for infinitely many λ ∈ ( 31 , 25 ). Pollicott and Simon [36] showed that for a.e. λ ∈ ( 41 , 31 ), one still has dim (λ) = log 3/(− log λ), and they found a dense subset of ( 41 , 31 ) where the dimension is strictly less than that fraction. Solomyak [39] finally established that for a.e. λ ∈ ( 13 , 25 ), the set (λ) has positive Lebesgue measure. In Theorem 5.10, we establish that 2 log 3 1 1 dim λ ∈ λ0 , : Ᏼ ( (λ)) = 0 ≤ 2 − for any λ0 > , 5 − log λ0 3 1 log 3 1 log 3 dim λ ∈ , λ1 : dim (λ) < ≤ for any λ1 > . 2 − log λ − log λ1 4 The latter inequality improves an estimate from [36]. Self-similar sets in the plane. Consider the self-similar sets ∞ S n Cλ = an λ : a n ∈ S , n=0
SMOOTHNESS OF PROJECTIONS
197
where S = {s1 , . . . , s# } ⊂ C is a fixed set of symbols. In [41] Solomyak showed that Ᏼ2 (CλS ) > 0 for a.e. λ > |l|−1/2 in a region of transversality. In Theorem 8.2, we prove that the dimension of the exceptional set for this property has to be strictly less than 2. Typical C 1 maps trade off dimension for smoothness. Hunt, Sauer, and Yorke [18] defined a Borel set A in a Banach space X to be prevalent if some Borel probability measure ν on X satisfies ν(A + x) = 1 for all x ∈ X. Sauer and Yorke [38] and Hunt and Kaloshin [17] showed that if µ is a Borel probability measure on Rd with correlation dimension at most m, then the set Aµ (d, m) of C 1 maps f : Rd → Rm that map µ to a measure with the same correlation dimension is prevalent. To prove this, they showed that the definition of prevalence holds with ν equal to Lebesgue measure on the set of m×d real matrices with entries bounded by 1 in absolute value. (These matrices are considered as linear maps from Rd to Rm .) This result can be obtained as a special case of Theorem 7.3, which also yields the following complement (see Remark 7.4): Let µ be a Borel probability measure on Rd with correlation dimension greater than m + 2γ . Then for a prevalent set of C 1 maps f : Rd → Rm , the image of µ under f has a density with at least γ fractional derivatives in L2 (Rm ). Consequently, if E ⊂ Rd has dim E > m (respectively, dim E > 2m), then for a prevalent set of maps f : Rd → Rm , the image f (E) has positive Lebesgue measure (respectively, nonempty interior) in Rm . All these examples are special cases of our main result, which is formulated in the following general setting. Given a measure µ on a compact metric space (', d) and a family of maps (λ : ' → Rm parameterized smoothly by λ ∈ Rn with n ≥ m, we show that for a.e. λ the projection of µ under (λ has as much “smoothness” as µ and that the Hausdorff dimension of the exceptional parameters decreases with the loss of smoothness. This requires the family (λ to satisfy some nondegeneracy assumption. In this paper, we impose the aforementioned transversality condition. In n where ω ∈ ' = {−1, +1}N the case of Bernoulli convolutions, (λ (ω) = ∞ ω λ n=0 n (N denotes the nonnegative integers). Here transversality means that the power series (λ (ω) − (λ (τ ) do not have double zeros. Solomyak showed in [39] that this is the case if λ < 0.649. Extending our smoothness results to the entire unit interval then requires arguments that are special to Bernoulli convolutions, namely, breaking the series up into various subseries. The paper is organized as follows. In Section 2, which is still mainly expository, we discuss transversality and state the general projection theorem in one dimension. Our estimate on the Hausdorff dimension of the set of λ for which νλ is a singular measure does not rely on Frostman’s lemma. Instead we use the following fact: Let {hj }∞ j =0 be a family of nonnegative C ∞ -functions on [0, 1] whose derivatives grow at most 1 j h (x) dx < ∞, then one can bound the dimension of exponentially in j . If 0 ∞ j 0 R j the set of x ∈ [0, 1] for which ∞ 0 r hj (x) = ∞ for any 1 ≤ r < R. This is proved in Section 3, see Lemma 3.1. In Section 3 we also deal with the case where {hj }∞ j =0 is assumed to have only finitely many derivatives. This is applied later to analyze
198
PERES AND SCHLAG
sums of Cantor sets with symbols in the Hölder space C 1,δ and to the geometric problem about intersections of sets with dilates of convex surfaces. It turns out that it suffices to assume some finite degree of smoothness on the surfaces. For Bernoulli convolutions, however, the C ∞ case suffices. In Section 4 we prove the general projection theorem in case m = n = 1. Our methods rely on the dyadic decomposition of frequency space (i.e., the LittlewoodPaley decomposition from harmonic analysis), which is recalled at the beginning of Section 4. To make the connection 3.1, which we sketched in the with Lemma previous paragraph, let hj (λ) = 2−j |ξ |∼2j | νλ (ξ )|2 dξ , where νλ is the distribution n j (1+2γ ) h (λ) dλ is controlled by the square of the of, say, ±λ . Then J ∞ j j =0 2 j 2, γ -Sobolev norm of νλ averaged in λ, whereas ∞ j =0 2 hj (λ) = ∞ characterizes 2 those λ ∈ J so that νλ does not have an L -density. Section 4 is split into two subsections. In the first subsection, we discuss the case where the projections have infinitely many derivatives with respect to the parameter, whereas the second subsection deals with the case where the dependence on the parameter has only some finite degree of smoothness. Applications of the one-dimensional scheme are discussed in Section 5. We start with classical Bernoulli convolutions, then consider asymmetric Bernoulli convolutions, the {0, 1, 3}-problem, and finally sums of Cantor sets. In all of these applications, the main new results concern smoothness of the densities of certain measures and the dimension of the set of those parameters for which some generic property fails. Section 6 discusses orthogonal projections in Euclidean space. We give a short proof of the projection theorem in this special case, which is simpler than the available proofs in the literature. Moreover, we obtain sufficient conditions for sets to have a.e. projection with nonempty interior. This section also serves to motivate the statement of the general projection theorem in higher dimensions, which is proved in Section 7. In Section 8, this result is applied to self-similar sets in the complex plane and to distance sets in arbitrary dimensions. We conclude the paper by stating some unsolved problems related to our work. Acknowledgments. We are grateful to P. Mattila and B. Solomyak for useful discussions. 2. A general scheme for families of projections. To motivate this section, we review some simple facts about orthogonal projections in Euclidean space. Definition 2.1. Let µ be a finite measure on Rn , n ≥ 2, with compact support. n−1 , its projection ν onto the line {tθ : t ∈ R} is given by f dν = For any θ ∈ S θ θ f ((x · θ )θ ) dµ(x) for any continuous f . For any α ∈ (0, n), the α-energy of µ is defined to be Ᏹα (µ) =
Rn
Rn
dµ(x)dµ(y) . |x − y|α
199
SMOOTHNESS OF PROJECTIONS
We measure smoothness of measures ν in Rn in terms of the homogeneous Sobolev norm 2 νˆ (ξ )2 |ξ |2γ dξ. ν 2,γ = Rn
Finiteness of ν 2,γ for some γ > 0 means that ν has γ (fractional) derivatives in L2 . The following proposition relates the smoothness of the projected measures to the energy of the original measure. If a, b > 0, then the notation a b means that C −1 a < b < Ca for some absolute constant C. Proposition 2.2. Let µ be a finite measure on Rn , n ≥ 2, with compact support, and let νθ be as in Definition 2.1. Suppose 0 < 1 + 2γ < n. Then S n−1 νθ 22,γ dθ Ᏹ1+2γ (µ). ˆ · θ)θ). Hence Proof. The projected measures νθ clearly satisfy νθ (ξ ) = µ((ξ ∞ ∞ 2 2γ νθ (tθ)2 |t|2γ dt dθ = νθ 22,γ dθ = |µ(tθ)| ˆ |t| dt dθ S n−1
S n−1 −∞
=2
Rn
S n−1 −∞
|µ(ξ ˆ )|2 |ξ |1+2γ −n dξ = cγ ,n Ᏹ1+2γ (µ).
The final equality follows from Plancherel’s theorem and the definition of energy; see Lemma 12.12 in [31]. The main purpose of this section is to study regularity properties of projected measures in a more general setting. Moreover, we bound the dimension of the set of those parameters for which the projected measure has less regularity than generically. In case of projections in the plane, the latter question was considered by Kaufman [24] and Falconer [9]; see also Section 6. For the exceptional set in Proposition 2.2 with γ = 0, Falconer found the estimate dim θ ∈ S 1 : νθ ∈ L2 (R) ≤ 2 − α if Ᏹα (µ) < ∞. However, his method does not apply to Bernoulli convolutions or other examples considered in this paper. Our approach is based on the Littlewood-Paley decomposition from harmonic analysis, which provides a simple characterization of various degrees of smoothness; see Section 4. Moreover, the analogue of Falconer’s bound is obtained without Frostman measures. Definition 2.3. For any finite measure ν on Rn , let its Sobolev dimension be defined as (2.1)
dims (ν) = sup α ∈ R :
|ˆν (ξ )| (1 + |ξ |) 2
Rn
α−n
dξ < ∞ .
Clearly, if 0 < dims (ν) < n, then dims (ν) = sup{α : Ᏹα (ν) < ∞}. In particular, if a Borel set E ⊂ Rn supports a probability measure µ with dims (µ) ≤ n, then
200
PERES AND SCHLAG
dim E ≥ dims (µ). If dims (ν) < n, then dims (ν) is also known as the correlation dimension of the measure ν. If dims (ν) = σ > n, then ν is absolutely continuous and its density has fractional derivatives of order (σ − n)/2 in L2 (Rn ). Throughout this paper, dim (without sub- or superscripts) means Hausdorff dimension. The following definition introduces the general framework we are working with. Definition 2.4. Let (', d) be a compact metric space, let J ⊂ R be an open interval, and let ( : J × ' → R be a continuous map. We assume that for any compact I ⊂ J and # = 0, 1, . . . , there exists a constant C#,I such that d# (2.2) # ((λ, ω) ≤ C#,I dλ for all λ ∈ I and ω ∈ '. Given any finite measure µ on ', let νλ = µ ◦ (−1 λ where (λ (·) = ((λ, ·). The α-energy of µ is defined as Ᏹα (µ) = dµ(ω1 ) dµ(ω2 )/d(ω1 , ω2 )α ' '
.
Informally, one can think of (λ (·) = ((λ, ·) as a family of projections parameterized by λ. We state two results on the smoothness of a typical νλ in terms of Ᏹα (µ) assuming certain transversality conditions. Definition 2.5. Let ', J , and ( be as in Definition 2.4. For any distinct ω1 , ω2 ∈ ' and λ ∈ J , let ((λ, ω1 ) − ((λ, ω2 ) . 3λ (ω1 , ω2 ) = d(ω1 , ω2 ) J is an interval of strong transversality for ( if there exist positive constants C, C# so that for all λ ∈ J and ω1 , ω2 ∈ ', # (i) |(d # /dλ# )3λ (ω1 , ω2 )| ≤ C# |(d/dλ)3λ (ω1 , ω2 ) for # = 2, 3, . . . , (ii) |(d/dλ)3λ | ≥ C. Theorem 2.6. Let J and ( be as in Definition 2.4 and suppose that J is an interval of strong transversality for (. Let µ be a finite positive measure on ' with finite α-energy for some α > 0. Then for any compact I ⊂ J , νλ (ξ )2 dλ ≤ Cα |ξ |−α Ᏹα (µ). (2.3) I
Moreover, dims (νλ ) ≥ α for a.e. λ ∈ J . More precisely, for any σ ∈ (0, α], dim λ ∈ J : dims (νλ ) < σ ≤ σ + min(1 − α, 0). (2.4) The condition of strong transversality turns out to be too restrictive for most applications. Since the cosine function has vanishing derivative at 0, π, it fails for projections onto lines. More importantly, (2.3) fails for Bernoulli convolutions because of Pisot numbers; See Lemma 5.7. The correct notion of transversality in the context of Bernoulli convolutions turns out to be the following one.
SMOOTHNESS OF PROJECTIONS
201
Definition 2.7. Let 3λ be as in Definition 2.5. For any β ∈ [0, 1), we say that J is an interval of transversality of order β for ( if there exists a constant Cβ so that for all λ ∈ J and ω1 , ω2 ∈ ', the condition |3λ (ω1 , ω2 )| ≤ Cβ d(ω1 , ω2 )β implies d 3λ ≥ Cβ d(ω1 , ω2 )β . dλ
(2.5)
In addition, we say that (λ is L-regular on J for some positive integer L or L = ∞, if under the same condition and for some constants Cβ,# , (2.6)
# d −β# dλ# 3λ (ω1 , ω2 ) ≤ Cβ,# d(ω1 , ω2 )
for # = 1, 2, . . . , L.
Our main result in this section is the following theorem. Theorem 2.8. Let ', J , and ( be as in Definition 2.4, and suppose that J is an interval of transversality of order β for ( for some β ∈ [0, 1) and that (λ is ∞-regular on J . Let µ be a finite positive measure on ' with finite α-energy for some α > 0. Then for any compact I ⊂ J , 2 νλ dλ ≤ Cγ Ᏹα (µ) if 0 < (1 + 2γ )(1 + a0 β) ≤ α, (2.7) 2,γ I
where a0 is some absolute constant. Moreover, for any σ ∈ (0, α], (2.8)
dim λ ∈ J : dims (νλ ) ≤ σ ≤ 1 + σ −
α , 1 + a0 β
and for any σ ∈ (0, α − 3β], (2.9)
dim λ ∈ J : dims (νλ ) < σ ≤ σ.
In all our applications of this theorem, we are able to take β arbitrarily small. Moreover, the geometric problems we consider in this paper satisfy transversality with β = 0; see Sections 6 and 8.2. Some of these geometric estimates are known to be optimal. Since they are covered by our general theory, it follows that the corresponding cases of Theorem 2.8 are sharp; see Section 6 for more discussion. However, we do not know whether Theorem 2.8 is optimal in all cases. The basic idea behind the proof of (2.7) is that for any finite measure ν on R and γ ∈ (− 21 , 1), 1/2 ∞ 2
(2.10) ν 2,γ 22j (γ +1) ν x − 2−j , x − ν x, x + 2−j j =−∞ 2 L (dx)
202
PERES AND SCHLAG
as one can easily check by applying Plancherel’s theorem to the right-hand side. However, it is difficult to work with the square-function in (2.10) because of the singularities of the kernel χ[−1,0] − χ[0,1] . As is standard in harmonic analysis, one circumvents this difficulty by considering a smoother kernel. The most convenient form is given in terms of the Littlewood-Paley decomposition; see Lemma 4.1. Theorem 2.8 has a simple corollary concerning the dimension of the exceptional set with respect to absolute continuity without making any assumptions on the energy of µ. The hypothesis is in terms of the upper and lower information dimension of µ. Recall that the lower pointwise dimension is given by πµ− (ω) = lim inf r→0+
log µ(B(ω, r)) , log r
and set
dimI (µ) = πµ− L∞ (dµ)
and
dimI (µ) = essinf dµ(ω) πµ− (ω) .
Notice that Ᏹα (µ) < ∞ implies dimI (µ) ≥ α, but the converse is clearly false. Corollary 2.9. Suppose J is an interval of transversality of order β for ( and that (λ is ∞-regular on J . Then (2.11) (2.12)
dimI (µ) dim λ ∈ J : νλ is singular ≤ 2 − , 1 + a0 β dimI (µ) dim λ ∈ J : νλ is not absolutely continuous ≤ 2 − , 1 + a0 β
where a0 is the constant from Theorem 2.8. Proof. Let α = dimI (µ) and fix some α˜ < α and > 0. Then
log µ B(ω, r) ˜ ' = ω : lim inf > α˜ r→0+ log r ˜ > 0. By Egoroff’s theorem, there exists ' ⊂ ' ˜ so that µ(' ˜ \ ' ) < satisfies µ(')
and
log µ B(ω, r) ≥ α˜ uniformly on ' . lim inf r→0+ log r ( )
Let µ = µ ' and νλ = µ ◦ (−1 ˜ (µ ) < ∞, and thus by λ . By definition, Ᏹα− Theorem 2.8, ( ) dim λ ∈ J : dims νλ ≤ σ ≤ 1 + σ − (α˜ − )(1 + a0 β)−1 .
In particular, (2.13)
( ) dim λ ∈ J : νλ is singular ≤ 2 − (α˜ − )(1 + a0 β)−1 .
SMOOTHNESS OF PROJECTIONS
Because
203
(2−j ) λ ∈ J : νλ is singular ⊂ lim sup λ ∈ J : νλ is singular , j →∞
(2.11) follows by letting α˜ → α and → 0+ in (2.13). The proof of (2.12) is very similar and is omitted. 3. Exceptional parameters for convergence of power series 3.1. The C ∞ case. The dimension bounds (2.4) in case α ≤ 1 and (2.9) are analogous to Kaufman’s result for orthogonal projections and involve Frostman measures as in his original proof. However, if α > 1, estimates (2.4) and (2.8) are derived from the smoothness bounds by means of the following lemma, which we state for parameters in Rn for an arbitrary n ≥ 1. Recall that the length of a multi-index η = (η1 , η2 , . . . , ηn ) ∈ Nn is defined to be |η| = η1 + · · · + ηn , and that ∂ η = ∂ |η| /(∂λ1 )η1 · . . . ·(∂λn )ηn , where λ = (λ1 , . . . , λn ). For applications to Bernoulli convolutions, it suffices to read the following lemma with n = 1. ∞ Lemma 3.1. Let Q ⊂ Rn be a nonempty open set. Suppose {hj }∞ 0 ∈ C (Q) satisfy sup A−j |η| ∂ η hj ∞ ≤ Cη for all multi-indices η ∈ Nn
(3.1)
j ≥0
sup
j ≥0 Q
R j |hj (λ)| dλ ≤ C∗ < ∞,
where A > 1. Suppose that R > rj ≥ 1. (i) If An < R/r, then ∞ j =0 r |hj (λ)| < ∞ for all λ ∈ Q. (ii) If Aα ≤ R/r ≤ An , then (3.2)
∞ dim λ ∈ Q : r j |hj (λ)| = ∞ ≤ n − α. j =0
Proof. Fix some 0 < r < R with An ≥ R/r. It suffices to prove (3.2) for any compact cube Q ⊂ Q. Fix such a Q and define Ej = {λ ∈ Q : |hj (λ)| > (1/j 2 )r −j }. Then ∞ (3.3) r j |hj (λ)| = ∞ ⊂ lim sup Ej . λ ∈ Q : j →∞ j =0
We estimate the (n − α)-Hausdorff measure of lim sup Ej by covering each Ej with cubes of side length A−j . The idea is that any point in Ej has a neighborhood of size A−j on which the average of |hj | is at least C(1/j 2 )r −j . More precisely, for
204
PERES AND SCHLAG
any positive integer N , N N
N (3.4) (−1)i hj (λ + iy) ≤ |y|N sup ∂ η hj ∞ ≤ CN |y|Aj . i |η|=N i=0
For any j = 0, 1, . . . and λ0 ∈ Q , let hj (λ0 + λ)dλ, Ij,N (λ0 ) = [−N Lj,N ,NLj,N ]n where Lj,N > 0 is determined below. In view of (3.4), with some dimensional constant bn , N N bn CN N+n j N (−1)i hj (λ0 + iy) dy Lj,N A ≥ n N +n [−Lj,N ,Lj,N ] i=0 i (3.5) N N 1 ≥ (2Lj,N )n hj (λ0 ) − Ij,N (λ0 ). i in i=1
j 2 r j )−1/N with a suitable constant C , one has In particular, setting Lj,N = A−j (CN N
n 1 2 j −1 j r 2Lj,N ≥ (2Lj,N )n |hj (λ0 )| − 2N Ij,N (λ0 ), 2 and therefore, for any λ0 ∈ Ej , (3.6) Ij,N λ0 ≥ 2−N j −2 r −j (2Lj,N )n if j is sufficiently large (depending on dist(Q , ∂Q)). Fix some positive integer N, and let {Uij : i = 1, . . . , Mj,N } be a covering of Ej with disjoint cubes of side length 2NLj,N . Pick any λij ∈ Uij ∩ Ej . Setting λ0 = λij in (3.6) and summing over i = 1, . . . , Mj,N yields
n Mj,N 2−N j −2 r −j 2Lj,N ≤ 2n hj (λ)dλ ≤ 2nC∗ R −j . Q
Therefore, by the definition of Lj,N , Mj,N
(3.7)
"j ! n 1+(n/N) 2(1+(n/N)) A r ˜ ≤ CN j . R
Let α ∈ (0, 1) satisfy Aα r α/N < R/r. In view of (3.7), Ᏼ
n−α
lim sup Ej
≤ C˜ N lim
k→∞
= C˜ N lim
k→∞
∞
n−α Mj,N 2NLj,N
j =k ∞ 1+(α/N) r j =k
R
α
A
j
j 2(1+(α/N)) = 0.
SMOOTHNESS OF PROJECTIONS
205
Thus (3.2) follows from (3.3) by letting N → ∞. To prove assertion (i), assume that An < R/r and let N be a fixed positive integer such that r n/N An < R/r. Let λ0 ∈ Ej for some sufficiently large j . It follows from (3.6) that
j R , C∗ ≥ R j Ij,N (λ0 ) ≥ C˜ N j −2(1+(n/N)) Ar 1+(n/N) which is a contradiction for large j . Thus lim supj →∞ Ej = ∅, and (i) follows from (3.3). L 3.2. The case of limited regularity. Suppose Q ⊂ Rn and {hj }∞ 0 ⊂ C (Q) for L some positive integer L. Here C denotes the space of functions that are L times continuously differentiable. Under assumption (3.1) for |η| ≤ L, the previous proof with N = L yields (3.2), provided Aα r α/L ≤ R/r ≤ An r n/L . Below we show how to weaken this condition on α under a slightly different assumption. For a positive integer L and δ ∈ [0, 1), we let C L,δ denote the space of functions that are L times continuously differentiable and such that the derivatives of order L satisfy a Hölder condition of order δ (we write C 0,δ = C δ ).
Lemma 3.2. Let Q ⊂ Rn be an open set and suppose {hj }∞ 0 are nonnegative j and uniformly bounded functions satisfying supj ≥0 R Q hj (λ) dλ < C∗ < ∞. Let j Hj = i=0 B i−j hi for some fixed B ≥ A > 1. 1 (i) Let 1 ≤ r < R and r ≤ B. Assume that {hj }∞ 0 ⊂ C (Q) and ∇Hj (λ) ≤ C1 Aj Hj (λ) for all j = 0, 1, 2 . . . , and all λ ∈ Q. j If An < R/r, then ∞ j =0 r hj < ∞ on Q. Otherwise, ∞ dim λ ∈ Q : (3.8) r j hj (λ) = ∞ ≤ n − α, j =0
Aα
≤ R/r. provided (ii) Let B < r < R. Assume that, for some integer L ≥ 1 and δ ∈ [0, 1), one has L,δ (Q) and {hj }∞ 0 ⊂C ∇Hj (λ) ≤ C1 Aj Hj (λ) for all j = 0, 1, 2 . . . and all λ ∈ Q, (3.9)
$ # sup ∂ η Hj (λ1 ) − ∂ η Hj (λ2 ) ≤ CL,δ |λ1 − λ2 |δ Aj (L+δ) Hj (λ1 ) + Hj (λ2 ) .
|η|=L
j If An (r/B)n/(L+δ) < R/r, then ∞ j =0 r hj < ∞ on Q. Otherwise, ∞ dim λ ∈ Q : (3.10) r j hj (λ) = ∞ ≤ n − α, j =0
provided Aα (r/B)n/(L+δ) ≤ R/r.
206
PERES AND SCHLAG
Proof. It suffices to prove these statements for an arbitrary cube Q ⊂ Q. We start with the proof of (3.8). Fix 0 < r0 < r1 ≤ B and T ∈ (0, ∞), and let ∞ (T ) −2 −j i Ej = λ ∈ Q : hj (λ) > j r1 , (3.11) r0 hi (λ) ≤ T i=0
for j = 0, 1, . . . . Clearly, ∞ ∞ ∞ % (T ) i i (3.12) r1 hi (λ) = ∞, r0 hi (λ) < ∞ ⊂ lim sup Ej . λ∈Q : i=0
T =1 j →∞
i=0
M
(T )
j be a covering of Ej with disjoint Fix some positive integer T , and let {Uij }j =1 cubes of side length Wj < A−j for j = 0, 1, . . . . To make an appropriate choice of Wj , we bound the variation of hj on cubes. By the bound on |∇Hj |, d Hj (λ + te) ≤ C1 Aj Hj (λ + te) dt
for any unit vector e and λ, λ + te ∈ Q . By Gronwall’s inequality (see, e.g., [14, Chapter 10]), Hj (λ) ≤ Hj (λ0 ) exp(C1 Aj |λ − λ0 |) for any λ, λ0 ∈ Q . Hence, for all j = 0, 1, . . . , Hj (λ) − Hj (λ0 ) ≤ CAj |λ − λ0 |Hj (λ0 ), (3.13) provided |λ − λ0 | < A−j . Since r0 < B, ∞ j =0
j
r0 Hj (λ0 ) ≤
j i ∞ r0 j B j =0
B
i=0
r0
∞
−1 r0i hi (λ0 ) ≤ 1 − r0 /B r0i hi (λ0 ). i=0
Combining this with (3.13) yields (3.14) # $ $ # hj (λ) − hj (λ0 ) = B −j B j Hj (λ) − Hj (λ0 ) − B j −1 Hj −1 (λ) − Hj −1 (λ0 ) j ∞
−1 A ≤ C 1 − r0 /B |λ − λ0 | r0i hi (λ0 ) r0 i=0
(T ) Uij ∩ Ej
if |λ − λ0 | < A−j . Applying this to any λ0 ∈ and λ ∈ Uij , one concludes that
−1 A j 1 −j 1 −j |λ − λ0 | ≥ 2 r1 hj (λ) ≥ 2 r1 − CT 1 − r0 /B r0 j 2j 2 j −j ˜ if Wj = C1/j (r0 /r1 ) A , where C˜ depends only on T , B, and r0 . Therefore, 1 1 −j −j −j C∗ R ≥ (3.15) hj (λ) dλ ≥ &Mj r1 dλ = 2 Mj Wjn r1 , 2 2j 2j Q i=1 Uij
207
SMOOTHNESS OF PROJECTIONS
and thus, by choice of Wj , Mj ≤ 2C∗ j 2(1+n)
r
1
R
An
j r j n 1
r0
.
Hence,
(T )
Ᏼn−α lim sup Ej
≤ lim
k→∞
j →∞
∞ j =k
Mj Wjn−α
r1 n j r1 j n A 2C∗ j ≤ lim k→∞ R r0 j =k j (n−α) r0 × C˜ n−α j −2(n−α) A−j (n−α) r1 ∞ r
1 α j r1 j α A = C lim j 2(1+α) = 0, k→∞ R r0
∞
2(1+n)
j =k
)α .
Aα
< R/r1 (r0 /r1 provided Therefore, in view of (3.12), ∞ ∞ i i (3.16) r1 hi (λ) = ∞, r0 hi (λ) < ∞ ≤ n − α dim λ ∈ Q : i=0
i=0
)α
if Aα
≤ R/r1 (r0 /r1 and 0 < r0 < r1 ≤ A. To prove (3.8), choose r = ρm > ρm−1 > · · · > ρ2 > 1 > ρ1 . Then
(3.17) λ∈Q:
∞
i
r hi (λ) = ∞ =
i=0
and (3.16) imply that
m %
λ∈Q:
k=2
dim λ ∈ Q :
∞ i=0
∞
ρki hi (λ) = ∞,
∞ i=0
i ρk−1 hi (λ) < ∞
r i hi (λ) = ∞ ≤ n − α
i=0
Aα
)α .
≤ R/r mini=2,...,m (ρi−1 /ρi Letting max ρi /ρi−1 → 1 finishes the proof. If An < R/r1 (r0 /r1 )n , then (3.15) with Mj = 1 leads to a contradiction if j is sufficiently large. Subdividing as in (3.17) therefore shows that the left-hand side of (3.17) is empty if R/r > An , and the proof of (i) is complete. (T ) For the proof of (ii), let r > r0 = B and T ∈ (0, ∞), and define Ej as in (3.11).
if
(T )
The definition of Hk implies that supk≥0 B k Hk (λ0 ) ≤ T for all λ0 ∈ Ej Therefore, by (3.13), |Hk (λ) − Hk (λ0 )| ≤ CT |λ − λ0 |, and thus (3.18)
Hk (λ) ≤ CT B −k
and all j .
208
PERES AND SCHLAG (T )
provided |λ0 − λ| < B −k for all λ0 ∈ Ej , and all j and k. Let N be the smallest integer greater than or equal to L + δ. In view of the definition of Hj , the Hölder estimate (3.9) on ∂ η Hj for |η| = L, and (3.18) with k = j and k = j − 1, N N i (−1) hj (λ0 + iy) i i=0 N N ( ' = B −j (−1)i B j Hj (λ0 + iy) − B j −1 Hj −1 (λ0 + iy) i (3.19) i=0
L+δ ≤ CL,δ Aj |y| max Hj (λ0 + ty) + max Hj −1 (λ0 + ty) 0≤t≤N
0≤t≤N
L+δ
≤ CL,δ T Aj |y| (T )
if λ0 ∈ Ej
B −j
1 ≤ 2 r −j 2j
and |y| ≤ Cj −2/(L+δ) A−j (B/r)j/(L+δ) = Wj , where C = C(L, δ, T ). M
(T )
j Now let {Uij }i=1 be a covering of Ej
(T ) some λij ∈ Uij ∩ Ej Wj , λij + Wj ]n yields
with cubes of side length 2NWj , and fix
for all i and j . Integrating (3.19) with λ0 = λij over [λij −
N
N 1 1 −j n n r Wj ≥ hj (λij )Wj − hj λij + λ dλ i i n [−N Wj ,NWj ] 2j 2 i=1
and thus (3.20)
[−NWj ,N Wj ]
−1 n Wj . hj λij + λ dλ ≥ 2−N 2j 2 r j
Summing (3.20) over i = 1, . . . , Mj implies, in view of our assumption C∗ R −j ≥ Q hj (λ) dλ, that 1 r −j dλ 2nC∗ R −j ≥ 2n hj (λ) dλ ≥ &Mj N +1 j 2 2 Q i=1 Uij (3.21)
n −j 1 = N +1 2 Mj 2NWj r . 2 j Therefore, by definition of Wj , j j n/(L+δ) r r Mj ≤ Cj 2n/(L+δ) Anj . R B Hence,
α/(L+δ) R r (3.22) Ᏼ < = 0 if A B r ∞ i for all T . Since for any r > B, the set {λ ∈ Q : i=0 r hi (λ) = ∞} is contained in n−α
(T ) lim sup Ej j →∞
α
209
SMOOTHNESS OF PROJECTIONS
(3.23) λ∈Q:
∞
r i hi (λ) = ∞,
i=0
∞
B i hi (λ) < ∞ ∪ λ ∈ Q :
i=0
∞
B i hi (λ) = ∞ ,
i=0
(3.10) follows from (3.12), (3.22), and (3.8). To apply (3.8) to (3.23), one needs to check that Aα (r/B)α/(L+δ) < R/r implies that Aα ≤ R/B. However, this is an immediate consequence of B < r. (T ) Finally, if An (r/B)α/(L+δ) < R/r, then (3.21) with Mj = 1 shows that Ej = ∅ j if j is sufficiently large and thus r hj < ∞ on Q, as claimed. 4. Proofs of the general projection theorems 4.1. The C ∞ case. To estimate Sobolev norms, it is convenient to decompose frequency space dyadically. For the sake of completeness, we first recall the construction of such a Littlewood-Paley decomposition; see Stein [43] and Frazier, Jawerth, and Weiss [13]. (Rn ) is the Schwartz space of smooth functions all of whose derivatives decay faster than any power. It is a basic property of the Fourier transform that it preserves . Lemma 4.1. There exists ψ ∈ (Rm ) so that ψˆ ≥ 0, (4.1)
supp ψˆ ⊂ ξ ∈ Rm : 1 ≤ |ξ | ≤ 4 ,
∞
ψˆ 2−j ξ = 1
if ξ = 0.
j =−∞
Moreover, given any finite measure ν on Rm and any γ ∈ R, ∞
(4.2) ν 22,γ ψ2−j ∗ ν (x) dν(x), 22j γ j =−∞
Rm
where ψ2−j (x) = 2j m ψ(2j x). ˆ ) = 1 for |ξ | ≤ 1, and φ(ξ ˆ )=0 Proof. Choose φ ∈ (Rm ) with φˆ ≥ 0, φ(ξ ˆ ˆ ˆ ˆ for |ξ | > 2. Define ψ via ψ(ξ ) = φ(ξ/2) − φ(ξ ). It is clear that ψ(ξ ) ≥ 0 and ˆ ) = 0 if |ξ | < 1 or |ξ | > 4. Equation (4.1) holds since the sum telescopes. that ψ(ξ Moreover, it is clear from (4.1) that there exists some constant Cγ depending only on γ so that for any ξ = 0, Cγ−1 |ξ |2γ ≤
∞
22j γ ψˆ 2−j ξ ≤ Cγ |ξ |2γ .
j =−∞
ˆ −j Since ψ) 2−j (ξ ) = ψ(2 ξ ), Plancherel’s theorem implies
ψ2−j ∗ ν (x) dν(x) = ψˆ 2−j ξ |ˆν (ξ )|2 dξ, Rm
and (4.2) follows. Notice that (4.2) is closely related to (2.10).
210
PERES AND SCHLAG
The following proposition shows how to obtain a dimension bound from a suitable Sobolev estimate. This depends on Lemma 3.1. Proposition 4.2. Let J and νλ be as in Definition 2.4. If I νλ 22,γ dλ < ∞ with some I ⊂ J and γ > − 21 , then for all σ ∈ [0 ∧ 2γ , 1 + 2γ ), one has dim{λ ∈ I : dims (νλ ) ≤ σ } ≤ σ − 2γ . Proof. Let ψ be the Littlewood-Paley function from Lemma 4.1. For any j = 0, 1, 2, . . . , define hj (λ) = 2−j ψ) νλ (ξ )|2 dξ 2−j (ξ )| (4.3) # $
= ψ 2j ((λ, ω1 ) − ((λ, ω2 ) dµ(ω1 ) dµ(ω2 ). ' '
Lemma 4.1 implies that C νλ 22,γ ≥
(4.4)
∞
2j (1+2γ ) hj (λ).
j =0
Moreover, for any > 0, (4.5)
λ ∈ I : dims (νλ ) ≤ σ ⊂
λ∈I :
∞
j 2σ + hj (λ) = ∞
j =0
by (2.1) and Lemma 4.1. Since (2.2) implies that for any j, # = 0, 1, . . . , # j# $ (#) d ψ 2 ((λ, ω1 ) − ((λ, ω2 ) dµ(ω1 ) dµ(ω2 ) hj (λ) = # dλ ' ' l i d ( ≤ C# 2 j # , ≤ C# 2 j # dλi ∞ L (I ×') i=0
the proposition follows from Lemma 3.1 with n = 1, A = 2, r = 2σ + , and R = 22γ +1 as → 0+. Proposition 4.4 below deals with the case α ≤ 1 in Theorems 2.6 and 2.8, see (2.4) and (2.8). In that case it is more efficient to rely on Frostman measures than on the previous proposition. The idea of using Frostman measures appears repeatedly in geometric measure theory; see [31]. First we prove a simple technical lemma that describes the location of the zeros of 3λ . Lemma 4.3. Let J and ( be as in Definition 2.4. Suppose J = (λ0 , λ1 ) is an interval of transversality of order β for ( and that (λ is 1-regular on J . Let Cβ be the constant from (2.5) and set r = d(ω1 , ω2 ). Then
SMOOTHNESS OF PROJECTIONS
211
Nβ % λ ∈ J : |3λ | < Cβ r β = Ij ,
(4.6)
j =1
where Ij are disjoint open intervals of length C −1 r 2β ≤ |Ij | ≤ C and Nβ ≤ Cr 2β . With the possible exception of the at most two intervals touching ∂J , each Ij contains a unique zero λj of 3λ . If λ0 ∈ I1 and I1 does not contain a zero of 3λ , then we let λ1 = λ0 and similarly with λ1 . All constants depend only on β and the constants in Definition 2.7. Proof. Let the intervals Ij be defined by (4.6). Since Cβ r β ≤ |(d/dλ)3λ | ≤ Cβ,1 r −β on each Ij by Definition 2.7, their lengths are as claimed. The other statements are now clear. Proposition 4.4. Let ', J , and ( be as in Definition 2.4. Assume that J is an interval of transversality of order β for (, that (λ is 1-regular on J , and that the measure µ on ' has finite α-energy for some α ∈ (0, 1]. Then νλ = µ ◦ (−1 λ satisfies (4.7) Ᏼσ λ ∈ J : dims (νλ ) < σ = 0 for any σ ∈ (0, α − 3β]. If J is an interval of strong transversality, then (4.7) holds for all σ ∈ (0, α]. Proof. Suppose (4.7) fails for some σ . By outer regularity of Hausdorff measure, there exists 0 > 0 so that
Ᏼσ {λ ∈ J : dims (νλ ) ≤ σ − 0 } > 0. By Frostman’s lemma, there exists a nonzero measure ρ on J so that ρ(I ) ≤ |I |σ for all intervals I and
ρ {λ ∈ J : dims (νλ ) > σ − 0 } = 0. (4.8) Frostman’s lemma can be applied since {λ ∈ J : dims (νλ ) > κ} is an Ᏺσ -set for any κ > 0. Indeed, ∞ ∞ 2 κ+δ−1 % % λ∈J : νλ (ξ ) |ξ | (4.9) λ ∈ J : dims (νλ ) > κ = dξ ≤ M . δ>0 M=1
Since
−∞
iξ (λ (ω) − eiξ (λ0 (ω) dµ(ω) e ' ≤ |ξ | (λ (ω) − (λ0 (ω) dµ(ω) ≤ C|ξ ||λ − λ0 |,
νλ (ξ ) − ν* λ (ξ ) ≤ 0
'
the sets on the right-hand side of (4.9) are closed, as claimed. Thus for small > 0 and with r = d(ω1 , ω2 ),
212
PERES AND SCHLAG
J
(4.10)
∞
∞
dνλ (x) dνλ (y) dρ(λ) σ − −∞ −∞ |x − y| dµ(ω1 ) dµ(ω2 ) dρ(λ) = |((λ, ω1 ) − ((λ, ω2 )|σ − J ' ' dµ(ω1 ) dµ(ω2 ) = χ[|3λ |≥Cβ r β ] |3λ (ω1 , ω2 )|−(σ − ) dρ(λ) d(ω1 , ω2 )σ − ' ' J dµ(ω1 ) dµ(ω2 ) + χ[|3λ |≤Cβ r β ] |3λ (ω1 , ω2 )|−(σ − ) dρ(λ) d(ω1 , ω2 )σ − ' ' J dµ(ω1 ) dµ(ω2 ) ≤C (σ − )(1+β) d(ω 1 , ω2 ) ' ' N β dρ(λ) dµ(ω1 ) dµ(ω2 ) C + σ − λ − λj d(ω1 , ω2 )(σ − )(1+β) ' '
(4.11)
≤C
'
j =1
dµ(ω1 ) dµ(ω2 ) , d(ω1 , ω2 )α '
since σ + 3β ≤ α. The sum is obtained by applying Lemma 4.3 to (4.10). However, (4.11) contradicts (4.8) if < 0 . The easier case of strong transversality is implicit in the above and is omitted. Proof of Theorem 2.6. Fix some compact I ⊂ J , and let ρ ∈ C ∞ (R) be nonnegative so that ρ = 1 on I and supp(ρ) ⊂ J . Fix distinct ω1 , ω2 ∈ ' and let r = d(ω1 , ω2 ). Then for any nonnegative integer N, ∞ ∞ eiξ [((λ,ω1 )−((λ,ω2 )] ρ(λ) dλ = eiξ r3λ (ω1 ,ω2 ) ρ(λ) dλ −∞ −∞ (4.12) ∞ eiξ r3λ (ω1 ,ω2 ) LN (ρ)(λ) dλ, = −∞
where L is the differential operator L(·) = (d/dλ)((−iξ r3λ (ω1 , ω2 ))−1 ·). It follows from Definition 2.5 that |Lk (f )| ≤ Ck |ξ r|−k ki=0 f (i) ∞ for all k. In particular, applying (4.12) with N = 0 and N ≥ α yields ∞
−α iξ [((λ,ω1 )−((λ,ω2 )] e ρ(λ) dλ ≤ Cα |ξ |d(ω1 , ω2 ) . −∞
Therefore, ∞ ∞ νλ (ξ )2 ρ(λ) dλ = eiξ [((λ,ω1 )−((λ,ω2 )] ρ(λ) dλ dµ(ω1 ) dµ(ω2 ) −∞ ' ' −∞ dµ(ω1 ) dµ(ω2 ) ≤ C|ξ |−α . d(ω1 , ω2 )α ' '
SMOOTHNESS OF PROJECTIONS
213
This proves (2.3) and also that I νλ 22,(α−1)/2− dλ < ∞ for any small > 0. The dimension bound (2.4) in case α ≥ 1 thus follows from Proposition 4.2, whereas (2.4) with α ≤ 1 is covered by Proposition 4.4. In view of Propositions 4.2 and 4.4, Theorem 2.8 follows from the Sobolev estimate (2.7). This is proved using the characterization (4.2). The main technical statement is Lemma 4.6 below. The following routine calculus lemma is needed in the proof of that lemma. Lemma 4.5. Let L be a positive integer and suppose h ∈ C L (I ) satisfies h = 0 on some interval I ⊂ R. Let H denote the inverse of h. Then for any positive integer # ≤ L, (4.13)
H (#) =
#−1 t=0 #, $ p$
p
p
−(#+p1 +···+pt ) (# ) h 1 ◦ H 1 · · · · · h(#t ) ◦ H t a#, $ p$ h ◦ H
$ with some integer coefficients a#, $ p$ . The inner sum runs over vectors # = (#1 , . . . , #t ) and p$ = (p1 , . . . , pt ) with integer coordinates. Moreover, if # ≥ 2, one has a#, $ p$ = 0 unless # ≥ #i ≥ 2 for all i, min pj ≥ 1, and #1 p1 + · · · + #t pt ≤ 2(# − 1). Proof. If # = 1, then (4.13) holds with t = 0. The general case now follows easily by induction. Lemma 4.6. Let J and ( be as in Definition 2.4. Assume that J is an interval of transversality of order β for ( and that (λ is ∞-regular on J . Suppose ρ ∈ C ∞ (R) is supported on J . Let ψ be the Littlewood-Paley function from Lemma 4.1. Then for any distinct ω1 , ω2 ∈ ', any integer j , and any positive integer q, (4.14)
∞
−∞
−q ρ(λ)ψ 2 [((λ, ω1 ) − ((λ, ω2 )] dλ ≤ Cq 1 + 2j d(ω1 , ω2 )1+a0 β ,
j
where Cq depends only on q, ρ, and β, and a0 is some absolute constant. Proof. Fix distinct ω1 , ω2 ∈ ' and j, q as above. We may assume that 2j r > 1 where r = d(ω1 , ω2 ). Let φ ∈ C ∞ be nonnegative with φ = 1 on [− 21 , 21 ] and supp(φ) ⊂ [−1, 1]. Then ∞ # $ ρ(λ)ψ 2j ((λ, ω1 ) − ((λ, ω2 ) dλ −∞
(4.15) = ρ(λ)ψ 2j r3λ (ω1 , ω2 ) φ Cβ−1 r −β 3λ dλ
#
$ + ρ(λ)ψ 2j r3λ (ω1 , ω2 ) 1 − φ Cβ−1 r −β 3λ dλ. Here Cβ is the constant from Definition 2.7. By the rapid decay of ψ,
214 PERES AND SCHLAG
#
$ ρ(λ)ψ 2j r3λ (ω1 , ω2 ) 1 − φ C −1 r −β 3λ dλ β
−q
−q ≤ Cq,β |ρ(λ)| 1 + 2j r 1+β dλ ≤ Cq,β 1 + 2j r 1+β . Thus it suffices to estimate the first integral in (4.15). By Lemma 4.3, there exists χ ∈ C ∞ (R) depending only on β so that supp(χ) is compact, χ = 1 on a neighborhood of the origin, and such that
χ r −2β λ − λj = χ r −2β λ − λj φ Cβ−1 r −β 3λ , where these functions have disjoint supports for distinct j . We can therefore write the first integral in (4.15) as
ρ(λ)ψ 2j r3λ φ Cβ−1 r −β 3λ dλ =
Nβ i=1
+
ρ(λ)χ r −2β λ − λj ψ(2j r3λ ) dλ
Nβ −2β
χ r λ − λj ψ 2j r3λ φ Cβ−1 r −β 3λ dλ ρ(λ) 1 −
i=1
Nβ
=
A(j ) + B.
i=1
To estimate the second term B, notice that |3λ | ≥ Cr 3β on the support of the integrand. The rapid decay of ψ therefore implies
−q |B| ≤ Cq,β 1 + 2j r 1+3β . For simplicity, we assume henceforth that i = 1 and set λ = λ1 . By choice of χ, we have |(d/dλ)3λ | ≥ Cβ r β on the support of the integrand of A(1) . In order to change variables in A(1) , we define H via
3λ = u ⇐⇒ λ = λ + H (u), provided χ r −2β λ − λ = 0. (4.16) Let F (u) = ρ(λ + H (u))χ (r −2β H (u))H (u). Thus
A(1) = F (u)ψ 2j ru du 2(q−1) F (#) (0)
u# + O F (2q−1) (u)u2q−1 du ψ(2j ru) (4.17) = #! |u|<(2j r)−1/2 #=0
(1) (1) + O (2j r|u|)−2q−1 |F (u)| du = A1 + A2 . |u|>(2j r)−1/2
SMOOTHNESS OF PROJECTIONS
215
Let |H (u)| ≤ Cβ r −β by the choice of χ, and thus |F (u)| ≤ Cr −β . In particular, (1) |A2 | ≤ Cβ,q (2j r)−q−1 r −β . Since ψ has vanishing moments of all orders,
(1)
A1 = −
2(q−1) −q
F (#) (0) # u du + O F (2q−1) ∞ 2j r ψ 2j ru . #! |u|>(2j r)−1/2 #=0
It remains to estimate F (#) (u). Suppose λ and u are related via (4.16). By Lemma 4.5 (with L = ∞) and (2.6) in Definition 2.7, #−1 (#) H (u) ≤ Cβ,# a $ 3 −(#+p1 +···+pt ) r −β(#1 p1 +···+#t pt ) ≤ Cβ,# r −β(4#−3) . λ #,p$ t=0 #, $ p$
Therefore, by Leibnitz’s rule, (#)
F ≤ Cβ,# r −2β# r −β(4#−3) + r −β(4l+1) ≤ Cβ,# r −6β# . ∞ Thus 2(q−1) (1) A ≤ Cβ,q r −6β# 1
#=0
∞ (2j r)−1/2
−2q−#−1
2j ru
u# du + O r −6(2q−1)β (2j r)−q
−q . ≤ Cβ,q r −6(2q−1)β 2j r Since Nβ ≤ Cβ r −2β , we finally obtain (4.14) with a0 = 12. Proof of (2.7). Fix some γ with 0 < (1 + 2γ )(1 + a0 β) ≤ α. Let ρ be a smooth nonnegative function on the line so that supp(ρ) ⊂ J . Fix any q > 1+2γ . In view of (4.2), the definition of νλ , and Lemma 4.6, (4.18) ∞ νλ 22,γ ρ(λ) dλ −∞
≤
∞
22j γ
j =−∞ ∞
' ' j =−∞
≤ Cβ,γ
≤ Cβ,γ
' ' −∞
as claimed.
'
−∞
ψ2−j ∗ νλ (x) dνλ (x)ρ(λ) dλ
$
# 2j (1+2γ ) ψ 2j ((λ, ω1 ) − ((λ, ω2 ) ρ(λ) dλ dµ(ω1 ) dµ(ω2 )
∞
∞
−q 2j (1+2γ ) 1 + 2j d(ω1 , ω2 )1+a0 β dµ(ω1 ) dµ(ω2 )
dµ(ω1 ) dµ(ω2 ) ≤ Cβ,γ Ᏹα (µ) < ∞, (1+a0 β)(1+2γ ) d(ω 1 , ω2 ) '
216
PERES AND SCHLAG
4.2. The case of limited regularity. We now discuss the case where ( : J ×' → R only has a finite degree of smoothness as a mapping λ ( → (λ . Definition 4.7. Let ' and J be as in Definition 2.4. Suppose ( : J × ' → R is continuous, and let L be a positive integer and δ ∈ [0, 1). We write (λ ∈ C L,δ (J ) if the following conditions are satisfied: given any compact I ⊂ J , we assume that the bounds (2.2) hold for all # = 0, 1, . . . , L and that (4.19) L L d d δ dλL ((λ1 , ω) − dλL ((λ2 , ω) ≤ Cδ,I |λ1 − λ2 |
for all λ1 , λ2 ∈ I and ω ∈ '
with some suitable constant CI,δ . The following definition is similar to Definition 2.7, only here we also allow for the slightly more general case of Hölder continuity. Definition 4.8. Suppose (λ ∈ C L,δ (J ) as in Definition 4.7 and let β ∈ (0, 1). We say that J is an interval of transversality of order β for ( if there exists a constant Cβ so that for all λ1 , λ2 ∈ J and ω1 , ω2 ∈ ', the condition 3λ (ω1 , ω2 ) + 3λ (ω1 , ω2 ) ≤ Cβ d(ω1 , ω2 )β 2 1 implies that (4.20)
d 3λ ≥ Cβ d(ω1 , ω2 )β . dλ 1
In addition, we say that (λ is L, δ-regular on J if under the same condition, with some constants Cβ,# , Cβ,L,δ , (4.21) # d −β# for # = 1, 2, . . . , L, dλ# 3λ1 (ω1 , ω2 ) ≤ Cβ,# d(ω1 , ω2 ) L d dL δ −β(L+δ) . dλL 3λ1 (ω1 , ω2 ) − dλL 3λ2 (ω1 , ω2 ) ≤ Cβ,L,δ |λ1 − λ2 | d(ω1 , ω2 ) Under these conditions, we have the following analogue of Theorem 2.8. Since Lemma 4.3 and Proposition 4.4 only require that (λ is 1-regular on J , we restrict ourselves to a discussion of the Sobolev estimate (2.7) and the dimension bound (2.8) from Theorem 2.8. Notice that the following statements reduce to those estimates if L = ∞. Theorem 4.9. Let ( : J × ' → R be as in Definition 4.7. Assume that J is an interval of transversality of order β for some β ∈ [0, 1) in the sense of Definition 4.8,
217
SMOOTHNESS OF PROJECTIONS
and assume that (λ is L, δ-regular on J with L + δ > 1. Let µ be a finite positive measure on ' with finite α-energy for some α > 1. Then on any compact I ⊂ J , the family of measures νλ = µ ◦ (−1 λ satisfies (4.22) νλ 22,γ dλ ≤ Cγ Ᏹα (µ), I
provided 0 < 1 + 2γ < L + δ and 1 + 2γ ≤
α , 1 + a0 β
where a0 is some absolute constant. Moreover, if σ ∈ (0, 1], then (4.23)
dim λ ∈ J : dims (νλ ) ≤ σ ≤ 1 + σ − min L + δ,
α . 1 + a0 β
If 1 < σ ≤ α, then (4.24)
dim λ ∈ J : dims (νλ ) ≤ σ ≤ 1 − min L + δ,
σ − 1 −1 α −σ 1+ . 1 + a0 β L+δ
Finally, if σ ∈ (0, α − 3β], then dim λ ∈ J : dims (νλ ) < σ ≤ σ. As already mentioned above, the final statement holds since Proposition 4.4 requires only C 1 parameterizations. As in the C ∞ -case, we exploit the Sobolev bound in order to obtain the dimension estimates (4.23) and (4.24). This is accomplished by means of the following proposition, which is based on Lemma 3.2 (cf. Proposition 4.2). Proposition 4.10. Let ', J and ( : J × ' → R be as in Definition 4.7 for some positive integer L and δ ∈ [0, 1). Suppose νλ as in 4.7 satisfies I νλ 22,γ dλ < ∞ with some I ⊂ J and γ > − 21 . (i) If σ ∈ [0 ∧ 2γ , 1 ∧ (1 + 2γ )], then
dim λ ∈ I : dims (νλ ) ≤ σ ≤ σ − 2γ .
(4.25)
(ii) If σ ∈ [1, 1 + 2γ ), then (4.26)
σ − 1 −1 . dim λ ∈ I : dims (νλ ) ≤ σ ≤ 1 − (1 + 2γ − σ ) 1 + L+δ
Proof. Let {hj }∞ j =1 be defined as in (4.3) and set h0 = 1. We verify the hypotheses j of Lemma 3.2 with A = 2 and Hj = i=0 2i−j hi . Recall that the Littlewood-Paley ˆ ) = φ(ξ/2) ˆ ˆ ), where φˆ ≥ 0, φ(ξ ˆ ) = 1 for |ξ | ≤ 1, function ψ is given by ψ(ξ − φ(ξ
218
PERES AND SCHLAG
ˆ ) = 0 for |ξ | > 2; see the proof of Lemma 4.1. In particular, and φ(ξ ∞ $ # −j −1 j ˆ ) νλ (ξ )2 dξ 2 Hj (λ) = 1 + ξ − φ(ξ φˆ 2 −∞ $ j +1 j +1 # (4.27) = 1+ (λ (ω) − (λ (τ ) 2 φ 2 ' ' # $ − φ (λ (ω) − (λ (τ ) dµ(ω) dµ(τ ). ∞ Let χ ∈ (R) satisfy |φ | ≤ χ. (For example, set χ(x) = C −∞ max|y−z|≤1 |φ (z)| η(x − y) dy, where η ∈ C ∞ is a standard bump function.) Thus # $ H (λ) ≤ C + C2j +1 χ 2j +1 (λ (ω) − (λ (τ ) j ' ' × (λ (ω)| + |(λ (τ ) dµ(ω) dµ(τ ) (4.28) ∞ 2
≤ C +C χˆ 2−j −1 ξ νλ (ξ ) dξ ≤ C2j Hj (λ). −∞
To obtain the final inequality in (4.28), notice that
ˆ ∞ φˆ 2−j −1 ξ χˆ 2−j −1 ξ ≤ χ
if |ξ | ≤ 2j +1 .
Hence, (4.28) follows from (4.27) and the rapid decay of χ. ˆ In view of (4.4) and (4.5), inequality (4.25) follows from (3.8) in Lemma 3.2 with n = 1, A = B = 2, R = 21+2γ , and r = 2σ + as → 0+. Estimate (4.26) follows from Lemma 3.2(ii) with the same choice of parameters. It remains to verify the Hölder estimate (3.9). For simplicity we only carry this out for L = 1. The general case is only slightly more technically involved. Let (λ (ω, τ ) = (λ (ω) − (λ (τ ). Hj , as defined above, clearly satisfies (4.29) H (λ1 ) − H (λ2 ) j j j +1
j +1 φ 2 (λ (ω, τ ) ( (ω, τ ) − ( (ω, τ ) dµ(ω) dµ(τ ) ≤ C +2 λ1 λ2 1 ' '
(4.30) +2
j +1
' '
j +1
φ 2 (λ (ω, τ ) − φ 2j +1 (λ (ω, τ ) ( (ω, τ ) dµ(ω) dµ(τ ). λ2 1 2
In view of (4.19), the same argument as in the first part of the proof shows that the right-hand side of (4.29) is bounded by C2j Hj (λ1 )|λ1 − λ2 |δ . If |λ1 − λ2 | > 2−j , then the integral in (4.30) can be again bounded by
C2j Hj (λ1 ) + Hj (λ2 ) ≤ C2j (1+δ) |λ1 − λ2 |δ Hj (λ1 ) + Hj (λ2 ) .
SMOOTHNESS OF PROJECTIONS
219
It remains to estimate (4.30) if |λ1 −λ2 | ≤ 2−j . As above, one constructs χ1 ∈ such that |φ | ≤ χ1 and Cχ1 (x) ≥ max|x−y|≤1 χ1 (y) for all x ∈ R. The integral in (4.30) is therefore bounded by 1 j +1
2j φ 2 (λ (ω, τ ) + 2j +1 t (λ − (λ (ω, τ ) dt C2 1 2 1 ' ' 0
× (λ2 − (λ1 (ω, τ ) dµ(ω) dµ(τ )
χ1 2j +1 (λ1 (ω, τ ) dµ(ω) dµ(τ ) ≤ 22j |λ2 − λ1 |
(4.31)
= C2j |λ2 − λ1 |
' ' ∞
−∞
2
χ1 2−j −1 ξ νλ (ξ ) dξ
≤ C2 |λ2 − λ1 |Hj (λ1 ) ≤ C2j (1+δ) |λ2 − λ1 |δ Hj (λ1 ). 2j
To pass to (4.31), one uses 2j |(λ1 (ω, τ ) − (λ2 (ω, τ )| ≤ C2j |λ2 − λ1 | ≤ C and the properties of χ1 . Proof of Theorem 4.9. By Proposition 4.10, the dimension bounds (4.23) and (4.24) follow from (4.25) and (4.26), respectively, if (4.22) holds. (4.18) shows that this is the case as soon as the inequality (4.14) is valid for some (real) q > 1 + 2γ . In fact, inspection of the proof of Lemma 4.6 reveals that under the conditions of Definition 4.8, one has (4.14) for any q < L + δ. More precisely, the only term in the proof of Lemma 4.6 that does not involve the rapid decay of ψ is the O-part of (1) A1 (see (4.17)). Thus the proof carries over unchanged up to (4.16). Now fix a small
> 0 and some M > 0. Since F ∈ C L−1,δ , as can be seen from the definition of F , Definition 4.7, and Lemma 4.5, (4.32) A(1) =
F (u)ψ 2j ru du
=
|u|<(2j r)−1+
j
F (#) (0)
L−1
ψ 2 ru
#=0
#!
u# +
$ F (L−1) (tu) − F (L−1) (0)
1# 0
(1 − t)L−1 dt uL−1 du (L − 1)!
−M
(1) (1) O 2j r|u| |F (u)| du = A1 + A2 . ×
+
|u|>(2j r)−1+
(1)
Taking M = M( , q) large enough, one obtains |A2 | ≤ Cβ,q (2j r)−q r −β for any q > 0. Estimating the contribution of the Taylor polynomial in (4.32) again involves the rapid decay of ψ. To bound the error term in (4.32), one uses the Hölder condition on F (L−1) :
220
PERES AND SCHLAG
|u|<(2j r)−1+
ψ 2j ru
0
≤ C ,L r −CLβ
$ (1 − t)L−1 dt uL−1 du F (L−1) (tu) − F (L−1) (0) (L − 1)!
1#
−(1− )(L+δ) |u|L−1+δ du ≤ C ,L 2j r 1+Cβ .
|u|<(2j r)−1+
Since > 0 is arbitrary, (4.14) holds for any q < L + δ, as claimed. 5. Applications Classical Bernoulli convolutions. Recall that νλ is the distribution of 5.1. ∞ n . Here we show that the density of ν has fractional derivatives in L2 for ±λ λ 0 almost all λ ∈ ( 21 , 1), and we estimate the dimension of those λ so that νλ is singular with respect to Lebesgue measure. First we recall the definition of δ-transversality from [34]. Definition 5.1. Let δ > 0. We say that J ⊂ R is an interval of δ-transversality for the class of power series (5.1)
g(x) = 1 +
∞
bn x n ,
with bn ∈ {−1, 0, 1}
n=1
if g(x) < δ implies g (x) < −δ for any x ∈ J . Lemma 5.3 establishes the connection between Definitions 2.7 and 5.1. A useful criterion for checking δ-transversality was found in [34]. A power series h(x) is called a (∗)-function if for some k ≥ 1 and ak ∈ [−1, 1], h(x) = 1 −
k−1
x i + ak x k +
∞
xi .
i=k+1
i=1
In [39] Solomyak showed that among all convex combinations of series of the form (5.1), the power series with the smallest double zero must be a (∗)-function. The following lemma from [34] bypasses this fact and reduces the search for intervals of transversality to finding a suitable (∗)-function. Lemma 5.2. Suppose that a (∗)-function h satisfies h(x0 ) > δ
and
h (x0 ) < −δ
for some x0 ∈ (0, 1) and δ ∈ (0, 1). Then [0, x0 ] is an interval of δ-transversality for the class of power series (5.1). In [34] a particular (∗)-function was found that satisfies h(2−2/3 ) > 0.07 and h(2−2/3 ) < −0.09, so transversality in the sense of Definition 5.1 holds on [0, 2−2/3 ] by this lemma. On the other hand, in [39] Solomyak proved that there is a power
221
SMOOTHNESS OF PROJECTIONS
series of the form (5.1) with a double zero at roughly 0.68, whereas 2−2/3 0.63. We return to this issue below. It is easy to see that Bernoulli convolutions are a special case of the general results in Section let ' = {−1, +1}N be equipped with the product measure ∞ 1 2. Indeed, 1 µ = 0 ( 2 δ−1 + 2 δ1 ). For any distinct ω, τ ∈ ', we define |ω ∧ τ | = min{i ≥ 0 : ωi = τi }. Fix some interval J = (λ0 , λ1 ) ⊂ (0, 1) and define ( : J × ' → R via (λ (ω) = ∞ |ω∧τ | n . One n=0 ωn λ . The metric on ' (depending on J ) is given by d(ω, τ ) = λ1 1 α checks that Ᏹα (µ) < ∞ if and only if λ1 > 2 . Lemma 5.3. Suppose J = (λ0 , λ1 ) is an interval of δ-transversality. Then J is an 1+β interval of transversality of order β for ( if λ0 > λ1 . |ω ∧ω |
Proof. Since d(ω1 , ω2 ) = λ1 1 2 , 3λ (ω1 , ω2 ) = 2(λk /λk1 )g(λ) with k = |ω1 ∧ ω2 | and a power series g of the form (5.1). Therefore, " ! k−# k (#) λ λ # (#) 3 ≤ C# k g ∞ + k g ∞ λ λk1 λ1
−βlk −1 −#−1 ≤ C#,β λ1 , ≤ C# k # λ−# 0 (1 − λ1 ) + (1 − λ1 ) since λ1 < 1 and β > 0. Thus condition (2.6) in Definition 2.7 holds. To check (2.5), βk assume |3λ | ≤ δbβ λ1 , where the constant bβ ∈ (0, 1) is determined below. Then 2
λ0 λ1
k
implies |g(λ)| ≤ 21 δbβ (λ−1 0 λ1
|g(λ)| ≤ 2
1+β k ) .
λ λ1
k
βk
|g(λ)| ≤ δbβ λ1
Hence
3 ≥ 2 λ0 λ−1 k |g (λ)| − k |g(λ)| ≥ 2λβk δ − δbβ k λ−1 λ1+β k ≥ δλβk , λ 1 1 1 λ0 2λ0 0 1 provided bβ ≤ 21 [1+supk≥0 k/2λ0 (λ−1 0 λ1 2.7 holds with Cβ = δbβ .
1+β k −1 ) ] .
Thus condition (2.5) in Definition
Theorem 2.8 now implies the following theorem. Theorem 5.4. Suppose J = [λ0 , λ0 ] ⊂ ( 21 , 1) is an interval of δ-transversality for the power series (5.1). Then dims (νλ ) ≥ log 2/(− log λ) for a.e. λ ∈ J . Furthermore, log 2 dim λ ∈ J : νλ ∈ L2 (R) ≤ 2 − . − log λ0
222
PERES AND SCHLAG
Proof. Fix any small β > 0, and partition J into subintervals Ji = [λi , λi+1 ] for 1+β i = 0, . . . , m so that λi ≥ (1 + β)λi+1 . By the previous lemma, all Ji are intervals of transversality of order β for (. Notice that the metric on ' depends on i; in |ω1 ∧ω2 | . In particular, µ has finite αi -energy with respect to di if fact, di (ω1 , ω2 ) = λi+1 i > 21 . Theorem 2.8 therefore implies that λαi+1 dims (νλ ) ≥
log 2 (1 + a0 β)−1 − log λi+1
for a.e. λ ∈ Ji
and dim λ ∈ Ji : νλ ∈ L2 (R) ≤ 2 −
log 2 log 2 (1 + a0 β)−1 . (1 + a0 β)−1 ≤ 2 − − log λi+1 − log λ0
Letting β → 0+ finishes the proof. It is well known that for 0 < λ < 21 , the support of νλ is a Cantor set of dimension log 2/(− log λ). In fact, νλ is a Frostman measure on that set, which implies that dims (νλ ) = log 2/(− log λ) for 0 < λ < 21 . In [39] Solomyak showed that the first double zero for a power series of the form (5.1) lies in the interval [0.649, 0.683]. In particular, the previous theorem applies only up to some point in this interval. It seems natural to conjecture that dims (νλ ) ≥ log 2/(− log λ) for a.e. λ ∈ ( 21 , 1), but our methods do not yield this estimate. Nevertheless, one can show that νλ has some smoothness for a.e. λ ∈ ( 21 , 1). This follows from Theorem 5.4 by “thinning and convolving” (see [39] and [34]). As one expects, the number of derivatives tends to ∞ as λ → 1. Lemma 5.5. For any > 0, there exists γ = γ ( ) > 0 so that 1/√2 (5.2) νλ 22,γ dλ < ∞. (1/2)+
Furthermore, there exists some positive constant γ0 so that (5.3)
2−2
2−2
−k−1
−k
−k+1 νλ 22,2k γ + νλ 22,γ0 dλ < ∞ 0
for k = 1, 2, . . . .
Proof. As mentioned above, [0, λ1 ] is an interval of transversality for the power series (5.1) for some λ1 > 2−2/3 . Fix any λ0 ∈ ( 21 , 2−2/3 ]. Partitioning the interval [λ0 , λ1 ] as in the proof of Theorem 5.4, one obtains from (2.7) that λ1 1 1+2γ (5.4) νλ 22,γ dλ < ∞, provided λ0 > . 2 λ0 , we remove every third term from the original series. More preTo go beyond 2−2/3 / cisely, let (λ (ω) = 3 | n ωn λn and denote the distribution of this series by ν0λ . It was
223
SMOOTHNESS OF PROJECTIONS
shown in [39] and [34] that the class of power series (5.1) that satisfy either b3j +1 = 0 for all j ≥ 0 or √ b3j +2 for all j ≥ 0 have [0, λ3 ] as an interval of δ-transversality for some λ3 > 1/ 2. As for the full series, one easily deduces from Theorem 2.8 that λ3 1+2γ (5.5) > 2−2/3 . 0 νλ 22,γ dλ < ∞, provided λ2 λ2
0λ | and λ1 > 2−2/3 , (5.2) follows from (5.4) and (5.5). Moreover, Since | νλ | ≤ |ν we have shown that there exists some #0 ∈ (2−1/2 , 2−1/4 ) and a γ0 > 0 so that #0 νλ 22,γ0 dλ < ∞. Using νλ (ξ ) = ν* λ2 (ξ )ν* λ2 (λξ ), we conclude that #2 0
#0
1/2
#0
νλ 2,2γ0 dλ =
1/2
#0
≤C
#0
≤C
(5.7)
λ
1/2 dλ
λ
νλ (ξ )4 |ξ |4γ0 dξ
#20
#0
ν*2 (ξ )ν*2 (λξ )2 |ξ |4γ0 dξ
∞
−∞
#0
(5.6)
1/2 dλ
νλ (ξ )2 |ξ |2γ0 dξ dλ < ∞.
#20
Equation (5.6) follows from Cauchy-Schwarz and a change of variables. To pass from (5.6) to (5.7), we basically observe that the inner integral in (5.6) behaves like a sum over ξ ∈ Z. More precisely, we have νλ = νλ χ for some χ ∈ C ∞ (R) with compact support. Consequently, ∞ ∞ νλ (ξ )4 |ξ |4γ0 dξ = νλ ∗ χˆ 4 (ξ )|ξ |4γ0 dξ −∞
= = =
−∞ ∞ ∞ −∞
−∞ #+1 ∞
#∈Z #
1 0 #∈Z
≤
νλ (ξ − η)χ(η) ˆ dη
−∞ ∞ −∞
1 ! 0
#∈Z
≤C
0 ∞ −∞
|ξ |4γ0 dξ
νλ (ξ − η)χ(η) ˆ dη
4
νλ (ξ + # − η)χ(η) ˆ dη
∞ −∞
|ξ |4γ0 dξ 4
νλ (ξ + # − η)χ(η) ˆ dη
! 2 1 ≤χˆ 1 L
4
∞
#∈Z −∞
|ξ + #|4γ0 dξ "2
2 |ξ + #|2γ0
dξ
"2 2 2γ νλ (η) χ(ξ ˆ + # − η)|ξ + #| 0 dη dξ
νλ (η)2 (1 + |η|)2γ0 dη
2 < ∞.
224
PERES AND SCHLAG
Finally, since | νλ (ξ )| ≤ 1,
#0
1/2
#0
νλ 22,γ0 dλ =
1/2
#0 #0
≤C
ν*2 (ξ )ν*2 (λξ )2 |ξ |2γ0 dξ dλ λ
#0 #20
λ
νλ (ξ )2 |ξ |2γ0 dξ dλ < ∞.
We have shown that (5.3) holds for k = 1. The general case now follows easily by induction along the same lines. Corollary 5.6. Let S(λ0 ) = essinf λ∈[λ0 ,1] dims (νλ ). Then S(λ0 ) > 1 for any λ0 > 21 and S(λ0 ) → ∞ as λ0 → 1. Furthermore, for any λ0 > 21 , there exists
(λ0 ) > 0 such that dim λ ∈ (λ0 , 1) : νλ does not have L2 -density < 1 − (λ0 ). Proof. This follows immediately from the previous lemma and Proposition 4.2. As observed by Kahane [22], Erd˝os’s argument yields that (λ0 ) → 1 as λ0 → 1−. To conclude the discussion of classical Bernoulli convolutions, we prove that (2.3) fails. Lemma 5.7. Suppose 1/λ0 is a Pisot number. Then 2 lim sup |ξ | νλ (ξ ) dλ > 0 |ξ |→∞
J
for any interval J containing λ0 . Proof. Recall that lim sup|ξ |→∞ |ν* λ0 (ξ )| > 0; see [6]. Also, iξ ( (ω) e λ − eiξ (λ0 (ω) dµ(ω) νλ (ξ ) − ν* λ0 (ξ ) ≤ ' ≤ |ξ | (λ (ω) − (λ0 (ω) dµ(ω) ≤ C|ξ ||λ − λ0 |. '
Therefore, if C is a sufficiently large constant, νλ (ξ )2 dλ > 0 lim sup |ξ | |ξ |→∞
|λ−λ0 |≤(C|ξ |)−1
as claimed. 5.2. Asymmetric Bernoulli convolutions. In [35] Peres and Solomyak studied n p the distribution νλ of the sum ±λ where the signs are chosen independently p with probabilities (p, 1 − p). For p ∈ [ 13 , 23 ], they showed that νλ is absolutely
225
SMOOTHNESS OF PROJECTIONS
continuous for a.e. λ ∈ [p p (1 − p)1−p , 1) and has Lq -density for a.e. λ ∈ [(p q + (1 − p)q )1/(q−1) , 1), where q ∈ (1, 2]. The results from Section 2 give bounds on the dimension of the exceptional parameters for the absolutely continuous case and if q = 2. As in the symmetric case, ' = {−1, 1}N . Here µ is the product measure that assigns weights p and 1 − p to +1 and −1, respectively. Theorem 5.8. Let p ∈ [ 13 , 23 ]. For all λ0 ∈ (p 2 + (1 − p)2 , 1], there exists =
(λ0 ) such that p dim λ ∈ (λ0 , 1) : νλ does not have L2 -density < 1 − (λ0 ) (5.8) and such that for all λ0 ∈ (p p (1 − p)1−p , 1] p dim λ ∈ (λ0 , 1) : νλ is singular < 1 − (λ0 ). (5.9) Proof. Since [0, 0.649] is an interval of transversality for the power series (5.1), Theorem 2.8 implies, via Lemma 5.3 and a subdivision of the interval as in the proof of Theorem 5.4, that p dim λ ∈ (λ0 , 0.649) : νλ does not have L2 -density < 1 for any λ0 > p2 + (1 − p)2 . To extend this up to 1, we follow [35]. The measure p p n νλ ∗ νλ is the distribution of the random series ∞ 0 an λ , where an ∈ {−2, 0, 2} 2 2 with probabilities p , 2p(1 − p), (1 − p) . By Lemma 5.2 and an explicit choice of (∗)-function, Peres and Solomyak showed that [0, 0.5] is an interval of transversality for the power series (5.1) with bn ∈ {−4, −2, 0, 2, 4}. Since (0.649)2 > 11/27 = maxp∈[1/3,2/3] (p 4 +(2p(1−p))2 +(1−p)4 ), Theorem 2.8 yields in the same fashion as before that 1 p p 2 dim λ ∈ 0.649, √ : νλ ∗ νλ ∈ L (R) 2
p = dim λ ∈ (0.649)2 , 0.5 : ν ∈ L4 (R) < 1. Splitting the random power series
λ
±λn into odd and even indices, one obtains
p p p (ξ )ν* (λξ ). νλ (ξ ) = ν* λ2 λ2
(5.10) Therefore,
1 dim λ ∈ 0.649, √ 2
p 2 : νλ ∈ L (R) < 1.
p p p n The measure νλ ∗ νλ ∗ νλ is the distribution of the random series ∞ 0 an λ , where an ∈ {−3, −1, 1, 3} with probabilities p 3 , 3p 2 (1 − p), 3p(1 − p)2 , (1 − p)3 . It was shown in [35] that [0, 0.415] is an interval of transversality for the corresponding
226
PERES AND SCHLAG
power series. Since 2−3/2 > 245/729 = maxp∈[1/3,2/3] (p 6 +(3p2 (1−p))2 +(3p(1− p)2 )2 + (1 − p)6 ), Theorem 2.8 again implies that
p dim λ ∈ 2−3/2 , 0.415 : νλ ∈ L6 (R) < 1. Splitting the original series modulo 3, one concludes that 1 p 1/3 2 : νλ ∈ L (R) < 1. dim λ ∈ √ , (0.415) 2 By this procedure we have estimated the dimension of λ inside the interval [λ0 , (0.415)1/3 ] for which νλ does not have L2 -density. Since maxp∈[1/3,2/3] (p 2 + (1 − √ p)2 ) = 59 and (0.415)1/3 > 5/9, inequality (5.8) follows from this estimate by applying (5.10) repeatedly. By the strong law of large numbers, the lower pointwise dimension of µ is
πµ− (ω) = − p log p + (1 − p) log(1 − p) µ-a.e. It therefore follows immediately from Corollary 2.9 that p dim λ ∈ (λ0 , 0.649) : νλ is singular < 1 for any λ0 > exp[−(p log p + (1 − p) log(1 − p))]. Since (5.8) is a stronger assertion on the interval [0.649, 1], we finally obtain (5.9). 5.3. {0, 1, 3}-problem. This problem concerns the Hausdorff dimension of the set ∞ ai λi : ai ∈ {0, 1, 3} . (λ) = i=0
Pollicott and Simon [36] showed that for a.e. λ ∈ ( 41 , 13 ), we have dim (λ) = log 3/(− log λ), and Solomyak [39] established that for a.e. λ ∈ ( 13 , 25 ), the set (λ) has positive Lebesgue measure. In this section, we estimate the Hausdorff dimension of the exceptional set of λ with respect to these properties. It is again clear that the general results in Section 2 apply to this case. Indeed, let ' = {0, 1, 3}N be equipped with uniform product measure. For any interval J = (λ0 , λ1 ) ⊂ (0, 1), define the projections ( and the metric d on ' as in Section 5.1. In [39] Solomyak showed that the smallest double zero of the class of power series (5.11)
g(x) = a +
∞ n=1
b n λn
with |bn | ≤ 1, |a| ≥
1 3
occurs in the interval [0.418, 0.437]; see also Corollary 5.2 in [35]. In particular, a simple compactness argument shows that [0, 25 ] is an interval of δ-transversality for the class (5.11) for some δ > 0 in the sense that |g(λ)| < δ implies |g (λ)| > δ for any λ in that interval. As in Lemma 5.3, we establish the connection with Definition 2.7.
SMOOTHNESS OF PROJECTIONS
227
Lemma 5.9. Suppose J = (λ0 , λ1 ) is an interval of δ-transversality for the class of power series (5.11). Then J is an interval of transversality of order β for (, provided 1+β λ0 > λ1 . |ω1 ∧ω2 | g(λ), where g is of Proof. It suffices to notice that 3λ (ω1 , ω2 ) = 3(λλ−1 1 ) the form (5.11). Otherwise the proof is identical with that of Lemma 5.3.
Theorem 2.8 now easily leads to the following result. Let M(λ) = log 3/(− log λ). Theorem 5.10. For any λ0 ∈ (0.25, 0.4) (5.12) (5.13)
dim λ ∈ (0.25, λ0 ) : dim (λ) < M(λ) ≤ M(λ0 ), dim λ ∈ (λ0 , 0.4) : Ᏼ1 ( (λ)) = 0 ≤ 2 − M(λ0 ).
Proof. Suppose λ0 ∈ ( 41 , 13 ). Fix any β ∈ (0, 1). As in the proof of Theorem 5.4 & 1+β we write [ 41 , λ0 ] = m ≤ λi+1 . Applying i=0 Ji with Ji = [λi+1 , λi ] and (1 + β)λi Lemma 5.9 to each Ji , one concludes from Theorem 2.8 that dim λ ∈ Ji : dims (νλ ) < M(λi ) − 3β ≤ M(λi ) ≤ M(λ0 ). Equation (5.12) follows by letting β → 0+. If λ0 ∈ ( 13 , 25 ), one partitions [λ0 , 25 ] in the same fashion. Theorem 2.8 then implies dim{λ ∈ Ji : νλ is singular} ≤ 2 −
M(λ0 ) M(λi+1 ) ≤ 2− . 1 + a0 β 1 + a0 β
Equation (5.13) now follows again by letting β → 0+. 5.4. Sums of Cantor sets. Following [35] we consider homogeneous self-similar sets in R ∞ n Ꮿλ = sn (λ)λ : sn ∈ Ᏸ, λ ∈ J , n=0
where Ᏸ = {s1 , s2 , . . . , sm } is a set of C 1 (J )-functions and J = [λ0 , λ1 ] ⊂ (0, 1). As in [35], we assume the strong separation condition (5.14)
min
min dist si (λ) + λᏯλ , sj (λ) + λᏯλ > h0 > 0.
1≤i<j ≤m λ0 ≤λ≤λ1
This condition implies dim Ꮿλ = log m/(− log λ) < 1, so λ < m−1 . The usual middleα Cantor sets are clearly special cases of the Ꮿλ , and we refer the reader to [35] for more motivation and background. In this section, we are concerned with the problem of estimating the dimension of the sum of two Cantor sets, or more generally, of
228
PERES AND SCHLAG
the sum of any compact set with one of the Ꮿλ . For technical reasons, we split Ꮿλ into the disjoint union of its cylinder sets of a certain fixed length. More precisely, let M0 = maxi=j si − sj L∞ (J ) and M1 = maxi=j si − sj L∞ (J ) and define k0 = *2(M0 + M1 )λ1 / h0 (1 − λ1 )2 +. Then every Ꮿλ is the union of mk0 congruent subsets that are obtained by fixing the first k0 symbols sn . Slightly abusing notation, we use Ꮿλ to denote one of these subsets. Since we are only concerned about questions relating to dimension, this makes no difference. To see that the case at hand is again a special case of the results in Section 2, fix a compact set K ⊂ R. Define ' = K × {1, 2, . . . , m}N and denote a typical point of ' n by (x, ω). Let (λ (x, ω) = x + ∞ n=0 sωn (λ)λ be the projections ( : J ×' → R. Let ρ be a Frostman measure on K. Define the measure µ on ' to be µ = ρ ×µ0 , where µ0 is the uniform product measure on the sequence space {1, 2, . . . , m}N . Finally, we let
|ω∧τ | d (x, ω), (y, τ ) = |x − y| + λ1 be the metric on '. By definition, ( : J × ' → R is continuous and C 1 in λ. In the following lemma, we show that the conditions in Definition 2.7 are satisfied. Lemma 5.11. Suppose s1 , s2 , . . . , sm ∈ C L,δ (J ) for some positive integer L and some δ ∈ [0, 1). Then J = [λ0 , λ1 ] is an interval of transversality of order β for ( 1+β (in the sense of Definition 4.8), provided λ1 > λ0 . Proof. Fix some (x, ω) and (y, τ ) ∈ '. Let k = |ω ∧ τ | and r = d((x, ω), (y, τ )). Notice that k > k0 by our assumption above. Then
x − y + λk g(λ) 3λ (x, ω), (y, τ ) = , |x − y| + λk1 where |g(λ)| > h0 by (5.14). Now suppose
β |3λ | ≤ Cβ |x − y| + λk1 = Cβ r β , (5.15) where the constant Cβ is to be determined. In fact, Cβ can be chosen so that |x − y| ≤ C1 λk1 with some constant C1 that depends only on Ᏸ and λ1 . More precisely, suppose that |x − y| ≥
2λk1 2λk1 max di − dj L∞ (J ) = M0 . 1 − λ1 i=j 1 − λ1
Then |3λ | ≥
|x − y| − λk1 M0 (1 − λ1 )−1 |x − y| + λk1
≥ C,
which contradicts (5.15) if Cβ is sufficiently small. Consequently, r λk1 and |3λ | ≤ kβ −βk (#) Cβ λ1 . It is then easy to see that |3λ | ≤ Cβ,# λ1 for suitable constants Cβ,# and (L) all # = 1, 2, . . . , L and also that a Hölder condition of order δ on 3λ holds if δ > 0.
SMOOTHNESS OF PROJECTIONS
229
Hence, (4.21) in Definition 4.8 is satisfied. Moreover, λk k
3 = g(λ) + g (λ) ≥ C λ0 λ−1 k kh0 − M0 + M1 ≥ Cλkβ ≥ Cβ r β λ 1 1 2 r λ λ1 (1 − λ1 ) 1+β
if Cβ is sufficiently small, since λ0 > λ1
and k > k0 . Thus (4.20) also holds.
Under the strong separation assumption (5.14), Peres and Solomyak showed in [35] that dim(K + Ꮿλ ) = dim K + dim Ꮿλ for a.e. λ such that dim K + dim Ꮿλ < 1. Furthermore, they also showed that K + Ꮿλ has positive Lebesgue measure for a.e. λ such that dim K + dim Ꮿλ > 1. The following theorem estimates the dimension of the exceptional values of λ in these statements. This is a simple consequence of Theorem 4.9. Theorem 5.12. Suppose K ⊂ R is compact and the sets Ꮿλ satisfy the strong separation condition (5.14) on J = [λ0 , λ0 ] ⊂ (0, 1). Then (5.16) dim λ ∈ J : dim(K + Ꮿλ ) < dim K + dim Ꮿλ ≤ dim K + dim Ꮿλ0 . If the symbols s1 , . . . , sm are in C L,δ with L + δ > 1, then dim λ ∈ J : Ᏼ1 (K + Ꮿλ ) = 0 ≤ 2 − min(dim K + dim Ꮿλ0 , L + δ). (5.17) Proof. Fix some > 0 and let ρ be a Frostman measure on K with exponent dim K − . Let the measure µ on ' be defined as above. Fix β ∈ (0, 1) and partition & 1+β J= N n=1 Ji , where each Ji = [λi , λi+1 ] satisfies λi ≥ (1 + β)λi+1 . It is easy to see that µ has finite αi -energy with respect to the metric di ((x, ω), (y, τ )) = |x − y| + |ω∧τ | λi+1 on ' if αi < dim K − + dim Ꮿλi+1 . Indeed, setting σi = dim Ꮿλi+1 , which is i the same as λσi+1 m = 1, one obtains dρ(x)dρ(y) dµ(ω) dµ(τ )
Ᏹαi (µ) = |ω∧τ | αi ' ' K K |x − y| + λi+1 ∞ dρ(x) dρ(y)
αi k ≤ m−k K K λi+1 + |x − y| k=0 dρ(x) dρ(y) ≤C < ∞, α −σ K K |x − y| i i provided 0 < αi − σi < dim K − . Therefore, by Lemma 5.11 and (2.9), dim λ ∈ Ji : dim(K + Ꮿλ ) < dim K − + σi − 3β ≤ dim λ ∈ Ji : dims (νλ ) < dim K − + σi − 3β ≤ dim K − + dim Ꮿλi+1 ≤ dim K − + dim Ꮿλ0 , and (5.16) follows by letting β → 0+ and then → 0+. The second statement (5.17) follows in a similar fashion from (4.23).
230
PERES AND SCHLAG
6. Orthogonal projections onto planes in Euclidean space. It is easy to see that Theorem 2.8 covers various classical results on the projection of planar sets onto lines (see [28], [23], [24], [9], [31]). Let ' = R2 and ((θ, x) = projθ (x) = (x ·θ)θ, where we consider J ⊂ S 1 as an interval of angles. Here projθ denotes the projection onto the line {tθ : t ∈ R}. Note that 3θ (x, y) = cos((θ, x − y)) for any x = y so that Definition 2.7 holds with β = 0. Suppose E ⊂ R2 is a Borel set. Applying Theorem 2.8 to a suitable Frostman measure supported on a compact subset of E, we therefore obtain Kaufman’s [24] and Falconer’s [9] theorems in the plane, that is, dim θ ∈ S 1 : dim projθ (E) ≤ t ≤ t and dim θ ∈ S 1 : Ᏼ1 (projθ (E)) = 0 ≤ 2 − dim E. Kaufman and Mattila [25] proved that the first bound is optimal. As noted by Falconer [9], their example can be easily modified to show that the second bound is sharp, too; see also Falconer [11, Theorem 8.17]. In particular, this implies that the estimates stated in Theorem 2.8 are optimal if β = 0, α ≤ 2, and σ ≤ 1. Moreover, it is easy to see that in this case (2.7) holds with β = 0 and equality; see also Proposition 2.2. It turns out that the calculation from the proof of that proposition can be generalized to higher dimensions. G(d, k) denotes the Grassmann manifold of all k-planes in Rd passing through the origin. Recall that dim G(d, k) = k(d −k); see [31]. For any finite measure µ on Rd , we define its projections νπ onto any k-plane π through the origin as usual, that is, given any continuous f ,
f dνπ = f projπ (x) dµ(x). Proposition 6.1. Let d ≥ 2 and k be positive integers. Suppose µ is a compactly supported measure in Rd . Then (6.1) dim π ∈ G(d, k) : dims (νπ ) < σ ≤ k(d − k) + σ − dims (µ). Proof. Let α = dims (µ). Suppose (6.1) fails. By Frostman’s lemma, there exists a nonzero positive measure ρ so that for some > 0,
ρ B(π, r) ≤ r k(d−k)+σ −α+ for any ball B(π, r) ⊂ G(d, k), (6.2)
ρ {π ∈ G(d, k) : dims (µπ ) ≥ σ } = 0. The measurability hypothesis in Frostman’s lemma can be verified as in Proposition 4.4, and we skip the details. We may assume that supp(µ) ⊂ B(0, 1). Fix a ˆ Clearly, φ ∈ so that φ = 1 on B(0, 1). Hence, µ = φµ and thus µˆ = φˆ ∗ µ. 2 ˆ ˆ ∗ |µ| ˆ ˆ ≤ φ L1 |φ| ˆ 2 νπ (ξ ) = µ(proj π (ξ )) and supp(νπ ) = projπ (supp(µ)). Also, |µ| by Cauchy-Schwarz. Therefore,
SMOOTHNESS OF PROJECTIONS
(6.3)
G(d,k) π
231
νπ (ξ )2 (1 + |ξ |)σ −k d Ᏼk (ξ ) dρ(π)
≤C
G(d,k) π
(6.4)
≤ CN
(6.5)
≤ Cα,d
Rd
φˆ ∗ |µ| ˆ 2 (ξ )(1 + |ξ |)σ −k d Ᏼk (ξ ) dρ(π)
2 µ(η) ˆ
Rd
G(d,k) π
(1 + |ξ − η|)−N (1 + |ξ |)σ −k d Ᏼk (ξ ) dρ(π) dη
2 µ(η) ˆ (1 + |η|)α−d− dη < ∞.
Here N is a sufficiently large integer. To pass from (6.4) to (6.5), first notice that for any r < |η|, the set {π ∈ G(d, k) : dist(π, η) ≤ r} is an (r|η|−1 )-neighborhood of a smooth manifold of codimension d − k in G(d, k) and is therefore contained in the union of no more than (|η|r −1 )(k−1)(d−k) balls of radius r|η|−1 . Thus by (6.2),
ρ {π ∈ G(d, k) : dist(π, η) ≤ r} ≤ C(|η|r −1 )k−σ +α−d− . Considering the contributions from the dyadic shells {ξ ∈ π : 2j ≤ |ξ − η| ≤ 2j +1 } separately, the inner integrals in (6.4) can therefore be estimated as (1 + |ξ − η|)−N (1 + |ξ |)σ −k d Ᏼk (ξ ) dρ(π) G(d,k) π
≤ C(1 + |η|)σ −k ρ {π ∈ G(d, k) : dist(π, η) ≤ 1}
2−j (N −k) (1 + |η|)σ −k ρ π ∈ G(d, k) : dist(π, η) ≤ 2j +C 1≤ 2j <(1/2)|η|
+C
2j ≥(1/2)|η|
2−j N
|ξ |≤2j +1
(1 + |ξ |)σ −k d Ᏼk (ξ ) ≤ C(1 + |η|)α−d− ,
if N is sufficiently large, as claimed. However, the finiteness of (6.3) contradicts (6.2), and the proposition follows. This proposition has the following simple corollary. Corollary 6.2. Suppose E ⊂ Rd is a Borel set with dim E > 2k. Then for a.e. π ∈ G(d, k) the projection of E onto π has nonempty interior. More precisely, dim π ∈ G(d, k) : projπ (E) has empty interior ≤ k(d − k) + 2k − dim E. Proof. Let µ be a Frostman probability measure on E, that is, µ is supported on a compact subset of E and dims (µ) = dim E. If νπ = projπ (µ) has a continuous density, its support has nonempty interior and therefore so does projπ (E). Applying Cauchy-Schwarz to Rk |fˆ(ξ )| dξ shows that for any f ∈ L2 (Rk ), f 2,(1/2)(k+ ) < ∞ ,⇒ f ∈ C Rk ;
232
PERES AND SCHLAG
see Stein [42, Chapter 5] for more precise estimates. The corollary therefore follows by letting σ → 2k+ in (6.1). The following example, described to us by Mattila, shows that there are Borel sets E of dimension two in R3 so that dim{θ ∈ S 2 : projθ (E) contains an interval} ≤ 1. Take a Besicovitch set A in R2 , that is, a set of measure zero that contains a line in every direction; see [31, Theorem 18.11]. Define E = R2 \ ∪r∈Q2 (r + A) as a subset of the (x1 , x2 )-coordinate plane of R3 . It clearly has dimension two, but its projections onto lines that do not lie in the (x1 , x2 )-plane do not contain intervals. If k > 1, we do not know of an example of a 2k-dimensional set E with the property that γd,k ({π ∈ G(d, k) : projπ (E) has empty interior}) > 0, where γd,k denotes Haar measure on G(d, k). It seems unlikely that the Sobolev embedding theorem would give the sharp bound for k > 1. On the other hand, it is easy to see that Corollary 6.2 allows us to recover Falconer’s theorem [8] about the nonexistence of Besicovitch (d, k)-sets for k > d/2. Recall that a (d, k)-set is defined to be a set of measure zero that contains some translate of every k-plane. Suppose that A ⊂ Rd is such a set for some choice of d ≥ 3 and k. Then E = Rd \ ∪r∈Qd (A + r) has dimension d but the projection of E onto any (d − k)-dimensional plane has empty interior. This contradicts Corollary 6.2 if d > 2(d − k), as claimed. However, Bourgain [3] proved a much stronger result, namely, that (d, k)-sets do not exist for k + 2k−1 ≥ d. It therefore seems that the case k > 1 in Corollary 6.2 is not optimal. 7. The general projection theorems in higher dimensions. In this section we obtain a higher-dimensional version of Theorem 4.9. The proof closely resembles that of Theorem 4.9 in Section 4. In particular, we need to prove higher-dimensional versions of various technical lemmas. We first introduce some terminology. Definition 7.1. Let (', d) be a compact metric space, let Q ⊂ Rn be an open connected set, and let ( : Q × ' → Rm be a continuous map with n ≥ m. The length of a multi-index η = (η1 , η2 , . . . , ηn ) ∈ Nn is defined to be |η| = η1 + · · · + ηn and ∂ η = ∂ |η| /(∂λ1 )η1 · · · · · (∂λn )ηn where λ = (λ1 , . . . , λn ). Let L be a positive integer and δ ∈ [0, 1). We assume that for any compact Q ⊂ Q and any multi-index η = (η1 , η2 , . . . , ηn ) ∈ Nn , there exist constants Cη,Q and Cδ,Q such that η ∂ ((λ1 , ω) ≤ Cη,Q , provided |η| ≤ L, and sup ∂ η ((λ1 , ω) − ∂ η ((λ2 , ω) ≤ Cδ,Q |λ1 − λ2 |δ
|η|=L
for all λ1 , λ2 ∈ Q and ω ∈ '. We denote these conditions by (λ ∈ C L,δ (Q). Given any finite measure µ on ', let νλ = µ ◦ (−1 λ for all λ ∈ Q, where (λ (·) = ((λ, ·). The α-energy of µ is Ᏹα (µ) = ' ' dµ(ω1 )dµ(ω2 )/d(ω1 , ω2 )α .
SMOOTHNESS OF PROJECTIONS
233
As in the previous sections, we need a transversality condition. Definition 7.2. Let (λ ∈ C L,δ be as in Definition 7.1 for some positive integer L and some δ ∈ [0, 1). Let 3λ (ω1 , ω2 ) =
((λ, ω1 ) − ((λ, ω2 ) . d(ω1 , ω2 )
For any β ∈ [0, 1), we say that Q is a region of transversality of order β for ( if there exists a constant Cβ so that for all λ1 , λ2 ∈ Q and distinct ω1 , ω2 ∈ ', the condition |3λ1 (ω1 , ω2 )| + |3λ2 (ω1 , ω2 )| ≤ Cβ d(ω1 , ω2 )β implies that $ # (7.1) det D3λ1 (D3λ1 )t ≥ Cβ2 d(ω1 , ω2 )2β . In addition, we say that (λ is L, δ-regular on Q if under the same condition, for some constants Cβ,η , Cβ,L,δ , (7.2) |∂ η 3λ1 (ω1 , ω2 )| ≤ Cβ,η d(ω1 , ω2 )−β|η| for any nonzero multi-index η of length |η| ≤ L sup ∂ η 3λ1 (ω1 , ω2 ) − ∂ η 3λ2 (ω1 , ω2 ) ≤ Cβ,L,δ |λ1 − λ2 |δ d(ω1 , ω2 )−β(L+δ) .
|η|=L
D3λ above denotes the Jacobi matrix of all first derivatives of 3λ with respect to λ (the number of rows of D3λ is m). Our main result in this section is the following theorem. Theorem 7.3. Let (λ ∈ C L,δ be as in Definition 7.1 with L + δ > 1. Assume that Q ⊂ Rn is a region of transversality of order β for ( and that (λ is L, δ-regular on Q in the sense of Definition 7.2. Suppose µ is a finite positive measure on ' with m finite α-energy for some α > 0, and let νλ = µ ◦ (−1 λ be the projection of µ onto R under ( for any λ ∈ Q. Then for any compact Q ⊂ Q, (7.3) νλ 22,γ dλ ≤ Cγ Ᏹα (µ), Q
provided 0 < (m+2γ )(1+a0 β) ≤ α and 2γ < L+δ −1. Moreover, if σ ∈ (0, α ∧m], then α dim λ ∈ Q : dims (νλ ) ≤ σ ≤ n + σ − min (7.4) ,L+δ ; 1 + a0 β if σ ∈ (m, α], then (7.5)
dim{λ ∈ Q : dims (νλ ) ≤ σ } ≤ n − min
σ − m −1 α ,L+δ −σ 1+ . 1 + a0 β L+δ
234
PERES AND SCHLAG
Finally, if σ ∈ (0, α − a0 β], then (7.6)
dim{λ ∈ Q : dims (νλ ) < σ } ≤ n + σ − m.
The constant a0 depends only on m, n, and δ. It is easy to check that projections onto planes in Rd satisfy Definition 7.2 with Q ⊂ G(d, k) being a coordinate chart, (λ the Euclidean projection from ' = supp(µ) onto the plane given by λ, and L = ∞, β = 0. Hence, Proposition 6.1 follows from Theorem 7.3 with n = k(d − k) and m = k. If β = 0 and L = ∞, Proposition 2.2 shows that (7.3) is sharp. Remark 7.4. Equation (7.3) implies the following statement from the introduction: Let µ be a Borel probability measure on Rd with correlation dimension greater than m + 2γ . Then for a prevalent set of C 1 maps f : Rd → Rm , the image of µ under f has a density with at least γ fractional derivatives in L2 (Rm ). Let A = {f ∈ C 1 (Rd , Rm ) : µ ◦ f −1 have a density in L2,γ (Rm )}, and let ᏸ be Lebesgue measure on the matrices M d×m (R) with entries in [− 21 , 21 ] (we identify linear maps Rd → Rm with their matrices). We need to check that ᏸ(f + A) = 0 for any f ∈ C 1 (Rd , Rm ). Fix some f ∈ C 1 (Rd , Rm ). To apply Theorem 7.3, we may assume that supp(µ) = ' is compact. Let Q = [−1, 1]dm ⊂ M d×m (R) and (λ (ω) = −f (ω) + λω. Since Ᏹm+2γ (µ) < ∞ by assumption, one has νλ 22,γ d ᏸ(λ) < ∞, as desired, provided transversality holds with β = 0 and regularity holds with L = ∞. Regularity is obvious. To check transversality, notice that Dλ 3λ (ω, τ ) · λ0 = λ0 ·
ω−τ |ω − τ |
for all λ0 ∈ M d×m (R). In particular, the linear map λ0 ( → Dλ 3λ (ω, τ ) · λ0 is surjective and therefore rank[Dλ 3λ (ω, τ )] = m. In fact, estimate (7.1) holds with β = 0. The following proposition is a higher-dimensional analogue of Proposition 4.10. Proposition 7.5. Let (λ ∈ C L,δ (Q) be as in Definition 7.1 for some positive integer L and δ ∈ [0, 1). Suppose the family of measure {νλ }λ∈Q on Rm , as given by Definition 7.1, satisfies Q νλ 22,γ dλ < ∞ with some Q ⊂ Q and γ > −m/2. (i) If σ ∈ [0 ∧ 2γ , m ∧ (m + 2γ )], then dim λ ∈ Q : dims (νλ ) ≤ σ ≤ n + σ − (m + 2γ ). (7.7) (ii) If σ ∈ [m, m + 2γ ), then (7.8)
σ − m −1 . dim λ ∈ Q : dims (νλ ) ≤ σ ≤ n − (m + 2γ − σ ) 1 + L+δ
SMOOTHNESS OF PROJECTIONS
235
Proof. Let ψ be the Littlewood-Paley function from Lemma 4.1. For any j = 1, 2, . . . , we define 2 −j m hj (λ) = 2 ψ) 2−j (ξ ) νλ (ξ ) dξ =
' '
# $
ψ 2j ((λ, ω1 ) − ((λ, ω2 ) dµ(ω1 ) dµ(ω2 ),
j whereas h0 = 1. Let Hj = i=0 2m(j −i) hi . As in the proof of Proposition 4.10, we check that these functions satisfy the hypotheses of Lemma 3.2 with A = 2 and B = 2m . In fact, up to replacing 2 with 2m in various places, the argument remains the same and we omit the details. In view of (4.4) and (4.5), the proposition now follows from Lemma 3.1 with r = 2σ + and R = 22γ +m as → 0+. Roughly speaking, transversality means that 3λ can be inverted where it is small. However, one needs to interpret this statement more carefully in the higher-dimensional case. First, we are dealing with maps from Rn → Rm with n ≥ m. Second, even if m = n, we cannot expect to break up a region of transversality into disjoint cubes on which either 3λ is large or invertible as in the one-dimensional case; see Lemma 4.3. Third, any decomposition of Q has to provide estimates involving the parameters in Definition 7.2. The following quantitative version of the inverse function theorem is used for that purpose. It is undoubtedly well known, but we provide a proof for the sake of completeness. Lemma 7.6. Suppose f : U ⊂ Rm → Rm is C 1 and that Df (x) − I ≤ 21 on B(x0 , r) ⊂ U for some r > 0. Then f : B(x0 , r/3) → f (B(x0 , r/3)) is a diffeomorphism, and f (B(x0 , ρ)) ⊃ B(f (x0 ), ρ/2) for any 0 < ρ ≤ r. Proof. Without loss of generality, x0 = f (x0 ) = 0. Define Ty (x) = y − f (x) + x and fix some ρ ∈ (0, r]. We claim that Ty : B(0, ρ) → B(0, ρ) is a contraction, provided |y| ≤ ρ/2. Indeed, if x, x ∈ B(0, ρ), then 1 Ty (x) ≤ 1 ρ + I − Df (sx) ds |x| ≤ ρ, 2 0
Ty (x) − Ty (x ) = x − x − f (x) − f (x )
1
≤ 0
I − Df x + s(x − x ) ds |x − x | ≤ 1 |x − x |. 2
Hence, for any y ∈ B(0, ρ/2), there is a unique x ∈ B(0, ρ) so that f (x) = y. In particular, f is one-to-one on B(0, ρ) if f (B(0, ρ)) ⊂ B(0, r/2). Since |f (x)| ≤ 0
1
Df (sx) ds |x| ≤ 3 |x|, 2
236
PERES AND SCHLAG
it suffices to take ρ = r/3, as claimed. The following lemma is a precise statement to the effect that 3λ is invertible where it is small. This is true only locally. However, we can cover the set of small values of 3λ by a collection of balls {Bj } whose size and number can be controlled and on each of which 3λ can be inverted in the following sense: there exist m coordinate directions (depending on Bj ) so that the restriction of 3λ to the intersection of Bj with any hyperplane in those directions is invertible. Moreover, we have uniform bounds on the derivatives of the inverse. Lemma 7.7. Let (λ ∈ C 1,δ (Q) for some 0 < δ < 1. Suppose that Q ⊂ Rn is a region of transversality of order β for ( and that (λ is 1, δ-regular on Q; see Definition 7.2. Let U ⊂ Q be open and bounded with dist(U, ∂Q) > 0. Then there exist constants C0 , C1 , C2 depending only on β, n, m, U, Q so that for any distinct ω1 , ω2 ∈ ', there exist λ1 , . . . , λN ∈ Q with the following properties. Writing r = d(ω1 , ω2 ) for simplicity, we have N
% B λj , C 1 r b 0 β , λ ∈ U : |3λ | ≤ C0 r b0 β ⊂
(7.9)
j =1
(7.10)
N %
B λj , 2C1 r b0 β ⊂ λ ∈ Q : |3λ | ≤ Cβ r β ,
j =1
where b0 = (2 + m)δ −1 and N ≤ C2 r −b0 n . Cβ is the constant from Definition 7.2. Moreover, on each ball Bj = B(λj , 2C1 r b0 β ), we can select n − m coordinate directions 1 ≤ i1 < · · · < in−m ≤ n so that for any choice of y$ = (y1 , . . . , yn−m ), Fy$ = 3λ λ ∈ Bj : λi1 = y1 , . . . , λin−m = yn−m is a diffeomorphism satisfying | det(DFy$ )−1 | ≤ C2 r −β and
−1 (7.11) DFy$ ≤ C2 r −mβ . Proof. Fix distinct ω1 , ω2 ∈ '. By Definition 7.2, we can choose C0 and C3 so that E = λ ∈ U : |3λ | ≤ C0 r (2+m)β/δ satisfies (7.12)
E = λ ∈ Q : dist(E, λ) ≤ C3 r (2+m)β/δ ⊂ λ ∈ Q : |3λ | ≤ Cβ r β .
Now fix any λ0 ∈ E. By Definition 7.2, (7.1), and the Cauchy-Binet formula, there exist m coordinate directions, say, the first m, so that the determinant of the first m×m-minor of D3λ0 is bounded below by Cr β . Moreover, by (7.12) and the Hölder bounds on D3λ in (7.2), this continues to hold on all of B(λ0 , C4 r (2+m)β/δ ) for an
237
SMOOTHNESS OF PROJECTIONS
appropriate choice of C4 . Let sm (λ) ≥ sm−1 (λ) ≥ · · · ≥ s1 (λ) > 0 be the singular values of the first m × m-minor of D3λ . We have shown that sm (λ) ≤ Cr −β ,
(sm sm−1 · · · · · s1 )(λ) ≥ Cr β
on B(λ0 , C4 r b0 β ). Thus s1 ≥ Cr mβ on B(λ0 , C4 r b0 β ). As above, let
Fy$ (λ1 , . . . , λm ) = 3λ λ ∈ B λ0 , C4 r b0 β : λm+1 = y1 , . . . , λn = yn−m for any choice of y$ = (y1 , . . . , yn−m ). By the lower bound on the singular values, (DFy$ )−1 ≤ Cr −mβ on the domain of Fy$ . In particular, with λ = (λ1 , . . . , λm ),
DFy$ λ −1 ◦ DFy$ λ − Im×m ≤ Cr −(m+1+δ)β λ − λ δ . 0
0
In view of Lemma 7.6, there exists a constant C1 so that Fy$ is a diffeomorphism on
λ ∈ B λ0 , 2C1 r b0 β : λm+1 = y1 , . . . , λn = yn−m
for any choice of y$. The lemma follows by applying Wiener’s covering lemma to a covering of E with balls of size 13 C1 r b0 β . We now prove the higher-dimensional version of Proposition 4.10. Proposition 7.8. Let (λ ∈ C 1,δ (Q) for some 0 < δ < 1. Suppose that Q ⊂ Rn is a region of transversality of order β for ( and that (λ is 1, δ-regular on Q; see Definition 7.2. Assume that µ has finite α-energy for some α ∈ (0, m]. Then (7.13)
Ᏼσ +n−m {λ ∈ Q : dims (νλ ) < σ } = 0
for any σ ∈ (0, α − a0 β] with a constant a0 depending only on m, n, and δ. Proof. It suffices to prove (7.13) for any subcube Q ⊂ Q with dist(Q , ∂Q) > 0. As in the proof of Proposition 4.4, we assume that (7.13) fails. By Frostman’s lemma there exists a nonzero measure ρ on Q so that ρ(V ) ≤ | diam(V )|σ +n−m for all Borel sets V ⊂ Q and (7.14)
ρ λ ∈ Q : dims (νλ ) > σ − 0 = 0
for an appropriate choice of 0 > 0. Fix any small > 0 and let r = d(ω1 , ω2 ). In view of Lemma 7.7,
238
PERES AND SCHLAG
Q
(7.15)
(7.16)
(7.17)
dνλ (x) dνλ (y) dρ(λ) σ − Rm Rm |x − y| dµ(ω1 ) dµ(ω2 ) = σ − dρ(λ) Q ' ' ((λ, ω1 ) − ((λ, ω2 ) −(σ − ) dµ(ω1 )dµ(ω2 ) χ[|3λ |≥C0 r b0 β ] 3λ (ω1 , ω2 ) dρ(λ) = d(ω1 , ω2 )σ − ' ' Q −(σ − ) dµ(ω1 )dµ(ω2 ) dρ(λ) + χ[|3λ |≤C0 r b0 β ] 3λ (ω1 , ω2 ) d(ω1 , ω2 )σ − ' ' Q dµ(ω1 )dµ(ω2 ) ≤C (σ − )(1+b0 β) ' ' D(ω1 , ω2 ) N dρ(λ) dµ(ω1 ) dµ(ω2 ) +C σ − d(ω , ω )σ − b β 1 2 ' ' j =1 B (λj ,C1 r 0 ) |3λ | dµ(ω1 ) dµ(ω2 ) ≤C . d(ω1 , ω2 )α ' '
To pass from (7.15) to (7.16), first notice that
∞
dρ(λ) ≤ 2i(σ − ) ρ λ ∈ B λj , C1 r b0 β : |3λ | ≤ 2−i . σ − b β |3 | λ B (λj ,C1 r 0 ) i=−∞
To bound the measure on the right-hand side, we use (7.11) and (7.2). In fact, those estimates imply that the set {λ ∈ B(λj , C1 r b0 β ) : |3λ | ≤ 2−i } is the union of (1 + r b0 β /2−i r β )n−m many balls of diameter no larger than C min(r b0 β , 2−i r −mβ ). Thus dρ(λ) σ − b β B (λj ,C1 r 0 ) |3λ | ≤C
∞
σ +n−m
n−m ˜ 1 + r (b0 −1)β 2i 2i(σ − ) min r b0 β , 2−i r −mβ r −Cβ ,
i=−∞
˜ where C˜ = C(m, n, σ, δ). Inserting this bound into (7.16) and using the bound on N given by Lemma 7.7 implies (7.17) for an appropriate choice of a0 . However, (7.17) contradicts (7.14) if < 0 . In view of Propositions 7.5 and 7.8, Theorem 7.3 follows once the Sobolev estimate (7.3) is established. As in Section 4, this requires a technical lemma about averages involving Littlewood-Paley functions; see Lemmas 4.6 and 7.10. The proof of that lemma uses the following calculus fact. Lemma 7.9. Suppose h = (h1 , . . . , hm ) : U ⊂ Rm → Rm is a C L -diffeomorphism on the open set U , and let H denote the inverse of h. For any nonzero multi-index
239
SMOOTHNESS OF PROJECTIONS
η = (η1 , . . . , ηm ) ∈ Nm with |η| ≤ L, (7.18) ∂ η H =
|η|−1 t=0
−|η|−p
Dh ◦ H
p σ ,...,σ ,#$ p 1
p
1 σ
v$η,p σ1 , . . . , σp , #$ ∂ i h#i ◦ H, i=1
where the third sum runs over multi-indices σi ∈ Nm and #$ ∈ {1, 2, . . . , m}p . If |η| ≥ 2, $ ∈ Rm vanish unless |η| ≥ |σi | ≥ 2 for all i and the vectors v$η,p (σ1 , . . . , σp , #) p i=1 |σi | ≤ 2(|η| − 1). Proof. If η = (1, 0, . . . , 0) = e1 , we clearly have ∂ η H = (Dh ◦ H )e1 . Hence the lemma holds with |η| = 1. For the inductive step, we differentiate (7.18): ∂ η+e1 H =
|η|−1 t=0
p σ ,...,σ ,#$ p 1
−|η|−p−1 e ∂ 1 Dh ◦ H v$η,p σ1 , . . . , σp , #$ −(|η|+ p) Dh ◦ H ×
−|η|−p v$η,p σ1 , . . . , σp , #$ ∂ σi h#i ◦ H + Dh ◦ H
p 1 i=1
×
p j =1
p 1
∂ σ i h # i ◦ H ∂ e1 ∂ σ j h # j
i=1,i=j
Since for any multi-index σ
−1 ∂ e1 ∂ σ h ◦ H = Dh ◦ H
◦H .
aτ ∂ τ h ◦ H
|τ |=|σ |+1
Rm ,
(7.18) holds for all multi-indices η with with suitable constant vectors a τ ∈ p |η | = |η| + 1. Also notice that i=1 |σi | has increased by at most two, as claimed. Lemma 7.10. Let (λ ∈ C L,δ (Q) for some positive integer L and some δ ∈ [0, 1). Suppose that Q ⊂ Rn is a region of transversality of order β for ( and that (λ is L, δ-regular on Q; see Definition 7.2. Let ρ ∈ C ∞ (Rn ) be supported inside Q. If ψ is the Littlewood-Paley function in Rm from Lemma 4.1, then for any distinct ω1 , ω2 ∈ ', any integer j , and any 0 ≤ q < L + δ + m − 1, $
−q j# (7.19) ρ(λ)ψ 2 ((λ, ω1 ) − ((λ, ω2 ) dλ ≤ Cq 1 + 2j d(ω1 , ω2 )1+a0 β , where Cq depends only on q, m, n, ρ, β, L, δ and a0 only depends on m, n, and L, δ. Proof. Fix distinct ω1 , ω2 ∈ ' and j, q as above. We may assume that 2j r > 1, where r = d(ω1 , ω2 ). Let φ ∈ C ∞ be nonnegative with φ = 1 on [−1, 1] and supp(φ) ⊂ [−2, 2]. Then
240
PERES AND SCHLAG
(7.20)
$ # ρ(λ)ψ 2j ((λ, ω1 ) − ((λ, ω2 ) dλ
= ρ(λ)ψ 2j r3λ (ω1 , ω2 ) φ Cβ−1 r −β 3λ dλ
$
# + ρ(λ)ψ 2j r3λ (ω1 , ω2 ) 1 − φ Cβ−1 r −β 3λ dλ.
Here Cβ is the constant from Definition 7.2. By the rapid decay of ψ,
#
$ ρ(λ)ψ 2j r3λ (ω1 , ω2 ) 1 − φ C −1 r −β 3λ dλ β
−q
−q ≤ Cq,β |ρ(λ)| 1 + 2j r 1+β . dλ ≤ Cq,β 1 + 2j r 1+β Thus it suffices to estimate the first integral in (7.20). To this end we introduce a partition of unity subordinate to the cover given by Lemma 7.7 with U = {Q : ρ > 0}. More precisely, by the standard construction of a partition of unity, there exist χj ∈ C ∞ (Rn ) so that supp(χj ) ⊂ Bj = B(λj , 2C1 r b0 β ) for j = 1, . . . , N and so that N b0 β }; see (7.9). Moreover, i=1 χj = 1 on {λ ∈ supp(ρ) : |3λ | ≤ C0 r (7.21) sup ∂ η χj ∞ ≤ Cη r −|η|b0 β j
for all multi-indices η ∈ Nn . By (7.10), χj (λ) = χj (λ)φ(Cβ−1 r −β 3λ ) for all j . We can therefore write the first integral in (7.20) as
ρ(λ)ψ 2j r3λ φ Cβ−1 r −β 3λ dλ =
N i=1
+
ρ(λ)χj (λ)ψ 2j r3λ dλ 2
ρ(λ) 1 −
N i=1
=
N
3
χj (λ) ψ 2j r3λ φ Cβ−1 r −β 3λ dλ
A(j ) + B.
i=1
To estimate the second term B, notice that |3λ | ≥ C0 r b0 β on the support of the integrand. The rapid decay of ψ therefore implies |B| ≤ Cq (1 + 2j r 1+b0 β )−q . For simplicity, we assume henceforth that i = 1. By Lemma 7.7, the map λ ( → 3(λ ,λ ) = Fλ (λ ) is a diffeomorphism on {λ ∈ Rm : (λ , λ ) ∈ Bj }, where λ = (λ1 , . . . , λm ) and λ = (λm+1 , . . . , λn ) (possibly after a permutation of the coordinates). Let Hλ denote the inverse of Fλ . Assuming as we may that ρ(λ) = ρ1 (λ )ρ2 (λ ), we let
Gλ (u) = ρ1 Hλ (u) χ1 Hλ (u) det DHλ (u) .
SMOOTHNESS OF PROJECTIONS
241
Clearly, Gλ ∈ C L−1,δ . Since q < L + δ + m − 1, there exists an integer M so that q −m−δ < M ≤ L−1. Fix > 0 such that (M +m+δ)(1− ) > q and rewrite A(1) as A(1) =
Rn−m
=
Rn−m
ρ2 λ
ρ2 λ
Rm
Gλ (u)ψ 2j ru du dλ
ψ 2j ru
|u|<(2j r)−1+
1 ∂ η G (0) η $ # u λ × uη + du dλ ∂ η Gλ (tu) − ∂ η Gλ (0) (1 − t)M dt η! η! 0 +
|η|≤M
Rn−m (1)
ρ2 λ
|η|=M
|u|>(2j r)−1+
O
−2q−m Gλ (u)du dλ
2j r|u|
(1)
= A 1 + A2 . (1)
By Lemma 7.7, Gλ ∞ ≤ Cr −β . In particular, |A2 | ≤ Cq (2j r)−q−m r −β . Since ψ has vanishing moments of all orders, (1)
(7.22)
∂ η Gλ (0) η u du dλ ψ 2j ru η! Rn−m |u|>(2j r)−1+ |η|≤M " ! η j −(M+m+δ)(1− ) dλ . ρ2 λ O sup ∂ Gλ δ 2 r + ρ2 λ
A1 = −
Rn−m
|η|=M
C
It remains to estimate ∂ η Gλ (u), which can be done uniformly in λ . Fix any λ and let u = Fλ (λ ). By Lemma 7.9, (7.2) in Definition 7.2, and Lemma 7.7, |η|−1 η ∂ Hλ (u) ≤ Cη t=0
p σ ,...,σ ,#$ p 1
−1 |η|+p −2β(|η|−1) r v$η,p σ1 , . . . , σp , #$ DFλ
≤ Cη r −2|η|(m+1)β . Therefore, by Leibnitz’s rule and (7.21), ∂ η Gλ ∞ ≤ Cη r −C|η|β where C = C(b0 , (1) m). Plugging this into (7.22) and exploiting the rapid decay of ψ, we obtain |A1 | ≤ Cq r −a0 Mβ (2j r)−(M+m+δ)(1− ) . Since N ≤ Cr −b0 nβ and (M +m+δ)(1− ) > q, this finally yields (7.19). Proof of (7.3). Fix some γ with 0 < (m + 2γ )(1 + a0 β) ≤ α and 2γ < L + δ − 1. Let ρ be a smooth nonnegative function on Rn so that supp(ρ) ⊂ Q and fix some
242
PERES AND SCHLAG
q ∈ (m + 2γ , L + δ + m − 1). In view of (4.2), the definition of νλ , and Lemma 7.10, νλ 22,γ ρ(λ) dλ
∞
22j γ
j =−∞ ∞
≤
2
' ' j =−∞ ∞
≤ Cγ
' ' −∞
∞ −∞
(ψ2−j ∗ νλ )(x) dνλ (x)ρ(λ) dλ $ j# dµ(ω1 )dµ(ω2 ) ) − ((λ, ω ) ρ(λ) dλ ((λ, ω ψ 2 1 2
j (m+2γ )
−q 2j (m+2γ ) 1 + 2j d(ω1 , ω2 )1+a0 β dµ(ω1 )dµ(ω2 )
≤ Cγ
'
dµ(ω1 ) dµ(ω2 ) ≤ C Ᏹα (µ) < ∞, (1+a0 β)(m+2γ ) ' d(ω1 , ω2 )
as claimed. 8. Applications of the higher-dimensional projection theorems 8.1. Self-similar sets in the complex plane. In [41], Solomyak considered sets of the form ∞ S n (8.1) an λ : a n ∈ S , Cλ = n=0
where S = {s1 , . . . , s# } is a set of digits and λ ∈ D = {z ∈ C : |z| < 1} (λn of course denotes complex powers). Let R = S − S, ∞ n bn λ : b n ∈ R ᏮR =
n=0
ᏹR = λ ∈ D : there exists a nonzero f ∈ ᏮR with f (λ) = 0
0 R = λ ∈ D : there exists a nonzero f ∈ ᏮR with f (λ) = f (λ) = 0 . ᏹ If λ ∈ D \ ᏹR , it is easy to see that CλS satisfies a strong separation condition as in (5.14) and that therefore dim CλS = log #/(− log |λ|). The case λ ∈ ᏹR was studied in 0 R . It was shown there that for any [41], assuming the transversality condition λ ∈ ᏹ 0 R satisfies fixed choice of digit set, S ⊂ C a.e. λ ∈ ᏹR \ ᏹ dim CλS =
log # − log |λ|
if |λ| < #−1/2
and
Ᏼ2 CλS > 0
if |λ| > #−1/2 .
It is straightforward to see that Theorem 7.3 applies to this case. First we verify that conditions (7.2) and (7.1) in Definition 7.2 are satisfied with L = ∞. This is very similar to Lemma 5.3.
243
SMOOTHNESS OF PROJECTIONS
To specialize from Section 7, let 0 < R1 < R2 < 1 and 0 R , R1 < |λ| < R2 Q ⊂ λ ∈ D : λ ∈ ᏹ 0 is relatively closed in D since ᏮR be a fixed closed cube. As observed in [41], ᏹ is a normal family. With this choice of Q, let ' = {1, . . . , l}N be equipped with |ω∧τ | and uniform product measure µ. Finally, let ((λ, ω) = the metric d(ω, τ ) = R2 ∞ n n=0 sωn λ . 1+β
Lemma 8.1. Let R2 > R1 . Then Q is a region of transversality of order 2β for ( in the sense of Definition 7.2. Proof. There exists δ = δ(Q) ∈ (0, 1) such that for any g ∈ ᏮR and λ ∈ Q, |g(λ)| < δ
implies
|g (λ)| > δ.
This follows since ᏮR is a normal family. Clearly, 3λ (ω, τ ) = (λR2−1 )|ω∧τ | g(λ) with some g = gω,τ ∈ ᏮR . Fix ω, τ ∈ ' and let k = |ω ∧ τ |. Therefore, " ! η ∂ 3λ ≤ Cη k |η| |λ|k−|η| R −k g ∞ + |λ|k R −k sup ∂ σ g ∞ 2
−|η|
≤ Cη k |η| R1
−β|η|k
≤ Cη,β R2
2
|σ |=|η|
(1 − R2 )−1 + (1 − R2 )−|η|−1
,
since R2 < 1 and β > 0. Thus condition (7.2) in Definition 7.2 holds with L = ∞. βk To check (7.1), assume |3λ | ≤ δbβ R2 , where the constant bβ ∈ (0, 1) is determined below. Then
k
k βk R1 R2−1 |g(λ)| ≤ |λ|R2−1 |g(λ)| ≤ δbβ R2 implies |g(λ)| ≤ δbβ (R1−1 R2
1+β k ) .
Hence, (here 3λ denotes the complex derivative)
3 ≥ R1 R −1 k g (λ) − R −1 k|g(λ)| λ 1 2
βk 1+β k ≥ R2 δ − δbβ R1−1 k R1−1 R2 βk
δR2 βk ≥ ≥ δbβ R2 2 if bβ ≤ 21 [1 + supk≥0 (k/R1 )(R1−1 R2 )k ]−1 . Since the Cauchy-Riemann equations imply that det[D3λ ] = |3λ |2 , condition (7.1) in Definition 7.2 holds with 2β instead of β and Cβ = (δbβ )2 . 1+β
244
PERES AND SCHLAG
Theorem 8.2. Let 0 < r < R < 1. Then 0 R : r < |λ| < R, Ᏼ2 C S = 0 ≤ 4 − log # , dim λ ∈ ᏹR \ ᏹ λ − log r log # 0 R : r < |λ| < R, dim C S < log # . ≤ dim λ ∈ ᏹR \ ᏹ λ − log R − log |λ| Proof. Fix any β ∈ (0, 1). There exist countably many cubes Qj ⊂ C so that % 0 R : r < |λ| < R = λ ∈ D\ᏹ Qj j 1+β
and Qj ⊂ {rj < |λ| < Rj } with rj > Rj . By the previous lemma, each Qj is a region of transversality of order 2β for (. It is easy to check that with the choice of |ω∧τ | , we have Ᏹα (µ) < ∞ if Rjα l > 1. Since supp(νλ ) = CλS , metric dj (ω, τ ) = Rj Theorem 8.2 now follows by applying Theorem 7.3 and letting β → 0+. 8.2. Intersections of sets with spheres. Let S0 ⊂ Rd , d ≥ 2, be a closed, strictly convex C ∞ -hypersurface surrounding the origin (the differentiability assumption will be relaxed later). Thus S0 = {ρ(u)u : u ∈ S d−1 } for some function ρ : S d−1 → (0, ∞) in C ∞ . Define f ∈ C ∞ (Rd \ {0}) to be f (x) = |x|/ρ(x/|x|). In particular, f is homogeneous of degree one and {x ∈ Rd : f (x) = r} = rS0 . Moreover, by the assumption of strict convexity, there exists κ > 0 so that for any x = 0 and any tangent vector v of S0 , (8.2)
2 4 2 5 D f (x)v, v ≥ κ |v| , |x|
whereas D 2 f (x)x = 0 by homogeneity. For any Borel set E ⊂ Rd and σ < 1, let Bσ = Bσ (E) = x ∈ Rd : dim f (−x + E) < σ . For σ = 1, we set
B1 = B1 (E) = x ∈ Rd : Ᏼ1 f (−x + E) = 0 .
Our main result in this section concerns the dimension of Bσ . Theorem 8.3. Let S0 be a strictly convex, C ∞ -hypersurface surrounding the origin and define f and Bσ as above. If E ⊂ Rd is a Borel set and σ ∈ [0, 1 ∧ dim E], then (8.3)
dim Bσ ≤ d + σ − max(1, dim E).
Furthermore, for any hyperplane H ⊂ Rd , (8.4)
dim(Bσ ∩ H ) ≤ d − 1 + σ − max(1, dim E).
SMOOTHNESS OF PROJECTIONS
245
Proof. By classical theorems of Marstrand and Mattila, (8.3) follows from (8.4) (see Theorem 10.10 in [31]). For the sake of illustration, however, we begin by showing directly that (8.3) follows from Theorem 7.3. In fact, we first consider the case S0 = S d−1 because it is easy to check the conditions in Definition 7.2 for Euclidean spheres. In this special case, it is more convenient to work with the square of the Euclidean norm, that is, f (x) = |x|2 . Define ( : Rd ×E → R to be ((λ, x) = |x −λ|2 , and let 3λ (x, y) = |x −y|−1 [|x −λ|2 −|y −λ|2 ] = /x −y/|x −y|, x +y −2λ0. Therefore, |∂ η 3λ (x, y)| ≤ 2 for any nonzero multi-index η, and condition (7.2) holds with β = 0. Moreover, |D3λ (x, y)| = 2|x −y/|x −y|| = 2, and (7.1) is also satisfied with β = 0. Hence, Definition 7.2 holds with Q = Rd , ' ⊂ E an arbitrary compact set, L = ∞, and β = 0. For any > 0, we can choose a Frostman measure µ on ' with exponent > dim E − . Since f (−λ + E) ⊃ supp(νλ ), (8.3) with S0 = S d−1 follows from Theorem 7.3 as → 0+. For general S0 as described above, fix a small > 0. Dilating, if necessary, we have dim(E \Q0 ) > dim E − for any cube Q0 of side length 2. Now fix a cube Q ⊂ Rd of side length 1. To bound dim(Bσ ∩Q), we may therefore assume that E ⊂ 2rQ\(rQ) for some r ≥ 2. Thus, if |v| = 1, x ∈ E, and λ ∈ Q, (8.2) implies that 4 5 ∇f (x − λ), v = 0 ,⇒ D 2 f (x − λ)v ≥ ρ (8.5) for some ρ > 0 depending on r and κ. Now define (λ : E → R to be (λ (x) = f (x −λ), and let 3λ (x, y) = |x −y|−1 [f (x −λ)−f (y −λ)]. It is clear that condition (7.2) holds with β = 0. To check (7.1), notice that # $ 4 5 3λ (x, y) = |x − y|−1 f (x − λ) − f (y − λ) = ∇f (x − λ), v + O(|x − y|), where v = x − y/|x − y|. Moreover, (8.6)
D3λ (x, y) = −D 2 f (x − λ)v + O(|x − y|),
where D denotes differentiation with respect to λ. In view of (8.5), there exists some small constant C0 such that 3λ (x, y) < C0 ρ ,⇒ |D3λ (x, y)| > ρ , (8.7) 2 provided λ ∈ Q and x, y ∈ E satisfy |x − y| < C0 ρ. Now let ' = E ∩ B(x0 , C0 ρ/2) be equipped with the usual Euclidean metric where x0 is arbitrary. We have shown that Definition 7.2 is satisfied with β = 0 with this choice of ', Q, and (λ . Therefore, selecting x0 so that dim(E ∩B(x0 , C0 ρ)) > dim E − and letting µ be an appropriate Frostman measure, we conclude that (8.3) follows from (7.4) and (7.6) with L = ∞, β = 0, n = d, m = 1, and α = dim E − as → 0+. The proof of (8.4) proceeds by induction in the dimension d. First we need to verify that transversality holds in all dimensions. Fix a hyperplane H ⊂ Rd . Without loss of generality, 0 ∈ H . Let e1 , . . . , ed−1 be a basis in H , and define (λ (x) =
246
PERES AND SCHLAG
−1 d−1 , and f (x − d−1 i=1 λi ei ), 3λ (x, y) = |x − y| [(λ (x) − (λ (y)] for any λ ∈ R d distinct x, y ∈ R . Clearly, with p = x − λ1 e1 − · · · − λd−1 ed−1 , 4 5 (8.8) 3λ (x, y) = ∇f (p), v + O(|x − y|), 5 4 5 4 (8.9) D3λ (x, y) = − D 2 f (p)v, e1 , . . . , D 2 f (p)v, ed−1 + O(|x − y|), where v = x − y/|x − y| and D denotes differentiation with respect to λ. We claim that for any v ∈ S d−1 and any z ∈ H , (8.10)
d−1 4 2 5 5 D f (z)v, ei 2 > 0. ∇f (z), v = 0 ,⇒
4
i=1
In fact, we show that (8.10) can only fail if z ∈ H . More precisely, suppose the sum in (8.10) vanishes for some z ∈ Rd , z = 0, and some v ∈ S d−1 . Without loss of generality, f (z) = 1. Then the functionals u ( → /D 2 f (z)u, ei 0, i = 1, . . . , d − 1, on the tangent space to S at z are linearly dependent. Thus d−1
(8.11)
ξi D 2 f (z)ei ∇f (z)
i=1
for some (ξ1 , . . . , ξd−1 ) = (0, . . . , 0). Let e = d−1 i=1 ξi ei . Clearly, e = 0. It is easy to verify that (8.11) implies that /D 2 f (z)e, e0 = 0. First notice that by strict convexity of S, there is a parameterization z+te = r(t)γ (t) for small t where γ is a smooth curve in S with γ (0) = z and r is a smooth positive function with r(0) = 1. Let ∂e f = /∇f, e0 be the directional derivative. Since ∂e f is homogeneous of degree zero, 7 6 5 4 2 d2 d d = ∂e f (γ (t)) = D 2 f (z)e, γ (0) = 0, D f (z)e, e = 2 f (z + te) t=0 t=0 dt dt dt by (8.11). In view of (8.2) and D 2 f (z)z = 0, we conclude that e z and thus z ∈ H , as claimed. To check transversality, we need the following quantitative version of (8.10) (cf. (8.5)). For any ρ0 > 0 there exists ρ1 > 0, depending on κ from (8.2), so that for any z ∈ Rd with f (z) = 1 and dist(z, H ) > ρ0 , we have (8.12)
d−1 4 4 2 5 5 ∇f (z), v < ρ1 ,⇒ D f (z)v, ei 2 > ρ1 . i=1
Indeed, by (8.10) and compactness, (8.13)
min
min
z∈S0 :dist(z,H )>ρ0 v∈Tz S0 :|v|=1
d−1 4 2 5 D f (z)v, ei 2 > ρ2 > 0 i=1
(Tz S0 denotes the tangent space to S0 at z). If |/∇f (z), v0| < ρ1 , then there exists v˜ ∈ Tz S0 , |v| ˜ = 1, so that |v − v| ˜ ≤ Cρ1 /|∇f (z)| ≤ Cρ1 and (8.12) follows from (8.13), provided ρ1 is small compared to ρ2 .
SMOOTHNESS OF PROJECTIONS
247
We start by proving (8.4) for d = 2. Fix some e ∈ S 1 , and let # ⊂ R2 be the line through the origin in direction e. If dim(E ∩ #) = dim E, then dim r > 0 : (x + rS) ∩ E = ∅ = dim E for all x ∈ # and (8.4) holds with H = #. Otherwise, for any > 0, there exists ρ0 > 0 and R > 0 so that dim(E ∩ B(0, R) \ #ρ0 ) > dim E − , where #ρ0 denotes a ρ0 -neighborhood of #. Now fix such > 0 and ρ0 > 0, and let J ⊂ # be an interval of length one. In view of (8.8), (8.9), and (8.12), there exist ρ > 0 and C0 (depending on J , κ, ρ0 , R) so that 3λ (x, y) < ρ ,⇒ D3λ (x, y)2 > ρ for any x, y ∈ B(0, R)\#ρ0 satisfying |x −y| < C0 ρ. Let ' = E ∩B(x0 , C0 ρ/2)\#ρ0 , where x0 ∈ B(0, R) is selected so that dim ' > dim E − . We conclude that with this choice of ' and (, J satisfies Definition 2.7 with L = ∞ and β = 0. Since f (− d−1 i=0 λi ei + E) ⊃ supp(νλ ), (8.4) therefore follows from Theorem 2.8 with α = dim E − as → 0+. Now suppose that (8.4) holds up to d − 1 for some d ≥ 3, and fix a hyperplane H ⊂ Rd passing through the origin. If dim(E ∩H ) = dim E, then (8.4) follows from (8.3) in dimension d −1. (Recall that the latter estimate follows from (8.4) in dimension d − 1.) Otherwise, for any > 0, there exists ρ0 > 0 so that dim(E \ H ρ0 ) > dim E − , where H ρ0 denotes a ρ0 -neighborhood of H . By the same argument we gave for d = 2, we conclude from (8.8), (8.9), and (8.12) that Definition 7.2 holds for any bounded cube Q ⊂ H and a suitable choice of '. Hence, (8.4) follows from Theorem 7.3 with L = ∞, β = 0, n = d − 1, m = 1, and α = dim E − as → 0+. Suppose E ⊂ # ⊂ Rd for some line # is a set of dimension 1 but with Ᏼ1 (E) = 0. Then it is easy to see that for all x ∈ Rd , (x +rS d−1 )∩E = ∅ for a.e. r. This implies that there are no nontrivial bounds for σ = 1 and dim(E) ≤ 1 in Theorem 8.3. In [10] Falconer showed that for any Borel set A ⊂ Rd with dim A > (d + 1)/2, the distance set D(A) = {|x − y| : x, y ∈ A} has positive measure. In [32] Mattila and Sjölin showed that under the same assumption, D(A) has nonempty interior. On the other hand, for d = 2 Bourgain [4] proved that D(A) has positive measure if dim A > 13 9 and Wolff [46] improved this to dim A > 43 . (8.3) implies the following sharpened version of Falconer’s theorem concerning “pinned” distance sets Dx (A) = {|x − y| : y ∈ A}. Corollary 8.4. Suppose A ⊂ Rd with dim A > (d + 1)/2. Then (d + 1) dim x ∈ Rd : Dx (A) has Lebesgue measure zero ≤ d + 1 − dim A < . 2 Proof. Apply (8.3) to f (x) = |x| and E = A.
248
PERES AND SCHLAG
Similarly, we obtain a sufficient condition for pinned distance sets to contain an interval. Using (7.4) with σ > 2 in the proof of Theorem 8.3 yields the following statement: Let A ⊂ Rd with d ≥ 3 satisfy dim A > (d + 2)/2. Then (d + 2) dim x ∈ Rd : Dx (A) has empty interior ≤ d + 2 − dim A < . 2 We now discuss the case where S0 ∈ C k,δ for some positive integer k and δ ∈ [0, 1). Theorem 8.5. Let S0 be a strictly convex C k,δ -hypersurface surrounding the origin with k + δ > 2, and define f and Bσ as above. If E ⊂ Rd is a Borel set and σ ∈ [0, 1 ∧ dim E], then
(8.14) dim Bσ ≤ d + σ − max 1, min(dim E, k + δ − 1) . Furthermore, for any hyperplane H ⊂ Rd , (8.15)
dim(Bσ ∩ H ) ≤ d − 1 + σ − max 1, min(dim E, k + δ − 1) .
In particular, if k + δ ≥ dim E + 1, then (8.3) and (8.4) remain valid. Proof. Since f ∈ C L,δ by assumption, (8.6), (8.8), and (8.9) show that 3λ ∈ in the sense of Definition 7.1. The proof of Theorem 8.3 therefore yields that 3λ as defined above satisfies Definition 7.2 with L = k −1 and β = 0, and as before, Theorem 7.3 implies (8.14) and (8.15). C k−1,δ
Finally, we give some examples to show that Theorem 8.3 can fail under weaker assumptions. Suppose S0 = ∂[−1, 1]d is the boundary of the unit cube. Let K ⊂ [0, 1] be a Cantor set and let E = K × [−1, 1]d−1 . It is easy to see that Bσ (E) ⊃ [3, ∞) × [−1, 1]d−1 for any dim K < σ ≤ 1. The bound in (8.3) would therefore require that d ≤ 1 + σ − dim K, which is clearly false. A similar example shows that (8.3) fails if S0 contains an arbitrarily small piece of a hyperplane. In view of (8.4), one might ask whether (8.16)
dim(B1 ∩ π) ≤ 1 + dim π − dim E
for any plane π ⊂ Rd . It turns out that this fails if dim π ≤ d − 2, even if S0 = S d−1 . For simplicity, let d = 3 and let (8.17) E = (0, r cos θ, r sin θ) : θ ∈ [0, 2π], r ∈ K , where K ⊂ [0, 1] is a Cantor set. Clearly, for any x ∈ R, dim{r > 0 : (x, 0, 0)+rS 2 ∈ E} = dim K, whereas (8.16) would require that dim(B1 ∩ {(x, 0, 0) : x ∈ R}) ≤ 2 − dim E = 1 − dim K. 9. Questions and comments 9.1. Bernoulli convolutions. (i) For which intervals J ⊂ ( 21 , 1) is ∞?
J
νλ 42 dλ <
SMOOTHNESS OF PROJECTIONS
249
(ii) Is the distribution of the zeros of (λ (ω)−(λ (τ ) on an interval of transversality absolutely continuous? 9.2. Geometric problems. (i) According to Corollary 6.2, a.e. projection of a Borel set E ⊂ Rd with dim E > 2k has nonempty interior. Can 2k be replaced by a smaller threshold if k > 1? (ii) Suppose that µ is a finite planar measure satisfying the Frostman condition µ(B(x, r)) ≤ r α for all x ∈ R2 and r > 0. Let νθ be the projection of µ onto a line in direction θ . For p ∈ [1, 2/(2 − α)), the Sobolev embedding theorem implies that the projected measures νθ have a density in Lp for a.e. θ ∈ S 1 . Can the threshold 2/(2 − α) be increased? (iii) Is Theorem 8.3 optimal? Heuristically speaking, estimates for distance sets are related to the Erd˝os problem of bounding the minimum number gd (n) of different distances determined by n points in R d . It is known that C −1 n4/5− < g2 (n) < 8 Cn/ log n; see [5] and the discussion in [1, Chapter 12]. The analogy between cardinality and dimension suggests that dim(E) > 5/4 ⇒ Ᏼ1 (D(E)) > 0. Generally speaking, however, it is presently unclear how to make such a deduction. Nevertheless, T. Wolff [44], [45] has successfully applied sophisticated methods from combinatorial geometry to obtain Hausdorff dimension estimates. In particular, he showed that a Borel set in the plane that contains a circle of every radius must have Hausdorff dimension 2. For pinned distance sets as in Corollary 8.4, the relevant combinatorial problem seems to be estimating the minimum number of distinct distances between n points and one “typical” point. More precisely, it is known (see [1, Corollary 12.11]) that for any n points p1 , p2 , . . . , pn ∈ R2 , one has card{|pi − pj | : j = 1, 2, . . . , n} ≥ Cn3/4 for at least n/2 choices of i. (This appears to be the best-known bound. In particular, the authors of [5] point out that their method does not show that there exists a point from which there are n4/5− different distances.) The analogy between cardinality and dimension alluded to above then suggests that dim(E) > 43 ⇒ Ᏼ1 (Dx (E)) > 0 for most points x ∈ E. Note that Corollary 8.4 requires dim(E) > 23 for the same conclusion in R2 . In the recent preprint [46], Wolff shows that dim(E) > 43 implies that Ᏼ1 (D(E)) > 0. His methods are in the spirit of [4] and involve Bochner-Riesz-type arguments. (iv) Does Theorem 8.3 hold if S0 ∈ C k,δ with k +δ < dim E +1 (cf. Theorem 8.5)? In particular, is Theorem 8.3 valid if S0 is only assumed to be a continuous, strictly convex hypersurface surrounding the origin? References [1] [2] [3]
P. Agarwal and J. Pach, Combinatorial Geometry, Wiley-Intersci. Ser. Discrete Math. Optim., Wiley, New York, 1995. J. C. Alexander and J. A. Yorke, Fat Baker’s transformations, Ergodic Theory Dynam. Systems 4 (1984), 1–23. J. Bourgain, Besicovitch type maximal operators and applications to Fourier analysis, Geom. Funct. Anal. 1 (1991), 147–187.
250 [4] [5] [6] [7] [8] [9] [10] [11] [12] [13]
[14] [15] [16] [17] [18]
[19] [20] [21] [22]
[23] [24] [25] [26] [27]
[28] [29]
PERES AND SCHLAG , Hausdorff dimension and distance sets, Israel J. Math. 87 (1994), 193–201. F. Chung, E. Szemerédi, and W. Trotter, The number of different distances determined by a set of points in the Euclidean plane, Discrete Comput. Geom. 7 (1992), 1–11. ˝ On a family of symmetric Bernoulli convolutions, Amer. J. Math. 61 (1939), 974–976. P. Erdos, , On the smoothness properties of a family of Bernoulli convolutions, Amer. J. Math. 62 (1940), 180–186. K. J. Falconer, Continuity properties of k-plane integrals and Besicovitch sets, Math. Proc. Cambridge Philos. Soc. 87 (1980), 221–226. , Hausdorff dimension and the exceptional set of projections, Mathematika 29 (1982), 109–115. , On the Hausdorff dimension of distance sets, Mathematika 32 (1985), 206–212. , The Geometry of Fractal Sets, Cambridge Tracts in Math. 85, Cambridge University Press, Cambridge, 1986. , The Hausdorff dimension of self-affine fractals, Math. Proc. Camb. Philos. Soc. 103 (1988), 339–350. M. Frazier, B. Jawerth, and G. Weiss, Littlewood-Paley Theory and the Study of Function Spaces, CBMS Regional Conf. Ser. in Math. 79, Conf. Board Math. Sci., Washington, Amer. Math. Soc., Providence, 1991. D. Funaro, Polynomial Approximation of Differential Equations, Lecture Notes in Phys. New Ser. m Monogr. 8, Springer-Verlag, Berlin, 1992. A. M. Garsia, Arithmetic properties of Bernoulli convolutions, Trans. Amer. Math. Soc. 102 (1962), 409–432. X. Hu and S. J. Taylor, Fractal properties of products and projections of measures in Rd , Math. Proc. Cambridge Philos. Soc. 115 (1994), 527–544. B. Hunt and V. Y. Kaloshin, How projections affect the dimension spectrum of fractal measures, Nonlinearity 10 (1997), 1031–1046. B. Hunt, T. D. Sauer, and J. A. Yorke, Prevalence: A translation-invariant “almost every” for infinite-dimensional spaces, Bull. Amer. Math. Soc. (N.S.) 27 (1992), 217–238; Addendum, Bull. Amer. Math. Soc. (N.S.) 28 (1993), 306–307. J. E. Hutchinson, Fractals and self-similarity, Indiana Univ. Math. J. 30 (1981), 713–747. M. Jarvenpää, On the upper Minkowski dimension, the packing dimension, and orthogonal projections, Ann. Acad. Sci. Fenn. Ser. A I Math. Dissertationes, 1994. B. Jessen and A. Wintner, Distribution functions and the Riemann zeta function, Trans. Amer. Math. Soc. 38 (1935), 48–88. J. P. Kahane, “Sur la distribution de certaines séries aléatoires” in Colloque de Théorie des Nombres (Univ. Bordeaux, Bordeaux, 1969), Bull. Soc. Math. France 25 (1971), 119– 122. R. Kaufman, On Hausdorff dimension of projections, Mathematika 15 (1968), 153–155. , An exceptional set for Hausdorff dimension, Mathematika 16 (1969), 57–58. R. Kaufman and P. Mattila, Hausdorff dimension and exceptional sets of linear transformations, Ann. Acad. Sci. Fenn. Ser. A I Math. 1 (1975), 387–392. M. Keane, M. Smorodinsky, and B. Solomyak, On the morphology of γ -expansions with deleted digits, Trans. Amer. Math. Soc. 347 (1995), 955–966. F. Ledrappier, “On the dimension of some graphs” in Symbolic Dynamics and Its Applications (New Haven, Conn., 1991), Contemp. Math. 135, Amer. Math. Soc., Providence, 1992, 285–293. J. Marstrand, Some fundamental geometrical properties of plane sets of fractional dimensions, Proc. London Math. Soc. (3) 4 (1954), 257–302. P. Mattila, Hausdorff dimension, orthogonal projections and intersections with planes, Ann. Acad. Sci. Fenn. Ser. A I Math. 1 (1975), 227–244.
SMOOTHNESS OF PROJECTIONS [30] [31] [32] [33]
[34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44] [45] [46]
251
, Orthogonal projections, Riesz capacities and Minkowski content. Indiana Univ. Math. J. 39 (1990), 185–198. , Geometry of Sets and Measures in Euclidean Spaces, Cambridge Stud. Adv. Math. 44, Cambridge University Press, Cambridge, 1995. P. Mattila and P. Sjölin, Regularity of distance measures and sets, to appear. Y. Peres, W. Schlag, and B. Solomyak, “Sixty years of Bernoulli convolutions” in Fractals and Stochastics, II, Proceedings of the Greifswald 1998 Conference, ed. Bandt, Graf, and Zähle, to appear. Y. Peres and B. Solomyak, Absolute continuity of Bernoulli convolutions, a simple proof, Math. Res. Lett. 3 (1996), 231–239. , Self-similar measures and intersections of Cantor sets, Trans. Amer. Math. Soc. 350 (1998), 4065–4087. M. Pollicott and K. Simon, The Hausdorff dimension of λ-expansions with deleted digits, Trans. Amer. Math. Soc. 347 (1995), 967–983. F. Przytycki and M. Urbansky, On Hausdorff dimension of some fractal sets, Studia Math. 93 (1989), 155–186. T. D. Sauer and J. A. Yorke, Are the dimensions of a set and its image equal under typical smooth functions? Ergodic Theory Dynam. Systems 17 (1997), 941–956. B. Solomyak, On the random series ±λn (an Erd˝os problem), Ann. of Math. (2) 142 (1995), 611–625. , On the measure of arithmetic sums of Cantor sets, Indag. Math. (N.S.) 8 (1997), 133–141. , Measure and dimension for some fractal families, Math. Proc. Cambridge Philos. Soc. 124 (1998), 531–546. E. Stein, Singular integrals and differentiability properties of functions, Princeton Math. Ser. 30, Princeton University Press, Princeton, 1970. , Harmonic Analysis, Princeton University Press, Princeton, 1993. T. Wolff, A Kakeya type problem for circles, Amer. J. Math. 119 (1997), 985–1026. , “Recent work connected with the Kakeya problem” in Prospects in Mathematics (Princeton, NJ, 1996), Amer. Math. Soc., Providence, 1999. , Decay of circular means of Fourier transforms of measures. Preprint (1999). To appear in Internat. Math. Res. Notices 1999, 547–567.
Peres: Department of Mathematics, Hebrew University, Jerusalem, Israel and Department of Statistics, University of California, Berkeley, California 94720-3860, USA; peres @math.huji.ac.il Schlag: Mathematical Sciences Research Institute, 1000 Centennial Drive, Berkeley, CA 94720-5070, USA; Current: Department of Mathematics, Princeton University, Fine Hall, Princeton New Jersey 08544, USA;
[email protected]
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS OF p-LAPLACIAN TYPE JUHA KINNUNEN and JOHN L. LEWIS
1. Introduction. In this work, we study regularity of solutions to second-order parabolic systems: (1.1)
∂ui = div Ai (x, t, ∇u) + Bi (x, t, ∇u), ∂t
i = 1, . . . , N.
In particular, we are interested in systems of p-Laplacian type. We present more precise structural assumptions later, but the principal prototype that we have in mind is the p-parabolic system ∂ui = div |∇u|p−2 ∇ui , ∂t
i = 1, . . . , N,
with 1 < p < ∞. As usual, solutions to (1.1) are taken in a weak sense, and they are assumed to belong to a parabolic Sobolev space. A good source for the regularity theory is [D]. In the elliptic case when the system is (1.2)
div Ai (x, t, ∇u) + Bi (x, t, ∇u) = 0,
i = 1, . . . , N,
it is known that solutions locally belong to a slightly higher Sobolev space than assumed a priori. This self-improving property was first observed by Elcrat and Meyers in [ME] (see also [Gi] and [Str]). Their argument is based on reverse Hölder inequalities and a modification of Gehring’s lemma [Ge], which originally was developed to study the higher integrability of the Jacobian of a quasiconformal mapping. In the elliptic case, higher integrabilty results play a decisive role in studying the regularity of solutions (see [GM] and [Gi]). The purpose of this work is to obtain higher integrablity results in the p-parabolic setting. We prove that the gradient of a weak solution to (1.1) satisfies a reverse Hölder inequality for p > 2n/(n+2). The critical exponent 2n/(n+2) occurs also in parabolic regularity theory (see [D]). We note that reverse Hölder inequalities and the local higher integrability for weak solutions were already proved for p = 2 in [GS] (see also [C]). Our result appears to be new even in the scalar case if p = 2. Received 12 November 1998. Revision received 3 June 1999. 1991 Mathematics Subject Classification. Primary 35K55. Lewis supported by National Science Foundation grant number DMS-9876881. 253
254
KINNUNEN AND LEWIS
One of the difficulties in proving our main result is that a solution does not remain a solution under multiplication by a constant that is neither 0 nor 1. Since reverse Hölder inequalities are invariant under multiplication by a constant, we have to choose a class of cylinders whose side lengths depend on the size of the function in order to obtain a reverse Hölder inequality as in [Ge] and then higher integrability. It seems to us that our results can be used to extend partial regularity results in [GM] for nonlinear elliptic systems to cover some parabolic systems. For p = 2, this was done in [GS], but our method also applies when p = 2. 2. Preliminaries. In order to be more precise about the structure and solutions of the system (1.1), we need some notation. Let Ω ⊂ Rn be an open set and let W 1,p (Ω) denote the Sobolev space of real-valued functions g such that g ∈ Lp (Ω) and the distributional first partial derivatives ∂g/∂xi , i = 1, 2, . . . , n, exist in Ω and belong to Lp (Ω). The space W 1,p (Ω) is equipped with the norm n ∂g
g 1,p,Ω = g p,Ω + ∂x . i p,Ω i=1
Given O
⊂ Rn
open, N a positive integer, and −∞ ≤ S < T ≤ ∞, let u = (u1 , . . . , uN ) : O × (S, T ) → RN
and suppose that whenever −∞ ≤ S < S1 < T1 < T ≤ ∞ and Ω ⊂ O, we have u ∈ L2 Ω × [S1 , T1 ] ∩ Lp [S1 , T1 ]; W 1,p (Ω) . (2.1) Here the notation Lp [S1 , T1 ]; W 1,p (Ω) means that for almost every t, S1 < t < T1 , with respect to one-dimensional Lebesgue measure, the function x → u(x, t) is in W 1,p () componentwise, and T1 N p p ui (·, t)p (2.2) dt < ∞. |||u|||p,Ω = u p,Ω×(S1 ,T1 ) + 1,p,Ω S1 i=1
Let ∇u denote the distributional gradient of u (taken componentwise) in the x variable only. We suppose that A = (A1 , . . . , AN ) where Ai = Ai (x, t, ∇u) : O × (S, T ) × RnN −→ Rn , and B = (B1 , . . . , BN ) where Bi = Bi (x, t, ∇u) : O × (S, T ) × RnN −→ R are Lebesgue (n + 1)-measurable functions on O × (S, T ). This is the case, for example, if Ai and Bi , i = 1, 2, . . . , N , satisfy the well-known Carathéodory-type conditions. We assume that there exist positive constants ci , i = 1, 2, 3, such that (2.3)
|Ai | ≤ c1 |∇u|p−1 + h1 ,
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
255
|Bi | ≤ c2 |∇u|p−1 + h2 ,
(2.4) and
N
(2.5)
Ai , ∇ui ≥ c3 |∇u|p − h3 ,
i=1
for i = 1, 2, . . . , N and almost every (x, t) ∈ O × (S, T ). Here ·, · denotes the standard inner product in Rn , and hi , i = 1, 2, 3, are measurable functions in O × (S, T ) so that (|h1 | + |h2 |)p/(p−1) + |h3 | = c4 < ∞, (2.6) q ,O×(S,T ) where q > 1. Finally, u satisfying (2.1) is said to be a weak solution in O ×(S, T ) to the nonlinear parabolic system ∂ui = div Ai (x, t, ∇u) + Bi (x, t, ∇u), ∂t
i = 1, . . . , N,
if the structural conditions (2.3)–(2.6) hold and (2.7)
T S
N O i=1
− ui
∂φi + Ai , ∇φi − Bi φi dx dt = 0 ∂t
for every test function φ = (φ1 , . . . , φN ) ∈ C0∞ (O × (S, T )). The following theorem is our main result. Theorem 2.8. Let p > 2n/(n + 2) and suppose that u is a weak solution to (1.1). Then there exists ε > 0 such that u ∈ L2 Ω × [S1 , T1 ] ∩ Lp+ε [S1 , T1 ]; W 1,p+ε (Ω) , where ε > 0 depends only on n, p, q , and ci , for i = 1, 2, 3, while |||u|||p+ε,Ω depends on these quantities as well as N, Ω, S1 , T1 , and c4 . The proof of our main result follows from two propositions in Section 4. 3. Fundamental estimates. In this section, we state and outline the proofs of some Sobolev- and Caccioppoli-type lemmas that are used in the proof of the main result. To do this, we need some notation. Given r, s > 0, (x, t) ∈ Rn+1 , let Dr (x) = {y ∈ Rn : |y − x| < r} denote the open ball in Rn and let Qr,s (x, t) = Dr (x) × (t − s, t + s)
256
KINNUNEN AND LEWIS
denote a cylinder in Rn+1 . Let |E| denote the Lebesgue (n + 1)-measure of the measurable set E, and if f is integrable on E with 0 < |E| < ∞, then the integral average of f over E is 1 f dx dt = f dx dt. |E| E E If Qρ,s (z, τ ) ⊂ O × (S, T ), then −1 Iρ (t) = Iρ (t, u, z, τ ) = m Dρ (z)
Dρ (z)
u(x, t) dx,
whenever τ −s < t < τ +s. Here m denotes Lebesgue measure in Rn , and the integral is taken componentwise. Lemma 3.1. Suppose that u is a weak solution to (1.1). If Q4ρ,s (z, τ ) ⊂ O × (S, T ), then there exists ρ , ρ < ρ < 2ρ, and a constant c depending on p, n, c1 , and c2 , such that Iρ(t2 ) − Iρ(t1 ) ≤ csρ −1 |∇u|p−1 + |h1 | + |h2 | dx dt Q2ρ,s (z,τ )
for almost all ti , τ − s < ti < τ + s, i = 1, 2. Proof. To prove this lemma, let δ, η > 0 be small, ρ < ρ < 2ρ, t1 < t2 , ψ1 ∈ C0∞ (t1 − η, t2 + η) with ψ1 = 1 on the interval (t1 , t2 ), and let ψ2 ∈ C0∞ (Dρ+δ (z)) be a radial function with ψ2 = 1 on Dρ(z). For fixed j = 1, 2, . . . , N , we put φj = ψ1 ψ2 and φi = 0 otherwise. Using (2.7) and letting first η → 0 and then δ → 0, we get from well-known Sobolev-type arguments that for almost every t1 , t2 , and ρ , as above, m Dρ(z) Iρ(t2 , uj ) − Iρ(t1 , uj )
−1 x − z, Aj (x, t) |x − z| dσ (x) dt + = ∂Dρ(z)×(t1 ,t2 )
Dρ(z)×(t1 ,t2 )
Bj dx dt.
Here σ denotes (n − 1)-dimensional surface area on ∂Dρ(z). Choose ρ , ρ < ρ < 2ρ, so that |∇u|p−1 + |h1 | + |h2 | dσ dt ∂Dρ(z)×(t1 ,t2 )
≤ 100ρ −1
D2ρ (z)×(t1 ,t2 )
|∇u|p−1 + |h1 | + |h2 | dx dt.
Using this choice, (2.3) and (2.4) in the above inequality, and summing over j = 1, 2, . . . , N , we deduce the claim.
257
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
The following lemma is a Caccioppoli-type estimate for parabolic systems of pLaplacian type. For short, we write p/(p−1) hp = |h1 | + |h2 | + |h3 |. Lemma 3.2. Let u be a weak solution to (1.1) and a = (a1 , . . . , aN ) ∈ RN . Then there exists a constant c depending on n, N , p, ci for i = 1, 2, 3, 4, such that if Qρ1 ,s1 (z, τ ) ⊂ Qρ2 ,s2 (z, τ ) ⊂ O × (S, T ) with 0 < ρ2 , s2 < 1, then we have |∇u|p dx dt + ess sup |u − a|2 dx Qρ1 ,s1 (z,τ )
≤ c(s2 − s1 )−1
t∈(τ −s1 ,τ +s1 ) Dρ1 (z)
Qρ2 ,s2 (z,τ )
+ c(ρ2 − ρ1 )−p
|u − a|2 dx dt
Qρ2 ,s2 (z,τ )
|u − a|p dx dt + c
Qρ2 ,s2 (z,τ )
hp dx dt.
Proof. Lemma 3.2 follows from a standard Caccioppoli-type estimate obtained from (2.7) by formally choosing test functions of the form p
φi = (u − a)i ψi , i = 1, 2, . . . , N, where ψi ∈ C0∞ Qρ2 ,s2 (z, τ ) is a cutoff function with ψi = 1 on Qρ1 ,s1 (z, τ ), 0 ≤ ψi ≤ 1, and −1 −1 ∂ψi < 1000. |∇ψi | ∞ + (s2 − s1 ) (ρ2 − ρ1 ) ∂t ∞ There is a difficulty with the test functions φi since the solution usually has a very modest degree of regularity with respect to the time variable. We refer the reader to [D, pp. 24–27] for an argument to overcome this difficulty. Next we prove a Sobolev-type inequality. Lemma 3.3. Let 1 ≤ ν < ∞ and suppose that u ∈ Lν ((τ − 2s, τ + 2s); 2ρ (z))). Then there is a constant c depending on n and ν such that
W 1,ν (D
Qρ,s (z,τ )
u(x, t) − Iρ (t) ν(1+2/n) dx dt
≤c
Q2ρ,2s (z,τ )
|∇u(x, t)|ν dx dt ·
ess sup
t∈(τ −2s,τ +2s) D2ρ (z)
u(x, t) − Iρ (t) 2 dx
ν/n
.
Proof. Let t, τ −2s < t < τ +2s be such that x → u(x, t) belongs to W 1,ν (D2ρ (z)), and denote ρ ∗ = 2ρ. Let ψ ∈ C0∞ (Q2ρ,2s (z, τ )) be a cutoff function such that ψ = 1 on Qρ,s (z, τ ) and |∇ψ| ≤ 10/ρ. Let v(x, t) = u(x, t) − Iρ (t) ψ(x, t).
258
KINNUNEN AND LEWIS
Hölder’s inequality implies that J= v(x, t)ν(1+2/n) dx Dρ ∗ (z)
≤
1/n v(x, t) dx 2
Dρ ∗ (z)
Dρ ∗ (z)
v(x, t)
(ν+(2/n)(ν−1))n/(n−1)
dx
(n−1)/n
.
We use Sobolev’s theorem for functions in W 1,1 (Dρ ∗ (z)) to deduce that there is constant c = c(n) such that (n−1)/n v(x, t)(ν+(2/n)(ν−1))n/(n−1) dx Dρ ∗ (z)
≤c
Dρ ∗ (z)
v(x, t)(ν−1)(1+2/n) ∇v(x, t) dx
≤c
Dρ ∗ (z)
Thus J ≤ cJ
(ν−1)/ν
v(x, t)
Dρ ∗ (z)
Clearly,
ν
Dρ ∗ (z)
ν(1+2/n)
dx
(ν−1)/ν Dρ ∗ (z)
∇v(x, t) ν dx
1/ν
|∇v(x, t)| dx
≤ cρ
∗ −1
1/ν Dρ ∗ (z)
Dρ ∗ (z)
Dρ ∗ (z)
1/ν .
1/n v(x, t) dx
.
2
+
∇v(x, t) ν dx
u(x, t) − Iρ (t) ν dx
∇u(x, t) ν dx
1/ν
1/ν .
Poincaré’s inequality in W 1,ν (Dρ ∗ (z)) implies that 1/ν u(x, t) − Iρ (t) ν dx Dρ ∗ (z)
≤
Dρ ∗ (z)
u(x, t) − Iρ ∗ (t) ν dx
≤c and hence
Dρ ∗ (z)
Dρ ∗ (z)
1/ν
u(x, t) − Iρ ∗ (t) ν dx
∇v(x, t) ν dx
1/ν + Iρ ∗ (t) − Iρ (t) m Dρ ∗ (z)
1/ν
1/ν
≤ cρ
∗
≤c
Dρ ∗ (z)
Dρ ∗ (z)
∇u(x, t) ν dx
∇u(x, t) ν dx
1/ν .
1/ν ,
259
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
The same argument as above gives 1/n v(x, t)2 dx ≤c Dρ ∗ (z)
≤c
Dρ ∗ (z)
Dρ ∗ (z)
u(x, t) − Iρ (t) 2 dx
1/n
u(x, t) − Iρ ∗ (t) 2 dx
1/n .
Collecting the obtained estimates, we arrive at ν/n ∇u(x, t) ν dx u(x, t) − Iρ ∗ (t) 2 dx J ≤c . Dρ ∗ (z)
Dρ ∗ (z)
The claim follows by integrating this inequality with respect to t over the interval (τ − 2s, τ + 2s). Observe that the proof applies to the case n = 1 as well. The following two lemmas are essential tools in proving our main result. We divide the discussion into two parts depending on whether p ≥ 2 or 2n/(n + 2) < p < 2. Lemma 3.4. Let u be a weak solution to (1.1) with p ≥ 2. Suppose that λ > 0, s = λ2−p ρ 2 , and Q40ρ,402 s (z, τ ) ⊂ O × (S, T ). Denote Q = Qρ,s (z, τ ), Q = Q4ρ,42 s (z, τ ), and Q = Q20ρ,202 s (z, τ ). If there is c5 ≥ 1 such that −1 p p p c5 λ ≤ |∇u| + h dx dt ≤ c5 |∇u|p + hp dx dt ≤ c52 λp , Q
Q
then there is c ≥ 1 such that p |∇u| dx dt ≤ c Q
q
Q
|∇u| dx dt
p/q
+c
Q
hp dx dt,
where q = max{p − 1, pn/(n + 2)}. The constant c has the same dependence as the constant in Lemma 3.2, except that it also depends on c5 . Proof. First suppose that p ≥ 2. From Lemma 3.2 with ρ1 = ρ, s1 = s, ρ2 = 2ρ, and s2 = 2s, we have |∇u|p dx dt ≤ cs −1 |u − a|2 dx dt Qρ,s (z,τ )
(3.5)
Q2ρ,2s (z,τ )
+ cρ −p
Q2ρ,2s (z,τ )
|u − a|p dx dt + c
= T 1 + T2 + c
Q2ρ,2s (z,τ )
Q2ρ,2s (z,τ )
hp dx dt
hp dx dt.
Since p ≥ 2, we may estimate T1 in terms of T2 using Hölder’s and Young’s inequalities and the assumption that s = λ2−p ρ 2 as p−2 −2 (3.6) ρ |u − a|2 dx dt ≤ λp /(4c5 ) + cT2 , T1 ≤ cλ Q2ρ,2s (z,τ )
260
KINNUNEN AND LEWIS
where c ≥ 1 has the same dependence as c in Lemma 3.2, except that it also depends on c5 . Hence it is enough to estimate T2 . By Lemma 3.1, we choose ρ , 2ρ < ρ < 4ρ, so that Iρ(t) − Iρ(ξ ) ≤ csρ −1 (3.7) |∇u|p−1 + |h1 | + |h2 | dx dt, Q
= Qρ,2s (z, τ ), and in (3.5) take for almost every ξ , τ − 2s < ξ < τ + 2s. Let Q a = a Q = a1 Q , . . . , aN Q , where ai Q = ui dx dt, Q
for i = 1, 2, . . . , N . Then we have −p u − Iρ(t) p dx dt + cρ −p (3.8) T2 ≤ cρ Q
ess sup
t∈(τ −2s,τ +2s)
p Iρ(t) − a Q .
We begin with estimating the second term on the right side of (3.8). Using (3.7), we have τ +2s Iρ(t) − a Q Iρ(t) − Iρ(ξ ) dξ ≤ (4s)−1 (3.9) τ −2s ≤ csρ −1 |∇u|p−1 + |h1 | + |h2 | dx dt, Q
and hence using the definition of λ, we obtain cρ −p
ess sup
t∈(τ −2s,τ +2s)
p Iρ(t) − a Q ≤ cλp(2−p) ≤c
(3.10)
≤c
|∇u|p−1 + |h1 | + |h2 | dx dt
Q
|∇u|p−1 + |h1 | + |h2 | dx dt
Q
Q
|∇u|
p−1
p/(p−1) dx dt +c
p
p/(p−1)
Q
hp dx dt.
Observe that the assumption p ≥ 2 is used in the second inequality above. Next we estimate the first term on the right side of (3.8). Lemma 3.3 implies that (3.11) u − Iρ(t) p dx dt ≤ c |∇u|q dx dt Q
Q
ess sup
t∈(τ −4s,τ +4s) Dρ(z)
= Q2 where q = pn/(n + 2), Q = 2 ρ. ρ ,4s (z, τ ), and ρ
u − Iρ(t) 2 dx
q/n
,
261
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
We estimate the essential supremum on the right side of (3.11). Let Q∗ = Q10ρ,10s (z, τ ). Clearly, u − Iρ(t) 2 dx ≤ c u − a Q∗ 2 dx + cm Dρ(z) a Q∗ − Iρ(t) 2 Dρ(z)
≤c
Dρ(z)
Dρ(z)
u − a Q∗ 2 dx.
Hence, using Lemma 3.2 with a = a(Q∗ ), we have u − Iρ(t) 2 dx ess sup t∈(τ −4s,τ +4s) Dρ(z)
≤c
(3.12)
ess sup
t∈(τ −4s,τ +4s) Dρ(z)
≤ cs −1 where s (3.13)
and
(3.14)
−1
Q∗
Q∗
u − a Q∗ 2 dx
u − a Q∗ 2 dx dt + cρ −p
u − a Q∗ 2 dx dt ≤ cs −1 + cs
+ cρ
Q∗
ess sup
t∈(τ −10s,τ +10s)
Q∗ −p ∗
Q
ess sup
t∈(τ −10s,τ +10s)
τ −10s
≤ cs −1 ρ 2
(3.15)
I10ρ (t) − a Q∗ 2
u − I10ρ (t) p dx dt
By Poincaré’s inequality in W 1,2 (D10ρ (z)), we have τ +10s u − I10ρ (t) 2 dx dt = s −1 s −1 Q∗
Q∗
u − a Q∗ p dx dt,
u − I10ρ (t) 2 dx dt
−1
p ρ −p u − a Q∗ dx dt ≤ cρ −p Q∗
Q∗
≤ cs −1 ρ 2
D10ρ (z)
Q∗
I10ρ (t) − a Q∗ p .
u − I10ρ (t) 2 dx dt
|∇u|2 dx dt
Q∗
|∇u|p dx dt
2/p
|Q∗ |
≤ cρ n+2 λ2 . Here we used the assumption that p ≥ 2 again. Exactly the same argument gives u − I10ρ (t) p dx dt ≤ c ρ −p (3.16) |∇u|p dx dt ≤ cρ n+2 λ2 . Q∗
Q∗
262
KINNUNEN AND LEWIS
Using Lemma 3.1, we choose ρ , 10ρ < ρ < 20ρ, such that Iρ(t) − Iρ(ξ ) ≤ csρ −1 |∇u|p−1 + |h1 | + |h2 | dx dt, Q
when τ − 10s < ξ < τ + 10s. This implies that I10ρ (t) − a Q∗ 2 ess sup s −1 Q∗ t∈(τ −10s,τ +10s)
(3.17)
2 n−2
≤ cs ρ
Q
p
p
2(p−1)/p
|∇u| + h dx dt
≤ cρ n+2 λ2 . A similar argument (see (3.10)) also gives I10ρ (t) − a Q∗ p ess sup ρ −p Q∗ t∈(τ −10s,τ +10s) n
(3.18)
≤ cρ sλ
p(2−p)
Q
|∇u|
p−1
+ |h1 | + |h2 | dx dt
p
≤ cρ n+2 λ2 . Using (3.12)–(3.18), we conclude that u − Iρ(t) 2 dx ≤ cρ n+2 λ2 . ess sup t∈(τ −4s,τ +4s) Dρ(z)
By (3.11) and Young’s inequality, we see that the first term on the right side of (3.8) can be estimated as p cρ −p u − Iρ(t) dx dt ≤ cλ2q/n |∇u|q dx dt Q
Q
≤c
q
Q
|∇u| dx dt
p/q
+ λp /(4c5 ).
Finally, using (3.5), (3.6), (3.8), and (3.10), we have (3.19)
p
Q
p
|∇u| dx dt ≤ λ /(2c5 ) + c +c
Q
q
Q
|∇u| dx dt
|∇u|p−1 dx dt
p/q
p/(p−1)
+c
Q
hp dx dt.
The claim follows from this estimate by absorbing the term containing λp into the left side.
263
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
Next we prove an analogue of Lemma 3.4 for 2n/(n + 2) < p < 2. Lemma 3.20. Let u be a weak solution to (1.1) with 2n/(n+2) < p < 2. Suppose that λ > 0, s = λ2−p ρ 2 , and Q40ρ,402 s (z, τ ) ⊂ O × (S, T ). Denote Q = Qρ,s (z, τ ), Q = Q4ρ,42 s (z, τ ), and Q = Q20ρ,202 s (z, τ ). If there is c6 ≥ 1 such that 2 −1 p c6 λ ≤ |∇u|p + s −1 u − a(Q) + hp dx dt Q
2 |∇u|p + s −1 u − a Q + hp dx dt ≤ c62 λp , ≤ c6 Q
then there is c ≥ 1 such that |∇u|p dx dt ≤ c Q
Q
|∇u|q dx dt
p/q
+c
Q
hp dx dt,
where q = 2n/(n + 2). The constant c has the same dependence as the constant in Lemma 3.2, except that it also depends on c6 . Proof. We use the same notation as in the proof of Lemma 3.4. Clearly, (3.5) also holds this case. Since p < 2, we use Hölder’s inequality to estimate T2 in (3.5) in terms of T1 and obtain (3.21)
T2 ≤ c ρ −2
Q2ρ,2s (z,τ )
|u − a|2 dx dt
p/2
p/2
≤ cλ(1−p/2)p T1
≤ λp /(4c6 ) + cT1 .
To estimate T1 by Lemma 3.1, we choose ρ , 2ρ < ρ < 4ρ, so that (3.7) holds. Let Q = Qρ,2s (z, τ ). Using the same argument that led to (3.8), we see that 2 −1 u − Iρ(t) 2 dx dt + cs −1 ess sup Iρ(t) − a Q . T1 ≤ cs (3.22) Q
t∈(τ −2s,τ +2s)
Using (3.9), the definition of λ, and Young’s inequality, we obtain s −1
ess sup
t∈(τ −2s,τ +2s)
≤ cλ2−p (3.23)
2 Iρ(t) − a Q
|∇u|p−1 + |h1 | + |h2 | dx dt
Q
≤ λp /(4c6 ) + c ≤ λp /(4c6 ) + c
2
|∇u|p−1 + |h1 | + |h2 | dx dt
Q
Q
|∇u|p−1 dx dt
p/(p−1)
p/(p−1)
+c
Q
hp dx dt.
264
KINNUNEN AND LEWIS
To estimate the first term on the right side of (3.22), we use Lemma 3.3 and argue first as in (3.11) to get u − Iρ(t) 2 dx dt (3.24)
Q
≤c
q
Q
|∇u| dx dt
u − Iρ(t) 2 dx
ess sup
t∈(τ −4s,τ +4s) Dρ(z)
q/n
,
= Q2 = 2 ρ . The essential supremum on where q = 2n/(n + 2), Q ρ ,4s (z, τ ), and ρ the right side of (3.24) is then estimated as in (3.12), and we obtain u − Iρ(t) 2 dx ess sup (3.25)
t∈(τ −4s,τ +4s) Dρ(z)
≤ cs
−1
Q∗
u − a Q∗ 2 dx dt + cρ −p
Q∗
u − a Q∗ p dx dt,
where Q∗ = Q10ρ,10s (z, τ ) as before. Using the assumption of the lemma and remembering that s = λ2−p ρ 2 , we have −1 u − a Q∗ 2 dx dt ≤ cλp Q∗ ≤ cρ n+2 λ2 . s Q∗
The second term on the right side of (3.25) can be estimated exactly the same way as in the case p ≥ 2; see (3.14), (3.16), and (3.18). We conclude that u − Iρ(t) 2 dx ≤ cρ n+2 λ2 . ess sup t∈(τ −4s,τ +4s) Dρ(z)
By (3.24) and Young’s inequality, we arrive at 2 −1 p−q cs u − Iρ(t) dx dt ≤ cλ |∇u|q dx dt (3.26)
Q
Q
≤c
q
Q
|∇u| dx dt
p/q
+ λp /(4c6 ).
The claim now follows from using (3.22), (3.23), and (3.26) as before by absorbing the term containing λp into the left side. This completes the proof. Remark 3.27. We record for future reference that the constant c in Lemmas 3.4 and 3.20 remain bounded above if p is in a compact subset of (2n/(n + 2), ∞). This is easily seen by analyzing the constants in Lemmas 3.1, 3.2, and 3.3. 4. Reverse Hölder inequalities. In this section, we show that gradients of weak solutions of (1.1) satisfy a reverse Hölder inequality provided p > 2n/(n + 2). As in the previous section, slightly different arguments are needed to handle the cases p ≥ 2 and 2n/(n + 2) < p < 2. First we study the case p ≥ 2.
265
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
Proposition 4.1. Let u be a weak solution to (1.1) when p ≥ 2 and suppose that Q4R,(4R)p (z, τ ) ⊂ O × (S, T ), where 0 < R < 1. Then there exist ε > 0 and c ≥ 1 having the same dependence as the corresponding constants in Theorem 2.8 with QR,R p (z,τ )
|∇u|
p+ε
1/(p+ε) σp−1 dx dt ≤ cR + cR
−1
p
Q2R,(2R)p (z,τ )
|∇u| dx dt
+c
Q2R,(2R)p (z,τ )
h
p+ε
σ 1/(p+ε)
dx dt
,
where σ = (2 + ε)/(2(p + ε)). Proof. To prove Proposition 4.1, we assume, as we may, that R = 1 and (z, τ ) = (0, 0), since otherwise we consider v(x, t) = u(z + Rx, τ + R p t) for (x, t) ∈ Q4,4p (0, 0). It is easily seen that v is a weak solution to a partial differential equation similar to and with the same structure as (1.1). Proving Proposition 4.1 for v with R = 1 relative to (0,0) and then transforming back, we get Proposition 4.1 for u. = Q2,2p (0, 0). To begin the proof of Proposition 4.1, we For short, we denote Q into Whitney-type cylinders divide Q Qi = Qri ,r 2 (zi , τi ), i
i = 1, 2, . . . ,
Let us so that ri is comparable to the parabolic distance of Qi to the boundary of Q. n+1 recall that the parabolic distance of sets E, F ⊂ R is inf |x − y| + |t − s|1/2 : (x, t) ∈ E and (y, s) ∈ F . Moreover, the cylinders Qi , i = 1, 2, . . . , are of bounded overlap and Q5ri ,(5ri )2 (zi , τi ) ⊂ Q. we define Next for (x, t) ∈ Q, g(x, t) = |∇u| + h (x, t) and
f (x, t) = c −1 min |Qi |1/2 : (x, t) ∈ Qi g(x, t),
where c ≥ 1 is chosen later. Let
λ20 =
Q
|g|p dx dt
266
KINNUNEN AND LEWIS
and λ > max{λ0 , 1} = λ0 . Suppose that (x, t) ∈ Qi with |f (x, t)| > λ. We set α = |Qi |−1
and
γ = α 1−p/2 λ2−p .
c large enough, we have If ri /20 ≤ r ≤ ri , then for p −1 |g| dx dt ≤ cγ α (4.2) |g|p dx dt ≤ c p λp α p/2 . Q
Qr,γ r 2 (x,t)
By Lebesgue’s differentiation theorem, we have for almost every such (x, t), that |g|p dx dt > c p λp α p/2 . (4.3) lim r→0
Qr,γ r 2 (x,t)
From (4.2), (4.3), and continuity of the integral, we see that there exists ρ, 0 < ρ < ri /20, such that (4.4) |g|p dx dt = c p λp α p/2 Qρ,γρ 2 (x,t)
and (4.5)
Qr,γ r 2 (x,t)
|g|p dx dt ≤ c p λp α p/2
for ρ ≤ r ≤ ri . Let s = γρ 2 and denote Q = Qρ,s (x, t),
Q = Q4ρ,42 s (x, t),
and
Q = Q20ρ,202 s (x, t).
Since λ, α > 1 and p ≥ 2, we have γ ≤ 1. This implies that Q ⊂ Q. Now (4.4) and (4.5) imply that there is a constant c ≥ 1 such that −1 p p/2 p c λ α (4.6) ≤ |g| dx dt ≤ c |g|p dx dt ≤ c2 λp α p/2 . Q
Observe that
Q
2−p 2 ρ . s = γρ 2 = λα 1/2
Thus we can apply Lemma 3.4 with λ replaced by λα 1/2 . Note that c5 in this case depends only on n and p. From Lemma 3.4, we conclude for pn q = max p − 1, n+2
267
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
and c ≥ 1 that (4.7)
p
|∇u| dx dt ≤ c
Q
q
Q
|∇u| dx dt
p/q
+c
Q
hp dx dt.
Using (4.6) and (4.7), we have c−1 λp ≤ |f |p dx dt Q
≤c
(4.8)
≤ c2
Q
Q
|f |q dx dt
p/q
+c
Q
k dx dt
|f |p dx dt ≤ c3 λp ,
where k(x, t) = min |Qi |p/2 : (x, t) ∈ Qi hp (x, t). : |f (x, t)| > λ and η > 0. Then by (4.8), we obtain Let G(λ) = (x, t) ∈ Q Q
|f |q dx dt
p/q
−1 ≤ cηp λp + Q ≤ cη
(4.9)
p
q
|f | dx dt
Q
p−q
+ cλ
Q ∩G(ηλ)
Q
−1
p/q
|f |q dx dt
+ cη
Q ∩G(ηλ)
p
p/q
Q
k dx dt
|f |q dx dt.
A similar argument gives (4.10)
Q
k dx dt ≤ cηp
+ cηp
Q
Q
|f |q dx dt
p/q
−1 k dx dt + c Q
Q ∩G(ηλ)
k dx dt.
Choosing η > 0 small enough in (4.9), (4.10), and absorbing terms, we arrive at p/q q |f | dx dt + k dx dt Q Q (4.11) −1 p−q −1 q |f | dx dt + c Q k dx dt. Q ≤ cλ Q ∩G(ηλ)
Q ∩G(ηλ)
An examination of the proof of the well-known Vitali-type covering lemma shows
268
KINNUNEN AND LEWIS
that we can choose pairwise disjoint cylinders Qi = Q4ρi ,γ (4ρi )2 (xi , ti ),
i = 1, 2, . . . ,
such that almost everywhere G(λ) ⊂
∞ i=1
where
Qi ⊂ Q,
Qi = Q20ρi ,γ (20ρi )2 (xi , ti ),
i = 1, 2, . . . .
From (4.8) and (4.11), we deduce that for some small η > 0, we have c−1 λp ≤ |f |p dx dt Qi
p−q
≤ cλ
(4.12)
q
Qi
|f | dx dt + c
≤ c2 λp−q |Qi |−1
Qi ∩G(ηλ)
≤ c 3 λp .
Qi
k dx dt
|f |q dx dt + c2 |Qi |−1
Qi ∩G(ηλ)
k dx dt
Multiplying (4.12) by |Q | and summing over i, we get from (4.12) and disjointness of the cylinders Qi , i = 1, 2, . . . , that |f |p dx dt ≤ |f |p dx dt G(λ)
Qi
i
≤ cλp−q
(4.13)
≤ cλ
p−q
i
Qi ∩G(ηλ)
|f |q dx dt +
q
G(ηλ)
i
|f | dx dt + c
G(ηλ)
Qi ∩G(ηλ)
k dx dt
k dx dt.
We can now apply a standard argument to complete the proof of Proposition 4.1. For completeness, we sketch it. Using Fubini’s theorem and (4.13), we have |f |p+ε dx dt G(λ0 ) ∞ ε−1 p ε λ |f | dx dt dλ + λ0 |f |p dx dt =ε λ0 G(λ) G(λ0 ) ∞ λε−1+p−q |f |q dx dt dλ ≤ cε λ0
+ cε
G(ηλ)
∞ λ0
λ
ε−1+p−q
G(ηλ)
k dx dt
ε dλ + λ0
G(λ0 )
|f |p dx dt
269
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
≤
cε ε+p−q
G(
λ0
)
|f |p+ε dx dt + c
G(
λ0
)
|f |ε k dx dt + λ0
ε
G(λ0 )
|f |p dx dt.
By Young’s inequality, we obtain |f |ε k dx dt ≤ ε |f |ε+p dx dt + c k 1+ε/p dx dt. G(λ0 ) G(λ0 ) G(λ0 ) Choosing ε > 0 small enough, we may absorb the integrals involving |f |p+ε into the left side and we obtain p+ε ε p dx dt ≤ cλ0 |f | |f | dx dt + c k 1+ε/p dx dt. G(λ0 ) G(λ0 ) G(λ0 ) Observe that there is a difficulty in moving terms to the left side since they may be infinite. This technical problem can be treated, for example, by truncating the function f . To be more precise, let > > λ0 and denote f> = min{|f |, >}. If |f | is replaced by f> in the definition of G(λ), we see that (4.13) holds with |f | replaced by f> , and we can go through the above argument. Now all absorbed terms are finite, and we obtain the claim passing to the limit as > → ∞. Thus p+ε ε p |f | dx dt ≤ λ0 |f | dx dt + |f |p+ε dx dt (λ ) Q Q\G G λ ( ) 0 0 ε |f |p dx dt + c k 1+ε/p dx dt. ≤ cλ0 Q
Q
Proposition 4.1 follows easily from this inequality. Then we prove a counterpart of Proposition 4.1 when 2n/(n + 2) < p < 2. Proposition 4.14. Let u be a weak solution to (1.1) when 2n/(n + 2) < p < 2 and suppose that Q4R,(4R)p (z, τ ) ⊂ O × (S, T ), where 0 < R < 1. Then there exist ε > 0 and c ≥ 1 having the same dependence as the corresponding constants in Theorem 2.8 with 1/(p+ε) |∇u|p+ε dx dt QR,R p (z,τ )
≤ cR −1 + cR
−1
Q2R,(2R)p (z,τ )
u − a Q2R,(2R)p (z, τ ) 2 dx dt
+c
p+ε
Q2R,(2R)p (z,τ )
h
1/(p+ε) dx dt
where ν = (2ε + (n + 2)p − 2n)/((p + ε)((n + 2)p − 2n)).
,
ν
270
KINNUNEN AND LEWIS
= Q2,2p (0, 0) into Whitney-type cylinders Qi , i = Proof. We again divide Q 1, 2, . . . , exactly in the same way as in the proof of Proposition 4.1. Next for (x, t) ∈ put Q, g(x, t) = |∇u| + h (x, t). Let ((n+2)p−2n)/2 λ0
(4.15)
=
Q
2 u − a Q dx dt
we define and λ > max{λ0 , 1} = λ0 . For (x, t) ∈ Q, f (x, t) = c −1 min |Qi |σ : (x, t) ∈ Qi g(x, t), where σ=
2n + 8 (n + 2) (n + 2)p − 2n
and c ≥ 1 are chosen later. Suppose (x, t) ∈ Qi with |f (x, t)| > λ. Put α = |Qi |−σ and let γ = (λα)2−p . Again by Lebesgue’s differentiation theorem, we have for almost every such (x, t) that lim |g|p dx dt > c p λp α p . r→0
Qr,γ r 2 (x,t)
and for Also, if r = (λα)p/2−1 ri with ri /20 ≤ ri ≤ ri , then Qr,γ r 2 (x, t) ⊂ Q c large enough, we have from Lemma 3.2 and Hölder’s inequality, 2 |g|p + γ −1 r −2 u − a Qr,γ r 2 (x, t) dx dt Qr,γ r 2 (x,t)
−(n+4) (λα)(1−p/2)n ≤ c(n)ri
Q
2 dx dt < hp + 1 + u − a Q c p λp α p ,
since p > 2n/(n + 2). We again use continuity of the integral to find ρ, 0 < ρ < (λα)p/2−1 ri /20 such that 2 (4.16) |g|p + γ −1 ρ −2 u − a Qρ,γρ 2 (x, t) dx dt = c p λp α p Qρ,γρ 2 (x,t)
and (4.17)
Qr,γ r 2 (x,t)
2 |g|p + γ −1 r −2 u − a Qr,γ r 2 (x, t) dx dt ≤ c p λp α p
for ρ ≤ r ≤ (λα)p/2−1 ri . From (4.16) and (4.17), we see that Lemma 3.20 can be applied with Q = Qρ,γρ (x, t) and λ replaced by λα. We can now repeat the proof of Proposition 4.1 essentially verbatim from (4.8) on in order to get Proposition 4.14. We omit the details.
HIGHER INTEGRABILITY FOR PARABOLIC SYSTEMS
271
The proof of Theorem 2.8 follows easily from Propositions 4.1, 4.14, and Lemma 3.3. References [C] [D] [Ge] [Gi] [GM] [GS] [ME] [Str]
H. J. Choe, On the regularity of parabolic equations and obstacle problems with quadratic growth nonlinearities, J. Differential Equations 102 (1993), 101–118. E. DiBenedetto, Degenerate Parabolic Equations, Universitext, Springer-Verlag, New York, 1993. F. W. Gehring, The Lp -integrability of the partial derivatives of a quasiconformal mapping, Acta Math. 130 (1973), 265–277. M. Giaquinta, Multiple Integrals in the Calculus of Variations and Nonlinear Elliptic Systems, Ann. of Math. Stud. 105, Princeton University Press, Princeton, 1983. M. Giaquinta and G. Modica, Regularity results for some classes of higher order nonlinear elliptic systems, J. Reine Angew. Math. 311/312 (1979), 145–169. M. Giaquinta and M. Struwe, On the partial regularity of weak solutions of nonlinear parabolic systems, Math. Z. 179 (1982), 437–451. N. G. Meyers and A. Elcrat, Some results on regularity for solutions of non-linear elliptic systems and quasi-regular functions, Duke Math. J. 42 (1975), 121–136. E. W. Stredulinsky, Higher integrability from reverse Hölder inequalities, Indiana Univ. Math. J. 29 (1980), 407–413.
Kinnunen: Department of Mathematics, P.O. Box 4, FIN-00014 University of Helsinki, Finland;
[email protected] Lewis: Department of Mathematics, University of Kentucky, Lexington, Kentucky 40506-0027, U.S.A.;
[email protected]
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
INTEGRALITY AND SYMMETRY OF QUANTUM LINK INVARIANTS THANG T. Q. LE
0. Introduction. Quantum invariants of framed links whose components are colored by modules of a simple Lie algebra g are Laurent polynomials in v 1/D (with integer coefficients), where v is the quantum parameter and D an integer depending on g. We show that quantum invariants, with a suitable normalization, are Laurent polynomials in v 2 . We also establish two symmetry properties of quantum link invariants at roots of unity. The first asserts that quantum link invariants, at rth roots of unity, are invariant under the action of the affine Weyl group Wr , which acts on the weight lattice. A fundamental domain of Wr is the fundamental alcove C¯ r , a simplex. Let G be the center of the corresponding simply connected complex Lie group. There is a natural action of G on C¯ r . The second symmetry property, in its simplest form, asserts that quantum link invariants are invariant under the action of G if the link has zero linking matrix. The second symmetry property generalizes symmetry principles of Kirby and Melvin (the sl2 case) and Kohno and Takata (the sln case) to arbitrary simple Lie algebra. 0.1. Quantum invariants. Suppose L is a framed link with m ordered components and M1 , . . . , Mm are modules of a simple complex Lie algebra g. Then the quantum invariant JL (M1 , . . . , Mm ) is a rational function in the variable v 1/D , where v is the quantum parameter and D is a number depending on g. (See [RT1], [Tu]; we recall the definition of quantum invariants in §1.) The Jones polynomial (see [Jo]) is the simplest in the family of quantum link invariants: When g = sl2 and the modules equal the fundamental representation, JL is the Jones polynomial, with a suitable change of variable. The reader should be able to relate v to any other variable if it is known that the quantum integer [n] is given by [n] =
v n − v −n . v − v −1
0.2. Integrality. A priori JL is a rational function in v 1/D . Lusztig’s result on the integrality of the R-matrix implies that JL is a Laurent polynomial in v 1/D with integer coefficients (see a detailed proof in §1.4.2 below). We study the integrality of the exponents of v. One of our main results shows that JL is essentially a Laurent Received 13 April 1999. 1991 Mathematics Subject Classification. Primary 57M25; Secondary 17B37. Author’s work partially supported by National Science Foundation grant number DMS-9626404. 273
274
THANG T. Q. LE
polynomial in v 2 . More precisely, suppose all the modules M1 , . . . , Mm are irreducible; then JL belongs to v p Z[v ±2 ], where p is a rational number determined by the linking matrix of L (see the strong integrality theorem in §2). Thus one can get rid of fractional and odd powers of v by using a suitable normalization. For example, suppose that the normalization of the quantum invariant is chosen so that the value of the unknot is 1; then the value of any unframed knot is in Z[v ±2 ]. The strong integrality theorem does not follow directly from the integrality result of Lusztig. To prove it, we have to use a geometric lemma about special presentation of links and a result of Andersen on quantum groups at roots of unity. 0.3. Symmetry I. To formulate the symmetry properties, it is more convenient to use another normalization of quantum invariants, QL (M1 , . . . , Mm ) := JL (M1 , . . . , Mm )JU (m) (M1 , . . . , Mm ), where U (m) is the trivial link with m components and each has zero framing. This normalization is the one used in the definition of quantum 3-manifold invariants. Since irreducible g-modules are parametrized by the set X+ of dominant weights, both JL and QL can be considered as functions from (X+ )m to Z[v ±1/D ]. The set X+ is the part of the weight lattice X which lies in a Euclidean space h∗R . For each positive integer r there is defined the fundamental alcove C¯ r , which is a simplex in h∗R (see §2). The reflections along the facets of C¯ r generate the affine Weyl group Wr , for which C¯ r is a fundamental domain. The affine Weyl group plays an important role in the theory of affine Lie algebras (see [Kac]). We show that when v 2 is a primitive rth root of unity, quantum invariants have very nice symmetry properties expressed in the first and the second symmetry principles. The first asserts that QL is componentwise invariant under the action of the affine Weyl group. More precisely, when v 2 is an rth root of unity, ¯ wm ·µm = QL ¯ µm , ¯ w1 ·µ1 , . . . , ¯ µ1 , . . . , QL ¯ µ is the simple module of highest weight µ. Here the where w1 , . . . , wm ∈ Wr and dot means the dot action, and all µ1 , . . . , µm , w1 · µ1 , . . . , wm · µm are in X+ . For a stronger statement that describes the maximal group of symmetry, see §2. Thus when considering quantum invariants at roots of unity, one could restrict the colors—that is, the modules assigned to components of links—to C¯ r , a fundamental domain of the affine Weyl group. The simplex C¯ r contains only a finite number of elements in X+ . For example, the sum over all weights in X+ could be replaced by the sum over all weights in C¯ r . This happens in the theory of quantum 3-manifold invariants. 0.4. Symmetry II. Let G be the (necessarily finite abelian) center of the simply connected complex Lie group associated with g. The group G is also known as the fundamental group; it is isomorphic to the quotient of the weight lattice by the root
QUANTUM LINK INVARIANTS
275
lattice. There is a natural action of G on C¯ r (see §2). Suppose v 2 is a primitive rth root of unity. In its simplest form, the second symmetry principle says that QL is invariant under the action of G if the linking matrix of L is zero. In general, under the action of G, QL is multiplied by a twisting factor determined by the linking matrix of L. More precisely, suppose µ1 , . . . , µm ∈ C¯ r , g1 , . . . , gm ∈ G; then ¯ g1 ·µ1 , . . . , ¯ µ1 , . . . , ¯ gm ·µm = v rt QL ¯ µm , QL where the dot action is, as usual, the one shifted by the half-sum of positive roots and where t = (r − h) lij gi | gj + 2 lij gi | µj , 1≤i, j ≤m
1≤i, j ≤m
with h being the Coxeter number (see Table 1). Here (lij ) is the linking matrix, and (· | ·)’s are scalar products naturally defined using the standard scalar product on g. For a stronger statement, see §2. The action of G is induced from that of the extended affine Weyl group. The second symmetry principle, in fact, describes how quantum invariants behave under actions of the extended affine Weyl group. For g = sl2 , the second symmetry principle was discovered by Kirby and Melvin [KM] and for g = sln by Kohno and Takata [KT1]. Our contribution in these cases is the explicit relation between the twisting factor and the scalar product of g. This relation makes the second symmetry principle more understandable and easier to deal with. We also consider all primitive rth roots of unity, not only e2πi/r . Our proof is different from those of [KM] and [KT1], though it borrows some ideas from [KM]. In order to handle all simple Lie algebras, we have to use deep results of Lusztig and Andersen on quantum groups. 0.5. In [KM] and [KT2], the second symmetry principle was used to define a finer version—the projective version—of quantum 3-manifold invariants. The values of the projective version, so far defined only for g = sln , were proved to be algebraic integers (see [Mu], [MR], [TY]). Then Ohtsuki showed that, for g = sl2 , the projective version has a perturbative expansion, which is a power series invariant of rational homology 3-spheres; see [Oh1] (see also [Le2] for the sln case). This result led Ohtsuki to the definition of finite-type invariants of 3-manifolds (see [Oh2]). In a forthcoming paper [Le3], we will generalize these results to arbitrary simple Lie algebras. 0.6. Various properties of quantum link invariants were proved by first establishing the properties for fundamental modules and then using cablings (see, e.g., [MW], [Yo]). This approach has been widely used for classical Lie algebras (series ABCD), since the invariants corresponding to fundamental representations are essentially the Homflypt and the Kauffman polynomials, which have simple skein relations. The case of exceptional Lie algebras has not been well studied. We do not use that approach in this paper. To uniformly handle all simple Lie algebras we extensively utilize results in quantum group theory.
276
THANG T. Q. LE
Table 1 A"
B" " odd
B" " even
C"
D" " odd
D" " even
E6
E7
E8
F4
G2
d
1
2
2
2
1
1
1
1
1
2
3
D
"+1
2
1
1
4
2
3
2
1
1
1
G
Z"+1
Z2
Z2
Z2
Z4
Z2 × Z 2
Z3
Z2
1
1
1
h
"+1
2"
2"
2"
2" − 2
2" − 2
12
18
30
12
6
h∨
"+1
2" − 1
2" − 1
" + 1 2" − 2
2" − 2
12
18
30
9
4
0.7. The paper is organized as follows. In §1 we recall necessary facts about quantum groups and the definition of quantum link invariants. The integrality theorem, the two symmetry principles, their refinements, and their corollaries are presented in §2. Finally, §3 contains proofs of main theorems. Acknowledgements. The author would like to thank M. Finkelberg, C. Kassel, G. Masbaum, T. Ohtsuki, T. Takata, and H. Wenzl for helpful and stimulating discussions. He is grateful to H. Andersen for explaining many results in quantum group theory. He also thanks W. Menasco, who provided a proof of Proposition 3.6, and the referee, for valuable corrections and comments. 1. Quantum groups and quantum link invariants 1.1. Quantum groups. We recall here some facts from the theory of quantum groups, following [Lu2] (see also [Ka]). We do not use the h-adic version, so the R-matrix does not lie in the quantum group. 1.1.1. Cartan matrix and roots. Let (aij )1≤i,j ≤" be the Cartan matrix of a simple complex Lie algebra g. There are relatively prime integers d1 , . . . , d" in {1, 2, 3} such that the matrix (di aij ) is symmetric. Let d be the maximal of (di ). The reader uncomfortable with Lie algebra theory might want to consider only the case d = 1, that is, the simply laced case (series ADE), for which many formulas become much simpler. The values of d and other data for various Lie algebras are listed in Table 1. We fix a Cartan subalgebra h of g and basis roots α1 , . . . , α" in the dual space h∗ . Let h∗R be the R-vector space spanned by α1 , . . . , α" . The root lattice Y is the Z-lattice generated by αi , i = 1, . . . , ". Define the scalar product on h∗R so that (αi | αj ) = di aij . Then (α | α) = 2 for every short root α. Let Z+ be the set of all nonnegative integers. The weight lattice X (resp., the set of dominant weights X+ ) is the set of all λ ∈ h∗R such that λ, αi := (2(λ | αi ))/(αi | αi ) ∈ Z (resp., λ, αi ∈ Z+ ) for i = 1, . . . , ". Let λ1 , . . . , λ" be the fundamental weights; that is, the λi ∈ h∗R are defined by λi , αj = δij or (λi | αj ) = di δij . Then X is the
277
QUANTUM LINK INVARIANTS
Z-lattice generated by λ1 , . . . , λ" . The root lattice Y is a subgroup of the weight lattice X, and the quotient G = X/Y is called the fundamental group. If µ ∈ X and α ∈ Y , then (µ | α) is always an integer. Let ρ be the half-sum of all positive roots. Then ρ = λ1 + · · · + λ" ∈ X+ . Finitedimensional simple g-modules are parametrized by X+ : for every λ ∈ X+ , there ¯ λ. corresponds a unique simple g-module 1.1.2. The Hopf algebra ᐁ and its integral form. Consider the algebra Ꮽ = Z[v, v −1 ] and its fractional field Q(v), where v is an indeterminate. The Hopf algebra ᐁ, known as the quantum group associated with g, is defined over Q(v) and is generated by Ei , Fi , Kα , with i = 1, . . . , " and α ∈ Y , subject to some relations. We refer the reader to [Lu2] for the set of relations and the definitions of the coproduct . and the antipode S; the precise formulas are not used in the sequel. Note that the coproduct in [Lu2] is the same as the one in [Tu], but opposite to the one in [Ka] and [KM]; correspondingly, our antipode is the inverse of that in [Ka] and [KM]. In [Lu2], the two lattices X, Y are in different spaces, dual to each other. Here we consider both X and Y as subsets of the same space h∗R (using the scalar product). One of the relations says that Kα+β = Kα Kβ = Kβ Kα and K0 = 1. Hence K−α = Kα−1 . Lusztig introduced an integral version Ꮽ ᐁ of ᐁ, similar to the Kostant Z-form of classical Lie algebras. For each positive integer p, let (p)
Ei
p
=
Ei , [p]i !
(p)
Fi
p
=
Fi , [p]i !
where [p]i ! =
p v di n − v −di n n=1
(p)
v di − v −di
.
(p)
Then Ꮽ ᐁ is the Ꮽ-subalgebra of ᐁ generated by Ei , Fi , Kα , with i = 1, . . . , n, p ∈ Z+ , and α ∈ Y . It is known that Ꮽ ᐁ inherits the Hopf algebra structure of ᐁ. 1.2. Category of ᐁ-modules 1.2.1. Finite-dimensional ᐁ-modules of type 1. Suppose M is a ᐁ-module. For every ν ∈ X, let M ν = x ∈ M | Kα (x) = v ν,α x for every root α . The subspace M ν is called the subspace of weight ν; its elements are vectors of weight ν. Let Ꮿ be the category of finite-dimensional (over Q(v)) ᐁ-modules M such that Mν. M= ν∈X (p)
(p)
It is known that on every M ∈ Ꮿ, both Ei and Fi equal zero for sufficiently large p. A morphism in Ꮿ is just a ᐁ-linear homomorphism. If M, N are in Ꮿ, then M ⊗N
278
THANG T. Q. LE
is also in Ꮿ. Thus Ꮿ is a tensor category (also known as a monoidal category; see, e.g., [Ka], [Tu]). The category Ꮿ is semisimple, and its simple objects are parametrized by the set of dominant weights X+ . For every λ ∈ X+ , there is a unique simple ᐁ-module λ ∈ Ꮿ, (p) with a vector x of weight λ such that Ei (x) = 0 for every i = 1, . . . , " and p ≥ 1. The module λ , called the module of highest weight λ, can be considered as the ¯ λ of the Lie algebra g. Every ᐁ-module deformation of the corresponding module in Ꮿ is the direct sum of simple modules of the form λ . The decomposition of the tensor product of two simple ᐁ-modules in Ꮿ is exactly the same as that of the tensor product of corresponding g-modules. Hence, the tensor category Ꮿ is tensorly equivalent to the tensor category of finite-dimensional g-modules. If ν is a weight of λ , then λ−ν is a sum of positive roots. In particular, λ−ν ∈ Y . 1.2.2. Dual modules. As usual, using the antipode S, for every M ∈ Ꮿ one can define the dual ᐁ-module M ∗ ∈ Ꮿ. By definition, M ∗ = HomQ(v) (M, Q(v)), and for every a ∈ ᐁ, f ∈ M ∗ , x ∈ M, one has (af )(x) = f (S(a)x). The dual of µ is ν , where ν = −w0 (µ) and w0 is the longest element of the Weyl group. 1.2.3. The element K˜ ±2ρ . For β ∈ Y , β = "i=1 ki αi , let K˜ β = ni=1 Kki di αi . Replacing K by K˜ has the following effect: If x is a vector of weight ν, then K˜ β (x) = v (ν|β) (x) (replacing the bracket λ, β by the scalar product (λ | β)). Note that 2ρ, as the sum of all positive roots, is always in the root lattice Y . Hence, (2ρ | µ) ∈ Z for every µ ∈ X. The elements K˜ ±2ρ play an important role. 1.2.4. The evaluation and coevaluation maps. The ground field Q(v) is the ᐁmodule λ , with λ = 0. The algebra ᐁ acts on Q(v) via the co-unit. The module Q(v) is the unit of the tensor product in Ꮿ. The left evaluation map evl : M ∗ ⊗ M → Q(v), defined by evl (f ⊗ x) = f (x), is ᐁ-linear. But the map M ⊗ M ∗ → Q(v) defined by (x ⊗ f ) → f (x) is not ᐁ-linear. However, the one twisted by K˜ −2ρ is: The map evr : M ⊗ M ∗ → Q(v), defined by evr (x ⊗ f ) = f (K˜ −2ρ x), is ᐁ-linear. Similarly, the coevaluation maps xs ⊗ xs∗ , coevl : Q(v) −→ M ⊗ M ∗ , defined by coevl (1) = s
∗
coevr : Q(v) −→ M ⊗ M,
defined by coevr (1) =
s
are ᐁ-linear. Here {xs } is a basis of M and
{xs∗ }
xs∗ ⊗ K˜ 2ρ (xs ),
is the dual basis in M ∗ .
1.2.5. Canonical basis. Lusztig and Kashiwara introduced a canonical basis Bλ for the ᐁ-module λ . The set Bλ is a Q(v)-basis of the Q(v)-vector space λ . Let Ꮽ λ be the Ꮽ-lattice in λ generated by Bλ . Then Ꮽ ᐁ leaves the lattice Ꮽ λ invariant, Ꮽ ᐁ(Ꮽ λ ) ⊂ Ꮽ λ . The set Bλ consists of weight vectors; that is, the intersection Bλ ∩ (λ )ν is a Q(v)-basis of the vector space (λ )ν for every weight ν.
QUANTUM LINK INVARIANTS
279
1.3. The braiding and the twist 1.3.1. The quasi-R-matrix. Let 7 be the quasi-R-matrix of [Lu2]; it is an infinite sum 7= as ⊗ b s , (1.1) s
where as , bs are in ᐁ. So 7 belongs to an appropriate completion of ᐁ ⊗ ᐁ. On every M ∈ Ꮿ, all the as , bs , except for a finite number of them, act as zero. Hence it makes sense to consider the operator 7 : M ⊗ N → M ⊗ N for every M, N in Ꮿ. In particular, there is 7 : λ ⊗µ → λ ⊗µ . Lusztig proved that 7 is invertible and that both 7, 7−1 leave Ꮽ λ ⊗Ꮽ Ꮽ µ invariant. Let us take the set x ⊗ y, with x ∈ Bλ and y ∈ Bµ as a basis of λ ⊗ µ , and call it the tensor product basis. Then, in this basis, the matrices of 7, 7−1 have entries in Ꮽ = Z[v, v −1 ]. Moreover, Lusztig proved that 7, 7−1 can be defined over Ꮽ. This implies that the as , bs in the formula (1.1) of 7 can be chosen in Ꮽ ᐁ. 1.3.2. Braiding. Let D be the least positive integer such that D(µ | ν) ∈ Z for every µ, ν ∈ X. Equivalently, D is the least positive integer such that DX ⊂ Y . The number D (see Table 1) is always a divisor of the determinant of the Cartan matrix. Let v 1/D be a new variable such that (v 1/D )D = v. To define the braiding, we extend the ground field to Q(v 1/D ) by taking tensor products of ᐁ and every module in Ꮿ with Q(v 1/D ). By abuse of notation, we still use ᐁ, Ꮿ to denote the corresponding objects (after taking tensor products). For two ᐁ-modules M, N in Ꮿ, let : : M ⊗ N → M ⊗ N be defined by :(x ⊗ y) = v (ν|µ) x ⊗ y, if x ∈ M ν and y ∈ N µ . Note that (ν | µ) is always in (1/D)Z. Let σ : M ⊗N → N ⊗M be the flip: σ (x ⊗y) = y ⊗x. Then the braiding operator c = c(M, N) : M ⊗ N → N ⊗ M is defined by c(M, N) = σ :7−1 : M ⊗ N −→ N ⊗ M. Then c commutes with the action of ᐁ. Actually, c is equal to the inverse of the commutativity isomorphism in [Lu2, Chapter 32]. The map : is called the diagonal part of the braiding. It is because of the diagonal part that we need to extend the ground field to Q(v 1/D ). It is clear now that if we take the tensor product bases as the bases of λ ⊗ µ and µ ⊗ λ , then the matrix of the braiding c has entries in Z[v ±1/D ]. A slightly stronger result is given below. Suppose M = µ and N = ν . If µ , ν are weights of M, N, respectively, then both µ−µ and ν −ν are in the root lattice Y . Since the scalar product of an element in Y and an element in X is always an integer, we see that (µ | ν ) ≡ (µ | ν) (mod Z).
280
THANG T. Q. LE
Hence, in any basis that is the tensor product of bases consisting of weight vectors, the matrix of : has entries in v (µ|ν) Ꮽ. Thus we have the following. Proposition 1.1. In the product bases, the matrix of the braiding c : µ ⊗ν → ν ⊗ µ has entries in v (µ|ν) Ꮽ = v (µ|ν) Z[v ±1 ], and its inverse has entries in v −(µ|ν) Ꮽ. 1.3.3. The twist. Recall that 7 = as ⊗bs , where the sum is infinite and as , bs ∈ ᐁ. Let (see [Lu2, Chapter 6]) S(as )bs . == s
This sum should be considered as an element of some completion of ᐁ. Since on M ∈ Ꮿ only a finite number of terms in the sum survive, one can define = : M → M. For every M ∈ Ꮿ, let θ = θ (M) : M → M be defined by θ (x) = v (ν+2ρ|ν) =(x) if x ∈ M ν . Then θ is invertible, commutes with ᐁ-actions, and is known as a quantum Casimir element (see [Lu2, Chapter 6]). We call it the twist. Moreover, for any M, N ∈ Ꮿ, one has that
−1 θ (M ⊗ N) θ (M) ⊗ θ(N) = c(N, M)c(M, N), (1.2) whose proof is similar to that of [Ka, Proposition VIII.4.5]. The twist θ can also be described as follows. First, note that θ(M N) = θ(M) θ (N). Every M in Ꮿ can be uniquely expressed in the form M(λ) , M= λ∈X+
where M(λ) is the direct sum of a finite number of copies of the simple module λ . The twist θ acts on M(λ) as the scalar v (λ+2ρ|λ) times the identity. 1.3.4. Ꮿ is a ribbon category. It is known that Ꮿ, together with the braiding c and the twist θ , is a ribbon category (see [Ka], [Tu]). Actually, Ꮿ is the same as the category Uh (g)-Modf r in [Ka, Chapter XVII] or the category ᐂq g in [Tu, Chapter XI]. Both [Ka] and [Tu] use the h-adic version of quantum group, which is not suitable for studying the roots of unity case. 1.3.5. The variable q. In knot theory, another variable q = v 2 is usually used. The reader should not confuse this q with the quantum parameter used in the definition of quantum groups by several authors. For example, our q is equal to q 2 in [Ka] and [Tu]. In the expression “quantum invariant at an rth root of unity,” the rth root of unity is q (but not v). 1.4. Quantum link invariants. It is known that any ribbon category gives rise to operator invariants of framed links, whose components are colored by objects of the
281
QUANTUM LINK INVARIANTS
D0
D1
T ◦ T =
D2
T
D4
D3
T ⊗ T = T T
T
Figure 1. Elementary tangle diagrams, composition, and tensor product
category. We first review the definition, following [KM] and [Tu]. 1.4.1. Framed tangles. A tangle T is an oriented 1-manifold properly embedded (up to isotopy) in R2 ×[0, 1], with ∂T ⊂ 0 ×R×∂[0, 1]. Define ∂− T = T ∩(R2 ×0) and ∂+ T = T ∩ (R2 × 1), and call T a (k, l)-tangle if |∂− T | = k and |∂+ T | = l. Thus a link is a (0, 0)-tangle. A framed tangle is a tangle T equipped with a normal vector field that is standard (1, 0, 0) on ∂T . As usual, we consider framed tangles up to isotopy relative to the boundary. In R3 , there is a natural way to identify framings of a component with integers. A diagram of a tangle is its regular projection on 0×R2 , together with the information on over- or undercrossings. A diagram defines a blackboard framing in which the normal vector is always (1, 0, 0). A diagram of T is good if the blackboard framing is coincident with the framing of T . It is well known that every tangle diagram can be factored into the elementary diagrams D0 –D4 depicted in Figure 1 using the composition ◦ (when defined) and the tensor product ⊗ of diagrams. 1.4.2. Operator invariants of colored framed tangles. A coloring of a tangle T is an assignment of an object in the category Ꮿ to each component of T . This induces a coloring of ∂T as follows: If C is an arc of color M, then assign M to each endpoint of C where C is oriented down and assign the dual object M ∗ to each endpoint where C is oriented up. Tensoring from left to right, this gives the boundary objects T± assigned to ∂± T . By convention, the empty product is the unit in Ꮿ (the ground ring Q(v 1/D )). There exists a unique ᐁ-linear operator JT : T− → T+ , assigned to each colored framed tangle T , that satisfies JT ◦T = JT ◦ JT , JT ⊗T = JT ⊗ JT , and, for the tangles given by the elementary diagrams with blackboard framing, JD0 = id, JD2 = coevl ,
JD1 = evl ,
JD2 = coevr ,
JD1 = evr , JD3 = c,
JD4 = c−1 .
282
THANG T. Q. LE
Here we assume that D1 , D2 have orientation pointing from left to right, while D1 , D2 are the same tangles D1 , D2 but with reverse orientation. In particular, if L is a framed link with m ordered components, then JL (M1 , . . . , Mm ), for M1 , . . . , Mm ∈ Ꮿ, is in Q(v 1/D ). We see that only the braiding c and K˜ ±2ρ take part in the construction of JT . Many problems are thus reduced to questions about c and K˜ ±2ρ . Since in the product bases the matrices of c and K˜ ±2ρ have entries in Z[v ±1/D ], we see that JL (µ1 , . . . , µm ) is always a Laurent polynomial in v ±1/D , that is, JL ∈ Z[v ±1/D ]. Masbaum and Wenzl in [MW] proved this fact for g = sln using idempotent decompositions. Later we see that every link can be decomposed into pure braids and tangle diagrams without crossing points. We then prove that JT , when T is a pure braid, can be expressed through the twist θ only (no need to use the braiding). Thus one needs to use only the twist and K˜ ±2ρ . Both are simple, since their actions on highest-weight modules are easily described. 1.4.3. Relation to the Kontsevich integral. Quantum invariants of links can also be defined through the Kontsevich integral; see [LM] and [Ka, Chapter XX]. Roughly speaking, one first takes the (framed version) Kontsevich integral of a link L, then ¯ µ1 , . . . , ¯ µm of the Lie algebra plugs in the weight system coming from the modules g. The result, after a change of variable, is JL (µ1 , . . . , µm ). This approach avoids the theory of quantum groups (although the Kontsevich integral has its origin in quantum group theory). Some properties of quantum link invariants can be easily seen from this point of view. Some other properties, such as the ones proved in this paper, are easier to prove using quantum group theory. Actually, we do not know how to prove the results of this paper by using the Kontsevich integral theory. 1.4.4. The trivial knot. Suppose U is the trivial knot. Then JU (M) is called the quantum dimension of M; its value is well known: v (µ+ρ|α) − v −(µ+ρ|α) JU (µ ) = v (ρ|α) − v −(ρ|α) positive roots α (1.3) v 2(µ+ρ|α) − 1 −(µ|2ρ) =v v 2(ρ|α) − 1 positive roots α
2(µ+ρ|w(ρ)) w∈W sn(w)v = , 2(ρ|w(ρ)) w∈W sn(w)v
(1.4)
where W is the Weyl group and sn(w) is the sign of the linear transformation w. Noting that (µ | 2ρ) is always an integer, we get the following. Corollary 1.2. One has that JU (µ ) is either in Z[v 2 , v −2 ] or in vZ[v 2 , v −2 ]. More precisely, JU (µ ) ∈ v (µ|2ρ) Z[v 2 , v −2 ].
QUANTUM LINK INVARIANTS
283
1.4.5. Sum, tensor product, and framing. The following facts are well known; see [Ka], [Tu]. One has the sum formula JL (M ⊕ M , . . . ) = JL (M, . . . ) + JL (M , . . . ),
(1.5)
where the dots denote the colors of the components other than the first. Let T (2) be the link obtained from T by replacing the first component by two of its parallel push-offs (using the framing). Then one has the tensor product formula JT (M ⊗ N, . . . ) = JT (2) (M, N, . . . ).
(1.6)
Suppose L is obtained from L by increasing the framing of the first component by 1. Then one has the framing formula JL (µ , . . . ) = v (µ+2ρ|µ) JL (µ , . . . ).
(1.7)
1.4.6. (1, 1)-tangles. Suppose that T is a (1, 1)-tangle and that the open component of T is the first component whose color is M. Then JT is an operator from M to M, commuting with the action of ᐁ. When M is a simple ᐁ-module, JT is a scalar operator, and thus there is a scalar invariant J˜T (M, . . . ) ∈ Z[v ±1/D ] such that JT (M, . . . ) = J˜T (M, . . . ) × id . If we close the (1, 1)-tangle T to get a framed link L, then JL (M, . . . ) = J˜T (M, . . . ) × JU (M).
(1.8)
2. Integrality and symmetry 2.1. Integrality. By integrality we mean the integrality of the coefficients and the exponents of v in JL . Recall that D is the least natural number such that (µ | µ ) ∈ (1/D)Z for every µ, µ in the weight lattice X. 2.1.1. Weak integrality. We have seen that JL (M1 , . . . , Mm ) is always in Z[v ±1/D ]. A little stronger statement is the following. Proposition 2.1 (Weak integrality). The quantum invariant JL (µ1 , . . . , µm ) fractional) number delies in v f Z[v ±1 ] = q f/2 Z[q ±1/2 ], where f is a (generally termined by the linking matrix (lij )1≤i,j ≤m of L: f = 1≤i, j ≤m lij (µi | µj ). Proof. If x ∈ M ν , then K˜ ±2ρ (x) = v (±2ρ|ν) x. Note that (2ρ | ν) is an integer, since 2ρ, as the sum of all positive roots, is always in the root lattice Y . It follows that in any basis consisting of weight vectors, K˜ ±2ρ has entries in Z[v ±1 ]. The braiding c : µ ⊗ ν → ν ⊗ µ has entries in v (µ|ν) Ꮽ (see Proposition 1.1) in the product bases; its inverse has entries in v −(µ|ν) Ꮽ. Counting the positive and negative crossing points of a good diagram of L gives the desired result.
284
THANG T. Q. LE
2.1.2. Strong integrality. The weak integrality says that the quantum invariant is essentially a Laurent polynomial in v. The fractional power can be eliminated by a suitable normalization. We have here a stronger statement, which says that the quantum invariant is essentially a Laurent polynomial in v 2 . Theorem 2.2 (Strong integrality). The invariant JL (µ1 , . . . , µm) is in v p Z[v ±2 ] = q p/2 Z[q ±1 ], where p is a (generally fractional) number determined by the linking matrix lij of L: 1 p= lij µi | µj + (lii + 1)(2ρ | µi ) ∈ Z. D 1≤i, j ≤m
1≤i≤m
The proof, which is presented in §3.6, is much more difficult than that of the weak integrality. We have to use a geometric lemma about special presentation of links together with a result of Andersen on quantum groups at roots of unity. The use of quantum groups at roots of unity seems unnatural, since the strong integrality does not have anything to do with roots of unity (see also the remark in §3.6.7). Andersen constructed an algebra homomorphism from the quantum group at v = −ε to the quantum group at v = ε, where ε is some root of unity. Heuristically, this implies some kind of symmetry between −v and v, which leads to the fact that JL depends essentially only on v 2 . Remarks. (a) The factor q p/2 could be understood as the contribution of the diagonal part of the R-matrix. (b) If the link L is replaced by a tangle T , then one cannot get such nice results about the exponents as in the strong integrality theorem. (c) If we use the normalization JˆL µ1 , . . . , µm := v −p JL µ1 , . . . , µm , then JˆL is a link invariant with values in Z[v 2 , v −2 ]. Note that we can define JˆL only for simple modules in Ꮿ. The normalization JˆL does not behave well under the action of the Weyl group (see below), and we do not use it in the sequel. Corollary 2.3. Consider the knot case. Let JL (µ ) be the nonframed version of the quantum invariant of knots, normalized so that the unknot takes value 1, that is, JL (µ ) :=
JL0 (µ ) , JU (µ )
where L0 is the framed knot with framing zero and of knot type L. Then JL (µ ) ∈ Z[v ±2 ]. Remark. When the link L has more than one component, then in general, the quotient JL /JU (m) is not a Laurent polynomial, but rather a rational function in v 1/D . The following corollary is useful in the theory of quantum invariants of 3-manifolds.
QUANTUM LINK INVARIANTS
285
(See [Le3]; for the case g = sln , the corollary was proved in [Le2], using cabling.) Corollary 2.4. If all the µj ’s are in the root lattice, then JL (µ1 , . . . , µm ) is in Z[v ±2 ] = Z[q ±1 ]. Proof. The second term in the expression of the exponent p is in 2Z, since (ρ | µj ) is in Z. The first term is lij µi | µj = lii (µi | µi ) + 2 lij µi | µj . ij
i
i>j
Since (α | α) is even for every α in the root lattice, we see that the first term is in 2Z, too. Hence, the exponent p is an even number. 2.2. The first symmetry principle. Recall that q = v 2 . We show that if q is an rth root of unity, then JL has nice symmetry. 2.2.1. The Weyl group and the affine Weyl group. Let C be the fundamental chamber: C = x ∈ h∗R | 0 ≤ (x | αi ), i = 1, . . . , " . Then X+ = X ∩ C. The Weyl group W , by definition, is generated by reflections along the facets of C. It is a finite subgroup of the orthogonal group of h∗R , and C is a fundamental domain of it. Let Wr be the group of affine transformation of h∗R generated by W and the translation group rY . Since the root lattice is invariant under the action of W , one has Wr = W rY. Let α0 be the highest short root. When d = 1, that is, when all the roots have the same length, α0 is simply the highest root. The fundamental alcove is defined by Cr = x ∈ C | (x | α0 ) < r = x ∈ h∗R | 0 ≤ x, α < r for every positive root α . Its topological closure C¯ r is an "-simplex and is a fundamental domain of the affine Weyl group Wr . (See, e.g., [Kac, Chapter 6]; one has to apply the theory in [Kac] to the dual root lattice.) Moreover, Wr is generated by the reflections along the facets of C¯ r . 2.2.2. JL as a function on the weight lattice. Let us define, for µ1 , . . . , µm ∈ ρ + X+ ,
JL (µ1 , . . . , µm ) := JL µ1 −ρ , . . . , µm −ρ ∈ Z v ±1/D . The shift by ρ is more convenient for us. The formula is good only when all the µ1 , . . . , µm are in ρ + X+ = X ∩ C ◦ , where C ◦ is the interior of the fundamental chamber C. We extend the definition to every point in X as follows. If one of the µj is on the boundary of the chamber C, then let JL (µ1 , . . . , µm ) = 0. For every µ ∈ X, there exists w ∈ W such that w(µ) ∈ C; moreover, if w(µ) is in the interior of C, then such a w is unique. For arbitrary µ1 , . . . , µm ∈ X, choose
286
THANG T. Q. LE
w1 , . . . , wm ∈ W such that wj (µj ) ∈ X+ . Then define (recall that sn(w) is the sign of w) JL (µ1 , . . . , µm ) = sn(w1 ) · · · sn(wm )JL w1 (µ1 ), . . . , wm (µm ) . Using formulas (1.3), (1.4) for the unknot, we see that the formula
JU (µ) = v −(µ−ρ|ρ)
positive roots α
v 2(µ|α) − 1 v −2(ρ|α) − 1
(2.1)
is valid for every µ, not only in ρ + X+ , but also in X. 2.2.3. Another normalization. Let U (m) be the zero-framing trivial link of m components. Recall that QL (µ1 , . . . , µm ) := JL (µ1 , . . . , µm ) × JU (m) (µ1 , . . . , µm ). This normalization is more suitable for the study of quantum 3-manifold invariants and helps us to get rid of the ± sign in many formulas. Then QL is componentwise invariant under the action of the Weyl group: For every w1 , . . . , wm ∈ W , QL w1 (µ1 ), . . . , wm (µm ) = QL (µ1 , . . . , µm ). 2.2.4. First symmetry principle. Recall that q = v 2 . Suppose f, g belong to the same q a Z[q ±1 ], where a ∈ (1/2D)Z. We say that f = g at primitive rth roots of unity and write (r)
f =g if, for every primitive rth root of unity ξ , one has q −a f |q=ξ = q −a g|q=ξ . (r)
There is no need to fix a 2Dth root of ξ . When writing f = g, we always assume that f and g belong to the same q a Z[q ±1 ]. Theorem 2.5 (First symmetry principle). At primitive rth roots of unity, the quantum invariant QL is componentwise invariant under the action of the affine Weyl group Wr . This means, for every w1 , . . . , wm ∈ Wr , (r) QL w1 (µ1 ), . . . , wm (µm ) = QL (µ1 , . . . , µm ). (2.2) (r) If one of the µ1 , . . . , µm is on the boundary of C¯ r , then JL (µ1 , . . . , µm ) = 0.
Note that by the strong integrality, the left-hand side and the right-hand side of (2.2) belong to the same q a Z[q ±1 ]. We also show that JL is componentwise skew-invariant under the affine Weyl group: For every w1 , . . . , wm ∈ Wr , (r) JL w1 (µ1 ), . . . , wm (µm ) = sn(w1 ) · · · sn(wm )JL (µ1 , . . . , µm ).
287
QUANTUM LINK INVARIANTS
0
λ1 (a)
α1
0
rλ1 /2
rλ1
(b) Figure 2. The A1 case
Remark. One can drop the “primitive” in the statements of the theorem. 2.3. The second symmetry principle. Recall that C¯ r is a fundamental domain of the action of W on h∗R . Because of the first symmetry principle, at primitive rth roots of unity, it is enough to consider JL (µ1 , . . . , µm ) with µj in C¯ r ∩ X, a finite set. It turns out that we can do better. There is a finite group G acting on C¯ r , and although JL is not really invariant under this action, it behaves quite nicely. 2.3.1. The extended affine Weyl group and the center group G. Recall that Wr = W rY . Note that X is invariant under the action of the Weyl group. Let Wˆ r be the group generated by W and translation by rX. Then Wˆ r = W rX. If λ ∈ X and w ∈ W , then w(λ)−λ is in Y . This implies Wr is a normal subgroup of Wˆ r . We have an exact sequence 1 −→ Wr −→ Wˆ r −→ G −→ 1, where G = X/Y is the fundamental group of the root data. It is known that G is isomorphic to the center of the simply connected complex Lie group associated with g and that |G| = det(aij ). The group G for various Lie algebras is listed in Table 1. Taking the action of Wˆ r modulo the action of Wr , we get an action of G on C¯ r that can be described explicitly as follows. Suppose g˜ ∈ X is a lift of g ∈ G = X/Y . There is a unique w ∈ Wr such that w(C¯ r + r g) ˜ = C¯ r . Then, for µ ∈ C¯ r , g(µ) = w µ + r g˜ ∈ C¯ r . For g ∈ D and µ ∈ X, we define a scalar product (g | µ) := (g˜ | µ), which is well defined as an element in (1/D)Z/Z. Similarly, for g1 , g2 ∈ G, let (g1 | g2 ) = (g˜ 1 | g˜ 2 ) ∈ (1/D)Z/Z. 2.3.2. Examples of G and its action. Here we give examples of the cases A1 , A2 , and B2 . When g = sl2 (the A1 case), the space h∗R is one-dimensional. The basis root and the weight are depicted in Figure 2(a). The simplex C¯ r is the interval [0, rλ1 ]. The nontrivial element of G = Z2 acts as the reflection about the midpoint rλ1 /2; see Figure 2(b). When g = sl3 (the A2 case), the space h∗R is two-dimensional and d = 1. The basis roots and weights are depicted in Figure 3(a). The simplex C¯ r is the equilateral
288
THANG T. Q. LE
rλ2
α2
rλ1
α0
λ2
λ1 α1
0
(a)
(b) Figure 3. The A2 case
rλ1 /2
α1
α0 = λ 1
rλ2
β0 λ2 α2
(a)
0
Figure 4. The B2 case
(b)
triangle with vertices at 0, rλ1 , and rλ2 . The group G = Z3 acts as rotations by 2π ik/3 about the center point; see Figure 3(b). When g is of B2 type, the space h∗R is again two-dimensional, with d1 = 2, d2 = 1. The basis roots and weights are depicted in Figure 4(a). The simplex C¯ r is the triangle with vertices at 0, rλ1 /2, and rλ2 . The nontrivial element of G = Z2 acts as the reflection about the dashed line that separates C¯ r into two equal halves; see Figure 4(b). 2.3.3. Second symmetry principle Theorem 2.6. Suppose µ1 , . . . , µm ∈ C¯ r and g1 , . . . , gm ∈ G. Then QL g1 (µ1 ), . . . , gm (µm ) = v rt QL (µ1 , . . . , µm ),
(2.3)
at primitive rth roots of unity. Here t depends only on the linking matrix (lij ) of L:
QUANTUM LINK INVARIANTS
t = (r − h)
lij gi | gj + 2
1≤i, j ≤m
289
lij gi | µj − ρ ,
1≤i, j ≤m
with h being the Coxeter number of the Lie algebra g (see Table 1). Remark. The factor v rt = q rt/2 makes both sides of (2.3) belong to the same q a Z[q ±1 ]. For the special cases g = sl2 and g = sln , the theorem was proved by Kirby and Melvin [KM] and Kohno and Takata [KT1]. In [KM] and [KT1], the twisting factor v rt is derived by direct computations. Here we express the twisting factor through the scalar product in h∗R . We also have the result for every primitive rth root of unity. Since Wˆ r = W rX and since QL is componentwise invariant under W , (2.3) is equivalent to the statement that for x1 , . . . , xm ∈ X, QL (µ1 + rx1 , . . . , µm + rxm )
(r) r[(r−h) lij (xi |xj )+2 lij (xi |µj −ρ)]
=v
QL (µ1 , . . . , µm ).
(2.4)
Corollary 2.7. Suppose L has zero linking matrix. Then QL (µ1 , . . . , µm ), at primitive rth roots of unity, is invariant under componentwise action of Wˆ r . Corollary 2.8. If µj − ρ is in the root lattice and µj ∈ C¯ r , then (r) QL g1 (µ1 ), . . . , gm (µm ) = v r[ lij (gi |gj )] QL (µ1 , . . . , µm ). The corollary follows from Theorem 2.6, since the second term in the expression of t in the theorem is in 2Z. These corollaries have application in the study of quantum 3-manifold invariants. In a subsequent paper, we will use Corollary 2.8 to define the projective version of quantum invariants and its perturbative expansion. 2.4. Refined versions of symmetry principles. When (r, d) = 1, we can strengthen both symmetry principles. Actually, we formulate the result so that it includes the (d, r) = 1 case. In the refined versions, the symmetry groups are larger, actually, the largest possible. For example, if one wants to construct quantum invariants of 3manifolds, one has to use the refined versions because of the nondegeneracy property in Proposition 2.10 below. 2.4.1. Refined versions of Cr , Wr , Wˆ r . If (r, d) = 1, let X = X and Y = Y . If (r, d) = 1, then (r, d) = d > 1. In this case, let X (resp., Y ) be the Z-lattice generated by λi /di (resp., αi /di ), i = 1, . . . , ". In other words, if (r, d) = 1, then X is the lattice dual to Y with respect to our scalar product, that is, X = {x ∈ h∗R | (x | y) ∈ Z, for every y ∈ Y }, and similarly, Y is the lattice dual to X. If (d, r) = 1, then X , Y is a realization of the root lattice and the weight lattice of the dual root system whose Cartan matrix is the transpose of the original one.
290
THANG T. Q. LE
In any case, X ⊂ X , Y ⊂ Y . Note that X and Y are invariant under W , and if x ∈ X , then x − w(x) ∈ Y for every w ∈ W . Let us define Wr = W rY ,
Wˆ r = W rX .
Then Wr is a normal subgroup of Wˆ r , and we have an exact sequence 1 −→ Wr −→ Wˆ r −→ G −→ 1,
(2.5)
where G = X /Y . Note that G is always isomorphic to G. The lattices X , Y thus depend on the root data (of the Lie algebra g) and whether (r, d) = 1 or not. There is a unifying definition good for every r, as described in the following lemma, which is easy to prove. Lemma 2.9. One has that rX = x ∈ X | (x | y) ∈ rZ for every y ∈ Y , rY = y ∈ Y | (x | y) ∈ rZ for every x ∈ X . 2.4.2. The fundamental domain of Wr . If (r, d) = 1, let Cr = Cr . Otherwise, let Cr = x ∈ C | (x | β0 ) < r , where β0 is the long highest root. The closure C¯ r is a simplex and a fundamental domain of Wr . In any case Cr ⊂ Cr , and if (d, r) = 1, then Cr is strictly less than Cr . In fact, if (r, d) = 1, then it can be shown that the volume of Cr is "i=1 (d/di ) times that of Cr . One important property of Cr is the following nondegeneracy property, proved in [AP]. Proposition 2.10 (see [AP]). Suppose µ ∈ C¯ r and ε 2 is a primitive rth root of unity. Then the quantum dimension JU (µ)|v=ε = 0 if and only if µ is on the boundary of C¯ r . Proof. This follows from the explicit formula of the quantum dimension (2.1). This proposition shows that the set Cr (but not Cr in general) is exactly what should be used in the construction of the topological quantum field theory. 2.4.3. Actions of G and the quadratic form on G . Again, the exact sequence (2.5) leads to an action of G on C¯ r . Explicitly it can be described as follows. Suppose g˜ ∈ X ˜ = C¯ r . is a lift of g ∈ G = X /Y . There is a unique w ∈ Wr such that w(C¯ r + r g) Then g(µ) : = w(µ + r g) ˜ for every µ ∈ C¯ r . For g ∈ G and µ ∈ X (not X here), let us define the product (g | µ) : = (g˜ | µ), which is well defined as an element in Q modulo Z. If ζ is a 2Drth root of unity, then ζ 2Dr(g|µ) is well defined as a complex number.
291
QUANTUM LINK INVARIANTS
rλ1 /2
α1
α0 = λ 1
rλ1 /2
rλ2 /2
β0 λ2
(a)
rλ2
α2
0
0 (b): Cr
(c): Cr
Figure 5. The B2 case
Similarly, for g1 , g2 ∈ G , let (g1 | g2 ) = (g˜ 1 | g˜ 2 ), defined as an element in Q modulo (1/(d, r))Z. If ζ is a 2Drth root of unity and k an integer divisible by (d, r), then ζ 2Drk(g1 |g2 ) is well defined as a complex number. A little bit more difficult to see is that ζ Dkr(g|g) is well defined as a complex number for every g ∈ G . But this follows from the evenness of the quadratic form (· | ·) on the root lattice. When (r, d) = 1, we could drop all the primes. Note that all the “prime” versions, such as X , Y , Cr , Wr , . . . , depend on the root data and whether (r, d) = 1 or not. The group G is isomorphic to G; however, the scalar product on G depends on whether (r, d) = 1 or not. Let us consider an example when (r, d) = 1. Then d = 2 or 3. If we want G to not be trivial, then we are left with only two cases: the B" or C" series of Lie algebras. Let us consider the case of B2 (see Figure 5). In this case Cr is a half of Cr such that Cr is the triangle with vertices at 0, rλ1 /2, rλ2 /2. The nontrivial element of G = Z2 acts as the reflection about the dashed line in Figure 5(c). 2.4.4. Refined symmetry principles. Both symmetry principles remain valid if we replace Wr , Cr by Wr , Cr . However, one has to take care of the Coxeter numbers. Theorem 2.11 (Refined first symmetry principle). At primitive rth roots of unity, QL is componentwise invariant under actions of Wr : For µ1 , . . . , µm ∈ X, w1 , . . . , wm ∈ Wr , (r) QL w1 (µ1 ), . . . , wm (µm ) = QL (µ1 , . . . , µm ). (2.6) (r)
If one of the µj is on the boundary of C¯ r , then JL (µ1 , . . . , µm ) = 0. Since Wr ⊂ Wr , the refined version implies the nonrefined version. We also prove that JL is componentwise skew-invariant under Wr : For w1 , . . . , wm ∈ Wr , (r) (2.7) JL w1 (µ1 ), . . . , wm (µm ) = sn(w1 ) · · · sn(wm )JL (µ1 , . . . , µm ). Certainly this identity implies (2.6). It also implies the second statement of the
292
THANG T. Q. LE
theorem: If one of µ1 , . . . , µm , say, µ1 , is on the boundary of C¯ r , then there is reflection along a facet of C¯ r that fixes µ1 . The sign of any reflection is −1. Hence, (r)
JL (µ1 , . . . , µm ) = 0. Another way to prove the second statement is the following. By (1.8), JL = J˜T JU (µ1 ). If µ1 is on the boundary of C¯ r , then by Lemma 2.10, (r)
(r)
JU (µ1 ) = 0. Hence JL = 0. Again, because of the first symmetry, it is enough to restrict the colors to C¯ r when considering quantum invariants. Let h∨ be the dual Coxeter number of g (see Table 1). Theorem 2.12 (Refined second symmetry principle). Suppose µ1 , . . . , µm ∈ C¯ r and g1 , . . . , gm ∈ G . Then, at primitive rth roots of unity, QL g1 (µ1 ), . . . , gm (µm ) = v rt QL (µ1 , . . . , µm ), (2.8) where t is determined by the linking matrix of L: lij (gi | gj ) + 2 lij (gi | µj − ρ), t = (r − h ) i,j
i,j
with
h =
h dh∨
if (d, r) = 1, if (d, r) = 1.
Note that v rt in (2.8) is well defined as a complex number; see §2.4.3. Again, the factor v rt = q rt /2 makes both sides of (2.8) belong to the same q a Z[q ±1 ]. Since the action of G is obtained from the action of the extended affine Weyl group Wˆ r , let us describe how QL behaves under the action of Wˆ r . Recall that Wˆ r = W rX and that QL is componentwise invariant under W . We need only to describe how QL behaves under the translation group rX . Suppose x1 , . . . , xm ∈ X ; then, QL (µ1 + rx1 , . . . , µm + rxm ) (r)
= QL (µ1 , . . . , µm )v r[(r−h )
lij (xi |xj )+2
lij (xi |µj −ρ)]
.
(2.9)
The theorem certainly follows from this statement. 3. Proofs 3.1. Quantum groups at roots of unity. We recall the theory of quantum groups at roots of unity, following [An], [AP], and [Lu2] and then prove some auxiliary facts. 3.1.1. Quantum group at roots of unity and its category of modules. Suppose ε ∈ C is a number such that ε2 is an rth primitive root of unity. Then ε is either a primitive 2rth root of unity or a primitive rth root of unity. The latter can happen only when r is odd. Fix a number ζ such that ζ D = ε. If a ∈ (1/D)Z, then by ε a we mean ζ Da .
QUANTUM LINK INVARIANTS
293
Let ε ᐁ be the algebra Ꮽ ᐁ ⊗Ꮽ C, where C is considered as an Ꮽ-algebra by mapping v to ε. Then ε ᐁ is a Hopf C-algebra, called a quantum group at a root of unity (Lusztig’s version). The Cartan subalgebra of Ꮽ ᐁ is not generated (over Ꮽ) by the Kα alone; one needs the elements d −di t 1−s v Kαi i − v 1−s Kαi Kαi = ∈ Ꮽ ᐁ, t v sdi − v −sdi
s=1
where i = 1, . . . , " and t = 1, 2, 3 . . . . For a ε ᐁ-module M and a weight ν ∈ X, let ν, αi Kαi ν ν,α (x) = M = x ∈ M | Kα (x) = ε x and x , t t i where for x ∈ Z, t ∈ Z, t > 0, t ε di (x−s+1) − ε −di (x−s+1) x := . t i ε sdi − ε −sdi s=1
Let ε Ꮿ be the category of finite-dimensional ε ᐁ-modules M such that M = ⊕ν∈X M ν (p)
(p)
and that Ei (x) = Fi (x) = 0 on M for sufficiently large p. Using the same formulas as in the case over Q(v), we define dual modules and the evaluation and coevaluation maps. We define the twist ε θ and the braiding ε c using the same formulas of θ and c, replacing v 1/D by ζ . Then (ε Ꮿ, ε θ, ε c) is a ribbon category. In particular,
−1
ε θ (M, N) ε θ (M) ⊗ ε θ(N)
= ε c(N, M) × ε c(M, N).
(3.1)
3.1.2. Simple modules. In general, ε Ꮿ is not semisimple: there are modules in ε Ꮿ that are indecomposable, but not simple. Since Ꮽ λ is invariant under Ꮽ ᐁ, there is defined ε λ =Ꮽ λ ⊗C, which is a ε ᐁmodule in ε Ꮿ. Here λ is in X+ . Since ε is a root of unity, ε λ may not be simple. But ε λ always has a unique quotient Lλ that is a simple ε ᐁ-module. This ε ᐁ-module Lλ ∈ ε Ꮿ is also of highest weight λ. If λ = µ, then Lλ is not isomorphic to Lν . If λ ∈ Cr , then ε λ is a simple ε ᐁ-module, that is, Lλ = ε λ . 3.1.3. Composition factors and the twist ε θ. In general, if M ∈ ε Ꮿ, then M may not be a direct sum of simple modules. However, there is a decreasing sequence of submodules M = M0 ⊃ M1 ⊃ · · · ⊃ Mn = 0 such that Mi /Mi+1 is simple. The quotient Mi /Mi+1 is called a composition factor of M.
294
THANG T. Q. LE
In [AP], as a corollary of the linkage principle, it was proved that for every M ∈ ε Ꮿ, M= M[µ] . (3.2) µ∈Cr
Here M[µ] is the maximal submodule of M such that each composition factor of it is isomorphic to Lν with ν in the Wr -orbit of µ under the dot action, that is, ν = w(µ + ρ) − ρ for some w ∈ Wr . Lemma 3.1. If ν is in the Wr -orbit of µ (under the dot action), then ε(µ+2ρ|µ) = ε (ν+2ρ|λ) . Proof. Suppose ν = w(µ + ρ) − ρ. Using the fact that ε 1/D is a 2rDth root of unity, it is easy to check the statement for the case when w is in W and the case when w is a translation by a vector in rY . Note that the twist ε θ acts as the scalar ε(µ+2ρ|µ) on ε µ (see §1.3.3); hence, it acts as the same scalar on any composition factor of ε µ . Thus we get the following. Proposition 3.2. In the above notation, the twist ε θ acts as scalar ε(µ+2ρ|µ) on M[µ] . 3.1.4. Quantum link invariants. Recall that ε Ꮿ is a ribbon category. Thus if L is a framed link, then there is defined the invariant ε J L (M1 , . . . , Mm ) ∈ C, where M1 , . . . , Mm are in ε Ꮿ. Although we use the notation with ε, it is understood that ε J T depends on the choice of ζ , a Dth root of ε, since the twist and the braiding do. However, this is not essential, since one can always get rid of fractional powers of v by a suitable normalization. Obviously when Mj = ε µj , then ε J L ε µ1 , . . . , ε µm = JL µ1 , . . . , µm v 1/D =ζ , where the right-hand side means the value of JL (µ1 , . . . , µm ) at v 1/D = ζ . Many modules are not direct sums of ε λ . We can also define invariants of framed links colored by these modules. The presence of these modules helps us to relate values of quantum invariants at various µ. The simplest argument goes as follows. Suppose T is a framed (1, 1)-tangle whose open component is colored by λ . Then JT (λ ) is a scalar operator from λ to λ ,
JT (λ ) = J˜T (λ ) id, with J˜T (λ ) ∈ Z v ±1/D . Hence, when specialized at v 1/D = ζ , the map ε J T (ε λ ) : ε λ → ε λ is also a scalar operator (although ε λ may not be irreducible): ε J T ε λ = ε J˜T ε λ id .
QUANTUM LINK INVARIANTS
295
Here ε J˜T (ε λ ) is a complex number obtained from J˜L (λ ) by putting v 1/D = ζ . It follows that if M is a composition factor of ε λ , then ε J T (M) : M → M is also a scalar operator with the same scalar ε J˜T (ε λ ). In particular, (3.3) ε J˜T Lλ = ε J˜T ε λ . Proposition 3.3. If ε λ and ε µ have a common composition factor and if T is a (1, 1)-tangle, then ε J˜T ε λ = ε J˜T ε µ . 3.2. Lemmas on quantum dimensions and signs Lemma 3.4. Recall that h = h if (d, r) = 1 and h = dh∨ if (d, h) = 1. (a) For every x1 , x2 ∈ X , the number h (x1 | x2 ) is an integer. (b) For every x ∈ X , one has (2ρ | x) ≡ h (x | x) (mod 2)
and
(2ρ | x) ≡ dh∨ (x | x) (mod 2).
(c) For every x1 , x2 ∈ X (not X ), one has (h − h)(x1 | x2 ) ∈ Z,
and
(h − h)(x1 | x1 ) ∈ 2Z.
Proof. (a) Suppose (d, r) = 1. Then X = X, and (x1 | x2 ) ∈ (1/D)Z. The values of h = h and D in Table 1 show that h is divisible by D. Hence h (x1 | x2 ) ∈ Z. Now suppose (d, r) = 1; then (d, r) = d > 1. This means g is of type B, C, F4 , or G2 . Note that X is the Z-lattice generated by λi /di , and (λi /di | λj /dj ) = (1/di )(A−1 )ij , where A−1 is the inverse of the Cartan matrix. Explicit calculation shows that (λi /di | λj /dj ) ∈ (1/D )Z, where D = 2 for B" , F2 , G2 , and C" with " even, and D = 4 for C" with " odd. In any case, D divides dh∨ , and hence dh∨ /D ∈ Z. (b) Note that if the statement is true for x = x1 and x = x2 , then it is true for x = x1 +x2 . Hence, it is enough to restrict oneself to the case when x is in a basis set. If (d, r) = 1, a basis set is {λi , i = 1, . . . , "}; if (d, r) = 1, a basis set is {λi /di , i = 1, . . . , "}. One can easily check the statement for each simple Lie algebra. (c) If (r, d) = 1, then h = h and both statements are trivial. Suppose (r, d) = 1; then h = dh∨ . Again one needs only to verify the statements when x1 , x2 is in a basis set of X, say, x1 = λi , x2 = λj . Recalling that (λi | λj ) = (A−1 )ij dj , one can easily check both statements. Lemma 3.5. Recall that U is the unknot. Let µ ∈ X. At primitive rth roots of unity, one has JU (µ + ry) = JU (µ)
and
JU (µ + rx) = (−1)(2ρ|x) JU (µ) JU (w(µ)) = sn(w)JU (µ)
QU (µ + ry) = QU (µ) for every y ∈ Y , and and
QU (µ + rx) = QU (µ) for every x ∈ X , QU (w(µ)) = QU (µ) for every w ∈ Wr .
296
THANG T. Q. LE
For the QL version, the statements are much simpler, since there is no sign. Proof. The first two identities for JU follow from the formula (2.1), if we remember that r(x | α) ∈ rZ for every x ∈ X , α ∈ Y (see Lemma 2.9). The third identity for JU follows from the first one and the fact that JU is skew-invariant under the action of the Weyl group W . All the identities for QU follow from the corresponding ones for JU . 3.3. Proof of refined first symmetry principle. The proof utilizes results from [AP] in the theory of quantum groups. We focus only on the first component of L. Suppose the color of this component is µ. Cut the link at a point on the component to get a (1, 1)-tangle T . Then by formula (1.8) we have JL (µ) = J˜T (µ)JU (µ). Since at primitive rth roots of unity we have (see Lemma 3.5) JU w(µ) = sn(w)JU (µ), it is enough to show that J˜T (w(µ)) = J˜T (µ) at primitive rth roots of unity. Here w ∈ Wr . From [AP, Section 3] and [Ja, Chapter II] we know that if λ = w(µ), then there is a sequence of µ1 , . . . , µs such that Lλ−ρ and Lµ−ρ are composition factors of ε µ1 and ε µs , respectively, and two consecutive ε µj , ε µj +1 have a common composition factor. It follows from Proposition 3.3 that J˜(λ) = J˜(µ) at primitive rth roots of unity. This proves the refined first symmetry principle. 3.4. Proof of refined second symmetry principle. As argued in §2.4.4, we need to prove (2.9). We use a result of Lusztig, which we first recall. 3.4.1. A tensor product theorem. Recall that ε2 is a primitive rth root of unity. Let r , i = 1, . . . , " . Xr = x ∈ X+ | x, αi < (r, di ) One can check that (Cr ∩ X+ ) ⊂ Xr . It is easy to check that if ξ ∈ X+ , then there exists unique λ ∈ Xr and ν ∈ rX ∩X+ such that x = λ + ν. We denote ξ (0) = λ ∈ Xr and ξ (1) = ν/r ∈ X . Lusztig [Lu1] proved that, as ε ᐁ-modules, Lξ ∼ = Lλ ⊗ Lν . This is quite nontrivial and very different from the classical case. It is similar to Steinberg’s tensor product theorem for algebraic groups over fields of positive characteristic. In [Lu1] the proof is given only for the case (r, d) = 1. However, the proof can be generalized to the case (r, d) = 1 (see the arguments of [AP, Theorem 3.12]).
QUANTUM LINK INVARIANTS
297
3.4.2. The square of the braiding on Lλ ⊗ Lν . Assume that, as in §3.4.1, λ ∈ Xr and ν ∈ (rX ∩ X+ ). The square of the braiding ε c(Lν , Lλ ) ε c(Lλ , Lν ) is an operator acting on Lλ ⊗Lν , commuting with the action of ε ᐁ. But Lλ ⊗Lν = Lλ+ν is a simple module. Hence, the square of the braiding is a scalar operator, ε c(Lν , Lλ ) ε c(Lλ , Lν ) = bλ,ν id,
where bλ,ν ∈ C is a constant. For the tangle diagrams D3 , D4 of Figure 1 corresponding to c, c−1 , we have D3 = D32 D4−1 . It follows that ε J D3 (Lλ , Lν ) = bλ,ν ε J D4 (Lλ , Lν ).
(3.4)
This means the operator of a positive crossing and the one of a negative crossing are proportional. The proportional factor bλ,ν can be calculated as follows. By (3.1),
−1 (ε c)2 = ε θ(Lλ ⊗ Lν ) ε θ(Lλ ) ⊗ ε θ(Lν ) . Using Lλ ⊗ Lν = Lλ+ν and the fact that ε θ acts on Lµ as the scalar ε (µ+2ρ|µ) (see §1.3.3), we see that bλ,ν = ε 2(λ|ν) .
(3.5)
3.4.3. The square of the braiding on Lν ⊗ Lν . We continue to assume that, as in the previous subsection, ν ∈ rX ∩ X+ . We show that (ε c)2 acts as a scalar operator on Lν ⊗ Lν . It is enough to show that ε θ(Lν ⊗ Lν ) is a scalar operator, since
−1 c2 = θ(Lν ⊗ Lν ) θ(Lν ) ⊗ θ(Lν ) . The structure of the module Lν and its tensor powers can be understood by classical Lie theory, via the quantum Frobenius map (see [Lu2, Chapter 35]). Every weight of Lν must be of the form ν − α, where α ∈ rY . The tensor product Lν ⊗ Lν is completely reducible: L ν ⊗ L ν = ⊕ τ Lτ , where τ ∈ 2ν − rY . The twist θ acts on Lτ as a scalar operator, with the scalar ε(τ +2ρ|τ ) (see Proposition 3.2). When τ ∈ 2ν −rY , it is easy to see that the scalar is always equal to ε (2ν+2ρ|2ν) . This means θ (Lν ⊗ Lν ) is a scalar operator with the scalar ε (2ν+2ρ|2ν) . So we have c2 = bν id, where the value of bν can be calculated (bν = ε2(ν|ν) ; we do not need this value). It follows that JD3 (Lν , Lν ) = bν JD4 (Lν , Lν ). For example, suppose T is a (1, 1)-tangle with framing zero. Let us calculate ˜ J ε T (Lν ). We switch over- or undercrossing at some points in a good diagram of T to get the trivial (1, 1)-tangle. With each switching we have to multiply the quantum invariant by bν or (bν )−1 . Since the framing is zero, we conclude that ε J˜T (Lν ) = 1.
298
THANG T. Q. LE
3.4.4. Reduction to the framing-zero case. We show here that if (2.9) is true for a link L, then it holds true for any link obtained from L by altering the framing of the components. It is sufficient to consider the case when we increase the framing of the first component by 1. Then the left-hand side of (2.9) is multiplied by a LHS = ε (µ1 +rx1 +ρ|µ1 +rx1 −ρ) and the right-hand side by
a RHS = ε (µ1 +ρ|µ1 −ρ) ε r[(r−h )(x1 |x1 )+2(x1 |µ1 −ρ)] . Hence, to show that a LHS = a RHS it is enough to prove that
1 = ε r[h (x1 |x1 )+2(x1 |ρ)] , which follows from Lemma 3.4. (The term in the square bracket of the exponent is divisible by 2 by Lemma 3.4.) 3.5. A special case. By virtue of the result of the previous subsection, we assume from now on that 0 = l11 = l22 = · · · . Suppose ξ ∈ X+ . Then ξ = ξ (0) + rξ (1) (see the notation in §3.4.1). In this subsection we assume that µ2 , . . . , µm ∈ X+ . We show that (r) QL ξ , µ2 , . . . , µm = v 2rκ QL ξ (0) , µ2 , . . . , µm , (3.6) where κ=
l1j ξ (1) | µj .
j
By Lemma 2.9, r(ξ (1) | α) ∈ rZ for every α ∈ Y . Since w(µj ) − µj is in Y (1) (1) for every w ∈ Wr , we have that ε2r(ξ |µj ) = ε2r(ξ |w(µj )) . It follows that ε 2rκ is invariant under the action of Wr . Thus using the refined first symmetry principle, we see that to prove (3.6) we can assume that µ2 , . . . , µm are in Cr . In this case ε µj = Lµj , j = 2, . . . , m. Hence, to prove (3.6) one just needs to show that 2rκ (3.7) ε QL ε ξ , Lµ2 , . . . , Lµm = ε ε QL ε ξ (0) , Lµ2 , . . . , Lµm . Cut L at a point on the first component to get a (1, 1)-tangle T . From Lemma 3.5 we know that ε QU (ε ξ ) = ε QU (ε ξ (0) ). Hence, by formula (1.8), identity (3.7) is equivalent to 2rκ ˜ (3.8) ε J˜T ε ξ , Lµ2 , . . . , Lµm = ε ε J T ε ξ (0) , Lµ2 , . . . , Lµm . Using (3.3) we can replace ε ξ and ε ξ (0) by, respectively, Lξ and Lξ (0) in (3.8). The Lusztig theorem says Lξ = Lν ⊗ Lξ (0) , where ν = rξ (1) . By the tensor product formula (1.6), we have ε J˜T Lξ , Lµ2 , . . . , Lµm = ε J˜T (2) Lν , Lξ (0) , Lµ2 , . . . , Lµm .
299
QUANTUM LINK INVARIANTS
There are two parallel push-offs of the first component of T ; let us denote the one colored with Lν by K. If we remove K, then from T (2) we get T . In the tangle diagram T (2) , consider a crossing point of K with the j th component whose color is Lµj . By formula (3.4), switching over- or undercrossing results in a factor bµj ,ν or its inverse. Switching over- or undercrossing to unlink the component K from other components, from T (2) we get T . Then we have ε J˜T (2)
" l Lν , Lξ (0) , Lµ2 , . . . , Lµm = bµj ,ν 1j × ε J˜T Lν , Lξ (0) , Lµ2 , . . . , Lµm
j =1
=
"
bµj ,ν
l1j
ε J˜K (Lν )ε J˜L
j =1
Lξ (0) , Lµ2 , . . . , Lµm . (3.9)
In §3.4.3 we showed that ε J˜K (Lν ) = 1. Using the values of bµ,ν in (3.5), from (3.9) we get (3.8). 3.5.1. The case when x2 = · · · = xm = 0. We prove (2.9) by assuming that x2 = · · · = xm = 0. Recall that the framings of L are zero. In this case, (2.9) reads (r) QL µ1 + rx1 , µ2 , . . . , µm = v 2r [ j l1j (x1 |µj −ρ)] QL µ1 , µ2 , . . . , µm .
(3.10)
By the refined first symmetry principle, QL is invariant under the translation by rY . Hence, we can further assume that µ1 + rx1 and all µ1 , . . . , µm are in the interior of the fundamental chamber C, that is, they are in ρ + X+ . Replacing µj by µj − ρ, we see that (3.10) is equivalent to (r) QL µ1 +rx1 , µ2 , . . . , µm = v 2r[ j l1j (x1 |µj )] QL µ1 , µ2 , . . . , µm . (3.11)
Note that (µ1 +rx1 )(0) = (µ1 )(0) and (µ1 +rx1 )(1) = (µ1 )(1) +x1 . Applying formula (3.6) for ξ = µ1 +rx1 and for ξ = µ1 and then comparing the right-hand sides of the resulting identities, we get (3.11). 3.5.2. End of proof of second symmetry principle. We continue to assume that the framings are 0, l11 = · · · = lmm = 0. The result of the previous subsection certainly holds true if we replace the first component by any component. Successively adding rx1 to µ1 , rx2 to µ2 , and so on, we get (r) QL µ1 + rx1 , . . . , µm + rxm = v 2rτ QL µ1 , . . . , µm , where τ=
i,j
lij xi | µj − ρ + lij xi | rxj . i>j
(3.12)
300
THANG T. Q. LE
...
...
β
β
Figure 6. Braid closure
By Lemma 3.4, h (xi | xj ) ∈ Z, and hence 2r xi | xj ≡ 2 r − h xi | xj (mod 2), and eventually 2r
xi | xj ≡ (r − h ) lij xi | xj (mod 2).
i>j
i,j
This means that when v 2 isan rth root of unity, the second term in the formula of τ can be replaced by (r −h ) i,j lij (xi | xj ), and (3.12) becomes (2.9). This completes the proof of the refined second symmetry principle. Consider the nonrefined version. Dividing the right-hand side of (2.9) by the right −h) l (x |x ) ij i j . By Lemma 3.4(c), if all the x ’s hand side of (2.4), the quotient is v r(h i are in X and v 2r = 1, then v r(h −h) lij (xi |xj ) = 1. Hence (2.9) implies (2.4), which, in turn, implies the nonrefined version of the second symmetry principle. 3.6. Proof of the strong integrality 3.6.1. Presentation of links as plat closures of pure braids Proposition 3.6. Every nonframed link has a diagram of the form Tu ◦ T ◦ Tl , where Tu and Tl do not have any crossing and T is a pure braid. Proof (W. Menasco). First consider the case when L has only one component, that is, L is a knot. Then L is the braid closure of a braid β (see Figure 6). The natural projection from the braid group to the symmetric group maps β to an element with only one cycle, since L is a knot. Any two such elements are conjugate in the symmetric group. Since braids of the same conjugacy class have the same closure, we may assume that the projection of β onto the symmetric group is the permutation (12 · · · n). This means, after some over- or undercrossing switchings, from β we get a braid isotopic to β described in Figure 7(a). The isotopy can be assumed to be horizontal. The closure of β is presented in Figure 7(b); it is a trivial knot. It could be horizontally isotoped into the diagram in Figure 7(c) and, eventually, into the one in
301
QUANTUM LINK INVARIANTS
···
···
(a)
(b)
(c)
(d)
Figure 7. The trivial knot
a pure braid
(a)
a pure braid
(b)
(c)
Figure 8. The plat closure
Figure 7(d). Now from this picture we go back to L by horizontal isotopy inside the parallel strip, as indicated in Figure 8(a), and undo the over- or undercrossing switchings using finger moves (see Figure 8(b)). We get the desired presentation (see Figure 8(a)). The proof for the case when L has many components is quite similar. The result, for a link of two components, is described in Figure 8(c). 3.6.2. Quantum invariants of pure braid. Suppose T is a pure braid whose components are colored by M1 , . . . , Mn ∈ Ꮿ. Then JT (M1 , . . . , Mn ) is an operator acting on the vector space M1 ⊗ · · · ⊗ Mn . We show here that JT can be expressed though the twist θ alone. If T is the square of D1 , that is, T is a full twist (see Figure 9(a)), then (see §1.3.3)
−1 JT (M1 , M2 ) = θ(M1 ) ⊗ θ(M2 ) θ(M1 ⊗ M2 ). Hence, in this case JT can be expressed through θ alone. Similarly, if T = D22 , then JT can be expressed through θ. If T is the tangle in Figure 9(b), which is obtained from the one in Figure 9(a) by taking parallels, then JT can also be expressed through θ alone, by the tensor product formula. Here the band stands for a bunch of parallel lines. Now we claim that every pure braid can be obtained from those in Figure 9(b) and their mirror images by using composition and tensor product. In fact, the pure
302
THANG T. Q. LE
(a)
(b)
(c)
(d)
Figure 9. The full twist and its parallels
braids depicted in Figure 9(c) and their mirror images generate every pure braid (using composition and tensor product). But these pure braids can be expressed through the ones of Figure 9(b), as shown in Figure 9(d) for a simple case. Hence JT , when T is a pure braid, can be expressed through θ. Using Lemma 3.6 we see that for a link L, up to a framing factor, JL can be expressed through θ and K±2ρ . 3.6.3. The map ϕ. Let ϕ : Z[v ±1/D ] → Z[v ±1/D ] be the algebra homomorphism defined by ϕ(v 1/D ) = eπi/D v 1/D . Then ϕ(v) = −v and ϕ 2D = id. Hence, the space Z[v ±1/D ] decomposes into eigenspaces of ϕ, whose eigenvalues are 2Dth roots of unity, eaπi , with a = 0, 1/D, . . . , (2D − 1)/D. The eigenspaces of ϕ are v a Z[v ±2 ]. Also x ∈ Z[v ±1/D ] is in the eigenspace v a Z[v ±2 ] if and only if ϕ(x) = eaπi x. Thus to prove the strong integrality theorem, one needs to show that ϕ JL µ1 , . . . , µm = eaπi JL µ1 , . . . , µm , (3.13) with a=
i,j
1 lij µi | µj + (lii + 1)(2ρ | µi ) ∈ Z. D i
Since both sides of (3.13) are Laurent polynomials in v 1/D , it is enough to prove (3.13) when v 1/D = e2πi/2Dr for every sufficiently large odd r. 3.6.4. The algebra homomorphism ϕ. ¯ Let us fix an odd integer r. Let ε be a primitive 2rth root of unity. Then −ε is a primitive rth root of unity. Andersen in [An] showed that there is an algebra homomorphism ϕ¯ : −ε ᐁ −→ ε ᐁ with the following properties. If M is a ε ᐁ-module, then pulling back via ϕ, ¯ we get a −ε ᐁ-module ϕ¯ ∗ (M). If M is the highest-weight module of highest weight µ, that is, M = ε µ , then ϕ¯ ∗ (M) is also the highest-weight −ε ᐁ-module of highest weight µ, that is, ϕ¯ ∗ (ε µ ) = −ε µ . Note that both M and ϕ¯ ∗ (M) have the same underlying vector space over C.
QUANTUM LINK INVARIANTS
303
In order to consider quantum invariants of links, we need to fix a Dth root of ε and a Dth root of −ε. Fix an arbitrary Dth root ζ of ε, and choose ζ = eπi/D ζ = ϕ(v)|v 1/D =ζ as the Dth root of −ε. Now we can define ε θ, ε c, −ε θ, −ε c. In general, ϕ¯ does not commute with the coproduct. If M and N are two ε ᐁmodules, there may be two different −ε ᐁ-module structures on M ⊗C N, one via the coproduct of −ε ᐁ (the usual one) and one via ϕ¯ and the coproduct of ε ᐁ. However, we have the following. Lemma 3.7. Let Mj = ε µj , j = 1, . . . , n. The space M1 ⊗ · · · ⊗ Mn has two −ε ᐁ-module structures as described above. Then the twist −ε θ acts the same way in the two different module structures. Similarly, every Kβ , β ∈ Y acts the same way in the two different module structures. Proof. The statement for Kβ follows from the fact that for Kβ , the map ϕ¯ commutes with ., ϕ(.(K ¯ ¯ β )), which, in turns, follows from the definition β )) = .(ϕ(K l+1 of ϕ¯ : ϕ(K ¯ β ) = Kβ (see [An]). The statement for θ follows from the fact that the action of θ is totally determined by the highest weight (see Proposition 3.2). One needs to decompose M1 ⊗ · · · ⊗ Mn using (3.2) and applying Proposition 3.2. 3.6.5. The action of the twist. Again let Mj = ε µj , j = 1, . . . , n. There are two actions of −ε ᐁ on M1 ⊗ · · · ⊗ Mn . By the result of the previous subsection, the twist −ε θ acts the same way in the two structures. On this same vector space, M1 ⊗· · ·⊗Mn acts the twist ε θ of ε ᐁ. εθ
Proposition 3.8. Let Mj = ε µj . On M1 ⊗ · · · ⊗ Mn , the two operators −ε θ and are proportional: −ε θ
= eπi(µ1 +···+µn +2ρ|µ1 +···+µn ) ε θ.
Proof. As ε ᐁ-modules, one has (see (3.2)) M1 ⊗ · · · ⊗ Mn =
M[ν] .
ν∈Cr
On M[ν] , ε θ acts as the scalar ζ D(ν+2ρ|ν) (see Proposition 3.2). On that same subspace M[ν] , −ε θ acts (through ϕ) ¯ as the scalar (ζ )D(ν+2ρ|ν) = eπi(ν+2ρ|ν) ζ D(ν+2ρ|ν) . Hence on M[ν] , −ε θ
= eπi(ν+2ρ|ν) ε θ.
(3.14)
Note that µ1 + · · · + µn − ν is in the root lattice. Using the fact that the scalar product of a vector in the root lattice and a vector in the weight lattice is always in Z, one can easily show that (ν + 2ρ | ν) ≡ µ1 + · · · + µn + 2ρ | µ1 + · · · + µn (mod 2Z).
304
THANG T. Q. LE
It follows that the proportional factor eπi(ν+2ρ|ν) in (3.14) does not depend on ν and is always equal to eπi(µ1 +···+µn +2ρ|µ1 +···+µn ) . This proves the proposition. 3.6.6. Pure braid. Suppose that a framed tangle T has a good diagram that is a pure braid on n strands, and suppose that the strands are colored by Mj , which are ε ᐁ-modules. Then ε JT (M1 , . . . , Mn ) is an operator acting on M1 ⊗ · · · ⊗ Mn . Using ϕ, ¯ one can consider T to be colored by −ε ᐁ-modules ϕ¯ ∗ (Mj ). Hence, there is defined the operator −ε JT acting on the same space M1 ⊗ · · · ⊗ Mn . Proposition 3.9. Suppose tij is the linking number of the ith and the j th components of the pure braid T . Suppose also that Mj = ε µj . Then on M1 ⊗ · · · ⊗ Mn the two operators −ε JT and ε JT are proportional:
where b =
1≤i<j ≤n 2tij
−ε JT
= ebπi ε JT ,
µi | µj ).
Proof. Since the pure braids in Figure 9(b) generate every pure braid, we assume T is as in Figure 9(b). We suppose that the band has n−1 parallel lines whose colors are M1 , . . . , Mn−1 , and the remaining line has color Mn . Then by the tensor product formula (1.6), ±ε JT (M1 , . . . , Mn−1 , Mn )
=
−1
±ε θ (M1 ⊗ · · · ⊗ Mn−1 ) ⊗ ±ε θ(Mn )
±ε θ(M1 ⊗ · · · ⊗ Mn−1 ⊗ Mn ).
Using the relation between ε θ and −ε θ in Proposition 3.8, we get the desired result. 3.6.7. End of proof of the strong integrality theorem. As noted in §3.6.3, we need to prove (3.13). Using the framing formula (1.7), it is easy to check that if (3.13) holds true for a framed link L, then it does for every framed link obtained from L by altering the framing. Hence, we may assume L has any framing we wish. By Lemma 3.6, we can assume that L, as a nonframed link, has a diagram D = Tu ◦ T ◦ Tl , where T is a pure braid diagram, and Tu and Tl do not have any crossing points. Alter the framing of L so that D is a good diagram of it. Then the framing ljj of the j th component is always even, since T is a pure braid. We know that the operators −ε JT and ε JT are proportional. Now we prove that ε JTu and ε JTl are proportional to −ε JTu and −ε JTl , respectively. Let us consider a diagram corresponding to a maximal or a minimal point. The corresponding operator involves only K˜ ±2ρ . Suppose the component is colored by ε λ . Then ε K˜ ±2ρ (x) = ε ±(2ρ|ν) x if x ∈ (ε λ )ν . Similarly, −ε K˜ ±2ρ (x) = (−ε)±(2ρ|ν) x. Note that if ν is a weight, then µ − ν ∈ Y . Using the fact that (2ρ | α) ∈ 2Z for every α ∈ Y , we see that ˜ ±2ρ is proportional to ε K˜ ±2ρ on ε λ , with the proportional factor (−1)(2ρ|µ) . −ε K The proportional factor does not depend on the sign of ±2ρ. Thus ε JTu and ε JTl are proportional to −ε JTu and −ε JTl , respectively.
305
QUANTUM LINK INVARIANTS
Now it is clear that κ := scalar factors:
−ε JL /ε JL
−ε JL ε JL
=
−ε JT ε JT
can be presented as the product of three ×
−ε JTu ε JTu
×
−ε JTl ε JTl
.
(3.15)
The first factor can be calculated using Proposition 3.9. Let us calculate the product of the second and third factors. If we replace T by the trivial pure braid T , then we get a trivial link L of m components. The value of JL is known, and one has −ε JL ε JL
= (−1)(2ρ|µ1 +···+µm ) = eπi(2ρ|µ1 +···+µm ) .
Here µ1 , . . . , µm are colors of the link. Applying (3.15), with T replaced by T , we see that the product of the second and the third factor is eπi(2ρ|µ1 +···+µm ) . Using Proposition 3.9 to calculate the first factor, we see that
κ = eπi[
1≤i, j ≤m lij (µi |µj )+(2ρ|µ1 +···+µm )]
.
Remember that lii is even, and (2ρ | µi ) is always an integer. We can alter the second term in the square bracket to get the value κ = eπi[
1≤i, j ≤m lij (µi |µj )+(2ρ|(l11 +1)µ1 +···+(lmm +1)µm )]
= eaπi ,
where eπia is the one in (3.13). Thus we have −ε JL
= eπia ε JL .
(3.16)
If we replace v 1/D by ζ in (3.13), then the right-hand side becomes eπia ε JL . The left-hand side, remembering that ϕ is an algebra homomorphism, is JL |v 1/D =ζ . The latter is −ε JL . Hence (3.16) implies that (3.13) holds true if v 1/D = ζ . Since ζ can take any 2Drth root of unity with r odd, (3.13) must hold true for every v 1/D . This completes the proof of the strong integrality theorem. Remark. As noted earlier, the use of roots of unity in the proof of the strong integrality seems very artificial. One could avoid roots of unity if the following question has an affirmative answer. Question. Is it true that in the product of the canonical bases of µ1 ⊗ · · · ⊗ µm , the twist θ has entries in v (µ1 +···+µm +2ρ|µ1 +···+µm ) Z[v ±2 ]? References [An] [AP] [Ja]
H. Andersen, Quantum groups at roots of ±1, Comm. Algebra 24 (1996), 3269–3282. H. Andersen and J. Paradowski, Fusion categories arising from semisimple Lie algebras, Comm. Math. Phys. 169 (1995), 563–588. J. Jantzen, Representations of Algebraic Groups, Pure Appl. Math. 131, Academic Press, Boston, 1987.
306 [Jo] [Kac] [Ka] [KM] [KT1] [KT2]
[Ko] [Le1] [Le2] [Le3] [LM] [Lu1]
[Lu2] [MR] [MW] [Mu] [Oh1] [Oh2]
[RT1] [RT2] [TY] [Tu] [Yo]
THANG T. Q. LE V. Jones, Hecke algebra representations of braid groups and link polynomials, Ann. of Math. (2) 126 (1987), 335–388. V. Kac, Infinite-Dimensional Lie Algebras, 3d ed., Cambridge Univ. Press, Cambridge, 1990. C. Kassel, Quantum Groups, Grad. Texts in Math. 155, Springer-Verlag, New York, 1995. R. Kirby and P. Melvin, The 3-manifold invariants of Witten and Reshetikhin-Turaev for sl(2, C), Invent. Math. 105 (1991), 473–545. T. Kohno and T. Takata, Symmetry of Witten’s 3-manifold invariants for sl(n, C), J. Knot Theory Ramifications 2 (1993), 149–169. , “Level-rank duality of Witten’s 3-manifold invariants” in Progress in Algebraic Combinatorics (Fukuoka, 1993), Adv. Stud. Pure Math. 24, Math. Soc. Japan, Tokyo, 1996, 243–264. M. Kontsevich, “Vassiliev’s knot invariants” in I. M. Gel’fand Seminar, Adv. Soviet Math. 16, Part 2, Amer. Math. Soc., Providence, 1993, 137–150. T. T. Q. Le, On denominators of the Kontsevich integral and the universal perturbative invariant of 3-manifolds, Invent. Math. 135 (1999), 689–722. , On perturbative PSU(n) invariants of rational homology 3-spheres, to appear in Topology. , Relation between quantum and finite type invariants, in preparation. T. T. Q. Le and J. Murakami, The universal Vassiliev-Kontsevich invariant for framed oriented links, Compositio Math. 102 (1996), 41–64. G. Lusztig, “Modular representations and quantum groups” in Classical Groups and Related Topics (Beijing, 1987), Contemp. Math. 82, Amer. Math. Soc., Providence, 1989, 59–77. , Introduction to quantum groups, Progr. Math. 110, Birkhäuser, Boston, 1993. G. Masbaum and J. Roberts, A simple proof of integrality of quantum invariants at prime roots of unity, Math. Proc. Cambridge Philos. Soc. 121 (1997), 443–454. G. Masbaum and H. Wenzl, Integral modular categories and integrality of quantum invariants at roots of unity of prime order, preprint, 1997. H. Murakami, Quantum SO(3)-invariants dominate the SU(2)-invariant of Casson and Walker, Math. Proc. Cambridge Philos. Soc. 117 (1995), 237–249. T. Ohtsuki, A polynomial invariant of rational homology 3-spheres, Invent. Math. 123 (1996), 241–257. , “A filtration of the set of integral homology 3-spheres” in Proceedings of the International Congress of Mathematicians (Berlin, 1998), Vol. 2, Documenta Mathematica, Bielefeld, 1998, 473–482; available from http://www.mathematik.unibielefeld.de/documenta/. N. Yu. Reshetikhin and V. Turaev, Ribbon graphs and their invariants derived from quantum groups, Comm. Math. Phys. 127 (1990), 1–26. , Invariants of 3-manifolds via link polynomials and quantum groups, Invent. Math. 103 (1991), 547–597. T. Takata and Y. Yokota, The PSU(n) invariants of 3-manifolds are polynomials, preprint, Kyushu University, 1996. V. G. Turaev, Quantum Invariants of Knots and 3-Manifolds, de Gruyter Stud. Math. 18, de Gruyter, Berlin, 1994. Y. Yokota, Skeins and quantum SU(N) invariants of 3-manifolds, Math. Ann. 307 (1997), 109–138.
Department of Mathematics, State University of New York, Buffalo, New York 14214, USA;
[email protected]
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY MICHELE GRASSI Introduction. To a smooth manifold M one can associate in a natural way a new smooth manifold, the manifold of k-jets of n-dimensional submanifolds of M, (k) indicated by Gn (M), which parametrizes in a smooth way the k-jets of immersed (k) submanifolds of M. On Gn (M) one can build in a canonical way a differential ideal, (k) denoted Ᏽ(k) . The cohomology associated to the complex Gn (M)/Ᏽ(k) is called characteristic cohomology. These ideas, which in part go back to [C], are explained here. A more detailed introduction to them can be read, for example, in the introduction to [BG1] or in [BGH1] (see also [BG2], [BGH2], [BGH3]). Characteristic cohomology appears in this picture as a cohomological tool to study n-dimensional submanifolds of M. In this context one should think of submanifolds as solutions to systems of PDEs (partial differential equations). For example, they could be integral manifolds of an integrable distribution or of a differential ideal. Characteristic cohomology (or a variation of it) can then be used to provide invariants for the system of PDEs. The notation for the qth characteristic cohomology group over a smooth manifold M is (k) H q ∗ G(k) n (M) /Ᏽ , d . A first step in the study of characteristic cohomology is to see if something such as a Poincaré lemma holds for it; that is, if the characteristic cohomology vanishes (k) on contractible open subsets of Gn (M). More precisely, it has been conjectured by (k) Griffiths [Gri] that for any contractible open set ᐁ ⊂ Gn (M) one has H q ∗ (ᐁ)/Ᏽ(k) (ᐁ), d = (0) when 0 ≤ q < n. For dim(M) − n = k = 1 the result is classical. Griffiths and his collaborators [Gri] proved it when n = 1 and k, dim(M) is arbitrary, when k = 1 and n, dim(M) is arbitrary, and finally when dim(M) = 3, k = 2, and n = 2. In Section 5, we prove precisely what was conjectured for all k, n, and dim(M). For index q = n, the result is no longer true. This is because on a smooth manifold, one can have many independent functionals on n-dimensional submanifolds, even locally. The extremely rich structure of the nth characteristic cohomology group is probably the next object to research. Some intriguing conjectures on this topic, formulated by Griffiths, suggest possible approaches. Following this line of ideas, Received 13 March 1998. Revision received 23 June 1999. 1991 Mathematics Subject Classification. Primary 35A27, 13C99; Secondary 18G40, 35A15, 58D10. 307
308
MICHELE GRASSI
Griffiths has obtained some promising partial results, which are unfortunately still short of a general answer. In Section 6, we include some simple facts about the nth characteristic cohomology group. In our proof, there is a strong interplay among rather distant areas of mathematics. As we have seen, the motivation for studying characteristic cohomology comes from differential geometry and PDE theory. However, when one localizes to a small open set, things change abruptly. By the right choice of a weight defined on the bundle of differential forms on the open set, the characteristic weight, one obtains a filtration on the complex computing characteristic cohomology, and the associated graded complex is a completely algebraic object (see Lemma 4.7). The algebraic problem at this point is not solved easily. In Lemma 4.13, we translate the problem into one in which we must compute the Koszul homology group of a family of modules over a polynomial ring. To define these modules, we choose to introduce some basic notions from Hopf algebra theory. From the perspective of commutative algebra, the modules in question are of an unusual nature. Therefore, to compute their Koszul homology, one has to develop some ad hoc homological tools— here, Theorem 2.9. Using it, we may prove the main vanishing theorem for Koszul homology, Theorem 3.9, which gives the desired local vanishing for characteristic cohomology, Theorem 5.1. Finally, a word of caution: There are some other results (see, for example, [BG1, p. 555]) that resemble Theorem 5.1. However, the fundamental difference between the result in [BG1] and Theorem 5.1 is that in [BG1] the authors consider the infinite prolongation case (see also Section 5). Equivalently, one can allow the order of jets to increase by a (fixed and often easily computable) amount. Still equivalently, one could ask that the forms in question come from lower order jets. This approach not only simplifies the transition to an algebraic problem but more importantly leads to an algebraic problem of a different nature. In the language of this work, such an algebraic problem would amount to proving the vanishing of Koszul homology for a free module, which is a known algebraic fact. In [BG1], however, the authors consider characteristic cohomology in a different context (see, for example, [BG1, Theorem 2, p. 562]) to which the methods of this work would not apply, at least directly. Acknowledgements. A special thank you goes to Professor M. L. Green for his support, help, and patience in 1996–1997 during the preparation of my Ph.D. thesis at the University of California Los Angeles (UCLA), from which this paper has been derived. I would also like to thank Professor P. A. Griffiths for a very productive week at the Institute of Advanced Study that I spent with him in the spring of 1997, discussing some of these topics. Also, my thanks go to Professor V. S. Varadarajan for his many courses and seminars at UCLA. Notation. The notation that we use is fairly standard. For concepts on differential geometry, see, for example, [BCGGG]. On commutative algebra, see [G] or [BH]. On homological algebra, see, for example, [HS]. Basic notions of Hopf algebra theory can
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
309
be found in any reference text, for example, [Mo] or [Sw]. We used the terms lemma and theorem for our results (unless otherwise noted), while we used proposition and remark for results that are already known (for which sometimes we couldn’t find a reference) or that are straightforward applications of well-known facts. As there are many module and algebra structures involved, to avoid confusion we indicate the category of the operation. This is especially true of the tensor product. Often we use the notation M ⊗A N or M ⊗f N when the tensor product is done with respect to the ring A or with respect to the B-module structure induced by the ring morphism f : B → A, if M and N are A-modules. We also use this notation for the multiplication by ideals, as in I ·A M or I ·f M. We use a, b, cA to denote the submodule generated by the elements a, b, and c over the ring A. We explain the less standard notation and terminology in the first section. 1. Definitions and basic properties. The definition of k-jet of an immersed manifold f : N → M at a point p ∈ M can be found in [BCGGG, pp. 18–26]. We also define it here. Definition 1.1. (1) Two germs of immersed manifolds f, g : (Rn , 0) → (M, p) are said to have the same k-jet at 0 if, for some (and therefore any) choice of coordinates on M around p, the corresponding f˜, g˜ : (Rn , 0) → (Rdim(M) , 0) have equal k-jets (in the ordinary sense) at the origin. In this case, f ≡(k) g. (2) The space of k-jets of n-dimensional parametrized immersed submanifolds of k (R n , M), is the set of equivalence classes of immersed germs M at p, indicated by J0,p n f : (R , 0) → (M, p) with respect to ≡(k) . (3) Consider the group Aut(Rn , 0) of germs of diffeomorphisms of Rn leaving the origin fixed. There is a natural action of the group Aut(Rn , 0) on the vector space k (R n , M). This allows one to consider the orbit space of J k (R n , M) with respect J0,p 0,p k (R n , M)/ Aut(R n , 0). to this action, indicated by J0,p (k) (4) The space of k-jets of n-dimensional submanifolds of M, indicated by Gn (M), as a set is n k G(k) J0,p R , M / Aut Rn , 0 . n (M) = p∈M
Here we want to stress that we use nonparametrized k-jets. See [BCGGG, pp. 18–26] for the related concept of a jet bundle J k (N, M) between (k) two smooth manifolds of dimensions n and m, respectively. On Gn (M) there is a natural smooth structure. The following elementary proposition is included to identify (k) a choice of coordinate neighborhoods on Gn (M). Proposition 1.2. Let M = M n+s be an (n + s)-dimensional manifold. Then admits a smooth structure.
(k) Gn (M)
310
MICHELE GRASSI
Proof. Let p ∈ M be a point. Let f : Rn → M be a smooth map f (0) = p with injective differential at 0, and let f˜ : (Rn , 0) → (M, p) be its germ. Let (x i , zα ) = (x 1 , . . . , x n , z1 , . . . , zs ) be local coordinates in an open set of M containing p, such that f ∗ (dx 1 ∧ · · · ∧ dx n ) = 0 at p. Then the image of f˜ can be expressed as a graph of the form zα = zα (x 1 , . . . , x n ), α = 1, . . . , s. We now associate to the germ f˜ the set of numbers ∂zα ∂ k zα i α (p) . x (p), z (p), i (p), . . . , i ∂x ∂x 1 · · · ∂x ik α=1,...,s; i,i1 ,...,ik =1,...,s To complete the proof, one needs to show that with a different choice of coordinates near p, the numbers above vary smoothly in an open set. We omit this simple verification. (k)
Locally, we can take the following coordinates on Gn (M):
x i , zα , piα , . . . , piα1 ,...,ik
α=1,...,s; i,i1 ,...,ik =1,...,s
,
where the p α are symmetric in the lower indices and stand for the partial derivatives described in the previous proof. Definition 1.3. Let f : N → M be an n-dimensional immersed submanifold (i.e., N has dimension n and df is everywhere injective). Then there is a canonical lifting (k) f (k) : N → Gn (M), defined by associating to each point q ∈ N the k-jet of the germ of f (N) at p = f (q). (1)
(k)
Remark 1.4. Gn (M) ∼ = G(n, T(M)), where Gn (M) is the Grassmann bundle over the tangent bundle. Definition 1.5. The natural maps from the space of i-jets to the space of j -jets, (j ) (i) for i ≥ j , are indicated with πi,j : Gn (M) → Gn (M). A differential ideal on a manifold M is simply a homogeneous (two-sided) ideal inside the algebra of global differential forms, which is closed under exterior differentiation (see, for example, [BCGGG, p. 16]). The notion of an integral manifold is associated to a differential ideal Ᏽ. This is an immersion f : N → M such that (k) f ∗ (Ᏽ) = (0) (see, for example, [BCGGG, p. 16]). On Gn (M) there is a canonical (k) differential ideal, which we denote by Ᏽ . Because a differential ideal is a module over the smooth functions, its restrictions over open sets generate a (fine) sheaf. Therefore the ideal is determined by its local sections, and it is enough to give it over coordinate neighborhoods. We do this for Ᏽ(k) , using the neighborhoods identified (k) above for Gn (M). Definition 1.6. The contact ideal, indicated by Ᏽ(k) , is the differential ideal on generated locally by the forms (using Einstein’s summation convention)
(k) Gn (M)
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
311
α α α i θ = dz − pi dx ··· α θi1 ,...,ik−1 = dpiα1 ,...,ik−1 − piα1 ,...,ik−1 ,l dx l . We call I (k) the (algebraic) ideal generated by the forms above. Proposition 1.7. The contact ideal has all the liftings of n-dimensional immersed submanifolds of M as integral manifolds. Proof. It is enough to check this locally. Definition 1.8. We define the local characteristic cohomology of degree k over ᐁ as H ∗ ( ∗(ᐁ)/Ᏽ(k) , d). Definition 1.9. We call p = pᐁ : ∗ (ᐁ) → ∗ (ᐁ)/Ᏽ(k) the natural projection map. There are many connections of differential ideals with PDEs and with Lagrangians. Consult [BGH1], [BCGGG], and [BG1]. 2. A theorem on Tor. In Sections 2 and 3, we consider some (apparently) unrelated algebraic constructions and results. As usual, k[x] is the algebra of polynomials over the ring k in the variables x = x1 , . . . , xn . One can put a Hopf algebra structure on the k-algebra k[x] in the following way (see [Sw] for the definition of Hopf algebra). Definition 2.1. The comultiplication % : k[x] → k[x] ⊗k k[x] is defined as the k-algebra homomorphism that sends xi to xi ⊗ 1 + 1 ⊗ xi for all i = 1, . . . , n. The antipode S : k[x] → k[x] is defined as the k-algebra homomorphism that sends xi to −xi for all i = 1, . . . , n. All the other elements of the Hopf algebra structure are canonical. Following the standard practice, we indicate by M[a] the graded module M with the degrees shifted by the integer a (i.e., M[a]i := Mi+a ). Remark 2.2. We have (k[x][a]) ⊗k (k[x][b]) ∼ = k[x] ⊗k k[x][a + b] as graded k[x] ⊗k k[x]-modules. Lemma 2.3. (k[x]⊗k k[x])⊗k[x] k inherits a k-algebra structure from k[x]⊗k k[x]. Proof. By definition, k[x] ⊗k k[x] ⊗k[x] k = k[x] ⊗k k[x] /m ·% k[x] ⊗k k[x] = k[x] ⊗k k[x] /%(m) k[x] ⊗k k[x] , where m is the maximal (irrelevant) ideal of k[x]. This identification shows that (k[x] ⊗k k[x]) ⊗k[x] k is a quotient of k[x] ⊗k k[x] by the ideal generated by {%(xi ) |
312
MICHELE GRASSI
i = 1, . . . , n}. From this description, it is clear that (k[x] ⊗k k[x]) ⊗k[x] k inherits a k-algebra structure from k[x] ⊗k k[x]. Lemma 2.4. The maps ψl : k[x] −→ k[x] ⊗k k[x] ⊗k[x] k, ψr : k[x] −→ k[x] ⊗k k[x] ⊗k[x] k,
φl : k[x] ⊗k k[x] ⊗k[x] k −→ k[x], φr : k[x] ⊗k k[x] ⊗k[x] k −→ k[x],
acting as ψl (p(x)) = (p(x)⊗1)⊗1, φl ((p(x)⊗q(x))⊗1)=p(x)S(q(x)), ψr (p(x)) = (1 ⊗ p(x)) ⊗ 1, φr ((p(x) ⊗ q(x)) ⊗ 1) = S(p(x))q(x), induce isomorphisms of k-algebras, and ψl φl = φl ψl = ψr φr = φr ψr = Id. Proof. We prove this for ψl and φl ; the case of ψr and φr is just the same. First, let us show that φl is well defined. Indeed, if φ˜ l is the map Id ⊗S from k[x] ⊗k k[x] to k[x] (which is clearly well defined), we have that φ˜ l is a (surjective) morphism of k-algebras. Moreover, φ˜ l (%(xi )) = 0 ∀i, so it induces a map on (k[x]⊗k k[x])⊗k[x] k, which is φl . This is enough to show that φl is well defined and surjective. Also, ψl is well defined, as k[x] is a free k-algebra. It is also clear that φl ψl = Id. To show that ψl φl = Id, it is then enough to prove that ψl is surjective. As the maps are clearly homogeneous morphisms of graded k-algebras, to prove surjectivity it is enough to consider “monomials” of the form (m1 ⊗ m2 ) ⊗ 1, where m1 and m2 are monomials of k[x], and show that these are in the image of ψl . We do this by induction on d = deg(m2 ). If d = 0, the statement is clear because the tensor products are over k. Suppose now d > 0, and the statement is known for d − 1. Then suppose, without loss of generality, that x1 divides m2 , m2 = x1 n. We have (m1 ⊗m2 )⊗1 = (%(x1 )m1 ⊗n)⊗1−(x1 m1 ⊗n)⊗1 = −(x1 m1 ⊗n)⊗1. By induction, this last element is in the image of ψl . Definition 2.5. Given a module M over the k-algebra k[x], we define a new k[x] = M ⊗S k[x], with the k[x]module structure on the k-module M as follows: M module structure induced from the multiplication on the right factor. Remark 2.6. is an exact homogeneous covariant additive functor. Moreover, = Id ∼ ∼ and k[x][a] =k[x] k[x][a]. If i is an ideal in k[x], stable under S, then k[x]/i =k[x] τ k[x]/i. In particular, this holds for i = m . Lemma 2.7. Let M and N be finitely generated graded k[x]-modules. Then % induces a k[x]-module structure on the k[x] ⊗k k[x]-module M ⊗k N. With this structure, (M ⊗k N) ⊗k[x] k ∼ =k M ⊗k[x] N. Moreover, (M ⊗k N) ⊗k[x] k is naturally a (k[x] ⊗k k[x]) ⊗k[x] k-module. Using ψl and ψr , it obtains two k[x]-module structures.
With the With the structure induced by ψl , (M ⊗k N) ⊗k[x] k ∼ =k[x] M ⊗k[x] N.
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
313
⊗k[x] N. structure induced by ψr , (M ⊗k N) ⊗k[x] k ∼ =k[x] M Proof. We prove only that with the structure induced by ψl ,
(M ⊗k N) ⊗k[x] k ∼ =k[x] M ⊗k[x] N. All the other statements follow easily from this. We define a k-linear map , from
by extending the natural quotient map , ˜ from M ⊗k (M ⊗k N) ⊗k[x] k to M ⊗k[x] N, ∼
N =k M ⊗k N to M ⊗k[x] N: / M ⊗k[x] N
M ⊗k N
(M ⊗k N) ⊗k[x] k _ _/ M ⊗k[x] N.
We have to check that the map is well defined, bijective, and a morphism of k[x]modules. We omit the simple verifications that the map is well defined and injective. To prove that it is surjective, we proceed exactly as before to define ,−1 : M ⊗k[x]
→ (M ⊗k N) ⊗k[x] k by inducing a map from the natural quotient map ,˜−1 from N
to (M ⊗k N) ⊗k[x] k: M ⊗k N ∼ =k M ⊗k N M ⊗k N
/ (M ⊗k N) ⊗k[x] k
_ _/ (M ⊗k N) ⊗k[x] k. M ⊗k[x] N
We have by inspection that ,,−1 = ,−1 , = Id, and therefore , is also surjective. ˜ clearly is, It remains to be checked that , is a map of k[x]-modules. However, , with respect to multiplication on the left factor on M ⊗k N. Through the quotient map to (M ⊗k N) ⊗k[x] k, this induces the structure induced by ψl on (M ⊗k N) ⊗k[x] k. Therefore , is also a map of k[x]-modules, as desired. Remark 2.8. We have k[x] ⊗k k[x] ∼ =k[x]
∞
k[x][−i]
n+i−1 i
i=0
where k[x][i] is the k[x]-module k[x] with degrees shifted by i. In particular, k[x]⊗k k[x] is a free k[x]-module. The following theorem is fundamental for the proof of the local vanishing theorem. We have not encountered any similar result in the literature. Theorem 2.9. Let M and N be finitely generated graded k[x]-modules. Then % induces a structure of k[x]-module on the k[x] ⊗k k[x]-module M ⊗k N. With this k[x]
j. structure, and for all i, j , Tor ik[x] (M ⊗k N, k)j ∼ =k Tor i (M, N)
314
MICHELE GRASSI
Proof. Let (C∗ , d1 ) be a minimal resolution of M and (D∗ , d2 ) of N. We then consider the double complex C∗ ⊗k D∗ : ↓ →
C i ⊗ k Dj
··· → ···
↓ →
C i ⊗ k D0
↓
···
↓
···
···
···
↓
···
↓
→ C 0 ⊗ k Dj
→ ···
→ C0 ⊗k D0 .
A standard spectral sequence argument shows that the total complex of this double complex is a free-graded resolution (not minimal) of M ⊗k N over k[x] ⊗k k[x]. Indeed, the E1 term of one of the two spectral sequences looks like ··· ···
(0)
··· ···
··· ···
M ⊗ k Dj
···
··· ···
··· →
···
(0) ···
→
M ⊗ k D0 ,
and the E2 term has all zeroes. Note that the tensor products are over k, not over k[x]. From Remark 2.8, the resolution is free-graded as it is over k[x]. It follows that the total complex associated to the double complex (C∗ ⊗k D∗ )⊗k[x] k computes the Tor group in the left-hand side of the equality that we want to show. Using the first standard filtration of this double complex and computing the spectral sequence p,q associated to it, we get the expression for the E1 term: E1 = Hq ((Cp ⊗k D∗ ) ⊗k[x] k, (1⊗d2 )⊗1). By using the map ψr to induce k[x]-structures, the complex ((Cp ⊗k D∗ ) ⊗k[x] k, (1 ⊗ d2 ) ⊗ 1) is easily seen by Lemma 2.7 to be isomorphic over k[x] to p ⊗k[x] D∗ , 1 ⊗ d2 ). Hence, the E0 term may be taken to look like (C ↓ p ⊗k[x] Dq → C
→
··· → ···
↓ →
p ⊗k[x] D0 C
↓
···
↓
···
···
···
↓
···
↓
0 ⊗k[x] Dq C
→ ···
0 ⊗k[x] D0 . → C p,q
p is still free over k[x] from Remark 2.6, that E As a consequence, we have, as C 1
=
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
315
p,0 p ⊗k[x] N. Now, from Lemma 2.7 C p ⊗k[x] N ∼
0, if q > 0, and E1 ∼ =C =k Cp ⊗k[x] N. p,0 ∼
using ψl . By Moreover, still using Lemma 2.7, we see that E1 =k[x] Cp ⊗k[x] N, inspection, the differential of E1 is d1 ⊗ 1, under the above isomorphism. From this it follows that p,q
E2
∼ =k 0
if q > 0,
p,0
E2
∼ =k Tor pk[x] M, Nˆ .
Therefore the spectral sequence degenerates at the second term, and we obtain the desired result. 3. A vanishing theorem for Koszul homology. If M is a k[x]-module, the Hopf algebra structure introduced before puts a k[x]-module structure on M ⊗k M, as we have seen, by imposing p(x)(m ⊗ n) = %(p(x))(m ⊗ n). We want to extend this method to build some other module structures. Definition 3.1. % induces a map of k-algebras %p : k[x] → p k (k[x]) by the rule %q+1 (xi ) = xi ⊗ 1 ⊗ · · · ⊗ 1 + 1 ⊗ %q (xi ) (using the isomorphism p k (k[x]) = k[x] ⊗k p−1 k (k[x])). If M is a k[x]-module, there is a k[x]-module structure on p−1 p k (M) = M ⊗k k (M) induced by %p through the rule p(x)(m1 ⊗ · · · ⊗ mp ) = %p (p(x))(m1 ⊗ · · · ⊗ mp ). p Remark 3.2. We have xi (m1 ⊗ · · · ⊗ mp ) = j =1 m1 ⊗ · · · (xi mj ) · · · ⊗ mp . Now, note that there is an action of the symmetric group p over p k (M), given as follows: σ (v1 ⊗ · · · ⊗ vp ) = 2(σ )vσ (1) ⊗ · · · ⊗ vσ (p) , where 2(σ ) is the sign of σ . Relative to this action, there is an operator π∧ = (1/p!) σ ∈p σ . Here we are using the fact that p! is invertible in k. p Lemma 3.3. π∧ is a k[x]-endomorphism of k (M). Proof. We see that xi π∧ (v1 ⊗ · · · ⊗ vp ) 1 2(σ )vσ (1) ⊗ · · · ⊗ vσ (p) = xi p! = =
1 p! 1 p!
σ ∈ p
σ ∈ j =1 p σ ∈ j =1
2(σ )vσ (1) ⊗ · · · ⊗ xi vσ (j ) ⊗ · · · ⊗ vσ (p) j j j j 2(σ ) 1 − δσ (1) + δσ (1) xi vσ (1) ⊗ · · · ⊗ 1 − δσ (p) + δσ (p) xi vσ (p)
p 1 j j j j = 2(σ ) 1 − δσ (1) + δσ (1) xi vσ (1) ⊗ · · · ⊗ 1 − δσ (p) + δσ (p) xi vσ (p) p! j =1
σ ∈
316
MICHELE GRASSI
=
p
π∧ v1 ⊗ · · · ⊗ xi vj ⊗ · · · ⊗ vp = π∧ xi v1 ⊗ · · · ⊗ vp .
j =1
p As usual, this operatorsatisfies π∧ ◦ π∧ = π∧ , and its image inside k (M) is isomorphic (over k) to p k (M). Moreover, the inclusion p k (M) → p k (M) provides a splitting of the exact sequence of k[x]-modules 0 −→ Ker(π∧ ) −→
p
p π∧ k (M) −−→ k (M) −→ 0.
We have therefore proved the following lemma. p p Lemma 3.4. k (M) is a split-graded sub-k[x]-module of k (M). p Remark 3.5. The module structure on k (M) is made so that
xi m1 ∧ · · · ∧ mp =
p
m1 ∧ · · · x i mj · · · ∧ m p ,
j =1
where m1 ∧ · · · ∧ mp = π∧ (m1 ⊗ · · · ⊗ mp ). Proof. This is an immediate consequence of Remark 3.2 and Lemma 3.3. Definition 3.6. If M is a k[x]-module, with Kt (x, M)j , we indicate the j th graded part of the homology of the complex obtained tensoring the Koszul complex over x = (x1 , . . . , xn ) with M over k[x]. This is a standard definition. These objects are called Koszul homology groups of M (over the polynomial ring k[x]). Note that the Koszul complex has all the maps of degree 1, so it is not, properly speaking, in the category of graded k[x]-modules. k[x] Proposition 3.7. If M is a k[x]-module, then Kt (x, M)j ∼ =k[x] Tor t (M, k)j +t .
Proof. This is just an application of the fact that the Koszul complex is a free resolution of k over k[x]. The above proposition is standard. For a general introduction to Koszul homology, see [BH, p. 39] or [G]. Corollary 3.8. Kt (x, −)j is an additive functor from the category of k[x]modules to the category of k-modules. The main result of this section is the following theorem. Theorem 3.9. Let W be a vector space of dimension s over k, and let m be the irrelevant maximal ideal in k[x]. Then Kt (x, p k (W ⊗k k[x]/mτ ))j = 0 whenever t ≥ p and j < p(τ − 1). The proof is at the end of this section.
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
317
Remark 3.10. The previous result can be easily improved as follows: In the notation of the theorem above, Kt (x, p k (W ⊗k k[x]/mτ ))j = 0 whenever t ≥ p and j = p(τ − 1), or t < p and j = t (τ − 1). Lemma 3.11. For a k[x]-module M, let us call mt (M) the minimum value j for which Kt (x, M)j = 0. Then, for t ≥ 1, mt M ⊗k k[x]/mτ ≥ mt−1 (M) + (τ − 1). Proof. From Theorem 2.9, Remark 2.6, and Proposition 3.7, we have that Kt (x, k[x] τ )j +t . Given a minimal resolution (C∗ , d) of M ⊗k k[x]/mτ )j ∼ =k Tor t (M, k[x]/m M, clearly Ct is generated in a degree greater than or equal to mt (M) + t. For a homogeneous element x in Ct to be sent into mρ Ct−1 by d, it must be that deg(x) ≥ mt−1 (M) + t − 1 + ρ or d(x) = 0. Therefore deg(x) < mt−1 (M) + t − 1 + ρ and d(x) ∈ mρ Ct−1 imply that d(x) = 0, and hence x = d(x ) for some x ∈ C t+1 . So, to get a nonzero homology element in the middle term of the complex Ct+1 ⊗k[x] k[x]/mτ −→ Ct ⊗k[x] k[x]/mτ −→ Ct−1 ⊗k[x] k[x]/mτ , we must go to degree at least mt−1 (M) + τ + t − 1. This, together with the above identification of the Koszul homology group as a Tor group, proves that mt (M ⊗k k[x]/mτ ) + t ≥ mt−1 (M) + τ + t − 1 as desired. Lemma 3.12. Kt (x, p k (W ⊗k[x]/mτ ))j = 0 whenever t ≥ p and j < p(τ −1). Proof. We prove this lemma by induction on p. If p = 1, we have to prove that Kt (x, k[x]/mτ )j = 0 whenever t ≥ 1 and j < τ − 1. This is clear from Lemma 3.11 applied to M = k. Suppose now that we know the statement for p − 1 ≥ 1. We have p p−1 τ ∼ τ τ k W ⊗ k[x]/m =k[x] k W ⊗ k[x]/m ⊗k W ⊗k k[x]/m p−1 τ τ ∼ =k[x] k W ⊗ k[x]/m ⊗k k[x]/m ⊗k W. From Lemma 3.11, we have that p−1 p−1 τ τ τ mt k W ⊗ k[x]/m ⊗k k[x]/m ≥ mt−1 k W ⊗ k[x]/m + (τ − 1). By induction we conclude therefore that p−1 τ τ mt k W ⊗ k[x]/m ⊗k k[x]/m ≥ p(τ − 1) for t ≥ p.
318
MICHELE GRASSI
This concludes the proof, because clearly mt
p−1 k
τ
W ⊗ k[x]/m ⊗k k[x]/mτ ⊗k W p−1 τ τ = mt k W ⊗ k[x]/m ⊗k k[x]/m .
Proof of Theorem 3.9. The proof of Theorem 3.9 follows immediately if we observe that, from Corollary 3.8 and Lemma 3.4 applied to the case M = W ⊗k[x]/mτ , Ktx,
p−1 τ W ⊗ k[x]/mτ ∼ =k[x] Ktx, Ker(π8 ) ⊕k[x] k W ⊗ k[x]/m
p−1 k
j
j
∼ = Kt x, Ker(π8 ) j ⊕k[x] Ktx,
p−1 k
W ⊗ k[x]/mτ .
j
We apply Lemma 3.4 to the case M = W ⊗ k[x]/mτ . 4. The algebraic reduction of the vanishing problem. We now reduce the problem of local vanishing of characteristic cohomology to an algebraic problem. This is done in two steps, obtaining a first algebraic reduction and then a second algebraic reduction. We use the notation of Proposition 1.6. In particular, we use Einstein’s summation convention for repeated indices. Proposition 4.1. We have dθiα1 ,...,it = −θiα1 ,...,it ,l ∧ dx l for all 0 ≤ t < k − 1 (if t = 0, then by dθiα1 ,...,it we mean dθ α ) and dθiα1 ,...,ik−1 = −dpiα1 ,...,ik−1 ,l ∧ dx l . Proof. If k < 1, then there is nothing to prove. Suppose therefore k ≥ 1. If t < k − 1, we have dθiα1 ,...,it = −dpiα1 ,...,it ,l ∧ dx l = −(θiα1 ,...,it ,l + piα1 ,...,it ,l,h dx h ) ∧ dx l = −θiα1 ,...,it ,l ∧ dx l − piα1 ,...,it ,l,h dx h ∧ dx l . However, as piα1 ,...,it ,l,h is symmetric in l, h and dx h ∧ dx l is antisymmetric in l, h, the second summand is zero, while the first one is as desired. For the case t = k − 1, we just have to observe that d 2 = 0 and that d is a derivation of the exterior algebra. Given an element in q (ᐁ), we can always write it in pa unique way as a sum of p0
α
k−1
k−1 “monomials” of the form m = θ α0 ∧ · · · ∧ θ α0 ∧ · · · ∧ θ k−1 1
v ∈ C ∞ (ᐁ) < dx i , dpiα1 ,...,ik > ∩ r (ᐁ).
i1
k−1 ,...,ik−1
∧ v with nonzero
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
319
Definition 4.2. For any element v ∈ q (ᐁ), we define ∂(v) ∈ q+1 (ᐁ) by ∂ is a derivation of ∗ (ᐁ), α ∂ θ α for all i1 , . . . , it , t ≤ k − 1, i1 ,...,it = d θi1 ,...,it ∞ ∂ C (ᐁ) = (0), α ∂ dpi1 ,...,ik = 0 for all i1 , . . . , ik . For any monomial m as before, we define also the following. Definition 4.3. If m = 0, pj (m) = :{different θiα1 ,...,ij in m}. We define p(m) = (p0 (m), . . . , pk−1 (m)) and |p(m)| = p0 (m) + · · · + pk−1 (m). Note that we do not put pk in p. Clearly, we have r = q − |p(m)|. Definition 4.4 (Characteristic weight). (1) Given p = (p0 , . . . , pk−1 ), we define w(p) = kp0 + (k − 1)p1 + · · · + pk−1 . (2) For any monomial m as before, different from 0, we define w(m) = w(p(m)). (3) If v ∈ ∗ (ᐁ), we define w(v) = min(w(m) | m appears in the expression of v). We say that v is w-homogeneous if all the monomials in its expression have the same w. We call w the characteristic weight. Lemma 4.5. For any v that is w-homogeneous, we have w(∂(v)) = w(v) − 1 if ∂(v) = 0, and w((d − ∂)(v)) > w(v) − 1 if (d − ∂)(v) = 0. Proof. For a typical element m as above, we have pk−1 p0 1 αk−1 d θ α0 ∧ · · · ∧ θ α0 ∧ · · · ∧ θ k−1 ∧ v k−1 =
i1
,...,ik−1
α
pk−1
k−1 (−1)i−1 θ α0 ∧ · · · ∧ d(θi ) ∧ · · · ∧ θ k−1 1
i1
i≤|p| |p| α01
+ (−1) θ
∧···∧θ
p
α0 0
k−1 ,...,ik−1
∧v
p
k−1 αk−1
∧ · · · ∧ θ k−1 i1
k−1 ,...,ik−1
∧ d(v).
We have indicated formally with θi the ith element in the wedge product above. It is clear that the first summation is ∂(m), while the second one is (d −∂)(m). In the first summation all summands have w decreased by 1, while in the last summand w stays the same or increases. This proves the statement. Definition 4.6. We have F w,q = {v ∈ q (ᐁ) | w(v) ≥ w}. Lemma 4.7. The complex F w+1,q−1 /F w+2,q−1 → F w,q /F w+1,q → F w−1,q+1 / is isomorphic to a split subcomplex of the following complex:
F w,q+1
q−1
U ⊗R C ∞ (ᐁ) −→
q
U ⊗R C ∞ (ᐁ) −→
q+1
U ⊗R C ∞ (ᐁ),
320
MICHELE GRASSI
where U = ωi , ηiα1 ,...,ik , ηiα1 ,...,ik−1 , . . . , ηα R and the differential is the derivation of the exterior algebra of U acting as ∂(ηiα1 ,...,iτ ) = −ηiα1 ,...,iτ ,l ∧ ωl , if τ ≤ k − 1, and as ∂(ηiα1 ,...,ik ) = 0 and ∂(ωi ) = 0. Proof. Let us define Ij to be the vector space span of the ηiα1 ,...,ij , and let J be the
span of the ωi , ηiα1 ,...,ik . The identification we are looking into is F w,q ∼ = F w+1,q
p0
q−|p|
pk−1
I 0 ⊗R · · · ⊗ R
Ik−1 ⊗R
J ⊗R C ∞ (ᐁ).
p:w(p)=w
The map providing this isomorphism is −→ ηiα1 ,...,ij θα i1 ,...,ij dpiα1 ,...,ik −→ ηiα1 ,...,ik i dx −→ ωi . Then we extend this by linearity to all the exterior algebras. Observe now that in the quotient complex F w+1,q−1 F w,q F w−1,q+1 −→ w+1,q −→ , w+2,q−1 F F F w,q+1 the induced differential comes only from ∂, as d − ∂ is mapped to zero. Moreover, under the above isomorphism, the ∂ of the quotient complex is sent to the ∂ of the “algebraic” complex q−1
∞
U ⊗R C (ᐁ) −→
q
∞
U ⊗R C (ᐁ) −→
q+1
U ⊗R C ∞ (ᐁ).
This just means that we have built a map of complexes. In the algebraic complex, we can define a weight w (see also Definition 4.8), and the image is just obtained by fixing w + q. From this it follows that the image of the above map is actually a split subcomplex. In view of the result of Lemma 4.7, it is interesting to isolate the algebraic part of the complex considered there. Definition 4.8 (First algebraic reduction). The first algebraic reduction is the complex · · · → q−1 U → q U → q+1 U → · · · , where U is a k-vector space U = ωi , ηiα1 ,...,ik , ηiα1 ,...,ik−1 , . . . , ηα , = ωi , and the differential is the derivation of the exterior algebra of U acting as ∂(ηiα1 ,...,iτ ) = −ηiα1 ,...,iτ ,l ∧ ωl if τ ≤ k − 1, and as ∂(ηiα1 ,...,ik ) = 0 and ∂(ωi ) = 0. Let us define Ij to be the vector space span of the ηiα1 ,...,ij , J be the span of the ωi , ηiα1 ,...,ik , and the span of just the ωi . We then
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
321
define a weight w that on m ∈ p0 I0 ⊗R · · · ⊗R pk−1 Ik−1 ⊗R q−|p| J takes the value w(m) = kp0 + · · · + pk−1 . We indicate the grade-w piece of the complex as · · · → ( q−1 U )w+1 → ( q U )w → ( q+1 U )w−1 → · · · . Theorem 4.9. H q (Ᏽ∗ (ᐁ), d) = 0 for 0 ≤ q ≤ n for any open set ᐁ small enough and contractible, provided that the complex q q−1 q+1 · · · −→ U U U −→ −→ −→ · · · w+1
w
w−1
(the grade-w piece of the first algebraic reduction) is exact at the indicated place when w > 0 and q ≤ n. Proof. We may as well assume that ᐁ is included in a standard chart for Gkn (M n+s ) and that it is smooth and topologically trivial. Suppose that v ∈ Ᏽq (ᐁ) = Ᏽ(ᐁ) ∩
q (ᐁ), with dv = 0, v = 0, and q ≤ n. Then modulo d Ᏽ, v is in the ideal generated by the θ ’s. Indeed, the only problem can arise from multiples of the dθ but not of the θ . However, any multiple of a dθ is equivalent modulo d Ᏽ to a multiple of θ, as can be verified easily. For our purposes (proving that v is a boundary), we can replace v by this new element, obtained by adding a boundary to it. We may assume, therefore, without loss of generality, that for some w with w > 0, v ∈ F w,q . Let w(v) be the smallest such v ∈ F w(v),q \F w(v)+1,q . From the exactness of the first algebraic reduction, we see that there is a v ∈ F w+1,q−1 with v −d(v ) =∈F w+1,q . Proceeding by induction, we get that there is a v1 such that v − d(v1 ) =∈ w F w,q = (0). This proves that v can be reduced to zero via boundaries, as desired. In the following, we give a further algebraic translation of the first algebraic rep duction. Consider first the k[x]-module W ⊗k k[x]. We grade ∧k (W ⊗k k[x]/mk+1 ) using the total degree, as usual. Referring to Definition 4.8, we give the following. Definition 4.10. We have( q U )tw = {v ∈ ( q U )w | t elements from appear}. Lemma 4.11. The two following complexes are isomorphic: q t U )w → ( q+1 U )t+1 (1) · · · → ( q−1 U )t−1 w+1 → ( w−1 → · · · ; (2) the graded piece of a Koszul complex: n−(t−1)
q−(t−1) · · · −→ W ⊗k k[x]/mk+1
⊗k ∧k
(q−t)k−(w+1)
n−t
q−t −→
⊗k ∧k W ⊗k k[x]/mk+1
n−(t+1)
−→
(q−t)k−w
q−(t+1)
⊗k ∧k W ⊗k k[x]/mk+1
(q−t)k−(w−1)
−→ · · · .
322
MICHELE GRASSI
Proof. Pick a basis {eα | α = 1, . . . , s} for W . Remember that we have already an ordered basis {ωi | i = 1, . . . , n} for . Pick an element of the form ωJ ⊗ ((eα1 ⊗ q−t k+1 m1 ) ∧ · · · ∧ (eα1 ⊗ mq−t )) ∈ n−t ⊗k (∧k (W ⊗k k[x]/m )), where the mi ’s are monomials of degree |mi | smaller than or equal to k, with i |mi | = pk −w, and J is an ordered multiindex of length n−t. Write mi = xEi , where the Ei are multiindices of length |mi |. This element is sent to B(ωJ ⊗ ((eα1 ⊗ m1 ) ∧ · · · ∧ (eα1 ⊗ mq−t ))) = α α1 ∧· · ·∧ηEq−t in the corresponding space of the first complex, where (−1)n−t ∗ωJ ∧ηE q−t 1 ∗ is the usual Hodge ∗ operator on the space ∗ . To define ∗, we use the given ordered basis for (as in the classical case one uses an orthonormal ordered basis). Then B is extended by k-linearity, and in this way B becomes an isomorphism over k. A direct computation shows that an element of degree j is sent to an element with w = (q −t)k −j , and therefore the degrees in the two complexes correspond correctly. We have only to check that B commutes with the differentials in the two complexes. By k-linearity, it is enough to prove this for elements of the kind used above. In the following, we indicate with C the contraction operator, and with (∂/∂ωi ) the dual to ωi with respect to the given bilinear form on . We call d the differential in the second complex. We have ∂ B ωJ ⊗ eα1 ⊗ m1 ∧ · · · ∧ eα1 ⊗ mq−t
α1 αq−t ∧ · · · ∧ η = ∂ (−1)n−t ∗ ωJ ∧ ηE Eq−t 1 = (−1)
n−t
q−t αh α1 α (−1)h−1 ∗ ωJ ∧ ηE ∧ · · · ∧ ∂ ηE ∧ · · · ∧ ηEq−t q−t h 1 h=1
= (−1)n−t
q−t n αh J α1 α (−1)h ∧ ωi ∧ · · · ∧ ηEq−t ∗ ω ∧ ηE1 ∧ · · · ∧ ηE q−t h ∪{i} h=1
=
i=1
q−t n h=1 i=1
α αh α1 (−1)n−t ∗ ωJ ∧ ωi ∧ ηE ∧ · · · ∧ ηE ∧ · · · ∧ ηEq−t q−t h ∪{i} 1
q−t n α J αq−t αh n−t i 1 = (−1) ηE1 ∧ · · · ∧ ηEh ∪{i} ∧ · · · ∧ ηEq−t ∗ω ∧ω ∧ i=1
=
n i=1
h=1
q−t α J αq−t αh i 1 ω ∧ ∗ω ∧ ηE1 ∧ · · · ∧ ηEh ∪{i} ∧ · · · ∧ ηEq−t h=1
(see Proposition 4.12) q−t n ∂ αq−t αh α1 n−t+1 J (−1) ∗ Cω ∧ ηE1 ∧ · · · ∧ ηEh ∪{i} ∧ · · · ∧ ηEq−t = ∂ωi i=1
h=1
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
323
q−t ∂ J =B C ω ⊗ eα1 ⊗ xE1 ∧ · · · ∧ eαh ⊗ xEh ∪{i} ∧· · ·∧ eαq−t ⊗xEq−t i ∂ω h=1 q−t ∂ J E1 Eq−t =B C ω ⊗ xi eα1 ⊗ x ∧ · · · ∧ eαq−t ⊗ x ∂ωi h=1
= B d ωJ ⊗ eα1 ⊗ xE1 ∧ · · · ∧ eαq−t ⊗ xEq−t as desired. The following simple fact is standard from Hodge theory.
Proposition 4.12. If ωJ has degree n − t (with the notation as in the previous proof) and dim( ) = n, then ωi ∧ (∗ωJ ) = (−1)n−t+1 ∗ ((∂/∂ωi ) C ωJ ). Theorem 4.13 (Second algebraic reduction). We have H q (Ᏽ∗ (ᐁ), d) = 0 for 0 ≤ q≤ n for any open set ᐁ small enough and contractible, provided that Kn−t (x, q−t (W ⊗k k[x]/mk+1 ))j = 0 whenever n − t ≥ q − t and j < (q − t)k. Proof. From Theorem 4.9, it is enough to show that the complex q q−1 q+1 −→ −→ −→ · · · U U U · · · −→ w+1
w
w−1
(the grade w piece of the first algebraic reduction as described in Definition 4.8) is exact at the indicated place when w > 0 and q ≤ n, provided that Kn−t (x, q−t (W ⊗k k[x]/mk+1 ))j = 0 whenever n − t ≥ q − t and j < (q − t)k. From Lemma 4.11 this is true, because the above complex and the Koszul complex computing the Koszul group are isomorphic in a way that correctly matches degrees. 5. The local vanishing theorem Theorem 5.1 (Local vanishing of characteristic cohomology). For any open set q (k) ᐁ ⊆ Gn (M), we have that p∗ : HDR (ᐁ) → H q ( ∗ /Ᏽ∗ (ᐁ), d) is an isomorphism for 0 ≤ q < n and is injective for q = n. (k)
Proof. First of all, we note that, for ᐂ ⊆ Gn (M) small enough and contractible, = 0 for 0 ≤ q ≤ n. Indeed, from Theorem 4.13, this is equivalent to showing that Kn−t (x, q−t (W ⊗k k[x]/mk+1 ))j = 0 whenever n − t ≥ q − t and j < (q −t)k. This is the content of Theorem 3.9. Let us now consider the complex of sheaves 0 → ( Ᏽ(k) )0 → · · · → ( Ᏽ(k) )n → · · · . The previous vanishing result reveals that this complex of sheaves is exact up to level n. Moreover, all the sheaves involved are fine. Therefore the first spectral sequence of hypercohomology abuts to zero in a degree smaller than or equal to n, while the second one abuts to the cohomology of the global sections of the previous complex over ᐁ. This shows that H q (Ᏽ∗ (ᐁ), d) = 0 H q (Ᏽ∗ (ᐂ), d)
324
MICHELE GRASSI
for 0 ≤ q ≤ n. We conclude using the long exact sequence in cohomology associated to the exact sequence of sheaves 0 −→ Ᏽ∗ , d −→ ∗ (ᐁ), d −→ ∗ (ᐁ)/Ᏽ∗ , d −→ 0. In the remainder of this section we consider the infinite prolongation case, men(k) tioned in the introduction. We have seen that there are natural maps πk,h : Gn (M) → (h) Gn (M), k > h. These maps are smooth and form an inverse system in the category ∗ : ∗ (G(h) (M)) → ∗ (G(k) (M)), k > h, of smooth manifolds. We have also that πk,h n n is a well-defined map of differential graded algebras, which sends Ᏽ(h) into Ᏽ(k) . We therefore have an array of induced maps of complexes that in the end provide a natural map between characteristic cohomologies (h) (k) ∗ q πk,h : H q ∗ G(h)
∗ G(k) n (M) /Ᏽ , d −→ H n (M) /Ᏽ , d . ∗ form a direct system in the category of differential graded algeThe maps πk,h (∞) (k) bras. We want to consider the object Gn (M) = lim ← Gn (M). This limit does k not exist in the category of smooth finite-dimensional manifolds. (For example, (k) (k+1) (M) there is always a dimensional increase.) However, from Gn (M) to Gn it exists as an infinite-dimensional smooth manifold. We are really interested in (k) lim → H q ( ∗ (Gn (M))/Ᏽ(k) , d), which is well defined in the category of vector k
(∞)
spaces. This is the same if, instead of all of Gn (M), we consider an open set −1 ᐁ inside it, eventually stable under πk+1,k , so that we can define for any q and any such ᐁ the vector space lim → H q ( ∗ (ᐁ)/Ᏽ(k) , d). Note also the simple fact that k
lim → H q ( ∗ (ᐁ)/Ᏽ(k) , d) = H q (lim → ( ∗ (ᐁ)/Ᏽ(k) , d)). We have natural maps k
∗ π∞,k :H
q
k
∗ (ᐁ) /Ᏽ(k) , d −→ lim H q ∗ (ᐁ) /Ᏽ(h) , d , →
h
∗ : ∗ (ᐁ)/Ᏽ(k) → lim→ ∗ (ᐁ)/Ᏽ(h) . With this induced by the natural maps π∞,k h notation, we can now state and prove the following theorem.
Theorem 5.2 (Vinogradov). For any open set ᐁ of the form above, which from some level k onward is contractible, lim → H q ( ∗ (ᐁ)/Ᏽ(k) , d) = (0) for 0 < q < n. k
You can find a proof of this theorem in [BG1, pp. 555–556]. Here we sketch how one can use the machinery developed in this paper to get a fairly simple proof. Proof. Let 0 < q < n, and let φ ∈ lim → H q ( ∗ (ᐁ)/Ᏽ(k) , d), d(φ) = 0. We want k to prove that there exists a ψ with φ − d(ψ) ∈ Ᏽ∞ . Take a k ' 0 so that there are (k) a contractible open set ᐁk ⊂ Gn and a form φk ∈ q (ᐁk )/Ᏽ(k) that in the limit reduce to ᐁ and φ, respectively. These two objects exist from the definition of a limit ∗ and that of ᐁ. Moreover, from the injectivity of πk+1,k for all k, d(φk ) ∈ Ᏽ(k) . From
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
325
Theorem 5.1, there exists a ψk ∈ q−1 (ᐁk ) such that φk −d(ψk ) ∈ Ᏽk . However, this equation can be “passed to the limit,” obtaining exactly what we wanted. 6. The nth cohomology group. Here we discuss the cohomology group H n q q (k) ( ∗ /Ᏽ, d). Because we proved that, ∀q < n, H q ( ∗ /Ᏽ, d) ∼ = HDR (Gn ) ∼ = HDR (G (n, T(M))), this is the next interesting case to study. It turns out that this case is qualitatively different from the previous ones, because, for example, we don’t have local vanishing. The results in this section have been obtained by different classical methods—basically, integrating by parts. We will stick to the algebraic notation. (k)
Definition 6.1. A point transformation of Gn (M) is a smooth automorphism that sends the ideal Ᏽ(k) to itself. Remark 6.2. A point transformation is easily seen to send I (k) to itself. If a transformation sends I (k) to itself, it sends the differential ideal that I (k) generates, Ᏽ(k) , to itself. Therefore, point transformations could also be defined as the automorphisms that leave I (k) fixed. (k)
Definition 6.3. A Lagrangian density on Gn (M) is defined as a (local) section (k) (k) over Gn (M) of the bundle D (k) ⊂ n (Gn (M))/Ᏽ(k) , inside the pullback of n (M). Locally, any such section has a lifting of the form L(x i , zα , . . . , piα1 ,...,ik )d x, ¯ where by 1 n d x¯ we mean dx ∧ · · · ∧ dx . Lemma 6.4. For any Lagrangian density L(x i , zα , . . . , piα1 ,...,ik )d x¯ as before, we ¯ ∈ (Ᏽk ). have d(L(x i , zα , . . . , piα1 ,...,ik )d x) Proof. In the expression for d(L(x i , zα , . . . , piα1 ,...,ik )d x), ¯ we get a summation of terms, one for each partial derivative of L with respect to its arguments. There are basically three different cases. (1) The partial derivatives with respect to the x i ’s. As d x¯ involves all the x i ’s, we do not get any contribution from this. (2) The partial derivatives with respect to the zα or the pIα ’s for |I | < k: For these, remember the formulas α α α i θ = dz − pi dx , ··· α θi1 ,...,ik−1 = dpiα1 ,...,ik−1 − piα1 ,...,ik−1 ,l dx l . From these, it follows that α α ¯ θ ∧ d x¯ = dz ∧ d x, ··· α θi1 ,...,ik−1 ∧ d x¯ = dpiα1 ,...,ik−1 ∧ d x, ¯
326
MICHELE GRASSI
and therefore all the summands coming from these partials are actually in I (k) . (3) The partial derivatives with respect to the pIα ’s for |I | = k. From the formulas above, we know that dθiα1 ,...,ik−1 = −dpiα1 ,...,ik−1 ,l ∧ dx l . Therefore, indicating with ˆ i ∧· · ·∧dx n and for I = I ∪{i0 }, we get |I |+1 = |I | = k x¯[i] = (−1)i−1 dx 1 ∧· · ·∧ dx and dpIα ∧ d x¯ = ±dθIα ∧ d x¯[i0 ] ∈ Ᏽ(k) .
With reference to the nth characteristic cohomology group, some of the most interesting problems arise from studying the following. Conjecture/Question [Gri]. Is it true that any class in H n+1 (Ᏽ, d), which comes from H n ( ∗ /Ᏽ, d) under the natural coboundary map, has a natural point-equivariant representative? At this time this conjecture remains open. Theorem 6.5. There is a unique map of presheaves n+1 PC : H n+1 (Ᏽ, d) −→ Z I (1) ∧ nM + I ∧ I , which, once composed with the natural map n+1 Z I (1) ∧ nM + I ∧ I −→ H n+1
Ᏽ ,d , Ᏽ2
induces the natural map H n+1 (Ᏽ, d) → H n+1 (Ᏽ/Ᏽ2 , d) at the level of sheaves. At the level of global sections, PC : H n+1 (Ᏽ, d) → (I /I 2 ) is Aut(M)-equivariant. Lemma 6.6. Given a local Lagrangian density Ld x, ¯ there exists 8 ∈ (I )n such (1) n+1 that d(Ld x¯ − 8) ∈ (I ∧ d x¯ + I ∧ I ) . Proof. In terms of the filtration F ω,n+1 , we can modify Ld x¯ by an element 8 ∈ (I )n so that d(Ld x) ¯ lands inside F k,n+1 . For this, it is clearly enough to show that the graded piece · · · −→
n
t−1 U
n+1 t n+2 t+1 U U −→ −→ −→ · · ·
ω+1
ω
ω−1
of the first algebraic approximation is acyclic at the indicated place whenever ω < k. From Lemma 4.13, this is equivalent to showing that for all t n−t+1
k+1 Kn−t x, =0 W ⊗k k[x]/m j
whenever j > k. This is in Remark 3.10. Lemma 6.7. We have I (1) ∧ d x¯ + I ∧ I = I (1) ∧ nM + I ∧ I .
LOCAL VANISHING OF CHARACTERISTIC COHOMOLOGY
327
Proof. The inclusion I (1) ∧ d x¯ + I ∧ I ⊂ I (1) ∧ nM + I ∧ I is clear. For the other direction, note that dx i , dzα C ∞ (G(k) (M)) = dx i , θ α C ∞ (G(k) (M)) . n
n
Lemma 6.8. If 81 and 82 satisfy the properties of Lemma 6.6, then d(81 −82 ) ∈ I ∧I. Proof. This is clear because d(I ) ∩ (I (1) ∧ d x¯ + I ∧ I ) ⊂ I ∧ I . Proof of Theorem 6.5. From Lemmas 6.6, 6.7, and 6.8, we see that there is a well-defined map PC : H n+1 (Ᏽ, d) → Z(I (1) ∧ nM + I ∧ I )n+1 . Moreover, from its definition in Lemma 6.6, we see that, once composed with the natural map (Z(I (1) ∧
nM + I ∧ I ))n+1 → H n+1 (Ᏽ/Ᏽ2 , d), PC induces the natural map H n+1 (Ᏽ, d) → H n+1 (Ᏽ/Ᏽ2 , d) at the level of sheaves. The equivariance with respect to Aut(M) is an immediate consequence of the local uniqueness, which gives PC canonically on every small open set. References [BT] [BH] [BCGGG]
[BG1] [BG2] [BGH1]
[BGH2] [BGH3] [BGH4] [C] [Gr] [G] [Gri] [HS] [Mo]
R. Bott and L. W. Tu, Differential Forms in Algebraic Topology, Grad. Texts in Math. 82, Springer-Verlag, New York, 1982. W. Bruns and J. Herzog, Cohen-Macaulay Rings, Cambridge Stud. Adv. Math. 39, Cambridge Univ. Press, Cambridge, 1993. R. L. Bryant, S. S. Chern, R. B. Gardner, H. L. Goldschmidt, and P. A. Griffiths, Exterior Differential Systems, Math. Sci. Res. Inst. Publ. 18, Springer-Verlag, New York, 1991. R. L. Bryant and P. A. Griffiths, Characteristic cohomology of differential systems, I: General theory, J. Amer. Math. Soc. 8 (1995), 507–596. , Characteristic cohomology of differential systems, II: Conservation laws for a class of parabolic equations, Duke Math. J. 78 (1995), 531–676. R. L. Bryant, P. A. Griffiths, and L. Hsu, “Toward a geometry of differential equations” in Geometry, Topology, and Physics for Raoul Bott, Conf. Proc. Lecture Notes Geom. Topology 4, International Press, Cambridge, 1995, 1–76. , Hyperbolic exterior differential systems and their conservation laws, I, Selecta Math. (N.S.) 1 (1995), 21–112. , Hyperbolic exterior differential systems and their conservation laws, II, Selecta Math. (N.S.) 1 (1995), 265–323. , Poincaré-Cartan forms and the geometry of first order functionals, I, Institute for Advanced Study preprint, 1997. E. Cartan, Les systèmes différentiels extérieurs et leur applications géométriques, Actualiatés Sci. Indust. 994, Hermann, Paris, 1945. M. Grassi, Characteristic cohomology of smooth manifolds, Ph.D. thesis, UCLA, 1997. M. L. Green, “Koszul cohomology and geometry” in Lectures on Riemann Surfaces (Trieste, 1987), World Scientific Press, Teaneck, N.J., 1989, 177–200. P. A. Griffiths, personal communication, January 23, 1997. P. J. Hilton and U. Stammbach, A Course in Homological Algebra, Grad. Texts in Math. 4, Springer-Verlag, New York, 1971. S. Montgomery, Hopf Algebras and Their Actions on Rings, CBMS Reg. Conf. Ser. Math. 82, Amer. Math. Soc., Providence, 1993.
328 [Sw]
MICHELE GRASSI M. E. Sweedler, Hopf Algebras, Math. Lecture Note Ser., Benjamin, New York, 1969.
Scuola Normale Superiore, P.zza dei Cavalieri 7, 56100 Pisa, Italy;
[email protected]
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
SYMPLECTIC MODULAR SYMBOLS PAUL E. GUNNELLS
1. Introduction 1.1. Let G be a semisimple algebraic group defined over Q of Q-rank , and let X be the associated symmetric space. Let ⊂ G(Q) be a torsion-free arithmetic subgroup. Then H ∗ (; Z) = H ∗ (\X; Z), and this cohomology vanishes for ∗ > N, where N = dim(X) − , the cohomological dimension of . The theory of modular symbols as formulated by Ash [2] constructs an explicit spanning set for H N (; Z) as follows. Let Ꮾ be the Tits building associated to G [17]. By the Solomon-Tits theorem, Ꮾ has the homotopy type of a wedge of ( − 1)spheres, and thus H˜ ∗ (Ꮾ; Z) is nonzero only in dimension −1. Using the Borel-Serre compactification of the locally symmetric space \X, we may construct a map (1)
: H−1 (Ꮾ; Z) −→ H N (; Z)
that is surjective (cf. §2). Because the left-hand side of (1) is generated by fundamental classes of apartments of Ꮾ, this provides a geometric spanning set for H N (). These cohomology classes (or rather, their duals in homology) are called modular symbols. 1.2. The modular symbols provide a spanning set for H N (; Z), but they do not provide a finite spanning set, a distinction that is important for applications. However, suppose K/Q is a number field with euclidean ring of integers ᏻ, and let G(Q) = SLn (K) and ⊂ SLn (ᏻ). Then in [6], Ash and Rudolph determine an explicit finite spanning set—the unimodular symbols—and present an algorithm to write a modular symbol as a sum of unimodular symbols (cf. §2.9). This algorithm, in conjunction with certain explicit cell complexes, can be used to compute the action of the Hecke operators on H N (). In turn, through work of Ash, Pinch, and Taylor [5], Ash and McConnell [4], and van Geemen and Top [18], corroborative evidence has been produced for certain aspects of the “Langlands philosophy.” In particular, in the case of ⊂ SL3 (Z), many examples of representations of the absolute Galois group ¯ Gal(Q/Q) have been found that appear to be associated to cohomology classes of . 1.3. In this paper we solve the finiteness problem for the symplectic group: G(Q) = Sp2n (K) and of finite index in Sp2n (ᏻ), where ᏻ is euclidean. We characterize a finite spanning set of H N (; Z) and present an algorithm (Theorem 4.11) Received 7 May 1998. Revision received 28 July 1999. 1991 Mathematics Subject Classification. Primary 11F75. Author’s work partially supported by National Science Foundation grant number DMS-9627870. 329
330
PAUL E. GUNNELLS
that generalizes the modular symbol algorithm of [6]. To do this, we prove a relation in the homology of the symplectic building (Theorem 3.16; see Examples 3.9 and 3.10). 1.4. We conclude this introduction by discussing the arithmetic nature of H N () in the symplectic case and by indicating possible applications of Theorem 4.11. First, we consider the complex cohomology. It is known that for n > 1, the groups H N (; C) do not contain cuspidal classes (that is, cohomology classes corresponding to cuspidal automorphic forms). Moreover, work of Schwermer [13], [14], [15] shows the following: • For n > 1, there are Eisenstein cohomology classes in the top degree. These classes are constructed using Eisenstein series and characters attached to the maximal split torus of the minimal parabolic subgroup of G. For n > 3, these are all of the Eisenstein classes appearing in the top degree. • Furthermore, for Sp4 , there are additional classes constructed using Eisenstein series and cuspidal classes for SL2 . Hence, for complex coefficients these classes are either easy to understand arithmetically or are better studied on a lower rank group using the classical theory of modular symbols. Nevertheless, there is one possibility that might be of interest. In the case of Sp6 , one cannot exclude the possibility that there is an Eisenstein cohomology class in the top degree associated to a cuspidal class for Sp4 whose infinity type is not a discrete series representation. It might be interesting to show the existence of this class and to study its arithmetic nature using the results of this paper. (I am grateful to the referee for this suggestion.) 1.5. Next, let p be a prime and let Fp be the corresponding finite field. Consider the mod p cohomology H N (; Fp ) or, more generally, H N (; M), where M is an Fp -module with -action. In this setting, the situation is much less clear. For example, there may be classes that lift to torsion classes in integral cohomology and thus are not associated to automorphic forms in any obvious manner. For discussion and examples of this, see [1] for ⊂ GL3 (Z) and [3], [7] for GLn . One would like to know if such classes arise in the symplectic case and, if so, if the corresponding Hecke eigenclasses are attached to Galois representations. The algorithm in Theorem 4.11, in conjunction with the cell complex described in [12], provides the means to explore this question for H 4 (), where ⊂ Sp4 (Z). 1.6. Finally, in another paper [10], we describe an algorithm to compute the Hecke action on H 5 (), where is a subgroup of SL4 (Z). This cohomology group, whose degree is one less than the cohomological dimension, is within the cuspidal range. One would like to have a symplectic version of this algorithm that could compute the Hecke action on cuspidal classes in H 3 of subgroups of Sp4 (Z) (i.e., on Siegel modular forms of weight 3). The algorithm described in this paper is an essential first step towards solving this problem.
SYMPLECTIC MODULAR SYMBOLS
331
1.7. Acknowledgments and related work. This paper relies heavily on the results of [2] and [6]. We thank Avner Ash for much advice and encouragement. In [12] the authors describe a deformation retract W of the symmetric space Sp4 (R)/ U(2), which may be used to compute H ∗ (; Z), where is a finite index subgroup of Sp4 (Z). The combinatorial data (from [11]) that is used to describe the cell decomposition of W inspired the results in Example 3.9 and led to Theorem 3.16. We thank Bob MacPherson and Mark McConnell for many conversations. Finally, we thank Mark McConnell, Richard Scott, and the referee, who made helpful comments that improved this article. 2. Minimal modular symbols. In this section we review results on minimal modular symbols. These are due to Ash and Rudolph [6] for G = SLn and to Ash [2] for any semisimple Q-group G. Our exposition closely follows these sources. For general results about buildings, the reader may consult Tits [17]. 2.1. Let G, X, and be as specified in the introduction, and let e ∈ X be the distinguished basepoint. Let T be a maximal Q-split torus of G stable under the Cartan involution corresponding to e, and let A = T (R)0 . Since is the Q-rank of G, we have that A ∼ = (R>0 ) . Let X¯ be the partial compactification of X constructed by Borel and Serre [8]. Then the closure Z of Ae in X¯ is homeomorphic to a closed ball of dimension , and ∂Z ¯ Let [I ] ∈ H−1 (∂ X) ¯ be the fundamental class of ∂Z; if m ∈ G(Q), let lies in ∂ X. [m] be the fundamental class of ∂(mZ). Let Ꮾ be the Tits building associated to G. According to [8, §8.4.3], there is a homotopy equivalence h : Ꮾ → ∂ X¯ that takes a distinguished apartment A0 homeomorphically onto ∂Z. This map is compatible with the natural G(Q)-action on Ꮾ ¯ In particular, if m ∈ G(Q), then h(mA0 ) = ∂(mZ). Since G(Q) acts transiand ∂ X. tively on apartments, and H−1 (Ꮾ; Z) is generated by the fundamental classes of the apartments, we have shown the following lemma. ¯ Z). 2.2. Lemma [2]. The classes [m] generate H−1 (∂ X; 2.3. Now let π : X¯ → \X¯ be the projection. Using π and the long exact se¯ ∂ X), ¯ we obtain quence of the pair (X,
(2)
∼ π∗ ¯ ∂ X¯ −− ¯ ∂ \X¯ → H \X, H−1 ∂ X¯ −→ H X, ∼ D −→ H N \X¯ −→ H N (\X).
¯ the last isomorphism Here, the first isomorphism follows from the contractibility of X, ¯ and the map D is Lefis induced by the canonical homotopy equivalence X → X, schetz duality. Since is assumed torsion-free, D is an isomorphism with integral coefficients, and H N (\X; Z) can be identified with H N (; Z). Let [m]∗ ∈ H N (; Z) be the image of [m] under the sequence (2). One of the main results of [2] is that π∗
332
PAUL E. GUNNELLS
is surjective, which with Lemma 2.2 implies the following theorem. 2.4. Theorem [2]. As m varies over G(Q), the classes [m]∗ generate H N (; Z). 2.5. Let K/Q be a number field with ring of integers ᏻ, and let G be the algebraic Q-group such that G(Q) = SLn (K). Here = n−1. We want to describe the classes [m] using the combinatorics of Ꮾ. Let V = K n . We assume that the basis {e1 , . . . , en } ⊂ V are eigenvectors for the torus T from §2.1. Then Ꮾ is a simplicial complex with a vertex for every proper nonzero subspace F of V . A set {F1 , . . . , Fk } of vertices of Ꮾ spans a (k −1)-simplex in Ꮾ if and only if the corresponding subspaces can be arranged in a proper flag: 0 ⊂ F 1 ⊂ · · · ⊂ Fk ⊂ V . The action of SLn (K) on V induces an action on Ꮾ and on H∗ (Ꮾ; Z). To construct the classes [m], we use an auxiliary simplicial complex. Let [[n]] be the set {1, . . . , n}. Let ∂!n−1 be the barycentric subdivision of the boundary complex of the (n − 1)-simplex. In other words, ∂!n−1 is a simplicial complex with vertices corresponding to the proper nonempty subsets I of [[n]], and a collection of subsets I1 , . . . , Ik corresponds to a (k − 1)-simplex in ∂!n−1 if and only if they can be arranged in a proper flag: I1 ⊂ · · · ⊂ Ik . We may orient ∂!n−1 using the standard ordering on [[n]]. Given n points in V {0}, we may construct a class in Hn−2 (Ꮾ; Z) (following [6]). Given v1 , . . . , vn ∈ V {0}, we define a simplicial map φ : ∂!n−1 −→ Ꮾ by sending the vertex I ⊂ [[n]] to the vertex of Ꮾ corresponding to the subspace spanned by {vi | i ∈ I }. If ξ is the fundamental class of the geometric realization of ∂!n−1 , then φ∗ (ξ ) is a class in Hn−2 (Ꮾ; Z). Thus, under the composition φ h ¯ ∂!n−1 −→ Ꮾ −→ ∂ X,
¯ Z). we have constructed a class in Hn−2 (∂ X; 2.6. Definition [6]. The modular symbol associated to v1 , . . . , vn ∈ V {0} is the ¯ Z) constructed above and is denoted [v1 , . . . , vn ]. If m ∈ Mn (K) class in Hn−2 (∂ X; (n×n matrices over K), then by [m] we mean the modular symbol constructed using the columns of m. 2.7. Proposition. If m ∈ SLn (K), then the construction of [m] given in Definition 2.6 coincides with that given in §2.1. In particular, the classes [m] span ¯ Z). Hn−2 (∂ X; Proof. Recall that we have a homotopy equivalence h : Ꮾ → ∂ X¯ taking a distinguished apartment A0 homeomorphically onto ∂Z (§2.1). This apartment is in
333
SYMPLECTIC MODULAR SYMBOLS
fact φ(∂!n−2 ), where ∂!n−2 is constructed using the basis e1 , . . . , en ∈ V . These ¯ so the identifications are compatible with the action of SLn (K) on V , Ꮾ, and ∂ X, result follows. Modular symbols have the following properties. ¯ Z) satisfies the following. 2.8. Proposition [6]. The map [ ] : Mn (K) →Hn−2 (∂ X; (1) [v1 , . . . , vn ] = (−1)|τ | [τ (v1 ), . . . , τ (vn )], where τ ∈ Sn is a permutation on n letters and |τ | is the length of τ . (2) If q ∈ K, then [qv1 , v2 , . . . , vn ] = [v1 , . . . , vn ]. (3) If the vi are linearly dependent, then [v1 , . . . , vn ] = 0. (4) If v0 , . . . , vn ∈ V {0}, then (−1)i v0 , . . . , vˆi , . . . , vn = 0. i
Furthermore, [ ] is surjective. 2.9. Now assume that ᏻ is a euclidean ring with respect to a multiplicative norm : ᏻ → Z≥0 . Using multiplicitivity, extend the norm to a map : K → Q≥0 . We recall how to identify a -finite spanning set of modular symbols. By a primitive vector, we mean a vector v ∈ ᏻn such that the greatest common divisor of the entries of v is a unit. Let L be an ᏻ-submodule of ᏻn of rank k ≤ n. Since ᏻ is a principal ideal domain, L has a free ᏻ-basis B = {v1 , . . . , vk }. Choose W = {wk+1 , . . . , wn } ⊂ ᏻn such that B W is a K-basis of K n and W may be extended to an ᏻ-basis of ᏻn . We define the index of L by i(L) := det(v1 , . . . , vk , wk+1 , . . . , wn ). It is easy to see that i(L) is independent of the choices above and that i(L) = 1
if and only if (L ⊗ᏻ K) ∩ ᏻn = L.
Furthermore, if L has rank n and is the usual norm NK/Q : K → Q, then i(L) = [ᏻn : L]. We write i(v1 , . . . , vk ) for i(L) if we are given a specific set of linearly independent vectors generating L. If the vi are linearly dependent, we define i(v1 , . . . , vk ) = 0. 2.10. Definition. Let v1 , . . . , vk ∈ ᏻn be linearly independent primitive vectors. Then a candidate for the vi is a primitive x ∈ ᏻn {0}, satisfying 0 ≤ i(x, v1 , . . . , vˆi , . . . , vk ) < i(v1 , . . . , vk ),
1 ≤ i ≤ k.
The following proposition is a fundamental result. 2.11. Proposition [6]. Let v1 , . . . , vk be a linearly independent set of primitive vectors. If i(v1 , . . . , vk ) > 1, then a candidate x for the vi exists.
334
PAUL E. GUNNELLS
Proof. We give the proof since our statement differs slightly from that found in [6]. First, assume k = n. Let L be the lattice spanned by the vi , and let w be a primitive vector in ᏻn that is not in L. Note that w exists since i(L) > 1. Let A ∈ Mn (ᏻ) be the matrix with columns v1 , . . . , vn , and let Ai [w] be the matrix obtained by replacing the ith column of A with w. Since ᏻ is euclidean, there exist αi , βi ∈ ᏻ such that det Ai [w] = αi det A + βi , where 0 ≤ βi < det A. Now let x = w − i αi vi . By our choice of w, the vector x is nonzero. It is easy to check that det Ai [x] = βi . Since 0 ≤ βi < det A and det Ai [x] = i(x, v1 , . . . , vˆi , . . . , vn ), the result follows. Now assume k < n. Since ᏻ is euclidean, L is a free module. Hence, we can choose an isomorphism L ⊗ K → K k that carries (L ⊗ K) ∩ ᏻn onto ᏻk . This brings us back to the full rank setting, and we may argue as above. Notice that x can be written as qi vi , with qi ∈ K satisfying 0 ≤ qi < 1. Furthermore, since the vi are linearly independent, at least one qi = 0. ¯ 2.12. Theorem [6]. As m ranges over SLn (ᏻ), the classes [m] generate Hn−2 (∂ X; Z). Hence, if ⊂ SLn (ᏻ) is torsion-free and of finite index, then the classes [m]∗ provide a finite spanning set of H N (; Z). Proof. Repeatedly applying Propositions 2.11 and 2.8(4), we may write any class [m] as a sum [mi ], where the determinant of each mi is a unit. Then applying Proposition 2.8(2), we may divide the first column of mi by the determinant of det mi to get mi ∈ SLn (ᏻ), satisfying [mi ] = [mi ]. 2.13. Definition. The classes {[m] | m ∈ SLn (ᏻ)} are called unimodular symbols. 3. Symplectic modular symbols. In this section we generalize the results of §2.5 to the symplectic case. In §§3.1 and 3.2 we recall well-known facts about symplectic geometry and the building associated to the symplectic group. In §3.4 we translate results of §2.5 to the symplectic setting, and in §§3.11–3.15 we describe a relation in the homology of the building that is crucial to our finiteness result. Throughout this section we do not assume that the ground field K has a euclidean ring of integers. 3.1. First, we recall some elementary facts about symplectic geometry to fix notation. Fix a field K, and let V be the vector space K 2n . If k ∈ Z, we use the notation k¯ for 2n + 1 − k. We fix a nondegenerate alternating bilinear form , : V → K. A basis v1 , . . . , vn , vn¯ , . . . , v1¯ of V is said to be symplectic if if j = ı¯ and i < j , 1 vi , vj = −1 if j = ı¯ and i > j , 0 otherwise.
SYMPLECTIC MODULAR SYMBOLS
335
We let G = Sp2n (K) be the subgroup of GL2n (K) preserving the form. Thus, Sp2n (K) = {g ∈ GL2n (K) | gv, gw = v, w}. The group G has Q-rank = n. Given any x ∈ V , we define x ⊥ to be the set {y ∈ V | x, y = 0}. If x is nonzero, then x ⊥ is a hyperplane containing x. A subspace F ⊂ V is called isotropic if v, w = 0 for all v, w ∈ F . Any one-dimensional subspace is isotropic, and the largest dimension an isotropic subspace may have is n. An n-dimensional isotropic subspace is called a Lagrangian subspace. 3.2. From now on we exclusively use Ꮾ to denote the building associated to Sp2n (K). We have the following description of Ꮾ as a simplicial complex, analogous to that found in §2.5: vertices of Ꮾ correspond to nonzero isotropic subspaces of V , and simplices of Ꮾ correspond to flags of nonzero isotropic subspaces. To describe the geometry of Ꮾ we must use cross-polytopes instead of simplices, and so we recall their definition. Let e1 , . . . , en be the standard basis of Rn . Then a cross-polytope on 2n vertices is a polytope isomorphic to the convex hull of the points ±e1 , . . . , ±en . Examples are the square (n = 2) and the octahedron (n = 3). Since the proper faces of cross-polytopes are simplicial, we may regard their boundary complexes as simplicial complexes. Let ∂βn be the first barycentric subdivision of the boundary complex of the cross-polytope on 2n vertices. To describe the struc¯ We order this set by ture of ∂βn , we use the notation [[n]]± := {1, . . . , n, n, ¯ . . . , 1}. ¯ 1 < · · · < n < n¯ < · · · < 1. 3.3. Definition (cf. [9]). A nonempty subset I ⊂ [[n]]± is called isotropic if for all i, j ∈ I , we have i = ¯. Note that #I ≤ n for any isotropic I . As in the SLn case, vertices of ∂βn correspond to isotropic subsets of [[n]]± . If we take β to be the convex hull of the points ±e1 , . . . , ±en , then the set I corresponds to the linear span of {ei | i ∈ I }, where eı¯ := −ei . Similarly, simplices of ∂βn correspond to proper flags of isotropic subsets. 3.4.
Now suppose that we are given 2n points v1 , . . . , vn , vn¯ , . . . , v1¯ ∈ V {0}.
3.5. Definition. The set v1 , . . . , vn , vn¯ , . . . , v1¯ is said to satisfy the isotropy condition if, for every isotropic subset I ⊂ [[n]]± , the subspace spanned by {vi | i ∈ I } is isotropic. Notice that this condition is strictly weaker than the requirement that the vi form a symplectic basis. In particular, the columns of any m ∈ Sp2n (K) satisfy this condition. Following §2.5, we define a simplicial map φ : ∂βn −→ Ꮾ by sending the vertex corresponding to an isotropic I ⊂ [[n]]± to the vertex of Ꮾ corresponding to the linear span of {vi | i ∈ I }. Via the composition φ ◦h, we have an induced map on homology taking the fundamental class ξ of the geometric realization ¯ Z). of ∂βn to a class in Hn−1 (∂ X;
336
PAUL E. GUNNELLS
3.6. Definition. Given v1 , . . . , vn , vn¯ , . . . , v1¯ ∈ V {0} satisfying the isotropy con¯ Z) denote the class constructed above. dition, let [v1 , . . . , vn ; vn¯ , . . . , v1¯ ] ∈ Hn−1 (∂ X; This class is called a symplectic modular symbol. If the columns of m ∈ M2n (K) satisfy the isotropy condition, we denote the symplectic modular symbol corresponding to its columns by [m]. Symplectic modular symbols satisfy properties similar to those satisfied by the special linear symbols. For instance, minor modification of the proof of Proposition 2.7 shows that the construction of [m] in Definition 3.6 agrees with that of §2.1, and thus ¯ Z). Furthermore, we have the following analog the modular symbols span Hn−1 (∂ X; of the first parts of Proposition 2.8, whose proof is easily checked. 3.7. Proposition. Symplectic modular symbols enjoy the following properties. (1a) Let τ ∈ Sn be a permutation on n letters. Then
v1 , . . . , vn ; vn¯ , . . . , v1¯ = τ (v1 ), . . . , τ (vn ); τ (vn¯ ), . . . , τ v1¯ ,
where τ (vk ) := vτ (k) and τ (vk¯ ) := v τ (k) for k ∈ [[n]]. (1b) We also have [v1 , v2 , . . . , vn ; vn¯ , . . . , v2¯ , v1¯ ] = −[v1¯ , v2 , . . . , vn ; vn¯ , . . . , v2¯ , v1 ]. (2) If q ∈ K, then [qv1 , v2 , . . . , vn ; vn¯ , . . . , v1¯ ] = [v1 , . . . , vn ; vn¯ , . . . , v1¯ ]. (3) If the vi are linearly dependent, then [v1 , . . . , vn ; vn¯ , . . . , v1¯ ] = 0. 3.8. The symplectic analogue of Proposition 2.8(4) is more complicated and forms one of the main results of this article. To motivate the result, we illustrate it for Sp4 (K) and Sp6 (K). In light of the homotopy equivalence Ꮾ → ∂ X¯ (§2.1), we work ¯ in H∗ (Ꮾ) rather than H∗ (∂ X). 3.9. Example. Let G = Sp4 (K). Then the simplicial complex Ꮾ is a graph with two types of vertices, corresponding to the one- and two-dimensional isotropic subspaces of V . Let [m] = [v1 , v2 ; v2¯ , v1¯ ] be a symplectic modular symbol for Sp4 (K). We may picture the subspace configuration in V = K 4 determined by {vi } by passing to P3 (K). Figure 1 shows the projectivized configuration on the left, along with the apartment in Ꮾ corresponding to [m] on the right. In Ꮾ we represent the one-dimensional (respectively, two-dimensional) isotropic subspaces of V by solid (respectively, hollow) vertices. By abuse of notation we use the same symbol for a point in K 4 , the point it determines in P3 (K), and the vertex it determines in Ꮾ. The lines on the left of Figure 1 are the projectivizations of the Lagrangian planes determined by the vi ; we have not drawn the images of the two non-Lagrangian planes. Now choose x ∈ V {0}. We use x to construct a class in H1 (Ꮾ; Z) homologous to [m]. Recall that x ⊥ is the set of all y ∈ V satisfying x, y = 0. In P3 (K), x ⊥ becomes a hyperplane that, for generic x, determines four new points by intersection with the original Lagrangian lines (see Figure 2). These new points and lines determine four
337
SYMPLECTIC MODULAR SYMBOLS
v1 v1 v2¯
v2¯
v1¯
v2
v2 v1¯ Figure 1. A configuration in P3 (K) and its corresponding apartment
v1 x v2¯
v1¯ v2
Figure 2. Constructing new points
new apartments in Ꮾ, and provide a relation [m] = 4i=1 [mi ] in homology, as in Figure 3. Now suppose that x is not generic. Then, for some subset I ⊂ [[2]]± , we have x, vi = 0 for i ∈ I . Up to symmetry there are three nontrivial possibilities, which we depict in Figure 4. The corresponding apartments appear in Figure 5. Note that in these cases there are fewer apartments in the images of the relations. 3.10. Example. Consider the case G = Sp6 (K). Now, Ꮾ has three kinds of vertices, corresponding to isotropic points, lines, and planes in P5 (K). The configuration in P5 (K) corresponding to a symplectic modular symbol m is combinatorially an octahedron with isotropic faces. We choose a generic point x and construct twelve new points by intersecting x ⊥ with the isotropic lines of the octahedron. In Figure 6 we show the configuration corresponding to [m] along with these constructed points,
338
PAUL E. GUNNELLS
v1
a v1
a
x v2¯ c
v2¯
v1¯
b
b v2
x c
v2 d
d
v1¯ Figure 3. A homology relation
v1 v2¯
v1 v2¯
v1¯
x
v1 v1¯
x
v2
x v1¯
v2¯
v2
v2
¯ {1, 2}, and {1} Figure 4. I = {1, 1},
v1
v2¯
v1
x v1¯
v2
v1 x
v2¯ v1¯
v2
x
v2¯
v2
v1¯
Figure 5
which are shown as hollow dots. (Although the configuration properly lives in P5 (K), we show it in three dimensions for clarity.) Notice that the constructed points satisfy nontrivial linear dependencies: three constructed points are collinear if they lie on an isotropic plane corresponding to a facet of the original octahedron. (These dependen cies are only true in P5 (K).) These dependencies ensure the relation [m] = [mi ] in homology.
SYMPLECTIC MODULAR SYMBOLS
339
Figure 6. A configuration in P5 (K)
3.11. Returning to the general case, let [m] = [v1 , . . . , vn ; vn¯ , . . . , v1¯ ] be a symplectic modular symbol. Let x ∈ V {0}, and let x ⊥ be as above. Let Dx ⊂ [[n]]± be the set of indices such that x, vi = 0 if and only if i ∈ Dx . Given distinct i, j ∈ [[n]]± with i = ¯ and not both i, j ∈ Dx , define xij by
(3) xij = x, vi vj − x, vj vi . If both i and j lie in Dx , then xij is not defined. The point xij lies on the intersection of x ⊥ with the isotropic plane spanned by vi and vj . Now, we define new matrices built from m, x, and the xij . 3.12. Definition. Let [m] be a symplectic modular symbol. Choose x ∈ V {0}, and construct the xij as in (3). If i ∈ Dx , define the matrix mi to be the matrix obtained by altering m according to the following rules: (1) replace vı¯ by x; (2) for j ∈ [[n]]± {i, ı¯}, replace vj by xij . A priori [mi ] may not be a symplectic modular symbol, because its columns might not satisfy the isotropy condition. Hence, we state the following proposition. 3.13. Proposition. For each i ∈ [[n]]± Dx , each [mi ] is a symplectic modular symbol. Proof. It is easy to check that xij , xik = 0 and vi , xij = 0 in all the necessary cases. Since x, xij = 0 by construction, the result follows. As indicated in Example 3.10, the xij satisfy linear dependencies, which we record in the following lemma. 3.14. Lemma. Suppose I = {i, j, k} is an isotropic subset, and #(I ∩ Dx ) ≤ 1. Then the points xij , xj k , and xik are linearly dependent.
340
PAUL E. GUNNELLS
Proof. The points satisfy the identity
x, vk xij = x, vj xik − x, vi xj k . 3.15.
We come now to the main result of this section.
3.16. Theorem. Let [m] = [v1 , . . . , vn ; vn¯ , . . . , v1¯ ] be a symplectic modular symbol for Sp2n (K), and choose x ∈ V {0}. Define Dx as above, and let [mi ] be the ¯ Z), symplectic modular symbols constructed in Definition 3.12. Then in Hn−1 (∂ X; we have [m] = (4) [mi ]. i∈[[n]]± Dx
Proof. As in Examples 3.9 and 3.10, we prove the relation in Hn−1 (Ꮾ; Z). Let A (respectively, Ai ) be the apartment corresponding to the symplectic modular symbol [m] (respectively, [mi ]). We think of these apartments as being explicit simplicial cycles in Ꮾ, and we will show that [m] = i [mi ] by examining these cycles. We begin by fixing some notation. Let (a1 , . . . , ak ) be an ordered tuple of linearly independent points of V lying in a Lagrangian subspace, and let F (a1 , . . . , ak ) ⊂ V be their linear span. Let σ (a1 , . . . , ak ) ⊂ Ꮾ be the simplex corresponding to the flag 0 ⊂ F (a1 ) ⊂ F (a1 , a2 ) ⊂ · · · ⊂ F (a1 , . . . , ak ) ⊂ V . Recall that the (closed) star of a simplex σ in a simplicial complex is the set of all simplices σ meeting σ , as well as the faces of all such σ . Also recall that maximal simplices in Ꮾ are called chambers. First, assume that the point x is generic with respect to the vi , so that Dx = ∅. Consider the column vector vi from [m]. The chambers in A appearing in the star of vi are the simplices of the form (5) σ vi , vk1 , . . . , vkn−1 ⊂ A, where {i, k1 , . . . , kn−1 } ⊂ [[n]]± is isotropic. These chambers correspond to the flags (6) 0 ⊂ F (vi ) ⊂ F vi , vk1 ⊂ · · · ⊂ F vi , vk1 , . . . , vkn−1 ⊂ V . On the right-hand side of (4), in Ai we have the chambers (7) σ vi , xi,k1 , . . . , xi,kn−1 ⊂ Ai . These chambers correspond to the flags 0 ⊂ F (vi ) ⊂ F vi , xi,k1 ⊂ · · · ⊂ F vi , xi,k1 , . . . , xi,kn−1 ⊂ V . (8) The sets of flags in (6) and (8) coincide by the definition of the xij , and these chambers appear with the same orientations on both sides of (4). Taking all permutations of
SYMPLECTIC MODULAR SYMBOLS
341
{k1 , . . . , kn−1 } in (5)–(8), we obtain all chambers in the star of vi . Hence, any chamber in A appears once in a unique Ai with the same orientation. Now, we claim that each of the remaining chambers in the Ai appears exactly twice with opposite orientations. Any such chamber must appear in the star of x or xij , and we first consider the star of x. Choose an apartment Ai and a point xij , and let I ⊂ [[n]]± be a maximal isotropic subset of the form {i, j, k1 , . . . , kn−2 }. Let FI be the Lagrangian subspace, corresponding to I . Consider the chambers with x and xij as vertices, and with all vertices other than x corresponding to subspaces lying in FI . These chambers have the form σ x, xij , xi,k1 , . . . , xi,kn−2 ⊂ Ai (9) or (10)
σ x, xij , xj,k1 , . . . , xj,kn−2 ⊂ Aj .
In (9) and (10), we allow all permutations of {k1 , . . . , kn−2 }. By Lemma 3.14, these chambers correspond to the same isotropic flag. Furthermore, it is not difficult to see that the chambers in (9) and (10) appear in Ai and Aj with opposite orientations. We may apply this argument to any pair {i, j } and any FI with {i, j } ⊂ I , and so all the chambers in the star of x in the Ai cancel each other in (4). To complete the proof for generic x, we investigate any remaining chambers on the right-hand side of (4). These must appear in the stars of the xij . Fix xij , and let I = {i, j, k1 , . . . , kn−2 } and FI be as above. The chambers meeting the star of xij and with all vertices corresponding to subspaces lying in FI are of two types: first, σ xij , xi,k1 , . . . , xi,kn−2 , vi ⊂ Ai (11) and (12)
σ xij , xj,k1 , . . . , xj,kn−2 , vj ⊂ Aj ,
and second, (13)
σ xij , xi,k1 , . . . , xi,kn−2 , x ⊂ Ai
and (14)
σ xij , xj,k1 , . . . , xj,kn−2 , x ⊂ Aj .
In (11)–(14), we allow all permutations of the right n−1 vertices. By Lemma 3.14, the isotropic flags corresponding to (11) and (12) (respectively, (13) and (14)) coincide, and checking orientations shows that these cancel in pairs. This accounts for all the chambers on both sides of (4), and hence the result follows for generic x. Now assume that Dx = ∅. Write Dx = I J , where I = I¯ and J ∩ J¯ = ∅. We claim it is sufficient to assume I = ∅. Indeed, let S = [[n]]± I , and let V ⊂ V be
342
PAUL E. GUNNELLS
the span of {vi | i ∈ S}. Then x ∈ V , and V is a symplectic space whose form , is the restriction of , . Furthermore, the vectors {vi | i ∈ S} define an apartment in the building Ꮾ associated to (V , , ) and thus determine a symplectic modular symbol = (1/2)#S. Writing [m] = [m ] ∈ Hn −1 (Ꮾ ; Z), where n [mi ] in Hn−1 (Ꮾ; Z) is equivalent to writing [m ] = [mi ] in Hn −1 (Ꮾ; Z). Hence, by induction we may assume that Dx contains no subset I with I¯ = I . This is the same as Dx being isotropic, and so up to symmetry the only invariant of Dx is its cardinality. As before, we proceed by investigating the stars of vertices. We merely indicate which chambers cancel which and omit the details. We first consider the case that #Dx ≤ n − 1. Any chamber in the star of vi has the form σ vi , vk1 , . . . , vkn−1 ⊂ A, (15) where {i, k1 , . . . , kn−1 } ⊂ [[n]]± is isotropic. This is matched on the right side of (4) by the chamber σ vi , vk1 , . . . , vkj , xkj ,kj +1 , . . . , xkj ,kn−1 ⊂ Akj . (16) In (16), kj is the first subscript reading from the left that does not appear in Dx . (The possibility that i = kj is included.) This chamber appears with the same orientation as that of (15), and chambers of this form account for all chambers on the left side of (4). In the star of x, the chamber σ x, xij , xi,k1 , . . . , xi,kn−2 ⊂ Ai (17) is canceled by the chamber σ x, xij , xj,k1 , . . . , xj,kn−2 ⊂ Aj , (18) exactly as in the generic case. If #Dx < n − 1, then every chamber in the star of x is of this form up to symmetry. If #Dx = n − 1, the remaining chambers in the star of x have the form σ x, vi1 , . . . , vin−1 , (19) where Dx = {i1 , . . . , in−1 }. This chamber appears in Ak and Ak¯ with opposite orientations, where k and k¯ are the unique elements that extend Dx to an isotropic subset. The remaining chambers on the right side of (4) appear in the stars of the xij where {i, j } ⊂ Dx . These cancel exactly as in (11)–(14). Finally we consider the case #Dx = n. This case is slightly different, since x is in the span of the {vi | i ∈ Dx }. Again we begin with the star of the vi and consider the chamber σ vi , vk1 , . . . , vkn−1 .
SYMPLECTIC MODULAR SYMBOLS
343
If Dx = {i, k1 , . . . , kn−2 }, then this chamber is matched by the chamber (20) σ vi , vk1 , . . . , vkj , xkj ,kj +1 , . . . , xkj ,kn−1 ⊂ Akj , exactly as in (16). Otherwise, if Dx = {i, k1 , . . . , kn−2 }, then this chamber is matched by σ vi , vk1 , . . . , vkn−2 , x ⊂ Akn−1 . Now consider the star of x. If {i, j }∩Dx = ∅, then we have cancellation as in (17) and (18). Otherwise, write Dx = {i1 , . . . , in }. Then in the remaining chambers, σ x, vi1 , . . . , vin−1 ⊂ Ain cancels σ x, vi1 , . . . , vin−2 , vin ⊂ Ain−1 . Finally, the chambers in the star of the xij with {i, j } ⊂ Dx cancel exactly as in (11)–(14). This completes the proof. 3.17. Remark. Theorem 3.16 can be proven in more generality than stated here. For example, the proof applies to buildings associated to the odd orthogonal groups SO2n+1 and can be modified to work for buildings associated to the even orthogonal groups SO2n . Using the notion of a “perspectivity” (see [16]), we may prove a similar result for buildings of type G2 . We would like to have a “building-theoretic” proof of Theorem 3.16. 4. Finiteness. Throughout this section we assume that ᏻ is a euclidean ring with respect to the norm : ᏻ → Z≥0 . We also assume, using Proposition 3.7(2), that all modular symbols have integral columns. ¯ Z) a unimodular symbol. 4.1. If m ∈ Sp2n (ᏻ), we call the class [m] ∈ Hn−1 (∂ X; ¯ Z). The goal of this section is to prove that the unimodular symbols span Hn−1 (∂ X; We begin with a notion that lets us measure how far a symplectic modular symbol is from being unimodular. In contrast to the special linear case (§2.9), we use the symplectic pairing rather than the determinant as a measure of non-unimodularity. 4.2. Definition. Let [m] be a symplectic modular symbol with primitive columns. The depth of [m] = [v1 , . . . , vn ; vn¯ , . . . , v1¯ ] is the number d(m) := Max vi , vı¯ . i∈[[n]]
Notice that d(m) = 1 implies that each vi , vı¯ ∈ ᏻ× . Hence, if d(m) = 1, we may divide the columns of m by appropriate units to obtain m ∈ Sp2n (ᏻ) satisfying ¯ Z), [m] = [m ]. Thus, in order to show that the unimodular symbols span Hn−1 (∂ X;
344
PAUL E. GUNNELLS
it is sufficient to show that through a homology we may replace a modular symbol with depth greater than 1 with a cycle of modular symbols, each of which has smaller depth. 4.3. Lemma. Let [m] = [v1 , . . . , vn ; vn¯ , . . . , v1¯ ] be a symplectic modular symbol with primitive columns. If d(m) > 1, then i(v1 , . . . , vn ; vn¯ , . . . , v1¯ ) > 1, and there exists a candidate for the vi . Proof. Assume that i(v1 , . . . , vn , vn¯ , . . . , v1¯ ) = 1. Then the lattice generated by the v1 is ᏻn , and thus m ∈ GL2n (ᏻ). Therefore vi , vı¯ ∈ ᏻ× , det(m) = i∈[[n]]
and d(m) = 1, a contradiction. This implies i(v1 , . . . , vn , vn¯ , . . . , v1¯ ) > 1, and a candidate exists by Proposition 2.11. 4.4. Suppose that [m] = [v1 , . . . , vn , vn¯ , . . . , v1¯ ] is a symplectic modular symbol with d(m) > 1, and let x be a candidate for m. As in §2.9, write qi v i x= i∈[[n]]±
with qi ∈ K satisfying 0 ≤ qi < 1. Define [mi ] as in Definition 3.12, so that (21) [mi ]. [m] = i∈[[n]]± Dx
Notice that for i ∈ [[n]]± Dx , we have x, vi = qı¯ vı¯ , vi < vı¯ , vi ≤ d(m), so that the norm of at least one of the symplectic pairings in each mi has decreased. However, it is not true that d(mi ) < d(m) in general, and so more than just the construction of x is required to show that the unimodular symbols span. The following example illustrates our strategy. 4.5. Example. Consider the case of G = Sp6 (K). As in Example 3.10, [m] corresponds to an octahedron in P5 (K). Suppose that x is a candidate for [m] and that Dx = ∅. Construct the modular symbols [mi ]. Then the configuration in P5 (K) corresponding to [m1 ] is the octahedron with vertices v1 , x, and the four constructed points lying on the lines in the original configuration in the link of v1 . (See Figure 7.) The point x has been chosen so that x, v1 < d(m), but in general x12 , x12¯ and x13 , x13¯ will be larger than d(m). However, we claim that i(x12 , x13 , x13¯ , x12¯ ) > 1 and that we may identify the quadruple (x12 , x13 , x13¯ , x12¯ ) with a modular symbol [m ] for Sp4 (K). This means we may argue inductively and use the reduction of this “link” modular symbol to reduce [m1 ]. (See Figure 8.)
345
SYMPLECTIC MODULAR SYMBOLS
v1 x13¯ v3¯
x12¯
x12
v2
v2¯ x13 x
v3
Figure 7. The configuration corresponding to [m1 ]
Figure 8. Reducing the link modular symbol
4.6. To study the [mi ], we first prove a version of the Hermite normal form for our matrix representatives. 4.7. Lemma. Let m ∈ M2n (ᏻ) be a matrix with nonzero, primitive columns, and assume that m satisfies the isotropy condition. Then there is a matrix γ ∈ Sp2n (ᏻ) such that γ m is upper-triangular and γ m satisfies the isotropy condition. Proof. The left action of Sp2n (ᏻ) corresponds to row operations on m, so we begin by listing the elementary symplectic row operations. Let {ri | i ∈ [[n]]± } denote the rows of m, and use the notation a ← b to mean that the row vector a is to be replaced by the expression b. Then we find that we may effect the following: (T1)
ri ←− ri + r ı¯ .
346
PAUL E. GUNNELLS
For 1 ≤ i < k ≤ n, ri ←− ri + rk ,
(T2)
r k¯ ←− r k¯ − r ı¯ ,
and ri ←− ri + r k¯ ,
(T3)
rk ←− rk + r ı¯ .
Also ri ←− r ı¯ ,
(P1)
r ı¯ ←− −ri .
If τ ∈ Sn , then ri ←− rτ (i) .
(P2)
Here T stands for transvection and P stands for permutation. In (P2) we mean that τ permutes the first n rows among themselves, with the action on the last n rows determined by τ (¯ı ) := τ (i). Furthermore, the inverses and transpositions of these operations are also elementary row operations. Now, using operations of type (T1) and (P1), we may carry m into the form
m11 .. . mn,1 0 . .. 0
m12 .. .
···
m12 ¯
···
mn,2 mn,2 ¯ .. .
··· ···
m11¯ .. . mn,1¯ . mn, ¯ 1¯ .. . m1¯ 1¯
In the first column, the entry in row i is the greatest common divisor of the original entries in row i and row ı¯. Next, using operations of type (T2) and (P2), we can take m into the form m11 m12 · · · m11¯ 0 m22 · · · m21¯ .. .. . .. . . . 0
m12 ¯
···
m1¯ 1¯
SYMPLECTIC MODULAR SYMBOLS
347
Because left multiplication by γ ∈ Sp2n (ᏻ) preserves the isotropy condition, it follows that m actually has the form m11 m12 · · · m11¯ 0 m22 · · · m21¯ .. .. .. . . . . .. . m22 · · · m2¯ 1¯ ¯ 0 ··· 0 m1¯ 1¯ Now we may apply the induction hypothesis to the middle (2n − 2) × (2n − 2) block to complete the proof of the lemma. 4.8. Let [m] be a symplectic modular symbol with m upper-triangular, and let x be a candidate for m. Without loss of generality, we may assume v1 = e1 by Proposition 3.7(3) and that 1 ∈ Dx . We want to study the modular symbol [m1 ] from (21) in greater detail. 4.9. Lemma. Suppose that m is upper-triangular with v1 = e1 . Let the points x1,j j ∈ [[n]]± 1, 1¯ and the matrix m1 be defined as in Definition 3.12. Let X be the 2n×(2n−2) matrix with columns the vectors x12 , . . . , x1,n , x1,n¯ , . . . , x12¯ . That is, X is the matrix obtained by deleting the first and last columns from m1 . ¯ by Define points wj for j ∈ [[n]]± {1, 1}
wj = ej , x e1 − e1 , xej , and let W be the 2n × (2n − 2) matrix with columns the vectors w2 , . . . , wn , wn¯ , . . . , w2¯ . Let m be the central (2n − 2) × (2n − 2) block of m1 . Then X = W m . Proof. Given a column vector v, let v k be the entry in the kth row. Suppose first that j ≤ n. Then, by computation, using Definition 3.12, we find ¯
1 = x 2 m2,j + · · · + x ¯ mjj x1,j
and
¯
k x1,j = −x 1 mkj
for 2 ≤ k ≤ j .
348
PAUL E. GUNNELLS
On the other hand, W has the form ¯ ¯ x3 x2 1¯ −x 0 ¯ 0 −x 1 . .. . . . 0 ···
··· ··· .. . ..
.
0
x2 0 .. .
, 0 ¯ −x 1
and so x1,j is this matrix times the j th column of m . The computation is similar if j > n, so the result follows. 4.10.
We come now to the main result of this article.
¯ Z). 4.11. Theorem. As m ranges over Sp2n (ᏻ), the classes [m] span Hn−1 (∂ X; Proof. By the paragraph following Definition 4.2, it is sufficient to show that if [m] satisfies d(m) > 1, then [m] = [mα ], where d(mα ) < d(m). We proceed by induction. Since SL2 (ᏻ) = Sp2 (ᏻ), the statement is true by the usual modular symbol algorithm. So we assume the statement is true for Sp2k (ᏻ) with k < n. Assume that d(m) > 1, and let x be a candidate for m. Write (22) [mi ]. [m] = i∈[[n]]± Dx
We permute the columns of each [mi ] so that vi is the first column. Choose an [mi ] from the right-hand side of (22). Permuting labels if necessary, we can assume i = 1. Because candidate selection and the relation (4) are Sp2n (ᏻ)-equivariant, using Lemma 4.7 we may assume m1 is upper-triangular. As in Lemma 4.9, we construct the matrix m and the vectors wj . Since m is a (2n − 2) × (2n − 2) matrix satisfying the isotropy condition, we have that [m ] corresponds to a class in Hn−2 (Ꮾ ; Z), where Ꮾ is the building associated to Sp2n−2 (K). By the induction hypothesis, we may write (23) mα , [m ] = α∈A
where mα ∈ Sp2n−2 (ᏻ), and the sum is finite. We may use the [mα ] to write (24) [mα ] [m1 ] = α∈A
as follows. Each mα corresponds to an endomorphism of the ᏻ-module generated by the wj . We may apply mα to W to produce a 2n × (2n − 2) matrix Wα . Then mα is the matrix with first column e1 , last column x, and with middle columns Wα . The
SYMPLECTIC MODULAR SYMBOLS
349
induction hypothesis asserts that the columns of Wα form an ᏻ-basis of the ᏻ-module generated by the wj . In particular,
d(mα ) ≤ wj , w¯ = x, v1 2 . We claim that we may reduce each [mα ] further to write [mα ] = [mαβ ], β∈B
where d(mαβ ) ≤ x, v1 . To see this, consider the j and ¯ columns of W : t ¯ wj = x ¯ , 0, . . . , 0, −x 1 , 0, . . . , 0 and t ¯ w¯ = − x j , 0, . . . , 0, −x 1 , 0, . . . , 0 , ¯
where −x 1 appears in the j th and ¯th rows, respectively. Assume first that these ¯ vectors are primitive. Then some multiple of wj added to w¯ will be divisible by x 1 . This means wj and w¯ generate a lattice L satisfying ¯ i(L) ≥ x 1 = x, v1 . Therefore, we may apply the SL2 -modular symbol algorithm to the pair (wj , w¯ ) to reduce L to (L ⊗ K) ∩ ᏻn . If either wj or w¯ is not primitive, then a simple modification of this argument achieves the same end. Applying this to each pair (wj , w¯ ) and adapting the construction that passes from (23) to (24), we find [mα ] = (25) [mαβ ], β∈B
with d(mαβ ) ≤ x, v1 . Together, (24) and (25) imply [m1 ] = [mαβ ]
with d(mαβ ) ≤ x, v1 < d(m).
α∈A β∈B
Because this argument may be applied to any of the [mi ] from (22), this completes the proof of the theorem.
350
PAUL E. GUNNELLS
4.12. Corollary. If ⊂ Sp2n (ᏻ) is torsion-free of finite index, and N is the cohomological dimension of , then the classes [m]∗ provide a finite spanning set of H N (; Z). References [1]
[2] [3] [4] [5] [6] [7]
[8] [9] [10] [11]
[12] [13] [14] [15] [16] [17] [18]
G. Allison, A. Ash, and E. Conrad, Galois representations, Hecke operators and the mod-p cohomology of GL(3, Z) with twisted coefficients, Experiment. Math. 7 (1998), 361– 390. A. Ash, A note on minimal modular symbols, Proc. Amer. Math. Soc. 96 (1986), 394–396. , Galois representations attached to mod p cohomology of GL(n, Z), Duke Math. J. 65 (1992), 235–255. A. Ash and M. McConnell, Experimental indications of three-dimensional Galois representations from the cohomology of SL(3, Z), Experiment. Math. 1 (1992), 209–223. 4 extension of Q attached to a non-selfdual automorA. Ash, R. Pinch, and R. Taylor, An A phic form on GL(3), Math. Ann. 291 (1991), 753–766. A. Ash and L. Rudolph, The modular symbol and continued fractions in higher dimensions, Invent. Math. 55 (1979), 241–250. A. Ash and W. Sinnott, An analogue of Serre’s conjecture for Galois representations and Hecke eigenclasses in the mod-p cohomology of GL(n, Z), preprint, available at http://can.dpmms.cam.ac.uk/Algebraic-Number-Theory/0185/index.html. A. Borel and J.-P. Serre, Corners and arithmetic groups, Comm. Math. Helv. 48 (1973), 436–491. I. M. Gelfand and V. V. Serganova, On the general definition of a matroid and a greedoid, Dokl. Akad. Nauk SSSR 292 (1987), 15–20. P. Gunnells, Computing Hecke eigenvalues below the cohomological dimension, to appear in Experiment. Math. R. MacPherson and M. McConnell, “Classical projective geometry and modular varieties” in Proceedings of the JAMI Inaugural Conference, Johns Hopkins University Press, Baltimore, 1989, 237–290. , Explicit reduction theory for Siegel modular threefolds, Invent. Math. 111 (1993), 575–625. J. Schwermer, On arithmetic quotients of the Siegel upper half space of degree two, Compositio Math. 58 (1986), 233–258. , Eisenstein series and cohomology of arithmetic groups: the generic case, Invent. Math. 116 (1994), 481–511. , On Euler products and residual Eisenstein cohomology classes for Siegel modular varieties, Forum Math. 7 (1995), 1–28. J. Tits, Sur la trialité et certains groupes qui s’en déduisent, Inst. Hautes Études Sci. Publ. Math. (1959), 13–60. , Buildings of Spherical Type and Finite BN-Pairs, Lecture Notes in Math. 386, SpringerVerlag, Berlin, 1974. B. van Geemen and J. Top, A non-selfdual automorphic representation of GL3 and a Galois representation, Invent. Math. 117 (1994), 391–401.
Department of Mathematics, Columbia University, New York, New York 10027, USA;
[email protected]
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
EXCEPTIONAL INTEGERS FOR GENERA OF INTEGRAL TERNARY POSITIVE DEFINITE QUADRATIC FORMS RAINER SCHULZE-PILLOT
0. Introduction. In [2] it was shown that in a certain sense most integers represented by some form in the genus of a given integral ternary positive definite quadratic form are represented by all forms in this genus. More precisely, let (L, q) be a Zlattice of rank 3 with integral positive definite quadratic form. Then all sufficiently large integers a that are represented primitively by some lattice in the spinor genus of (L, q) are represented by all lattices in that spinor genus (see corollary to Theorem 3 in [2]). The theorems of that article actually imply a slightly sharper characterization of the set of exceptional integers that are represented by some forms in the genus of L but not by all of them. Since there seems to be some interest to have available a characterization of this set that is as sharp as possible, I give such a description and some examples in this note. I also comment on the question of effectivity of the results and on results for primitive representations. As in [2], a special role is played by the integers t in the square class of a primitive spinor exception, that is, in the square class of an integer represented primitively by some but not by all spinor genera in the genus of (L, q). The Fourier coefficients of the spinor generic theta series at integers tp 2 for primes p in certain arithmetic progressions do not grow for growing p. In view of the positivity of the Fourier coefficients of theta series, this implies that the Shimura lift with respect to such a square class of the difference of the theta series of lattices in the same spinor genus omits these primes in its Fourier expansion. One might therefore be tempted to speculate about a connection to CM-forms. By looking at an example, we see, however, that this is not the case; in general one has to expect that one is looking at the sum of a cusp form and of its quadratic twist. Acknowledgement. I thank Peter Sarnak for stimulating discussions on the questions mentioned above. 1. Exceptional integers. Let (L, q) as above be a quadratic lattice of level N, that is, for the dual lattice L# , we have q(L# )Z = N −1 Z. Let d denote the discriminant of (L, q). Let T denote the (finite) set of primes p for which (L, q) remains anisotropic over the p-adic completion Qp (for p ∈ T , we have p | N) and write q(L) for the set of numbers represented by some lattice in the genus of (L, q) (or equivalently, locally everywhere by (L, q)), qr (L) for the set of t ∈ q(L) that are divisible at Received 5 March 1999. 1991 Mathematics Subject Classification. Primary 11E45; Secondary 11E12, 11E20, 11F27. 351
352
RAINER SCHULZE-PILLOT
most to the rth power by the primes in T , and q ∗ (L) for the set of t ∈ q(L) that are represented primitively by some lattice in the genus of (L, q). For t ∈ qr (L), there is an integer m (which is bounded) consisting only of primes in T such that t/m2 ∈ q ∗ (L); all statements about representation of sufficiently large t ∈ qr (L) are therefore immediately reduced to the corresponding statements for sufficiently large t ∈ q ∗ (L). Recall the following facts about representation of numbers by spinor genera of ternary lattices (see [3], [8], [10], and [11]): • There is a finite set of square classes ti Z2 (the spinor exceptional square classes) such that numbers in q(L) outside these square classes are represented (primitively if they are in q ∗ (L)) by all spinor genera in the genus of L (i.e., in each spinor genus, there is at least one lattice representing the number), and all spinor genera have the same measure of representation (or Darstellungsmaß) for such an integer. An integer t whose square-free part does not divide d is not in any of these exceptional square classes. • For each of the spinor exceptional square classes, the set of spinor genera in the genus of (L, q) is divided into two half-genera containing equally many spinor genera such that all spinor genera in the same half-genus (primitively) represent the same numbers in that square class (and with equal representation measures). The numbers that are (primitively) represented only by one of the half-genera are called the (primitive) spinor exceptions of the genus; if t is a primitive spinor exception and m is an integer prime to the level N , then tm2 is a primitive spinor exception too. The sets of (primitive) spinor exceptions have√been explicitly determined in [10] and [4]. • For each ti from above, let Ei = Q( −dti ). If p is a prime that splits in Ei /Q, then for all t ∈ ti Z2 , the integer tp2 is a (primitive) spinor exception if and only if t is a (primitive) spinor exception and t and tp2 are represented by the same spinor genera in the genus of (L, q). • Let ti , Ei be as above and let p be a prime that is inert in Ei /Q. Let t ∈ ti Z2 be a primitive spinor exception of the genus of (L, q) represented by the halfgenus of spn(L). If p |N, then tp2 is primitively represented by the other halfgenus not containing spn(L) (but not by the half-genus of L). In particular, the tm2 with (m, N ) = 1, for which at least one prime factor of m is inert in Ei /Q, are primitive spinor exceptions but not spinor exceptions. If p | N , then either there is a ν0 depending on N such that tp2ν is not a primitive spinor exception for ν ≥ ν0 or the tp 2ν behave in the the same way as in the case p |N. • Let ti , Ei be as above and let t ∈ ti Z2 , p be a prime. If p is ramified in Ei /Q and p 2ν divides t for ν ∈ N large enough (depending on N), then t is neither a spinor exception nor a primitive spinor exception. We proved in [2] for positive definite (L, q) that all sufficiently large integers that are primitively represented by the spinor genus of (L, q) are represented by all lattices in that spinor genus and gave an asymptotic formula for the number of representations. However, we made no statement about the representation behaviour
EXCEPTIONAL INTEGERS OF TERNARY QUADRATIC FORMS
353
of those integers that are primitive spinor exceptions but not spinor exceptions: If they are sufficiently large, they are represented by all lattices in the spinor genera representing them primitively and by at least one lattice in each of the spinor genera in the other half of the set of spinor genera in the genus. The following theorem shows that most of these integers are also represented by all lattices in the genus with possibly (and usually) infinitely many exceptions. Theorem. There is some constant c depending on N and r such that all t ∈ qr (L) with t ≥ c are represented by all lattices in the genus of (L, q) unless one of the following conditions is satisfied: (i) t is a spinor exceptional integer, in which case it is represented by all lattices in the half-genus representing t and by no lattice in the other half-genus; (ii) t/p 2 is a spinor exceptional integer for some prime p that is inert in the quadratic extension E/Q associated to the square class of t, in which case t is represented by all lattices in the half-genus not representing t/p 2 and by precisely those lattices in the other half-genus that represent t/p 2 (hence by all of them if t/p 2 ≥ c holds). Proof. We assume without loss of generality that t ∈ q ∗ (L). By the results of [2], there is a constant c0 depending on N such that all integers t ≥ c0 in qr (L) that are primitively represented by some lattice in the spinor genus of (L, q) are represented by all lattices in the spinor genus of (L, q). Choosing c ≥ c0 , we therefore have only to deal with t that are represented by the spinor genus of (L, q) but are not represented primitively by that spinor genus. Let E/Q be the quadratic extension associated to the square class of t. Let tˆ ∈ q ∗ (L) ∩ t (Q× )2 be such that all representations of tˆ by the lattices in gen(L) are primitive; we call such a number a primitive element of q ∗ (L) and notice that such numbers are almost square-free in the following sense: There is some constant c1 depending on N such that primitive elements of q ∗ (L) are divisible by m2 only for m ≤ c1 (this is an easy consequence of the well-known representation properties of local lattices [9]). Hence there is a constant c2 depending on N such that tˆ ≤ c2 holds for all primitive elements tˆ of q ∗ (L) that are in one of the spinor exceptional square classes. We can choose tˆ such that it divides our given t, write t = tˆm2 , and decompose m = ma mr mi ms , where ma is the largest divisor of m consisting only of primes p for which Lp is anisotropic, mr is the largest divisor of m consisting only of primes ramified in E/Q and prime to ma , and mi is the largest divisor of m consisting only of primes inert in E/Q and prime to ma . By assumption, the “anisotropic part” ma of m is bounded, and we can restrict ourselves to the case ma = 1; in particular, we can assume t ∈ q ∗ (L). Since, by assumption, t is not represented primitively by the spinor genus of L, it is a primitive spinor exception. The results of [4] quoted above imply then that mr is bounded as well, and we can restrict to mr = 1 (the restrictions made being justified by suitably enlargening c0 ). Let K be a lattice in the genus of L representing tˆ. The integer t1 := tˆm2s is then a spinor exception that is primitively represented by some lattice K in the half-genus
354
RAINER SCHULZE-PILLOT
of K; it is (primitively) represented by all classes in that genus if t ≥ c0 by the results of [2]. We are left with the case that t1 ≤ c0 and mi = 1 hold. We choose c ≥ c02 . If mi is composite, it has at least one proper divisor mi such that t1 m 2i ≥ c0 holds. Hence t1 m 2i is represented (primitively) by all classes in one half-genus H1 in the genus of L, and for any prime p | (mi /mi ), we see that t1 m 2i p 2 is represented (primitively) by all classes in the other half-genus H2 of the genus of L and represented (but not primitively) by all classes in H1 . Then, of course, t is represented by all classes in the genus of L as well, and our assertion is proved. Remark 1. An anologous result is true for representations with additional congruence conditions. The proof is the same as above. Remark 2. The result we gave in [2] is not effective. It would be no principal problem to make the estimates on the error term in our asymptotic formula effective; in fact, for the estimates of Fourier coefficients of modular forms of half-integral weight greater than or equal to 5/2, this was done in the Diplom thesis (or Diplomarbeit) of M. Bienert [1]. However, as already remarked in [2], the growth of the main term in our asymptotic formula is of the same order of magnitude as the growth of the class number of imaginary quadratic fields. The effective bound of Goldfeld [5], following from the work of Gross and Zagier, for the class number is much too weak for the present purpose. It is, however, well known that under the assumption of the generalized Riemann hypothesis, the class number bound h(d) d 1/2− can be made effective. In fact, we have (from Siegel’s proof of his class number estimate), √ for any 0 < < 1 for which the Dedekind zeta function ζD (s) of Q( −D) satisfies ζD (1 − /2) ≤ 0, the estimate wD 1/2−(1−2) h(D) > 4πe4π √ (where w is the number of units of Q( −D). Under the assumption of the generalized Riemann hypothesis, our asymptotic formula (and the sharpening in the theorem above) therefore becomes effective too. The author is frequently asked whether the results of [2] also hold for primitive representations. We take this occasion to state this fact in a corollary. Corollary. There is some constant c∗ depending on N and r, such that all t ∈ qr (L) with t ≥ c∗ that are represented primitively by some lattice in the spinor genus of (L, q) are represented primitively by all lattices in the spinor genus of (L, q). The same is true for representations with additional congruence conditions. Moreover, the primitive representations of t satisfying these conditions by (L, q) are asymptotically equidistributed in the sense of [2, Theorem 3] on the ellipsoid surface {x ∈ L ⊗ R | q(x) = t}. Proof. Denote by r(L, q, t) = #{x ∈ L | q(x) = t} the number of representations of t by (L, q) and by r ∗ (L, q, t) the number of primitive representations. By the
EXCEPTIONAL INTEGERS OF TERNARY QUADRATIC FORMS
355
Möbius inversion formula, we have r ∗ (L, q, t) =
d 2 |t
As usual, we denote by
t µ(d)r L, q, 2 . d
r spn(L, q), t :=
{K} r(K, q, t)/|O(K)|
{K} 1/|O(K)|
(where the summation goes over a set of representatives of the classes of lattices in the spinor genus of L and where |O(K)| is the order of the group of units or isometries of (K, q)) the weighted mean of the representation numbers of t by the lattices in the spinor genus of L and analogously for the averaged primitive representation number r ∗ (spn(L, q), t). Obviously, we then have t µ(d)r spn(L, q), 2 , r ∗ spn(L, q), t = d 2 d |t
and hence
t t r ∗ (L, q, t) − r ∗ spn(L, q), t = µ(d) r L, q, 2 − r spn(L, q), 2 . d d 2 d |t
The number of terms in the sum on the right-hand side is at most the number of divisors of t, hence O(t ) for all > 0. Each summand µ(d)(r(L, q, t/d 2 ) − r(spn(L, q), t/d 2 )) is estimated as a cusp form coefficient in the same way as in [2], and the main term r ∗ (spn(L, q), t) is at least of the order of magnitude of t 1/2− as in [2]. This proves the first assertion; the remainder of the corollary is proved in the same way as in [2]. 2. Shimura correspondence. As usual, ϑ(K, z) = x∈K exp(2πiq(x)z) is the theta series of the lattice K, and {K} ϑ(K, z)/|O(K)| ϑ spn(L), z := {K} 1/|O(K)| (where the summation goes over a set of representatives of the classes of lattices in the spinor genus of L and where |O(K)| is the order of the group of units or isometries of (K, q)) is the theta series of the spinor genus of L. We proved in [11], [12] that the Shimura lift with respect to any t of ϑ(L, z)−ϑ(spn(L), z) is cuspidal. Let us consider the following example taken from [7]: We look at the lattice K giving the quadratic form 4x 2 + 48y 2 + 49z2 + 48yz + 4xz and the lattice K giving the quadratic form x 2 + 48y 2 + 144z2 . The forms are in the same spinor genus, with another spinor genus in the same genus consisting of the lattice L with quadratic
356
RAINER SCHULZE-PILLOT
form 9x 2 + 16y 2 + 48z2 and the lattice L with quadratic form 16x 2 + 25y 2 + 25z2 + 14yz + 16xz + 16xy. The exceptional square class√is just the set of integral squares, and the associated quadratic extension is E = Q( −3) by [10]. Write f (z) = ϑ(K , z) − ϑ(K, z) and g(z) = ϑ(L, z) − ϑ(L , z). By results from [12], f, g are “good” cusp forms, that is, forms whose Shimura lifting is cuspidal. The explicit calculation of T (p 2 ) acting on theta series given in [12] also shows that T (p 2 )f is a scalar multiple λp f of f for p ≡ 1 mod 3 and λp g of g for p ≡ −1 mod 3, and vice versa with λp replaced by some µp for g. This follows from the fact that T (p2 )f is in the first case, a cusp form in the one-dimensional space of cusp forms generated by ϑ(K, z), ϑ(K , z), and in the second case, a cusp form in the one-dimensional space of cusp forms generated by ϑ(L, z), ϑ(L , z). In fact, an explicit (computer-assisted) calculation of the theta series and their T (p 2 )-images for the first few primes p shows that we have T (p2 )f = λp f , T (p 2 )g = λp g (respectively, T (p 2 )g = λp f , T (p 2 )f = λp g) with λp = −2, 0, −4, −2, 4 for p = 5, 7, 11, 13, 19. But since for p, p ≡ 1 mod 3 and q, q ≡ −1 mod 3, we have λp µq = λq µp and λq µq = µq λq , by the commutativity of the Hecke algebra, we see that λp = µp holds for all p = 2, 3. The associated eigenforms f + g, f − g and their Shimura lifts are thus finally seen to have Hecke eigenvalues λp , χ (p)λp , where χ(p) = (−3/p) is the quadratic character associated to E/Q, that is, they are quadratic twists of each other. In general, the situation may be slightly more complicated, but the apparent lacunarity of the Fourier coefficients of ϑ(spn(L))−ϑ(L) in the square class of a primitive spinor exception is also caused by adding up character twists of cusp forms for the quadratic character associated to the square class in question. References [1] [2]
[3] [4] [5] [6] [7] [8] [9] [10]
M. Bienert, Fourier-Koeffizienten von Modulformen halbganzen Gewichts vom Nebentyp, Diplomarbeit, Universität zu Köln, 1996. W. Duke and R. Schulze-Pillot, Representation of integers by positive ternary quadratic forms and equidistribution of lattice points on ellipsoids, Invent. Math. 99 (1990), 49–57. A. Earnest, Representation of spinor exceptional integers by ternary quadratic forms, Nagoya Math. J. 93 (1984), 27–38. A. Earnest, J. S. Hsia, and D. Hung, Primitive representations by spinor genera of ternary quadratic forms, J. London Math. Soc. (2) 50 (1994), 222–230. D. Goldfeld, Gauss’s class number problem for imaginary quadratic fields, Bull. Amer. Math. Soc. (N.S.) 13 (1985), 23–37. J. S. Hsia, Representations by spinor genera, Pacific J. Math. 63 (1976), 147–152. B. W. Jones and G. Pall, Regular and semiregular positive definite ternary quadratic forms, Acta Math. 70 (1940), 165–191. M. Kneser, Darstellungsmaße indefiniter quadratischer Formen, Math. Z. 77 (1961), 188–194. O. T. O’Meara, Introduction to Quadratic Forms, 2d ed., Grundlehren Math. Wiss. 117, Springer-Verlag, New York, 1971. R. Schulze-Pillot, Darstellung durch Spinorgeschlechter ternärer quadratischer Formen, J. Number Theory 12 (1980), 529–540.
EXCEPTIONAL INTEGERS OF TERNARY QUADRATIC FORMS [11] [12]
357
, Darstellungsmaße von Spinorgeschlechtern ternärer quadratischer Formen, J. Reine Angew. Math. 352 (1984), 114–132. , Thetareihen positiv definiter quadratischer Formen, Invent. Math. 75 (1984), 283–299.
Fachbereich 9 Mathematik, Bau 27, Universität des Saarlandes, Postfach 15 11 50, D-66041 Saarbrücken, Germany;
[email protected]
Vol. 102, No. 2
DUKE MATHEMATICAL JOURNAL
© 2000
FORMAL CHOW GROUPS, p-DIVISIBLE GROUPS, AND SYNTOMIC COHOMOLOGY TAKAO YAMAZAKI
Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 359 2. Syntomic complex . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363 3. Calculation of PD-algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 368 4. Surjectivity of symbol map . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 370 5. The Leray spectral sequence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 374 6. Syntomic cohomology and p-divisible groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379 7. K-cohomological functor and p-divisible groups . . . . . . . . . . . . . . . . . . . . . . . . . 384 Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 386 1. Introduction. Let K be an absolutely unramified complete discrete valuation field of mixed characteristic (0, p) with perfect residue field k, and V a smooth projective variety over K. In this article, we consider the Chow groups CHr (V ) in the infinitesimal method, which was proposed by Spencer Bloch (cf. [5, p. 24]). We review this method in what follows. First of all, recall the modified version of the Bloch-Quillen formula (cf. [16]) 1 1 M ∼ CHr (V ) ⊗ Z , ⊗Z = H r V , r,V (r − 1)! (r − 1)! M the Zariski sheaf of Milnor K -groups where, for any scheme Z, we denote by r,Z r on Z, that is, the sheafification of the presheaf that associates the Milnor K-group KrM ( (U, ᏻU )) to an open subscheme U of Z. Here, for any commutative ring R, we denote by KrM (R) the group R ∗⊗r /H, where H is the subgroup of R ∗⊗r generated by elements of the form x1 ⊗ · · · ⊗ xr with xi + xj = 0 or 1 for some i = j . If r is equal to the dimension of V , the above formula is valid without tensoring Z[1/(r − 1)!] (cf. [10]). Now assume that V admits a smooth projective model X over the valuation ring W of K (i.e., W = W (k) is the ring of Witt vectors of k). Let Ꮽ be the category of Artinian local W -algebras that have k as their residue field. For any object A of Ꮽ,
Received 27 April 1999. Revision received 20 September 1999. 1991 Mathematics Subject Classification. Primary 14G20; Secondary 11G25, 14F30, 19E20. Author’s research supported by Japan Society for the Promotion of Science (JSPS) Research Fellowships for Young Scientists. 359
360
TAKAO YAMAZAKI
we write XA = X ⊗W A. Set Y = Xk . For r > 0, define M M M r,X = ker r,X −→ r,Y , A ,Y A where the map is induced by the reduction map A → k. Note that since XA and Y M M as have the same underlying topological space, we can regard both r,X and r,Y A sheaves on (XA )Zar = YZar . For r > 0 and q ≥ 0, we define the K-cohomological M ) as functor Hˆ q (r,X M M (A) = H q XA , r,X . Hˆ q r,X A ,Y This is a covariant functor from Ꮽ to the category of abelian groups. According to the M ) as the formal completion of CHr (V ). Bloch-Quillen formula, we regard Hˆ r (r,X It seems natural to expect that it reflects an infinitesimal behavior of CHr (V ) at M ), mainly when the origin. The aim of this article is to study the functor Hˆ q (r,X q = dim V . As an application of our theory, we prove the following theorem, which was conjectured by Bloch (cf. [5, p. 24]). Let Ꮽ0 be the full subcategory of Ꮽ consisting of m m all objects A such that A/pA ∼ = k[T1 , . . . , Ts ]/(T1 1 , . . . , Ts s ) for some s ≥ 0 and m1 , . . . , ms ≥ 1. (For more explanation of this category, see (1) below.) Theorem A. Assume p ≥ 3. Let X be a projective smooth scheme over W of relative dimension d. Assume that d < p and that the de Rham cohomology groups j HdR (X/W ) have no torsion for all j . Let 0 < r < p. When r = d, we assume that Y is of Hodge-Witt type (cf. Remark 6.11; more generally, we only need to assume that d+r−1 the de Rham cohomology group HdR (X/W ) is of Hodge-Witt type as a filtered Dieudonné module—cf. Definition 6.10). Let Gc be the p-divisible group over W , which is characterized by the condition in Remark 1.1 below. Then there exists a surjective homomorphism (functorial in A) M (A) −→ Gc (A) Hˆ d r,X for any object A of Ꮽ0 . d+r−1 Remark 1.1. We regard the de Rham cohomology group HdR (X/W ) as a filtered Dieudonné module (cf. remark after Definition 5.1). Let N be the level [r −1, d+r−1 r]-part of HdR (X/W ) (cf. Proposition 6.12 for the definition). Its translation N[r − 1] is a filtered Dieudonné module of level [0, 1], and so it corresponds to a p-divisible group G via the Dieudonné functor (cf. (15)). Then Gc is the connected component of G. The tangent space Lie Gc of Gc is isomorphic to H d (X, "r−1 X/W ). One should regard the homomorphism in Theorem A for r = d as the formal completion of the Albanese map
A0 (V ) −→ AlbV (K),
FORMAL CHOW GROUPS
361
deg
where A0 (V ) = ker(CHd (V ) −→ Z), and AlbV (K) is the group of K-rational points of the Albanese variety AlbV of V . M ) is to use the syntomic cohomology theory of Our method of studying Hˆ q (r,X Fontaine-Messing. Let Ꮽsyn be the full subcategory of Ꮽ consisting of all objects A such that A/pA is syntomic over k. In Section 2, for any object A of Ꮽsyn we define a group H q ((XA , Y ), (r)), which we call the relative syntomic cohomology group. This is a slight modification of the syntomic cohomology group of Fontaine-Messing. We need to restrict the category Ꮽ to the smaller one Ꮽsyn , since we cannot define the group H q ((XA , Y ), (r)) if A does not belong to Ꮽsyn . We further restrict the category Ꮽsyn to the much smaller one Ꮽ0 , for a technical reason (since Ꮽsyn contains rings that the author could not control). We remark that the following rings belong to Ꮽ0 :
• OK /p n OK where n is any positive integer and OK is the integer ring of a totally ramified finite extension of K; (1) • the affine ring of a connected finite flat group scheme over W . In Section 3, we make some preliminary calculation about rings in Ꮽ0 . In the following theorem, which is proven in Section 4, we compare the Kcohomological functor with the relative syntomic cohomology group. Theorem B (Corollary 4.6). Let X be a quasi-projective smooth scheme over W of relative dimension d. Let 0 < r < p. For any object A of Ꮽ0 , there is a canonical surjective homomorphism (functorial in A) M Hˆ d r,X (A) −→ H d+r (XA , Y ), (r) . In the appendix, we show that the kernel of this map is essentially zero, in the sense of Stienstra (see Definition A.1, taken from [18, p. 2]). However, we do not know whether this map is in fact an isomorphism or not (cf. Conjecture 4.2, Remark 4.7). M ) for q = d. We also do not know anything about Hˆ q (r,X Our next objective is to analyze the syntomic cohomology groups. In Section 5, we construct a Leray spectral sequence for the syntomic cohomology groups. The E2 -terms are given by the relative syntomic cohomology groups with coefficients (cf. Definition 5.2), which are written as H i ((A, k), (M, r)), where M is a filtered Dieudonné module (cf. Definition 5.1) and A is an object of Ꮽsyn . The de Rham j cohomology group HdR (X/W ) has a natural structure of a filtered Dieudonné module (cf. remark after Definition 5.1). Theorem C (Corollary 5.5). Let X be a projective smooth scheme over W of relative dimension d. Assume that d < p and that the de Rham cohomology groups j HdR (X/W ) have no torsion for all j . Let 0 < r < p. Then, for any object A of Ꮽsyn there exists a spectral sequence
362
TAKAO YAMAZAKI
j i,j E2 = H i (A, k), HdR (X/W ), r ⇒ H i+j (XA , Y ), (r) . This spectral sequence induces a filtration on H q ((XA , Y ), (r)). We find an analogous filtration in Stienstra [18, p. 5]. We might wonder whether this filtration corresponds to the “motivic filtration” of Deligne-Beilinson. Roughly speaking, the following theorem says that the top-graded quotient is prorepresentable by a pdivisible group. Theorem D (Corollary 6.8). Assume p ≥ 3. Let M be a filtered Dieudonné module corresponding to a p-divisible group G via the Dieudonné functor (15). Let A be an object of Ꮽ0 . Then there exists a canonical isomorphism ∼ = Gc (A) −−→ H 1 (A, k), (M, 1) , where Gc is the connected component of G. This theorem is proven in Section 6. Taking care of a switch of the twist, we show that those three theorems imply Theorem A in Section 7. In the appendix, we consider the injectivity of the map in Theorem B. Such a problem might be considered in the equal characteristic situation, which was also proposed by Bloch. He obtained a result in the case of characteristic zero (cf. [2], [3], [4, Chapter 6]). Stienstra got a very general result in the positive characteristic case, by using his theory of the formal de Rham-Witt complex (cf. [18], [17], [4, Chapter 6]). Though we do not use their results, we might see some analogy with those works, as mentioned above. According to Stienstra’s theory [18], it would be interesting to interpret our theory in the context of a generalization of the Dieudonné theory, in which Theorem D plays a basic role. Perhaps further study of the syntomic cohomology groups (like H 2 ((A, k), (M, 2))) will be a guide for a study of a deeper part of the Chow groups (like the albanese kernel). For this point, see Proposition 7.1. Acknowledgements. This research is a subject of my Ph.D. thesis, which is directed by Kazuya Kato. Without his advice, this paper would not exist. I would like to express my deep gratitude to him. I am also grateful to Kanetomo Sato, Jinya Nakamura, and especially Jan Stienstra for a lot of fruitful discussions. In particular, the context of the appendix arose from discussion with Stienstra. I heartily thank Thomas Geisser, who carefully read the earlier version of this manuscript and gave a lot of comments which improved this paper very much. I acknowledge once again the great influence of profound works of Spencer Bloch and Jan Stienstra. Conventions 1.2. For a ∈ Q, write [a] = inf{n ∈ Z | n ≥ a}. For an abelian group A and an integer n, we write n A = {a ∈ A | na = 0}. Let k denote a fixed perfect field of characteristic p > 0, W = W (k) the ring of Witt vectors of k, and σ the frobenius on W . For a ring or a scheme S, Sn denotes S ⊗Z Z/p n Z and "1S denotes the absolute
363
FORMAL CHOW GROUPS
differential module "1S/Z . If S is a ring, Sˆ denotes the ring lim Sn . If S is a scheme, ← − Sˆ denotes the formal scheme lim Sn . − → A PD-envelope of a W -algebra (or a W -scheme) is the one with respect to the canonical divided power structure on pW . If D is a PD-algebra (or a PD-scheme) with PD-ideal (sheaf) JD , JD[r] denotes the rth divided power ideal (sheaf). For r ≤ 0, we use the convention JD[r] = D (or ᏻD ). All sheaves and cohomology groups are taken in the Zariski site, unless stated otherwise. 2. Syntomic complex. In this section, we define a relative syntomic complex for a W -scheme. This is a slight modification of the theory of Fontaine-Messing and Kato. Our main references are [11] and [12]. Let U be a W -scheme such that p N ᏻU = 0 for some N > 0. We assume that U1 is syntomic over k, and that there exists a W -immersion U +→ Z with Z a smooth W -scheme endowed with a Frobenius φ : Z → Z, that is, a morphism of schemes such that φ ⊗ Z/pZ : Z1 → Z1 is the absolute Frobenius and such that the diagram φ
Z Spec W
σ
/Z / Spec W
commutes (cf. [11, I.1.2]). Let D = DU (Z) be the PD-envelope with PD-ideal sheaf JD , and let Dˆ = lim Dn be its p-adic completion endowed with the induced PD− → structure on JDˆ (cf. Conventions 1.2). Note that the PD-envelope DUn (Zn ) is canonically isomorphic to Dn (= D ⊗ Z/pn Z), whose PD-ideal sheaf JDn is the image of JD (cf. [1, 3.33]). We also remark that the underlying topological spaces of Dˆ and U are the same. ˆ q be the continuous differential module lim "q . For r ≥ 0 and n > 0, we Let " Z ← − Zn denote the complexes of sheaves on UZar (cf. [1, 7.23]) d
d
d
ˆ 1Z −→ J [r−2] ⊗ᏻ " ˆ 2Z −→ · · · J [r] −→ J [r−1] ⊗ᏻZˆ " ˆ ˆ ˆ Zˆ D
D
D
and d
d
d
⊗ᏻZn "1Zn −→ JD[r−2] JD[r]n −→ JD[r−1] ⊗ᏻZn "2Zn −→ · · · n n [r] by J[r] U,Z and by Jn,U,Z , respectively.
Lemma 2.1. (i) For 0 ≤ r < p and n > 0, we have r [0] φ J[r] n,U,Z ⊂ p Jn,U,Z . (ii) The following sequence is exact for any n, m > 0: pm
pn
[0] [0] [0] J[0] m+n,U,Z −−→ Jm+n,U,Z −−→ Jm+n,U,Z −−→ Jn,U,Z −→ 0. can
364
TAKAO YAMAZAKI
Proof. Since D ∼ = DU1 (Z), the proof of [11, I.1.3] works in this situation as well. Because of the lemma, we can define [0] φr = p −r φ : J[r] n+r,U,Z −→ Jn,U,Z
and [0] φr : J[r] U,Z −→ JU,Z
as the inverse limit. We remark that an analogous sequence of (ii) for J[r] n,U,Z with
[0] r > 0 is not exact, and hence we cannot define φr : J[r] n,U,Z → Jn,U,Z .
Definition 2.2. Let 0 ≤ r < p. We define a complex (r)U,Z of Zariski sheaves on U to be the mapping fiber of [0] 1 − φr : J[r] U,Z −→ JU,Z .
More precisely, the degree-i part of (r)U,Z is [r−i] ˆ iZ ⊕ ᏻ ˆ ⊗ᏻ " ˆ i−1 , Jˆ ⊗ᏻZˆ " Z D Zˆ D
and the boundary map is given by (x, y) −→ dx, (1 − φr )(x) − dy ˆ i , y ∈ ᏻ ˆ ⊗" ˆ i−1 ). We denote by H q (U, (r)) the qth hypercoho(x ∈ J [r−i] ⊗" Z Z D Dˆ mology group of the complex (r)U,Z . To justify the notation, we remark that (r)U,Z is independent of the choice of Z in the derived category D(UZar ). This can be shown by the same way as in [11, p. 212] (cf. also [12, p. 412]), by using [1, 7.24]. For the same reason, we write the cohomology sheaf of the complex (r)U,Z as Ᏼq ((r)U ), instead of Ᏼq ((r)U,Z ). We define the symbol map KrM (U, OU ) −→ H r U, (r) in completely the same way as in [11, I, Section 3] (also cf. [12, pp. 412–413]). For the sake of completeness, we copy it here. Let r and r be nonnegative integers satisfying r + r < p. We define the product structure (r)U,Z × (r )U,Z −→ (r + r )U,Z
by (x, y)(x , y ) = xx , (−1)i xy − yφ(x ) ,
FORMAL CHOW GROUPS
365
ˆ i )⊕(ᏻ ˆ ⊗ ⊗" where (x, y) and (x , y ) are local sections of the degree-i part (J [r−i] Z D Dˆ ] [r−i ˆ i )⊕(ᏻ ˆ ⊗ " ˆ i −1 ), respectively. This product ˆ i−1 ) and the degree-i part (J ⊗" " Z
structure induces a product
Dˆ
Z
Z
D
H i U, (r) ⊗ H i U, (r ) −→ H i+i U, (r + r ) , which satisfies an associative law. Denote the immersion U → Z by i. Let Ꮿ be the complex ker i −1 ᏻ∗Z −→ ᏻ∗U −→ i −1 ᏻ∗Z , which is canonically quasi-isomorphic to ᏻ∗U [−1]. We define a morphism of complexes Ꮿ → (1)U,Z as follows: The degree-zero part ker(i −1 ᏻ∗Z → ᏻ∗U ) → JDˆ is defined by x −→ log(x) ¯ =
∞ (−1)i (i − 1)!(x¯ − 1)[i] , i=1
where x¯ is the image of x in ᏻDˆ and [ ] denotes the PD-structure. (Note that x¯ − 1 is ˆ 1 ) ⊕ ᏻ ˆ is defined by in JDˆ .) The degree-one part i −1 ᏻ∗Z → (ᏻDˆ ⊗ " Z D ¯ x¯ p . x −→ 1 ⊗ dx/x, p −1 log φ(x)/ (Note that φ(x)/ ¯ x¯ p − 1 is in p ᏻDˆ and that log(φ(x)/ ¯ x¯ p ) is in p ᏻDˆ .) This morphism of complexes induces a canonical homomorphism U, ᏻ∗U = H 1 (U, Ꮿ) −→ H 1 U, (1) . Hence, by the product structure we obtain a homomorphism ⊗r U, ᏻ∗U −→ H r U, (r) for r < p. It is shown in [11, I, 3.2] that this homomorphism factors through KrM ( (U, ᏻ∗U )). By sheafification, we get a morphism of sheaves on UZar ,
M r,U −→ Ᏼr (r)U ,
which we call the symbol map. Let X be a quasi-projective smooth W -scheme and A an object of Ꮽsyn . Since we assumed X to be quasi-projective, we can find a smooth W -scheme Z endowed with a Frobenius φ and an immersion XA +→ Z. Hence we can apply the above definitions to the scheme XA . Let Y = Xk . Note that we have not only (r)XA ,Z but also (r)Y,Z by considering the composition Y +→ XA +→ Z.
366
TAKAO YAMAZAKI
Definition 2.3. Let 0 ≤ r < p. We define the complex (r)(XA ,Y ),Z to be the mapping fiber of the natural map (r)XA ,Z −→ (r)Y,Z .
We denote by H q ((XA , Y ), (r)) and Ᏼq ((r)(XA ,Y ) ) the qth hypercohomology group and the qth cohomology sheaf of the complex (r)(XA ,Y ),Z , respectively. Again, these hypercohomology groups and cohomology sheaves are independent of the choice of Z and φ. We remark that the map (r)XA ,Z → (r)Y,Z is injective. Hence, if we write the cokernel of this map by Ꮿ, (r)(X,Y ),Z is quasi-isomorphic to Ꮿ[−1]. But this mapping fiber definition is useful in Section 5. The following lemma, which can be seen easily, may be helpful in considering this complex. Lemma 2.4. Let Ꮿ1
fC
g1
Ᏸ1
/ Ꮿ2 g2
fD
/ Ᏸ2
be a commutative diagram of complexes. Writing the mapping fiber of a morphism h of complexes by Ᏺ(h), there are two morphisms of complexes: f = (fC , fD ) : Ᏺ(g1 ) −→ Ᏺ(g2 ), g = (g1 , g2 ) : Ᏺ(fC ) −→ Ᏺ(fD ). Then we have a canonical isomorphism Ᏺ(f ) ∼ = Ᏺ(g).
Consider the diagram 0
/ M r,XA ,Y
0
/ Ᏼr (r)(X ,Y ) A
α
/ M r,XA
/ M r,Y
/ Ᏼr (r)X A
/ Ᏼr (r)Y ,
/0
where the middle and the right vertical arrows are the symbol maps. The upper row is exact by definition, and the right square is commutative. Lemma 2.5 shows the lower row is also exact. Hence we can define the symbol map M r,X −→ Ᏼr (r)(XA ,Y ) A ,Y by the unique map α that makes the left square commutative.
367
FORMAL CHOW GROUPS
Lemma 2.5. The complex (r)Y,Z is acyclic outside [r, ∞). Define n (r)Y,Z by the exact sequence pn
0 −→ (r)Y,Z −−→ (r)Y,Z −→ n (r)Y,Z −→ 0. It is enough to show that n (r)Y,Z is acyclic outside [r, ∞). We prove a more precise lemma as follows. Lemma 2.6. Let 3 : Yet → YZar be the natural morphism of sites. Then n (r)Y,Z is quasi-isomorphic to R3∗ Wn "rY,log [−r]. Proof. By a standard induction argument, this is reduced to the case n = 1. In the étale site, "rY,log [−r] is quasi-isomorphic to the complex deg(r)
"rY
deg(r+1) 1−F
(2)
/ "r /d"r−1 , Y Y
where F is the inverse Cartier operator. Since each component of complexes 1 (r)Y,Z and (2) is a coherent sheaf, it suffices to prove that 1 (r)Y,Z is quasi-isomorphic to the complex (2) for the étale site. Then, we can assume that X itself has a Frobenius φ and that Z = X. By definition, 1 (r)Y,X is the total complex associated to the double complex p r ᏻX /p r+1 ᏻX
/ p r−1 "1 /p r "1 X X
−φr
/ ···
/ "r Y
/ ···
/ "r Y
−φr
1−F
/ "1 Y
ᏻY
/ "r+1 Y
/ ···
1
/ "r+1 Y
/ ··· .
If 0 ≤ q < r, we have a commutative diagram q
q
p r−q "X /p r−q+1 "X
−φr
/ "q Y
∼ =
q "Y
−F
/ "q . Y q
q
d
q+1
The lower arrow is injective, and its cokernel is "Y / ker("Y − → "Y ). Using this repeatedly, we see that 1 (r)Y,X is quasi-isomorphic to the total complex associated to 0
/ ···
/0
/ "r Y
/ ···
/0
/ "r /d"r−1 Y Y
1−F
0
From this, the assertion follows.
/ "r+1 Y
/ ···
1
/ "r+1 Y
/ ··· .
368
TAKAO YAMAZAKI
3. Calculation of PD-algebra. In this section, we calculate PD-envelope algebras of Artinian rings. The results are used in Sections 4 and 6. In this section, except in Remark 3.5, we use the following convention: s ≥ 0; m1 , . . . , ms ≥ 1; m ≥ 2.
A = k[T1 , . . . , Ts , T ] T1m1 , . . . , Tsms , T m .
m1 ms m−1 = k[T , . . . , T , T ] T , . . . , T , T . A 1 s s 1 B = W [T1 , . . . , Ts , T ]. p φ : B −→ B; Tj −→ Tj , T −→ T p , a −→ σ (a) (a ∈ W ). (3) m m m 1 D = DB T , . . . , T s , T ⊃ JD , s 1 D = DB T1m1 , . . . , Tsms , T m−1 ⊃ JD , C = DW [T1 ,...,Ts ] T1m1 , . . . , Tsms ⊃ JC , are the PD-envelope algebras and their PD-ideals. Note that there is a natural map D +→ D induced by the obvious maps B → A → A . It is easily seen that D= CT [i,m] . i≥0
Here we use the following notation: T [i,m] denotes γa (T m )T b , where γ denotes the PD-structure a = [i/m] and b = i −am. The sub-C-module of D generated by T [i,m] is written as CT [i,m] . If i < pm, then T [i,m] = T i /[i/m]!. Noting this, we have for 0 ≤ r < p, rm−1
[r−[i/m]] i J [r] = J T ⊕ CT [i,m] . D
i=0
C
i≥rm
If we replace m by m − 1 in the right-hand side of the above two equations, we get a description of D and JD[r] . We consider groups JD[r] /JD[r] . In the case of r = 0, since CT [i,m−1] /CT [i,m] = 0 for i < p(m − 1), we have D ∼ = D
i≥p(m−1)
CT [i,m−1] . CT [i,m]
(4)
ˆ [i,m−1] in Dˆ . This is a subgroup of Let S be the (p-adic) closure of ⊕i≥(m−1)p CT for any 0 ≤ r < p. The following lemma can be shown by elementary calculation.
J [r] Dˆ
369
FORMAL CHOW GROUPS
Lemma 3.1. Let 0 ≤ r < p. For any element x of S, φr (x) again belongs to S, and j φr (x) = φr ◦ · · · ◦ φr (x) (j times) converges p-adically to zero as j → ∞. Corollary 3.2. The map 1 − φr :
J [r] ˆ
D J [r] Dˆ
−→
Dˆ Dˆ
is a surjection for any 0 ≤ r < p and is an isomorphism if r = 0. Proof. From (4), it is enough to show that, for any x ∈ S, the class of x in Dˆ /Dˆ j is in the image of 1 − φr . But the above lemma shows that y = ∞ j =0 φr (x) is a , and it is clear that (1 − φr )(y) = x. When r = 0, well-defined element of S ⊂ J [r] Dˆ this correspondence defines the inverse map of 1 − φ. [r−[i/(m−1)]]
[r−[i/m]]
Next, we consider the case of r = 1. Since (JC T [i,m−1] )/(JC if 0 ≤ i < m − 1 or if m ≤ i < p(m − 1), we can write JD ∼ CT m−1 CT [i,m−1] ⊕ . = JD CT [i,m] JC T m−1
T [i,m] ) = 0
i≥p(m−1)
We remark that the second term is the same as the right-hand side of (4). This shows that the inclusion map JD → D induces a surjection JD /JD → D /D, which has a splitting (inclusion to the second term in the above presentation) ι:
D JD −→ . D JD
Passing to the limits, we have ˆ m−1 Dˆ JDˆ ∼ CT ⊕ , = JDˆ JCˆ T m−1 Dˆ
ι:
J ˆ Dˆ +→ D . JDˆ Dˆ
Now the following lemma is seen easily. Lemma 3.3. There exists an exact sequence 0 −→
ˆ m−1 s JDˆ 1−φ1 Dˆ CT −→ 0, −−→ −−−→ JDˆ JCˆ T m−1 Dˆ
ˆ m−1 , the image of the class of x via s is the class of where, for x ∈ CT (Since φ1 (x) ∈ S, this is a well-defined element of JDˆ by Lemma 3.1.)
∞
j j =0 φ1 (x).
370
TAKAO YAMAZAKI
Remark 3.4. Assume rm ≤ p(m − 1). (If r ≤ p/2, this assumption holds for any m ≥ 2.) Then the above argument works for J [r] /J [r] as well. In fact, under this Dˆ Dˆ assumption, there exists a description [r−[i/(m−1)]] i rm−1 J T J [r] ˆ Cˆ Dˆ ∼ ⊕ D , = [r−[i/m]] i Dˆ J [r] J T i=0
Dˆ
Cˆ
and an exact sequence 0 −→
rm−1 i=0
[r−[i/(m−1)]] i T C [r−[i/m]] i Jˆ T C
Jˆ
sr
−→
J [r] ˆ
D J [r] Dˆ
1−φr
−−−→
Dˆ −→ 0, Dˆ
where sr is defined by the similar way. (If r = 1, s1 is nothing but s of Lemma 3.3.) The following remark is used in Sections 4 and 6. Remark 3.5. Let A be an object of Ꮽsyn such that pN+1 A = 0. Take a surjective W homomorphism B → A where B is a smooth W -algebra endowed with a Frobenius φ. Let A = A/p N A. Let D = DB (ker(B → A)) be the PD-envelope algebra. The PD-ideal JD of D contains pN+1 . We see that DB (ker(B → A )) ∼ = D by ignoring the PD-structure, and the PD-ideal JD of this PD-envelope algebra is JD + p N D. 4. Surjectivity of symbol map. In this section, let X be a smooth quasi-projective scheme over W . The aim of this section is to prove the following theorem and its Corollary 4.6. Theorem 4.1. Let A be an object of Ꮽ0 and 0 < r < p. (i) For any q > r, we have Ᏼq (r)(XA ,Y ) = 0. (ii) The symbol map
M r,X −→ Ᏼr (r)(XA ,Y ) A ,Y
is a surjection. In the appendix, we consider the kernel of the symbol map in (ii). Here we propose an optimistic conjecture, or, perhaps, a hope, as follows. Conjecture 4.2. Let 0 < r < p. For any object A of Ꮽsyn , (i) Ᏼq ((r)(XA ,Y ) ) = 0 for any q > r; M (ii) the symbol map r,X → Ᏼr ((r)(XA ,Y ) ) is an isomorphism. A ,Y Instead of proving Theorem 4.1, we prove a slightly more general proposition.
371
FORMAL CHOW GROUPS
Proposition 4.3. Let 0 < r < p. (i) If A is an object of Ꮽsyn , then we have Ᏼq ((r)XA ) = 0 for q ≥ r + 2. (ii) If A is an object of Ꮽ0 , then Ᏼr+1 (r)XA −→ Ᏼr+1 (r)Y is an isomorphism, and the sequence M r,X −→ Ᏼr (r)XA −→ Ᏼr (r)Y −→ 0 A ,Y is exact, where the first map is the symbol map. For a smooth quasi-projective scheme X over W , Take a closed immersion X → Z over W such that Z is smooth over W and endowed with a Frobenius φ. Let T = DX (Z) be the PD-envelope with PD-ideal sheaf JT .
(5)
For an object A of Ꮽsyn , Take a surjective W -homomorphism B → A such that B is smooth over W and endowed with a Frobenius φ. Let D = DB (ker(B → A)) be the PD-envelope with PD-ideal JD .
(6)
Then the fiber product XA → Z ⊗W B = ZB is a closed W -immersion and ZB is smooth over W endowed with the Frobenius φ ⊗ φ. It can be seen that DXA (ZB ) ∼ = T ⊗W D with PD-ideal JT ⊗ D + ᏻT ⊗ JD , by using the universality of PD-envelope algebras and the flatness of D and X over W . Hence, to calculate the cohomology sheaves Ᏼq ((r)XA ), we can use [r]
[0]
1−φr
(r)XA ,ZB = mapping fiber of JXA ,ZB −−−→ JXA ,ZB , [0] where J[r] XA ,ZB and JXA ,ZB is the complex
ˆ 1Z −→ J [r−2] ⊗ᏻ " ˆ 2Z −→ · · · −→ JT[r−1] ⊗ᏻZˆ " JT[r]⊗D ˆ ˆ ˆ B B Zˆ ⊗D T ⊗D B
B
and ˆ 1Z −→ (ᏻT ⊗D) ˆ 2Z −→ · · · , ˆ −→ (ᏻT ⊗D) ˆ ˆ ᏻT ⊗D ⊗ᏻZˆ " ⊗ᏻZˆ " B B B
B
respectively. If q ≥ r + 1, q
q
ˆ ˆ ˆ ˆ 1 − φr : (ᏻT ⊗D) ⊗" ZB −→ (ᏻT ⊗D) ⊗ "ZB is an isomorphism, because it has the inverse map implies Proposition 4.3(i).
j j ≥0 φr
(note φr = p q−r φq ). This
372
TAKAO YAMAZAKI
To prove Proposition 4.3(ii), we first work under the assumptions and notation of (3), and we consider Ᏼq ((r)XA ) → Ᏼq ((r)XA ). Define a complex Ꮿ by exactness of 0 −→ (r)XA ,ZB −→ (r)XA ,ZB −→ Ꮿ −→ 0.
(7)
This complex Ꮿ is equal to the mapping fiber of J[r] X ,ZB A
J[r] XA ,ZB
1−φr
−−−→
J[0] X ,ZB A
,
J[0] XA ,ZB
[r] [0] [0] where J[r] X ,ZB /JXA ,ZB and JX ,ZB /JXA ,ZB are the complexes A
A
deg(r−2)
/
···
deg(r−1)
ˆ r−2 JT[2]⊗D ⊗" ZB ˆ ˆ r−2 JT[2]⊗D ⊗" ZB ˆ
/
deg(r)
ˆ r−1 JT[1]⊗D ⊗" ZB ˆ ˆ r−1 JT[1]⊗D ⊗" ZB ˆ
/
ˆr ˆ ) ⊗ " (OT ⊗D ZB ˆr ˆ (OT ⊗D) ⊗" ZB
/ ···
and ···
/
ˆ r−2 ˆ ) ⊗ " (OT ⊗D ZB ˆ r−2 ˆ (OT ⊗D) ⊗" ZB
/
ˆ r−1 ˆ ) ⊗ " (OT ⊗D ZB ˆ r−1 ˆ (OT ⊗D) ⊗" ZB
/
ˆr ˆ ) ⊗ " (OT ⊗D ZB ˆr ˆ (OT ⊗D) ⊗" ZB
/ ··· ,
respectively. We denote by αr,q the map [r−q] ˆq JT ⊗D ⊗" ZB ˆ
[r−q] ˆq JT ⊗D ⊗" ZB ˆ
1−φr
−−−→
q
ˆ ˆ ) ⊗ " (OT ⊗D ZB ˆq ˆ (OT ⊗D) ⊗" ZB
.
By using Corollary 3.2, we see that αr,q is a surjection for any q and is an isomorphism if q ≥ r, so that Ᏼq (Ꮿ) = 0 if q ≥ r. By (7), this implies that Ᏼq (r)XA −→ Ᏼq (r)XA is an isomorphism if q ≥ r + 1 and is a surjection if q = r. By using Lemma 3.3, one gets isomorphisms
ˆ r−1 ∼ ᏻX ⊗ (C/JC ) ⊗ " ZB =
m−1 ˆ ᏻT ⊗CT
ˆ r−1 ⊗" ZB m−1 ˆ ˆ JT ⊗C + ᏻT ⊗JC T
∼ =
−→ ker(αr,r−1 ),
373
FORMAL CHOW GROUPS
j where the last map “ ∞ j =0 φ1 ” is defined by the similar way as in Lemma 3.3. By (7), we get a surjection ˆ r−1 −→ ker Ᏼr (r)XA −→ Ᏼr (r)X . ᏻX ⊗ (C/JC ) ⊗ " (8) ZB A By induction, for A as in (3), the proof of Proposition 4.3(ii) is completed by the following. Lemma 4.4. Let a ∈ ᏻX ⊗(C/JC ) and b1 , . . . , br−1 ∈ (ᏻX ⊗A)∗ be local sections. The image of {1 + aT m−1 , b1 , . . . , br−1 } under the symbol map coincides with the image of a ⊗ d b˜1 /b˜1 ∧ · · · ∧ d b˜r−1 /b˜r−1 under (8), where b˜i is a local lift of bi to OZ∗ B (i = 1, . . . , r − 1). Proof. Once we see the lemma for r = 1, the case r > 1 follows immediately from ˆ of the definition of the symbol and that of (8). Assume r = 1. Take a lift c ∈ ᏻT ⊗C j m−1 ) is well defined in ᏻT ⊗D a. Then a section ξ = j ≥0 φ1 (cT ˆ (cf. Lemma 3.1). The section ˆ1 dξ, (1 − φ1 )(ξ ) ∈ ᏻT ⊗D ˆ ⊗ "ZB ⊕ ᏻT ⊗D ˆ ˆ1 is in (ᏻT ⊗D its cohomology class is the image of aT m−1 under ˆ ⊗ "ZB ) ⊕ ᏻT ⊗D ˆ , and (8). Once we see the fact that n≥0 ξ n /n! is a well-defined section of ᏻT ⊗D ˆ , we complete the proof of the lemma, by taking this as a lift of 1 + aT m−1 . To see this fact, we first note that n 1 j j cp φ1 T m−1 n! n≥0
j ≥0
is a well-defined section of ᏻT ⊗D ˆ . (This is a consequence of the fact that the Artin j Hasse exponential n≥0 (1/n!)( j ≥0 X p /p j )n is in Z(p) [[X]].) By the way, it can be seen that j j j cp φ1 T m−1 − φ1 cT m−1 j ≥0
is in p ᏻT ⊗D ˆ . Then the fact follows. Next, we work under the assumptions and notation as in Remark 3.5. Again, we ∼ define a complex Ꮿ by exactness of the sequence (7). We remark that J[0] XA ,ZB = J[0] X ,ZB . Then it turns out that Ꮿ is equal to A
J [r] ˆ T ⊗D JT[r]⊗D ˆ
−→
J [r−1] ˆ T ⊗D
ˆ 1Z ⊗" B JT[r−1] ˆ ⊗D
−→
J [r−2] ˆ T ⊗D JT[r−2] ˆ ⊗D
ˆ 2Z −→ · · · . ⊗" B
374
TAKAO YAMAZAKI
Hence, we get a surjection ˆ r−1 ᏻX ⊗ p N A ⊗ " ZB
J ˆ ∼ T ⊗D ˆ r−1 −→ ker Ᏼr (r)XA −→ Ᏼr (r)X . ⊗" = ZB A JT ⊗D ˆ
(9)
Lemma 4.5. Let a ∈ ᏻX ⊗(C/JC ) and b1 , . . . , br−1 ∈ (ᏻX ⊗A)∗ be local sections. The image of {1+p N a, b1 , . . . , br−1 } under the symbol map coincides with the image of pN a ⊗ d b˜1 /b˜1 ∧ · · · ∧ d b˜r−1 /b˜r−1 under (9), where b˜i is a local lift of bi to OZ∗ B (i = 1, . . . , r − 1). The proof is similar to and easier than that of Lemma 4.4. By induction, this lemma completes the proof of Proposition 4.3. Corollary 4.6. Let d be the relative dimension of X over W . Let 0 < r < p. For any object A of Ꮽ0 , there is a canonical surjective homomorphism M Hˆ d r,X (A) −→ H d+r (XA , Y ), (r) . Proof. Consider the spectral sequence i,j E2 = H i Y, Ᏼj (r)(XA ,Y ) ⇒ H i+j (XA , Y ), (r) . i,j
We have E2 = 0 if j > r, by Theorem 4.1(i)—or if i > d, since the Zariski cohomological dimension of Y is d. Hence we have H d+r ((XA , Y ), (r)) = E2d,r . Since H d (Y, −) is a right exact functor, the corollary follows from Theorem 4.1(ii). Remark 4.7. The above proof shows that, if Conjecture 4.2 is true, the map in Corollary 4.6 is an isomorphism for any object A of Ꮽsyn . 5. The Leray spectral sequence. In this section, let X be a smooth projective scheme over W . This section is devoted to constructing a spectral sequence of Leray type for the relative syntomic cohomology groups H q ((XA , Y ), (r)). The following definition is taken from Wintenberger [19]. Definition 5.1. A filtered Dieudonné module over W of finite type is a W -module M endowed with (i) a decreasing filtration (M i )i∈Z such that M i = M(i % 0), M i = 0(i & 0); (ii) a family of σ -linear homomorphism (φi : M i → M)i∈Z such that φi |M i+1 = pφi+1 for all i ∈ Z; satisfying the conditions (iii) M i is of finite type over W for all i ∈ Z; (iv) M i is a direct summand of M as a W -module for all i ∈ Z; (v) M = i∈Z φi (M i ).
375
FORMAL CHOW GROUPS
We denote by MFW,tf the category of filtered Dieudonné modules of finite type over W . This is an abelian category (cf. [19]). The following fact, due to Fontaine [8] and Kato [11], plays an essential role in this section: If X is a smooth projective scheme over W of relative dimension d < p, then q ˆ " ˆ · ) has the structure of the de Rham cohomology group M = HdR (X/W ) ∼ = Hq (X, X an object of MFW,tf , defined as follows. The filtration (M i )i∈Z is the Hodge filtration, ˆ " ˆ " ˆ " ˆ ·≥i ). (Note that the natural map Hq (X, ˆ ·≥i ) → Hq (X, ˆ· ) that is, M i = Hq (X, X X X is injective; cf. [11, 2.5].) Let Z, φ be as in Section 4 (5). The homomorphism φi : M i → M(i < p) is defined as the composition φi ˆ " ˆ J[i] −− ˆ J[0] ∼ ˆ ·≥i ∼ M i = Hq X, → Hq X, = Hq X, X X,Z X,Z = M. Note that M p = 0. Let M be an object of MFW,tf . Assume that M is torsion-free. Let A be an object of Ꮽsyn and B, φ, D as in Section 4 (6). For 0 ≤ r < p, we define the syntomic complex (M, r)A,B and the relative syntomic complex (M, r)(A,k),B with coefficients in M. First we define for r ≥ 0 J (M)[r] A,B =
r i=0
ˆ M i ⊗W J [r−i] ⊂ M ⊗W D. ˆ D
We remark that this group is the inverse limit of the rth filtration subgroups of the sections of the crystals associated to M (on the big crystalline site of Spec(W )) at the PD-thicking (Spec(A), Spec(Dn )). We have a connection [r−1] ˆ1 ∇ = 1 ⊗ d : J (M)[r] A,B −→ J (M)A,B ⊗Bˆ "B
and its expansion ∇ [r−q] [r−q−1] ˆ q −− ˆ q+1 ⊗Bˆ " J (M)A,B ⊗Bˆ " B → J (M)A,B B
defined in the usual way. We denote the complex of abelian groups ∇
∇
∇
ˆ 1 −→ J (M)[r−2] ⊗ ˆ " ˆ 2 −→ · · · −→ J (M)[r−1] J (M)[r] A,B − A,B ⊗Bˆ "B − A,B B B − by J(M)[r] A,B . Since M is torsion-free, we can define for 0 ≤ r < p [0] φr = p −r φ : J(M)[r] A,B −→ J(M)A,B ,
whose map on the degree-q component is the unique map that coincides with φi ⊗ [r−q−i] ˆ q for any i. φr−q−i ⊗ φq on M i ⊗ J ˆ ⊗" B D
376
TAKAO YAMAZAKI
Definition 5.2. Let 0 ≤ r < p. We define the complex (M, r)A,B of abelian groups to be the mapping fiber of the map [0] 1 − φr : J(M)[r] A,B −→ J(M)A,B .
We denote by H q (A, (M, r)) the qth cohomology group of (M, r)A,B . We also define the complex (M, r)(A,k),B as the mapping fiber of (M, r)A,B −→ (M, r)k,B ,
where the latter complex is defined by using the composition B → A → k. We denote by H q ((A, k), (M, r)) the qth cohomology group of (M, r)(A,k),B . Again, those cohomology groups are independent of the choice of B, which can be seen by the same method as in Section 2 and [1, 7.24]. First we study the nonrelative syntomic cohomology groups. Theorem 5.3. Let X be a smooth projective scheme over W of relative dimension d. Let A be an object of Ꮽsyn . Assume d < p and the de Rham cohomology group q HdR (X/W ) is torsion-free for any q. Then, for 0 ≤ r < p, there exists a spectral sequence j i,j E2 = H i A, HdR (X/W ), r ⇒ H i+j XA , (r) . Let π : (XA )Zar → (Spec A)Zar be the map of sites induced by the structure morphism. Let Ꮿ = F 0 Ꮿ ⊃ F 1 Ꮿ ⊃ F 2 Ꮿ . . . be a finite filtration (by subcomplexes) of a complex Ꮿ of Zariski sheaves on XA , whose components in negative degrees are ·,j trivial. The following fact, due to Deligne, is explained in [13, 3.3]: Let E1 (Ꮿ) be the complex of E1 -terms of the spectral sequence of a finitely filtered complex Ꮿ: i,j
E1 (Ꮿ) = Ri+j π∗ (gr i Ꮿ) ⇒ Ri+j π∗ (Ꮿ), where gr i Ꮿ = F i Ꮿ/F i+1 Ꮿ. We remark that the boundary map of this complex is induced by the connecting homomorphism of 0 −→ gr i+1 Ꮿ −→ F i Ꮿ/F i+2 Ꮿ −→ gr i Ꮿ −→ 0.
(10)
Then there exists (another) spectral sequence ·,j i,j E2 = Ri E1 (Ꮿ) ⇒ Ri+j ◦ R0 π∗ Ꮿ, where = (Spec A, −). Let Z, T , B, D, and φ be as in Section 4, (5) and (6). We define a filtration ·,j j i F (r)XA ,ZB and show (E1 ((r)XA ,ZB )) = (HdR (X/W ), r)A,B . We also show
377
FORMAL CHOW GROUPS ·,j
that each component of E1 ((r)XA ,ZB ) is a quasi-coherent sheaf. Then, the above fact implies the theorem. We start with an exact sequence ˆ 1Z −→ "1 ⊗W Bˆ −→ 0. ˆ 1B −→ " 0 −→ ᏻZˆ ⊗W " B Zˆ [r−q] ˆ q as ⊗ᏻZˆ " Using this, we define a filtration F i (i ≥ 0) on JT ⊗D ZB ˆ
B
[r−q] ˆq ⊗ᏻZˆ " F i JT ⊗D ZB ˆ B [r−q] ˆ q−i ⊗ᏻ ˆ iB −→ J [r−q] ⊗ᏻ " ˆq . = Im JT ⊗D ⊗ " " ᏻ ⊗ ᏻ W ˆ ZB ZB Z ˆ ˆ Zˆ Zˆ Zˆ T ⊗D B
B
B
J[r] XA ,ZB
is defined to be the complex The filtration F i (i ≥ 0) on ∇ ∇ ∇ [r−1] [r−2] i 1 i 2 ˆ ˆ " " − − → F ⊗ − − → F ⊗ −→ · · · . J J F i JT[r]⊗D ᏻZˆ ᏻZˆ ZB ZB − ˆ ˆ ˆ T ⊗D T ⊗D B
Some linear algebra shows that deg(i−1)
B
gr i J[r] XA ,ZB
is isomorphic to the complex
deg(i)
· · · −→ 0 −→ JT[r−i] ⊗ᏻZˆ ˆ ⊗D
B
deg(i+1)
[r−i−1] ⊗ᏻZˆ ˆ T ⊗D B
ˆ i −→ J ᏻZˆ ⊗W " B
ˆ 1 ⊗W " ˆ i −→ · · · . " Z B
i [0] Let 0 ≤ r < p. We remark that φr (F i J[r] XA ,ZB ) ⊂ F JXA ,ZB . We now define for i ≥ 0 a filtration F i (r)XA ,ZB to be the mapping fiber of i−1 [0] F i J[r] JXA ,ZB . XA ,ZB −−−→ F 1−φr
[0] (We use the convention F −1 J[0] XA ,ZB = JXA ,ZB .) The above remark shows that 0 [r] [0] gr i (r)XA ,ZB ∼ = the mapping fiber of gr i JXA ,ZB −→ gr i−1 JXA ,ZB [r] [0] ∼ = gr i JXA ,ZB ⊕ gr i−1 JXA ,ZB [−1]. ·,j
Hence, the degree-i component of the complex E1 ((r)XA ,ZB ) is [r] [0] Ri+j π∗ gr i (r)XA ,ZB ∼ = Ri+j π∗ gr i JXA ,ZB ⊕ Ri+j −1 π∗ gr i−1 JXA ,ZB . The following lemma shows that this is a quasi-coherent sheaf and that the global j section of this sheaf is equal to the degree-i component of (HdR (X/W ), r)A,B . Lemma 5.4. We have an isomorphism of sheaves on Spec(A), j [r−i] ∼ ˆi Ri+j π∗ gr i J[r] XA ,ZB = J HdR (X/W ) A,B ⊗Bˆ "B .
378
TAKAO YAMAZAKI
Proof. Since this lemma is independent of φ, we can assume Z = T = X. We remark that ·−i ∼ i+j π∗ " ˆ ⊗W J [r−·] ⊗ ˆ " ˆi Ri+j π∗ gr i J[r] X XA ,XB = R B B ˆ D
∼ ˆ ·X ⊗W J [r−i−·] ⊗ ˆ " ˆi = Rj π∗ " B B . Dˆ
First we deal with the case r = 0. In this case, we have · · ˆ ˆ X ⊗W Dˆ ⊗ ˆ " ˆ ˆi ∼ j ˆi R j π∗ " B B = R π∗ "X ⊗W D ⊗Bˆ "B , ˆ ˆ ˆi ˆ · ⊗W Dˆ ⊗ ˆ " ˆi because the boundary map in " X B B is a D-linear map and D ⊗ "B is a ˆ free D-module. This proves the lemma in the case r = 0. Furthermore, the spectral sequence · ˆ iB ⇒ Ra+b π∗ " ˆ X ⊗W Dˆ ⊗ ˆ " ˆ aX ⊗ Dˆ ⊗ " ˆi E1a,b = Rb π∗ " (11) B B degenerates at E1 , by [11, II.2.5]. Consider the natural morphism of spectral sequences from · ˆ iB ⇒ Ra+b π∗ " ˆ X ⊗W J [r−i−·] ⊗ ˆ " ˆ aX ⊗ J [r−i−a] ⊗ " ˆi E1a,b = Rb π∗ " (12) B B ˆ ˆ D
D
to (11). We see that the maps on E1 -terms are injective. Using the fact that the spectral sequence (11) degenerates at E1 , we see that (12) also degenerates at E1 , and that the map · · ˆ X ⊗W J [r−i−·] ⊗ ˆ " ˆ X ⊗W Dˆ ⊗ ˆ " ˆ iB −→ Ra+b π∗ " ˆ iB Ra+b π∗ " Dˆ
B
B
is injective. The lemma follows. ·,j
It remains to show that the boundary maps of (E1 ((r)XA ,ZB )) coincide with j those of (HdR (X/W ), r)A,B . But this follows from the theory of Gauss-Mannin connection for crystals, except a part coming from 1 − φr , and this part can be seen directly from the definition of the filtration and (10). Next, we deduce a Leray spectral sequence for the relative syntomic cohomology groups. Corollary 5.5. Assume X, d, A, and r satisfy the hypothesis of Theorem 5.3. Let Y = Xk . Then, for 0 ≤ r < p, there exists a spectral sequence j i,j E2 = H i (A, k), HdR (X/W ), r ⇒ H i+j (XA , Y ), (r) . Proof. We use the same method as in the proof of the theorem. Let F i (r)XA ,ZB and F i (r)Y,ZB be as in the above proof. We define a subcomplex F i (r)(XA ,Y ),ZB of (r)(XA ,Y ),ZB as the mapping fiber of the natural map F i (r)XA ,ZB −→ F i−1 (r)Y,ZB .
FORMAL CHOW GROUPS
379
(We use the convention F −1 (r)Y,ZB = (r)Y,ZB .) Then gr i (r)(XA ,Y ),ZB ∼ = gr i (r)XA ,ZB ⊕ gr i−1 (r)Y,ZB , and we see that j ·,j E1 (r)(XA ,Y ),ZB = HdR (X/W ), r (A,k),B . The corollary follows. 6. Syntomic cohomology and p-divisible groups. In this section, we study the syntomic cohomology groups H q (A, (M, r)) and the relative syntomic cohomology groups H q ((A, k), (M, r)). We begin with a definition taken from [11, II.2.8]. Definition 6.1. Let M be an object of MFW,tf , and let I be a closed interval in R of the form [i, j ] or [i, ∞)(i, j ∈ Z). We say that M is of level I if M i = M for i ≤ inf(I ) and if M i+1 = 0 for i ≥ sup(I ). Proposition 6.2. Let M be a torsion-free object of MFW,tf of level [0, ∞) and let 0 ≤ r < p. (i) If q ≥ r + 2, then we have for any object A of Ꮽsyn H q A, (M, r) = 0. (ii) If q ≥ r + 1, then we have for any object A of Ꮽ0 H q (A, k), (M, r) = 0. (iii) If q ≥ 2, we have
H q k, (M, r) = 0.
In particular, if 1 ≤ r < p and if A is an object of Ꮽ0 , we have H r+1 (A, (M, r)) = 0. Proof. Let A, B, and φ be as in Section 4 (6). If q > r, the degree-q component [0] of the map 1 − φr : J(M)[r] A,B → J(M)A,B is an isomorphism. Hence (i) follows. To prove (ii), let A, A , B, and φ be as in Section 3 (3) or Remark 3.5. By the distinguished triangle (M, r)(A,k),B −→ (M, r)A,B −→ (M, r)k,B −→ (+1),
(13)
it is enough to show that
H q A, (M, r) −→ H q k, (M, r)
is an isomorphism if q = r + 1 and is a surjection if q = r. Define a complex Ꮿ by exactness of 0 −→ (M, r)A,B −→ (M, r)A ,B −→ Ꮿ −→ 0. [q]
If we put M q in place of JT in the proof of Proposition 4.3, we can prove that Ꮿ is acyclic outside [0, r − 1]. Hence (ii) follows. We also remark that the degree-(r − 1)
380
TAKAO YAMAZAKI
component of Ꮿ is isomorphic to ˆ r−1 . M/M 1 ⊗ ker(A −→ A ) ⊗ " B
(14)
For (iii), we use the obvious surjection W → k with Frobenius φ = σ . Then
ˆ 1 is trivial. (r)k,W is acyclic outside [0, 1], since " W
In the rest of this section, we study a relation between the rational points of a p-divisible group and the syntomic cohomology groups. Assume p = 2. Then we have the contravariant equivalence functor ILM of [9, 9.11], between the category of p-divisible groups over W and the full subcategory of MFW,tf consisting of objects M, which is torsion-free and is of level [0, 1]. (Strictly speaking, the subject of [9] is not p-divisible groups but finite flat group schemes. However, the proof in [9] also works for p-divisible groups, by using the corresponding result of [7, Chapter IV, 1.6] instead of [6].) But since we need a covariant theory, we define, for a p-divisible group G, the object MG of MFW,tf corresponding to G as MG = Hom ILM(G)[1], W . (15) Here we use the following notation: Hom denotes the Hom-object in MFW,tf (cf. [19, 1.7]). W is the object of MFW,tf defined as W q = W (resp., 0) if q ≤ 0 (resp., q > 0), and φi = p −i σ (i ≤ 0). For an object M of MFW,tf , M = M[q] is the translation of M, that is, an object of MFW,tf defined as M i = M i+q and φ i = φi+q . We use the 1 ∼ Lie G. fact that there is a canonical isomorphism MG /MG = Instead of the symbol map in Proposition 4.3, we have the following theorem. Theorem 6.3. Assume p = 2. Let G be a connected p-divisible group over W and MG the object of MFW,tf corresponding to G. Then we have a canonical homomorphism for any object A of Ꮽsyn G(A) −→ H 1 A, (MG , 1) . Furthermore, if A belongs to Ꮽ0 , then this is an isomorphism and H q (A, (MG , 1)) = 0 unless q = 1. Consequently, for any object A of Ꮽ0 we have G(A) if q = 1, H q (A, k), (MG , 1) ∼ = H q A, (MG , 1) ∼ = 0 if q = 1. Note G(k) = 0 since G is connected. The definition of this map is as follows. Let x ∈ G(A). For each n > 0, define Fn as the pullback of the extension 0
/
pn G
/G O
0
/
pn G
/ Fn
pn
/G O
/0
/Z
/ 0,
where the right vertical arrow is defined as 1 → x. (This pullback is taken in the
381
FORMAL CHOW GROUPS
category of abelian sheaves on the flat site of Spec(A).) Let En = Fn /p n Fn . Then we have a short exact sequence 0 −→ pn G −→ En −→ Z/pn Z −→ 0, so that En is a finite flat group scheme over A. Furthermore, E = (En )n>0 forms a p-divisible group over A, and there is an exact sequence 0 −→ G −→ E −→ Qp /Zp −→ 0. Take B, D, and φ as in Section 4 (6). By taking the section of the associated crystalline Dieudonné module (and its canonical filtration) of the above sequence at a PD-thicking (Spec(A), Spec(Dn )), we have a commutative diagram with exact rows 0
/
0
/ M 1 ⊗W Dn + M G ⊗W J D n G
M G ⊗W Dn
/ Nn
/ Dn
/ 0
/ N1 n
/ Dn
/ 0,
where Nn ⊃ Nn1 is the one associated to E. By passage to the limit, we have 0
/
0
/ M 1 ⊗W Dˆ + MG ⊗W J ˆ G D
MG ⊗W Dˆ
/ N
/ Dˆ
/ 0
/ N1
/ Dˆ
/ 0.
Take a ∈ N 1 such that its image in Dˆ is 1. Since (1 − φ1 )(1) = 0 and ∇(1) = 0, we have 1 ˆ 1B . ⊗ Dˆ + MG ⊗ JDˆ n ⊕ Dˆ ⊗ " (1 − φ1 )(a), ∇(a) ∈ MG The image of x is defined to be the class of this element in H 1 (A, (MG , 1)). The proof of Theorem 6.3 is similar to that of Proposition 4.3. Using the notation 1 ∼ Lie G, we have as in the proof of Proposition 6.2, by (14) and the fact MG /MG = H 0 (Ꮿ) ∼ = Lie G ⊗W ker(A −→ A ). But the former group is isomorphic to ker(G(A) → G(A )). By induction, Lemma 6.4 completes the proof. The last assertion is deduced from (13). Lemma 6.4. We have H q (k, (MG , 1)) = 0 for any q. Proof. To calculate H q (k, (MG , 1)), we use the obvious surjection W → k with φ = σ . Then the complex (MG , 1)k,W is 1−φ1
1 + pMG −−−→ MG . MG
382
TAKAO YAMAZAKI
1 + pM → M is an isomorphism. There is a σ −1 We must show that 1 − φ1 : MG G G linear map V : MG → MG such that φ ◦ V = V ◦ φ = p (cf. [9, 9.8]). This map 1 ) (cf. Definition is characterized by V (φ0 (x) + φ1 (y)) = px + y (x ∈ MG , y ∈ MG 1 5.1(v)). This shows V (MG ) = MG + pMG . We also get 1 1 + pMG −→ MG + pMG V ◦ (1 − φ1 ) = V − 1 : MG
(1 − φ1 ) ◦ V = V − 1 : MG −→ MG . Since G is connected, the map V is topologically nilpotent. Hence both of V − 1 are isomorphisms. Thus the lemma follows. Next, we consider an étale p-divisible group. Proposition 6.5. Assume p = 2. Let G be an étale p-divisible group over W , and MG the object of MFW,tf corresponding to G. Then we have for any object A of Ꮽ0 1−φ1 1 −− −→ MG if q = 0, ker MG = MG q 1−φ 1 H A, (MG , 1) = coker MG = M 1 −−−→ MG if q = 1, G 0 if q ≥ 2. In particular, for any q ≥ 0 we have H q (A, k), (MG , 1) = 0. Proof. Since G is an étale p-divisible group, MG is of level [1, 1]. Then the following two lemmas complete the proof. Lemma 6.6. Let 0 ≤ s ≤ r < p. Let M be a torsion-free object of MFW,tf of level [s, ∞) and A an object of Ꮽsyn . Take B as in Section 4 (6). Then we have (M, r)A,B = M[s], r − s A,B , (M, r)(A,k),B = M[s], r − s (A,k),B . The proof is clear from the definition. Lemma 6.7. Let M be a torsion-free object of MFW,tf object A in Ꮽ0 , we have 1−φ 0 ker M −−−−→ M 1−φ 0 H q A, (M, 0) = coker M −− −−→ M 0 In particular, for any q ≥ 0 we have
of level [0, ∞). For any
if q = 0, if q = 1, if q ≥ 2.
FORMAL CHOW GROUPS
383
H q (A, k), (M, 0) = 0. Proof. As in the proof of Proposition 6.2, we see that H q (A, (M, 0)) = To calculate the latter group, we can use the natural surjection W → k with φ = σ . Then the complex (M, 0)k,W is H q (k, (M, 0)).
1−φ 0
M −−−−→ M. Hence the equality follows. The last assertion follows from (13). Corollary 6.8. Assume p = 2. Let G be a p-divisible group over W , and MG the object of MFW,tf corresponding to G. Then we have for any object A of Ꮽ0 H
q
Gc (A) if q = 1, (A, k), (MG , 1) = 0 if q = 1,
where Gc is the connected component of G. Proof. Let MGc be the object of MFW,tf corresponding to Gc . Then MG /MGc corresponds to the étale part of G. Taking B as in Section 4 (6), we have an exact sequence of complexes 0 −→ (MGc , 1)(A,k),B −→ (MG , 1)(A,k),B −→ (MG /MGc , 1)(A,k),B −→ 0. By the deduced long exact sequence, the corollary follows from Theorem 6.3 and Proposition 6.5. Remark 6.9. The sequence 0 −→ H 1 (A, k), (MG , 1) −→ H 1 A, (MG , 1) −→ H 1 k, (MG , 1) −→ 0 (16) is not exact in general. An example for a failure of the injectivity of the first map is given by A = Wn with n ≥ 2 and by a p-divisible group G that is a nontrivial extension of Qp /Zp by Qp /Zp (1). This is a big difference from the equal characteristic situation, which was considered in [3] and [18]. In fact, if A is an object of Ꮽsyn such that the structure morphism W → A factors through k, it is clear that (16) is a split exact sequence. We also remark that, if G is a trivial extension of an étale p-divisible group by a connected p-divisible group, for any object A of Ꮽ0 , (16) is also a split exact sequence by Theorem 6.3 and Proposition 6.5. (Remember that any p-divisible group over k is a trivial extension of an étale p-divisible group by a connected p-divisible group.) The following definition is taken from [11, II.2.12].
384
TAKAO YAMAZAKI
Definition 6.10. Let M be an object of MFW,tf . We say that M is of Hodge-Witt type if there is a finite sequence of subobjects (Mi )0≤i≤N in MFW,tf such that 0 = M 0 ⊂ M 1 ⊂ · · · ⊂ MN = M and such that, for each i, Mi+1 /Mi is of level [mi , mi + 1] for some integer mi . The following remark is proven in [11, II.2.15]. Remark 6.11. For X a smooth projective scheme over W of relative dimension d < p, the following conditions are equivalent. (i) Y is of Hodge-Witt type, that is, H q (Y, W "sY ) is of finite type over W for all q and s. q (ii) HdR (X/W ) is of Hodge-Witt type for all q. Proposition 6.12. Assume p = 2. Let M be a torsion-free object of MFW,tf of level [0, ∞) and of Hodge-Witt type. By [11, II.2.14], we have the “level-[0, 1] part” of M, that is, a subobject N of M such that N is of level [0, 1] and M/N is of level [1, ∞). Let Gc be the connected component of the p-divisible group corresponding to N. (Though the choice of N may not be unique for the level-[1, 1] part, Gc itself is independent of this choice. We have Lie Gc ∼ = N/N 1 ∼ = M/M 1 .) Then there exists a canonical isomorphism Gc (A) −→ H 1 (A, k), (M, 1) for any object A of Ꮽ0 . Furthermore, H q ((A, k), (M, 1)) = 0 unless q = 1. Proof. For q ≥ 0, we have H q ((A, k), (M/N, 1)) ∼ = H q ((A, k), (M/N[1], 0)) = 0 by Lemmas 6.6 and 6.7. From an exact sequence 0 −→ (N, 1)(A,k),B −→ (M, 1)(A,k),B −→ (M/N, 1)(A,k),B −→ 0, we have H q (A, k), (M, 1) ∼ = H q (A, k), (N, 1) . Then the proposition follows from Corollary 6.8. 7. K-cohomological functor and p-divisible groups. In this section, we prove Theorem A. Let the notation and assumptions be as in Theorem A. By Corollary 5.5, we have a spectral sequence j i,j E2 = H i (A, k), HdR (X/W ), r ⇒ H i+j (XA , Y ), (r) . i,j
We have E2 = 0 if i > r by Proposition 6.2. Since j ˆ " ˆ " ˆ ·X ∼ ˆ ·≥j −d , HdR (X/W ) = Hj X, = Hj X, X
FORMAL CHOW GROUPS
385
j
we see that HdR (W/X) is of the level [j − d, d]. Hence we have by Lemma 6.6 j i,j E2 = H i (A, k), HdR (X/W ), r j = H i (A, k), HdR (X/W )[j − d], r − j + d . i,j
Thus Proposition 6.2 shows E2 = 0 if i > r − j + d. Moreover, E20,d+r = 0 by 1,d+r−1 and that there exists a Lemma 6.7. These facts imply that E21,d+r−1 = E∞ canonical surjection d+r−1 H d+r (XA , Y ), (r) −→ H 1 (A, k), HdR (X/W ), r . d+r−1 (X/W ) to be of Hodge-Witt type, Proposition 6.12 and Since we assume HdR Lemma 6.6 provide canonical isomorphisms d+r−1 Gc (A) ∼ (X/W )[r − 1], 1 = H 1 (A, k), HdR d+r−1 ∼ (X/W ), r . = H 1 (A, k), HdR
These two maps and Corollary 4.6 complete the proof. Proposition 7.1. Let the notation and assumptions be as in Theorem A. Assume further d = r = 2. If we assume Conjecture 4.2, we have an analogue of Bloch’s conjecture about the Albanese kernel; that is, admitting Conjecture 4.2, the following conditions are equivalent: (i) H 2 (X, ᏻX ) = 0. M )(A) → G(A)) = 0 for any object A of Ꮽ . (ii) ker(Hˆ 2 (2,X 0 Proof. Let A be an object A of Ꮽ0 . By Remark 4.7, we have M H 2 2,X (A) ∼ = H 4 (XA , Y ), (2) . 3 (X/W ), 2)) Using the notation in the above proof, we have E20,3 = H 0 ((A, k), (HdR i,j = 0 by Proposition 6.12, and E2 = 0 if i ≥ 3 by Proposition 6.2. Hence we have 2,2 E22,2 = E∞ . From Remark 4.7, we have 2 M ker H 2,X (A) −→ G(A) 3 ∼ (X/W ), 2 = ker H 4 (XA , Y ), (2) −→ H 1 (A, k), HdR 2 ∼ (X/W ), 2 . = H 2 (A, k), HdR 2 (X/W ) is of level [1, 2], and we have, by Lemma 6.6 and Assume (i). Then HdR Proposition 6.2, 2 2 H 2 (A, k), HdR (X/W ), 2 ∼ (X/W )[1], 1 = 0, = H 2 (A, k), HdR
that is, (ii) holds. Assume H 2 (X, ᏻX ) = 0. Then the following lemma completes the proof.
386
TAKAO YAMAZAKI
Lemma 7.2. Let M be a torsion-free object of MFW,tf of level [0, ∞). Assume M 0 = M 1 . We set A = Wn [T ]/(T e ). Then for sufficiently large e and n, we have H 2 (A, k), (M, 2) = 0. Proof. Set r = rank W (M), r1 = rank W (M 1 ). Let n ≥ 3 and e > ps with s > r1 /(r − r1 ). Since H 2 (k, (M, 2)) = 0 by Proposition 6.2, it is enough to show that H 2 (A, (M, 2)) = 0. Let B = W [T ] endowed with the Frobenius φ : T → T p . Let D = DB ((p n , T e )) be the PD-envelope algebra and Dˆ its p-adic completion with the induced PD-structure on JDˆ . Define a complex 1 (M, 2)A,B by the exactness of p
0 −→ (M, 2)A,B −→ (M, 2)A,B −→ 1 (M, 2)A,B −→ 0. Since H 3 (A, (M, 2)) = 0 by Proposition 6.2, it is enough to show that H 2 (A, ˆ 2 = 0, we have 1 (M, 2)) = 0. Noting " B H 2 A,1 (M, 2) (1−φ2 ,∇) ˆ −− −−−−→ M ⊗ Dˆ ⊗ "1B1 . = coker M 1 ⊗ Dˆ + M ⊗ JDˆ ⊗ "1B1 ⊕ (M ⊗ D) i Let F be the image of ⊕si=1 M ⊗ W T p −1 dT under the natural map M ⊗ Dˆ ⊗ "1B → ˆ ∩ F = 0 and (1 − φ2 )(M ⊗ M ⊗ Dˆ ⊗ "1B1 . Then dimk F = rs. We have ∇(M ⊗ D) 1 1 JDˆ ⊗ "B1 ) ∩ F = 0. We also have dimk ((1 − φ2 )(M ⊗ Dˆ ⊗ "1B1 ) ∩ F ) ≤ r1 (s + 1). Since rs > r1 (s + 1) by definition, this completes the proof.
Remark 7.3. Let the notation and assumption be as in Theorem A, but do not d+r−1 assume HdR (X/W ) to be of Hodge-Witt type. For r = d, we have a composition map 2d M −→ H 2d XA , (d) −→ H 0 A, HdR (X/W ), d ∼ H d XA , d,X = Zp , A 2d (X/W ) = W [−d] (cf. Lemmas 6.6 and 6.7). This map can be viewed because of HdR as the formal completion of the degree map
CHd (X ⊗W K) −→ Z, where K is the field of fractions of W . Appendix In this appendix, we show that the map in Theorem B is an isomorphism modulo essentially zero functors. We need the following definition, which is taken from Stienstra [18, p. 2].
FORMAL CHOW GROUPS
387
Definition A.1. A functor F from Ꮽsyn to the category of abelian groups is called essentially zero if, for any object A of Ꮽsyn , there exists a surjection A → A in Ꮽsyn such that the induced map F (A ) → F (A) is zero. Theorem A.2. Let the notation and assumptions be as in Theorem B. Then the functors M A −→ ker Hˆ d r,X (A) −→ H d+r (XA , Y ), (r) , M A −→ coker Hˆ d r,X (A) −→ H d+r (XA , Y ), (r) are essentially zero. Remember that a functor F from Ꮽsyn to the category of abelian groups is called formally smooth if, for any surjection A → A in Ꮽsyn , the induced map F (A ) → F (A) is surjective. This theorem suggests that, roughly speaking, at least the “formally M ) and H d+r ((X , Y ), (r)) are isomorphic. So, the smooth parts” of functors Hˆ d (r,X . author wishes that this theorem play a role when one considers a generalization of the Dieudonné theory, which might be an analogue of Stienstra’s theory [18]. Proof. If A → A¯ is a surjection in Ꮽsyn , and if the theorem is true for A, then it ¯ It therefore suffices to prove the theorem for is also true for A.
A = Wn [T1 , . . . , Ts ] T1m1 , . . . , Tsms with s ≥ 0, n, m1 , . . . , ms ≥ 1. Thus, the theorem about the cokernel is already proven in Theorem B. In the rest of this appendix, we fix such an A and show M ker Hˆ d r,X (A) −→ H d+r (XA , Y ), (r) = 0. By the proof of Corollary 4.6, it is enough to show that the symbol map M r,X −→ Ᏼr (r)(XA ,Y ) A ,Y is an isomorphism (cf. Remark 4.7). Let R be a p-adically complete local ring that is essentially smooth over W . We use the notation RA = R ⊗W A, Rk = R ⊗W k, and so forth. We can define the syntomic cohomology groups H r ((RA , Rk ), (r)) and the symbol map, as in Section 2. What we need is to prove the following. Proposition A.3. The symbol map induces an isomorphism ker KrM (RA ) −→ KrM (Rk ) −→ H r (RA , Rk ), (r) . Since this proposition is proved by following Kurihara’s proof in [14], [15], we only sketch it. We can define the groups H r ((RA , RWn ), (r)), by replacing k by
388
TAKAO YAMAZAKI
Wn in the definition of the relative syntomic cohomology groups. Noting that the surjection A → Wn (defined by Ti → 0 for any i) has a section Wn → A (induced by the natural map W → A), we have a commutative diagram with exact rows 0
/ K M RA , RW n r
0
r
κ1
/H
RA , RWn , (r)
/ K M RW , R k n r
/ K M (RA , Rk ) r / H (RA , Rk ), (r)
r
/0
κ2
/H
r
RWn , Rk , (r)
/ 0.
(Here we use an ad hoc notation KrM (RA , Rk ) = ker(KrM (RA ) → KrM (Rk )), etc., since it corresponds to our notation of the relative syntomic cohomology groups. This is not the standard notation.) Now the proposition is reduced to showing that κ1 and κ2 are isomorphisms. First we consider κ2 . Using the natural surjection R → Rn , we can see that d ˆ r−1 /p n " ˆ r−2 −→ ˆ r−1 . H r RWn , Rk , (r) ∼ p" = coker p 2 " R R R Kurihara constructed the exponential homomorphism (cf. [14, 0.1, 3.3], [15, 2.10])
ˆ r−2 −→ KrM (R, Rk )ˆ. ˆ r−1 d p 2 " p" R R (Here “ ˆ ” means the completion with respect to some filtration; cf. loc. cit.) By compositing it with the natural map KrM (R, Rk )ˆ→ KrM (RWn , Rk ), we get the inverse map of κ2 . Next, we consider κ1 . Choose a p-base of R/pR, and take their lift I . One can find a Frobenius φ on R such that φ(X) = X p for any X ∈ I . Let B = R[T1 , . . . , Ts ], and p extend φ to B by φ(Ti ) = Ti for any i. Let (D, J ) be the PD-envelope of B with respect to the kernel of the natural surjection B → RA . Define ˆq , ˆ "qB = ker D ⊗ ˆ "qB −→ " UD ⊗ R ˆq . ˆ "qB = ker J ⊗ ˆ "qB −→ " UJ ⊗ R Then we can see H r RA , RWn , (r) (1−φr ,d) ∼ ˆ "r−1 ˆ "r−1 ˆ "r−2 −−−−−−→ U D ⊗ ⊕ UD ⊗ . = coker U J ⊗ B B B Following Kurihara’s proof (cf. [14], [15]), we have a homomorphism M ˆ "r−1 E : UD ⊗ B −→ Kr RA , RWn
389
FORMAL CHOW GROUPS
such that
dX1 dXr−1 E a ··· X1 Xr−1
= s(a), X1 , . . . , Xr−1 ,
a ∈ (T1 , . . . , Ts )B,
Xi ∈ I ∪ {T1 , . . . , Ts } ˆ "r−1 = 0, E UJ ⊗ B
for any i,
j where s = exp( ∞ j =0 φ1 ) (cf. [15, 1.1]). By [15, 1.3], we see that E factors through r H ((RA , RWn ), (r)) and gives the inverse map of κ1 . References [1] [2]
[3] [4] [5] [6] [7] [8]
[9] [10]
[11]
[12] [13] [14] [15] [16] [17]
P. Berthelot and A. Ogus, Notes on Crystalline Cohomology, Princeton Univ. Press, Princeton, 1978. S. Bloch, “Algebraic K-theory and algebraic geometry” in Algebraic K-Theory, I: Higher KTheories (Seattle, Wash., 1972), Lecture Notes in Math. 341, Springer-Verlag, Berlin, 1973, 259–265. , K2 of Artinian Q-algebras, with application to algebraic cycles, Comm. Algebra 3 (1975), 405–428. , Lectures on Algebraic Cycles, Duke Univ. Math. Ser. 4, Duke Univ. Mathematics Department, Durham, N.C., 1980. , “p-adic étale cohomology” in Arithmetic and Geometry, Vol. 1, Progr. Math. 35, Birkhäuser, Boston, 1983, 13–26. J.-M. Fontaine, Groupes finis commutatifs sur les vecteurs de Witt, C. R. Acad. Sci. Paris Sér. A-B 280 (1975), A1423–A1425. , Groupes p-divisibles sur les corps locaux, Astérisque 47–48, Soc. Math. France, Paris, 1977. , “Cohomologie de de Rham, cohomologie cristalline et représentations p-adiques” in Algebraic Geometry (Tokyo/Kyoto, 1982), Lecture Notes in Math. 1016, SpringerVerlag, Berlin, 1983, 86–108. J.-M. Fontaine and G. Laffaille, Construction de représentations p-adiques, Ann. Sci. École Norm. Sup. (4) 15 (1982), 547–608. K. Kato, “Milnor K-theory and the Chow group of zero cycles” in Applications of Algebraic K-Theory to Algebraic Geometry and Number Theory, Contemp. Math. 55, Amer. Math. Soc., Providence, 1986, 241–253. , “On p-adic vanishing cycles (application of ideas of Fontaine-Messing)” in Algebraic Geometry (Sendai, 1985), Adv. Stud. Pure Math. 10, North-Holland, Amsterdam, 1987, 207–251. , The explicit reciprocity law and the cohomology of Fontaine-Messing, Bull. Soc. Math. France 119 (1991), 397–441. N. Katz, Nilpotent connections and the monodromy theorem: Applications of a result of Turrittin, Inst. Hautes Études Sci. Publ. Math. 39 (1970), 175–232. M. Kurihara, The exponential homomorphisms for the Milnor K-groups and an explicit reciprocity law, J. Reine Angew. Math. 498 (1998), 201–221. , The Milnor K-groups of a local ring over a ring of p-adic integers, preprint. C. Soulé, Opérations en K-théorie algébrique, Canad. J. Math. 37 (1985), 488–550. J. Stienstra, “The formal completion of the second Chow group, a K-theoretic approach” in Journées de géométrie algébrique de Rennes (Rennes, 1978), Vol. 2, Astérisque 64, Soc. Math. France, Paris, 1979, 149–168.
390 [18] [19]
TAKAO YAMAZAKI , Cartier-Dieudonné theory for Chow groups, J. Reine Angew. Math. 355 (1985), 1–66; Correction, J. Reine Angew. Math. 362 (1985), 218–220. J.-P. Wintenberger, Un scindage de la filtration de Hodge pour certaines variétés algébriques sur les corps locaux, Ann. of Math. (2) 119 (1984), 511–548.
Institute of Mathematics, University of Tsukuba, Tendoudai 1-1-1, Tsukuba-shi, Ibaraki, 305-8571, Japan;
[email protected]
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
VARIATIONAL PROPERTIES OF A NONLINEAR ELLIPTIC EQUATION AND RIGIDITY M. L. BIALY and R. S. MACKAY
1. Introduction and main results. In this paper, we discuss variational properties of classical solutions of the nonlinear equation u = −Vu (u, x1 , . . . , xn ) , which is the Euler-Lagrange equation of the functional 1 I (u) = (∇u)2 − V (u, x1 , . . . , xn ) dx1 · · · dxn . 2
(1.1)
(1.2)
The main question addressed here is the following: Under what conditions on the potential V are all classical solutions of (1.1) globally minimising for the functional (1.2)? By a global minimiser we mean a smooth function on a domain of Rn minimising the integral (1.2) over all bounded subdomains with smooth boundaries with respect to smooth functions with the same boundary values. The motivation for this question comes from variational problems of classical mechanics. In geodesic problems it is well known that all geodesics are globally minimising on manifolds of negative sectional curvature. However, it was shown first by Hopf [12] and Green and Gulliver [9] that the situation is completely different for Riemannian (two) tori or for Riemannian planes that are flat outside a compact set. They proved that for these manifolds, there always exist geodesics with conjugate points, which are therefore nonminimal, unless the metric is flat. We refer the reader to [4] and [6] for higher-dimensional generalisations of Hopf and Green-Gulliver theorems and to [8], [7], [13], and [10] for very important previous developments. It was observed first in [2] that the Hopf phenomenon is not entirely Riemannian. In [3] it was shown that a similar type of rigidity holds for Newton’s equations with periodic or compactly supported potentials. For the equation (1.1) with periodic potentials, it was shown in [15] and [16] that far-reaching generalisations of Aubry-Mather and KAM (Kolmogorov-ArnoldMoser) theories apply. Using these theories, one can construct families of minimal solutions that form laminations or sometimes even foliations of the configuration torus. It is an interesting open question, however, if it happens that, for all slope Received 3 February 1999. 1991 Mathematics Subject Classification. Primary 35J60; Secondary 58F18, 53C99. Authors’ work supported by Engineering and Physical Sciences Research Council grant number GR/M11349. 391
392
BIALY AND MACKAY
vectors, these laminations are genuine foliations. This question is tightly related to the one we are addressing in this paper, since all the leaves of a foliation are globally minimal. We assume throughout this paper that the potential V is compactly supported. The main reason for this is to make the analogy with the aforementioned situations of ≤ 0 everywhere which Hopf rigidity and to exclude, in particular, the case with Vuu is analogous to nonpositive curvature. Nevertheless, several of our results apply in more generality, as we indicate. We show that in case of dimensions greater than 2, there are many potentials such that all solutions of (1.1) are minimising (see Theorem 1). In dimension 2, however, at least for radially symmetric potentials, there always exist nonminimal solutions of (1.1) unless V vanishes identically (see Theorem 2). We state Theorems 1 and 2 here. Theorem 1. For n ≥ 3, let V be a compactly supported potential on Rn+1 . As (u, x) ≤ U (x) for some function U such that either sume that Vuu (A) U (x) ≤ ((n − 2)/2)2 / x − x0 2 for some point x0 ∈ Rn , or (B) U n/2 ≤ (n(n − 2)/4)|Sn | where |Sn | is the volume of the unit n-sphere. Then any solution of (1.1) is globally minimising. Theorem 2. Let V (u, x1 , x2 ) be a radially symmetric compactly supported potential (n = 2). There always exist radial nonminimal solutions of (1.1) unless V vanishes identically. Our approach to the proof of Theorem 2 is based on the reduction to a Newton equation with the potential supported in the semistrip. It turns out that a technique analogous to Hopf’s can be applied for such a shape of support. In Section 3, we consider the case of radially symmetric potentials for n ≥ 3. We illustrate the minimality property of radial solutions of (1.1), organising them in foliations (see Theorem 3). Acknowledgements. This paper was written while the first author was visiting the University of Cambridge. We would like to thank the EPSRC for their support. 2. Proof of Theorem 1. Let u be a solution of (1.1) in a domain , and let υ be any function with the same boundary values. Set ξ = υ − u and write (∇ξ )2 I (υ) − I (u) = − V (u + ξ, x) − V (u, x) d n x. ∇u∇ξ + (2.1) 2 Integrating the first term by parts and using the equation (1.1), we have ∇u∇ξ dx = Vu (u, x)ξ d n x.
By the assumption on V , we have 1 V (u + ξ, x) − V (u, x) − Vu (u, x)ξ ≤ U (x)ξ 2 . 2
VARIATIONAL PROPERTIES AND RIGIDITY
393
Substituting this in (2.1), we obtain I (υ) − I (u) ≥
(∇ξ )2 1 − U (x)ξ 2 d n x. 2 2
(2.2)
We show that under hypothesis (A) or (B), the last integral is always positive unless ξ ≡ 0. Indeed, for case (A), introduce spherical coordinates centred at x0 , r = x − x0 . The required assertion follows from the next lemma. Lemma 1. For any function ξ(r) defined on [r1 , r2 ] (0 ≤ r1 < r2 ≤ ∞) with ξ(r2 ) = 0, it follows that
r2 r1
r
n−1
2 n−2 2 ξ2 ξ − dr ≥ 0 2 r2
with equality for ξ ≡ 0 only. Proof. Introduce the new function ϕ(r) = ξ(r) · r (n/2)−1 and substitute into the integral. We have
r2 r1
r
n−1
2 2 − n −n/2 (n − 2)2 −n 2 r r ϕ dr ϕ + ϕ − r 2 4 r2 r2 2 2 n−2 2 ϕ (r1 ) + r ϕ dr. r · ϕ + (2 − n)ϕϕ dr = = 2 r1 r1
1−(n/2)
Since n ≥ 3, we see that the last expression is always nonnegative and equals zero only for ξ ≡ 0. This completes the proof of Lemma 1 and of the theorem in case (A). Alternatively, under (B), Sobolev’s inequality (e.g., [14, §8.3]) implies that |∇ξ |2 d n x ≥ Sn ξ 22n/(n−2) , where
n(n − 2) n |S |. 4 Then Hölder’s inequality applied to the second term of the integrand (as in [14, §11.3]) yields U ξ 2 d n x ≤ ξ 2 U n/2 . n/(n−2) Sn =
Thus,
1 Sn − U n/2 ξ 22n/(n−2) . 2 This completes the proof in case (B). I (v) − I (u) ≥
394
BIALY AND MACKAY
Remark 1. Lemma 1 and its proof have a precursor in [5, Ch. 6, §5], and the whole of Theorem 1 fits in the general domain of “absence of bound states,” described, for example, in [17, §8.3]. Remark 2. In addition, our proof shows that any solution is a nondegenerate minimum for (1.2). In a different way, one can say this as follows: For any solution u on , the linearised equation ξ + Vuu (u, x)ξ = 0 has only the trivial solution satisfying the zero boundary conditions ξ |∂ = 0. 3. Radial potentials. Let us now consider the case of radial, compactly supported (u, r) ≤ 1/4r 2 . Radial solutions u(r) of (1.1) are potentials V (u, r) satisfying Vuu given by the following: u +
n−1 u + Vu (u, r) = 0. r
It follows from Theorem 1(A) that for n ≥ 3, all radial solutions are without conjugate points. Moreover, Lemma 1 gives that the points r1 and ∞ are not conjugate for any r1 > 0. This fact enables us to organise the solutions into foliations in the following way. Denote by NA the class of all those solutions that can be written as u(r, α) = (α/r n−2 ) + A for some α outside the support of V . (u, r) ≤ Theorem 3. For compactly supported radial potential V satisfying Vuu 2 1/4r , the set NA is totally ordered and the graphs of the solutions define a smooth foliation of Rn+1 − R(u) × {x = 0}.
Proof of Theorem 3. Given A, we have to show that the function u(r, α) is monotone in α. Indeed, the function ξ(r) = ∂u/∂α is a solution of the linearised equation. Note that by definition, ξ(∞) = 0. Then the nonconjugacy property implies that ξ > 0, and then it is easy to complete the proof. Remark 3. In some cases, there is another way of organising the set of all solutions into foliations. Assume we are given a radial potential with V (u, r) ≡ 0 for 0 < r ≤ r0 and r ≥ R0 satisfying the inequality of Theorem 3. Define the set MA of all those solutions that can be written as u(r, α) = (A/r n−2 )+α when r ≤ r0 . Then one shows that for any given A the set MA is ordered and the graphs of the solutions smoothly foliate the space Rn+1 −R(u)×{x = 0}. In addition, M0 foliates the whole of Rn+1 . In order to check the order property, one shows that the linearised equation has no focal points in the following sense: Any solution satisfying ξ˙ (r1 ) = ξ(r2 ) = 0 is trivial, provided r1 < r2 (this is not necessarily true for r1 > r2 ). Then one proceeds exactly as in the proof of Theorem 3. 4. Rigidity for the case n = 2. Let u(r) be a radial solution of the equation (1.1) for compactly supported rotationally symmetric V (u, r), V (u, r) ≡ 0 for |u| ≥ U or
VARIATIONAL PROPERTIES AND RIGIDITY
395
r > R. With the substitution r = et , the equation for u as a function of t can be written in the Hamiltonian form u˙ = p, p˙ = −e2t Wu (u, t).
(4.1)
The Hamiltonian function of (4.1) is 1 H = p2 + e2t W (u, t) 2 with the function W (u, t) = V (u, et ). Note that the support of W is contained in the semistrip " = {|u| ≤ U, t ≤ T = ln R}. We prove the following rigidity result, which implies Theorem 2. Theorem 4. There always exist solutions with conjugate points for (4.1), unless W vanishes identically. The strategy of the proof follows the original one of Hopf, but it takes special care about the noncompactness of the situation, which requires careful estimates on ω given in Lemmas 2 and 3. In what follows, we assume that all solutions of (4.1) are without conjugate points. The first step in the proof is the following very well-known construction: If the solution u(t) has no conjugate points, then one can easily construct a nonvanishing solution ξ of the linearised Jacobi equation ξ + e2t Wuu u(t), t ξ = 0. Having such a ξ(t) for every u(t), one defines the function ω(p, u, t) by the formula ω(p, u, t) =
ξ˙ (t) ξ(t)
when p = u(t), ˙ u = u(t).
Then ω satisfies the Riccati equation along the flow of (4.1): ω˙ + ω2 + e2t Wuu u(t), t = 0.
(4.2)
Here · stands for the derivative along the flow. It should be mentioned that by the construction of ξ and ω, the function ω is a priori only measurable (and smooth along the flow). Introduce K by the following:
(u, t). K= sup Wuu (t,u)&"
We need the following two lemmas specifying the behaviour of the function ω at infinity.
396
BIALY AND MACKAY
Lemma 2. The following statements hold true: |ω(p, u, t)| ≤ KeT
for all (p, u, t), 1 for t > T . 0 ≤ ω(p, u, t) < t −T
(4.3) (4.4)
There exists a constant K˜ such that for all (p, u, t) with t < T , ˜ t/4 < ω(p, u, t) < Ket . −Ke
(4.5)
Lemma 3. The function ω satisfies the following inequalities: (I) For t ≤ T , then 0 ≤ ω < Ket−((u−U )/p) ;
if u > U, p > 0, if u < −U, p < 0,
then 0 ≤ ω < Ke
if u > U − p(T − t), p ≤ 0
or
(4.6)
t+((−U −u)/p)
;
(4.7)
u < −U − p(T − t), p ≥ 0,
then ω ≡ 0. (4.8)
(II) For t > T , if u > U + p(t − T ), p > 0,
then 0 ≤ ω < Ket−((u−U )/p) ;
(4.9)
if u < U + p(t − T ), p < 0,
then 0 ≤ ω < Ket+((−u−U )/p) ;
(4.10)
if u < −U, p ≥ 0
or
u > U, p ≤ 0,
then ω ≡ 0.
(4.11)
We postpone the proof of Lemmas 2 and 3 and first finish the proof of the theorem. Proof of Theorem 4. In order to achieve decay also in the p direction, introduce the Gibbs density α(p, u, t) = e−H = e−1/2p
2 −e2t W (u,t)
.
We have α˙ = −e−H H˙ = −e−H Ht = −α e2t W t .
(4.12)
Now recall that the Hamiltonian flow of (4.1) preserves the Liouville measure dµ = dp du. Multiply the Riccati equation (4.2) by α, and write it in the form ( ˙
αω − αω ˙ + αω2 + αe2t Wuu = 0.
Substitute equation (4.12) to obtain ( ˙ αω + αω e2t W t + αω2 + αe2t Wuu (u, t) = 0.
397
VARIATIONAL PROPERTIES AND RIGIDITY
Owing to the estimates of Lemmas 2 and 3, one can integrate this equation over the whole (p, u) space. In addition, use the invariance of the measure and write 2t d 2 αω dµ + αω e W t dµ + αω dµ + αe2t Wuu dµ = 0. (4.13) dt Using integration by parts, the last term can be replaced by α(e2t Wu )2 dµ. Now integrate the last equation (4.13) for −A ≤ t ≤ A for a large constant A and pass to the limit A → +∞. Note that by the uniform estimate of Lemma 2, the term αω dµ|A −A vanishes in the limit. So we have 2 αω W e2t t dµ dt + αω2 dµ dt + α e2t W dµ dt = 0. (4.14) By the Cauchy-Schwarz inequality, we can estimate the first integral of (4.14) by
αω W e2t t dµ dt ≥ −
αω2 dµ dt
2 α W e2t t dµ dt
1/2 .
With the notation x = ( αω2 dµ dt)1/2 , we have the quadratic inequality 2t 2 2 e Wu α dµ dt ≤ 0. x 2 − x · α W e2t t dµ dt + Then its discriminant must be nonnegative: 2t 2 2 4 α e Wu dµ dt ≤ α e2t W t dµ dt.
(4.15)
The final argument in the proof is the following rescaling trick. It is similar to one invented in [3] for periodic potentials. Consider the family of Hamiltonians for every natural number N: HN =
1 1 1 H (Np, Nu, t) = p2 + 2 e2t W (Nu, t) . 2 N2 N
It can be immediately checked that the property of having all solutions without conjugate points remains valid for all N. Thus the inequality (4.15) implies the following inequalities for all N: 4
e−(1/N
2 1 Wu (Nu, t) du dt N 2 1 2t −(1/N 2 )W (Nu,t)e2t e W (Nu, t) t du dt. ≤ e N2
2 )W (Nu,t)e2t
e2t
(Here the integration with respect to p has been performed on both sides.)
398
BIALY AND MACKAY
Change the variable in both integrals to υ = Nu. We obtain the inequality: 4 N3
e−(1/N
2 )W (υ,t)e2t
1 ≤ 5 N
2 e2t Wu (υ, t) dυ dt
e−(1/N
2 )W (υ,t)e2t
2 e2t W (υ, t) t dυ dt.
Now it is clear that if W is not zero identically, then the left side is of order 1/N 3 while the right side is of order 1/N 5 as N → ∞. This then proves that W ≡ 0 identically. The proof of the theorem is complete. Proof of Lemma 2. By the definition of K, we have ω˙ ≤ K 2 e2t − ω2
for t ≤ T
and ω˙ = −ω2
for t > T .
The proof is based on the following elementary fact. Fact. Any solution of the inequality ω˙ ≤ B 2 − ω2 ,
ω (t0 ) = ω0
blows up if |ω0 | > B. Moreover, the blow-up time t∗ is estimated as follows for ω0 > B,
t 0 + < t ∗ < t0 ;
for ω0 < −B,
t0 < t∗ < t0 + , ω0 − B 1 ln . where = 2B ω0 + B To prove (4.3), we take B = KeT and obtain the required estimate. In the same manner, we obtain (4.4) and the right-hand side of (4.5). Let us prove the rest of (4.5). Pick two moments of time τ0 < τ1 < 0. On the segment τ ∈ [τ0 , τ1 ], we have ω˙ ≤ B 2 − ω2 with B = Keτ1 . If ω (τ0 ) = ω0 is too negative, then the blow-up happens before τ1 , and this is impossible. Thus we obtain 1 ω0 − B = ln > τ 1 − τ0 . 2B ω0 + B This implies 1 + e2Bd . e2Bd − 1 Choose τ1 = τ0 /2; then this inequality can be written in the form: ω0 ≥ −B
ω0 ≥ −eτ0 /4 · f (τ0 ) ,
VARIATIONAL PROPERTIES AND RIGIDITY
where
399
τ /2
f (τ0 ) = Keτ0 /4 ·
1 + e−Kτ0 e 0
τ /2
1 + e−Kτ0 e 0
.
An easy calculation shows that for τ0 → −∞, f → 0, and thus f (τ0 ) is bounded ˜ Since τ0 was arbitrary, (4.5) follows. from above by some positive constant K. Proof of Lemma 3. Any point (p, u, t) with |u| > U is situated outside the support " and so moves in the straight line u(t) = u+pt unless it hits ". If it does not touch ", then ω vanishes identically. This is the case in both (4.8) and (4.11). In the cases of (4.6)–(4.7) and (4.9)–(4.10), the line u(t) = u + pt hits " in backward time. Since ω˙ = −ω2 for the free motion, it follows that ω(p, u, t) has to be nonnegative and can be bounded from above by Ket˜, where t˜ is the time of entering ". This completes the proof of Lemma 3. Although by Theorem 4 there are always solutions with conjugate points, one cannot claim that they appear on those radial solutions that are regular at zero. The next example shows that all regular solutions may remain minimal. Example 4.1. Consider two compactly supported functions ,(u) and -(t). Define the function 2 1 ˙ ˙ + -(t) ,(u) . W (u, t) = −e−2t -(t),(u) 2 Then W (u, t) is compactly supported, and one can easily verify that all the solutions of ˙ u˙ = f (u, t) with f (u, t) = ,(u)-(t)
(4.16)
are the solutions of the Newton equation ü = −e2t Wu (u, t). The solutions of (4.16) form a foliation of R(u) × R(t) and then, by Weierstrass’s theorem of calculus of variations (see, e.g., [11, §23]), are globally minimal. It is clear from the construction that all corresponding radial solutions u(r) of (1.1) are regular at zero. 5. Discussion and open problems. The results of this paper can be generalised in several directions. We have formulated our results in the context of compactly supported potentials V on R × Rn , but most of them extend to some noncompactly supported cases. Theorem 1 and its proof apply verbatim to V of noncompact support, in particular to V periodic in u, but periodicity in x is excluded by each of the hypotheses (A) and (B). Theorem 2 can be generalised to V periodic in u and compactly supported and radially symmetric in x. Our work, however, leaves open some very natural questions. (1) To what other radially symmetric cases can Theorem 2 be extended?
400
BIALY AND MACKAY
(2) What results can be obtained for potentials periodic in both u and x? The fundamental papers [15] and [16] by Moser imply, for our equation, the existence of minimal laminations for all irrational slopes which for certain slopes are foliations. In this context, Question 2 can be reformulated as follows: Are there other periodic potentials except those with Vu (u, x) = 0 such that for any slope there is a smooth foliation of Tn+1 by minimal solutions? We refer the reader to the survey article by Bangert [1] for detailed discussions of this and related questions. (3) Does Theorem 2 extend to nonradial potentials V ? The proof of Theorem 2 is based on the reduction to the case of Hamiltonian systems and so cannot be generalised in a straightforward way to nonradial potentials V . However, it is very reasonable to expect that the equation perturbed by a compactly supported W , u = −Vu (u, r) + εWu (u, x1 , x2 ), always has nonminimal solutions for ε sufficiently small. In the case of Hamiltonian systems, it would follow from the theorem on continuous dependence of solutions on the initial values. (4) Regarding Theorem 4, if the support of W (u, t) were compact, then absence of conjugate points would imply that every trajectory on the (u, t)-plane leaves the support of W in the same direction as it entered it. It would be interesting to know if this a priori weaker assumption already implies that W is zero. Related questions in the Riemannian case were considered in [6]. References [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13]
V. Bangert, “Minimal foliations and laminations” in Proceedings of the International Congress of Mathematicians, Vol. 1, 2 (Zürich, 1994), Birkhäuser, Basel, 1995, 453–464. M. Bialy, Convex billiards and a theorem by E. Hopf, Math. Z. 214 (1993), 147–154. M. Bialy and L. Polterovich, Hopf-type rigidity for Newton equations, Math. Res. Lett. 2 (1995), 695–700. D. Burago and S. Ivanov, Riemannian tori without conjugate points are flat, Geom. Funct. Anal. 4 (1994), 259–269. R. Courant and D. Hilbert, Methods of Mathematical Physics, Vol. 1, Interscience Publishers, New York, 1953. C. Croke, Rigidity and the distance between boundary points, J. Differential Geom. 33 (1991), 445–464. C. Croke and A. Fathi, An inequality between energy and intersection, Bull. London Math. Soc. 22 (1990), 489–494. C. Croke and B. Kleiner, On tori without conjugate points, Invent. Math. 120 (1995), 241–257. L. Green and R. Gulliver, Planes without conjugate points, J. Differential Geom. 22 (1985), 43–47. J. Heber, On the geodesic flow of tori without conjugate points, Math. Z. 216 (1994), 209–216. D. Hilbert, Mathematical problems, Bull. Amer. Math. Soc. 8 (1902), 437–479. E. Hopf, Closed surfaces without conjugate points, Proc. Nat. Acad. Sci. U.S.A. 34 (1948), 47–51. A. Knauf, Closed orbits and converse KAM theory, Nonlinearity 3 (1990), 961–973.
VARIATIONAL PROPERTIES AND RIGIDITY [14] [15] [16] [17]
401
E. Lieb and M. Loss, Analysis, Grad. Stud. Math. 14, Amer. Math. Soc., Providence, 1997. J. Moser, Minimal solutions of variational problems on a torus, Ann. Inst. H. Poincaré Anal. Non Linéaire 3 (1986), 229–272. , A stability theorem for minimal foliations on a torus, Ergodic Theory Dynam. Systems 8 (1988), Charles Conley Memorial Issue, 251–281. M. Reed and B. Simon, Methods of Modern Mathematical Physics, IV: Analysis of Operators, Academic Press, New York, 1978.
Bialy: School of Mathematical Sciences, Tel-Aviv University, Tel-Aviv 69978, Israel Mackay: Nonlinear Centre, Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Silver Street CB3 9EW, United Kingdom
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
EXAMPLES WITH BOUNDED DIAMETER GROWTH AND INFINITE TOPOLOGICAL TYPE X. MENGUY
0. Introduction. In this paper, we construct an example of an open manifold with positive Ricci curvature, infinite topological type, and bounded diameter growth. Let M n be an open manifold with base point p ∈ M n . For r > 0, denote by C(p, r) the union of the unbounded components of M n \Br (p). U. Abresch and D. Gromoll [AG] defined diameter growth as 3 diam(p, r) = sup diam k , C p, r , (0.1) 4 where the supremum is taken over all components k of ∂C(p, r) and diam(k , C(p, (3/4)r)) is the diameter of k calculated with respect to the intrinsic distance of C(p, (3/4)r). Here, we show the following theorem. Theorem 0.2. There exists a complete 4-dimensional manifold with (0.3)
positive Ricci curvature,
(0.4)
infinite topological type, and
(0.5)
bounded diameter growth.
This result can be generalized to any dimension greater than 5 by taking products with any compact manifold of positive Ricci curvature. If we take a product with a circle, we get a 5-dimensional example satisfying (0.4) and (0.5) but whose Ricci curvature is nonnegative. Theorem 0.2 is the first example of a manifold satisfying (0.3), (0.4), and (0.5). Previously, J. Sha and D. G. Yang [ShYg] constructed manifolds satisfying (0.3) and (0.4); see also the examples of M. T. Anderson [An]. Examples with Ricci-flat Kähler metrics were later given by Anderson, Kronheimer, and LeBrun [AnKL]. All these examples are collapsing, namely, limr→0 r −n Vol(Br (p)) = 0. Among them, the smallest diameter growth is due to [ShYg] and is equivalent to r 2/3 . More recently, G. Perelman [P] constructed 4-dimensional compact manifolds with positive Ricci curvature, large volume, and large Betti numbers. We used these constructions in [M2] to give examples of manifolds with (0.3), (0.4), and Euclidean volume growth. In contrast to the examples of Sha and Yang, the sectional curvature of our example in Theorem 0.2 is not bounded from below. Indeed, according to a result of Abresch Received 11 May 1999. 1991 Mathematics Subject Classification. Primary 53C20; Secondary 53C21. 403
404
X. MENGUY
and Gromoll [AG], an n-dimensional manifold with nonnegative Ricci curvature, sectional curvature bounded from below, and diameter growth smaller than r 1/n has finite topological type. Note also that in dimensions 2 and 3 any complete manifold satisfying (0.3) is diffeomorphic to Euclidean space. This follows from the work of S. Cohn-Vossen [C] and of R. Schoen and S. T. Yau [SY], respectively. The idea of this paper is to start with a cylinder over a lemon (that is, a spherical suspension over a small 2-sphere). We replace some cylindrical sectors by spherical sectors such that locally it looks like a singular 4-sphere. We then remove a ball centered on the singular set and glue in a CP 2 with a point removed via the following metric of Perelman [P]. Let (S n , g) be a rotationally symmetric metric with sectional curvature greater than 2 , where dσ 2 1 on the n-sphere. That is, g = dt 2 + B 2 (t)dσn−1 n−1 is the metric of the unit (n − 1)-sphere. We denote by [0, πr0 ] the range of t and denote by R0 the maximal value of B(t) on [0, πr0 ]. For any given ρ > 0 with (0.6)
(n−1)/n
R0
< ρ < r0 ,
there exists a metric of positive Ricci curvature on S n × [0, 1] with the following properties: • S n × {0} is isometric to a round sphere of radius ρ/λ; • the principal curvatures of S n × {0} are greater than −λ; • S n × {1} is isometric to (S n , g) and its principal curvatures are greater than 1. We refer to a metric of this form as a neck of Perelman. In our construction, we look for removing balls Bri (oi ) and gluing in a rescaled neck of Perelman. That is why we make the following definition. Definition 0.7. Let N n be a compact hypersurface in M n+1 . We denote by II its second fundamental form, SN n its unit tangent bundle, and KN its intrinsic curvature. We say that N n satisfies Perelman’s property if for all x ∈ N and all linearly independent X, Y in Sx N n , KN (X ∧ Y ) > maxn II2 (Z, Z). Z∈SN
It is also possible to construct a metric with positive Ricci curvature and convex boundary on CP n with a point removed (see [P] for details). Therefore, if the boundary of a ball in M n+1 has a rotationally symmetric metric and satisfies Perelman’s property, we can rescale it by the inverse of the maximum of the principal curvatures, use a neck of Perelman if R0 (which appears in the definition of the neck) is sufficiently small, and glue in a CP n . What is important is that the gluing has positive Ricci curvature, thanks to the neck of Perelman. See [M2] for another construction using a neck of Perelman. Note that in higher even dimension, we can modify slightly the construction of Theorem 0.2 and glue in CP n ’s instead of CP 2 ’s. The main point of this paper then is to smooth the metric near the singular set and check Perelman’s property for some balls Bri (oi ). From C. Sormani [So] we
DIAMETER GROWTH AND INFINITE TOPOLOGICAL TYPE
405
know that the annuli Br+1 (p) \ Br (p) are converging to cylinders (see also [ChCo] for almost rigidity results in the case of almost maximal volume). Therefore, our glued-in CP 2 ’s have to be smaller and smaller. If we apply the stability theorem of T. H. Colding [Co1], we know that at least one cross section of the cylinders has to be a singular space, and in our case the singular cross section looks like a lemon. See also [Co2] for background and general results about manifolds with lower-bounded Ricci curvature. The main difference between this construction and the one in [M2] is the way we smooth the metric near the singular set. In [M2] we smoothed the metric slowly and progressively in order to have both positive Ricci curvature and Perelman’s property. Namely, the metric of the main block was written as g#(r) (x), where #(r) = 1/ logβ r and r denotes the distance from a base point (see [M2, Sections 1.2 and 1.3]; compare also with Section 1.2 of this paper). Roughly speaking, # measures the difference between g# (x) and the metric of a lemon. In the case of bounded diameter growth, the curvatures of the spherical sectors are big, and therefore the width of these sectors decreases rapidly. As a consequence the radii of the balls Bri (oi ) and the parameter #(r) decrease rapidly too. If we apply the same smoothing as in [M2], some very negative Ricci curvatures are created along the singular set. The new idea of this paper is to use the neck of Perelman not only to glue in CP 2 ’s but also to smooth the metric. The parameter # now depends on the balls Bri (oi ) that are removed and not on r. The function #(r) is constant except inside Bri (oi ), where it changes a lot. Therefore, the created negative Ricci curvature is localized inside the balls Bri (oi ). However, a slight modification of the metric g# (x) used in [M2] enables us to keep very good control of the principal curvatures of ∂Bri (oi ) (see Lemma 1.26 and Section 1.4; compare with [M2, Section 1.6] and [P, Section 3]). After removing Bri (oi ) and gluing in a CP 2 with a point removed, the Ricci curvature is positive. Finally, we point out that the construction of Theorem 0.2 can be adapted to the noncollapsing case. This then gives a new, shorter, and more elegant construction than the one of [M2]. Acknowledgment. This paper is part of my dissertation [M1]. I would like to thank sincerely Professor T. H. Colding for his advice, remarks, and encouragement. 1. Construction 1.1. Main block. The metric of our manifold is given by (1.1)
ds 2 = dt 2 + u2 (t) dx 2 + f (t, x)2 dσ 2 ,
where dσ 2 is the standard metric of the unit 2-sphere. T , X, 1 , and 2 form an orthonormal basis corresponding to the forms dt 2 , dx 2 , dσ 2 . Let ti ≥ 2 be any increasing sequence going to infinity to be determined. We define inductively the function u(t) and sequences ui , ui , Ki , ψi , ri , oi as follows:
406
X. MENGUY
(1.2)
u1 = u(t1 ) = 1
(1.3)
ui = u(ti )
(1.4) (1.5) (1.6) (1.7) (1.8) (1.9)
and
u1 = ut (t1 ) =
and ui = ut (ti ); 1 − u 2 i Ki = ; u2i
2 Ki ψi = 1 − ui ; sin
1 ; t12
π ψi ri < √ − ; 2 4 Ki oi = (t = ti + ri , x = 0);
1 Ki t − ti + ψi , for ti < t < ti + 2ri ; u(t) = √ sin Ki ti+ 2 ut (t) = ut ti + 2ri , for ti + 2ri < t < ti+1 . t
The function f (x, t) is defined for t > t1 in Section 1.2. The metric is defined near the origin, that is, for t < t1 , in Section 1.6. From now on, we suppose that ri is sufficiently rapidly decreasing so that the diameter growth is bounded. 1.2. Smoothing near the singular set. In this subsection, we construct the C 2 function f (t, x) in order to have both Perelman’s property at the boundary of Bri (oi ) and positive Ricci curvature everywhere outside these balls. First, we define a function f# (x) as follows: Let R and # be two constants such that (1.10)
Let θ be a C 2 function such that (1.11)
1 0<#< . 2
0 < R < 1,
θ=
1, for x ≤ 0, 0, for x ≥ 1,
and set (1.12)
θ# (x) = θ
x −# . # 1/4 − #
Let b, l, δ be positive constants to be determined. For 0 ≤ x ≤ b, set (1.13)
f# (x) =
1 sin(lx); l
DIAMETER GROWTH AND INFINITE TOPOLOGICAL TYPE
407
for b ≤ x ≤ #, set (1.14)
f# (x) =
and for # ≤ x ≤ π/2, set (1.15)
x 1 sin(lb) exp (1 − #) log ; l b
f# (x) = R sin x + δθ# (x) .
Extend f# to π/2 ≤ x ≤ π by symmetry so that f# (π − x) = f# (x). We determine b, l, and δ such that f# is C 1 at x = b and x = #. From (1.13), (1.14), and (1.15), we find the following C 1 gluing conditions: (1.16) (1.17) (1.18)
1 sin(lb)# 1−# b#−1 = R sin(# + δ), l (f# )x 1−# = cot(# + δ), (#) = f# # (f# )x 1−# . (b) = l cot(lb) = f# b
f# (#) =
To solve the system, we look for a solution expandable in power series of #. We first solve the two last equations. Equation (1.17) gives δ as a power series of #. The last equation (1.18) gives the product lb as a power series of # times a square root of #: (1.19) (1.20)
δ = # 2 + L.O.T., √ lb = 3# + L.O.T.
(L.O.T. means lower-order terms). The first equation (1.16) then gives l: # # lb sin lb l# = (1.21) . lb R sin(# + δ) # From (1.19), (1.20), and (1.21), we see that l is going to infinity when # is going to zero. The following lemma is useful in proving that the Ricci curvature of the main block is positive outside the balls Bri (oi ). Lemma 1.22. For all η > 0, there exists #0 > 0 such that if # < #0 , then (1.23) (1.24)
(f# )xx ≥ 1 − η, f# 1 − (f# )2x ≥ 1 − η. f#2 −
Proof. Note first that it suffices to prove (1.23), since (1.24) follows from (1.23) and the Ricatti comparison theorem. From (1.13), we have −(f# )xx /f# = l 2 ≥ 1 for
408
X. MENGUY
0 < x < b and sufficiently small #. From (1.14), we get −(f# )xx /f# = #(1−#)/x 2 > 1 for b < x < #. For # < x < 1, we calculate from (1.15) that (1.25)
−
2 (f# )xx = 1 + δθ# (x) − cot x + δθ# (x) δθ# (x). f#
From (1.12), (1.19), and (1.25), we see that | − (f# )xx /f# − 1| is as small as we want for sufficiently small #. For 1 ≤ x ≤ π/2, −(f# )xx /f# = 1, which proves eventually the lemma. The following lemma is crucial to control the principal curvatures of ∂Bri (oi ). Lemma 1.26. There exists #0 > 0 such that if # < #0 , then for 0 ≤ x ≤ 1, (f# )x (1.27) − cot x < 2#. A(x) = tan(x) f# Proof. From Lemma 1.22 and a standard comparison argument, we get that 1 (f# )x < . f# x
(1.28) For b ≤ x ≤ #, (1.14) gives
1−# (f# )x , = f# x
(1.29) and for x ≤ b, (1.13) gives (1.30)
1−# (f# )x . ≥ f# x
Therefore, from (1.28), (1.29), and (1.30), we get that (1.31)
A(x) ≤
# tan x < 2# x
for x ≤ # and # sufficiently small. For # ≤ x ≤ 1, we get from (1.15) that (1.32) and thus (1.33)
(1.34)
(f# )x = cot x + δθ# (x) 1 + δθ #(x) , f# tan(x) 1 + δθ# (x) − 1 , A(x) = tan x + δθ# (x) A(x) < 2#,
for # sufficiently small. This proves the lemma.
DIAMETER GROWTH AND INFINITE TOPOLOGICAL TYPE
409
Remark 1.35. We see that f# can be smoothed near x = b and x = # to a C 2 function satisfying Lemmas 1.22 and 1.26. We now define the function f (x, t) for t > t1 . Let #i be a decreasing sequence converging to zero to be determined. For ti + 3ri /2< t < ti+1 + ri+1 /2, we set (1.36)
f (t, x) = f#i+1 (x),
and for ti + ri /2 < t < ti + 3ri /2, we set (1.37)
f (t, x) = f#(t) (x),
where #(t) is bounded by #i+1 and #i . For t1 < t < t1 + r1 /2, we set (1.38)
f (t, x) = f#1 (x).
1.3. Calculation of curvatures. In this subsection, we prove that the Ricci curvature of the main block is positive outside the balls Bri (oi ) provided that #i is sufficiently rapidly decreasing. Indeed, we assume that the sequence #i is decreasing so rapidly that ft = 0 outside Bri (oi ). We then compute the curvature outside the ball Bri (oi ) as follows: utt (1.39) K(X, T ) = K 1 , T = K 2 , T = − , u fxx ut 2 (1.40) , K X, 1 = K X, 2 = − 2 − u fu 1 − (fx )2 ut 2 (1.41) − . K 1 , 2 = u u2 f 2 The mixed curvature is zero. From Lemma 1.22, (1.8), (1.9), and the above calculations, the Ricci curvature is then positive, provided that #1 is small enough. 1.4. Establishing Perelman’s property. In this subsection, we prove that if #i is sufficiently rapidly decreasing, the intrinsic curvatures of ∂Bri (oi ) are greater than the square of the principal curvatures. This permits us to glue in a CP 2 with a point removed via a neck of Perelman. As in Section 1.3, we suppose that #i is sufficiently rapidly decreasing such that ft = 0 on ∂Bri (oi ). Note that in this case for (t, x) ∈ ∂Bri (oi ), f (t, x) = f# (x) with # = #i or #i+1 . We denote by N the outward pointing unit vector to T ∂Bri (oi ). Let θ be given by (1.42)
N = X sin θ + T cos θ.
Define Y by (1.43)
Y = X cos θ − T sin θ.
First, we calculate the second fundamental form of T ∂Bri (oi ):
410 (1.44)
X. MENGUY
ut (f# )x cos θ + sin θ. II 1 , 1 = II 2 , 2 = ∇1 N | 1 = u f# u
Using the√fact that the metric of the T -X-plane is locally isometric to the 2-sphere of radius 1/ Ki for ti < t < ti + 2ri (see (1.8)), we get
II(Y, Y ) = Ki cot K i ri . (1.45) II(·,·) is diagonal in the orthonormal basis 1 , 2 , and Y . The intrinsic curvatures are then calculated by (1.44), (1.45), and the Gauss equations so that (1.46) and
(1.47)
2 (f# )x ut K N 1 , 2 = K 1 , 2 + cos θ + sin θ u f# u KN Y, 1 = KN Y, 2 = K X, 1 cos2 θ + K T , 1 sin2 θ
u (f# )x t Ki cos θ + sin θ . + Ki cot u f# u
The mixed curvature is zero. We note √ that if f# (x) = sin x, the metric is locally isometric to the 4-sphere of radius 1/ Ki (see (1.8)), and therefore
ut cot x cos θ + sin θ = Ki cot Ki r i . (1.48) u u Combining (1.44) and (1.48), we get
(1.49)
(f ) sin θ # x K i ri = − cot x II 1 , 1 − Ki cot f# u cot x| sin θ| ≤ A(x) u
u t ≤ A(x) Ki cot Ki ri + | cos θ| u 1
≤ A(x) ; Ki cot K i ri + u
consequently, by Lemma 1.26,
4# (1.50) K i ri ≤ . II 1 , 1 − Ki cot ri Conclusion. From (1.45), (1.46), (1.47), (1.50), and Section 1.3, we conclude that Perelman’s property is satisfied, provided that #i is sufficiently rapidly decreasing.
DIAMETER GROWTH AND INFINITE TOPOLOGICAL TYPE
411
1.5. Gluing in CP 2 ’s. To glue a CP 2 in at ∂Bri (oi ) via a neck of Perelman, we need to check that ∂Bri (oi ) rescaled by (1.51)
Ki cot
4# K i ri + ri
satisfies (0.6) for sufficiently small #. If #i and #i+1 are going to zero, then the metric of Bri (oi ) is converging to (1.52)
ds 2 = dt 2 + u2 (t) dx 2 + R 2 sin2 (x)dσ 2 .
Bri (oi ) is then a ball of a singular 4-sphere with center on the singular set. Therefore, after rescaling by (1.50) with # = 0, we get that
r0 = cos Ki ri , (1.53)
R0 = R cos Ki ri . (1.54) Therefore, (0.6) is satisfied if #i is sufficiently rapidly decreasing and if R is smaller than some fixed constant. Conclusion. Combining the result of this section with the conclusion of Section 1.4, we can glue a CP 2 in at ∂Bri (oi ) via a neck of Perelman, provided that #i is sufficiently rapidly decreasing and R is small enough. 1.6. Smoothing near the origin. The submanifold defined by t = t1 satisfies Perelman’s property if t1 ≥ 2. Indeed, (1.55) (1.56) (1.57)
ut II(X, X) = II 1 , 1 = II 2 , 2 = , u fxx KN X, 1 = KN X, 2 = − 2 , fu 2 1 − (fx ) . KN 1 ∧ 2 = (uf )2
From (1.56), (1.57), and Lemma 1.22, the intrinsic curvatures are greater than 1/(2u2 ) for sufficiently small #1 ; from (1.55) and (1.2), the principal curvatures are smaller than 1/(4u). For t ≤ t1 , we then can continue the manifold via a neck of Perelman and glue the end of the neck in a ball of R4 in order to complete the metric. References [AG] [An]
U. Abresch and D. Gromoll, On complete manifolds with nonnegative Ricci curvature, J. Amer. Math. Soc. 3 (1990), 355–374. M. T. Anderson, Short geodesics and gravitational instantons, J. Differential Geom. 31 (1990), 265–275.
412 [AnKL] [ChCo] [C] [Co1] [Co2]
[M1] [M2] [P]
[SY]
[ShYg] [So]
X. MENGUY M. T. Anderson, P. Kronheimer, and C. LeBrun, Complete Ricci-flat Kähler manifolds of infinite topological type, Comm. Math. Phys. 125 (1989), 637–642. J. Cheeger and T. H. Colding, Lower bounds on Ricci curvature and the almost rigidity of warped products, Ann. of Math. (2) 144 (1996), 189–237. S. Cohn-Vossen, Totalkrümmung und geodätische Linien auf einfach zusammenhängenden offenen vollständigen Fläschenstücken, Recueil Math. Moscou 43 (1936), 139–163. T. H. Colding, Ricci curvature and volume convergence, Ann. of Math. (2) 145 (1997), 477–501. , “Spaces with Ricci curvature bounds” in Proceedings of the International Congress of Mathematicians (Berlin, 1998), Vol. 2, Documenta Mathematica, Bielefeld, 1998, 299–308; available from http://www.mathematik.uni-bielefeld.de/documenta/. X. Menguy, Examples of manifolds and spaces with positive Ricci curvature, Ph.D. thesis, Courant Institute, New York University, 2000. , Noncollapsing examples with positive Ricci curvature and infinite topological type, to appear in Geom. Funct. Anal. G. Perelman, “Construction of manifolds of positive Ricci curvature with big volume and large Betti numbers” in Comparison Geometry (Berkeley, 1993–94), Math. Sci. Res. Inst. Publ. 30, Cambridge Univ. Press, Cambridge, 1997, 157–163. R. Schoen and S. T. Yau, “Complete three-dimensional manifolds with positive Ricci curvature and scalar curvature” in Seminar on Differential Geometry, Ann. of Math. Stud. 102, Princeton Univ. Press, Princeton, 1982, 209–228. J. Sha and D.-G. Yang, Examples of manifolds of positive Ricci curvature, J. Differential Geom. 29 (1989), 95–103. C. Sormani, The almost rigidity of manifolds with lower bounds on Ricci curvature and minimal volume growth, to appear in Comm. Anal. Geom.
Courant Institute of Mathematical Sciences, 251 Mercer Street, New York, New York 10012, USA;
[email protected]
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
RIGID LOCAL SYSTEMS, HILBERT MODULAR FORMS, AND FERMAT’S LAST THEOREM HENRI DARMON
Contents 1. Frey representations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415 1.1. Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415 1.2. Classification: The rigidity method. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 416 1.3. Construction: Hypergeometric abelian varieties . . . . . . . . . . . . . . . . . . . . . . 419 2. Modularity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 428 2.1. Hilbert modular forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 428 2.2. Modularity of hypergeometric abelian varieties . . . . . . . . . . . . . . . . . . . . . . 430 3. Lowering the level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 434 3.1. Ribet’s theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 434 3.2. Application to x p + y p = zr . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435 4. Torsion points on abelian varieties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 446 Introduction. Historically, two approaches have been followed to study the classical Fermat equation x r + y r = zr . The first, based on cyclotomic fields, leads to questions about abelian extensions and class numbers of K = Q(ζr ) and values of the Dedekind zeta-function ζK (s) at s = 0. Many open questions remain, such as Vandiver’s conjecture that r does not divide the class number of Q(ζr )+ . The second approach is based on modular forms and the study of 2-dimensional representations of ¯ Gal(Q/Q). Even though 2-dimensional representations are more subtle than abelian ones, it is by this route that Fermat’s last theorem was finally proved (cf. [Fre], [Se2], [Ri2], [W3], and [TW]; or [DDT] for a general overview). This article examines the equation x p + y q = zr .
(1)
¯ Certain 2-dimensional representations of Gal(K/K), where K is the real subfield of a cyclotomic field, emerge naturally in the study of equation (1), giving rise to a blend of the cyclotomic and modular approaches. The special values ζK (−1), which Received 23 September 1998. Revision received 23 June 1999. 1991 Mathematics Subject Classification. Primary 11G18; Secondary 11D41, 11F80, 11G05. Author’s work partially supported by grants from Natural Sciences and Engineering Research Council of Canada and from Fonds pour la Formation de Chercheurs et l’Aide à la Recherche, and by an Alfred P. Sloan research award. 413
414
HENRI DARMON
in certain cases are related to the class numbers of totally definite quaternion algebras over K, appear as obstructions to proving that (1) has no solutions. The condition that r is a regular prime also plays a key role in the analysis leading to one of our main results about the equation x p + y p = zr (see Theorem 3.22). One is interested in primitive solutions (a, b, c) to equation (1), that is, those satisfying gcd(a, b, c) = 1. (Such a condition is natural in light of the abc conjecture, for example; see also [Da2].) A solution is called nontrivial if abc = 0. It is assumed from now on that the exponents p, q, and r are prime and that p is odd. Let (a, b, c) be a nontrivial primitive solution to equation (1). One wishes to show that it does not exist. The program for obtaining the desired contradiction, following the argument initiated by Frey and brought to a successful conclusion by Wiles in the case of x p + y p = zp , can be divided into four steps. Step 1 (Frey, Serre). Associate to (a, b, c) a mod p Galois representation ¯ ρ : Gal(K/K) −→ GL2 (F) having “very little ramification,” that is, whose ramification can be bounded precisely and a priori independently of the solution (a, b, c). Here K is a number field and F is a finite field. For the Fermat equation x p + y p = zp , one may take K = Q and F = Z/pZ: the representation ρ is then obtained by considering the action of GQ on the p-division points of the Frey elliptic curve y 2 = x(x −a p )(x +b p ). As explained in Section 1, one is essentially forced to take K = Q(ζq , ζr )+ and F, the residue field of K at a prime above p, in studying equation (1). Step 2 (Wiles). Prove that ρ is modular, that is, arises from a Hilbert modular form on GL2 (AK ). In the setting of Fermat’s equation, Wiles proves that all semistable elliptic curves over Q arise from a modular form, which implies the modularity of ρ. Step 3 (Ribet). Assuming step 2, show that ρ comes from a modular form of small level, and deduce (in favorable circumstances) that its image is small, that is, contained in a Borel subgroup or in the normalizer of a Cartan subgroup of GL2 (F). In the setting of Fermat’s equation, Ribet showed that ρ has to be reducible; for reasons that are explained in Section 3, one cannot rule out the case where the image of ρ is contained in the normalizer of a Cartan subgroup when dealing with equation (1). Step 4 (Mazur). Show that the image of ρ is large; for example, that it contains SL2 (F). Historically, this is the step in the proof of Fermat’s last theorem that was carried out first, in the seminal papers [Ma1] and [Ma2], which also introduced many of the tools used in steps 2 and 3. In the classical setting, combining the conclusions of steps 3 and 4 leads to a contradiction and shows that (a, b, c) does not exist, thus proving Fermat’s last theorem. In [Da1] and [DMr], it was observed that the program above can be used to show that x p + y p = zr has no nontrivial primitive solutions when r = 2, 3 and p ≥ 6 − r (the
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
415
result for r = 3 being conditional on the Shimura-Taniyama conjecture, which is still unproved for certain elliptic curves whose conductor is divisible by 27). The purpose of this article is to generalize the analysis to the general case of equation (1). Sections 1, 2, 3, and 4 describe the generalizations of steps 1, 2, 3, and 4, respectively. As a concrete application, the main results of Section 3 relate solutions to x p +y p = zr to questions about p-division points of certain abelian varieties with real multiplications by Q(cos(2π/r)). Alas, our understanding of these questions (and of the arithmetic of Hilbert modular forms over totally real fields) is too poor to yield unconditional statements. For the time being, the methods of this paper should be envisaged as a way of tying equation (1) to questions that are more central, concerning Galois representations, modular forms, and division points of abelian varieties. Acknowledgements. The author is grateful to F. Diamond, J. Ellenberg, A. Kraus, and K. Ribet for their helpful comments, and to N. Katz and J.-F. Mestre for pointing out a key construction used in Section 1. The author greatly benefitted from the support of le Centre interuniversitaire en calcul mathématique algébrique (CICMA) and the hospitality of the Université Paris VI (Jussieu) and the Institut Henri Poincaré, where the work on this paper was started, and of the Eidgenössische Technische Hochschule (ETH) in Zürich, where it was completed. 1. Frey representations ¯ 1.1. Definitions. If K is any field of characteristic zero, write GK := Gal(K/K) for its absolute Galois group. Typically, K is a number field; let K(t) be the function field over K in an indeterminate t. The group GK(t) fits into the exact sequence 1 −→ GK(t) −→ GK(t) −→ GK −→ 1. ¯ Let F be a finite field, embedded in a fixed algebraic closure of its prime field. Definition 1.1. A Frey representation associated to the equation x p +y q = zr over K is a Galois representation = (t) : GK(t) −→ GL2 (F) satisfying the following conditions. (1) The restriction of to GK(t) has trivial determinant and is irreducible. Let ¯ −→ PSL2 (F) ¯ geom : GK(t) ¯ be the projectivization of this representation. (2) The homomorphism ¯ geom is unramified outside {0, 1, ∞}. (3) It maps the inertia groups at 0, 1, and ∞ to subgroups of PSL2 (F) of order p, q, and r, respectively. The characteristic of F is also called the characteristic of the Frey representation. One should think of = (t) as a 1-parameter family of Galois representations of
416
HENRI DARMON
GK indexed by the parameter t. Condition (1) in Definition 1.1 ensures that this family has constant determinant but is otherwise “truly varying” with t. The motivation for the definition of (t) is the following. Lemma 1.2. There exists a finite set of primes S of K depending on in an explicit way, such that, for all primitive solutions (a, b, c) to the generalized Fermat equation x p + y q = zr , the representation ρ := (a p /cr ) has a quadratic twist that is unramified outside S. Sketch of proof. Let ¯ : GK(t) −→ PGL2 (F) be the projective representation deduced from . The field fixed by the kernel of ¯ is a finite extension of K(t), whose Galois group is identified with a subgroup G of PGL2 (F) by ; ¯ in other words, it is the function field of a G-covering of P1 over K. This covering is unramified outside {0, 1, ∞} and its ramification indices are p, q, and r above those three points: it is a G-covering of “signature (p, q, r)” in the sense of [Se3, Sec. 6.4]. The lemma now follows from a variant of the Chevalley-Weil theorem for branched coverings (see, e.g., [Be] or [Da2]). Definition 1.3. Two Frey representations 1 and 2 attached to equation (1) are said to be equivalent if their corresponding projective representations ¯ 1 and ¯ 2 differ ¯ that is, if 1 is conjugate (over F) ¯ to a central by an inner automorphism of PGL2 (F), twist of 2 . To a Frey representation we assign a triple (σ0 , σ1 , σ∞ ) of elements in PSL2 (F) of orders p, q, and r satisfying σ0 σ1 σ∞ = 1 as follows (cf. [Se3, Ch. 6]). The element σj is defined as the image by ¯ geom of a generator of the inertia subgroup of GK(t) at ¯ t = j . The elements σ0 , σ1 , and σ∞ are well defined up to conjugation, once primitive p, q, and rth roots of unity have been chosen. One can choose the decomposition groups in such a way that the relation σ0 σ1 σ∞ = 1 is satisfied (cf. [Se3, Th. 6.3.2]). The triple (σ0 , σ1 , σ∞ ) is then well defined up to conjugation. If Cj is the conjugacy class of σj in PSL2 (F), one says that the Frey representation is of type (C0 , C1 , C∞ ). For the following definition, assume that the exponents p, q, and r are odd, so that σ0 , σ1 , and σ∞ lift to unique elements σ˜ 0 , σ˜ 1 , and σ˜ ∞ of SL2 (F) of orders p, q, and r, respectively. Definition 1.4. The Frey representation attached to x p + y q = zr is said to be odd if σ˜ 0 σ˜ 1 σ˜ ∞ = −1, and is said to be even if σ˜ 0 σ˜ 1 σ˜ ∞ = 1. 1.2. Classification: The rigidity method. If n is an integer, let ζn denote a primitive√nth root of unity. Given an odd prime p, write p ∗ := (−1)(p−1)/2 p, so that Q( p ∗ ) is the quadratic subfield of Q(ζp ). We now turn to the classification of Frey representations, beginning with the classical Fermat equation.
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
417
The equation x p + y p = zp Theorem 1.5. Let p be an odd prime. There is a unique Frey representation (t) of characteristic p (up to equivalence) associated to the Fermat equation x p + y p = zp . One may take K = Q and F = Fp , and the representation (t) is odd. Remark. This theorem is originally due to Hecke [He], where it is expressed as a characterization of a certain field of modular functions of level p. Proof of Theorem 1.5. Set F = Fp . We begin by classifying conjugacy classes of triples σ0 , σ1 , and σ∞ of elements of order p in PSL2 (F) satisfying σ0 σ1 σ∞ = 1. There are two conjugacy classes of elements of order p in PSL2 (F), denoted pA and pB, respectively. The class pA (resp., pB) is represented by an upper-triangular unipotent matrix whose upper right-hand √ ∗ entry is a square (resp., a nonsquare). These two classes are rational over Q( p ) in the sense of √ [Se3, Sec. 7.1], and they are interchanged by the nontrivial element in Gal(Q( p ∗ )/Q) as well as by the nontrivial outer automorphism of PSL2 (F). Lift σ0 , σ1 , and σ∞ to elements σ˜ 0 , σ˜ 1 , and σ˜ ∞ of order p in SL2 (F). The group SL2 (F) acts on the space V = F2 of column vectors with entries in F. Since σ˜ j is unipotent, there are nonzero vectors v1 and v2 in V which are fixed by σ˜ 0 and σ˜ 1 , respectively. Because σ0 and σ1 do not commute, the vectors for V . Scale v2 so that σ˜ 0 is expressed by the matrix 1 1 v1 and v2 form 1a 0basis in this basis; let be the matrix representing σ˜ 1 . Since σ˜ ∞ has trace 2, the 01 x 1 −1 relation σ˜ 0 σ˜ 1 = σ˜ ∞ forces x = 0, which is impossible since σ1 is of order p. Hence −1 there are no even Frey representations of characteristic p. The relation σ˜ 0 σ˜ 1 = −σ˜ ∞ gives x = −4. Note that the resulting elements σ0 , σ1 , and σ∞ belong to the same conjugacy class in PSL2 (F). It is well known that they generate PSL2 (F). Hence there are exactly two distinct conjugacy classes of surjective homomorphisms geom
¯ A
geom
, ¯ B
: GQ(t) −→ PSL2 (F), ¯
of type (pA, pA, pA) and (pB, pB, pB), respectively, which are interchanged by the outer automorphism of PSL2 (F). By the rigidity theorem of Bely˘ı, Fried, Thompson, geom geom and Matzat (cf. [Se3, Sec. 7]), ¯ A and ¯ B extend uniquely to homomorphisms ¯ A , ¯ B : GQ(t) −→ PGL2 (F) = Aut PSL2 (F) , which are conjugate to each other. Thus there is at most one Frey representation attached to x p + y p = zp , whose corresponding projective representation ¯ is conjugate to ¯ A and ¯ B . To prove the existence of , it is necessary to show that ¯ A (say) lifts to a linear representation GQ(t) → GL2 (F). Choose a set-theoretic lifting s of ¯ A to GL2 (F) satisfying det(s(x)) = χ(x), where χ is the mod p cyclotomic character, and note that such a lifting satisfies s(x)s(y) = ±s(xy). Hence, the obstruction to lifting ¯ A to a homomorphism into GL2 (F) is given by a cohomology class c(x, y) := s(x)s(y)s(xy)−1 in H 2 (Q(t), ±1). We note that (for j = 0, 1, and ∞) the
418
HENRI DARMON
homomorphism ¯ A maps the decomposition group at t = j to the normalizer of σj , which is the image in PGL2 (Fp ) of a Borel subgroup B of upper-triangular matrices. Since the inclusion F× p → B splits, it follows that the restrictions c0 , c1 , and c∞ of the cohomology class c in H 2 (Q((t)), ±1), H 2 (Q((t −1)), ±1), and H 2 (Q((1/t)), ±1) vanish. In particular, c has trivial “residues” at t = 0, 1, ∞ in the sense of [Se4, Ch. II, Annexe, Sec. 2]. Hence, c is “constant,” that is, comes from H 2 (Q, ±1) by inflation (see [Se4, Ch. II, Annexe, Sec. 4]). But note that H 2 (Q, ±1) injects into H 2 (Q((t)), ±1), since a nontrivial conic over Q cannot acquire a rational point over Q((t)). Therefore, the class c vanishes, and the result follows. (For an alternate and, perhaps, less roundabout argument, see [By].) The equation x p + y p = zr . Let us now turn to the equation x p + y p = zr , where r and p are distinct primes. One is faced here with the choice of considering Frey representations either of characteristic p or of characteristic r. From now on, we adopt the convention that the prime p is always used to denote the characteristic of the Frey representation, so that the equations x p + y p = zr and x r + y r = zp require seperate consideration. The following theorem is inspired from the proof given in [Se3, Prop. 7.4.3 and 7.4.4] for the case r = 2 and r = 3; the general case follows from an identical argument (see also [DMs]). Theorem 1.6. Suppose that r and p are distinct primes and that p = 2. There exists a Frey representation of characteristic p over K associated to x p + y p = zr if and only if (1) the field F contains the residue field of Q(ζr )+ at a prime p above p, and (2) the field K contains Q(ζr )+ . When these two conditions are satisfied, there are exactly r − 1 Frey representations up to equivalence. When r = 2, exactly (r − 1)/2 of these are odd and (r − 1)/2 are even. Proof. for condition (3) in Definition 1.1 to be satisfied, it is necessary that PSL2 (F) contain an element of order r. This is the reason for condition (1) in Theorem 1.6. Condition (2) arises from the fact that (for r = 2) the (r − 1)/2 distinct conjugacy classes of elements of order r in PSL2 (F) are rational over Q(ζr )+ (in the sense of [Se3, Sec. 7.1]) and are not rational over any smaller extension. Assume conversely that conditions (1) and (2) are satisfied. Let σ0 , σ1 , and σ∞ be chosen as in the proof of Theorem 1.5, and let σ˜ j be the lift of σj to SL2 (F) of order p when j = 0, 1. Finally, let σ˜ ∞ be a lift of σ∞ to an element of order r if r is odd and to an element of order 4 if r = 2. Let ω¯ ∈ F be the trace of σ˜ ∞ . When r = 2, one has ω¯ = 0, and when r is odd, ω¯ is of the form ϕ(ζr + ζr−1 ) where ϕ is a homomorphism from Z[ζr + ζr−1 ]+ to F. Note that there are exactly (r − 1)/2 such ϕ’s. One now finds, as in the proof of Theorem 1.5, that (σ˜ 0 , σ˜ 1 , σ˜ ∞ ) is conjugate to one of the following two triples:
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
1 0 1 0
1 1 , −(2 + ω) ¯ 1 1 1 , −(2 − ω) ¯ 1
419
−1 1 0 , , −2 − ω¯ 1 + ω¯ 1 1 −1 0 . , 2 − ω¯ −1 + ω¯ 1
When r = 2, these triples are equal in PSL2 (F). When r is odd, they are distinct. An argument based on rigidity as in the proof of Theorem 1.5 produces (r − 1) inequivalent homomorphisms from GK(t) to GL2 (F), yielding the desired odd and even Frey representations. These Frey representations are constructed explicitly in Section 1.3 (cf. Lemma 1.9 and Theorem 1.10). The equation x r + y r = zp Theorem 1.7. Suppose that r and p are distinct odd primes. There exists a Frey representation of characteristic p over K associated to x r + y r = zp if and only if (1) the field F contains the residue field of Q(ζr )+ at a prime p above p, and (2) the field K contains Q(ζr )+ . When these two conditions are satisfied, there are exactly (r − 1)(r − 2)/2 inequivalent Frey representations: (r − 1)2 /4 odd representations and (r − 1)(r − 3)/4 even representations. Although the conclusion is somewhat different, the proof of Theorem 1.7 follows the same ideas as the proof of Theorem 1.6. Each triple (C0 , C1 , pA), where C0 and C1 each range over the (r − 1)/2 possible conjugacy classes of elements of order r in PSL2 (F), gives rise to a unique odd and even projective representation of GK(t) of type (C0 , C1 , pA), with one caveat: There is no even representation of type (C0 , C1 , pA) when C0 = C1 . The equation x p + y q = zr . We finally come to the general case of equation (1). Assume that the exponents p, q, and r are distinct primes and that p is odd. Theorem 1.8. There exists a Frey representation of characteristic p over K associated to x p + y q = zr if and only if (1) the field F contains the residue fields of Q(ζq )+ and of Q(ζr )+ at a prime p above p, and (2) the field K contains Q(ζq )+ and Q(ζr )+ . When these two conditions are satisfied, there are (r − 1)(q − 1)/2 inequivalent Frey representations over Q(ζq , ζr )+ . If q, r = 2, then (r − 1)(q − 1)/4 of these are odd and (r − 1)(q − 1)/4 are even. The proof is the same as for Theorems 1.5, 1.6, and 1.7. 1.3. Construction: Hypergeometric abelian varieties The equation x p + y p = zp . One can construct the Frey representation (t) of Theorem 1.5 explicitly, by considering the Legendre family of elliptic curves
420
HENRI DARMON
J = J (t) : y 2 = x(x − 1)(x − t). It is an elliptic curve over Q(t) which has multiplicative reduction at t = 0 and 1 and has potentially multiplicative reduction at t = ∞. The module J [p] of its p-division points is a 2-dimensional F-vector space on which GQ(t) acts linearly. The corresponding representation (t) is the Frey representation of characteristic p attached to x p + y p = zp . The equation x p +y p = zr . When r = 2, let C2 (t) be the elliptic curve over Q(t) given by the equation C2 (t) : y 2 = x 3 + 2x 2 + tx.
(2)
Lemma 1.9. The mod p Galois representation associated to C2 is the Frey representation associated to x p + y p = z2 . The proof of this lemma is omitted. It follows the same ideas but is simpler than the proof of Theorem 1.10 for the case of odd r, for which all the details are given. j −j Suppose now that r is an odd prime. Let ωj = ζr + ζr , and write ω for ω1 , so that K = Q(ω) is the real subfield of the cyclotomic field Q(ζr ). Let ᏻK denote its ring of integers, and let d = (r − 1)/2 be the degree of K over Q. Let g(x) = dj =1 (x +ωj ) be the characteristic polynomial of −ω, and let f (x) be an antiderivative of ±rg(x)g(−x); for example, we take f (x) = xg x 2 − 2 = g(−x)2 (x − 2) + 2 = g(x)2 (x + 2) − 2. Following [TTV], consider the following hyperelliptic curves over Q(t) of genus d: Cr− (t) : y 2 = f (x) + 2 − 4t, Cr+ (t) : y 2 = (x + 2) f (x) + 2 − 4t .
(3) (4)
Let Jr− = Jr− (t) and Jr+ = Jr+ (t) be their Jacobians over Q(t). In [TTV], Tautz, Top, and Verberkmoes show that these families of hyperelliptic curves have real multiplications by K, that is, that ± EndQ(t) Jr ᏻK . (5) ¯ Their proof shows that the endomorphisms of Jr± are in fact defined over K, and that the natural action of Gal(K/Q) on EndK(t) (Jr± ) and on ᏻK are compatible with the identification of equation (5), which is canonical (see also [DMs]). Fix a residue field F of K at a prime above p, and let ϕ be a homomorphism of ᏻK to F. The module Jr± [p] ⊗ϕ F is a 2-dimensional F-vector space on which GK acts F-linearly. By choosing an F-basis for this vector space, one obtains Galois representations (depending on the choice of ϕ, although this dependence is supressed from the notation) r± (t) : GK(t) −→ GL2 (F).
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
421
Theorem 1.10. The representations r− (t) and r+ (t) (as ϕ varies over the (r − 1)/2 possible homomorphisms from ᏻK to F) are the r − 1 distinct Frey representations of characteristic p associated to x p +y p = zr . The representations r− are odd, and the representations r+ are even. Proof. (See also [DMs, Prop. 2.2 and 2.3].) Observe the following. (1) Outside of t = 0, 1, ∞, the curves Cr± (t) have good reduction. Hence r± (t) satisfies condition (2) in Definition 1.1 of a Frey representation. (2) The Cr± (t) are Mumford curves over Spec(K[[t]]) and Spec(K[[t −1]]), that is, the special fiber of Cr± (t) over these bases is a union of projective lines intersecting transversally at ordinary double points. For example, replacing y by 2y +(x +2)g(x) yields the following equation for Cr+ (t) over Spec(K[[t]]), whose special fiber is the union of two projective lines crossing at the d + 1 ordinary double points (x, y) = (−2, 0), (−ωj , 0): y 2 + (x + 2)g(x)y + t (x + 2) = 0.
(6)
Likewise, replacing y by 2y + xg(−x) gives the following equation for Cr+ (t) over Spec(K[[t − 1]]): y 2 + xg(−x)y + g(−x)2 + (x + 2)(t − 1) = 0.
(7)
Its special fiber is a projective line with the d ordinary double points (x, y) = (ωj , 0). A similar analysis can be carried out for Cr− (t). By Mumford’s theory, the Jacobians Jr± (t) have purely toric reduction at t = 0 and t = 1, and hence r± maps the inertia at these points to unipotent elements of SL2 (F). (3) The curve Cr− (t) has a quadratic twist that acquires good reduction over K[[(1/t)1/r ]], while Cr+ (t) acquires good reduction over this base. For example, setting t˜ = (1/t)1/r and replacing x by 1/x and y by (2y + 1)/x (r+1)/2 in equation (4) for Cr+ (t) gives the model y 2 + y = x r + t˜h(x, y, t˜/2),
(8) r− (resp., r+ ) SL2 (F) whose
where h is a polynomial with coefficients in Z. Therefore maps the inertia at t = ∞ to an element of order 2r (resp., r) of image in PSL2 (F) is of order r. It follows from (2) and (3) that r± (t) satisfies condition (3) in Definition 1.1. (4) A strong version of condition (1) in Definition 1.1 now follows from the following group-theoretic lemma. Lemma 1.11. Let σ0 , σ1 , and σ∞ be elements of PSL2 (F) of order p, p, and r satisfying σ0 σ1 σ∞ = 1. Then σ0 , σ1 , and σ∞ generate PSL2 (F) unless (p, r) = (3, 5) and σ˜ 0 σ˜ 1 σ˜ ∞ = −1, in which case they generate an exceptional subgroup isomorphic to A5 ⊂ PSL2 (F9 ). Proof. Let G be the subgroup of PSL2 (F) generated by the images of σ0 , σ1 , and σ∞ . The proper maximal subgroups of PSL2 (F) are conjugate to one of the groups
422
HENRI DARMON
in the following list (cf., e.g., [Hu, Ch. II.8, Th. 8.27]): (1) the Borel subgroup of upper-triangular matrices; (2) the normalizer of a Cartan subgroup; (3) a group isomorphic to PSL2 (F ) or PGL2 (F ) for some F ⊂ F; (4) one of the exceptional subgroups A4 , S4 , or A5 . The fact that G contains two unipotent elements that do not commute rules out the possibility that G is contained in a Borel subgroup or in the normalizer of a Cartan subgroup, and the fact that it contains an element of order r rules out the groups isomorphic to PSL2 (F ) or PGL2 (F ). Obviously, G can be contained in one of the exceptional subgroups only if both p and r are less than or equal to 5, that is, if (r, p) = (2, 3), (2, 5), (3, 5), or (5, 3). In the first three cases, G is isomorphic to PSL2 (Fp ). (Note that PSL2 (F3 ) A4 and that PSL2 (F5 ) A5 .) When (r, p) = (5, 3) and σ˜ 0 σ˜ 1 σ˜ ∞ = −1, one checks directly that G is isomorphic to the exceptional subgroup A5 ⊂ PSL2 (F9 ). The equation x r +y r = zp . Choose a parameter j ∈ {1, 3, 5, . . . , r −2}, and define curves over the function field Q(t) by the equations j +2 − 2r 2 j −2 x − 1 , Xr,r (t) : y = u x x −u j +2 t + r 2 j −2 x − 1 Xr,r (t) : y = u x . , u= x −u t −1 A role is played in our construction by the Legendre family J (t) of elliptic curves, whose equation we write in the more convenient form: x − 1 j +2 J (t) : y 2 = u2 x j −2 . x −u These curves are equipped with the following structures. − and X + defined by (1) A canonical action of µr on Xr,r r,r ζ (x, y) = (x, ζy),
ζ ∈ µr .
− , X + , and J defined by (2) An involution τ on Xr,r r,r
τ (x, y) = (u/x, 1/y). + and has no fixed points on X − and on J . This involution has two fixed points on Xr,r r,r − − + are defined by (3) Maps π : Xr,r → J and πr : Xr,r → Xr,r π(x, y) = (x, y r ); πr (x, y) = x, y 2 .
These maps obey the rules τ ζ = ζ −1 τ,
π ζ = π,
πr ζ = ζ 2 πr ,
τ π = πτ,
τ πr = πr τ.
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
423
Let ± ± Cr,r = Xr,r /τ,
J = J /τ.
− to J and C + , The maps π and πr commute with τ and hence induce maps from Cr,r r,r respectively, which are denoted by the same letters by abuse of notation. Write π ∗ + , and C − induced by π and and πr∗ for the maps between the Jacobians of J , Cr,r r,r + denote the Jacobian of πr , respectively, by contravariant functoriality. Finally let Jr,r + , and let J − be the quotient of the Jacobian Jac(C − ) of C − defined by Cr,r r,r r,r r,r
− ∗ + − Jr,r := Jac Cr,r π (J ) + πr∗ Jr,r . + (resp., J − ) have dimension equal to Proposition 1.12. The abelian varieties Jr,r r,r (r − 1)/2 when j ∈ {1, 3, 5, . . . , r − 4} (resp., j ∈ {1, 3, 5, . . . , r − 2}). In these cases there is a natural identification
± EndK Jr,r = ᏻK , which is compatible with the action of Gal(K/Q) on each side. ± is a direct calculation based on Proof. The computation of the dimension of Jr,r ± , let the Riemann-Hurwitz formula. To study the endomorphism rings of Jr,r ± ± ± ηζ : Xr,r −→ Cr,r × Cr,r ± to C ± given by η := (pr, pr ◦ζ ), where pr is the be the correspondence from Cr,r ζ r,r ± ± . The resulting endomorphism η of Pic(C ± ) is natural projection of Xr,r to Cr,r ζ r,r defined (on effective divisors) by the equation
ηζ (pr P ) = pr(ζ P ) + pr(ζ −1 P ). The commutation relations between ζ , π, and πr show that π ηζ = 2π,
π r η ζ = η ζ 2 πr .
+ ) of Jac(C − ) are preserved by these correHence, the subvarieties π ∗ (J ) and πr∗ (Jr,r r,r − as well as of J + . The assignment spondences, which induce endomorphisms of Jr,r r,r ± ). It is an isomorphism since J ± has ζ → ηζ yields an inclusion of ᏻK into End(Jr,r r,r multiplicative reduction at t = ∞ and hence is not of complex multiplication (CM) type. The result follows. ± be the Galois repreChoose as before a homomorphism ϕ : ᏻK → F, and let r,r ± [p] ⊗ F. Note that sentations obtained from the action of GK(t) on the modules Jr,r ϕ ± depend on the choice of the parameter j as well as on the the representations r,r choice of ϕ.
424
HENRI DARMON
− , as j ranges over {1, 3, . . . , r −2} and Theorem 1.13. (1) The representations r,r ϕ over the different homomorphisms ᏻK → F, are the (r − 1)2 /4 distinct odd Frey representations attached to x r + y r = zp . + , as j ranges over {1, 3, . . . , r − 4} and ϕ over the (2) The representations r,r different homomorphisms ᏻK → F, are the (r − 1)(r − 3)/4 distinct even Frey representations attached to x r + y r = zp .
Proof. See, for example, [Ka, Th. 5.4.4] or [CW]. ± , as functions of the variable Remarks. (1) The periods of the abelian varieties Jr,r t, are values of certain classical hypergeometric functions. These functions arise as solutions of a second-order differential equation having only regular singularities at t = 0, 1, and ∞ and monodromies of order r at 0 and 1 and quasi-unipotent monodromy (with eigenvalue −1 for the odd Frey representation and 1 for the even Frey representation) at t = ∞. (2) Katz’s proof, which is based on his analysis of the behaviour of the local monodromy of sheaves under the operation of “convolution on Gm ,” is significantly more general than the rank 2 case used in our application. It also gives a motivic construction of rigid local systems over P1 − {0, 1, ∞} of any rank. Katz’s “hypergeometric motives” suggest the possibility of connecting equation (1) to higher-dimensional Galois representations, for which questions of modularity are less well understood. (3) In computing finer information such as the conductors of the Frey representa± (a r /c p ) at the “bad primes,” it may be desirable to have a direct proof of tions r,r Theorem 1.13 along the lines of the proof of Theorem 1.10. The details, which are omitted, will be given in [DK].
The equation x p + y q = zr . The notion of “hypergeometric abelian variety” explained in [Ka, Th. 5.4.4] and [CW, Sec. 3.3] also yields a construction of the (r − 1)(q − 1)/2 Frey representations of characteristic p over K = Q(ζq , ζr )+ associated to x p + y q = zr , when p, q, r are distinct primes and p is odd. We do not describe the construction here, referring instead to [Ka, Sec. 5.4] for the details. All that is used in the sequel is the following theorem. − and Theorem 1.14. If q, r = 2 (resp., q = 2), there exist abelian varieties Jq,r + (resp., J ) over Q(t) of dimension (r − 1)(q − 1)/2 satisfying Jq,r 2,r
± EndK Jq,r = ᏻK
(resp., End(J2,r ) = ᏻK ),
whose mod p representations give rise to all the Frey representations in characteristic p associated to x p + y q = zr . More precisely, fix a residue field F of K at p, and let ϕ be a homomorphism of Q(ζq )+ Q(ζr )+ to F. There are (r − 1)(q − 1)/4 (resp., ± (resp., (r − 1)/2) such ϕ’s. Extending ϕ to a homomorphism ᏻK −→ F, let q,r ± 2,r ) be the Galois representation obtained from the action of GK(t) on Jq,r [p] ⊗ϕ F ± (resp., ) are the distinct Frey (resp., J2,r [p] ⊗ϕ F). Then the representations q,r 2,r
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
425
− representations of characteristic p attached to x p +y q = zr . The representations q,r + (resp., q,r ) are odd (resp., even).
Frey abelian varieties. We may now assign to each solution (a, b, c) of equation (1) a Frey abelian variety, obtained as a suitable quadratic twist of the abelian variety ± (a r /c p ) for x r + y r = J (a p /b p ) for x p + y p = zp , Jr± (a p /cr ) for x p + y p = cr , Jr,r p ± p r p q r z , and Jq,r (a /c ) for x + y = z . These twists are chosen in such a way as to make the corresponding mod p representations as “little ramified” as possible, in accord with Lemma 1.2. The equation x p + y p = zp . If (a, b, c) is a solution to the Fermat equation x p + p p 2 p p = zr , the elliptic curve J (a √ /c ) has equation y = x(x − 1)(x − a /c ), which is a quadratic twist (over Q( c)) of the familiar Frey curve yq
J (a, b, c) : y 2 = x(x + a p )(x − b p ). Let ρ be the associated mod p representation of GQ . The equation x p + y p = zr . When r = 2, we associate to a solution (a, b, c) of equation (1) the following twist of C2 (a p /c2 ): C2 (a, b, c) : y 2 = x 3 + 2cx 2 + a p x.
(9)
When r is odd, the Frey hyperelliptic curves Cr− (a, b, c) and Cr+ (a, b, c) are given by the equations Cr− (a, b, c) : y 2 = cr f (x/c) − 2(a p − b p ), Cr+ (a, b, c) : y 2 = (x + 2c) cr f (x/c) − 2(a p − b p ) .
(10) (11)
− c) is a nontrivial quadratic twist of Cr− (a p /cr ) (over the field Note √ that Cr (a, b, + Q( c)), while Cr (a, b, c) is isomorphic to Cr+ (a p /cr ) over Q. Here are the equations of Cr− (a, b, c) for the first few values of r:
r =3:
y 2 = x 3 − 3c2 x − 2(a p − b p ).
r =5:
y 2 = x 5 − 5c2 x 3 + 5c4 x − 2(a p − b p ).
r =7:
y 2 = x 7 − 7c2 x 5 + 14c4 x 3 − 7c6 x − 2(a p − b p ).
Let Jr± (a, b, c) be the Jacobian of Cr± (a, b, c), and let ρr± be the corresponding mod p Galois representations (which depend, as always, on the choice of a homomorphism ϕ from ᏻK to F). The representation ρr± is a quadratic twist of r± (a p /cr ). ± (a, b, c) or C ± (a, b, c), as we have no We do not write down the equations for Cr,r q,r further use for them in this paper. A more careful study of the Frey abelian varieties ± (a, b, c) associated to x r + y r = zp will be carried out in [DK]. Jr,r
426
HENRI DARMON
Conductors. We say that a Galois representation ρ : GK → GL2 (F) is finite at a prime λ if its restriction to a decomposition group at λ comes from the Galois action on the points of a finite flat group scheme over ᏻK,λ . When ᐉ = p, this is equivalent to ρ being unramified. Let N(ρ) denote the conductor of ρ, as defined, for example, in [DDT]. In particular, N(ρ) is divisible precisely by the primes for which ρ is not finite. The equation x p + y p = zp . By interchanging a, b, and c and changing their signs if necessary so that a is even and b ≡ 3 (mod 4), one finds that the conductor of ρ := (a, b, c) is equal to 2 (cf. [Se2]). The presence of the extraneous prime 2 in the conductor (in spite of the fact that all the exponents involved in the Fermat equation are odd) can be explained by the fact that the Frey representation used to construct ρ is odd, so that one of the monodromies of (t) is necessarily of order 2p. In contrast, we will see that the Galois representations obtained from even Frey representations are unramified at 2. The equation x p + y p = zr . Let r = (2 − ω) be the (unique) prime ideal of K above r. Proposition 1.15. (1) The representation ρr− is finite away from r and the primes above 2. (2) The representation ρr+ is finite away from r. Proof. The discriminants 2± of the polynomials used in equations (10) and (11) to define Cr± (a, b, c) are 2− = (−1)(r−1)/2 22(r−1) r r (ab)((r−1)/2)p , 2+ = (−1)(r+1)/2 22(r+1) r r a ((r+3)/2)p b((r−1)/2)p . If 3 is a prime that does not divide 2± , then Cr± (a, b, c) has good reduction at 3; hence ρr± is finite at all primes above 3. So it is enough to consider the primes that divide 2ab. Suppose first that 3 = 2 divides a, and let λ denote any prime of K above 3. Let Kλ be the completion of K at λ and ᏻλ its ring of integers, and denote by ± ρr,λ the restriction of ρr± to an inertia group Iλ ⊂ Gal(K¯ λ /Kλ ) at λ. We observe that ± ± ρr,λ = r± (a p /cr )|Iλ , since 3 does not divide c. To study ρr,λ , we consider the abelian ± variety Jr over Kλ ((t)). Let M be the finite extension of Kλ ((t)) cut out by the Galois representation r± on the p-division points of Jr± . From the proof of Theorem 1.10, one knows that Cr± is a Mumford curve over Kλ [[t]]. Hence, its Jacobian Jr± is equipped with a (t)-adic analytic uniformization 1 −→ Q −→ T −→ Jr± Kλ ((t)) −→ 1, where T (Kλ ((t))× )d is a torus and Q is the sublattice of multiplicative periods. Hence M is contained in L((t 1/p )), where L is a finite extension of Kλ . Because
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
427
Jr± extends to an abelian scheme over the local ring ᏻλ ((t)), the extension L/Kλ is unramified when 3 = p and comes from a finite flat group scheme over ᏻλ when 3 = p. ± But the extension of Kλ cut out by ρr,λ is contained in L(t 1/p ), where t = a p /cr . Since ord3 (t) ≡ 0 (mod p), this extension is unramified at λ when 3 = p and comes from a finite flat group scheme over ᏻλ when 3 = p. The proof when 3 = 2 divides b proceeds in an identical manner, considering this time Cr± (t) over Kλ ((t −1)) and using the fact that ord3 ((a p /cr ) − 1) = ord3 (−b p /cr ) ≡ 0 (mod p) to conclude. Consider finally the case where 3 = 2. If 2 does not divide ab, then c is even. Making the substitution (x, y) = (1/u, (2v + 1)/u(r+1)/2 ), the equation of Cr+ (a, b, c) becomes
(a p − b p ) r u + (lower-order terms in u). 2 The coefficients involved in this equation are integral at 2, and (a p − b p )/2 is odd; hence, Cr+ (a, b, c) has good reduction at 2, and therefore, ρr+ is unramified at λ. If 2 divides ab, suppose without loss of generality that it divides a, and note that the equation (6) for Cr+ (t) also shows that Cr+ (a, b, c) is a Mumford curve over Kλ . The result follows. v 2 + v = 4c(a p − b p )ur+1 −
Remark. The reader will find in [Ell] a more general criterion for the Galois representations arising from division points of Hilbert-Blumenthal abelian varieties to be unramified, which relies on Mumford’s theory in an analogous way. Proposition 1.15 implies that the conductor of ρr+ is a power of r and that the conductor of ρr− is divisible only by r and by primes above 2. We now study the exponent of r that appears in these conductors. Proposition 1.16. (1) If r divides ab, then the conductor of ρr− and ρr+ at r divides r. (2) If r does not divide ab, then the conductor of ρr− and ρr+ at r divides r3 . Proof. We treat the case of ρr+ , since the calculations for ρr− are similar. By making the change of variable x = (2 − ω)u − 2, y = (2 − ω)d+1 v in equation (6), one finds the new equation for Cr+ : 2 − ωj t v+ u = 0. (12) u− Cr+ : v 2 + u 2−ω (2 − ω)d j
Setting t˜ = t/(2 − ω)d , one sees that Cr+ (t˜) is a Mumford curve over Spec(ᏻr[[t˜]]). (The singular points in the special fiber have coordinates given by (u, v) = (0, 0) and ((2 − ωj )/(2 − ω), 0), which are distinct since (2 − ωj )/(2 − ω) ≡ j 2 (mod r).) One concludes that when ordr(t) > d, the representation r+ (t) is ordinary at r, and its conductor divides r. When r divides a one has ordr(a p /cr ) ≥ pd > d. A similar reasoning works when r divides b, and so part (1) of Proposition 1.16 follows. Part (2) is proved by analyzing Jr± (t) over Spec(ᏻr[t, 1/(1−t), 1/t]). The conductor of Jr± over this base is constant, and one finds that the conductor of ρr± is equal to r3 .
428
HENRI DARMON
By combining the analysis of Propositions 1.15 and 1.16, we have shown the following theorem. Theorem 1.17. (1) The conductor of ρr− is of the form 2u rv , where u = 1 if ab is even. One has v = 1 if r divides ab, and v ≤ 3 otherwise. (2) The conductor of ρr+ divides r if r divides ab, and r3 otherwise. 2. Modularity 2.1. Hilbert modular forms. Let K be a totally real field of degree d > 1, and let ψ1 , . . . , ψd be the distinct real embeddings of K. They determine an embedding of the group ; = SL2 (K) into SL2 (R)d by sending a matrix ac db to the d-tuple ai bi d ci di i=1 , where aj = ψj (a) and likewise for bj , cj , and dj . Through this embedding, the group ; acts on the product Ᏼd of d copies of the complex upper half-plane by Möbius transformations. More precisely, if τ = (τ1 , . . . , τd ) belongs to Ᏼd , then a i τi + b i d . Mτ := ci τi + di i=1 If f is a holomorphic function on Ᏼd and γ ∈ GL2 (K), we define (f |2 γ )(τ ) = det(γ ) (ci τi + di )−2 f (γ τ ). Let ; be a discrete subgroup of GL2 (K). Definition 2.1. A modular form of weight 2 on ; is a holomorphic function on
Ᏼd which satisfies the transformation rule
f |2 γ = f, for all γ in ;. A function that vanishes at the cusps is called a cusp form on ;. The space of modular forms of weight 2 on ; is denoted M2 (;), and the space of cusp forms is denoted S2 (;). Let n be an ideal of K. We now introduce the space S2 (n) of cusp forms of weight 2 and level n, as in [W1, Sec. 1.1]. For this, choose a system c1 , c2 , . . . , ch of representative ideals for the narrow ideal classes of K. Let d denote the different of K, and assume that the ci have been chosen relatively prime to nd. Define
a b −1 ∈ GL+ ;i (n) := M = 2 (K) | a, d ∈ ᏻK , b ∈ (ci d) , c d c ∈ ci dn, ad − bc ∈ ᏻ× K . Definition 2.2. A cusp form of weight 2 and level n is an h-tuple of functions (f1 , . . . , fh ), where fi ∈ S2 (;i (n)).
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
429
Denote by S2 (n) the space of cusp forms of weight 2 and level n. To the reader acquainted with the case K = Q, the definition of S2 (n) may appear somewhat contrived. It becomes more natural when one considers the adelic interpretation of modular forms of level n as a space of functions on the coset space GL2 (AK )/GL2 (K). As in the case where K = Q, the space S2 (n) is a finitedimensional vector space and is endowed with an action of the commuting self-adjoint Hecke operators Tp for all prime ideals p of K which do not divide n (cf. [W1, Sec. 1.2]). A modular form f ∈ S2 (n) is called an eigenform if it is a simultaneous eigenvector for these operators. In that case one denotes by ap(f ) the eigenvalue of Tp acting on f . Let Kf be the field generated by the coefficients ap(f ). It is a finite totally real extension of Q. If λ is any prime of Kf , let Kf,λ be the completion of Kf at λ and let ᏻf,λ be its ring of integers. Eigenforms are related to Galois representations of GK thanks to the following theorem. Theorem 2.3. Let f be an eigenform in S2 (n). There is a compatible system of λ-adic representations ρf,λ : GK −→ GL2 (ᏻf,λ ) for each prime λ of Kf , satisfying trace ρf,λ frobq = aq(f ) ,
det ρf,λ frobq = Norm(q),
for all primes q of K which do not divide nλ. Sketch of proof. When K is of odd degree or when K is of even degree and there is at least one finite place where f is either special or supercuspidal, this follows from work of Shimura, Jacquet and Langlands, and Carayol (cf. [Ca]). In this case, the representation ρf,λ can be realized on the λ-adic Tate module of an abelian variety over K. (It is a factor of the Jacobian of a Shimura curve associated to a quaternion algebra over K which is split at exactly one infinite place.) In the general case, the theorem is due to Wiles [W2] (for ordinary forms) and to Taylor [Tay] for all f . The constructions of [W2] and [Tay] are more indirect than those of [Ca]: They exploit congruences between modular forms to reduce to the situation that is already dealt with in [Ca], but they do not realize ρf,λ on the division points of an abelian variety (or even on the étale cohomology of an algebraic variety). A different construction, by Blasius and Rogawski [BR], exhibits the Galois representations in the cohomology of Shimura varieties associated to an inner form of U (3). Let A be an abelian variety over K with real multiplications by a field E. More precisely, one requires that E is a finite extension of Q whose degree is equal to the dimension of A, and one also requires that A is equipped with an inclusion: E −→ EndK (A) ⊗ Q.
430
HENRI DARMON
Following the terminology of Ribet, call A an abelian variety of GL2 -type over K. It gives rise to a compatible system ρA,λ of 2-dimensional λ-adic representations of GK for each prime λ of E by considering the action of GK on (T3 (A) ⊗ Q3 ) ⊗E Eλ . The conductor of A is defined to be the Artin conductor of ρA,λ for any prime λ of good reduction for A. (One can show that this does not depend on the choice of λ.) The following conjecture is the natural generalization of the Shimura-Taniyama conjecture in the setting of abelian varieties of GL2 -type. Conjecture 2.4 (Shimura and Taniyama). If A is an abelian variety of GL2 -type over K of conductor n, then there exists a Hilbert modular form f over K of weight 2 and level n such that ρf,λ ρA,λ for all primes λ of E. If A satisfies the conclusion of Conjecture 2.4, one says that A is modular. Remark. To prove that A is modular, it is enough to show that it satisfies the conclusion of Conjecture 2.4 for a single prime λ of E. Conjecture 2.4 appears to be difficult in general, even with the powerful new techniques introduced by Wiles in [W3]. In connection with equation (1), one is particularly interested in Conjecture 2.4 for hypergeometric abelian varieties. Conjecture 2.5. For all t ∈ Q, the hypergeometric abelian variety J (t) (resp., ± (t), J ± (t)) attached to the equation x p + y p = zp (resp., x p + y p = zr , Jr± (t), Jr,r q,r r r x +y = zp , x p +y q = zr ) is modular over Q (resp., Q(ζr )+ , Q(ζr )+ , Q(ζq , ζr )+ ). 2.2. Modularity of hypergeometric abelian varieties The modularity of J . The modularity of the curves in the Legendre family J follows from Wiles’s proof of the Shimura-Taniyama conjecture for semistable elliptic curves. To prove that J is modular, Wiles begins with the fact that the mod 3 representation J [3] is modular. This follows from results of Langlands and Tunnell on base change; the key fact being that GL2 (F3 ) is solvable. Wiles then shows (at least when the representation J [3] is irreducible and semistable) that every “sufficiently well-behaved” lift of J [3] is also modular. This includes the representation arising from the 3-adic Tate module of J , and hence J itself is modular. ± . When r = 2, the abelian variety J is an elliptic The modularity of Jr± and Jr,r 2 curve (which arises from the universal family on X0 (2)) and its modularity follows from the work of Wiles and its extensions [Di1]. − are elliptic curves, so that Likewise, when r = 3, the abelian varieties Jr± and Jr,r their modularity follows from the Shimura-Taniyama conjecture. It is still conjectural in this case, in spite of the progress made toward the Shimura-Taniyama conjecture − in [Di1] and [CDT]: For many values of t, the conductors of J3± (t) and J3,3 (t) are divisible by 27.
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
431
For r > 3, the prime 3 is never split in Q(ζr )+ , so that the image of the Galois ± [3] is contained in a product of groups isomorrepresentation acting on Jr± [3] or Jr,r phic to GL2 (F3s ) with s > 1. Because GL2 (F3s ) is not solvable when s > 1, it seems ± [3] and use the prime 3 as difficult to directly prove the modularity of Jr± [3] or Jr,r in Wiles’s original strategy. Consider instead the prime r of norm r. Since GQ fixes r, it acts naturally on ± [r] of r-torsion points of J ± and J ± . Furthermore, these the modules Jr± [r] and Jr,r r r,r modules are 2-dimensional Fr -vector spaces, and the action of GQ on them is Fr linear. − [r] are isomorphic to a quadratic Theorem 2.6. (1) The modules Jr− [r] and Jr,r twist of the mod r representation associated to the Legendre family J . + [r] are reducible Galois representations. (2) The modules Jr+ [r] and Jr,r
Proof. By the same arguments as in the proof of Theorem 1.10, one shows that the − [r] (resp., J + [r] and J + [r]), if irreducible, representations attached to Jr− [r] and Jr,r r r,r are Frey representations associated to the Fermat equation x r + y r = zr which are odd (resp., even). By Theorem 1.5, there is a unique odd Frey representation (up to twisting by a quadratic character) associated to x r + y r = zr , which is the one associated to the r-torsion points on the Legendre family J (t). Part (1) follows. Since there are no even Frey representations associated to x r + y r = zr , the reducibility + [r] follows as well. (Alternately, in [DMs, Prop. 2.3], an explicit of Jr+ [r] and Jr,r r-isogeny from Jr+ (t) to Jr+ (−t) defined over K is constructed, which shows that the corresponding representation is reducible and, in fact, that Jr+ has a K-rational torsion point of order r.) ± be the conductors of the G -representations J ± [r] and J ± [r]. Let Nr± and Nr,r Q r r,r ± [r] arise from a classical Corollary 2.7. The GQ -representations Jr± [r] and Jr,r ± ± modular form f0 on ;0 (Nr ) and ;0 (Nr,r ).
Proof. Since the elliptic curve J : y 2 = x(x − 1)(x − t) is modular for all t ∈ Q, it is associated to a cusp form on ;0 (NJ ) where NJ is the conductor of J (t). The lowering-the-level result of Ribet [Ri2] ensures that there is a form f0 of level Nr− − ) attached to J − [r] (resp., J − [r]). In the case of the even Frey represen(resp., Nr,r r r,r tations, the appropriate modular form f0 can be constructed directly from Eisenstein series. ± [r] to G , Consider now the restriction of the Galois representations Jr± [r] and Jr,r K which we denote with the same symbol by abuse of notation.
Theorem 2.8. There are Hilbert modular forms f over K giving rise to Jr± [r] or
± [r]. Jr,r
Proof. This is a consequence of cyclic base change, taking f to be the base change lift of f0 from Q to K.
432
HENRI DARMON
In light of Theorem 2.8, what is needed now is a “lifting theorem” in the spirit of [TW] and [W3] for Hilbert modular forms over K, which would allow us to ± . The methods of conclude the modularity of the r-adic Tate module of Jr± and Jr,r [TW] are quite flexible and have recently been partially extended to the context of Hilbert modular forms over totally real fields by a number of mathematicians, notably Fujiwara [Fu] and Skinner and Wiles [SW1]–[SW3]. Certain technical difficulties ± in full generality. prevent one from concluding the modularity of Jr± and Jr,r (1) When r does not divide ab, the r-adic Tate module of Jr± is neither flat nor ordinary at r. One needs lifting theorems that take this into account. The work of Conrad, Diamond, and Taylor [CDT] is a promising step in this direction, but many technical difficulties remain to be resolved. Even when r = 3, one cannot yet prove − that the elliptic curves J3± (t) and J3,3 (t) are modular for all t ∈ Q. (2) The reducibility of the representation Jr+ [r] may cause some technical difficulties, although the recent results of Skinner and Wiles [SW1]–[SW3] go a long way toward resolving these difficulties in the ordinary case. As an application of the results of Skinner and Wiles, we have the following theorem. Theorem 2.9. (1) If r divides ab, then the abelian varieties Jr± (a, b, c) are modular. ± (a, b, c) are modular. (2) If r divides c, then the abelian varieties Jr,r ± (a, b, c) have multiplicative reProof. The abelian varieties Jr± (a, b, c) and Jr,r duction at r, by the proof of Proposition 1.16. Hence the r-adic Tate modules Tr± ± of these varieties, viewed as a representation of G , are ordinary at r. Since and Tr,r K + are reducible, the modularity of the residual representations attached to Tr+ and Tr,r the associated r-adic representations follows from [SW3, Sec. 4.5, Th. A]. (Note that the five hypotheses listed in this theorem are satisfied in our setting, with k = 2 and A = 1, since the field denoted there by F (χ1 /χ2 ) is equal to the cyclotomic field − , the associated residual representation is never Q(ζr ).) In the case of Tr− and Tr,r reducible when r > 5 by the work of Mazur, and the modularity of the associated r-adic representations follows from [SW2, Sec. 5, Th. 5.1]. ± . Let K = Q(ζ , ζ )+ , and let q be a prime of K above q. The modularity of Jq,r q r This prime is totally ramified in K/Q(ζr )+ . Denote by q also the unique prime of Q(ζr )+ below q, and let F be the common residue field of Q(ζr )+ and K at q. ± [q ] As in the previous section, one notes that the action of GK on the module Jq,r extends to an F-linear action of GQ(ζr )+ . ± [q] is isomorphic to a quadratic twist of J ± [q] Theorem 2.10. The module Jq,r r as a GQ(ζr )+ -module.
Proof. The proof is exactly the same as the proof of Theorem 2.6. ± [q]. Corollary 2.11. If Jr± is modular, then so is Jq,r
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
433
Proof. The proof is the same as for Corollary 2.7 and Theorem 2.8; this time applying cyclic base change from Q(ζr )+ to K. Modularity of J [3] | | ↓ Modularity of J ↓ Modularity of J [r] | | ↓
←−
Base change
←−
Wiles lifting
−→
x r + y r = zr
←−
Theorem 2.6 & base change
Theorem 2.6 ↓
Modularity of Jr− [r] − [r ] and Jr,r | | ↓
Modularity of Jr+ [r] + [r ] and Jr,r ←−
Generalized Wiles lifting?
−→
Modularity of Jr− − and Jr,r
Modularity of Jr+ + and Jr,r
↓
↓ Jr− [q]
Modularity of − [q ] and Jr,r | | ↓
−→ ←−
xq + yq
xr + yr
= zr , = zq
Theorem 2.10 & base change
←−
Modularity of Jr+ [q] + [q ] and Jr,r
−→
| | ↓
− [q ] Modularity of Jq,r
| | ↓
+ [q ] Modularity of Jq,r
←−
Generalized Wiles lifting?
−→
− Modularity of Jq,r
| | ↓ + Modularity of Jq,r
↓ Modularity of
| | ↓
↓ − [p ] Jq,r
−→
xp + yq
= zr
←−
+ [p ] Modularity of Jq,r
Figure 1
Corollaries 2.7 and 2.11 suggest an inductive strategy for establishing the modu± , and J ± : combining a series of base changes with successive larity of J , Jr± , Jr,r q,r
434
HENRI DARMON
applications of Wiles-type lifting theorems (at the last step, for Hilbert modular forms over Q(ζq , ζr )+ ). This strategy, and its connections with Fermat’s equation and its variants, is summarized in Figure 1. 3. Lowering the level 3.1. Ribet’s theorem. Let ρ : GK → GL2 (F) be a Galois representation of GK with values in GL2 (F), where F is a finite field. If f is a Hilbert modular form that is an eigenform for the Hecke operators, denote by ᏻf the ring generated by the associated eigenvalues. Definition 3.1. We say that ρ is modular if there exists a Hilbert modular form f over K and a homomorphism j : ᏻf → F such that, for all primes q that are unramified for ρ, trace ρ frobq = j aq(f ) . If f can be chosen to be of weight k and level n, we say that ρ is modular of weight k and level n. The following is a generalization of Serre’s conjectures [Se2] to totally real fields, in a simple special case. Conjecture 3.2. Suppose that ρ : GK −→ GL2 (F) is an absolutely irreducible Galois representation, where F is a finite field of characteristic p. Suppose also that (1) ρ is odd, and its determinant is the cyclotomic character; (2) ρ is finite at all primes p dividing p; (3) the conductor of ρ in the sense of [Se2] is equal to n. Then ρ is modular of weight 2 and level n. This conjecture also seems quite difficult. (For example, the argument in [Se2, Sec. 4, Th. 4] shows that Conjecture 3.2 implies the generalized Shimura-Taniyama Conjecture 2.4.) The following conjecture, which extends a result of Ribet [Ri2] to totally real fields, should be more approachable. Conjecture 3.3. Suppose that ρ satisfies the assumptions of Conjecture 3.2 and that it is modular of weight 2 and some level. Then ρ is modular of weight 2 and level n. The following partial result is proved in [Ja] and [Ra], building on the methods of [Ri2]. Theorem 3.4. Let ρ : GK −→ GL2 (F) be an irreducible mod p representation associated to a Hilbert cuspidal eigenform f of weight 2 and level nλ, where n, λ,
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
435
and p are mutually relatively prime and λ is a prime of K. If [K : Q] is even, assume that f is either special or supercuspidal at a finite prime q not dividing p and λ. If ρ is unramified at λ, then ρ comes from a Hilbert cuspidal eigenform g of weight 2 and level n. 3.2. Application to x p + y p = zr . In the remainder of this paper, we focus our attention on the equation x p + y p = zr and attack it by studying the representations ρr+ = + (a p /cr ) (and, toward the end, ρr− ) attached to the p-torsion of Jr± (a, b, c). Theorem 3.5. (1) If r divides ab, then ρr+ (resp., ρr− ) comes from a modular form of weight 2 and level dividing r (resp., 2u r, for some u). (2) If r does not divide ab, assume further that Jr± (t) is modular and that Conjecture 3.3 holds for Hilbert modular forms over K. Then ρr+ (resp., ρr− ) comes from a modular form of weight 2 and level dividing r3 (resp., 2u r3 , for some u). Proof. The modularity of Jr± (a, b, c) (which, when r divides ab, follows from Theorem 2.9) implies that ρr± is modular of weight 2 and some level. By Theorem 1.17, ρr+ has conductor dividing r when r | ab and dividing r3 in general, and it satisfies all the other hypotheses in Conjecture 3.2; a similar statement holds for ρr− . Conjecture 3.3 implies the conclusion. Note that, when r | ab, the Hilbert modular form f associated to Jr+ (a, b, c) is special or supercuspidal at r, so that the hypotheses of Theorem 3.4 are satisfied. Hence, Theorem 3.4 can be applied to remove all the unramified primes from the level of the associated modular form, proving part (1) of Theorem 3.5 unconditionally. Remark. Theorem 3.5 suggests that the analysis of the solutions (a, b, c) to x p + y p = zr splits naturally into two cases, depending on whether or not r divides ab. The following definition is inspired by Sophie Germain’s classical terminology. Definition 3.6. A primitive solution (a, b, c) of x p + y p = zr is called a first case solution if r divides ab, and a second case solution otherwise. Remark. As with Fermat’s last theorem, the first case seems easier to deal with than the second case (cf. Theorem 3.22). Our hope is that Theorem 3.5 forces the image of ρr± to be small (at least for some values of r). Before pursuing this matter further, observe that equation (1) has (up to sign) three trivial solutions: (0, 1, 1), (1, 0, 1), and (1, −1, 0). Proposition 3.7. (1) If (a, b, c) = (0, 1, 1) or (1, 0, 1), then Jr+ and Jr− have degenerate reduction, and the representations ρr± are therefore reducible. (2) If (a, b, c) = (1, −1, 0), then Jr± have complex multiplication by Q(ζr ), and hence the image of ρr± is contained in the normalizer of a Cartan subgroup of GL2 (F). Proof. This can be shown by a direct calculation. For example, the curve Cr+ (1, −1, 0) has equation
436
HENRI DARMON
y 2 = x r+1 − 4x. Making the substitution (x, y) = (−1/u, (2v +1)/u(r+1)/2 ), one obtains the equation v 2 + v = ur , and one recognizes this as the equation for the hyperelliptic quotient of the Fermat curve x r + y r = zr that has complex multiplication by Q(ζr ). Proposition 3.7 suggests the following question. Question 3.8. Can one show that the image of ρr± (a, b, c) is necessarily contained in a Borel subgroup or in the normalizer of a Cartan subgroup of GL2 (F)? The case r = 2 and 3. For r = 2 (resp., r = 3) one can answer this question in the affirmative, by noting that ρ2 (a, b, c) (resp., ρ3+ (a, b, c)) is modular of level dividing 32 (resp., 27). (One needs to assume the Shimura-Taniyama conjecture for r = 3.) The space of classical cusp forms of weight 2 and level 32 (resp., 27) is 1-dimensional. In fact, X0 (32) (resp., X0 (27)) is an elliptic curve with complex multiplication by Q(i) (resp., Q(ζ3 )). (It is also a quotient of the Fermat curve x 4 + y 4 = z4 (resp., x 3 + y 3 = z3 ).) So the Galois representations arising from nontrivial primitive solutions of x p + y p = z2 and x p + y p = z3 are either reducible or of dihedral type. This was proved in [Da1] (see also [DMr]). Answering Question 3.8, for specific values of r > 3 and p, requires a computation of all the Hilbert modular forms over K of weight 2 and level dividing r3 . We limit ourselves to the simpler case where K has narrow class number 1. Remark. It is known that K has narrow class number 1 for all r < 100 except r = 29, when the narrow class number is equal to 8. (The author is grateful to Cornelius Greither for pointing out these facts.) We now give a formula for the dimension of S2 (1) and S2 (r k ), with k = 1, . . . , 3 under the narrow class number 1 assumption. To do this we need to introduce some notation. • Recall that d = (r − 1)/2 denotes the degree of K over Q. • Set δ2 = 2 if r ≡ 1 (mod 4) and δ2 = 0 if r ≡ 3 (mod 4). Likewise let δ3 = 2 if r ≡ 1 (mod 3) and δ3 = 0 if r ≡ 2 (mod 3). • Let ζK (s) be the Dedekind zeta-function of K. The main contribution to the dimension of S2 (r k ) is given by the special value ζK (−1), a rational number that can be computed from the formula: (−1)d B2,χ ζK (−1) = , 12 χ 2
r
B2,χ
1 = χ(a)a 2 , r a=1
where the product is taken over all nontrivial even Dirichlet characters χ : (Z/rZ)× / !±1" → C× of conductor r.
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
437
• Let h− be the minus part of class number of Q(ζr ). This number can be evaluated also as a product of generalized Bernoulli numbers: h− = (−1)d 2r
B1,χ χ
2
r
,
B1,χ =
1 χ(a)a, r a=1
where the product this time is taken over the odd Dirichlet characters √ of conductor r. • Let h(a) be the class number of the quadratic extension K( a), and (for d < 0) √ × × √ let q(a) be the index of ᏻ× K ᏻQ( a) in ᏻK ( a) . One has q(a) = 1 or 2, and q(a) = 1 if r ≡ 3 (mod 4). Only the ratios h(−1)/q(−1) and h(−3)/q(−3) are involved in the formula for the dimension of S2 (r k ). Let χ4 and χ3 denote the nontrivial Dirichlet character mod 4 and 3, respectively. When K has narrow class number 1, these ratios are given by the formulae B1,χχ h(−1) 4 = (−1)d+1 , q(−1) 2 χ
B1,χχ h(−3) 3 = (−1)d+1 , q(−3) 2 χ
where the products are taken over the nontrivial even Dirichlet characters of conductor r. Recall that B1,χχ4
4r 1 = aχχ4 (a), 4r
B1,χχ3
a=1
3r 1 = aχχ3 (a). 3r a=1
Table 1 lists these invariants for the first few values of r. Table 1
Let
r
d
ζK (−1)
h−
5
2
1/30
1
1
1
7 11 13 17 19
3 5 6 8 9
−1/21 −20/33 152/39 18688/51 −93504/19
1 1 1 1 1
1 1 3 8 19
1 1 2 5 9
h(−1)/q(−1) h(−3)/q(−3)
χ (n) = 1 + (−1)d dim S2 (n) .
Under the assumption that K has narrow class number 1, this is the arithmetic genus of the Hilbert modular variety Ᏼd / ;0 (n); cf. [Fr, Ch. II, Sec. 4, Th. 4.8]. Theorem 3.9. Assume that K has narrow class number 1. Then χ(r k ) (and hence, the dimension of S2 (r k )) is given by the formulae
438
HENRI DARMON
χ (1) =
h(−3) ζK (−1) r − 1 − h(−1) h + + , + d−1 2r 4q(−1) 3q(−3) 2
χ (r) = (r + 1)
ζK (−1) r − 1 − h(−1) h(−3) h + δ2 + δ3 , + 2r 4q(−1) 3q(−3) 2d−1
ζK (−1) h(−1) h(−3) χ r k = r k−1 (r + 1) d−1 + δ2 + δ3 . 4q(−1) 3q(−3) 2 Proof. The formula for χ (1) is given in [We, Th. 1.14 and 1.15]. A routine calculation then yields the formula for χ(r k ), after noting that (1) an elliptic fixed point of order 2 (resp., 3) on Ᏼd for the action of SL2 (ᏻK ) lifts to δ2 (resp., δ3 ) elliptic fixed points on Ᏼd / ;0 (r k ) for k ≥ 1; (2) an elliptic fixed point of order r lifts to a unique elliptic fixed point modulo ;0 (r), and there are no elliptic fixed points of order r on Ᏼd / ;0 (r k ) when k > 1. Noting that K has narrow class number 1 when r < 23, Theorem 3.9 allows us to compute the dimensions for the relevant spaces of cusp forms (see Table 2). Table 2 r
dim S2 (1) dim S2 (r) dim S2 r2 dim S2 r3
5
0
0
0
2
7 11 13 17 19
0 0 1 6 12
0 1 4 55 379
1 6 24 879 7300
5 56 290 14895 138790
The case r = 5 and√7. When r = 5, the action of the Hecke operators on the spaces S2 (n) over K = Q( 5) can be calculated numerically by exploiting the JacquetLanglands correspondence between forms on GL2 (K) and on certain quaternion algebras. Let B be the (unique, up to isomorphism) totally definite quaternion algebra over K which is split at all finite places. The algebra B can be identified with the standard Hamilton quaternions over K, since 2 is inert in K:
√ B = x + yi + zj + wk, x, y, z, w ∈ Q 5 . The class number of B is equal to 1: The maximal orders in B are all conjugate to the ring of icosians 1 1 R = Z ω, i, j, k, (1 + i + j + k), (i + ωj + ωk) ¯ , 2 2
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
439
whose unit group R × is isomorphic to the binary icosahedral group of order 120 (cf., e.g., [CS, Ch. 8, Sec. 2.1]). Let Rn be an Eichler order of level n in R, and write ˆ Rˆ n := Rn ⊗ Z,
ˆ Bˆ = B ⊗ Z.
The Jacquet-Langlands correspondence shows that S2 (n) is isomorphic as a Hecke module to the space L2 Rˆ n× \ Bˆ × /B × , on which the Hecke operators act in the standard way. Table 3 lists the eigenvalues of the Hecke operators Tp acting on S2 (r3 ), for all the primes p of K of norm less than or equal to 50.√It turns out that the two eigenforms in S2 (r3 ) are conjugate to each other over Q( 5), so we have only displayed the eigenvalues of one of the two eigenforms. Table 3 p
(2)
ap(f ) 0
(3) 0
p
(6 + ω)
ap(f )
0
(3 − ω) (4 + ω) √ √ (−1 − 5 5)/2 (−1 + 5 5)/2
(4 − ω)
(5 + ω)
(5 − ω)
0
0
0
(7 + 2ω) (5 − 2ω) (7 + ω) (6 − ω) √ √ √ √ (−11 + 5 5)/2 (−11 − 5 5)/2 (9 + 5 5)/2 (9 − 5 5)/2
(7) 0
Observe that ap(f ) = 0 for all the primes p that are inert in the quadratic extension Q(ζ5 )/Q(ω). This suggests that f is actually of CM type, and it corresponds to an abelian variety of dimension 2 with complex multiplication by Q(ζ5 ). In fact, this can be proved: The abelian variety J5+ (1, −1, 0) = Jac y 2 + y = x 5 has complex multiplication by Q(ζ5 ), and its Hasse-Weil L-function is a product of Hecke L-series attached to Grössen characters of Q(ζ5 ) of conductor (1 − ζ5 )2 . + A direct √ 3 calculation shows √ that J5 (1, −1, 0) is associated to the two eigenforms in S2 ( 5 ) over GL2 (Q( 5)). When r = 7, we did not carry out a numerical investigation of the Hecke eigenforms of level r2 and r3 , but this turns out to be unnecessary in identifying the modular forms that arise in these levels. Let A be the (unique, up to isogeny) √ elliptic curve over Q of conductor 49, which has complex multiplication by Q( −7). It corresponds to a cusp form over Q of level 49. Its base change lift to K = Q(cos(2π/7)) is the unique modular form of level r2 . The space S2 (r3 ) contains a 2-dimensional space of old forms, and hence there are three eigenforms of level r3 . These must consist of the Hilbert modular forms associated to the Fermat quotient J7+ (1, −1, 0) : y 2 + y = x 7 .
440
HENRI DARMON
So when r = 5 and 7, the spaces S2 (r3 ) contain only eigenforms of CM type associated to hyperelliptic Fermat quotients or CM elliptic curves. Hence, we have the following theorem. Theorem 3.10. Let r = 5 or 7, and let (a, b, c) be a nontrivial primitive solution to the equation x p + y p = zr , where p = r is an odd prime. Let p be any prime of K = Q(cos(2π/r)) above p, and write F := ᏻK /p. Then we have the following. (1) If (a, b, c) is a first case solution, the mod p representation associated to Jr+ (a, b, c) is reducible. (2) If (a, b, c) is a second case solution, assume further that Jr+ (a, b, c) is modular and that Ribet’s lowering-the-level theorem (see Conjecture 3.3) holds for Hilbert modular forms over K. Then the mod p representation associated to Jr+ (a, b, c) is either reducible or its image is contained in the normalizer of a Cartan subgroup of GL2 (F). Following [Se1], one can use the fact that Jr+ is semistable to obtain more precise information in the first case. Proposition 3.11. If r = 5 or 7 and (a, b, c) is a first case solution to x p + y p = then Jr+ (a, b, c) is Q-isogenous to an abelian variety having a rational point of order p. zr ,
Proof. Choose a prime p of K above p, and let χ1 : GK −→ F× be the character giving the action of GK on the K-rational 1-dimensional F-vector subspace L of Jr+ [p]. Let χ2 be the character of GK describing its action on Jr+ [p]/L. The local analysis in [Se1, Sec. 5.4., Lem. 6] shows that χ1 and χ2 are unramified outside the primes above p. Also, the set of restrictions {χ1 |Ip , χ1 |Ip } to an inertia group Ip at a prime p above p is equal to {χ, 1}, where χ is the cyclotomic character giving the action of Ip on the pth roots of unity. (Use the corollary to Proposition 13 of [Se1].) Hence, one of χ1 or χ2 is everywhere unramified. (When there is a single prime of K above p, this is immediate. If p is split in K, one observes, by analyzing the image × of the map ᏻ× K → (ᏻK ⊗ Fp ) and using class field theory, that the inertia groups at the various p , in the Galois group of the maximal tamely ramified abelian extension of K unramified outside p, have nontrivial intersection and, in fact, are equal for all but finitely many p.) Since K has class number 1, one of χ1 or χ2 is trivial. If χ1 = 1, then Jr+ [p] has a K-rational point whose trace gives a point of order p in Jr+ (a, b, c). If χ2 = 1, the module L˜ generated by the ᏻK [GQ ]-translates of L is a Q-rational subgroup of Jr+ [p] which is of rank 1 over ᏻK ⊗ Fp . The quotient Jr+ /L˜ has a rational point of order p. Corollary 3.12. If 3 is a prime satisfying 3 < p 1/d − 2p 1/2d + 1, then 3 divides ab. + good reduction at 3 and #Jr+ (F3 ) < Proof. √ 2d If 3 does not divide ab, then Jr has + (1+ 3) by the Weil bounds. Hence, p > #Jr (F3 ). This contradicts Proposition 3.11,
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
441
since the prime-to-3 part of the torsion subgroup of Jr+ (Q) injects into Jr+ (F3 ) (and likewise for any abelian variety isogenous to Jr+ ). Theorem 3.13. Suppose r = 5 or 7. There exists a constant Cr− depending only on r such that, if p ≥ Cr− and (a, b, c) is a first case solution to x p + y p = zr , the Galois representation ρr− is reducible. (In this case, there is a quotient of Jr− (a, b, c) over Q which has a rational point of order p.) Proof. By Corollary 3.12, if p is large enough, then 2 divides ab, so that the abelian variety Jr− (a, b, c) is semistable at 2 and, hence, everywhere (see the proof of Proposition 1.15). The mod p representation associated to Jr− (a, b, c), if irreducible, is therefore equal to the mod p representation associated to a Hilbert modular form f over K in S2 (2r), by Theorem 3.5. Corollary 3.12 further implies that if 3 ≤ p1/d − 2p 1/2d + 1 is a rational prime, then 3 divides ab, so that Jr− (a, b, c) has multiplicative reduction at any prime λ of K above 3. By using the Tate uniformization of Jr− (a, b, c) at λ, we find that aλ (f ) ≡ norm(λ) + 1 (mod p),
for all 3 ≤ p 1/d − 2p 1/2d + 1.
For each f , there is a constant Cf− such that this statement fails whenever p > Cf− , since the mod p representations attached to f are irreducible for almost all p. Now take Cr− to be the maximum of the Cf− as f runs over the normalized eigenforms in S2 (2r). The statement in parentheses follows by applying to Jr− the same arguments used in the proof of Proposition 3.11. Remark. Although the statement of Theorem 3.13 involves only ρr− , note the crucial role played in its proof by the representation ρr+ via Corollary 3.12. This illustrates how information gleaned from one Frey representation may sometimes be used to yield insights into a second a priori unrelated Frey representation associated to the same generalized Fermat equation. The case r = 11. When r = 11, there is a 44-dimensional space of newforms of level r3 , and studying the equation x p +y p = z11 would require computing the Fourier coefficients associated to these newforms. We content ourselves with the following result, which requires only dealing with S2 (r). Theorem 3.14. Let (a, b, c) be a first case solution to the equation x p +y p = z11 , where p > 19 is prime, and let p be any prime of K = Q(cos(2π/11)) above p. Then + + the mod p representation associated to J11 (a, b, c) is reducible, and in fact J11 (a, b, c) has a rational point of order p. Proof. Let p be any ideal of K above p, and let ρp denote the mod p representation + associated to J11 (a, b, c). Suppose that it is irreducible. By exploiting the action of Gal(K/Q), it follows that ρp is irreducible for all choices of p. Theorem 3.5 implies that ρp is modular of level dividing r. Table 2 shows that the space of cusp forms of this level is 1-dimensional. In fact, the unique normalized eigenform f of level r is
442
HENRI DARMON
the base change lift to K = Q(cos(2π/11)) of the modular form f = η(z)2 η(11z)2 of level 11 associated to the elliptic curve X0 (11). Consider the prime ideal (2) of K above 2, of norm 32. Then a(2) (f) = a32 (f ) = 8. This implies that
+ a(2) := a(2) J11 (a, b, c) ≡ 8 (mod p)
for all primes p above p. Taking norms, one finds p 5 divides normK/Q (a(2) − 8). By the Weil bounds, we have √ 5 | normK/Q (a(2) − 8)| ≤ 2 32 + 8 . √ Since p > 20 > 8(1 + 2), we must have a(2) = 8. But this leads to a contradiction. + For, if 2 divides ab, then J11 (a, b, c) has purely toric reduction at 2 and a(2) = ±1. If + ab is odd, then J11 (a, b, c) has good reduction at (2), and 11 divides norm(32 + 1 − + a(2) ) = 255 since J11 (a, b, c) has a K-rational point of order 11 (by Theorem 2.6). It + follows that the mod p representations associated to J11 are reducible. The proof of + Proposition 3.11 now shows that J11 (Q) has a point of order p, since Q(cos(2π/11)) has class number 1. Corollary 3.15. If 3 is a prime satisfying 3 < p 1/5 − 2p1/10 + 1, then 3 divides ab. The proof of this corollary is the same as for Corollary 3.12. Finally, we record the following theorem. − − such that, if p ≥ C11 and (a, b, c) Theorem 3.16. There exists a constant C11 − p p 11 is a first case solution to x + y = z , the Galois representation ρ11 is reducible. − (In this case, there is a quotient of J11 (a, b, c) over Q which has a rational point of order p.)
The proof is the same as for Theorem 3.13. The case r = 13. When r = 13 there is a unique normalized cusp form of level 1, which is the base change lift of the cusp form associated to the elliptic curve X1 (13). (Note that this curve acquires good reduction over Q(cos(2π/13)).) This modular form does not pose any obstructions to studying first case solutions to x p +y p = z13 , since the representation attached to a solution of the equation is ramified at r. On the other hand, the 2-dimensional space of newforms of level r would have to be studied more carefully in order to understand the (first case) solutions to x p +y p = z13 . The numerical calculation of eigenforms in S2 (r) becomes increasingly difficult as
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
443
r gets larger, and it has not been carried out even for r = 13. One can go further without such explicit numerical calculations (cf. Theorem 3.22) by studying congruences (modulo r) for modular forms. General r. Let 3 be a rational prime. The 3-adic Tate module T3 (Jr± (t)) ⊗ Q3 is a 2-dimensional K3 := K ⊗ Q3 -vector space. When t is rational, the linear action of GK on this vector space extends to a GQ -action that is GK -semilinear; that is, it satisfies σ (αv) = α σ σ (v), for all α ∈ K3 , σ ∈ GQ . Letting aq(Jr± ) := trace(ρJ,3 (frobq)), it follows that σ aq Jr± = aqσ Jr± .
(13)
This motivates the following definition. Definition 3.17. A Hilbert modular form over K of level n is called a Q-form if for all ideals q of K which are prime to n, it satisfies the relation aq(f )σ = aqσ (f ),
for all σ ∈ GQ .
(In particular, this implies that the Fourier coefficients aq(f ) belong to K.) Equation (13) implies the following lemma, which reflects the fact that the abelian varieties Jr± (t) with t ∈ Q are defined over Q (even though their endomorphism rings are only defined over K). Lemma 3.18. For all t ∈ Q, if the abelian varieties Jr− (t) and Jr+ (t) are modular, then they are associated to a modular Q-form over K. Let f be an eigenform in S2 (n), and let λ be a prime in the ring of Fourier coefficients ᏻf . Denote by ρf,λ the λ-adic representation associated to f by Theorem 2.3 and let V be the underlying Kf,λ -vector space. Choose a GK -stable ᏻf,λ -lattice H in ¯ := H/λH gives a 2-dimensional representation ρ¯f,λ for GK over the V . The space H residue field kf,λ := ᏻf,λ /λ. In general, this representation depends on the choice of lattice, but its semisimplification does not. One says that ρf,λ is residually irreducible if ρ¯f,λ is irreducible for some (and hence all) choices of lattice H. Otherwise one says that ρf,λ is residually reducible. In the latter case, the semisimplification of ρ¯f,λ is a direct sum of two 1-dimensional characters × , χ1 , χ2 : GK −→ kf,λ
whose product is the cyclotomic character × χ : GK −→ (Z/3Z)× ⊂ kf,λ
giving the action of GK on the 3th roots of unity.
444
HENRI DARMON
Let f be a Q-form over K in the sense of Definition 3.17, so that in particular its Fourier coefficients are defined over K. We say that f is r-Eisenstein if its associated r-adic representation ρf,r is residually reducible. Proposition 3.19. There exists a constant Cr+ depending only on r such that, for any first case solution (a, b, c) to equation (1) with p > Cr+ , one of the following holds: (1) the representation ρr+ is reducible, or (2) it is isomorphic to the mod p representation attached to an r-Eisenstein Q-form in S2 (r). Proof. Let g be any eigenform in S2 (r). If g is not a Q-form, then there exists a prime q of ᏻg and a σ ∈ GQ such that aq(g)σ = aqσ (g). If g is a Q-form but is not r-Eisenstein, then there is a prime q of K such that r does not divide aq(g) − N q − 1. In either case, one has aq(g) = aq(f ), for all modular forms f that correspond to a Jr+ (t) with t ∈ Q. Indeed, such an f is a Q-form and is r-Eisenstein by Theorem 2.6. If f ≡ g for some prime p of ᏻg K above p, then taking norms gives p divides NormKg K/Q aq(g) − aq(f ) = 0. Let dg := [Kg : Q]. Applying the Weil bounds, one finds NormK K/Q aq(g) − aq(f ) ≤ 16 NormK/Q (q) (r−1)dg /4 , g so that
(r−1)dg /4 p ≤ Cg := 16 NormK/Q (q) .
In particular, if p > Cg , the representation ρr+ is not equivalent to ρ¯g,p for any prime of ᏻg K above p. Now set Cr+ := maxg Cg , where the maximum is taken over all eigenforms g in S2 (r) which are either not Q-forms or are not r-Eisenstein. If p > Cr+ and (a, b, c) is a first case solution to x p + y p = zr and if the associated representation ρr+ is irreducible, then it is associated by Theorem 3.5 to a Hilbert modular eigenform in S2 (r). This form must be an r-Eisenstein Q-form by the choice of Cr+ . In light of Proposition 3.19, it becomes important to understand whether there exist r-Eisenstein Q-forms in S2 (r). Proposition 3.20. Suppose that r is a regular prime. Then there are no rEisenstein Q-forms over K of level 1 or r. Proof. Suppose on the contrary that f is a Q-form in S2 (r) and that ρf,r is residually reducible. Let χ1 and χ2 be the characters of GK which occur in the ¯ for some (and hence all) GK -stable lattices H in V . Because semisimplification of H,
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
445
f is a Q-form, it follows that χ1 and χ2 are powers of the cyclotomic character χ with values in !±1" ⊂ F× r . Furthermore, χ1 χ2 = χ. Hence, we may assume without loss of generality that χ1 = 1 and χ2 = χ. By [Ri1, Prop. 2.1], there exists a GK -stable lattice H for which χ A0 1 A0 , (14) ρ¯f,r 1 = 0 χ2 0 χ and it is not semisimple. This implies that A := A0 /χ is a nontrivial cocycle in H 1 (K, Z/rZ(−1)). Proposition 3.20 now follows from the next lemma. Lemma 3.21. The cocycle A is unramified. Proof. The cocycle A is unramified at all places v = r because v does not divide the level of f . It is also unramified at r: If f is of level 1, this is because ρ¯f,r comes from a finite flat group scheme over K. If f is of level r, then, by [W2, Th. 2], the restriction of the representation ρ¯f,r to a decomposition group Dr at r is of the form χ A . ρ¯f,r|Dr 0 1 But the restriction of χ to Dr is nontrivial. Comparing the equation above to equation (14), it follows that the local representation ρ¯f,r|Dr splits. Therefore, the cocycle A is locally trivial at r. This completes the proof of Lemma 3.21. Proposition 3.20 now follows directly: The cocycle A cuts out an unramified cyclic extension of Q(ζr ) of degree r, which does not exist if r is a regular prime. Theorem 3.22. Let r be a regular prime. Then there exists a constant Cr+ (depending only on r) such that, for all p > Cr+ and all first case solutions (a, b, c) to x p + y p = zr , the mod p representation associated to Jr+ (a, b, c) is reducible. Proof. Combine Propositions 3.19 and 3.20. Remarks. (1) The value of the constant Cr+ depends on the structure of the space of Hilbert modular forms over Q(cos(2π/r)) of level r. It would be possible in principle to write down a crude estimate for Cr+ by using the Chebotarev density theorem and known estimates for the size of fourier coefficients of Hilbert modular eigenforms, but we have not attempted to do this. (2) The consideration of r-Eisenstein Q-forms so crucial for the proof of Theorem 3.22 is only likely to be of use in studying first case solutions. Indeed, there typically exist r-Eisenstein Q-forms on S2 (r3 ); for example, the base change lifts from Q to K of certain r-Eisenstein forms on X0 (r 2 ) or (more germane to the present discussion) the form in S2 (r3 ) associated to the CM abelian variety Jr+ (1, −1, 0). (3) The arguments based on r-Eisenstein Q-forms yield no a priori information about the Galois representations ρr− , since the mod r representation attached to Jr− (a, b, c) is irreducible. (It is isomorphic to a twist of the representation coming
446
HENRI DARMON
from the r-torsion of the Frey curve y 2 = x(x − a p )(x + b p ), by Theorem 2.6.) Nonetheless, one can still show the following theorem. Theorem 3.23. Assume further that K has class number 1. Then (1) Jr+ (a, b, c) is isogenous to an abelian variety having a rational point of order p; (2) there exists a further constant Cr− such that if p > Cr− , the abelian variety Jr− (a, b, c) is isogenous to an abelian variety having a rational point of order p. Proof. The proof of (1) is the same as for Proposition 3.11, and (2) follows from the same reasoning as for Theorem 3.13. 4. Torsion points on abelian varieties. Ultimately, one wishes to extract a contradiction from theorems like Theorems 3.10, 3.13, 3.14, 3.16, 3.22, and 3.23 by proving that when p is large enough (relative to r perhaps), the image of ρr± is large; for example, that this image contains SL2 (F) or, at the very least, that the abelian varieties Jr± (a, b, c), when semistable, cannot contain a rational point of order p. The following folklore conjecture can be viewed as a direct generalization of a conjecture of Mazur for elliptic curves. Conjecture 4.1. Let E be a totally real field and K a number field. There exists a constant C(K, E) depending only on K and E, such that for any abelian variety A of GL2 -type with EndK (A) ⊗ Q = EndK¯ (A) ⊗ Q E, and all primes p of E of norm greater than C(K, E), the image of the mod p representation associated to A contains SL2 (F). This conjecture seems difficult. The set of abelian varieties of GL2 -type with End(A) ⊗ Q E is parametrized by a d-dimensional Hilbert modular variety, and very little is known about the Diophantine properties of these varieties. When r = 2 and r = 3, one has K = E = Q since the representations ρr± arise from elliptic curves. Much of Conjecture 4.1 can be proved thanks to the ideas of Mazur [Ma1], [Ma2]. • Theorem 8 of [Ma1] implies that the image of ρr± is not contained in a Borel subgroup of GL2 (Fp ) when p > 5. • A result of Momose [Mo] building on the ideas in [Ma1] implies that this image is not contained in the normalizer of a split Cartan subgroup if p > 17. • Finally, a result of Merel and the author [DMr] implies that the image of ρr+ is not contained in the normalizer of a nonsplit Cartan subgroup. (We were unable to prove a similar result for ρr− .) Combining these results with an ad hoc study (carried out by Bjorn Poonen [Po], using traditional descent methods) of the equations x p + y p = zr (r = 2, 3) for small values of p yields the desired contradiction. Thus, the main result of [DMr] provides an (essentially) complete analogue of Fermat’s last theorem for equation (1) when
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM
447
r = 2 or 3, which one would like to emulate for higher values of r. Theorem 4.2 [DMr]. (1) The equation x p + y p = z2 has no nontrivial primitive solutions when p ≥ 4. (2) Assume the Shimura-Taniyama conjecture. Then the equation x p +y p = z3 has no nontrivial primitive solutions when p ≥ 3. Return now to the case r > 3. The following special case of Conjecture 4.1, which is sufficient for the applications to equation (1), seems more tractable. Conjecture 4.3. There exists a constant Br depending only on r, such that for any t ∈ Q and all primes p of K = Q(ζr )+ of norm greater than Br , the image of the mod p representation of GK associated to Jr± (t) is neither contained in a Borel subgroup nor in the normalizer of a Cartan subgroup of GL2 (F). A natural approach to this conjecture is to study the curves X0± (p), Xs± (p), and which classify the abelian varieties Jr± (t) with a rational subgroup, a “normalizer of split Cartan subgroup structure,” and a “normalizer of nonsplit Cartan subgroup structure” on the p-division points, where p is an ideal of the field K. For the moment, we know very little about the arithmetic of these curves, except when r = 2 and r = 3 when they are closely related to classical modular curves. When r > 3, they appear as quotients of the upper half-plane by certain nonarithmetic Fuchsian groups described in [CW]. We content ourselves here with giving a formula for the genus of these curves. Let J = ±1 be defined by the condition Np ≡ J (mod r). ± (p ) Xns
Lemma 4.4. (1) The genus of X0± (p) is equal to 1 1 2 1 J 1− − Np − 1− . 2 r p 2 r (2) The genus of Xs± (p) is equal to 1 1 2 1 J +1 1− − Np(Np + 1) − 1− + 1. 4 r p 4 r ± (p) is equal to (3) The genus of Xns 1 1 2 1 J −1 1− − Np(Np − 1) + 1− + 1. 4 r p 4 r
Proof. The curves above are branched coverings of the projective line with known degrees and ramification structure. The calculation of the genus follows by a direct application of the Riemann-Hurwitz genus formula. Example. When r = 5 and p = (3), one finds that the curves X0− (3) and X0+ (3) are of genus 1; that is, they are elliptic curves over Q. A direct calculation reveals that X0+ (3) is an elliptic curve of conductor 15, denoted by 15E in Cremona’s tables. √ By looking up the curve 15E twisted by Q( 5), one finds that 15E has finite
448
HENRI DARMON
√ Mordell-Weil group over Q( 5). Does J0+ (p) always have a nonzero quotient with finite Mordell-Weil group over Q(ζr )+ , at least when p is large enough? References [Be] [By] [BR] [Ca] [CW] [CDT] [CCNPW]
[CS] [Da1] [Da2] [DDT]
[DK] [DMr] [DMs] [Di1] [Di2] [Ell] [Fr] [Fre] [Fu] [He]
[Hu] [Ja]
S. Beckmann, On extensions of number fields obtained by specializing branched coverings, J. Reine Angew. Math. 419 (1991), 27–53. G. V. Bely˘i, On extensions of the maximal cyclotomic field having a given classical Galois group, J. Reine Angew. Math. 341 (1983), 147–156. D. Blasius and J. D. Rogawski, Motives for Hilbert modular forms, Invent. Math. 114 (1993), 55–87. H. Carayol, Sur les représentations 3-adiques associées aux formes modulaires de Hilbert, Ann. Sci. École Norm. Sup. (4) 19 (1986), 409–468. P. Cohen and J. Wolfart, Modular embeddings for some nonarithmetic Fuchsian groups, Acta Arith. 56 (1990), 93–110. B. Conrad, F. Diamond, and R. Taylor, Modularity of certain potentially Barsotti-Tate Galois representations, J. Amer. Math. Soc. 12 (1999), 521–567. J. H. Conway, R. T. Curtis, S. P. Norton, R. A. Parker, and R. A. Wilson, Atlas of Finite Groups: Maximal Subgroups and Ordinary Characters for Simple Groups, Clarendon Press, Oxford, 1985. J. H. Conway and N. J. A. Sloane, Sphere Packings, Lattices and Groups, 2d ed., Grundlehren Math. Wiss. 290, Springer, New York, 1993. H. Darmon, The equations x n +y n = z2 and x n +y n = z3 , Internat. Math. Res. Notices 1993, 263–274. , Faltings plus epsilon, Wiles plus epsilon, and the generalized Fermat equation, C. R. Math. Rep. Acad. Sci. Canada 19 (1997), 3–14. H. Darmon, F. Diamond, and R. Taylor, “Fermat’s last theorem” in Current Developments in Mathematics (Cambridge, Mass., 1995), International Press, Cambridge, Mass., 1994, 1–154. H. Darmon and A. Kraus, On the equations x r + y r = zp , in preparation. H. Darmon and L. Merel, Winding quotients and some variants of Fermat’s last theorem, J. Reine Angew. Math. 490 (1997), 81–100. H. Darmon and J-F. Mestre, Courbes hyperelliptiques à multiplications réelles et une construction de Shih, CICMA preprint; to appear in Canad. Math. Bull. F. Diamond, On deformation rings and Hecke rings, Ann. of Math. (2) 144 (1996), 137–166. , The Taylor-Wiles construction and multiplicity one, Invent. Math. 128 (1997), 379–391. J. Ellenberg, Hilbert modular forms and the Galois representations associated to Hilbert-Blumenthal abelian varieties, Ph.D. thesis, Harvard University, 1998. E. Freitag, Hilbert Modular Forms, Springer, Berlin, 1990. G. Frey, “Links between solutions of A−B = C and elliptic curves” in Number Theory (Ulm, 1987), Lecture Notes in Math. 1380, Springer, New York, 1989, 31–62. K. Fujiwara, Deformation rings and Hecke algebras in the totally real case, preprint. E. Hecke, Die eindeutige Bestimmung der Modulfunktionen q-ter Stufe durch algebraische Eigenschaften, Math. Ann. 111 (1935), 293–301; in Mathematische Werke, 3d ed., Vandenhoeck & Ruprecht, Göttingen, 1983, 568–576. B. Huppert, Endliche Gruppen, I, Grundlehren Math. Wiss. 134, Springer, Berlin, 1967. F. Jarvis, On Galois representations associated to Hilbert modular forms, J. Reine Angew. Math. 491 (1997), 199–216.
HILBERT MODULAR FORMS AND FERMAT’S LAST THEOREM [Ka] [Ma1] [Ma2] [Me]
[Mo] [Po] [Ra] [Ri1] [Ri2] [Se1] [Se2] [Se3] [Se4] [SW1] [SW2] [SW3] [TTV] [Tay] [TW] [Vi] [We] [W1] [W2] [W3]
449
N. Katz, Exponential Sums and Differential Equations, Ann. of Math. Stud. 124, Princeton Univ. Press, Princeton, 1990. B. Mazur, Modular curves and the Eisenstein ideal, Inst. Hautes Études Sci. Publ. Math. 47 (1977), 33–186. , Rational isogenies of prime degree, Invent. Math. 44 (1978), 129–162. J.-F. Mestre, “Familles de courbes hyperelliptiques à multiplications réelles” in Arithmetic Algebraic Geometry (Texel, 1989), Progr. Math. 89, Birkhäuser, Boston, 1991, 193–208. F. Momose, Rational points on the modular curves Xsplit (p), Compositio Math. 52 (1984), 115–137. B. Poonen, Some Diophantine equations of the form x n + y n = zm , Acta Arith. 86 (1998), 193–205. A. Rajaei, On levels of mod 3 Hilbert modular forms, Ph.D. thesis, Princeton University, 1998. K. Ribet, A modular construction of unramified p-extensions of Q(µp ), Invent. Math. 34 (1976), 151–162. ¯ , On modular representations of Gal(Q/Q) arising from modular forms, Invent. Math. 100 (1990), 431–476. J.-P. Serre, Propriétés galoisiennes des points d’ordre fini des courbes elliptiques, Invent. Math. 15 (1972), 259–331. , Sur les représentations modulaires de degré 2 de Gal(Q/Q), Duke Math. J. 54 (1987), 179–230. , Topics in Galois Theory, Res. Notes Math. 1, Jones and Bartlett, Boston, 1992. , Galois Cohomology, rev. ed., Springer, Berlin, 1997. C. Skinner and A. Wiles, Ordinary representations and modular forms, Proc. Nat. Acad. Sci. U.S.A. 94 (1997), 10520–10527. , Nearly ordinary deformations of irreducible residual representations, to appear. , Residually reducible representations and modular forms, to appear. W. Tautz, J. Top, and A. Verberkmoes, Explicit hyperelliptic curves with real multiplication and permutation polynomials, Canad. J. Math. 43 (1991), 1055–1064. R. Taylor, On Galois representations associated to Hilbert modular forms, Invent. Math. 98 (1989), 265–280. R. Taylor and A. Wiles, Ring-theoretic properties of certain Hecke algebras, Ann. of Math. (2) 141 (1995), 553–572. M.-F. Vignéras, Arithmétique des algèbres de quaternions, Lecture Notes in Math. 800, Springer, Berlin, 1980. D. Weisser, The arithmetic genus of the Hilbert modular variety and the elliptic fixed points of the Hilbert modular group, Math. Ann. 257 (1981), 9–22. A. Wiles, On p-adic representations for totally real fields, Ann. of Math. (2) 123 (1986), 407–456. , On ordinary λ-adic representations associated to modular forms, Invent. Math. 94 (1988), 529–573. , Modular elliptic curves and Fermat’s last theorem, Ann. of Math. (2) 141 (1995), 443–551.
Department of Mathematics and Statistics, McGill University, 805 Sherbrooke Street West, Montréal, Québec H3A 2K6, Canada;
[email protected]
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
ON FAMILY RIGIDITY THEOREMS, I KEFENG LIU and XIAONAN MA 0. Introduction. Let M, B be two compact smooth manifolds, and let π : M → B be a submersion with compact fiber X. Assume that a compact Lie group G acts fiberwise on M, that is, the action preserves each fiber of π. Let P be a family of elliptic operators along the fiber X, commuting with the action of G. Then the family index of P is (0.1)
Ind(P ) = Ker P − Coker P ∈ KG (B).
Note that Ind(P ) is a virtual G-representation. Let chg (Ind(P )) with g ∈ G be the equivariant Chern character of Ind(P ) evaluated at g. In this paper, we first prove a family fixed-point formula that expresses chg (Ind(P )) in terms of the geometric data on the fixed points X g of the fiber of π. Then by applying this formula, we generalize the Witten rigidity theorems and several vanishing theorems proved in [Liu3] for elliptic genera to the family case. Let G = S 1 . A family elliptic operator P is called rigid on the equivariant Chern character level with respect to this S 1 -action, if chg (Ind(P )) ∈ H ∗ (B) is independent of g ∈ S 1 . When the base B is a point, we recover the classical rigidity and vanishing theorems. When B is a manifold, we get many nontrivial higher-order rigidity and vanishing theorems by taking the coefficients of certain expansion of chg . For the history of the Witten rigidity theorems, we refer the reader to [T], [BT], [K], [L2], [H], [Liu1], and [Liu4]. The family vanishing theorems that generalize those vanishing theorems in [Liu3], which in turn give us many higher-order vanishing theorems in the family case. In a forthcoming paper, we extend our results to general loop group representations and prove much more general family vanishing theorems that generalize the results in [Liu3]. We believe there should be some applications of our results to topology and geometry, which we hope to report on a later occasion. This paper is organized as follows. In Section 1, we prove the equivariant family index theorem. In Section 2, we prove the family rigidity theorem. In the last part of Section 2, motivated by the family rigidity theorem, we state a conjecture. In Section 3, we generalize the family rigidity theorem to the nonzero anomaly case. As corollaries, we derive several vanishing theorems. Acknowledgements. We would like to thank W. Zhang for many interesting discussions. Part of this work was done while the second author was visiting the Institut des Hautes Études Scientifiques. He would like to thank Professor J. P. Bourguignon and IHES for their hospitality. Received 23 April 1999. 1991 Mathematics Subject Classification. Primary 14D05; Secondary 57R91. 451
452
LIU AND MA
1. Equivariant family index theorem. The purpose of this section is to prove an equivariant family index theorem. As pointed out by Atiyah and Singer, we can introduce equivariant families by proceeding as in [AS1] and [AS2]. Here we prove it directly by using the local index theory as developed by Bismut. This section is organized as follows: In Section 1.1, we state our main result, Theorem 1.1. In Section 1.2, by using the local index theory, we prove Theorem 1.1. 1.1. The index bundle. Let M, B be two compact manifolds, let π : M → B be a fibration with compact fiber X, and assume that dim X = 2k. Let T X denote the relative tangent bundle. Let W be a complex vector bundle on M and let hW be a Hermitian metric on W . Let hT X be a Riemannian metric on T X and let ∇ T X be the corresponding LeviCivita connection on T X along the fiber X. Then the Clifford bundle C(T X) is the bundle of Clifford algebras over M whose fiber at x ∈ M is the Clifford algebra C(Tx X) of (T X, hT X ). We assume that the bundle T X is spin as a bundle on M. Let = + ⊕ − be the spinor bundle of T X. We denote by c(·) the Clifford action of C(T X) on . Let ∇ be the connection on induced by ∇ T X . Let ∇ W be a Hermitian connection on (W, hW ) with curvature R W . Let ∇ ⊗W be the connection on ⊗W along the fiber X: ∇ ⊗W = ∇ ⊗ 1 + 1 ⊗ ∇ W .
(1.1)
For b ∈ B, we denote by Eb , E±,b the set of Ꮿ∞ -sections of ⊗W , ± ⊗W over the fiber Xb . We regard the Eb as the fiber of a smooth Z2 -graded infinite-dimensional vector bundle over B. Smooth sections of E over B are identified to smooth sections of ⊗ W over M. Let {ei } be an orthonormal basis of (T X, hT X ); let {ei } be its dual basis. Definition 1.1. Define the twisted Dirac operator to be c(ei )∇e⊗W (1.2) . DX = i i
DX
Then is a family Dirac operator that acts fiberwise on the fibers of π. For b ∈ B, DbX , denote the restriction of D X to the fiber Eb . D X interchanges E+ and E− . Let X be the restrictions of D X to E . By Atiyah and Singer [AS2], the difference D± ± bundle over B, X X Ind D X = Ker D+,b (1.3) − Ker D−,b , is well defined in the K-group K(B). Now let G be a compact Lie group that acts fiberwise on M. We consider that G acts as identity on B. Without loss of generality, we can assume that G acts on (T X, hT X ) isometrically. We also assume that the action of G lifts to and W , and that the G-action commutes with ∇ W .
ON FAMILY RIGIDITY THEOREMS, I
453
In this case, we know that Ind(D X ) ∈ KG (B). Now we start to give a proof of a local family fixed-point formula that extends [AS2, Proposition 2.2]. with j = 1, . . . , r, a finite number of sections Proposition 1.1. There exist Vj ∈ G (sij +1 , . . . , sij +1 ) with ij +1 −ij = dim Vj of Ꮿ∞ (B, E− ) such that we can find a basis {ej,l } of Vj , under which the map D +,b : Ꮿ∞ (B, E+,b ) ⊕ ⊕rj =1 Vj → Ꮿ∞ (B, E−,b ) given by X X (1.4) D +,b s + "j,l λj,l ej,l = D+ s + "j,l λj,l sij +l X
is G-equivariant and surjective. The vector spaces Ker D +,b form a G-vector bundle X X Ker D + on B, and the element [Ker D + ] − ⊕rj =1 Vj ∈ KG (B) depends only on D X and not on the choice of {Vj } and the sections {si }. Proof. Given b0 ∈ B, we can find a > 0 and a ball U (b0 ) ⊂ B around b0 , such that for any b ∈ U (b0 ), a is not an eigenvalue of DbX,2 . [0,a[ [0,a[ ⊕E−b be the direct sum of the eigenspaces of DbX,2 associated Let Eb[0,a[ = E+b to the eigenvalues λ ∈ [0, a[. By [BeGeV, Proposition 9.10], E [0,a[ forms a finitedimensional subbundle E [0,a[ ⊂ E over U (b0 ). Clearly, E [0,a[ is a G-vector bundle on U (b0 ). By [S, Proposition 2.2], we have an isomorphism of vector bundles on B, [0,a[ E [0,a[ = ⊕V ∈G (1.5) ⊗V, Hom G V , E denotes the space of all irreducible representations of G. We can also find where G [0,a[ ti,k ∈ Ꮿ∞ (U (b0 ), HomG (V , E− )) such that for b ∈ U (b0 ), the elements ti,l form a [0,a[ basis of HomG (V , E− )b . Let {ei,l } be a basis of Vi . Then we can choose the sections [0,a[ ti,k ei,l ∈ Ꮿ∞ (B, E− ) to be our si . This proves the first part of the proposition locally. The global version now follows easily by extending the above local sections of Ꮿ∞ (U (b0 ), E− ) together with a use of the partition of unity argument. This is essentially the same as the proof of [AS2, Proposition 2.2]. By [S, Proposition 2.2], we have X (1.6) ⊗V Ind D X = ⊕V ∈G Hom G V , Ind D and HomG (V , Ind(D X )) ∈ K(B). We denote by (Ind(D X ))G ∈ K(B) the G-invariant part of Ind(D X ). By composing the action of G and the Chern character of HomG (V , Ind(D X )), we get the equivariant Chern character chg (Ind(D X )) ∈ H ∗ (B). Definition 1.2. We say that the operator D X is rigid on the equivariant Chern character level if chg (Ind(D X )) is constant on g ∈ G. More generally, we say that D X is rigid on the equivariant K-Theory level if Ind(D X ) = (Ind(D X ))G . In the rest of this paper, when we say that D X is rigid, we always mean that D X is rigid on the equivariant Chern character level.
454
LIU AND MA
Now let us calculate the equivariant Chern character chg (Ind(D X )) in terms of the fixed-point data of g. Let T H M be a G-equivariant subbundle of T M such that T M = T H M ⊕ T X.
(1.7)
Let P T X denote the projection from T M to T X. If U ∈ T B, let U H denote the lift of U in T H M, so that π∗ U H = U . Let hT B be a Riemannian metric on B, and assume that W has the Riemannian metric hT M = hT X ⊕ π ∗ hT B . Note that our final results are independent of hT B . Let ∇ T M , ∇ T B denote the corresponding Levi-Civita connections on M and B. Put ∇ T X = P T X ∇ T M , which is a connection on T X. As shown in [B1, Theorem 1.9], ∇ T X is independent of the choice of hT B . Now the connection ∇ T X is well defined on T X and on M. Let R T X be the corresponding curvature. We denote by ∇ and ∇ ⊗W the corresponding connections on and ⊗W induced by ∇ T X and ∇ W . Take g ∈ G and set (1.8)
M g = {x ∈ M, gx = x}.
Then π : M g → B is a fibration with compact fiber X g . By [BeGeV, Proposition 6.14], T X g is naturally oriented in M g . Let N denote the normal bundle of M g ; then N = T X/T Xg . We denote the differential of g by dg, which gives a bundle isometry dg : N → N. Since g lies in a compact abelian Lie group, we know that there is an orthogonal decomposition N = N(π ) ⊕ ⊕0<θ <π N(θ ), where dg|N(π) = − id, and for each θ, 0 < θ < π, N(θ) is a complex vector bundle on which dg acts by multiplication by eiθ , and dim N(π) is even. So N(π ) is also naturally oriented. As the Levi-Civita connection ∇ T M preserves the decomposition T M = T M g ⊕0<θ ≤π N(θ ), the connection ∇ T X also preserves the decomposition T X = T Xg g ⊕0<θ ≤π N(θ ) on M g . Let ∇ T X , ∇ N , ∇ N (θ) be the corresponding induced connecg tions on T X g , N, N(θ ), and let R T X , R N , R N (θ) be the corresponding curvatures. Here we consider N(θ ) as a real vector bundle. Then we have the decompositions: (1.9)
g
RT X = RT X ⊕ RN ,
R N = ⊕θ R N (θ) .
Definition 1.3. For 0 < θ ≤ π, we write −R W , chg W, ∇ W = Tr g exp 2πi g (i/4π)R T X g g T X 1/2 (1.10) = det A T X ,∇ g , sinh (i/4π)R T X θ N(θ ), ∇ N(θ ) = A
i (1/2) dim N (θ) det 1/2
1
. 1 − g exp (i/2π)R N (θ)
ON FAMILY RIGIDITY THEOREMS, I
455
X g ), A θ (N (θ)) denote the corresponding cohomology classes on M g . Let chg (W ), A(T If we denote by {xj , −xj } (j = 1, . . . , l) the Chern roots of N(θ), T Xg such that .xj define the orientation of N(θ) and T X g , then
(1.11)
.j xj /2 T Xg = A , sinh(xj /2) 1 e(1/2)(xj +iθ) θ N(θ ) = 2−l .l = .lj =1 x +iθ . A j =1 sinh 1/2(xj + iθ) ej −1
We denote by π∗ : H ∗ (M g ) → H ∗ (B) the intergration along the fiber X g . Theorem 1.1. We have the following identity in H ∗ (B):
T X g chg (W ) . θ N(θ) A (1.12) chg Ind D X = π∗ .0<θ≤π A 1.2. A heat kernel proof of Theorem 1.1. As Atiyah and Singer indicated in the end of [AS2], we can proceed as in [AS1] and [AS2] to introduce an equivariant family, and then to find a formula for the equivariant Chern character of the index bundle. Here we use a different approach by combining the local relative index theory and the equivariant technique to give a direct proof of the local version of Theorem 1.1. We denote by 0 ∇ = ∇ T X ⊕ π ∗ ∇ T B the connection on T M. Let S = ∇ T M − 0 ∇. By [B1, Theorem 1.9], S(·)·, ·hT M is a tensor independent of hT B . For U ∈ T H M, we define a horizontal 1-form k on M by k(U ) = (1.13) S(U )ei , ei . i
Definition 1.4. Let ∇ E denote the connection on E such that if U ∈ T B and s is a smooth section of E over B, then (1.14)
∇UE s = ∇U⊗W s. H
If U, V are smooth vector fields on B, we write
T U H , V H = −P T X U H , V H , (1.15) which is a tensor. Let f1 , . . . , fm be a basis of T B, and let f 1 , . . . , f m be the dual basis. Define (1.16)
1 c(T ) = "α,β f α f β c T fαH , fβH . 2
Definition 1.5. For t > 0, let At be the Bismut superconnection constructed in [B1, §3]: √ 1 1 (1.17) At = tD X + ∇ E + k − √ c(T ). 2 4 t
456
LIU AND MA
It is clear that At is also G-invariant. Let dvX denote the Riemannian volume element on the fiber X. Let 4 be the scaling homomorphism from 5(T ∗ B) into itself: ω → (2πi)−(deg ω)/2 ω. Theorem 1.2. For any t > 0, the form 4 Tr s [g exp(−A2t )] is closed and its cohomology class is independent of t and represents chg (Ind(D X )) in cohomology. Proof. Just proceed as in [B1, §2(d)]. Theorem 1.3. We have the following identity: (1.18)
lim 4 Tr s g exp − A2t t→0 T X g , ∇ T Xg .0<θ≤π A θ N(θ), ∇ N (θ) chg W, ∇ W . = A Xg
Proof. If A is a smooth section of T ∗ X ⊗ 5(T ∗ B) ⊗ End( ⊗ W ), we use the notation
2
∇e⊗W + A(ei ) i
=
2k i=1
2 2k T X + A(ei ) − ∇ ⊗W ∇e⊗W 2k T X − A "i=1 ∇ei ei . i "i=1 ∇e ei i
⊗W on the fiber X as given by Let ∇t be the connection on 5(T ∗ B)⊗ (1.19)
1 1 ∇t = ∇ ⊗W + √ S(·)ej , fαH c(ej )f α + S(·)fαH , fβH f α f β . 4t 2 t
Let K X denote the scalar curvature of the fiber (X, hT X ). By the Lichnerowicz formula [B1, Theorem 3.5], we get
(1.20)
2 t X t A2t = −t ∇t,e + K + c(ei )c(ej )R W (ei , ej ) i 4 2 √ 1 + tc(ei )f α R W ei , fαH + f α f β R W fαH , fβH . 2
Let Pu (x, x , b)(b ∈ B, x, x ∈ Xb ) be the smooth kernel associated to exp(−A2t ) with respect to dvX (x ). Then
4 Tr s g exp − A2t = (1.21) 4 Tr s gPt g −1 x, x, b dvX (x). X
By using standard estimates on the heat kernel, for b ∈ B, we can reduce the problem g of calculating the limit of (1.21) when t → 0 to an open neighbourhood ᐁε of Xb g in Xb . Using normal geodesic coordinates to Xb in Xb , we identify ᐁε to an εneighbourhood of Xg in NXg /X . We know that, if (x, z) ∈ NXg /X with x ∈ X g ,
ON FAMILY RIGIDITY THEOREMS, I
457
then g −1 (x, z) = x, g −1 z .
(1.22)
Let dvXg (x), dvNXg /X,x with x ∈ X g be the corresponding volume forms on T Xg and NXg /X induced by hT X . Let k(x, z)(x ∈ X g , z ∈ NXg /X , |z| < ε) be defined by dvX = k(x, z) dvXg (x) dvNXg /X (z).
(1.23) Then it is clear that
k(x, 0) = 1. By the discussion following (1.21) and (1.23), we get
lim 4 Tr s g exp − A2t t→0
= lim 4 Tr s gPt g −1 x, x dvX (x) t→0 ᐁε/8 (1.24)
4 Tr s gPt g −1 (x, Y ), (x, Y ) = lim t→0 x∈X g
|Y |≤ε/8,Y ∈NXg /X
× k(x, Y ) dvXg (x) dvNXg /X (Y ). g
By taking x0 ∈ Xb and using the finite propagation speed as in [B2, §11b], we may assume that in Xb we have the identification (T X)x0 R2k with 0 ∈ R2k representing x0 , and that the extended fibration over R2k coincides with the given fibration restricted to B(0, ε). ⊗W by parallel transport Take any vector Y ∈ R2k . We can trivialize 5(T ∗ B)⊗ along the curve u → uY with respect to ∇t . Let ρ(Y ) be a Ꮿ∞ -function over R2k , which is equal to 1 if |Y | ≤ ε/4, and equal to 0 if |Y | ≥ ε/2. Let T X be the ordinary Laplacian operator on (T X)x0 . Let Hx0 be ⊗W )x0 over (T X)x0 . the vector space of smooth sections of the bundle (5(T ∗ B)⊗ For t > 0, let L1t be the operator acting on Hx0 : (1.25)
L1t = 1 − ρ 2 (Y ) − tT X + ρ 2 (Y )A2t .
For t > 0, s ∈ Hx0 , we write (1.26)
Y Ft s(Y ) = s √ , t
L2t = Ft−1 L1t Ft .
Let {e1 , . . . , e2l } be an orthonormal basis of (T Xg )x0 , and let {e2l +1 , . . . , e2k } be an 2 orthonormal basis of NXg /X,x0 . Let L3t be the operator obtained from √ Lt by √ replacing j the Clifford variables c(ej ) with 1 ≤ j ≤ 2l by the operators (e / t) − t iej .
458
LIU AND MA
Let Pti (Y, Y ) with Y, Y ∈ (T X)x0 and |Y | < ε/4, i = 1, 2, 3 be the smooth kernel associated to exp(−Lit ) with respect to the volume element dvT Xx0 (Y ). By using the finite propagation speed method, there exist c, C > 0, such that for Y ∈ NXg /X,x0 , |Y | ≤ ε/8, and t ∈]0, 1], we have −1 Pt g Y, Y k(x0 , Y ) − P 1 g −1 Y, Y ≤ c exp − C . (1.27) t t2
For α ∈ C(ej , iej )(1≤j ≤2l ) , let [α]max ∈ C be the coefficient of e1 ∧ · · · ∧ e2l in the expansion of α. Then, as in [B2, Proposition 11.12], if Y ∈ NXg /X , (1.28) −1 max
1 −1 (1/2) dim Xg (−1/2) dim NXg /X 3 g Y Y t Tr s gPt . Tr s gPt g Y, Y = (−2i) √ ,√ t t T X , R W , . . . be the corresponding restrictions of R T X , R W , . . . to M g . Let Let R|M g |M g ∇ej be the ordinary differentiation operator on (T X)x0 in the direction ej . By [ABoP, Proposition 3.7] and (1.20), we have, as t → 0,
L3t −→ L30 = −
(1.29)
2k
∇ej +
j =1
2 1 TX W R|M g Y, ej + R|M g. 4
By proceeding as in [B2, §11g–§11i], we obtain the following: there exist some constants γ > 0, c > 0, C > 0, r ∈ N such that for t ∈]0, 1] and Y, Y ∈ (T X)x0 , we have 3 P Y, Y ≤ c 1 + |Y | + Y r exp − C Y − Y 2 , t (1.30) 3 P − P 3 Y, Y ≤ ct γ 1 + |Y | + Y r exp − C Y − Y 2 . t
0
From (1.28) and (1.30), we get (1.31) lim
t→0
|Y |≤ε/8 Y ∈NXg /X
4 Tr s gPt1 g −1 Y, Y dvNXg /X (Y )
√ (−2i)(1/2) dim X t→0 |Y |≤ε/8 t Y ∈NXg /X
= lim
=
NXg /X
g
max
4 Tr s gPt3 g −1 Y, Y dvNXg /X (Y )
max
g (−2i)(1/2) dim X 4 Tr s gP03 g −1 Y, Y dvNXg /X (Y ).
Now we define (1.32)
A=−
2k
∇ej +
j =1
2 1 TX R|M g Y, ej . 4
ON FAMILY RIGIDITY THEOREMS, I
459
By Mehler’s formula [G], the smooth kernel q(Y, Y ) for Y, Y ∈ T X associated to exp(−A) is given by R T X/2 1 R T X/2 −k 1/2 exp − Y, Y q Y, Y = (4π ) det 4 tanh R T X/2 sinh R T X/2 (1.33) 1 R T X/2 R T X/2 1 TX Y , Y + eR /2 Y, Y . − 4 tanh R T X/2 2 sinh R T X/2 From (1.9) and (1.33), we deduce for Y ∈ NXg /X , −1 R T X/2 −k 1/2 q g Y, Y = (4π ) det sinh R T X/2 (1.34) N R N/2 −1 R N/2 1 cosh R /2 − e g Y, Y . × exp − 2 sinh R N/2 On the other hand, for Y ∈ N(θ), we have N N/2 R N eR /2 1 R N N g −1 Y, Y = eR /2 g −1 + e−R /2 g Y, Y . (1.35) 2 sinh R N/2 sinh R N/2 2 It is easy to see that N 1 N R 1 N N N − eR /2 g −1 + e−R /2 g = 1 − g −1 eR /2 − e−R /2 g . (1.36) cosh 2 2 2 From (1.9), (1.34)–(1.36), we get (1.37) NXg /X
q g −1 Y, Y dvNXg /X (Y )
= (4π )
g −1 R T X /2 −1 1/2 −R N det 1/2 1− g|N . det 1− ge g sinh R T X /2
−(1/2) dim X g
det
1/2
We may and do assume that on the basis {em }2l +1≤m≤2k , the matrix of g has diagonal blocks cos(θj ) − sin(θj ) , 0 < θj ≤ π. sin(θj ) cos(θj ) Then one verifies easily that the action of g on is given by θj θj g = .l +1≤j ≤k cos + sin c(e2j −1 )c(e2j ) . (1.38) 2 2
460
LIU AND MA
By (1.29) and (1.38), we know that
(1.39)
Tr s gP03 g −1 Y, Y −1 θj W = .l +1≤j ≤k − 2i sin Tr g exp − R|M q g Y, Y . g 2
From (1.24), (1.27), (1.31), (1.37), and (1.39), we finally arrive at the wanted formula (1.18). By Theorems 1.2 and 1.3, we now have the complete proof of Theorem 1.1. 2. Family rigidity theorem. This section is organized as follows. In Section 2.1, we state our main theorem of the paper: the family rigidity theorem. In Section 2.2, we prove it by using the equivariant family index theorem and the modular invariance. In Section 2.3, motivated by the family Witten rigidity theorem, we state a conjecture about a K-theory level rigidity theorem for elliptic genera. Throughout this section, we use the notation of Section 1 and take G = S 1 . 2.1. Family rigidity theorem. Let π : M → B be a fibration of compact manifolds with fiber X and dim X = 2k. We assume that the S 1 acts fiberwise on M, and T X has an S 1 -equivariant spin structure. As in [AH], by lifting to the double cover of S 1 , the second condition is always satisfied. Let V be a real vector bundle on M with structure group Spin(2l). Similarly, we can assume that V has an S 1 -equivariant spin structure without loss of generality. The purpose of this part is to prove the elliptic operators introduced by Witten [W] are rigid in the family case, at least at the equivariant Chern character level. Let us recall them more precisely. For a vector bundle E on M, let
(2.1)
St (E) = 1 + tE + t 2 S 2 E + · · · , 5t (E) = 1 + tE + t 2 52 E + · · ·
be the symmetric and exterior power operations in K(M)[[t]]. Let ∞ @q (T X) = ⊗∞ n=1 5q n (T X) ⊗m=1 Sq m (T X),
(2.2)
∞ @q (T X) = ⊗∞ n=1 5−q n−1/2 (T X) ⊗m=1 Sq m (T X), ∞ @−q (T X) = ⊗∞ n=1 5q n−1/2 (T X) ⊗m=1 Sq m (T X).
We also define the following elements in K(M)[[q 1/2 ]]:
ON FAMILY RIGIDITY THEOREMS, I
461
∞ @q (T X | V ) = ⊗∞ n=1 5q n (V ) ⊗m=1 Sq m (T X),
(2.3)
∞ @q (T X | V ) = ⊗∞ n=1 5−q n−1/2 (V ) ⊗m=1 Sq m (T X), ∞ @−q (T X | V ) = ⊗∞ n=1 5q n−1/2 (V ) ⊗m=1 Sq m (T X), ∞ @∗q (T X | V ) = ⊗∞ n=1 5−q n (V ) ⊗m=1 Sq m (T X).
Let p1 (·)S 1 denote the first S 1 -equivariant Pontrjagin class, and let (V ) = + (V )⊕ − (V ) be the spinor bundle of V . In the following sections, we denote by D X ⊗ W the Dirac operator on ⊗ W as defined in Section 1. We also write dsX = D X ⊗(T X). The following theorem is the family analogue of the Witten rigidity theorems as proved in [BT], [T], and [Liu4]. Theorem 2.1. (a) The family operators dsX ⊗@q (T X), D X ⊗@q (T X), and D X ⊗ @−q (T X) are rigid. (b) If p1 (V )S 1 = p1 (T X)S 1 , then D X ⊗ (V ) ⊗ @q (T X | V ), D X ⊗ (+ (V ) − − (V )) ⊗ @∗q (T X | V ), D X ⊗ @q (T X | V ), and D X ⊗ @−q (T X | V ) are rigid. 2.2. Proof of the family rigidity theorem. For τ ∈ H = {τ ∈ C; Im τ > 0}, q = e2πiτ , let ∞ n−1/2 2πiv e .n=1 1 + q n−1/2 e−2πiv , θ3 (v, τ ) = c(q).∞ n=1 1 + q ∞ n−1/2 2πiv θ2 (v, τ ) = c(q).∞ e .n=1 1 − q n−1/2 e−2πiv , n=1 1 − q (2.4) ∞ n 2πiv θ1 (v, τ ) = c(q)q 1/8 2 cos(πv).∞ .n=1 1 + q n e−2πiv , n=1 1 + q e ∞ n 2πiv θ (v, τ ) = c(q)q 1/8 2 sin(πv).∞ .n=1 1 − q n e−2πiv n=1 1 − q e n be the classical Jacobi theta functions (see [Ch]), where c(q) = .∞ n=1 (1 − q ). Let g = e2πit ∈ S 1 be a generator of the action group. Let {Mα } be the fixed submanifolds of the circle action. Let π : Mα → B be a submersion with fiber Xα . We have the following equivariant decomposition of T X:
(2.5)
T X|Mα = N1 ⊕ · · · ⊕ Nh ⊕ T Xα .
Here Nγ is a complex vector bundle such that g acts on it by e2πimγ t . We denote the j Chern roots of Nγ by {2π ixγ }, and the Chern roots of T Xα ⊗R C by {±2πiyj }. We write dimC Nγ = d(mγ ) and dim Xα = 2kα . Similarly, let (2.6)
V|Mα = V1 ⊕ · · · ⊕ Vl0
be the equivariant decomposition of V restricted to Mα . Assume that g acts on Vv by j e2πinv t , where some nv may be zero. We denote the Chern roots of Vv by 2πiuv . Let us write dimR Vv = 2d(nv ).
462
LIU AND MA
For f (x) a holomorphic function, we denote by f (y)(T X g ) = .j f (yj ) the symmetric polynomial that gives characteristic class of T Xg , and we use the same notation for Nγ . Now we define some functions on C × H with values in H ∗ (B): (2.7)
θ1 (y, τ ) g −1 θ1 (xγ + mγ t, τ ) T X .γ i (Nγ ) , Fds (t, τ ) = "α π∗ 2πy θ(y, τ ) θ(xγ + mγ t, τ ) θ2 (y, τ ) g −1 θ2 (xγ + mγ t, τ ) T X .γ i (Nγ ) , FD (t, τ ) = "α π∗ 2πy θ(y, τ ) θ(xγ + mγ t, τ ) θ3 (xγ + mγ t, τ ) θ3 (y, τ ) F−D (t, τ ) = "α π∗ 2πy T Xg .γ i −1 (Nγ ) , θ(y, τ ) θ(xγ + mγ t, τ ) 2πiy V −k g .v θ1 (uv + nv t, τ )(Vv ) Fds (t, τ ) = i "α π∗ TX , θ (y, τ ) .γ θ(xγ + mγ t, τ )(Nγ ) 2πiy V −k g .v θ2 (uv + nv t, τ )(Vv ) FD (t, τ ) = i "α π∗ TX , θ (y, τ ) .γ θ(xγ + mγ t, τ )(Nγ ) .v θ3 (uv + nv t, τ )(Vv ) 2πiy V F−D T Xg , (t, τ ) = i −k "α π∗ θ (y, τ ) .γ θ(xγ + mγ t, τ )(Nγ ) 2πiy V −k+l g .v θ(uv + nv t, τ )(Vv ) FD ∗ (t, τ ) = i "α π∗ TX . θ(y, τ ) .γ θ(xγ + mγ t, τ )(Nγ )
By Theorem 1.1 and [LaM, page 238], we get, for t ∈ [0, 1] \ Q and g = e2πit , (2.8) Fds (t, τ ) = chg Ind dsX ⊗ @q (T X) , FD (t, τ ) = q −k/8 chg Ind D X ⊗ @q (T X) , F−D (t, τ ) = q −k/8 chg Ind D X ⊗ @−q (T X) , FdVs (t, τ ) = c(q)l−k q (l−k)/8 chg Ind D X ⊗ (V ) ⊗ @q (T X | V ) , FDV (t, τ ) = c(q)l−k q −k/8 chg Ind D X ⊗ @q (T X | V ) V (t, τ ) = c(q)l−k q −k/8 chg Ind D X ⊗ @−q (T X | V ) , F−D FDV ∗ (t, τ ) = (−1)l c(q)l−k q (l−k)/8 chg Ind D X ⊗ + (V )− − (V ) ⊗@∗q (T X | V ) . Considered as functions of (t, τ ), we can obviously extend these F and F V to meromorphic functions on C×H. Note that these functions are holomorphic in τ . The rigidity theorems are equivalent to the statement that these F and F V are independent of t. As explained in [Liu4], we prove it in two steps: (i) we show that these F, F V
ON FAMILY RIGIDITY THEOREMS, I
463
are doubly periodic in t; (ii) we prove they are holomorphic in t. Then it is trivial to see that they are constant in t. Lemma 2.1. (a) For a, b ∈ 2Z, Fds (t, τ ), FD (t, τ ), and F−D (t, τ ) are invariant under the action U : t −→ t + aτ + b.
(2.9)
V (t, τ ), and F V (t, τ ) are (b) If p1 (V )S 1 = p1 (T X)S 1 , then FdVs (t, τ ), FDV (t, τ ), F−D D∗ invariant under U .
Proof. Recall that we have the following transformation formulas of theta- functions (see [Ch]):
(2.10)
θ (t + 1, τ ) = −θ(t, τ ),
θ(t + τ, τ ) = −q −1/2 e−2πit θ(t, τ ),
θ1 (t + 1, τ ) = −θ1 (t, τ ),
θ1 (t + τ, τ ) = q −1/2 e−2πit θ1 (t, τ ),
θ2 (t + 1, τ ) = θ2 (t, τ ),
θ2 (t + τ, τ ) = −q −1/2 e−2πit θ2 (t, τ ),
θ3 (t + 1, τ ) = θ3 (t, τ ),
θ3 (t + τ, τ ) = q −1/2 e−2πit θ3 (t, τ ).
From these, for θv = θ, θ1 , θ2 , θ3 and (a, b) ∈ (2Z)2 , l ∈ Z, we get (2.11)
2 2 2 θv x + l(t + aτ + b), τ = e−πi(2lax+2l at+l a τ ) θv (x + lt, τ ),
which proves (a). To prove (b), note that since p1 (V )S 1 = p1 (T X)S 1 , we have (2.12)
2 2 "v,j ujv + nv t = "j (yj )2 + "γ ,j xγj + mγ t .
This implies the equalities (2.13)
"v,j nv ujv = "γ ,j mγ xγj , "γ m2γ d(mγ ) = "v n2v d(nv ),
which, together with (2.11), prove (b). For g = ac db ∈ SL2 (Z), we define its modular transformation on C × H by t aτ + b g(t, τ ) = , (2.14) . cτ + d cτ + d The two generators of SL2 (Z) are 0 −1 (2.15) S= , 1 0
T =
1 1 , 0 1
464
LIU AND MA
which act on C × H in the following way: S(t, τ ) =
(2.16)
t 1 ,− , τ τ
T (t, τ ) = (t, τ + 1).
Let Eτ be the scaling homomorphism from 5(T ∗ B) into itself: β → τ (1/2) deg β β. Lemma 2.2. (a) We have the following identities: Fds
(2.17) F−D
t 1 ,− τ τ t 1 ,− τ τ
= i k Eτ FD (t, τ ),
Fds (t, τ + 1) = Fds (t, τ ),
= i k Eτ F−D (t, τ ),
FD (t, τ + 1) = F−D (t, τ )e−(πi/4)k .
(b) If p1 (V )S 1 = p1 (T X)S 1 , then we have (2.18) 1 τ (l−k)/2 k V t ,− = i Eτ FDV (t, τ ), Fds τ τ i t 1 τ (l−k)/2 k V V ,− = i Eτ F−D (t, τ ), F−D τ τ i 1 τ (l−k)/2 k−l V t i Eτ FDV ∗ (t, τ), FD ∗ , − = τ τ i
FdVs (t, τ + 1) = e−(πi/4)(k−l) FdVs (t, τ ), V FDV (t, τ + 1) = e−(πi/4)k F−D (t, τ ),
FDV ∗ (t, τ + 1) = e−(πi/4)(k−l) FDV ∗ (t, τ).
Proof. By [Ch], we have the following transformation formulas for the Jacobi theta-functions: 1 τ πit 2 /τ θ(t, τ ), e i i 1 t τ πit 2 /τ θ1 ,− = e θ2 (t, τ ), τ τ i τ πit 2 /τ t 1 θ2 ,− = e θ1 (t, τ ), τ τ i t 1 τ πit 2 /τ θ3 ,− = e θ3 (t, τ ), τ τ i
θ
(2.19)
t 1 ,− τ τ
=
θ(t, τ + 1) = eπi/4 θ(t, τ ), θ1 (t, τ + 1) = eπi/4 θ1 (t, τ ), θ2 (t, τ + 1) = θ3 (t, τ ), θ3 (t, τ + 1) = θ2 (t, τ ).
The action of T on the functions F and F V are quite simple, and we leave the proof to the reader. Here we only check the action of S. By (2.19), we get
ON FAMILY RIGIDITY THEOREMS, I
(2.20) 1 t Fds ,− τ τ
465
θ1 (y, −1/τ ) g −1 θ1 xγ + mγ (t/τ , −1/τ ) (Nγ ) T X .γ i 2πy θ (y, −1/τ ) θ xγ + mγ (t/τ , −1/τ )
= " α π∗
k −kα
= "α i τ
θ1 (τy, τ ) g −1 θ1 τ xγ + mγ t, τ (Nγ ) . 2π τy T X .γ i θ(τy, τ ) θ τ xγ + mγ t, τ
π∗
If α is a differential form on B, we denote by {α}(p) the component of degree p of α. It is easy to see that (2.17) for Fds follows from the following identity: τ (2.21)
−kα
(2p) θ1 (τy, τ ) g −1 θ1 τ xγ + mγ t, τ (Nγ ) T X .γ i π∗ τy θ (τy, τ ) θ τ xγ + mγ t, τ (2p) θ1 (y, τ ) p g −1 θ1 xγ + mγ t, τ (Nγ ) T X .γ i = τ π∗ y . θ (y, τ ) θ xγ + mγ t, τ
By looking at the degree-2(p + kα ) part, that is, the (p + kα )th homogeneous terms of the polynomials in x and y, on both sides, we immediately get (2.21). From (2.7), (2.20), and (2.21), we obtain (2.22)
Fds
t 1 ,− τ τ
(2p)
(2p) = i k τ p FD (t, τ ) ,
which completes the proof of (2.17) for Fds . The other identities in (2.17) can be verified in the same way. By using (2.12), (2.19), and the same trick as in the proof of (2.17), we can obtain the identities in (2.18). This completes the proof of Lemma 2.2. The following lemma implies that the index theory comes in to cancel part of the poles of the functions F and F V . Lemma 2.3. If T X and V are spin, then all of the F and F V above are holomorphic in (t, τ ) for (t, τ ) ∈ R × H. Proof. Let z = e2πit and l = dim M. We consider these F and F V as meromorphic functions of two complex variables (z, q) with values in H ∗ (B). (i) Let N = max α,γ |mγ |. Denote by DN ⊂ C2 the domain (2.23)
|q|1/N < |z| < |q|−1/N ,
0 < |q| < 1.
Let fα be the contribution of the component M α in the functions F ’s and c(q)k−l F V ’s.
466
LIU AND MA
Then in DN , by (2.4) and (2.7), it is easy to see that fα has expansions of the form −l d(mγ ) ∞ (2.24) "n=0 bα,n (z)q n , q −a/8 .γ zmγ − 1 ∞ b n where a is an integer, hα (z, q) = "n=1 α,n (z)q is a holomorphic function of (z, q) ∈ DN , and bα,n (z) are polynomial functions of z. So as meromorphic functions, these F and c(q)k−l F V have expansions of the form
(2.25)
∞ q −a/8 "n=0 bn (z)q n
with bn (z) rational function of z, which can only have poles on the unit circle |z| = 1 ⊂ DN . Now if we multiply these F and c(q)k−l F V by l d(mγ ) (2.26) , f (z) = .α .γ 1 − zmγ we get holomorphic functions that have convergent power series expansions of the form (2.27)
∞ q −a/8 "n=0 cn (z)q n
with {cn (z)} polynomial functions of z in DN . By comparing the above two expansions, we get for n ∈ N, (2.28)
cn (z) = f (z)bn (z).
(ii) On the other hand, we can expand the Witten element @ into formal power ∞ R q n with R ∈ K(M). So for t ∈ [0, 1] \ Q, z = e2πit , we series of the form "n=0 n n get a formal power series of q for these F and c(q)k−l F V : ∞ q −a/8 "n=0 (2.29) chz Ind D X ⊗ Rn q n with a ∈ Z. By (1.6), we know that N(n) chz Ind D X ⊗ Rn = "m=−N(n) am,n zm , (2.30) with am,n ∈ H ∗ (B), and N (n) being some positive integer depending on n. By comparing (2.7), (2.25), and (2.30), we get for t ∈ [0, 1] \ Q, z = e2πit , (2.31)
N(n)
bn (z) = "m=−N(n) am,n zm .
Since both sides are analytic functions of z, this equality holds for any z ∈ C. By (2.28), (2.31), and the Weierstrass preparation theorem, we deduce that (2.32)
∞ q −a/8 "n=0 bn (z)q n =
1 ∞ q −a/8 "n=1 cn (z)q n f (z)
is holomorphic in DN . Obviously, R × H lies inside this domain. The proof of Lemma 2.3 is complete.
ON FAMILY RIGIDITY THEOREMS, I
467
Proof of the family rigidity theorem for spin manifolds. We prove that these F and F V are holomorphics on C × H, which implies the rigidity theorem we want to prove. We denote by F one of the functions: F , F V , Eτ F , and Eτ F V . From their expressions, we know the possible polar divisors of F in C × H are of the form t = (n/ l)(cτ + d) with n, c, d, l intergers and (c, d) = 1 or c = 1 and d = 0. can dWe always find intergers a, b such that ad − bc = 1. Then the matrix g = −b ∈ SL (Z) induces an action 2 −c a (2.33)
F g(t, τ ) = F
t dτ − b , . −cτ + a −cτ + a
Now, if t = (n/ l)(cτ + d) is a polar divisor of F (t, τ ), then one polar divisor of F (g(t, τ )) is given by (2.34)
t n dτ − b = c +d , −cτ + a l −cτ + a
which gives exactly t = n/ l. But by Lemma 2.2, we know that up to some constant, F (g(t, τ )) is still one of these F , F V , Eτ F , and Eτ F V . This contradicts Lemma 2.3; therefore, this completes the proof of Theorem 2.1. 2.3. A conjecture. Motivated by the family rigidity theorem, Theorem 2.1, we and Zhang would like to make the following conjecture. Conjecture 2.1. The operators considered in Theorem 2.1 are rigid on the equivariant K-theory level. This means that as elements in KG (B), the equivariant index bundles of those elliptic operators actually lie in K(B). Note that this conjecture is more refined than Theorem 2.1, since the equivariant Chern character map is not an isomorphism. In [Z], Zhang proved this for the canonical Spinc -Dirac operator on almost complex manifolds. 3. Jacobi forms and vanishing theorems. In this section, we generalize the rigidity theorems in the previous section to the nonzero anomaly case, from which we derive a family of holomorphic Jacobi forms. As corollaries, we get many family vanishing theorems, especially a family U-vanishing theorem for loop space. This section generalizes some results of [Liu3, §3] to the family case. This section is organized as follows: In Section 3.1, we state the generalization of the rigidity theorems to the nonzero anomaly case. In Section 3.2, we prove this result. In Section 3.3, as corollaries, we derive several family vanishing theorems. We keep the notation of Section 2.
468
LIU AND MA
3.1. Nonzero anomaly. Recall that the equivariant cohomology group HS∗1 (M, Z) of M is defined by HS∗1 (M, Z) = H ∗ M ×S 1 ES 1 , Z , (3.1) where ES 1 is the usual universal S 1 -principal bundle over the classifying space of S 1 . So HS∗1 (M, Z) is a module over H ∗ (BS 1 , Z) induced by the projection π : M ×S 1 ES 1 → BS 1 . Let p1 (V )S 1 , p1 (T X)S 1 ∈ HS∗1 (M, Z) be the equivariant first Pontrjagin classes of V and T X, respectively. Also recall that H ∗ BS 1 , Z = Z[[u]] (3.2) with u being a generator of degree 2. In this section, we suppose that there exists some integer n ∈ Z such that (3.3)
p1 (V )S 1 − p1 (T X)S 1 = n · π ∗ u2 .
As in [Liu3], we call n the anomaly to rigidity. Following [Liu4], we introduce the following elements in K(M)[[q 1/2 ]]: ∞ @q (T X | V )v = ⊗∞ n=1 5q n (V − dim V ) ⊗m=1 Sq m (T X − dim X),
(3.4)
∞ @q (T X | V )v = ⊗∞ n=1 5−q n−1/2 (V − dim V ) ⊗m=1 Sq m (T X − dim X), ∞ @−q (T X | V )v = ⊗∞ n=1 5q n−1/2 (V − dim V ) ⊗m=1 Sq m (T X − dim X), ∞ @∗q (T X | V )v = ⊗∞ n=1 5−q n (V − dim V ) ⊗m=1 Sq m (T X − dim X).
For g = e2πit , q = e2πiτ , with (t, τ ) ∈ R × H, we denote the equivariant Chern character of the index bundle of D X ⊗ (V ) ⊗ @q (T X | V )v , D X ⊗ @q (T X | V )v , D X ⊗@−q (T X | V )v , and D X ⊗(+ (V )−− (V ))⊗@∗q (T X | V )v by 2l FdVs ,v (t, τ ), V (t, τ ), F V l V FD,v −D,v (t, τ ), and (−1) FD ∗ ,v (t, τ ), respectively. Similarly, we denote by H (t, τ ) the equivariant Chern character of the index bundle of D X ⊗ ⊗∞ m=1 Sq m (T X − dim X). Later we consider these functions as the extensions of these functions from the unit circle with variable e2πit to the complex plane with values in H ∗ (B). For α a differential form on B, we denote by {α}(p) the degree-p component of α. The purpose of this section is to prove the following theorem, which generalizes the family rigidity theorems to the nonzero anomaly case. Theorem 3.1. Assume p1 (V )S 1 − p1 (T X)S 1 = n · π ∗ u2 with n ∈ Z. Then for V (t, τ )}(2p) , {F V (2p) are holomorpic Jacobi p ∈ N, {FdVs ,v (t, τ )}(2p) , {FD,v −D,v (t, τ )} 2 forms of index n/2 and weight k +p over (2Z) F with F equal to F0 (2), F 0 (2), Fθ , respectively, and {FDV ∗ ,v (t, τ )}(2p) is a holomorphic Jacobi form of index n/2 and weight k − l + p over (2Z)2 SL2 (Z).
ON FAMILY RIGIDITY THEOREMS, I
469
See Section 3.2 for the definitions of these modular subgroups, F0 (2), F 0 (2), and Fθ . 3.2. Proof of Theorem 3.1. Recall that a (meromorphic) Jacobi form of index m and weight l over L F, where L is an integral lattice in the complex plane C preserved by the modular subgroup F ⊂ SL2 (Z), is a (meromorphic) function F (t, τ ) on C × H such that t aτ + b 2 F , = (cτ + d)l e2πim(ct /(cτ +d)) F (t, τ ), cτ + d cτ + d (3.5) F (t + λτ + µ, τ ) = e−2πim(λ τ +2λt) F (t, τ ), where (λ, µ) ∈ L and g = ac db ∈ F. If F is holomorphic on C × H, we say that F is a holomorphic Jacobi form. Now, we start to prove Theorem 3.1. Let g = e2πit ∈ S 1 be a generator of the action group. For α = 1, 2, 3, let 2
θ (0, τ ) =
(3.6)
∂ θ(t, τ )|t=0 , ∂t
θα (0, τ ) = θα (t, τ )|t=0 .
By applying Theorem 1.1, we get
(3.7)
FdVs ,v (t, τ ) = (2π )−k
θ (0, τ )k V F (t, τ ), θ1 (0, τ )l ds
V (t, τ ) = (2π )−k FD,v
θ (0, τ )k V F (t, τ ), θ2 (0, τ )l D
V F−D,v (t, τ ) = (2π )−k
θ (0, τ )k V F (t, τ ), θ3 (0, τ )l −D
FDV ∗ ,v (t, τ ) = (2π )l−k θ (0, τ )k−l FDV ∗ (t, τ ), θ (0, τ )k 2πiy −k g TX . H (t, τ ) = (2π i) "α π∗ θ(y, τ ) .γ θ(xγ + mγ t, τ )(Nγ ) Lemma 3.1. If p1 (V )S 1 − p1 (T X)S 1 = n · π ∗ u2 , we have (3.8) FdVs ,v V F−D,v
FDV ∗ ,v
1 t ,− τ τ t 1 ,− τ τ t 1 ,− τ τ
= τ k eπint
2 /τ
V Eτ FD,v (t, τ ),
= τ k eπint
2 /τ
V Eτ F−D,v (t, τ ),
= τ k−l eπint
2 /τ
Eτ FDV ∗ ,v (t, τ ),
FdVs ,v (t, τ + 1) = FdVs ,v (t, τ ), V V FD,v (t, τ + 1) = F−D,v (t, τ ),
FDV ∗ ,v (t, τ + 1) = FDV ∗ ,v (t, τ ).
470
LIU AND MA
If p1 (T X)S 1 = −n · π ∗ u2 , then 1 t 2 ,− = τ k eπint /τ Eτ H (t, τ ), (3.9) H τ τ
H (t, τ + 1) = H (t, τ ).
Proof. First recall that the condition on the first equivariant Pontrjagin classes implies the equality 2 2 "v,j ujv + nv t − "j (yj )2 + "γ ,j xγj + mγ t (3.10) = n · t 2, which gives the equalities "v n2v d(nv ) − "γ m2γ d(mγ ) = n, "v,j nv ujv = "γ ,j mγ xγj , 2 2 "v,j ujv = "j (yj )2 + "γ ,j xγj .
(3.11)
The action of T on the functions F and F V is quite easy to check, and we leave this detail to the reader. We only check the action of S. By (2.7), (2.19), (3.7), and (3.11), we have (3.12) t 1 ,− = (2π i)−k "α π∗ FdVs ,v τ τ k .v θ1 uv + nv (t/τ ), −1/τ (Vv ) θ 0, −1/τ 2πiy g TX × l θ y, −1/τ .γ θ xγ + mγ (t/τ ), −1/τ )(Nγ θ1 0, −1/τ = (2π i)−k τ k eπi(nt /τ ) "α π∗ θ (0, τ )k 2π iy g .v θ1 (τ uv + nv t, τ )(Vv ) × TX . θ1 (0, τ )l θ (τy, τ ) .γ θ(τ xγ + mγ t, τ )(Nγ ) 2
As in (2.21), by comparing the (p + kα )th homogeneous terms of the polynomials in x, y, and u on both sides, we find the following equation
(3.13)
π∗
.v θ1 (τ uv + nv t, τ )(Vv ) (2p) 2π iy T Xg θ (τy, τ ) .γ θ(τ xγ + mγ t, τ )(Nγ ) (2p) 2πiy p g .v θ1 (uv + nv t, τ )(Vv ) TX = τ π∗ . θ(y, τ ) .γ θ(xγ + mγ t, τ )(Nγ )
By (3.12) and (3.13), we get the equation (3.8) for FdVs ,v . We leave the other cases to the reader.
ON FAMILY RIGIDITY THEOREMS, I
471
Recall the three modular subgroups a b ∈ SL2 (Z) | c ≡ 0 (mod 2) , F0 (2) = c d a b 0 ∈ SL2 (Z) | b ≡ 0 (mod 2) , F (2) = (3.14) c d 0 1 1 0 a b a b (mod 2) . or ∈ SL2 (Z) | ≡ Fθ = 1 0 0 1 c d c d Lemma 3.2. If p1 (V )S 1 − p1 (T X)S 1 = n · π ∗ u2 , then for p ∈ N, {FdVs ,v (t, τ )}(2p) V (t, τ )}(2p) is a Jacobi form over (2Z)2 is a Jacobi form over (2Z)2 F0 (2); {FD,v V F 0 (2); {F−D,v (t, τ )}(2p) is a Jacobi form over (2Z)2 Fθ . If p1 (T X)S 1 = −nπ ∗ u2 , then {H (t, τ )}(2p) is a Jacobi form over (2Z)2 SL2 (Z). All of them are of index n/2 and weight k + p. The function {FDV ∗ ,v (t, τ )}(2p) is a Jacobi form of index n/2 and weight k − l + p over (2Z)2 SL2 (Z). Proof. By (2.19) and (3.7), we know that these F V and H satisfy the second equation of the definition of Jacobi forms (3.5). Recall that T and ST 2 ST generate F0 (2), and also F 0 (2) and Fθ are conjugate to F0 (2) by S and T S, respectively. By Lemma 3.1 and the above discussion, for F V and H , we easily get the first equation of (3.5). For g = ac db ∈ SL2 (Z), let us use the notation t aτ + b −l −2πimct 2 /(cτ +d) F (g(t, τ ))|m,l = (cτ + d) e F (3.15) , cτ + d cτ + d to denote the action of g on a Jacobi form F of index m and weight l. By Lemma 3.1, for any function in F ∈ {{F V }(2p) , H (2p) }, its modular transformation {F }(2p) (g(t, τ ))|n/2,k+p (or {F }(2p) (g(t, τ ))|n/2,k−l+p ) is still one of the {{F V }(2p) } and H (2p) . Similar to Lemma 2.3, we have the following lemma. Lemma 3.3. For any g ∈ SL2 (Z), let F (t, τ ) be one of the {F V }(2p) or H (2p) . Then F (g(t, τ ))|n/2,k+p is holomorphic in (t, τ ) for t ∈ R and τ ∈ H. As in Lemma 2.3, this is the place where the index theory comes in to cancel part of the poles of these functions. Of course, to use the index theory, we must use the spin conditions on T X and V . Now we recall the following result [Liu3, Lemma 3.4]. Lemma 3.4. For a (meromorphic) Jacobi form F (t, τ ) of index m and weight k over L F, assume that F may only have polar divisors of the form t = (cτ + d)/ l in C×H for some integers c, d and l = 0. If F (g(t, τ ))|m,k is holomorphic for t ∈ R, τ ∈ H for every g ∈ SL2 (Z), then F (t, τ ) is holomorphic for any t ∈ C and τ ∈ H.
472
LIU AND MA
Proof of Theorem 3.1. By Lemmas 3.1, 3.2, and 3.3, we know that the {F V }(2p) and H (2p) satisfy the assumptions of Lemma 3.4. In fact, all of their possible polar divisors are of the form l = (cτ + d)/m where c, d are integers and m is one of the exponents {mj }. The proof of Theorem 3.1 is complete. 3.3. Family vanishing theorems for loop space. The following lemma is established in [EZ, Theorem 1.2]. Lemma 3.5. Let F be a holomorphic Jacobi form of index m and weight k. Then for fixed τ , F (t, τ ), if not identically zero, has exactly 2m zeros in any fundamental domain for the action of the lattice on C. This tells us that there are no holomorphic Jacobi forms of negative index. Therefore, if m < 0, F must be identically zero. If m = 0, it is easy to see that F must be independent of t. Combining Lemma 3.5 with Theorem 3.1, we have the following result. Corollary 3.1. Let M, B, V , and n be as in Theorem 3.1. If n = 0, the equivariant Chern characters of the index bundle of the elliptic operators in Theorem 3.1 are independent of g ∈ S 1 . If n < 0, then these equivariant Chern characters are identically zero; in particular, the Chern character of the index bundle of these elliptic operators is zero. Another quite interesting consequence of the above discussions is the following family U-vanishing theorem for loop space. Theorem 3.2. Assume that M is connected and the S 1 -action is nontrivial. If p1 (T X)S 1 = n · π ∗ u2 for some integer n, then the equivariant Chern character of the index bundle, especially the Chern character of the index bundle, of the elliptic m operator D X ⊗ ⊗∞ m=1 Sq (T X − dim X) is identically zero. Proof. In fact, by (3.11), we know that (3.16)
"j m2j d(mj ) = n.
So the case n < 0 can never happen. If n = 0, then all the exponents {mj } are zero, so the S 1 -action cannot have a fixed point. By (2.7) and (3.7), we know that H (t, τ ) is zero. For n > 0, we can apply Lemmas 3.1, 3.4, and 3.5 to get the result. m As remarked in [Liu3], the fact that the index of D X ⊗ ⊗∞ m=1 Sq (T X − dim X) is U-vanishing theorem of zero may be viewed as a loop space analogue of the famous Atiyah and Hirzebruch [AH] for compact connected spin manifolds with nontrivial S 1 -action. The reason is that this operator corresponds to the Dirac operator on loop space LX, while the condition on p1 (T X)S 1 is a condition for the existence of an equivariant spin structure on LX. This property is one of the most interesting and surprising properties of loop space. Now, under the condition of Theorem 3.2, a very
ON FAMILY RIGIDITY THEOREMS, I
473
interesting question is to know when the index bundle of this elliptic operator is zero in K(B). References [A] [ABoP] [AH] [ASe] [AS1] [AS2] [BeGeV] [B1] [B2] [BT] [Ch] [EZ] [G] [H] [K] [L1]
[L2] [LaM] [Liu1] [Liu2] [Liu3] [Liu4] [O]
[S]
M. F. Atiyah, Collected Works, Oxford Sci. Publ., Oxford Univ. Press, New York, 1988. M. F. Atiyah, R. Bott, and V. K. Patodi, On the heat equation and the index theorem, Invent. Math. 19 (1973), 279–330. M. F. Atiyah and F. Hirzebruch, “Spin manifolds and group actions” in Collected Works, Vol. 3, Oxford Sci. Publ., Oxford Univ. Press, New York, 1988, 417–429. M. F. Atiyah and G. Segal, The index of elliptic operators, II, Ann. of Math. (2) 87 (1968), 531–545. M. F. Atiyah and I. M. Singer, The index of elliptic operators, I, Ann. of Math. (2) 87 (1968), 484–530. , The index of elliptic operators, IV, Ann. of Math. (2) 93 (1971), 119–138. N. Berline, E. Getzler, and M. Vergne, Heat Kernels and Dirac Operators, Grundlehren Math. Wiss. 298, Springer, Berlin, 1992. J. -M. Bismut, The Atiyah-Singer index theorem for families of Dirac operators: Two heat equation proofs, Invent. Math. 83 (1985), 91–151. , Equivariant immersions and Quillen metrics, J. Differential Geom. 41 (1995), 53–157. R. Bott and C. Taubes, On the rigidity theorems of Witten, J. Amer. Math. Soc. 2 (1989), 137–186. K. Chandrasekharan, Elliptic Functions, Grundlehren Math. Wiss. 281, Springer, Berlin, 1985. M. Eichler and D. Zagier, The Theory of Jacobi Forms, Progr. Math. 55, Birkhäuser, Boston, 1985. E. Getzler, A short proof of the local Atiyah-Singer index theorem, Topology 25 (1986), 111–117. F. Hirzebruch, T. Berger, and R. Jung, Manifolds and Modular Forms, Aspects Math. E20, Friedr. Vieweg and Sohn, Braunschweig, 1992. I. Krichever, Generalized elliptic genera and Baker-Akhiezer functions, Math. Notes 47 (1990), 132–142. P. S. Landweber, “Elliptic cohomology and modular forms” in Elliptic Curves and Modular Forms in Algebraic Topology (Princeton, 1986), Lecture Notes in Math. 1326, Springer, Berlin, 1988, 55–68. , ed., Elliptic Curves and Modular Forms in Algebraic Topology (Princeton, 1986), Lecture Notes in Math. 1326, Springer, Berlin, 1988. H. B. Lawson and M.-L. Michelsohn, Spin Geometry, Princeton Math. Ser. 38, Princeton Univ. Press, Princeton, 1989. K. Liu, On SL2 (Z) and topology, Math. Res. Lett. 1 (1994), 53–64. , Modular invariance and characteristic numbers, Comm. Math. Phys. 174 (1995), 29–42. , On modular invariance and rigidity theorems, J. Differential Geom. 41 (1995), 343–396. , On elliptic genera and theta-functions, Topology 35 (1996), 617–640. S. Ochanine, “Genres elliptiques équivariants” in Elliptic Curves and Modular Forms in Algebraic Topology (Princeton, 1986), Lecture Notes in Math. 1326, Springer, Berlin, 1988, 107–122. G. Segal, Equivariant K-theory, Inst. Hautes Études Sci. Publ. Math. 34 (1968), 129–151.
474 [T] [W]
[Z]
LIU AND MA C. Taubes, S 1 -actions and elliptic genera, Comm. Math. Phys. 122 (1989), 455–526. E. Witten, “The index of the Dirac operator in loop space” in Elliptic Curves and Modular Forms in Algebraic Topology (Princeton, 1986), Lecture Notes in Math. 1326, Springer, Berlin, 1988, 161–181. W. Zhang, Symplectic reduction and family quantization, preprint.
Liu: Department of Mathematics, Stanford University, Stanford, California, 94305, USA;
[email protected] Ma: Institut für Mathematik, Humboldt-Universität zu Berlin, Unter den Linden 6, D-10099 Berlin, Germany;
[email protected]
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
EXPONENTIAL DECAY IN THE FREQUENCY OF ANALYTIC RANKS OF AUTOMORPHIC L-FUNCTIONS D. R. HEATH-BROWN and P. MICHEL
This note should be seen as an appendum to the work of E. Kowalski and the second author [KM1] and deals with the problem of bounding, unconditionally, the order of vanishing at the critical point in a family of L-functions. This problem is illustrated in the particular case of L-functions of weight-2 primitive modular forms of prime level. Recall the notation from [KM1]: For q a prime number, let S2 (q)∗ be the set of primitive forms of weight 2 and level q, normalized so that their first Fourier coefficient is 1; for f (z) = λf (n)n1/2 e(nz) ∈ S2 (q)∗ , λf (1) = 1, n≥1
let L(f, s) :=
λf (n) n≥1
ns
,
the associated (normalized) L-function; it admits analytic continuation to C with a functional equation relating L(f, s) to L(f, 1−s), and we call rf := ords=1/2 L(f, s) the analytic rank of f . In [KM1] the following was proved: There exists an absolute constant C1 ≥ 0 such that, for all q prime, f ∈S2
rf ≤ C1 |S2 (q)∗ |.
(q)∗
After that, much progress has been made concerning this question. In particular, in [KMV], a sharp explicit value was given for the constant C1 (C1 = 1.1891 for q large enough). In the course of the proof, a uniform bound for the square of the ranks was obtained: rf2 ≤ C2 |S2 (q)∗ |. f ∈S2 (q)∗
However, the latter improvement used only a slight variant of the methods of [KM1]. In fact, it is possible to pursue this idea further and it turns out that much more is true; this is the subject of the present note. Received 10 August 1999. 1991 Mathematics Subject Classification. Primary 11F66; Secondary 11G40, 11M99. 475
476
HEATH-BROWN AND MICHEL
Theorem 0.1. There exist absolute constants A, B > 0 such that, for all q prime, eArf ≤ B|S2 (q)∗ |. f ∈S2 (q)∗
Remark. It is possible to prove a bit more than this: Namely, one may replace rf by the number of zeros (counted with their multiplicity) in a small circle of radius ∼ 1/ log q centred at 1/2. As a corollary we can look at the proportion of forms f with analytic rank larger than a given r. This proportion decays exponentially with r. Corollary 0.2. There exist absolute constants A, B > 0 such that, for all r ≥ 0, |{f ∈ S2 (q)∗ , rf ≥ r}| ≤ Be−Ar . |S2 (q)∗ | With more work, the above constants A and B could be computed explicitly and optimized (see, e.g., [KM2] for the basic material necessary to do this). We may also remark that these results are very similar to the ones obtained by the first author and Fouvry concerning the distribution of the (arithmetic) rank in explicit families of twists of elliptic curves (see [HB1], [HB2], [F]). Surprisingly, Theorem 0.1 is based on a very general principle and one can also obtain similar exponential upper bounds for the analytic ranks of a wide class of families of L-functions, including the most basic example of the Dirichlet character L-functions. By refining slightly the work of Soundararajan [So], one can also treat the family of L-functions attached to quadratic characters. The general principle is that, as soon as one is able to control the mollified square of a family of L-functions (slightly to the right of the critical line), one obtains an exponential bound for the average of the analytic ranks. As there is no real loss and no greater difficulty in axiomatizing this principle, we do so, and deduce Theorem 0.1 as a special case. Consider a finite probability space (, µ), where µ(ω) > 0 for every ω ∈ . For each ω ∈ , suppose we are given a function hω (s) that is holomorphic in the halfplane Re(s) ≥ 0. Moreover, assume the following hypothesis on the variance of the function hω (s) − 1. Hypothesis 0.3. For some B, C > 0, M > 2, we have the bound |hω (σ + it) − 1|2 µ(ω) ≤ C(1 + |t|)B M −σ ω∈
uniformly for σ ≥ (2 log M)−1 . Note that, in view of this hypothesis and the fact that µ(ω) > 0, we have hω (s) = 1 + Oω (1 + |t|)B/2 M −σ/2
(1)
EXPONENTIAL DECAY OF ANALYTIC RANKS OF L-FUNCTIONS
477
for each hω . Thus, hω (s) is nonvanishing for sufficiently large σ . For any α ≥ 0 and t1 < t2 ∈ R, we may therefore define N(ω, α, t1 , t2 ) to be the number of zeros ρ of hω (s) such that Re(ρ) ≥ α, t1 ≤ Im(ρ) ≤ t2 . Clearly, N (ω, α, t1 , t2 ) is finite. Our general result gives an upper bound for the 2kth power of N (ω, α, t1 , t2 ) on average. Theorem 0.4. With the above notation, assume that Hypothesis 0.3 is satisfied. Then for all k ≥ 1, for all α ≥ (log M)−1 , and all t1 < t2 , we have ω∈
N (ω, α, t1 , t2 )2k µ(ω)
C(k!)2 48
k α log M
2k 1 + |t| +
16k log M
B
M −α/2 1 + (t2 − t1 ) log M ,
where we have set |t| := Max(|t1 |, |t2 |). The constant involved in the Vinogradov symbol is absolute. Remark. We have not tried to optimize the various explicit constants present in this bound, nor the exponent 1/2 in M −α/2 . In the next section, we prove this general result. Then, in the following one, we explain how Theorem 0.1 can be deduced from it. The necessary Hypothesis 0.3 in our setting is provided by [KM1, Proposition 4]. Acknowledgment. We would like to thank E. Kowalski and P. Sarnak for encouraging us to publish this note. This work began while the second author visited the Oxford Mathematical Institute. He wishes to thank the institute for its hospitality and support. He is also supported by a membership at the Institut Universitaire de France. 1. Proof of Theorem 0.4. Obviously, we may assume t2 −t1 = 1/ log M. We start, as in [KM1], by recalling the following lemma of Selberg [S]. Lemma 1.1 (Selberg, [S, Lemma 14]). Let h be a function holomorphic in the region s ∈ C | Re(s) ≥ α , t1 ≤ Im(s) ≤ t2 and satisfying
π h(s) = 1 + o exp − Re(s) t2 − t1
(2)
in this region, uniformly as Re(s) → +∞. Denoting the zeros of f in the interior of the region by ρ = β + iγ , we have
478
HEATH-BROWN AND MICHEL
γ −t β − α 2 t2 − t1 sin π 1 sinh π t2 − t1 t2 − t1 ρ
t −t sin π 1 log |h(α + it)| dt t2 − t 1 t1 +∞ σ − α sinh π log h σ + it1 + log h σ + it2 dσ + t2 − t 1 α
=
t2
(where the zeros are counted according to multiplicity). We apply this lemma to each hω (s). We take α = α/2, and we also take = t2 + (' − 1)/2 log M, t1 = t1 − (' − 1)/2 log M, where ' = 16k. Thus t2 − t1 = '/ log M. Note that ' > 2π , so that the condition of Lemma 1.1 is satisfied by (1). It therefore follows by positivity that
t2
α log M t2 − t1 N (ω, α, t1 , t2 ) ' πα log M π(' − 1) ≤ 2 t2 − t1 N (ω, α, t1 , t2 ) sin sinh 2' 2' t 2 t −t ≤ sin π 1 log |h(α/2 + it)| dt t2 − t 1 t1 +∞ σ − α/2 h σ + it dσ. + log h σ + it sinh π + log 1 2 t2 − t1 α/2 The reader should observe that the function log |h(s)| might take large negative values. Since we wish to apply Hölder’s inequality, we first eliminate these large values by using the inequality log |1 + x| ≤ log(1 + |x|). This yields α log M t2 − t1 N (ω, α, t1 , t2 ) ' t 2 log 1 + |h(α/2 + it) − 1| dt ≤ t1
+
σ − α/2 exp π log 1+ h σ + it1 − 1 + log 1+ h σ + it2 − 1 dσ t2 − t1
+∞ α/2
:= I0 (ω) + I1 (ω) + I2 (ω), say. Next we take the 2kth power of this inequality, getting 2k α log M t2 − t1 N (ω, α, t1 , t2 ) ≤ 32k−1 I0 (ω)2k + I1 (ω)2k + I2 (ω)2k . '
(3)
We only treat I1 (ω)2k , the other terms being similar. By Hölder’s inequality we have
EXPONENTIAL DECAY OF ANALYTIC RANKS OF L-FUNCTIONS
2k−1 2k σ − α/2 dσ ≤ exp −π 2k − 1 t2 − t1 α/2 +∞ 2k σ − α/2 × exp 4kπ log 1 + h σ + it1 − 1 dσ. t2 − t 1 α/2
I1 (ω)
2k
479
+∞
The first factor is bounded by (t2 − t1 )2k−1 . For the second term we use the fact that log(1 + |x|)k ≤ k!|x|, which follows from the inequality y k ≤ k!(ey −1). This allows us to bound the second integral by +∞ 2 σ − α/2 (k!)2 h σ + it1 − 1 dσ. exp 4kπ t2 − t 1 α/2 Since 4π/16 < 1, the above integral is convergent by (1). Next, averaging over ω and using Hypothesis 0.3 (since α/2 ≥ (2 log M)−1 ), we obtain that 16k B −α/2 C 2k 2 2k I1 (ω) µ(ω) ≤ (k!) t2 − t1 1 + |t| + M . '(1 − π/4) log M ω∈
This finishes the bound for ω∈ I1 (ω)2k µ(ω). We may bound the terms involving I0 (ω) and I2 (ω) similarly, thereby completing the proof of Theorem 0.4. 2. Application to zeros of L-functions. Next we turn to the proof of Theorem 0.1, in which we combine the preceding result with the method of Riemann-Weil-Mestre explicit formulae (see [KM1]). Fix a smooth, even, nonnegative test function φ, compactly supported in [−1, 1], such that ˆ Re φ(s) ≥ 0, for | Re(s)| ≤ 1, where ˆ φ(s) =
φ(x)esx dx R
ˆ denotes the Laplace transform of φ. We assume that φ is normalized so that φ(0) = 1. Let λ > 1/1000 be a parameter to be fixed later. For f ∈ S2 (q)∗ we have (see [KM1]) the upper bound λf (p) log p log p λrf ≤ φ(0) log q − 2 φ λ p 1/2 p 1 (4) ˆ + Oφ (λ) Re φ λ ρ − − 2λ 2 Re(ρ−(1/2))≥1/λ
:= φ(0) log q − 2S(f ) − 2λ-(f ) + Oφ (λ),
480
HEATH-BROWN AND MICHEL
say, where p runs over primes and ρ runs over the zeros of L(f, s) of real part greater than 1/2 + 1/λ. We aim to show that there is an absolute constant C such that rf2k ≤ (Ck)2k |S2 (q)∗ | (5) f
for every integer k ≥ 1. Then, if the constant A > 0 is chosen so that the series (ACk)2k k≥0
is convergent, we have
(2k)!
:= B
exp(Arf ) ≤ B|S2 (q)∗ |
f
as desired. By Stirling’s formula, we see that any A < (eC)−1 is admissible. We first make a trivial reduction. If one takes λ = 1 and chooses φ appropriately in (4), one finds that rf ≤ 2 log q for q large enough. Thus in order to prove (5) we may suppose that k ≤ log q. We now take λ to be of the form λ = θ log q/k, where θ > 1/1000 is an absolute constant to be determined later. It now remains to obtain bounds for S(f )2k and -(f )2k . f
f
2.1. The bound for f S(f )2k . In order to stay at an elementary level as much as possible, we first consider the “harmonic” average, and we prove the bound f
1 S(f )2k (Cλ)2k 4π(f, f )
(6)
for some absolute C and any integer k ≥ 1. Here (∗, ∗) is the Petersson inner product on the space of weight-2 forms for /0 (q). We may then reduce to the classical average by means of the Cauchy-Schwarz inequality 1/2 1/2 1 S(f )4k (Cλ)2k |S2 (q)∗ | S(f )2k ≤ 4π(f, f ) 4π(f, f ) f
f
f
(7) using the bound
4π(f, f ) |S2 (q)∗ |,
f
for which see [M], for example. The advantage of using harmonic averaging is that
EXPONENTIAL DECAY OF ANALYTIC RANKS OF L-FUNCTIONS
481
we can apply Petersson’s trace formula, which we state in the following simple form (see, e.g., [KM1]). Lemma 2.1. For m ≥ 1, we have
1 λf (m) = δm,1 + O m1/2 q −3/2 . 4π(f, f ) ∗
(8)
f ∈S2 (q)
Expanding S(f )2k , we see that all we need is to bound a linear combination of terms of the form
···
p1 p2
pj
f
j λf (pi ) log pi φ(log pi /λ) αi 1 . 1/2 4π(f, f ) p i=1
i
Here the pi run over distinct primes, and α1 , α2 , . . . , αj are fixed positive integers such that α1 + · · · + αj = 2k. In order to apply (8) we need to rewrite λf (p)α in the form λf (p)α = a0,α + a1,α λf (p) + · · · + aα,α λf (p α ), which may be done using the identity (2 cos θ )α = a0,α + a1,α 2 cos θ + · · · + aα,α 2 cos αθ. Note in particular that the coefficients ai,α are nonnegative and that a0,α = 0 if α is odd. Expanding the various λf (pi )αi and applying (8), we see that we get a diagonal contribution (coming from the δm,1 term in (8)) if and only if all the αi are even. This contribution is given by p1 p2
···
j pj i=1
a0,αi
≤ 22k
log pi φ(log pi /λ)
αi
1/2
pi p1 p2
···
j log pi φ(log pi /λ) αi pj i=1
1/2
pi
.
We may now recombine the various sums to show that the diagonal contribution to the main sum f (1/4π(f, f ))S(f )2k is bounded by k φ(log p/λ)2 log2 p ≤ (Cλ)2k , 22k p λ
p≤e
for some constant C depending only on our choice of φ.
482
HEATH-BROWN AND MICHEL
Similarly, the total contribution coming from the error term in Petersson’s formula is found to be 2k
q −3/2 φ(log p/λ) log p C 2k e2kλ q −3/2 C 2k q 2θ−3/2 . p≤eλ
Hence, as long as θ < 3/4, we may conclude that f
1 S(f )2k (Cλ)2k , 4π(f, f )
for some absolute constant C. This completes the proof of (6). Remark. It is also possible to prove (6) directly by means of the trace formula for Hecke operators. 2.2. The bound for f -(f )2k . We begin by noting that repeated integration by parts produces an estimate 1
ˆ φ(s)
l
Re(s) l e
1 + | Im(s)|
for any integer l ≥ 1. If we split the strip {1/λ ≤ Re(ρ − 1/2) ≤ 1} into rectangles of width 1/λ, of the form 1 m m+1 1/λ ≤ Re ρ − ≤ 1, ≤ Im(s) < , 2 λ λ and observe that (for β ≥ 1/λ) eλ(β−1/2) λ
β−(1/2)
eλα dα,
1/2λ
we find that -(f ) l λ
m≥0
where
1 Im , (1 + m)l
m m + 1 λα e dα. n f, α, , Im = λ λ 1/2λ
1/2
Here we define n(f, α, t1 , t2 ) to be the number of zeros of L(f, 1/2 + s) in the region Re(s) ≥ α, t1 ≤ Im(s) ≤ t2 . We apply Hölder’s inequality to Im , whence
EXPONENTIAL DECAY OF ANALYTIC RANKS OF L-FUNCTIONS
Im2k
≤
1/2
e
−λα
2k−1
m m+1 n f, α, , λ λ 1/2λ
dα
1/2λ
1/2
2k e
(4k−1)λα
483
dα
≤ λ1−2k J (f, m, k), where
m m + 1 2k 4kλα n f, α, , e dα. λ λ 1/2λ
J (f, m, k) =
1/2
Thus, -(f )2k ≤ Clk λ
m≥0
2k−1 1 (1 + m)2
m≥0
1 J (f, m, k), (1 + m)2kl−(4k−2)
for some Cl depending only on l. We are now ready to apply Theorem 0.4. We take our space to be the set S2 (q)∗ , equipped with the uniform probability measure (µ(f ) = 1/|S2 (q)∗ |). The function hf (s) has the form hf (s) = L(f, 1/2 + s)M(f, 1/2 + s), where M(f, s) is a suitable “mollifier.” Such functions were constructed in [KM1], so that Hypothesis 0.3 is satisfied for M = q 1/5 (say) and for some absolute constants B, C (see [KM1, Proposition 4]). We observe that n(f, α, t1 , t2 ) ≤ N(f, α, t1 , t2 ), in the notation of Theorem 0.4. Thus, on recalling that k −1 log q λ k −1 log q and that M = q 1/5 , we deduce that |S2 (q)∗ |−1
J (f, m, k) C k (k!)2 (1 + m)B
1/2
q −α/10 e4kλα dα,
1/2λ
f
for some constant C. We may now choose l = B + 2, say, and λ = (40k)−1 log q, to obtain J (f, m, k) C k (k!)2 λ−1 |S2 (q)∗ | f
for some absolute constant C. It then follows that -(f )2k C k (k!)2 |S2 (q)∗ |,
(9)
f
with a new constant C. Finally, combining (7), (9), and (4), we obtain (5). References [F]
É. Fouvry, “Sur le comportement en moyenne du rang des courbes y 2 = x 3 + k” in Séminaire de Théorie des Nombres (Paris, 1990–91), Progr. Math. 108, Birkhäuser, Boston, 1993, 61–84.
484 [HB1] [HB2] [KM1] [KM2] [KMV]
[M] [S] [So]
HEATH-BROWN AND MICHEL D. R. Heath-Brown, The size of Selmer groups for the congruent number problem, Invent. Math. 111 (1993), 171–195. , The size of Selmer groups for the congruent number problem, II, Invent. Math. 118 (1994), 331–370. E. Kowalski and P. Michel, The analytic rank of J0 (q) and zeros of automorphic Lfunctions, Duke Math. J. 100 (1999), 503–542. , An explicit upper bound for the rank of J0 (q), to appear in Israel J. Math. E. Kowalski, P. Michel, and J. M. VanderKam, Non-vanishing of high derivatives of automorphic L-functions at the center of the critical strip, to appear in J. Reine Angew. Math. M. Ram Murty, “The analytic rank of J0 (N)(Q)” in Number Theory (Halifax, N.S., 1994), CMS Conf. Proc. 15, Amer. Math. Soc., Providence, 1995, 263–277. A. Selberg, Contributions to the theory of Dirichlet’s L-functions, Skr. Norske Vid.-Akad. Oslo I Mat.-Nat. Kl. 1946, no. 3. K. Soundararajan, Non-vanishing of quadratic Dirichlet L-functions at s = 1/2, preprint, 1999.
Heath-Brown: Mathematical Institute, 24-29 Saint Giles’, Oxford, OX1 3LB, England;
[email protected] Michel: Institut Universitaire de France et Université Montpellier II, CC 051, 34095 Montpellier Cedex 05, France;
[email protected]
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES AND HAMILTONIAN DYNAMICS DANIEL GATIEN and FRANÇOIS LALONDE §1. Introduction. Given a closed Lagrangian submanifold L of a symplectic manifold (M, ω), we consider the moduli space of J -holomorphic cylinders u with boundary in L which belong to a given free homotopy class α ∈ [C, ∂C; M, L]. Actually, we are interested in the cases (M, L, α) for which the moduli space of Fredholm-regular unparametrised J -holomorphic cylinders reduces to an isolated family containing an odd number of elements (a single one, for instance). If, say, the manifold contains no holomorphic spheres and if the two boundary components of the cylinder are mapped to two disjoint components L0 , L1 of L, then an argument similar to the one used by Hofer and Viterbo in [HV2] (or by Gromov in [G]) shows the existence of a periodic orbit of any Hamiltonian that separates L0 from L1 (i.e., reaches on L1 a minimum value larger than its maximum value on L0 ), at least when the growth at infinity of H is controlled (in case the manifold M is noncompact) and when holomorphic discs with boundary in L can be avoided. The idea is to perturb the equation du + J ◦ du ◦ j = 0, where j is any of the conformal structures of the cylinder and where J belongs to the set (M, ω) of ω-compatible almost complex structures on M, by introducing a new term that depends on the given Hamiltonian H . The result then follows from a detailed study of the solutions of the perturbed equation that are in the class α. Essentially, this result tells us that the difference minL1 H − maxL0 H (or, more generally, 1 0 (minL1 Ht −maxL0 Ht ) dt in the nonautonomous case) is a kind of capacity whose positivity ensures the existence of periodic orbits. Note that, because the cylinder C = [0, 1] × S 1 is parallelizable, the CauchyRiemann equation for maps u : C → M has a direct interpretation in terms of vector fields on C, and perturbing this equation to introduce the Hamiltonian term is straightforward. Second, because the group of biholomorphisms of C, for any choice of complex structure, is just the compact space S 1 , the passage from solutions of the Cauchy-Riemann equation to unparametrised J -curves is simple. As we will see, an important property of the perturbed equation is naturally expressed in terms of the Received 8 April 1999. Revision received 7 July 1999. 1991 Mathematics Subject Classification. Primary 58F05; Secondary 53C15, 53C23. Lalonde’s work partially supported by Natural Sciences and Engineering Research Council of Canada grant number OGP 0092913 and by Fonds pour la Formation de Chercheurs et l’Aide à la Recherche grant number ER-1199. 485
486
GATIEN AND LALONDE
noncompactness of the space of conformal structures of C. We begin by describing this approach as it applies to cotangent bundles of manifolds V belonging to the following class. Consider a closed Riemannian manifold W with a flat metric g and the product Riemannian manifold ([0, 1] × W, dt 2 ⊕ g). Identify the ends {0} × W, {1} × W using an isometry ι of (W, g). Let p be a fixed point of ι, and assume that the geodesic γ0 = [0, 1] × {p}/ι of V = [0, 1] × W/ι is the only one in its free homotopy class. Note that this is automatically the case if ι is an involution whose fixed points are all isolated (we prove this assertion in §2.2). The unit tangent vector field along γ0 can be extended to the vector field ∂/∂t over all V . The flat metric identifies the tangent bundle of V to its cotangent bundle. Using this identification implicitly, we get two Lagrangian submanifolds, the zero section L0 and the Lagrangian submanifold L1 that is given as the graph of the vector field ∂/∂t. Because the geodesic property of γ0 in V translates into the holomorphic property of the cylinder Cγ0 = {s(dγ0 /dt)(t) : s, t ∈ [0, 1]} that γ0 generates in T ∗ V , we get a holomorphic cylinder with boundary in L0 ∪ L1 , which is unique at least in the class of vertical ones (i.e., those generated by curves in the base V ) since that geodesic is unique. It can actually be checked (see §4) that it is unique amongst all cylinders in the free homotopy class α ∈ [(C, ∂C), (T ∗ V , L0 ∪ L1 )] of Cγ0 . The first theorem of this paper states that any compactly supported Hamiltonian on T ∗ V that separates the Lagrangian submanifolds L0 and L1 must have a periodic orbit in the homotopy class of γ0 . Note that all the previous conditions about the triple (T ∗ V , L, α) are met in the cotangent of the Klein bottle (obtained by taking W = S 1 and ι as the reflection). Of course, any such cotangent arises whenever we are given a Lagrangian embedding of the manifold V in an arbitrary symplectic manifold. By Weinstein’s Lagrangian neighbourhood theorem, a neighbourhood of the embedded V is symplectomorphic to a neighbourhood of the zero section in T ∗ V . Thus, our first theorem describes a local property of a Hamiltonian defined near a Lagrangian submanifold whose topological type is similar to the manifolds V described above. Studying the ways by which solutions can escape from that Weinstein neighbourhood and using the control on the estimates obtained by taking L1 sufficiently close to L0 , we can prove a global version of the theorem. As we will see, this leads to a new condition on the existence of a Lagrangian embedding of V into Cn and to a notion of symplectic (self-) linking. The paper is essentially self-contained. This might be useful since we must work in a Fredholm setting where the conformal parameter of the cylinder plays a crucial role in some parts of the proof. Because our cotangent bundles are quotients of Cn by isometries, all our proofs ultimately rely on classical results of complex analysis such as those related to convexity or to elementary facts from the Bers-Vekua theory. Acknowledgments. The ideas presented in this paper were first developed during discussions between the second author and Leonid Polterovich. Both authors are grateful to him. We also thank Misha Bialy for useful suggestions and the referee for a careful reading of the paper, which helped clarify some aspects.
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
487
§2. Statement of the main results Theorem 2.1 (Cf. Hofer-Viterbo [HV1]; see Remark 4 below). Let W be a closed flat manifold and ι an isometric involution whose fixed points are all isolated. Let V be the flat manifold obtained by taking the quotient of R × W by the group G of isometries generated by T × ι, where T is the translation by 1 on R. Let L0 , L1 ⊂ T ∗ V be the zero section and the section corresponding to the 1-form dt, respectively. Denote by βp the free homotopy class of loops of T ∗ V that contains the loop (R × {p})/G = (R × {p})/Z, where p is any of the fixed points of ι. Consider any periodic Hamiltonian H = T ∗ V × [0, 1] → R, with period 1, satisfying 1 min Ht |L1 − max Ht |L0 dt > 0 0
and whose differential is compactly supported. Denote this quantity by h. Then, for each of the free homotopy classes of loops βp and −βp in T ∗ V , there is 0 < λ ≤ 1/ h such that the Hamiltonian flow of λHt has a closed periodic orbit (of period 1) in that class. In particular, an autonomous Hamiltonian that separates L0 from L1 and that has a compactly supported gradient must have a nonconstant closed orbit in each of these classes. Examples. The simplest example is W = {pt} and V = S 1 so that T ∗ V is the cylinder. Then the theorem simply states that any Hamiltonian that separates two circles (in the homotopy class of the S 1 -factor) and that is equal to constants C1 , C2 at infinity must have a periodic orbit in the class of the S 1 -factor. As another interesting example, let us mention the n-torus with the involution given by x → −x in the universal covering. It has 2n fixed points, and therefore one finds 2n+1 distinct periodic orbits in the cotangent T ∗ (T n × [0, 1]/ι) for any Hamiltonian satisfying the hypotheses of the theorem. One can easily classify the involutive isometries up to conjugations in the case of tori. The simplest of these examples (W = S 1 ) leads to the Klein bottle V = K 2 . Other examples include the two types, up to conjugation, of involutive isometries of W = K 2 . Remark. When the Hamiltonian is autonomous, it would be interesting to know whether the theorem could be deduced from a purely topological argument based on a Poincaré return map and a tangential version of the Lefschetz trace formula for fixed points of diffeomorphisms. Such an argument, which indeed works in the case of the geodesic flow, would have to be quite delicate. However, such an argument seems questionable when the Hamiltonian is not autonomous. Even for autonomous Hamiltonians, such an approach would likely fail in the next theorem. Theorem 2.2. Let L be a Lagrangian embedding in a symplectic manifold M (compact or not) of any of the manifolds V considered in Theorem 2.1. Suppose U is a Weinstein neighbourhood of L in M, and suppose L = Lε is a translate of L in
488
GATIEN AND LALONDE
the direction of the p1 -coordinate in U by a quantity ε. If ε is sufficiently small, then every smooth Hamiltonian H : M → R such that inf H > sup H, L
L
and satisfying an appropriate growth condition at infinity, has a periodic orbit in each of the free homotopy classes of loops βp , −βp in M. A similar statement applies as well to a periodic Hamiltonian. A sufficiently strong condition at infinity is the requirement that the support of the gradient of H be compact. However, when M = M × C for M compact, the theorem still holds if we only impose that, outside some compact K ⊂ M, H be a linear function of the form M × C → C → R. Remarks. (1) Note that we need no condition on the manifolds M or M (except the compactness of M ). (2) The number ε only depends on the size of the Weinstein neighbourhood. Choosing it small enough ensures that no other holomorphic cylinder exists in the prescribed homotopy class. As we show in the proof, it also prevents bubbling off at an interior or boundary point of the solutions. (3) The interesting cases here are M compact and M = Cn . In the first case, the only condition imposed on H is the separation property. In the second case, the theorem leads to a new obstruction on a Lagrangian embedding in Cn of any manifold of the type V described in Theorem 2.1. This is what we discuss in the next section. (4) The statement of Theorem 2.1 is closely related to the main theorem in Hofer and Viterbo’s paper [HV1]. Actually, the autonomous case of Theorem 2.1 can essentially be derived, in many cases, from their main result. However, the method of Hofer and Viterbo is quite different from the one used in this paper. Their method follows a cohomological argument on the space of loops in the cotangent bundle, the idea being to control the gap between a given Hamiltonian H and the standard Hamiltonian representing the geodesic flow. Their argument assumes that some compact level hypersurface of H encloses the zero section of T ∗ V , in addition to known results on the cohomology of the loop space which were developed in the study of closed geodesics. It is, however, unlikely that the nonautonomous case of Theorem 2.1 is a consequence of Hofer and Viterbo’s main theorem or that Theorem 2.2 is a consequence of their method because it makes essential use of the linear structure at infinity of the cotangent bundle. (Note that, even in the autonomous case, the level hypersurfaces in Theorem 2.2 might escape outside of any given Weinstein neigbourhood of the Lagrangian submanifold.) §2.1. Lagrangian embeddings in compact manifolds. In this paragraph, we show that the statement of the previous theorem is not empty in the compact case. That is to say, for each manifold V of the type described in Theorem 2.1, we give an example of a compact symplectic manifold M that contains V as a Lagrangian submanifold. View W as a quotient of Rn−1 by a discrete group GW of isometries of Rn−1 . Consider the quotient of M = Cn by the discrete group of Kähler isometries of Cn
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
489
generated by (q1 , qi , p1 , pi ) −→ (q1 + 1, −qi , p1 , −pi ), (q1 , qi , p1 , pi ) −→ q1 , ψ(qi ), p1 , pi , (q1 , qi , p1 , pi ) −→ q1 , qi , p1 , ψ(pi ) , (q1 , qi , p1 , pi ) −→ (q1 , qi , p1 + 1, pi ). Here we write qi for (q2 , . . . , qn ) and pi for (p2 , . . . , pn ), and ψ runs through a set of generators of GW . Because any cocompact group of isometries must necessarily contain translations, it is easy to check that the compactness of W implies the compactness of the above quotient. One can then check that the Lagrangian n-plane p1 = pi = 0 is sent to a manifold diffeomorphic to V .1 Remark. Let L0 , L1 ⊂ Cn be disjoint, closed Lagrangian submanifolds of Euclidean space. We say that they are symplectically linked if it is not possible to find a Hamiltonian diffeotopy of the ambient space that fixes L0 and sends L1 as far as we wish from L0 , that is to say, on either side of any real hyperplane of Cn . Let L be a Lagrangian submanifold of Cn . It is not symplectically self-linked if some disjoint C 1 -close Lagrangian copy L of L can be sent, by a Hamiltonian isotopy that avoids L, to a position as far as one wishes from its initial position (i.e., on either side of any hyperplane). More precisely, we require the existence of a small Lagrange isotopy Lt , t ∈ [0, 1], beginning at L0 = L, with Lt disjoint from L for all t > 0, such that for any t > 0, there is a Hamiltonian isotopy starting at Lt , avoiding L, and sending Lt on either side of any hyperplane. It is symplectically self-linked, otherwise. Note that the standard tori T n ⊂ Cn are not symplectically self-linked when n > 1. Indeed, the Lagrangian torus L = S(2π − ε) × S(2π) × · · · × S(2π), where S(c) is the standard circle of capacity c, can be sent to infinity while avoiding L = T n = (S(2π ))n , by translating the second factor to a position disjoint from itself and then by translating all other factors as far as one wishes. However, an immediate corollary of Theorem 2.2 is: If a Lagrangian embedding of a manifold V , as in Theorem 2.1, exists in Cn , then it must be symplectically self-linked. Indeed, L and φ(L ) cannot lie on opposite sides of a hyperplane P ⊂ Cn whenever φ is a compactly supported symplectomorphism of Cn with supp(φ) ∩ L = ∅. Suppose 1 It was claimed in [A] that K 2 admits a Lagrangian embedding into CP 2 . It was actually proved that, given a pair L, L ∼ = RP 2 of Lagrangian projective planes in CP 2 which intersect transversally in a single point, one can surgically replace a neighbourhood of the intersection point by an embedded Lagrangian annulus and thus obtain a Lagrangian surface homeomorphic to RP 2 # RP 2 = K 2 . However, the existence of such a pair L, L is not obvious. One can in fact show that for L, L , which are images of Lagrangian 3-planes in C3 under the standard quotient map ρ : C3 \ {0} → CP 2 , the above condition never holds. If L is transverse to L , then L meets L in exacly three points. Since every known example of a Lagrangian RP 2 in CP 2 is obtained in this way, the existence of a Lagrangian Klein bottle in CP 2 is still an open question.
490
GATIEN AND LALONDE
a separating hyperplane P exists for some such φ. Then P = f −1 (0) for some linear function f : Cn → R with f | L < 0 and f | φ(L ) > 0. The Hamiltonian system associated to f clearly has no periodic orbits (not even degenerate ones), and hence neither does the system associated to H = φ ∗ f = f ◦ φ. But since H satisfies the hypotheses of the above theorem, this is a contradiction. It would be interesting to generalise the results of this paper to other manifolds V (hyperbolic ones, for instance). The only delicate points to check are (1) the relation between geodesics in V and holomorphic cylinders with Lagrangian boundary conditions in T ∗ V , and (2) the regularity of the (non-null-cobordant) space of such cylinders. §2.2. Uniqueness of geodesics. Let p be a fixed point of an isometric involution ι of a flat closed manifold W whose fixed points are all isolated, and consider the geodesic γ0 = [0, 1] × {p}/ι of V = [0, 1] × W/ι. In this section, we prove the claim announced above that γ0 is the unique geodesic in its free homotopy class. A closed geodesic γ in the same homotopy class projects in W to a ι-equivariant closed geodesic, that is to say, a map α : R → W that is a geodesic in the flat manifold W and is such that ι(α(t + 1)) = α(t). Since ι is an involution, this is the same as a geodesic α : [0, 1] → W that joins x to y = ι(x) and is such that the differential of ι sends (dα/dt)(0) to (dα/dt)(1). A homotopy from γ0 to γ would then project to a homotopy made of paths αt : [0, 1] → W with ι(αt (0)) = αt (1) for all t. Note that, because ι is an involution, the union of each αt with its image by ι is a closed loop αt of W . There are two cases: either (1) α1 = α is nonconstant or (2) it is constant and therefore equal to a fixed point p of ι. In the first case, we get a homotopy between the loops α0 and α1 , which would imply that α1 is contractible. But this is a closed nonconstant geodesic of a flat manifold, a contradiction. In the second case, one can easily check that such a homotopy would imply the existence of a geodesic β in W joining the two fixed points p, p such that ι ◦ β would be homotopic to β with fixed endpoints. If β = ι ◦ β, we reach a contradiction since we get two homotopic geodesics with same endpoints, which is impossible in a flat manifold. If β = ι ◦ β, then the two geodesics agree pointwise. Thus, the entire image of β would consist of fixed points, a contradiction with the fact that they are isolated. Thus, γ0 is the only geodesic in its homotopy class, as claimed. §3. Beginning of the proof of Theorem 2.1. Let n − 1 be the dimension of W , so that V has real dimension n. Identify W with the quotient of Rn−1 by the action of a discrete cocompact group GW of isometries of Rn−1 . Then ι lifts to a GW equivariant isometric involution ι˜, which is a reflection about some affine subspace of R n−1 . Since its projection on W has only isolated fixed points, it must be a reflection through some point of Rn−1 that we may choose to be the origin. We therefore identify ι˜ with − id. Then view V as the quotient of Rn = R ⊕Rn−1 by the discrete cocompact group G whose generators are the isometries T × ι˜ = T × − id and id ×ψ, where ψ runs through a set of generators of GW and T is the translation by 1. Denoting
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
491
by qi the local coordinates of V coming from its universal covering, the standard symplectic form ω on T ∗ V is then given locally by ω = i dpi ∧dqi , where as usual pi (dqj ) = δij . Notice that dq1 is the coordinate expression of a globally defined closed 1-form on V . Viewed as a mapping from V to T ∗ V , this 1-form embeds V as a Lagrangian submanifold L1 of T ∗ V . Similarly, the (image of the) zero section V → T ∗ V is another embedded Lagrangian submanifold L0 in T ∗ V , and L0 and L1 are disjoint. (L1 is a translate of L0 in the direction of p1 .) Let γ0 be the closed curve γ0 (t) = (t, 0, . . . , 0) in L0 . Assume that H : T ∗ V × R/Z → R is a smooth Hamiltonian such that dH is compactly supported. Set 1 h= inf Ht − sup Ht dt, 0
L1
L0
which is strictly positive by hypothesis. We show that the Hamiltonian system x˙ = XλH (x) has a nonconstant 1-periodic solution with λ ≤ 1/ h and is freely homotopic to γ0 in T ∗ V . (In the autonomous case, this means that H has a periodic orbit with period T bounded above by 1/ h.) §4. Uniqueness of holomorphic cylinders. Every Riemann surface homeomorphic to [0, 1] × S 1 is biholomorphic to exactly one compact cylinder Cσ = [0, σ ] × S 1 = [0, σ ]×R/Z with σ > 0, where Cσ inherits its complex structure from [0, σ ]× R ⊂ C via the obvious Z-action on the second coordinate. We sometimes view Cσ as the unit cylinder C = [0, 1] × S 1 equipped with the almost complex structure ∂ ∂ 1 ∂ ∂ −→ σ , −→ − , ∂s ∂t ∂t σ ∂s where s, t are the usual coordinates on C. This almost complex structure, which we also denote σ , is simply the pullback via the horizontal rescaling C → Cσ , (s, t) → (σ s, t), of the almost complex structure on Cσ . We endow each cylinder Cσ with its standard conformal metric ds 2 + dt 2 . (Note that when Cσ is identified with (C, σ ), this is not the usual flat metric on C, unless σ = 1.) We begin by quickly reviewing the Kähler structure of the cotangent bundle T ∗ V . Let ρ : Rn → V be the Riemannian covering map. Consider the composition ∼ =
dρ
∼ =
π : T ∗ Rn −→ T Rn −−→ T V −−→ T ∗ V , where the isomorphisms are induced by the flat metrics on Rn and V . Note that ((dψ)−1 )t (pi ) = dψ(pi ) = ψ(pi ) for all ψ ∈ GW . Then π is a symplectic covering map whose automorphism group is Aut(π ) = (q1 , qi , p1 , pi ) −→ (q1 + 1, −qi , p1 , −pi ), (q1 , qi , p1 , pi ) −→ q1 , ψ(qi ), p1 , ψ(pi ) ,
492
GATIEN AND LALONDE
where as above we wrote qi for (q2 , . . . , qn ) and pi for (p2 , . . . , pn ). Better still, if we identify T ∗ Rn with Cn by setting zk = pk + iqk , then Aut(π : Cn −→ T ∗ V ) = (z1 , zi ) −→ (z1 + i, −zi ), (z1 , zi ) −→ z1 , ψC (zi ) is a group of Kähler isometries. Here, ψC ∈ U (n − 1, C) is the complexification of ψ ∈ O(n − 1, R), defined as the unique complex linear map of Cn−1 = {(z2 , . . . , zn )} whose restriction to the p2 , . . . , pn -plane is equal to ψ. It follows that T ∗ V inherits a Kähler structure from Cn and that π is holomorphic. Moreover, the resulting Kähler form on T ∗ V equals the symplectic form ω = i dpi ∧ dqi . We henceforth let i and ·, · denote the almost complex structure and Riemannian metric on T ∗ V induced by the standard structures on Cn via the covering map π : Cn → T ∗ V . Thus, ·, · = ω(·, i·) and the symplectic gradient XH of any smooth function H : T ∗ V → R is simply i times the ·, · gradient ∇H . Recalling the definitions of L0 , L1 ⊂ T ∗ V , we now observe that
π −1 (L0 ) = (z1 , zi ) : Re z1 = Re zi = 0 ,
π −1 (L1 ) = (z1 , zi ) : Re z1 = 1, Re zi = 0 . So, the inclusion v0 (z) = (z, 0, . . . , 0) of [0, 1] × R into Cn covers a holomorphic cylinder u0 : C1 → T ∗ V with boundary in L = L0 ∪ L1 . In what follows, we deal exclusively with maps u : (Cσ , ∂Cσ ) → (T ∗ V , L) that, after rescaling horizontally, are homotopic to u0 in [C, ∂C; T ∗ V , L], the set of homotopy classes of maps (C, ∂C) → (T ∗ V , L). There are two reasons for this. First, we require a uniform bound on the symplectic area; L is Lagrangian, every cylinder in this since ∗ class has symplectic area equal to u0 ω = 1. The second reason is the following proposition. Proposition 4.1. Suppose u : (C, ∂C) → (T ∗ V , L) is (σ, i)-holomorphic and homotopic to u0 in [C, ∂C; T ∗ V , L]. Then σ = 1 and u = u0 , up to a rotation of C. Proof. The inclusion v0 : [0, 1]×R → Cn , v0 (z) = (z, 0, . . . , 0), is a lift of u0 and satisfies v0 (z + i) = φ ◦ v0 (z), where φ ∈ Aut(π) is the automorphism φ(z1 , zi ) = (z1 +i, −zi ). Choose a homotopy from u0 to u, and use the homotopy lifting property of covering maps to lift it to a homotopy from v0 to a map v : [0, 1] × R → Cn . Then v is (σ, i)-holomorphic and satisfies v(z + i) = φ ◦ v(z), v {1} × R ⊂ π −1 (L1 ). v {0} × R ⊂ π −1 (L0 ), It is more convenient to view v as a map [0, σ ] × R → Cn , which is holomorphic in the usual sense. Writing v = (v1 , vi ), we then have v1 (z + i), vi (z + i) = v1 (z) + i, −vi (z) , Re v1 = Re vi = 0
on {0} × R,
Re v1 = 1, Re vi = 0
on {σ } × R,
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
493
and we must show that σ = 1 and v(z) = (z + ia, 0, . . . , 0), where a ∈ R. Consider first the holomorphic function vi , i > 1. Its real part Re vi is harmonic, vanishes on the boundary of [0, σ ]×R, and satisfies Re vi (z+i) = − Re vi (z). Hence, Re vi assumes an interior maximum value, and so it must be constant by the maximum principle for harmonic functions. Thus, Re vi = 0 and vi is constant. But since vi (z + i) = −vi (z), we must in fact have vi = 0. A similar argument shows that σ = 1 and v1 has the desired form. Consider the holomorphic function w(z) = v1 (z)−z/σ . As before, Re w is harmonic and vanishes on the boundary of [0, σ ] × R. Moreover, since v1 (z + i) = v1 (z) + i, we have 1 , w(z + i) = w(z) + i 1 − σ and in particular, Re w(z+i) = Re w(z). The maximum and minimum principles now imply that Re w = 0, so w must be an imaginary constant ia. Then 1 − 1/σ = 0, so σ = 1 and v1 (z) = z + ia. A geometric observation explains why one might expect u0 to be unique in its homotopy class (cf. [Le], [Sz]). Identify T V with T ∗ V in the usual way (so T V is a complex manifold). Introduce a complex structure on the tangent bundle T R of R by identifying (t, s(d/dt)) ∈ Tt R with s + it ∈ C. Then it is easy to check that for any geodesic γ : R → V , the differential dγ : T R → T V is a holomorphic map. In particular, every closed geodesic in V gives rise to a holomorphic (infinite) cylinder in T ∗ V = T V . Notice that the cylinder u0 arises in precisely this way. It is the differential of the closed geodesic γ0 (t) = u0 (0, t) in L0 . Now if two closed geodesics in V are homotopic, so are the resulting holomorphic cylinders. Since γ0 is in fact unique (among geodesics) in its homotopy class, one might well suspect the same to be true of the cylinder u0 , as Proposition 4.1 confirms. §5. The inhomogeneous Cauchy-Riemann equation. Central to the proof of Theorem 2.1 is a family of elliptic first-order equations of the form ∂u = g. After quickly reviewing the interpretation of such an equation when u is a map between manifolds, we introduce the relevant family of equations for maps of the cylinder into T ∗ V and derive some basic estimates. Let u be a smooth map from a Riemann surface (S, j ) to an almost complex manifold (M, J ). Recall that u is (j, J )-holomorphic if and only if ∂u = 0, where ∂u = du + J ◦ du ◦ j is (twice) the complex antilinear part of du. Here ∂u is a (0, 1)-form on S with values in the complex vector bundle u∗ T M, in other words, a section of the bundle Homj,J (T S, u∗ T M) → S. Alternately, ∂u can be viewed as a section over graph(u) of the vector bundle E = Homj,J (T S, T M) over S ×M whose fiber Ez,v is the space of complex antilinear maps Tz S → Tv M. Under this interpretation, the equation
494
GATIEN AND LALONDE
∂u = g has an obvious meaning whenever g is a global section of E → S × M: A solution is simply a map u : S → M such that ∂u = g | graph(u). The theory of pseudo-holomorphic curves applies equally well to solutions of this inhomogeneous equation. Indeed, as Gromov observed, a map u : S → V satisfies ∂u = g if and only if the associated graph map 1S ×u : S → S ×M is (j, Jf g)-holomorphic for the structure Jg on S × M defined by Jg (ξ, X) = j ξ, J X + gz,p (j ξ )
for ξ ∈ Tz S, X ∈ Tp M.
Now assume S is the cylinder Cσ = [0, σ ] × S 1 . Let pr 2 : Cσ × M → M be the projection on the second factor. Since a complex antilinear map on a real 2dimensional space is completely determined by its value on a single nonzero vector, the assignment A → A(∂/∂s) defines a bundle isomorphism E → pr ∗2 T M. Using this correspondence, the inhomogeneous Cauchy-Riemann equation can be written ∂u ∂u +J = g(z, u), ∂s ∂t
(1)
where g is now a section of pr ∗2 T M → Cσ × M. Specializing further to (M, J ) = (T ∗ V , i), we now let H : M × R/Z → R be a smooth Hamiltonian satisfying the hypotheses of Theorem 2.1. Since the gradient ∇H with respect to M can be pulled back to a section of pr ∗2 T M, we obtain, as a special case of (1), the family of equations ∂u ∂u + i + λ∇H (u) = 0, ∂s ∂t
(2)
where λ ∈ R and where we write H (u), ∇H (u) for H (u, t), ∇H (u, t) to simplify our notation. Our aim is to study C ∞ solutions u : (Cσ , ∂Cσ ) → (T ∗ V , L) of equation (2) for which λ ≥ 0 and u is homotopic to u0 in [C, ∂C; T ∗ V , L] (after rescaling). Let Ꮿ denote the set of all such triples (u, λ, σ ). We begin with a straightforward 1 estimate. (Recall that h = 0 (inf L1 Ht − supL0 Ht ) dt > 0.) Proposition 5.1 (Cf. [HV2]). For any solution (u, λ, σ ) ∈ Ꮿ, we have 1 ≤ σ
2 ∂u ≤ 1 − λh. Cσ ∂s
In particular, λ < 1/ h and σ ≥ 1. Moreover, each loop u(s, ·) : S 1 → T ∗ V has length at least 1. Proof. Using equation (2) and recalling that ·, · = ω(·, i·), we compute 2
∂u = ∂u , −i ∂u − λ ∂u , ∇H (u, t) = ω ∂u , ∂u − λ ∂ H (u, t). ∂s ∂s ∂t ∂s ∂s ∂t ∂s
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
495
Thus, 2 1 σ ∂u ∂ ∗ = H (u, t) ds dt u ω−λ ∂s ∂s 0 0 Cσ Cσ 1 H u(σ, t), t − H u(0, t), t dt = 1−λ
0
≤ 1 − λh, since u(σ, t) ∈ L1 and u(0, t) ∈ L0 for all t. This proves the second estimate. To prove the first, use the Cauchy-Schwarz inequality 2 1/2 σ 1/2 σ σ ∂u ∂u (s, t) ds ≤ (s, t) ds ds , ∂s ∂s 0 0 0 together with the estimate σ ∂u (s, t) ds = length u(·, t) ≥ distance(L0 , L1 ) = 1, ∂s 0 to show that
σ 0
2 ∂u ds ≥ 1 . ∂s σ
The result then follows by integrating over t. For the last assertion, note that since u is homotopic to u0 , each loop u(s, ·) is homotopic to γ0 = u0 (0, ·) in [S 1 ; T ∗ V ]. Thus, length(u(s, ·)) ≥ length(γ0 ) = 1, because γ0 has minimal length in its homotopy class. Proposition 5.1 yields local a priori bounds on the energy of solutions (u, λ, σ ) in
Ꮿ. Given any compact set K ⊂ R × S 1 , there is a constant c = c(K) > 0 such that
1 energy(u | K) = 2
Cσ ∩K
|du|2 ≤ c
for every (u, λ, σ ) ∈ Ꮿ. This is an immediate consequence of the estimate 2 2 2 2 ∂u ∂u ∂u ∂u 2 |du| = + = + i + iλ∇H (u) ∂s ∂t ∂s ∂s 2 2 2 ∂u ∂u ∂u 2 2 |iλ∇H ≤ + 2 i + 2 (u)| ≤ 3 + 2 $dH $2C 0 . ∂s ∂s ∂s h §6. C k estimates. We now show that every cylinder in Ꮿ is contained in a fixed compact subset of T ∗ V and has uniformly bounded derivatives of all orders. Derivative bounds are more or less standard. We use a well-known “bubbling-off” argument for the first-order bounds and standard elliptic bootstrapping techniques for the
496
GATIEN AND LALONDE
higher-order estimates. However, to establish C 0 boundedness, we exploit the special features of the situation, that is, the pseudo-convexity of some neighbourhood of the holomorphic cylinder. (However, one could also use versions of the monotonicity principle or of the maximum principle, as in the proof of Theorem 2.2 below.) Proposition 6.1. There exist constants ck > 0 such that $u$C k ≤ ck for every solution (u, λ, σ ) ∈ Ꮿ. follows from the observation that Proof. The existence of a uniform C 0 bound each regular level set τ −1 (a) of the function τ = i pi2 on T ∗ V is weakly pseudoconvex, and hence no holomorphic curve in T ∗ V can touch τ −1 (a). More explicitly, choose a ≥ 1 so that W = {w ∈ T ∗ V : τ (w) ≤ a} contains the support of ∇H ; then every solution has an image in W . Suppose instead that u(Cσ ) ⊂ W for some (u, λ, σ ) ∈ Ꮿ. Then the function τ ◦u assumes its maximum value at an interior point z0 of Cσ and is subharmonic in a neighbourhood of z0 , since u is holomorphic near z0 . This contradicts the maximum principle. To obtain a first-order bound, we argue by contradiction: Assume there exists a sequence of solutions (uk , λk , σk ) ∈ Ꮿ such that Mk = $duk $C 0 −→ ∞. For convenience, we view uk as maps from [0, σk ] × R ⊂ C to T ∗ V which are 1-periodic in t, and we use the obvious circle action on Ꮿ to assume that the function |duk | attains its maximum on the real axis. Thus, |duk (zk )| = Mk for some zk ∈ [0, σk ] × {0}. Set δk = distance zk , ∂([0, σk ] × R) = min{zk , σk − zk }. We show that, according as the sequence Mk δk is or is not bounded, one of the groups π2 (T ∗ V , L0 ) or π2 (T ∗ V ) must contain a nonzero element. Since both of these groups are in fact trivial, this is a contradiction. Suppose the sequence Mk δk is unbounded. By passing to a subsequence, we can assume Mk εk → ∞ for a suitably chosen sequence εk → 0 such that 0 < εk < δk . Let Dk ⊂ C be the open disk of radius Mk εk centered at the origin, and define vk : Dk → T ∗ V by z vk = uk zk + . Mk This rescaled map satisfies ∂vk ∂vk λk +i + ∇H (vk ) = 0, ∂s ∂t Mk
(3)
|dvk (0)| = 1,
(4)
$dvk $C 0 ≤ 1.
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
497
Moreover, since energy is a conformal invariant, the estimate at the end of §3 shows that the sequence vk has bounded energy. It follows from (3) and (4) that vk has a subsequence that converges uniformly with all derivatives on compact sets to a smooth map v : C → T ∗ V . (See [HZ, pp. 240–243]; since the vk give rise to a sequence of Jk -holomorphic disks in C × T ∗ V with uniformly bounded derivatives, this is also a consequence of the compactness theorem in [MS, Appendix B].) Clearly, v is holomorphic (since the Hamiltonian term in (3) tends to zero as k → ∞), has finite energy, and satisfies |dv(0)| = 1. The removable singularity theorem for holomorphic curves (see [S]) now implies that v extends to a (nonconstant) holomorphic map on the sphere S 2 = C ∪ {∞}. But since this holomorphic sphere has positive symplectic area, it represents a nontrivial element of π2 (T ∗ V ). This contradicts π2 (T ∗ V ) = 0. Now suppose the sequence Mk δk is bounded. Choose an upper bound M, and let εk > 0 be a sequence such that Mk εk > M,
εk −→ 0, Mk εk −→ ∞.
Since δk → 0 and σk ≥ 1, we can assume (after passing to a subsequence) that either δk = zk for all k or δk = σk − zk for all k. These two cases are handled similarly, so we only discuss the first one: δk = zk . Let Dk+ be the half-disk {z ∈ C : |z| < Mk εk , Re z ≥ 0}, and define vk : (Dk+ , ∂Dk+ ) → (T ∗ V , L0 ) by
vk (z) = uk
z . Mk
As before, the vk are solutions of equation (3) with uniformly bounded energy and $dvk $C 0 ≤ 1. Moreover |dvk (zk )| = 1, where zk = Mk zk ∈ Dk+ . Since the sequence zk is bounded (in fact, 0 ≤ zk ≤ M), we can assume that it converges to a point in the closed half-plane C+ = {z ∈ C : Re z ≥ 0}. The compactness theorem in z∞ [MS, Appendix B] now shows that a subsequence of vk converges uniformly with all derivatives on compact sets to a holomorphic map v : (C+ , ∂C+ ) → (T ∗ V , L0 ) )| = 1 and having finite energy. By removal of singularities—this time with |dv(z∞ for holomorphic curves with Lagrangian boundaries (see [O], [S])—we conclude that v extends to a holomorphic map on the disk C+ ∪ {∞} ⊂ S 2 . This holomorphic disk then represents a nontrivial element of the relative group π2 (T ∗ V , L0 ), which is again a contradiction. The above mentioned compactness theorem for sequences of J -holomorphic curves with uniformly bounded derivatives is proved by applying standard estimates for the Cauchy-Riemann operator to a suitable local formulation of the problem. Similar arguments yield uniform bounds on the higher order derivatives of solutions (u, λ, σ ) ∈ Ꮿ. We omit the details and refer the interested reader to [HZ] and [S]. §7. Fredholm setting. The next step in the proof of Theorem 2.1 is to show that there exist solutions in Ꮿ with σ arbitrarily large. This means that σ , when
498
GATIEN AND LALONDE
viewed as a complex structure on the unit cylinder C = [0, 1]×R/Z, tends to infinity in the moduli space of all such structures. Until further notice, all maps into T ∗ V are explicitly defined on the unit cylinder C with its usual flat metric and variable complex structure σ . Translating the results of the previous sections to this new setting, we find that the set Ꮿ of rescaled solutions consists of all (u, λ, σ ) with u : (C, ∂C) → (T ∗ V , L) homotopic to u0 and satisfying ∂ σ u + λ∇H (u) = 0,
(5)
where ∂σ =
∂ 1 ∂ +i σ ∂s ∂t
is the Cauchy-Riemann operator relative to the structures σ on C and i on T ∗ V . (Recall that we write H (u), ∇H (u) for H (u, t), ∇H (u, t).) This is a special case of the inhomogeneous Cauchy-Riemann equation ∂ σ u + g ◦ (1C × u) = 0,
(6)
where g is a section of pr ∗2 T M → C × M, M = T ∗ V , and 1C × u : C → C × M is the graph map of u (cf. equation (1)). Passing from Ꮿ to Ꮿ changes the content of Propositions 5.1 and 6.1 somewhat. For Proposition 5.1, replace Cσ by (1/σ ) C and note that λ < 1/ h and σ ≥ 1 continue to hold for (u, λ, σ ) ∈ Ꮿ . On the other hand, because the metric on C is kept fixed, Proposition 6.1 only ensures the existence of derivative bounds for subsets of Ꮿ with σ bounded. Our main goal in this section is to show that Ꮿ is noncompact in a natural topology. (More specifically, we exhibit a closed noncompact subset of Ꮿ .) The existence of a sequence of solutions with σ → ∞ then follows readily from this rescaled version of Proposition 6.1. We begin by introducing a global setting for the inhomogeneous equation (6). These ideas essentially date back to Gromov [G1] and were formalized, in the case of disks with Lagrangian boundaries, in [ALP]. Our approach is inspired from [ALP] (but we use parts of [MS, Chapter 3] to correct a number of errors). We give the details since the presence of the conformal parameter of the cylinder complicates the situation. First, let Q be the closed hypersurface in L0 covered by the q2 , . . . , qn -hyperplane . Fix α ∈ (0, 1), and let ᐄ1,α denote the Hölder space of C 1,α maps u : in Rn = V (C, ∂C, 0) → (M, L, Q) that are homotopic to u0 in [C, ∂C; M, L]. (Here “0” denotes the image in C of the origin in the universal cover [0, 1]×R. The condition u(0) ∈ Q is required to fix the parametrisation of the cylinders.) This is a C ∞ separable Banach manifold whose tangent space Tu ᐄ1,α at u is the real Banach space of C 1,α vector fields ξ(z) ∈ Tu(z) M along u which are tangent to L along ∂C and tangent to Q at 0 ∈ C. Local charts on ᐄ1,α are given by the exponential map of the metric ·, · on M, relative to which both L and Q are totally geodesic. Next, we choose an appropriate Banach space of sections of pr ∗2 T M → C × M. Endow C × M with the
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
499
product metric and pr ∗2 T M with the pulled-back metric and Levi-Civita connection. For any integer k > 1, let Ᏻk denote the completion with respect to the C k norm of the space of C ∞ compactly supported sections of pr ∗2 T M. Thus, Ᏻk is the separable real Banach space of C k sections of pr ∗2 T M whose covariant derivatives through order k vanish at infinity. (It is even a complex Banach space, but we do not need this.) Finally, define
ᏹk = (u, g, σ ) ∈ ᐄ1,α × Ᏻk × (0, ∞) : ∂ σ u + g ◦ (1C × u) = 0 , and let P : ᏹk → Ᏻk be the projection on the second factor. Note that for each (u, g, σ ) ∈ ᏹk , u is of class C k by elliptic regularity (see [MS, Appendix B] or [S]). In particular, every solution of equation (5) in ᏹk is smooth. Proposition 7.1. For every integer k > 1, ᏹk is a C k−1 Banach submanifold of and P is a C k−1 Fredholm map.
ᐄ1,α × Ᏻk × (0, ∞)
Proof. The map (u, g, σ ) → ∂ σ u + g ◦ (1C × u) defines a section of the C ∞ Banach space bundle Ᏹα → ᐄ1,α × Ᏻk × (0, ∞) whose fiber at (u, g, σ ) is the Hölder space C α (u∗ T M) of C α vector fields along u. In order for ᏹk to be a manifold, we must show that this section is transverse to the zero section. This means that for each (u, g, σ ) ∈ ᏹk , the linearisation ᏸ = D (u, g, σ ) : Tu ᐄ1,α ⊕ Ᏻk ⊕ R −→ C α (u∗ T M)
is surjective and has split kernel. (By definition, ᏸ is the composition of the differential d (u, g, σ ) with the projection T(u,0) Ᏹα → C α (u∗ T M) onto the fiber.) A calculation similar to the one in [M1, Proposition 4.1] shows that, for any fixed (u, g, σ ) ∈ ᏹk , ᏸ is given by ᏸ(ξ, h, τ ) = Du ξ + h ◦ (1C × u) −
τ ∂u , σ 2 ∂s
where Du : Tu ᐄ1,α → C α (u∗ T M) is the operator Du ξ =
1 ∇∂/∂s ξ + i∇∂/∂t ξ + ∇ξ g. σ
Here ∇ denotes covariant differentiation with respect to the induced connections on u∗ T M and pr ∗2 T M. To interpret ∇ξ g as a section of u∗ T M, view ξ as a vector field tangent to C × V along graph(u): ξ(z) ∈ Tu(z) M ⊂ T(z,u(z)) (C × M). Note that Du is an elliptic first-order partial differential operator whose first-order terms have C ∞ coefficients and whose lower-order term has C k−1 coefficients. Hence, Du is Fredholm, as is the operator Du,σ : Tu ᐄ1,α ⊕ R −→ C α (u∗ T M),
(ξ, τ ) −→ Du ξ −
τ ∂u . σ 2 ∂s
500
GATIEN AND LALONDE
Moreover, the section is itself of class C k−1 , being the sum of a C ∞ section and a C k−1 section. Since the image of ᏸ contains the image of the Fredholm operator Du , it has finite codimension in C α (u∗ T M) and so must be closed. (This is a consequence of the open mapping theorem: A finite-codimensional subspace of a Banach space is closed if it is the image of some Banach space by a continuous linear map.) To see that ᏸ is surjective, it therefore suffices to prove that this image is dense. In fact, it is even true that the restriction of ᏸ to Ᏻk has dense image. For since u is of class C k , ᏸ | Ᏻk is the composition of the surjective map R : Ᏻk −→ C k (u∗ T M),
h −→ h ◦ (1C × u),
with the inclusion C k (u∗ T M) → C α (u∗ T M), and since the latter has dense image, so does ᏸ | Ᏻk . To finish proving that ᏹk is a manifold, it remains to check that ᏸ has split kernel or, equivalently, that ᏸ has a right inverse with closed image. Since Du,σ is Fredholm, we can write C α (u∗ T M) = E1 ⊕ E2 ,
(7)
where E1 = image(Du,σ ) and E2 is finite-dimensional. Moreover, since C k (u∗ T M) is dense in C α (u∗ T M), we can assume E2 ⊂ C k (u∗ T M). Viewed as a map from Tu ᐄ1,α ⊕R to E1 , Du,σ has a right inverse T1 : E1 → Tu ᐄ1,α ⊕R with closed image. Choose a right inverse for the above map R : Ᏻk → C k (u∗ T M), and let T2 : E2 → Ᏻk be its restriction to E2 . Then, clearly, T1 ⊕ T2 : C α (u∗ T M) −→ Tu ᐄ1,α ⊕ Ᏻk ⊕ R is the desired right inverse for ᏸ. Turning now to the projection P : ᏹk → Ᏻk , note that the tangent space T(u,g,σ ) ᏹk consists of all triples (ξ, h, τ ) ∈ Tu ᐄ1,α ⊕ Ᏻk ⊕ R such that ᏸ(ξ, h, τ ) = Du,σ (ξ, τ ) + h ◦ (1C × u) = 0,
and the differential dP(u, g, σ ) : T(u,g,σ ) ᏹk −→ Ᏻk is simply the projection (ξ, h, τ ) → h. Hence, the kernel of dP(u, g, σ ) is isomorphic to the kernel of Du,σ , and so it must be finite-dimensional. On the other hand, its image is the closed subspace of Ᏻk consisting of all h such that h ◦ (1C × u) ∈ image(Du,σ ). Using the above decomposition (7) of C α (u∗ T M), it follows that the restriction ᏸ | Ᏻk : Ᏻk → C α (u∗ T M) induces an isomorphism of the quotient Ᏻk / image(dP(u, g, σ )) with E2 ⊂ C k (u∗ T M). Thus, image(dP(u, g, σ )) ⊂ Ᏻk has the same codimension as image(Du,σ ) ⊂ C α (u∗ T M), and hence dP(u, g, σ ) is a Fredholm operator (with the same index as Du,σ ).
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
501
As we will see in the next section, zero is a regular value of P . Since P −1 (0) = {(u0 , 0, 1)} by Proposition 4.1 and the definition of ᐄ1,α , it follows that the index of P must be zero, at least on the component of (u0 , 0, 1) in ᏹk . It can in fact be shown that index(P ) vanishes on all of ᏹk . See §10, where the index calculation is given. Restricting P to this component of ᏹk but keeping the same notation, let us now consider the preimage P −1 (@) of the closed line segment @ joining 0 to (1/ h)∇H in Ᏻk . We use a perturbation argument based on the Sard-Smale theorem [Sm] to show that this subfamily of Ꮿ is noncompact in ᏹk . Assume, if possible, that P −1 (@) is compact. Since P is Fredholm, it is locally proper; that is, each point of ᏹk has an open neighbourhood ᐁ such that P | ᐁ is proper. (If X, Y are metrisable spaces, we say that a map f : X → Y is proper if every sequence xn in X for which f (xn ) converges has a convergent subsequence. It then follows that compact subsets of Y have compact preimages, but not conversely.) Using the local properness of P , it is not difficult to find open neighbourhoods ᐁ of P −1 (@) and ᐂ of @ such that P −1 (K)∩ ᐁ is compact whenever K ⊂ ᐂ is compact. For example, choose ᐁ ⊃ P −1 (@) so that P | ᐁ is proper and then reason by contradiction to find ᐂ. Now observe that both endpoints of @ are regular values of P , since P −1 ((1/ h)∇H ) = ∅ by Proposition 5.1. It follows from the Sard-Smale theorem that @ can be perturbed in ᐂ, while keeping its endpoints fixed, to be transverse to P (this requires P to be of class C 2 at least, so take k ≥ 3). The preimage P −1 (@ ) of the resulting transverse curve @ is then a 1-dimensional submanifold of ᏹk with boundary ∂P −1 (@ ) = {(u0 , 0, 1)}. The same is of course true of P −1 (@ ) ∩ ᐁ. But since the latter is compact, it defines a cobordism between the singleton P −1 (0) and P −1 ((1/ h)∇H ) = ∅, and this is a contradiction. The existence of a sequence in Ꮿ (or in Ꮿ ) with σ → ∞ now follows readily from Proposition 6.1. Suppose instead that σ ≤ M for all (u, λ, σ ) ∈ Ꮿ . Then Proposition 6.1 holds for P −1 (@) ⊂ Ꮿ , and each (u, λ, σ ) ∈ P −1 (@) satisfies 0 ≤ λ < 1/ h and 1 ≤ σ ≤ M. Hence, P −1 (@) is precompact in ᐄ1,α × Ᏻk ×(0, ∞) by Ascoli’s theorem. Since it is also closed there, it must be compact, which is a contradiction. Notice that in the above proof, showing that P −1 (@) is noncompact requires both that P −1 (0) be noncobordant to zero and that index(P ) be nonnegative. This was the reason for only allowing maps with u(0) ∈ Q. Since Q has codimension 1 in V , introducing this restriction causes the index to decrease by exactly 1, that is, we pass from an operator of index 1 (the dimension of the relevant space of parametrised holomorphic cylinders) to an operator of index zero. Also notice that the Sard-Smale theorem requires that P be defined on a separable Banach manifold. This explains our choice of Ᏻk . The usual space of C k -bounded sections of pr ∗2 T M is not separable, because M is not compact. §8. Regularity. The following proposition closes the gap in the discussion of the previous section. We show that the regularity of P at (u0 , 0, 1) is equivalent to the solvability of a pair of Riemann-Hilbert problems, each defined on a closed annulus
502
GATIEN AND LALONDE
in C. We then solve these problems using elementary results from the Bers-Vekua theory of generalised analytic functions. Proposition 8.1. Zero is a regular value of P : ᏹk → Ᏻk . Proof. It suffices to show that dP(u0 , 0, 1) is surjective. By the proof of Proposition 5.1, this amounts to showing that the operator Du0 ,1 : Tu0 ᐄ1,α ⊕ R −→ C α u∗0 T M ,
∂u0 (8) ∂s is surjective. We begin by lifting Du0 ,1 to the universal covers of C and M. Recall that the inclusion v0 : [0, 1] × R → Cn , v0 (z) = (z, 0, . . . , 0), covers the cylinder u0 : C → M and satisfies v0 (z + i) = φ ◦ v0 (z), where φ ∈ Aut(π : Cn → M) is the automorphism φ(z1 , zi ) = (z1 + i, −zi ). Hence, to every (continuous) vector field ξ : C → T M along u0 , one can associate exactly one vector field η : [0, 1] × R → T Cn along v0 such that the following diagram commutes: (ξ, τ ) −→ ∇∂/∂s ξ + i∇∂/∂t ξ − τ
η
[0, 1] × R C
/ T Cn dπ
ξ
/ T M.
Then η(z + i) = dφ ◦ η(z), and conversely, any such section η of v0∗ T Cn gives rise to a unique section ξ of u∗0 T M. Moreover, if ξ belongs to Tu0 ᐄ1,α , the corresponding η satisfies two additional conditions: η | {0} × R, η | {1} × R are tangent to π −1 (L0 ), π −1 (L1 ), respectively, and η(0) is tangent to the q2 , . . . , qn -plane in Cn . Now a section of v0∗ T Cn can be viewed simply as a map [0, 1] × R → Cn . Thus, η = (v1 , . . . , vn ), where vi : [0, 1] × R → C, and the above conditions read v1 (z + i) = v1 (z), for an arbitrary section ξ of
vi (z + i) = −vi (z),
i > 1,
u∗0 T M.
In addition, Re v1 | ∂ [0, 1] × R = 0, v1 (0) = 0, Re vi | ∂ [0, 1] × R = 0, i > 1,
in case ξ ∈ Tu0 ᐄ1,α . To translate expression (8) for Du0 ,1 to this setting, we simply replace ∇∂/∂s , ∇∂/∂t by ∂/∂s, ∂/∂t (viewed as operators on maps [0, 1] × R → Cn ) and ∂u0 /∂s by ∂v0 /∂s = (0, 1). The resulting operator then assigns ∂v2 ∂vn ∂v1 (v1 , . . . , vn , τ ) −→ − τ, ,..., , ∂ z¯ ∂ z¯ ∂ z¯ where ∂ ∂ ∂ = +i ∂ z¯ ∂s ∂t
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
503
is (twice) the usual Cauchy-Riemann operator for maps [0, 1] × R → C. In order for Du0 ,1 to be surjective, we must therefore show that the following two problems always admit solutions. (1) Given a C α map g1 : [0, 1] × R → C such that g1 (z + i) = g1 (z), find τ ∈ R and v1 : [0, 1] × R → C of class C 1,α such that v1 (z + i) = v1 (z) and Re v1 | ∂ [0, 1] × R = 0,
∂v1 = g1 + τ, ∂ z¯
v1 (0) = 0.
(2) Given g2 ∈ C α such that g2 (z+i) = −g2 (z), find v2 ∈ C 1,α such that v2 (z+i) = −v2 (z) and Re v2 | ∂ [0, 1] × R = 0.
∂v2 = g2 , ∂ z¯
An obvious reformulation of these problems will allow us to work with maps defined on a closed annulus in C. For the i-periodic functions in problem (1), we can substitute maps A1 → C, where A1 = {z : 1 ≤ |z| ≤ e2π }. Thus g1 (z) = f1 (e2πz ), where f1 ∈ C α (A1 , C), and similarly for v1 . For problem (2), the periodicity condition is “twisted,” but each function is nevertheless 2i-periodic. Each therefore gives rise to a map f : A2 → C, A2 = {z : 1 ≤ |z| ≤ eπ }, such that f (−z) = −f (z). As is easily checked, these substitutions result in the following equivalent problems. (1 ) Given f1 ∈ C α (A1 , C), find τ ∈ R and u1 ∈ C 1,α (A1 , C) such that f1 (z) + τ ∂u1 (z) = , ∂ z¯ 2π z¯
Re u1 | ∂A1 = 0,
u1 (1) = 0.
(2 ) Given f2 ∈ C α (A2 , C) such that f2 (−z) = −f2 (z), find u2 ∈ C 1,α (A2 , C) such that u2 (−z) = −u2 (z) and ∂u2 f2 (z) (z) = , ∂ z¯ π z¯
Re u2 | ∂A2 = 0.
The existence (and uniqueness) of solutions to problems (1 ) and (2 ) is an immediate consequence of the following lemma. Lemma 8.2. Let A ⊂ C be the closed annulus 1 ≤ |z| ≤ a, and suppose f ∈ C α (A, C), where 0 < α < 1. Then there exists u ∈ C 1,α (A, C) such that ∂u = f, ∂ z¯ if and only if
Re
Re u | ∂A = 0
f (z) dx ∧ dy = 0. A z
(9)
(10)
In this case, any two solutions differ by an additive imaginary constant. In particular, there is exactly one solution u with u(1) = 0. Moreover, if f (−z) = f (z) for all z ∈ A, then (10) holds and there exists a unique solution u such that u(−z) = −u(z).
504
GATIEN AND LALONDE
Proof. We need the following generalisation to Banach spaces of the well-known fact that image(T ) = ker(T ∗ )⊥ whenever T is a Fredholm operator on a Hilbert space. Let X, Y be Banach spaces and suppose T , T ∗ : X → Y are Fredholm operators with index(T ∗ ) = − index(T ). Assume T and T ∗ are adjoint with respect to a pair of continuous nondegenerate bilinear forms ·, · : Y × X → R and ·, · : X × Y → R; that is, T x1 , x2 = x1 , T ∗ x2 for all x1 , x2 ∈ X. (Here, we mean nondegenerate in the weak sense; the induced maps X → Y ∗ , Y → X∗ are injective.) Then y ∈ image(T ) if and only if y, x = 0 for all x ∈ ker(T ∗ ). (See [W, Chapter 1], or the references therein.) Set X = C 1,α (A, C), Y = C α (A, C) ⊕ C 1,α (∂A, R), and consider the operators T , T ∗ : X → Y defined by ∂u ∂u ∗ Tu= , Re u | ∂A , T u= , Re χu | ∂A , ∂ z¯ ∂ z¯ where χ : ∂A → S 1 ⊂ C is the unit tangent vector associated to the standard orientation of ∂A. In [V, Chapter IV] Vekua shows that T and T ∗ are Fredholm operators of index 0 and that ker(T ) and ker(T ∗ ) both have (real) dimension 1. Moreover, using Stokes’s theorem, it is easy to check that T and T ∗ are adjoint with respect to the following continuous nondegenerate bilinear forms on Y × X and X × Y : (f, ψ), u = Re f (z)u(z) dx ∧ dy − ψ Im(χu) ds, A
u, (f, ψ) = − Re
∂A
A
f (z)u(z) dx ∧ dy +
∂A
ψ Im u ds,
where, in the rightmost terms, s denotes arclength along ∂A (see [W, Chapter 1] for a similar calculation). Now, since χ (z) = ±iz/|z|, ker(T ∗ ) is generated by the function 1/z ∈ X. Thus, given f ∈ C α (A, C), (f, 0) ∈ image(T ) if and only if (f, 0), 1/z = 0, which proves the first assertion. The second assertion is clear because ker(T ) is generated by the constant function i. To prove the third, assume f (−z) = f (z). Then (10) clearly holds, so (9) admits a solution u1 ∈ X and every other solution has the form u(z) = u1 (z) + ic, where c ∈ R. To choose c so that u(−z) = −u(z), consider the function v(z) = u1 (−z) + u1 (z). Since f (−z) = f (z), v satisfies ∂v = 0, ∂ z¯
Re v | ∂A = 0,
that is, v ∈ ker(T ). Thus v = ia for some a ∈ R, so u1 (−z) −
ia ia = −u1 (z) + , 2 2
which shows that u(z) = u1 (z) − ia/2 is the desired solution.
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
505
§9. Proof of Theorem 2.1. Returning now to maps defined on cylinders of variable length, we show that Theorem 1.1 is a straightforward consequence of the basic estimate in Proposition 5.1, the uniform C k bounds in Proposition 6.1, and the existence of a sequence (uk , λk , σk ) in Ꮿ with σk → ∞. Given any such sequence, we can assume λk → λ ∈ [0, 1/ h]. Using Proposition 5.1 and the mean value theorem, we then have 2 σk 1 ∂uk 2 ∂uk = −i dt ds ∇H (u , t) − λ 1≥ k k ∂s ∂t 0 Cσk 0 1 ∂uk 2 = σk −i ∂t (sk , t) − λk ∇H uk (sk , t), t dt, where 0 ≤ sk ≤ σk , 0 1 |−i x˙k − λk ∇H (xk , t)|2 dt, = σk 0
where xk = uk (sk , ·) : S 1 → T ∗ V . By Proposition 6.1 and Ascoli’s theorem, we can assume xk → x in C ∞ (S 1 , T ∗ V ). The above inequality then shows that −i x˙ − λ∇H (x, t) = 0, that is, x˙ = XλHt (x). Now, each xk is freely homotopic to the geodesic γ0 and has length at least 1, so x also has these properties. In particular, x is nonconstant, and so λ > 0. In the autonomous case, the reparametrised loop t → x(t/λ) is then the desired T -periodic orbit of H , where T ≤ λ ≤ 1/ h. This completes the proof of the theorem. §10. Calculation of the index formula. In this section, we use the doubling construction to compute the Fredholm index of our problem and show that it vanishes. For the sake of completeness, we give the details of the proof (but see also [HLS], where a similar argument is given). As before, let C = [0, 1] × R/Z be the standard cylinder that we may endow with any of its conformal structures σ ∈ (0, ∞). Let (M, J ) be an almost complex manifold of dimension 2n endowed with an oriented totally real distribution Ᏺ of maximal dimension n (for instance, the fibers of the cotangent bundle in case M = T ∗ V ). Let L be an oriented Lagrangian submanifolds of M. The Maslov index of loops on L can then be computed with respect to Ᏺ. Consider the maps u : (C, ∂C) → (M, L) in an appropriate Sobolev or Hölder space which send the boundary of the cylinder to L. Give the boundary E = E1 ∪E2 of u the orientation induced from u. Consider the index of the Fredholm projection PL , defined as in §6, except that we impose no condition that would fix the parametrisation. Theorem 10.1. The Fredholm index of the parametrised J -holomorphic cylinders with Lagrangian boundary, more precisely the real index of the Fredholm operator PL , is equal to the Maslov index of E plus 1; that is to say, it is equal to the sum of the Maslov indices of the two oriented components E1 , E2 , plus 1. In particular, if L is the union of two Lagrangian submanifolds L1 , L2 and if the component Ei is mapped to Li , then the index is equal to 1 if L1 is Lagrange isotopic to L2 in such a way that E1 is mapped to −E2 through the isotopy.
506
GATIEN AND LALONDE
It follows that the index of unparametrised J -holomorphic embedded cylinders with boundary in L = L0 ∪ L1 vanishes in Theorem 2.1. Proof. Recall that, in order to compute the Fredholm index, we may perturb the various structures of the problem inasmuch as the perturbation gives rise to a continuous perturbation of the corresponding Fredholm setting since in this case the index varies continuously and must therefore remain constant. We may assume that there is at least one (σ, J )-holomorphic cylinder u0 : (C, ∂C) → (M, L) for some conformal structure σ0 of the cylinder and some (possibly nongeneric) almost complex structure J0 on M. We must compute the index of the Fredholm map P locally given, near u0 , as
P = ∂¯J : ᐄ = (σ, u) : σ ∈ (0, ∞), u : (C, ∂C) −→ (M, L) −→ ᐅ, where u is assumed to belong to an appropriate Sobolev class H k , say, and ᐅ consists of H k−1 -sections of the bundle u∗0 T M → C. This is because one may first identify antiholomorphic maps with vector fields (because the tangent space of the cylinder is parallelizable) and then transport these vector fields to u∗0 T M → C using a natural connection. We wish to compare the index of this problem with the index of the linearised operator P with fixed conformal structure. The latter is defined in the following way. Let E → C be the pullback by u of the tangent bundle T M. It is a holomorphic bundle of complex rank n. The Lagrangian submanifold L induces on the pullback bundle E two totally real subbundles Fi → Ei , i = 1, 2, of real dimension n over the boundary components Ei of C. We consider the linear Fredholm operator P = ∂¯ defined on sections of E with boundary in F = F1 ∪F2 . Because the index of such a problem does not depend on the choice of the conformal structure, the index of the restriction dP(σ0 ,u0 ) : Ᏼτ −→ ᐅ of the differential dP(σ0 ,u0 ) to each hyperplane
Ᏼτ = (τ, ξ ) : ξ ∈ E k−1 u∗0 (T M, T L)
⊂ T(σ0 ,u0 ) ᐄ,
where E k−1 (u∗0 (T M, T L)) is the obvious space of sections, is constant (i.e., it does not depend on τ ). It is then an easy exercise of linear algebra to check that the index of dP(σ0 ,u0 ) is equal to 1+index(dP(σ0 ,u0 ) |Ᏼτ ). Therefore, it is equal to 1+index(P ). We must now compute the index of P . We assume that the index is nonnegative and that the generic bundle has a moduli space ᏹ1 of holomorphic sections with boundary in F whose dimension coincides with the index. Note that this is enough for the purposes of this paper. The proof in the general case, which makes use of both the operator and its adjoint, is similar, and we leave it to the reader. Note that F is integrable, and denote by the same letter F the corresponding totally real submanifold, whose components are diffeomorphic to R/Z × Rn .
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
507
Lemma 10.2. Given an end (−ε, 0) × R/Z of C, there is an extension of E to a collar neighbourhood (−ε, ε) × R/Z and an antiholomorphic involution θ of E|(−ε,ε)×R/Z which induces the map (x, y) → (−x, y) on the base (−ε, ε) × R/Z and which is the identity on Fi . Proof. We may assume that the J -structure is locally split near the boundary of u, that is, it coincides, in some small neighbourhood of the boundary with the structure induced by considering E as a trivial holomorphic bundle. We may also assume that, over each end Ei , the subbundle Fi varies in a real-analytic way along Ei so that the conjugation βFi : Ex → Ex , x ∈ Ei , with respect to Fi , is a real-analytic map H from Ei = (R/Z) to the space Ꮽ of linear anticomplex automorphisms of Cn that we equip with a complex structure coming from the embedding of Ꮽ in the n × n matrices EndC (Cn ) induced by composing with the standard complex conjugation. Extend H to an antiholomorphic map H : (−ε, ε) × R/Z → Ꮽ. Define the fibered map θ : (−ε, ε) × R/Z × Cn −→ (−ε, ε) × R/Z × Cn as a pair (φ, H), where φ(x, y) = (−x, y) on points (x, y) ∈ (−ε, ε) × R/Z and where H(x, y, v) is the conjugation H(x, y)(v). One easily checks that this map is antiholomorphic. Note that any holomorphic section σ of E, with boundary in F , can be extended over (−ε, ε) × R/Z to a θ -equivariant holomorphic section. The set σ ∪ θ(σ ) gives a J -invariant surface over the extended end which is necessarily smooth over Ei (and therefore everywhere). This is because the distribution Fi is integrable over Ei . Hence, the differential dθ (σ (x)), x ∈ Ei , fixes the line dσ (Tx Ei ) and therefore it fixes the 2-plane generated by dσ (Tx Ei ), J dσ (Tx Ei ), which is equal to the plane dσ (Tx C). Now take two copies of the pair (E, F ), and endow the second with the opposite complex structure. Extend their ends as above, and glue the corresponding ends together using the antiholomorphic map θ to get a holomorphic bundle E → I, where I is a surface of genus 1. By the lemma, each holomorphic section with boundary in F gives rise to a holomorphic section of E . It is easy to see that the latter is Fredholm regular if the first one is. The correspondence Ꮿ embeds the first moduli space ᏹ1 , of holomorphic sections of E with Lagrangian boundary, as the real part of the second space ᏹ2 , consisting of all holomorphic sections of E . To see this, note that image(Ꮿ) consists of those elements of ᏹ2 which are fixed under the involution I : ᏹ2 → ᏹ2 , defined by permuting the restrictions of a holomorphic section to the two copies of E used to construct E . Because the second copy is endowed with the opposite complex structure, I (J σ ) = −J I (σ ) for any element σ ∈ ᏹ2 . Thus I is an antiholomorphic involution of the space of sections, whose fixed point set is therefore a real part of ᏹ2 . Thus the real dimension of ᏹ1 is equal to the complex dimension of ᏹ2 . This gives the correspondence for the analytic side of the index formula.
508
GATIEN AND LALONDE
As for the topological side, the sum m(F ) of the oriented Maslov indices of F = {F1 , F2 } is equal to the Chern index of E . To see this, note that E is defined by the clutching functions βF1 over E1 and βF2 over E2 . But E1 + E2 is homological to a small null-homotopic loop γ ⊂ C ⊂ I. Thus, topologically, E is defined by the clutching function L : γ → Gl(n, C) equal to the sum A ◦ βF1 + A ◦ βF2 , where A is the standard complex conjugation and where the sum is the composition of loops, not the sum of matrices. Thus, its Chern number c1 (E ) is equal to the rotation number in C − {0} of the determinant of βF1 + βF2 , which is equal to the sum of the Maslov indices. Hence the index of our original problem is index = 1 + dimR ᏹ1 = 1 + dimC ᏹ2 = 1 + index2 = 1 + c1 (E ) = 1 + m(F ). §11. Proof of Theorem 2.2. We first prove the theorem in the case M = Cn . Let J be a ω0 -compatible almost complex structure on Cn such that J |U is the structure inherited from T ∗ V = T ∗ L and J is the standard structure outside some compact neighbourhood of U . Let p be a fixed point of ι, and let γ0 be the corresponding geodesic. Let Lε ⊂ U be the translate of L in the p1 coordinate by a quantity ε. Then u∗0 ω0 = ε, where u0 : Cε → U is the obvious J -holomorphic cylinder with boundary in L ∪ Lε with base γ0 . Lemma 11.1 (Uniqueness of holomorphic cylinders). If ε is small enough, u0 is the unique holomorphic cylinder in its homotopy class; that is, any other (σ, J )holomorphic map u : (C, ∂C) → (Cn , L = L0 ∪ Lε ) satisfies σ = ε and u = u0 up to rotation. 2 Proof. Let S be the hypersurface of U defined by i pi = a, and let Z = ∪x∈S Bx (δ) be the union of balls of capacity δ sufficiently small so that each ball is included in U and disjoint from some neighbourhood U of L0 . Because any complete J -holomorphic surface passing through a point x ∈ S must have an area bounded below by δ (monotonicity lemma; see [S]), taking ε < δ ensures that any holomorphic cylinder with boundary in L = L0 ∪ Lε must remain in U . The conclusion then follows by Proposition 4.1. The next proposition is a straightforward adaptation of Proposition 5.1. Proposition 11.2. Let H : Cn → R be as in the statement of Theorem 2.2, and denote by Ꮿ the set of triples (u, λ, σ ) satisfying the equation ∂u ∂u +J + λ∇H (u) = 0, ∂s ∂t
(11)
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES
509
where λ ≥ 0 and u : (Cσ , ∂Cσ ) → (Cn , L) is homotopic to u0 in [C, ∂C; Cn , L] (after rescaling). Recall that h = inf Lε H − supL0 H > 0. Then ε2 ≤ σ
2 ∂u ≤ ε − λh. Cσ ∂s
In particular, λ < ε/ h and σ ≥ ε. Moreover, we get the following a priori bound for the energy of the solutions: 2 ∂u 2ε 2 2 $du$ ≤ 3 + 2 $dH $2C 0 . ∂s h With this, the C k estimates, k ≥ 0, can be derived in much the same way as in §5. There are, however, two differences. The first difference is the C 0 estimate. The Hamiltonian is linear at infinity, and we must show that this is enough to keep our solutions in a bounded domain of Cn . Let B ⊂ Cn be a closed ball centered at the origin containing U and such that J is standard and H is linear outside B. We claim that every solution must remain in B. If not, then each component of u would be harmonic outside B, so |u|2 would be subharmonic. But |u|2 would assume its maximum at an interior point of Cσ , a contradiction. The second difference is the C 1 estimate. Since this time π2 (Cn , L) does not vanish, we must check that no J -holomorphic disc with boundary in L can appear as a limit of solutions uk . Note that such a disc would have to meet the hypersurface S and would then have area A > ε. The idea now is that, for large k, we would then have a cylinder uk : (Cσ , ∂Cσ ) → (Cn , L) with a domain D ⊂ Cσ over which the map uk is essentially holomorphic and has area greater than ε. But this is a contradiction since the area over which a cylinder in Ꮿ is holomorphic is always bounded above by ε by our basic estimate: ∂uk 2 ∂uk 2 ≤ ∂s ∂s ≤ ε − λh. D
Cσ
Now, to formalise this argument, one uses superior limits as in §5. With the notation of §5, write v : (C+ , ∂C+ ) → (Cn , L0 ) with finite energy, which we extend to a nontrivial element of π2 (Cn , L0 ) by removal of singularities. We then have 2 ∂v ∂vk 2 ∂uk 2 ∗ v ω0 = ≤ lim sup + ∂s = lim sup + ∂s ≤ ε, C+ C+ ∂s k→∞ Dk k→∞ Bk where Bk+ = {z ∈ C : |z| ≤ εk , Re(z) ≥ 0} and where the first inequality is established in the following way. Choose N so that ∂vk 2 ≤ N. Dk+ ∂s
510
GATIEN AND LALONDE
Then, for any compact K ⊂ C, 2 ∂v ∂vk 2 ∂vk 2 ≤ lim sup ≤ N. = lim + ∂s k→∞ K∩D + ∂s K ∂s D k→∞ k k The required inequality then follows. With these C k estimates, the rest of the proof of the theorem follows as for Theorem 2.1, except that we must get rid of the possibility that λ is zero in the limit. More precisely, let (uk , σk , λk ) ∈ Ꮿ be a sequence such that σk → ∞, λk → λ ∈ [0, ε/ h]. We must show that λ is strictly positive. With this, the rest of the proof proceeds as in §8. So, assume λ = 0. Then by Ascoli and the C k estimates, there is a subsequence, denoted in the same way, which converges uniformly with all derivatives on compact subsets to a J -holomorphic map u : (C∞ , ∂C∞ ) −→ (Cn , L0 ),
C∞ = [0, ∞) × R/Z.
Our basic estimate implies that
∂uk 2 ≤ε Cσn ∂s
for all k, and therefore for any compact K of C∞ , we get 2 ∂u ≤ ε. K ∂s Thus, the same integral over C∞ , which is the energy of u, is still bounded by ε. Therefore, u extends to a J -holomorphic map over the disc, with boundary in L0 . But this is a contradiction since that disc cannot escape from U because of the area estimate. The proof of Theorem 2.2 in the remaining cases is a straightforward adaptation of the above arguments when one keeps in mind the fact that for any compatible J on a compact symplectic manifold, there is a strictly positive lower bound for the areas of nonconstant J -holomorphic spheres or discs with Lagrangian boundary. References [A] [AL] [ALP]
[G1]
M. Audin, Quelques remarques sur les surfaces lagrangiennes de Givental , J. Geom. Phys. 7 (1990), 583–598. M. Audin and J. Lafontaine, eds., Holomorphic Curves in Symplectic Geometry, Progr. Math. 117, Birkhäuser, Basel, 1994. M. Audin, F. Lalonde, and L. Polterovich, “Symplectic rigidity: Lagrangian submanifolds” in Holomorphic Curves in Symplectic Geometry, Progr. Math. 117, Birkhäuser, Basel, 1994, 271–321. M. Gromov, Pseudoholomorphic curves in symplectic manifolds, Invent. Math. 82 (1985), 307–347.
HOLOMORPHIC CYLINDERS WITH LAGRANGIAN BOUNDARIES [G2] [HLS] [HV1] [HV2] [HZ] [L] [Le] [M1] [M2] [MS] [O] [S]
[Sm] [Sz] [V] [W]
511
, Partial Differential Relations, Ergeb. Math. Grenzgeb. (3) 9, Springer, Berlin, 1986. H. Hofer, V. Lizan, and J.-C. Sikorav, On genericity for holomorphic curves in fourdimensional almost-complex manifolds, J. Geom. Anal. 7 (1997), 149–159. H. Hofer and C. Viterbo, The Weinstein conjecture in cotangent bundles and related results, Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 15 (1988), 411–445. , The Weinstein conjecture in the presence of holomorphic spheres, Comm. Pure Appl. Math. 45 (1992), 583–622. H. Hofer and E. Zehnder, Symplectic Invariants and Hamiltonian Dynamics, Birkhäuser Adv. Texts Basler Lehrbücher, Birkhäuser, Basel, 1994. F. Lalonde, Suppression lagrangienne de points doubles et rigidité symplectique, J. Differential Geom. 36 (1992), 747–764. L. Lempert, “Complex structures on the tangent bundle of Riemannian manifolds” in Complex Analysis and Geometry, Univ. Ser. Math., Plenum, New York, 1993, 235–251. D. McDuff, Examples of symplectic structures, Invent. Math. 89 (1987), 13–36. , Symplectic manifolds with contact type boundaries, Invent. Math. 103 (1991), 651–671. D. McDuff and D. Salamon, J-Holomorphic Curves and Quantum Cohomology, Univ. Lecture Ser. 6, Amer. Math. Soc., Providence, 1994. Y.-G. Oh, Removal of boundary singularities of pseudo-holomorphic curves with Lagrangian boundary conditions, Comm. Pure Appl. Math. 45 (1992), 121–139. J.-C. Sikorav, “Some properties of holomorphic curves in almost complex manifolds” in Holomorphic Curves in Symplectic Geometry, Progr. Math. 117, Birkhäuser, Basel, 1994, 165–189. S. Smale, An infinite dimensional version of Sard’s theorem, Amer. J. Math. 87 (1965), 861–866. R. Szöke, Complex structures on tangent bundles of Riemannian manifolds, Math. Ann. 291 (1991), 409–428. I. N. Vekua, Generalized Analytic Functions, Internat. Ser. Monogr. Pure Appl. Math. 25, Pergamon Press, Oxford, 1962. W. L. Wendland, Elliptic Systems in the Plane, Monogr. Stud. Math. 3, Pitman, Boston, 1979.
Gatien: Department of Mathematics and Statistics, McMaster University, Hamilton, Ontario L8S 4K1, Canada;
[email protected] Lalonde: Institut des Sciences Mathématiques, Université du Québec à Montréal, Case Postale 8888, Succursale Centre-Ville, Montréal, Québec H3C 3P8, Canada; fl
[email protected]
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS OSAMU FUJINO
Contents 0. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 513 1. Definitions and preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 515 2. Reduced boundaries of dlt n-folds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517 3. Finiteness of B-pluricanonical representations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 520 4. The abundance theorem for slc threefolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 522 Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 530 0. Introduction. The main purpose of this paper is to prove the abundance theorem for semi log canonical threefolds. The abundance conjecture is a very important problem in the birational classification of algebraic varieties. The abundance theorem for semi log canonical surfaces was proved in [1] and [15] by D. Abramovich, L.-Y. Fong, S. Keel, J. Kollár, and J. Mckernan. The proof uses semiresolution, and so on, and has some combinatorial complexities. We simplify the proof and generalize the theorem to semi divisorial log terminal surfaces (see Corollary 4.10). By our method we can reduce the problem to the irreducible case and the finiteness of some groups. This shows that if the log Minimal Model Program (log MMP, for short), the log abundance conjecture for n-folds, and the finiteness of B-pluricanonical representations (see Section 3) hold for (n − 1)-folds, then the abundance conjecture for semi log canonical n-folds is true almost automatically (see Theorem A.1 in the appendix). But unfortunately the log MMP and the log abundance conjecture are only conjectures for n-folds with n ≥ 4. So we prove the following theorem. Theorem 0.1 (Abundance theorem for slc threefolds). Let (X, ) be a proper semi log canonical (slc, for short) threefold with KX + nef. Then KX + is semiample. This theorem is a generalization of the abundance theorem for log canonical threefolds proved by S. Keel, K. Matsuki, and J. McKernan (see [16]). According to the authors, the abundance theorem for log canonical threefolds is considered to be the first step towards a proof of the abundance conjecture in dimension 4. We believe that the abundance theorem for semi log canonical threefolds is the second step. Received 2 October 1998. Revision received 6 July 1999. 1991 Mathematics Subject Classification. Primary 14E30; Secondary 14E07. Research Fellow of the Japan Society for the Promotion of Science. 513
514
OSAMU FUJINO
The notion of semi log canonical singularity was first introduced in [22] for the problem of compactifying the moduli of surfaces. For the further development of this direction, we recommend the readers to see [10]. Let us see the scheme proposed in [2], [12], and [20]. The abundance conjecture states: Let X be a minimal n-fold with terminal singularities. Then for sufficiently divisible and large m ∈ N, the linear system |mKX | is basepoint-free. After the minimal model program (still conjectural in dimension ≥ 4) produces a minimal n-fold in the birational equivalence class, the abundance conjecture would provide the Iitaka fibration morphism |mKX | : X → Xcan onto its canonical model, which is absolutely crucial for the study of the birational properties of algebraic varieties. The cited authors proposed the following inductional scheme toward a proof of the abundance conjecture. (i) Show that a member D ∈ |mKX | exists for sufficiently divisible and large m ∈ N. (ii) Apply the log MMP to the pair (X, DX ) (the boundary DX is constructed from D in (i)) to obtain a log minimal model (Y, DY ). Observe that by the (generalized) adjunction KY + DY |DY = KDY + Diff, where Diff is the supplementary term for the equality to hold, and the pair (DY , Diff) is a minimal (n − 1)-fold with semi log canonical singularities. (iii) Apply induction on the pair (DY , Diff). Lift the global sections of m(KDY + Diff) to those of m(KY + DY ), which should then provide “enough” global sections of the original mKX to prove that the linear system |mKX | is basepoint-free. In order to complete the inductional circle of steps, we consider the abundance statement for log pairs. (iv) Based upon the abundance for minimal n-folds X with terminal singularities, prove the abundance for log pairs (X, D) with log canonical singularities. (v) Based upon the abundance for log pairs (X, D) with log canonical singularities, prove the abundance for log pairs with semi log canonical singularities. In [2], [12], and [20], the authors proved the abundance conjecture for threefolds along the line of argument described as above, establishing the inductional step (v) in dimension 2 through some combinatorial discussions. In this paper we capture the essential difficulty in carrying out step (v) in arbitrary dimension, as what we call the finiteness of B-pluricanonical representations. In dimension n = 2 or 3 where we can prove this finiteness in dimension n − 1 = 1 or 2, respectively, we establish the step (v) in one stroke without going through the complex combinatorial arguments. We sketch the contents of this paper. Section 1 sets up some basic definitions and facts. In Section 2, we treat the reduced boundaries of dlt n-folds. This is a reformulation of [1, 12.3.2]. Section 3 deals with B-pluricanonical representations (the precise definitions are given in Definition 3.1). We prove their finiteness for curves and surfaces; it plays an important role in our proof of the abundance theorem
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
515
for slc n-folds. In Section 4, the main section, we prove the abundance theorem for slc threefolds. In the appendix, we reformulate the main theorem under some assumptions such as log MMP for n-folds, and we collect some known results for the reader’s convenience. Acknowledgements. I would like to express my gratitude to Professor Shigefumi Mori for giving me much advice and encouraging me during the preparation of this paper. I am grateful to Professor Yoichi Miyaoka and Professor Noboru Nakayama for giving me many useful comments. I would also like to thank Professor Nobuyoshi Takahashi, who pointed out some mistakes. I should add gratefully that the referee’s comments helped me revise this paper. Notation. (1) The word scheme is used for schemes that are separated and of finite type over C; the term variety stands for a reduced and irreducible scheme. A normal scheme consists of the disjoint union of irreducible normal schemes. (2) We freely use terminology about singularities of the pair (X, ), such as Kawamata log terminal, log terminal, divisorial log terminal, log canonical (frequently abbreviated as klt, lt, dlt, and lc), and terminal. For the definition of this terminology, we refer the reader to [21, Section 2.3]. (See also [27].) In the definition in [21, Section 2.3], is not necessarily effective, but in this paper we assume is an effective Q-divisor. (3) Let f : X Y be a rational map. We say that a Q-divisor D is horizontal (resp., vertical) if every irreducible component of D is dominating (resp., not dominating) over Y . (4) The log MMP means the log MMP for Q-factorial dlt pairs. (5) ν denotes the numerical Kodaira dimension. (6) We will make use of the standard notation and definitions as in [21]. 1. Definitions and preliminaries. In this section, we present the basic notation and recall the necessary results. Definition 1.1. Let X be a reduced S2 scheme. We assume that it is pure ndimensional and normal crossing in codimension 1. Let be an effective Q-Weil divisor on X (cf. [5, 16.2]) such that KX + is Q-Cartier. Let X = ∪Xi be a decomposition into irreducible components, and let µ : X := Xi → X = ∪Xi be the normalization. A Q-divisor on X is defined by KX + := µ∗ (KX + ) and a Q-divisor i on Xi by i := |Xi . We say that (X, ) is a semi log canonical n-fold (an slc n-fold, for short) if (X , ) is lc. We say that (X, ) is a semi divisorial log terminal n-fold (an sdlt n-fold, for short) if Xi is normal; that is, Xi is isomorphic to Xi , and (X , ) is dlt. Remark 1.2. (1) The definition of slc above is equivalent to the one in [1] (see [17, 4.2]).
516
OSAMU FUJINO
(2) If (X, ) is an slc n-fold, then X is seminormal (see [1, 12.2.1(8)] and [3, Remark 4.7]). (3) If (X, ) is a dlt n-fold, then (, Diff( − )) is an sdlt (n − 1)-fold (see [18, 17.5] and [21, 5.52]). (4) Let (X, ) be lc. Then (, Diff( − )) is not necessarily slc (see [18, 17.5.2 Example]). The scheme is not necessarily S2 . Note that [5, (16.9.1)] is not correct. (5) Let (X, ) be lc. If (X, 0) is Q-factorial and lt, then the pair (, Diff( − )) is slc. Since X has only rational singularities, especially, X is Cohen-Macaulay, and is Q-Cartier, satisfies S2 condition. The following lemma plays an important role in Section 2. Lemma 1.3 (Connectedness Lemma) [26, 5.7], [18, 17.4], [13, 1.4]. Let X and Y be normal varieties, and let f : X → Y be a proper surjective morphism with connected fibers. Let = di i be a Q-divisor on X. Let g : Z → X be a log resolution (cf. [21, Notation 0.4(10)]) such that h := f ◦ g. Let e i Ei , and F := − ei E i . KZ = g ∗ (KX + ) + i:ei ≤−1
Assume that (1) if di < 0, then f (i ) has codimension at least 2 in Y ; (2) −(KX + ) is f -nef and f -big. Then Supp F = Supp F is connected in a neighborhood of any fiber of h. In particular, if (X, ) is lc and (X, − ) is klt, then ∩ f −1 (y) is connected for every point y ∈ Y . Lemma-Definition 1.4 (Q-factorial dlt model) (cf. [15, 8.2.2]). Let (X, ) be an lc n-fold with n ≤ 3. Then there is a projective birational morphism f : (Y, ) → (X, ) such that (Y, ) is Q-factorial dlt and KY + = f ∗ (KX +). Furthermore, if (X, ) is dlt, then we may take f a small projective morphism that induces an isomorphism at every generic point of a center of log canonical singularities for the pair (Y, ). (For the definition of a center of log canonical singularities, see Definition 4.8.) We say that (Y, ) is a Q-factorial dlt model of (X, ). Definition 1.5. Let (X, ) = ni=1 (Xi , i ) and (X , ) = ni=1 (Xi , i ) be normal schemes with Q-divisor, such that KX + and KX + are Q-Cartier Qdivisors. We say that f : (X, ) (X , ) is a B-birational map (resp., morphism) if f : X X is a proper birational map (resp., morphism) and there exists a common resolution α : T → X, β : T → X of f : X X such that α ∗ (KX + ) = β ∗ (KX + ). That is, there exists a permutation σ such that fi : Xi Xσ (i) is a proper birational map (resp., morphism) and there exists a common resolution αi : Ti → Xi , βi : Ti → Xσ (i) of fi such that αi∗ (KXi + i ) = βi∗ (KX + σ (i) ) on σ (i)
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
Ti for every i. The last condition means that if we write and KTi = βi∗ KX
KTi = αi∗ (KXi + i ) + F
σ (i)
517
+ σ (i) + E,
then F = E. If there is a B-birational map from (X, ) to (X , ), we say that (X, ) is Bbirationally equivalent to (X , ) and write (X, ) ∼B (X , ). Here the symbol B is the initial of boundary. Lemma 1.6. Let (X, ) and (Z, ) be normal varieties with Q-divisors such that KX + and KZ + are Q-Cartier. Let f : X → R and h : Z → R be proper surjective morphisms onto a normal variety R and p : X Z a birational map such that f = h ◦ p. Assume that (1) p −1 has no exceptional divisors; (2) = p∗ ; (3) KX + ≡f 0, KZ + ≡h 0. Then p is a B-birational map. Proof. Let β : W → X be a resolution such that the induced rational map α = p ◦ β : W → Z is a morphism. Let m > 0 be a sufficiently divisible integer. We have linear equivalences −mKW ∼ −α ∗ m(KZ + ) − F, mKW ∼ β ∗ m(KX + ) + E. Adding the two we obtain β ∗ (KX + ) − α ∗ (KZ + ) = F − E. By assumption, F − E is α-exceptional and numerically α-trivial. Then F = E, that is, α ∗ (KZ + ) = β ∗ (KX + ). 2. Reduced boundaries of dlt n-folds. The following is a reformulation of [1, 12.3.2], which fits better in our arguments. Proposition 2.1 (cf. [26, 6.9], [1, 12.3.2]). Let (X, ) be a Q-factorial dlt n-fold with n ≤ 3. Let f : X → R be a projective surjective morphism onto a normal variety R with connected fibers. Assume that KX + is numerically f -trivial. Then one of the following holds. (0) dim R = 0. (0.1) is connected. (0.2) has two connected components 1 and 2 , and there exists a rational map v : X (V , P ) onto a Q-factorial lc (n − 1)-fold (V , P ) with general fiber P1 . The pair (V , 0) is lt. Furthermore, there exists an irreducible component Di ⊂ i such that v|Di : (Di , Diff( − Di )) (V , P ) is a B-birational map for i = 1, 2.
518
OSAMU FUJINO
(1) dim R ≥ 1. (1.1) ∩ f −1 (r) is connected for every r ∈ R. (1.2) The number of connected components of ∩ f −1 (r) is at most two for every r ∈ R. There exists a rational map v : X (V , P ) onto a Q-factorial lc (n − 1)-fold (V , P ) with general fiber P1 . The pair (V , 0) is lt. The horizontal part h of is one of the following: (i) h = D1 , which is irreducible, and the mapping degree deg[D1 : V ] = 2; there is also a B-birational involution on (D1 , Diff(−D1 )) over V ; (ii) h = D1 + D2 such that Di is irreducible and v|Di : (Di , Diff( − Di )) (V , P ) is a B-birational map for i = 1, 2. Remark 2.2. (1) In Proposition 2.1 the assumption KX + ≡f 0 is equivalent to KX + ∼Q,f 0. It is because the relative log abundance theorem holds when dim X ≤ 3 (see Theorem A.2 in the appendix). (2) If the log MMP holds for n-folds, then Proposition 2.1 is also true for n-folds. First, we prove the following lemma. Lemma 2.3. Let (Z, ) be a Q-factorial lc n-fold with n ≥ 2 and = 0. Let h : Z → R be a projective surjective morphism onto a normal variety R with connected fibers. Assume the following conditions: (1) (Z, − ε) is klt, where ε is a small positive rational number; (2) KZ + ≡h 0; (3) there is a (KZ + − ε)-extremal Fano contraction u : Z → V over R such that dim V = n − 1. Then the horizontal part h of is one of the following: (a) h = D1 , which is irreducible, and deg[D1 : V ] = 2; (b) h = D1 , which is irreducible, and deg[D1 : V ] = 1; (c) h = D1 + D2 , such that Di is irreducible and deg[Di : V ] = 1 for i = 1, 2. In the cases (a) and (c), the number of connected components of ∩ h−1 (r) is at most two for every r ∈ R. In the case (b), ∩ h−1 (r) is connected for every r ∈ R. Furthermore, there is a Q-divisor P on V such that (V , P ) is a Q-factorial lc (n − 1)-fold and KDi + Diff( − Di ) = u|Di ∗ (KV + P ) for i = 1, 2. In the case (a), there is a B-birational involution ι over V ; that is, ι : (D1 , Diff(− D1 )) (D1 , Diff( − D1 )) over V is a B-birational map and ι2 = id. In the case (c), u|Di : Di → V is a B-birational morphism for i = 1, 2. In particular, (D1 , Diff( − D1 )) ∼B (D2 , Diff( − D2 )). Note that (V , 0) is lt. Proof. Since is u-ample by the assumptions (2) and (3), we have h = 0. So the general fiber of Z → V is P1 and deg[h : V ] ≤ 2, because KZ + is numerically h-trivial. Since u : Z → V is extremal, the vertical component v of
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
519
is a pullback of a Q-divisor on V and (V , 0) is a Q-factorial lt pair (see [7, Corollary 3.5] or [24, Appendix]). Therefore the first part is proved. Let H1 , H2 , . . . , Hn−2 be general hypersurfaces on V . Consider u−1 H1 ∩ H2 ∩ · · · ∩ Hn−2 −→ H1 ∩ H2 ∩ · · · ∩ Hn−2 .
By using [12, 3.5.1 and 3.5.2] and [1, 12.3.4] to the above morphism, we have a Qdivisor P on V satisfying KDi + Diff( − Di ) = u|Di ∗ (KV + P ). The normalization of (D1 , Diff( − D1 )) is lc and the normalization of (V , P ) in the function field C(D1 ) is lc, since KD1 + Diff( − D1 ) = u|D1 ∗ (KV + P ). Thus (V , P ) is lc and Q-factorial, since Z is Q-factorial and u is extremal. Proof of Proposition 2.1. If f is birational, then Connectedness Lemma 1.3 implies that we are in the case (0.1) or (1.1). Thus we may assume that dim R < dim X. We run the (KX + − ε)-MMP on X over R for 0 < ε 1. The end result is a birational map p : X Z over R. Let h : Z → R be the induced morphism. Since KX + ≡f 0, we obtain that KZ + p∗ ≡h 0. Then (Z, p∗ ) is a Q-factorial lc pair (see Lemma 1.6) and (Z, p∗ − ε p∗ ) is klt. Step 1. If (Z, p∗ − ε p∗ ) is a minimal model, then KZ + p∗ − εp∗ is h-nef and KZ + p∗ ≡h 0. So −p∗ is h-nef. If dim R = 0, then p∗ = 0. By Lemma 2.4 below, we have = 0. If 0 < dim R < dim X, then p∗ ∩ h−1 (r) is connected for every r ∈ R. By Lemma 2.4 we have the case (1.1). Step 2. If there exists an extremal Fano contraction u : Z → V over R, then −(KZ + p∗ − ε p∗ ) is u-ample. Let g : V → R be the induced morphism. We note h = g ◦ u. First, assume that dim R = 0. If p∗ is connected, we have the case (0.1) by Lemma 2.4. Thus we can assume that p∗ is not connected. Since p∗ is relatively ample, we have dim V = dim X − 1. Then by Lemma 2.3 we have the case (c) in Lemma 2.3 and u|Di : Di → V are finite, where Di are as in Lemma 2.3(c) for i = 1, 2. It is because D1 intersects D2 by the relative ampleness of D1 and D2 if u|D1 or u|D2 is not finite. We have Di V by Zariski’s main theorem, because V is normal. By Lemma 1.6, the adjunction, and Lemma 2.4, we have the case (0.2). Thus, we may assume dim R ≥ 1. If dim V = dim X − 1, we get case (1.2) by Lemmas 1.6, 2.3, and 2.4 below, and the adjunction. If dim R = 1, dim V = 1, and dim X = 3, then V R. Since u is extremal, p∗ is h-ample and ρ(Z/R) = 1. Then every horizontal irreducible component of p∗ is h-ample, and every vertical irreducible component of p∗ is a pullback of a Q-Cartier Q-divisor on R. Then p∗ ∩ h−1 (r) is connected for every r ∈ R. We have the case (1.1). The next lemma is used in the proof of Proposition 2.1. Lemma 2.4. If f : X → R, h : Z → R, p : X Z are as in the proof of Proposition 2.1, then the number of connected components of ∩ f −1 (r) is equal
520
OSAMU FUJINO
to the number of connected components of p∗ ∩ h−1 (r) for every r ∈ R. Proof. p : X Z is a composition of flips and divisorial contractions. X X 1 X 2 · · · X i · · · Z. Use Connectedness Lemma 1.3 in each step. Let i be the proper transform of on X i . Note that i is relatively ample for each flipping or divisorial contraction and KXi +i is numerically trivial over R. 3. Finiteness of B-pluricanonical representations. We consider the birational automorphism groups of pairs. Definition 3.1. Let (X, ) be a pair of a normal scheme and a Q-divisor such that KX + is Q-Cartier. We define Bir(X, ) := {σ : (X, ) (X, ) | σ is a B-birational map}, Aut(X, ) := {σ : X → X | σ is an automorphism and σ ∗ = }. Since Bir(X, ) acts on H 0 (X, ᏻX (m(KX + ))) for every integer m such that m(KX + ) is a Cartier divisor, we can define B-pluricanonical representation ρm : Bir(X, ) → Aut H 0 (X, m(KX + )). The following conjecture plays an important role when we reduce the problem to the irreducible case. Conjecture 3.2 (Finiteness of B-pluricanonical representations). Let (X, ) be an n-dimensional (not necessarily connected) proper lc pair. Assume that KX + is nef. Then there is a positive integer m0 such that ρm1 m0 (Bir(X, )) is finite for every m1 ∈ N. For Conjecture 3.2, it is obviously sufficient to prove it under the assumption that X is irreducible. In Theorems 3.3, 3.4, and 3.5, we prove the conjecture for curves and surfaces. Theorem 3.3 (Cf. [1, 12.2.11]). Let (C, ) be a proper lc curve. Then there is a positive integer m0 such that ρm1 m0 (Aut(C, )) is finite for every m1 ∈ N. Proof. If the genus g(C) ≥ 2, then it is trivial. If g(C) = 1 and = 0, then it is also true by [23, p. 60, Application 1]. If g(C) = 1 and = 0, then we put m0 = 12 and the theorem holds by [1, 12.2.9.1]. So we assume that C is a rational curve. If deg(KC +) < 0, then there is nothing to be proved. If | Supp | ≥ 3, then Aut(C, ) is a finite group. So we can reduce to the case where = = {two points}. In this case we can prove easily that ρm (Aut(C, )) is trivial if m is even. Theorem 3.4. Let (S, ) be a proper klt surface. Let m0 ≥ 2 be an integer such that m0 (KS + ) is a Cartier divisor. Then ρm1 m0 (Bir(S, )) is finite for every m1 ∈ N.
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
521
Proof. Let h : S → S be a terminal model, that is, (S , ) is terminal and KS + = h∗ (KS + ) (see [19, (6.9.4)]). Note that is effective and = 0 by the construction. The discrepancy of every exceptional divisor over (S , ) is positive and that of a nonexceptional divisor is nonpositive. The composite σ := h−1 ◦ σ ◦ h does not change discrepancies because σ is a B-birational map. Thus σ is a biregular morphism such that σ ∗ = . We may assume (S, ) is terminal such that = 0 and replace Bir(S, ) with Aut(S, ). Let f : S → S be a finite sequence of −1 ∗
blowings-up whose centers are over such that KS +f∗ = f (KS +)+ ai Ei and Supp(f∗−1 ∪ Ei ) is a simple normal crossing divisor. Let m 0 ≥ 2 be an integer such that m0 (KS + ) is a Cartier divisor. We put D := f∗−1 + (1/m0 )Ei . Then we obtain 1 ∗ ai + Ei . KS + D = f (KS + ) + m0 Note that D is effective, D = 0, and Supp D is a simple normal crossing divisor. Since S − D f S − , σ := f −1 ◦ σ ◦ f acts on H 0 (S , ᏻS (mKS + (m − 1)D )). (Cf. [25, Theorem 2.1 and Proposition 1.4], [9, §11.1].) Then the image of Aut(S, ) → Aut H 0 (S , ᏻS (mKS + (m − 1)D )) is a finite group for every m. It was proved by I. Nakamura, K. Ueno, P. Deligne, and F. Sakai. (See [25, Theorem 5.1] and [28, §14].) Since H 0 S, ᏻS m1 m0 (KS + ) = H 0 S , ᏻS m1 m0 KS + D ⊂ H 0 S , ᏻS mKS + (m − 1)D
for every positive integer m1 with m = m1 m0 , the image of ρm1 m0 : Aut(S, ) → Aut H 0 (S, m1 m0 (KS + )) is a finite group. Theorem 3.5. Let (S, ) be a projective lc surface. Assume that KS + is nef. Then there is a positive integer m0 such that ρm1 m0 (Bir(S, )) is finite for every m1 ∈ N. Proof. Let f := |k(KS +)| : S → R be a morphism with connected fibers for a sufficiently large and divisible integer k by the log abundance theorem. Case 1: ν(S, KS + ) = 2. Then f is a birational morphism and Bir(S, ) acts on R biregularly. We put 2 := f∗ . Then KS + = f ∗ (KR + 2) and 2 is Bir(S, )-invariant. Let h : S → R be the unique minimal resolution; so we have KS + = h∗ (KR +2), Bir(S, ) acts on S biregularly, and is Bir(S, )-invariant. Thus, we may reduce to the case where (S, ) is lc, S is smooth and Bir(S, ) = Aut(S, ) in Theorem 3.4. Since KS + is big, we obtain an effective Cartier divisor D such that am0 (KS + ) ∼ + D, where a is a sufficiently large integer and m0 is a sufficiently divisible integer so that m0 (KS + ) is a Cartier divisor. Observing 1 = m1 m0 (KS + ) + D, (m1 + a) m0 (KS + ) − m1 + a
522
OSAMU FUJINO
we have
H 0 S, ᏻS m1 m0 (KS + ) ⊂ H 0 S, ᏻS (m1 + a)m0 (KS + ) − .
By using Theorem 3.4 for the right-hand side, we obtain the result. Case 2: ν(S, KS + ) = 1. Let g := S → S be a minimal resolution and := f ∗ (KS + ) − KS > 0. We may replace (S, ) with (S , ). By contracting (−1)-curves in the fibers, we may reduce to the case where f : S → R is a P1 -bundle or a minimal elliptic surface. When the horizontal part h = 0, we take an irreducible component D of h . By elementary transformations we may assume D is smooth. Then H 0 S, ᏻS m1 m0 (KS + ) ⊂ H 0 D, ᏻD m1 m0 KD + Diff( − D) . So we have the result by Theorem 3.3. Next we may assume h = 0. When f : S → R is a P1 -bundle, = f ∗ pi for some pi ∈ R. When f : S → R is a minimal elliptic surface, we have KS ∼Q,f 0 and ≡f 0. Then = ai f ∗ pi for some pi ∈ R and positive rational numbers ai (see [4, VIII.3, VIII.4]). Bir(S, ) acts on f (). We define B := pi ∈f () pi . Let H be an ample Cartier divisor on R such that m0 (KS +) ∼ f ∗ H . Then we have bH ∼ B +D for some effective divisor D and some sufficiently large integer b. Then bm0 (KS + ) ∼ f ∗ (B + D). We put A := f ∗ B. Observing 1 (m1 + b) m0 (KS + ) − A = m1 m0 (KS + ) + f ∗ D, m1 + b we have
H 0 S, ᏻS m1 m0 (KS + ) ⊂ H 0 S, ᏻS (m1 + b)m0 (KS + ) − A .
By using Theorem 3.4 for the right-hand side, we obtain a result. Case 3: ν(S, KS + ) = 0. In this case we can show that Bir(S, ) acts on H 0 (S, m1 m0 (KS + )) trivially, using Lemma 4.9 below. First, we replace (S, ) with its Q-factorial dlt model (see Lemma-Definition 1.4). So we may assume that (S, ) is dlt. By Proposition 4.5, we can take a preadmissible section s (cf. Definition 4.1). Then g ∗ s = s for every g ∈ Bir(S, ) by Lemma 4.9. Since ν(S, KS + ) = 0, we have the result. 4. The abundance theorem for slc threefolds. We introduce the notion of preadmissible and admissible sections for the inductive proof of the abundance conjecture for slc n-folds. Definition 4.1. Let (X, ) be a proper sdlt n-fold, and let m be a sufficiently large and divisible integer. We define admissible and preadmissible sections inductively on dimension:
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
523
• s ∈ H 0 (X, ᏻX (m(KX + ))) is preadmissible if the restriction s|(i i ) ∈ H 0 (i i , ᏻ(i i ) (m(KX + )|(i i ) )) is admissible; • s ∈ H 0 (X, ᏻX (m(KX +))) is admissible if s is preadmissible and g ∗ (s|Xj ) = s|Xi for every B-birational map g : (Xi , i ) (Xj , j ) for every i, j . Note that if s ∈ H 0 (X, ᏻX (m(KX + ))) is admissible, then s|Xi is Bir(Xi , i )invariant for every i. We define linear subspaces of H 0 (X, ᏻX (m(KX + ))) as follows: PA X, m(KX + ) := {s is preadmissible} and
A X, m(KX + ) := {s is admissible}.
When dim X = 1, the preadmissible section is a slight generalization of the normalized section (see [1, 12.2.9]). But in the higher dimensional case, the admissible and preadmissible sections behave much better in the inductive proof of the abundance conjecture for slc n-folds. Lemma 4.2. Let (X, ) be a proper slc n-fold, µ : X → X the normalization, and KX + := µ∗ (KX +). Let f : (Y, 2) → (X , ) be a proper birational morphism such that (Y, 2) is dlt with KY +2 = f ∗ (KX +). Then PA(Y, m(KY +2)) descends to sections on (X, ). Proof. By the definition of slc, X is S2 and normal crossing in codimension 1. So, this lemma is obvious by the definition of preadmissible sections. Lemma 4.3. Let (X, ) be a proper Q-factorial dlt n-fold, KX + nef, and S = = 0. Assume that f = |k(KX +)| : X → R is a proper surjective morphism onto a normal variety R with connected fibers for a sufficiently large and divisible p integer k and f () R. If there exist sections {si }i=1 ⊂ H 0 (S, ᏻS (m(KX +)|S )) without common zeros, then there exist sections {ui }li=1 ⊂ H 0 (X, ᏻX (rm(KX + ))) for some integer r such that sir for 1 ≤ i ≤ p, (1) ui |S = 0 for p < i ≤ l; (2) {ui }li=1 have no common zeros. Proof. There is an ample Q-Cartier Q-divisor H on R such that KX + ∼Q f ∗ H . We consider the following commutative diagram: H 0 X, ᏻX rm(KX + ) O ∼ =
H 0 R, ᏻR (rmH )
/ H 0 S, ᏻS rm(KX + )|S O ∼ =
/ H 0 T , ᏻT (rmH |T ) .
524
OSAMU FUJINO
Let T := f (S) and IT the defining ideal of T . By Lemma 4.4 below, (f |S )∗ ᏻS = ᏻT . So the vertical arrows are isomorphisms. Since H is ample, the horizontal arrows are surjective and ᏻR (rmH ) ⊗ IT is generated by global sections for a sufficiently large integer r. So we can get sections {ui }li=1 with required properties. We use the next lemma in Proposition 4.5. Lemma 4.4. Let f : X → R be a proper surjective morphism between normal varieties with connected fibers. Assume that (1) (X, ) is a Q-factorial dlt n-fold; (2) f () R; (3) KX + ∼Q,f 0. Then there is an exact sequence 0 −→ f∗ ᏻX (−) −→ ᏻR −→ f∗ ᏻ −→ 0. Proof. There is a positive integer m and a Cartier divisor D on R such that m(KX + ) = f ∗ D. Observe that m(KX + ) − − (KX + {}) = (m − 1)(KX + ). Since KX + {} is klt and (m − 1)(KX + ) is f -semiample by the assumptions (1) and (3), R 1 f∗ ᏻX (m(KX + ) − ) is torsion-free (see [14, 1-2-7]). By the assumption (2), f∗ ᏻ (m(KX + )) is a torsion sheaf. Then we have an exact sequence 0 −→ f∗ ᏻX m(KX + ) − −→ f∗ ᏻX m(KX + ) −→ f∗ ᏻ m(KX + ) −→ 0. Tensoring ᏻR (−D) gives the result. The next proposition is the main part of the proof of the abundance theorem for slc n-folds. Proposition 4.5. Let (X, ) be a projective Q-factorial dlt pair with n ≤ 3. Let m be a sufficiently large and divisible integer, especially m ∈ 2Z. Assume that (1) KX + is nef; (2) A(, m(KX + )| ) generates ᏻ (m(K + Diff( − ))). Then PA X, m(KX + ) −→ A , m(KX + )| is surjective, and PA(X, m(KX + )) generates ᏻX (m(KX + )). Proof. It is sufficient to prove this proposition for each connected component. So we can assume that (X, ) is a projective Q-factorial irreducible dlt pair. Apply the log abundance theorem. We get f := |k(KX +)| : X → R for a
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
525
sufficiently large and divisible integer k. If = 0, then the proposition is trivial. Thus, we may assume = 0. We have the following possibilities by Proposition 2.1: (1) ν(X, KX + ) = 0 and is connected; (2) ν(X, KX + ) ≥ 1 and f −1 (r) ∩ is connected for every r ∈ R and f () = R; (3) ν(X, KX + ) ≥ 1 and f −1 (r) ∩ is connected for every r ∈ R and f () R; (4) f −1 (r) ∩ is not connected for some r ∈ R. Case 1. Consider the exact sequence 0 −→ H 0 X, ᏻX m(KX + ) − −→ H 0 X, ᏻX m(KX + ) −→ H 0 , ᏻ m(KX + )| −→ · · · . Since ν(X, KX + ) = 0, H 0 (X, ᏻX (m(KX + ) − )) = 0 and the second and third terms are one-dimensional. Thus, we get the result. Case 2. We construct a morphism ϕ : → C by the linear system corresponding to A(, m(KX + )| ). Since every curve in any fiber of f | goes to a point by ϕ, there exists a morphism ψ : R → C such that ψ ◦ (f | ) = ϕ. For s ∈ A(, m(KX + )| ), there exists a section t on C such that s = ϕ ∗ t. Thus we obtain u := f ∗ ψ ∗ t ∈ PA(X, m(KX + )) such that u| = s. Note that if we write KX + ∼Q f ∗ H as in Lemma 4.3, we have H 0 (X, ᏻX (m(KX + ))) H 0 (R, ᏻR (mH )) H 0 (, ᏻ (m(KX + )| )). Since A(, m(KX + )| ) generates ᏻ (m(K + Diff( − ))), PA(X, m(KX + )) generates ᏻX (m(KX + )). Case 3. We put T := f () R. By Lemma 4.4 we obtain ᏻT = f∗ ᏻ . Then T is seminormal (see [3, Proposition 4.5]). As in case 2, we construct ϕ : → C by A(, m(KX + )| ) and get ψ : T → C. By Lemma 4.3, we can pull back s ∈ A(, m(KX + )| ) to u ∈ PA(X, m(KX + )) if m is a sufficiently large and divisible integer. By Lemma 4.3 we can also check that PA(X, m(KX + )) generates ᏻX (m(KX + )) if m is a sufficiently large and divisible integer. Case 4. In this case, X is generically a P1 -bundle over (V , P ) by Proposition 2.1. Let f : (X, ) → R be Iitaka fiber space and u : (X , ) → (V , P ) be the last step of the log MMP over R as in the proof of Proposition 2.1. In this case u is a Fano contraction to an (n − 1)-dimensional lc pair (V , P ). By Lemma 1.6, we have that H 0 X, ᏻX m(KX + ) H 0 X , ᏻX m(KX + ) . Let α : (Y, ) → (X, ) and β : (Y, ) → (X , ) be a common log resolution of a B-birational map p : X X such that KY + = α ∗ (KX + ) and
526
OSAMU FUJINO
KY + = β ∗ (KX + ). We define B := di =1 i , where = di i (see Definition 4.8). Then H 0 , ᏻ m(KX + )| H 0 B , ᏻB m(KY + )|B H 0 , ᏻ m(KX + )| by Lemma 1.3 (see also the proof of Lemma 4.9). We note that and are seminormal (see Remark 1.2). So it is sufficient to treat (X , ) instead of (X, ). Let s be a section in A(, m(KX +)| ). By the above isomorphism, we can assume that the section s is in H 0 ( , ᏻ (m(KX + )| )). We have the decomposition = h ∪ v , where h (resp., v ) is the horizontal (resp., vertical) part of with respect to the morphism u. Since s| h is B-birational involution invariant over (V , P ), it descends to a section t on (V , P ). By the isomorphism H 0 (X, ᏻX (m(KX + ))) H 0 (X , ᏻX (m(KX + ))) H 0 (V , ᏻV (m(KV + P ))), we can pull the section t back to the section w ∈ H 0 (X, ᏻX (m(KX +))) H 0 (X , ᏻX (m(KX + ))). First, we prove that s| h = w| h . It is true because there is a small analytic open set U in V such that u−1 (U ) is biholomorphic to P1 ×U and u|u−1 (U ) : P1 ×U → U is a second projection. By the same argument as in [1, 12.3.4], the difference between s| h and w| h is at most (−1)m . Since we assume that m is even, we have that s| h = u| h . Next, we check that s| v = w| v . By Lemma 2.3, v = u∗ Di , where Di ⊂ P is an irreducible divisor. We define Ei := u∗ Di . It is sufficient to check that s|Ei = w|Ei for every i. Let 2i be an irreducible component of Ei ∩ h such that u|2i : 2i → Di is dominant. Since h ∩ v = ∅, we can always take such 2i . We consider the commutative diagram H 0 Ei , ᏻEi m(KX + )|Ei O
/ H 0 2i , ᏻ2 m(KX + )|2 i i O
∼ =
0
H Di , ᏻDi m(KV + P )|Di
ι
id
0
/ H Di , ᏻD m(KV + P )|D . i i
The map ι is injective since u|2i : 2i → Di is dominant and the left vertical map is an isomorphism since Di is seminormal (see [3, Proposition 3.2]) and u|Ei has connected fibers. As s|2i = w|2i , so we get s|Ei = w|Ei for every i. Note that 2i ⊂ h . Thus we have s = w| . Finally, since A(, m(KX +)| ) generates ᏻ (m(K +Diff(− ))) by the assumption, the restriction to h , which descends to sections on (V , P ), generates ᏻ h (m(KX + )| h ). Therefore PA(X, m(KX + )) generates ᏻX (m(KX + )). This completes the proof. Remark 4.6. In Proposition 4.5 the assumption n ≤ 3 is used for the log MMP and the log abundance theorem, which are used in Proposition 2.1, too.
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
527
Lemma 4.7. In Proposition 4.5, if dim X ≤ 2, then we can replace PA(X, m(KX + )) with A(X, m(KX + )). Proof. For every s ∈ A(, m(KX + )| ), we can take a sections ∈ PA(X, m(KX +)) such that s | = s by Proposition 4.5. If we put t := 1/|G| g∈G g ∗ s ∈ A(X, m(KX + )), where G = ρm (Bir(X, )), then t| = s by Lemma 4.9. Therefore, A X, m(KX + ) −→ A , m(KX + )| is surjective. Let k be a sufficiently large and divisible integer. Then Bir(X, ) acts on PA X, lk(KX + ) . l≥0
Let N := |ρk (Bir(X, ))| < ∞ by Section 3, and let σi be the ith elementary symmetric polynomial. We obtain {s = 0} ⊃
N
j =1
N
∗ ∗ s =0 . gj∗ s = 0 = σi g1 s, . . . , gN i=1
If s ∈ PA(X, k(KX + )), then N!/ i ∗ ∗ s ∈ A X, N !k(KX + ) . g1 s, . . . , gN σi The vector space PA(X, k(KX + )) generates ᏻX (k(KX + )) by Proposition 4.5. Thus we can prove that A(X, N!k(KX + )) generates ᏻX (N!k(KX + )) by using Proposition 4.5 and Lemma 4.9. In order to prove Lemma 4.9 we need the following definition. Definition 4.8. Assume that X is nonsingular, Supp is a simple normal crossing divisor, and = i di i is a Q-divisor such that di ≤ 1 (di may be negative) for every i. In this case we say that (X, ) is B-smooth. Let (X, ) be dlt or B-smooth. A subvariety W of X is said to be a center of log canonical singularities if there is a proper birational morphism from a normal variety µ : Y → X and a prime divisor E on Y with the discrepancy a(E, X, ) = −1 such that µ(E) = W (cf. [13, Definition 1.3]). Let (X, ) be dlt or B-smooth. We write = i di i such that i are distinct prime divisors. Then the B-part of is defined by B := di =1 i . If (X, ) is dlt or B-smooth, then a center of log canonical singularities is an irreducible component of an intersection of some B-part divisors. (See the Divisorial Log Terminal Theorem of [27] and [21, Section 2.3].) When we consider a center of log canonical singularities W , we always consider the pair (W, 2) such that KW +2 = (KX + )|W , where 2 is defined by using the adjunction repeatedly. Note that if (X, ) is dlt (resp., B-smooth), then (W, 2) is dlt (resp., B-smooth) by the adjunction. If (X, ) is dlt pair or B-smooth and W is a center of log canonical singularities, then we write W X.
528
OSAMU FUJINO
Lemma 4.9. Let (X, ) be a pure n-dimensional proper dlt pair with KX + nef and let m be a sufficiently large and divisible integer. We write G = ρm (Bir(X, )). If s ∈ PA(X, m(KX + )), then g ∗ s| = s| and g ∗ s ∈ PA(X, m(KX + )) for every g ∈ G. In particular if |G| is finite, then g ∗ s ∈ A X, m(KX + ) , g∈G
g ∗ s ∈ A X, m|G|(KX + ) ,
g∈G
and
g ∗ s| = (s| )|G| .
g∈G
Proof. In this proof we omit the restriction symbols such as |B when there is no confusion. Let α, β : (Y, ) → (X, ) be a common log resolution of a Bbirational map σ : X X such that α = σ ◦ β and ρm (σ ) = g. Since is seminormal and B → has connected fibers by Connectedness Lemma 1.3, we have α∗ ᏻB = ᏻ and β∗ ᏻB = ᏻ . Then α ∗ and β ∗ induce the isomorphisms between H 0 (, ᏻ (m(KX + )| )) and H 0 (B , ᏻB (m(KY + )|B )). So in order to prove g ∗ s| = s| it is sufficient to check that α ∗ (s| ) = β ∗ (s| ) in H 0 (B , ᏻB (m(KY + )|B )) for some common log resolution (Y, ). For this purpose we prove the next two claims. Claim (An ). Let (T , ) and (S, 2) be n-dimensional B-smooth pairs and h : S → T a B-birational morphism. If W T , then there is a W S such that W → W is a B-birational morphism. Claim (Bn ). Let (T , ) and (S, 2) be n-dimensional B-smooth pairs and h : S → T a B-birational morphism. Assume that W S. If W → h(W ) is not Bbirational, then there is a W W such that W → h(W ) is a B-birational morphism and the inclusion W → W induces the isomorphism H 0 (W, ᏻW (m(KS + 2)|W )) H 0 (W , ᏻW (m(KS + 2)|W )). Proof of Claims. We prove Claim (An ) and Claim (Bn ) by induction on n. If n = 1, then (A1 ) and (B1 ) are trivial. First, we treat (An ). If W is a divisor, then we can take the proper transform of W as W . Otherwise, take a divisor V T such that W V . We define U S as the proper transform of V . By using the (An−1 ) to the B-birational morphism U → V , we obtain W U such that W → W is a B-birational morphism. Thus, we have (An ). Next, we treat (Bn ) by using (Al ) with l < n. Let u : (S , 2 ) → S be the blowingup of S with center W . If we prove (Bn ) for the pair S and the exceptional divisor E S , then we can prove (Bn ) for S. It is because G E implies u(G) is the required center of log canonical singularities. So we may assume that W is a divisor. By
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
529
[21, 2.45] we have a sequence of blowings-up T0 → T1 → · · · → Tk = T whose centers are the centers associated to the valuation W such that the rational map S T0 is an isomorphism at the generic point of W . We can replace (S, 2) with the elimination of indeterminacy (S , 2 ) because W and its proper transform are B-birational. So we can assume that f0 : S → T0 is a B-birational morphism. We can apply (An−1 ) to the B-birational morphism f0 : W → f0 (W ). So we only have to prove (Bn ) to the B-birational morphism T0 → T . So we may assume that (S, 2) = (T0 , 0 ). We use the induction on the number k of the blowings-up. When k = 1, we take a divisor D T1 such that h(W ) D and its proper transform D T0 . By applying (Bn−1 ) to D ∩ W D and h(D ∩ W ) = h(W ) D, we have the result. Next we apply the induction hypothesis to a : T0 → Tk−1 . We have U W such that a : U → a(W ) is a B-birational morphism. Since b : Tk−1 → Tk is one blowing-up whose center is the center associated to the valuation W , we can take a divisor V on Tk−1 such that a(U ) = a(W ) V and b : V → b(V ) is B-birational. By using (Bn−1 ) to b : V → b(V ) we have a V a(U ) such that V → (b ◦ a)(W ) = h(W ) is B-birational. By using (Al ) with l < n to U → a(U ) we obtain a W U W such that W → V is B-birational. So W → h(W ) is B-birational. By the construction of W , the inclusion W → W induces the isomorphism H 0 (W, ᏻW (m(KS + 2)|W )) H 0 (W , ᏻW (m(KS + 2)|W )). This proves (Bn ). Proof of Lemma 4.9 continued. We return to the proof of the lemma. Let f : (X , ) → (X, ) be Szabó’s resolution such that KX + = f ∗ (KX + ), that is, a log resolution whose discrepancy is greater than −1 (see Resolution Lemma in [27]). Let α = αi and β = βi : (Y, ) = (Yi , i ) → (X, ) be a common log resolution of a B-birational map σ : X X passing through (X , ) such that α = σ ◦ β and ρm (σ ) = g. We write α , β : Y → X such that α = f ◦ α , β = f ◦ β . We take an irreducible divisor E ⊂ B . Apply (Bn ) to the αi : Yi → αi (Yi ) such that E Yi . Then we obtain an E E or E = E such that E → α (E) is Bbirational. Apply (Bn ) to the βi : Yi → βi (Yi ). Then we obtain an E
E or E
= E such that E
→ β (E ) is B-birational. By repeating the above construction, we obtain F E or F = E such that α : F → α (F ) and β : F → β (F ) are B-birational. The morphism f : α (F ) → α(F ) and f : β (F ) → β(F ) are Bbirational since f is Szabó’s resolution. Then α : F → α(F ) and β : F → β(F ) are B-birational. Since s ∈ PA(X, m(KX +)), α ∗ (s|α(F ) ) = β ∗ (s|β(F ) ) on F . Since H 0 (E, m(KY + )|E ) H 0 (F, m(KY + )|F ) by (Bn ), we have α ∗ s = β ∗ s on E. Since E is an arbitrary irreducible component of B , we have α ∗ (s| ) = β ∗ (s| ). Thus g ∗ s| = s| . The latter part is trivial. The next corollary is the main theorem of this paper, which includes Theorem 0.1 (for proper pairs, see Remark 4.11) and a generalization of [1, 12.1.1] and [15, 8.5]. Corollary 4.10. Let (X, ) be a projective slc n-fold such that KX + is nef, where n ≤ 3. Then |m(KX + )| is free for a sufficiently large and divisible integer
530
OSAMU FUJINO
m. In particular, if (X, ) is projective sdlt and n ≤ 2, then A(X, m(KX + )) generates ᏻX (m(KX + )). Proof. We prove this corollary inductively on dim X. We take the normalization of (X, ) and take a Q-factorial dlt model (Y, 2) = (Yi , 2i ) for each irreducible component by Lemma-Definition 1.4. By the assumption of induction, A(2, m(KY + 2)|2 ) generates ᏻ2 (m(K2 + Diff(2 − 2))). By Proposition 4.5, PA Y, m(KY + 2) −→ A 2, m(KY + 2)|2 is surjective, and PA(Y, m(KY + 2)) generates ᏻY (m(KY + 2)). So, by Lemma 4.2, |m(KX + )| is free. If dim X ≤ 2, then we have the finiteness of B-pluricanonical representation (see Section 3). Therefore, we can replace PA(Y, m(KY + 2)) with A(Y, m(KY + 2)) by Lemma 4.7. So we get the latter part of this corollary. We need the latter part for the inductional treatment. In order to prove the abundance for slc n-folds (X, ), we use Proposition 4.5, which demands that ᏻ2 (m(K2 +Diff(2− 2))) is generated by not only PA(2, m(KY + 2)|2 ) but also A(2, m(KY + 2)|2 ). Remark 4.11. Proposition 4.5, and hence Theorem 3.5 and Corollary 4.10, hold true even for proper pairs. This can be checked, as in [16, 7.1]. Indeed, let (X, D) be a proper dlt pair such that KX +D is nef. By the log MMP, we can find a Q-factorial projective dlt pair (X , D ) with nef KX +D . By [26, 1.5], (X , D ) is B-birationally equivalent to (X, D). By using the arguments similar to those used in Lemma 4.9 and in case 4 in the proof of Proposition 4.5, we can reduce Proposition 4.5 for (X, D) to that for the projective (X , D ). Therefore, we get Proposition 4.5 for proper dlt pairs. Thus, Theorem 3.5 and Corollary 4.10 hold true for proper pairs. Appendix We can state the results similar to Corollary 4.10 in arbitrary dimension, if we list all the necessary results (e.g., log MMP) yet to be established as the assumption. Theorem A.1. Assume the log MMP for dimension n. (1) If the abundance conjecture holds for lc n-folds and if the finiteness of Bpluricanonical representations (see Conjecture 3.2) is true for dimension (n − 1), then the abundance conjecture is true for slc n-folds. (2) If the abundance conjecture holds for klt n-folds and slc (n − 1)-folds, then the abundance conjecture is true for lc n-folds. Proof. For the proof of (1), see Remarks 2.2 and 4.6, and the proof of Corollary 4.10. One can prove (2) by using the same argument as in [16, Section 7]. We list the following two results for the reader’s convenience. Theorem A.2 (Relative log abundance theorem). Let (X, ) be lc and dim X ≤ 3. Let f : X → S be a proper surjective morphism onto a variety S. Assume that
ABUNDANCE THEOREM FOR SEMI LOG CANONICAL THREEFOLDS
531
KX + is f -nef. Then KX + is f -semiample. Proof. If dim S = 0, then this is nothing but the log abundance theorem (see [6], [8], and [16]). So we may assume dim S ≥ 1. If (X, ) is klt, the proof is given, for example, in [14, 6-1-11], [11]. When (X, ) is lc, we can use the arguments in [16, Section 7] in the relative setting. (See also [15, 8.5].) Corollary A.3 (Threefold log canonical flips) (cf. [15, 8.1]). Threefold log canonical flips exist. Proof. Let (X, ) be an lc pair and f : (X, ) → S a flipping contraction. We take a Q-factorial dlt model (X , ) (see Lemma-Definition 1.4) and run the log MMP over S. Then we obtain a relative minimal model (X
,
) over S. By using Theorem A.2 we have a relative canonical model (X + , + ). It is easy to check that (X+ , + ) is the flip of f . References [1]
[2]
[3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14]
[15]
D. Abramovich, L.-Y. Fong, J. Kollár, and J. McKernan, “Semi log canonical surface” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 139–154. D. Abramovich, L.-Y. Fong, and K. Matsuki, “Abundance for threefolds: ν = 2” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 159–163. F. Ambro, The locus of log canonical singularities, preprint, 1998. A. Beauville, Complex Algebraic Surfaces, London Math. Soc. Stud. Texts 34, Cambridge University Press, Cambridge, 1996. A. Corti, “Adjunction of log divisors” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 171–182. L.-Y. Fong and J. McKernan, “Abundance for log surfaces” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 127–137. O. Fujino, Applications of Kawamata’s positivity theorem, Proc. Japan Acad. Ser. A Math. Sci. 75 (1999), 75–79. T. Fujita, Fractionally logarithmic canonical rings of surfaces, J. Fac. Sci. Univ. Tokyo 30 (1984), 685–696. S. Iitaka, Algebraic Geometry: An Introduction to Birational Geometry of Algebraic Varieties, Grad. Texts in Math. 76, Springer, New York, 1982. K. Karu, Minimal models and boundedness of stable varieties, preliminary version, preprint, 1998. Y. Kawamata, Pluricanonical systems on minimal algebraic varieties, Invent. Math. 79 (1985), 567–588. , Abundance theorem for minimal threefolds, Invent. Math. 108 (1992), 229–246. , On Fujita’s freeness conjecture for 3-folds and 4-folds, Math. Ann. 308 (1997), 491– 505. Y. Kawamata, K. Matsuda, and K. Matsuki, “Introduction to the minimal model problem” in Algebraic Geometry (Sendai, 1985), Adv. Stud. Pure Math. 10, North-Holland, Amsterdam, 1987, 283–360. S. Keel and J. Kollár, “Log canonical flips” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 95–101.
532 [16] [17] [18] [19] [20]
[21] [22] [23] [24] [25] [26] [27] [28]
OSAMU FUJINO S. Keel, K. Matsuki, and J. McKernan, Log abundance theorem for threefolds, Duke Math. J. 75 (1994), 99–119. J. Kollár, Projectivity of complete moduli, J. Differential Geom. 32 (1990), 235–268. , “Adjunction and discrepancies” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 183–192. , “Crepant descent” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 75–87. J. Kollár, K. Matsuki, and J. McKernan, “Abundance for threefolds: ν = 1” in Flips and Abundance for Algebraic Threefolds, Astérisque 211, Soc. Math. France, Montrouge, 1992, 155–158. J. Kollár and S. Mori, Birational Geometry of Algebraic Varieties, Cambridge Tracts in Math. 134, Cambridge University Press, Cambridge, 1998. J. Kollár and N. Shepherd-Barron, Threefolds and deformations of surface singularities, Invent. Math. 91 (1988), 299–338. D. Mumford, Abelian Varieties, Oxford Univ. Press, London, 1970. N. Nakayama, Zariski-decomposition and abundance, RIMS-1142, preprint, 1997. F. Sakai, “Kodaira dimensions of complements of divisors” in Complex Analysis and Algebraic Geometry, Iwanami Shoten, Tokyo, 1977, 239–257. V. V. Shokurov, 3-fold log flips (in Russian), Izv. Ross. Akad. Nauk Ser. Mat. 56 (1992), 105–203; English transl. in Russian Acad. Sci. Izv. Math. 40 (1993), 95–202. E. Szabó, Divisorial log terminal singularities, J. Math. Sci. Univ. Tokyo 1 (1994), 631–639. K. Ueno, Classification theory of algebraic varieties and compact complex spaces, Lecture Notes in Math. 439, Springer, Berlin, 1975.
Research Institute for Mathematical Sciences, Kyoto University, Kyoto 606-8502, Japan;
[email protected]
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
BMO FOR NONDOUBLING MEASURES J. MATEU, P. MATTILA, A. NICOLAU, and J. OROBITG
1. Introduction. The Calderón-Zygmund theory of singular integrals has been traditionally considered with respect to a measure satisfying a doubling condition. Recently, Tolsa [T] and, independently, Nazarov, Treil, and Volberg [NTV] have shown that this standard doubling condition was not really necessary. Likewise, in the homogeneous spaces setting, functions of bounded mean oscillation, BMO, and its predual H 1 , the atomic Hardy space, play an important role in the theory of singular integrals. This note is an attempt to find good substitutes for the spaces BMO and H 1 when the underlying measure is nondoubling. Our hope was that we would have been able to prove some results of Tolsa, Nazarov, Treil, and Volberg, via BMO-H 1 interpolation, but in this respect we were unsuccessful. Let µ be a nonnegative Radon measure on Rn . A function f ∈ L1loc (µ) is said to belong to BMO(µ) if the inequality (1) |f (x) − fQ | dµ(x) ≤ Cµ(Q) Q
holds for all cubes Q with sides parallel to the coordinate axes; fQ = (µ(Q))−1 Q f dµ denotes the mean value of f over the cube Q. The smallest bound C for which (1) is satisfied is then taken to be the “norm” of f in this space, and it is denoted by f ∗ . One says that BMO(µ) has the John-Nirenberg property when there exist positive constants c1 and c2 so that whenever f ∈ BMO(µ), then for every λ > 0 and every cube Q with sides parallel to the coordinate axes, one has µ {x ∈ Q : |f (x) − fQ | > λ} ≤ c1 e−c2 λ/f ∗ µ(Q). It is well known that if a measure µ is doubling (i.e., there exists a constant C = C(µ) such that µ(2Q) ≤ Cµ(Q) for all cubes Q), then it satisfies the John-Nirenberg inequality. We give examples of nonnegative Radon measures on Rn which do not Received 8 January 1999. Revision received 3 August 1999. 1991 Mathematics Subject Classification. Primary 26B35, 42B25, 46E30. Mateu, Nicolau, and Orobitg’s work partially supported by grant numbers PB98-0872 and PB961183 of the Dirección General de Enseñanza Superior and by grant number 1998 SGR00052 of the Comissió Interdepartamental de Recerca i Innovació Tecnològica. Mattila gratefully acknowledges the hospitality of the Centre de Recerca Matemàtica at the Universitat Autònoma de Barcelona during the preparation of this work. 533
534
MATEU, MATTILA, NICOLAU, AND OROBITG
have the John-Nirenberg property. On the other hand, we show that for a large class of measures, the John-Nirenberg property holds. Theorem 1. Let µ be a nonnegative Radon measure on Rn . Assume that for every hyperplane L, orthogonal to one of the coordinate axes, µ(L) = 0. Suppose that f is in BMO(µ). Then there exist constants c1 and c2 , independent of f , so that for every λ > 0 and every cube Q with sides parallel to the coordinate axes, one has −c2 λ µ(Q). µ {x ∈ Q : |f (x) − fQ | > λ} ≤ c1 exp f ∗ When the measure µ is the Lebesgue measure (or any other doubling measure) on Rn , the John-Nirenberg theorem follows from a stopping time argument using dyadic cubes. The estimate that is needed is |f2Q − fQ | ≤ Cf ∗ , which follows from the fact that µ(2Q)/µ(Q) is bounded from above. When the measure µ is not doubling our approach, following an idea of Wik [W], it is based on the following covering lemma. Covering lemma. Let µ be a positive Radon measure in Rn such that for every hyperplane L, orthogonal to one of the coordinate axes, µ(L) = 0. Let E be a subset of Rn , and let ρ be a real number in (0, 1). Suppose that E is contained in a cube Q0 , with sides parallel to the coordinate axes, and suppose that µ(E) ≤ ρµ(Q0 ). Then there exists a sequence {Qj } of cubes with sides parallel to the coordinate axes and contained in Q0 such that (a) µ(Qj ∩ E) = ρµ(Qj ); (b) the family {Qj } is almost disjoint with constant B(n), that is, every point of Rn belongs to at most B(n) cubes Qj ; (c) E ⊂ j Qj , where E is the set of µ-density points of E. We say that x is a µ-density point of E when limr→0 µ(Q(x, r)∩E)µ(Q(x, r))−1 = 1, where Q(x, r) denotes the cube centered at x and sidelength r. The assumption on the measure µ means that, given a cube Q, with 2µ(Q) < µ(Rn ), there exists ˜ ⊃ Q such that µ(Q) ˜ = 2µ(Q). The proof of our covering lemma uses a a cube Q variant of the well-known Besicovitch covering theorem. At first sight, the assumption on the measure in the statement of Theorem 1 seems quite restrictive, but the next result disproves this feeling. Theorem 2. Let µ be a nonnegative Radon measure on Rn . Assume that for any point p ∈ Rn , µ({p}) = 0. Then there exists an orthonormal system {e1 , . . . , en } so that for every hyperplane L with normal vector ei (i ∈ {1, . . . , n}), µ(L) = 0. As in the case of the Lebesgue measure, the John-Nirenberg theorem gives the
BMO FOR NONDOUBLING MEASURES
535
duality H 1 (µ)−BMO(µ), where H 1 (µ) is a natural atomic Hardy space. Let L∞ c (µ) be the space of bounded functions with compact support. One can then prove the following interpolation result. Theorem 3. Let µ be a nonnegative Radon measure on Rn . Assume that for every hyperplane L, orthogonal to one of the coordinate axes, µ(L) = 0. Let T be 1 a sublinear operator that is bounded from L∞ c (µ) to BMO(µ) and from H (µ) to 1 p L (µ). Then T extends boundedly to every L (µ), 1 < p < ∞. As in the case of a doubling measure, the proof follows easily from a CalderónZygmund decomposition and from the Lp -estimates for the sharp maximal function f # (x) = sup
1 µ(Q)
Q
|f − fQ | dµ,
where the supremum is taken over all cubes centered at x ∈ Rn . However, as Joan Verdera pointed out to us, Theorem 3 is quite unsatisfactory. Roughly speaking, no interesting operator T maps H 1 (µ) to L1 (µ), when the measure µ is not doubling. We present an example to illustrate this phenomenon. Finally, we compare BMO(µ), defined as above with cubes whose sides are parallel to the coordinate axes, with BMO(µ) defined with balls: f ∈ BMOb (µ) if there exists C < ∞ such that for every ball B ⊂ Rn there is a ∈ R such that (2)
B
|f − a| dµ ≤ Cµ(B).
Recall that if µ is a doubling measure, the spaces BMO(µ) and BMOb (µ) coincide. In our setting of nondoubling measures, the situation is quite different. Precisely, we have the following theorems. Theorem 4. There exists an absolutely continuous measure µ in R2 and f ∈ such that f ∈ BMOb (µ) but for any choice of the coordinate axes f ∈ / BMO(µ). L1 (µ)
Theorem 5. There exists an absolutely continuous measure µ in R2 and f ∈ such that f ∈ BMO(µ) for all choices of the coordinate axes but f ∈ / BMOb (µ). L1 (µ)
In any case, the question that arises is: May the space BMOb be a good choice for dealing with functions of bounded mean oscillation? For 1 < p < ∞, we say that p f ∈ BMOb (µ) if |f − a|p dµ ≤ Cµ(B), B
p
with C, B, and a as above. It is clear that BMOb (µ) = BMOb (µ) if the John-Nirenberg
536
MATEU, MATTILA, NICOLAU, AND OROBITG
inequality holds. From this point of view, we realize that BMO(µ) is better than BMOb (µ). Theorem 6. There exists an absolutely continuous measure µ in R2 and f ∈ p such that f ∈ BMOb (µ) but f ∈ / BMOb (µ) for all p > 1. In particular, the John-Nirenberg inequality fails. L1 (µ)
This result may seem surprising. However, two elementary geometric facts, which are important in our analysis, distinguish between dealing with balls or cubes. First, a cube may be covered by a finite number of subcubes, while for balls a countable number of subballs is needed. Second, when intersecting two cubes, the sections have equal diameter, while for balls, the length of the sections may decay exponentially when the balls become nearly disjoint. The first fact provides the necessary covering properties that are used in the proof of Theorem 2. The decay mentioned in the second one gives some extra help to construct functions in BMOb , which do not fulfill the John-Nirenberg inequality. It is worth mentioning that if instead of cubes or balls, one considers regular polygons of N sides, in the definition of BMO, the analogue of Theorem 2 holds. However, the constants c1 , c2 depend on N. The paper is organized as follows. Section 2 contains the proof of Theorems 1 and 2, as well as an example of a Radon measure for which the John-Nirenberg estimate does not hold. Section 3 is devoted to the Calderón-Zygmund decomposition, the Lp estimates for the sharp maximal function, and the proof of Theorem 3. Section 4 contains the proof of Theorems 4, 5, and 6. Finally, we include an appendix with a proof of the John-Nirenberg theorem on spaces of homogeneous type, because it is not easy to find in the literature a proof for the Lebesgue measure which immediately generalizes to doubling measures. 2. John-Nirenberg inequality. The main goal of this section is to prove Theorem 1; that is, the John-Nirenberg property holds for a wide class of measures. We also give an example of a measure µ and a function f , for which f ∈ BMO(µ), but f doesn’t satisfy the John-Nirenberg inequality. In the case of the Lebesgue measure m in Rn , the John-Nirenberg estimate follows from a stopping time argument that uses dyadic cubes. A trivial but essential fact is that for any cube Q, one has m(2Q) ≤ Cm(Q), where C is a constant. Then if f ∈ BMO(m), it follows that |f2Q − fQ | ≤ C, which is the estimate that is needed in the stopping time argument. When the measure µ is supported in the real line and has no atoms, one can prove Theorem 1 along the same lines. The only modification that is needed consists of replacing the usual dyadic grid by a µ-dyadic grid, which is constructed in the following way. Given an interval I , the first generation G1 (I ) consists of the two
537
BMO FOR NONDOUBLING MEASURES
disjoint subintervals I+ , I− of I satisfying µ(I+ ) = µ(I− ) = µ(I )/2. The second generation G2 (I ) is G1 (I+ ) ∪ G1 (I− ). Next generations are defined recursively. One can now use the usual stopping time argument to prove the John-Nirenberg estimate for such measures. To prove Theorem 1 for n > 1 we need the following Besicovitch covering theorem. Besicovitch covering theorem. Let A be a subset of Rn , and let be a family of rectangles with sides parallel to the coordinate axes, such that each point of A is the center of some rectangle of . Assume that A is a bounded set or that the diameters of rectangles of are bounded. Moreover, suppose that the ratio of any two sidelengths of a rectangle of is bounded by 2. Then there is a finite or countable collection of rectangles Rj ∈ such that they cover A, and every point of Rn belongs to at most B(n) rectangles Ri , where B(n) is an integer depending only on n. For this result, see [G]; also the proof given in [M] for balls can be easily modified. Proof of covering lemma. For any x ∈ Q0 and for r > 0 satisfying r ≤ $(Q0 ), we ˜ define Q(x, r) as the unique cube (parallel to the coordinate axes) with sidelength r, containing x, contained in Q0 and with center y closest to x (see Figure 1).
Q0
Q0
˜ Q(x, r) x=y
r
˜ Q(x, r) y x
r
Figure 1
˜ ˜ If x ∈ Q0 ∩ E , we define the function hx (r) = µ(Q(x, r) ∩ E)/µ(Q(x, r)) for 0 < r ≤ $(Q0 ). Clearly, this is a continuous function satisfying hx ($(Q0 )) ≤ ρ by hypothesis and limr→0 hx (r) = 1, since x is a density point. Consequently, there exists a positive number rx such that hx (rx ) = ρ. Hence, for any x ∈ Q0 ∩ E we ˜ define Q(x) = Q(x, rx ). Now, we could not apply the Besicovitch covering theorem, because x may not be the center of Q(x). To circumvent this obstacle, for any cube Q(x) we define the rectangle R(x) in Rn as the unique rectangle in Rn centered on x such that R(x)∩Q0 = Q(x). Denote by this family of rectangles (see Figure 2). It is an easy computation to check that the ratio of any two sidelengths of a rectangle in is bounded by 2. So, by the Besicovitch covering theorem we have a countable
538
MATEU, MATTILA, NICOLAU, AND OROBITG
Q0
Q0
Q(x)
Q(x) x
x R(x)
R(x)
Figure 2
collection of rectangles Rj ∈ such that they cover E ∩ Q0 , and every point of Rn belongs at most to B(n) rectangles Ri . Then, if we take the family of cubes ᏽ = {Q(x) : R(x) = Rj for some j }, it is clear that ᏽ is a countable family of cubes satisfying properties (a), (b), and (c). Now, the covering lemma is one of the main ingredients we use to prove Theorem 1. Proof of Theorem 1. We may assume that f ∗ = 1. Let Q0 be an arbitrary cube, with sides parallel to the coordinate axes. Put, for any integer k ≥ 2, Ek = {x ∈ Q0 : f (x) − fQ0 ≥ k}, Sk = {x ∈ Q0 : |f (x) − fQ0 | ≥ k}. Clearly, µ(Ek ) ≤ µ(Sk ) ≤ µ(Q0 )/k. In particular, µ(Ek ) ≤ µ(Q0 )/2 and by the covering lemma we can cover the set Ek , k ≥ 2, with a sequence {Qk,j } of almost disjoint cubes with constant B(n), parallel to the coordinate axes, such that 1 µ Ek ∩ Qk,j = µ(Qk,j ). 2 Therefore, (3)
1 µ Ekc ∩ Qk,j = µ(Qk,j ), 2
and (4)
µ(Ek ) ∼ =
µ Qk,j ∩ Ek ∼ µ(Qk,j ). = j
j
BMO FOR NONDOUBLING MEASURES
539
On the other hand, for any $ > 0, one has
1 1 = f ∗ ≥ |f (y) − f (x)| dµ(x) dµ(y) 2 Qk,j Qk,j 2 µ(Qk,j )
1 ≥ dµ(y) |f (y) − f (x)| dµ(x) 2 Ek+$ ∩Qk,j Ekc ∩Qk,j µ(Qk,j ) µ Ek+$ ∩ Qk,j µ Ekc ∩ Qk,j µ Ek+$ ∩ Qk,j ≥$ , =$ 2 2µ(Qk,j ) µ(Qk,j ) where the last inequality comes from (3). So, one obtains µ(Ek+$ ∩ Qk,j ) ≤ (2/$)µ (Qk,j ). This inequality and (4) give µ(Ek+$ ) ≤
2 C µ Qk,j ∩ Ek+$ ≤ µ(Qk,j ) ≤ µ(Ek ). $ $ j
j
Similar arguments can be used to obtain the same estimate for the sets {x ∈ Q0 : f (x) − fQ ≤ k}, k < 0. The estimates combine to µ(Sk+$ ) ≤
A µ(Sk ), $
where A is an integer depending only on the dimension. We take $ = 2A and find 1 µ(Sk+2A ) ≤ µ(Sk ), 2 which, for any positive integer p implies: µ S2+2Ap ≤ 2−p µ(S2 ) ≤ 2−p µ(Q0 ), from which the conclusion of the theorem follows. Observe that the constants c1 and c2 , in the statement of Theorem 1, do not depend on µ; they only depend on n. Now, we give a Radon measure for which the John-Nirenberg inequality does not 2 hold. We consider the case n = 1. Let µ = n≥1 (1/2n )δ1/n , where δ1/n is a Dirac n mass in the point 1/n, and let f (1/n) = 2 . To show that f belongs to BMO(µ), it is enough to consider intervals I = [1/N2 , 1/N1 ], where N1 and N2 are positive integers and N2 can also be infinity. Obviously, µ(I ) =
N2 1 ∼ 1 . 2 = N2 2n 2 1 n=N1
540
MATEU, MATTILA, NICOLAU, AND OROBITG
Therefore, 1 µ(I )
I
N2 2 f − 2N1 dµ ∼ = 2N1 n=N1
N2 1 1 n 2n N1 N12 − 2 ≤ N . 2 ≤ 2 2 2 n n 2 1 2 2 +1 n=N +1 1
Consequently, f ∈ BMO(µ). In order to verify that f does not satisfy the JohnNirenberg inequality, one can see that for t = 2N , where N is a large positive integer and I = [0, 1],
1 1 ∼ 1 ∼ µ x ∈ I : |f (x) − fI | > t = µ 0, = . 2 = N2 N 2n 2 n≥N N
On the other hand, 2−tC µ(I ) is of order 2−C·2 . Hence, the John-Nirenberg inequality 2 N holds only if 1/2N ≤ B2−C·2 , for some constant B, but this inequality fails when N is big enough. The same construction can be repeated in the plane to get a continuous example. We take a family of segments Ln , with endpoints (1/n, 0) and (1/n, 1), and we define 2 a measure µ = n≥1 (1/2n )Ᏼ1 |Ln , where Ᏼ1 |Ln is the 1-dimensional Hausdorff measure on Ln . Taking the function f such that f |Ln = 2n , one can check that f ∈ BMO(µ), but the John-Nirenberg property is not satisfied. However, observe that in accordance with Theorem 2 rotating the coordinate axes, the corresponding BMO(µ), where µ is the above measure, has the JohnNirenberg property. Proof of Theorem 2. We now show that if µ is a continuous Radon measure on Rn , then we can choose the coordinate axes in such a way that µ(∂Q) = 0 for all cubes Q with sides parallel to the axes. We first give the very elementary argument in the plane. So let µ be a continuous Radon measure on R2 . Let ᏸ be the set of the lines L through the origin such that µ(L ) > 0 for some line L parallel to L. We claim that ᏸ is at most countable. Otherwise there exist , > 0, R < ∞, and distinct lines L1 , L2 , . . . ∈ ᏸ such that for some lines L i parallel to Li and Ai = L i ∩ B(0, R), we have µ(Ai ) ≥ , for all i. But for i = j , Ai ∩ A j is eitherempty or a singleton, whence µ(Ai ∩ Aj ) = 0. Thus µ(B(0, R)) ≥ µ( i Ai ) = µ(Ai ) = ∞, which is a contradiction. Since ᏸ is at most countable, so is the set ᏸ⊥ of the orthogonal complements L⊥ , L ∈ ᏸ. Choosing L ∈ / ᏸ ∪ ᏸ⊥ , the lines L and L⊥ give the desired coordinate axes. To prove our claim in Rn , it seems to be convenient (although not necessary) to use invariant measures on Grassmannians (see, e.g., [M, Chapter 3] for them). Let G(n, m) be the set of all m-dimensional linear subspaces of Rn , and let γn,m be the unique orthogonally invariant Radon probability measure on it. We also allow m = 0; then G(n, 0) = {0} and γn,0 = δ0 . For V ∈ G(n, m) and 0 ≤ k < m, we let G(V , k)
BMO FOR NONDOUBLING MEASURES
541
be the space of k-dimensional linear subspaces of V , and we let γV ,k be the natural measure on it. Let µ be a continuous Radon measure on Rn . Denote
Gm = V ∈ G(n, m) : µ(V + x) > 0 for some x ∈ Rn . We leave it to the reader to check that Gm is a Borel set. Lemma 1. Let γn,n−1 (Gn−1 ) = 0. Proof. We prove that if 0 < m < n and if γn,m (Gm ) > 0, then γn,m−1 (Gm−1 ) > 0. Thus if γn,n−1 (Gn−1 ) > 0, induction gives γn,0 (G0 ) > 0, which means that µ has atoms and gives a contradiction. So suppose γn,m (Gm ) > 0. We have that γn,m (Gm ) = γV ⊥ ,1 L ∈ G(V ⊥ , 1) : L + V ∈ Gm dγn,m−1 V . This follows from the uniqueness of γn,m since also the right-hand side defines an orthogonally invariant Radon probability measure on G(n, m). Thus the set of those V ∈ G(n, m − 1) for which γV ⊥ ,1 L ∈ G(V ⊥ , 1) : L + V ∈ Gm > 0 (5) has positive γn,m−1 measure. Hence it suffices to show that (5) implies that V ∈ Gm−1 . Let V ∈ G(n, m − 1) satisfy (5). For every L ∈ G(V ⊥ , 1) such that L + V ∈ Gm , there is x(L) ∈ Rn such that µ(L+V +x(L)) > 0. Since there are uncountably many such lines L, we must have for some L = L (cf. the argument for R2 above), µ L + V + x(L) ∩ L + V + x(L ) > 0. But (L + V + x(L)) ∩ (L + V + x(L )) ⊂ V + x for some x ∈ Rn by easy linear algebra. Consequently, V ∈ Gm−1 , and Lemma 1 is proved. Lemma 2. If G ⊂ G(n, n − 1) and if γn,n−1 (G) = 0, then there are coordinate axes in such a way that V ∈ / G for every coordinate hyperplane V . Proof. We use induction on n. If n = 2, then γ2,1 (G⊥ ) = 0 and L, L⊥ will do for L ∈ / G ∪ G⊥ . Suppose the lemma holds in Rn−1 . As above, by the uniqueness of γn,n−1 , we have γn,1 L ∈ G(n, 1) : L⊥ ∈ G = γn,n−1 (G) = 0 and
γL⊥ ,n−2 V ∈ G(L⊥ , n − 2) : L + V ∈ G dγn,1 L = γn,n−1 (G) = 0.
542
MATEU, MATTILA, NICOLAU, AND OROBITG
Thus, we can choose L ∈ G(n, 1) such that L⊥ ∈ / G and γL⊥ ,n−2 V ∈ G(L⊥ , n − 2) : L + V ∈ G = 0. By our induction hypothesis, we can choose coordinate axes L1 , . . . , Ln−1 for L⊥ such that for every coordinate (n − 2)-plane V , we have L + V ∈ / G. Then L1 , . . . , Ln−1 , L are the required axes in Rn . Combining Lemmas 1 and 2 we see that given a continuous µ there are axes such that µ(V ) = 0 for all hyperplanes parallel to the coordinate planes, as required. The last part of this section is devoted to the introduction of the predual of BMO(µ) : H 1 (µ). In the case of the Lebesgue measure m, BMO(m) can be viewed very naturally as the dual of an atomic space H 1,∞ (m) (see [J, 3.II]). In this section we claim that for the measures µ satisfying the hypothesis of Theorem 1, one can consider the atomic space H 1,∞ (µ), and its dual space is BMO(µ). A function a is called a p-atom, 1 < p ≤ ∞, if there exists a cube Q such that (i) a ∈ Lp (µ), aLp (µ) ≤ µ(Q)(1/p)−1 ; (ii) spt a ⊂ Q; (iii) Q a dµ = 0. In the case µ(Rn ) < ∞, the constant 1/(µ(Rn )) is also considered an atom. Thus, we define a Banach space H 1,p (µ) in the following way: f ∈ H 1,p (µ) if and only if there exist λi ∈ R and p-atoms ai such that |λi | < ∞ and f = i λi ai . For f ∈ H 1,p (µ), we define its norm f H 1,p (µ) to be inf |λj |, where the infimum is taken over all sequences (λi )i∈I occurring in such an atomic decomposition of f . When a is a p-atom one can check that aL1 (µ) ≤ 1, and so H 1,p (µ) is continuously embedded in L1 (µ). It is also an easy computation to verify that H 1,p2 (µ) is continuously embedded in H 1,p1 (µ) if 1 < p1 ≤ p2 < ∞. Now, we can state the following duality result. Theorem 7. Let µ be a nonnegative Radon measure in Rn . Assume that for every hyperplane L, orthogonal to one of the coordinate axes, µ(L) = 0. Then (H 1,p (µ))∗ = BMO(µ) if 1 < p ≤ ∞. In these notes we don’t prove this result because the main argument follows using the same ideas as in the case of the Lebesgue measure. For a good exposition of this result in the case of the Lebesgue measure, see [J, 3.II]. By the above theorem we have a family of Banach spaces H 1,p (µ), with the same dual. Hence, since they are continuously embedded, one in the other, we can conclude that they coincide, and so we can define H 1 (µ) = H 1,p (µ) for any p, 1 < p ≤ ∞. To finish this section we want to remark that the main ingredient in the proof of the above result is the John-Nirenberg inequality, and so Theorem 7 can be stated in a more general setting. That is, if we have a positive Radon measure µ in Rn (for this
BMO FOR NONDOUBLING MEASURES
543
measure we define BMO(µ), and BMO(µ) satisfies the John-Nirenberg property), then we can define H 1 (µ) and (H 1 (µ))∗ = BMO(µ). 3. Interpolation. This section is devoted to prove a Calderón-Zygmund decomposition for functions in L1 (µ) and a result on interpolation of operators. This follows from estimating the Lp (µ)-norm of a function by the Lp (µ)-norm of its sharp maximal function. Lemma 3 (Calderón-Zygmund decomposition). Let µ be a nonnegative Radon measure in Rn such that µ(L) = 0 for every hyperplane L orthogonal to one of the coordinate axes. Suppose we are given a function f ∈ L1 (µ) and a positive number λ, with λ > f 1 /(µ(Rn )). There exists a decomposition of f , f = g + b, and a sequence of cubes {Qj }, so that (i) |g(x)| ≤ Cλ for µ-a.e. x; (ii) b = bj , where each bj is supported in Qj , bj dµ = 0, and |bj | dµ ≤ 4λµ(Qj ); (iii) {Qj } is almost disjoint with constant B(n); (iv) j µ(Qj ) ≤ C/λ Qj |f | dµ. j
Proof. Let Mf be the centered Hardy-Littlewood maximal function Mf (x) = sup r>0
1 µ Q(x, r)
Q(x,r)
|f (y)| dµ(y),
where Q(x, r) is the cube with center x and sidelength r. For each x ∈ Eλ = {x : Mf (x) > λ} we consider a cube Q(x, rx ) such that Q(x,rx ) |f | dµ > λµ(Q(x, rx )). Then we proceed as in the proof of covering lemma. We define 1 D(r) := µ Q(x, r)
Q(x,r)
|f (y)| dµ(y).
We have D(rx ) > λ and limr→∞ D(r) = f 1 /(µ(Rn )) < λ. Therefore, because D(r) is a continuous function on (rx , ∞), we get a cube Qx centered at x such that (6)
λ=
1 µ(Qx )
Qx
|f | dµ.
Applying the Besicovitch covering theorem we have an almost disjoint sequence {Qj } of cubes such that Eλ ⊂ j Qj and such that (6) holds for each Qj . Consider functions χQj , ϕj = χQj
1 ≤ ϕj ≤ 1 B(n)
on Qj ,
544 and
MATEU, MATTILA, NICOLAU, AND OROBITG
ϕj ≡ 1 on
j
Qj , and define bj = f ϕj − (f ϕj )Qj χQj .
Clearly,
Qj
|bj | dµ ≤
Qj
|f | dµ +
Qj
|f ϕj | dµ ≤ 2
Qj
|f | dµ = 2λµ(Qj )
and bj dµ = 0. Finally, take g = f − j bj and b = j bj . Now, if x ∈ / Qj , then g(x) = f(x) and the differentiation theorem gives |g(x)| ≤ λ for µ-a.e. x ∈ / Qj . When x ∈ Qj , g(x) = (f ϕj )Qj χQj ≤ λB(n) j
because {Qj } is almost disjoint and (f ϕj )Qj ≤ λ. Theorem 8. Let µ be a nonnegative Radon measure in Rn such that µ(L) = 0 for every hyperplane L orthogonal to one of the coordinate axes. (a) Assume µ(Rn ) = ∞. Then, one has f p ≤ C(p, n)f # p , 1 < p < ∞, for any f ∈ L1 (µ). (b) Assume µ(Rn ) < ∞. Then, f − ≤ C(p, n)f # , f dµ p n R
p
1 < p < ∞,
for any f ∈ L1 (µ). Proof. We only prove (a) because (b) follows in a similar way. As in the case of the Lebesgue measure, one only has to prove the following good λ-inequality (7) µ x ∈ Rn : f (x) > aλ, f # (x) < γ λ ≤ m(a, γ , n)µ x ∈ Rn : f (x) > λ for λ > 0 sufficiently large, where a > 1, γ > 0 are positive constants and where m(a, γ , n) < 1. Actually, we get m(a, γ , n) = C(n)(a − 1 − 2γ )−1 γ .
BMO FOR NONDOUBLING MEASURES
545
Let Eλ = {x ∈ Rn : f (x) > λ}. For µ-a.e. x ∈ Eλ one has µ Q(x, r) ∩ Eλ = 1. lim r→0 µ Q(x, r) On the other hand, the Chebyshev inequality gives µ Q(x, r) ∩ Eλ f 1 ≤ = 0. lim r→∞ λµ(Rn ) µ Q(x, r) Let λ > 0. Since µ puts no mass to any hyperplane orthogonal to one of the coordinate axes, one can choose a cube Q(x) centered at x such that 1 µ Q(x) ∩ Eλ = µ Q(x) . 4
(8)
We first observe that if Q(x) is centered at a point x ∈ Eλ for which f # (x) < γ λ, one has λ < fQ(x) < λ(1 + 2γ ). 4
(9) Actually, fQ >
λ 1 λµ Q(x) ∩ Eλ = , 4 µ Q(x)
and since Q(x) is centered at a point x ∈ Rn , where f # (x) < γ λ, we have 1 µ x ∈ Q(x) : |f (x) − fQ(x) | ≥ 2γ λ ≤ µ Q(x) . 2 Since µ(Q(x) ∩ Eλc ) = (3/4)µ(Q(x)), there exists y ∈ Q(x) ∩ Eλc such that |f (y) − fQ(x) | < 2γ λ. Since f (y) ≤ λ, we deduce fQ(x) < λ(1 + 2γ ). We apply the Besicovitch covering theorem to the family of cubes {Q(x)}, x ∈ ˜ Eλ = {x ∈ Eλ : f # (x) < γ λ}. Then we obtain an almost disjoint family of cubes {Qj } with the following properties 1 µ Qj ∩ Eλ = µ(Qj ), 4
E˜ λ ⊂
Qj .
j
So the estimate (10)
µ Qj ∩ Eaλ ≤ (a − 1 − 2γ )−1 γ µ(Qj )
546
MATEU, MATTILA, NICOLAU, AND OROBITG
finishes the proof. Using again that Qj is centered at a point x ∈ Rn , where f # (x) < γ λ, we have f − fQ dµ < γ λµ(Qj ). j Qj
On Eaλ one has f > aλ. Thus, (9) gives f − fQ dµ ≥ µ Qj ∩ Eaλ λ(a − 1 − 2γ ). j Qj
Then
µ Qj ∩ Eaλ λ(a − 1 − 2γ ) < γ λµ(Qj ),
which gives (10). As in the case of the Lebesgue measure, the Lp -boundedness of the sharp function gives a result on interpolation of operators. Proof of Theorem 3. We first assume µ(Rn ) = ∞. The proof follows closely the p arguments in [J, p. 43]. Since functions in L∞ c (µ) with mean zero are dense in L (µ), 1 < p < ∞, one only has to prove Tf p ≤ Cf p for such functions. In fact, we show (Tf )# ≤ Cf p , (11) p
1 < p < ∞,
and apply the previous theorem. Observe that the corresponding hypothesis holds 1 1 because f ∈ L∞ c (µ) with mean zero implies f ∈ H (µ); hence Tf ∈ L (µ). By the Marcinkiewicz interpolation theorem (see [S]), the strong inequality (11) follows from the weak estimate (12)
Cf pp µ x ∈ Rn : (Tf )# (x) > λ ≤ , λp
1 < p < ∞,
where λ > 0. Let {Qj } be the collection of almost disjoint cubes associated with the CalderónZygmund decomposition of |f |p at the value λp . So, we write f ϕj − (f ϕj )Qj χQj + g, f = b+g = j
such that g∞ ≤ C(n)λ.
547
BMO FOR NONDOUBLING MEASURES
Set
bj = f ϕj − (f ϕj )Qj χQj ;
then sup(bj ) ⊂ Qj , bj has mean zero and |bj |p ≤ C(n)λp µ(Qj ). Qj
Remember
p
µ(Qj ) ≤
So, one has bH 1 (µ) ≤ C(n)λ
f p . λp p
µ(Qj ) ≤ C(n)λ1−p f p .
# Since T is bounded from L∞ c (µ) to BMO(µ), the function (T g) is bounded by a multiple of λ. Hence, if c0 is a sufficiently large constant, we have
x : (Tf )# (x) > 2c0 λ ⊆ x : (T b)# (x) > c0 λ .
Now,
µ x : (T b)# (x) > c0 λ ≤ µ
1 x : M(T b)(x) > c0 λ 2
≤C
T b1 , λ
where M is the Hardy-Littlewood maximal function. The boundedness of the operator T from H 1 (µ) to L1 (µ) gives T b1 ≤ CbH 1 (µ) , which gives (12) and finishes the proof. When µ(Rn ) < ∞, the proof follows the same lines using that constant functions are in H 1 (µ). Given a function f on R denote by Tµ f , the Hilbert transform of f with respect to µ, f (y) dµ(y). Tµ f (x) = p.v. x −y We give an example, due to Verdera [V], of a measure µ on R for which we know that the operator Tµ is bounded on Lp (µ), 1 < p < ∞, but it is not bounded from H 1 (µ) to L1 (µ). Before proceeding to define the measure µ and the function h ∈ H 1 (µ), we make some computations. √ √ Consider , > 0 sufficiently small, and let I = [,, 2,] and J = [ ,, , + ,]. Write f = (1/,)[χJ − χI ], and let ν be the Lebesgue measure restricted on [−1, 0] ∪ I ∪ J . A simple calculation gives, when x ∈ [−1, 0], Tν f (x) =
√ √ 1 log(2, − x) − log(, − x) − log , + , − x + log , − x ≥ 0, ,
548
MATEU, MATTILA, NICOLAU, AND OROBITG
and then we have
|Tν f | dν ≥
0 −1
|Tν f (x)| dx ≥ C| log ,|.
Observe that function f belongs to H 1 (ν) with norm bounded by 2, but with respect to the Lebesgue measure f is an atom with norm of order , −1/2 . Now, one repeats this construction at different scales. √ Let (,k ) be a sequence of positive numbers tending to zero and verifying ,k+1 + −2 ,k+1 < ,k (and so k≥1 k | log ,k | = ∞). Define intervals √ √ and Jk = ,k , ,k + ,k . Ik = [,k , 2,k ] Let µ be the Lebesgue measure restricted on [−1, 0] ∞ k=2 (Ik ∪ Jk ). Clearly, µ does not satisfy any doubling condition, and easily, Tµ is bounded in L2 (µ) because dµ = g dm, where g 2 = g. Observe that the first condition on the sequence (,k ) means that Jk+1 is at the left-hand side of Ik . So, the function ak = ,k−1 (χIk − χJk ) is a µ-atom. Define ∞ 1 h= ak . k2 k=2 Since k −2 < +∞ and ak are atoms, one has h ∈ H 1 (µ). On the other hand,
∞ ∞ 1 0 1 |log ,k | = +∞. |Tµ h(x)| dx = T a (x) dx ≥ C |Tµ h| dµ ≥ µ k 2 k −1 k2 −1 0
k=2
k=2
/ L1 (µ). That is, Tµ h ∈ 4. BMO with balls and cubes. Theorems 4, 5, and 6 are proved with a similar construction. To simplify slightly, we do not construct µ as an absolutely continuous measure but as a sum of weighted length measures on some circles. Since the oscillation of f on two neighboring circles is at most 1, it is clear from the proof that by replacing µ with a sum of weighted Lebesgue measures on very narrow annuli, we get the same conclusions. We choose nonincreasing sequences (εi ) and (λi ), i = 1, 2, . . . , of positive numbers such that for all i, 1 ε1 = , 2
(13) (14) (15)
1 λ1 = , 2
εi+1 ≤
0 < λi ≤ 2−i ,
εi , 10 λi+1 ≤ λi ,
√ √ εi+1 2−i−1 ≤ εi λi .
BMO FOR NONDOUBLING MEASURES
549
Set
Si = x ∈ R2 : |x| = 1 − εi ,
Ti = x ∈ R2 : |x| = 1 + εi , λi Ᏼ 1 | S i , µS = i
µT =
2−i Ᏼ1 | Ti ,
i
µ = µS + µT , iχSi ∪Ti . f= i
Clearly, f belongs to L1 (µ) with f dµ ≤ 8π
∞
i2−i = c0 .
i=1
We now fix a disc B with center x and radius r. If B ∩ Si = ∅ for some i, we let i0 be the smallest such i. Similarly, j0 is the smallest j such that B ∩ Tj = ∅ if such a j exists. Lemma 4. If B ∩ Tj = ∅ for some j , then |f − j0 | dµT ≤ c0 µT (B). B
Proof. As B ∩ Tj0 = ∅, we see by simple geometry that Ᏼ1 (B ∩ Tj ) ≤ 5Ᏼ1 (B ∩ Tj0 +1 )
for j > j0
(since Tj and Tj0 +1 are much closer to each other than to Tj0 ). Hence, (j − j0 )2−j Ᏼ1 (B ∩ Tj ) |f − j0 | dµT = B
j >j0
≤ 5Ᏼ1 (B ∩ Tj0 +1 )
(j − j0 )2−j
j >j0
≤ c0 2
−j0 −1
Ᏼ (B ∩ Tj0 +1 ) = c0 µ(B ∩ Tj0 +1 ) ≤ c0 µT (B). 1
Lemma 5. We have f ∈ BMOb (µ). More precisely, (2) holds with an absolute constant c. Proof. Let B be as above. We consider four cases.
550
MATEU, MATTILA, NICOLAU, AND OROBITG
Case 1: |x| ≤ 1/2. If B ∩ S2 = ∅, f is constant on B ∩ spt µ. Otherwise, r > 1/3, and using trivial geometry and the assumptions ε1 = λ1 = 1/2 and ε2 ≤ ε1 /10, we have Ᏼ1 (B ∩ S1 ) ≥ 2/5 and 1 µ(B) ≥ λ1 Ᏼ1 (B ∩ S1 ) > . 5 Thus, 1 µ(B)
B
f dµ ≤ 5
f dµ ≤ 5c0 ,
and (2) holds with a = 0, c = 5c0 . From now on we assume that |x| > 1/2 and, by Lemma 4, that B ∩ Si = ∅ for some i. Case 2: either B ⊂ {y : |y| ≤ 1} or j0 > i0 +2. We may assume that B ∩Si0 +2 = ∅, since otherwise f = i0 or i0 + 1 on B ∩ spt µ. As B meets Si0 and Si0 +2 , and it does not meet Ti0 +2 , one sees by simple geometry (see Figure 3) that √ µ(B) ≥ λi0 +1 Ᏼ1 (B ∩ Si0 +1 ) ≥ λi0 +1 rεi0 +1 , √ Ᏼ1 (B ∩ Si ) ≤ 2 rεi0 +2 for i > i0 + 1, √ Ᏼ1 (B ∩ Ti ) ≤ 2 rεi0 +2 for i > i0 + 1.
B
Ti0 +2
≈ √r ε i0 +1
Ti0 +3 Si0 +2
Si0 +1
Si0
Figure 3. Case 2
BMO FOR NONDOUBLING MEASURES
551
Thus, also using (15), |f − i0 | dµ ≤ (i − i0 )λi Ᏼ1 (B ∩ Si ) + (i − i0 )2−i Ᏼ1 (B ∩ Ti ) B
i>i0
√
≤ 3µ(B) + 4 rεi0 +2
i>i0 +2
(i − i0 )2−i
i>i0 +2
√
≤ 3µ(B) + c0 rεi0 +2 2−i0 −2 √ ≤ 3µ(B) + c0 rεi0 +1 λi0 +1 ≤ (3 + c0 )µ(B). Hence, (2) holds in this case with c = 3 + c0 . In the last two cases, we also assume that B ∩ Tj = ∅ for some j . Case 3: j0 = i0 , i0 + 1, or i0 + 2. Since εj0 +1 < εj0 /10 and B meets both Sj0 and Tj0 , one easily sees that for i > j0 , Ᏼ1 (B ∩ Si ) ≤ 2Ᏼ1 (B ∩ Tj0 +1 ).
Thus, B
|f − j0 | dµS ≤ 2µ(B ∩ Sj0 −2 ) + µ(B ∩ Sj0 −1 ) + ≤ 3µ(B) + 2Ᏼ (B ∩ Tj0 +1 ) 1
(i − j0 )λi Ᏼ1 (B ∩ Si )
i>j0
(i − j0 )2−i
i>j0
≤ 3µ(B) + c0 2
−j0 −1
Ᏼ (B ∩ Tj0 +1 ) 1
≤ (3 + c0 )µ(B). Combining this with Lemma 4, we have (2) with c = 3 + c0 in Case 3. Our final case is the following. Case 4: j0 < i0 . Then for i > j0 , Ᏼ1 (B ∩ Si ) ≤ Ᏼ1 (B ∩ Ti )
and
Ᏼ1 (B ∩ Ti ) ≤ 5Ᏼ1 (B ∩ Tj0 +1 ).
Thus, B
|f − j0 | dµ ≤
(i − j0 )(λi + 2−i )Ᏼ1 (B ∩ Ti )
i>j0
≤2
(i − j0 )2−i Ᏼ1 (B ∩ Ti ) ≤ 2c0 µ(B).
i>j0
Hence, the proof of Lemma 5 is complete.
552
MATEU, MATTILA, NICOLAU, AND OROBITG
To prove Theorem 4 we choose sequences (ik ) and (jk ) of positive integers and the sequences (εi ) and (λi ) in such a way that ik + k < jk < ik+1 , εi2k < εik +k ,
(16) (17) 1 λ1 = , 2
λi = λik
for i = ik , . . . , ik + k,
1 λi+1 ≤ λi 2
for i = ik + k, . . . , jk ,
for all i and k. We leave to the reader as an exercise to check that this choice, keeping also (13)–(15), is possible. Moreover, once the other numbers at the kth step are chosen, we can take jk as large as we wish. The choice of jk is determined in the proof of the following lemma. Lemma 6. Under the conditions (13)–(17), f ∈ / BMO(µ) for any choice of the coordinate axes. Proof. Let Qk be the square with center on the x1 -axes, sidelength 2εik , and the right-hand vertices on S 1 (see Figure 4).
Qk
Sik Sik −1
Sik +k
S1
Figure 4. Proof of Lemma 6
By (16) for i = ik , . . . , ik + k, Qk ∩ Si does not meet the vertical sides of Qk , whence it is an arc with εik ≤ Ᏼ1 (Qk ∩ Si ) ≤ 3εik . Thus, by (17),
BMO FOR NONDOUBLING MEASURES
µ(Qk ∩ Si ) = λik Ᏼ1 (Qk ∩ Si ) ≥ λik εik
553
for i = ik , . . . , ik + k.
We choose jk such that
Ᏼ1 Qk ∩ Si+1 ≤
1 1 Ᏼ (Qk ∩ Si ) for i ≥ jk . 2
By (13) and elementary geometry, this is possible. Then by (17), 1 µ Qk ∩ Si+1 ≤ µ(Qk ∩ Si ) for i ≥ ik + k. 2 Thus, µ(Qk ) =
i k +k
µ(Qk ∩ Si ) +
≤2
µ(Qk ∩ Si )
i>ik +k
i=ik i k +k
µ(Qk ∩ Si ) = 2λik
i=ik
i k +k
Ᏼ1 (Qk ∩ Si )
i=ik
≤ 12λik kεik . For any number a, there are at least k/3 values i ∈ {ik , . . . , ik + k} such that |f (x) − a| ≥ k/3 for x ∈ Si . Thus 1 µ(Qk )
Qk
|f − a| dµ ≥
1 kk λik εik = k. 12λik kεik 3 3 108 1
Hence, f ∈ / BMO(µ) and Lemma 6 is proved; consequently Theorem 4 is also proved. Proof of Theorem 6. The proof follows much the same lines as the one given for Theorem 4, but in the present case both the measure and the function are not constant on circles. Let m ≥ 9 be an integer. Let (18)
λ1 = 4−m ,
λ2 = · · · = λm = m−2 4−m .
Let λ > 0, and choose ε1 > ε2 > · · · > εm such that (13) and (15) are satisfied. We choose λ much smaller than λ2 , and then we choose the sequence (εi ), very quickly decreasing. Set Si and Ti as before. Divide the unit circle S 1 into disjoint consecutive arcs I1 , . . . , I8m of length 2π8−m . For each j divide Ij into disjoint consecutive arcs Ij,0 , . . . , Ij,2m of length Ᏼ1 (Ij,i ) = 2π8−m /(2m + 1) = $m . Set, with tA = {tx : x ∈ A},
554
MATEU, MATTILA, NICOLAU, AND OROBITG k Ij,i = (1 − εk )Ij,i ⊂ Sk ,
Sk,j =
2m i=0
k Ij,i , m
µS =
8 m k=1 j =1
Tk,j =
k Jj,i = (1 + εk )Jj,i ⊂ Tk , 2m i=0
k Jj,i ,
k λk Ᏼ1 | Ij,m +λ
i=m
k , Ᏼ1 | Ij,i
8m m m 2m k k 2−i−k Ᏼ1 | Jj,i + 2i−2m−k Ᏼ1 | Jj,i , µT = k=1 j =1
i=0
i=m+1
and µ = µ S + µT , 8m m m f= (i − m + k)χI k k=1 j =1
k j,i ∪Jj,i
i=m−k
+
m+k
(m + k − i)χI k
i=m+1
k j,i ∪Jj,i
.
Therefore, easy computations give µ(Tk,j ) ! 2−k $m µS !
and
µT ! 8m $m ! m−1 ! µ(T1 ),
4−m 4−m 1 + λ4m m2 ! ! µ(S1 ), whenever λ < 4−m m−2 , m m f dµ ! $m 2−m and f dµT ! 2−m , Tk,j
4−m 4−m 1 + λ4m m3 ! , whenever λ < 4−m m−3 . m m Then, clearly f ∈ L1 (µ) with f dµ ≤ c0 µ, where c0 is as before. Note that f = 0 on the extreme arcs of Tk,j and that it oscillates linearly to k in the middle. Since the measure decays exponentially in the middle, f is again nicely in BMOb (µT ), as the next lemma shows. Let B be a disc with center x. f dµS !
Lemma 7. There is a ∈ R such that |f − a| dµT ≤ CµT (B). B
Moreover, if for some k, B ∩ Tk,j = ∅ for at least two indices j , then we can take a = 0. Proof. Suppose that B ∩ Tk,j = ∅, and suppose that the indices i for which B ∩ = ∅ form a sequence i1 , i1 + 1, . . . , i2 such that i1 = 0 or i1 = 1 or i2 = 2m or i2 = 2m − 1 or form two sequences i1 , i1 + 1, . . . , i2 and i3 , i3 + 1, . . . , i4 such
k Jj,i
BMO FOR NONDOUBLING MEASURES
555
that i1 = 0 or i1 = 1 and i4 = 2m or i4 = 2m − 1. We say then that B intersects Tk,j noncentrally. Otherwise, we say that B intersects Tk,j centrally. We also say that B intersects Tk centrally or noncentrally if it intersects some Tk,j centrally or, respectively, noncentrally. We claim that if B intersects Tk,j noncentrally, then f dµ ≤ c0 µ B ∩ Tk,j . (19) B∩Tk,j
Suppose, for example, that there is only one sequence i1 , . . . , i2 as above with i1 = 0 or i1 = 1. The other cases are similar. If i2 = 0, 1, or 2, then f = 0, 1, or 2 on B ∩Tk,j , and (19) is clear. k ⊂ B, whence Otherwise, Jj,2 k ≥ 2−2−k $m µ B ∩ Tk,j ≥ 2−2−k Ᏼ1 Jj,2
and
B∩Tk,j
f dµ ≤
Tk,j
f dµ ! $m 2−m .
Hence, (19) follows. Assume now that for some k, B ∩Tk,j = ∅ for at least two values of j . We claim that (20) f dµT ≤ 16c0 µT (B). B
Let k1 , k1 + 1, . . . , k2 be those k for which B ∩ Tk = ∅. We examine three cases separately. First, suppose B intersects Tk1 centrally. Then B ∩ Tk1 = B ∩ Tk1 ,j for some j and (21) f dµ ≤ f dµ ! $m 2−m . B∩Tk1
Tk1 ,j
We see by simple geometry that if the εi ’s decrease sufficiently quickly, our assumption (that for some k, B ∩ Tk,j = ∅ for at least two j ) implies that µ B ∩ Tk1 +1 ≥ 2−2−k1 $m . (22) Moreover, k2 = m and B intersects the remaining Tk noncentrally. Combining (19), (21), and (22), we get (20) for this first case. On the other hand, if B intersects both Tk1 and Tk2 noncentrally, then B intersects every Tk noncentrally and again (20) follows from (19). Finally, when B intersects Tk1 noncentrally and Tk2 centrally, our assumption implies µ(B ∩ Tk1 ) ≥ 2−k1 $m . Then B intersects Tk centrally for k = k3 , . . . , k2 , and as in (21), k2 k3
B∩Tk
f dµ ≤ (m − k1 )2−m $m ≤ 2−k1 $m ≤ µT (B).
556
MATEU, MATTILA, NICOLAU, AND OROBITG
Combining this with (19), we also obtain (20). To finish the proof of Lemma 7, suppose that for every k there is at most one j such that B ∩ Tk,j = ∅. Then the values of k for which B ∩ Tk = ∅ form a sequence k1 , k1 +1, . . . , k2 , and the index j for which B ∩Tk,j = ∅ is the same for all k1 ≤ k ≤ k1 +1 = ∅ (if there are any) form a sequence k2 . Moreover, the indices i for which B ∩Jj,i k = ∅ for i1 , i1 + 1, . . . , i2 . If the sequence (εi ) decreases sufficiently quickly, B ∩ Jj,i k > k1 , i1 < i1 − 1, and i > i2 + 1. Thus µT (B) ≤ 4µ(B ∩ (Tk1 ,j ∪ Tk1 +1,j )). Then, by similar easy estimates as before, 2 |f − a| dµT ≤ C |f − a| dµ ≤ C µT (B), B
B∩ Tk1 ,j ∪Tk1 +1,j
where a = min{f (x) : x ∈ B ∩ (Tk1 ,j ∪ Tk1 +1,j )}. This completes the proof of Lemma 7. We want to show again that (2) holds for some a with an absolute constant c; that is, f ∈ BMOb (µ). Suppose first that |x| ≤ 1/2. We may assume that B ∩ S2 = ∅. Then B contains at 1 and so least 8m−1 arcs Ij,m µ(B) ≥ 8m−1 λ1 $m /2 = 2m $m /16 ! 4−m m−1 .
We have
B
f dµS ≤
f dµS ! 4−m m−1 µ(B)
provided we choose λ ≤ $m /(m2 2π). If B ∩ Tk,j = ∅ for at most one j for every k, then we get B
f dµT ≤
m k=1 Tk,j
f dµ ! m2−m $m ≤ 16m4−m µ(B) ≤ µ(B)
since m ≥ 9. If B ∩ Tk,j = ∅ for at least two indices j , for some k, we have by Lemma 7, f dµT ≤ Cµ(B). B
Combining these inequalities we obtain f dµ ≤ Cµ(B) B
in case |x| ≤ 1/2. Assume then that |x| > 1/2. As previously, we study different cases. By Lemma 7 we may assume that B ∩Sk = ∅ for some k. The case where this happens only for k = 1
BMO FOR NONDOUBLING MEASURES
557
is trivial, so suppose that B ∩Sk for some k > 1. If B ∩S2 = B ∩T4 = ∅, the diameter of B is at most 2ε2 . Choosing 2ε2 < $m , there are two pairs (i1 , j1 ) and (i2 , j2 ) such k ∪ J k ) = ∅ unless (i, j ) = (i , j ) or (i, j ) = (i , j ). Then that for every k, B ∩ (Ij,i 1 1 2 2 j,i we can use almost identical arguments as in the proof of Theorem 4, because in this scale both the measure and the function are almost constant on circles. Suppose then that B ∩ T4 = ∅. Then, choosing ε4 sufficiently small for all k ≥ 5, µ B ∩ (Sk ∪ Tk ) ≤ 26−k µ(B ∩ T5 ), whence, as f ≤ k on Sk ∪ Tk , B
f dµ ≤ 4µ(B) +
m
k26−k µ(B ∩ T5 ) ≤ (4 + 8)µ(B) = 12µ(B).
k=5
Finally, we are left with the case where B ∩ S2 = ∅ and B ∩ T4 = ∅. If B intersects only S2 and S3 , it is trivial. Therefore, assume B ∩ S4 = ∅. Let r be the radius of B. We then have √ µ(B ∩ S3 ) ≥ λᏴ1 (B ∩ S3 ) ≥ λ rε3 and for k ≥ 4, µ B ∩ (Sk ∪ Tk ) ≤ 2−k Ᏼ1 B ∩ (Sk ∪ Tk ) ≤ 22−k Ᏼ1 (B ∩ S4 ) ≤ 23−k r2ε4 . Choosing ε4 ≤ λ2 ε3 , we conclude f dµ ≤ 3µ(B) + ≤ 3µ(B) +
f dµ
B\S2 ∪S3 m 4−k √
k2
rε4
k=4
√ ≤ 3µ(B) + 6 rε4 √ ≤ 3µ(B) + 6λ rε3 ≤ 9µ(B). This completes the proof that the function f belongs to BMOb (µ) with the norm independent of m. We point out that if B is a disc not contained in {x : |x| < 3}, then (23) f dµ ≤ Cµ(B). B
Clearly, when µ(B ∩ S) > 0, we have µ(B) ≥ µT (B) ≥ µ/100, and thus (23) holds. If µ(B ∩ S) = 0, then either B ∩ T2 = ∅ or B ∩ T1,j = ∅ for at least two indices j . In both cases we get (23) from Lemma 7. To finish the proof of Theorem 6, we choose discs B1 , B2 , . . . such that the discs 3Bk are disjoint. Let {Mk } be an increasing sequence of integer numbers (for instance,
558
MATEU, MATTILA, NICOLAU, AND OROBITG
Mk = 9+k). Taking m = Mk , we use the above construction in each Bk appropriately scaled. Then we get in 3Bk a measure µk and fk ∈ BMO(µk ) with uniformly bounded BMOb -norms. Set µ= µk , f= fk . k
k
Let B be any disc. If B ⊂ 3Bk for some k, then µ|B = µk|B , and (2) holds. Otherwise, B is not contained in 3Bk for any k. Then by (23), f dµ = fk dµk ≤ C µk (B) = Cµ(B). B
k
B
k
So f ∈ BMOb (µ). Denote by rk the radius of Bk , and note that µk (Bk ) ! rk 4−Mk Mk−1 . Then for any a ∈ R and all k, we have
Mk 4−Mk Mk 1 4−Mk Mk µ x ∈ Bk : |f (x) − a| > ≥ · · $ 8 ≥ r ! µ(Bk ). M k k 2 2 3 3 Mk Mk 3Mk p
This implies trivially (by the Chebyshev inequality) that f ∈ / BMOb (µ) for all p > 1. Thus, finally, the proof of Theorem 6 is complete. Proof of Theorem 5. Let m > 3 be an integer; we have ε1 , . . . , εm such that ε1 < 1/2 and 0 < εi+1 < εi /10 for 1 ≤ i < m. Let Si = {x : |x| = 1 − εi } and Ti = {x : |x| = 1 + εi } as before. We choose the sequence (εi ) so quickly decreasing that the following hold: Ᏼ1 Q ∩ S(r) ≤ 2−m Ᏼ1 Q ∩ Si+1 (24) for 1−εi+2 ≤ r ≤ 1+εi+2 , 1 ≤ i ≤ m−2, and for any square Q such that Q∩Si = ∅ and Q ∩ Ti+2 = ∅; Ᏼ1 Q ∩ S(r) ≤ 2Ᏼ1 Q ∩ Ti+1 (25) for 1−εi+1 ≤ r ≤ 1, 1 ≤ i ≤ m−1, and for any square Q such that Q∩Ti = ∅. Here S(r) denotes {x : |x| = r}. We define µ and f as in the proof of Theorem 4 with λ1 = · · · = λm = λ = 2−m . Then f = i on Si ∪ Ti . First, we observe that the BMOb (µ)-norm of f is at least m/9, since for any a ∈ R and D = {x : |x| ≤ 1}, 3 |f − a| dµ ≥ µ x ∈ D : |f (x) − a| > m/3 ≥ µ(D)/3. m D
BMO FOR NONDOUBLING MEASURES
559
Second, we claim that (1) holds for any square Q with an absolute constant C. For this we can follow the argument in the proof of Theorem 4. Lemma 4 and its proof hold with B replaced by Q as they stand; i0 and j0 are defined in the same way with Q in place of B. We need not worry about where the center of Q lies, so our first case is the analogue of Case 2: Q ⊂ D or j0 > i0 + 2. In this case, (24) yields Ᏼ1 Q ∩ (Si ∪ Ti ) ≤ 2λᏴ1 (B ∩ Si0 +1 ) ≤ 2µ(B) for i > i0 + 1, and the inequalities in Case 2 can be repeated. In case j0 = i0 , i0 + 1, or i0 + 2 (see Case 3), we have by (25), Ᏼ1 (Q ∩ Si ) ≤ 2Ᏼ1 (Q ∩ Tj0 +1 )
for i > j0 , and the same argument works again. Finally, if j0 < i0 , the proof runs as it is with B replaced by Q. To complete the proof of Theorem 5, we use the same method as at the end of the proof of Theorem 6, defining f = m fm and µ = m µm . Then f ∈ BMO(µ) (for all choices of the coordinate axes), but f ∈ / BMOb (µ). Another variant of functions of bounded mean oscillation is the following one. Suppose that in the definition of the space BMO(µ) we only consider cubes (with sides parallel to the axes) centered at the support of the measure µ. We denote this new space of functions by BMOc (µ). Obviously, BMO(µ) is contained in BMOc (µ), and as the next example shows, this inclusion can be strict, even for the restriction of the Lebesgue measure to a cube. Example. The idea to construct our example is simple, but again the explanation becomes a little tedious. Take the square Q0 = [0, 1] × [−1/2, 1/2], and let µ be the planar Lebesgue measure restricted in Q0 . Now, we consider a collection of squares which are dyadic with respect to Q0 . For each positive integer k and for each j = 1, 2, . . . , 2k−1 , we define squares 1 j −1 j + , , Qk,j = 0, k × 2 2k 2k 1 −j 1 − j Q− = 0, , × . k,j 2k 2k 2k Let ϕ be a Lipschitz function satisfying (i) 0 ≤ ϕ ≤ 1; (ii) 1 1 1 1 ϕ(z) = 1 if z ∈ − , × − , , 2 2 2 2 3 3 3 3 ϕ(z) = 0 if z ∈ / − , × − , ; 4 4 4 4
560
MATEU, MATTILA, NICOLAU, AND OROBITG
(iii) ∇ϕ ≤ 4. Then define
+ + ϕk,j (z) = ϕ 2k z − ck,j
and
− − (z) = ϕ 2k z − ck,j ϕk,j ,
+ − − + + and ck,j are the centers of Q+ where ck,j k,j and Qk,j . Thus, ϕk,j is equal to 1 on Qk,j , + − k has support contained in (3/2)Q+ k,j , and ∇ϕk,j ≤ 8 · 2 . The function ϕk,j satisfies the analogous properties. Write k−1 ∞ 2 + − ϕk,j − ϕk,j . b= k=1 j =1
When Q is any square contained in Q0 , a standard computation (or an application of [GJ, Lemma 2.1]) gives 1 |b − bQ | dµ ≤ C. µ(Q) Q If Q is any square with center lying in Q0 , then there is a square P ⊂ Q0 so that Q ∩ Q0 ⊂ P and µ(Q) ≥ (1/4)µ(P ). So, |b − bP | dµ ≤ |b − bP | dµ ≤ Cµ(P ) ≤ Cµ(Q). Q
P
Consequently, b ∈ BMOc (µ). On the other hand, let Qk be a square suchthat Qk ∩ Q0 = [0, 1/2k ] × [−1/2, 1/2]. Then µ(Qk ) = 2−k . By symmetry of b, Qk b = 0. Observe that ∞ k k j |b − bQk | dµ = |b| ≥ ≥ 2−k = µ(Qk ). j +1 2 2 2 Qk Qk j =k
Therefore, b ∈ / BMO(µ). Now, we describe an example that shows that the John-Nirenberg inequality is false for BMOc (µ), even for absolutely continuous measures µ and cubes centered at points of the support of the measure. For simplicity, our measure again contains pieces on line segments, but obviously it can be fattened without destroying the desired properties. Let m > 1 be an integer. Set
Ij = (x, y) : (j − 1)/m < x ≤ j/m, y = 1 , j = 1, . . . , m,
Jj = (x, y) : (j − 1)/m < x ≤ j/m, y = 1 + 1/m , j = 1, . . . , m,
K = (x, y) : 0 ≤ x ≤ 1, y = 0 , I=
m j =1
Ij ,
J=
m j =1
Jj .
561
BMO FOR NONDOUBLING MEASURES
Then consider the measure µ=2
−m
Ᏼ | K +m 1
−1 −m
2
Ᏼ |I+ 1
m
2−j Ᏼ1 | Ij
j =1
and the function f=
m
j χIj ∪Jj
j =1
(see Figure 5).
1 + 1/m
1
2−iᏴ1
f ≡i
J I
m−1 2−m Ᏼ1
0
f ≡0
K
µ = 2−m Ᏼ1
1
Figure 5. The function f and the measure µ, when m = 4
It is easy to check that for every square Q with sides parallel to the coordinate axes and with center in I ∪ J ∪ K, there is a = a(Q) ∈ R such that (26) |f − a| dµ ≤ Cµ(Q). Q
Actually, if the center of Q is in K, one may take a = 0 because f dµ ≤ Cµ(I ) and f dµ ≤ Cµ(J ). I
J
If the center of Q is in I ∪ J , one may assume its sidelength $(Q) ≥ 1/m. Since µ(Ij ) ≤ µ(Jj ), one has |f − a| dµ ≤ 3 |f − a| dµ. Q
Q∩J
562
MATEU, MATTILA, NICOLAU, AND OROBITG
Now, since the measure µ decays exponentially in J , the function f | J looks like a logarithm. Hence, if a = min{j : Jj ∩ Q = ∅}, one has |f − a| dµ ≤ Cµ(Q ∩ J ). Q∩J
So, (26) is proved. Notice also that if one has a cube Q that contains either K, I , or J , then (26) holds with a = 0. Taking
Q = (x, y) : |x| ≤ 1 + 1/2m, |y| ≤ 1 + 1/2m , we have that for any a ∈ R, 2−m µ(Q) µ {x ∈ Q : |f (x) − a| > m/3} ≥ ≥ . 3m 4m As in the proofs of Theorems 5 and 6, we now apply this construction to a sequence of squares. Consider a collection {Qn : n = 1, 2, . . . } of disjoint squares whose left side is in the y-axis and that satisfies
dist(Qn , Qm ) ≥ 2 max $(Qn ), $(Qm ) , n = m. Let µn , fn be the measure and the function given by the construction above in the square Qn with m = n. Set µ= µn , f= fn . One has f ∈ BMOc (µ). To see this, observe that if a cube Q centered at a point in Qk intersects some other Qj , j = k, then Qk ⊂ Q and, moreover, Q contains the K or J piece of the cube Qj . So, as remarked above, one has fj dµj ≤ Cµj (Q) Q
and adding up,
Q
f dµ ≤ Cµ(Q).
So, f ∈ BMOc (µ). On the other hand, as before, for any a ∈ R, one has
µ(Qn ) µ x ∈ Qn : |f (x) − a| > n/3 ≥ , 4n and the John-Nirenberg inequality fails. Appendix: John-Nirenberg theorem on spaces of homogeneous type Let (X, d, µ) be a space of homogeneous type in the sense of Coifman and Weiss [CW, p. 587]. Thus, the quasidistance d satisfies d(x, y) ≤ K d(x, z) + d(z, y) ,
563
BMO FOR NONDOUBLING MEASURES
and the positive Borel measure µ is doubling µ B(x, 2r) ≤ Dµ B(x, r) , where B(x, r) = {y ∈ X : d(x, y) < r} is the ball centered at x and radius r > 0. Remark. There exists a constant α such that if x ∈ B(a, R) and 0 < ρ ≤ 12KR/5, then B(x, ρ) ⊂ B(a, αR). (Take α = K((12K/5) + 1).) Our goal is to check the following. Theorem A. There exist two positive constants β and b such that for any f ∈ BMO(X) and for any ball S ⊂ X, one has (27)
µ x ∈ S : |f − fS | > λ ≤ β exp −
bλ µ(S), Kf ∗
for all λ > 0.
Proof. We guess that the proof we present here is implicit in the work of Coifman and Weiss [CW, p. 594, footnote]. However, since they didn’t write it explicitly, there has been some confusion in the literature. We follow the standard stopping time argument; that is, we assume that λ is large enough and fix some λ1 . Then we study the sets {x ∈ S : |f (x) − fS | ≤ λ1 }, {x ∈ S : |f (x) − fS | ≤ 2λ1 }, up to {x ∈ S : |f (x) − fS | ≤ mλ1 ! λ}. In showing (27), we assume f ∗ ≤ 1 and fix S = B(a, R). We define a maximal operator associated to S (if we replace S by another ball, then the maximal operator changes),
1 MS f (x) = sup µ(B)
B
|f (y) − fS | dµ(y) : B ball, x ∈ B, B ⊂ B(a, αR) .
Using a Vitali-type covering lemma, one can prove that A µ {x : MS f (x) > t} ≤ µ(S), t where A is a constant that only depends on K and D but not on S. Take λ0 > A. Consider the open set U = {x : MS f (x) > λ0 }. We have µ(U ∩S) ≤ (A/λ0 )µ(S) < µ(S), and therefore S ∩ U c = ∅. Define r(x) = (1/(5K)) dist(x, U c ). If x, y ∈ S, then d(x, y) ≤ 2KR. Since U c ∩ S = ∅, if x ∈ S, we have r(x) ≤ 2KR/(5K) = 2R/5. Clearly, B x, r(x) ⊂ U. U ∩S ⊂ x∈U ∩S
Again by a Vitali-type covering lemma (e.g., see [CW, Theorem 3.1]), we can select
564
MATEU, MATTILA, NICOLAU, AND OROBITG
a finite or countable sequence of disjoint balls {B(xj , rj )} such that rj = rj (x) and U ∩S ⊂ B xj , 4Krj ⊂ U. j
On the other hand, and
B xj , 6Krj ∩ U c = ∅
12KR . B xj , 6Krj ⊂ B(a, αR) because 6Krj ≤ 5
Thus, we get 1 µ B xj , 6Krj
B(xj ,6Krj )
|f − fS | dµ ≤ λ0 ,
(1)
and consequently, if we write Sj = B(xj , 4Krj ), we obtain 1 |fS − fSj | ≤ |f − fS | ≤ c0 λ0 := λ1 , µ(Sj ) Sj because µ is a doubling measure. By the differentiation theorem, |f (x) − fS | ≤ λ0
for µ-a.e. x ∈ S
j
(1)
Sj .
Moreover, (1) A CA µ Sj ≤ C µ B xj , rj ≤ Cµ(U ) ≤ µ(S) = µ(S). λ0 λ0 j
j
(1)
Now, we do the same construction for each Sj . Again (2) (1) Si , f (x) − fS (1) ≤ λ0 for µ-a.e. x ∈ Sj j
i
and therefore for these points |f (x) − fS | ≤ f (x) − fS (1) + fS (1) − fS ≤ λ0 + c0 λ0 ≤ 2c0 λ0 := 2λ1 . j
j
It is clear (taking λ0 = 2A ) that (2) A (1) A 2 µ Si µ Sj ≤ µ(S) = 2−2 µ(S). ≤ λ0 λ0 i
j
By continuing this process, we would finish the proof.
BMO FOR NONDOUBLING MEASURES
565
Recently, in [B] and [MP], other correct proofs of the John-Nirenberg inequality for homogeneous-type spaces have been presented. References [B] [CW] [GJ] [G] [J] [MP] [M] [NTV] [S] [T] [V] [W]
S. M. Buckley, Inequalities of John-Nirenberg type in doubling spaces, preprint. R. R. Coifman and G. Weiss, Extensions of Hardy spaces and their use in analysis, Bull. Amer. Math. Soc. 83 (1977), 569–645. J. B. Garnett and P. Jones, The distance in BMO to L∞ , Ann. of Math. (2) 108 (1978), 373–393. M. de Guzmán, Differentiation of Integrals in Rn , Lecture Notes in Math. 481, Springer, Berlin, 1975. J.-L. Journé, Calderón-Zygmund Operators, Pseudodifferential Operators and the Cauchy Integral of Calderón, Lecture Notes in Math. 994, Springer, Berlin, 1983. P. MacManus and C. Perez, Trudinger inequalities without derivatives, preprint. P. Mattila, Geometry of Sets and Measures in Euclidean Spaces: Fractals and Rectifiability, Cambridge Stud. Adv. Math. 44, Cambridge Univ. Press, Cambridge, 1995. F. Nazarov, S. Treil, and A. Volberg, Cauchy integral and Calderón-Zygmund operators on nonhomogeneous spaces, Internat. Math. Res. Notices 1997, 703–726. E. M. Stein, Singular Integrals and Differentiability Properties of Functions, Princeton Math. Ser. 30, Princeton Univ. Press, Princeton, 1970. X. Tolsa, L2 -boundedness of the Cauchy integral operator for continuous measures, Duke Math. J. 98 (1999), 269–304. J. Verdera, On the T (1)-theorem for the Cauchy integral, to appear in Ark. Mat. I. Wik, On John and Nirenberg’s theorem, Ark. Mat. 28 (1990), 193–200.
Mateu, Nicolau, and Orobitg: Departament de Matemàtiques, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain;
[email protected];
[email protected];
[email protected] Mattila: Department of Mathematics, University of Jyväskylä, P.O. Box 35, SF-40351 Jyväskylä, Finland;
[email protected].fi
Vol. 102, No. 3
DUKE MATHEMATICAL JOURNAL
© 2000
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS DAN EDIDIN and WILLIAM GRAHAM 1. Introduction. The purpose of this paper is to prove an equivariant RiemannRoch theorem for schemes or algebraic spaces with an action of a linear algebraic group G. For a G-space X, this theorem gives an isomorphism G (X) − τ G : GG (X) −→ G → Q −
∞ i=0
CH iG (X)Q .
G (X) is the completion of the equivariant Grothendieck group of coherent Here G sheaves along the augmentation ideal of the representation ring R(G), and the groups CH iG (X) are the equivariant Chow groups defined in [EG2]. The map τ G has the same functorial properties as the nonequivariant Riemann-Roch map of [BFM] and [F, Theorem 18.3]. If G acts freely, then τ G can be identified with the nonequivariant Todd class map τX/G : G(X/G) → CH ∗ (X/G)Q . The key to proving this isomorphism is a geometric description of completions of the equivariant Grothendieck group (see Theorem 2.1). Aside from Riemann-Roch, this result has some purely K-theoretic applications. In particular, we prove (see Corollary 6.2) a conjecture of Köck (in the case of regular schemes over fields) and extend to arbitrary characteristic a result of Segal on representation rings (see Corollary 6.1). For actions with finite stabilizers, the equivariant Riemann-Roch theorem is more precise; it gives an isomorphism between a localization of GG (X)Q and ⊕ CH iG (X)Q (see Corollary 5.1). This formulation enables us to give a simple proof of a conjecture of Vistoli (see Corollary 5.2). If G is diagonalizable, then we can express GG (X) in terms of the equivariant Chow groups (an unpublished result of Vistoli; cf. [To] also). Actions with finite stabilizers are particularly important because quotients by these actions arise naturally in geometric invariant theory. In a subsequent paper, we will use these results to express the Todd class map for a quotient of such an action in terms of equivariant Todd class maps, generalizing Riemann-Roch formulas of Atiyah and Kawasaki. The main tool of this paper is the approximation of the total space of the classifying bundle EG by an open subset U of a representation V , where G acts freely on U and where V −U is a finite union of linear subspaces. Approximations to EG by open sets
Received 8 January 1999. Revision received 20 August 1999. 1991 Mathematics Subject Classification. Primary 14C40, 14L30; Secondary 19E15. Edidin’s work partially supported by National Security Agency grant number MDA904-97-1-0030 and the University of Missouri Research Board. Graham’s work partially supported by the National Science Foundation. 567
568
EDIDIN AND GRAHAM
in representations were introduced by Totaro in Chow theory (see [T]) and were used in [EG2] to define equivariant Chow groups. However, in these papers, V −U is only required to have large codimension. Because Chow groups are naturally graded, we i N i can identify ⊕N i=0 CH G (X) with ⊕i=0 CH G (X × U ) as long as codim(V − U ) > N. Since Grothendieck groups are not naturally graded, we need the stronger condition that V − U is a union of linear subspaces to compare GG (X) with GG (X × U ). Such V and U can be found for tori or for Borel subgroups of GLn . For these groups, we prove that the completion of GG (X) along the augmentation ideal has a geometric description. This fact directly implies the equivariant Riemann-Roch isomorphism for these groups. For general G, however, it seems unlikely that such V and U exist, so we must employ a less direct approach. The Riemann-Roch theorem for general G is deduced by embedding G into GLn and then reducing the case of GLn to that of a Borel subgroup. This strategy of proof is due to Atiyah and Segal for compact groups. The necessity of using completions of equivariant Grothendieck groups also goes back to Atiyah and Segal. In our setting it is motivated as follows. For smooth varieties, the Todd class map is defined by a power series that, in the nonequivariant case, terminates on any particular variety. However, the equivariant Chow groups of a fixed variety can be nonzero in arbitrarily highdegree, so the natural target of the equivariant i Todd class map is the infinite product ∞ i=0 CH G (X)Q . To obtain a Riemann-Roch isomorphism, it is natural to expect that the equivariant Grothendieck groups must also be completed, as is indeed the case. There is one essential difference between these completions. The completion map for Chow groups is injective, as it simply replaces a direct sum by a direct product. However, on the Grothendieck group side, information is definitely lost by completing (cf. Section 5). Acknowledgement. We thank the referee who read the paper very carefully and suggested many corrections and improvements. 1.1. Contents. In Section 2 we define the completions we need for the main results, and we give a geometric description of these completions for tori and for the subgroup of upper triangular matrices of GLn . In Section 3 we construct the equivariant Riemann-Roch map, and we prove that it has the same functorial properties as the nonequivariant Riemann-Roch map. Moreover, it behaves naturally with respect to restriction to a subgroup (see Section 3.4). In Section 3.3 we illustrate the use of this theorem by deriving the Weyl character formula for SL2 following [Bo]. In Section 4 we prove that the Riemann-Roch map induces an isomorphism on the completions. Section 5 contains results, mentioned above, on actions with finite stabilizers, and proves Vistoli’s conjecture. Finally, in Section 6 we prove that two naturally defined completions of equivariant K-theory are equal, and we apply this to prove Köck’s conjecture and the result about representation rings. 1.2. Conventions and notation. All groups in this paper are assumed to be linear algebraic groups over an arbitrary field k (i.e., closed subgroup-schemes of GLn (k)).
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
569
Note that in characteristic p such groups need not be smooth over k. Representations of G are assumed to be rational, that is, linear actions of G on finite-dimensional k-vector spaces. Free actions. By a free action of G on X, we mean an action that is schemetheoretically free; that is, the action map G×X → X ×X is a closed embedding. An action that is proper and set-theoretically free is free (see [EG2, Lemma 8]). Algebraic spaces. This paper is written in the language of algebraic spaces. Unless stated otherwise, a space is an equidimensional quasi-separated algebraic space of finite type over k. One reason to work in this category is that we need quotients of the form X ×G U , where U is an open set in a representation of G on which G acts freely. In the category of algebraic spaces, such quotients exist, by a result of Artin (see [EG2, Proposition 22]). It is possible to work entirely in the smaller category of schemes of finite type over k, but some mild technical hypotheses are required to ensure that quotients exist as schemes (see [EG2, Proposition 23]). Because free actions are proper, if X is separated, so is X ×G U (see [E, Corollary 2.2]). Equivariant Chow groups. We use AG k (X) to refer to the equivariant Chow groups of “dimension” k as defined in [EG2]. However, it is usually more convenient to index by codimension. We write CH iG (X) for the “codimension” i equivariant Chow group: i If X has pure dimension n, then CH iG (X) := AG n−i (X). The notation AG (X) refers to the degree i operational Chow group; when X is smooth, CH iG (X) AiG (X) (see i [EG2]). The operational Chow groups have a natural product, and ⊕∞ i=0 CH G (X) is i a positively graded module for the graded ring ⊕AG (X). The G-equivariant Chow ring of a point is denoted A∗G . Pullback from a point i ∗ makes ⊕∞ i CH G (X) a graded AG -module. Equivariant K-theory. We use K G (X) to refer to the Grothendieck group of G-equivariant vector bundles. The Grothendieck group of G-equivariant coherent sheaves is denoted GG (X). The tensor product gives a ring structure on K G (X), and GG (X) is a module for this ring. If X is regular and has the property that every coherent sheaf is the quotient of a locally free sheaf (this holds for regular schemes but is not known in general for regular algebraic spaces), then a result of Thomason [Tho3, Corollary 5.8] says that these groups are isomorphic. The representation ring R(G) can be identified with K G (pt), and consequently G G (X) has an R(G)-module structure. If A is a ring, then define GG (X)A = GG (X) ⊗Z A as the Grothendieck group with coefficients in A. We make the analogous definition for Chow groups. 2. Completions. In this section (except in Example 2.1), we work with a fixed coefficient ring A and write simply GG (X) for GG (X)A ; we do similarly for Chow groups. We use the following completions in the sequel. The graded ring ⊕AiG (X) com ∞ i i pletes to ∞ i=0 AG (X), and i=0 CH G (X) is naturally a module for this ring. We let ∗ (X) denote the completion J denote the augmentation ideal in A∗G , and we let CH G
570
EDIDIN AND GRAHAM
of the A∗G -module CH ∗G (X) along J . Completing equivariant K-theory requires more care since K G is not naturally graded. We complete as follows. Choose an embedding G → GLn . This induces a homomorphism R = R(GLn ) → R(G). Thus, GG (X) is an R-module. Let I be the augmentation ideal of R (the ideal of virtual representations of GLn of dimension zero), and let IG be the augmentation ideal of R(G). We denote the I -adic completion G (X). of GG (X) by G This definition is convenient for proving results about G by reducing to the case of GLn , but it is not obvious that it is independent of the choice of embedding of G into GLn . However, this follows from Corollary 6.1, which implies that I and IG give the same topology on R(G) whenever G → GLn ; hence, I -adic and IG -adic completions are isomorphic. A special case of this result is proved in Proposition 2.2. In characteristic zero, Corollary 6.1 is the same as [Seg, Corollary 3.9]. However, Segal’s methods do not extend to characteristic p. 2.1. Geometric completions. There is a way of completing equivariant K-groups and Chow groups which is more directly related to the definition of equivariant Chow groups. For any representation V of G, let V 0 denote the open subset of points of V on which G acts freely, and let U be an open subset of V 0 . A quotient (X × U )/G exists; such a quotient is usually written as X ×G U . Choose V and U such that the codimension of V − U is greater than k (this is always possible by [EG1]). Then, by definition, CH kG (X) = CH k (X ×G U ). With this as motivation, let ᐂ be a collection of pairs (V , U ) of G-modules and invariant open sets with the following properties. (i) G acts freely on U . (ii) If (V1 , U1 ) and (V2 , U2 ) are in ᐂ, then there is a pair (V1 ⊕ V2 , U ) in ᐂ such that U contains U1 × V2 and V1 × U2 . The elements of such a system can be partially ordered by requiring that the pair (W, UW ) < (V , UV ) if the G-module V can be written as a direct sum V = W ⊕ W with UV ⊃ UW × W . Suppose that (V , U ) is in ᐂ; by the homotopy property, we can identify GG (X) = GG (X × V ) (see [Tho3, Theorem 4.1]) and CH iG (X × V ) = CH iG (X) (see [EG2]). In this way, we obtain surjective maps kV : GG (X) −→ GG (X × U ) and rV : ⊕ CH iG (X) −→ ⊕ CH iG (X × U ). Lemma 2.1. If (W, UW ) < (V , UV ), then ker kW ⊇ ker kV and ker rW ⊇ ker rV . Proof. Write V = W ⊕ W . Then UV ⊃ UW × W , so we have surjective maps GG (X) −→ GG (X × UV ) −→ GG (X × UW × W ) ∼ = GG (X × UW ).
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
571
The first map is kV and the composition is kW , proving the first inclusion. A similar argument works for rV and rW . For (W, UW ) < (V , UV ), the lemma implies that there is a natural surjective map GG (X)/ ker kV −→ GG (X)/ ker kW making these groups into an inverse system indexed by ᐂ. A similar statement holds for the Chow groups. Taking the inverse limit, we obtain a completion of GG (X). For an arbitrary system ᐂ, it is difficult to describe this completion. However, there is one situation where we can understand it. Call a system ᐂ with above properties (i) and (ii) good if it has the following third property. (iii) If (V , U ) is in ᐂ, then V − U is contained in a finite union of invariant linear subspaces. Theorem 2.1. Let ᐂ be a good system of representations. Then, we have the following. (a) The topology on GG (X) induced by the subgroups ker kV coincides with the IG -adic topology. Hence, lim←(V ,U )∈ᐂ (GG (X)/ ker kV ) is isomorphic to the IG -adic completion of GG (X). (b) The topology on CH ∗G (X) induced by the subgroups ker rV coincides with the J -adic topology. Hence, lim←(V ,U )∈ᐂ CH ∗G (X)/ ker rV is isomorphic to the J -adic completion of CH ∗G (X). The following result shows that for certain classes of groups, good systems exist. Theorem 2.2. If G is a torus or, more generally, any subgroup of the group of upper triangular matrices in GLn , then there exists a system ᐂ of good pairs for G. In particular, if the ground field is algebraically closed, then any connected solvable group has a good system. The proofs of these theorems are given in the next two subsections. We close with two propositions that are used in the sequel. Proposition 2.1. Let G be a connected subgroup of the group of upper triangular matrices. Then ∞ ∗ (X) CH CH iG (X). G i=0
Proof. If G is a connected subgroup of the upper triangular matrices, then G = T U with T a torus and U unipotent (see [Bor, Theorem 10.6]). An argument similar to that used in Thomason [Tho4, proof of Theorem 1.11] implies that CH ∗G (X) = CH ∗T (X), so we assume G = T is a torus.
572
EDIDIN AND GRAHAM
i Let M = CH ∗T (X) and Mn = ⊕∞ i=n+1 CH T (X). We show that the filtrations of M n given by {J M} and {Mn } give the same topology. Step 1. For any n we must show that there exists l such that J l M ⊆ Mn . Since multiplication by J l maps CH iT (X) to Mi+l−1 , if l ≥ n + 1, then the desired inclusion holds. Step 2. For any n we must show that there exists l such that Ml ⊆ J n M. Brion [Br, Theorem 2.1] has shown that CH ∗T (X) is generated as an A∗T -module by fundamental classes of T -invariant subvarieties. Any such fundamental class lies in CH iT (X) for some i ≤ N, where N = dim X. Hence, if r > dim X, then i CH rT (X) ⊆ ⊕i≤N Ar−i T · CH G (X).
Since T is a torus, A∗T is generated as an algebra in degree 1 (see [EG2, Section 3.2]). r−i , and we can conclude that if l > n+N , then we have M ⊆ J n M. Thus, Ar−i l T ⊂J Proposition 2.2. If G = GLn and B (resp., T ) is the group of upper triangular (resp., diagonal) matrices, then we have the following. (a) The map R(B) → R(G) = R(B)W is finite. (b) IGLn generates the same topology on R(B) (resp., R(T )) as IB (resp., IT ). Proof. The Weyl group of GLn is Sn over any field, so by [Ser, Example 3.8] R(G) = R(B)Sn . In this case it is well known that R(B) Z[t1 , . . . , tn , t1−1 , . . . , tn−1 ] and R(G) = R(B)Sn Z[e1 , . . . , en , e1−1 , . . . , en−1 ], where ei is the ith elementary symmetric polynomial in the variables t1 , . . . , tn . This fact obviously implies (a). Part (b) follows from [Seg, Corollary 3.9] applied to the maximal compact subgroups Un ⊂ GLn (C) and (S 1 )n ⊂ B. 2.2. Proof of Theorem 2.2. Observe that if H ⊂ G is a closed subgroup and if
ᐂ is a good system for G, then it is also a good system for H . Thus, it suffices to
construct a good system for the group B of upper triangular matrices in GLn . We do this as follows. Let V1 be the vector space of upper triangular matrices. B acts on V1 by left matrix multiplication. Let U1 ⊂ V1 be the subset of invertible elements. Then V1 − U1 is a union of n hyperplanes. Set Vk = V1⊕k , and let Uk ⊂ Vk consist of the k-tuples of V1 such that at least one element of the k-tuple lies in U1 . Then Vk −Uk is a union of linear subspaces. By construction, the collection of pairs {(Vk , Uk )} satisfies properties (ii) and (iii) of a good system. The action of B on Uk has trivial stabilizers; but, to verify (i), we must check that the action map B × Uk → Uk × Uk is a closed embedding. A tuple of matrices (A1 , . . . , Ak , B1 , . . . , Bk ) ∈ Uk × Uk is in the image of B × Uk if and only if there is a matrix A ∈ B such that Bi = AAi for all i. Thus, the image of B × Uk in Uk × Uk is the closed subvariety defined by the equations Bi Adj(Ai )Aj = det(Ai )Bj . Remark 2.1. Let (V , U ) be a pair in our system ᐂ of good representations. In the proof of Theorem 2.1 below, we only use the fact that G acts with trivial stabilizers
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
573
(as opposed to freely) on the open set U ⊂ V . However, the freeness of the action is essential when we apply Theorem 2.1 to prove the Riemann-Roch isomorphism. This is because we need to know that if X is a separated algebraic space, then X ×G U is still separated. 2.3. Proof of Theorem 2.1 Lemma 2.2. Let X be a G-space that is a union of invariant irreducible components X1 , . . . , Xk . Then proper pushforward gives a surjection ⊕GG (Xi ) −→ GG (X). Proof. By induction it suffices to prove the lemma when X = X1 ∪ X2 . Let X˜ ˜ = GG (X1 ) ⊕ GG (X2 ). The finite be the disjoint union of X1 and X2 . Then GG (X) surjective map X˜ → X gives a map of localization exact sequences GG(X1 ∩ X2 )⊕2
/ GG(X ) ⊕ GG(X ) 1 2
/ GG(X)
GG(X1 ∩ X2 )
/ GG(X −X ∩ X ) ⊕ GG(X −X ∩ X ) 1 1 2 2 1 2
/0
/ GG(X −X ∩ X ) ⊕ GG(X −X ∩ X ) / 0. 1 1 2 2 1 2
A diagram chase shows that the map GG (X1 ) ⊕ GG (X2 ) → GG (X) is surjective. Remark 2.2. The analogous statement also holds for nonequivariant higher Ktheory. However, for higher K-theory, the only proof we know uses the Brown-Gersten spectral sequence. Since we do not know how to adapt that sequence to equivariant K-theory, we cannot state a result for higher equivariant K-theory. Lemma 2.3. Let L V be representations of G, and let i : X × L → X × V be the inclusion. Then i∗ GG (X × L) ⊂ IG GG (X × V ), where IG ⊂ R(G) is the augmentation ideal. Proof. Let πV : X × V → X and πL : X × V → X be the projections. By the homotopy property of equivariant G-theory, the smooth pullbacks πV∗ and πL∗ are isomorphisms of R(G)-modules. Since i is a regular embedding and πV ◦ i = πL , we have, by the compatibility of flat and local complete intersection morphism (l.c.i.) pullbacks (see [FL, VI.6]), that i ∗ πV∗ = πL∗ . Thus, the pullback i ∗ : GG (X × V ) → GG (X × L) is an isomorphism of R(G)-modules. Hence, it suffices to show that i∗ i ∗ GG (X × L) ⊂ IG GG (X × V ). By the projection formula, i∗ i ∗ acts by multiplication by i∗ (1) ∈ K G (X × V ). Since L V is an invariant linear subspace, i∗ (1) = λ−1 (V /L), which is in IG .
574
EDIDIN AND GRAHAM
Lemma 2.4. Let Z be a separated algebraic space, and let aZ = ker(K(Z) → Z) be the augmentation ideal. Then, if k > dim Z, akZ G(Z) = 0. If Z is not separated, then akZ G(Z) = 0 for k sufficiently large. Proof. Let Z be a separated algebraic space. Following [F, Definition 18.3], we p define a Chow envelope Z → Z to be a proper morphism from a quasi-projective scheme Z , such that for every integral subspace V ⊂ Z, there is a subvariety V of X such that p maps V birationally to V . Using Chow’s lemma for algebraic spaces (see [Kn, Theorem IV.3.1]), the argument of [F, Lemma 18.3] shows that every algebraic space has a Chow envelope (of the same dimension as Z) and that the proper pushforward p∗ : G(Z ) → G(Z) is surjective. Since Z is quasi-projective, FZk G(Z ) = 0 [FL, Corollary 3.10], where FZk is the kth level of the γ -filtration. Since p ∗ : K(Z) → K(Z ) preserves the rank of a locally free sheaf, p ∗ akZ ⊂ akZ . By definition, akZ ⊂ F k , so p ∗ akZ G(Z ) = 0. Hence, by the projection formula, akZ p∗ G(Z ) = 0. Since p∗ is surjective, the lemma follows. Even if Z is not separated, it has an open set W that is a separated scheme. Then ak GG (W ) = 0 for k > dim Z. Using the Noetherian induction and the localization sequence, we see that akW GG (Z) = 0 for k 0. Lemma 2.5. Suppose that G acts freely on an algebraic space Z. Then IGk GG (Z) = 0 for k 0. Proof. Let Y = Z/G be the quotient algebraic space. The proof of [MFK, Proposition 0.9] extends to algebraic spaces and shows that Z → Y is a G-principal bundle. Thus, there is an equivalence between the categories of coherent sheaves on Y and G-equivariant sheaves on X; that is, GG (Z) G(Y ). Under this isomorphism, IGl GG (Z) ⊂ al G(Y ). The lemma now follows from Lemma 2.4. Proof of Theorem 2.1. We only prove part (a). The corresponding result about Chow groups has essentially the same proof. There are two steps to show that the filtrations of GG (X) by the submodules ker kV and by the powers of the ideal IG generate the same topology. Step 1. We must show that given any pair (V , U ), there is an integer k such that IGk GG (X) ⊂ ker kV or, in other words, that IGk GG (X × U ) = 0. Since G acts freely on X × U , this follows from Lemma 2.5. Step 2. We must show that given a positive integer k, there is a pair (V , U ) such that ker kV ⊂ IGk GG (X). Let (V , U ) be any good pair and set C = V −U . Then, ker kV = im(GG (X × C) → GG (X)). By definition, C = Ci , where Ci is contained in an invariant linear subspace Li of V , and by Lemma 2.3, im(GG (X ×Ci )) ⊂ IG GG (X). By Lemma 2.2, im(GG (X × C)) is generated by the images of GG (X × Ci ), so im(GG (X × C)) ⊂ IG GG (X). The group G acts freely on the open set Uk = V ⊕k − C k , and by induction, im GG X × C k ⊂ IGk GG X × V ⊕k .
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
575
Thus, (V ⊕k , Uk ) is the desired good pair. Example 2.1. There is a natural map G G (X)Q / ker(kV )Q −→ lim ←(V ,U )∈ᐂ
lim
←(V ,U )∈ᐂ
GG (X)Z / ker(kV )Z ⊗ Q.
However, because inverse limits do not commute with tensor products, this map need not be an isomorphism. For example, if G = Gm is the 1-dimensional torus and X is a point, then GG (X)Z = K G (X) = R(G) = Z u, u−1 . Let ᐂ denote the system (V ⊕k , Uk ), where Uk = V ⊕k − 0. Write x = u − 1; then lim←(V ,U )∈ᐂ (GG (X)Z / ker(kV )Z ) Z[[x]], so
lim
←(V ,U )∈ᐂ
GG (X)Z / ker(kV )Z ⊗ Q Z[[x]] ⊗Z Q.
This is not isomorphic to lim
←(V ,U )∈ᐂ
GG (X)Q / ker(kV )Q Q[[x]].
3. Construction of an equivariant Riemann-Roch map. If Z is a separated algebraic space, then there is a Riemann-Roch map τZ : G(Z) → CH ∗ (Z)Q with the same properties as the Riemann-Roch map for schemes constructed (see [F, Theorem 18.3]). This fact (see [Gi]) can be deduced from the Riemann-Roch theorem for quasi-projective schemes and the existence of Chow envelopes for separated algebraic spaces. In this section we construct, for a separated G-space X, an equivariant RiemannRoch map ∞ τ G : GG (X) −→ CH iG (X)Q i=0
with the same functoriality as in the nonequivariant case (see [F, Chapter 17]). In G (X). addition, this map factors through the completion map GG (X) → G The results of the previous section are not needed to construct the map, but they are G (X) → used in the next section to show that the map induces an isomorphism G Q ∞ i i i i=0 CH G (X)Q . To simplify notation, we write CH G (X) for CH G (X)Q . All spaces in this section are assumed to be separated. 3.1. Equivariant Todd classes and Chern characters. If E → X is an equivariant vector bundle of rank r, then it has equivariant Chern classes c1G (E), . . . , crG (E), which are elements of the equivariant operational Chow ring A∗G (X) (see [EG2, Sections 2.4 and 2.6]). As in the nonequivariant case, an equivariant vector bundle E → X of rank r has Chern roots x1 , . . . , xr such that ciG (E) = ei (x1 , . . . , xr ), where ei (x1 , . . . , xr )
576
EDIDIN AND GRAHAM
refers to the ith elementary symmetric polynomial. This follows from the fact that the equivariant Chern classes can be calculated on a fixed mixed space X ×G U (see [EG2, Section 2.4, Definition 1]) and the nonequivariant splitting principle (see [F, Remark 3.2.3]). Definition 3.1. Let E be a vector bundle with Chern roots x1 , . . . , xr . Define the equivariant Chern character chG : K G (X) −→
∞ i=0
by the formula chG (E) =
r
AiG (X)
exi .
i=1
Likewise, define the equivariant Todd class by the formula G
Td (E) =
r i=1
xi . 1 − e−xi
G G Because ∞ i the leading coefficient of Td (E) is 1, Td (E) is an invertible element of i=0 AG (X).
3.2. Construction of an equivariant Riemann-Roch map. Recall that if G acts freely on a space Y , then we have identifications (1)
GG (Y ) = G(Y /G)
and
CH ∗G (Y ) = CH ∗ (Y/G).
In what follows, we use such identifications, often without further comment. When we compare τ G and τY/G , we tacitly use these identifications. i We want to define τ G : GG (X) → ∞ i=0 CH G (X) without assuming that the action G is free so that if the action is free, then τ coincides with the nonequivariant map τX/G . We define τ G as follows. Choose a representation V such that G acts freely on an open set U and codim(V −U ) > k (such pairs always exist by [EG1, remark after Lemma 3]). The action of G on U and, hence, on X × U is also free—in particular, proper—so the quotient X×G U is a separated algebraic space (see [E, Corollary 2.2]). Define ρU : GG (X × U ) −→ CH ∗G (X × U ) by ρU (β) =
τX×G U (β) TdG (V )
.
Under the identification CH ∗G (X × U ) = CH ∗ (X ×G U ), the action of TdG (V ) is that of the Todd class of the vector bundle X ×G (U × V ) → X ×G U .
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS j
577
j
If j < k, we identify CH G (X) with CH G (X ×G U ). If α ∈ GG (X), we define i ∈ ∞ i=0 CH G (X) to be the element whose j -component agrees with that of the image of α under the composition τ G (α)
ρU
GG (X) −→ GG (X × U ) −→ CH ∗G (X × U ), where the first map is a flat pullback. Proposition 3.1. The definition of τ G is independent of the choice of V and U . Proof. It suffices to show that, given V ⊃ U and V ⊃ U with codim(V −U ) and codim(V −U ) greater than k, the construction using V and U agrees with that using V = V × V and an open subset U of V . We can choose the open subset of V arbitrarily, provided the codimension is sufficiently large; so we take the open subset U = U × V . The constructions agree because the following diagram commutes: G X ×G (U × V ) O G(X ×G U )
ρU ×V
/ CH ∗ X ×G (U × V ) O
ρU
/ CH ∗ (X ×G U ).
Here the vertical arrows are flat pullback, and commutativity follows from [F, Theorem 18.3(4)], using the fact that the relative tangent bundle of the morphism (2) is the bundle
X ×G (U × V ) −→ X ×G U X ×G (U × V × V ) −→ X ×G (U × V ).
Remark 3.1. There is a variant construction of the map τ G , which is useful below. Let ᐂ → X be an equivariant vector bundle such that G acts freely on an open set ᐁ ⊂ ᐂ, which surjects onto X. If β ∈ GG (X × ᐁ), set ρᐁ (β) =
τᐁ/G (β) TdG ᐂ
∈ CH ∗G (X × ᐁ).
If codim(ᐂ − ᐁ) > k, then the homotopy property of equivariant Chow groups shows that we can identify CH kG (ᐁ) with CH kG (X) when k < j . Arguing as in the proof of the proposition, we can use pairs of the form (ᐂ, ᐁ) to obtain an equivalent definition of τ G . Theorem 3.1. Let X be a separated G-space. The equivariant Riemann-Roch map τ G : GG (X) −→
∞ i=0
has the following properties.
CH iG (X)
578
EDIDIN AND GRAHAM
G (X). τ G factors through the completion map GG (X) → G G τ is covariant for equivariant proper morphisms. If 3 ∈ K G (X) and α ∈ GG (X), then τ G (3α) = chG (3)τ G (α). Let f : X → Y be a morphism. Assume either (i) f is a smooth and equivariantly quasi-projective1 morphism, or (ii) f : X → Y is an equivariant l.c.i. morphism, X and Y can be equivariantly embedded in smooth G-schemes, and G is connected. Then τ G (f ∗ α) = TdG (Tf )f ∗ τ G (α), where Tf ∈ K G (X) is the relative tangent element of the morphism f . (e) If G acts freely on X, then the map τ G coincides with the nonequivariant map τX/G under the identifications GG (X) = G(X/G) and CH ∗G (X) = CH ∗ (X/G). Moreover, τ is uniquely determined by properties (d)(i) and (e).
(a) (b) (c) (d)
Proof. Properties (b) and (c) follow from the definition of τ G and the nonequivariant Riemann-Roch theorem of [F, Chapter 18]. Property (e) follows because, as in Proposition 3.1, the diagram G X ×G V 0 O G(X/G)
ρV 0
/ CH ∗ X ×G V 0 O
τX/G
/ CH ∗ (X/G)
commutes. i If IG is the augmentation ideal of R(G), then chG (IGN ) ⊂ ∞ N AG . Thus, by property (c), ∞ G N G τ IG G (X) ⊂ CH iG (X). i=N
Thus,
τG
restricts to a map N−1 GG (X) IGN GG (X) −→ CH iG (X). i=0
Taking the limit as N → ∞ gives the desired factorization G (X) −→ GG (X) −→ G
∞ i=0
CH iG (X),
proving (a). The proof of property (d)(i) uses [F, Theorem 18.3(4)] but requires some care, particularly in the category of algebraic spaces. Suppose that f : X → Y is quasiprojective. Let U be an open set in a representation on which G acts freely. Denote say that a morphism f : X → Y is equivariantly quasi-projective if there is a G-linearized, f -ample, line bundle on X. This is an equivariant version of [EGAII, Definition 5.3.1]. 1 We
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
579
the mixed spaces X ×G U and Y ×G U by XG and YG . Using descent (see [SGA, Sections 8.4–8.5]) as in [EG2, Proposition 2], we see that the induced map XG → YG p is smooth and quasi-projective. Let Y → YG be a Chow envelope for YG . Then we have a Cartesian diagram X
f
q
XG
/ Y p
f
/ YG ,
where, by base change, f is smooth and quasi-projective. Since Y is a quasiprojective scheme and f is a quasi-projective morphism, it follows from [EGAII, Corollary 5.3.3] that X embeds in projective space, as well. Hence, [F, Theorem 18.3(4)] applies to the morphism f . If α ∈ G(YG ), then α = p∗ α for some α ∈ G(Y ). Since flat pullback commutes with proper pushforward, f ∗ p∗ α = q∗ f ∗ α . Thus, τ (f ∗ α) = τ (f ∗ p∗ α )
= τ (q∗ f ∗ α )
(by [F, Theorem 18.3(2)]) = q∗ τ (f ∗ α ) ∗ (by [F, Theorem 18.3(4)]) = q∗ Td(Tf )f τ (α ) = q∗ Td(q ∗ Tf )f ∗ τ (α ) (since Tf = q ∗ Tf ) = Td(Tf )f ∗ τ (α)
(by the projection formula).
Taking the limit over all pairs (V , U ) in the construction of τ G gives (d)(i). To prove (d)(ii), argue as follows. Suppose that X ⊂ M and Y ⊂ Q, where M and Q are smooth G-schemes. The requirement that G is connected ensures that we can choose open sets U ⊂ V so that M ×G U and Q ×G U are smooth schemes (see [EG2, Proposition 23]). Thus, the mixed spaces XG and YG are embeddable in smooth schemes, and again by descent (see [EG2, Proposition 2]), the induced map XG → YG is l.c.i., so (d)(ii) follows again from [F, Theorem 18.3(4)]. Finally, suppose that τ is another map with properties (d)(i) and (e). Suppose α ∈ GG (X) is given, and denote by τ (α)j (resp., τ (α)j ) the term of τ (α) (resp., j τ (α)) in CH G (X). Let (V , U ) be a representation and open set on which G acts freely such that codim(V − U ) > j . Let πU : X × U → X × V → X be the composition of j projection with open inclusion. Since codim(V −U ) > j , we may identify CH G (X) = j j CH G (X × V ) = CH G (X × U ). The morphism πU is smooth and quasi-projective so by property (d)(i),
∗ j τ (πU α) τ (α)j = . TdG (V )
580
EDIDIN AND GRAHAM
Since G acts freely, τ and τ coincide on X × U . Thus,
τ (πU∗ α) j = τ (α)j . τ (α)j = TdG (V ) Let E → X be an equivariant vector bundle on a complete variety, and let π denote the morphism X → pt. Then π∗ (E) = (−1)i [H i (X, E)] ∈ R(G). Set ∞ χ G (E) = ch π∗ (E) ∈ AiG 0
and TdG (X) = τXG (ᏻX ). Applying the general Riemann-Roch theorem to the morπ phism X → pt yields the following corollary. Corollary 3.1 (Equivariant Hirzebruch-Riemann-Roch). We have χ G (E) = π∗ chG (E) TdG (X) i in ∞ 0 AG . 3.3. Example: The Weyl character formula. In this section we illustrate the equivariant Riemann-Roch theorem by using it to prove the Weyl character formula for SL2 . This is essentially a special case of a calculation done by Bott [Bo] using the Atiyah-Singer index theorem. This proof is different than some other proofs in that it does not use a localization theorem to reduce to a computation at the fixed-point locus of the action of a maximal torus on the flag variety. Let T = Gm acting on P1 with weights 1, −1. We calculate χ
T
∞ ᏻP1 (n) ∈ CH iT (pt).
i=0
This calculation is related to representations of SL2 , since we can view T as embedded i in SL2 as a maximal torus. Then (−1) H i (P1 , ᏻP1 (n)) is a virtual representation of SL2 , and χ T (ᏻP1 (n)) gives a formula for the restriction of this representation to T . We prove that e(n+1)t − e−(n+1)t χ T ᏻP1 (n) = et − e−t (the notation is explained below). For n ≥ 0 the higher cohomology groups of ᏻP1 (n) vanish, the resulting representation of SL2 is irreducible, and this gives the Weyl character formula. Here is the calculation. We have A∗T = Q[t] (see [EG2, Section 3]), and ∞ i=0
AiT ∼ =
∞ i=0
CH iT (pt) = Q[[t]].
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
The Chern character chT : R(T ) → is a representation of T , write
∞
i i=0 AG (X)
581
is given explicitly as follows. If V
V = kn1 ⊕ · · · ⊕ knr , where kni is the 1-dimensional representation of T with weight ni (the weights ni need not be distinct). Then chT (V ) = eni t . In [EG2, Section 3.3] we computed A∗T (P1 ) Q[t, h]/(t + h)(t − h) when T acts with weights ±1. Under this identification, c1T (ᏻP1 (n)) = nh and c1T (TP1 ) = 2h. Thus,
2h π∗ chT ᏻP1 (n) TdT P1 = π∗ enh . 1 − e−2h by
Lemma 3.1. If p(h) ∈ CH ∗T (P1 ) Q[t, h]/(t + h)(t − h), then π∗ (p(h)) is given
p(t) − p(−t) . π∗ p(h) = 2t ∞ i i 1 The same formula holds for π∗ : ∞ i=0 CH T (P ) → i=0 CH T (pt).
Proof. If BT is a model for calculating CH ∗T , then the model PT1 for calculating CH ∗T (P1 ) is a P1 -bundle over BT . Since π∗ (h) = 1, the projection formula implies i 1 that π∗ ( ai t i + h bj t j ) = bj t j . Now if p(h) = ai hi is in ∞ i=0 CH T (P ), 2 2 2i 2i then we can write (using the relation h = t ) x = a2i t + h a2i+1 t , so p(t) − p(−t) π∗ p(h) = , a2i+1 t 2i = 2t which is the desired formula. The lemma implies that χ T (ᏻP1 (n)) is equal to
2h e(n+1)h e(n+1)t − e−(n+1)t π∗ enh = π 2h , = ∗ eh − e−h et − e−t 1 − e−2h as desired. 3.4. Change of groups. Let H ⊂ G be an algebraic subgroup. In this section we discuss the relationship between G-equivariant and H -equivariant Grothendieck groups and the corresponding Riemann-Roch maps τ G and τ H . Lemma 3.2. Given an action of G × H on X, with H = 1 × H acting freely, there is a commutative diagram τ G×H / CH iG×H (X) GG×H (X) i
GG (X/H )
τG
/
i
CH iG (X/H ).
582
EDIDIN AND GRAHAM
Proof. Let V be a representation of G × H , and let U be an open set on which G × H acts freely. Then ᐂ = X ×H V is a G-equivariant vector bundle on X/H . The group G acts freely on the open set ᐁ = X ×H U , which surjects onto X/H . Identifying ᐁ/G with X ×G×H U , we see that the maps ρU and ρᐁ are the same. Using pairs of the form (V , U ), we define τ G×H as ρU , as before. On the other hand, by Remark 3.1 we can use vector bundles and open sets of the form (ᐂ, ᐁ) = G (X ×H V , X ×H U ) to define τX/H as ρᐁ . Hence, the maps coincide. In K-theory, part (a) of the next proposition is due to Thomason [Tho3], following ideas that go back to Atiyah and Segal [AS]. Part (c) has apparently been used implicitly by Thomason [Tho4], but we do not know of an explicit statement or proof. Proposition 3.2. Let H be a subgroup of G, and let X be an H -space. (a) There is a natural isomorphism of R(G)-modules GG (G×H X) GH (X) and a natural isomorphism of A∗G -modules ⊕ CH iG (G ×H X) ⊕ CH iH (X). (b) The isomorphisms are compatible with the τ maps, that is, the following diagram commutes: τG / CH iG (G ×H X) GG (G ×H X) i
∼ =
∼ =
GH (X)
τH
/
i
CH iH (X).
(c) If the H -action on X is the restriction of a G-action, then G ×H X ∼ = G/H × X. Under the isomorphism in (a), the “forgetful” map GG (X) → GH (X) corresponds to the flat pullback GG (X) → GG (G/H × X). A similar statement holds for Chow groups. Proof. We only prove parts (a) and (c) for Grothendieck groups; the arguments for Chow groups are similar. Let X be an H -space. Define an (H × G)-action on G × X by (h, g) · (g , x) = gg h−1 , h · x . (3) We also define an (H × G)-action on X by (4)
(h, g) · x = h · x.
The projection π : G × X → X is (H × G)-equivariant and, moreover, a G-principal bundle (G-torsor). The projection G × X → G ×H X is G-equivariant and an H torsor. Hence, we have equivalences of categories between the following: H -equivariant coherent sheaves on X, (G × H )-equivariant coherent sheaves on G × X, and G-equivariant coherent sheaves on G ×H X.
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
583
(This uses the general fact—see [Tho3]—that if M → N is a principal bundle, then there is an equivalence of categories between coherent sheaves on N and equivariant coherent sheaves on M.) Hence, GG (G ×H X) GH (X). Under these equivalences of categories, the G-equivariant vector bundle V ×(G×H X) → G×H X corresponds to the H -equivariant vector bundle V × X → X. This translates into the fact that the isomorphism is an isomorphism of R(G)-modules. This proves (a) in K-theory. Part (b) follows from Lemma 3.2. We prove (c). We now assume that the H -action on X is the restriction of a Gaction. We can then define another (H × G)-action on X, this time by (h, g) · x = g · x.
(5)
Let Ᏺ be a G-equivariant sheaf on X. Then Ᏺ is naturally an (H × G)-equivariant sheaf, with respect to either action of H × G. As was noted above, the projection π : G × X → X is (H × G)-equivariant with respect to action (4) on X, so π ∗ Ᏺ is an (H × G)-equivariant sheaf on G × X. Next, the action map a : G × X → X is (H ×G)-equivariant with respect to action (5) on X. Therefore, a ∗ Ᏺ is also (H ×G)equivariant. In particular, equations (4) and (5) give two (G × G)-actions on X. Since Ᏺ is a G-equivariant sheaf on X, by definition there is an isomorphism of coherent sheaves θ : π ∗ Ᏺ → a ∗ Ᏺ satisfying the cocycle condition (see [Tho3, Section 1.2]), which implies the following lemma. Lemma 3.3. If Ᏺ is a G-equivariant sheaf on X, then the map θ : π ∗ Ᏺ → a ∗ Ᏺ is an isomorphism of (G × G)-equivariant sheaves on G × X. Consider the diagram GG (X) GH (X)
π∗
π∗
/ GG×G (G × X)
/ GG (G/G × X)
/ GH ×G (G × X)
/ GG (G/H × X).
The left two vertical maps are the forgetful maps, and the left square commutes. The map GH ×G (G × X) → GG (G/H × X) comes from taking the quotient by the subgroup 1 × H and using the identification (G × X)/(1 × H ) with G/H × X taking (g, x) to (gH, gx). Likewise, we identify GG×G (G × X) with GG (G/G × X) = GG (X). Tracing through the definitions, the latter identification is given by (a ∗ )−1 . By Lemma 3.3 the composition along the top row is the identity. The right vertical arrow is flat pullback, and the reader can verify that the right square also commutes. This proves Proposition 3.2. 4. The Riemann-Roch isomorphism. All spaces considered here are again assumed to be separated so that we can apply the Riemann-Roch theorems of [F]. Let G ⊂ GLn and let I ⊂ R(GLn ) be the augmentation ideal. In this section we use
584
EDIDIN AND GRAHAM
G (X) to denote the I -adic completion of GG (X)⊗Q, and we continue the notation G i to denote CH G (X) ⊗ Q by CH iG (X). The purpose of this section is to prove the following theorem.
Theorem 4.1. Let X be a separated algebraic space. The map τ
G
G (X) −→ : G
∞ i=0
CH iG (X)
is an isomorphism. G (X) is also the completion of GG (X) with Remark 4.1. By Corollary 6.1, G respect to IG ⊂ R(G).
Proof of Theorem 4.1. By Proposition 3.2 we obtain a commutative diagram GGLn GLn ×G X
∼ =
G (X) G
τ GLn
∞
i i=0 CH GLn
GLn ×G X
∼ =
∞
τG
i i=0 CH G (X).
Thus, to prove the theorem it suffices to show that τ GLn is an isomorphism. We do this in two steps. 4.1. G = B is the group of upper triangular matrices. Let ᐂ = {Vn , Un } be the system of good representations and open subsets constructed in the proof of Theorem 2.2. Since B acts freely on X ×Un , Theorem 3.1(e) and the nonequivariant RiemannRoch theorem (see [F, Theorem 18.3], extended to separated algebraic spaces in [Gi]), imply that τ B : GB (X × Un ) −→ CH ∗B (X × Un ) is an isomorphism. By Theorem 3.1, the map τ B is compatible with the maps in the inverse system of representations constructed in Section 2. In this way we obtain an isomorphism B G (X)/ ker kn −→ lim CH ∗B (X)/ ker rn . τ B : lim ←(V ,U )∈ᐂ
←(V ,U )∈ᐂ
By Proposition 2.2, the IB -adic completion of GB (X) is isomorphic to the I -adic B (X), so by Theorem 2.1, completion G B B (X). G (X)/ ker kn G lim ←(V ,U )∈ᐂ
Also, by Theorem 2.1 and Proposition 2.1, lim
←(V ,U )∈ᐂ
∗ (X) CH ∗B (X)/ ker rn CH B
∞ i=0
CH iB (X).
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
585
The case G = B follows. 4.2. G = GLn . Let π : G/B × X → X be the projection to the second factor. By Proposition 3.2, we can identify GB (X) with GG (G/B ×X) so that the forgetful map i ! : GG (X) → GB (X) corresponds to the flat pullback π ∗ : GG (X) → GG (G/B × X). Since B is the group of upper triangular matrices, the quotient G/B is complete, and we can define a map i! : GB (X) → GG (X) by using the identification GB (X) ∼ = GG (G/B × X) and setting i! to be the proper pushforward π! : GG (G/B × X) → GG (X). Lemma 4.1. The composition i! i ! = id, so we can identify GG (X) as a summand in GB (X). Proof. This follows from the argument of [AS], using the fact that G/B is a tower of projective bundles and the projective bundle theorem of [Tho3, Theorem 3.1]. We now turn to Chow theory. We use the identification CH ∗B (X) CH ∗G (G/B ×X) to define a map i∗ :
∞ i=0
CH iG (X) −→
∞ i=0
CH iB (X)
∞ i=0
CH iG (G/B × X)
by the formula α " −→ TdG (TG/B )π ∗ (α). We also set i∗ : CH ∗B (X) −→ CH ∗G (X) to be the composition π∗
CH ∗B (X) CH ∗G (G/B × X) −−→ CH ∗G (X). Lemma 4.2. The composition i∗ i ∗ = id, so we can identify CH ∗G (X) as a summand in CH ∗B (X). Proof. In general, if f : M → N is a smooth proper morphism with f! [ᏻM ] = [ᏻN ], then the Riemann-Roch theorem implies that f∗ (Td(Tf )f ∗ β) = β for all β ∈ CH ∗ (N ). (Indeed, by the projection formula we may assume β = 1. Then by Riemann-Roch f∗ (Td(Tf )) = 1.) Applying this to the case where M and N are the mixed spaces (G/B × X) ×G U and X ×G U yields the result. Lemma 4.3. We have (i) i! τ G = τ B i∗ , (ii) τ B i ! = i ∗ τ G .
586
EDIDIN AND GRAHAM
Proof. In view of the previous lemmas, (i) follows from Theorem 3.1(b) and (ii) from Theorem 3.1(d)(i). Since τB is an isomorphism, the previous lemmas imply that τG is also an isomorphism when G = GLn . This proves the theorem. 5. Actions with finite stabilizers. In this section we assume that the group G acts on the space X with finite stabilizers and that rational coefficients are used throughout. In this situation, we can obtain more refined information about the completions in Section 2. In particular we prove the following theorem. Theorem 5.1. Suppose that an algebraic group G acts on a space X with finite stabilizers. Let IG be the augmentation ideal. Then there is an isomorphism G (X) −→ GG (X) . G IG
We begin with a commutative algebra lemma. Lemma 5.1. Let R be a ring and I a maximal ideal, and let M be an R-module. where “” denotes the I -adic completion. Suppose that I k MI = 0. Then MI = M, Proof. The rings R/I k are local, so (R/I k )I = R/I k . Thus, MI /I k MI := M ⊗ RI /I k RI = M ⊗ R/I k := M/I k M. Since I k MI = 0, MI is I -adically complete, proving the result. I ∼ Hence, M = M. Because of the lemma, Theorem 5.1 is an immediate consequence of the following result, whose proof uses Corollary 6.1. Proposition 5.1. If G acts on X with finite stabilizers, then for sufficiently large k, IGk GG (X)IG = 0. Proof. The proof proceeds as in previous proofs, by building up from a torus to GLn to a general group. Step 1: G = T is a torus. By Thomason’s generic slice theorem [Tho1, Proposition 4.10], there is a T -equivariant open subset U ⊂ X such that U is equivariantly isomorphic to S ×Y , where S = T /H for a diagonalizable subgroup H ⊂ T and where T acts trivially on Y . If W is any representation of T which is trivial on H , then the vector bundle U × W → U is equivariantly trivial, so ([W ] − dim W )GT (U ) = 0. Hence, under the map R(S) → R(T ), the ideal J = IS R(T ) annihilates GT (U ). Note that J ⊂ IT . Since the stabilizers are finite, H is finite. Let T be the character group of T , and let S be the character group of S. Then we can choose a basis e1 , . . . , en of T such that d1 e1 , . . . , dn en is a basis for S, where di are positive integers. Using this basis, we have R(T ) ∼ = Q[T] ∼ = Q[t1 , . . . , tn ]
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
and
587
J = t1d1 − 1, . . . , tndn − 1 ⊂ IT = (t1 − 1, . . . , tn − 1).
Hence, JIT = IT R(T )IT , so IT GT (U )IT = 0. By Noetherian induction, we may assume that ITk GT (X − U )IT = 0 for some sufficiently large k. Then by the localization exact sequence, ITk+1 GT (X)IT = 0. Note that the above discussion and Noetherian induction also show that there is a nonzero ideal JT ⊂ IT such that JT GT (X) = 0. Step 2: G = GLn . Let B be the group of upper triangular matrices in GLn , and let T be the group of diagonal matrices. Restriction from B to T induces isomorphisms R(B) ∼ = R(T ) and GB (X) ∼ = GT (X) (see [Tho4, proof of Theorem 1.11]). We claim that there is an ideal JG ⊂ IG with the support of R(G)/JG finite, such that JG GG (X) = 0. The reason is as follows: The map R(G) ∼ = R(B)W → R(B) is finite, so if JG is the inverse image of JB := JT under this map, then the support of R(G)/JG is finite. Since GG (X) is an R(G)-submodule of GB (X) (see the proof of Theorem 4.1), we have JG GG (X) = 0, proving the claim. Because R(G)/JG has finite support and JG is contained in the maximal ideal IG , it follows that JG R(G)IG is IG -primary, so it contains IGk R(G)IG for some k. (Note that R(GLn ) = Q[t1 , . . . , tn ]Sn is Noetherian.) Then IGk GG (X)IG = 0 as desired. Step 3: The general case. Embed G ⊂ GLn . In this case, GG (X) = GGLn (GLn ×G k GG (X) X). Thus, by step 2, IGL IGLn = 0 for some positive integer k. Equivalently, n k G given x ∈ G (X) and a ∈ IGLn , there exists b ∈ R(GLn ) − IGLn such that abx = 0. Now, the action of R(GLn ) on GG (X) factors through φ : R(GLn ) → R(G). Also, the augmentation map R(GLn ) → Q factors through φ, so if x ∈ R(GLn )−IGLn , then φ(x) ∈ R(G) − IG . Corollary 6.1 implies that for some d, IGd ⊂ φ(IGLn )R(G). This implies that IGdk GG (X)IG = 0. Indeed, if a ∈ IGdk and x ∈ GG (X), write a = φ(a) for k a ∈ IGL and choose b as in the preceding paragraph; then b = φ(b) ∈ R(G) − I (G) n and a b x = 0. This proves the result. Remark 5.1. The proof of the proposition implies that for G = T or G = GLn , there is an ideal J ⊂ IG ⊂ R(G) with the support of R(G)/J finite, such that J GG (X) = 0. If G is a diagonalizable group embedded in a torus T , then we know from the general theory of characters of diagonalizable groups (see [Bor, Section 8.12]), that the map R(T ) → R(G) is finite. Thus, an ideal J exists as above which annihilates GG (X). In characteristic zero, the map R(GLn ) → R(G) is finite for arbitrary G (see [Seg, Proposition 3.2]), and we can make the same conclusion about the support of GG (X). However, in characteristic p we don’t even know if R(G) is always Noetherian! Proposition 5.2. If G acts with finite stabilizers, then the rational equivariant Chow groups CH ∗G (X) are generated by invariant cycles on X. In particular CH iG (X) = 0 for i > dim X.
588
EDIDIN AND GRAHAM
Proof. By the generalization of [Se, Theorem 6.1] to algebraic spaces, there is a finite cover f : X → X on which G acts freely. Since G acts freely, CH ∗G (X ) is generated by invariant cycles (see [EG2, Proposition 8]). On the other hand, the proper pushforward f∗ : CH ∗G (X ) → CH ∗G (X) is surjective because f is finite and surjective, and we are using rational coefficients. Therefore, CH ∗G (X) is generated by invariant cycles. The preceding proposition implies that the equivariant Chow groups are complete. Using the fact that I and IG generate the same topology (see Corollary 6.1), we can restate the Riemann-Roch theorem. Corollary 5.1. Suppose that X is separated and G acts with finite stabilizers. There is a map ∼ τ : GG (X) −→ GG (X)IG −→ CH ∗G (X) satisfying properties (a)–(e) of Theorem 3.1. Remark 5.2. When G acts properly on a separated scheme X with reduced stabilizers, then Vistoli [Vi] asserts the existence of a map τX : GG (X) −→ CH ∗ [X/G] ; here [X/G] is the Deligne-Mumford quotient stack. By [EG2, Proposition 14], CH ∗ [X/G] = CH ∗G (X). Vistoli noted that his map need not be an isomorphism, and he made a conjecture about its kernel (see [Vi, Conjecture 2.4]). The conjecture states that if α ∈ ker τX , then ξ α = 0, where ξ is the class of some perfect complex with everywhere nonzero rank. We expect that his map is the same as ours in this case. Unfortunately, because he did not write his map down and did not state whether it satisfied properties (d)(i) and (e) of Theorem 3.1, we cannot positively assert this. However, for our map τX , Vistoli’s conjecture is true and his statement can be refined. Corollary 5.2 (Vistoli’s conjecture). We have α ∈ ker τXG : GG (X) −→ CH ∗ ([X/G]) if and only if there exists a virtual representation 3 ∈ R(G) of nonzero rank such that 3α = 0. Proof. The kernel is just the kernel of the localization map GG (X) → GG (X)IG . 5.1. The case G is diagonalizable. By Remark 5.1, GG (X) is supported as an R(G)-module at a finite number of primes, each of which is maximal. Denote these
589
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
ideals by P = P0 , P1 , . . . , Pk . Following [Seg], each prime Pi corresponds to a finite subgroup (called the support of Pi ) Hi ⊂ G. It is defined as the minimal element of the set of subgroups H ⊂ G such that Pi ∈ I m(Spec R(H ) → Spec R(G)). Note that different Pi ’s may have the same support. This definition makes sense for any group G, but Hi is only defined up to conjugation as a subgroup of G. In our case, G is abelian so the Hi ’s are uniquely determined. Following [Tho5, Lemma 1.1 and Proposition 1.2], we give an explicit construction of the support H of a prime ideal P ⊂ R(G). Since G is diagonalizable, R(G) = Z[N], where N is a finitely generated abelian group without p-torsion (where p = char k). Given a prime P ⊂ R(G), set KP = {n ∈ N | 1 − n ∈ P }. The equivalence of categories between finitely generated abelian groups (without ptorsion) and diagonalizable groups (see [Bor, Section 8]) means that quotient N/KP determines a unique subgroup H ⊂ G with the property that R(H ) = Z[N/KP ]. When P is maximal, KP has finite index in N and H is a finite group. The representation ring of the quotient G/H is the subring Z[KP ] of Z[N ] = R(G). If I is the augmentation ideal of R(G/H ), then this construction shows that I = P ∩ R(G/H ). Denote by X i the subscheme fixed by Hi . Theorem 5.2. Suppose that G is diagonalizable and acts on a separated space X with finite stabilizers. Then GG (X)Pi CH ∗G/Hi (X i ) ⊗R(G/Hi ) R(G)Pi . In particular, there is an isomorphism CH ∗G/Hi (X i ) ⊗R(G/Hi ) R(G)Pi . GG (X) i
Proof. By the localization theorem for diagonalizable group schemes, GG (X)Pi i Pi (see [Tho5, Theorem 2.1]). Since Hi ⊂ G acts trivially on X , we have
GG (X i )
GG (X i ) GG/Hi (X i ) ⊗R(G/Hi ) R(G) (see [Tho2, Lemma 5.6]). Let I i be the augmentation ideal of G/Hi . Since we have Pi ∩ R(G/Hi ) = I i , G/H i G/H i i (X ) ⊗ i (X ) ⊗ G R(G/Hi ) R(G) P = G I i R(G/Hi ) R(G) P . i
i
Thus, by the Riemann-Roch isomorphism of Corollary 5.1, we have G/H i ∗ i i (X ) ⊗ G I i R(G/Hi ) RG P CH G/Hi (X ) ⊗R(G/Hi ) RG P . i
i
590
EDIDIN AND GRAHAM
∗ ∗ i (Here the Chern character ch : R(G/Hi ) → ∞ i=0 AG/Hi makes CH G/Hi (X ) into an R(G/Hi )-module.) The first statement follows. As noted in Remark 5.1, there is an ideal J ⊂ R(G) such that R(G)/J is supported at a finite number of points and J GG (X) = 0. Hence, GG (X) GG (X) ⊗R(G) R(G)/J. Then J = Q1 ∩ Q2 ∩ · · · ∩ Qk with Qi a Pi -primary ideal. By the Chinese remainder theorem, r r R(G)/Qi = R(G)/J P R(G)/J i=1
so
GG (X)
i=1
i
GG (X)Pi
i
and the second statement follows. Remark 5.3. This result was first obtained by Angelo Vistoli (unpublished). It is also related to the Riemann-Roch theorem for algebraic stacks proved by Toen [To]. 6. More on completions. There are other natural completions of GG (X) and CH ∗G (X) besides those of Section 2. The purpose of this section is to prove that the different definitions give isomorphic completions. As an application of these results, we prove a special case of a conjecture of Köck. To begin, fix an embedding of G into GLn . Then GG (X) is an R(GLn )-module. G (X) was defined to be the completion of GG (X) along the augRecall that G ∗ (X) is the completion of CH ∗ (X) along the mentation ideal I of R(GLn ), and CH G G ∗ augmentation ideal of AG (pt). We refer to these as the “point completions” because they are defined using ideals in the equivariant groups of a point. Let IX ⊂ K G (X) denote the augmentation ideal, that is, the ideal of virtual vector G (X) denote the completion of GG (X) along I . Let bundles of rank zero, and let G X ∗ (X) denote the completion ∗ JX ⊂ AG (X) denote the augmentation ideal, and let CH G of CH ∗G (X) along JX . We refer to these as the “X-completions” because they are defined using ideals in the equivariant groups of X. In the proof below, it is necessary to distinguish between ideals corresponding to different groups. We use a subscript to indicate this, for example, IX,G ⊂ K G (X). Note that the X-completions only depend on G and X, while a priori, the point completions depend in addition on an embedding of G into GLn . Corollary 6.1 implies that the point completion is independent of the embedding. The main result of this section is that the point and X-completions are isomorphic. Theorem 6.1. (a) The I -adic and IX -adic topologies on GG (X) coincide. Hence, we have an isomorphism of completions G (X) G G (X). G
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
591
(b) The J -adic and JX -adic topologies on CH ∗G (X) coincide. Hence, we have an isomorphism of completions ∗ (X) CH ∗ (X). CH G G
Proof. We only prove (a); the proof of (b) is similar. To show that the filtrations induced by powers of the ideals I and IX induce the same topology, we must check two things. First, we must show that for any n, there exists an r such that I r GG (X) ⊆ IXn GG (X). This is clear because under the map R(GLn ) → K G (X), the image of I is contained in IX , so we can take r = n. Second, we must show that for any n, there exists an r such that IXr GG (X) ⊆ I n GG (X). As above, we do this in steps: first for G = B the group of upper triangular matrices, then for G = GLn , and finally for arbitrary G. Suppose then that B ⊂ GLn is the group of upper triangular matrices. We use the notation of the proof of Theorem 2.2. We know there exists m such that ker km ⊆ IBn GB (X). Since IB and I generate the same topology on R(B), we can assume ker km ⊆ I n GB (X). So we must show that there exists r such that IXr GB (X) ⊆ ker km , that is, such that IXr GB (X × Um ) = 0. Since B acts freely on X × Um , we have GB (X × Um ) G(X ×B Um ). Under this isomorphism, IXr GB (X × Um ) ⊆ ar G X ×B Um , where ar denotes the augmentation ideal of K(X ×B Um ). By Lemma 2.4, we have ar G(X ×B Um ) = 0 for r 0. The analogous statement for Chow groups, that JXr CH ∗B (X × Um ) = 0 for r > dim(X ×B Um ), also holds. Assume now that G = GLn . Then we have GB (X) GG (G/B × X)
o
/ GG (X),
where the two maps are i! and i ! . We have proved that there exists r such that r GB (X) ⊂ I n GB (X). IX,B
Hence,
r i! IX,B GB (X) ⊂ i! I n GB (X) = I n GG (X),
where the last equality follows by the projection formula. Now, we also have r r GG (X) ⊂ IX,B GB (X). i ! IX,G Combining these facts, we see that r GG (X) ⊂ I n GG (X). i! i ! IX,G
592
EDIDIN AND GRAHAM
r GG (X) ⊂ I n GG (X). Since i! i ! is the identity, IX,G Finally, consider the case where G → GLn is any subgroup. By Proposition 3.2, there is an isomorphism of R(G)-modules GG (X) GGLn GLn ×G X .
Under this isomorphism I n GG (X) corresponds to I n GGLn (GLn ×G X). Moreover, r GG (X) corresponds to I r IX,G GGLn (GLn ×G X). This follows from the GLn ×G X,GLn fact that the equivalence of categories obtained from descent of (GLn ×G)-equivariant sheaves on GLn ×X takes locally free sheaves to locally free sheaves of the same rank (see [EGAIV, Proposition 2.5.2]). Because of these correspondences, the theorem follows from the case where G = GLn . Corollary 6.1. If H ⊂ G, then the topology on R(H ) induced by the ideal IG R(H ) is the same as the IH -adic topology. Proof. By embedding H ⊂ G ⊂ GLn , it suffices to prove the result for G ⊂ GLn . The corollary now follows by applying the theorem when X = pt. Remark 6.1. In characteristic zero this is the same as [Seg, Corollary 3.9], but in characteristic p the result is new. For a large class of group schemes over perfect fields Thomason [Tho1, Corollary 3.3] showed that IG -adic and IGLn -adic topologies are the same on the mod l ν equivariant G-theory localized at the Bott element. G Observe that the higher equivariant K-groups GG i (X) (resp., Ki (X)) are also G modules over R(G) and K (X). Thus we can define completions of these groups with respect to the ideals IX and IG . If X is a regular scheme, then KiG (X) = GG i (X) and we can prove a corollary about higher K-theory as well. Part (b) proves a conjecture of Köck [Ko, Conjecture 5.6] for regular schemes over fields.
G G Corollary 6.2. (a) If X is a regular G-scheme, then K i (X) Ki (X). (b) Let f : Y → X be an equivariant proper morphism of regular G-schemes. Then the pushforward f∗ : KiG (Y ) → KiG (X) induces a map of completions G G f∗ : K i (Y ) −→ Ki (X).
Proof. The action of R(G) on KiG (X) factors through the map R(G) → K G (X). Since K G (X) = GG (X), Theorem 6.1 implies that the ideals IX ⊂ K G (X) and the G G IG K G (X) generate the same topology. Thus, K i (X) Ki (X), which proves (a). By the projection formula applied to the commutative triangle /X } } } }} }~ } pt
Y
RIEMANN-ROCH FOR EQUIVARIANT CHOW GROUPS
593
we have f∗ (I k KiG (Y )) = I k f∗ KiG (Y ) ⊂ I k KiG (X). Hence, f∗ is continuous with respect to the I -adic topology, proving (b). Remark 6.2. In its full form, Köck’s conjecture asserts that if G/S is a flat group scheme and if X → Y is any equivariant projective local complete intersection morphism, then there is a pushforward G G f∗ : K i (X) −→ Ki (Y )
of completions. This conjecture is quite subtle because (despite the suggestive notation) the completions are taken with respect to different ideals, and if X and Y are not regular, there is no obvious way of comparing the topologies. Remark 6.3. The K0 version of Köck’s conjecture has been proved [CEPT] for finite group schemes acting on regular projective varieties over rings of integers of number fields. References [AS] [BFM] [Bor] [Bo]
[Br] [CEPT]
[E]
[EG1] [EG2] [F] [FL] [Gi] [SGA]
[EGAII]
M. Atiyah and G. Segal, Equivariant K-theory and completion, J. Differential Geom. 3 (1969), 1–18. P. Baum, W. Fulton, and R. MacPherson, Riemann-Roch for singular varieties, Inst. Hautes Études Sci. Publ. Math. 45 (1975), 101–145. A. Borel, Linear Algebraic Groups, 2d ed., Grad. Texts in Math. 126, Springer, New York, 1991. R. Bott, “The index theorem for homogeneous differential operators” in Differential and Combinatorial Topology (A Symposium in Honor of Marston Morse), Princeton Math. Ser. 27, Princeton Univ. Press, Princeton, 1965, 167–186. M. Brion, Equivariant Chow groups for torus actions, Transform. Groups 2 (1997), 225–267. T. Chinburg, B. Erez, G. Pappas, and M. J. Taylor, Riemann-Roch type theorems for arithmetic schemes with a finite group action, J. Reine Angew. Math. 489 (1997), 151–187. D. Edidin, “Notes on the construction of the moduli space of curves” to appear in Recent Progress in Intersection Theory (Bologna, 1997), ed. A. Vistoli, Birkhäuser, Boston. D. Edidin and W. Graham, Characteristic classes in the Chow ring, J. Algebraic Geom. 6 (1997), 431–443. , Equivariant intersection theory, Invent. Math. 131 (1998), 595–634. W. Fulton, Intersection Theory, Ergeb. Math. Grenzgeb. (3) 2, Springer, Berlin, 1984. W. Fulton and S. Lang, Riemann-Roch Algebra, Grundlehren Math. Wiss. 277, Springer, New York, 1985. H. Gillet, Intersection theory on algebraic stacks and Q-varieties, J. Pure Appl. Algebra 34 (1984), 193–240. A. Grothendieck, Revêtements étales et groupe fondamental, Séminaire de Géométrie Algébrique du Bois-Marie (SGA1), Lecture Notes in Math. 224, Springer, Berlin, 1971. A. Grothendieck and J. Dieudonné, Éléments de géométrie algébrique, II: Étude globale élémentaire de quelques classes de morphismes, Inst. Hautes Études Sci. Publ. Math. 8 (1961).
594 [EGAIV] [Kn] [Ko] [MFK] [Seg] [Ser] [Se] [Tho1] [Tho2] [Tho3]
[Tho4] [Tho5] [To] [T] [Vi]
EDIDIN AND GRAHAM , Éléments de géométrie algébrique, IV: Étude locale des schémas et des morphismes de schémas, II, Inst. Hautes Études Sci. Publ. Math. 24 (1965). D. Knutson, Algebraic Spaces, Lecture Notes in Math. 203, Springer, Berlin, 1971. B. Köck, The Grothendieck-Riemann-Roch theorem for group scheme actions, Ann. Sci. École Norm. Sup. (4) 31 (1998), 415–458. D. Mumford, J. Fogarty, and F. Kirwan, Geometric Invariant Theory, 3d ed., Ergeb. Math. Grenzgeb. (2) 34, Springer, Berlin, 1994. G. Segal, The representation ring of a compact Lie group, Inst. Hautes Études Sci. Publ. Math. 34 (1968), 113–128. J.-P. Serre, Groupes de Grothendieck des schémas en groupes réductifs déployés, Inst. Hautes Études Sci. Publ. Math. 34 (1968), 37–52. C. S. Seshadri, Quotient spaces modulo reductive algebraic groups, Ann. of Math. (2) 95 (1972), 511–556; Errata, Ann. of Math. (2) 96 (1972), 599. R. Thomason, Comparison of equivariant and topological K-theory, Duke Math. J. 53 (1986), 795–825. , Lefschetz-Riemann-Roch theorem and coherent trace formula, Invent. Math. 85 (1986), 515–543. , “Algebraic K-theory of group scheme actions” in Algebraic Topology and Algebraic K-Theory (Princeton, 1983), Ann. of Math. Stud. 113, Princeton Univ. Press, Princeton, 1987, 539–563. , Equivariant algebraic vs. topological K-homology Atiyah-Segal-style, Duke Math. J. 56 (1988), 589–636. , Une formule de Lefschetz en K-théorie équivariante algébrique, Duke Math. J. 68 (1992), 447–462. B. Toen, Riemann-Roch theorems for Deligne-Mumford stacks, preprint, http://xxx.lanl. gov/abs/math.AG/9803076. B. Totaro, The Chow ring of the symmetric group, to appear in Contemp. Math. A. Vistoli, “Equivariant Grothendieck groups and equivariant Chow groups” in Classification of Irregular Varieties (Trento, 1990), Lecture Notes in Math. 1515, Springer, Berlin, 1992, 112–133.
Edidin: Department of Mathematics, University of Missouri, Columbia, Missouri 65211, USA;
[email protected] Graham: Department of Mathematics, University of Georgia, Boyd Graduate Studies Research Center, Athens, Georgia 30602, USA;
[email protected]