Number Theory: Paris 1992-3 (London Mathematical Society Lecture Note Series)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor J.W.S. Cassels, Department of Pure Mathematic...

Author: Sinnou David

90 downloads 743 Views 3MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor J.W.S. Cassels, Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, 16 Mill Lane, Cambridge CB2 ISB, England. The titles below are available from booksellers, or, in case of difficulty, from Cambridge University Press. 34 46 50 59 66 69 76 77 83 86 87 88 89 90 92 93 94 95 96 97 98 99 100 104 105 107 108 109 110 113 114 115

116 118 119 121

122 125 126 128 129 130 131 132 133 134 135 136 137 138 139 140 141

144 145 146 147 148 149 150

Representation theory of Lie groups, M.F. ATIYAH et al p-adic analysis: a short course on recent work, N. KOBLITZ Commutator calculus and groups of homotopy classes, H.J. BAUES Applicable differential geometry, M. CRAMPIN & F.A.E. PIRANI Several complex variables and complex manifolds II, M.J. FIELD Representation theory, I.M. GELFAND et al Spectral theory of linear differential operators and comparison algebras, H.O. CORDES Isolated singular points on complete intersections, E.J.N. LOOUENGA Homogeneous structures on Riemannian manifolds, F. TRICERRI & L. VANHECKE Topological topics, I.M. JAMES (ed) Surveys in set theory, A.R.D. MATHIAS (ed) FPF ring theory, C. FAITH & S. PAGE An F-space sampler, N.J. KALTON, N.T. PECK & J.W. ROBERTS Polytopes and symmetry, S.A. ROBERTSON Representation of rings over skew fields, A.H. SCHOFIELD Aspects of topology, I.M. JAMES & E.H. KRONHEIMER (eds) Representations of general linear groups, G.D. JAMES Low-dimensional topology 1982, R.A. FENN (ed) Diophantine equations over function fields, R.C. MASON Varieties of constructive mathematics, D.S. BRIDGES & F. RICHMAN Localization in Noetherian rings, A.V. JATEGAONKAR Methods of differential geometry in algebraic topology, M. KAROUBI & C. LERUSTE Stopping time techniques for analysts and probahilists, L. EGGHE Elliptic structures on 3-manifolds, C.B. THOMAS A local spectral theory for closed operators, I. ERDELYI & WANG S14ENGWANG Compactification of Siegel moduli schemes, C-L. CHAI Some topics in graph theory, H.P. YAP Diophantine analysis, J. LOXTON & A. VAN DER POORTEN (eds) An introduction to surreal numbers, H. GONSHOR Lectures on the asymptotic theory of ideals, D. REES Lectures on Bochner-Riesz means, K.M. DAVIS & Y-C. CHANG An introduction to independence for analysts, H.G. DALES & W.H. WOODIN Representations of algebras, P.J. WEBB (ed) Skew linear groups, M. SHIRVANI & B. WEHRFRITZ Triangulated categories in the representation theory of finite-dimensional algebras, D. HAPPEL Proceedings of Groups - Si Andrews 1985, E. ROBERTSON & C. CAMPBELL (eds) Non-classical continuum mechanics, R.J. KNOPS & A.A. LACEY (eds) Commutator theory for congruence modular varieties, R. FREESE & R. MCKENZIE Vander Corpus's method of exponential sums, S.W. GRAHAM & G. KOLESNIK Descriptive set theory and the structure of sets of uniqueness, A.S. KECHRIS & A. LOUVEAU The subgroup structure of the finite classical groups, P.B. KLEIDMAN & M.W. LIEBECK Model theory and modules, M. PREST Algebraic, extremal & metric combinatorics, M-M. DEZA, P. FRANKL & I.G. ROSENBERG (eds) Whitehead groups of finite groups, ROBERT OLIVER Linear algebraic monoids, MOHAN S. PUTCHA Number theory and dynamical systems, M. DODSON & J. VICKERS (eds) Operator algebras and applications, 1, D. EVANS & M. TAKESAKI (eds) Operator algebras and applications, 2, D. EVANS & M. TAKESAKI (eds) Analysis at Urbana, I, E. BERKSON, T. PECK, & J. UHL (eds) Analysis at Urbana, II, E. BERKSON, T. PECK, & J. UHL (eds) Advances in homotopy theory, S. SALAMON, B. STEER & W. SUTHERLAND (eds) Geometric aspects of Banach spaces, E.M. PEINADOR & A. RODES (eds) Surveys in combinatorics 1989, J. SiEMONS (ed) Introduction to uniform spaces, I.M. JAMES Homological questions in local algebra, JAN R. STROOKER Cohen-Macaulay modules over Cohen-Macaulay rings, Y. YOSHINO Continuous and discrete modules, S.H. MOHAMED & B.J. MULLER Helices and vector bundles, AN. RUDAKOV et al Solitons, nonlinear evolution equations and inverse scattering, M. ABLOWITZ & P. CLARKSON Geometry of low-dimensional manifolds I, S. DONALDSON & C.B. THOMAS (eds)

151

152 153 154 155 156 157 158 159 160 161

162 163 164 165 166 167 168 169 170 171

172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191

192 193 194 195 196 197 198 199

200 201

202 203 204 205 206 207 208 209 210 211 212 215 216 221

Geometry of low-dimensional manifolds 2, S. DONALDSON & C.B. THOMAS (eds) Oligomorphic permutation groups, P. CAMERON L-functions and arithmetic, J. COATES & M.J. TAYLOR (eds) Number theory and cryptography, J. LOXTON (ed) Classification theories of polarized varieties, TAKAO FUJITA Twistors in mathematics and physics, T.N. BAILEY & R.J. BASTON (eds) Analytic pro-p groups, J.D. DIXON, M.P.F. DU SAUTOY, A. MANN & D. SEGAL Geometry of Banach spaces, P.F.X. MULLER & W. SCHACHERMAYER (eds) Groups St Andrews 1989 volume 1, C.M. CAMPBELL & E.F. ROBERTSON (eds) Groups St Andrews 1989 volume 2, C.M. CAMPBELL & E.F. ROBERTSON (eds) Lectures on block theory, BURKHARD KULSHAMMER Harmonic analysis and representation theory, A. FIGA-TALAMANCA & C. NEBBIA Topics in varieties of group representations, S.M. VOVSI Quasi-symmetric designs, M.S. SHRIKANDE & S.S. SANE Groups, combinatorics & geometry, M.W. LIEBECK & J. SAXL (eds) Surveys in combinatorics, 1991, A.D. KEEDWELL (ed) Stochastic analysis, M.T. BARLOW & N.H. BINGHAM (eds) Representations of algebras, H. TACHIKAWA & S. BRENNER (eds) Boolean function complexity, M.S. PATERSON (ed) Manifolds with singularities and the Adams-Novikov spectral sequence, B. BOTVINNIK Squares, A.R. RAJWADE Algebraic varieties, GEORGE R. KEMPF Discrete groups and geometry, W.J. HARVEY & C. MACLACHLAN (eds) Lectures on mechanics, J.E. MARSDEN Adams memorial symposium on algebraic topology 1, N. RAY & G. WALKER (eds) Adams memorial symposium on algebraic topology 2, N. RAY & G. WALKER (eds) Applications of categories in computer science, M. FOURMAN, P. JOHNSTONE, & A. PITTS (eds) Lower K- and L-theory, A. RANICKI Complex projective geometry, G. ELLINGSRUD et al Lectures on ergodic theory and Pesin theory on compact manifolds, M. POLLICOTT Geometric group theory I, G.A. NIBLO & M.A. ROLLER (eds) Geometric group theory II, G.A. NIBLO & M.A. ROLLER (eds) Shintani zeta functions, A. YUKIE Arithmetical functions, W. SCHWARZ & J. SPILKER Representations of solvable groups, O. MANZ & T.R. WOLF Complexity: knots, colourings and counting, D.J.A. WELSH Surveys in combinatorics, 1993, K. WALKER (ed) Local analysis for the odd order theorem, H. BENDER & G. GLAUBERMAN Locally presentable and accessible categories, J. ADAMEK & J. ROSICKY Polynomial invariants of finite groups, DJ. BENSON Finite geometry and combinatorics, F. DE CLERCK et al Symplectic geometry, D. SALAMON (ed) Computer algebra and differential equations, E. TOURNIER (ed) Independent random variables and rearrangement invariant spaces, M. BRAVERMAN Arithmetic of blowup algebras, WOLMER VASCONCELOS Microlocal analysis for differential operators, A. GRIGIS & J. SJOSTRAND Two-dimensional homotopy and combinatorial group theory, C. HOG-ANGELONI, W. METZLER & A.J. SIERADSKI (eds) The algebraic characterization of geometric 4-manifolds, LA. HILLMAN Invariant potential theory in the unit ball of Cn, MANFRED STOLL The Grothendieck theory of dessins d'enfant, L. SCHNEPS (ed) Singularities, JEAN-PAUL BRASSELET (ed) The technique of pseudodifferential operators, H.O. CORDES Hochschild cohomology of von Neumann algebras, A. SINCLAIR & R. SMITH Combinatorial and geometric group theory, A.J. DUNCAN, N.D. GILBERT & J. HOWIE (eds) Ergodic theory and its connections with harmonic analysis, K. PETERSEN & I. SALAMA (eds) Lectures on noncommutative geometry, J. MADORE Groups of Lie type and their geometries, W.M. KANTOR & L. DI MARTINO (eds) Vector bundles in algebraic geometry, N.J. HITCHIN, P. NEWSTEAD & W.M. OXBURY (eds) Arithmetic of diagonal hypersurfaces over finite fields, F.Q. GOUVEA & N. YUI Hilbert C*-modules, E.C. LANCE Groups 93 Galway / St Andrews I, C.R. CAMPBELL et al Groups 93 Galway / St Andrews II, C.R. CAMPBELL et al Number theory, S. DAVID (ed) Stochastic partial differential equations, A. ETHERIDGE (ed) Harmonic approximation, S. GARDINER

London Mathematical Society Lecture Note Series. 215

Number Theory Seminaire de Theorie des Nombres de Paris 1992-3

Edited by

Sinnou David Universite Pierre et Marie Curie, Paris

CAMBRIDGE UNIVERSITY PRESS

Published by the Press Syndicate of the University of Cambridge The Pitt Building, Trumpington Street, Cambridge C132 I RP 40 West 20th Street, New York, NY 10011-4211, USA 10 Stamford Road, Oakleigh, Melbourne 3166, Australia

© Cambridge University Press 1995 First published 1995 Library of Congress cataloging in publication data available British Library cataloguing in publication data available

ISBN 0 521 55911 1 paperback Transferred to digital printing 2004

Number Theory Paris 1992-93

Table des Matieres K. ALLADI

Decomposition of the integers as a direct sum of two subsets ..................... I Y. ANDRE

Theorie des motifs et interpretation geometrique des valeurs p-adiques de G functions (une introduction) .................................................................37 N. BOSTON

A refinement of the Faltings-Serre method ...............................................61 J. BOXALL

Sous-varietes algebriques de varietes semiabeliennes sur un corps fini..69 P. COHEN Proprietes transcendantes des fonctions automorphes .............................81 E. FOUVRY et M. Ram MURTY

Supersingular primes common to two elliptic curves .................................91 V GRITSENKO

Arithmetical lifting and its applications ..................................................103 G. HARMAN

Towards an arithmetical analysis of the continuum ............................... 127 H. HIDA

On A-adic forms of half integral weight for SL (2) /Q .............................139 J. MARTINET

Structures algebriques sur les reseaux .................................................. 167 H. OUKHABA

Construction of elliptic units in function fields ........................................187 I. PAYS

Arbres, ordres maximaux etformes quadratiques entieres ..................... 209 T.N. SHOREY

On a conjecture that a product of k consecutive positive integers is never equal to a product of mk consecutive positive integers except for8.9.10=6! .......................................................................................231 P. STEVENHAGEN

Redei-matrices and applications ...........................................................245

R. TIJDEMAN

Decomposition of the integers as a direct sum of two subsets .................261 Y.G. ZAHRIN

CM Abelian varieties with almost ordinary reduction .............................. 277

Number Theory Paris 1992-93

Liste des conf6renciers 5 octobre : H. CARAYOL. - Representations galoisiennes et congruences 12 octobre : E. PEYRE. - Points de hauteur donnee sur une surface de Del Pezzo

D. BROWNAWELL. - Transcendance on Drinfeld modules 19 octobre : I. PAYS. - Algebres de quaternions et formes quadratiques entieres

2 novembre : G. HARMAN. - Types of fractions near irrationals 9 novembre : P. RAMBOUR. - Proprietes galoisiennes d'un anneau d'entiers en caracteristique p

16 novembre : T. HALES. - Sphere packings 23 novembre : E. FOUVRY. - Nombres premiers supersinguliers

30 novembre : V. GRITSENKO. - Maass wave functions on four dimensional hyperbolic space

7 decembre : T.N. SHOREY. - Squares in products from a block of consecutive integers 14 decembre : R. TIJDEMAN. - Complementing sets of integers

4 janvier : M. HINDRY. - Hauteurs de Neron sur les varietes abeliennes

11 janvier : Q. LIU. - Modeles stables des courbes de genre deux 18 janvier : G. CHRISTOL. - Indice d'operateurs differentiels 25 janvier : R. COULANGEON. - Reseaux unimodulaires et quaternioniens lei fevrier : N. BOSTON. - Some applications of families of Galois repre-

sentations

8 fevrier : H. OUKHABA. - Unites elliptiques et unites cyclotomiques dons les corps defonctions lei mars : J. BOXALL. - Sous-varietes algebriques de varietes abeliennes sur un corps fini 8 mars : J. MARTINET. - Structures algebriques sur les reseaux 15 mars : E. ULLMO. - Geometrie d'Arakelov des courbes elliptiques

22 mars : J.-M. COUVEIGNES. - Calcul et rationalite desfonctions de Belyi

29 mars : R. MURTY. - The supersolvable reciprocity law

5 avril : M.L. BROWN. - Modules singuliers et modules supersinguliers de modules de Drinfeld 26 avril : L. SCHNEPS. - Groupe de Grothendieck-Teichmiiller et automorphismes de groupes de tresses

3 mai : J.-P. WINTENBERGER. - Relevements de representations adeliques associees awc motifs de modules de Drinfeld

10 mai : D. BLASIUS. - Good reduction of Shimura varieties 17 mai : S. SAITO. - Cohomological Hasse principle for a surface over a number field

24 mai : H. HIDA. - A-adic forms of half-integral weight 7 juin : W. RASKIND. - Applications d'Abel-Jacobi £-adiques superieures Y. ANDRE. - Pour une theorie inconditionnelle des motifs 14 juin : K. ALLADI. - The combinatorics of words and applications to partitions

21 juin : P. COHEN. - Proprietes transcendantes des fonctions automorphes

28 juin : M. KUWATA. - Points rationnels sur les tordues de courbes elliptiques

Y. ZARHIN. - Hodge and Tate cycles on simple abelian fourfolds

Les textes qui suivent sont pour la plupart des versions ecrites de conferences donnees pendant l'annee 1992-93 au Seminaire de Theorie des Nombres de Paris. Ce seminaire est financierement soutenu par le C.N.R.S. et regroupe des arithmeticiens de plusieurs universites et est dotee d'un conseil scientifique et editorial. Ont ete aussi adjoints certains textes

dont la mise a la disposition d'un large public nous a paru interessante. Les articles presentes ici exposent soit des resultats nouveaux, soft des syntheses originales de questions recentes ; ils ont en particulier tous fait l'objet d'un rapport. Ce recuefi doit bien sur beaucoup a tous les participants du seminaire et a ceux qui ont accepte d'en reviser les textes. Il doit surtout a Monique Le Bronnec qui s'est chargee du secretariat et de la mise au point definitive du manuscrit; son efficacite et sa tres agreable collaboration ont ete cruciales dans l'elaboration de ce livre.

Pour le Conseil editorial et scientifique S. DAVID

Number Theory Paris 1992-93

The method of weighted words and applications to partitions Krishnaswami Alladi

ABSTRACr. The study of identities of Rogers-Ramanujan type forms an important part of the theory of partitions and q-series. These identities relate partitions whose parts satisfy certain difference conditions to partitions whose parts satisfy congruence conditions. Lie Algebras have provided a natural setting in which many such identities have arisen. In this paper a new technique called "the method of weighted words" is discussed and various applications illustrated. The method is particularly useful in obtaining generalisations and refinements of various Rogers-Ramanujan type identities. In doing so, new companions to familiar identities emerge. Gordon and I introduced the method a few years ago to obtain generalisations and refinements of the celebrated 1926 partition theorem of Schur. The method has now been improved in collaboration with Andrews and Gordon thereby increasing its applicability. The improved method yielded a generalisation and a strong refinement of a recent partition conjecture of Capparelli which arose in a study of Lie Algebras. Another application is a refinement and generalisation of a deep partition theorem of Gollnitz. A unified approach to these partition identities is presented here by blending the ideas in four of my recent papers with Andrews and Gordon. Proofs of many of the results are given, but for those where the details are complicated, only the main ideas are sketched.

1. - Introduction Identities of Rogers-Ramanujan type form an important part of the theory of partitions and q-series. Generally, one side of these identities is in the form of an infinite series, while the other side is an infinite product.

Usually, the series is the generating function for partitions whose parts satisfy certain difference conditions whereas the product is the generating function for partitions whose parts satisfy congruence conditions. The literature on such identities is vast (see for instance Andrews [101, [11)).

2

K. ALLADI

Andrews' monograph [I I I gives a quick overview of several recent major advances and discusses applications to many areas within mathematics, and even to Physics. The generic name "Identities of Rogers-Ramanujan Type", stems from

the two celebrated identities due independently to L. J. Rogers and S. Ramanujan which are the prototype, namely, 00

CIO

1

q

n=O

(1 - q)(1 - q2)...(1 - qn)

M=0

(1 - q5-+l)(1 - q5m+4)

and 00

qn2+n

(1.2)

n=0

n = H (1-q)(1-q2 )...(1-q) M=0

1

(1 - q5m+2)(1 - q5m+3)'

Indeed, even today, (1.1) and (1.2) are unmatched in simplicity, elegance and depth. These two identities have nice combinatorial interpretations as observed by MacMahon and Schur : THEOREM R. - For i = 1, 2, the number of partitions of an integer n into parts with minimal difference 2 and each part > i, is equal to the number of partitions of n into parts =- ±i (mod 5).

However, no simple combinatorial proof of (1.1) and (1.2) is known. One reason for the difficulty is that we do not know how to refine (1.1) and (1.2) by introducing a free parameter whose power would represent an important statistic in the partitions being counted. The term refinement is explained below.

There are several examples of Rogers-Ramanujan type identities for which refinements are known, the most famous being the 1926 partition theorem of Schur [221 : SCHUR'S THEOREM. - Let S(n) denote the number of partitions of n into

distinct parts - ±1 (mod 3). Let Si (n) denote the number of partitions of n into parts with minimal difference 3, and such that no two consecutive multiples of 3 can occur as parts. Then S(n) = Si(n)-

The equality in Schur's theorem can be refined to (1.3)

S(n; k) = Si (n; k),

TILE METHOD OF WEIGHTED WORDS ANDAPPLICA77ONS TO PARTITIONS

3

for any positive integer k, where S(n; k) and Si (n; k) count partitions of the type counted by S(n) and Si (n), but with the added restriction that there are precisely k parts, and with the convention that parts - 0 (mod 3) are counted twice. Combinatorial proofs can usually be given for partition identities which permit refinements. This is also the case with Schur's theorem; for combinatorial proofs of Schur's theorem, see Bressoud [14] or Alladi-Gordon 111.

Schur had originally stated the equality of three partition functions T (n) = S(n) = S1 (n), where T (n) is the number of partitions of n into parts ± 1 (mod 6). We do not discuss T(n) in this paper because refinements of the type (1.3) are not possible with T(n). The equality T(n) = Si(n) can

be considered as the next case beyond the Rogers-Ramanujan identities because the minimal difference 2 is replaced by 3 and the modulus 5 is replaced by 6. But in doing so, Schur realised that he needed the extra condition that consecutive multiples of 3 should not occur as parts. The purpose of this paper is to discuss a new technique called the method o f weighted words originally due to Alladi and Gordon [ 1 ], [2], and which has recently been improved by Alladi, Andrews and Gordon [3], [4]. This method is particularly useful in the study of Rogers-Ramanujan identities which permit refinements. Indeed, the method provides substantial refinements and generalisations often involving several free parameters which keep track of the number of parts in various residue classes. In many instances these refinements lead to bijective proofs of the partition theorem in question, in a natural way. The basic idea of the method is to consider positive integers which occur in various colours denoted by a, b, c,.... We then form words whose letters are symbols a, b, c.... with subscripts, where the subscript denotes the integer or part of the partition, and the symbols a, b.... will denote the colour

of that part. Special types of words are considered by impossing certain order rules on the symbols and certain gap conditions on the subscripts. These will correspond to the gap conditions on the parts in the partition theorem being discussed. The usefulness in the method lies in the fact that the symbols a, b, c.... play a dual role. On the one hand they represent colours, and on the other, they are free parameters when computing generating functions. The upshot of all this is that the partition theorem in question becomes a special case under certain dilations and translations, and the free parameters a, b, c.... provide the necessary refinements of the partition theorem. By changing the order rules on the symbols a, b, c..... several new companion partition identities are generated. The method was introduced in [11 to obtain refinements and generalisations of Schur's theorem. Subsequently [2], it was noticed that by changing the order rules for the symbols several companions to Schur's

4

K. ALLADI

theorem would be generated. More precisely, the function Si (n) is only one of six partition functions S,, (n),µ = 1, 2, ... , 6, all equal. Generalisations, refinements and companions to Schur's theorem are discussed in § § 4, 5.

In recent years, Lie Algebras have provided a general setting where Rogers-Ramanujan type identities have been discovered (see Lepowsky and Wilson [19], [20], [211). Motivated by a study of Vertex Operators in Lie Algebras, Capparelli [ 151 [16] made the following conjecture : CAPPARELLI'S CONJECTURE. - Let C* (n) denote the number of partitions of

into parts = ±2, ±3 (mod 12).

Let D(n) denote the number of partitions of n into parts > 1 with minimal difference 2, where the difference is > 4 unless consecutive parts are multiples of 3 or add up to a multiple of 6. Then C*(n) = D(n). Capparelli's conjecture was proved by Andrews in 1992 [12] using generating functions. Since there are similarities between Capparelli's conjecture and Schur's theorem, it was natural to see whether the method of weighted words would apply. And indeed it did, but the key idea was to replace C* (n) by the equivalent function C(n) which denotes the number of partitions of n into distinct parts = 2,3,4 or 6 (mod 6). This is for the purpose of refinements. In otherwords, C* (n) is like T (n) in Schur's theorem for which refinements are not possible and C(n) is like S(n) in Schur's theorem which permits refinements. Andrews, Gordon and 1 [3] obtained a three parameter refinement and generalisation (see Theorem 14 in § 7) from which Capparelli's conjecture followed as a special case under suitable dilations and translations. The deepest application of the method of weighted words so far has been a proof of a substantial generalisation and refinement due to Andrews, Gordon and myself [4], of the following formidable theorem of GOllnitz [ 18] : GOLLNITZ'S THEOREM. - Let A(n) denote the number of partitions of n into

parts - 2, 5, 11 (mod 12). Let B(n) denote the number of partitions of n into distinct parts

2, 4,

or5 (mod 6). Let C(n) denote the number of partitions of n in the form ml + m2 + + m,, no part equal to 1 or 3, and such that m1-m1+1 > 6 with strict inequality if mi - 6, 7 or 9 (mod 6).

Then A(n) = B(n) = C(n).

The equality A(n) = B(n) is trivial, and so the real challenge is the equality B(n) = C(n). Moreover, the function A(n) is like C*(n) in Capparelli's conjecture and T(n) in Schur's Theorem, permitting no refinements. So we will not discuss A(n) in this paper.

7HE METHOD OF WEIGHTED WORDS AND APPLICA77ONS TO PAR7TITONS

5

Gbllnitz [18] actually established the refinement

B(n;v) =C(n;v),

(1.4)

where B(n; v) and C(n; v) denote the number of partitions counted by B(n) and C(n) respectively, with the additional restriction that there are precisely v parts, and the convention that parts - 6, 7 or 9(mod 6) are counted twice. His proof is complicated and the details are forbidding. Andrews [8] gave a proof using generating functions, and subsequently gave a second proof ([ 111, § 10) where he used computer algebra to simplify the calculations. He then asked for a proof which lends more insight into the equality (1.4). We believe that our proof by the method of weighted words does provide this insight (see § 9 for a sketch of the main ideas behind this proof).

The main idea is to consider integers in six colours, of which three colours a, b and c are primary, and three colours ab, ac and be are secondary.

We then impose certain order rules on the coloured integers. Gbllnitz' theorem is viewed as emerging from an incredible key identity (see (9. 11) in § 9). The proof of this key identity is deep and difficult and may be found in [4]. It requires not only Watson's q-analog of Whipple's theorem but also the 6q'6 summation of Bailey. This is substantially more than what is required to prove either Schur's theorem or the Rogers-Ramanujan identities.

One advantage in using primary and secondary colours is that this explains the requirement for strict inequalities in Gbllnitz' theorem when mI - 6, 7 or 9(mod 6). As will be seen in § 9, the residues 2, 4 and 5(mod 6) correspond to the primary colours a, b and c whereas the residues 6, 7 and 9(mod 6) correspond to the secondary colours ab, ac and be. Another advantage with our approach is that Schur's theorem falls out as a special case by setting c = 0. For then, colours c, ac and be disappear and we are left only with three colours a, b and ab, which corresponds to Schur's theorem. The paper concludes with § 10 where some further problems going beyond Gollnitz' theorem are described.

2. - Notations We adopt the standard notation n-1

(a)n = (a; q)n =

(1 - aqj) j=0

for any complex number a, and 00

(2.1)

(a),, = n-00 lim (a)n = fl(1 - aqj), for j=0

IqI < 1.

6

K. ALLADI

In fact (2.1) can be used to define (a),,, for all real n by means of the relation (a)n =

(2.2)

.<

((aq) a

)

For a positive integer t, the q-multinomial coefficient of order t is defined by

(2.3)

nl+n2+

+nt _ n1+n2+ +nt _

-

nl, n2, ... , nt , [ nl n2, ... , nt

(q)ni (q)n2 ... (q)nt

9

If the base for the multinomial coefficient is q, then as in (2.3) we sometimes

do not write it; if the base is anything other than q, it will be displayed. Multinomial coefficients of order t = 3 will be used in the proof of Schur's

theorem while those of order t = 6 will appear in

the discussion on ni + n2 I is the q-binomial Gollnitz's theorem. When t = 2, the expression

ni, n2 J

r

coefficient. This is often denoted by I nl + n2

1 I. .

Binomial coefficients to

I.

bases q and q2 will appear in the discussion of Capparelli's conjecture. Given a partition it, we let a(7r) denote the sum of its parts, and v(7r), the number of parts of it. Also A(ir) will denote the largest part of it. Sometimes the parts will be arranged according to a specific lexicographic ordering (not necessarily the standard ordering of the positive integers) and in such cases A(7r) will denote the largest part of it with respect to this ordering. Quite often we will be counting the number of parts of it of a specific type, and this will be denoted by a subscript, eg : va (ir) will be the number of a-parts of it (precise definition will be given when necessary). If 7r = bl+b2+b3+ and 7r' = b1l +b2+b3+ are two partitions whose parts, as is customary, are written in decreasing order, then 7r" = it + it' is defined to be the partition b' + b2 + b3 + ..., where b' = bj + b. for each j. If v(7r) v(7r'), then in place of b; or V (whichever is missing), the number 0 is substituted while computing b' . Thus v(7r + ir') = max(v(7r), v(7r')). We also adopt the convention that j - b(mod m) means j = b+lm, with

j>0and1>0.

Finally, whenever we refer to a minimal partition, we mean a partition for which o ,(7r) is minimal subject to the given conditions.

3. - Lower bound gap conditions For partitions it which are defined by imposing lower bound conditions

on the gaps or differences between consecutive parts, their generating function can be computed by considering first the minimal partitions of the type under discussion.

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTITIONS

7

For instance, consider the generating function for partitions 7r into n distinct parts. The smallest partition into n distinct parts is n(n + 1)/2 =

n + (n - 1) +

+ 1. Hence the generating function is qn(n+l)/2/(q)n.

Similarly, in connection with the Rogers-Ramanujan identities, consider the generating function for partitions into n parts with minimal difference 2. The minimal partition in this case is n2 = (2n -1) + (2n - 3) + + 3 + 1. Hence the generating function is qn2/(q)n,

which is the n-th term in (1.1). More generally, consider the generating function (3.1)

G(q) = E

q'('r)

7r

for partitions 7r which are given by specifying v(7r) = m, the number of parts of ir, and by imposing certain lower bound gap conditions on the parts. Let (3.2)

H(q) _

ga(lr*)

be the generating function for all minimal partitions 7r* that can be constructed subject to these conditions. We then have LEMMA

1. - G(q) = H(q)/(q),,,,.

Proof : lemma 1 is an immediate consequence of the observation that the partitions 7r counted in (3.1) can all be realised in the form (3.3)

7r = 7r* + -7r',

where 7r' is a partition which satisfies v(7r') < v(ir*) = m. The best way to see (3.3) is to draw the Ferrers graphs of it, ir* and 7r'. Since the generating function of the graphs 7r' is 1/(q),,,,, Lemma 1 follows from (3.3). The lower bound gap conditions defining it can be quite complicated. Lemma 1 is extremely useful in such situations.

4. - The method of weighted words Let each integer j > 2 occur in three colours, red, blue and purple, and let the integer 1 occur only in two colours, red and blue. We introduce the

8

K. ALLADI

symbols aj, b; and cj to represent the integer (part) j in colours red, blue and purple respectively. Sometimes we refer to a3 as an a-part, b3 as a bpart and cj as a c-part. We think of the letters a, b, and c as free parameters. Under the transformation

(4.1) (dilation) q ' q3 and (translations) a H aq-2, b E-- bq-1, c --> cq-3, the powers of the parameters a, b and c will represent the number of parts in residue classes 1, 2 and 3(mod 3) in partitions counted by Si (n). In order to make the transition from Si (n) to S(n) we will need to choose c = ab. That is why we think of c as having colour purple=red plus blue. The gap between any two symbols representing coloured integers is defined to be the absolute value of the difference between their subscripts. The gap is a non-negative integer without colour. For example, the gap between a5 and c5 is 0, and between a5 and b7 is 2. The weight of a symbol is its subscript. For example the weight of b6 is 6, and b6 represents the integer 6 coloured blue. In order to discuss partitions using these symbols, we need a lexiographic ordering of the symbols. First we consider the following lexiographic ordering : (4.2)

al -<

b2-
We shall refer to this as Scheme 1. Now by a word 7r we mean a collection of

these symbols arranged in non-increasing order according to the specified scheme (lexicographic ordering), in the present case, Scheme 1. By Q(7r) = n

we mean that the sum of the weights of it is n. In this sense we may think of 7r as a partition of n into coloured integers and we use the terms word and partition interchangeably as convenient. For example, 7r = a7c7b5b5a4c4c2blala1 is a word with or(7r) = 37. To specify the partition

of 37 into coloured integers we sometimes write a7 + C7 + b5 + b5 + a4 + C4 + C2 + bi + al + al. Let va,(7r), vb(7r), vc(7r) denote the number of a-parts, b--parts and c-parts of 7r respectively. In this example v,.,(7r) = 4, vb(7r) = 3, and vv(7r) = 3. We now consider words e1e2 ... e,,, where the el are symbols from (4.2),

such that the gap between the symbols is > 1, with the added restriction that the gap between consecutive symbols e` and ej+1 is > 1 if (4.3)

el is red, e1+1 is blue

or ei is purple.

We refer to this as a Type 1 word or a Type 1 partition. At first glance condition (4.3) might seem artificial but it is a natural generalisation of the gap conditions defining Si(n). Also, there are interesting generating functions attached to Type 1 partitions. More precisely we have the following lemma which was established in (1) :

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTTTJONS

9

LEMMA 2. - Let G = G(i, j, k; q) be the generating function for all Type 1 partitions it having va(?r) = i, vb(7r) = j and v,(-7r) = k. Then

G

where T, = '('+

zi

qTi+i+k+Tk (q)i(q)j(q)k,

is the m-th triangular number.

Since all partitions 7r counted by G have v(7r) = i + j + k, it follows from

Lemma 1 that Lemma 2 is equivalent to the statment that (4.4)

H(i, j, k)

i+j+k

G(i, j, k

qTi+j+i.+T),.

Here H(i, j, k) is the generating function for all minimal Type 1 partitions 7r* with va(7r*) = i, vb(7r*) = j and vc(7r*) = k. Since the multinomial

coefficients satisfy a recurrence relation, it turns out that (4.4) can be proved by induction on i + j + k, and indeed this was how Lemma 2 was proved in (1).

For the purpose of the induction, it is convenient to consider the decomposition

H=Ha+Hb+Hc,

(4.5)

where Ha, Hb and HH are the generating functions for partitions counted by H, with the additional restriction that the smallest part is an a-part, b-part and c-part respectively. It can be showed by induction on i + j + k (see (1)) that : HH(i, j, k) = qTi+J+k+Tk

i+j+k-1

i-1,j,k [i++k_ 1 Hb(i, j, k) = qTi+i+k+Tk+i i,j-1,k

(4.6)

HH(i, j, k) =

qTi+i+k+Tk+i+7

i+j+k-1

i,j,k-1

Once this is done, (4.4) follows from (4.5) and (4.6) because of the recurrence

a+j+k _ [ (4.7)

i,j,k

k-1 ,-[ i +i j- +l,j,k ]+q

t

+qi+j

i+j+k-1 i,j-1,k

+

[i++k_11 i,j,k-1

10

K. ALLADI

satisfied by the multinomial coefficients. Next, let Bl (n; i, j, k) denote the number of Type 1 partitions it of having va(7r) = i, vb(ir) = j and v, (7r) = k. Then from Lemma 2 we see that

E Bi(n; i,j, k)atb3ckgn =

atyck G(i,j, k) _ i,j,k

i,j,k,n (4.8)

i

qTi+i+k+Tk

Nc k (q)i(q)j(q)k

The main result which leads to a refinement and generalisation of Schur's theorem is LEMMA 3. - Let r, s > 0 be given integers. Then qTr+T

qT +s-,,,.+T,,,.

G(r-m,s-m, m) = E

O<m<min(r,s (g)r-m(q)s-m(q)m

0<m< min(r,s)

(q)r(q)s

In [I I we give two proofs of Lemma 3 one of which is combinatorial.

Lemma 2 has a nice combinatorial interpretation from which Schur's theorem follows and we describe this now. Note that arbsgTr+Te (4.9) 's

(q)r(q)s

V (n; r, s)arb9q

=

,

n,r,s

where V(n; r, s) is the number of (vector) bi-partitions (7rl; 7r2) of n such that irl has r distinct red parts and 72 has s distinct blue parts. So Lemma 3 is the assertion that the generating functions in (4.8) and (4.9) are equal when c = ab. In this case with

i=r-m,j =s-m,k=m, we have (4.10)

i+j+2k=r+s.

So we get the following combinatorial result :

THE METHOD OF WEIGHTED WORDS AND APPLICA77ONS TO PARTITIONS

11

THEOREM 1. - Let t > 0 be given and V (n; t) = > V (n; r, s). Then r+s=t V (n; t) _

B1(n; i, j, k). i+j+2k=t

Theorem 1 gives a refinement of Schur's theorem under the transformations (4.1). More generally, under the transformations q F_F qM

a F-4

aq«-M

b F-, bqO-M

applied to (4.7) and (4.8) we get the following generalization and refinement of Schur's theorem :

THEOREM 2. - Let M>3and 0
a +,3(mod M) such that (i) the difference between any two parts is > M. (ii) the difference between parts =_ (a +,3) (mod M) is > M. (iii) the parts - a +,3 (mod M) are counted twice. Then

A(n; k) = B(n; k).

In [1] a more general form of Theorem 2 is stated by relaxing the condition 0 < a < Q < M < a +,3 to the simpler condition that a, Q and a +,3 are incongruent (mod M). The reason condition (4.9) enters into Theorem 1 is because, given a bi-partition of n into r red and s blue parts,

one takes m of the red parts and m of the blue parts to form the purple parts, leaving behind r - m red parts and s - m blue parts. Hence the purple parts have to be counted twice.

5. - Six companions to Schur In 1971, using a computer search, Andrews [9] found the following companion to Schur's theorem : THEOREM A. - Let S2 (n) denote the number of partitions of n into parts

+ e such that el - el+1 > 3, 2 or 5 according as eI - 1, 2 or 3 el + e2 + (mod 3). Then S(n) = S2(n). Andrews gave a proof of Theorem A using generating functions in a man-

ner similar to his proof of Schur's theorem [5]. But the exact connections

12

K. ALLADI

between Theorem A and Schur's theorem remained unclear. These connections will now become clear by means of the method of weighted words. What is more, this approach will show that Si (n) and S2 (n) are only two of six partition functions S,,, (n), p = 1, 2, ... , 6, all equal to S(n). Under the transformations (4.1), the symbols in Scheme 1 become (5.1)

a,,,, = 3m - 2, b,,,. = 3m - 1 and c,,,,+1 =

3m,

where c = ab. In this case, the lexicographic ordering (4.2) for Scheme 1 yields the natural ordering

1<2<3<4< for the positive integers. From now on we will refer to the transformations in (4.1) as standard transformations. Let x,,,, = x( l) denote the symbol occupying the m-th position in (4.2). That is xl = al, x2 = bi, x3 = c2, ... , and so on. Instead of defining Type 1 partitions by means of the gap conditions (4.3), we may define them as follows : Type 1 partitions are those of the form x,,,., + xm2 + + x,,,,, , where (5.2)

ml - ml+1 > 3 with strict inequality if xm, is a c-part.

This is a direct translation of the difference conditions defining Sl (n) to the more general situation involving weighted symbols. From now on we will refer to the inequalities (5.2) as standard gap conditions.

In order to understand the difference conditions defining Andrews' S2(n), consider another lexicographic ordering of the symbols, namely, Scheme 2, given by (5.3)

al -
Under the standard transformations and with c = ab, Scheme 2 in (5.3) yields the following different ordering of the positive integers : (5.4)

Next let x(2) denote the symbol occupying the m-th position in (5.4). Then the difference conditions defining Andrews' function S2 (n) are equivalent to the statement that we consider partitions of the form x,n' +x.a2,,2 + + x(,n) , where x,n, = satisfy the standard gap conditions. More generally, if denotes the symbol occupying the m-th position in (5.3), let a Type 2 partition be one of the form x,n; +x,n2 + +x,n , with x,nt = XLI satisfying (5.2). Also, let B2 (n; i, j, k) denote the number of Type 2 partitions 7r of n having va,(7r) = i, vb(7r) = j, v. (7r) = k. We then have :

13

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PAR7777ONS

THEOREM 3. - Let n > 0 and i, j, k > 0 be integers. Then Bl (n; i, j, k) = B2 (n; i, j, k)

Theorem A is a consequence of Theorem 3 and Theorem 1, under the standard transformations. Indeed, Theorem 3 yields a refinement of Theorem A. One way to prove Theorem 3 is to show that the generating functions of B1 (n; i, j, k) and B2 (n; i, j, k) are the same, that is show that (5.5)

B2 (n; i,j, i,,j,k,n

aiNck i,j,k

q

T;+3+k+Tk

(q)i(q).j(q)k

and compare with (4.8). The proof of (5.5) proceeds in exactly the same way as that of (4.8). The only difference is that we now use another recurrence for the multinomial coefficients instead of (4.7), namely, (5.6)

I

[i+j+k-1

i+j+k1=1

i, j- 1, k L i, j, k- 1 to establish the corresponding generating function formulae for minimal L

i, j, k

i- 1, j, k J

Type 2 partitions.

Scheme 1 was generated by the standard ordering al while Scheme 2 was generated by a different ordering al

-<

b1 - c2

-<

c2

-<

b1.

More generally, there are six schemes given by the six permutations of the symbols al, b1 and c2. They are

Scheme 1: a1 -' bl -< c2 < a2 b2 c3 Scheme2: a1-
Scheme 6: C2 < b1 < al - c3 -< b2 < a2 Actually, only three of these schemes are essentially different, because Schemes 4, 5 and 6 are obtained from Schemes 1, 2 and 3 by interchanging the roles of a and b. But there are certain advantages in discussing all six schemes as will be seen soon. Let x(,µ,,) denote the symbol occupying position m in Scheme it, µ = 1,2,... , 6. By a partition of Type p we mean an expression X(Y) satisfy the standard gap conditions (5.2). x,n), where the Let B,,, (n; i, j, k) denote the number of partitions 7r of n of Type µ such that va(7r) = i, vb(7r) = j and v, (7r) = k. Then, extending Theorems 3 and 1 we have

14

K. ALLADI

THEOREM 4. - Let n > 1 and i, j, k > 0 be given integers. Then

Bi(n; i,j, k) = B2 (n; i, j, k) =

= B6 (n; i,j, k).

THEOREM 5. - Let r, s > 0 be given integers. Then

B,,(n;r-m,s-m,m).

V(n;r,s)= O<m<min(r,s)

for v = 1, 2, ... , 6. Consequently, if r + s = t, then

V (n; t) = > B, (n; i, j, k). i+j+2k=t

Under the standard transformations, the gap conditions (5.2) defining

Type µ partitions, µ = 1,2,.. . , 6, become the following : we are now counting partitions of the form el + e2 + + e,,, where the el are (ordinary) positive integers satisfying the difference conditions given by : (5.7) (5.8)

Type 1 : el - ei+1 > 3,3 or 4, if el Type 2 : el - e1+1 > 3,2 or 5, if el

Type 3 : el - e1+1

1, 2 or 3 (mod 3). 1, 2 or 3 (mod 3).

= 1 or > 3, if e` - 1 (mod 3),

(5.9)

> 2 or 6, if el

2 or 3 (mod 3).

Type 4 : el - e1+1

> 2 or 4, if el

1 or 3 (mod 3),

Type 5 : el - ej+1

= 3 or > 5, if el - 2 (mod 3). > 1, if el - 1 (mod 3),

(5.10)

= 3or > 5, if ei

(5.11)

Type 6 : el - ej+1

2 (mod 3),

= 4 or > 6, if el 3 (mod 3). > 1 or 6, if el - 1 or 3 (mod 3),

(5.12)

I= 2, 3or > 5, if el - 2 (mod 3) . Note that the difference conditions in (5.7) are precisely those defining Si (n) in Schur's theorem, whereas the conditions in (5.8) are the same as those given by Andrews for S2 (n) in Theorem A. Next, let S, (n; i, j, k) denote the number of Type it partitions (those given by conditions (5.7)-(5.12) corresponding to Type p), having i parts - 1 (mod

3), j parts - 2 (mod 3) and k parts - 3 (mod 3). Then using Theorems 4 and 5 and the standard transformations, Schur's theorem and Theorem A can be improved to :

7HE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTITIONS

15

THEOREM 6. - Given integers n > 1 and i, j, k > 0, Sl (n; i, j, k) = S2 (n; i, j, k) _

= Ss (n; i, j, k)

THEOREM 7. - Given integers n > 1 and r, s > 0, let S(n; r, s) denote the number of partitions of n into r distinct parts - 1 (mod 3) and s distinct parts 2 (mod 3). Then

L

S(n;r,s) =

S, (n;r-m,s-m,m),µ=1,2,...,6.

O<m< min(r,s)

The difference conditions (5.9)-(5.12) defining Sµ(n) for µ = 3, 4, 5 and 6 are more complicated than those defining Si (n) and S2 (n). Hence the partition functions SN,(n), it > 3, did not show up in Andrews' computer search [9]. This illustrates the usefulness of the weighted words approach. The partition functions B,, (n; i, j, k) deal with the base case without any dilations and translations and so the gap condition defining them, namely (5.2), is uniform and elegant for all six functions. It is the lexicographic orderings that distinguish these six functions. Since Theorems 5, 6, and 7 are consequences of Theorem 4 (and Theorem 1), we will now describe how to prove Theorem 4 utilising recurrence relations for the q-multinomial coefficients. i +. + k 1 satisfy a total of six basic The q-multinomial coefficients i, j, k

L L

J

recurrences depending on the order in which the letters i, j and k are reduced by one. Two recurrences have already been given, namely, (4.7) and (5.6). Yet another recurrence is (5.13)

1i+j+k i,j,k

H

i+j+k-1

k

i,j,k-1

i+j+k-1 ]qk+i i+j+k-1

i-1,j,k

i,j-l,k

There are three more recurrences which we will not write down. Next, let H(µ) (i, j, k), H01) (i, j, k), and H,(,µ) (i, j, k) denote the generating functions for all minimal Type it partitions using i a-parts, j b-parts and k c-parts respectively, such that the smallest part is al, bl and c2. Then these generating functions can be computed by induction on i + j + k in a manner similar to (4.6) as will be described presently. For Type 1 partitions, the starting triple for Scheme 1 ordering is Scheme 1

:

al -< bl -< C2.

16

K. ALLADI

Hence the generating functions Ha, = H(1), Hb = H61) and H. = H,1) in (4.6) are given in terms of q-multinomial coefficients by replacing i by i - 1,

j by j - 1, and k by k - 1, in that order. For Type 2 partitions, the starting triple in Scheme 2 ordering is Scheme 2 : a1 -< C2 -< b1.

So in this case we have H(2) (z, j, k) = (5.14)

qTi+i+k+Tk

z+j+k-1

i-l,j,k

z+j+k-1 i,3, k - 1

(i, j, k) = qTi+i+k+Tk+i

=qTs+i+k+Tk+i+k

Hc(i, j, k)

[z+3±k - 1

i,j-l,k

Equations (5.14) can be established by induction on i + j + k. In Scheme 2, since a1 occurs first followed by c2 and then by b1, the letters i, k and j are reduced by 1 and in that order in (5.14). From (5.14) and (5.6) it follows

that H(2) + H(2) + H(2) = H(2) = H = G (q)i+j+k with H and G as in (4.4). This yields Theorem 3, which is the first equality in Theorem 4. Similarly, since the starting triple for Scheme 3 ordering is (5.15)

Scheme 3: C2 - al - b1, we have H(3) (z, j,

(5.16)

k) =

qTi+i+k+Tk

i+i+k-1 i, 3, k - 1

H(3) (i, j, k) = qT,+i+k+Tk+k H(3) (z) j, k)

=

qTi+i+k+Tk+k+i

z+j+k-1

i-l,j,k

[i-i_±k_ 1

i,j-1,k

where in (5.16) the letters k, i and j are decreased by 1 in that order to correspond to the starting triple in Scheme 3. Formula (5.16) is easily established by induction on i + j + k. From (5.16) and (5.13) it follows that Ha3) + H(3) + H(3) = H(3) = H, and this yields the equality B1 = B2 = B3 in Theorem 4. The treatment of the generating functions H(µ), H(A) and H(µ) for

p = 4, 5 and 6 is similar. We summarize the ideas of this section in the form of

THE METHOD OF WEIGHTED WORDS AND APPLICA77ONS TO PART7T7ONS

17

THEOREM 8. - The six schemes correspond to the six ways in which the q-multinomial coefficient [ i + j+ k 1 can be expanded in terms of I

i, j, k I

i+j+k- 1 , and [i+j+k-1

i+j+k - 1

In particular under i - 1, j, k i, j - 1 , k i, j, k - 1 . the standard transformations, the six companion functions S,,, (n) = 1, 2, ... , 6, to Schur'spartitionfunction S(n), correspond to these six recurrences for the q-multinomial coefficients.

6. - Combinatorial proof of Theorems 4 and 1 The combinatorial proof given here is an extension of the method in [2] which itself is based on certain ideas due to Bressoud [ 13].

Let r, s be given. We start with a bi-partition (7ri; 7r2) counted by V (n; r, s). So v(7ri) = r and v(7r2) = s. In what follows several steps will be given and illustrated with

7r1 =a7+a6+a3+a2+a1 and7r2=b13+b12+b8+b7+b5+b4+b2. Step 1

decompose 72 into two partitions 7r4 and 7r5, where 7r4 has the parts of 72 which are < r and 7r5 has the remaining parts. Let v(7r4) = m. So v(7r5) = s - m. :

7r4 =b5+b4+b2,

75 =b13+b12+b8+b7.

Step 2 : consider the conjugate of the Ferrers graph of 7r4 and circle the bottom node in each column. Denote this graph by 7r4. Construct the Ferrers graph 76, where the number of nodes in each row of 7r6 is the sum of the number of nodes in the corresponding rows of 7r1 and 7r4. The parts of 7r6 ending in the circled nodes are coloured purple and the remaining parts of

7r6 are red. Thus 76 has r - m red parts and m purple parts. 76 = 71 +7rq = alo+c9+a5 +c4+c2.

Step 3 : write the parts of 75 in a column in descending order and below them write the parts of 7r6 in descending order. Draw a line to separate the parts of 7r5 and 7r6.

Step 4 : substract 0 from the bottom element, 1 from the element above that, 2 from the one above that etc.... and display the new values and the subtracted values in two adjacent columns Cl IC2. The elements of C2 have no colour while those of Cl retain the colour of the parts from which they were derived.

18

K. ALLADI

Step 5: (penultimate Step) Rearrange the entries of C1 in descending order according to Scheme /.c, to form a column CR.

Step 6: (final Step) Add the corresponding elements of CR and C2 to get a partition 13 counted by B,,, (n; r - m, s - m, m). Each of these steps is a one-to-one correspondence and so this combinatorial procedure has provided a proof of Theorem 1 with any B. in place of B1. Since any function BN,, p = 1, 2, ... , 6 can be used in Theorem 1, the statement of Theorem 4 is an immediate consequence with i = r - m,

j=s-m,k=m.

To simply get a bijection between B. (n; i, j, k) and B. (n; i, j, k), proceed from Step 6 to Step 5, and replace Scheme p ordering by Scheme w ordering

and return to Step 6. We illustrate the above steps for Schemes 1 and 2 below.

Step 4

Step 3 1r5/1r6

Cl

C2

b13

b5

8

b12

b5

7

b8

b2

6

b7

b2

5

alo

a6

4

Cg

C6

3

a5

a3

2

C4

C3

1

C2

C2

0

THE METHOD OF WEIGHTED WORDS AND APPLUCA77ONS TO PARTITIONS

Step 5 (Scheme 1)

Step 6

Step 5 (Scheme 2)

19

Step 6

CR

C2

73

CR

C2

73

a6

8

a14

a6

8

a14

C6

7

C13

b5

7

b12

b5

6

b11

b5

6

bll

b5

5

blo

C6

5

ell

a3

4

a7

a3

4

a7

C3

3

C6

b2

3

b5

b2

2

b4

b2

2

N

b2

1

b3

C3

1

C4

C2

0

C2

C2

0

C2

The combinatorial proof given above leads to several improvements of Theorems 1 and 4, a few of which will be described below. For a full description of these improvements and for details of proofs see [2]. Let .Fµ,, denote the bijection which is described above converting a Type p partition 7rµ to a partition 7r, of Type w. More precisely, this is the bijection which is the result of starting with a Type p partition in Step 6, then going back to Step 5 with Scheme p ordering, then rearranging the elements of column CR in Step 5 according to Scheme w ordering, and finally perform Step 6 to get the partition 7r, or Type w. So, (6.1)

µ, (1r L) = 7r--

The mappings .Fµ,, tell us a lot about the generating functions (6.2) Fµ(x(,,µ)) = F.(x((A); a, b, c, 9) of Type µ

The most striking result concerning these generating function is TxEOREM 9. - For all postive integers m Fi(xsl)) = F2(xs2)) _ ... = F6(x36) ).

20

K. ALLADI

The proof of Theorem 9 (see 121) makes use of several identities connecting for different values of m and p, these identities being direct consequences of the bijections Fµ,,. The principal reason for the equality of the six generating functions at all positions 3m is because the alphabet lists in all six schemes are identical when truncated at 3m. That is, for all m, the set of values

1 W ,...,x3m lxlW x2 3 (6.3)

_ {al,a2,...,a.,bl,b2i...,b,C27C37...,CM+1},µ = 1,2,...,6. In particular, under the standard transformations, the set of symbols

considered in (6.3) for each of the Schemes become the set of positive integers < 3m. That is, although the natural ordering of the positive integers is altered in Schemes a, for p > 2, the set of positive integers considered is the same in all Schemes when truncated at position 3m. So, if Kµ = K. (m; a, b, c, q) denotes the a - b - c - q generating function for all Type µ partitions (under standard transformations) such that all parts are < 3m, then Theorem 9 yields

THEOREM 10. - Kl(3m) = K2(3m) = ... = K6(3m). By comparing the coefficient of aib' Ckgn in the equalities of Theorem 10 we get the following improvement of Theorem 6 :

THEOREM 11. - Let Sµ i, j, k, 3m) denote the number of partitions counted by Sµ(n; i, j, k) with the additional restriction that all parts are < 3m. Then, for each integer m > 1, Sl (n; i, j, k, 3m) = S2 (n; i, j, k, 3m) =

= S6 (n; i, j, k, 3m) .

7. - Refinements of Capparelli's conjecture In recent years, a number of classical partition identities have been found to lie at the heart of the interaction of vertex operators and representations of affine Lie Algebras (see Lepowsky and Wilson [19], [20], [21]). In the course of a study of standard modules of level 3 for A22), Capparelli [15] was led to the partition conjecture stated in § 1. This conjecture is stated after his Theorem 21 in [ 16]. Andrews [12] recently proved Capparelli's conjecture using generating functions.

We now describe how to obtain substantial refinements as well as generalisations of this partition result by the method of weighted words.

THE METHOD OF WEIGHTED WORDS AND APPLlCA77ONS '10 PAR7TTIONS

21

From the point of view of refinements it is preferable to replace C* (n) by the function C(n) which denotes the number of partitions of n into distinct parts - 2, 3, 4 or 6 (mod 6). Clearly C(n) = C* (n) because 00

2__, C* (n) qn n=0

=

1

(q2; g12) 0o (g3; g12) 00 (g9; q12)

,0

(g10; q12) 00

(7.1)

C(n)gn .

= (-q2; q6).(-q4; q6).(-q3; q3) n=0

With the function C(n) the following three parameter refinement of Capparelli's conjecture can be proved using the method of weighted words : THEOREM 12. - Let C(n; i, j, k) denote the number of partitions counted

by C(n) with the additional restriction that there are precisely i parts = 4

(mod 6), j parts - 2 (mod 6), and of those - 0 (mod 3), exactly k are > 3(i + j). Let D(n; i, j, k) denote the number of partitions counted by D(n) with the additional restriction that there are precisely i parts - 1 (mod 3) and j parts - 2 (mod 3) and k parts - 0 (mod 3). Then

C(n; i, j, k) = D(n; i, j, k). It is possible to prove Theorem 12 combinatorially. Although the combinatorial proof is similar to the proof of Schur's theorem given in the previous

section, there are some important differences, and so we give this combinatorial proof in the next section. As a consequence of that proof we get a further improvement of Theorem 12, namely, THEOREM 13. - Let C(n; i, j, k, N) denote the number of partitions 7r of n

counted by C(n; i, j, k) such that L + 3k < 3N - 2, where L is the largest part 0 0 (mod 3) among the parts of 7r.

Let D(n; i, j, k, N) denote the number of partitions of n counted by D(n; i, j, k) such that the largest part is < 3N - 2. Then C(n; i, j, k, N) = D(n; i, j, k, N).

The method of weighted words yields the more general result stated as Theorem 14 below from which Theorem 12 follows as a special case under (7.2)

(dilation) q --> q3

and (translations) a H aq-2, b H bq-4.

22

K. ALLADI

In a certain sense Capparelli's conjecture represents the most interesting dilations and translations in Theorem 14, but there are other nice special cases. We do not discuss them here and refer the reader to [3]. For Capparelli's problem, we assume that the integer 1 occurs in two colours a and c, and that integers > 2 occur in three colours a, b and c. As before, the symbols aj, b; and cj represent the integer j in colours a, b and c respectively. The lexicographic ordering that we choose is (7.3)

al < b2 < cl < a2 < b3 < c2 < a3 < b4 < c3 <

.

The Capparelli problem corresponds to the transformations (7.4)

a., F-. 33 - 2,

b3 F--> 33 - 4,

cj

3j

in which case the inequalities (7.3) become

the natural ordering among the positive integers. Let K(n; i, j, k) denote the number of vector partitions of n in the form (irl,1r2, 1r3) such that Il has distinct even a-parts, 7r2 has distinct even

b-parts and 73 has distinct c-parts such that v(7rl) = i, v(7r2) = j and the number of parts of 73 which are > (i + j) is k. Let G(n; i, j, k) denote the number of partitions (words) of n into symbols aj, b3, cj such that each part is > al and the gap between consecutive symbols is given by the matrix below : a

b

a

2

2

1

b

0

2

0

c

2

3

1

c

.

We then have THEOREM 14. - K(n; i, j, k) = G(n; i, j, k).

Note : this matrix is to be read row-wise. For instance, if an a-part has weight j, then the next larger part, if it is a b-part, must have weight > j + 2; if the next larger part is a c-part, its weight is > j + 2. Theorem 14 is a generalisation of Theorem 12 which itself is a refinement of Capparelli's conjecture. From the point of view of generating functions, Theorem 14 is a consequence of

23

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTITIONS

LEMMA 4. - Let Ti = i i2 1 denote the i-th Triangular number. Then

(a) V

'

i,j,k,n

j k)atb'ck '

'

ihjg2Ti+2Tj (-q)i+j (-cqt+j+1)00

/

(q

t,3

2

; q2 )i(q2 ;q2 )j

ibiCkg2T.+2Tj+Tk+(i+j)k

(b) > G(n; i, j, k)atb'ckgn = i, j,k,n

(q)i+j+k

t,3,

i+j+kl 2 + j, k

+jl q

1i i, ?

92

Remark : theorem 14 is the statement that the generating functions in (a) and (b) of Lemma 4 are equal. To see this, all one needs to do is to sum the function in (b) over k to get the function in (a). More precisely ibiCkg2Ti+2Tj+Tk+(i+j )k

t,k ,7

(q)i+j+k ai&ig2T+2Tj(-q)i+j

ckgTk+(t+i)k

(q2; g2)i(g2; q2)j

(q)k

ij

(q2, q2)i+j

(q)i+j+k

(q)i+j(q)k (g2;g2)i(q2;q2)j

aibjg2T,+2Tj(-q)i+j(_cgi+j+1) 00 (g2;g2)i(q2;q2)j

which is the generating function in Lemma 4 (a). If we take c = 1, then the generating function in Lemma 4 (a) becomes a product, because (7.5)

(-q).

a'bi 2T -F2Tj (-q)00(-aq2; g2)00(-bq2; g2)oo.

ij

(ql; q1) i (q2; q2) 7

In (7.5) replace q H q3, a H q-2, b H q-4 to get 00

00

m=1

n=0

fl (1 + q6m-2) (1 + q6m-4)(1 + q3m) =

C(n)q,,

the generating function in (7. 1). So, in order to prove Theorem 14, we need to prove Lemma 4. Part (a)

of the lemma is clear from the definition of K(n; i, j, k). It is the proof of part (b) which is deeper and we sketch the main ideas below.

24

K. ALLADI

Given i, j and k, we need to show that G(i, j, k) _ n

G(n; i, j, k)qn =

q

i+j+kl

i+j,k Jq li+jlq2 i,j

Since the partitions counted by G(i, j, k) have i + j + k parts and since these partitions are given by lower bound gap conditions as in the matrix, the generating function for all minimal partitions H(i, j, k), using i a-parts, j b--parts and k c-parts is given by Lemma 1 and (7.6) to be (7.7)

[i++k] [i+i]

H(2, j, k) = G(i, j, k) (q)i+j+k = q2T,+2T;+Tk+(%+j)k

qi,j

q2

The p roof of (7.7) is by induction on i + j + k, the length of the word, and makes use of the functions Ha(i, j, k), Hb(i, j, k) and HH(i, j, k). These

are the generating functions for partitions counted by H(i, j, k) but with additional restriction that the smallest part is a2, b2 and cl respectively. From the definition of Ha, Hb and HH it is clear that

Ha+Hb+Hc=H.

(7.8)

The q-binomial coefficients in (7.6) satisfy two recurences each, namely,

_ i+j+k-1

i+j+k _ i+j+k-1

i+j,k

k i+j+k-1 i+j,k-1 +q i+j-l,k

i+j-1,k +q i+j+k-1 i+j,k-1

(7.9)

%+j

and

(7.10)

[i+] [i+_1] q2

i-1,jg2

2%

{i-i-_1]

[i±.i_1] q2

i,j-lq2

2

[i+i_i] i-1,j q2

All the recurences in (7.9) and (7. 10) are necessary to establish the formulae for Ha, Hb and H, The details are a bit complicated and so we do not give them here; they may be found in [3]. Once these formulae for Ha, Hb and HH are established, (7.8) will yield (7.7) which in turn will yield (7.6).

Remarks : in the discussion of Schur's theorem and companions in § § 4,5, we made crucial use of the q-multinomial coefficient of order 3, namely z+j+k (q)%+j+k

i,j,k

(q)i(q)j(q)k.

THE METHOD OF WEIGHTED WORDS AND APPLICA77ONS TO PARTTTTONS

25

As is well known, the q-multinomial coefficient and the q-binomial coefficient are related by

i+j+k

(7.11)

i,j,k

_ ]

i+j+k

i+jl

- [ i+j,k Jq [ i,j J9

The main difference here is that instead of (7.11) we are making use of the product

ri+j+kl L i+j,k Jq L i,j jq2 in Lemma 4(b).

8. - Combinatorial proof of Theorem 14 It is convenient to introduce the concept of level for the symbols aj, bj and cj. More precisely, in the lexicographic ordering a1

level 1

level 2

level 3

level 4

we think of a1, b2, c1 as at level 1, a2, b3, c2 as at level 2, and so on. The symbol a1 is necessary in this list even though it is never counted by the functions K(n) and G(n). Combinatorial Proof : we begin with a partition (9r1i 72) counted by K(n), where irl has distinct even a-parts and j distinct even b-parts, and ire has distinct c-parts. Several constructions will now be given and illustrated with

xl =a14+b14+b12+b8+as+a4+b4+a2+b2 7r2 = C13+C12+C1o+C5+C4+C2+C1.

In this example i = 4, j = 5. Step 1 : split ire into 74 U 75 where 74 has the parts of ire with weights < i + j and 7r5 has the remaining parts.

in4=C5+C4+c2+C1,

in5 =C13+C12+C10, i+j=9,k=3.

26

K. ALLADI

Step 2: consider the Ferrers graph of 1r1 and to its side place the graph of the conjugate of 7r4 - call this 7r4.

7r4 = 4 + 3 + 2 + 2 + 1 (uncoloured).

Step 3 : consider the partition 76 obtained by adding the number of nodes in the corresponding rows of 7rl and in. That is 76 = 7rl + in. The parts of 7r4 have no colour, while the parts of 7r6 retain the colour of the parts of 7r1 from which they were derived.

7r6 =a18+b17+b14+blo+a7+a4+b4+a2+b2. Important Observation : the correspondence (7rli 7r4) <--> 7r6 is oneto-one. Observe that 76 could have both odd and even parts. To extract 7r4 out of 76, start from the lowest part of 7r6, move upward and note the position of the first odd weight. This corresponds to the length of the first column of in. Next, note the position of the first even weight beyond this point. This corresponds to the length of the second column of ir4. Proceeding

beyond this point note the position of the next odd weight, and so on, at each stage keeping track of the positions where there is a change in parity of the weights. These positions will give the columns of 7r4. Step 4 : write the parts of 7r5 in descending order and below them write the parts of 76 in descending order. Step 5 : subtract 0 from the bottom element of this column, 1 from the element above that, 2 from the next one above, and so on, and display the

new values and the amounts subtracted in two adjacent columns C1IC2. The elements of C2 have no colour while those of Cl retain the colour of the part from which they were derived. Step 6: rearrange the elements of Cl to form a column CR by inserting

the c-parts at the appropriate levels. That is, if cj is an element of C1, then all elements below cj are < cj and those above cj are > cj. Note that cj may repeat in Cl and CR, as may also the parts aj and b;. In CR the lexicographic ordering of aj and b; may not correspond to (7.3), but the worst that can happen is a switch within the same level. For example, although al < b2, we could have al above b2 in Cl and CR. But a symbol aj or b3 which is a level higher than ak or bk will always occur above ak or bk in Cl and CR. Note also that al can occur as an element of Cl or C.

Step 7 : add the corresponding elements of CR and C2 to form a partition 7r3 counted by G(n; i, j, k). The parts of 7r3 retain the colours of the elements of CR from which they were derived. Note that all parts of ir3

are > al.

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTITIONS

Step 4

Ste p 5

Step 7

Ste p 6

7r5 /7r6

C1

C2

CR

C13

C2

11

C12

C2

C2

13

alo

11

a21

10

blo

10

b2o

C1o

Cl

9

b8

9

b17

a18

alo

8

b5

8

b13

b17

blo

7

a3

7

alo

b14

b8

6

C2

6

C8

blo

b5

5

C2

5

C7

a7

a3

4

Cl

4

C5

a4

al

3

al

3

a4

b4

b2

2

b2

2

b4

a2

al

1

al

1

a2

b2

b2

0

b2

0

b2

1

27

Each of these steps is a one-to-one correspondence and so this completes the proof of Theorem 14.

9. - Extensions of Gollnitz's theorem GOllnitz's theorem stated in § 1 is one of the deepest in the theory of partitions. Our approach to this theorem via the method of weighted words not only lends insight into the structure of the partition functions concerned, but also provides substantial refinements and generalisations as can be seen from Theorems 15 and 16 below. Owing to the intricacy of the proofs of these theorems, we provide here only a description of the main ideas and refer to [41 for details. We assume that the integer 1 occurs in three primary colours, red, blue and yellow, and that integers j > 2 occur in six colours, of which red, blue and yellow are primary and purple, orange and green are secondary. We use the symbols aj, b; , cj, d; , ej and fj to denote the integer j in colours red, blue, yellow, purple, orange and green respectively. As before, a, b, C, d, e, f are free parameters. However, in making the transition from C(n) to B(n)

in GOllnitz' theorem, we need to take d = ab, e = ac, f = be. That is why the subscripts of d, e and f are integers j > 2 since they are obtained by

28

K. ALLADI

combining two parts in primary colours. That is also why we think of d, e and f as secondary colours. The lexicographic ordering we use for Gollnitz's theorem is (9.1) al --< bi

We will call this Scheme 1. Under the transformations (dilation) q H q6, (translations) a H aq-4, b H bq-2, c'--f cq-1,

(9.2)

and (combinations) d = ab, e = ac, f = bc,

the symbols become

fa. =6j-4,b, =6j-2,c. =6j-1, 1d; =6j-6,ej =6j-5, f; =6j-3,

(9.3)

and so the lexicographic ordering in (9.1) becomes

2<4<5<6<7<8<9<10<11<12<13...,

(9.4)

the standard ordering of the positive integers. In view of this, the transformations in (9.2) will be called standard transformations. Observe that the integers 1 and 3 are absent in (9.4). This is because el = (ac)1 and f, = (be)1 are absent in (9.1). For reasons which will become clear towards

the end of this section, it is sometimes useful to consider the full list of symbols (9.5)

dl-<

f in (9.5) are underlined to indicate that they do not occur in (9.1). Observe that for symbols with the same weight, the colours occur in the following order : (9.6)

d-< e-< a-< f
Given two colours, we use (9.6) to determine which of the two is of lower order. For example between e and b, e is of lower order and b of higher order. Next, consider partitions it = m1 + m2 + , using symbols in Scheme

1 such that the gap between symbols is > 1 with the added restriction that the gap between consecutive symbols ml and mi+1 is > 2 if (9.7)

ml is of lower order and m1+1 of higher order

or if ml and m1+1 are of the same secondary colour.

We will refer to such a 7r as a Type 1 partition. The principal partition result that we get by this approach is Theorem 15, from which Gollnitz' theorem falls out as a special case.

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTITIONS

29

THEOREM 15. - Let B(n; i, j, k) denote the number of vector partitions 7r' = (irl; 72; 13) of n such that in, 72 and 73 have distinct a -parts, b-parts and c -parts respectively, and also v(7rl) = i, v(7r2) = j, and v(7r3) = k. Let C (n; a, ,0, y, 6, e, ¢) denote the number of Type 1 partitions in of n such that va,(7r) = a, vb(ir) = /3, ... , vf(7r) = 0. Then

E

B(n; i, j, k) =

C(n; a,)3, y, 6, e, 0).

a+b+e=i, /9+6+q=7,'Y+e+¢=k

Note that under the standard transformations, Theorem 15 yields a strong refinement of Gollnitz' theorem. Under the standard transformations the gap conditions (9.7) defining Type 1 partitions become the difference conditions defining the function C(n) in Gbllnitz' theorem. Also, (9.7) provides a natural explanation as to why there is strict inequality in Gollnitz' theorem when ml - 6, 7 or 9(mod 6). This is because the residue classes 6, 7 and 9(mod 6) correspond to the secondary colours ab, ac and be respectively.

More generally, under the substitutions

(dilation) q H qM and (translations) a H aqr'-M, (9.8)

b ,- bgr2-M c r-r cgr3-M.

Theorem 15 yields the following result.

THEOREM 16. - Let M > 6 and rl, r2, r3 residues such that

0
Let C(n; v) denote the number of partitions of n into v distinct parts such that ml > m2 > (i) each part ml is = rl, r2, r3, rl + r2, rl + r3 or r2 + r3 (mod M). (ii) ml - ml+1 > M with strict inequality if ml - ri + r2, rl + r3, or r2 + r3 (mod M). (iii) the parts = r1 + r2, rl + r3 and r2 + r3 (mod M) are counted twice. Then B(n; v) = C(n; v).

It is to be noted that in Theorem 16 we have almost total freedom in choosing the three residues r1, r2, r3 (mod M). (Gdllnitz [ 181, Satz (4.8 and

4.10)) obtained two extensions of his basic theorem with the modulus 6

K. ALLADI

30

replaced by M + 4 for M > 2, but in each theorem he prescribes a fixed set of residues mod(M + 4). Gollnitz' Satz (4.8) and (4.10) follow as special cases of Theorem 16.

The combinatorial explanation for counting the parts of secondary colour twice in Theorem 15 is as follows : Given a partition counted by B(n; i, j, k), take b of the a-parts and b of the b-parts to form the ab-parts (d-parts). Similarly, e of the a-parts and e of the c-parts combine to yield the ac-parts. Finally 0 of the b--parts and 0 of the c-parts combine to form the bc-parts. Thus in partitions counted by the function C we have

a=i-b-e, 3=j-6-0, 'y=k-e-q5.

(9.9) Therefore

a+/3+ry+2S+2e+2g=i+j+k. Finally the number of parts in partitions counted by C(n; a, /3, y, b, e, 0) is

s=a+Q+'y+b+e+q.

(9.10)

In what follows we assume (9.9) and (9.10). From the point of view of the method of weighted words, Theorem 15 is seen as emerging from the following incredible Key Identity : qTs+Tb+T+T. _1(1

(9.11)

atPt'Yt6tct0

t,1,k i

- q«(1 - q'))

(q)a(a)a(0) ry(0)A((7) (a) E

x=n+btE,9=P+bt ¢, k=7t Et Ck qT;+T.; +Tk

_ (-aq)OO(-bq)OO(-cq)OO.

Clearly, the generating function of B(n; i, j, k) is B(n; i,j, k)qn n

T +Tj+Tk

(q)i(q)j(q)k'

which is the summand on the right hand side of (9.11). In order to prove Theorem 15 we need to do two things. Firstly, we need to show LEMMA 4. -

n

C(n; a, l3, Y, b, e, )q

n = qT +Tb+T +To_, (1 - qa(1 -

() () () () () (

q« g a q ry'i b q E q q' )

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTITIONS

31

Secondly we need to prove the key identity (9.11). Both of these are quite difficult. The proof of Lemma 4 is along the lines of the proof of Lemma 2 given in

§4, but the details here are more difficult. First, we rewrite the expression on the right in Lemma 4 in the form qT s +T+T+TT- i

(9.12) (q)S 1 it follows that the generating function for all minimal Type 1 partitions having va(7r) = a, vb(7r) = Q, ... , vf(ir) = 0 is

s

(9.13)

The proof of (9.13) is by induction on s and makes use of recurrences for the multinomial coefficients of order t = 6. Details may be found in [4]. The proof of the key identity (9.1) is very difficult (see [41) and so we do not give it here. The depth of this identity becomes plain when we see that its proof requires not only Watson's q-analog of Whipple's theorem but also the 6T6 summation of Bailey (see Gasper and Rahman [17]). Note that if we set c = 0 in (9.11) and compare the coefficients on a' b' on both sides, we get Lemma 3. Thus one advantage in this approach to

Gollnitz' theorem is that Schur's theorem falls out as a special case by setting c = 0. We now discuss companions to the function C(n) in Theorem 15 (hereinafter denoted by C1(n)) obtained by considering lexicographic orderings other than Scheme 1. It is for this that the full list of symbols in (9.5) is useful. Before introducing these orderings we make an observation about Type 1 partitions. Let x,,, = x,,;) denote the symbol occupying position m in the complete list (9.5). That is xo1) = d1, x(11) = e1, x21) = a1, and so on. We set x(1) = d1

because under the standard transformations, d1 becomes the integer 0. With this notation the gap conditions defining Type 1 partitions in (9.7) can be recast in the following equivalent form : Type 1 partitions are those of the form x.,,,,, + XM2 + , where : (9.13)

f mj - ml+1 > 6 with strict inequality if is of secondary colour.

From now on we will refer to (9.13) as standard gap conditions.

32

K. ALLADI

In considering other lexicographic orderings we omit the symbol dl because d1 = 0 under the standard transformations. We therefore choose any ordering of the symbols (9.14)

el, a1, f1, b1, c1, d2 -

For instance, Scheme 1 is generated by the basic ordering (9.15)

Scheme i :

el -< al -< f 1 -< b1

cl -< d2

We get the full list of symbols in (9.5) by increasing the weights in (9.15) by one in succession. Similarly, we may consider another ordering of the symbols, namely, (9.16)

Scheme 2:

el

al -< f 1

b1 - d2 -< c1

Then the full list of symbols generated by Scheme 2 is (9.17)

el-
As in (9.5), the symbols el and f 1 are underlined in (9.17) to indicate that they will never appear in partitions we will be considering presently. More generally one may consider all 6 ! = 720 orderings of the symbols in (9.14) and the 720 Schemes thus generated. Let x,,,, = denote the symbol occupying position m in the full list of symbols in Scheme µ. We define a Type µ partition to be one of the form x,,,,, + x,,,,2 + , where 0, x7n, satisfy the standard gap conditions (9.13). Next let Cµ (n; a, _y, 6, E, 0) denote the number of Type µ partitions 7r of n with vo. (7r) = a, vb (7r)

v f (ir) = 0. Then we have

THEOREM 17. - Forp = 2,3,...,720 c,

6,

.E,

A combinatorial proof of Theorem 17 can be given in a manner identical to the bijective proof of the equality of the Schur companion functions given in § 6. So we do not repeat the ideas here. A combinatorial proof of Theorem 17 may be found in (4). Under the standard transformations the full list of symbols in (9.17) for Scheme 2 yields the following ordering of the positive integers : (9.18)

1--< 2-<3-<4-<6-< 5-<7-<8-< 9-< 10-<

This ordering leads to the following result which may be considered as a companion to Gbllnitz' theorem just as Andrews' theorem is a companion to Schur's theorem.

THE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTT ITONS

33

THEOREM 18. - Let C2 (n) denote the number of partitions of n in the form

, such that no ml = 1 or 3 and ml - ml+l > 7, 6, 7, 6, 5 or 8 f mi - 7, 2, 9, 4, 5 or 6(mod 6). Then

ml + m2 +

Cl (n) = C2 (n).

Remarks : the inequalities defining most of the other companions C,(n) (under standard transformations) are generally more complicated than the ones for Ci(n) and C2(n), and will be like (5.9)-(5.12). But here we have more than 700 such sets of inequalities and so we will not write them down ! There may be a few functions Cµ (n) which are as nice as Cl (n) and C2(n) and it may be worthwhile to determine them all. The combinatorial proof of Theorem 17 alluded to above involves bijec-

tions between partitions of Type p and those of Type w, for p, w = 1, 2, ... , 720; that is, bijections between partitions counted by C,, (n) and CL,(n). But it is an entirely different story concerning bijections between

partition counted by C,.(n) and B(n), and indeed no such bijection is known at present even though Theorem 15 is a refinement of the equality C,(n) = B(n) involving many parameters. This appears to be extremely difficult.

10. - Beyond Gollnitz' theorem The approach to Schur's theorem via the method of weighted words involved two primary colours a, b and one secondary colour ab. For Gollnitz' theorem we need three primary colours a, b, c and three secondary colours ab, ac and be. The principal reason for the substantial increase in the depth and difficulty when making the transition from Schur's theorem to Gollnitz's theorem is that the ternary colour abc is dropped ; that is, we are not dealing with the full non-empty alphabet of colours that can be generated using a, b, c, but only with a proper subset.

Andrews [61, [71 has obtained general partition theorems which extend Schur's theorem by choosing a set al, a2i ..., a, of distinct residues (mod M), with M > 21 - 1 and by considering all possible residue classes which are given as non-empty sums EEzai, Ej = 0 or 1. The main reason he was able to obtain such extension was because he was dealing with the complete set of 2' - 1 residues generated by al, a2, ..., a,. One way of extending Gollnitz's theorem would be to consider four primary colours a, b, c, d. We then have a choice of dropping either abed, or may be even some of the secondary and ternary colours. At the moment we do not know which of the choices (if any) would be connected with the expansion of

(-aq).(-bq).(-cq).(-dq)..

34

K. ALLADI

An attempt at this question might give some clues about the general situation with r primary colours a,, a2, ... , a,.. Once again the emphasis is that we should not deal with the full alphabet of colours generated by al, a2, ... , a, . Such a study was attempted computationally in 1971 by Andrews [9; p. 384-385] but his effort was limited by the amount of computer power and memory available at that time. With the availability of modern computer algebra systems, it may not be unreasonable to consider these questions now.

Manuscrit recu le 10 janvier 1994

7I-IE METHOD OF WEIGHTED WORDS AND APPLICATIONS TO PARTITIONS

35

References

[11 K. ALLADI and B. GORDON. - Generalizations of Schur's partition theorem,

Manuscripta Mathematics 79 (1993), 113-126. [2] K. ALLADi and B. GORDON. - Schur's partition theorem, companions, refine-

ments and generalisations, (to appear). [3] K. ALLADI, G.E. ANDREWS and B. GORDON. -Refinements and generalisations

of Capparelli's conjecture on partitions, J. Algebra (to appear). [4] K. ALLADI, G.E. ANDREWS and B. GoRDON. - Generalisations and refinements

of a partition theorem of Gollnitz, J. Reine and Angew. Math. (to appear). [5] G.E. ANDREWS. - On Schur's second partition theorem, Glasgow Math. J. 9 (1967), 127-132. [6] G.E. ANDREWS. - A new generalisation of Schur's second partition theorem, Acta. Arith. 4 (1968), 429-434. [7] G.E. ANDREWS. - A general partition theorem with difference conditions, Amer. J. Math. 191 (1969), 18-24. [8] G.E. ANDREWS. - On a partition theorem of Gollnitz and related formulae, J. Reine Angew. Math. 236 (1969), 37-42. [9] G.E. ANDREWS. - The use of computers in search of identities of Rogers-

Ramanujan type, in Computers in Number Theory (A.O.L. Atkin and B.J. Birch, Eds.), Academic Press (1971), 377-387. [10] G.E. ANDREWS. - The theory of partitions, Encyclopedia of Math., Vol. 2 Addison Wesley, Reading (1976). [111 G.E. ANDREWS. - q-series : their development and applications in Analysis, Number Theory, Combinatorics, Physics and Computer Algebra, NSF-CBMS Lectures, Vol. 66, Amer. Math. Soc., Providence (1985). [12] G.E. ANDREWS. - Schur's theorem, Capparelli's conjecture and q-trinomial coefficients, in Proc. Rademacher Centenary Conf. (1992), Contemp. Math., Amer. Math. Soc. (to appear). [13] D.M. BRESSOUD. - On a partition theorem of Gollnitz, J. Reine and Angew. Math. 305 (1979), 215-217. [14] D.M. BRESSOUD. - A combinatorial proof of Schur's 1926 partition theorem, Proc. Amer. Math. Soc. 79 (1980), 338-340.

36

K. AIIADI

[15] S. CAPPARELLI. - Vertex operators for affine algebras and combinatorial identities, Ph.D. Thesis, Rutgers Univ. (1988). [16] S. CAPPARELLI. - On some representations of twisted affine Lie Algebras and

Combinatorial identities, J. Algebra 154, (1993), 335-355. [17] G. GASPER and M. RAHMAN. - Basic hyper-geometric series, Encyclopedia of

Mathematics and its Applications, Vol. 35, Cambridge (1990). [18] H. GOLLNITZ. - Partitionen mit D ferenzenbedingungen, J. Reine Angew. Math. 225 (1967), 154-190. [19] J. LEPOwsKy and R.L. WILSON. - A new family of algebras underlying the Rogers-Ramanujan identities and generalisations, Proc. Nat. Acad. Sci. USA 78 (1981), 7254-7258. [20] J. LEPOwsKy and R.L. WILSON. - A Lie-theoretic interpretation and proof of the Rogers-Ramanujan identities, Adv. in Math. 45 (1982), 21-72. [21] J. LEPOwSKY and R.L. WILSON. - The structure of standard modules, L Universal algebras and the Rogers-Ramanujan identities, Invent. Math. 77 (1984), 199-290. [22] I. SCHUR. - ZurAddiven Zahlentheorie, Gessammelte Abhandlungen, Vol. 2, Springer, Berlin (1973), 43-50.

Krishnaswami ALLADI

Department of Mathematics University of Florida Gainesville, Florida 32611 U.S.A.

Number Theory Paris 1992-93

Theorie des motifs et interpretation geometrique des valeurs p-adiques de G-functions (une introduction) Yves Andre

1. - Introduction Partons d'une question concrete, dans 1'esprit de Hensel. Considerons une serie de puissances a coefficients rationnels, la plus simple pour commencer :

1+x+x2+...=(1-x)-1. Pour x = 2/3 < 1, cette serie converge. Mais 2/3 a deux facons d'etre < 1, l'usuelle et la dyadique; dans les deux cas bien entendu, la somme vaut 3 (dans R et dans Q2 respectivement). Prenons un exemple moans elementaire : 1 + x2 + 3/2x4 +... + 12n n

(1 -

2x2)-1/2

.

Pour x = 2/3, on trouve 3 en sommant dans R. Dans Q2, on devine que la somme est ±3 en notant que le carre de la serie de puissances est une fonction rationnelle. Pour determiner le signe, it suffit des lors de remarquer que cette somme est congrue a 1 modulo 4: c'est donc -3. Ainsi, aux places de Q pour lesquelles 2/3 est dans le disque de convergence, les sommes sont des nombres rationnels, mass distincts.

Plus generalement, considerons une serie de puissances y(x) = ao + E Q[[x]], eventuellement transcendante, et supposons alx + a2x2 + que pour une valeur x E Q, cette serie converge dans R vers un nombre rationnel ou algebrique. Soit p un nombre premier tel que la serie converge aussi p-adiquement.

Y.ANDR8

38

Converge-t-elle encore vers un nombre rationnel, resp. algebrique, dans Q ? Comme on peut s'y attendre, la reponse est non en general : d'ailleurs,

en s'inspirant de l'exemple precedent, on construit aussitot le contreexemple suivant :

y(x) = [(1 -

2x2)-1/2

- 3] exp(2x),

x = 2/3,

p = 2.

On notera toutefois ici que y(x) est solution d'une equation differentielle d'ordre 2, et le phenomene remarquable est que les evaluations en 2/3,

aussi bien reelles que dyadiques, de y(x) et de sa derivee y'(x) sont rationnellement proportionnelles. Ceci nous conduit a reformuler legerement la question en termes de

dependance lineaire ou algebrique sur Q d'evaluations de y(x) et de ses derivees - nous limitant aux solutions d'equations differentielles lineaires.

Voici un exemple du a F. Beukers [Be93), mettant en jeu la serie hypergeometrique de Gauss y(x) = 2Fl(1/12, 5/12,1/2;x). Pour x = 1323/1331, cette serie converge dans ]l8 vers le nombre algebrique 4 ' 11 (miracle!).

Comme 1323 = 33 72, elle converge aussi dans Q7. Beukers montre qu'elle converge en fait vers le nombre algebrique 4 ' 11 (second miracle!). L'idee que je vais defendre ici est que ce phenomene a lieu chaque fois qu'il est motive; plus precisement, je me propose de montrer comment la philosophic des motifs de A. Grothendieck conduit au principe heuristique (ou conjecture) suivant : PRINCIPE DES RELATIONS GLOBALES. - SOit (yl (x), ... , yn(x)) E Q[[x]]n

une base de solutions dune equation dferentielte lineaire a coefficients dans Q(x) , facteur d'une equation de Picard-lochs. Soient v, w deux places de Q, dont l'une v, est archimedienne, et soit un element de Q situe dans le disque de convergence v- et w-adique de yl (x),. . . , y,,, (x). _

S'il existe une relation de dependance algebrique sur Q entre les ... , y(n-1) des derivees de evaluations v-adiques yl Y (' )v, yl (x), ... , yn (x) en l;, ne provenant1 pas par specialisation x -- d'une relation de dependance algebrique sur U(x) entre les (x), alors it en est de meme pour les evaluations w-adiques.

Nous notons bien entendu ici Q une cloture algebrique de Q, et disons qu'une equation differentielle Ay = 0 est facteur d'une autre A'y = 0 pour exprimer que A divise A' dans Q(x) [d/dx] (a droite ou a gauche). Rappelons qu'une equation de Picard-Fuchs est une equation differentielle qui regit la variation de la cohomologie de De Rham d'une Q(x) variete algebrique en fonction du parametre x (connexion de Gauss-Manin).

THEORIE DES MOTIFS ET INTERPRETATION GkOME`7RIQUE

39

Ainsi, dans 1'exemple de Beukersl, 1'equation hypergeometrique satisfaite par 2F1 (1/12, 5/12, 1/2; 27t2/4) est aussi satisfaite par la classe de cohomologie de la differentielle dX/Y sur la courbe elliptique d'equation

Y2=X3-X-t.

Rappelons aussi [A891 que toute solution y(x) = ao + alx + a2x2 +

E

Q[[x]] d'une equation de Picard-Fuchs est ce que Siegel aappele, it y a 65 ans, une G fonction : elle definit pour toute place de Q une fonction analytique au voisinage de 0, et, de plus, le denominateur commun a ao, a1i ... , a,,,, croit au plus exponentiellement en m. Le principe des relations globales peut etre etendu heuristiquement a toutes les G-fonctions; on n'y gagnerait guere, attendu qu'une conjecture de Bombieri-Dwork predit que toute G-fonction "provient de la geometrie",

i.e. est solution d'une extension (multiple) de facteurs de connexions de Gauss-Manin (exemple : les polylogarithmes). D'autre part, lorsque la connexion est semi-simple (par exemple lorsqu'elle gouverne la variation de cohomologie d'une Q(x) variete propre et lisse), on peut omettre du "principe" la condition que v est archimedienne.

Le principe des relations globales est particulierement interessant en liaison avec le theoreme suivant (qui reprend les memes notations), du a Bombieri : PRINCIPE DE HASSE POUR LES VALEURS DE G-FONCTIONS. - Toute relation

de dependance algebrique de degre done 6 a coefficients dons Q entre les evaluations yl (), yi y(n-1) valide en toute place de convergence 2, provient necessairement, par specialiation, d'une relation de dependance algebrique entre les (x) a coefficients dans q(x), sauf si t; appartient a un certain sous-ensemble de U de hauteur borne (polynomialement en 6). Voir [Bo8l], et aussi [A891.

En resume, l'objet principal de cet article de synthese est de justifier le principe des relations globales (a partir de conjectures bien connues de Grothendieck dans le cadre de sa theorie des motifs), et d'en esquisser une application. Cette justification comporte deux volets : a) interpretation cohomologique des valeurs (archimediennes ou non) de G-fonctions; nous nous attarderons sur des exemples mettant en 1 cet exemple a d'ailleurs ete concu pour tester le principe ci-dessus, deJA esquisse dans [A89], cf. l'introduction de [Be93].

2 Le principe ci-dessus permet de construire de telles relations. en multipliant entre elles les relations "locales" (ce qui augmente 6 bien entendu).

40

Y. ANDRI;

jeu les varietes abeliennes, avant de developper le cas general (en vue duquel nous presentons brievement la theorie des motifs et certaine variante nonconjecturale). b) Elucider comment 1'existence de relations algebriques "excep-

tionnelles" entre valeurs de solutions d'une equation de Picard-Fuchs est liee a la presence de cycles algebriques "exceptionnels" sur les fibres correspondantes de la famille a un parametre de varietes sous-jacente.

Quant a 1'application, nous nous bornerons a indiquer succinctement comment, en prenant les deux principes ci-dessus comme guide, on parvient a demontrer, inconditionnellement, une serie de resultats du type suivant [A] :

THEOREME 1. - Soit f : A -> S une famille de varietes abeliennes parametree par une courbe afine S sur Q. Une fibre A3 sera dite excep-

tionnelle si l'une de ses puissances porte un cycle algebrique qui ne prouient pas par specialisation3 d'un cycle de Hodge absolu sur la puissance correspondante de la fibre generique geometrique Af. Supposons qu'il existe tine fonction rationnelle x sur S telle que les fibres de f au-dessus de x = oo soient de type CM. Alors pour tout 6 > 0, it n'y a qu'un hombre fini de nombres algebriques C de degre < 6 et p-entiers en tous les hombres premiers sauf au plus 6, tels que l'une des fibres de f au-dessus de x = l; soit exceptionnelle.

A noter, en particulier, qu'est exceptionnelle toute fibre A telle que l'homomorphisme de specialisation End 4q --f End A ne soit pas surjectif. On pourra comparer avec le premier enonce de ce type, obtenu dans [A861. En fait, on peut preciser le theoreme 1 : la hauteur (logarithmique) des Z; exceptionnels est bornee polynomialement en 6. Dans le cas d'une famille modulaire de courbes elliptiques, les fibres exceptionnelles correspondent aux moduli singuliers j, on peut poser x = j-1, et on obtient ainsi que la hauteur de l'invariant j d'une courbe elliptique a multiplication complexes par un ordre quadratique imaginaire 0 est bornee par un certain polynome en le nombre de classes de 0 et le nombre de premiers divisant la norme de j. Precisons toutefois que les methodes mises en oeuvre ici debordent largement le cadre des varietes abeliennes (cf. § 8d).

Plan de l'article : 2. Periodes 3. Une Q-structure dans la cohomologie cristalline? 3 it est commode de considerer ici les realisations de De Rham ou l-adiques, de sorte que la specialisation est immediate a definir.

T7IEORIE DES MOTIFS ET INTERPRETATION GEOMETRIQUE

41

4. Le cas des varietes abeliennes degenerantes 5. Le cas des courbes elliptiques avec bonne reduction 6. Motifs 7. Realisations de Betti, periodes p-adiques 8. Comment eviter les conjectures standard?

IL. - Periodes

Soit f : X -* S = P'\{(1, ... , (9 } une famille de varietes projectives lisses define sur un corps de nombres K. Quitte a retrancher d'autres points (ti, on peut supposer que les OS-modules de cohomologie de De Rham HHR(X/S) := ]R9 f.Ql Cls sont fibres. A 1'aide d'une base wi, ... , wn, et en termes d'une coordonnee globale x sur S, la connexion de Gauss-Manin V donne naissance a un systeme differentiel (*)

dx = r Y , ou r est une matrice n x n a coefficients dans K(x)

.

Supposons les (;, tous non nuls pour simplifier, et considerons la matrice Y(x) solution de (*) a coefficients dans K[[x]] normalisee par Y(O) = Id. Choisissons un plongement K y (C.

On a: HdR(Xan/San)v y (Rv fan * Q) ®Q C HdR(X/S) ®Os OS-n. Le choix d'une trivialisation locale de R9 fan *Q au voisinage de x = 0 permet d'exprimer l'isomorphisme HdR(X/S) ®Os OSan = (R9 f an *Q) ®QOSan sons

forme d'une matrice de periodes 52(x) = (1 w (x) = j wti(x))4, satisfaisant a 1'equation (*). Cette matrice est donc liee a Y(x) par la formule .

(**)

Yx) = 12(x)12(0)-1 ,

ce qui fournit une interpretation geometrique des evaluations complexes de Y(x). Si les evaluations en ( E K ((j4 (,) des coefficients de la matrice Y(x) sont algebriquement dependantes sur K, on en deduit que les coefficients des matrices 12(x) et 12(0) le sont aussi. Or une conjecture celebre de Grothendieck 1G661 affirme :

(P) toute relation polynomiale a coefficients dans K entre les periodes d'une K-variete projective (lisse) devrait "provenir" de l'existence d'un cycle algebrique sur une puissance de cette varietes. 4 sous cette forme traditionnelle, c'est en fait la transposee de la matrice de cet isomorphisme. 5 Un tel cycle algebrique donne en effet lieu a des relations polynomiales a coefficients algebriques entre les periodes, en ecrivant la compatibilite de ses composantes Betti et De Rham, et en utilisant la formule de Kenneth.

42

Y. ANDRE

Remarque : cette conjecture n'est connue que pour un petit nombre de varietes "tres simples"6 : surface cubique (Ia conjecture equivaut alors a la transcendance de 7r), courbe elliptique a multiplication complexe (Chudnovsky)... Mais si l'on borne a priori le degre des relations de dependance entre periodes (il en sera ainsi dans la situation consideree ci-dessus), alors le fait que ces relations proviennent de cycles algebriques peut s'etablir, a 1'aide de la theorie diophantienne des G-fonctions, dans des situations beaucoup plus generales. En voici un exemple, formule en termes de "cycles motives", legere variante des cycles algebriques developpee au § 8 : TI-IEOREME 2. - Soit g

:Z-S=

11 \ {(1, ... I(", } un K-morphisme

projectif et plat de dimension relative q, lisse en dehors de 0. On suppose que

la fibre Zo est un diviseur a croisements normaux simples dont toutes les strates d'intersection sont lisses, et tel que la cohomologie de chaque strate soit formee de cycles motives. Soient A 1, \2, ... des elements lineairement independants de Hq (ZT , Q) situes dans l'image de la puissance q-ieme du logarithme de la monodromie locale en 0, et soient 1j i 772.... des elements lineairement independants de HHR(Z ). Alors toute relation polynomiale homogene de degre 6 a coefficients dans K entre les periodes j>, qi provient

d'un cycle motive sur une puissance de la fibre Z j , pourvu que m soit suffisamment grand (par rapport a 6). C'est un cas particulier du theoreme principal de [A89] IX, sauf que dans l'hypothese et la conclusion, cycles motives remplacent cycles de Hodge; la preuve est identique. La borne pour m est en principe effective, du type exponentielle d'une puissance de 6 (une particularite de la theorie diophantienne des G-fonctions fait qu'on ne sait pas remplacer dans 1'enonce 1/m par "un rationnel proche de 0").

3. - Une Q-structure dans la cohomologie cristalline? Examinons maintenant la possibilite d'une interpretation geometrique similaire des evaluations p-adiques de Y(x). Choisissons donc un plongement de K dans CP, le complete p-adique d'une cloture algebrique de Q,. Considerons le morphisme Xan. -> San de CP varietes analytiques (au sens de Bourbaki, que nous qualifierons de "mou", par opposition a la theorie analytique rigide qui interviendra plus loin) associe a f. Le faisceau de germes horizontaux HdR(Xan/Sa")V HdR(X/S) ®Os OS- est localement constant (pour la topologie usuelle, "molle").

6 du point de vue motivique, cf. infra

THEORIE DES MOTIFS ET INTERPRI;`TATION GEOME`IRIQUE

43

Supposons d'abord que f ait bonne reduction en p, i.e. se prolonge en un morphisme projectif lisse au-dessus du localise en p de l'anneau des entiers de K (on fixe un tel prolongement). En particulier, la fibre Xo a bonne reduction Yo en p. Alors 1'espace des sections de HdR(Xan/San)° au voisinage de 0 s'identifie a 1'espace de cohomologie cristalline Hq is(X0/W(lFP)) ® C. d'apres [B083). (Si Xo n'a pas bonne reduction, on conjecture generalement que X0 a au moins reduction potentiellement semi-stable, et it convient de remplacer alors cohomologie cristalline par cohomologie cristalline logarithmique a la Hyodo-Kato). Le choix d'une base de cet espace de cohomologie cristalline donne lieu a une matrice de "periodes p-adiques" S2p(x), lice aux evaluationsp-adiques de Y(x) par la formule

Y(x) =

(*)p

QP(x)Qp(0)-1.

Toutefois pour donner un sens geometrique aux considerations de dependance sur Q entre periodes p-adiques, ou entre les evaluations des coefficients de Y(x) en un point E K p-adiquement proche de 0, it faudrait savoir dormer une signification geometrique au Q-espace engendre par la base choisie de la cohomologie cristalline, et aux Q-espaces analogues attaches aux puissances de Yo (Kanneth). On voudrait en particulier, en liaison avec la situation predite par la conjecture de Grothendieck, que la classe cristalline des reductions des cycles algebriques sur un produit de

puissances des fibres X et X0 soient dans ces Q-espaces. On est donc conduit au probleme suivant : PROBLEME. - Construire une Q-structure dans la cohomologie cristalline

(tensorisee avec Cp) des puissances de X0, de telle sorte que les cycles algebriques soient rationnels relativement a cette Q-structure.

Ce qui revient essentiellement a construire une cohomologie a coefficients dans 0 sur la categoric des varietes sommes disjointes de puissances de X0, et un isomorphisme de comparaison entre cette cohomologie et la cohomologie cristalline (tensorisees avec Cr). Les motifs pointent a l'horizon...

En attachant ensuite a toutefibre (ou tout produit fini de fibres) X", pour x E Cp assez proche de 0, le Q-espace de cohomologie (problematique)

de sa reduction, on obtiendrait un espace HB(Xy,Q) analogue a la cohomologie de Betti a coefficients dans Q dans la situation complexes en designant par U une petite boule autour de x, on aurait :

HB(Xx,Q) Q C =

Hdn(Xan xSa U/U)°

HdR(X/S) ®Os °U

44

Y. ANDRE

Remarque : une fausse piste, le foncteur mysterieux. Les travaux de J.-M. Fontaine, W. Messing et G. Faltings ont permis de construire ce que Grothendieck appelait le mysterieux foncteur reliant cohomologie etale et cohomologie cristalline. En fait, la construction vaut dans le cas relatif, du moins sous une hypothese de bonne reduction, et peut se presenter ainsi [F89] : i1 existe un anneau differentiel filtre BdR, et un isomorphisme de BdR-modules a connexion filtres : rHdR(X/S) ®FOs BdR

H t(XV,Qp) ®Q BdR

D'autre part, tout plongement de Q, dans C induit un isomorphisme equivariant sous la monodromie :

Ht (Xi, Q) = HB

sjn S, Q) ®QQ (§: revetement universel de Sr).

Une base de HB(Xp xs,.n S,Q) etant choisie, l'isomorphisme compose donnerait naissance a une matrice de "periodes" BdR liee a Y(x) par la relation Y(x) = QBdR C, ofi C est une matrice inversible a coefficients constants (au sens differentiel) dans BdR. Mais le fait que BdR contienne beaucoup trop de constantes differentielles (non reduites aux "scalaires") voue cette tentative d'interpretation geometrique des valeurs p-adiques de Y(x), proposee dans l'introduction de [A891, a 1'echec. Signalons encore que, dans le cas constant, l'analogue de la conjecture de Grothendieck pour la matrice de "periodes" 51BdR est faux [A90].

4. - Le cas des varietes abeliennes degenerantes [A901 C'est un cas qui deborde le cadre precedent - 0 est une singularite -, mass ofi l'analogue (partiel) du programme precedent peut etre accompli plus aisement. Une variete abelienne A sur une extension finie du corps complet K((x)) ou Q, sera dite degenerante si la composante neutre de la fibre speciale de son modele de Neron est un tore deploye T = G. Ces varietes sont bien connues en geometrie rigide, puisque la variete rigide associee A'r'9 est un tore analytique.

a) Rappelons que si A est une variete abelienne complexe de dimension g, alors A(C) = C9/L, ofi L est un reseau de rang 2g, et on a un accouplement non-degenere f,? : L ® HHR(A) -> C. A 1'aide de la fonction exp(2iir), on peut encore representer A(C) sous la forme T(IC)/M, ofi M est un reseau de rang g, et T un tore de dimension g (parametrisation de Jacobi). Soit M' le groupe Hom(Gm, T), M'V son dual. Alors L s'inscrit dans une suite exacte (***)

0 -> 2i7rM" ---+ L ---+ M

0,

THE`ORIE DES MOTIFS ET INTERPRETATION GEOME`7RIQUE

45

scindee par le choix d'une determination du logarithme.

Le morphisme M y T se decrit par une application bilineaire M x M' -> C*; toute polarisation de A donne lieu a une "isogenie" M -+ M', d'ou une application q : MOM -> C*. Il s'avere que - log Iql est un produit scalaire sur MR. b) Soit maintenant A une variete abelienne degenerante de dimen-

sion g sur C. Alors on peut representer A(Cp) sous la forme T(Cp)/M, avec M. T comme ci-dessus (parametrisation de Tate). Toute polarisation de A donne lieu comme precedemment a une application q : M ® M --+ C;, et it s'avere que - log I qI P est un produit scalaire. On a M = H1(Arig, Z), M' = H1((Aduat)rig Z) et M71 se plonge canoniquement dans le dual du module de Tate de A. On peut pousser plus loin l'analogie avec la situation complexe : dans [A90], on utilise le semi-endomorphisme de Frobenius sur la cohomologie cristalline d'un certain 1-motif fonctoriellement attache a A pour construire

un reseau L de rang 2g, sur lequel agissent les endomorphismes de A, et qui s'inscrit dans une suite exacte (***)p

0 -* (2ilr)pM"

L -> M - 0,

scindee par le choix d'une determination du logarithme p-adique; ici (2iir)p

designe un generateur du Zp module des racines p-primaires de l'unite dans C* (c'est l'analogue p-adique de 2iir). On construit en outre un accouplement canonique non-degenere : ?: L ® HdR(A) -i (Cp[(2i7r)p]

c) Soit f : A -> S\{so} un schema abelien de dimension relative g sur le complementaire d'un K-point lisse so dans une courbe affine7 S sur K, et soft x une coordonnee locale en so. On suppose que la composante neutre de la fibre en x = 0 du module de Neron est un tore deploye T sur K. Il en est alors de meme du schema abelien dual, avec un tore V. Posons M = Hom(G.m,, T'), M' = Hom(G,,,,, T).

Fixons K y C. Alors le sous-faisceau de R1 flan * Z constant au voisinage de so s'identifie a 2iirM"; sa fibre en tout s E S(C), s # so, s'identifie au reseau 2iirM," de rang g associe en a) a la variete abelienne A,,; de

meme M s'identifie au reseau M3 de rang g associe en a) a A, D'autre part, le choix d'une determination du logarithme, c'est-a-dire essentiellement d'un secteur U d'angle 2ir d'une petite boule autour de x = 0, identifie R1 ffan * Z I U a L = 2i7rM" (D M; sa fibre en touts E S(C), s # so, s'identifie au reseau L3 = H1(Xn, Z) de rang 2g. 7

Pas necessairement un ouvert de la droite, nous devions des notations du § 2.

46

Y. ANDRE`

THI;OREME 3. - Pour tout -y E 2i7rM'", la serie de Taylor yy(x) de aim fys w(s) est une G fonction. Pour tout y E L, - j w(s) s'ecrit zy(x(s))log a-' (s)n" + y-y(x(s)), oil z.y(x) et yy(x) sont des G-fonctions, a.y(=- K,nyEZ. Fixons d'autre part un premier p et K - Cr tels que le schema abelien f ait bonne reduction en p. Pour tout s E S(C) asset proche mais distinct de so, M, resp. M', s'identifie au reseau MS resp. Ms de rang g associe en b) a la variete abelienne degenerante As. Le choix d'une determination du logarithme p-adique (e.g. via loge p = 0) identifie L = (2iir)pM" (D M au reseau LS de rang 2g associe en b) a A,. De plus, l'accouplement p-adique f,? se prolonge en un accouplement horizontal

?

: L ® HdR(Au/U) -# OU[(2i7r)r] (U: petite boule autour de s)

.

On a alors, en parfaite analogie avec le theoreme 3 (et avec les memes notations) : THeOREME 4. - Pour tout y E (2i7r)pM'",

j

rye

w(s) est l'eualuation

p-adique de la G-fonction yy(x) en x(s). logv(2i Pour tout y E L, on a (2i,ir)p j w(s) = zy(x(s)) )y)n" + yy(x(s)). Remarques : i) Dans cc dernier theoreme, on peut aussi remplacer l'accouplement f,? par celui de Fontaine-Messing, a condition de remplacer le logarithme p-adique usuel par sa version BdR introduite par Fontaine. Cela fournit donc un "pont" entre les theories p-adiques de Dwork et de Fontaine. ii) Dans le cas ou la variete abelienne degenerante sur CP est la jacobienne d'une courbe de Mumford, le reseau L mentionne au point b) est etroitement lie a un reseau de rang 2g construit anterieurement, et par de tout autres methodes (fonctions theta), par L. Gerritzen [Ge86]. [Le reseau considers dans [A901, note 7Gµ4 ® ... ® 7Lµ'9 ® 7Gmi ® . ® 7Gmy, est lie au reseau 7Gdu1 /ul ®... ®Zdu9 /u9 ®Z/3i ®... ®7G/jy de [Ge86] par les formules :

,3i = m', dud/uj = (2iir)Pti,'9 + E(loggjj)mi, ou les qij : M x M _ C,

(j = 1,.. , g) sont les facteurs d'automorphie attaches a uti]. .

iii) On peut completer le theoreme 3 en montrant 1'existence d'un

entier N > 0 tel que pour tout -y

E

2iirM" (resp. M), yy(Nx)

(resp. exp(yy(Nx)/(Nx))) soit a coefficients entiers algebriques.

5. - Le cas des courbes elliptiques avec bonne reduction [A] Revenons a l'hypothese de bonne reduction et examinons le probleme du § 3 dans le cas non trivial le plus simple, celui d'une courbe elliptique X sur le corps de nombres K.

THEORIE DES MOTIFS ET INTERPRETATION GEOME`IRIQUE

47

a) Le cas oil la reduction k est ordinaire Dans ce cas, on sait que admet un releve canonique Xcan/U de type Xj;

P

CM. On a : HHR(X) ®CP = Hc!ras(X0/W(FP)) ®CP = HIR(Xcan) ®CP, et on montre que HB(X, Q) := HHR(X'") repond au probleme du §3.

b) Reduction supersinguliere

Ici D := End Xf P est un ordre maximal d'une algebre de quaternions sur Q, donc D-. = M2(Q). Soient E C DQ un corps quadratique (necessairement imaginaire et deployant), et v un vecteur propre dans Hcris (Xo/W (PP)) ® CP pour 1'action de E. Alors Diq D. v est un Q-espace de dimension 2, contenant un E-vecteur propre u lineairement independant de v sur (CP. Normalisant la base (v, u) par une homothetie de sorte que v A it soit le generateur canonique de Hc2ris (Xo/W (IFP)), on obtient une ,,i.(Y0/W(FP))0CP structure (elle ne depend pas des choix auxilliaires de E et -), qui repond au probleme

du §3.

Remarque : it est probable que l'isomorphisme HHR(X) ® CP HI (XcP, Q) ®C, est toujours transcendant. Du reste, pour X de type CM, on peut deduire de [0901 une expression des periodes p-adiques (i.e. des coefficients de la matrice de cet isomorphisme relativement a des bases de HdR(X) et H' (X(cP, 0) resp.) en termes de valeurs de la fonction FP en des rationnels. c) La fonctoriatite de HB (X1, Q) permet de demontrer simplement des relations entre valeurs p-adiques de la matrice Y(x) solution de 1'equation de Picard-Fuchs relative a une famille de courbes elliptiques XIS. Par exemple, supposons que la fibre en 0 ait multiplication complexe par Q( ), et la fibre en E K par Q( -d') # Q( ). Choisissons

une base symplectique wl, w2 de sections de HdR(X/S) de telle sorte que wl soit dans le cran 1 de la filtration de Hodge (i.e. la classe d'une forme differentielle relative reguliere), et que w2 (0) soit propre pour 1'action de e= E End Xo sur HHR(Xo) ; alors wi (0), resp. wl (e), est propre sons 1'action de e, resp. de e' = -d' E End Xe, et it y a un unique element or de K tel que w2(i;) +QW2(Z;) soit propre sous 1'action de e'.

Soit p un nombre premier non ramifie dans Q(v) et v une place de K divisant p (correspondant a un ensemble de plongements "equivalents" K - CP) tel que l; soit p-adiquement proche de 0 (de sorte que Xo et Xg ont meme reduction en p, necessairement supersinguliere). Examinons les images de e et e' dans End Xo = End Xg : ee' + e'e commute a e et e',

donc est un entier mv. On a (ee' - e'e)2 = my - 4dd'. Comme ee' - e'e

48

Y. ANDRE

anticommute a e, et nest donc pas un entier, on a Im, I < 2 dd'. [Remarque : dans le cas d'une place archimedienne plutot que p-adique,

un argument analogue vaut, en considerant les images de e et e' dans End HB (Xoc, Q) identifie a HB Q) par le prolongement analytique dans une petite boule de centre 01. De plus ee' - e'e agit trivialement sur les formes differentielles invariantes, en caracterisant p ; donc p divise my - 4dd'. Soit de nouveau (v, ii) Q) verifiant e v = la base symplectique de HB (Xo,p, Q) = HB

il. On a alors e'

v, e

v + d iv E Q u. Rela-

tivement aux bases (w1(0), w2 (0)) (resp. (w1 (0, w2 (f ))) et (i7, ii), la matrice

de periodes SZp(0) s'ecrit sous la forme 1

0

1/ I

Cul 0

(zu

0 I M avec M E M2(Q( ru1

0 1 ), ,

resp. SZp() s'ecrit

-d')) [dans le cas

d'une place v archimedienne, it conviendrait de multiplier les W, za' en premiere ligne par 2i7r]. L'evaluation v-adique a done pour coefficients Y1i (e) = M11 '/'a, Y12 M12zJ'zJ, Y21(S) = -Miivw'/w + M21/tv'u', Y22(0 = M22=/VU' - M120'W W. Parce que M diagonalise faction de e' dans HB(XEcp,Q), on obtient la relation

M11M22 + M12M21 = -mv/2 dd'; et comme d'autre part e' 6+ 2d v

'-d. est collineaire a u, on a Mil = 11 n'est pas difficile de conclure alors a la relation suivante8 entre evaluations v-adiques des coefficients de Y(x) en : (m,+2 ou my est un entier borne en valeur absolue par 2 dd' et tel que p(v) I m + 2 dd' si v est ultrametrique de caracteristique residuelle p(v). 11 y a en fait une telle relation pour chaque place v de K telle que Y(x) converge

v-adiquement en . Ces exemples en faveur de 1'existence des Q-structures de Betti nous encouragent a explorer les dimensions superieures.

6. - Motifs Grosso modo, la theorie des motifs est a la geometrie algebrique ce que la theorie de Galois est a l'algebre commutative.

Pour contourner 1'excessive complexite de la categorie des varietes projectives lisses sur un corps k fixe, it est souvent necessaire d'enrichir 8 Beukers a decouvert ces relations simultanement et independamment [Be931 (sans toutefois en donner la forme explicite), par une methode de relevement d'isogenies. 11 a en outre teste plusieurs exemples sur ordinateur.

TH9ORIE DES MOTIFS ET INTERPRETA77ON G8OMETRIQUE

49

ses morphismes. Dans les problemes de classification, par exemple, it est commun de considerer non seulement les applications regulieres, mais aussi les applications rationnelles. Il arrive d'avoir meme a considerer toutes les correspondances algebriques; A. Well en a eloquemment montre l'interet en prouvant "l'hypothese de Riemann" pour les courbes sur un corps fini. II y a une trentaine d'annees, Grothendieck a eu l'idee flamboyante que la categorie des varietes projectives lisses sur k C C, avec pour morphismes les correspondances algebriques a coefficients rationnels modulo 1'equivalence numerique, est "essentiellement"9 equivalente a la categorie des representations semisimples de dimension finie sur Q d'un Q-groupe pro-algebrique, - 1'equivalence transformant produit cartesien en produit tensoriel -; et que pour k de caracteristique non nulle, la situation est similaire, quitte a remplacer "groupe" par "gerbe".

Definissons un motif comme un triplet (Z, n, e) (note generalement eI1(Z)(n)), ou Zest un k-schema projectif lisse, nun entier, et e : Z E) ) Z une correspondance algebrique de degre 0 modulo l'equivalence numerique, verifiant e o e = e (idempotence). Un morphisme de motifs er3(Z)(n) --> e'(j(Z')(n') est une correspondance algebrique de degre n' - n modulo 1'equivalence numerique de la

formee'ofoe:Z-o-) Z'.

[Rappelons qu'une correspondance algebrique Z Z' de degre r (a coefficients rationnels) est une combinaison Q-lineaire de sous-schemas integres de Z x Z' de codimension r + dim Z, ou une classe d'equivalence d'une telle combinaison ; et que, pour toute equivalence "adequate", les correspondances algebriques se composent par la formule go f = pzZ * (p`az' f

pa'z"9)' les degres s'additionnant]. U. Jannsen a demontre que les motifs forment une categorie Q--lineaire abeiienne semisimple [J921. On la munit du produit tensoriel e(7(Z)(n) 0 e'I)(Z')(n') = (e x e')rj(Z x Z')(n + n'), et on la note Mk. Lorsque e = id, n = 0, on note le motif simplement b(Z); le foncteur contravariant de cohomologie motivique associe fj(Z) a Z. Faisons provisoirement l'hypothese suivante (c'est l'une des "conjectures standard" de Grothendieck) :

(N) pour chacune des theories de cohomologies classique H' (celles qui portent un nom), 1'equivalence homologique coincide avec 1'equivalence numerique.

Il en decoule que H' (en particulier, chaque cohomologie etale B-adique

car k) se factorise par la cohomologie motivique Q-lineaire 4, pour £ comme foncteur sur la categorie des k-schemas projectifs lisses. 9 stricto sense it faut ajouter formellent les noyaux des projecteurs.

50

Y. ANDRE

Il en decoule aussi que 1isomorphisme de Lefschetz fort" LZ i H2d-i(Z)(d - i) (donne par le cup-produit itere avec la classe Ht(Z) -+ d'une section hyperplane du k-schema projectif Z de dimension d [D80] [KM741) provient d'un isomorphisme de motifs; en particulier son inverse est algebrique, ce qui entraine notoirement : (C) les projecteurs de Kiinneth sont donnes par des correspondances algebriques. Avec (C), la ®-categorie des motifs Mk se trouve graduee1°, et la theorie "tannakienne", initiee par Grothendieck a ce propos, montre alors que Mk est ®-equivalente a la categoric des representations de dimension finie

d'une gerbe pro-algebrique sur Q (on dit que Mk est tannakienne sur Q). Tout foncteur fibre" sur Mk s'interprete comme une cohomologie; et reciproquement si 1'equivalence homologique coincide avec 1'equivalence numerique. Sous (N), et si de plus k C C, la cohomologie de Betti realise en fait une

®-equivalence entre Mk et la categorie des representations de dimension finie sur Q d'un Q-groupe pro-reductif G,,,,ot, appele groupe de Galois motivique absolu. L'image Gmot(Z) de G,,,,ot dans GL(H'(Z)) est le sousgroupe Zariski ferme qui fixe les classes algebriques parmi les tenseurs mixtes sur H(Z); G,,,ot s'exprime comme limite projective de ces groupes de Galois motiviques G,,,,ot(Z).

La conjecture (P) de Grothendieck sur les periodes exprime que l'isomorphisme canonique entre les foncteurs fibres HdR 0 C et HB 0 C est "generique" parmi tous les isomorphismes possibles.

7. - Realisation de Betti; periodes p-adiques [A] a) Reprenons les notations du § 3: f : X -* S est un morphisme projectif lisse ayant bonne_reduction en la place v de K induite par un plongement

fixe K y Cp, Xo est la reduction de la fibre Xo ... et considerons la categoric tensorielle de motifs engendree par 15(Xok) pour k = ]FP = cloture X011. Pour les k varietes, algebrique du corps residuel de K en v ; on la note 1'enonce (C) est vrai [KM741, done Xo est tannakienne sur Q [D901 [J92]. On en deduit : ThEOREME 5. - It existe un corps de nombres F C_ Cp, et une cohomologie

F) a coefficients dans F pour les schemas sommes disjointes de puissances de Xok (d'od un F-groupe de Galois motivique G,,,,ot(Xok)). En 10

on modifie alors la contrainte de commutativite evidente pour 0 par un signe

obeissant a la regle de Koszul. 11 =®_foncteur exact a valeurs espaces vectoriels.

THEORIE DES MOTIFS ET INTERPR TATION GEOM TRIQUE

51

outre, si l'equivalence homologique cristalline coincide avec l'equivalence numerique sur ces schemas, it existe un isomorphisme de cohomologies

C. = H'(., F) ® Ci,, bien defini a l'action pres de Gmot(Xok)(Cr)

Pour tout x E Cp assez proche de 0 (de sorte que X. = Xok), on obtient alors un foncteur fibre sur la categorie tensorielle X O engendree par 15 (X')

en posant HB(Xx ,F) := H'(Xok,F); nous 1'appellerons realisation de Betti. Il remplit le programme du § 3. La matrice de "periodes p-adiques" fl (x), transposee de la matrice de l'isomorphisme compose HHR(Xx) 0 cCv = H9is (Xok /W (k)) 0 (Cp = HB (XX, F) 0 Cr relativement a la base

w1 ... w de I,HdR(X/S) et a une base de HB(Xy, F) est solution de 1'equation de Picard-Fuchs (*) lorsque x vane.

Remarques : 1) II resulte des conjectures de Tate que G,,,,ot(Xok) devrait etre un groupe diagonalisable definissable sur Q, ce qui est connu pour les varietes abeliennes. On pourrait alors prendre pour F un corps deployant pour ses commutants dans diverses representations. 2) On peut oter un degre de liberte sur le choix de l'isomorphisme de comparaison t en normalisant le_determinant. Par exemple, pour les courbes supersingulieres, ou G,,,,ot (Xok) = G,,,,F, t normalise est unique : la Q-structure H' (Xok , F) sur Hcris (Xok /W (k)) 0 Cp est celle definie au § 5b.

3) Par analogie avec la situation complexe, j'aimerais suggerer la conjecture (P)P pour x = e Q (p-adiquement proche de 0), it existe un isomorphisme

de comparaison normalise t (comme ci-dessus), tel que toute relation de dependance algebrique sur Q entre les coefficients de la matrice de periodes provienne d'un cycle algebrique sur une puissance de Xg. p-adiques Op

Appliquee au produit Xg x Xo, cette conjecture permettrait d'abandonner 1'hypothese que la place v figurant dans le "principe" de l'introduction est archimedienne (on pourrait echanger les roles de v et w dans le raisonnement 7b). Toutefois, dans le cas de connexions de Gauss-Manin non semisimples (eventuellement extensions de connexions de Gauss-Manin semisimples), des relations non triviales entre evaluations p-adiques peuvent apparaitre

sans contrepartie archimedienne : c'est un "artefact" du au logarithme, illustre par 1'exemple suivant (B. Dwork) pour y(x) = log(1 - x), Z; = 1 - e2ir/7 on a y(1;)7 = 0 mais y(Z;), 0 (quoique les periodes i°g 1-C (1-E) 1 Q1,2 = 2i et 1 (S27)12 2 = ,og du 1-motif [Z -> G,,,,] soient 2iIr :

rationnelles).

52

Y. ANDR$

La conjecture ci-dessus rappelle celle de Leopoldt sur le regulateur padique. Soient d'ailleurs E un corps de nombres totalement reel, T le noyau de la norme NE/Q dans le tore rationnel attach& a E, et M un reseau facteur direct et d'indice fins dans le groupe des unites de E. Considerons le 1-motif associe au plongement canonique M - T. Alors pour un choix naturel des bases de cohomollogie, la matrice SZ resp.1 , attachee a cc 1-motif est de la

forme 12i7I

R

I , resp.

(2Z PI RP) , ou R, resp. Rp, est la matrice de

logarithmes dont le determinant donne le regulateur (resp.

p-adique).

b) Dans cette section plus descriptive que demonstrative, nous voici enfin en mesure de justifier le principe des relations globales enonce dans l'introduction. Supposons d'abord que 1'equation differentielle en question soit une equation de Picard-Fuchs (et non un quelconque facteur), attachee a un morphisme X -+ S comme au § 2. Il est alors equivalent de raisonner sur les yPl (x) ou sur les coefficients de la matrice Y(x). Considerons une relation exceptionnelle 0 de dependance algebrique sur K entre les evaluations v-adiques Yj () ("exceptionnelle" voulant dire : ne provenant pas par specialisation x - d'une relation de dependance algebrique a coefficients dans K(x) entre les YZj (x)). Nous allons montrer, en nous appuyant sur les conjectures (P) et (N) de Grothendieck, qu'il y a aussi une relation exceptionnelle Q,,, (' ()),,,) = 0 entre les evaluations w-adiques sauf peut-etre dans une situation

exceptionnelle decrite ci-dessous et peu plausible (qui ne se presente d'ailleurs pas dans le cas particulierement interessant ou la composante neutre du groupe de Galois motivique de X0 est un tore). D'apres la formule (**) et la conjecture de Grothendieck (P), une telle relation (archimedienne) Q (YYj 0 provient d'un cycle algebrique 9

sur un produit X' x X'; plus precisement, elle s'obtient par elimination a partir des relations entre les periodes 52,,,tij(0) que donne la compatibilite de 9 dans HdR(Xg)®"` ® HHR(XO)®"i et HB(Xgc,,,Q)®m 0 HB(Xo(c,,,Q)®m' respectivement.

Soit w une autre place de K ; si w est archimedienne, on supposera seulement que !; est dans le disque de convergence w-adique de Y(x) ; si w est p-adique, on va devoir supposer que X -> S ait bonne reduction en w, et que t;' soit suffisamment proche de 0 pour que Xg et Xo aient meme reduction Xo. En ecrivant derechef la compatibilite des classes de 9 dans et F)®" OHB(Xo(Cw, F) ®"t resp., on HdR(Xf)®'n (DHdR(Xo)®m'

obtient des relations entre les periodes Il,,ij (£), SZ,,,,zj (0) (quitte a remplacer

K par une extension finie, on peut supposer que le corps des coefficients F de chaque cohomologie de Betti "w-adique" intervenant - 11 y en a un

THEORIE DES MOTIFS ET INTERPRtrATION GE`OME`7RIQUE

53

nombre fini - est contenu dans K). Le probleme est de montrer qu'on peut, grace a ces relations, eliminer entre elles ces periodes suffisamment pour obtenir une relation entre les composantes de Y(e)w = Qw(0Q-(0)-1 :elimination effective s'avere tres difficile au-delft du cas des courbes elliptiques (§ 5c), et un simple decompte de degres de transcendance ne suffit pas. Il faut recourir a un argument geometrique d'espaces homogenes, comme suit. Groupes enjeu : posons VV = HdqR(Xe), Vo = HHR(Xo), n = dim Ve =

dim V0. Sur ces K-espaces vectoriels agissent lineairement le groupe de Galois motivique Ge,o := G,not(1jq(Xg) ED q(Xo)) (version De Rham), et le groupe Gxye := fixateur dans GL(VV ® Vo) des tenseurs mixtes qui proviennent par specialisation X --> de cycles algebriques sur les puissances de Xx x Xo (en notant x le point generique de S).

On a Ge,o C Gxye, et Gx,e est reductif (c'est d'ailleurs une "forme tordue" de G,not(ljq(Xx) Considerons l'application birationnelle d'espaces affines 0: Hom(Vf, Kn) ED Hom(Vo, Kn) -

Hom(VV, Vo) ED Hom(Vo, K')

donnee par (he, ho) H (ho 1 o hg, ho). Elle respecte 1'action de Gx-e (action

triviale sur K"`). Notons p1, p2, les projections (Gyre-equivariantes) sur chacun des facteurs Hom(VV, Vo), Hom(Vo, Ku). Alors p2(Gx,e) s'identifie au groupe de Galois motivique Go := G,not(Clq(Xo)), et pi(Gxye) contient un sous-groupe ferme isomorphe au groupe de Galois differentiel "pointe en c", i.e. au fixateur Gdi f,e C GL(VV) des tenseurs mixtes qui proviennent par

specialisation x - l de tenseurs horizontaux sous la connexion de GaussManin (action triviale sur Vo). Ce qui fait apparaitre Ge,o et Gdi f,e x 1 comme

sous-groupes Zariski-fermes de p1(Gx-e) x Go, et on a p2(Ge,o) = Go. [En fait, on peut montrer que la suite 1 - Gdif,e x 1-> Gx-e P2 , Go - 1 est exacte - cela decoule directement du th. 8 ci-dessous; en admettant ce resultat, on obtient une suite exacte de groupes reductifs

1 - Gdif,e

p1(Gx-,e) - Go p2 1,

ou Go est le quotient de Go par p2 (ker p1) ; en particulier Gdif,e p1(Ge,o) _

pi(G)1.

Espaces homogenes enjeu : soit 2,) C Hom(VV, Vo) la sous-variete define

par les relations polynomiales entre coefficients de tY(x) specialisees en x = (ceci prend un sens parce que n'est pas une singularite), avec

54

Y. ANDRE

l'identification Vo = (FHdR(X/S) ® K[[x]])V. D'apres la theorie de Galois A differentielle, 23 est un espace homogene principal a droite sous fortiori, 23 est contenue dans un espace homogene sous pi(Gx_.,g). Introduisons ensuiteT2, C_ Hom(VV, K')®Hom(Vo, K'): K-adherence de Zariski de (tS2v( ) , t5) (0)), avec l'identification K' = HI (Xgc,,, Q) K = HB (Xoc,, , Q) ® K donnee par le prolongement horizontal ; notation analogue: T,, (en remplacant Q par F). Alors 0 induit un isomorphisme de

rv ou w sur une sous-variete de 23 x p2it suit que pi est la K-adherence de Zariski de Y(i;),,,,,,. Quitte a remplacer K par une extension finie, on peut supposer que possede un point rationnel yv,w sur K, dont on note H,,,,,, le groupe d'isotropie dans p1(Gx-g).

La forme precise de la conjecture (P) stipule que T (et done q

,3v))

est l'adherence12 d'un espace homogene principal sous Gg,o. Ceci entraine

que p1 (0(T,)) est l'adherence de l'orbite de yv sous pi(Ge,o), avec pour groupe d'isotropie Hv n pi(Ge,o). [On deduit de la que 23 est stable sous puis que p2 induit un isomorphisme de Hv Gdi f,g pi pi sur G. En particulier Hv est reductif, de meme que H,,, qui lui est conjugue par un element de Gdif,g(K)]. Quant aq3w, it est clair qu'elle est contenue dans l'adherence de l'orbite de yw sous L'hypothese qu'il existe une relation de dependance algebrique sur K entre les evaluations v-adiques Yi7 ne provenant pas par specialisation x -* i; d'une relation de dependance algebrique a coefficients dans K(x)

entre les Yij(x), c'est-a-dire que p1(o(pv))

23, entraine done que

1'adherence de l'orbite de yv sous pi (GC,o) est distincte de 2 (c'est-a-dire que 1'adherence de Hv pi (Gg,o) dans pi (Gx..) ne contient pas Gdif,g) Alors de deux choses Tune : a) p1(0(pw)) 54 ou bien 0) l'orbite de yv, sous pi est dense dans 2), mats distincte de 23 (puisque l'orbite de yv dans 2) est de dimension moindre). Ce cas ne me parait pas plausible, mais je n'ai pas su 1'ecarter a priori [en utilisant (****), on voit tout de suite que le cas Q) signifie que Hw pi (Ge,o) contient un ouvert dense de pi mais est distinct de pi (G1yg) ; cela ne se produit pas si Hv, n p1(GC,o) est reductif, par exemple si la composante neutre de Go, et done celle de H,,,, est un tore]. Hors du cas /3), cette analyse justifie le princppe de l'introduction dans le cas des solutions de Gauss-Manin, ou plus generalement d'un facteur motive (i.e. decoupe par une correspondance algebrique relative), (sous 12

ce serait un espace principal homogene, et non seulement l'adherence d'un tel, si on avait pris la precaution de remplacer C)q(Xx) par Cl4(X,;) ® Q(1), mats peu importe.

THEORIE DES MOTIFS ET INTERPRETATION GEOME` RIQUE

55

reserve des conjectures de Grothendieck (P) et (N), et d'une condition de bonne reduction) ; pour un quelconque facteur de Gauss-Manin, on utilise le corollaire du theoreme 8 ci-dessous, d'ou it resulte que la composante isotypique de ce facteur est motivee, du moins si G,g est connexe (le cas non connexe n'est guere plus difficile). Enfin, en 1'absence de la condition de bonne reduction (ou dans le cas d'extensions non-triviales de facteurs de Gauss-Manin), on peut esperer qu'une theorie convenable des motifs mixtes - a construire - permettrait d'etendre nos arguments. Les resultats du § 4 vont dans ce sens...

8. - Comment eviter les conjectures standard? [A931 a) Revenons au cadre general du § 6. Malheureusement, 1'hypothese (N) ne semble pas d'acces facile, surtout en caracteristique positivel3 (ou elle ne parait pas meme connue pour les varietes abeliennes). On a vu que cette hypothese entraine l'algebricite de l'involution de Lefschetz, i.e. l'involution * de H'(Z) donnee14 en chaque degre i par l'isomorphisme de Lefschetz LZ z, 0 < i < 2d. L'idee directrice de [A931, pour eviter (N), est d'introduire formellement cette involution * parmi les morphisme motives. Soit V une sous-categorie pleine, stable par produits, sommes disjointes

et composantes connexes, de la categorie des k-schemas Z projectifs et lisses. DEFINITION. - Un cycle motive sur Z est un element de H' (Z) de la forme

pz. (a *Q), oet a et Q sont des Q-cycles algebriques sur Z x Z', avec (' arbitraire dans V, pz designant la projection sur Z, et ou. * est relative a une quelconque polarisation de Z x Z' de type "produit".

Les cycles motives forment un Q-espace, gradue par la graduation moite sur HP"'(Z), contenant les cycles algebriques. Ces espaces ne dependent pas de la cohomologie classique choisie H' (a isomorphisme canonique pres), du moins si k est de caracteristique nulle. On montre que les operations usuelles de la theorie des cycles algebriques s'etendent aux cycles motives. En particulier, les correspondances motivees se composent. On peut alors definir sur les cycles motives l'analogue de 1'equivalence numerique. L'analyse de cette equivalence conduit a introduire le sous-corps Q de F engendre par les valeurs fZ a *a, oil a decrit les cycles algebriques sur Z, et Z parcourt V, et a considerer les cycles et 13

Voir toutefols [A931 § 2 pour une tentative dans ce sens; signalons d'ailleurs que (N) entraine la semi-simplicite de Frobenius. 14 On neglige ici la torsion de Tate.

56

Y. ANDRE

correspondances motives a coefficients dans Q (si k est de caracteristique nulle, ou si * est algebrique, on a bien entendu Q = Q).

On definit alors la categorie des motifs modeles sur V, pour tout schema Z dans V, comme au § 6, mais en remplacant Q-correspondances algebriques par correspondances motivees a coefficients dans Q (modulo M. On la munit de 0 et la note M(V) 15 THEOREME 6. - La ®-categorie Ail (V) est tannakienne sur Q, graduee et semi-simple; en outre, si car k = 0, alors on a Q = Q, - est t'identite, et toute cohomologie classique H'' sefactorise a travers la cohomologie motivique 1j, donnant naissance a un foncteur fibre gradue sur M (V).

(On obtient alors, pour k C C, une definition non conjecturale du groupe de Galois motivique.)

b) Ce theoreme permet de specialiser les motifs en inegale caracteristique. Soient K un corps de nombres, v une place p-adique de K et k la cloture algebrique du corps residuel. Soient V une categorie stable par produits fibres, sommes disjointes et composantes connexes, de schemas Z projectifs et lisses sur 1'anneau des v-entiers de K, VK (resp. Vk) la categorie des fibres generiques (resp. speciales geometriques). Alors on a un foncteur de specialisation sp : M(VK) -> MQ?k). Pour associer a une cohomologie classique sur Vk, disons la cohomologie cristalline, un foncteur fibre sur M(Vk), it faut faire agir les correspondances motivees modulo - sur les espaces de cohomologie cristalline

en respectant les fonctorialites idoines, c'est-a-dire relever de maniere "coherente" les classes mod. - en de vraies correspondances motivees. C'est possible, du moins lorsque V est "engendree" par un seul schema X de fibre generique XK connexe [A93][A] :

TxfoRkME 7. - II existe un foncteur fibre gradue 4HHri8 sur M(X®) a

valeurs dans les W(k)Q-espaces vectoriels, tel que pour tout m, Hcris(X= /W(k))Q, et tel que l'isomorphisme de BerthelotOgus induise un isomorphisme de foncteurs fibres gradues HHR 0 CP Hcris ®Cp) o sp sur M(-K)

Dans le cas d'une famille projective lisse X -> S avec bonne reduction

en v comme au § 2, on peut appliquer ce resultat en prenant pour X un modele de Xe, pour chaque T; E S(K) p-adiquement proche de 0, separement. La compatibilite des constructions pour plusieurs est 0 etant fixe, j'ignore par exemple si la classe cristalline problematique : de la specialisation de tout dans H,,,,is(Xok) = Hcris(X= ) 15

Voir note (8).

THEORIE DES MOTIFS ET INTERPRtFATION GEOME7RIQUE

57

cycle motive sur une puissance de Xo coincide avec la classe en theorie H,,*ris ; c'est toutefois vrai pour les intersections de classes de diviseurs sans hypothese supplementaire sur Xk, et pour tout cycle motive si Xk est une variete abelienne. Cela permet d'appliquer la construction du § 7 dans de nombreuses situations en se passant de l'hypothese (N), du moans lorsque Q = Q.

c) Citons quelques applications du theoreme 6 dans le cas d'un corps de base k C_ C. Soient S un schema reduit connexe de type fini sur C, f : X -> S un morphisme projectif et lisse, et 0 une section globale du faisceau R2P f,,Q(p). Grothendieck a conjecture que si la fibre 03 est algebrique en un point

s c S(C), it en est de meme en tout point [G661. En s'appuyant sur le "theoreme de la partie fixe" de Deligne [D71 ] § 4, on montre :

Tii9OREME 8. - Si la fibre 0, est motivee en un point s E S(C), it en est de meme en tout point (pour un choix convenable de V). Avec les notations du § 7b, on en deduit : COROLLAIRE. - Lie Gdj f,g est un ideal de Lie G,g ; c'est aussi un.sousespace motive de End HdR(Xe).

Enfin, a l'aide d'un cas particulier du theoreme de deformation 8 et de [A92], une argumentation suivant le fil de celle de Deligne pour prouver que tout cycle de Hodge 1'est absolument sur les varietes abeliennes, permet de montrer : ThIoREME 9. - Tout cycle de Hodge sur une variete abelienne complexe est motive (pour V convenable).

d) Quelques mots sur la demonstration du theoreme 1 [A]

On se ramene a la situation du §2 define sur un certain corps de nombres de base K0, par le changement de variable x t--> 1/x, et en remplacant f par 1/x n o f : X = A --> S (a fibres non geometriquement connexes), n etant choisi de facon a tuer la ramification au-dessus de x = 0 (pour le nouveau choix de x). Les coefficients de la matrice Y(x) sont alors des series de puissances en x a coefficients dans Ko (en fait des G-fonctions). D'autre part, le nombre de places v de K = K0(Z;') divisant exceptionnel est borne par hypothese (par n52). Le principe de la preuve consiste a construire des relations polynomiales 0 de degre borne a coefficients dans une extension de degre borne de K entre les evaluations v-adiques en Z; des coefficients d'une

58

Y. ANDIZt

matrice Y(x) solution de 1'equation de Picard-Fuchs, a multiplier entre elles ces relations (il y en a au plus nb2), avant de conclure par le "principe de Hasse" de Bombieri cite dans l'introduction (compte-tenu de cc que les points de hauteur et degre bornes sont en nombre fini). L'hypothese que les varietes abeliennes dans la fibre x = 0 sont de type CM entraine que la composante neutre de Go = G,,,,ot(Xo) est un tore. Nos motifs seront modeles sur des varietes abeliennes ; nous utiliserons les consequences suivantes du fait d'avoir affaire a des varietes abeliennes : i) Q = Q, car sur toute variete abelienne, l'involution de Lefschetz est algebrique (Lieberman-Grothendieck) ; ii) X. a bonne reduction potentielle partout; de la resulte 1'existen-

ce d'un revetement etale S' -> S, et pour toute place finie w, d'un ouvert U,,, de Zariski de SK_ au-dessus duquel f 79 a bonne reduction (Ogus). 0 se fait en La construction des relations "exceptionnelles" Q(Y; adaptant l'argument d'espaces homogenes de 7b. Enfin, borner le degre de relations exceptionnelles revient a borner le degre de certains tenseurs "exceptionnels" invariants sous G,,,,ot (Xg) ; pour cc faire, on recourt au lemme suivant :

LEMME. - Sott G un groupe algebrique semi-simple complexe. Il n'y a qu'un nombre fini de classes de conjugaison de sous-groupes fermes connexes H C G contenant le centre de la composante neutre de leur normalisateur dans G.

Manuscrit recu le 22 janvier 1994

TH9ORIE DES MOTIFS ET INTERPRETATION GEOME`IRIQUE

59

BIBLIOGRAPHIE

[A86] Y. ANDRE. - Multiplication complexe daps un pinceau de vartetes abeliennes, Sem. de Theorie des Nombres de Paris, 1984-85, (C. Goldstein, ed.) Progress in Math. 63 Birkhauser Boston (1986), 1-22. [A891 Y. ANDRE. - G -Junctions and Geometry, Aspects of Math. vol. E13, Vieweg, Braunschweig/Wiesbaden (1989). [A901 Y. ANDRE. - p-adic Betti lattices, in "p -adic analysis" Proceedings of the Trento conference, (F. Baldassarri, S. Bosch, B. Dwork, eds.) Springer L.N.M. 1454 (1990). (A92] Y. ANDRE. - Une remarque a propos des cycles de Hodge de type CM, Sem. de Theorie des Nombres de Paris, 1989-90, (S. David, ed.) Progress in Math. 102 Birkhauser Boston (1992), 1-7. [A931 Y. ANDRE. - Pour une Teorie inconditionnelle des motifs, soumis a publication, (premiere version prepubliee a 1'Univ. Paris 6). [A] Y. ANDRE. - Realisation de Betti des motifs p-adiques, en preparation, (premiere partie prepubliee a l'I.H.E.S., Avril 1992). [BO831 P. BERTHELOT, A. OGUS. - F-isocrystals and the De Rham cohomology I, Inv. Math. 72 (1983), 159-199. 1Be931 F. BEUKERS. - Algebraic values ofG functions, J. reine angew. Math.

434 (1993), 45-65. [Bo8 1] E. BOMBIERI. - On G -functions, in Recent progress in analytic number theory, Durham 79, Academic Press (1981) vol. 2, 1-67. [D711 P. DELIGNE. - Theorie de Hodge II, Publ. Math. I.H.E.S. 40 (1971),

5-57. [D801 P. DELIGNE. - La conjecture de Weil II, Publ. Math. I.H.E.S. 52 (1980), 137-252. [D901 P.

DELIGNE. - Categories tannakiennes, in The Grothendieck

Festschrift, Birkhauser Boston (1990), vol II, 111-195. [F891 G. FALTINGS. - Crystalline cohomology and p-adic Galois representations, in Algebraic analysis, Geometry and number theory, J.I. Igusa ed., proc. of the JAMI inaug. conf. John Hopkins Univ. (1989) 25-80.

60

Y.ANDR8

[Ge86] L. GERRITZEN. - Periods and Gauss-Manin connection for families of p-adic Schottky groups, Math. Ann. 275 (1986) 425-453. [G661 A. GROTHENDIECK. - On the de Rham cohomology of algebraic varieties,

Publ. Math. I.H.E.S. 29 (1966), 93-103. [GM78] H. GILLET, W. MESSING. - Riemann-Roch and cycle classes in crystalline cohomology, Duke Math. J. 45 (1978), 193-211. [J921 U. JANNSEN. - Motives, numerical equivalence, and semi-simplicity, Inv. Math. 107 (1992), 447-452. [KM74] N. KATz, W. MESSING. - Some consequences of the Riemann hypothesis for varieties overfinitefields, Invent. Math. I.H.E.S. 23 (1974), 73-77. [090] A. OGUS. - A p-adic analogue of the Chowla-Selberg formula, in "padic analysis" Proceedings of the Trento conference, (F. Badassarri, S. Bosch, B. Dwork, eds.) Springer L.N.M. 1454 (1990).

Yves Andre UA 763 du C.N.R.S.

Universite Paris 6 College de France 3, rue d'Ulm 75231 PARIS 05

Number Theory Paris 1992-93

A refinement of the Faltings-Serre method Nigel BostonN

1. - Introduction In recent years the classification of elliptic curves over Q of various conductors has been attempted. Many results have shown that elliptic curves of a certain conductor do not exist. Later methods have concentrated

on small conductors, striving to find them all and hence to verify the Shimura-Taniyama-Weil conjecture for those conductors. A typical case is the conductor 11. In [1[, Agrawal, Coates, Hunt, and van der Poorten showed that every elliptic curve over Q of conductor 11 is Q-isogenous to y2 + y = x3 - x2. Their methods involved a lot of computation and the use of Baker's method. In [ 121, Serre subsequently applied Faltings' ideas to reprove this result in a much shorter way. He called this approach "the method of quartic fields". In this paper I first seek to refine this method and to make it possible to classify elliptic curves over Q of conductor N for a large number of N. These N are all prime and so this work is indeed superceded by the result of Wiles that every semistable elliptic curve over Q is modular (if fixed). The advantage of my method is that it provides a much simpler approach (when

it works). Like Wiles, I am using deformations of Galois representations

but in a more elementary way. The second half of the paper indicates how the Faltings-Serre method can be used to describe spaces of Galois representations and gives the first applications of the method to mod p representations with p 2. The main result of the first half is Theorem 1 below. Note that there are extensive tables of class numbers and units of cubic fields due to Angell [2) and that information on quartic fields is not required Partially supported by NSF grant DMS 90-14522. 1 thank God for leading me to these results. I thank J.-P.Serre for generously sending me copies of his unpublished work. 1

62

N. BOSTON

THEOREM (1.1). - Let N be a prime - 3 (mod 8) , such that 3 divides neither

h (Q (')) nor h (Q (,/-_N)) . Let M be one of the cubic subfields of the unique cubic cyclic extension K of Q() of conductor 2. Suppose that h(M) is odd and that the minimum polynomial modulo N of a fundamental unit of M has a quadratic residue and a quadratic non-residue root.

Then there is at most one Q-isogeny class of elliptic curves over Q of conductor N with given trace of Frobenius at 2, a2.

Remarks (1) There is a unique such field K because 2 is inert in Q(om) and 3 does not divide h(Q(v/---N)). (2) By Cohen-Lenstra heuristics, 47% of N should satisfy )). Apparently most (but not all, e.g. 571) of these 3 { h(Q(/))h(Q( N have h(M) odd. Some of these satisfy the condition on the fundamental unit (e.g. N = 11, 67,179,...); some don't (e.g. N = 19,43,163,...). (3) The prime N may satisfy the hypotheses of the theorem but there be no elliptic curve over Q of conductor N, (e.g. N = 227, 251, ... see 151).

2. - The Basic Set-up and Elementary Properties

_

Let E be an elliptic curve over Q of conductor N. Let p : Gal(Q/Q) -* GL2(F2) give the action of Galois on the 2-division points of E. The curve E has no rational points of order 2 since its conductor is neither 17 nor of the form u2 + 64 (i.e. it is not a Setzer-Neumann curve) [ 131.

Using work of Brumer and Kramer [5] based on work of Serre [ 11 ], we can deduce various properties of E and p. Firstly, plus or minus the discriminant of a semistable elliptic curve with no rational point of order 2 is never a perfect square. It follows that E has supersingular reduction

at 2, since Q(v) (0 being the discriminant of E) is Q(v) or Q(v) and so has no unramified cyclic cubic extensions by the hypotheses of the theorem. Secondly, since E is supersingular modulo 2, its 2-division field is a cyclic cubic extension of Q(v) unramified outside 2 and totally ramified

at 2, and moreover 2 is inert in

From this it follows that

Q(/) = Q(/), that p is surjective, and that the 2-division field of E (i.e. the fixed field of ker p) is K.

3. - The Faltings-Serre Method [12] Suppose that E' is another elliptic curve over Q with conductor N and the same trace of Frobenius at 2. Assume that E' is not isogenous to E. Let P I P ' : Gal(Q/Q) --> GL2(Z2) give the action of Galois on the Tate modules T2 (E), T2 (E') respectively. By Faltings [7], p and p' are not isomorphic. By section 2, their reductions modulo 2 are isomorphic.

Pick the largest a such that p and p' are isomorphic modulo 2a. Replacing p' by a conjugate if necessary, we can assume that they are equal

A REFINEMENT OF THE FAL77NGS-SERRE METHOD

63

modulo 21. Define o : Gal(Q/Q) -+ M2(IF2)°oGL2(IF2) by

a(x) = ((p'(x) - p(x))/2a(mod 2), P(x)), where M2(IF2)° denotes the 2 x 2 matrices over IF2 of trace zero (mapped to since det p equals det p'). Let k be the fixed field of ker or.

4. - The Proof of the Main Theorem The idea of the Faltings-Serre method is to use it to produce a representation a that can then be shown not to exist by methods of algebraic number theory (in particular tables of number fields). This then shows that there cannot be two non-isogenous curves with the properties stated in the main theorem. PROPOSITION (4.1). - The extension K/K is unramiiled outside N.

Proof : since E has supersingular reduction at 2, the theorem of HondaHill-Cartier [8[ implies that the characteristic polynomial of the formal group

associated to E at 2 is the same as the characteristic polynomial of the system of 2-adic representations at 2. This says that a2 determines the formal group at 2 of E, which determines the 2-adic representation of a decomposition group D2 at 2, i.e. pID2

p'1 D2 If x E D2 (so in particular if

x is in an inertia group at 2), then a(x) = (0, p(x)).

It remains to show that such an extension K/K cannot exist. The key idea is to use two results of Nicole Moser 1101. The first one is :

(1) h(K) = (ah(M)2h(Q(V-__N))/3 (a = 1 or 3) PROPOSITION (4.2). - The class number h(K) is odd.

Proof : this follows from the above formula (1), from our hypothesis that h(M) is odd, and from genus theory, which tells us that h(Q(vl--N)) is odd (since N is prime). Secondly, Moser showed [ 101 that K has a Minkowski unit, i.e. a single

generator of its unit group modulo torsion as a Z[Gal(K/Q)]-module. To apply this, consider by global class field theory the exact sequence of IF2 [Gal (K/Q)]-modules : 0 -+ B -4 U --* (DpjN U p _*

-+ 0,

64

N. BOSTON

where U is the global units of K modulo squares, U. is the local units of K. modulo squares, and P is the Galois group over K of a maximal elementary 2-abelian extension L unramified outside the primes of K above N. Now dimF2 U = 3, dimF2 Up = 1 implying that dimF2 P = dimF2 B. Since K C L, it remains to show that B = 0. The existence of a Minkowski unit implies that U - {±1} G V, where V is an irreducible 2-dimensional F2 [Gal(K/Q)]-module. Sowe just need an element of V which is not in one of the kernels from U - U. The image in V of the unit in the hypotheses of the theorem satisfies this.

5. - Examples (1) N = 11. There is an elliptic curve over Q of conductor 11, namely (11A) y2+y = x3-x2. Let E be another such. Then [51,[131 p is determined,

E has supersingular reduction at 2, and M has odd class number. In fact M is the cubic field of discriminant -44. By [6] a fundamental unit of M has minimum polynomial x3 + x2 + x - 1, which factors modulo 11 as (x + 3)2(x + 6). Since -3 is a quadratic non-residue modulo 11, theorem 1 shows that every elliptic curve over Q of conductor 11 with a2 = -2 is isogenous to (11A). As in Serre's original letter [ 121, this classifies up to isogeny every elliptic

curve over Q of conductor 11, because a similar argument to the above shows that an elliptic curve with good reduction outside 11 and a2 = 2 (respectively a2 = 0) is isogenous to (121A) (respectively (121D))-

(2) N = 67. There is an elliptic curve over Q of conductor 67, namely

(67A) y2 + y = x3 + x2 - 12x - 21. Let E be another such. Then [5], [ 13] p is determined, E has supersingular reduction at 2, and M has odd class number. In fact M is the cubic field of discriminant -268. By [6] a fundamental unit of M has minimum polynomial x3 - 7x2 + 13x - 1, which factors modulo 67 as (x+16)2(x+28). Since -16 is a quadratic non-residue modulo 67, theorem 1.1 shows that every elliptic curve over Q of conductor 67 with a2 = 2 is isogenous to (67A).

The same argument as for N = 11 now applies, because the twist of (67A) by the quadratic character associated to Q( -67) is an elliptic curve of conductor 672 with a2 = -2 and with the same p and the curve of CM type relative to Q( -67) is an elliptic curve of conductor 672 with a2 = 0 and the same T.

6. - Deformation Spaces of Galois Representations The homomorphisms p and p' are lifts of the same p to Z2. They therefore

lie in the deformation space of lifts of p [3]. The Faltings-Serre method constructs from them a third lift a to the dual numbers F2 [E] (E2 = 0). We consider below some applications of this idea.

A REFINEMENT OF THE FALTINGS-SERRE METHOD

65

Let p : Gal(Q/Q) -+ GL2(FP) be an absolutely irreducible representation. Let C denote the category of complete, noetherian local rings

with residue field IFP. Objects of this category are rings of the form Zp[[Ti, ..., T,.]]/I. If R is such a ring, then two representations P1, p2 : Gal(Q/Q) -> GL2 (R) will be called strictly equivalent if conjugate by an element of r2(R) := ker(GL2(R) -+ GL2(]FP)). A strict equivalence class of lifts of p is called a deformation of p. Fix a finite set of rational primes S containing the primes ramified in p. Define a functor 17: C - - --+ Sets by : F(R) = {deformations of p to R unramified outside S} Mazur [91 proved that .F is representable, i.e. that there exists a representation L; : Gal(Q/Q) -+ GL2 (R) (the universal deformation) lifting p and

parametrizing lifts of p to R in C unramified outside S up to strict equivalence via Hom(R, R). The set Hom(R, Z,,) will be called the deformation space of lifts of p.

At the joint AMS-LMS conference in Cambridge, England, in 1992, I suggested that deformation spaces of Galois representations should have some special properties. In particular, it appears that they are often coordinatized by their restrictions to various inertia subgroups It (e E S), namely (i) the restrictions to It V E S - {p}) should indicate which component the lift is on and (ii) the restriction to IP should indicate where the lift is on that component. This idea is now to use the Faltings-Serre method to prove some cases of (ii). The novelty of this approach lies in replacing the prime 2 by more general primes. See [31 for a further discussion of this. Example (1) Let E be the elliptic curve X0(49). This is an elliptic curve over Q of conductor 49. In [4), it is calculated that the universal deformation

ring of the Galois representation given by the 3-division points of E with S = {3, 7} is Z3[[T1, T2i T3, T4]]/((1 + T4)3 - 1). Thus its deformation space

splits into three explicitly given components {T4 = 0}, {T4 = w - 1}, {T4 = w2 - 1}, where w is a primitive 3rd root of unity, and as shown in 141 any lift to Z3 lies on the first component. Also, (i) above holds. In other words, the image of an inertia group at 7 determines on which component a representation over Z3 lies. Now let p and p' be two lifts Of T to Z3 (so lying on the first component). Suppose that they agree on inertia at 3. We shall show that they are actually strictly equivalent (so give the same point in the deformation space). Assume for now they are not strictly equivalent. Since p is absolutely irreducible, two lifts are strictly equivalent if and only if they are isomorphic. For suppose that p' = A-1 pA with A E GL2 (Z7,). Then A centralizes the image of p and so by Schur the image of A in GL2(]FP) is a scalar matrix,

66

N. BOSTON

i.e. A = BC where B is scalar and C E I'2(Zp). But then p' = C-1pC. Since p and p' are not isomorphic, or defines a homomorphism from Gal(Q/Q) into the semidirect product M2 (F3)oGL2 (1F3) (= GL2 (IF3 [e]), e2 =

0) unramified outside S = {3, 7}. Letting K and k denote, as before, the fixed fields of p and a respectively, we get that K/K is unramified outside 7. It is also unramified outside 3 because (i) holds, i.e. p and p' agree on inertia at 3. Such an everywhere unramified field extension of K does not exist, since its Galois group would be a quotient of the ideal class group killed by 3 with Gal(K/Q) acting via the adjoint action. This is excluded as explained in [41 by the work of Coates and Flach since 3 does not divide the numerator of a certain special value of the L-function of the symmetric square of E.

(2) Following [91, let p be a prime number of the form 27 + 4a3 and K be a splitting field over Q for x3 + ax + 1. Embedding Gal(K/Q) = S3 in GL2(Fp), we obtain a representation p : Gal(Q/Q) -+ GL2(1Fp) unramified outside p. Letting S = {p}, Mazur showed that R = Zp[[T1iT2,T3]]. Let p and p' be lifts of p to 7Gp unramified outside S. Suppose that they agree on inertia at p, but are not strictly equivalent. Then they produce, as in (1), an unramified p-extension of the fixed field of ker p, but as Mazur showed in (91, p-h(K), a contradiction thereby proving (ii) in this case. Manuscrit recu le 17 aout 1993

A REFINEMENT OF THE FALTINGS-SERRE METHOD

67

REFERENCES

[1] M. AGRAwAL, J. COATES, D. HUNT and A. VAN DER POORTEN. - Elliptic curves

of conductor 11 Math. Comp. 35 (1980), 991-1002. [2] I.O. ANGELL. - A table of complex cubic fields.

[3] N. BOSTON. - Deformations of Galois representations, (a monograph), in preparation . [4] N. BOSTON and S.V. ULLOM. - Representations related to CM elliptic curves,

Math. Proc. Camb. Phil. Soc. 113 (1993), 71-85. [5] A. BRUMER and K. KRAMER. - The rank of elliptic curves, Duke Math. J. 44,

no 4 (1977), 715-742. [6] B.N. DELONE and D.K. FADDEEV. - The theory of irrationalities of the third degree, AMS, Providence, RI, 1964.

[7] G. FALTINGS. - Endlichkeitssatze fur abelsche Varietaten fiber Zahlkorpern,

73 (1983), 349-366. [8] W. HILL. - Formal groups and zeta-functions of elliptic curves, Invent. Math. 12 (1971), 321-336.

[9] B. MAZUR. - Deforming Galois representations, Proceedings of the March 1987 Workshop on "Galois groups over Q" held at MSRI, Berkeley, California.

[10] N. MOSER. - Unites et nombre de classes d'une extension galoisienne diedrale de Q, Abh. Math. Sem. Univ. Hamburg 48 (1979), 54-75.

[I I I J.-P. SERRE. - Proprietes galoisiennes des points d'ordre fini des courbes elliptiques, Invent. Math. 15 (1972), 259-331. [ 121 J.-P. SERRE. - Letter to Tate, Oct. 26, 1984.

68

N. BOSTON

[ 131 B. SETZER. - Elliptic curves of prime conductor, J. London Math. Soc. 10 (1975), 367-378.

Nigel BOSTON

Department of Mathematics University of Illinois 273 Altgeld Hall, MC 382 1409 West Green street Urbana, IL 61801 U.S.A.

Number Theory Paris 1992-93

Sous-vari@tes algebriques de varibtes semi-abeliennes sur un corps fini John Boxall

RESUME : Dans cet article nous etendons les resultats deja obtenus dans [Boll concernant l'intersection dans une variete abelienne d'une sous-variete avec certains groupes de points de torsion aux varietes semi-abeliennes, le corps de base etant un corps fine.

SUMMARY : In this paper we extend results already proved in [Boll concerning intersections on abelian varieties of subvarities with certain groups of torsion points to semi-abelian varieties, the base field a finite field.

1. - Introduction

_

Soit k un corps fin!, soit k une cloture algebrique de k, soit 1 un nombre premier different de la caracteristique de k et soit G le groupe des racines de 1'unite dans k dont 1'ordre est une puissance de 1. Nous nous proposons

de montrer que 1'ensemble des ( E G tels que 1 - (E G est fini. Plus generalement, soient (a, b) E k2 avec ab 54 0 : nous allons montrer que I'ensemble : J((, 1q) E G2 I a( + br1=1 }

_

est fini.

Dans ce but, designons par r le groupe de Galois de k sur k et remarquons que si ((, 71) E G2 verifie (1)

a(+brj=1,

alors on a egalement a a( + b au = 1 pour tout or E I' et done (2)

a'(+ b'uj = 1 ,

ofi l'on a pose :

a =a ( et b'=b"'

.

70

J. BOXALL

Or, les equations (1) et (2) ont au plus une solution ((, rl) si (a(, ark) # ((, rl) ; it suffit donc pour conclure de montrer qu'il existe un sous-groupe fini Go de G ayant la propriete que pour tout ((, 77) E G2 avec ((,,q) Go, ii existe or E F tel que ((a - 1)(, (a - 1)11) soit un element de Go different de (1, 1). En effet, toute solution 7,7) n'appartenant pas a Go satisait a (1) et (2) avec (a'a-1, b'b-1) E Go \ (1, 1). 11 y a done au plus IGo12 + IGoI 1 solutions.

-

Or, it est aise de construire un tel sous-groupe Go. Posons l' = 4 ou 1' = l selon que 1 = 2 ou l est impair. Soit k' 1'extension de k dans k engendree par les racines l'-iemes de l'unite, soit le l'ordre de G fl k'* et soit e : F --4 Zt le caractere cyclotomique. On sait alors l'image de Gal(k/k') par e est (1 + 1eZ1)". Soit done (E G avec (V k' et soit ln, (n > e), l'ordre de (. Si l'on choisit a E Gal(k/k') de telle maniere que e(a) - 1 (mod ln) mais e(a) 0- 1 (mod In-1), alors (a - 1)( appartient a k' et est different de 1. On en tire que Go = G fl k'* convient. Voici une autre interpretation de ce resultat. Designons comme d'habitude par G.,,,, le groupe multiplicatif : le groupe G n'est autre chose que le groupe des points de torsion de Gvm, dont l'ordre est une puissance de 1. Soit X la courbe dans G2 definie par 1'equation ax+by = 1. Selon notre resultat X (k) fl G est un ensemble fini qui peut etre effectivement determine. Pour tout Q = (c, 71) E G2 (k), on designe par TQX le translate de X par le point

-Q =

Si donc P E X(k) fl Get si a E I', alors aP E X(k) et donc P = aP - (aP - P) E Top_ pX (k), d'ou P E X fl Tp_pX (k). (c-1

77-1)

Autrement dit, si P = ((,'rl), alors ((, rl) est une solution des equations (1) et (2) que, on le sait, n'ont qu'un nombre fini de solutions (eventuellement une au plus) lorsque aP P. Pour conclure it ne reste qu'a montrer que pour tout P E X (k) fl G en dehors d'un sous-groupe fini effectif Go on peut choisir a de telle maniere que aP P et aP - P E Go : d'apres 1'alinea precedent, le choix Go = G fl k'* convient. De ce point de vue, ce resultat est capable d'importantes generalisations(*). Soit E une variete semi-abelienne, c'est-a-dire une extension d'une variete abelienne par un tore (pour plus de details, le lecteur se reportera au debut du §2). On suppose que E est definie sur k et on designe par G un sous-groupe de E(k) (puisque k est un corps fini, tout element de E(k) est d'ordre fini). Soit X une sous-variete fermee de E que l'on suppose definie sur k : comme precedemment, on designe par TQX le translate de X par -Q E E(-k). Pour simplifier on supposera (_k) que X est k-irreductible. Nous nous interessons alors a determiner X f1 G. Bien sur, ceci ne sera possible que si G est d'une nature tres particuliere. Comme dans le resultat qui (*l Apres avoir termine ce texte, je me suss apercu que l'argument qui vient d'etre presente se trouve (pour une courbe plongee dans une variete abelienne) dans l'article de Raynaud ([R31, p3-4).

SODS-VARIETE AWEBRIQUES DE VARIETES SEMI-ABELIENNES SUR UN CORPS FINI

71

vient d'etre demontre, on pourrait prendre comme G le groupe des points de torsion dont l'ordre est une puissance de 1. Plus generalement, soit S un ensemble fini de nombres premiers et soit 1(S) le monoide multiplicatif engendre par S. Pour tout n c Q (S), on designe par E[n] le groupe de points de n-torsion de E(k) et l'on pose ES = UfEO(s)E[n]. Nous demontrerons alors au §2 le theoreme suivant : THEOREMS A. - Avec les notations et les hypotheses qui viennent d'etre introduites, it existe un ensemble fini de couples (Pi, Bi)iEI, Oft Pi E Es, Bi est une sous-variete semi-abelienne de E et Tp, Bi C X, tel que :

X (k) n Es = U Tp Bi. iEI

On en tire immediatement que, lorsque G = Es, on a X (l) n G C UiEI Tp Bi(k). Ce dernier resultat a deja ete demontre dans [Boll lorsque E est une variete abelienne. Le resultat analogue lorsque k est un corps de nombres a ete demontre par Bogomolov [Bgl], IBg2l. Ensuite les travaux de Raynaud [R1], [R2J, [R31, Hindry [HI et Faltings IF] ont etabli le meme resultat lorsque G est le groupe de tous les points de torsion sur k, ou le groupe E(k), ou meme lorsque G est 1'enveloppe divisible d'un sous-groupe de type fini de E(k). Le cas ou E est un tore et G un groupe de type fini (en caracteristique zero toujours) a ete traite par Laurent [Lal. Lorsque G est 1'enveloppe divisible d'un groupe de type fini et X est une courbe, Liardet [Lil a demontre que X (-k) n G est soit un ensemble fini soit un ensemble forme de racines de l'unite et que cette derniere possibilite ne se produit que dans certains cas bien precis. Le travail recent de Ruppert [Rul etudie les solutions en racines de 1'unite de systemes d'equations algebriques. Lorsque k est un corps fini tout element de E(k) est de torsion et it est done impossible d'etendte le theoreme A au cas ou G = E(k). La situation, lorsque k est un corps de fonctions de caracteristique positive, a ete etudiee par Voloch et Abramovich [Vol, [Ab-Vol.

Remarque 1

:

si en plus on suppose que 1'ensemble des Tp Bi du

theoreme soit choisi de facon minimale, alors les Tp Bi seront necessairement les composantes irreductibles de X (k) n Es. En particulier, ils sont uniquement determines par X et S.

Remarque 2 : notre demonstration montre que les sous-varietes semiabeliennes Bi du theoreme A peuvent etre realisees comme des stabilisateurs par translation de composantes irreductibles de fermes de la forme TQ1X n TQ2X n ... n TQ,,X. On peut se demander si les sous-varietes abeliennes apparaissant dans les travaux de Raynaud, Hindry et Faltings puissent etre construites de maniere analogue.

72

J. BOXALL

Le §3 est consacre a une etude de 1'effectivite dans la demonstration du theoreme A.

Je remercie vivement L. Moret-Bailly pour la lecture approfondie d'une version preliminaire de ce travail.

2. - Varietes semi-abeliennes Rappelons que par definition une variete semi-abelienne E est une extension d'une variete abelienne par un tore; elle est alors definie par une suite exacte

1-+T-4 E-->A->0, ou T est un tore et A une variete abelienne definis sur k. A 1'aide de la theorie de la structure des groupes algebriques, on voit aisement que tout sous-groupe algebrique connexe d'une variete semiabelienne est une variete semi-abelienne. Il s'ensuit que si V est une sousvariete (fermee) de E, alors le stabilisateur By de V par l'operation des elements de E(k) par translation est le produit d'une sous-variete semiabelienne et d'un groupe fini. On designe alors par By la composante neutre de By et le groupe fini par Hv. Pour toute sous-variete V de E, on designe par TQV le translate de V

par -Q E E(k). On peut alors ecrire : By = nQEV(k)TQV. On en tire que si V est irreductible, alors dim(Bv) < dim V et si dim(Bv)= dim V alors By = By et V est le translate de By par un element de E(-k). Soit a nouveau S un ensemble fini de nombre premiers, ft(S) le monoide

multiplicatif engendre par S. Pour tout n E f2(S), on designe par E[n] le groupe des points d'ordre n de E(k) et l'on pose Es = UfEQ(S)E[n]. Soit r s le groupe de Galois de k(Es) sur k. Pour tout 1 E S on designe par r(l) le rang du module de Tate T1(E) (on a alors r(l) = 2dimA+dimT n

si l est different de la caracteristique de k et 0 < r(l) < dim A si 1 est egal a la caracteristique de k). L'operation de r s sur ES induit une representation

de I'S dans le groupe des automorphismes continus Aut(Es) de Es. On fixe un choix des bases des TI(E), ce qui induit un isomorphisme continu entre Aut(Es) et DIES GLr(l) M); on obtient ainsi une representation p de rs dans ce dernier groupe. Posons L = 11IES l', oft l'on a ecrit 1' = 1 si l est impair et 1' = 4 si 1 = 2. Pour tout n E fl(S), on designe par k,, le corps k(E[n]) et par r,, le groupe de Galois de k(Es) sur kn. Soit N le plus grand element de Il(S) tel que kN = kL. On a alors, pour tout n divisible par L :

P(rn) S fJ

(I+l°rdi(fl)Mr(I)(ZI))

lES

(oft I designe la matrice identite et Mr(I) (Z1) designe 1'algebre des matrices carrees d'ordre r(l) a coefficients dans Z1). Soit 0 un generateur topologique

SOUS-VARI9'ItALGEBRIQUES DE VARIETE`S SEMI-ABE`LIENNES SUR UN CORPS FINI 73

de FN : k etant un corps fini tout element de rN s'ecrit de maniere unique dans la forme eb avec b E 7L = 1 im Z/nZ. Pour tout 1 E S designons par OI n

la l-composante de p(O). D'apres la definition de N, on a OI = I mod lord, (N) mais Ol o I (mod lord,(N)+1) pour tout l E S. Pour tout 1 E S, definissons la matrice 4)1 par OI = I+lord, (N) (DI. Comme 0 ne laisse stable qu'un nombre fini d'elements de ES, 4DI est inversible (i.e. un element de GLr(I) (QI)). Soit el (P) 1'exposant de l dans l'ordre de P E ES.

Il s'ensuit alors que pour tout l E S on a :

e` (P) - ordi (det 'I) - ordi (N) < el (9P - P) < el (P) - ordi (N). LEMME 1. - Soit b un entier strictement positif. Alors pour tout P E Es

on a: el(P) - ordl(det 4)1) - ordi(N) - ordi(b) < el (95P - P) < el (P) - ordi (N) - ordi (b) . Par consequent, si l'on pose p = PIES lord, (det'Dt) alors E[n] C Es(kn) C_ E[µn] pour tout n E Q(S) divisble par N, Es (kn) designant les elements de ES rationnels sur kn.

Demonstration : le cas b = 1 est deja acquis. Si b > 1, on utilise 1'equation b

OiP-P=(Oi-I)P=b(OIP-P)+rCb(OI-I)rP r=2 b

= b1ordi(N)4,1P+ 1:Cblrord,(N) P. r=2

Puisque lIN pour tout I E S et 41N si 2 E S on verifie que la puissance exacte de 1 divisant Cb lr ord, (N) pour 2 < r < b est strictement superieur a ordi (b) + ordi (N) . On en tire que el ((Oi p - P)) = ordi (b) + ordi (N) + el ((BP - P)) d'ou l'encadrement de el (ObP - P). Pour montrer la derniere assertion on pose b = N . Soit alors P E Es (k)

un point d'ordre ,una avec a E cl(S) et a > 1 et soft 1 un element de S divisant a. Alors el (P) - ordi (det I) - ordi (N) - ordi (b) = ordi (a) > 1 et donc eb ne laisse pas stable P, c'est-a-dire P 0 ES(kn). On conclut que Es(kn) C E[pn] et l'inclusion E[n] C Es(kn) est triviale.

Remarque i : soit k;n l'unique extension de kN de degre m. Pour tout n E fl(S) divisible par N, 0* est un generateur topologique de Gal(k/kn).

74

J. BOY-ALL

On tire de la demonstration du lemme 1 que OP I (mod lord,(n)) mais OR I (mod lordi(n)+1) 11 s'ensuit que 0* est egalement un generateur topologique de kn et donc que kn = k'' pour tout n E Q(S) divisible par N. Ce lemme nous permet d'aborder la demonstration du theoreme A. Soit

n E I(S) divisible par N et soit V une sous-variete de E define sur

_n

k, que l'on supposera kn-irreductible.Supposons d'abord que By = (0).

Si P E V(k) Es, alors o P E V (T) n Es pour tout or E I, et donc P E V n Tp_PV(k). Si P E[pn], le lemme 1 montre que P West pas definie sur kn et donc nest pas stable par 0* : on peut donc choisir une puissance a de 0* de telle facon que 0 j4 aP - P E E[,an]. On trouve ainsi que

V(k)nEs c E[un]uU(vnTQV(k)nEs), Q

ou Q parcourt E[,an] \ (0). Puisque By = (0) et V est kn-irreductible, on conclut que toutes les composantes irreductibles des V n TQV sont de dimension strictement inferieure a celle de V. En prenant l'adherence de Zariski, on peut ecrire : (4)

v (T) n Es = { partie finie de E[pn] } U w (T) n Es, w

ou {W } parcourt un ensemble de varietes definies sur k,,,n que l'on peut supposer k,,,,-irreductibles.

On peut etendre cet argument au cas ou By

(0) en passant a la

variete semi-abelienne quotient E' = E/Bv. Soient /Gy et NV les entiers u et N associes a E' et soit Hv un sous-groupe fini de E tel que By = By x Hv. On a alors Bv,s = Bv,s ® Hv,s Si e(Hv,s) designe 1'exposant de Hv,s on obtient, pour tout n E I(S) divisible par Nve(Hv,s) : (5)

V(k) n Es = U P

uU

n Es,

w

ou P parcourt une partie de ELavn] et W un ensemble fini de varietes definies et irreductibles sur kµvn. On peut alors appliquer le meme raisonnement a chacune des varietes W (en replacant k par ki,). Le theoreme A peut alors etre demontre par recurrence en la dimension de X.

En outre, comme les W sont des kn-composantes des V n TQV, on conclut que x (T) n Es est une reunion finie de translates de sousvarietes semi-abeliennes dont chacune est le stabilisateur d'une composante irreductible d'une intersection de translates de X (remarque 2 de l'introduction).

SOUS-VARIETE° ALGEBRIQUES DE VARIEIES SEMI-ABELIENNES SUR UN CORPS FINI 75

Remarque 2: lorsque E est une variete abelienne simple, on conclut que v (T) fl Es est une ensemble fini. Meme lorsque E est une variete abelienne quelconque on peut parfois demontrer que V (T) fl G est necessairement un ensemble fini pour certain sous-groupes G de E(T). Ceci est le cas par

exemple lorsque G est le groupe des points tues par une puissance d'un ideal premier p de degre un d'une sous-algebre commutative de rang 2 dim A de End A ® Q (voir [Bo2]). Rappelons que d'apres un theoreme bien connu de Tate [T], End A ® Q contient toujours une telle sous-algebre lorsque k est un corps fini.

Remarque 3 : 1'hypothese que k soit un corps fini West pas essentielle dans notre demonstration du theoreme 1. Si par exemple k est un corps de nombre (ou plus generalement un corps de type fini sur Q ou sur un corps fini) on peut verifier une version modifiee du lemme 1 qui permet de conclure

dans la meme maniere. On obtient ainsi une nouvelle demonstration des resultats originaux de Bogomolov ([Bgl], [Bg2]). Par contre, la methode de Bogomolov ne s'applique apparemment pas sur un corps fini : en effet it utilise le fait que sur un corps de nombres p(F) contient un sous-groupe d'indice fini du groupe des homotheties et cette propriete est fausse en general sur un corps fini.

3. - Un peu d'effectivite D'apres un resultat fondemental de Chow [Ch], toute variete semiabelienne E est quasi-projective. Fixons une fois pour toutes un plongement de E dans un espace projectif P". Le degre d'une sous-variete equidimensionnelle V de E relatif a ce plongement peut alors etre defini comme le cardinal de l'intersection de l'adherence de 1'image de V avec un sous-espace projectif generique de codimension egale a la dimension de V. Si V est un ferme de E, on definit son degre comme etant la Somme des degres de ses composantes irreductibles. Le degre jouit alors des proprietes suivantes : (a) Le degre d'une reunion finie de varietes est inferieure ou egale a la Somme de leurs degres ; en particulier, si W est une reunion de composantes irreductibles de V, alors deg W < deg V ; (b) Le degre d'une intersection finie de varietes est inferieure ou egale au produit de leurs degres; _ (c) Le degre est invariant par translation par un element de E(k) (car le morphisme defini par translation par un element de E(k) est plat).

(Pour plus de details sur la notion de degre, le lecteur pourra consulter [Fu], notamment le §8). Le but de ce paragraphe est de demontrer le theoreme suivant : THEOREMS 2. - Soit k un corps fini, soft g > 1 un entier soit E une variete semi-abelienne de dimension g et soit S un ensemble fini de premiers. Alors

76

J. BOXALL

it existe une constante effective K, dependant uniquement de k, S et de E, telle que pour toute sous-variete X de E definie sur k, on ait

deg(X(k) fl Es) <

K(degX)2d,mX(dimx+1)(29ISI+1)

Demonstration : pendant la demonstration, une constante signifira toujours un nombre ne dependant que de k, S et de E. Fixons d'abord k et S, un entier g > 1 et designons par S, 1'ensemble des varietes semiabeliennes de dimension au plus g et definies sur k. Soit E' E £y, soit T est le tore maximal de E' et soit A = E'/T, alors T et A sont definis sur k : si donc k' est une extension finie de k on a (d'apres les "conjectures" de Weil) : IT(k')I <- (Ik'I + 1)dimT IA(k')I <- ( Ik'I + i)2dimA d'ou 1)dimT)(IkI + 1)2 di-A Il s'ensuit que IE'(k')I IT(k')I IA(k')I (Ik'I + :

I Es (k') I est majore uniquement en fonction de k et de g et donc que µ et N peuvent etre majores effectivement en fonction de (k, g, S). Reprenons les notations utilisees lors de la demonstration du theoreme A. D'apres (5) et les proprietes du degre on obtient : (6)

deg (V(k) fl Es) < IE[ivnJI degBv + Edeg (W(k) fl Es), w

ou n est divisible par Nve(HV,s) et les W parcourent une partie des kµ1ncomposantes irreductibles des intersections v f1TQV avec Q E E[n] tel que dim (V fl TQ) < dim V - 1. Avant de continuer, nous aurons besoin d'un lemme. Rappelons que si V est une sous-variete de E, By designe le stabilisateur de V et By la composante connexe de By contenant l'identite. Dans ce lemme, qui generalise le lemme 4 de [Bo 1], k designe momentairement un corps quelconque. LEMME 3. - On a : deg By < (deg V )dim V+1

Demonstration : ecrivons B (resp. B) a la place de By (resp. By). On sait que B = fQEV(k)TQV. Soit {Va} 1'ensemble des composantes connexes de V de dimension maximale. Si dim B = dim V, alors U. V.

est une reunion finie de translates de B : comme le degre est stable par translation, on en tire que deg b < deg V estle resultat est vrai. Si dimB < dim V - 1, comme k est infini et donc V(k) ne peut pas etre une reunion finie de points de sous-varietes propres, it existe Q1, Q2 E V(k) tels que b C TQ1V flTQ2V et dim (TQ,V fl TQ2V) < dim V. Soit alors {CPI 1'ensemble des composantes irreductibles de TQ1 V fl TQ2V de dimension

dim V - 1. Si dim B = dim V - 1 on conclut que les Cp sont des reunions finies de translates de B et que Up Cp est une reunion de translates de

SOUS-VARI9TkALGEBRIQUES DE VARIETES SEMI-ABELIENNES SUR UN CORPS FINI

77

B : on en tire que deg B < deg (TQ,V fl TQ2V) _< (deg V)2. Si au contraire dim B < dim V - 2, alors on montre a nouveau qu'il existe Q3 E V (T) tel que B C TQ1V flTQ2V flTQ3V et dim (TQ,V fl TQ2V fl TQ3V) < dim V - 2.

En continuant ainsi, on montre que B peut etre ecrit comme une reunion de composantes de 1'intersection d'au plus dim V + 1 translates de V ce qui entraine le lemme. Reprenons la demonstration du theoreme et etudions d'abord le terme E[pvn]I deg By dans I'inegalite (6). Puisque By = By x Hv_et le degre est invariant par translation, on a deg By = (deg Bv) I Hv (k) I. D'apres le lemme 3 on a : deg By < (deg V)dim V+1. On conclut que deg By < (deg V)dim v+1 et egalement que e(HV,s) <- I Hv,s I 5 (deg V)dim v+1. D'apres ce qui precede, µv et NV sont bornes en fonction de (k, g, S). Comme IE'[a] I < a29 pour tout a > 1 et pour tout E' E E9 on en conclut qu'il y a une constante Ko telle que pour tout n E Q(S) divisible par Nve(Hv,s) on ait :

IE[pvn]I degBv < Kon29(degV)2(dimv+l)

(7)

De la meme maniere que (6), on peut ecrire

deg (W(k) fl Es) < E I E[pwn] I degBw + Ideg (Y(k) fl Es), w

W

Y

oil les Y sont de dimension < dim V - 2 et n E Q(S) est divisible par le ppcm des Nwe(Hws) et Nve(HV,s) et est independant de W et de V. En appliquant (7) (avec V remplace par W) on obtient :

E IE[µwn]I degBw < Kon2s E(deg W)2(dimw+l) w

w (deg W)2(dim V+1)

< Kon29

w deg

Kon29

W) 2(dim V+1)

w

Comme

>degW w

deg(V f1TQV) < IE[pvn]I(degV)2 < (IGVn)2g(degV)2, Q

on conclut que : degV(k) fl Es < Kon29(deg V)2(dim V+l) + (Kon2g)2dim V+3(deg V)4(dim V+1)

+

deg Y(k) fl Es,

y

78

J. BOXALL

Ko etant une constante. Apres r etapes on trouve alors Kr(n29)1+2''-'(aim V+l) (deg

deg (V(k) n Es) <

V)2r(dim V+1)+

+ j deg (Z(k) n Es), z ofi Kr est une constante et dim Z < dim V - r pour tout Z. En particulier, lorsque r = dim V + 1 la somme sur les Z devient vide, et l'on en tire : K'(n2g)1+2d'm`'(dim V+1) (deg

deg (V(k) n Es) <

V)2d;m V+'(dim V+1)

ou K' est une constante et n est divisible par le ppcm des NzE(Hz,s) pour toutes les varietes Z qui interviennent a chaque etape. Or, on a e(Hz,s) < (deg Z)diin z+l < (deg Z)dim V+1. Enfin Z est une composante

d'une intersection d'au plus deg Z < (deg

V)2d,m V-d,m Z

< (deg

2dimV-dimz V)2d,m

e(Hz,s) < (deg V)

translates de V : on a donc

V et enfin

:

dim V (dim V+1)

Le ppcm des NzE(Hz,s) sera alors majore par une constante foes V)2d;m V(dim V+1)ISI

On peut done choisir comme n tout element de a(S) plus grand qu'une constante fois (deg V)2d,m V (dim V+1)IsI. On obtient (deg

alors le theoreme 2 en commencant avec V = X. Il serait interressant d'ameliorer cette borne. Lorsque X est une courbe lisse et complete et E sa jacobienne on peut borner X(k) n Es en fonction du genre de X (voir [Boll). Remarque (et corrections a [Bo I], [Bo2l). Pour les varietes abeliennes, nous avons deja enonce dans [Boll (theoreme 1) une borne explicite pour deg X (k) n Es. Or la demonstration est basee sur le lemme 1 qui est faux en general et doit etre remplace par le lemme 1 du present article. Autres corrections a [Boll : (i) lignes 6 et 7 de la demonstration du theoreme 1 (p. 1065) ; it aurait fallu lire :... onmontre a 1'aide des lemmes 3

et 4 que, si P E C n As, it existe or c Gal(k/k) avec ... (ii) ligne 3 de la demonstration du lemme 5 (p. 1066) ; it aurait fallu lire : Pour tout E C(k)

on peut trouver y E C(k) tel que les diviseurs .... Corrections a [Bo2l : page 1 (en bas) : f ne s'etend pas en un morphisme X P1 mais seulement en une application rationnelle dont l'ouvert de definition contient tous les points de codimension un. Page 6 (en bas) : la courbe affine y2 = x5 - 11 possede 50 points sur IF41 (et non 64). Manuscrit recu le 3 novembre 1993 Version revisee revue le 7 janvier 1994

SOUS-VARIETE ALGEBRIQUES DE VARIETES SEMI-ABELIENNES SUR UN CORPS FINI

79

Bibliographie [Ab-Vo] D. ABRAMOVICH, J.-F. VOLOCH. - Towards a proof of the Mordell-Lang

conjecture in characteristic p. Intern. Math. Research Notes 5 (1992) 103-115. [Bgl] F. A. BOGOMOLOV. - Sur l'algebricite des representions l-adiques. C. R. Acad. Sci. Paris. 290 serie A (1980) 701-703. [Bg21 F. A. BoGoMOLOV. - Points of finite order on an abelian variety. Math. USSR. Izv. 17 (1981) 55-72. [Boll J. BOXALL. - Autour d'un probleme de Coleman. C. R. Acad. Sci. Paris.

315 serie A (1992) 1063-1066. [Bo2] J. BOXALL. - Valeurs speciales de fonctions abeliennes. Groupe de travail sur les problemes diophantiens, University de Paris VI, annee 1990/1. [Ch] W. L. CHOW. - On the projective embedding of homogeneous varieties. in : Symposium in honor of S Lefschetz, edite par R.H. Fox, D.C. Spencer et A.W. Tucker, Princeton University Press (1957). [F) G. FALTINGS. - Diophantine approximation on abelian varieties. Annals of Math. 133 (1991) 549-576. [Fu] W. FuLTON. - Intersection theory. Ergebnisse der Math. and ihrer Gru.n.zgebiete, 3. Folge, Band 2, Springer-Verlag (1984). [H] M. HINDRY. - Autour d'une conjecture de S Lang. Invent. Math. 94 (1988) 575-603. [La] M. LAuRENT. - Equations diophantiennes exponentielles. Invent. Math. 78 (1984) 299-327. [Li] P. LIARDET. - Sur une conjecture de Serge Lang. Asterisque 24-25 (1975) 187-210. [R11 M. RAYNAUD. - Courbes sur une variete abelienne et points de torsion.

Invent. Math. 71 (1983) 207-233. [R2) M. RAYNAUD. - Sous-varietes d'une variete abelienne et points de

torsion, dans Arithmetic and Geometry I, dedie a I Shafarevich, Birkhauser, (1983) 327-352. [R3) M. RAYNAUD. - Around the Mordell conjecture for Function Fields and

a Conjecture of Serge Lang, dans Algebraic Geometry, Lecture notes in Maths. 1016, Springer-Verlag (1982). [Ru] W. M. RuPPERT. - Solving algebraic equations in roots of unity. J. Crelle. 435 (1993) 119-156.

80

J. BOXALL

[T] J. TATE. - Endomorphisms of abelian varieties over finite fields. Invent. Math. 2 (1966) 134-144. [Vol J.-F. VOLOCH. - On the conjectures of Mordell and Lang in positive characteristic. Invent. Math. 104 (1990) 643-646.

John BOXALL

Departement de mathematiques et de mecanique Universite de Caen Esplanade de la Paix 14032 CAEN cedex FRANCE

e-mail : [email protected]

Number Theory Paris 1992-93

Propri@t@s transcendantes des fonctions automorphes Paula Beazley Cohen

Le sujet de cet article est un travail en commun avec J. Wolfart et H. Shiga dont les details sont donnes dans un manuscrit [CSW] intitule "Criteria for complex multiplication and transcendence properties of automorphic functions", preprint du Johann Wolfgang Goethe-Universitat, Frankfurt-am-Main. Le but du present article est de servir d'introduction a ce manuscrit. Le point de depart est le resultat suivant, demontre par Th. Schneider en 1937 [Sch] : THEOREME (Schneider). - Soit j = j (T) la fonction modulaire elliptique. Alors T et j (T) sont tous les dewy des nombres algebriques si et seulement si T est quadratique imaginaire.

Rappelons que la fonction modulaire elliptique j : 7-i -> C

est holomorphe sur le demi-plan superieure 7-1, meromorphe avec un developpement de Fourier d'ordre -1 a l'infini et automorphe par rapport au groupe modulaire r = PSL(2, Z) = SL(2, Z)/{±12}, c'est-a-dire : 7m1T 112 _ n1T+n2

m1, m2, n1, n2 E Z, min2 - m2n1 = 1.

La fonction j est normalisee de sorte que, dans son developpement en puissances de q = exp(2iirr), le coefficient de q-1 soit egal a 1 et le terme constant soit egal a 744. De plus, on a une bijection,

F\n

{ [£T]; T E 7-l},

£T = cC/(ZT + Z),

ou [£T] designe la classe d'isomorphisme de £T sur C. L'invariant modulaire de E E [£T] est donne par :

j(£) =j(T).

82

P. B. COHEN

Les valeurs de la fonction automorphe j parametrisent les classes d'isomorphisme de courbes elliptiques : j (E) = j (V) si et seulement si £ et £' sont

isomorphes sur C. On a meme : j = j (r) est algebrique si et seutement s'il existe une courbe ettiptique S E [Sr] telle que £ soit define sur 0. On ecrira £/Q lorsque £ est definie sur 0, c'est-a-dire lorsque les invariants g2 = 92(L) et g3 = 93(L) du reseau L ou £ = C/1 sont des nombres algebriques. Il y a deux possibilites pour l'algebre des endomorphismes End,(£T) = End(ET) ®z Q. On a soit End,(£T) = Q soit End,(£T) = Q(T) avec r quadratique imaginaire. Dans le deuxieme cas £T est done a multiplication complexe (CM) par Q(T). On peut reformuler le theoreme de Schneider de la facon suivante,

soit £/Q, alors £ E [£T] avec T E Q si et seulement si £ est a CM .

Le travail de Wolfart, Shiga et moi-meme donne une generalisation du resultat de Schneider aux dimensions superieures : c'est-a-dire aux domaines D, classifies par Shimura et Siegel, qui parametrisent certaines families {Az; z E D} de varietes abeliennes polarisees d'un type d'endomorphisme donne. Qualitativement, avec les notations analogues a celles du cas elliptique, notre resultat dit que : soit A/Q alors A E [AZ] avec z E D(Q) si et seulement si A est de type CM.

Rappelons l'origine des domaines D [Shi]. Soit (A, C) une variete abelienne simple avec polarisation C. Alors, End, (A) est une algebre de division L sur Q munie d'une involution positive p induite par C. De telles (L, p) ont ete classifiees par Albert. Soit K le centre de Let F = {x E K I xP = x} avec g = [F : Q], alors it y a quatre possibilites pour L : TYPE (I) TYPE (II)

L=F L est une algebre de quaternions totalement indefinie sur F : L ®Q 1R

M2(18)9

TYPE (III) L est une algebre de quaternions totalement definie sur F : L ®Q R - H9 (ou H est l'algebre des quaternions hamiltoniens) TYPE UV) L est une algebre centrale simple sur K ou K est un corps CM donne par une extension quadratique totalement imaginaire de F.

Soient, n un entier positif et 4) L - M,z(C) une representation complexe de L de dimension n (on aura toujours que [L Q] divise 2n) :

:

et soit S = S(L, 4), p) 1'ensemble des varietes abeliennes polarisees (A, C), dim A = n, (ici A n'est pas supposee simple) teller que

L C End,(A) C soit compatible avec p it existe un reseau A C C' et un isomorphisme A

Cn/A induisant 4).

PROPRII`TES T RANSCENDANTES DES FONCTIONS AUTOMORPHES

83

Une representation 4) avec S non vide est appelee une representation admissible. Dans le cas des types I, II, et III it n'y a qu'une seule F admissible

a isomorphisme pres. La construction de Shimura donne pour chaque L et chaque classe d'isomorphisme d'une 4) admissible un domaine D qui

parametrise des families E = {(A,zfC.);z E D} d'elements de S. Pour chaque E it y a un groupe modulaire r = r y agissant sur D tel que l'on ait une bijection : I \D ^ { [(A, C)], (A, C) E E}.

On dit que z E D est un point CM si A,z E E est de type CM. Rappelons qu'une variete abelienne est de type CM lorsqu'a isogenie pres elle se casse en produits de puissances de varietes abeliennes simples B ou les Endo(B) sont des corps CM de degre 2dim(B) sur Q. Le domaine D est suppose convenablement normalise, c'est-a-dire it est un des domaines donnes dans [Shi, p. 1621. Les points CM de D sont alors algebriques : en les ecrivant comme produits de matrices leurs coefficients sont tous des nombres algebriques. On designe par D(Q) les points algebriques de D. Par les travaux de Borovoi, Deligne, Milne [Mil et d'autres sur 1'existence de modeles canoniques, on sait que la variete de Shimura F\D a une structure de variete quasi-projective V definie sur Q. L'application canonique

J:D-->V est donc une generalisation de la fonction modulaire elliptique j. Wolfart, Shiga et moi-meme avons demontre la generalisation suivante du resultat de Schneider : THEOREME. - On a z E D(Q) et J(z) E V(Q) si et seulement si z est un point CM.

Pour simplifier les notations on va designer par une seule lettre A une variete abelienne polarisee. Le Theoreme est une consequence du resultat suivant de Wolfart, Shiga et moi-meme. RESILTAT PRINCIPAL. - Soit A une variete abelienne polarisee definie sur

. Alors it y a equivalence entre : (i) A est isomorphe d A,,, ou. A. E E et z E D(Q), (ii) A est de type CM.

L'existence de V permet de definir le corps K des fonctions Tautomorphes definies sur Q. Comme consequence evidente du Theoreme on a,

84

P. B. COHEN

COROLLAIRE. - Si z E D(Q) est un point non CM alors it existe un f c K telle que f (z) soit transcendant.

Avant de donner une We de la demonstration de ce resultat passons a des exemples :

L'espace et les fonctions modulaires de Siegel : dans cet exemple on a L = Q avec, pour chaque n > 1, la representation de Q a valeurs dans

Mn(C) donnee par 4 : a H aln, a E Q. Le domaine D est donne par 1'espace de Siegel

= ' H , , z={zEMn,,(C)I tz=z,

1 (z-z)>0}

sur lequel agit le groupe modulaire F = Sp(2n, Z) ou

Sp(2n,Z)={M=I C

)EM2(z)ltM( _0°

0) M=(-o°

Ici A, B, C, D sont dans Mn(Z). On a r = rE pour une famille E de representants des classes d'isomorphisme de varietes abeliennes de dimension n principalement polarisees. L'action de r sur D est donnee par z H (Az + B)(Cz + D)-1. Soit A une variete abelienne principalement polarisee isomorphe a Cn/A on A est un reseau dans Cn. On peut ecrire

A=Z(J w)®...®Z(J 71

Ou

w)

72n

f_t(fwfw)Ecn

pour une base W1, ... ,Wn de H°(A,1) sur C et une base yl, ... , yen de H1(A, 7L) sur Z. On peut choisir ces bases de H°(A, 52) et de H1(A, Z) de telle facon a ce que pour les matrices des periodes

Q2 = (J

Q1= ( 'yn

7n+1

0'...,J 72n CO)

on alt z = SZj 1522 dans Hn. On appelle z un module de A. Clairement, A est isomorphe a Az = Cn/(z.Zn + Zn) et Az est munie d'une polarisation principale C. On a une bijection

F\ln ^' {[(AzjC.)],z E -ln} L'hypothese que la polarisation soit principale nest pas cruciale : pour 1'enlever, it faudrait modifier le groupe r et a chaque variete abelienne polarisee de dimension n on peut associer un module (modulo F) daps 'Hn. Le Resultat principal donne donc :

PROPRIETES TRANSCENDANTES DES FONCTIONS AUFOMORPHES

85

PROPOSITION 1. - Soient A une variete abelienne polarisee define sur 0 et z E fl un module de A. Alors Z E 9{n, (0) si et settlement si A est de type CM.

Le Theoreme permet de deduire des resultats de transcendance sur les fonctions automorphes de Siegel definie sur 0. Soit K le corps de ces fonctions dont 1'existence est une consequence de celle de V, ou si l'on veut K est le corps des fonctions automorphes de Siegel qui sont des quotients de formes automorphes de Siegel a coefficients de Fourier algebriques. Alors le Corollaire donne : PROPOSITION 2. - Si z E l,,, (0) alors toute fonction automorphe de Siegel darts K et definie a z prend une valeur algebrique a z si et seulement si z est un point CM.

Un exemple de Freitag montre qu'en general les resultats donnes par le Corollaire, comme celui de la Proposition 2, sur la transcendance des valeurs des fonctions automorphes aux points algebriques non CM ne peuvent pas etre ameliores sans faire des hypotheses supplementaires sur le point non CM.

Exemple de Freitag [Fr] : prenons n = 2 dans 1'exemple des espaces de Siegel. Alors Freitag a montre que le corps K des fonctions sur 9-12 automorphe par rapport a r = Sp(4, Z) et definies sur Q est engendre par trois fonctions fl, ,((f2, f3 telles que sur

D'={z= l

0

0 T2

I

E912IT1,T2EN1

onait: fl (Z)

0,.f2(z)

9(71)+9(72),f3(z)

9(Tl)-1+9(T2)

avec G4 = G4(7-) et G6 = G6 (T), T E 9-l les series oil 9( T) = d'Eisenstein (normalisees) de poids 4 et de poids 6. On peut choisir z E V avec Ti un point CM et T2 un point algebrique mais non CM (et g(Tl), 2 T) GG4(T)

g(T2) 0 0). Le point z sera alors un point algebrique non CM auquel deux fonctions dans K algebriquement independantes prennent une valeur

algebrique. Seulement une troisieme fonction dans K algebriquement independante de ces deux fonctions et definie en z prendra une valeur transcendante en z.

Les fonctions modulaires de Hilbert : la demonstration du Resultat principal peut donner un corollaire plus fort. En effet, on n'a pas toujours

besoin de savoir que tous les coefficients du point non CM z sont des nombres algebriques, pour avoir un f E K tel que f (z) soft transcendant. Dans le cas des fonctions modulaires de Hilbert on a par exemple :

86

P. B. COHEN

PROPOSITION 3. - Soit F un corps totatement reel de degre g = [F : Q] et E) l'anneau des entiers de F. Le group r = PSL(2, O) agit sur )-l9 et on design par K le corps des fonctions I -automorphes definies sur Q. Alors, si z = (zl, ... , z9) E x9 n'est pas un point CM et si zi E 7-l(J) pour un seul i = 1, ... , g, it existe un f E K avec f (z) transcendant.

On va donner une demonstration (differente de celle qui figure dans ICSWI) de la Proposition 3 a la fin de cet article. La condition qu'il s'agit d'un seul i = 1, ... , g dans la Proposition 3 peut donner lieu a des points dans 1'espace de Siegel 7-ig dans l'image d'un plongement de 1-l9 dans 7-lg dont tous les coefficients sont transcendants mais auxquels les fonctions

modulaires de Siegel definies sur 0 ne prennent pas toutes des valeurs algebriques.

We de la demonstration du Resultat principal

:

les details de la

demonstration du Resultat principal sont donnes dans notre manuscrit [CSW]. Le fait que (ii) implique (i) est une consequence de la normalisation choisie pour D (d'ailleurs si B est une variete abelienne de type CM, alors par la theorie de la multiplication complexe, B est isomorphe sur C a une variete abelienne definie sur (0). Comme dans le cas elliptique, c'est la demonstration que (i) implique (ii), c'est-a-dire que si A,, nest pas de type CM alors z 0 D(Q) qui utilise la theorie des nombres transcendants. La demonstration de Schneider utilisait une fonction auxiliaire, polynome en certaines fonctions elliptiques de Weierstrass. Notre demonstration en dimension superieure utilise le Theoreme du sous-groupe analytique de G. Wiistholz [Wii] (donc une fonction auxiliaire donnee par des fonctions abeliennes est implicite) et des renseignements tres explicites tires des travaux de Shimura, notamment de [Shi]. Prenons le cas ou A est simple, A E [Az], Az E E, z c D = D(L, 4))

avec L = Endo(A). On suppose que Az n'est pas de type CM et donc que dim(D) > 0. On demontre ensuite que l'hypothese que z E D(Q) entraine une contradiction. Soit w1, ... , wi,, une base du Q-espace vectoriel HI (A, S2(O).. Alors it existe -ii, ... , rym E Hl (A, Z), m = i' avec, pour f.Yi w

t (f7, wl ... , f 'j

E (C-,

AQ=A®zQ=1: 4)(L)J w. 7=1

'Yi

Comme A est definie sur 0 on a 4)(L) C MA(O). Donc les elements de L induisent des relations de dependance lineaire sur 0 entre les periodes

fy w, ou w E H° (A,1

), y E Hl (A, Z).

D'autre part, dans la construction de Shimura le domaine D se decom-

pose en un produit de g = [F

:

Q] (avec les notations deja donnes)

PROPRIETES TRANSCENDANTES DES FONCTIONS AU7OMORPHES

87

domaines irreductibles D = D1 x

.

.

. x V9,

et donc le point z s'ecrit : z= v=1' Les z, sont des matrices a coefficients algebriques (par hypothese) qui sont des quotients de matrices

Les matrices Q', k = 1, 2 ont leurs coefficients des combinaisons Qlineaires des f wi, i = 1,.. , n, j = 1,... , m : c'est implicite dans la .

construction de Shimura. Si z est une matrice a coefficients algebriques it y a donc une relation de dependance (0-lineaire entre les periodes f3 wi, i = 1 ... , n, j = 1, ... , m et on peut montrer que cette relation est non triviale.

C'est ici qu'intervient l'argument de transcendance qui dit que les relations non triviales de dependance lineaire sur 0 entre les fj wi, i = 1,. .. , n, j = 1,. .. , m proviennent toutes de relations non triviales de dependance lineaire sur 4 ) (L) entre les fi w" , j = 1, ... , m. En effet, dans [CSW] nous demontrons un resultat que G. Wustholz a annonce sans demonstration dans [Wu,ICM] : LEMME. - Soit A une variete abelienne define sur 0 et isogene auproduit

direct Al' x ... x AN de uarietes abeliennes simples Aµ, dim(A,) = nµ, definies sur Q et deux-d-deux non isogenes. Alors le Q-espace vectoriel VA

engendre par toutes les periodes des d ferentielles dans H° (A, 1l) est de dimension : dimo(VA) =

Comme les If-,,. w, j = 1, ... , m sont independants sur 4)(L) l'hypothese

z E D(Q) entraine la contradiction voulue (en fait meme 1'hypothese z E D (Q) pour un seul v aurait ete suffisante : voir la Proposition 3 pour une application de cette remarque). Pourtant, lorsque L est strictement contenu dans Endo(A) on utilise l'hypothese plus forte sur z. Si L est strictement contenu dans Endo(A) et A est simple alors A est isomorphe a A., ou A,, appartient a une famille E' C E d'elements d'un S(L', 4', p') ou L' = Endo(Az-). Soit D' = D(L', I'). Si dim(D') = 0 alors A est de type CM. Si dim(D') > 0 alors la discussion precedante montre que le point z' ne peut pas etre algebrique. Pour deduire la transcendance de z de celle de z', it faut utiliser des proprietes de rationnalite de certains plongements modulaires, definis sur (0, de D' dans D. Un argument facile

88

P. B. COHEN

de plongement modulaire permet aussi de traiter le cas on A nest pas simple.

Afin d'illustrer l'idee de la demonstration du Resultat principal demontrons la Proposition 3.

Demonstration de la Proposition 3 : le quotient PSL(2, O)\'Hg est 1'espace des modules d'une famille E de varietes abeliennes polarisees de M9 (C) donnee par : dimension g dans S(F, p) avec D : F 4) : 0 H diag(aj(0),...,ag(6)),

ou al, ... , ag sont les plongements galoisiens de F dans T18. On est donc dans le cas du TYPE a) avec m = 2. Soit A une variete abelienne define sur Q et isomorphe a AZ E E on z E D. L'espace H° (A,1 ) se decompose en g sous-

espaces propres pour 1'action induite de F. Tous ces sous-espaces sont de dimension 1 sur F. Soient cal, ... , wg des generateurs correspondants et soient ryi, 72 des elements de Hl (A, Z) qui engendrent Hl (A, Q) sur F. On peut choisir cette base de telle sorte que : Wi

- J7i f72 Wi

zi =

soit dans 7-l et que z = (zi)i-i.....g soit le point qui correspond a A. Si zi E 0 alors on sait que zi n'est pas dans ai (F) et donc par le Lemme it doit y avoir un element M de Endo(A) qui n'est pas dans fi(F). Mais les elements

de 4)(F) commutent aux elements de Endo(A). Donc L4, = 4)(F)(M) est un sous-corps de Endo(A) totalement imaginaire de degre 2dim(A) et une extension CM de F, d'ou it vient que A est a multiplication complexe. Manuscrit recu le 7 fevrier 1994

PROPRIE7ES TRANSCENDANTES DES FONCTIONS AUIOMORPHES

89

REFERENCES

[CSW] P. B. COHEN, H. SHIGA, J. WOLFART. - Criteria for complex multiplica-

tion and transcendence properties of automorphic functions, preprint J. W. Goethe-Universitat, Frankfurt am Main (1993). [Mil J. S. MILNE. - Canonical models of (mixed) Shimura varieties and automorphic vector bundles, "Automorphic forms, Shimura varieties and L-functions", Vol. 1, ed. by L. Clozel, J. S. Milne, Ann Arbor 1988, Academic Press (1990), 283-414. [Sch] Th. SCHNEIDER. - Arithmetische Untersuchungen elliptischer Integrate, Math. Ann. 113 (1937), 1-13. [Shi] G. SHIMURA. - On analytic families of polarized abelian varieties and automorphicfunctions, Ann. Math. 78 (1963), 149-192. [Will G. WosTHOLZ. - Algebraische Punkte auf analytischen Untergruppen algebraische Gruppen, Ann. of Math. 129 (1989), 501-517. [Wii,ICM] G. WUsTHoLz. - Algebraic Groups, Hodge Theory and Transcendence, Proc. ICM Berkeley 1986, Vol. 1, AMS, (1987), 476-483.

Paula Beazley COHEN UA 747 CNRS

College de France 3 rue d'Ulm F-75005 Paris France

Number Theory Paris 1992-93

Supersingular primes common to two elliptic curves E. Fouvry and M. Ram Murty

1. - Introduction Let E be an elliptic curve over Q. Denote by 7ro(x, E) the cardinality of the set of the supersingular primes of E less than x. It has been conjectured by Lang and Trotter [L-T] that, when E has no complex multiplication, the following holds, when x -i 00 x (1.1) fro (x, E) ' CE tog x

where CE is a positive constant depending only on E, precisely defined in terms of Ga1(Q(Eto,.s), Q). The first significant step towards (1.1) is due to Elkies ([Ell]), who proved

that each elliptic curve over Q, has infinitely many supersingular primes. This result was improved by the authors, who proved THEOREM A ([F-M] Theoreme 1). - Let E be an elliptic curve over Q. Then, for every positive S, there exists xo(6, E) such that, for x > xo(6, E), the following holds : 1093 x . 70 (x, E) > (log4 x)'+6

Here logk is the k-fold iterated logarithm function. Note that the best upperbound for 7ro(x, E) is due to Elkies and Murty (1E121, [E131) and has the shape 7ro (x, E) = OE (x 4) , for any non CM-curve E, with the convention that CM means that the curve has complex multiplication.

In the direction of (1.1), we must also quote another result which asserts, very vaguely speaking, that the Lang-Trotter Conjecture is true on average. More precisely, let a, b be two integers with 4a3 + 27b2 let Ea,,b be the elliptic curve defined by the equation y2 = x3 + ax + b,

then we have :

0 and

92

E. FOUVRY and M. R. MURTY

THEOREM B ((F-M] Theoreme 6). - For every positive e, we have, for x - oo, the asymptotic relation 'ro(x,Ea.,b)

\3

IaI
).(4AB) fox-

uniformlyforA> xz+E B > x2+E, AB > xz+E A familiar way to write (1.1) is to say that the probability for a prime p to be supersingular for E is (1.2)

CE 2

f' 1

and we are led to the problem of the primes supersingular for two given elliptic curves E and E'. We say that two elliptic curves over Q are in general position when none of them has complex multiplication and when they are not isogenous over Q. Let us recall that, when E has complex multiplication,

we have 7ro (x, E) - 2 iog , and if E and E' are isogenous over (0, they have the same supersingular primes apart from the prime divisors of the conductors of these curves ; in other words, we have (1.3)

iro(x,E,E') =7ro(x,E) -OE(1) =iro(x,E') -OE,(1),

where 7ro(x, E, E') is the cardinality of the set of primes, less than x, supersingular for E and E'. Then, following (1.2), it is natural to think that if E and E' are in general

position, the probability for a prime p to be supersingular for E and E' should be CE E' P

where CE,E, is a positive constant depending only on E and E'. Such a probabilistic assumption of independence appears in IL-T] page 37, and leads to the conjecture (1.4)

7ro(x, E, E') - CE,E' loge x (x -- 00)-

when E and E' are supposed to be in general position. This conjecture seems extremely hard to prove - for the moment, nobody knows how to prove that fro (x, E, E') - oo when x -> oo - since the set of primes in question is very, very sparse (heuristically as sparse as the following set connected with the Mersenne conjecture : {p < x; 2" - 1 is a prime}). The bulk of this paper is to prove that (1.4) is true on average, in the same philosophy as Theorem B proves (1.1) on average. We will prove

SUPERSINGULAR PRIMES COMMON It) TWO ELLIPTIC CURVES

93

THEOREM 1. - For every positive E, we have for x -> oc, the asymptotic relation 0(xi Ea,b> r"'a',b') ..'

(1.5) IaI
35

to g2x(16AA 'BB')

IbI
holds uniformly for A, A' >

x2+E, B, B' >

x2+E, AB, A'B' > x-2+E

Let S(A, A', B, B') be the sum studied in (1.5). To be allowed to say that Theorem 1 proves (1.4) on average, we must check that the contribution to S(A, A', B, B') of the pairs of curves (Ea,b, Ea',b,) which are not in general position is negligible. So we denote respectively by SCM(A, A', B, B') and S1--(A, A', B, B') the contribution of those pairs with Ea,b having complex multiplication and with Ea,b and Ea',b' isogenous over (Q. The first contribution satisfies : SCM(A, A', B, B') < {(a, b); Ial < A, JbI < B, Ea,b is CM}1

E Y, 70(x,Ea',b') Ia'I
There are thirteen families of elliptic curves with complex multiplication, they can be written as E0,t; Et,o; Ea=t2,p,t3 (t E Z*, 1 < i < 11)

where the (ai, ,3z) are eleven pairs of integers. Theorem B implies under the conditions of Theorem 1 SCM (A, A', B, B') = O (max(A, B).A'B'

tog ) = O(AA'BB'),

which is clearly negligible, compared to the expected main term. The term Sls°(A, A', B, B') requires a more delicate treatment. We shall prove later the following : LEMMA 1. - Let E be an elliptic curve over Q. Then, for A and B tending to infinity, we have {(a, b); 0 < dal

A, 0 < IbI < B, Ea,b isogenous overQ to E

= O(min(A2 , B 3) logs (2AB)), where the "0" is independant of E.

94

E. FOUVRY and M. R. MURTY

This lemma and Theorem B directly imply

S's°(A A' B, B') < max I{(a, b); Iat < A, jbI < B, Ea,,b isogenous over Q to E}I E/Q

E E lro(x,Ea',b') Ia'I
= O(min(A2 , B 3 ).A'B' f 1og10(2AB)) = O(AA'BB') which is also negligible.

Proof of Lemma i : since Ea,b and E are isogenous over 0, the isogeny is defined on a number field K of relative degree at most 12 over Q ([MW] Lemma 6.1 for instance). Now, we use a strong theorem of Masser and Wustholz ([M-W] Theorem), asserting, that if there is an isogeny between two elliptic curves E and E' over a number field k of degree at most d over Q, then there exists between them a "simple" isogeny, i.e. with a degree less than c(w(E'))4, where c depends only on d and w(E') is the maximum of 1 and of the logarithmic Weil height of the curve E. In our situation, we deduce that, if E and Ea,b are isogenous, there is an isogeny of degree O(log4(2AB)). A simple trick of algebra ([M-W] Lemma 6.2) allows us to suppose that this isogeny is cyclic.

Let jE be the invariant of E. The modular polynomial of order n, 'D.n(X, jE) ([La] pages 55-59) detects the existence of a cyclic isogeny of degree n between Ea,b and E. More precisely, such an isogeny exists if and only if we have 6912a3 \ 4a3 + 27b2 ,

3E) = 0.

The degree in X of 4)n (X, jE) is n fl (1 + P) = O(n log 2n), which implies PIn

that the equation 4),,(X, jE) = 0 has at most O(nlog2n) solutions in Q. Let v be such a root, which we can suppose different from 0 and 1728 since ab j4 0. The equation in a and b 6912a3

u

4a3 + 27b2

v

has O(min(Ai, B)) roots in the rectangle [-A, A] x [-B, B]. Then gathering these solutions when v varies and when n varies, we complete the proof of Lemma 1. The above discussion gives another formulation of Theorem 1

SUPERSINGULAR PRIMES COMMONT0'IWO ELLIPTIC CURVES

95

THEOREM 2. - Let t indicates that Ea,b and Ea',b, are in general position. Then under the conditions of Theorem 1, we have

E E E 7ro(x, Ea,b, Ea',b' ) IaI
(96 .loge x)

E E 1:l 1.

IaI
A more difficult question seems to give an asymptotic formula similar to (1.6), but where Ea,b and Ea',b' takes only one value in each isogeny class.

2. - From supersingular primes to class numbers The starting point of our proof is LEMMA 2. - Let p > 5 be a prime. The number of isomorphism classes of elliptic curves over FP with p + 1 points is equal to H(-4p) (number of of isomorphism classes of positive quadratic forms, not necessary primitive, with discriminant -4p).

Such a statement appears at several places in the literature, for instance

in [Bil, page 58. Since the quadratic forms counted by H(-4p) are not necessarily primitive, we have the equality

H(-4p) = h(-p) + h(-4p)

(2.1)

with the convention that the h symbol is equal to zero when it is not defined.

We denote by y2 = x3 + aix + bi (1 < i < H(-4p)) the equations which define representatives of the above isomorphism classes of elliptic curves over Fp. We can suppose that these equations are minimal relative to p, for instance by imposing the conditions 0 < ai < p, 0 < bi < p. If the equation defining Ea,b is minimal for p we deduce that p is supersingular for Ea,b if and only if there exists t E F , and 1 < i < H(-4p), such that

a - ait4, b - bit6(mod p).

(2.2)

For aibi 0- 0(mod p), the image of the application t E IF , -> (ait4, bits) has cardinality P2 1 for p > 5. We gather the above observations into

LEMMA 3. - Let p > 5, then there exists a set £p of residue classes mod p x mod p with the following properties i)

I£PI=H 2Pp+0(p)

96

E. FOUVRY and M. R. MURTY

ii)

if (a, b) E 7L2 - { (0, 0) } and if k is the largest integer such that

p4kIa and p6kIb, then p is a supersingular prime for Ea,b, if and only if

(ap-4k, by-6k) (mod p) belongs to It is now easy to transform S(A, B, A', B') into SOP.

S(A,B,A',B')=>( 1: E 1: 1+O((ro4+1)(B+1)))x Pox (u,v)EEp a=u(mod p) b--(mod p) I.I
bj
1+0((4'+1)(B'+l)))+O(ABA'B'),

( ,b'I
where the errors terms come from the non-minimal equations and from the primes 2 and 3. Using the trivial equality 1 = 2A + 0(1), p

a=u(modp) 1.1:5 A

we get the equality

SABA'B' ( ) =

I P<x

(2.3)

EPI

22A2B 2A' 2B'+ p

p

p

p

+0 (x4 log2 x + (A + B + A' + B')x3 log2 x+

+(ABA' + -

+ BA'B')x log2 x + ABA'B'),

by using Lemma 3 ii) and by using the classical upperbound h(-d) _ 0([d log d) (note that some log-factors could be spared by using average bounds of that quantity by methods of the next Proposition). We postpone to paragraph IV an improvement of (2.3) by appealing to the theory of exponential sums. The equation i) of Lemma 3 splits (2.3) into main term and error term :

S = MT + ET

(2.4)

with

MT(A, B, A', B') = 4AA'BB' > H2(-4p) p2 P<x

and

(2.5)

ET(A, B, A', B') = O(ABA'B')

under the conditions (2.6)

A, A', B, B' > x log x.

3. - Class numbers on average To evaluate the main term in (2.4), we will prove the

97

SUPERSINGULAR PRIMES COMMON'10'IWO ELLIPTIC CURVES

PROPOSITION. - For x -> oo we have

H'(-4p)

35 241og2 x.

2

P

P<x

Such a formula, one more time illustrates the well known fact that on and we will treat it by the values of Laverage h(-d) behaves like functions at the point 1. Formula (2.1) transforms our sum into EP<x

h2( P + 2 EP<x

(3.1)

h -PP -4P +

EP<x

hz P 4P

= T1,1(x) + 2T1,4(x) + 7'4,4(x)

say. We will concentrate on the typical sum T1,1(x), for which we will prove

T1,lx) ,.

(3.2)

5

24 1092 x

but the proof of this formula can be obtained, after integrating by parts, from the following (3.3)

Ti,l(x) :_ P<x

h2 (-p) p

x

5

24 log x

The classical Dirichlet class number formula, coupled with the PolyaVinogradov formula gives an expression of h(-p) as a finite sum

h(-p) = ZP 1: X-P(n) 7r

n
n

rplogUpl

+0 `

'

for any U > 1 and X-P, the Kronecker symbol associated to -p for p - 3(mod 4). Squaring this equality, we obtain for V/7< U < x (3.4)

Ti,l(x) =

2

X-P(nln2) +O (x logx\
711112

`

U

J

To treat the triple sum in (3.4), we put 1712 = m12 with m squarefree and dU (n) the modified divisor function

du(n) :_ {(nl,n2); nl < U, 122 < U, nln2 = n}1,

E. FOUVRY and M. R. MUR7Y

98

the main term in (3.4) is/now p2(m) T2

X-p( m)X2 p(1) 2

du(ml2).

p<x l

m

The most important contribution comes from m = 1, it has the shape 1

d 1122)\ l
1) 1)

P<.,pf'

=

(

+o(1)

l T.-'X lox

.

p-3(mod 4)

But we have the equality °° 2k + 1

°O d(12)

E

12

- 11

p2 k

d(12)

(

1 + 1/p2

= 11 (1 - 1/p2)2

1=1

d12

+O(U-4)).

-

_ (3(2) _

57r2

12

Thus (3.4) may be written as 24 log x

(1 + O(1))+

µ2(m)X-P(m)X? PMdu(ml2) m p<x m#1

(3.5)

).

1, we hope cancellation from summation over p and m of the Since m terms X_p(m). We call W(x, 1) the double sum over these variables in the error term of (3.5). After cutting the ranges of summation, we have :

W(x,l) =O(log2x{sups ply IW(1,M,P)I +xE})

with

du(ml 21 and the error term xE coming from the p dividing 1. In W(1, M, P), m is never divisible by 4, so we put m' = m or m' = m/2 if m is odd or even; if we fix the congruence of p and m' modulo 8, we see that the Kronecker symbol is expressed in terms of the Jacobi symbol W(11 M, P) _ EP
M

E1 X-p(m) = (176 ) r2 where E1 and E2 are of absolute value 1, are independent of m and p. We now appeal to a general upperbound for a double sum over Jacobi symbols :

LEMMA 4. - Let (an,,) and (bn) be complex numbers. Then we have

)7,

E

M<m<2M N
a7Dbn(m) « 11a112 IIbII2(M7 + M"N"),

\n

99

SUPERSINGULAR PRIMES COMMON 717 TWO ELLIPTIC CURVES

where the sum is over odd square free integers m and n.

Remark that the reciprocity law for Jacobi symbols implies that, in the

above sum the variables m and n play a similar role and the following upperbound will be sufficient for our purpose : (3.6)

a,,,,bn (n) << 11a112 I bI I2(MN)' (min(M, N))-6, I

M<m<2M N
for some absolute positive S. Our proof is quite standard and mixes Cauchy-Schwarz inequality and

Burgess bound for character sum ([Bu] Theorem 2 with e = 1/32) . This proof is a slight generalisation of [H-B], Lemma 4. We have

Ea.j:bn(n) IIaII2{ ybn(n) in n m n m I

bn, 6n2

I a 112

n,

n2

71

nln2

m

IIaII2(IIbII2M+ E E n1

212

l

Ibn,bn2IMIN')71

n2 23

IIaII2IIbII2M2 + IIaII2IIbII2M9N

.

Note that the classical Polya-Vinogradov would be sufficient to prove (3.6), if the situation M1-61 < N < M1+61 (with 61 a very little positive constant) was excluded, Burgess bound is used to cover that case. This bound, which depends on a result of Weil, could be replaced by the much more accessible

(Corollary of IF-I]), if that last result was stated for general moduli, not necessarily prime moduli. We now use (3.6) to bound W(1, M, P) M-1 (E du (ml2)) 2 M' P(min(M, P))-6 w y, M, P) << 1

(3.7)

d(l2)P loge x (min(M, P))-6 << 12 x log-10 x

as soon as we have M > (log x)1 . In the case where M < (log x)

100 ,

(3.7) is a direct application of the famous Siegel-Walfisz Theorem on the distribution of primes in arithmetic progressions. Inserting this bound in (3.5), summing over 1, we obtain (3.3) and by the way (3.2). The other terms T1,4 (x) and T4,4(x) are evaluated along the same techniques. We write T1,4(x) =

2 2

X-P(nl)X-4p(n2) nin2 P<x n1
logx

+0(X! U

E. FOUVRY and M. R. MUR7Y

100

In the above/ sum, we may suppose that n2 is odd, this implies that X-4p(n2) = X-,(n2)If we write d* (n) = i{(nl, n2); nl < U, n2 < U, nln2 = n, n2 odd} I, we obtain the equality

i 2

T1,4(x)

l
di212)

(3.8)

1+1/p2

4

2

1)

p<x,p{l p-3(mod 4)

x

x

1

4'logx

7x23 P>3 (1 - 1/p2)2'2logx Similarly, we have the equality

T4,4 (x) =F4

X-4p(nl)X-4p(n2) nln2

3 2 + o x'2 log x

U

) and we may suppose that both nl and n2 are odd. We define dU (n) = I {(nl, n2); nl < U, n2 < U, nln2 = n, nl and n2 are odd and we obtain P<x

n1
dUl2 ** l2 )

4

/

T4,4 lx) ,. -2 (3.9)

i
1)

( p<x,p)1

4 7 1+1/p2 x 72 11 (1 - 1/p2)2'logx

3

x

4'logx'

Gathering (3.1), (3.2), (3.3), (3.8) and (3.9), we get

T(x)

5

(24

3) x _35 x

1

+

2

+ 4) logx

24logx'

which ends the proof of Theorem 1.

4. - Use of exponential sums The aim of this paragraph is to weaken the conditions (2.6) over A, A', B, B' down to the conditions appearing in Theorem 1. We improve the use of the formula

EE

(u,v)EEp a-u(mod p) b-v(mod p) Ial
Ib1
giving the number of Ea,b with Ial < A, Ibi < B for which a given prime p > 5 is supersingular. The idea, already appearing in IF-M], is to say that is also in £p) ; £p is not too chaotic, (i. e. if Ea,b belongs to £p, then it is now possible to appeal to the theory of exponential sums to detect the A, Ibu6I < B. We extract from [F-M], paragraph 7 the conditions I au4l following lemma

SUPERSINGULAR PRIMES COMMON TO TWO ELLIPTIC CURVES

101

LEMMA 5. - Let 0 < a, Q < p, such that the given prime p > 5 is supersingularforE,,g. then the number ofEa,,b, (lad < A, JbI < B) isomorphic to Ea,p with p4' a and p6 f b is

Let us denote by Fp a subset of representative classes of isomorphism of the form E,,,p with p f a/3. Note that the cardinality of that subset is H(-4p) - 0(1). By inserting the result of Lemma 5 in the formula (2.3), we now obtain

S(A, B, A', B') _

/I-Fp4pB.p p<x

(,EFI(4

l

1

2

+0(,/p log2p(1+A+

1+0(Jlog2p(1+

pB'.p

2

B)))

+A+B))x

p'+ p/)))+O(Ap +A'+B'))'

where the error terms come from the curves Ea,b and Ea',b' with plab and pIa'b'. Using now the relation I FpI = O(,/p- log p), we get

4B

S(A,B,A',B') _

p21 +O(plog3p(p+1)(p+1)))x

p<x

(IT 4A pB' p 2 = 16ABA'B'

(BP'

1

(y

+0(plog3p( p/ + 1)

p<x p

+ 1)))

O(ABA'B'),

under the conditions A, A' > x2+E, B, B'> x2+E, AB, A'B' > x2 +E. Since we have I-Fpl2 = H2(-4p) + O(,,fp-logp), the treatment of the main term is straightforward by the Proposition.

Manuscrit recu le 29 septembre 1993

102

E. FJUVRY and M. R. MUR7Y

REFERENCES [Bil B. BIRCH. - How the number of points of an elliptic curve over a fixed prime field varies, J. London Math Soc. 43 (1968), 57-60. [Bu] D.A. BURGESS. - On characters sums and L-series. II, Proc. London Math Soc.(3) 13, (1963), 524-536. [Ell] N. ELKIES. - The existence of infinitely many supersingular primes for every elliptic curve over Q, Inv. Math. 89 (1987), 561-567. [E121 N. ELKIES. - Supersingular primes of a given elliptic curve over a number field, Ph. D. Thesis, Harvard University, (1987).

[E131 N. ELKIES. - Distribution of Supersingular Primes, AsterisqueJournees Arithmetiques de Luminy 1989 198-199-200 (1991), 127132.

[F-M] E. FOUVRY and R. MURTY. - On the distribution of supersingular primes, (preprint). [F-11 J. FRIEDLANDER and H. IWANIEC. - A mean-value theorem for charac-

ter sums, Michigan Math J. 39 (1992), 153-159. [H-B1 D.R. HEATH-BROWN. - The size of Selmer groups for the congruent number problem, Inv. Math. 111 (1993), 171-195. [Lal S. LANG. - Elliptic functions, Addison-Wesley, (1973). [L-T] S. LANG and H. TROTTER. - Frobenius in GL2 extensions, Lecture Notes in Mathematics 504, Springer Verlag, (1976). [M W) D.W. MASSER and G. WusTHOLZ. - Estimating isogenies on elliptic curves, Inv. Math. 100 (1990), 1-24. [Mu] R. MURTY. - Recent developments in the theory of elliptic curves, Proceedings of the Ramanujan Centennial International Conference, (1987), 45-54. Etienne FOUVRY

Mathematique- Batiment 425 Universite de Paris-Sud F-91405 ORSAY Cedex Ram MURTY

Department of Mathematics Mc GILL University MONTREAL, PQ CANADA H3A 2K6

Number Theory Paris 1992-93

Arithmetical lifting and its applications Valeri GritsenkV

1. - Introduction and formulation of the main results Let F(Z) be a Siegel modular form of weight k with respect to Sp4(Z). By definition F is a holomorphic function on the Siegel upper half-plane

H,={z=I

z

)EM,(C),Irn(Z)>O

that satisfies the functional equation

(1) Flk g(Z) := J(g, Z)-"F(g < Z >) = F(Z), J(g, Z) = det(CZ + D), for any g = ( C

)

E Sp4(Z). The Fourier-Jacobi expansion of F is its

D with respect to the variable w Fourier development (2)

F(T, z, w) = fo (T) + L f,,,, (T, z) exp(2iri mw), m> 1

where ,r = u+iv (v > 0) belongs to the usual upper half-plane H1 and z E C. The Satake compactification of the quotient space Sp4 (Z) \ T I2 has two boundary components : the curve SL2 (7G) \H-H1 and the point oo. The function

fo(r) is equal to the restriction of the modular form to the boundary curve and the expansion (2) corresponds to the Fourier expansion with respect to the maximal parabolic subgroup defining the boundary curve (see [P-SI). The functions fm. (T, z) are examples of Jacobi modular forms of index m (see

[EZ]). In this paper we construct a lifting from the space of Jacobi modular

forms of index t in the space of modular forms on the Siegel upper-half space HH2 with respect to the so-called paramodular group r[t] : Lifting : {Jacobi forms ft : H1 x C --> C} --> (3)

{modular forms F : I' [t] \ H2 - C}, (*)Partly supported by Forschungsschwerpunkt "Arithmetik Mannheim-Heidelberg".

104

V GRrFSENKO

where by r[t] we denote the following subgroup of the rational symplectic group

r[t] _

(4)

E Sp2 (Q)

,

t*

where t is a natural number and all * denote integral numbers. This group appears in the following algebro-geometric context. Let S be an abelian variety of dimension two with polarization of type (1, t) (t E N). We may write S as a two dimensional complex torus c2/(Z,T)7L4

S

where (Z, T) is the period matrix, Z E 1H12 and T = ( 1

0). The polariza-

tion with respect to this basis is given by the bilinear form Jt =

0

-T

T 0

The integral symplectic group of this skew-symmetric form Sp(Jt,7L) = {g E M4 (Z) : gJttg = Jt}

is called the parasymplectic (or paramodular) group. It is easy to see, that this group is conjugated to the group rtb C Sp4(Q) and rtb = tr[t], more exactly, *

It 1Sp(Jt,Z)It - rtb =

*

* * *

*

t-1*

t*

* t* t* t* * t* *

E

tr[t],

*

where It = diag(1, 1, 1, t) and all * denote integral numbers. We shall also keep the name "paramodular group" for r[t]. The quotient space

At=rtb\j2

is the coarse moduli space of abelian surfaces with polarization of type (1, t) (see [I[, [HKW)). At has a structure of a quasi-projective algebraic variety. For p=1 the variety Al is the moduli space of abelian surfaces with principal

polarization and it is rational (igusa). For p=5 this variety is connected with the famous Horrocks-Mumford vector bundle (see IHKW)) and is also rational. It is known, that A2, A3, A7 are rational.

The first application of the lifting (3) is the following theorem about geometrical type of the variety At.

ARI7HME77CAL LIFTING AND ITS APPLICA77ONS

105

THEOREM 1. - Let j., be a non-singular model of a compactification of the moduli space Ap of abelian surfaces with polarization of type (1, p). The variety AP is not unirationalfor any prime p, greater than 11.

The lifting (3) has a purely arithmetical description. We shall construct

the lifted form F using a representation of a Hecke ring of a parabolic subgroup of Sp4(Z) on the graded space of Jacobi modular forms. This Hecke ring is defined in §2. We shall see, that the lifted form F is in a sense a generalization of the classical theta-function 6(-r) = E,nEZ exp (27rin2 r) (see §3). This analogy will give us the second important application : a new integral representation of the Spin L -function of Siegel modular forms. Let us recall the definition of the Spin L-function. The Hecke ring 'H(r) of the integral symplectic group r = Sp4(Z) is generated by the following elements

T(p) = rdiag(1, l,p,p)F, Ti,p = Fdiag(1,p,p2,p)r, (OP)±1 = (rpE41)l1, where p is a prime number. The local factors of the Spin (or Andrianov) Lfunction are connected with the following polynomials Qp(X) of degree four over the Hecke ring 7-1(F)

Qp(X) = 1 - T(p)X +p(Ti,p+ (p2 + 1)AP)X2 -p3ApT(p)X3 +p6A2X4. Let F(Z) be a Siegel modular form, which is an eigenfunction of all Hecke operators. Then one defines L-function ZF(s) of the modular form F ZF(S) =

fi

Qp,F(p

8)-1,

P - prime

where the polynomial QP,F(X) is obtained from the polynomial Qp(X) by exchanging the elements of Hecke ring in the coefficients of this polynomial with their corresponding eigenvalues. If we denote by ao, al, a2 the Satake parameters of the one dimensional representation of the local Hecke ring Wp(r) defined by eigenvalues of the function F(Z), then the local factor QP,F(X) has the following form

Qp,F(X) = (1 - aoX)(1 - aoa1X)(1 - aoa2X)(1 - aoala2X). ZF(s) is the Spin L-function in the Langlands classification (see [L]). The analytical continuation of this L-function was constructed in [A],

but the proof contains cumbersome calculations and takes 50 pages in the Russian Mathematical Survey. In §4 and §5 we construct new integral representations of ZF(s) as a Rankin-Selberg convolution of a given cusp form with the lifting of some of its Fourier-Jacobi coefficients. It shall give a new short proof of Andrianov's result together with additional information about poles of this function obtained in the papers of Evdokimov and Oda.

106

V.. GRITSENKO

TI-IEOREM 2 (See [A], [Ev], [021). - Let F be a cusp form of weight k with respect to Sp4 (Z) and F be an eigenfunction of all Hecke operators. Then the

function

ZF(s) = (21r)-2, (s)r(s - k + 2)ZF(S) can be continued meromorphically to the whole s-plane with only two possible

poles at s = k - 2, k and satisfies the functional equation

ZF(2k - 2 - s) _ (-1)kZF(s) Moreover ZF (s) is entire function except the case of F being a Maass modular form of even weight k. For such a Maass form the L :function ZF(s) has two

simple poles at s = k - 2, k with the residue 7r2-k< F, F >/< fl, fl > at s = k, where < F, F > is the scalar square of the Siegel modular form F and < fl, fn > the scalar square of its first Fourier-Jacobi coefficient.

2. - The Spin L-function and Dirichlet series The Fourier-Jacobi coefficients fm (T, z) of a modular form F (see (2)) are modular forms of weight k with respect to congruence subgroups of SL2 (Z)

for any fixed z. For fixed T they are Jacobi functions, that we usually use to construct embeddings of the elliptic curve cC/TZ + Z in the projective spaces. Taking these two properties together one may say, that f,,,, is a modular form with respect to the Jacobi group ),J = SL2 (Z) x H(Z), where H(Z) is the integral Heisenberg group, i.e., the following central extension

0-Z-4H(Z)-*ZxZ-+0. The Jacobi group is isomorphic to the following maximal parabolic subgroup of Sp4 (Z)

* 0**

(5)

r°°

11**** 0 0

*

0 0*

=

a 0

b

0

1

0

0

1

0

1

0

0

1

1

r

c

0do

-q 0

0

1

q

0

0

0

1

1(0

0

0

1

°`

where q,1, r E Z and I a d ) E SL2 (Z). Using this realization of the Jacobi group as the parabolic subgroup r we may give the following definition of Jacobi modular forms. DEFINITION. - A holomorphic function

q(T,z): ]H[1 XC ->C

is called a Jacobi form of index m and weight k if the function q(Z) _ ¢(T, z)exp(2iri mw) on the Siegel upper half-plane H2 is a modular form of weight k with respect to the parabolic group r,,,, i.e.,

ARI7TIMETICAL LIFTING AND ITS APPLICA77ONS

107

1. 01kM=¢foranyMEF,c,; 2. The function has the usual Fourier expansion

0(T, z) _ L f (n, l) exp (2iri (nT + lz)). n,IEZ, n>_0 4nm>I2

This definition is equivalent to the definition given in [EZ]. We call a function 0 a Jacobi cusp form if we have the strict inequality 4nm > 12 in the last summation. We shall denote the space of all Jacobi forms or all Jacobi cusp forms of index m and weight k by 9)T/ m or C5k The construction of the lifting will be described in terms of the Hecke ring of the parabolic

subgroup F. Note here, that F,, is not reductive! We shall consider this ring as a non-commutative extension of the Hecke ring of Sp4(Z). First of all let us recall the definition of an abstract Hecke ring.

Definition A pair (r, G), where r is a subgroup of a semigroup G, is called a Hecke pair if any double coset rgr (g E G) is the union of a finite number of left and right cosets relative to F. The Hecke ring 1-l (r, G) of the pair (r, G) is the r-invariant subspace of the Q-vector space consisting of

all formal finite linear combinations X = Ei ai rgi (ai E Q, gi E G), where a representation of the. group r on this space is defined by the right multiplication X --+ X 7 = Ei ai r(gi-y). For any two elements of this space X = Ei ai rhi and Y = E4 b3 rgi their product is defined by X Y = Ei aib4 r(hig4). The product is independent of the choice of representatives gi, h; and 7-l(F, G) is an associative ring.

The elements rgr = Li rgi (g E G) form a basis of the vector space l (r, G) and our definition is equivalent to the standard definition of the Hecke ring. Let us define two Hecke rings 'H (F) ='HQ(Sp4(Z),GSp4(Q))

and

x(r.) ='HQ(roo,Groo(Q)),

where GSp4(Q) = {g E M4(Q) : t9J19 = l4(9)JI,

µ(9)E Q+}

is the group of symplectic similitudes and Gr,, (Q) its parabolic subgroup of type rte. If X E 7-1(r), then according to the elementary divisor theorem one can represent X in the form X = Ei airgi, where gi E Gr,,,, (Q) and ai E Q. It easy to see that the map (6)

Im : X = > airgi -+ E airoo9i

108

V. GRITSENKO

is a homomorphic embedding of the Hecke ring H(I,) into 7-1(I'..) (see [G1]) and we shall identify the ring 7-1(r) with its image in 7-1(F ). We have the following representation of the ring 7-1(I ,,,) on the space of functions, which are invariant with respect to Ik -action (see (1)) of the

parabolic subgroup r,,,,

F ->FjkX=

ai /l(gi)2k_3J(gi,

Z)_kF(gi

< Z >),

(7)

aiI',,.g

(X =

If F is a Siegel modular form of weight k with respect to r and X E 7-1(I) C 7-1(I ,,,,) we obtain the representation of the ring 7-1(r) on the finite dimensional space of Siegel modular forms (Hecke operators). It is known that eigenfunctions of all Hecke operators form a basis of the space of all Siegel modular forms. We have identified the Hecke ring of the symplectic group with its image (see (6)) in the ring 7-1(F ), which also contains two subrings isomorphic to the Hecke ring 7{(SL2(Z)) : (8)

x(r)

7{(x(,)

"

7-l(SL2)

.

It is enough to define the embeddings j± for the generators

T(p) = SL2(Z)diag(1, p)SL2(Z) and T(p, p) = SL2(Z)diag(p, p)SL2(Z) of the ring 7-l(SL2(Z)). By definition we have

j_(T(p)) = T_(p) = I 00diag(l, p, p, 1)I 00,

j+(T (p)) = T+ (p) = r0diag(1,1, p, p)r,,

j-(T(p,p)) = A-(p) = r.diag(p,p2,p,1)r0, j+(T(p,p)) = A+ (p) = r.diag(p,1,p,p2)r00 The statement that the mapping j_ is a homomorphic embedding is clear, because there is a one-to-one correspondence between the left cosets in the decomposition of the double cosets T(p), T(p,p) and T_(p), A_(p) (see [G1] and [G5] where more general embeddings have been constructed). The mapping j+ is dual to the embedding j_ with respect to the involution * of

the Hecke ring l(r

)

*:

I -gr- -> r-µ(g)g-11F00

The next lemma is a special case of a general result proved in [G I].

ARJ7HMETICAL LIFTING AND ITS APPLICA77ONS

109

LEMMA 1. - The polynomial QP(X) splits over the ring -l(F,,,) :

QP(X) = j_ (QSL(X)) (1 + pzP(VP - p)X 2) j+(QsL (X )), where thefirst and the thirdfactors are the j f -images of the Heckepolynomial

QSL (X) = 1 - T(p)X + pT(p, p)X2 for the group SL2 (Z) and

VP rEP-1Z/Z

rEP 1Z/Z

1

0 1

0 0

0

0 0 0

0

1

0

0

1

0

r

Proof : using the elementary divisor theorem for the symplectic group we can calculate the images (6) of the generators of 'H (IF) in the Hecke ring

of the parabolic subgroup l (F

)

:

T(p)=T_(p)+T+(p), T1,=A_ (p)+A+(p)+r diag(1,p,p2,p)r,,,+LP(VP 1).

The coefficients T_(p) and A_(p) of the polynomial j_(QPL(t)) = 1 T_ (p)t + pA_ (p)t2 have the same decompositions as sums of left cosets as the elements T (p) and T (p, p) in the Hecke ring of SL2 (Z) and it is easy to verify that the following identities hold : A_(p)T+(p) = p20PT_ (p), T_ (p) (V P - p) = 0,

A_ (p) (VP

- p) = 0.

Using the antiautomorphism *, we have that T_ (p)A+(p) = p20PT+(p),

(OP - p)T+(p) = 0,

(OP - p)A+(p) = 0.

Taking into consideration the identity T_ (p)T+ (p) = pr. diag (1, p, p2 , p) r. + (P3 + p2) OP,

which one can easily check, we obtain the factorization of the lemma.

There are two representations of the Hecke ring f(F,,,) on the space of the Fourier-Jacobi coefficients of Siegel modular forms. The first one is the representation "Ik " on the space of all Jacobi forms of weight k (homogeneous modular forms with respect to Fj, defined in (7), and the second is the representation on the space of Fourier coefficients of r,,,invariant functions F(Z) f.. I I k X := the

mtrt Fourier-Jacobi coefficient of the function Fl k X X.

110

V. GRITSENKO

The following formulae are clear from the definitions (see [G1] for more general statements) fm(T,z)IIkT+(n)=fmnik T+(n)(Z)exp(-27rimw), (9)

fm(T,z)IIkT-(n)= llif-/nIk T_(n)(Z)exp(-27rimw),

otherwise, where by T± (n) we denote the j±-images of the standard Hecke element ,

SL2(Z)diag(a,b)SL2(Z).

TSL(n) a6=n alb

To make our notation shorter we set (fmnik T+(n))(Z)exp(-27rimw), fmnik T+(n) (10) (fmlk T_(n))(Z)exp(-27rimnw). milk T_(n) These are Jacobi forms of index m and mn respectively. COROLLARY 1. - Let

F(T, z, w) _

fm (T, z) exp(27ri mw) m>1

be a Siegel cusp form of weight k which is an eigenfunction of all Hecke operators. Then for any natural n and prime p the following identity holds in the ring of formal power series X6 = Qp,F(X) E fnp6l k T+ (P) 6>0

(fn+fnIkT-(p)X+pfP IkA-(p)X2)Ik(1+p(Vp-p)A X2), where

f

fmIk(1+p(V -p)A X2) = 1 (1 - p2k-4X2) fm,

if m - 0 mod p, otherwise.

Proof : taking the j+-image of the formal power Hecke series for SL2 (Z)

we have >6>0T+(p6)X6 = j+(QSL(X))-1. The function F(Z) is an eigenfunction, thus Qp,F (X) fn = fn I l k Qp(X) and we obtain with help of Lemma (11) 1 the following identities in the ring of the formal power series

Qp,F(X)Y,fnp6lkT+(p6)X6 =Qp,F(X)EfnllkT+(p6)X6 = 6>0

fnllkQp(X)(1-T+(p)X

6>0 +pA+(p)X2)-1 =

M l k (1 - T_ (p) X + pA- (p) X 2) (1 + p(Op - p) APX 2) .

To finish the proof we can use the formulae (9).

ARI7HMETICAL LIFTING AND 175 APPLICATIONS

111

COROLLARY 2. - Let F(Z) be the same as in the previous corollary. Let t be a natural number such that ft 0 0 and all Fourier-Jacobi coefficients ft1d of the modular form F, where d > 1 is a divisor of t, are identically equal to 0. Then the following identity holds for sufficiently large Re(s)

L(2s - 2k + 4, Xt) L ft,, (T, z)IkT+(n) n-s = ft (T, Z) ZF (S), n>1

whereL(2s-2k+4, Xt) is the Dirichlet LfunctionwiththeprincipalDirichlet character modulo t.

Proof : one has to apply the identities of previous corollary successively

for all primes p with X = p-' and to take into account the standard estimation

(T, z) I = O((v/m) _ 2 e2"'"`y2/v) of Fourier-Jacobi coefficients

(see [KSD.

A generalization of these results to the case of Spn can be found in [G1].

3. - Jacobi lifting. In full analogues with (1) one can define the space fmk(r[t]) of all modular forms of weight k with respect to the paramodular group r[t] (see (4)).

In this section we construct an injective map from the space of Jacobi forms of index t > 1 and weight k (i.e., from the space of modular forms on

the parabolic subgroup I'j into the space of modular forms with respect to the paramodular group of level t. THEOREM 3. - Let q5(T, z) be a Jacobi form of weight k and index t > 1 with the following Fourier expansion

O(T, z) = ) ,

f (n, 1) exp (2iri (nT + lz)).

n,LEZ t> 12

If the zeroth Fourier coefficient f (0, 0) of the Jacobi form 0 is not 0, we also suppose that the weight k > 4. Then the following function (see (10)) CO

m2-k

G,6 (7-, z, w) = f (0, 0)Ek(T) +

Ik T_ (m)) (T, z) exp (27ri tmw)

nt=1

is a modular form of weight k with respect to the paramodular group T[t], where Ek(T) sk + n>1 Qk-1(n)exp (27ri nr) is the Eisenstein series of weight k on SL2 (7G).

112

V. CRITSENKO

Let us make some remarks about this theorem. If index t = 1, the map -* G,6 coincides with well-known the Maass or the Saito-Kurokava lifting (see [EZ]). The theorem shows that the Maass lifting is only the first member in the infinite series of liftings connected with Jacobi forms. Thus for any Siegel modular form fm(T, z) exp (27rimw)

F(T, Z, W) = m>O

we can construct a infinite series of lifted functions

that defined a

"section" of the following infinite product

F -+ l `mk(r[m]). mEN

We may rewrite at least formally the definition of the form G,5 using multiplicative notations. Let f (0, 0) = 0 and 1(Z) = q(T, z)exp (27ri tw). Then Em2-kT

(11) Gm(Z)= I k

(1-T_(p)p2-k+T

(m) _q' Ik 11

m=1

(p,p)p3-2k)-1

P

where the p-factor in the infinite product is the j--image the Hecke polynomial for SL2(Z) (see Lemma 1).

j_(QPL(p2-k)) of

It is interesting, that we can rewrite the classical theta-function in the same terms. To this end let us define the Hecke rings l(SL2) = ,1-l(SL2(Z), SL2(Q)) and 7i(Fo) = 1-l((Fo, I'o(Q)) of the special linear group

and its parabolic subgroup r0 = {

(

1 si)' b E Z }. Like in the

case of Sp4(Z) (see (6)) we can define an embedding'1-l(SL2) -* R(IO)We may continue the comparison with (8) and define an embedding of the multiplicative semigroup N-1 or, more generaly, the polynomial ring Q[x-1] into l(r'0). (Q[x-1] is isomorphic to the Hecke ring?({ 1}, N-1) of the trivial group, consisting only of the unity!) By definition n-1

7-

--

[n-1]

ro

0

no 1

ro = ro

n

I E l(ro) .

0

We can interpret Z-periodic functions of the complex variable r as automorphic functions with respect to the parabolic subgroup r0 C SL2(Z) (compare with the definition of the Jacobi forms). If we take the representation of the Hecke ring 9-l(F0) on the space of Z-periodic functions (automorphic with respect to F o) we obtain, for instance, that exp (27ri r) I [n-1]

ARITHMETICAL LIFTING AND ITS APPLICA77ONS

113

= exp (27ri n2T). As a consequence, we can represent the classical thetafunction as a sum over a semigroup of the Hecke operators {[n-1], n E N} instead of as a sum over the lattice Z. Namely, 0(T) =

exp (27ri n2T) = 1 + 2 nEZ

exp (27ri T) I [n-1], [n 1]E7{({1}, N-1)

or using some formal notation 0(T) = 1 + 2 exp (27ri r) I fl(1 -

[p-1])-1

= 1 + 2 exp (27ri r)I j_(((1)).

P

From this point of view the lifting (11) is a generalization of the last formal identity.

Proof of Theorem 3 : the function G4,(Z) is the sum of Jacobi forms of indices mt for m > 0 (the Eisenstein series is a Jacobi form of index 0). Thus G4, is invariant with respect to the action of the subgroup I'". and, moreover, with respect to I [t] = F (Q) f r[t]. Let us calculate the Fourier expansion of Go :

1

Go (Z) =f(0,0)Ek(7)+

mk-1

rn> 1 ad=m

f (n, 1) exp (27ri (n

dk

b mod d

4tn>l2

aTd b

+ laz + tmw))

(f(o, 0)0'k-1 (m) exp (27ri mtw)

= f (0, 0)Ek(T) + m >1

ak-1 E f(dnl,l)exp(27ri(niaT+alz+adtw)) I

+ a d =m

/

4tdn1>l 2

"1; 0

= P0,0 ) (`- B2k + 1: Qk_ 1( m)(ex P( 27ri mT) + ex P(7ri mtw k

+

m>1

E

ak-1

f (a2 , 1) exp (21ri (n7- + lz + mtw)).

4tmn>l2 al(n,l,m)

m>l,n>1

This expansion shows us that GO (T, z, w) is invariant with respect to exchanging of the variables (T - tw, w -k t-1T). The element

Wt=I tot

Ut

),

where Ut=I

00

114

V. GRITSENKO

realizes this transformation. Hence Golk Wt = (-1)k Go.

(12)

Moreover we have Gm I k Jt = Gk, where Jt is the element from the definition of the parasymplectic group (see § 1), since

where I =

WtIWtI = Jt,

0

0

1

0

O1

0

0

0

0

0

0

1

E 1700.

It is easy to see that the element Jt and the group r,,,, [t] generate the paramodular group r[t]. The theorem is proved. From the definition of the function Go follows the following COROLLARY. - The lifting

J:

k,t - `mk(r[t]),

J(O):=Gk

is injective and satisfies the following commutative relation

J(O)IkT_(m) = J(OIkT (m)) We would like to compare the lifting of Theorem 3 with the analytical theta-lifting connected with dual reductive pairs (see, for example, [Ku]). The space of Jacobi forms fitk,,, is isomorphic to a subspace of modular forms of a half-integral weight with respect to an appropriate congruence subgroup of SL2 (Z) (see [EZI). There is an isogeny between the paramodular group F[t] and a special orthogonal group of type SO(2, 3) (see (G41). Let

us consider the theta-lifting for the pair (SL2, SO(2, 3)), i.e., the integral operator with a theta-function of an even quadratic form of signature (2, 3) as a kernel (see (01], [RS], [Ko]). It will give us a map from the modular forms of a half-integral weight into the space of modular forms with respect to a congruence subgroup of r[t]. We would get the full paramodular group r[t] only for an unimodular even quadratic form of signature (2, 3), which does not exist ! The next defect of the theta-lifting is non-existence of the theta-integral

for modular forms of small weights. We shall see below, that in order to prove Theorem 1 about the moduli spaces we need modular forms of weight 3. Moreover, it is not easy to construct the theta-lifting of Eisenstein series, but in context of the arithmetical lifting we can take not only the Eisenstein

115

ARITHMETICAL LIFTING AND ITS APPUCA77ONS

series, but we can also lift a constant function that gives us so-called singular modular forms (see [G4]).

To finish this short discussion we would like to add, that Theorem 3 is a particular example of a general lifting from the space of Jacobi forms defined on 1H11 x C' (see [G31 and [G41).

Now we shall prove Theorem 1.

Basis to the geometric theory of automorphic forms is the fact that automorphic forms of special weights correspond to sections of canonical line bundles on algebraic varieties. Let F E 03(r[t]) be a modular form of weight 3. The holomorphic differential form on the Siegel upper-half plane WF = F(Z) A dZ = F(T, z, w)d-r A dz A dw is F[t]-invariant and defines an element of the zeroth cohomology group H°(At, Q3 (At)), where Q3 (At) is the sheaf of canonical differential forms on At. The complex variety At is not compact and has a lot of singularities. Due to Freitag we have the following simple criterion about continuation of canonical differential froms on a singular variety to its non-singular model. LEMMA (Freitag). - The elementw c- H° (At, SZ3 (At)) could be extended to

a canonical dferentialform on a non-singular model it of a compactification of the variety At if and only if the differential form w is square integrable. See (F], Hilfsatz 3.2.1.

It is known, that WF is square-integrable for the cusp modular form F. Thus we have the following identity for the geometrical genus of the variety At

p9(At) = h3'°(At) = dims 63(rab[t]) (see §1). If F E 9JTk(F[t]), then FIk J1 E'!Jtk(rab[t]), since J1gJ1 = tg-1 for any g c r[t]. Theorem 3 gives us examples of modular forms with respect to r[t]. It is easy to show that the Satake compactification of F[p] \ 1112 has two one-dimensional components, which are isomorphic to SL2(7L) \ H1

(see [HKW]). Thus the restrictions of the lifting Go (0 E 93Z" P) to the boundary of r[p] \H2 are modular forms of weight k with respect to SL2(Z).

They are identically equal to 0 for odd k and we automatically get cusp forms. Consequently, using the lifting of Theorem 3 we have the following estimation for the geometrical genus of the moduli variety

2]

m-

p9 (At) ? dimc with

9Jt3,n

{2j+2}12- [

=

2

{2j + 10}12 = lI k j 12

-

1

if k if k

0(m),

2 mod 12,

2mod 12.

116

V.. GRITSENKO

The formula for the dimension of the space of Jacobi forms has been obtained in [EZ) (see also [SZ)). For a prime number p > 11 we have pg (AP) > 0, that proves Theorem 3. Corollary from Theorem 3 gives us even more. THEOREM 3bs. - The variety it is not unirational if t has a prime divisor greater than 11. Proof : let us take a Jacobi form 0 of weight 3 and index p. In accordance with (9) 0 Ik T_ (m) E fii3 p,,,, and the lifting J(0I k T_ (m)) = J(q) I k T_ (m) is a cusp form of weight 3 with respect to F[pm].

It is possible to prove, that the variety At could be unirational only for finite numbers of t. The maximal such t is 36. This subject will be developed in more detail in the separate publication [G6) (see also [G41).

4. -Analytical continuation of ZF (S) In this section we shall construct an analytical continuation of Lfunction ZF(s) using a variant of Rankin-Selberg convolution of two Siegel

modular forms, proposed in [KS]. The first function in this convolution will be the eigenfunction F(Z) and the second one will be a lifting of some Fourier-Jacobi coefficient ft. The lifting J(ft) has been defined (see (11)) as action of the "infinite product" on the Jacobi form ft, hence it is not a surprise that the Rankin-Selberg convolution of this form with an eigenfunction has an Euler product. In order to get the functional equation of ZF (s) we consider in §5 more "advanced" variant of this integral, in which we take a convolution with respect to the parabolic subgroup of the maximal normal extension of the paramodular group F[t]. LEMMA 2. - Let F (Z) be a cusp form which is an eigenfunction of all Hecke

operators and t be the same as in Corollary I of Lemma 1. Let Gt = J (ft) be the lifting of the Fourier-Jacobi coefficients ft of F. For Re s > 3 the following identity holds Irk-2t-(s+k-2)

< ft, ft > ZF(s + k - 2) = L*(2s, Xt) f

F(Z)G(Z)Eo(Z, s)I YIk-3dXdY, 00(t)\H12

where

< ft, ft >=

f

xC

ft (r, z)ft(T, z)v-3exp(-47rt y2 v-1)dudvdxdy k

117

ARITHMETICAL LIFTING AND ITS APPLICATIONS

is the scalar product of Jacobi forms, Eot (Z, s) is the Eisenstein series of the congruence subgroup roo (t) = Sp2 (Z) n r [t]

I Y(ry < Z >)Isv(ry < Z >)-s, (Z=X+iY=

Eot(Z, s)=

(ZT

Z

7E r, \ro0 (t)

T = u+iv, z = x+ iy, I YJ = detY and L*(2s, Xt) = 7r-8r(s)L(2s, Xt) (see Corollary 2).

Proof : the integral on the right hand side is the Rankin-Selberg convolution of two modular forms on roo(t). As usual in this method, we may pass to the integral over a fundamental domain of the parabolic subgroup r"' of roo (t)

r,,,\IH[2={0y2v-1}, whereT=u+iv, z=x+iy,W=ul+ivi. After taking the integral over this domain one obtains

f

=(47rt)-(s+k-2)r(s+k-2)j:
m2-kftIkT (m) >m (s+k-2)

,n>1

The operators T_ (m) and T+(m) are connected by the duality * (see §1), thus using the standard Hermitian consideration it is not difficult to prove

that < ftm, m2-kftlkT-(m) >_ < ft.IkT+(m), ft > (see [F], Chapter N, and [KS]). We can finish the proof using Corollary 2.

Proof of analytic continuation of ZF(s)

:

the Eisenstein series

Eot (Z, s) is reduced to a sum of so-called Epstein zeta-functions (see [K], [Kr]). Let us introduce the following positive definite quadratic form corresponding to the variable Z = X + iY E 1H[2

Pz

)

(Y0

=

Y-1

)]

[\ X E

(M[N] = tNMN).

Then

Pry=Pz[try] andY(ry )-1=16z[I tD I] for-y

=(A D )ESP4(R).

The quotient v(7 < Z >)/I Y(ry < Z >)I is equal to the (2,2)-entry of the matrix Y(ry < Z >) -1 and one can rewrite the series L(2s, Xt)Eot(Z, s) as a sum of Epstein zeta-functions of the quadratic form Pz : L(2s, Xt)Eot(Z, s) =

Pz[N]-s = 1` t-'%(s, g, 0, Pz),

N=t(nl,'2,n3,n4)EZ4 nl,n2,n3=Omodt, (n4,t)=1

9=(0,0,0,94)

g4Et-1Z\Z (t9q,t)=1

V.. GR175ENKO

118

where for g, h E ]R4 (see [Ep], [T))

exp(27ritNh)Pz[N+gj-s.

g,h,Pz)= NEZ4 N+gj4O

It is known (see (Ep)) that the function (* (s, g, h, Pz) = 7r`F(s)((s, g, h, Pz) has the meromorphic continuation to the s-plane, satisfies the functional equation (13)

(*(s,g,h,Pz) =exp(-27ri )(*(2-s,h,-g,Pz1)

where < g, h >= g1h1 + . - + g4h4, and has simple pole with residue 1 at -

s = 2, if h is integral, and simple pole with residue -1 at s = 0, if g is integral. As a corollary of the integral representation of Lemma 2, we have the meromorphic continuation of the function ZF(s) of Theorem 2. Moreover, if t > 1 then the vectors g are not integral and ZF(s) could have a pole only at s = k.

If t = 1 (that could be possible only for even weight k), then the Eisenstein series 7r-sF(s)((2s)Eol(Z,s) is equal to the Epstein zetafunction c* (s, 0, 0, Pz), satisfies the functional equation (13) and has two poles at s = 0, 2 with residues :1 respectively. It gives us the functional equation of Theorem 2 in the case t = 1. Moreover the residue Ress-_kLF.(s) is proportional to the scalar product < F, G1 > of the Siegel modular form F and the form G1 = J(fl) containing

in the Maass subspace, which is invariant with respect to the action of Hecke operators. This scalar product is zero if F orthogonal to this subspace, thus ZF(s) is entire for such F. Otherwise F = G1 and the residue of ZF(s) at s = k is equal to 7r2-k< F, F >/< fl, fi >. We consider the case t > 1 below.

5. -Functional equation of the Spin L-functions for Siegel modular forms with the first Fourier-Jacobi coefficient fl - 0 The integral representation of the Spin L-function obtained above gives

us its meromorphic continuation, but the Eisenstein series, which is the kernel of the integral, has no good functional equation. As was shown in the proof of Theorem 3, the lifting J(ft) is "nearly" invariant with respect to a normal extension F [t] of the paramodular group r[t] generated by the group I'[t] and the element Wt (see (12)). To get the functional equation for ZF(s) one has to construct the second variant of the Rankin-Selberg convolution for this new group, since in that case the Eisenstein series satisfies a good functional equation. Without loss of generality we may restrict ourselves to the case of a prime number t.

119

ARITHMETICAL LIFTING AND ITS APPLICATIONS

LEMMA 3. - Let F be a cusp form which is an eigenfunction of all Hecke

operators. Let us assume that its first Fourier-Jacobi coefficients f, (T, z)

vanishes. Then there exists a prime number p such that fP (7-, z) is not identically equal to zero.

Proof : let us consider the Fourier expansion F(Z) _ E a(N) N E'B

exp (27ri tr(NZ)), where the sum is taken over the set SB2 of all positive

definite semi-integral symmetric matrices N = (l72 1/2) . The Fourier coefficient a(N) depends only on the class of the quadratic form N : a(N) _ a(tXNX) for any X E SL2(Z). If there is a primitive N ((m,1, n) = 1) such that a(N) # 0, then we may take any prime p represented by the quadratic form N. For such prime numbers fp(T, z) 0 0. Let us suppose that a(N) = 0 for all primitive matrices N. Consider the Fourier-Jacobi expansion of F

F(T, z, w) = L fm (T, z)exp (2iri mw). nx>r> 1

The form F is an eigenfunction, therefore we have the following relation between the Fourier-Jacobi coefficients of F and FlkT(e) for any divisor e of the index r (ed = r) :

fd{FjkT(e)}(T,z) = (fr{F} 1kT+(e))(T,z) = AF(T(e))fd{F}(r,z) - 0,

where T(e) is the Sp4(Z)-Hecke operator with index e, )'F(T(e)) is its eigenvalue and T+(e) is the j+-image of the SL2(Z)-Hecke operator (see (7), (9), (10) and the proof of Lemma 1). In the Fourier expansion of the Jacobi form fr a(N)exp (27ri (tr(NZ))),

f, (r, z)exp (27ri rw) = N=

*

r

E'B2

0, where there are no a(N) with a primitive N. If a(l e1/2 eed2)) ed = r, e > 1 and (m,1, d) = 1, then it is easy to see, that the function (frlkT+(e))(T,z), that is identically equal to 0 in accordance with the previous considerations, has at least one non-zero Fourier coefficient? ! The lemma is proved. In the next lemma we consider two "trace" operators for Siegel modular forms on Sp4 (Z) which send them to modular forms on the paramodular group and on the group I * [t] respectively.

120

V. GRITSENKO

LEMMA 4. - Let

F(Z) = 1: f. (-r, z)exp (2iri mw) E Mk (SP4 (Z)), m> 1

then the functions

Fp = FIkVp + Flk Jp

Fp =Fr+FPIkWP,

and

(see (1), (12) and Lemma 1) are modular forms of weight k with respect to the paramodular group T[p] and its (maximal) normal extension r- [pi = r[p] U F[p]W9,

respectively. Moreover the function F; (Z) has the Fourier Jacobi expansion FP (Z) =

z)exp (2iri pmw)

where =Pfpm+p-(2k_6) fr

fpm

+p-(k-3)fmlkT_(p)

IkA-(p)

Proof : the first part of the lemma is evident. To calculate the FourierJacobi expansion we can rewrite the trace operator as follows

Fp = 1: FIko(x)+FlktJlJp xEp-17/Z 0

+xEp E 'Z/Z FI k

0 0

1

0

0

0

0(x)Wp + FIk

0

0

Ol

0

1

0

0 1

0

0

0

-1

01

0

0

0

1

0

JPWP.

The first sum gives us only coefficients with indices divisible byp, the second sum is equivalent to the action of the operators A_ (p)AP 1 and the last two summands coincide with the action of the Hecke operator (0,/P-)-1T_(p). LEMMA 5. - Let p be prime. Then the Eisenstein series

EP(Z, s) = 7r-'F(s)(1 +p s)((2s)

>2

ryEro

I Y(1 < Z >)I sv(ry < Z >)-s, [p1\r* [p]

ARrITIMETICAL LIFTING AND ITS APPLICA77ONS

121

has a meromorphic continuation to the whole s-plane and is invariant with respect to the transformation s -> 2 - s. Proof : as in the proof of Lemma 2, Et (Z, s) can be represented as a sum of Epstein zeta-functions. The last rows of representatives of r,, [p] \ r* [p]

form the set of all Z[p-1]-primitive (primitive outside p) vectors of the following types : (pa, pb, pc, d) with (p, d) = 1;

p(a, b, c, d) with (b, p) = 1; f (a, pb, c, d) without a - c - 0 mod p.

Therefore there is a representation of Ep* as a sum of Epstein zeta-functions. A simple computation shows that

EE(Z,s)=i 'r(s)p24(

Pz[t(pa,pb,pc,d)]-s+p-$Pz[t(a,pb,c,d)])

(a,b,c,d)EZ4\{0} C(s,2,0,Pz)+p_ 1 E

g=(0,92,0,0) 92 mod p

p

h=(0 0 O h4) h4 mod p

(*(s,0,

h p

,Pz)

Using the functional equation(13) and the identity PZ 1 = Pz [J1] we get the functional equation for E*.

In accordance with Lemma 2 and with the identity (12), the product

(Fp(Z) + (-1)kFpIk Wp(Z)) Gp(Z) (det y)k

(where Gp = J(fp)) is invariant with respect to the action of r* [p] and we can construct an integral analogue to the integral of Lemma 2 for the group

r*[p] LEMMA 6. - Let F(Z) be a cusp form of weight k on Sp4 (Z). Let F be an eigenfunction of all Hecke operators with the first FourierJacobi coefficient f 1(T, z) = 0 and let p be a prime number for which f p (rr, z) 0 identically. For Res > k + 1 the following identity holds p3-2k7rk-2 (p2 -k+1 + (_1)kp 2 )) < fp, fp > ZF(s) _

(FF(Z) + (-1)kFpIkWp (Z)) Gp(Z) E, (Z, s - k + 2)1 yak-3dXdy, r' [p] \H2

where < fp, fp ># 0 is the scalar square and Gp = J(fp) is the Ufting of the Jacobi form fp.

122

V. GRITSENKO

We note that the functional equation for L-function ZF(s) of Theorem 2 follows from the above representation and the functional equation for the Eisenstein series EP (Z, s) from Lemma 5. Proof of Lemma 6: applying the same unfolding arguments with EE as in Lemma 2 we find that the integral equals the following Dirichlet series

+2r(s)r(s - k + 2)(1 - P -(s-k+2))-1 L(2s - 2k + 4, Xp)

C

fmpIkT+(m), fr >

m>1

where

MS

are the Fourier-Jacobi coefficients of the function Fr+(-1)kFpl k

W. These coefficients contain three summands, as has been shown in Lemma 5. Thus we have

1: m>1 (14)

E

fmplkT+(m) = ms

[pfPr+p (2k-6)fTIkA-(p)+p (k-3)fTIkT-(p)] IkT+(m) MS

m> 1

which is reduced to three Dirichlet series. The first series one can calculate using Corollary 2. For the third sum one gets

(15) L(2s-2k+4, Xp) E fmI kT (p)T+(m) =p-s(fpI kT_(p)T+(p)) ZF(s) m> 1

To prove this we may use the identity (16)

)pfmp = f m p l l k T (p) = f m I k 7- (p) + f m p 2 I k T+ (p),

where AP is the F-eigenvalue of the Hecke operator T(p). Thus the series (15) is equal to L(2s - 2k + 4, Xp)

(Apfmp - fmp2 Ik T+(p)) I kT+(m) m-s. m> 1

The operators T+(p) and T+(m) commute, thus using Corollaries 1 and 2 of Lemma 1 we see that the last sum is equal to

(Af - (ff2 - fplkT-(p)ps)IkT+(p))ZF(s)

123

ARrIHMETICAL LIFTING AND ITS APPLICATIONS

Applying (16) again for mp = p2 and taking into account the assumption that f, - 0 we get (15). The calculation of the second sum in (14) can be done as follows. The standard identity between elements in the SL2 (Z)-Hecke ring T(p)T(M) P_ T (m) + pT (p, p)T (P) gives us after the "plus" j+-embedding

T+(p)T+(P) = T+ (m) + pA+(p)T+(). In Lemma 1 we have seen that A_(p)T+(p) = p20PT_(p) and A_(p)A+(p) = p40P. Therefore fmlk (A-(p)L 'T+(mp))

(mp)-3

m> 1

=p2-s E fmlkT-(P)T+(in)m-s _p5-2sY fvlIkT+(l)zP1-s m> 1

1>1

= (p2-sfpl k T-(p)T+(p) -

p2k-1-23)ZF(s)L(2s - 2k + 4, Xn)-1'

where we have used (15) and Corollary 2. In the second and third sums there is a summand of type f,IkT_(p)T+(p), which we shall calculate in the next lemma. LEMMA 7. - Let fP(T, z) be a Jacobi form of index p and weight k such that fP I k T+ (p) - 0. Then

fplkT-(p)T+(p) =p2k-6(p3 - (_l)kp2)f

.

Proof : one can prove the lemma almost entirely "inside" the formal Hecke ring 7-l(I ,,.). As in the proof of Lemma 1

T-(p)T+(p)=(pTj(p)+p3+p2)OP, where Tj(p)=I diag(p-1,1,p,1)I',. We note that the operator IkTj(p) coincides up to a constant with the Hecke-Jacobi operator Tj(p) defined in §4 of [EZ]. On the other side it is easy to check that (17)

T'+(p)T-(p) =

(T.I(p)VP

where

_ P

x,y,rEP-'Z/Z

I

+ pVP + 'P)AP, 0

0

y

1

y

r

0

0

1

x

0

0

0

1

1

-x

V GRITSENKO

124

Let us take the standard expansion of fP(T, z) with respect to the basic Jacobi functions 2

Op,,, (-r, z) = E exp (27rip(1 + 2

2p)

IEZ

T + 27ri(2pl +,u)z

(see IEZ], §5). If

ff(T, z) =

0li(7)BP,µ(T, z), µ mod 2p

then after obvious calculations with Gauss sums one gets ff(T, z)I k ..y = p2

E

Oµ(T)eP,-µ(T, z)

Ii mod 2p

(this is the only place in the proof of the lemma in which we have to use Jacobi forms themselves). The invariance of fP(T, z) with respect to the mi-

nus identity matrix -E4 is equivalent to the identity µ = (-1)_,(T, z), thus ffIk P = (-1)kp2 fP. Moreover, from (17) we get fP I k T(p) =

2p 2 fP,

0

if k is even, if k is odd,

since by our assumption fpl k T+ (p) = 0. This prove the lemma.

To finish the prove of Lemma 6 and to get the functional equation of Theorem 2 one has to collect the three sums in (14) together and to take into account the result of Lemma 7. We have proved in §4, that the function ZF(s) could have only one pole for t > 1. Together with the functional equation stated above it gives us that the Spin L-function is entire function on the whole s-plane if the first Fourier-Jacobi coefficients of the cusp form F vanishes. Thus Theorem 2 is proved completely.

Manuscrit recu le 26 octobre 1993

ARI7TIMETICAL LIFTING AND 175 APPLICA77ONS

125

REFERENCES

[A] A. N. ANDRIANOV. - Eulerproducts corresponding to Siegel modularforms of

genus 2, Russian Math. Survey 29 (1974), 45-116. [EZ] M. EICHLER, D. ZAGIER. - The theory of Jaccobi forms, Progress in Math. 55,

Birkhauser, Boston, Basel, Stuttgart, 1985. [Ep] P. EPSTEIN. - Zur Theorie allgemeiner Zetafu ctionen Math. Ann. 56, (1903), 614-644. [Ev] S. A. EvDOKIMOV. - A characterization of the Maass space of Siegel cusp forms of degree 2, Matem. Sbornik 112 (1980), 133-142 (Russian) ; English

transl. in Math. USSR Sbornik 40 (1981), 125-133. [F] E. FREITAG. - Siegelsche Modulfunktionen, Grundlehren der math. Wissensch., 254, Springer, Berlin, Heidelberg, New York, 1983. [G 1 ] V. A. GRITSENKO. - The action of modular operators on the Fourier-Jacobi co-

efficients of modular forms, Matem. Sbornik 119 1982, 248-277 (Russian) ; English transl. in Math. USSR Sbornik 47 (1984), 237-268. [G2] V. A. GRITSENKO. - Jacobi functions and Euler products for Hermitian modularforms, Zap. Nauk. Sem. LOMI 183 (1990), 77-123 (Russian) ; English transl. in J. Soviet Math. 62 (1992), 2883-2914. [G3] V. A. GRITSENKO. - Jacobi functions of n-variables,Zap. Nauk. Sem. LOMI 168 (1988), 32-45 (Russian); English transl. in J. Soviet Math. 53 (1991), 243-252. [G4] V. A. GRITSENKO. - Modular forms and moduli spaces of abelian and K3 surfaces, Mathematica Gottingensis Schrift. des SFB "Geometrie und Analysis", Helt 26, 1993, p. 32; appears in St.Petersburg Math. Jour. 5 (1994). [G5] V. A. GRITSENKO. - Induction in the theory of zeta-functions, Preprint 91097 University Bielefeld, 1991 p. 76; appears in St.Petersburg Math. Jour. 5 (1994). [G6] V. A. GRITSENKO. - Moduli spaces of abelian surfaces (in preparation). IKW] K. HULEK, C. KAHN, S. H. WEINTRAUB. - Theta functions and compactification

of moduli spaces of polarized abelian surfaces, 1993.

[I] J. IGVSA. - Theta function, Grundlehren der math. Wissensch., 254, Springer Verlag, Berlin, Heidelberg, New York, 1972.

126

V. GRITSENKO

[K] W. KOHNEN. - On character twists of certain Dirichlet series, Mem. of the Fac.

of Science Kyushu University, series A, Mathematics 47 (1993), 103-119. [KS] W. KOHNEN, N.-P. SKORUPPA. - A certain Dirichlet series attached to Siegel

modular forms of degree two, Invent. Math. 95 (1989), 449-476. [Ko] H. Ko,nvA. - On construction ofSiegel modularforms of degree two, J. Math. Soc. Japan 34 (1982), 393-411. [Kr] A. KRIEG. - A Dirichlet series for modular forms of degree n, Acta Arith. 59 (1991), 243-259. [Ku] S. KUDLA. - Seesaw dual reductive pairs, Automorphic forms of several variables, Progress in Math. 46, Birkhauser, Boston, Basel, Stuttgart, 1983, 244-268. [L] R.P. LANGLANDS. - Euler products, Yale Univ. Press, 1971.

[01] T. ODA. - On modular forms associated with indefinite quadratic forms of signature (2, n - 2), Math. Ann. 231 (1977), 97-144. [02] T. ODA. - On the poles of Andrianov L-functions, Math. Ann. 256 (1981), 323-340. [P-S] I. I. PYATETSKII-SHAPIRO. - Automorphic functions and the geometry of clas-

sical domains, Gordon and Breach, New York, 1969. [RS] S. RALLIS, G. SCHIFFMANN. - On a relation between SL2 cusp forms and cusp

forms on tube domain associated to orthogonal groups, Trans. Amer. Math. Soc 263 (1981), 1-58. ISZI N-P. SKORUPPA, D. ZAGIER. - A trace form for Jacobi forms, J. reine and angew. Math. 393 (1989), 168-198. [T] A. TERRAS. - Harmonic Analysis on Symmetric Spaces ans Applications, I, Springer Verlag, Berlin, Heidelberg, New York, 1983.

Valeri GRITSENKO

Department Steklov Mathematical Institute St. Petersburg FONTANKA 27

191011 ST. PETERSBURG RUSSIA

Number Theory Paris 1992-93

Towards an arithmetical analysis of the continuum Glyn Harman

1. - Introduction Since the set of real numbers is uncountable, almost all (in a variety of senses) real numbers are effectively indescribable. Our curiousity forces us, however, to attempt to describe the irrationals in terms of their relation to the "known" set of rationals. An elementary theorem given by Dirichlet (1842) provides the simplest such relation. For every real a, and any given N > 1, there exist coprime integers m, n

such that (1)

a

n <- n(N1+

1)

with 1 < n < N.

This result is best possible, even for almost all a (in the sense of Lebesgue measure), although if we are only interested in what is true for infinitely many N then better results are possible. Hurwitz (1891) : For every irrational a there are infinitely many fractions m/n in lowest terms such that

a--MIn < 52n2 1

(2)

Khintchine (1926) : Let f (n) be a positive function defined on the integers which decreases to zero monotonically with increasing n, and such that (3)

diverges. Then, for almost all a there are infinitely many solutions to (4)

a-

m < f () n n2

,

with (m, n) = 1.

128

G. HARMAN

These results are best possible since 57' cannot be replaced by a larger number in (2), while if (3) converges then there are only finitely many solutions to (4) for almost all a. Of course, each irrational a can be expressed

as a non-terminating continued fraction, and the approximations in (2) and (4) (as soon as f (n) < will come from convergents to the continued 2) fraction. These results naturally lead to a more general question : what types of fractions are near irrationals? Here, by "types", we mean fractions whose numerator and/or denominator are restricted in various ways, for example to prime values. By "near" we mean we would like to get as close as possible to

m
I a--n but if we obtained (5)

a- mn

-1-e

then the size of 0 could be regarded as a measure of our "success". We will henceforth consider the question from two different perspectives corresponding to (2) and (4) above : what is true for every irrational? or : what is true for almost all irrationals? One could ask the question for different subsets of the irrationals, for example algebraic numbers, but we shall not pursue such topics here. Neither shall we deal with inhomogeneous approximation. The answers we obtain form "une contribution a 1'analyse arithmetique du continu", which subject was begun in earnest in 1903 in Paris by E. Borel [4]. Before reviewing our current state of knowledge on these questions, it is amusing to observe that these, and related questions, are of interest in fields considerably removed from number theory. To make a clockwork model of

the solar system requires approximations m/n to a number a (usually the ratio of a planet's "year" to the earth's year) with m and n composed of numbers with small prime factors (to make the gears used feasible). The Dutch scientist Christiaan Huygens used continued fractions to make such a model in the 17th century, a project which we note was "partiellement execute a Paris" [22). For details of this problem see Chapter 4 of [26). In the theory of music, and in particular in the construction of musical instruments, it is often regarded as desirable to have n equally spaced notes per octave such that (6)

log(3/2) N ml log 2 n

and

log(5/4) N m2 log 2

n

where ml and m2 are integers. The first approximation in (6) is to make the interval of a fifth correct, the second gives a major third. It would be

TOWARDS AN ARITHMETICAL ANALYSIS OF THE CONTINUUM

129

useful to have other approximations with the same denominator also. The well-known value of 12 for n gives 27/12 = 1.4983... (very nearly 3/2), while 24/12 = 1.2599... (about 0.8% sharp). To improve on these approximations (and so possibly obtain sweeter music?) would require n = 53 (31/53, 17/53) or n = 118 (69/118, 38/118). The problem now becomes one of genetic engineering : breed people with lots of small fingers and specially developed brains to play the resulting instruments!

2. - Question One : restrict only one of numerator/denominator Without loss of generality we restrict the denominator only in the following. In 1941 Duffin and Schaeffer (8) generalized Khintchine's theorem (4) as follows.

Suppose that f (n) is a non-negative function such that 0(n)f(n) (7)

n n=1

f(n)

> c n=1

for all N and some positive constant c, and the right hand side of (7) tends to infinity with N. Then, for almost all real a there are infinitely many solutions to (8)

Ana - ml < f (n) with (m, n) = 1

(in (7) 0(n) denotes Euler's totient function).

As examples we may take f (n) = x(n)g(n) where x is the characteristic function of a set with number-theoretic interest and g(n) is suitably behaved. We thus obtain : (9)

spa - mI < l/ (2p) , (m, p) = 1, p a prime,

(10)

In2a - ml < 1/(n log n), (m, n) = 1,

(11)

110na - ml < 1/n, (m,10) = 1,

all have infinitely many solutions for almost all a. The first inequality (9) shows that almost all real numbers have infinitely many convergents in their continued fraction expansion having prime denominator. The final inequality (11) actually says something about the decimal expansion of

almost all a : for almost all a there are infinitely many n such that the n-th and [loglo n] following decimal places are all zero.

130

G. HARMAN

The major unsolved problem in this field is the Duffin-Schaeffer conjecture which states that in place of (7) we require only the divergence of the left hand side of (7). Although many cases of this conjecture have been settled (see [91, 1171, 1271 for example) it remains an open question whether

the conjecture is true in its full generality (although it is known to be true in higher dimensions [251, and is true if "almost all a" is replaced by "a set of dimension 1"). Settling this problem would be a major advance in the arithmetical analysis of the continuum. When we turn from what is almost always true to what is true for every irrational a we enter more difficult territory. Now (9), (10) and (11) are no longer always true. There are uncountably many a such that (12)

Ipa - MI >

loge 4000p log log p '

for all large primes p 1181. There are uncountably many a such that (13)

l10T'a-MI >1 if (m, 10) = 1.

where each aj is either 4 or 5, but the This is very easy. Take a = 0 a1 a2 decimal does not recur. The best approximations m/10n with (m, 10) = 1 must then have m ending in a 3 or a 7 which gives (13).

Inequalities like (9) were first investigated by Vinogradov [301 who showed that for every irrational a there are infinitely many solutions in primes p to (14)

l Ipa ll <

p-e+E

11 denotes distance to the nearest integer and 0 = 1/5. The value of 0 was improved to 1/4 by R.C. Vaughan [291 and subsequently improved

where 11

further to 3/10 by the present author [121 (the method presented there shows that (14) holds for a value of 0 between 3/10 and 1/3 : numerical calculation is required to arrive at the best value). The idea here is to approximate 11x11 by a Fourier series and then convert a sum over primes to

double sums, so that we require estimates for (15)

1: 11: a,,,, E bn e(amn$) £
.

n
(MN)0-1, am, bn << nE for any e > 0, and bn = 1 or logn if M is much smaller than N. The conversion to double sums may be done by Vaughan's identity or arise from a sieve method. Here L

TOWARDS AN ARITHMETICAL ANALYSIS OF THE CON77NUUM

131

Hardy and Littlewood were the first to investigate (10). They were only able to show that min Ilan211 -> 0 as N -> oo for all a [ 11]. Vinogradov gave

the first quantitative formulation of this result and this was improved to min an211 <

N-2+E

by Heilbronn 1201. This result was generalized to ank by Danicic [7], also improving Vinogradov's earlier work. Here estimates are required for the Weyl sums :

EI > e(anke)

(16)

£
The two problems above suggest the further problem of Ilapkll, for prime p. This requires information on double Weyl sums [2]. It would be very interesting to have a new means of attacking these problems. The current methods give not only one solution of the desired inequality, but the "correct number" with a smaller error (or lower bound with the expected order of magnitude). This sets a limit on how well the methods could be expected to work since we know, for example, that the sequence {an2} does not have small discrepancy [3].

3. - Question 2 : restrict both numerator and denominator, what happens for almost all a ? Let A and 13 be two sets of positive integers, and let p(n) denote the probability that n E B. That is, we suppose E 1 = E p(n)(1 + o(n))

(17)

,
n
where p(n) is continuous and non-increasing. For example we have : B :

p(n) =

prime numbers square-frees n a(mod q)

(log n)

n=r2+s2

K/(log n)2.

6/7r2

1/q

Now suppose that f (n) is a decreasing positive function and consider the sum (18)

p(n)f(n)nE.A

132

G. HARMAN

If this sum diverges it would be reasonable to suppose that there were infinitely many solutions to (19)

(na - mI
for almost all a > 0. On the other hand, if (18) converges then it is easy to show that there are only finitely many solutions to (19) for almost all a > 0. If 13 is just the set of positive integers (the question considered in the previous section) then it is known that a zero-one law operates (15] and (10]) : the inequality has infinitely many solutions either for almost all a or for a set with measure zero. In the current situation it is possible to produce counter-examples to show that this is no longer the case (15], although it is likely that such a law will operate in all cases of number-theoretic interest. As an example of the convergence of (18) we have the following result : For almost all a there are only a finite number of solutions to

1pa - qI < 1/p with p, q primes. It is possible to show that the exceptional set here has Hausdorff dimension one. To study the case where (18) diverges it is useful to impose the additional condition (compare (7))

ON

(20) nEE

n<x

n

>cI: l

for £=Aor13.

nE£

n<x

We then only count coprime solutions to (19). Some condition in addition to the divergence of (18) is necessary because if A and 13 both consist of numbers with many small prime factors then there will be a large number of solutions to m _ a with a E A and b E 13. n b

This means the fractions "fall on top of each other" rather than spread out evenly. The condition (20) is certainly satisfied for sets such as the primes, square-frees, integers in arithmetic progressions or sums of two squares. In these cases (and some others) the present author has shown that there are infinitely many solutions to (19) given the divergence of (18)

for almost all a > 0. It is possible to give explicit examples of numbers in the exceptional sets for these problems, even though no numbers are known in the set of almost all a. For example, although there are infinitely many solutions to Ima - nj < 1/(2n) with m, n both sums of two squares for almost all a > 0, the number 3 + 5-2 (= [3, 2, 4]) has no such

'IDWARDS ANARTT IMETICAL ANALYSIS OF THE CON77NUUM

133

approximation because each numerator in its continued fraction expansion is congruent to 3 (mod 4) (they are 3, 7/2, 31/9,131/38 . ). In like manner the number (45 - 10'1')/186 (= [0, 4, 2, 4, 9]) is an example for the set of measure zero for approximation by fractions with square-free denominator. The convergents to the continued fraction expansion of this number are 1/4,2/9,9/40,83/369,341/1516,3152/14013,... and the denominators are divisible alternately by 4 and 9. We now indicate how such results can be proved. The following lemma is fundamental to the start of the proof (see [161). The lemma itself is a consequence of Cauchy's inequality and the Lebesgue density theorem. LEMMA. - Let I be a sub-interval of R, and Dn a sequence of subsets of 1. For each open interval j C I write 13n = D,,, fl j and suppose that (21)

and (22)

limsupN

A(1n f18in))

A8n))2

> 6A (J) ,

m,n
n
where A() denotes Lebesgue measure, and 8 is a positive constant independent of J. Then almost all a E I belong to infinitely many D.

Now suppose we are dealing with (19) with A and 13 the set of primes. We then take

DP=In

U

(S-f(P)

S+f(P))

#y

p

p

e a prime

which leads us to require an upper bound for A(i3p fl Bq)

(22)

p,q
and a lower bound for (23)

E A(13p) p
It is easy to give a lower bound for (23). If I = (A, B) and J _ (a, b) then we obtain 1.f () (b - a) 2 p
p

G. HARMAN

134

for all large N. We therefore need to demonstrate that (22) is bounded by

K(b - a

(24)

r f(p)12 p
)ogp)

where K is independent of J. We note that for p < q we have A(Bp n 1q) < 2f (q)

r-qs'p

q

1.

0<jrp-sql <2qf (p)

Here r - q signifies r/q E J. The problem has now been reduced to giving an upper bound for the number of solutions to (25)

I rp - sql < A p, q, r, s all primes.

One can use the Brun-Titchmarsh theorem to tackle (25) for p and q in certain ranges. When this result is inadequate the author fixed p and applied the three dimensional sieve to give an upper bound for the number

of solutions to (25) in q, r and s. To deal with the remainder terms it is necessary to use exponential sums and the standard bound for the simplest incomplete Kloostermann sum. In this way (24) is established and the result proved.

4. - Question 3 : restrict both numerator and denominator, what happens for every irrational a? In general our answers to this question are very much worse than for the previous questions. The most successful theorem in this area is the following result given by Heath-Brown [ 191 : For every irrational a there are infinitely many fractions rn/n with square-

free numerator and denominator such that

Ian - ml < n-2/3+E

The proof uses a lattice argument for counting solutions to certain congruences. The problem of approximating any given irrational by a fraction whose numerator and denominator are both primes appears to be exceptionally difficult (like Goldbach's problem, another binary problem in primes). As yet no-one has shown that (26)

lap - ql < 1

TOWARDS AN ARITHMETICAL ANALYSIS OF THE CONTINUUM

135

is solvable in primes p and q. One can obtain p) on the right side of (26) where "almost all" intervals [x, x + xA) contain primes (so 1/12 + E is a possible value for A). This result is true for rational a as well though!

Just as progress has been made on Goldbach's problem by using almost-primes instead of primes so the same approach can be made here. The first such result was given by Vaughan in 1976 (28] :

Ipa-P4I
0). The present author improved this by replacing P4 with P3 [13] (also giving a less significant improvement on 6 to 1/300). This problem thus appears "harder" than Goldbach where a prime and a P2 suffices (6]. It has been shown by Iwaniec

[23] (see also [ 14]) that one can approximate irrationals in this way with fractions whose numerator and denominator are sums of two squares. We finish this article by giving an indication of how such results can be established. We replace a with a convergent a/q, with error 1/q2 and choose the size of our possible numerator and denominator in relation to q. To approximate with p/P3 we pick X = q8/5, and write

A={Qpa/q :pSX,Ilpalgll <X-6/2}. indicates nearest integer. We then want to show that A contains P3 numbers. To do this we need to consider how well A is distributed in arithmetic progressions. We write Here Q

Ad=#,{nEA:n-0(modd)}, Rd=Ad- al. We then wish to show that A IRdI = ° L1o X] d
(27)

gl

l

for as large a value of D as possible. Using a familiar argument (see Chapter 2 of [1]) we obtain

R-d«

X1

6-E

6 ll

d

eff [pq d] +d Y e
where L = L(d) = dXE+6 We can convert the sum over primes above into a double sum which leads us to seek estimates for mnfa e

n

ad

136

G. FIARMAN

Using the large sieve and other devices we obtain a suitable estimate when

D < X1/3. This establishes the result after employing Chen's role reversal technique [6). The reader can see from these results that there is much work to be done in this area to give a more satisfactory "contribution a 1'analyse arithmetique du continu". Additional note : since delivering the above talk (November 1992) there

has been further progress on two problems. The most important is the announcement by A. Zaharescu that the exponent 2 can be improved to 4/7 in Heilbronn's Theorem on IIan2 11, with a further improvement to 2/3 if one is only looking for infinitely many solutions. His proof uses character sums not exponential sums. As mentioned above, numerical calculation is required to obtain the best exponent p for I1apIl < p -P and Jia Chao-Hua Q. Number Theory 45 (1993), 241-253) has shown that one can take p = 4/13 (0.308...). The present author will show elsewhere how the method can be improved to yield 7/22 (0.318...).

The referee in his comments pertinently remarked that I should have mentioned in section 4 the important work of Margulis (Discrete subgroups and ergodic theory in Number Theory, trace formulas and discrete groups, Oslo 1987, pages 377-398, Academic Press, Boston 1989) whereby every irrational a has infinitely many approximations

a

712 + v2 < 162 + v2

for any e > 0.

I

Manuscrit recu le 2 septembre 1993

TOWARDS ANARI7HME77CAL ANALYSIS OF77IE CONTINUUM

137

References [11 R.C. BAKER. - Diophantine Inequalities, Clarendon Press, Oxford 1986. [2] R.C. BAKER and G. HARMAN. - On the distribution of app` modulo one,

Mathematika, 38 (1991), 170-184. [3] H. BEHNKE. - Uber die Verteilung von Irrationalitaten mod 1, Abh. Math. Sem. Hamburg, 1 (1922), 252-267. [4] E. BOREL. - Une contribution a I'analyse arithmetique du continu,

Journal de Mathematiques Pures et Appliquees, (5eme serie), 9 (1903), 329-375. [5] J.W.S. CASSELS. - Some metrical theorems in Diophantine approximation I, Proc. Cambridge Phil. Soc., 46 (1950), 209-218.

[6] J.-R. CHEN. - On the representation of a large even integer as the sum of a prime and the product of at most two primes, Sci. Sinica, 16 (1973), 157-176. [7] I. DANicic. - Contributions to Number Theory, Ph. D. Thesis, London 1957. [8] R.J. DUFFIN and A.C. SCHAEFFER. - Khintchine's problem in Metric Diophantine approximation, Duke Math. J., 8 (1941), 243-255. [9] P. ERDOs. - On the distribution of the convergents of almost all real numbers, J. Number Theory, 2 (1970), 425-441. [10] P.X. GALLAGHER. - Approximation by reduced fractions, J. Math. Soc.

of Japan, 13 (1961), 342-345. [11] G.H. HARDY and J.E. LrrrLEwoOD. - The fractional part of nJB, Acta

Math., 37 (1914), 155-191. [12] G. HARMAN. - On the distribution of ap modulo one, J. London Math. Soc., (2) 27 (1983), 9-18. [13] G. HARMAN. - Diophantine approximation with a prime and an almost

prime, J. London Math. Soc., (2) 29 (1984), 13-22. [14] G. HARMAN. - Diophantine approximation with almost primes and two

squares, Mathematika, 32 (1985), 301-310. [15] G. HARMAN. - Metric diophantine approximation with two restricted variables I, Math. Proc. Cambridge Phil. Soc., 103 (1988), 197-206. [16] G. HARMAN. - Metric diophantine approximation with two restricted variables III, J. Number Theory, 29 (1988), 364-375. [17] G. HARMAN. - Some cases of the Dufn and Schaeffer conjecture, Quart. J. Math. Oxford, (2) 41 (1990), 395-404. [18] G. HARMAN. - Numbers badly approximable by fractions with prime denominator, preprint Cardiff, 1993.

138

G. HARMAN

[19] D.R. HEATH-BROWN. - Diophantine approximation with square-free integers, Math. Zeit., 187 (1984), 335-344. [20] H. HEILBRONN. - On the distribution of the sequence n29 (mod 1), Quart. J. Math. Oxford, (1) 19 (1948), 249-256. [21] A. HuRwITZ. - Uber die angenaherte Darstellung der Irrationalzahlen durch rationaleBriiche, Math. Ann., 39 (1891), 279-284. [22] C. HUYGENS. - Projet de 1680-81, partiellement execute d Paris, d'un

planetaire tenant compte de la variation des vitesses des planetes dans leurs orbites supposees elliptiques ou circulaires, et consideration de diverses hypotheses sur cette variation, in CEuvres Completes de Christian Huygens, 21, 109-163, Martinus Nijhoff, La Haye (1944). [23] H. IwANIEC. - On indefinite quadratic forms in four variables, Acta Arithmetica, 33 (1977), 209-229. [24] A. KHINTCHINE. - Zdr metrischen Theorie der diophantischen Approximationen, Math. Zeit., 24 (1926), 706-714. [25] A.D. POLLINGTON and R.C. VAUGHAN. - The k-dimensional Duffin and

Schaeffer conjecture, Mathematika, 37 (1990), 190-200. [26] A.M. ROCKETF and P. SzUsz. - Continued Fractions, World Scientific, Singapore-New Jersey-London-Hong Kong 1992. [27] J.D. VAALER. - On the metric theory of Diophantine approximations,

Pacific J. Math., 76 (1978), 527-539. [28] R.C. VAUGHAN. - Diophantine approximation by prime numbers III, Proc. London Math. Soc., (3) 33 (1976), 177-192. [29] R.C. VAUGHAN. - On the distribution of ap modulo 1, Mathematika, 24 (1977), 135-141. [30] I.M. VINOGRADOV. - The method of trigonometric sums in the theory of numbers (translated from the Russian by K.F. Roth and A. Davenport), Wiley-Interscience, London 1954. Glyn HARMAN

School of Mathematics University of Wales College of Cardiff, 23 Senghenydd Road, P.O. Box 926, CARDIFF CF2 4YH

United Kingdom

Number Theory Paris 1992-93

On A-adic forms of half integral weight for SL(2)/Q Haruzo HIDA

1. - Let S be the two-fold metaplectic cover of S = SL(2)/z and fix a prime p > 5. In this short note", we want to describe a technique of lifting a family of complex automorphic representations of S(A) to a "Aadic automorphic" representation II of S(A(P°°)), where A is a one variable power series ring over an appropriate p-adically complete discrete valua-

tion ring, and A(P°°) is the adele ring A of Q the p and oo-components removed. Then we will have a A-adic version of a result of Waldspurger [Wa2]. We begin with the study of p-adic cusp forms of half integral weight

and prove in Section 3 that the classical cusp forms of weight k + . is dense in the space of p-adic cusp forms of half integral weight if k > 2 (Theorem 1). Then we study A-adic forms of half integral weight in Section 4 by combining the techniques of Wiles [Wi] (introduced for integral weights) and the representation theoretic technique of Waldspurger [Wal,2]. Taking

the limit shrinking the congruence subgroup, we get the desired A-adic representation of S(A(P°°)) (Proposition 1). Then we prove the weak multiplicity one theorem for p-ordinary A-adic automorphic representations (Theorem 2 in Section 4). Although our construction is just the combination of these two existing techniques, we get a fairly strong result on p-adic standard L-functions of G = GL(2)/Q. That is, a certain ratio of the restriction of 2-variable p-adic standard L-functions [K] to the line interpolating The author is partially supported by an NSF grant. The final touch to the paper was given while the author was visiting the Isaac Newton Institute for Mathematical

Sciences, Cambridge, England. The author acknowledges the support from the Institute for the month of April in 1993. Some part of the work presented in this note was actually done in 1988 in order to construct a p-adic standard L-functions for GL(2) restricted at the center critical line (Theorem 4). The construction of two variable p-adic standard L-functions was later done by K. Kitagawa [K] using a different method.

140

H. HIDA

the central critical values is shown to be square in the field of fractions of the Iwasawa algebra A (Theorems 3 and 4), which is the A-adic version of a result of Waldspurger ([Wa2] Corollary 2) we alluded to. A further scrutinizing of the representation we constructed might bring us a sharpening of this result giving a A-adic version of the result in M. However to make our presentation short, we will not touch this subject in the present account. Another interesting point which awaits further study is the behavior of the specialization 7rv,t=2 of irreducible factors it of 11 at weight 2. In [GS],

Greenberg and Stevens gave an interesting limit formula of the derivative

of the p-adic standard L-function at the center critical point, when the L-function has an exceptional zero at this point. This is the unique case where the specialized automorphic representation It t=2 of S(ZP°°)) (supplemented with the p-component) becomes super cuspidal at p although the integral image of 7rwt=2 under the Shimura correspondence is special and p-ordinary. Thus the study of the behavior of the other local components of Trwt=2 might cast some new insight upon the p-adic analog of the conjecture of Birch-Swinnerton Dyer formulated in [MTT]. Although I have only worked out here the result for SL(2) defined over Q, our idea works fine for SL(2) over general number fields. However, in the general case, the many variable standard p-adic L-functions defined on the spectrum of the p-adic Hecke algebra are not yet constructed.

2. - Let A be a congruence subgroup of level prime to p. When we consider modular forms of half integral weight, we assume that A is contained in ro(4). We write O1(pa) = A n r1(pa) and A(pa) = A1(p) n ro(pes). We use the same notation introduced in [H1] Sections 1 and 2 for classical modular forms. In particular, for each integer k and an algebra A, Pk+(1/2)(AI(pa); A)) stands for the space of A-integral cusp forms of half integral weight k + 2 with respect to Ai(pa), while for each integer SK,(AI (pa); A)) stands for the space of A-integral cusp forms of integral

weight ,c. Here the A-integrality is given by the q-expansion at the cusp oo. For each Dirichlet character X modulo Npa, Pk+(1/2)(ro(Npa); X; A) consists of cusp forms g in Pk+(1/2)(r1(Npa);A) with 9Ik+(1/2)a = X(d)9 for each or = I a d

)

E ro (N), where glk+(1/2)0'is the action of or defined in

[H 1J (2.2a) which is a little different from the normalization of [Sh 1] p. 447. Our normalization is :

9Ik+(1/2)0'(z) = g(a(z))j(a, z)-1J(a,

z)-k

for a = (c

b

d

J

ON A ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2)IQ

141

00

where J(o, z) = (cz + d) and j (a, z) = 0(a (z))/9(z) for 9(z) = >exp(27rin2z).

n--oo

By [HI] Theorem 2.2 or its proof, Pk+(1/2)(r1(Np°); A) is stable under this action of a E 1'o (Npa). We now give an interpretation in adelic language following [Wa2] III. We write S for the algebraic group SL(2)lz. We write S for the two fold metaplectic cover of SL(2) defined in [Wa2] II.4. Thus S(Qp), S(A) and S(R) have meaning, where A is the adele ring of Q. In other words, we have a non-splitting exact sequence of groups :

1-->{±1}--S(A)->S(A)--* 1, where A is either A, Qp or R. Now let us describe the 2-cocycle /3 giving the ), we put x (a) = d

extension S(Q,,) for a place v of Q. For each a = (a or c according as c = 0 or c # 0. We also put

sv(a) _

(c,d)v if cd

d

0, v is finite, vp(c) is odd, otherwise,

1

where (c, d)v is the Hilbert symbol at v (that is, Artin symbol (d, Qv) of d). Then we have

VC-(d'w)

= (c, d)v f for the

Qv(a> a') = (x(a) , x(aTv(-x(a)x(a'), x(aa'))vsv(a)sv(a')sv(ao') .

For a e S(Q), the product s(a) = IIvsv(av) is well defined. Similarly we may define /(a, a') = nv3(av, a',) for a, a' E S(A). Then we identify S(A) with S(A) x {±1} under the multiplication law given by (g, e)(h, e') = (gh, )0(g, h)ee'). By the product formula of the Hilbert symbol, ,Q(a, a') =

s(a)s(a')s(aa') for or, a' E S(Q). Thus a F--> (a, s(a)) gives a section : S(Q) -> S(A). We identify S(Q) with its image in S(A). We also identify the standard maximal compact subgroup S02(]R) with ][8/Z by 0 1--k cos 2x9 sin 27x9

sin 27r9 cos 27r9

Then the pull back image of S02(R) in S(R) can

be identified with ][8/2Z. We write the corresponding element r(9) in S(R) for an integer k and C, = {r(9) 10 E R/2Z}. Then r(9) - e((k + 2)9) and e(9) = exp(27ri9) is a character of C. Via (g, e) H g(i) E H, we have

S(R)/C,, = H for the upper half complex plane H. Let e : A/Q -> C be the standard additive character such that e(xo,,) = exp(27rix,,,,). We write ev for the restriction of e to Qv for each place v, and we define 'yv(t) to be the Weil's constant with respect to ev and the quadratic form tx2 on Qv [W] p. 161. We put, following [Wa2],'(t) = (t, t),,-y,, (t)-y (1)-1. Then we

142

H. HIDA

(t,t')v5'v(t)5'v(t'), 1 for arbitrary v, and ye(t) = 1 if t E Ze and s 54 2, ry52(1) = '2(5) = 1 and 5'2(3) = 5'2(7) = -i. Let Uo (N)

(a d)

E S(2) I c E Nz 5

(2 = IIC

and write Uo (N)Q for the f -component of Uo (N). Defining for or = ( a c

d)

E

S(Q2)

J 'Y2(d)(c,d)2s2(a) ry`2 (d)

if C 74 0,

if c = 0,

we can check that E extends to a character of Uo(4)2 x {±1} in S(Q2) non-

trivial on {+1}. Let x be a character of (Z/NZ)" with X(-1) = 1. For (a (u,E) E Uo(N) x {±1}, we define X(u) = X(d) if u = d). We then consider the space of functions f satisfying :

(m 1)

f(ax(u,E)r(B)) = e2(u2)EX(u)f(x)e((k + 2)6)

for a E S(Q), (u,E) E Uo(N) x {±1} and r(O) E C. We impose another condition at oo :

(m2) D f = (k(k 2- 2) ) f for k' = k+ 2 for the Casimir operator D at oo . We write Pk+(1/2) (N, X; C) for the space of functions satisfying (ml - 2)

which are cusp forms. Writing J(g, z) = cz + d for g = I a b ) E SL2 (R) and z E H, we can identify S(18) = { (g, t(g, z)) Ig E SL2 (R) , t(g, z) : holomorphic on H

with t(g, z)2 = J(g, z) }.

The product is then given by (g, t(g, z))(h, t(h, z)) = (gh, t(g, h(z))t(h, z)). We have a natural inclusion map S(l) , S(A) and S(Q) - S(A). We have

the theta series : 0(z) : E00 exp(27rin2z) defined on H. As is well known, n--oo

ON A-ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2)/Q

143

putting j(ry, z) = B(y(z))/O(z), j(y, z)2 = (dl) J(ry, z) if y E ro(4). Thus ry H (y, e2(y)j(y, z)) defines an inclusion of ro(\4) into S(R). It is known that the extension splits over U1 (4) = { I a b

i

E S(7G)

I

c E 42 and

a - d - 1 mod 47L}. Thus we have by the\strong g approximation theorem that S(A) = S(Q)U1(4)S(R). We can identify these two realizations by S(A)E) (g, t(g, z)) 1-- (g, t(g., z)J(g", z)-1/2) where the square root is taken so that -ir/2 < arg(cz + d)1/2 < 7r/2. For each cusp form f E Pk+(1/2)(N, X; Q, we define F : H - C by F(z) = f((g,1))J(g,i)k+(112) for z = g(i) (g E S(R)). Then as shown in [Wa2[ Proposition 3, f H F induces an isomorphism :

(2.1)

T'k+(1/2)(ro(N), X; C)

Pk+(1/2) (N, X, (C)

When f is cuspidal, the holomorphy of F follows from (ml - 2). Let us prove the above isomorphism. We have put F(z) = f ((g, 1))J(g, i)k+(1/2) for g E S(IR). Then

F(y(z)) = f (('Y.g,1))J(yg, i)k+(1/2) Suppose y c Fo(N). Then note that

(y.g, l) _ (ygyf 1,1) _ (y, S(-Y))(g'yf 1, 1)(1, S('Y-'),3(-Y,g'yf 1))

_ (y, S(y))(g, 1)(yf 1, 1)(1, S(y-1)0(g, yf 1)Q(y,gyf 1))

Since,3 is a 2-cocycle, /3(h, k)/3(g, hk) =,3(gh, k),3(g, h). This shows

(y, s(y))(g, 1)(yf

1

1)(l, S(y-1))3(7g,yf 1)0(y,g))

Thus :

F(y(g)) =f((y,s(y))(g,l)('Yf1,1)(1,s(_ 1))3(yg,yf1)/3(y,g)))J(yg,i)k+(1/2) =f((g, l)(yf 1, 1)(1,S(7-1)0(yg,yf 1)13(y,g)))J(yg, i)k+(1/2) =S(y-1)0(yg, yf 1)Q(y, g)E2(yf 1)X(y f 1) f ((g, 1))J(yg, i)k+(1/2)

144

H. HIDA

Since J(y9, i)1/2 = Q('Y, 9)J(y, z)1/2J(9, i)1/2 and

y

_

a

c

x(") = X(d)

if

b

d) ' we see

F(y(z)) = s(y-1)/3(79,yf 1)E2(yf 1)X(d)J(7,z)1+(1/2)f(9,

1)J(9,i)k+(1/2)

= s(y-1)0(79,yf 1)E2(yf 1)X(d)F(z)J(y,z)k+(1/2)

Thus we need to prove s(ry-1)0(ryg,yf 1)E2(yf 1) = (d)%y2(d). If c = 0, the

both sides are trivial. Thus we may assume that c # 0. The case c # 0 is treated in [Wa2] p. 388.

For any open subgroup U of Uo(4), we write Tu = S(Q) n US(IR). We write Pk+(1/2) (U; C) for the space of holomorphic cusp forms on S(A) satisfying (m2) and (m' 1)

f (ax(u, e)r(O)) = E2((u2i E)) f (x)e((k + 2)9) for u E U and a E S(Q) .

Then Pk+(1/2) (Fu; C) - Pk+(1/2) (U; C). Thus we can transfer the rational

structure from the classical side to the adelic side to have the spaces Pk+(112)(U; A) for any subalgebra A of C.

3. - In this section, we first prove the density theorem of low weight classical cusp forms in the space of p-adic cusp forms of half integral weight. Using this fact, we describe another way, much closer to Weil's original definition in [W] and due to Shimura [Sh2], to define S(A). By the strong approximation theorem, we have a bijection :

{congruence subgroups of S(Z) of level prime to p} <> Z = {open subgroups of S(Z(P))}

A=

fl s(z) -,& : the closure of A in S(Z(P)),

where Z(P) = fl Z e. We put t =/P

S.(0; A) =U SK.(AI(pa); A) and Pk+(1/2) (S; A) =U Pk+(1/2) (A, (pa); A) a

a

Let 0 be the ring of Witt vectors with coefficients in an algebraic closure FP of FP and K be the field of fractions of 0. Let 1l be the completion

ON A ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2) /Q

145

of an algebraic closure Q of Qp under its standard p-adic norm I Ip. We take an embedding : K - 1l. = QP and fix two embeddings Q -> C and Q , SlP for an algebraic closure Q of Q. Put A = 0 fl Q and S,, (A; 0) _ SK.(0; A)_®A 0, Pk+(1/2) (A; O) = Pk+(1/2) (,a;_A) ®A O. We write S(0; O)

(resp. P(0; 0)) for the p-adic completion of S, (0; 0) (resp. Pk+(1/2) (0; 0)), which is independent of ic (resp. k) if c > 2 (resp. k > 2). This fact is proven in [H2] and [H6] for integral weight and is conjectured in [H 1 ] for half integral weight. Now we can give a proof of this fact for half integral weight.

THEOREM 1. - If k > 2 and p > 3, we have an isomorphism preserving q-expansions : Pk+(1/2) (A; 0)

Pk+(3/2) (O; O) .

Proof : let U be an open subgroup of G(2) (G = GL(2) /z) and Y(U) be the corresponding open modular curve. Suppose that U D G(Zp) and we put

U(pa)=

sESIsP=(0

)modP}.

For each positive integer N, we put (N = exp( jet). Then Y., = Y(U(p-)) has a model over A = Z[1/6N, (N] for the level N of U which is the moduli space parametrizing an elliptic curve E with U-structure and a Drinfeld

style level structure at p; that is, a morphism ¢ : Z/p'Z -+ E of group schemes such that

E [O(P)] is of degree pa as a relative Cartier divisor PEZ/p'Z

(see [KM] Chapter 1 or [H71). Suppose that U C Uo(4). We can compactify Ys, adding cusps to get the proper curve X, which is regular proper over Zp [KM]. Let w/Y be the invertible sheaf corresponding to weight 1 modular

forms studied in [KM]. Let Ia be the Igusa curve containing the cusp 00 which is the irreducible component of Xq mod p'. If we consider the pordinary moduli problem 0 : pp. C E of generalized semi-stable elliptic curves, it gives an open subscheme Ua of X,, whose fiber at p is Ia-{super singular points}. Then there exists a unique invertible sheaf w1/2 on Ua such that wl/2 = Wand O E r(Uc/c, wl/2). By the q-expansion principle and p > 3, 9 is a section defined over Z P, We first suppose that U is contained in the principal congruence subgroup of level 24. Then the Dedekind ri function is a section of Ho(Uc.,w1/2). Writing w(2 + (k/2)) for w1®2 ®° IU., we

146

H. HIDA

consider the following commutative diagram : 0 1

H°(U.,w(k+

2))

H°(U.,w(k+ 2) 0 Z/p°Z)

0Z/p`xZ

177

l77

H°(U,,wl(k+ 1)) ®Z/paZ

H°(Ua,wl(k + 1) ® Z/paZ)

lp

1

H°(Ua,O(D)) 0 Z/p`Z

H°(U., O(D) 0 Z/paZ)

1 0,

E

where D is a cuspidal divisor given by div(a) =

(ords(ij))s

sEU.,orde (,j)>0

and the first horizontal maps are given by the multiplication by 77. Here we

regard D as a closed subscheme of Ua in a natural way, and O(D) is its structure sheaf. The first row is exact. When k > 2, deg (w (k + 2) ®A Q) > deg(1 x,/A ®A Q). Thus the Riemann-Roch theorem tells us the vanishing of H1(X,y, w (k+ 2)) ®AQ = H1(U.y, w (k+ 2)) ®AQ. Since w (k+ 2) is Aflat, this shows the vanishing of H1(Uy, w (k + 2 and the exactness of the second row. Since the vertical maps are injective, we have a commutative diagram whose rows are exact if k > 2 0

H°(Uy,w(k + 2)) 0 Z/ppZ

H°(U.y, w(k + 2)) 0 Z/ppZ

H°(U.y, O(D)) 0 Z/ppZ

0

E Ea

H°(U.y,w(k+2)) ®Z/ppZ

H°(Uy,w(k+ 1))O Z/p'Z

H°(Uy, O(D)) ® Z/p8Z

ONA ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2),,

147

where 0 < /3 < a < y and Ea is the modular form on U. of weight 1 with E,, - 1 modp«. Taking injective limit with respect to y, we write

H°(U,w(t)) = lim H°(Uy, w(t)) . 'Y

Then we have by the p-adic density theorem of integral weight modular forms, if k > 2 0

0

HO (U., w(k + )) ® Z/pOZ

E.

H°(U.,w(k+

®Z/p'Z 2))

2

H°(U.,w(k + 2)) 0 Z/pa7G

H°(U,,.,w(k + 1)) 0 Z/p'Z

H°(UU, O(D)) ®Z/p'7G

H°(Uc, O(D)) ®Z/p Z

.

This shows the p-adic density theorem for half integral weight if 24 1 N. If not, we just use restriction and transfer maps and recover the result in general if p > 3. Put

S,(A) = _U S,,(0; A) and P,,,(A) = _U PK,(0; A) DEZ

S(O)

= U S(0; O)

,

DEZ

and P(O) = U P(0; 0).

DEZ

AEZ

If f E S,,(A), one can find I' such that f E SK,(I;A). Then for each x E S(A(P°°)) (A(P°°) = {x E A xP = x,). = 0}), one can find I

u E f c S(A(°")) and y E S(Q) such that x = wy, where f is the closure of T in S(A(°°)) (A(°°) = {x E A I x = 0}). Some time ago, Shimura defined the action of x E S(A(P°°)) on f by f' = f I y [Sh2]. Then he showed that the action is a smooth action of S(A(P°°)) on Sk (Q( 6)), where Q(b) = Q[(N I (p, N) = 1] is the maximal abelian extension of Q unramified at p. Using Katz's theory of p-adic modular forms (see [H7] Chapter 2), it is easy to check that the action of S(A(P°°)) preserves S,, (A) and extends

148

H. HIDA

to S(O) by p-adic continuity. Note that the representation of S(A(P°°)) we obtained is smooth, but not of finite type. I like to call this representation the p-adic automorphic representation of S(A(P°°)). According to Shimura [Sh2l, we can give a definition of S(A(P°°)) as follows :

S(A(PO°))={(x,v)ES(A(P°°))xGL(P(O)) I (f")2 = (f2)x for all fEP(O)}. Then we have an exact sequence : 1-* {±1 } -> S(A(P°°)) - S(A(P°°)) -1.

It is basically shown in [Sh2l that any x E S(A(P°°)) is liftable to an automorphism v of Pk+(1/2) (Q( 6)). Since x preserves A-integrality, v keeps

A-integrality and hence gives an automorphism of P(O). This shows the surjectivity of 7r. There is an alternative way of showing the surjectivity of it. One can check that the action of S(Z(P)) is liftable to half integral weight by multiplying half integral weight cusp forms by 71 (or 0), because the action

of S(Z(P)) preserves A-integral structure of integral weight cusp forms. It is easy to check the liftability of the action of upper triangular matrices. Thus by the Iwasawa decomposition, every x E S(A(P°°)) is liftable. By definition, we have a smooth p-adic "automorphic" representation of S(A(P°°)) on P(O).

Although we do not have a good action of S(Q) on P(O), we can at least define an action of the maximal split torus T(7LP) = ZP < in S(7LP). Take a subgroup A corresponding to Z E Z. Thus its level N is prime

to p. We assume that ro(4) D A. When A is a Zr algebra, we can show multiplying by 0 as done in [H 1l §3 that Pk+(1/2) (O1 (pr); A) is stable under

the action ofZN= 7LP" x (Z/NZ)" for the level N of A, which is given for (A 1 (pr); A) by f E Pk+(1/2) (3.1)

fz= f kzP

o,,, for arz E SL2 (7G) with vz

= I\

z-1 0 ) 0

Npr.

z 11 mod

This action of ZP" extends by continuity to P(O).

4. - We put W = 1 + pZP in ZP" Z. Then W = ZP as topological groups, and ZP" = W x a for the subgroup p of (p-1)-th roots of unity. Simplifying the notation, we write Pk+(1/2)(Npa; A) for Pk+(1/2)(L'1(Npa); A). We put,

for O- D E Z and a character E of W modulo pa, Pk+(1/2)(A(pa);E; A)={fEPk+(1/2)(AI(pa);A)

If I z=E(z)4f for z E W},

where A(pa) = Ai(p) n Fo(pa) and A is a ring either in SZP or in C containing all the values of e on W. We now consider the action of the

ON A-ADIC FORMS OF HALF INFEGRAL WEIGHT FOR SL(2)/Q

149

Hecke operator T(q2) for each prime q on Pk+(1/2)(A1(pa); (C). As shown in [Shl] Theorem 1.7, we know

a(n,fIT(q2))=a(p2n,f)+q-1(2)a(n,flq)+q-'a(n/g2,flg2) ifg{Np°, (4.1)

a(n,flT(g2)) = a(p2n, f) if

glNp«,

where N is the level of A and q E ZN (= ZP < x (Z/NZ)") acts on f as in (3.1). This combined with [H 11 Theorem 2.2 shows that Pk+(1/2) (O 1(pa); 0)

is stable under T(q2). In particular, we can define the idempotent e in Endo(Pk+(1/2)(A1(p"); 0)) by taking the limit :

e = n-,oo lim T(p2)n!

(4.2)

We write M°'`1 for eM for any module M with an action of e. Hereafter we allow as a base ring a finite extension of the ring of Witt

vectors with coefficients in ]FP and write the ring as 0 and its field of fractions as K. All the definitions we have given for the ring of Witt vectors carry over to this slightly general situation by extending scalar to 0 from the ring of Witt vectors. Write A = 0[[W]] for the completed group algebra of W. Then A is isomorphic to the one variable power series ring O[[X]] via u 1--f 1 + X if we fix a generator u E W. We fix an algebraic closure 1L of the quotient field L of A and consider the algebraic closure of K in QP as a subfield of E. For each normal integral domain II in 1L finite over

A, let X(II) = Homo_alg(ll, S2p) be the space of all Qp valued points of Spec(II) and A(ll) be the subset of arithmetic points, that is, those 0-algebra homomorphisms P : II -+ QP such that P(ry) = -1k(P) for an integer k(P) > 0 on a neighborhood of the identity of W. Thus sp(ry) = P(-y)ry-k(P) defines a finite order character of W, whose order will be denoted by Pr(P)-1. We write A(II; 0) = {P E A(II) 10 D P(1)}. For each congruence subgroup A (with level N) associated with ,& E Z, let ]F(O;1) be the space of II-adic cusp forms. Thus f E IF(A; II) is a formal q-expansion : (n/N, f)qn/N E ] [[q1/N]] n=o

whose specialization f(P) = E P(a(n/N, f))gn/N E P(II)[[g'/N]] at PEA(II) n=o

is a classical cusp form in Pk(p)+(1/2) (A(pr(P) ), sp; Stp) for all P E A(ll) with

sufficiently large k(P) > 0. When A = r1(N) (4 I N), we write 1F(N;11) for lP(O; II). Since A is a regular local ring of dimension 2, 1 is A-free. Fixing a

150

H. HIDA

base {ij} of l over A, we can write formally that f = Ejfjij. Then it is easy to see that fj is a A-adic form. Thus P(0; II) = IP(A; A) OA II. There is another interpretation of the above space of A-adic forms. We first identify A with the measure algebra on W having values in O. Then to each f E P(0; A), we associate a p-adic measure 0 fW qdf on W having values in O[[g11N]] by (4.3)

fW Odf =

J

Oda(n/N, f)q N E O[[ql/N]].

n=1 W

Writing Xp(w) = Ep(w)wk(p) for each arithmetic point P (that is, the character of W corresponding to P), we have fW xpdf = f(P) E Pk(p)+(1/2) k(P) >> 0} (0(pr(p)), p; Q p) for sufficiently large k(P). Since {xp I

spans a dense subspace of continuous functions on W having values in K, as a measure, df has values in P(O). In particular, the new measure 0 H fW qdf s for s E S(A(P°°)) again comes from a A-adic form f s E IP(O3; A) for a suitable congruence subgroup O3 corresponding to I

Os E Z. Thus, we have a natural action of S(A(P°°)) on P(1) _ U IP(A; II). ZEZ

Similarly, we have an action of Hecke operators T(q2) and the group Z on IP(N; II). Writing c : w 1--> [w] for the tautological character of W into A,

we know that w E W acts on F(N; II) via t, that is, f I w= [w]f. Since the projector e naturally acts on Pk+(1/2) (O) and hence on P(O), e again acts on P(0; II) and P(1[). We note this fact as

PROPOSITION 1. - As long as q is prime to the level N of 0, we have Hecke operators T(q2) given by (4.1) and the ordinary projector e on IP(A; II), and the metaplectic group S(A(P°°)) naturally acts on P(II) through a smooth representation. Here the smoothness means that the stabilizer of each vector

in the representation space is open in S(A(P°°)).

We can think of the corresponding notion of II-adic cusp forms for integral weight modular forms (cf. [H5] Chapter 7). We briefly recall the definition. For o E Z, a formal q-expansion f c II[[g1/N]] is called an II-adic cusp form of integral weight if f(P) E Sk(p) (A(pr(p)), Ep; Q p) whenever P is arithmetic and k(P) is sufficiently large. We write S(A; II) for the space of II-adic cusp forms (of integral weight). Then similar to Proposition 1, we have Hecke operators T(n) (cf. [H5] Chapter 7) and the ordinary projector e on S(o; II). In this case, e is given on the space of p-adic cusp forms by

e = lim T(p)". The group S(A(P°°)) naturally acts on U S(0; II). We n-*oo DEZ

actually need to have G(A(P°°))-action (recall G = GL(2)lz). Note that

ON A-ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2)/Q

151

G(A) = G(Q)G(7G)G+(R) for the identity connected component G+(R) of G(R). For each open subgroup U of G(2), we consider cusp forms f : G(A) ---> C satisfying : (M 1)

f (axu) = f (x) det(u,,,) J(u,,., i)-k for u E UC,,.R" ;

(M2)

Df= rk(k2 -2)l f; u ) xf Idu=0forallx E G(A).

(M3) J(Q\A

1

We write Sk(U; (C) for the space of functions f satisfying (M 1-3). Choosing

a complete representative set R = R(U) for G(Q)\G(A)/UG+(R) in G(2), (Ftut-1 = S(Q) f1 tUt-1S(R)) for each we can define Ft E Sk(FtUt-1; cC)

t c R by Ft(z) = f(tg)det(g)-1J(g,i)k, where g E G+(TR) such that g(i) = z. Then it is easy to see Sk(U;C) = ®tERSk(FtUt-1;(C). We then define Sk(U; A) by the image of ®tERSk(FtUt-1; A). We can take R inside

R = { (0

0

/

1 a E 7L(P) }. We always choose R in this way. Then we have

e and T(p) we/ll defined on Sk(U; Slr). Let U = {U : open subgroup of G(Z(P))}.

Write Uo = U x GL2(ZP) for U E U. Taking R(Uo) in R so that R(Uo) D R(Vo) if V D U for all U, V E U, we define S(U; II) = ®t ER(U(,)S(rtUot-1; II)

and S(l[) _ U S(U; II). Using the stability of U S(0; )<) under S(A(PO°)), UEU

pES

it is easy to check that S(I) is stable under S(A(P°°)). Since Ca a E A(P°°) basically permutes the direct summands S(II) is stable under G(A(P°°)). We thus have

011

with

S(rtuot-1; II) of S(U; II),

PROPOSITION 2. - The space S(U; II) has, as II-linear endomorphisms, the

ordinary projector e and the Hecke operators T (q) for primes q prime to the level of U. The group G(A(P°°)) acts on S(II) smoothly.

5. - Before going into a hard work, we like to give a sketch of the theory. The first main result is

152

H. HIDA

THEOREM 2. - The automorphic representation of S(A(POO)) on pord (II) is

smooth and, after having extended scalar to the field of fractions of II, is a discrete direct sum of irreducible admissible representations with multiplicity at most 1.

Putting off all the details to the end of this paper for attentive readers,

we here give a sketch of the proof. It is well known that §ord(N; A) = Sord(rl(N); A) is free of finite rank over A (see [H5] Chapter 7), and if k(P) > 2 for P E A(A; 0), then (*)

Sord(A; A)/PSord(A;

A) -

Skrd(A(pr(P)), EP; 0)

.

This implies that there are only finitely many, bounded independently of weights, of complex irreducible automorphic representations of G(A) which is p-ordinary and of conductor dividing Np. On the other hand, one has the Shimura correspondence : Sh : {irreducible holomorphic automorphic representations of S(A) of weight k + 1 } --> {irreducible holomorphic automorphic

representation of G(A) of weight 2k}.

By a result of Waldspurger, there exists a bound M > 0 such that (i)

#Sh-1(7r) < M for all k, if C(7r) I Np,

where C(7r) is the conductor of 7r. If if is p-ordinary (that is, the eigenvalue of T(p2) in is a p-adic unit), Shff) is p-ordinary. Moreover, if we write V for the space of i, we have a positive bound M' independently of weights

(but depending on 0) such that (ii)

Then (i) + (ii)

dime H°(0(p), V) < M. ranko Pk+(1/2) (A(p); 0) < M"(0) independently of k

for a positive bound M"(o). Take a subset

in pord (A; A)

which is linearly independent over A. Then we can find m rational numbers nl,... , n,,,,, such that D = det(a(n1, Off)) # 0. Therefore for arithmetic P with k(P) sufficiently large and ep = id, gti(P) is and element of Pord (p); 0) and D(P) 0. In other words, {oi(P)}ti is linearly independent over 0. Therefore m < M". This implies that rankA pord (A; A) < M". As we will see later, Ford (A; A) is actually free of finite rank over A. Then all the assertion follows from the weak multiplicity one theorem of Waldspurger by reducing the A-adic reprensentation modulo P.

ON A ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2)

/Q

153

Thus we have the A-adic Shimura correspondence :

Sh : {irreducible A-adic ordinary automorphic representations of S(A)} -- {irreducible A-adic ordinary automorphic representations of G(A)}. Suppose II = Sh(II). We write 9rp = II mod P and 7rp = Sh(arp). Then 7rp for an arithmetic P is a scalar extension of classical representation if k(P) >

2. This means that one can supplement a (unique) local representation at p with irp to get a complex automorphic representation if k(P) > 2, which we again write 7rp. Similarly 9rp is associated with a complex automorphic representation of the metaplectic group if k(P) is sufficiently large, because we can only prove the metaplectic version of (*) under the assumption that k(P) is sufficiently large. Here note that 7rp # II mod P but 7rp = Sh(arp) = II mod p2, because representations of weight k + .

correspond to those of weight 2k. Here we used the group structure of Spec(A)(O) = Homgr(W,Ox) to define p2. The above fact characterizes the A-adic Shimura correspondence. By (*), the prime to p-part C(II) of the conductor of 7rp is independent of P. Moreover the central character of II can be written as L02 for a finite order even character V) modulo 4pC(II), where t is the tautological character of W into A" composed with the "norm" character : (A(P00))x 9 x --* IxI-lw-1(x) E ZPx for the Teichmuller character w. We put Op = cp2/)w k( ) for each arithmetic P. As a striking consequence of his theory, Waldspurger expressed the square of a certain ratio of two Fourier coefficients of a cusp form of half integral weight by a ratio of L-values attached to the image under the Shimura correspondence. Applying this result, we get a A-adic version of his result : I

THEOREM 3. - For each pair (m, n) of positive square free integers with m/n E HII4NPQ , we find two elements 4 and T in II such that if k(P) > 1 or 1/i2p # 1, we have : 4,(P)2 q,(P)2

L(2,7rp L(2,7rp

P1Xm)

as long as

L(2,EP ®0P'Xm) 0, where Xt is the quadratic character associated with Q(f ). 6. - We now start filling the details with the argument in Section 5. Fix a character V of (Z/NpZ) x . For each arithmetic point P E A(A), we define

154

H. HIDA

a character by of ZN by Op(z) = /i(z)Xp()z-k(P) _,0ePW-k(P)(z), where z 1--> < z > is the projection to W and w is the Teichmiiller character. We now prove

PROPOSITION 3. - The dimension of P%+1/2)(A0(pr(P))>V)P;1 )) is

bounded independent of P E A(A) if k(P) > 1 (the dimension depends on ,& E Z).

To prove the proposition, we prepare several lemmas. Let £ be a prime

and put

Ur=Ur,e={( a

d)esL2(z)Icomodr},

For each character x of Ze modulo £r and a Ur-module M, we write M(x)

for the x-eigenspace. That is, M(x) _ {m E M I (a Ca c

b d/

E

d

) m = x(d)m for

Ur}. When the reference to the level fr is necessary, we write

M(i'.r, x) in place of M(x).

LEMMA 1. - Let 7r be an irreducible admissible representation of the metaplectic covering group S(Qe) of SL2 (Q) and V denote its representation

space. Let x be a character of Qe modulo fr. Suppose that 7r appears as a local factor of a holomorphic automorphic representation of weight k + (1 /2) (k > 2). Then the dimension of V (.fir, x) is bounded independent of V and x (but it depends on r).

Proof : when 7r is special or principal, then we can realize it as a subquotient of the induced representation space 13,. = 13µ,eQ of a character A of the standard Borel subgroup, as in [Wal, 11.2] and [Wa2, II], for the .part ee of the standard additive character e of A/Q and a quasi character

u of the standard Borel subgroup of §(Q t). Since the left translation by the upper triangular matrices of S(Qe) is already prescribed on Bµ, any function in C3µ is determined by its restriction to SL2(Zt) x {±1}. Then for each given open compact subgroup U of SL2(Ze) x {±1}, the dimension of H°(U, Bµ) is bounded by the index 2(SL2(Z() : U). A more effective bound can be obtained using the explicit calculation of the space x)

done in [Wa2) Proposition 9 (p. 417) (see also Lemma 3 in the text). We then have dim(B,(Qr, x)) < 2(r + 1). This settles the problem in the case of non-super cuspidal representations. Let II be a holomorphic automorphic representation of S(A) of weight k + a (k > 2) having it as its factor

155

ON A ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2),Q

at e. Let W be the space of the e-component of the automorphic representation of GL2(A) corresponding to II by the Shimura correspondence. Using the notation of [Wall V.4 (p. 99), we mean by W the f-component of V'(e, V) ®x. We know from [Cl that dim W (er, x2) < r + 1. By [Wa2) V, Proposition 5 (p. 404), V (er, x) is a subspace of the space spanned by, with the notation in (Wa2l, i,,,e o j,,,e(w)(fr,,,) for r sufficiently large (if

r > max(2ve(2) + 1, ve(C(x))) for the conductor C(x) of x), where w E W(er,x) and v E (Qell"(V)/(Qell")2). Here fr,,, is a SchwartzBruhat function on He = {x E M2(Qt) I Tr(x) = Of determined by (r, v) as specified in [Wa2) Chapter V. The choice of v E Qell" is bounded by #(Qellx/(Qell>)2) which is 4 if e > 2 and 8 if e = 2. Thus we have, for general V,

dim(V (er, x)) < 8(r + 1) for r sufficiently large . This finishes the proof. LEMMA 2. - Let it be an irreducible admissible representation of S(Q1)

with representation space V. Suppose that it is super cuspidal. Then, for sufficiently large m, T(t) annihilates V (er, x) if r > 0. Proof : note that

Uo(er)(( 0 e0 and

((e0

1)Uo(er) =

U uE7Z. /1 "Ze

uf-M ((e0 f-M ),1) _

((e0

e--M.

)

eo )'i) ((0

1

,

i)Uo(er)

u) 1). )

Thus for v E H°(Uo(er), V), we define an operator T(f-) by

vIT(em)= UEZe /1"Z

((e0

ue-'n ) e-,n

,

1) v .

The operator T(fm) coincides with the Hecke operator (TT)m acting on V (f', x) defined in [Wa2) III.3, pp. 388-389. Then we have

vit(((0

eo )'1)) I_'_Zt it( ( 10

1

,1)vdu=0

for sufficiently large m by the definition of super cuspidality. As shown in (Wa2] Lemma 4, p. 389, we know that TI = e(3-2k)/2ye(ex(e)-1T(e2) for T(e2) defined in [Shl], where ye(t) = (t, t)e-Ye(t)rye(1) 1. Thus we know the lemma from the above result. Here we should note that the definition of our space of modular forms of half integral weight is different by the character k

(=1) from that of [Wa2l, and thus we do not replace x by Xo as was done in [Wa2l for these formulas.

156

H. HIDA

LEMMA 3 ([V)). - Suppose that f > 2. Let V = 13,. and let X be a continuous character of Q into Cx. We consider the Hecke operator Tyr) = Suppose that r > Sup(vt(C(X)), X(2)ry'e(2)-1f(2k-3)/2Te.

vt(C(fLX)C(ItX-1))), where C(X) is the conductor of X. Then we have the following assertions :

(i) If both µX and pX-1 are non-trivial on Zellx, then T(P) is nilpotent on V (fr; X) for r > 0; (ii) Suppose that pX-1 is unram flied but pX is ramified. Then we can decompose V(Fr; X) = N ® V(C(X); X) so that T(e2) is nilpotent on N and V (C(X); X) is one dimensional on which T(e2) acts by scalar multiplication of XV)Q

(iii) Suppose that uX is unramified but uX-1 is ramified. Then we can decompose V (Fr; X) = N ® V (C(X); X) so that T (f2) is nilpotent on N and V (C(X); X) is one dimensional on which T(2) acts by scalar multiplication

of

x()Q(2k-1)/2/l(e)

;

(iv) Suppose that both µX and

µX-1

are wuamified. Then we can de-

compose V (.fr; X) = N (D V(; X) so that (I) T(t2) is nilpotent on N, (ii) V (i?; X)

is two dimensional, and (iii) we have a base {vl, v2 } of V (2; X) such that vlIT(t2)=X(&)f(2k-l)/2µ(2-1)v1

with some constant c.

and v2 IT(j2)=x (t)e(2k-1)/2A(f)v2+cv1 l

Proof : write v(e) for the exponent of 2 in C(e) for any character of e of Z.11'. As shown in [Wa2) Proposition 9, p. 417, under the assumption v(µX-1) of r > Sup(v(X), 1), V(2'; X) 0 if and only if r > v(pX) +

As long as r > v(X) and r > 1, T(22) sends V(2r; X) to V(; X) (cf. [Wa2] Lemma 7 or [H2) (8.6)). This shows that for sufficiently large M. V (2r; X) IT(f2m) is contained in V (C(X); X) or V (f; X) if v(X) < 1.

Unless X is quadratic, v(X) = v(X2) since 2 > 2. Thus if X2

id,

then v(pX) + v(µX-1) > Max(v(µX), v(,X-1)) > v(X). If moreover both v(pX) and v(/4X-1) are positive, then v(pX) + v(pX-1) > v(X) and thus V (C(X); X) = 0. Therefore T(22) is nilpotent on V (jr; X) if X2 id and if both v(jX) and v(pX-1) are positive. Now suppose that X2 = id and both v(µX) and v(pX-1) are positive. Then if X id, then C(X) = 2

and V (C(X); X) = 0 because v(pX) + v(pX-1) > 1. If X = id, then again V(2; X) = 0 because v(µ) + v(µ) > 1. Thus T(22) is nilpotent if both v(pX) and v(µX-1) are positive. Now suppose that v(pX-1) = 0 but v(µX) > 0. Then X2 54 id because v(µX) = v(X2) > 0 (and hence v(X2) = v(X)), and V(C(X); X) is one dimensional by [Wa2] Proposition 9. Moreover

by [;a2] Proposition 10, (ii), we know that T(f2) acts on V(C(X); X) by X(f)2(k/2)-lµ(2-1) Thus we can decompose the scalar multiplication of V(fr;X) = N (3 V(C(X);X) such that on N, T(f2) is nilpotent, and on the one dimensional space V(C(X);X), T(f2) acts via the multiplication

ON A ADIc FORMS OF HALF INTEGRAL WEIGHT FOR SL(2)/Q

of

X(f)f(k/2)-1µ(f-1). Suppose v(µX-1)

157

= v(µX) = 0 and x # id. Then

x2 = id because v(ltX) = v(X2) = 0. By [Wa2] Proposition 2, V(f; x) is 2-dimensional, and there is a base {v1, v2} of V(f; x) such that v1 V2

I T(f2) =

x(f)f(k/2)-1µ(f-1)v1

I T(f2) =

and

X(f)&/2) -11L(f)V2 + f(k/2)-21'e(f)-1x(f)(f - 1)v1

.

Thus we can decompose V (P'; x) = N ® V (t; x) such that on N, T (j2) is nilpotent and on the 2-dimensional space V (f; x), it acts by the above formula. Next suppose that v(µx-1) = v(ax) = 0 and x = id. Then V (f; x) is 2-dimensional, and we can find a base {v1, V21 by [Wa2] Proposition 10 such that (v1 + v2) E V(1; x) and

vl IT (f2) =

x(f)f(k/2)-1µ(f-1)v1

and v2 I T(f2) =

X(f)f(k/2)-1µ(f)v2

+ cv1

with some constant c. The value of c is given by [Wa2] p. 420. This shows

that V (f'; x) = N ® V (f; x) such that on N, T(f2) is nilpotent, and on the 2-dimensional space V(f;x), T(f2) is an automorphism described

as above. Finally we assume that v(µx) = 0 but v(µx-1) > 0. Then v(µX-1) = v(X-2) > 0 and hence v(x2) = v(x). Thus again by [Wa2] Propositions 9 and 10, V(C(x); x) is one dimensional and T(f2) acts on X(f)f(k/2)-1µ(f). Therefore, we can decompose it by the multiplication of V (rr; x) into V (C(x)) X) ® N, where on N, T(f2) is nilpotent and on the one-dimensional space V (C(x); x), it acts by the scalar X(f)f(k/2)-1µ(f). LEMMA 4 ([Wa l ] Proposition 18, p. 68). - Let p* be an irreducible admissible representation of PGL2 (Qe) and let p be the corresponding

irreducible admissible representation of S(Q1) via Well representation with respect to the additive character ee (1; E Qell"). Then we have

Equivalence class of p* 7r(µ> µ-1) (µ2 a) 11 2 a, jI 7 a 1/2 ) a(µ, A -1 ) (11

Equivalence class of p r{lx{

u(a1/2 a-1/2)

Supercuspidal Supercuspidal

Supercuspidal

aµxe

where et is the standard additive character of Qt and ee (x) = et(t;x) and we have used the notation of [Wa 1 ] Propositions 1 and 2.

158

H. HIDA

Here note that irf,xe (resp. &µx4) with respect to ep is isomorphic to µxev (resp. vµx{,,) with respect to e,v, and hence the right-hand side is well defined independent of the additive character. A cusp form f E S,c(A,(pr);C) is called ordinary at p if f I T(p) = Af and IAIp = 1. An automorphic representation it of GL2(A) spanned by a holomorphic primitive form f is called ordinary at p if f is ordinary at p. LEMMA 5 (e.g. [H3) § 2). - Let 7r be a unitary holomorphic automorphic representation of GL2 (A). Suppose that 7r is irreducible and ordinary at p. Then the local component 1rp of 7r is either a principal series representation it (a, )3) with unramfied a or a special representation a(a,,0) with unramified a. Let f be the primitive form of weight k on GL2 (A) belonging to it and

write µ for the central character of it and A(T(p)) for the eigenvalue of T(p) on f. Then if 7r = 7r(a, p) and a and,3 are both unramified, then a(p) + /3(p) = p(1-k)I2A(T(p)), a(p)/3(p) = µ(p) and I A(T(p))I p = 1. If7rp = 7r(a, /)) and,3 is ramified, thena(p) = p(l-k)/2A(T(p)), a(p)O(p) = µ(p) and I A(T (p)) I p = 1. If itp = a(a,,3), then 7r,,. is of weight 2 and A(T (p)) = a(p).

LEMMA 6. - Let F be a number field of finite degree. Let p be a cuspidal automorphic representation of PGL2 (FA) and let R be the set of all cuspidal automorphic representations of S(FA). Define for each integral ideal N of F,

R(p; N) = {7r c R 17rv = pv for all v outside N}

,

where 7rv denotes the corresponding representation of PGL2(Fr) via Weil representations defined in [Wa 1, V.41 (where it is written as : T see Lemma 4). Then we have

V'(e, T) ;

#R(p; N) C #{H INF,
Proof : we know from Lemma 4 (or the remark after the lemma) that

if T is principal or special, then V'(ev,Tv) =

Moreover if

x/y E (Fti )2, then V'(ev,Tv) = V'(ev,T.) for all Tv by [Wal) Theorem 2, p. 80, Proposition 28, p. 98 and [Wa2) Assertion 3, p. 394. Thus the number of isomorphism classes in {V'(ev ,Tv) I x E Fvx} for all v outside N are at most #{l lNF
Proof of Proposition 3: we only prove the assertion when A = r1(N). The general case follows from this special case because any A contains a conjugate of F1(N) for a suitable N. We shall prove the boundedness

for P E A(A) with k(P) > 1. We write x = op for a given P E A(A)

ON A-ADIc FORMS OF HALF INTEGRAL WEIGHT FOR SL(2) /Q

159

and V) : (Z/NpZ) 1 --4 QP and consider x as an idele character so that

X(w) = x(.&) for a prime element zu at any prime f outside Np. Let V be the

subspace of functions on S(A) spanned by right translations of elements in

nord

(Npr(P), OP; (C)

under the Hecke algebra of S(A). We decompose V = ®PV(p) into the sum of irreducible subspaces V(p). Then by the weak multiplicity one theorem proven by [Wal) p. 131, each irreducible representation p occurs at most once. Decompose p = ®epe into the tensor product of local representations. Then by Lemma 2, pp is either aµ or rµ for a quasi character µ : Qpx - Cx.

By the Weil representation, aµ corresponds to Q(µ,µ-1) and 7r(p, p-1), which is a representation of PGL2 (A) (Lemma 4 and [Wa 1) Proposition 27 and Lemma 70). Then the Shimura correspondence is given locally by

aµ'' o,(Fex, p-1x)

and 7rN,

i)7r(px, p-1x)

and globally by p 1-- + p* ® x, where p H p* is given via the global Weil representation. The eigenvalue for T(p2) on V(pp)(pr; x) (r = r(P)) is given as follows (Lemma 3) : if µx is unramified, then it is

(k = k(P)); if µ-1x is unramified, then

µ'1x(p)p(2k-1)/2

px(p)p(2k-1)/2

and if both px

and p-1x are ramified, it vanishes. On the other hand, these values are the eigenvalue of T (p) on V (p p O x) (pr; x2) by Lemma 5. Note that even if both

µx and p-1x are unramified, at most one eigenvalue in can be a p-adic unit in Q. Thus p corresponds to and p-1x(p)p(2k-1)/2

px(p)p(2k-1)/2

the ordinary p* of character x2 and of level at most Npr(P). Then by [H5) Theorem 7.3.3, the number of such automorphic representations occurring in S2k(P)(Npr(P),x2) is bounded independent of P if k(P) > 1. Then by Lemmas 2, 4 and 6, we know the assertion of the proposition.

We say an element f c IP(A; II) is ordinary, if for all P E .A(II) with sufficiently large k(P), fp E space of all II-adic ordinary cusp forms as Ip°rd(A; II). Then p°rd(A; II) _ Pk(P)+(1/2)(A(pr(P)),Ep;Sl1). We denote the

eP(A; II).

PROPOSITION 4. - For each 0 E Z with A C I'o (4), p rd (A; II) is free of finite rank over II.

Proof : we prove the assertion for p°rd (N; II) applying the argument of Wiles [Wi). The other cases can be treated similarly. Let A = I'1(N). Let IK be the quotient field of II, which is a finite extension of L. We put pord (N; III) = pord (N; II) 02 K. Let f1 i f2,. .., fr be a finite set of linearly independent elements in P°rd (N; II) over II. Then we can find positive integers

160

H. HIDA

nl,... , n,. so that D = det(a(n2, fj)) # 0. We now choose P E A(II) so that for all i = 1,... , r, fz(P) E

Pk(P)+(1/2)\0(pr(P))+EP QP)

and D(P)

0.

Then 0 # D(P) = det(a(nz, fj(P)) and thus ff, (P) are linearly independent. Namely, we have r < dim Pk(P)+(1/2) p, (p' (p)), eP; lP)

,

which is bounded independently of P by Proposition 3. Thus there is a maximal set {f1i f2, ... , fr} of linearly independent elements in pord(N,1[). That is, dimK Ford (N; IIK) = r < oo. For any fin prd (N;1[), we can write r

f = > cj(f)fz and Dc;,(f) E L Thus D-1(IIf1 +

+ If,) D ll ord(N;1[) and

£=1

hence prd (N; II) is of finite type over II as II-module, because II is noetherian. Now we see by definition that ]EDOrd(N; II) = npJpord(N; IIp) where P runs

over all prime ideals of height 1, lip is the localizaton at prime P and lord (N; 1p) = pord (N; II) ®Q IIp. This shows that p rd (N; II) is II-reflexive and

hence if II = A, then prd (N; A) is A-free of finite rank. Since we already know that Ford (N; 1[) = pord (N; A) ®A II, we conclude that Ford (N; 11) is II-free

of finite rank. PROPOSITION 5. - Let PEA(II). Then each f EPk(P)+(1/2) (A(pr(P)), "FP; 0)

can be lifted to an ordinary A-adic form f E p

rd (A; II)

such that f(P) = f .

Proof : it is sufficient to prove the assertion for II = A. Let E(X) E A[[q]] be the A-adic Eisenstein series (cf. 1H51 § 7.1) such that for the generator

w=1+pofW

l E(Q)=(Q(w)-1) {LP(1-k(Q),EQw-k(Q))/2+y( Q()d-1)gn1 J 00

n=1 0
in Mk(Q) (A(pr(Q)), EQ; 0P) for all Q E A(A). Then we see for the point Po of

X (A) corresponding to the trivial character of W, E(Po) = (1 -p) log(w)/p, which is a p-adic unit. We then put F = E(Po)-1E and consider the pro(A(pr(P)), duct f F inside A[[q]]. Then F f (Q) = f F(Q) E Pk(P)+k(Q)+(1/2) EPEQ;1)). We define a formal q-expansion F * f (X) by F f (EP1(w)w-kX + (Ep1(w)w-k - 1), which is a A-adic cusp form in P(N; A) ([H5]

Lemma 7.1.1). Then we see thatF*f(P) = fF(Po) = f. Then e(F*f)(P) _ (F * f (P)) I e = f by Lemma 7 and the assertion of the theorem follows.

ON A-ADIC FORMS OF HALF INTEGRAL WEIGHT F OR SL(2)/Q

161

COROLLARY 1. - For P E A(l[; 0) with sufficiently large k(P) depending

on A, we have pk(P)+(1/2)00(ord

Pr(P)), EP; 0) = pord(A; )()/Ppord(A; 1).

Proof : Choose a base fl,... , fr of POrd(A; II). We can find a> 0 so that (i) fi(P) E Pk(P)+(1/2) Ep; O) for all i and all P with

k(P) > a, and (ii) there exist integersni such that det(a(ni/N,fj))(P)#0 ifk(P)>a. Then fi (P) are linearly independent over O. Thus pord (0; II)/ppord (0 1) injects into Pk(P)+(1/2) (A(pr(p)), EP; 0). Surjectivity of the morphism follows from Proposition 5. COROLLARY 2. - Let fl,... , fr be a base of pord(A; II). Then we can find

integers nl,... , nr so that det (a(ni/N, fj)) E F. Proof : let fl, ... , fr be a base of Pk(P)+(1/2) (A(pr(p)), Ep; O). let = be a prime element of O. If det(a(ni/N, f3) - 0 mod wO for all choice of integers fl, ... , nr, then { fi mod zo} are linearly dependent and hence we can find A, E 0 not all divisible by w such that EiAi fi = 0 mod ruO. Then E Pk(P)+(1/2)(A(Pr(P)) Ep;O)

but w-1 Ai are not all in O. This contradicts to the fact that {f} forms a base. Thus we can find the ni's so that det(a(ni, fj)) E Ox. Now applying this argument to a base {fi(P)} by choosing P with sufficiently large k(P), we find that det(a(ni/N, fj))(P) E Ox which implies that det(a(ni/N, fj)) E Ix

Analogs of all the assertion so far we proved in this paragraph holds for Sord (U; II) in an obvious sense (see [H5] Chapter 7). In particular, the statement corresponding to Corollary 1 for Sord(A; II) holds if k(P) > 2.

7. - We now restate Theorem 3 in the language of p-adic Hecke algebras. Let hord(N; 0) be the p-adic ordinary Hecke algebra defined in [H5] § 7.3. Let us recall the definition. The algebra hors (N; 0) is the Asubalgebra of EndA(Sord(N; A)) generated by T(n) for all n. There is another description of the algebra. Writing h,rd(Np'; 0) for the 0-subalgebra of Endo(Skrd(Npr; 0)) generated by T(n) for all n, we have a natural isomorphism : hord(N; 0) = l4im hk''d(Npa; 0) if k > 2, which takes T(n) to a

T(n) [H2]. Under the natural pairing < h, f >= a(1, f I h), we know HomA (hors (N; 0), A) = §ord (N; A) and (7.1)

HomA (Sord (N; A), A) - hord (N; O)

.

162

H. HIDA

We have a smooth representation of G(A(P°°)) on Sord (II) = U Sord (U, II) _ UEU

eS(l<) and S(II). Thus compactly supported smooth functions on G(A(P°°))

with values in II act on §(1). We fix an algebraic closure L of L. We then consider S°rd(A) = Sord(A) ®A A as a G(A(P°°))-module for any Asubalgebra A in E. Each irreducible factor of the representation on § rd(L) of G(A(P°°)) is admissible by the control theorem ([H5] Theorem 7.3.3, which is the integral weight counterpart of Corollary 1 and is valid for all arithmetic points of weight k > 2). Pick an arithmetic point P with k(P) > 2

and consider the localization Ap at P. Then by the control theorem, Sord(Ap) ®A K(P) for K(P) = Ap/P is a semi-simple GL2(A(P°°))module. Since there are Zariski dense arithmetic points in Spec(A) at which the control theorem holds, we see that Sord (L) is semi-simple as a G(A(P°°))-module. Thus Sord(L) is a sum of irreducible subspaces. The multiplicity is one by the control theorem combined with the multiplicity one theorem in classical situation. Since the proof of the factorization theorem

in [JL] § 9 is purely algebraic, it carries over to our situation, and each irreducible factor it of §ord(L) is factored into the tensor product of local representations : it = ®e P're. Let A; hord(C; 0) -> II be a primitive Aalgebra homomorphism. Then by the control theorem, we have a unique automorphic representation ir(P) = ®e7re(P) corresponding to A mod P for

P E A(II) with k(P) > 2. Thus A corresponds a unique factor it = ir(A) of S°rd (L) and 7re(P) = ire mod P. We write V (7r) for the subspace of §ord (j[) on which S(A(P°°)) acts via it. Thus for each arithmetic point P with k(P) > 2, Ap(T(n)) = A(T(n))(P) is an algebraic number. Then for each Dirichlet character cp, we can define the complex L-function : 00

L(s, Ap

E n=1

(p(n)Ap(T(n))n_s

.

Note that L(s, ir(P)) = L(s + k (p)-', Ap) is the standard L-function of ir(P). As is well known, the L-function L(s, Ap ®cp) has a motivic interpretation. Since II is an integral domain, we see that ZC = ZP" x (Z/CZ)" E) z F--, A() E II is a character, where is the operator induced by the central action of z E (Z(P)) X C G(A(P°°)) on S(II) (see (2.1)). In particular, it restriction to µp_1 x (Z/CZ)" gives a character Oo : µP_1 x (Z/CZ)" --> QP . We regard this character as a character of ZC composing the projection : ZN -> µP_1 X (Z/CZ)" = (Z/CpZ)" and call it the character of A. We now consider the following conditions on A :

(He). Writing xe = 7r (a, 0) when Ire is principal (f a(-1) = Q(-1) = 1;

p), we have

163

ON A-ADIC FORMS OF HALF INTEGRAL WEIGHT FOR SL(2) /Q

(Hp). Oo = Vi2 for an even character 0 modulo N for N divisible by C and 4.

Under this condition, the automorphic representation associated to A is in the image of the Shimura correspondence (see [Wa2] Proposition 2). Now we consider the automorphism of A which takes w to IDm (w E W) for m prime to p. This ring automorphism extends to an automorphism a,,, of II if II is sufficiently large. For each P E A(ll), we denote p2 for

P 0 0'2. Then k(P2) = 2k(P) and Ep2 = Ep. As constructed in [K] and [GS], for each character cp of (Z/NpZ) 1, there is a two variable p-adic Lfunction Gp(P, Q; A ®cp) defined on X(1) x X (A) interpolating the value L(k(Q), Ap ®EQlwk(Q)cp) for (P, Q) E A(II) x A(A) with 0 < k(Q) < k(P). Here is a result slightly stronger than Theorem 3 : THEOREM 4. - Let A : h°rd (C; 0) -> II be a primitive A-algebra homomor-

phism Suppose (He) and (Hr). Then for any pair (m, n) of two square free positive integers with m/n E fl Q2, there exists an element 1 in 1K such LINp

that for all P E A(1[) with k(P) > 1, ifCp(P2, P; A ® /-IX L)

4,(P)2 _ Gp(P2, P; A

0, we have

®V)-1Xn.)Op(n/m)(n/m)k(p)-(1/2)

1Cp(P2, P; A (D V)-1X'.,,)

where Xt is the quadratic character corresponding to Q(f ). Here note that under our assumption on (m, n), (m/n) is prime to Np. Proof : we take P E A(ll) with k(P) sufficiently large. Let cp be a Dirichlet

character. Then for the least common multiple N' of C and the conductor of cp, we find A 0 cp : h(N'; 0) --* 1[ such that A 0 o(T(n)) = cp(n)A(T(n)). Then the character of A ®cp is given by 02V2 . Taking even cp with sufficiently

large 2-power conductor, we may assume that the conductor C' of A (D V is divisible by 16. If we replace A by A ®cp, the role of V) will be replaced by the L-value appearing in the Vicp. Since A ®'-1X,,, = (A ®cp) ®(p assertion of the theorem is unchanged even if we replace A by A 0 . Thus we may assume that 16 1 C (hence Tr satisfies the condition (H2) in [Wa2] p. 378). Let f be the cusp form in Pi(p)+1/2(I o(N2pr(p)) gyp; flu) which is a linear combination of the base defined in [Wa2] Theorem 1 for 7r (p2). Let

us take f E p"rd(N2;1) such that f I T(q2) = a2 0 A(T(q))f for all prime q outside Np and f(P) = c f with 0 7 c E 0. Such f exists by Corollary 1. Then by [Wa2] Corollary 2, for any Q E A(ll) such that f(Q) is classical, we have : a(m, f)2(Q)L(k(Q), AQ2 0

1GQ'X.)OQ(n/rn)(n/m)k(Q)-(1/2)

= a(n, f)2 (Q)L(k(Q), AQ2 0 OQ1X,,,,)

164

H. HIDA

To get the p-adic interpolation, we need to remove certain Euler factor at p and divide the special value by a certain period. However the Euler factor and the period are the same for n and m under the condition of the theorem. Thus using two variable p-adic L-functions, the above identity can be stated as : a(m, f)(Q)2GP(Q2, Q; A ®V)-1xn.) I

Q(n/m)(n/m)k(Q)-(1/2)

= a(nf) (Q)2GP(Q2, Q; A ® W-'X.). If £P(Q2, Q;

A®V)-1xn) = 0 for all Q

as above, the p-adieG-function GP(A®

'-1xn) vanishes. Hence there is nothing to prove. If L ,(A ®/-1xn) #0, by the assumption of the theorem, GP(A ®O-1x,,,,) # 0. Then we may assume

that GP(P2, P; ® -1xm)Gp(P2 P; A ® 0 by moving around P. Then we may assume by Theorem 1 of [Wa2] that the m-th and n-th Fourier coefficients of f are both non-zero. Therefore -IX")

0. Thus we can take 4) = a(n, f)/a(m, f). Now we have the a(m; f)a(n; f) evaluation property of 4) described in the theorem for almost all P. Note that .CP(P, Q; A ®0-1xn) for a fixed n is a p-adic analytic function of (P, Q) (see

[K] and [GS]). Thus as long as the removed Euler factor does not vanish,

we get the result. The only case where the Euler factor vanishes is the case where k(P) = 1 and the character of ir(P2) is trivial. However this case is excluded because of the vanishing of the p-adic L-function in the denominator at (P2, P). = 0 4=* L(k(P), Apt ®1/Ip1xn) = 0 if either k(P) > 1 or 02 # 1, Theorem 3 follows from Theorem 4. Since GP(P2, P;

A®z0A00-1)(n)

Manuscrit recu le 20 juin 1993

ONA-ADIC FORMS OF HALFWIEGRAL WEIGHT FOR SL(2)/Q

165

References [C] W. CASSELMAN. - On some results of Atkin and Lehner, Math. Ann.

201 (1973), 301-314. [GS] R. GREENBERG and G. STEVENS. - p-adic L-functions and p-adic periods of modular forms, Inventiones Math. 111 (1993), 407-447. [H1] H. HIDA. - p-adic L functions for base change lifts of GL2 to GL3, Perspective in Math. 11 (1990), 93-142. [H2] H. HIDA. - On p-adic Hecke algebras for GL2 over totally real fields, Ann. of Math. 128 (1988), 295-384. [H3] H. HIDA. - Nearly ordinary Hecke algebras and Galois representations of several variables, JAMI inaugural conference proceedings, 1988 May, Supplement to Amer. J. Math. (1990), 115-134. [H4] H. HIDA. - A p-adic measure attached to the zeta functions associated with two elliptic modular forms H, Ann. 1'Institut Fourier 38 No 3 (1988), 1-83. [H5] H. HIDA. - Elementary theory of L -functions and Eisenstein series, LMS Student Texts tenbfbk 26, Cambridge University Press, 1993. [H6] H. HIDA. - On nearly ordinary Hecke algebras for GL(2) over totally real fields, Adv. Studies in Pure Math. 17 (1989), 139-169. [H7] H. HIDA. - Geometric modular forms, Proc. CIMPA Summer School at Nice, 1992. [JL] H. JAcQuET and R.P. LANGLANDS. - Automorphic forms on GL(2), Lecture notes in Math. 114, 1970. [KM] N.M. KATZ and B. MAZUR. - Arithmetic moduli of elliptic curves, Ann.

of Math. Studies 108, Princeton University Press, 1985. (K] K. KITAGAWA. - On standard p-adic L functions of families of elliptic

cusp forms, preprint. [MTT] B. MAZUR, J. TATE and J. TEITELBAUM. - On p-adic analogues of the conjectures of Birch and Swinnerton-Dyer, Inventiones Math. 81 (1986), 1-48. [Sh 1 ] G. SHIMURA. - On modularforms of half integral weight, Ann. of Math. 97 (1973), 440-481. [Sh2] G. SHIMURA. - On certain reciprocity laws for theta functions and modular forms, Acta Math. 141 (1978), 35-71.

166

H. HIDA

[V] M.-F. VIGNERAS. - Valeurs au centre de symetrie des fonctions L associees awcformes modulaires, Seminaire de Theorie des Nombres, Paris 1979-80, Progress in Math. 12, Birkhauser (1981), 331-356. [Wall J.-L. WALDSPURGER. - Correspondance de Shimura, J. Math. pures et appl. 59 (1980), 1-133. [Wa2] J.-L. WALDSPURGER. - Sur les coefficients de Fourier des formes modulaires de poids demi-entier, J. Math. pures et appl. 60 (1981), 375-484. [W] A. WEIL. - Sur certain groupes d'operateurs unitaires, Acta Math. 111, 143-211. [Wi] A. WILES. - On ordinary A-adic representations associated to modular forms, Inventiones Math. 94 (1988), 529-573. Haruzo HIDA

Department of Mathematics UCLA

Los Angeles, Ca 90024 U.S.A.

Number Theory Paris 1992-93

Structures

sur les reseaux

Jacques MartinetN

PREMIERE PARTIE : rappels sur les reseaux

1. - On note E un espace euclidien de dimension n, souvent identifie par le choix d'une base orthonormee de E. La norme d'un vecteur x E E est N(x) = x.x, le carre de la norme euclidienne 11x1j. Par reseau, on entend un sous-groupe discret A de E de rang n. La norme de A est N(A) = minxEA,x#o N(x). On pose S(A) = {x E A I N(x) = N(A)} et s(A) =

Le determinant de A est le determinant de la matrice de Gram d'une base de A (matrice des produits scalaires deux a deux des vecteurs !'IS(A)I.

de la base). L'inuariant d'Hermite d e A est y (A) = N(A). det(A)et la constante d'Hermite pour la dimension it est rye,, = SUPA -y (A).

On dit qu'un reseau A est entier si le product scalaire de E est a valeurs entieres sur A, et qu'il est pair si ses vecteurs sont de norme paire. Le reseau dual de A est A* = {x E E I Vy E A, x. y E Z}. Les reseaux entiers sont les reseaux qui sont contenus dans leur dual. Ceux qui sont egaux a leur dual sont dits unimodulaires; ce sont les reseaux entiers de determinant 1.

Les reseaux que nous rencontrerons seront tous proportionnels a des reseaux entiers. Dans ce cas, it existe une plus petite norme qui les rend

entiers. On definit l'invariant de Smith d'un reseau A en considerant le reseau entier A' qui lui est ainsi associe; le couple (A'*, A') de Zmodules libres de rang n possede lui-meme un invariant de Smith (suite des "facteurs invariants" ou "diviseurs elementaires"), qui est l'invariant

L

de Smith Smith(A) de A. Si Smith(A) = (al, ... , a,,), on a a,z = 1 et Smith(A*) = (a , _an-1 , ... , a). a, 2. - Soit E' un sous-espace de E de dimension r coupant A suivant un reseau A' de E'. Alors, E'1 coupe A* suivant un reseau A'1 de E'1, et *

Recherche effectuee au sein de I'unite mixte C.N.R.S.- Enseignement Superieur U.R.M. 9936

168

J. MARTINET

l'on a entre determinants la relation

det(A') = det(A). det(A") . Considerons le cas particulier dans lequel it existe une similitude u de A sur A*, que nous prenons egale a 1'identite dans le cas unimodulaire. On associe alors a tout reseau A' comme ci-dessus le reseau relatif A' = u(E')1 n A C A. En designant par le rapport de similitude, on obtient la formule

det(A' ) = A' det(A). det(A') .

3. - Soit A un reseau. Nous appellerons defaut de perfection de A la difference entre la dimension ('2 1) de 1'espace Ends(E) des endomorphismes symetriques de E et celle du sous-espace de Ends (E) engendre par les projections sur les directions des vecteurs minimaux de A; on appelle relation d'eutaxie toute expression de 1'identite de E comme combinaison lineaire de ces projections. On dit que A est parfait s'il est de defaut nul et qu'il est eutactique s'il possede une relation d'eutaxie a coefficients positifs. On sait (theoreme de Voronoi) que A est extreme (c'est-a-dire qu'il realise un

maximum local de son invariant d'Hermite) si et seulement s'il est parfait et eutactique. (En exprimant les endomorphismes de E dans un couple de bases (13,13*) ou 13 est une base de A, on transforme ces definitions geometriques issues de [Be-M1] en les definitions classiques de la theorle des formes quadratiques.)

Une condition suffisante de perfection, due a Barnes, est 1'existence d'une section hyperplane parfaite de meme norme et de n vecteurs minimaux independant en-dehors de cette section (perfection relative). Soit Ao un reseau de dimension no. Cette condition de perfection relative est verifiee par les reseaux de dimension no+ 1 dont le determinant est minimum parmi ceux possedant A0 comme section hyperplane de meme norme. Les reseaux

faiblement lamines au-dessus de Ao sont ceux que l'on obtient en iterant le procede ci-dessus, et l'on parle de reseauxfortement lamines dans le cas de ceux qui sont de determinant minimum dans chacune des dimensions no, no + 1, no + 2.... (ce vocabulaire est emprunte a Plesken et Pohst qui ont etudie les variantes des procedes de lamination dans lesquelle on considere des reseaux entiers de norme donnee). Les reseaux lamines sans autre precision sont ceux qui ont ete obtenus par Conway et Sloane par "laminations fortes" au-dessus de Ao = {0} auquel est attribue la norme 4 ([C-S], ch. 6); pour n < 8, ces reseaux, notes A,,,, sont les renormalisations a la norme 4 des reseaux {0}, Z, A2, A3, ID4,1Th5, E6, E7, ]E8, puisque la cons-

tante d'Hermite est atteinte dans ces dimensions sur les reseaux qui leurs sont semblables ("theoreme de Blichfeldt-Vetchinkin"; Korkine et Zolotareff

pour n < 5, Barnes pour n = 6).

STRUCTURES ALGEBRIQUES SUR LES RESEAUX

169

Certains resultats de perfection et d'eutaxie que nous presentons dans cette note ont ete obtenus en utilisant deux programmes de Batut, l'un calculant le rang des projections sur les vecteurs minimaux d'un reseau defini par une matrice de Gram et indiquant s'il existe une relation a coefficients d'eutaxie egaux, et l'autre dormant une base de 1'espace des relations qui existent entre ces projections et l'identite, ainsi que divers programmes disponibles dans le systeme PART.

L'inuariant dHermite dual de A, introduit dans [Be-M1], est ^1' (A) _ (N(A)N(A*))1/2 ; sa borne superieure sur les reseaux ("constante de BergeMartinet" de [C-S31) est notee y,,,. On dit que A est dual-extreme si son invariant y,,, est un maximum local. Pour qu'il en soit ainsi, it suffit ([BeM1], 3.20) que A soit extreme et que A* soit eutactique. 4. - Rappelons les definitions de quelques reseaux classiques (cf [C-S], ch. 4). Soit (Ei), 0 < i < n (resp. 1 < i < n) la base canonique de Zn+1 (resp. de 7Ln). On pose

An = {xEZn+1I

xi=0} et IIDn={xE7LnI>,xi-Omod 2}.

Ce sont des reseaux pairs de norme 2. Le dual de D. est le reseau cubique centre, de norme 1 lorsque n est > 4. 11 est isometrique au sous-reseau de Z muni de la forme 4 E xiyi defini par les n-1 congruences x1 = X2 xn mod 2. Pour n pair > 8, soit 1D ,+ = IIDn U (El + E2 +

+ En)Dn. On obtient un reseau isometrique en considerant le2double systeme de congruences

XI -x2-...-xnmod2 et sur Zn muni de la forme 4 E Xi yi. Sous cette forme, on voit que IIDn est isometrique a son dual, et meme qu'il est unimodulaire pour n - 0 mod 4, pair pour n = 0 mod 8. On pose B8 = D8 , et l'on definit E7 (resp. E6) comme l'orthogonal dans ]E8 d'un vecteur minimal (resp. d'un sous-reseau isometrique a A2), cf. No 2 (a isometrie pres, les choix faits ci-dessus sont sans importance). Les reseaux de racines sont les sommes orthogonales de reseaux de racines irreductibles, isometriques a Z, An (n > 1), IIDn (n > 4) ou En (n = 6,7,8). Ces derniers sont extremes, ont des duals eutactiques, et sont donc aussi dual-extremes. DEUXIEME PARTIE : autour du reseau de Coxeter-Todd

5. - Soit A 1'anneau des entiers d'un corps de nombres K totalement reel ou de type C.M., de degre q. On note x H x l'involution de K (1'identite

170

J. MARTINET

dans le premier cas), et l'on munit K de la forme bilineaire TrK/Q(\µ) (ou parfois d'une forme qui lui est proportionnelle), ce qui fait de A un reseau entier de I[8 0 K.

Soit a un ideal de A stable par l'involution de K. On considere sur A' les congruences suivantes : moda (CO Al-A2-..._A,,,,, (C2) (C'2)

=0 mod a =0 mod a2,

qui definissent des reseaux de dimension n = qm. La congruence C'2 n'interviendra qu'en meme temps que la congruence C' I et seulement lorsque m est un multiple de la norme de a. En notant d le discriminant du corps K (i.e. le determinant du reseau A), on trouve pour les determinants des reseaux definis par les congruences C l (resp. C2, resp. Cl et C'2) les valeurs IdImNK/Q(a)2(m-I) (resp. IdImNK/Q(a)2 resp. jdImNK/Q(a)2m).

On observe que, pour m E a, le reseau defini par Cl ou par Cl et C'2 est encore entier lorsqu'on munit A de la forme -LTr(A7) : on a en effet A77

=AlDµt=mA1µ1moda.

i

Les determinants donnes ci-dessus sont alors a diviser par mq"`.

6. - Dans les numeros 6 a 9, sauf dans la remarque 8.3, A est l'anneau Z[w] (w2 + w + 1 = 0) des entiers d'Eisenstein. Les congruences C 1, C2, C'2 ont ete considerees dans les annees cinquante par Coxeter, Coxeter et Todd, et Barnes ([Cox), [Cox-T], [Bar]).

Soit n = 2r + t. Le reseau Lr de Barnes est forme des elements de Ar+t qui verifient la congruence C2 et dont les t dernieres coordonnees

sont reelles. Les reseaux Ln sont parfaits pour n > 5 et r > 2 ([Bar] ; cela se voit par reduction a la dimension 5 en utilisant des arguments de perfection relative). Pour n = 2r > 6, ces reseaux sont extremes et dual extremes ([Bar], [Be-M1]). Dans le cas n = 6, r = 3, considere initialement par Coxeter ([Cox]), on trouve un reseau semblable a E6*, et l'on obtient donc E6 par la congruence C 1 avec la forme Tr. s est defini par les congruences C 1 Le reseau de Coxeter-Todd, note K12, et C3, avec la forme Tr. Cette definition par congruences, jointe au fait que

s a son dual, montre que x H lwx est un isomorA -- A2 est semblable phisme de K12 sur son dual, un resultat note par Conway et Sloane, qui 11

1'interpretent en faisant remarquer que K12 est Z[w]-unimodulaire ([C-S], ch. 4, § 9). Une variante de cette construction, analogue a la definition de 1, 1, 1 1, 1). Dn (cf. n° 3) consiste en 1'adjonction a L62 du vecteur 1

i (l

Sous cette forme, on voit immediatement que K12 est extreme et dualextreme (on a des resultats analogues dans toutes les dimensions multiples de 6 et

STRUCTURES ALGEBRIQUES SUR LES RRSEAUX

171

> 12).

Il est facile de verifier que le reseau A6 - IE6 se plonge dans K12. Comme les reseaux A,,, realisent la constante 7,n pour n < 6, ce sont les reseaux de plus petit determinant contenus dans K12 pour les dimensions comprises entre 0 et 6; c'est la serie K,. pour 0 < n < 6. La methode du no 2, appliquee

en prenant u = (x H 11 l.x), permet de construire une suite descendante D K7 D K6 de reseaux dont les determinants sont K12 D K11 . minimaux pour les dimensions comprises entre 12 et 6. On obtient une suite K, 0 < n < 12 en raccordant les deux suites en dimension 6. Cela est bien connu depuis Leech (et egalement entre les dimensions 12 et 24 que nous examinerons plus loin), cf. IC-S], ch. 6, § 1. Toutefois, comme on va le voir, ces plongements ne sont pas compatibles avec les Z[w]-structures qui existent naturellement sur D4 et sur 1E6 (on a rencontre une telle structure dans le cas de IE6, et l'on peut identifier ID4 a l'ordre de Hurwitz 931 des quaternions usuels sur Q, puts plonger A dans 931 par

w I. -1+i+9+k 2

7. - Nous nous interessons maintenant a des reseaux A pour lesquels le produit scalaire est de la forme Tr o h ou h : A -f A est une forme hermitienne (nous dirons simplement Z[w]-reseaux), et nous considerons les plongements qui sont des isometries pour les structures hermitiennes, ce qui est plus restrictif que Metre seulement une isometrie pour la structure euclidienne qui s'en deduit. Le theoreme suivant sera demontre au no 9. 7.1. THEOREME. - Soit A un Z[w]-reseau entier de norme 4. Si n = 4

et si det(A) est < 81 (resp. si n = 6 et si det(A) est < 243), alors A est Z[w]-semblable a D4 ou a L4 (resp. a E6 ou a E6) Ces reseaux ont des A-bases (el, e,2) (resp. (el, e2, e3)) formees de vecteurs minimaux,

et sont definis par les suites de produits scalaires (el.e2i e1.we2) (resp. (el.e2, el.we2 i el.e3, el.we3, e2.e3, e2.w63)); des choixpossibles pour ces quatre reseaux sont les suites (0, 2), (1, 1) (resp. (0, 2, 0, 2, 0, 0), (1,1,1, 1, 1, 1)). [Le th. 7.1 prouve en particulier l'unlcite a Z[w]-isometrie pres des reseaux ID4 et E6. Felt a demontre un resultat analogue par une formule de masse pour le reseau K12 dans son article [Fe] consacre aux reseaux Z[w]-unimodulaires. Des resultats d'unicite concernant en particulier IID4 sur Z[(8] et sur Z[(121 et ]E6 sur Z[(9] figurent dans [Be-M2], th. 4.3 et 4.6].

En examinant les produits scalaires entre vecteurs minimaux de K12, on s'apercoit qu'iI n'est pas possible de plonger A4 - IID4 dans K12 en tant que Z[w]-reseau, et donc non plus A6 - E6. En revanche, la definition de K12 montre que 1'on peut plonger L et L3 - E*. En utilisant la methode du no 2, on construit une suite croissante de Z[w]-reseaux Kn (n pair) plonges dans

172

J. MARTINET

K12, que l'on complete pour n impair en prenant le reseau de determinant minimum parmi ceux qui sont contenus dans Kn+1 et contiennent Kn_1. Ces reseaux Kn, comme les K, , sont bien definis a un automorphisme de K12 pres.

On voit tout de suite que 1'on a Ki = Al - Z, K2 = A2 - A2, K111 = K11, K12 = K12, et que, pour 4 < n < 8, Kn est isometrique a Lr avec r = L J . Le reseau K3, connu des cristallographes (cf. [C-S31) est caracterise a2similitude pres comme le reseau d'invariant ry3 minimum parmi ceux qui satisfont l'inegalite s > 5. Une verification informatique a partir d'une matrice de Gram de K12 montre :

7.2. PRoPosrrioN. - Les reseaux Kn (resp. Kn) sont parfaits sauf pour n = 7 et n = 8 (resp. n = 3 et n = 4) oft le defaut de perfection est egal a 1. Le tableau suivant decrit les principaux invariants des reseaux Kn :

K3 K4 Ke Ks K Ks

reseau det(Kn)

36

81

s(Kn)

5

9

Ks

Kio

162 243 486

729

972

972

36

54

81

135

15

Smith(K') i 12.3 i 9.32 6.33

27

35

18.33 I' 9.3 4

36.3 62.33

Signalons que les reseaux Kio et Kio* (pour lequel on a s = 120) sont extremes et donc dual-extremes. [Voici une construction explicite de ces deux reseaux. Par division par 1 - w, on transforme le vecteur minimal (0, 0, 0, 0, 1 - w, -(1 - w)) de K12 en le vecteur (0, 0, 0, 0, 1, -1) de K12, dont l'orthogonal permet de definir Kip par les congruences Al =- 1\2 =- A3 = 1\4 = A5 mod a et Al + A2 + A3 + 1\4 - A5 = 0 mod a2 sur Z(w)5 muni de la forme hermitienne AA + A2A2 + A3A3 + A4A4 + 2A5A5. On volt que les 135 couples de vecteurs minimaux de K10 sont representes par 34 = 81 vecteurs de composantes de la forme w' et 6.9 = 54 vecteurs obtenus par permutation des 4 premieres composantes de vecteurs de la forme (w'(1 w), -wj (1 - w), 0, 0, 0). On obtient le reseau dual en remplacant dans la forme hermitienne 2.A5A5 par 2A5A5 et en divisant par 1 - w, et les 120 couples de vecteurs minimaux proviennent de 81 vecteurs comme ci-dessus, de 4.9 = 36 vecteurs obtenus par permutation des 4 premieres composantes de vecteurs de la forme (w'(1 - w), 0, 0, 0, -wj (1 - w)) et des 3 vecteurs (0, 0, 0, 0, 3w')].

Grace a des programmes de Batut, on verifie qu'il existe dans les cas de K11 et de K9* une unique relation d'eutaxie. Pour une indexation convenable des directions de vecteurs minimaux, elles ont les formes respectives 41

12

Id

= d

i=2

pi

et

Id = p1 + d

- i=2

pi.

STRUCTURES ALGI;BRIQUES SUR LES RESEAUX

173

On montre que la section de K11 (resp. de KO) par 1'hyperplan orthogonal a la premiere direction minimale definit le reseau Kio (resp. K$), alors que

les autres directions minimales sont asociees a K1o (resp. a des reseaux K8 isometriques au reseau P8 de Barnes, note A(2) dans IC-S], ch. 8, § 6) [Ces proprietes d'eutaxie s'interpretent par 1'existence de deux orbites de plans hexagonaux engendres par des vecteurs minimaux dans K12 = K12 et dans Kio , signalons que K10 possede une section parfaite K9 de meme determinant (972) que

K9, mats avec s = 82 au-lieu de s = 81, les reseaux Kg et Kg ont ete trouves par Barnes ([Bar], II, p. 221)].

8. - En plongeant K12 dans le reseau de Leech A24 et en utilisant la methode du no 2, on complete la suite Kn jusqu'a la dimension 24. II est clair que Yon obtient une suite de sections de A24 qui sont de determinant minimum parmi les reseaux contenant ou contenus dans K12, et que l'on a la relation de symetrie det(Kn) = det(K24_n). On peut proceder de ]a meme facon avec la serie Kn. On commence par munir A24 d'une Z[w]-structure compatible avec le plongement K12 -f A24; on indiquera dans la quatrieme partie comment realiser un tel plongement sur un ordre maximal du corps de quatemion de centre Q ramp en {3, oo}, ce qui est un resultat plus precis. La methode du no 2 permet de prolonger

la suite K,, jusqu'a la dimension 24, les reseaux obtenus etant des Z[w]reseaux pour n pair; on a les relations de symetrie det(K,',) = det(K24_n) et les egalites K13 = K13 et K,, = An pour n = 22,23,24 (alors que la coincidence de Kn et de An a lieu des la dimension 18) ; on definit de meme un reseau K16 a partir de K$ .

Pour etudier ces reseaux K, au-dela de la dimension 12, on utilise la determination par Plesken et Pohst ([PI-PI) des reseaux faiblement lamines

pour la norme 4 au-dessus de K12. Ces auteurs ont trouve un reseau en dimension 13, qui est K13 = K13, deux en dimension 14 qui sont K14 et K14, puis, au-dessus de l'un d'eux, qui ne peut etre que K14, une suite de reseaux de determinants det(K'' ), uniques a isometrie pres, sauf en dimension 16 oit it y a deux reseaux, que l'on distingue par leurs invariants s, qui prennent les valeurs 1218 et 1224. 8.1. TrICOREME. - Le reseau K16 est le reseau d'invariant s = 1224.

Demonstration (1) ( H. NAPIAS). On repere le reseau K22 dans A24 a l'aide de matrices de Gram, on construit la suite descendante des K. jusqu'a la dimension 17, et l'on distingue les sections Kl6 et K6 par leurs orthogonaux. [Elle a egalement montre qu'un seulement parmt les 37 vecteurs minimaux de K17 a pour orthogonal Klg dans K17, resultat analogue a ceux que l'on a observes pour Kli et K9*1.

(2) Par adjonction a L2 du vecteur vi = 1 1 (1, 1, 1, 1, 1, 1, 0, 0, ...)

174

J. MARTINET

pour 2m > 12 puis du vecteur v2 -1, 0, 1, -1, 0, 1, -1, 0, 0, ...) pour 2m > 16, on obtient des reseaux de determinant 3'n puis 3ri-2, obtenant K12 pour m = 6, puis un reseau A de determinant 36 pour m = 8, qui contient visiblement K12 ainsi qu'une suite de sections en dimensions n = 15,14, 13 de determinants det(K,,). On a ainsi construit celui des deux reseaux de Plesken-Pohst qui est K16, et l'on verifie facilement que 1'on a s(A) = 1224. [On construct K18 par adjonetion a L98 de vi, v2 et v3 = ll

(0, 0, 0,1,1,1,1,1,1),

et l'on en deduct des constructions explicites de K17 et de Kist.

8.2. Remarque. - Les reseaux Kn et K;,, n > 12 et K16 sont parfaits. Cela se voit en controlant la perfection relative a partir de la dimension 12. Il est probable, mais non demontre, que les constructions par lamination pour

une norme donnee donnent dans ce cas particulier les reseaux faiblement

lamines au-dessus de K12, un resultat qui entrainerait directement la perfection, comme dans le cas des reseaux An consideres par Conway et Sloane.

8.3. Remarque. - Prenons A = Z[(9] et a = ((1 - (9)2), et soit R6m le reseau defini par la congruence E Ai - 0 mod a4 sur ((a2)-,9 T). On a R6 - E6 ([Cral), d'ou R12 ^' K12 et (R18, 1 1C9 (1, 1, 1)) -- K1'8; on trouvera dans [Bay-M] une construction de K12 comme module de rang 1 sur Q((21).

9. - Nous demontrons maintenant le th.7. 1. 9.1. LEMME. - Soit A un reseau pair de dimension n et de norme m. mny,-n. Alors, A possede n Supposons uerifiee l'inegalite det(A) < 1,+2 vecteurs minimaux independants.

Demonstration. L'inegalite de Minkowski sur les minima successifs d'un reseau montre qu'il existe des vecteurs el, C 2 ,- .. , en de A verifiant

l'inegalite N(el)N(e2) ... N(en) < y7 det(A), qui entraine que l'on a N(el)N(e2) ... N(en) < + n. On peut supposer ces vecteurs ranges par normes croissantes. On montre que ce sont des vecteurs minimaux en raisonnant par recurrence sur leur indice. On a en effet

N(el) ... < '"+2m''; si l'on suppose que el, ... , ei_1 sont de norme m, on trouve pour la norme de ei les inegalites N(ei) < N(ei_l)N(ei)n-i+l

m(

)1/(n-i+l) < m+2, donc N(ei) < m puisque A est pair, d'oU 1'egalite

N(ei) = m,

U

[Pour n = 2,3,. .. , 8, la borne du lemme est egale a 18,48,96, 192, 288, 324, 3241.

9.2. TiiEOREME. - SoitA un Z [w] -reseau entier de norme4, de dimension 4 (resp.6), et de determinant < 96 (resp. < 288). Alors, A est Z [w]-semblable

STRUCTURES ALGEBRIQUES SUR LES RI;SEAUX

175

d D4 ou a L2 (resp. a E6 ou a ]EE). En particulier, les Z[w]-structures sur les reseauxD4, L2 et E6 sont uniques a Z[w]-isometrie pres.

Demonstration. Le lemme montre que A contient n = 4 (resp. n = 6) vecteurs minimaux independants et donc, compte tenu de 1'action de Z[w], qu'il contient un sous-reseau A' possedant une base de la forme (x, wx, y, wy) (resp. (x, wx, y, wy, z, wz)). Le reseau A' est determine par la

donnee des produits scalaires a1 = x. y, bi = x.wy (resp. a1 = x.y, b1 = x.wy, a2 = x.z, b2 = x.wz, a3 = y.z, b3 = y.wz) qui sont majores par 2 en valeur absolue. On a det(A') < 2n det(A2)n/2 = 12n/2 et det(A') > 24 det(D4) (resp. det(A') > 26 det(E6), d'ou la majoration [A A'] < 3 avec

egalite seulement pour n = 6, A' _- A2 I A2 I A2 et [A A'] = 3 (car 2 est inerte dans Q[w]), cas dans lequel A est de determinant 26.3, donc semblable a E6, et oii 1'existence d'autres vecteurs minimaux que ceux des orbites de x, y, z permet encore de supposer que l'on a A' = A.

On peut supposer que x.y est > 0 et minimum parmi les valeurs absolues des produits scalaires des vecteurs minimaux de A' appartenant a deux orbites distinctes, et utiliser 1'automorphisme w'-4 w2 de Z[w] pour echanger x.wy et x.w2y. On voit tout de suite que, en dimension 4, A = A' est obtenu en prenant pour (al, b1) l'un des 4 couples (0, 0), (0, 1), (0, 2), (1, 1), conduisant a des reseaux de determinants respectifs 144, 121, 64, 81, d'ou le theoreme dans cc cas. Dans le cas de la dimension 6, nous avons d'abord prouve "a la main"

1'assertion d'unicite de la Z[w]-structure de Es, qui entraine le resultat analogue pour E6. On observe pour cela que le produit scalaire de deux vecteurs minimaux de EE nest jamais nul, cc qui permet de definir A' en prenant pour (al, bl, (12, b2i a3, b3) l'une des suites (1, 1, 1, 1,1, 1) ou (1, 1, 1, 1, 1, -2), et la seconde se ramene a la premiere en remplacant y par x + w2y. On acheve la demonstration en controlant sur ordinateur qu'il n'y a pas de determinant dans l'intervalle ] 192, 243[, et que la valeur 243 du determinant, lorsqu'elle ne provient pas d'une suite sans produit scalaire nul, correspond a un reseau de norme 2. [Variante pour l'assertion d'unicite concernant 1E6 : on considere un vecteur minimal x de E6 ; on verifie que le reseau 1E6 fl (Z [w]x) L, qui est de norme 2 et de determinant

9, est isometrique a A2 I A2 ; on en deduit que E6 s'identifie a un reseau de la forme (A2 I A2 I A2, 11. (x, y, z)), et l'on observe que x, y, z doivent etre des unites de Z[w] pour que le nombre de vecteurs minimaux soft superieur a celui de A2 I A2 I A2. Par multiplications a droite par des unites, on se ramene au cas

x = y = z = 11.

10. - Nous terminons cette premiere partie par quelques remarques sur la constante ryn. Sloane, dans une lettre a 1'auteur ([S!)), a donne pour

176

J. MARTINET

certaines dimensions < 24 des exemples de reseaux sur lesquels l'invariant

yn prend des valeurs relativement grandes. Pour n < 9, ce sont ceux de [Be-M1], 4.6 et 4.7. Pour n = 10, it indique la valeur 4 pour ryn2, atteinte sur deux reseaux semblables a leur dual (dont D+), cf. [C-S3]; le couple (K'0, Kio*) fournit la meme valeur. Le resultat propose est le meme pour "ii, atteint en particulier sur les reseaux Ail et K11, qui sont tous deux dual-extremes ([Be-M 11, § 4, (a) pour le premier, n° 7 ci-dessus pour le second).

H. Napias a montre que Yon -yn2(K18) = 8 et yn2(K21) = 9. Nous avons rencontre pour la premiere fois le reseau K18 dans un travail de Souvignier ([Soul) consacre aux sous-groupes maximaux de Gln,(Z), dont nous avons extrait le premier exemple d'un reseau L de dimension 21 avec ry21(L) > y'21(A21) (L et son dual sont extremes, et l'on a y21(A21) = 8 <

y21(L) = 8,4 < y21(K21) = 9). Le reseau K20 donne le meme resultat (y202

= 8) que A20 cite dans [Si].

Tous ces reseaux sont dual-extremes. TROIsiEME PARTIE : autour du reseau de Barnes Wall

11. - Soit H2 ou simplement H le corps de quaternions "usuels" de centre Q ramifie en 2 et a l'infini, muni de sa base (1, i, j, k) verifiant

les relations i2 = j2 = -1 et ij = -ji = k, d'ou l'on deduit les relations supplementaires k2 = -1,jk = -kj = i,ki = -ik = j, et soft 9J12 ou simplement 971 l'ordre maximal des quaternions de Hurwitz,

de base (1,i, j,w = -1+2'+k) sur Z; c'est l'unique ordre de H contenant strictement l'ordre 0 de base (1, i, j, k). Notons a l'ideal bilatere engendre

(a gauche ou a droite) par 1 + i; on a a = {x = a + bi + cj + dk E £ a + b + c + d - 0 mod 2}. Munis de la forme Trd(xy), a et 971 s'identifient respectivement aux reseaux D4 et ID*, et l'on2obtient E8 en considerant sur 971 x 931 la congruence >, - JI mod a (ou par adjonction a 9J1 x 971 muni de la forme Trd(xg) de l'element 1+ti (1, 1)). Soit m > 0 un entier et soit n = 4m. On on munit 971 de la forme Tr(Aji) et l'on pose J. = { (A,, , A.) E 9311T I Al + ... Am = 0 mod a}. On verifie

que J,, est un reseau de norme 4, primitif sauf pour n = 4 ou n = 8 ou l'on trouve une renormalisation de IlD4 et de E8, dont le dual s'identifie a .L.(9J1)m. En identifiant D4 a la derniere composante de J,J, et en coupant par les orthogonaux des sections de ID* semblables a {0}, A1, A2, A3, llD4. on

obtient des reseaux Jn, Jn_1, J,t_2i J,i,_3 et un reseau qui s'identifie a J,_4, ce qui definit Jn pour tout n. On a ainsi construit les analogues pour 992 des reseaux L Lnj2J construits par Barnes sur 1'anneau des entiers d'Eisenstein. 11.1. PROPOSITION. - Pour tout n > 1, J,, est un reseau entier de norme 4, qui est une section hyperplane de Jam,+1. It possede les invariants suivants :

STRUCTURES ALGEBRIQUES SUR LES RESEAUX

n = 4h n = 4h + 1 n = 4h + 2 n = 4h + 3

det(Jn) = 22h+4 det(Jn) = 22h+5 det(Jn) = 3.2(2h+4) det(Jn) = 22h+6

177

s = 12h(4h - 3) s = 4h(12h - 7) s = 12h(4h - 1) s = 3(16h2 + 4h + 1)

It est parfait quelque soit n, extreme pour n - 0 ou 1 mod 4, et est dualextreme lorsque n est divisible par 4. En outre, pour n < 12, Jn est un reseau [amine An, et l'on a plus precisement J12 ^-, A 2 et J11 - Aii". Enfin, pour n = 4h + 2 > 9 et 2 E {1, 2, 3}, Jn est de norme e+1 et la configuration S(J,n) est semblable a S(AQ).

Demonstration. Le calcul du determinant et du nombre de vecteurs minimaux ne presente pas de difficult. On verifie aussi facilement que Jn est relativement parfait par rapport a Jn_1, d'ofi 1'assertion de perfection.

On montre que Jn est extreme pour n - 0, 1 mod 4 en montrant qu'il contient un reseau de norme 4 semblable a D, ce qui assure qu'il est dualextreme pour n = 4m vu que (M)m est eutactique. La suite des valeurs des determinants pour 0 < n < 12 montre tout de suite qu'il s'agit de reseaux lamines, que l'invariant s permet d'identifier en dimensions 11 et 12. Enfin, on determine S(J,ry) par recurrence descendante en identifiant J,, a une projection de Jn+1

12. - L'analogue pour l'ordre de Hurwitz des reseaux D8 = E8 et K12 est le reseau A16 de Barnes-Wall (la notation A16 des reseaux lamines est justifiee ci-dessous), que l'on peut definir au choix par le double systeme de congruences Al = A2 = A3 = A4 mod a et Al + A2 + A3 + A4 = 0 mod a2

sur 9314 muni de la forme 2 >

1

Trd(AJi) ou par adjonction a J16 de

[Plus generalement, on definit de facon analogue un reseau J;, pour tout n > 16 divisible par 8, ayant le meme determinant (4'') que 931''1.

On demontre comme dans les cas de DI et de K12 le resultat suivant : 12.1. PROPOSITION. - L'application q'-4 q(1 + i) A16 stir son dual; en particulier A16 est de norme 2.

est une isometric de

Il resulte de cette proposition que, pour tout vecteur minimal x de Ai6, 931x est un reseau de dimension 4 isometrique a ID4. Les orthogonaux de ses sections minimales {0}, A1, A2, A3i )
sions 16,15,14,13,12 de determinants respectifs 256, 512, 768,1024,1024 dont le dernier est visiblement isometrique a J12 f-- A12 ", ce qui prouve bien qu'il s'agit du reseau lamine de dimension 16, et que les sections considerees sont A16, A15, A14, A'13 ATh 12

178

J. MARTINET

De meme qu'il y a 2 orbites de reseaux A2 dans K12, it y a 3 orbites de reseaux D4 dans A16, et I'on constate que les reseaux de dimension 12 que 1'on obtient par orthogonalite sont Ail X, Am12 id, Ail" On construit A g" a 1'aide des deux dernieres orbites (et Ami' A l'interieur de A12" et de Amid comme iI se doit), mais on verifie qu'il nest pas possible de plonger Amid 13

dans A16. Ce resultat se voit egalement en utilisant la construction des reseaux faiblement lamines jusqu'a la dimension 24 par Plesken et Pohst ([P-L]), qui ont en outre montre que le plongement de Amid 13 est possible dans A17-

13. - Considerons toujours la suite des reseaux lamines A, en nous limitant a ceux qui sont des9J1-modules, ce qui impose que n soit divisible par 4. Conway et Sloane ([C-S I]) ont construit la "serie principale" A0 C A4 -J(D4 C A8 - IE8 C A12 X C A16 C A20 C A24 C ... C A48

dont le terme de dimension 24 est le reseau de Leech et dont les termes de dimension superieure ne sont sans doute qu'une possibilite parmi d'autres. 11 s'agit de reseaux qui sont lamines au sens fort en tant que Z-reseaux et qui possedent une 9R-structure, celle de A,,_4 etant induite par celle de A. L'unicite de ces reseaux en tant que Z-reseaux a ete etablie par Conway et Sloane jusqu'a la dimension 24 sauf bien sur en dimension 12. La question de l'unicite en tant que 931-reseaux se pose, ainsi que celle de 1'existence pour les reseaux de dimension 12 autres que Al " 13.1. PROPOSITION. - Un 931-reseau A de dimension 12, de determinant 1024 et de norme 4 dont le dual est de norme 1 est semblable a Ail X ou a Ail"

les configurations respectives des vecteurs minimaux etant respectivement celles de de IID4 I IID4 I IID4 et de D4 .

Demonstration. Pour tout vecteur x minimal de A*, 9JZx est un reseau isometrique a D* ; les sections de A par les orthogonaux des reseaux semblables a {0}, Al, A2, A3, D4 qu'il contient constituent une suite

decroissante de sections de A de normes au moins 4 et de determinants 1024,1024, 768, 512, 256. Le dernier terme de la suite est d'invariant d'Hermite 2, et est donc de norme 4 et isometrique a A8 - E8. Il s'en suit que ces sections prises pour les dimensions croissantes de 8 a 12 sont des reseaux lamines. Comme les 931-reseaux ont un invariant s divisible par 12 (le nombre de couples fu d'unites de SJJi), on dolt exclure A12d. Enfin, la valeur des invariants s (respectivement 12 et 36) determine les structures des ensembles de vecteurs minimaux des duals. 13.2. PROPOSITION. - Le reseau A12"" possede tine structure de 931-reseau.

STRUCTURES ALGgBRIQUES SUR LES RkSEAUX

179

Demonstration. Sigrist([Sil) en utilisant une generalisation de l'algorithme de Voronoi pour les reseaux quaternioniens (cf. [Be-M-SI), puis Lai'hem (ILal) au cours d'une recherche de reseaux quaternioniens entiers de norme 4 et de dimension 12, ont trouve un reseau de dimension 12, de determinant 1024, de dual de norme 1 avec s = 312. La proposition 13.1 entraine que ce reseau est isometrique a A12i", qui se trouve ainsi muni d'une structure de fit-reseau. En ce qui concerne 1'unicite a isometrie hermitienne pres des structures

sur les reseaux lamines de petite dimension, elle est connue dans les cas suivants : A4 (parce qu'il y a une seule classe dans fit), A8 et A24 (traites par Quebbemann dans IQ] a l'aide d'une formule de masse), A16 (Quebbemann,

communication privee). Comme le sous-groupe unitaire de Aut(A16) est transitif sur S(A16), les inclusions A12 c A16 compatibles avec la 931structure A16 imposent que A12 soit en fait Z-isometrique a A 2 . On a encore dans ce cas un resultat d'unicite 13.3. PROPOSITION.- A EJ 1-isometrie pres, le reseau A 12 X porte Line unique

fit-structure.

Demonstration. Son dual est de norme 1 et de determinant 1024 et ses vecteurs minimaux engendrent un reseau de determinant 43 (prop. 13.1), donc d'indice 4, i.e. de "931-indice" a = (1 + i). Par homothetie, on obtient

le reseau A - 9R 1 fit 1 931 tandis que A12 "* devient un reseau A' engendre par adjonction a A d'un vecteur 1+i v ou v = (x, y, z), x, y, z, E 911. Comme on ne change pas A' en remplacant x, y, z par des elements qui leurs sont congrus modulo a, on peut les supposer dans {0, 1, w, w2 }, et 1'egalite

S(A') = S(A) impose que x, y, z soient non nuls. Par multiplications a droite par l'une des unites 1, w, w2, on se ramene au cas ou x = y = z = 1, ce qui montre immediatement que A' est egal au dual de Am' muni de la 911-structure qui a servi a le definir au n° 11. [Un raisonnement analogue permet de traiter le cas de 1E8 : on choisit un vecteur

minimal x de E8; en considerant l'orthogonal de 931x, on plonge 911 1 931 11)4 1 I1D4 dans E8, et l'on reconstruct E8 par adjonction a 1D) 1 1D) d'un vecteur de la forme i1i (x, y). On peut de meme traiter le cas de A16 en l'identifiant au reseau defini par l'adjonction a (1 + i)(931 1 931 1 931 1 931) des 4 vecteurs ( 1 , 1,0,0), (0, 1 , 1 , 0), (0, 0, 1, 1), (1, w, w2, 1), qui engendrent un code de poids 2 sur IF4, et aussi retrouver la prop. 13.3 en identifiant Ail' a ((1 + i)(931 1 931 1 931), (1, 1, 0), (0, 1, 1))l.

Les resultats que nous venons de dormer permettent de determiner pour l'essentiel la suite des reseaux lamines munis de 931-structures jusqu'a la dimension 24 : on a la serie principale de [C-Sll decrite plus haut, une bifurcation en dimension 8 vers le reseau A1" 2 qui est un cul-de-sac vu le resultat d'unicite pour la dimension 16 (et qui pourrait ne pas etre unique en

180

J. MARTINET

tant que 931-reseau, mais c'est peu probable), et peut-etre des bifurcations en dimension 16 vers des reseaux A20 munis de fit-strucures exotiques, et qui seraient alors des culs-de-sac vu le resultat d'unicite pour la dimension 24; cette eventualite est elle aussi peu probable.

14. - On definit de facon naturelle le procede de lamination (au sens faible comme au sens fort) au-dessus d'un 931-reseau A0 dans ]'ensemble des 931-reseaux : it s'agit de suites croissantes de 931-reseaux de meme norme que A0, le plongement d'un reseau dans le suivant etant compatible a ]'action de 931, les determinants verifiant les conditions de minimalite forte ou faible. On s'interesse ici aux laminations fortes dans le cas ou A0 est le reseau de dimension 0 auquel est attribuee la norme 4. Supposons demontre pour une certaine dimension n < 44 que les laminations au sens ci-dessus conduisent a un reseau An lamine au sens usuel, et considerons le terme A' = A' '+4 suivant. Posons m = N(A'*). Pour tout x E A*, le 931-reseau 931x est semblable a 1
semblables a ID4, A3, A2 et A1, qui sont les reseaux les plus denses dans les dimensions 4,3,2, 1; leurs determinants sont 4 m4, 2 m3, 4 m2 et m. Il en resulte que les sections de determinant minimum de A dans les dimensions n - 1, n - 2, n - 3, n - 4, que nous notons An+3, A'+2, An+1, A'' , ont pour determinants les determinants des orthogonaux des reseaux contenus dans 931x, c'est-a-dire m det(A), 4m2 det(A), 2m3 det(A) et 4m4 det(A). U caractere minimal de det(A;j montre que l'on a det(An) < det(An), done en fait det(An) = det(An), ce qui entraine 1'inegalite det(An+1) > det(An+l). Or, Conway et Sloane ont calcule jusqu'a la dimension 48 les determi-

nants des An. Le determinant do de An s'obtient a partir de sa valeur en dimensions < 4 (1,4, 12, 32, 64) par les formules do = 216-nd8_n pour 0 < n < 8, do = 216-ndn_8 pour 8 < n < 16, do = d24_n pour 12 < n < 24 et do = 224-ndn_24 pour 24 < n < 48 (cf. [C-S], ch. 6). II est alors facile de verifier que la minoration de det(An+i) par det(An+1) entraine 1'egalite de ces deux determinants, et que la valeur de m qui s'en deduit entraine det(A,t+4), i.e. que les reseaux lamines sur l'ordre de 1'egalite Hurwitz en dimension n + 4 sont des reseaux lamines au sens usuel. Vu que A8 -- 1E8 est le plus dense des reseaux de dimension 8, on deduit du raisonnement ci-dessus que les 931-reseaux lamines (en tant que 931-reseaux) de dimension 12 sont A12° et A12 a". Le resultat analogue est probablement vrai dans les dimensions 16, 20, 24, pour lesquelles it est generalement conjecture que A16, A20, A24 sont les reseaux les plus denses. Il semble possible de traiter le cas de la dimension 16 en verifiant a la facon

de [La] qu'il n'y a pas d'autres 931-reseaux de norme 4 en dimension 12 que A12° et A12 X. Nous conjecturons plus, a savoir que ces deux reseaux realisent le maximum de ]'invariant `Y12 sur les 932-reseaux.

181

STRUCTURES ALGEBRIgUES SUR LES RESEAUX

QUATRIEME PARTIE : au-deli de la dimension 16

15. - Soit H un corps de quaternions totalement defini. Cela signifie que le centre F de H est un corps de nombres totalement reel et que la norme reduite NrdHIF est positive a toutes les places infinies de F. Etant donne a > > 0 de F, la forme 7L-bilineaire (A, p) - TrF/Q (aTrdH/F (Aµ)) est definie positive. Les ordres maximaux de H deviennent ainsi des reseaux de R ® H, et l'on construit d'autres reseaux par congruences selon un procede deja utilise, cf. no 5 et no 11. Nous examinons dans cette quatrieme partie

des exemples dans lesquels F est soit le corps Q, le corps H n'etant plus le corps des quaternions usuels, soit un corps quadratique reel, le corps H n'etant ramifie qu'aux deux places reelles de F, renvoyant a [Bay-M] pour d'autres exemples. Soit 931 un ordre maximal de H. Pour un ideal premier p non nul de F, it y a seulement deux possibilites : CAS RAMIFIE. On a p931 = ' 32, 9J1/T est une extension quadratique de

ZF/p, et le complete en T de H est un corps gauche. CAS DECOMPOSE. On a 931/p931 ^ M2(ZF/p) et le complete de H en p931

est isomorphe a M2 (Fp). On utilisera surtout la variante suivante des doubles congruences qui

sont intervenues aux no 5 et 11 : on choisit deux ideaux a gauche T et T' dans 9931 au-dessus d'un ideal maximal p de F non ramifie dans H,

en se limitant au cas ou p est l'ideal au-dessus de (2) suppose inerte dans F/Q, et l'on considere pour un entier m > 0 le reseau de )R ® H'' muni du produit scalaire (a,µ) 2Trp/Q(aTrdH/F(Aµ)), de dimension n = 4[F: Q]m, defini sur Al

=A2=...=A,,,,mod T et

=0 mod T'.

Al

[S'il y a de la decomposition au-dessus de 2 dans F/Q, on peut choisir un couple T T' pour chaque Ideal de F au-dessus de 2].

15.1. MiEOREME. - Sous les hypotheses ci-dessus, le reseau defini par la condition (*) est entier et pair et de norme > 4 lorsque m est > 3.

Demonstration. La demonstration de l'integralite se fait par completion en p (notee par le symbole -), ce qui permet de ramener les calculs de norme

reduite a des calculs de determinants d'ordre 2. Par une identification convenable de l'algebre locale a une algebre de matrices, on peut faire en sorte que l'on ait K

K ZK/ \ZK

\2K p/

et

, - \p 2 K

182

J. MARTINET

Identifiant alors Ai E 931 a une matrice de la forme

que les congruences de la condition (*) deviennent

(xi \ zi

Yi

ti), on constate m

yi - y1 mod p et ti - t1 mod p (1 < i < m),

et

xi i=1

i=1

zi

0 mod

Comme la norme reduite dans une algebre de matrices n'est autre que le determinant, on a m

m

A Ai = T riti - yizi = (Exi)t] - (Ezi)J1 = Omodp, i=1

i=1

i=1

i=1

ce qui prouve qu'il s'agit d'un reseau entier pair. Pour minorer la norme de A _ (A1, A2, ... , A,n) suppose non nul, on distingue trois cas :

si les Ai ne sont pas dans T, on a N(A) > m min Nrd(A) > 3minNrd(Ai), donc N(A) > 4;

Si les A, sont dans T et si deux d'entre eux sont non nuls, on a N(A) > 4 puisque les produits A Ai sont dansT n ZF = 2ZF; si un seul des Ai est non nul, c'est un element de 'a3 l', 43' = p931, d'ol

encore le resultat dans ce cas.

16. - Nous prenons maintenant pour H 1'algebre H3 de centre Q ramifiee en 3 et a l'infini, munie de sa base (1, i, j, k) verifiant les relations

i2 = -1, j2 = -3,ij = -ji = k, et donc les relations supplementaires

k2 = -3,jk = -kj = 3i,ki = -ik = j, et pour ordre maximal l'ordre 9713 = 931 de base 1, i, w, iw sur Z ou w = - 2 jest une racine de

]'unite d'ordre 3. (Le choix de 931 importe peu, les ordres maximaux de H etant conjugues, comme dans le cas des quaternions de Hurwitz.) Les unites de 93Z sont {±1, ±w, ±w2, ±i, ±iw, ±iw2 } ; elles forment un groupe isomorphe au

groupe quaternionien d'ordre 12. Le theoreme ci-dessous donne une construction explicite d'une structure de 9313-reseau sur K12, dont ]'existence a ete prouvee it y a peu par Gross (IGro]) :

16.1. THEOREME. - Le reseau construit a l'aide de la condition (*) avec m = 3 sur L'ordre 9313 est isometrique au reseau de Coxeter-Todd.

Demonstration. Le theoreme 15.1 montre qu'il s'agit d'un reseau de norme au moins 4, dont on voit tout de suite qu'il est entier en tant que 9313-reseau et de determinant 36. 11 est donc unimodulaire en tant que 9313-reseau, et donc en particulier en taut que reseau sur 1'anneau des

STRUCTURES ALGEBRIQUES SUR LES RSEAUX

183

entiers d'Eisenstein. Le theoreme de Felt ([Fe)) cite au no 7 entraine qu'il est isometrique a K12. [Le theoreme de Felt montre qu'il s'agit meme d'une isometrie en tant que 7G[tWIreseau; Ch. Bachoc vient de demontrer que K12 est meme unique a 9723-isometrle presj

17. - Soit F un corps quadratique reel, de discriminant d impair. Nous considerons maintenant le corps de quaternions H ramifie exactement aux deux places infinies de F, et nous supposons que l'unite fondamentale a de F est de norme -1, ce qui equivaut au fait que la differente de F possede

un generateur totalement positif, en l'occurence a = ev, si bien qu'un ordre maximal 971 de H, muni de la forme TrK/Q(a-1Trd(Aµ)), definit un reseau Z-isometrique a E8. Un corps de quaternions Ho de centre Q peut etre plonge (d'une infinite de facon) dans un corps gauche H du type ci-dessus : it suffit de choisir un corps F dans lequel les nombres premiers ramifies dans Ho sont inertes ou ramifies dans F/Q et de prendre H = F ®Q Ho, les invariants locaux aux places finies de F etant alors tous nuls. Un ordre arbitraire 0 de Ho etant contenu dans l'ordre 7GF4.7 de H, lequel est a son tour contenu dans un ordre maximal 931 de H. on voit que E8 peut etre muni d'une structure de i:7-reseau sur n'importe quel ordre de quaternions totalement defini sur Z. On verra plus loin d'autres exemples du meme type; signalons simplement ici qu'un resultat analogue s'applique a A16. En appliquant a l'ordre 971 la construction par le double systeme de congruences (*), on obtient un reseau unimodulaire pair A de dimension n = 8m que nous notons simplement Un, sans mettre en evidence dans la notation sa dependance a priori des choix de H, 971,'3, T'. Il est fort possible que la classe d'isometrie de Un (en tant que Z-reseau) ne depende pas de ces choix. C'est ce qu'on constate en dimension 8 (resp. 24), puisque E8 (resp. A24) est alors l'unique reseau unimodulaire pair (resp. et de norme 4, theoreme de Conway). Le cas de la dimension 32 a ete resolu par Coulangeon ([Coul), qui a caracterise U32 comme le reseau unimodulaire pair d'invariant

de Venkov maximum qui est associe au code de Reed-Muller. Quant a la dimension 16, on trouve E8 I E8, comme on le volt en considerant 1'application (A, p) - (A +;t, A - µ). [Pour n > 40, les repartitions des normes redultes dans les suites (Al, A2,. . . , Am ) definissant des vecteurs minimaux sont des permutations de (1, 1, 0, . . . , 0) ; on en

deduct 1'egalite s(Un) = 15n(n - 7) pour n > 40. Les resultats pour n = 24 et n = 32 (et aussi pour n = 40) decoulent de la theorie des fonctions O ; on a s(U24) = 98280 et s(U32) = 73440. Pour n = 32, on dolt ajouter aux 15n(n - 7) = 12000 vecteurs ci-dessus 61440 vecteurs assoctes a la repartition

(1,1,1,1); pour n = 24, on ajoute a 15n(n - 7) = 6120 la contribution des

184

J. MARTINET

permutations de la repartition (2,1,1), soft 92160 vecteurs.

On connaitlsl en dimension 40 quelques autres reseaux unimodulaires pairs de norme 4. Pour celui de McKay (cf. [C-S], ch. 8, § 5), d'apres McKay, le groupe d'automorphismes ne serait pas transitif sur ]'ensemble de ses vecteurs minimaux, ce qui entrainerait que U40 ne lul est pas isometrique. Nous Ignorons st notre reseau U4o coincide avec 1'un des reseaux construits par Eva Bayer dans [Bay] ou par Ozeki dans [Oz]].

Revenons au double systeme de congruences (*) defini par deux ideaux

a gauche maximaux T et T' au-dessus de 2 d'un ordre maximal i3 d'un corps de quaternions Ho de centre Q. Si on choisit un corps F dans lequel 2 et les nombres premiers ramifies dans Ho sont inertes, on plonge comme cidessus Ho dans H et i7 dans un ordre maximal fit de H, et ces plongements transforment le reseau A0 associe au double systeme de congruences en un reseau defini de facon analogue sur 931 a ]'aide d'ideaux maximaux audessus de 2 contenant respectivement ¶ 3 et T'. Ceci s'applique en particulier au cas on 971o est l'ordre note'9J13 au no 16

en prenant F = Q(/5) ou p est n'importe quel nombre premier congru a 5 ou 11 modulo 24, par exemple p = 5. En prenant m = 3, on en deduit une construction du reseau de Leech A24 sur 9R3, utilisee par Tits ([Ti]) dans le

cas du corps F = Q(v), et un plongement de K12 dans A24 compatible avec la 9713-structure dont nous avons muni K12 au no 16, qui justifie la construction de la serie K i au-dela de la dimension 12 que nous avons faite au no 8.

On peut faire une remarque analogue avec ]'ordre 9312 de Hurwitz. On verifie que la construction par double congruence des reseaux J4,,n faite au debut du no 12 conduit au reseau A12 " lorsque l'on prend m = 3 (lorsque m est impair, le determinant calcule au no 12 doit etre multiplie par 24), et l'on en deduit le plongement connu de A12 " dans A24 en tant que reseaux sur l'ordre de Hurwitz.

Manuscrit recu le 8 mars 1993

Pl

Je remercie Eva Bayer pour les references concernant les reseaux de dimension 40

SMUCTTJRES ALGEBRIQUES SUR LES R$SEAUX

185

BIBLIOGRAPHIE

[Bar] E.S. BARNES. - The construction of perfect and extreme forms I, II, Acta

Arith. 5 (1959), 57-79, 461-506. [Bay] E. BAYER-FLUCKIGER. - Definite unimodular lattices having an automorphism of given characteristic polynomial, Comm. Math. Helvet. 59 (1984), 509-538. [Bay--M] E. BAYER-FLUCKIGER et J. MARTINET. - Formes quadratiques liees aux

algebres semi-simples, J. refine angew. Math. (1994), a paraitre. [Be-M 1 ] A.-M. BERGS et J. MARTINET. - Sur un probleme de dualite lie aux spheres

en geometrie des nombres, J. Number Theory 32 (1989), 14-42. [Be-M2] A.-M. BERGS et J. MARTINET. - Reseaux extremes pour un groupe d'automorphismes, Asterisque 198-200 (1992), 41-66. [Be-M-S] A.-M. BERGS, J. MARTINET et F. SIGRIST. - Une generalisation de l'algorithme de Voronoi pour les formes quadratiques, Asterisque 209 (1992), 137-158. [C-S] J.H. CONWAY et N.J.A. SLOANE. - Sphere Packings, Lattices and Groups,

Springer-Verlag, Grundlehren no 290, Heidelberg, 1988. [C-S11 J.H. CONWAY et N.J.A. SLOANE. - Complex and integral laminated lattices,

Trans. Amer. Math. Soc. 280 (1983), 463-490. [C-S2] J.H. CONWAY et N.J.A. SLOANE. - Low-dimensional lattices. III. Perfect forms, Proc. Royal Soc. London A, 418 (1988), 43-80. [C-S3] J.H. CONWAY et N.J.A. SLOANE. - On Lattices Equivalent to Their Duals,

a paraitre. [Cou] R. COULANGEON. - Expose au Sem. Th. Nombres de Paris, (]anvier 1993).

[Cox] H.S.M. COXETER. - Extreme forms, Canad. J. Math. 3 (1951), 391-441. [Cox-T] H.S.M. COXE'I'ER and J.A. TODD. - An extreme duodenary form, Canad.

J. Math. 5 (1953), 384-392. [Cra] M. CRAIG. - Extreme forms and cyclotomy, Mathematika 25 (1967), 4456.

[Fe] W. FELT. - Some Lattices over Q(/), J. Algebra 52 (1978), 248-263.

186

J. MARTINET

[Gro] B. GROSS. - Group representation and lattices, J. Amer. Math. Soc. 3 (1990), 929-960. [La] M. LAIHEM. - Communication privee. [Oz] M. OZEKI. - Examples of even unimodular extremal lattices of rank 40,

J. Number Theory 28 (1989), 119-131. [P1-P2] W. PLESKEN and M. POHSr. - Constructing Integral Lattices With Prescribed Minimum. 11, Math. Comp. 60 (1993), 817-825. [Q] H.-G. QUEBBEMANN. - An application of Siegel's formula over quaternion

orders, Mathematika 31 (1984), 12-16. [Si] F. SIGRIST. - Lettre electronique du 11 septembre 1990 a l'auteur. [S1] N.J.A. SLOANE. - Lettre a l'auteur du 11 mai 1992. [Soul B. SOUVIGNIER. - Diplomarbeit, Aachen,1991.

[Til J. TITS. - Quaternions overQ(f), Leech's lattice and the sporadic group of Hall-Janko, J. Algebra 63 (1980), 56-75.

Jacques Martinet Mathematiques, Universite Bordeaux I 351, cours de la Liberation 33405 TALENCE cedex

Number Theory Paris 1992-93

Construction of Elliptic Units in Function Fields Hassan Oukhaba

i. - Introduction Let k be a global function field and Fq be its field of constants. Fix a place o0 of k. Let Ok be the Dedekind ring of elements of k regular outside of oo, and k,, be the completion of k at oo. For each finite abelian extension F of k we let OF be the integral closure of Ok in F. We know that OF is a Dedekind ring with a finite ideal class

number h(OF). As usual we denote by OF the group of units of OF, and p(F) C OF the finite multiplicative group of non zero constants of F. We have p(k) = Ox = F9 , and in general the quotient group OF/µ(F) is a free abelian group of rank rF - 1, where rF is the exact number of places of F sitting over oo. Now suppose that F C k, which means that the place oo splits completely in F/k. Suppose in addition that one of the following two conditions holds.

1) The extension F/k is unramified. 2) One, and only one, prime divisor of k ramifies in F/k and deg (oo) = 1.

Then one knows that there exists a subgroup EF of OF called the group of elliptic units of F. It is a Galois module generated by the torsion points of certain Drinfeld Ok-modules. It's elements are also obtained as finite products of special values of elliptic functions. The group EF has finite index in OF, cf. ] 10]. Actually when only one prime does ramify in F/k we had succeed to construct subgroups of finite index in OF even if deg (00) > 1,

cf. 191. Unfortunately, the index formula obtained then contains a factor depending on deg (oo) and which is hard to control. When deg (00) = 1 this factor is equal to 1 also and the index formula is just what one can expect. But in general this factor increases proportionaly to deg (00). This means that the subgroups so constructed are not sufficiently large when deg (00) > 1. Hence, one could suppose that there is possibility to obtain larger subgroups of OF, in other words to obtain more units of OF, using

188

H. OUKHABA

new techniques of constructions. This is what we propose to do in the present paper. Our aim here is to define £F, the group of elliptic units of F. We shall expose some of its interesting properties, precise the nature of its elements and calculate its index in O. As we shall see the description of £F is rather easy and almost canonical. Moreover, the "exponential function", which we are going to redefine in the next section, is the only basic material of its construction. Finally we would like to draw the attention to the work of D. Kersey, cf. [7] chap. 12 and 13, which was one of our source of inspiration.

Some supplementary notations Let F C k,,, be a finite abelian extension of k such that the place 00 splits

completely in F/k. let b C Ok be an ideal of Ok prime to the conductor of F/k. Then we will write (b, F/k) for the automorphism of F/k associated to b by the Artin map. Moreover if q is a prime ideal of Ok then qF will denote

the product of the prime ideals of OF sitting over q. Finally, if m C Ok is an ideal of Ok then we know that there exists a maximal finite abelian extension of k whose conductor divides m and which is contained in k,,,,. It will be denoted by H,,,.

2. - Some preliminaries In this section we recall some definitions and results, necessary in the sequel. The reader is invited to consult [1], [4], [9] or [11], where are proved

all the results stated here. Let S2 be the completion at o0 of the algebraic closure of k, Then we call a lattice of l every finitely generated projective Ok-module, contained into Q. To such a lattice r C 0 one can associate its exponential function defined on 1 by :

er(z)`ifnzJJ(1- z). 7Er

ry

7#0

We know that er is defined everywhere and is entire and IFq-linear. It is also an epimorphism and we have er(z) = 0 if, and only if, z E r. Moreover the equation eyr(xz) = x er(z) holds for every x E S2" and z E Q.

When F is contained into a lattice r of SI such that r and r have the same rank as Ok-modules then er and er are related by the formula : (1)

er(z) = P(r/r; er(z))

,

where P(r/r; t) is a linear polynomial whose roots are all simples and

CONSTRUCTION OF ELLIP77C UNITS IN FUNCTION FIELDS

189

constitute the finite group er(r). Its leading coefficient is

6(r

-) arll

)-i

( pEn/r R er(p) p#0

where p describe a complete system of non zero representatives of r modulo F. Let K(oo) be the constant field of k,,.. It is a finite extension of ]Fq. We have [K(oo) 1Fq] = deg (oo). Let us choose s (once of all) a sign-function of :

k,,, i.e., a co-section of the inclusion map K(oo)" -4 k' such that s(z) = 1 if Iz - 11,, < 1. Then one can associate to each lattice r of S2 of rank 1 its s-discriminant OS(r) E SZX and ar an Fq-automorphism of K(oo) such

that X6 (r, x-1 r) = A, (r)

Nx-1 s(x)°r

for all x E Ok\{0}. In the above formula, w,,, is just the number of non zero elements of K(oo), i.e., w... = K(oo)". On the other hand Nx is by definition the exact number

of congruence classes of Ok modulo the ideal xOk. One can show that A3(zr) = z-w°°OS(r) for all z E V<. Moreover we have the equation

6(r, r)- =As (r)[r:r]/Os(r) whenever r C r are lattices of SZ which have rank 1 as Ok-modules. The invariants A (a) associated to fractional ideals a of Ok are used to express the special values of L-functions at s = 0 and hence are related to analytic class number formulas, cf. [31, [51 and [61. In other respects J. Yu has shown

that they are transcendental over k, cf. 1131. However, as in the classical theory, it is possible to construct elements of SZ which are algebraic over k using the above invariants. Indeed let 06 C k,,, be the abelian closure of k in k,,. Let H(1) be the maximal subextension of 06 such that Hlll/k is unramified. Then one knows that the quotient A,(a,)/O3(a2) E H(1) for all fractional ideals al and a2 of Ok. In fact this last quotient generates in OH(1) the ideal

(alai'OH(,))w-.

Now suppose that r is a lattice of SZ of rank 1. Then the lattice t r is well defined for all idele t of k. And if p is an element of the k-vector space

kr generated by r then one can check, using the strong approximation theorem, that there exists u E SZ such that U - tvp mod. tvrv for all the places v

oc of k, where tv is the component of t at v and IF, is the

completion of r at v. The elements u that verify the property above define

190

H. OUKHABA

the same class modulo tr. We shall write tp for any representative of this class. Let or E Gal (S2/k) and t be a idele of k such that the automorphism It, k] of kab/k, associated to t by the Artin map coincide with the restriction of a to kab, cf. [ 111, then there exists a non zero element AQ (t, r) E S2" which verifies the formula er(P)a = AQ(t,

(2)

for all p E kr. It is possible to describe the behavior of A, (t, r) as a, t or r varies, cf. [ 11 ].

In fact if we suppose that O3 (r) = 1 and if the component t of t at each place v # oo is integral, then we have

A,(t, r) = 6(r, t-lr)-1, provided that s(ty) = 1. Let us observe that the automorphism Is, k] of kab/k is equal to the identity map if, and only if, t E k" x kx. Therefore the formula (2) implies that the quotient er(pl)/er(p2) E kab, for all pl and P2 in U. Moreover we have :

= et-lrt-1P1)

(er(P1)1

J

(3)

er (P2)

'

for all idele t of k.

3. - The ramified part of the group of elliptic units Let F C koo be a finite abelian extension of k. Suppose that the conductor of F/k is qn, where q is a prime ideal of Ok and n is a positive integer. Then it is possible to construct elements of F" using the values

ea-iqn(1), where a is any ideal of Ok prime to q. These elements will constitute the ramified part of the group of elliptic units of F. PROPOSITION 1. - Let B be a finite set of ideals of Ok all prime to q. Then

the product (4)

11 eb-1gn(1)nb

bEB

belongs to Hqn provided that the rational integers nb, 6 E B verify the condition

nb=0.

(5)

bEB

This proposition is equivalent to the following one.

CONSTRUC77ON OF ELLIP77C UNrIS IN FUNCTION FIELDS

191

PROPOSITION 2. - The quotient

ea-1gn(1)/eb-1gn(1) E Hqn

for all ideals a and 6 of Ok which are prime to q.

Proof : let a E Gal (SZ/k) and t a idele of k, choosen so that or is equal to the automorphism [t, k] of kab/k. Then Corollary 3.3 of [11) (see also the previous section) implies that ea-1q^(1)° = A,(t,a-1gn)et-la-1gn(t-1 1) b-lgn)et-l6-1gn(t-1

e6-lgn(1)° =

11),

where AQ(t, a-lqn) and A0(t, b-lqn) are elements of Q': which are equal in this case, cf. [I I]. So that we get the formula ea-lgn(1)

(6) Ceb-lgn(1)

et-la

)

1gn(t-1.

1)

et-16-1gn(t-1.1)

Now suppose that a is the identity map on kab, then we know that t E k" x kx. In this case the quotient on the right of the above formula is equal to ea-1qn (1)/e6-1 qn (1). This means in particular that this last quo-

tient belongs to kab. In fact using class field theory we see also that it is in Hqn.

0

PROPOSITION 3. - Let a, 6 and -0 be integral ideals of Ok, all prime to q. Then we have ea-lgn(1) (7)

(0,Hgn/k)

e6-Ign(1))

- ea-11i-1gn(1)

ea-1a-1gn(1)

Proof : proposition 3 is easily derived from above formula (6) and the o well known properties of the Artin map. PROPOSITION 4. - Let B be a finite set of integral ideals of Ok, all prime to q. Let nb, b E B, be rational integers which verify (4). Then the product 11 eb-'gn(1)nb

bEB

generates in the integral closure OHgn of Ok in Hqn the ideal

(flbnb) bEB

192

H. OUKHABA

geneProof : all we have to prove is that the quotient rates in OHgn the ideal (a OHgn)-1. So, put a df n (a, H, -1k) and consider the element cp(a) of Q' defined as follows (8)

Bo (a)

df n OS(a-1qn) [ea-lqn (1)] --

It is obvious that cp(a) depends only on or (and not on the ideal a). The behavior of this invariant is well known, cf. [9) chap. IV. In particular cp(a) E OHgn and we have cp(a)'r = W(ar), for all a, -r c Gal (Hqn/k). Moreover it verifies the following norm formula NHgn/x(l) ( \P(a))

u'k

Os(a-1) O5

a-l ) am

Fq . which implies that p(a) generates in Oxgn the ideal qH, where Wk Indeed the quotient A,(a-1)/O3(a-1q) generates in OH(,) the ideal qH(1). On the other hand the extension Hqn /H(l) is totally ramified at each prime factor of qH(,). Now since we have the relation ov(a) C

egn(1)

Yo-

o(1)

A.(qn) O3(a-iqn)

and since OS(gn)/os(a-1qn) generates in OH(1) the ideal cf. [11) Proposition 3.7, we can deduce that the quotient ea-1 qn(1)/egn(1) generates in °Hqn the ideal (a-1 OH,,)DEFINITION 1. - Let F C k,,, be a finite abelian extension of k. Suppose that the conductor of F/k is equal to qn. Then we set SF to be the sub-group of F" generated by all the norms

NlignIF

egn(1) Cea-ign(1)

where a describes the set of integral ideals of Ok which are prime to q.

4. - The unramified part of the group of elliptic units We describe below the method of construction of non zero elements of those unramified abelian extensions of k which are included into k,,,. The connexion with the torsion points of certain Drinfeld 0k-modules is explained in [101, proof of Theorem 2.2.

193

CONSTRUCTION OF ELLIPTIC UNITS IN FUNCTION FIELDS

Let r c r be lattices of 5l such that the index [r : r] of r in r is finite. Then one can define the function

(z

,

r r) aIn S(r, r) er(z)jr:r] er(z)

z E Il,

which is well defined on the complement of r\r in Q. It vanishes on IF and, in fact, is elliptic (i.e. periodic) with respect to F. Moreover, as a rational function of er(z) its divisor is [r

:

r](0)r

-

T, (P)r. PEI'/r

On the other hand, we have the homogeneity formula

T (Az ; Ar, Ar) = T (z; r, r), for all A E W.

Also if rl C r2 C r3 are lattices of l2 such that the index [r3

:

r1] of rl

into r3 is finite, then we have rl,r2)[r3:r21W(z;

W(z; rl,r3) = `1'(z;

(9)

r2,r3)

PROPOSITION 5. - Let M, M and r be lattices of Il such that M C M, M C r and r f1 M = M. Consider the lattice r L fn r -+- M and choose S a complete system of representatives of r modulo M. Then we have the distributivity formulas (10)

(z; r' r) _ f T(z+P; M, 79), pES

(11)

SWIM)

f W(P; M, M) PES P#o

Proof : this is a simple consequence of the formula (1).

o

Now suppose that r C r are lattices of fI of rank 1. Then the value T (p ; r, r) E kab for all p E kF. Indeed, %P (p; r, r) is a product of quotients

of the form er(Pl)/er(P2), P1, P2 E kr, which are elements of kab by Theorem 3.2 of [ 11) (see also § 2 above). Moreover, the formula (3) implies the following property (12)

for all idele t of k.

''(P; r,r)]t'k] = q(t-lp; t-1r t-ir)

194

H. OUKHABA

PROPOSITION 6. - Let m 54 0 be a proper ideal of Ok. Let a be a integral ideal of Ok prime to m. Then we have

'(1 ; m, a-1m) E Hm. Moreover if 6 is a ideal of Ok prime tom then the automorphism (6, Hm/k) of Hm/k applied to W(1; m, a-lm) gives W(1; m,

a lm)(b,Hm/k) = iy(1; b-lm a 16-1m)

Proof. this Proposition is a direct consequence of (12). See also the alternative proof given in [10].

o

The above Proposition 5 and Proposition 6 have the following remarkable consequence.

PROPOSITION 7. - Let p be a prime ideal of Ok and n > 0 be a positive integer. Let a be a integral ideal of Ok prime to p. Then we have (13)

NHpn+1 /Hpn

(14)

(,P(1;

pn+l a-lpn+1)) =

NH,/HC,) (`1'(1; p, a

1p))wk

T(1; pn, a-lpn),

= b(Ok, a-l) 6(p, a-lp) Na_1

Moreover the ideal of OHpn generated by the value W (1; pn a-lpn) is pHpn where wk = OIFy .

Proof : let X be a complete system of representatives in Ok of (Ok/p)" modulo the group F9 . Then the elements of Gal (Hp/H(l)) are precisely the automorphisms (x, Hp/k), x E X. On the other hand if we put M = p,

M = a-lp and r = Ok then we have r = I + M = a-'; moreover the set {lx, E IFq and x E X} is a complete system of non zero representatives of r modulo M. Therefore the formula (14) is just the identity (11) applied to this precise case. The formula (13) is obtained from (10) proceeding as above. Now let us observe that we have the equation '(1e ; pn a-lpn)w00 = (p(1)(Na-(a,Hvn/k))

which implies in particular that W(1; pn, a-1pn) generates in OH,n the ideal pHpn

since the ideal generated by cp(1) is p1 , cf. §3.

0

CONSTRUCTION OF ELLIPTIC UNITS IN FUNCTION FIELDS

195

Now let L C k, be an unramified abelian extension of k. Then, for a given prime ideal p of Ok we shall write Rp,L for the subgroup of L" generated by µ(L) and by all the norms NH /L (T (1; pn,

a-lpn))

where n is any positive integer and a is any integral ideal of Ok prime to p. In fact n can be fixed according to the formula (13). On the other hand if p' is a prime ideal of Ok such that the automorphism (p', L/k) is equal to (p, L/k) then we have, cf. (10) Theorem 3.2,

Rp,L (1'L = Rp, L II

LL.

The group Rp,L n OL will be noted £L,, if a = (p, L/k). DEFINITION 2. - We define RL to be the subgroup of L" generated by all Rp,L, where p is any prime ideal of Ok Hence we have RL

df n

Rp,L p

Remark 1 : the group EL of the elements of RL which are units of OL is called the group of elliptic units of L. We have £L

dfn

RL f1 O =

fi

EL".

oEGaI (L/k)

The quotient group OL /£L is finite, cf. (101. We have the index formula (15)

[OX

:

£L] =

h(OL) [H( j)

Remark 2: the formula

T(1; b-'Pn a-lb-lpn) = W(1; pn, a-'b-lpn)/`W(1; pn, b-lpn)Na, verified for all prime ideal p of Ok and all integral ideals a and b of Ok not divisible by p, shows clearly that the group Rp,L is stable under the action of the Galois group of L/k. Thus the groups RL and £L are also stable under

196

H. OUKHABA

the action of Gal (L/k). Therefore, using formula (10), one can prove that p(L)RLk is generated by p(L) and by all the quotients 8(Ok b-1) NH( 1)/L

b(a, b-1 a)

where a and b are any integral ideals of Ok which are coprime. Hence, the group RLk"'°° is generated by all the elements of L" of the form Os(OIc)l Nb Os(b-la) Os(b-1)

NH(I)IL

\ Os(a) /

where a and b are as above.

Remark 3 : using the description of µ(L)RLk just given above and the fact that the order Wk of IFq is the g.c.d. of the integers Na - 1, a ideal of Ok, one can prove that we have NH(1)/L (11 b(Ok, b-1)1b ) E RL , bEB

where 13 is a finite set of integral ideals of Ok and nb, b E 13 are rational integers such that

Enb(Nb-1)=0. bEB

DEFINITION 3. - LetT E Gal (L/k).

i) If L = Hill, then we put h = h(Ok) and dfnWooA,(b)h aH(1)(T)

where b is any fractionnal ideal of Ok such that (6,H(l)/k) = -r-1 and bh = aOk, with a E k. ii) In general, we put : OL(T) dfn =

H

T E Gal (H(1)/k) TIL = T

OH(,)(T).

197

CONSTRUCTION OF ELLIPTIC UNITS IN FUNCTION FIELDS

One can show, cf. [ 11 ] Lemma 3.5, that the quotient aL (T1) /aL (T2) is a unit of OL. Moreover if T E Gal (L/k) then we have the property T

,L(r1)

1L(T1T) 1L(T1T)

OL(T2)

which implies in particular that NH(1)/L

(OH(,)(T1)

49L (T1)

OH(,) (r2)

aL(T2)

where T1 and T2 are automorphisms of H(1) /k such that -Ti,, = Ti, i = 1, 2.

Finally we see that the group IZ,,, Oh is generated by all the elements of L" which have the form

8L(1) Na, OL(TT') (0L(7)) CUL(T'))x

wO°(1-Na,,)[H(1):L]

Gal (L/k) and aT, is any integral ideal of Ok such that (a,,, L/k) = T'. The element x of k" is such that there exists b a ideal

where T, T' e

of Ok which verifies (b, L/k) = T and bh = XOk.

In particular, for any A E k" take b = .Ok and x =

Ah

so that T dfn =

(b, L/k) = 1 and .hw-(Na-1)[H(l) : L]

a any ideal of Ok,

na(Na - 1) for a well suited finite set U of

belongs to RL'`" °°h. As wk = aEU

integral ideals of Ok, and convenient integers na, a E U, we get COROLLARY. - The group kxwkwooh[H(1) : L] is contained into

RLkw,oh

4. - The group of elliptic units Let F C k,, be a finite abelian extension of k such that the conductor of F/k is equal to qn, where q is a prime ideal of Ok and n is a positive integer. df° We put F(1) F fl H(1) and we define a subgroup RF of F" by setting (16)

DEFINITION 4. - Let £F be the intersection of RF with the group of units of OF, i.e., (17)

£

F

dfr`

RFfl OxF

H. OUKHABA

198

We call SF the group of elliptic units of F.

Our goal in the present section is to describe the group eFkw°°h in a manner which will allow us to calculate its index in OF. Therefore we have

first to introduce some "new" elements of OF defined as the norm from Hqn down to F of the invariants W(o ), o E Gal (Hqn /k), defined by the formula (8). DEFINITION 5. - Let T E Gal (F/k). Then we put WF(T) dfn

where z E Gal (Hq.. /k) is such that TIF = T.

Remark 5 : let M C k. be a finite abelian extension of k. Let a be a integral ideal of Ok prime to the conductor of M/k. Then we know that (a,M/k) = CNa for all £ E µ(M) .

In particular if (a', M/k) _ (a, M/k) then Na' - Na mod. wM where wM dfn

Op(M). This means that we have a well defined Dirichlet character OM : Gal (M/k) -+ (7G/wM7L)" T '--' AGM (T)

given by the condition AM (T) - Nb (mod. wM),

where b is any integral ideal of Ok such that T = (b, M/k). We have 'M(T) - 1(mod. wk) for any T E Gal (M/k) so that we can make the following construction : For IM the augmentation ideal of the group ring Z[Gal (M/k)], and each integer f > 1, we define

M : Imt -* Z/mZ, with m = wM/wk, to be the following surjective morphism XPM

nT1,T2...... e

((,l

- 1)(T2 - 1) ... (Te - 1)/

T1, 72 .....Te

dfn

E

' 'Tl ,T2 ,...,Te

T1,T2......

e

M(T1) - I" CXPM(T2) - 11... CWM(Te) - 11 wk

wk

Wk

J

These operators are well defined, as will be made clear in some further work; here we just need to have IF(') and 0M) to our disposition, and the following lemma relating them :

CONSTRUCTION OF ELLIPTIC UNITS IN FUNCTION FIELDS

199

LEMMA 1. - For any element A of IM we have

M (wkA) _ ` (l (A)

mod.

wM 1

.

wk)

Proof : obvious. dfn

F fl H(1) and, for q" the conductor of F/k; dl [FH(1) F] = [H(1) F(1)] so that FH(1)] and d2

PROPOSITION 8. -Let F(1) dfn

: [Hqn : : put d1 Z/mZ and' F(), ) : IF(,) -> 7G/m7L, d1d2 = [Hqn : F]. Let WF(1) : IF(1) with m = wF(,) /wk, be the surjective morphisms defined as above. Also put W = wkw,,,h. Then the group SF is formed of all the products

(18)

LI

TT

oEGal (F/k)

8F(,) (T)n-

-rEGal (F(1)/k)

when the elements Emo(u) of Z[Gal (F/k)] and Emr(T) of Z[Gal (F(1)/k)] are such that 1) Emo(u) E IF; 2) En,(-r) E 1F(1), i.e., EnT(T) E IF(,) and fl T"T = 1 ;

3) consider the other element EMI(T) of IF(,) defined by Tqn dfn E mo T E Gal (F(1) /k) ; then request (q", F(1)/k) and MT

dfn

o EGal (F/k) 'n IF(,)

d1TFl1)

(Mr(r))+w(2) (>nT(T)) T

T

0 mod. _F(')

(

.

wk

Proof : let us see that any element a of F" satisfying the above conditions 1) to 3) belongs to EF : so we fix elements Emo(a) of Z[Gal (F/k)] and En, (T) of Z [Gal (F(1)/k)] satisfying 1) to 3); in particular we define MT for T E Gal (F(1) /k) by

M

T

dfn

T,

ma

EGal (F/k)

of F(,)=''qn

a) First by conditions 1) and 2) we have that Emo(a) and En, (T) are elements of the augmentation ideals IF and IF(,) respectively, so that we have :

aEOF

200

H. OUKHABA

and the only thing we have to prove is : w

aE

SFw

b) Arbitrarily, choose a finite set Z of integral ideals of Ok, all prime

to q, such that the Artin map

a'-' (a, F/k) define a bijection from Z to Gal (F/k) ; thereafter put dfn

Mb = m(b F/k).

Also, for each T E Gal (F(1) /k), choose an integral ideal aT of Ok and an element xT of k" such that (a,, F(1) 1k) = T and ah = XTOk. c) Then a can be written as the product ABC where A, B and C are as follows

A dfn = NIJq

T7 eb-lgn(1)""

IF

bEZ

B

wk

dfn

CF,1) ( T )

d1m°

xd1m°

a

iEGal (F/k)

vEGal (F/k)

)

where or and T are related by a,F(l) = TTq so that T = (bq-n, F(1)/k) and xo E k" is defined by (bq-n)h1 = x
fl

dfn

nr

TEGal (F(1)/k)

By condition 1) we have A E SF d) But we know that wk = .

na(Na - 1) where U is a finite set aEU

of integral ideals of Ok and na, a E U, are rational integers. So, just put Ta = (a, F(1) 1k), a E U, and observe that the product BC is also equal to the product B'C' with

B' dfn EGal

B

d /k)1m

and B' dfn

OF,

aE

1)

oF(1)

(T) )Na aF(1)(Ta)

'na

lx (Na-)wedz

OF,1) (TTa) /

1

201

CONSTRUCTION OF ELLIPTIC UNITS IN FUNCTION FIELDS

where as above T = 7-gn1Q,F(1) = (bq-n, F(1)/k) and X Ok = (bq-,)h; for defining C' regroup all the m, with O]F(1) = TTgn so that C,, dfn

=

ri rEGal (F(1)/k)

na

11 ( aF (1) (1) a F(1)(T Ta ) aF(1) (T)

aEU

d 1 M.,

TT

aF (T)nr (1)

OF(,) (Ta)

rEGal (F(1)/k)

Nota Bene. We have that AB and AB' are units in OF so that C and C' are units in OF(,), as was obvious by their very definition for the quotients OF(1) (T1)/OF(1) (T2) are units.

Observe now that by the end of § 4 we have Bo E RF

1

Moreover, as

we have

Nwk 1)

(( 71)k

aEU

aEU

wF(1)/wk),TEGal (F(1) /k),

F(l) (-r

we deduce from the condition 3) the fact that (F1)

(>diMr(>fla(T - 1)(Ta - 1))+>nr(T))

0(mod.WF(1)/wk)

.

aEU

Yet, see [10), the condition F?i) (Ent(t))- 0(mod. wF(1)/wk) is a necest sary and sufficient condition for the product

JI

'F(1) (t)n,,

nt(t) E IF(1),

tECal (F(1)/k)

n

to belong to £FO = RF(1) now 1) ; whence C' E RF and a E RF(j) . SF . On the other hand, let us prove that any 1

a E I RF(1). SF )f1oF

satisfy the conditions 1) to 3) : by Definition 1 and the observation at the end of § 4, the unit a may be written as a product AB with dfn

dfn

B

H { 11 (a,a')EYxY f7 [ bEZ

Na' 9F(,) (TTTa') r hw-](1-Na')d2

( 0F(1) (1)

N"q"/F

aF(1) (T.') la

OF(1)

(

eb_1

eqQ(

Tnbl

1

))

l1 J

202

H. OUKHABA

where we have made the following conventions. Y is a finite set of integral ideals of Ok. Z is a finite set of integral ideals of Ok, all prime to q ; na,a', (a, a') E Y x Y, and mb, b E Z, are rational integers. Ta = (a, F(1) /k), Ub = (b, F/k) or its restriction to F(j). Finally [aw°°h] is the w.-power of any element x E k" such that ah = XOk. Now using formula (8) and Definition 3, we can write wk h

PF(Ub)

NHgIF(e'q(1)) eq^(1) )W= wkh

(OF(Ub)

( x

PF (1) In particular we get

1 8F(1) Qb ( Tg nl) J

aEY

fJ bEZ

O9(b-lqn)

wkdl

[b-hw°°]v'kdld2

F(1) Tqn1)

[ahw_]vad2

zs(q)

xNHa)/Fa>

1PF(1)

L

[bhwo]m(,wkdld2

=1

v

na,a' (Na' - 1) ; whence in terms of automorphisms

where Va = a' EY

11 Ta k

11 T6T2bd1 = 1.

aEY

bEZ

This equality is a necessary and sufficient condition for the sum

'a (1-Ta)+Cd1

M

aEY Wk

Tb)

bEZ

to be an element of IF(1) ; yet this sum itself (which belongs to wkIF(1)) is congruent modulo IF(1) to our n, (7) which here is

E na,a'(NQ -Ta')(1 - Ta)+wkdl a,a'EY

mbTgnl(1 -Tb) bEZ

hence condition 2) is proved. Condition 1) is trivial. On the other hand, by the Lemma 1 applied to the sum (*) we have 0(2

F(j 2 ))

Y' nT(T))l T

naa

(_) (O(T )-1/

a,a'EY

Mb )(Tgnl)

+ d1

bEZ

-d

b 4 (Tq^1)

bEZ

-dl

(1 - 4'(Tb) ) 1\

Y

(1 -

Wk

J

(Tb) )

(mod. wF(') wk

(mod. wF(1)

Wk

MT(T) I

Wk

(mod. wF(1) Wk

)

CONSTRUC77ON OF ELLIPTIC UNITS IN FTJNC77ON FIELDS

203

where as above we have put MT

E

df"

mb;

o6EGul (F/k)

o61 F.(1)=-qn

hence condition 3) is also satisfied. This concludes the proof of Proposition 8.

6. - The index formula Take F C koo to be, as in section 5, a finite abelian extension of k such that the conductor of F/k is equal to q", where q is a prime ideal of Ok. We want to calculate the index of the group £F in O. The technique we will use is well known, cf 191 or [121. Let a E Gal (F/k). Then for each rational integer a > 0 we put dfn

to F(a) =

OF(1) (a)aWF(a)

where Q E Gal(F(l)/k) is such that & = aIF(1). It is obvious that ta,F(a1)/ta,F(a2) E OF, for all al, a2 E Gal (F/k). Moreover we have the action

ta'F(a1) ° Cta,F(a2)

ta,F(aia) ta,F(a2a)

for all

Cr E

Gal (F/k).

Let us denote Ta,F the subgroup of OF generated by the quotients ta,F(a)/ta,F(a'), or, a' E Gal (F/k). We know that the group Ta,F has finite index in OF, cf. [91 or [ 121. We have (19)

[OF

:

Ta,F] = wkea(F) h(hF) (Wc)[F:k]-1

where ea(F) is a positive integer, equal to 1 if F n H(l) = k ; otherwise we have (20)

ea(F)

ST

11 (1 - X((q, F n H(1)/k)) + awkh[F : F n H(1)]) X541

where x runs through the set of all non trivial characters of Gal (FnH(l) /k).

The fact that ea(F) # 0 implies that the quotients ta,F(a)/ta,F(1), a E Gal (F/k) and a 1, constitute a maximal system of independant elements of O. In particular we have NF/F(1) (T4,F)= Ta,Fn H(1).

204

H. OUKHABA

In other words the group Ta,F n H(1) is generated by the quotients to F(T)/ta F(1),T E Gal (F(1) /k) where we have put F(1) = F n H(1) and

fj

to F(T) df"1

ta,F(T),

for all T E Gal (F(1)/k).

rE Gal (F/K) 4I F.(1) =r

This leads to the identity

(Ta,F n H(1))n= Ta F n H(1), for all n > 0.

(21)

Moreover the group Ta,FnH(1) has finite index in OF(1) given by the formula [OF(1) : Ta,F n H(1)] = wkea(F)

(22)

h(OF(u (w")[F(1):k]-1 h

Nota Bene. Let us recall also that the subgroup OF(1) of OF(1) formed of all the products 11 OF(1)(r)

j

such that

rE Gal (F(1)/k)

1

nr(T) E IF(1) has a finite index in OF(1), cf. [91 or

rE Gal (F(1)/k)

[ 121, given by the following formula (23)

[OX F(

: 0 F(1)= U)k(wkw

h)[F(1):kl-1

h(OF(1)) [H(1)

:

F(1)]

PROPOSITION 9. - Let a be a positive integer. Then the group dfn

Za,F = TaWkh F F(1) has finite index in OF, given by the formula (24)

[oF :

ZaF]=Wk(Wkw.h)[F:k]-1

h(OF) [H(1)

:

F'(1)

Proof : on one hand we have the isomorphism Za,F/T"F OF(1)/Ta F n OF(1). On the other hand one can check that Ta F n OF(1) Ta F n H(1). This leads to the following identity [OF : Za,F] [OF(1)

:

(Ta,F n H(1))wkhl = [OF : Ta,F ]

[OF(1)

:

_

OF(1)1

205

CONSTRUCTION OF ELLIPTIC UNITS IN FUNCTION FIELDS

which allows us to conclude the proof using Formulas (19), (21) and (23).o One can notice the inclusion EFkw°°h C Za,F, for all a > 0. In fact when EFkw°°h

I a then the group group of the products w,,,,

fi

(25)

can be characterized as follows. It is the

fi

ta,F(a)ma

of Cal (F/k)

OF(,) (-r)"

TE Gal (F(1)/k)

such that of Gal (F/k)

ma E IF

E

2')

nT E IF (1)

TE Gal (F(1) /k)

3') We have the congruence

Y

d1 PF(,) (

nr(T))

TEGal (F(1)/k)

where

0(mod. ww()'\

,

TEGal (F(1) /k)

M,(7-) is the element of IF(1) such that MT =

>'

mo,

T E Gal (F(1) /k).

o EGal (F/k)

LEMMA 2. - We have a well defined morphism

Za,F -, (Z/mZ), M = WF(,) /wk,

which associate to the element (25) of Za,F the lefthand side of the above congruence 3'). This morphism is onto and its kernel isjust the group EF kwoo h

so that we have

[ZaF

:

wkwooh

EF

wF'(1) Wk

Proof : all we have to prove is that the congruence d1 ' F(j)

(

nr(T))

MI(T))+WF(i) ( TE Gal (F(1) /k)

( \

lWkh

fJ ta,F aE Gal (F/k)

Wk

TE Gal (F(,) /k)

occurs whenever the element

f, TEGal (F(1)/k)

0(mod . wF(,)

OF(1)

206

H. OUKHABA

of Za,F is equal to 1. But in this case one can see easily that the product

H uEGal (F/k)

E Ta,F n H(1),

ta,F(Q)m

which means that ma = ma', if 0 F(1) = 11'F(1); and then one can write using the definition of ta,F(a) for a E Gal (F/k) ta,F(a)'n'

OF(1)(T)'n')a[F:F(1)1

= (

aEGal (F/k)

[J

TEGal (F(1)/k)

fi

x

TEGal (F(1)/k)

(f

PF(a))"*

,

o EGal (F/k)

°I FM=

where we have put m' = ma if a E Gal (F/k) and T E Gal (F(1) /k) are such that a,F(,) = T. Now the formula OF(1)(T

VF(a)m')wkix=

)1

,TE

Gal(F(i)/k),

already proved in 191, chap. IV, leads to the equality :

11

awkh[F : F(1)]-"x-4]

8FU)

=1

rEGal (F(1)/k)

which is equivalent to the condition (26)

nT + m'T + mrawkh[F : F(1)] - m;.Tq = 0

for all T E Gal (F(1) /k). Now we have d14

M.(T)) =

[F

)/F(1) (

TEGal (F(1)/k)

F(1)J mTTgn (T))

TEGal (F(1)/k)

= [H q" : H(1)] 0(1 ( F(j) )

E

m'T(TTq^1))

TEGal (F(1) /k)

Nq

"-1(Nq-1) Wk

= -g1F(j) (

m, T

TEGal (F(1)/k)

E TEGal (F(1)/k)

,

1

)-1)

Wk

m7(1 -T)(1 - Tq1)).

CONSTRUCTION OF ELLIPTIC UNITS IN FUNCTION FIELDS

207

On the other hand the above condition (26) allows us to write ,j(F(1)

nT(T)

1

m'.)(r))

F21'

/TEGal (F(l)/k)

TEGal (F(1) /k) (2) l

mT(1 - T)(1 - Tq 1) I .

1

TEGal (Fhl /k)

The Lemma 2 is proved. PROPOSITION 10. - The quotient group OF 1,6F is finite. The exact number

of its elements is given by the index formula

[o : EF]J = L

h(OF) [H(1)

:

F(1)]

Manuscrit recu le 4 decembre 1993

208

H. OUKFIABA

References [ 1 ] V.G. DRINFELD. - Elliptic modules, Math. USSR-Sbornik, 23 (1974), 561-

592. [2] S. GALOVITCH, M. ROSEN.-Theclass number ofcyclotomicfunctionfields,

Journal of number theory, 13 (1981), 363-375. [3] S. GALOVITCH, M. RosEN. - Units and class groups in cyclotomic function

fields, Journal of number theory, 14 (1982), 156-184. [4] D.R. HAVES. - Explicit class field theory for global function fields, Studies

in algebra and number theory (Rota G.C (ed)), New-York, Academic Press, (1979), 173-217. [5] D.R. HAVES. - Elliptic units in function fields, In proc. of a conference on

modem developments related to Fermat's last theorem, D. Goldfeld ed. Birkhailser, Boston, 1982. [6] D.R. HAVES. - Stickelberger elements in function fields, Compositio. Math, 55 (1985), 209-239. [7] D. KUBERT, S. LANG. - Modular Units, Grundleh der Math. Wiss., 244 (1981), ed. Springer. [8] H. OuKHABA, G. ROBERT. - Etude d'un ideal particulier associe a un caractere de Dirichlet d'ungroupefini, Seminaire de Theorie des Nombres de Bordeaux 3 (1991), 117-127. [9] H. OUKHABA. - Fonctions discriminant, formules pour le nombre de classes et unites elliptiques; le cas des corps de fonctions (associes a des courbes sur des corps finis), These (Grenoble, Institut Fourier, Juin 1991).

[10] H. OUKHABA. - Groups of elliptic Units in Global function fields, in Proceedings of the Workshop at the Ohio State University, June 17-26, 1991. 1111 H. OUKHABA. - On discriminant functions associated to Drinfeld Modules

of rank 1, Journal of number theory, 47 (1994). [12] G. ROBERT. - Unites elliptiques, Bulletin Soc. Math. France, Supplement 36, Decembre 1973. [ 13] J. Yu. - Transcendence and Drinfeld modules, Invent. Math. 83 (1986), 507-517. Hassan OUKHABA Equipe de Mathematiques URA CNRS 741 16, Route de Gray France - 25030 Besancon Cedex

Number Theory Paris 1992-93

Arbres, ordres maximaux et formes quadratiques enti8res Isabelle Pays

On salt depuis Lagrange que tout entier naturel est une somme de quatre canes, et, d'apres Jacobi (1828), que le nombre de representations d'un entier en somme de quatre canes est

r4(m) = 8 Ed (m > 1), dim 4td

les representations obtenues en permutant 1'ordre ou en changeant le signe des composantes etant comptees separement. Les preuves connues de cette formule sont de nature analytique (analyse complexe, formes modulaires, fonctions elliptiques, ...). Parmi les nombreuses references, citons E. Landau [8, pp. 146-150] qui determine le nombre de representations d'un en-

tier en somme de quatre canes en utilisant les formules sur le nombre de decompositions d'entiers en somme de deux canes (qu'il a etablies auparavant de maniere tout a fait elementaire); G.H. Hardy et E.M. Wright [7, p. 314], J.V. Uspensky et M.A. Heaslet [15, pp. 450-458], ainsi que E. Grosswald [6, pp. 30-36] donnent des preuves basees sur des identites qui peuvent etre derivees des proprietes des fonctions elliptiques ou simplement verifiees "a la main"; dans [9, p. 3331, W. Scharlau exploite le fait que la somme de quatre canes est une forme quadratique avec un seul element dans son genre pour deduire la formule de Jacobi; A. Robert [ 111 et B. Gordon [4] etablissent la formule de Jacobi a partir de resultats sur les formes modulaires (ce qui necessite un peu d'analyse complexe). Toutes ces preuves utilisent ou bien des identites un peu "mysterieuses", ou alors du materiel assez sophistique. Pour des references concernant l'origine et les developpements historiques, nous renvoyons le lecteur au recueil de L.E. Dickson [3, Chap. VIII, p. 2851. Signalons aussi un article de G. Rousseau [ 121, oft l'auteur donne un moyen pour construire des representations d'un entier en somme de quatre canes a partir de fractions continues.

I. PAYS

210

Nous proposons ici une nouvelle preuve a caractere purement algebrique et geometrique de la formule de Jacobi. Cette preuve est tout a fait elementaire : les prerequis sont a peine un peu plus qu'un cours de premier cycle en algebre. La preuve que nous presentons decoule de resultats plus generaux sur le nombre de representations d'une puissance quelconque d'un nombre premier par certaines formes quadratiques a quatre variables, obtenus au moyen d'actions de "groupes de quaternions" sur "l'arbre de SL2 (Q p)". L'article se presente comme suit : Au § 1. on rappelle la definition d'une algebre de quaternions. Les ordres maximaux dans une algebre de quaternions rationnelle permettent de definir les formes quadratiques entieres que l'on examine plus loin. Au §2 on decrit la construction de 1'arbre a partir des ordres maximaux de M2(Qp). C'est au §3 que

l'on explique la relation entre l'action d'un certain groupe sur l'arbre et les representations d'une puissance d'un nombre premier par les formes quadratiques associees (au § 1) aux ordres maximaux. On montre au §4 que, lorsque l'ordre maximal est principal, on peut obtenir le nombre de representations d'un entier quelconque (et non plus uniquement d'une puissance d'un nombre premier). Cela conduit a une nouvelle preuve de la formule de Jacobi.

1. - Algebres de quaternions et ordres Nous renvoyons aux ouvrages 11 ], (101 et 1161 pour les preuves detaillees

des resultats mentionnes dans ce paragraphe.

Soit K un corps de caracteristique differente de 2 et soient a et b deux elements non nuls de K. L'algebre de quaternions (a, b)K est l'algebre

admettant une base de quatre elements sur K, notes 1, i, j, k, avec la multiplication definie par les relations i2 = a, j2 = b, k = i.j = -j.i. Le conjugue du quaternion q = Xi + x2i + x3j + x4k, note q, est defini par

q = xi - x2i - x3j - x4k. La norme reduite du quaternion q, notee n(q), est definie par n(q) = q.q = x1 - axe - bx3 + abx4. La trace reduite du quaternion q, notee t(q), est definie par t(q) = q + = 2x1. Il est bien connu qu'une algebre de quaternions est soit a division, soit isomorphe a 1'algebre de matrices M2(K).

Les corps consideres ici sont soit le corps (global) Q des nombres rationnels, soit un des corps (locaux) Qp des nombres p-adiques ou IR le corps des nombres reels. Sur un corps local (ici IR ou Qp), it y a une unique algebre de quaternions a division, a isomorphisme pres. Sur R, it s'agit de 1'algebre des quaternions de Hamilton, IEII = (-1, -1)a. Soit H = (a, b)q. Quitte a multiplier a et b par des carres convenables, on peut supposer que a et b sont dans Z. Pour reconnaitre si Hp = (a, b)Qp est a division, on utilise le symbole de Hilbert (a, b)p. L'algebre (a, b)Q est

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

211

a division si et settlement si (a, b) p = -1. Nous renvoyons le lecteur a 114, p.391 pour le calcul de ce symbole. Notons toutefois que (a, b)P = 1 pour presque tout p (c'est-a-dire pour tout p sauf un nombre fini d'entre eux).

Le discriminant de H est le produit des nombres premiers p pour lesquels 1'algebre de quaternions H ® Q, est a division

disc(H) =

11

p.

p premier (a,6)p=-1

Soit R un anneau principal de caracteristique differente de 2, K son corps de fractions et H une algebre de quaternions sur K (nous envisageons en particulier le cas on R = Z ou Z[P] = {ap' la, n E Z} avec K= Q ou alors R = Z P, l'anneau des entiers p-adiques, avec K = Q p). Nous designons par R" le groupe multiplicatif des elements inversibles de R.

Un ordre de H sur R est un sous-R-module de H de rang 4 qui est aussi un anneau. Les elements d'un ordre ont la propriete d'etre entiers sur R, c'est-a-dire que leur trace et leur norme appartiennent a R. Un ordre maximal est un ordre qui n'est contenu proprement dans aucun autre ordre. Voici deux exemples qui nous seront utiles.

Exemple IL. Dans H = (-1, -1)q le Z-module 0' de base (1, i, j, k) est un ordre de H sur Z. De meme, le Z-module 0 engendre par 1, i, j,

a= (1+i+j+k)/2 est un ordre de H. On note que t(a) = 1 etn(a) = 1. L'ordre 0' nest pas maximal car it est contenu dans l'ordre 0. Exemple 2. Soit R un anneau principal et K son corps de fractions. Alors M2 (R) est un ordre de M2 (K).

Les formes quadratiques que nous allons examiner sont les formes normes d'ordres sur Z d'une algebre de quaternions H sur Q. Soit 0 un tel ordre et soit (el, e2, e3, e4) une base de 0 sur Z. La forme norme de 0 par rapport d la base e est

n(x) = n(> Xei) _

XiXjt(eiej)

Xi n(ei) + i<j

ou x E 0. (Nous la notons parfois qo.) Le fait que 0 est un ordre assure que la forme quadratique obtenue est a coefficients entiers. Le choix d'une autre base de 0 conduit a une forme Z-equivalente. Par abus de langage nous appelonsforme norme de 0 un representant quelconque de la classe d'equivalence. On verifie que deux ordres 0 et 0' conjugues par un automorphisme interieur de H (c'est-a-dire 0' = hOh-1 pour un certain element inversible h de H) donnent lieu a des formes quadratiques 7L-equivalentes.

I. PAYS

212

Dans 1'exemple 1 ci-dessus, la forme quadratique associee a l'ordre 0, exprimee par rapport a la base (1, i, j, a), est go(X1, X2, X3, X4) = X1 + X2 + X3 + X4 + X1X4 + X2X4 + X3X4,

tandis que celle associee a l'ordre 0, exprimee par rapport a la base (1, i, j, k), est la somme de quatre carres : qo' (X1, X2, X3, X4) = Xi + X2 + X3 +X4 .

:etude de ces formes quadratiques necessite une connaissance plus approfondie des ordres maximaux sur Z d'une algebre de quaternions rationnelle. Pour commencer, nous allons expliquer comment on peut voir facile-

ment si un ordre est maximal. Rappelons d'abord que l'on peut munir H naturellement de la forme bilineaire bt induite par la trace en posant bt(x, y) = t(xy). On a aussi besoin des definitions suivantes. Un R-reseau d'une algebre de quaternions H sur K est un sous-R-module de H de rang 4. Le discriminant d'un R.-reseau M de H, note disc(M), est le determinant de la matrice de 1'application bilineaire bt dans une base de M. On voit, en examinant la formule de changement de base, que cet element de K"

est defini a un carre de R" pres. De plus, si L est un R-reseau contenu dans M, alors disc(L) = r2disc(M) pour un certain r E R et L = M si et seulement si r E R'. Notons aussi que le discriminant d'un ordre est un element non nul de R/R"2. Le reseau "standard" M de base (1, i, j, k) a pour discriminant -(4ab)2R"2. Des lors, a nouveau par changement de bases, on voit que le discriminant

d'un reseau de H est toujours l'oppose d'un carre de K" (modulo R"2). Notons que si R = Z, R"2 est reduit a l'unite. COROLLAIRE. - Soit R un anneau principal et K son corps de fractions. Alors : 1. M2 (R) est un ordre maximal de M2 (K),

2. M2 (R) est un anneau principal, 3. tous les ordres maximaux de M2 (K) sont conjugues a M2 (R).

Demonstration 1) 11 est clair que M2(R) est un ordre. Comme disc(M2(R)) = lmodR"2, on deduit du comportement du discriminant par rapport a l'inclusion des reseaux que M2 (R) est maximal. 2) Soit I un ideal a gauche de M2(R) et soit ((x1i x2), (yl, y2)) une base du R-reseau de R2 engendre par les lignes des matrices de I. On verifie :

aisement que la matrice A ayant pour premiere ligne (xl, x2) et pour seconde ligne (y1, y2) est dans I et que I = M2 (R) A.

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

213

Si I est un ideal a droite de M2(R), on procede de maniere semblable en considerant cette fois le reseau engendre par les colonnes des matrices de I. 3) Notons 0 = M2(R) et soit O' un autre ordre maximal de M2(R). Alors, comme 0 est maximal. O'O est un ideal a droite de 0 et est donc principal.

On peut donc ecrire O'O = xO pour un certain x dans GL2(K). Par ailleurs, comme 0' est maximal, l'ordre de stabilisateurs a gauche de O'O

est egal a 0' U'ordre de stabilisateurs a gauche, 0 (I), d'un ideal I est

09(I) = {x E HjxI C I}). Comme 0,(0'0) = O9(xO) = xOx-1, on obtient 0' = xOx-1. Voici le critere qui permet de reconnaitre les ordres maximaux :

Critere. Un ordre 0 sur Z d'une algebre de quaternions H sur Q est maximal si et seulement si son discriminant est egal a I'oppose du carre du discriminant de H, c'est-d-dire si disc(o) = -(discH)2. 2. - L'arbre

Les arbres qui nous seront utiles pour etudier les nombres de representations sont les arbres associes aux groupes SL2 sur les corps locaux, qui sont des cas particuliers des immeubles de Bruhat-Tits. Nous les realisons ici a l'aide des ordres maximaux dans une algebre de quaternions deployee sur un corps local Qp (c'est-a-dire une algebre isomorphe a M2(Qp)).

Soit H une telle algebre et 0 un ordre maximal de H sur 1'anneau Z des entiers p-adiques. D'apres le corollaire ci-dessus, on sait que tous les ordres maximaux de H sont conjugues a 0, ce qui va permettre de definir une distance entre ces ordres maximaux. On introduit pour cela la valuation p-adique v : Qp -> Zp U oc normalisee par la condition v(p) = 1, et la fonction It : H --> Z U oo definie par :

µ(x)=max{nEZIxEp"O}

pourx

0

et

µ(0) = 00. Cette fonction satisfait les proprietes suivantes : PRopweTes. - Pour x, p E H et a E Q, , on a : 1. µ(x) = oo si et seulement six = 0. 2. µ(x + y) > min{µ(x), µ(J)}.

3. µ(xy) ? i(x) + µ(J)

4. µ(xa) = it(x) + v(a). 5. ;L(x) =,u(x). De plus, en designant respectivement par OX et par H" les groupes multiplicatifs des elements inversibles de 0 et de H,

a) Pourx E Hx, on ax E 0 " sietseulementsiµ(x) = µ(x-1) =0.

I. PAYS

214

b) Pour x E H', les conditions suivantes sont equivalentes : i) x E pa0" pour un certain a E Z. ii) it(x) + µ(x-1) = 0.

iii) x0x-1 = 0. c) Pour x, y E H, si x satisfait les conditions equivatentes de la propriete precedente, on a : p(xy) = µ(x) + µ(y) = µ(yx) et µ(xyx-1) = µ(y).

d) Pourx E H", µ(x-1) = µ(x) - v(n(x)) (oCl n design la norme de H).

Demonstration : les proprietes 1 a 4 sont toutes evidentes. La propriete 5 decoule immediatement du fait que tout ordre d'une algebre de quaternions est stable par la conjugaison quaternionienne (car y = t(x).1-

x). Si x E 0", alors µ(x) = 0 car les elements de p0 ne sont pas inversibles dans 0; on a alors de meme µ(x-1) = 0. Reciproquement, si /1(x) = µ(x-1) = 0, alors x et x-1 sont tous deux dans 0, done x E 0". Cela prouve la propriete (a). Si x E pa0" pour un certain a E Z, alors p(x) = a et /1(x-1) = -a, done µ(x) +EL(x-1) = 0. Inversement, six E H"

est tel que u(x) + u(x-1) = 0, soit x = pay pour a = µ(x) et pour un certain y E 0 N p0. On a alors a(y) = 0 et µ(x-1) _ -a + µ(y-1). La relation µ(x) + li(x-1) = 0 entraine alors : µ(y-1) = 0, done y E Ox par la propriete (a). Cela demontre 1'equivalence des conditions (i) et (ii) de (b). Par ailleurs, la condition (i) entraine evidemment (iii). Reciproquement, si

x satisfait la condition (iii), on ecrit encore x = pay pour a = µ(x) et pour un certain y E 0 N pO; comme 0 - M2 (Zp), it nest pas difficile de verifier qu'alors OyO = 0. Or, de la relation xOx-1 = 0, on deduit que yO = Oy; on a done

yO=Oy=OyO=O, ce qui montre que y E O" et x E paQx et acheve la demonstration de la propriete (b). Pour etablir la propriete (c), on observe que, d'apres la propriete 3, µ(xy) ? l2(x) + t1(y)

14) = µ(x-Ixy) a(x-1) + A(xy) Lorsque µ(x-1) = -µ(x), on en deduit immediatement que µ(xy) _ µ(x) + µ(y). La relation lz(yx) = µ(x) + µ(y) se demontre de maniere analogue, et la relation a(xyx-1) = µ(y) se deduit des deux precedentes. Enfin, la propriete (d) resulte des proprietes 4 et 5, car x-1 = x.n(x)-1. Soient maintenant 01 et 02 des ordres maximaux de H, et soient x1 et x2 E H" tels que :

01 =x10xi1

et

02

=x20x21.

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

215

On pose d(01, 02) = -µ(xi 1x2) - µ(x2 1x1) E 7L.

Pour voir que la fonction d est bien definie, it faut verifier que le second

membre ne depend pas du choix de xl et x2. Si xi E H" est tel que Oi = xi0x'i 1 pour i = 1, 2, alors x'i 1xi0x%lx' = 0 pour i = 1, 2, donc, par la propriete (c) ci-dessus, on a

µ(x,l ix2) = µ(xti

I 1x2) = µ(x'1 XI) + µ(x 1x2) + µ(x2 1x'2)

1

et, de meme, 1L(x'2 1x1) = p(x'2 1x2) + p(x21xi) + ji(xi 1x'1). Des lors, µ(x1 1x2) + p(x'2 1x'1) = p(xi 1x2) + p(x21xi), ce qui prouve que d est bien definie. On a en fait d(01, 02) > 0, car, d'apres la propriete 3, µ(x1 1x2) + µ(x2 1x1) < 11(x1 1x2.x21xi) = 0.

PROPOSITION 2.1. - La fonction d est une distance sur l'ensemble des ordres maximaux de H. Cette distance est invariante par conjugaison, c'est d-dire que pour x E H" et pour 01, 02 des ordres maximaux de H,

d(x0lx-1, x02x-1) = d(01, 02)

Demonstration

:

it

est clair par definition que la fonction d est

symetrique. De la propriete (b), on deduit que d(01, 02) = 0 si et seulement Si 01 = 02. L'inegalite triangulaire decoule de la propriete 3 et l'invariance par conjugaison est evidente puisque (xx1)-1(xx2) = x1 1x2. On obtient alors l'arbre des ordres maximaux de H. TI-IEOREME 2.2. - Le graphe X dont les sommets sont les ordres maximaux de H et dont les aretes sont les couples (01i 02) d'ordres maximaux tels que d(01, 02) = 1 est un arbre, c'est-d-dire un graphe connexe et sans circuit. De plus, cet arbre est (p+ 1)-regulier, c'est d-dire que chaque sommet est l'origine de p + 1 aretes.

Demonstration : montrons d'abord que le graphe X est connexe. II suffit de montrer que tout ordre maximal 0' est lie a 0 par un chemin du graphe. On raisonne par induction sur la distance de 0 a 0. L'enonce est evident si cette distance est 1, puisqu'alors 0 et 0' sont lies par une arete. Il suffit donc de prouver que si la distance de 0 a 0' est n > 1, alors i1 existe un ordre 0" a distance n - 1 de 0 et a distance 1 de 0'. Soit 0' = xOx-1. Quitte a multiplier x par une puissance convenable

de p, on peut supposer p(x) = 0, c'est-a-dire que x E 0 N p0. Comme O/pO ^ M2 (1FP), la trace induit une forme bilineaire non degeneree sur

I. PAYS

216

O/pO; on peut donc trouver u E 0 N p0 tel que t(xu) ¢ pp, ce qui entrain bien sur que xu E 0 , pO. Soit alors y = xu + p- in (x). Comme d(O, 0') = n > 1, on a, par la propriete (d),

-IL(x-1) = v(n(x)) = n > 1, d'ou y E 0 N pO, c'est-a-dire, µ(y) = 0. Par ailleurs, x-1 y = u + p-lx,

donc µ(x-ly) _ -1, et de la relation n(x-1 y) = n(u) + p-lt(xu) + p-2n(x)

on tire : v(n(x-ly)) = -1. D'apres la propriete (d), on en deduit

-tc(x-ly) - IL(y-lx) = 1. Par ailleurs, comme v(n(x)) = it, on dolt avoir v(n(y)) = n - 1, donc

-µ(y) - 12(y-1) = n - 1. Des lors, l'ordre 0" = yOy-1 possede les proprietes requises.

Montrons ensuite que le graphe X ne contient pas de circuit. Soit 01i ... , On un chemin sans aller-retour, c'est-a-dire, (1)

f d(Oi,Oi+1) = 1 pouri = 1,...,n- 1 d(Oi, Oi+2) > 0 pour i = 1, ... , n - 2.

Pour prouver que ce chemin n'est pas un circuit, it suffit de montrer que d(Oi, On) = n - 1.

Ecrivons Oi = xiOx-1 pour i = 1 , ... , n et, pour i = 1, ... , n - 1 xi lxi+1 = pa`yi pour un certain yi E 0 N pO et ai = µ(xi lxi+1) E Z. D'apres la propriete (d), on a, pour i = 1, ... , n - 1,

p(xi+1xi) = ai - v(n(p'' yi)) = -ai - v(n(yi)) Des lors, d(Oi,Oi+1) = -N(x lxi+1) - µ(xi+lxi) = v(n(yi)), et les conditions (1) ci-dessus s'ecrivent :

v(n(yi)) = 1 pour i = 1, ... , n - 1 v(n(yiyi+l)) - 2,a(yiyi+l) > 0 pour i = 1, ... , n - 2.

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

217

Vu la multiplicativite de la norme, la premiere condition entraine : pour v(n(yiyi+l)) = 2 pour i = 1,... , n-2; par ailleurs, comme yti E i = 1, ... , n - 1, on a yjy2+1 E 0 pour i = 1, ... , n - 2, done µ(yiyi,+1) > 0. Ces observations conduisent a reecrire les conditions ci-dessus sous la forme :

v(n(yti)) = 1 pour i = 1, ... , n - 1

(2)

I µ(ytiyti+l)

= 0 pour i = 1,...,n - 2.

Montrons alors, par induction sur m, que µ(y1... ym) = 0. C'est clair pour m = 2. Supposons donc µ(y1... yri.-1) = 0 et µ(y1 ... ym) > 0, c'est-a-dire,

yl...ym EPO. On a alors (3)

(yl

... y.-1 + pO).(ym + pO) = 0

dans O/p0.

Comme O/pO est isomorphe a une algebre de matrices carrees d'ordre 2 sur IF7,, on peut considerer yj + pO,... , ym + pO comme des operateurs lineaires sur un espace vectoriel de dimension 2 sur IFr. Ces operateurs sont non nuls puisque yj ¢ pO, et non inversibles puisque v(n(yi)) = 1; ils sont donc tous de rang 1. De meme, yl ... ym-1 + pO est de rang 1 puisque µ(y1 ... ym-1) = 0. ;equation (3) indique alors que : Im(ym + pO) = Ker(yl ... ym-1 + pO).

Par ailleurs, on a aussi Ker(yl

... y.-1 + p0) = Ker(ym-1 + pO),

donc

(ym-1 + PO) (Y. + PO) = 0

et par consequent µ(ym-lym) > 0, contrairement a l'hypothese. On a donc bien µ(y1 ... ym) = 0 pour tout m = 1, ... , n - 1. Un calcul direct donne alors d(Ol, On) = v(n(yl ... yn-1)) - 2µ(y1 ... yn-1) = n - 1, ce qui acheve de demontrer que le graphe X ne contient pas de circuit.

Pour prouver que X est (p + 1)-regulier, comme la conjugaison par tout element de H" induit un automorphisme de X et que tout ordre maximal est conjugue a 0, it suffit de montrer qu'il y a p + 1 ordres a

I. PAYS

218

distance 1 de 0. Or, it y a une correspondance bijective entre 1'ensemble des ordres a distance 1 de 0 et l'ensemble des ideaux a droite I de 0 tels

que 0 Q I Q p0, qui associe a tout ideal I son ordre de stabilisateurs a gauche :

O9(I)={xeHIxICI}

et a tout ordre 0' a distance 1 de 0 l'ideal pO'0 (pour etablir la bijectivite de cette correspondance, it est utile de remarquer que si x E 0 '. p0 et v(n(x)) = 1, alors OTTO est un ideal bilatere de 0 contenant proprement p0, donc 0770 = 0 puisque 01p0 f-- M2(]Fp) est simple. Si a present 0' est un ordre a distance 1 de 0, on peut ecrire :

0' = xOx-1 pour un certain x comme ci-dessus, d'oCi

p0'O = xOTO = xO. Reciproquement, 09(xO) = x0x-1).

Comme par ailleurs les ideaux a droite I tels que 0 12 p0 sont en bijection avec les ideaux a droite non triviaux de 0/p0 ^ M2 (IFp), qui sont au nombre de p + 1, it y a bien p + 1 ordres a distance 1 de 0.

3. - Actions de sous-groupes et representations d'entiers Soit H une algebre de quaternions sur Q et_ 0 un ordre maximal de H sur Z. La forme quadratique quaternaire a coefficients entiers que nous allons etudier est la forme norme sur 0, c'est-a-dire (voir aussi § 1) que si e = (el, e2i e3, e4) designe une base de 0, la forme s'ecrit 4

Xiei) _ i=1

Xi n(ei) + i

i<j

XiJCjt(eie )

Etant donne un nombre premier p, on se propose dans cette section d'etudier les representations des puissances de p par cette forme, c'esta-dire les solutions (XI, x2, x3, x4) E Z4 de

1'iei =

fpn

i=1

ou, ce qui revient au meme, les elements x de 0 tels que n(x) = fpn. L'ensemble de ces elements est note R(p) : R(pn) = {x E 0 1 n(x) = fpn}.

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

219

On note aussi Rp(pn) 1'ensemble des solutions primitives, c'est-a-dire,

Rp(pn)=Ix EONpOIn(x)=fpn}. Les resultats sont tres differents suivant que p divise le discriminant de H ou non. Lorsque p ne divise pas le discriminant, 1'algebre Hp = H 0 Qp est isomorphe a M2(Qp), et 1'ordre Op = O 0 Z,, en est un ordre maximal.

Le groupe des inversibles de O[,], que l'on note O[P]x, s'identifie a un sous-groupe de Hp x.

Le theoreme suivant met en relation 1'action de ce sous-groupe sur I'arbre Xp des ordres maximaux de Hp (par conjugaison) et 1'ensemble des representations primitives de fpn par la forme norme de O. Soit On 1'ensemble des sommets de l'arbre Xp a distance n de Op qui sont dans la meme orbite que Op par l'action de O[P]". Soit - la relation d'equivalence sur 0 N {0} definie par x " y si x et y sont associes (a droite) dans 0 c'esta-dire si x-1 y est dans Ox. II est clair que si x E Rp(pn), alors tout element

y tel que y - x est aussi contenu dans Rp(pn). On peut donc considerer 1'ensemble quotient Rp(pn)/ -. On a le : THEOREME 3.1. - L'application qui a un element x de O fait correspondre t'ordre xOpx-1 de Hp definit une byection entre les ensembles Rp(pn)/ et An.

Demonstration : si x E R.p(pn), alors a(x) = 0 et v(n(x)) = n, donc, par la propriete (d) de la section precedente, µ(x-1) = -n. Des lors, d(Op, xOpx-1) = n. Montrons que l'application definie dans 1'enonce est surjective : soit 0' un element de A, it existe alors y E 0[-!]' tel que 0' = et d(Op, yOpy-1) = n. Comme les scalaires agissent trivialement, on peut yOpy-1

supposer, quitte a multiplier y par une puissance adequate de p, que µ(y) = 0, c'est-a-dire que y E O N pO. La condition d(Op, yOpy-1) = n se traduit alors par v(n(-y)) = n. Par ailleurs, comme -y E O[P]", on doit avoir n(y) E Z[Y]" = {±pk I k e Z}.

Les conditions precedentes entrainent : n(y) = fpn, donc l'ordre 0' est l'image de y par l'application decrite dans 1'enonce.

Montrons pour terminer que l'application est aussi injective. Deux elements x, y de Rp(pn) definissent le meme sommet si et seulement si XOpx- 1

= yOpy

1

I. PAYS

220

D'apres la propriete (b) de la section precedente, cette condition est equivalente a

X-1yEpaQp

pour un certain a E Z. Comme n(x-ly) = ±1, on en deduct que x-1y E H" n OP x. Par ailleurs, x-1y E O[ff] puisque x-1 = fx/pn et que T E 0. Donc,

Voici un cas ou le nombre d'elements de An est particulierement facile a calculer. TrIEOREME 3.2. - Si t'ordre maximal 0 est principal, alors transitivement sur les sommets de l'arbre Xp, et par consequent

agit

t& l=pn-1(p+1) pour tout n > 1.

Demonstration : soit 0' un ordre maximal de Hp. On sait que tous les ordres maximaux de Hp sont conjugues, donc 0' = xOpx-1 pour un certain x E Hp >'. Quitte a multiplier x par une puissance convenable de p, on peut choisir x E Op. Considerons alors l'ideal a droite I de 0 define par

I= n(OgnH)

n(x0p n H),

qi4P

c'est-a-dire l'ideal dont les localises sont Iq = °q pour q p et Ip = xOp. D'apres l'hypothese, cet ideal est principal, donc I = yO pour un certain

y E 0. Comme Iq = Oq pour q

p, on a y E Coq pour q 54 p, donc

y E CA[P]". Par ailleurs, de la relation Ip = XOP = Y°p,

on deduit que l'ordre des stabilisateurs a gauche de Ip est

'g(Ip) = X0pX-1 = JOPJ-1+

c'est-a-dire que 0' = yOpy-1. Cela prouve que 0[-!]' agit transitivement sur les sommets de 1'arbre Xp. Il en resulte en particulier que On est 1'ensemble de tous les sommets a distance n de Op. Cet ensemble contient

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

221

pn-1(p + 1) elements, puisque I'arbre Xp est (p + 1)-regulier, d'apres le theoreme 2.2. Remarque : plus generalement, Vigneras [ 16, p.147, Prop.3.3] a montre que le nombre d'orbites de sommets de Xp sous 1'action de O[P] est egal au nombre de classes de O. La demonstration n'est pas aussi elementaire que celle du theoreme precedent, car elle utilise un theoreme puissant d'Eichler.

On a choisi au debut de ce paragraphe un nombre premier p qui ne divisait pas le discriminant de H. Voici maintenant ce qui se produit lorsqu'au contraire p divise le discriminant de H, c'est-a-dire lorsque Hp = H 0 Qp est une algebre a division. Comme precedemment, on dit que deux elements x, y E 0 sont associes (a droite) s'il existe un element inversible u E Ox tel que x = yu; on note alors x - y. THEOREME 3.3. - Lorsquep divise le discriminant de H, alors les elements

de R(pn) sont tous associes, pour tout n > 0, de sorte que le quotient R(pn)/ - contient un seul element, si R(pn) nest pas vide. Si l'ordre 0 estprincipal, alors R(pn) est non vide, pour tout n > 1.

Demonstration : supposons que R(pn) est non vide. Soient x, y E R(pn) ; alors x et y sont inversibles dans Oq = 0 0 Z. pour q # p, et donc, xOq = YOq = Oq.

En p, l'algebre de quaternions Hp = H®Qp est isomorphe a l'unique algebre de quaternions a division sur Qp, et Op est l'anneau devaluation de Hp U10, § 12]). De plus, tout ideal a droite de Op est bilatere et principal et est donc de la forme irPOp, ou 7rp est une uniformisante de Or,. Comme la valuation p-adique de n(7rp) est 1, on a en particulier x0p = 7rnop = yOp.

Ainsi, xOp = yOp pour tout p, donc xO = yO et x - y. Cela prouve que tous les elements de R(pn) sont associes. Si 0 est principal, alors l'ideal I dont les localises sont Oq pour q 54 p et 7rpOp en p est principal; soit I = 7r0 pour un certain it E O. On a alors

0

7rO 2 pO,

donc p = 7rir' pour un certain 7r' E 0, et n(7r)n(ir') = p2.

Si n(-7r) = ±1, alors 7rO = 0; si n(7r) = ±p2, alors 7rO = pO. Comme ces deux egalites sont exclues, on doit avoir n(7r) = fp,

I. PAYS

222

donc R(p) est non vide. De plus, pour tout n > 1, on a n(7rn) = ±p', donc R(pn) est non vide pour tout n > 1. Dans le cas particulier ou la forme norme est definie positive, ce qui

revient a dire que l'algebre de quaternions H est telle que H 0 IR est isomorphe a 1'algebre (-1, -1)R des quaternions d'Hamilton, it est clair que les equations n(x) = -pn n'ont pas de solution et que les equations n(x) =

pn n'en ont qu'un nombre fini. Les resultats precedents permettent de denombrer ces solutions. L'ordre 0 de l'algebre de quaternions rationnelle H etant fixe, notons r(pn) (resp. rp(pn)) le nombre de solutions (resp. de solutions primitives) x E 0 de 1'equation n(x) = pn, c'est a dire le nombre d'elements de R(pn) (resp. Rp(pn)). COROLLAIRE 3.4. - Supposons que laforme norme soit definie positive. Si p est un nombre premier qui ne divise pas le discriminant de L'algebre H, alors pour tout n > 1, rp(a)n) = 10' I Ionl oil An est L'ensemble des sommets de I'arbre Xp qui sont dans la meme orbite que O. sous l'action de 0[-1]" et a distance n de CAP, et [n/21

r(pn) = IOX I

E k=0

.

oil [n/2] est le plus grand entier inferieur ou egal a n/2. Si p est un nombre premier qui divise le discriminant de H. alors pour tout n > 1,

r(pn) = 0 ouI0xI. Si de plus L'ordre 0 est principal, alors pour tout nombre premier p qui ne divise pas le discriminant de H,

rp(pn) =

I0XI. pn-i(p+ 1)

et

np+1

lox

p

-1

-1 r(pn) = et sip divise le discriminant de H,

pourtoutn> 1 pour tout n > 1,

I

r (pn) = I C X I

pour tout n > 1.

Demonstration : Si p ne divise pas le discriminant de H, les formules pour rp(pn) resultent directement des theoremes 3.1 et 3.2 ci-dessus, puisque rp(pn) = 0 < I ' I Rp(pn)/

I.

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

223

Les formules pour r(p') s'en deduisent, car les solutions non primitives de n(x) = p' sont de la forme x = pkxk ou Xk est solution primitive de n(x) = p"-2k. De meme, si p divise le discriminant de H, les formules pour r(p') decoulent du theoreme 3.3. Pour completer l'information donnee dans ce corollaire, remarquons que la structure du groupe O" est connue pour les ordres maximaux des algebres de quaternions definies positives : PROPOSITION 3.5. - Si 0 est un ordre maximal d'une algebre de quater-

nions H sur Q telle que H 0 R est isomorphe a (-1, -1)R, alors O" /{f1} est cyclique d'ordre 1, 2 ou 3 sauf dans deux cas :

si H = (-1, -1)Q, alors 0'/ f ± 11 est isomorphe au groupe alterne A4; si H = (-1, -3)Q, alors O" /{±1} est isomorphe au groupe symetrique S3. Demonstration : voir [ 17, th. 5, p. 269]. Exemple : revenons a 1'exemple 1 avec l'ordre Ode base (1, i, j, a) dans

1'algebre H = (-1, -1)Q. Le discriminant de 0 vaut -4 et le discriminant de H est egal a 2. Des lors, 0 est maximal. Pour voir que c'est un anneau principal, on montre que les elements de 0 satisfont un algorithme de division euclidienne ([ 13, p. 98, lemme 3]). L'ordre du groupe O" des elements inversibles de 0 est 24, d'apres la proposition 3.5. Des lors, pour tout p 2, le nombre de representations primitives de p"` par la forme : q0 (Xl, X2, X3, X4) = X1 + X2 + X3 + X4 + X1X4 + X2X4 + X3X4.

est egal a rp(p') = 24(p +

1)p"-1

(p

2),

et le nombre de representations de 2 est donne par :

r(27z)=24

n>0.

4. - Ordres principaux Dans cette section, 0 designe un ordre maximal principal dans une algebre de quaternions rationnelle H dont la forme norme est definie positive. On se propose de montrer que, grace au fait que 0 est un anneau principal, it est possible de donner le nombre de representations d'un entier

positif quelconque (et non pas seulement des puissances d'un nombre premier) par la forme norme de O.

I. PAYS

224

LEMME 4.1. - Soient a et b des entiers positifs premiers entre eux. Si

x E 0 est tel que n(x) = ab, ators it existe y, z E 0 tels que x = yz et n(y) = a, n(z) = b.

Si y et z sont des elements de 0 tels que n(y) = a et n(z) = b, alors yz0 + a0 = yO et Oyz + 0b = Oz. Demonstration : soit x E 0 tel que n(x) = ab. On considere l'ideal x0 + a0 de 0. Comme 0 est principal, on a :

x0 + a0 = yO pour un certain y de 0. Soient x = yz et a = yy'. Alors n(x) = ab = n(y)n(z) et a2 = n(y)n(y'), donc n(y) est un commun diviseur de abet de a2. Comme

a et b sont premiers entre eux, n(y) divise a. Par ailleurs, de la relation xO + aO = yO, on tire aussi

xx'+aa' = y pour certains x', a' de 0. En prenant la norme des deux cotes, on deduit :

n(y) = n(xx') + n(aa') + t(xx'aa') = n(x) n(x') + a2n(a') + at(xx'a') = a(bn(x') + an(a') + t(xx'a')). Comme x, x' et a' sont dans 0, on a t(xx'a') E Z, donc le facteur de a dans le membre de droite est un entier. Il en resulte que a divise n(y). Comme a et n(y) sont tous les deux positifs, on a que n(y) = a et donc aussi n(z) = b. Par ailleurs, si y et z sont des elements de 0 tels que n(y) = a et n(z) = b alors, comme 0 est principal, on a :

yzO+aO=dO pour un certain d dans 0. Les arguments du debut montrent que n(d) = a. Par ailleurs, de yj = a, on deduit que aO C yO. Des lors, yzO + aO = dO C yO.

Ainsi i1 existe u E 0 tel que d = yu. Comme n(d) = n(y) = a, on a n E 0" et donc d0 = yO. Avec ce lemme, on peut donner le nombre de representations d'un entier quelconque par la forme norme de 0; comme precedemment, on note

R(m) =Ix E0I n(x)=m} et

r(m) = JR(m)I, ou m est un entier (positii) quelconque.

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

225

T1ii orEME 4.2. - 1. Lafonction r(m)/ IOx I est multiplicatioe, c'est-d-dire que si a et b sont des entiers positifs premiers entre eux, alors :

- lox I

r(ab)

r(a)

lox I

r(b)

lox I

2. Soit m un entier positif. On a

r(m) = I

I

(

d) dim pgcd(d,disc H)=1

Demonstration : 1. Soient a et b des entiers positifs premiers entre eux. La multiplication

dans 0 definit une application : R(a) x R(b) -i R(ab), qui est surjective d'apres le lemme 4.1. Pour demontrer la premiere partie de 1'enonce, it suffit de prouver que tout element de R(ab) est l'image de lox I elements de R(a) x R(b), puisqu'alors

r(ab) =

IR(a) x R(b)I

r(a)r(b)

IoxI

IOxI

.

Fixons x E R(ab). Si (y, z) et (y', z') sont des elements de R(a) x R(b) tels que

yz=x=y'z',

alors d'apres la seconde partie du lemme 4. 1 on a : xO + aO = yO = y'O.

Des lors, it existe un element inversible u E Ox tel que y' = yu (et donc z' = u-1z). Cela prouve que les elements de R(a) x R(b) qui ont x pour image sont les couples (yu, u-Iz), oli u E O. Le nombre de ces couples est bien egal au nombre d'elements de Ox. 2. On a deja calcule, dans le corollaire 3.4, le nombre de representations des puissances d'un nombre premier p : si p ne divise pas disc H, alors :

et si p divise disc H, r(Pn)

IoxI

226

I. PAYS

On voit ainsi que les fonctions

(r

et (T, d) prennent la mil-me valeur dIm

pgcd(d,disc H)=1

lorsque m est une puissance d'un nombre premier. Comme ces deux fonctions sont multiplicatives, elles doivent prendre la meme valeur pour tout M. Le theoreme 4.2 s'applique aux ordres principaux dans les algebres

de quaternions rationnelles definies positives, qui sont au nombre de cinq [16, p.1551. L'ordre principal est alors unique (a conjugaison pres) [16, p.26, Cor. 4.111. Voici la liste des cinq formes quadratiques (a Zequivalence pres) auxquelles le resultat s'applique ainsi que, pour chacune, la formule pour le nombre de representations d'un entier quelconque. - Le nombre de representations d'un entier positif n par la forme

Xi +X2+x3+X4 + X1X4 + X2X4 + X3X4 est

241: d, din 2{d

- Le nombre de representations d'un entier positif n par la forme

Xi

+X2+X3 +X4 +X1X4+X2X3

est

121: d, din 3$d

- Le nombre de representations d'un entier positif n par la forme

x2 + 2X2 + 5X2 + X4 + X1X2 + X1X4 + X2X4 + 5X2X3 est

6Ed, dIn 5{d

- Le nombre de representations d'un entier positif n par la forme X1 + X2 + 2X3 + 2X4 + X1X4 + X2X3 est

41: d, din 7{d

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

227

- Le nombre de representations d'un entier positif n par la forme X1 + 4X2 + 13X3 + 2X4 + X1X2 + X1X4 + X2X4 + 13X2X3 est

2> d. dln 13{d

Nous allons maintenant deduire la formule de Jacobi pour la somme de quatre carres a partir de la premiere de ces formules. La forme quadratique Xi +X2 +X3 +X4 est la forme norme de l'ordre 0' de base (1, i, j, k) dans l'algebre de quaternions H = (-1, -1)Q. Cet ordre n'est pas maximal : it est strictement contenu dans l'ordre maximal 0 de base (1, i, j, (1+i+j+k)/2). On ne peut donc pas lui appliquer la technique developpee ci-dessus. Cependant, les relations entre 0 et 0' sont telles qu'il est quand meme possible de deduire le nombre de representations d'un entier en Somme de quatre carres a partir du nombre de representations d'un entier par la forme norme de 0. Pour eviter la confusion, la notation r(m) designe, dans la fin de ce paragraphe, le nombre de representations de m par la forme norme de 0 tandis que r4(m) designe le nombre de representations de m en somme de quatre carres (qui est la forme norme de 0'). Commencons par indiquer ]a relation entre 0 et 0'. PROPOSITION 4.3. - Soit x un element non nul de 0. Si n(x) est paire, alors x E 0'; si n(x) est impaire, alors x est associe (a droite) a 8 elements de 0' (et a 16 elements de 0 N 0').

Demonstration : par rapport a la base (1, i, j, a) de 0, la forme norme s'exprime de la maniere suivante : n(xi + 2x2 + 353 + ax4) = xi + x2 + x3 + x4 + 51x4 + 52x4 + 53x4.

Des lors, pour x = x, + i52 + jX3 + ax4, n(x) -= (xl + x2 + x3)2 + (xl + x2 + 53)x4 + x4 mod 2.

Comme la forme quadratique X2 + XY + Y2 est anisotrope sur le corps a deux elements, on a donc n(x) E 2Z si et seulement si xl + X2 + 53 - 54 0 mod 2. En particulier, si n(x) est paire, alors 54 est pair, ce qui entraine :

xe0'.

Si n(x) est impaire, alors (x + 20)(5 + 20) = 1 + 20, donc T + 20 est l'inverse de x + 20 dans l'anneau quotient 0/20. Par ailleurs, on montre

I. PAYS

228

aisement (voir par exemple [13, p.1001) que x est associe a droite a un element de 0'; pour terminer la demonstration, on peut donc supposer x E 0'. Si U E 0' est tel que xu E 0', alors dans 0/20, on a u + 20 = a; (xu) + 20,

ce qui prouve que u E 0', puisque Y(xu) E 0' et 20 C 0'. On a donc xu E 0' si et seulement si u E 0", ce qui prouve que les associes a droite de x qui sont dans 0' sont en bijection avec 0", d'ou la proposition, car

10'x1=8. La formule de Jacobi se deduit alors aisement de la formule pour le nombre de representations par la forme norme de 0 : THEOREME 4.4. (Jacobi). - Le nombre de representations d'un entier positif m en somme de quatre carres est donne par

7'4(m) =8() 'd). dim 4{d

Demonstration : soit, comme precedemment :

R(m)={xE01n(x)=m}. Si m est impair, on deduit de la proposition precedente que chacune des classes d'elements associes a droite qui constituent R(m) contient 8 elements de 0' et 16 elements de 0 . 0'; donc r4(m) = 3r(m) =

d.

d1m

Si m est pair, alors d'apres ]a proposition precedente R(m) C 0', donc

r4 (m) = r(m) = 24 Ed. djm 2{d

On peut encore exprimer ce resultat comme suit :

r4(m) = 8(E d +

2d) =

dim

dim

dim

2{d

2{d

4{d

Dans les deux cas, on a donc bien le resultat annonce. manuscrit recu le 22 fevrier 1994

ARBRES, ORDRES MAXIMAUX ET FORMES QUADRATIQUES

229

Bibliographie

[1] A. BLANCHARD. - Les corps non commutatifs, Presses Universitaires de France, Paris, 1970. [2] J.W.S. CASSELS. - Rational Quadratic Forms, Academic Press, London, 1978. [3] L.E. DICKSON. - History of the Theory of Numbers, vol. II, Chelsea Publishing Co., New York, 1952. [4] B. GORDON. - An Application of Modular Forms to Quadratic Forms, BA-

thesis, 1975. [5] H. GROSS. - Darstellungsanzahlen von quaternaren quadratischen Stammformen mit quadratischer Diskriminante, Comment. Math. Helv 34, (1960) 198-221. [6] E. GROSSWALD. - Representations of Intergers as Sums of Squares, SpringerVerlag, New York Berlin Heidelberg Tokyo, 1985. [7] G.H. HARDY and E.M. WRIGHT. - An Introduction to the Theory of Numbers,

5th ed., Oxford University Press, 1979. [8] E. LANDAU. - Elementary Number Theory, Chelsea Publishing Co., New York, 1966.

[9] G. ORZECH ed. - Conference on Quadratic Forms 1976, Queen's Paper in Pure and Applied Math. 46, Kingston, Ontario, Canada 1977. [10] I. REINER. - Maximal Orders, Academic Press, London 1975. [11] A. ROBERT. - Introduction to Modular Forms, Queen's Paper in Pure and Applied Math. 45, Kingston, Ontario, Canada 1976. [12] G. ROUSSEAU. - On a construction for the representation of a positive integer

as the sum offour squares, L'Enseignement Math 33, (1987) 301-306. [13] P. SAMUEL. - Theorie algebrique des nombres, Hermann, Paris 1967. [14] J.-P. SERRE. - Cours d'Arithmetique, Coll. SUP, Presses Univ. France, Paris, 1970. [15] J.V. USPENSKY and M.A. HEASLET, Elementary Number Theory, McGraw-Hill,

New York and London, 1939.

230

1. PAYS

[16] M.-F. VIGNERAS. - Arithmetique des algebres de quaternions, Lecture Notes in Math 800, Springer-Verlag, Berlin Heidelberg New York, 1980. [17] M.-F. VIGNERAS. - Simplification pour les ordres des corps de quaternions

totalement definis, J. Reine Angew. Math. 286/287, (1976) 257-287. [18] A. WEIL. - Sur les sommes de trois et quatre carres, L'Enseignement Math 20 (1974) 215-222.

Isabelle PAYS

Universite de Mons-Hainaut Avenue Maistriau, 15 B-7000 MONS BELGIQUE

Number Theory Paris 1992-93

On a conjecture that a product of k consecutive positive integers is never equal to a product of mk consecutive positive integers except for 8.9.10 = 6! and related questions T.N. SHOREY

For an integer m > 2, we consider the equation

(1) (x+1) .

(x+k) _ (y+1) . . . (y+mk) in integers x > 0, y > 0, k > 2.

We replace x + 1 by x and y + 1 by y in (1) for observing that it is identical to considering equation

x(x+1) . . (x+k-1) = y(y+l) . . . (y+mk-1) in integers x> 0, y>0, k>2. If m = 2, equation (1) has a solution given by 8.9.10 = 6! (2)

x=7,y=0,k=3.

MacLeod and Barrodale [8) observed that this is the only solution of (1) with

m = 2 and k < 5. I give their proof for k = 2. We write equation (1) with m = 2 and k = 2 (3)

(x + 1)(x + 2) _ (y + 1)(y + 2)(y + 3)(y + 4).

By putting u = y2 + 5y, we have (x + 1) (x + 2) _ (u + 4) (u + 6).

Notice that

(x+32 _ 1)2<(x+1)(x+2)< (x+2)2 4

232

T.N. SHOREY

and

(u+5- 4)2 < (u+4)(u+6) < (u+5)2. We have [%11-(x

+ 1)(x + 2)]

=

(u + 4)(u + 6)]

[

which implies that i.e.

x=u+3.

(4)

By substituting (4) in (3), we have

(u + 4)(u + 5) = (u + 4)(u + 6) and this is a contradiction. Now, I give a few comments on the proof. We have written the right hand side of equation (1) as a product of translates of u and the translates

are independent of y. This is typical of the case m = 2. In the general case, we shall be extracting k-th roots in place of square roots. For this, it is necessary for the above argument that x and y are large as compared with k. Saradha and Shorey [ 111 proved that (2) is the only solution of equation

(1) with m = 2. Further, Saradha and Shorey [12] showed that equation (1) with m E {3, 4} has no solution. Recently, Mignotte and Shorey showed

that this is also the case when m E {5, 6}. For m > 2, it is proved in [13] that equation (1) implies that max (x, y, k) < C

where C is an effectively computable(i) number depending only on m. We have not been able to replace C by an absolute constant. It is likely that equation (1) with m > 2 has no solution. (5)

Now, we give a sketch of the proof that equation (1) with m > 2 implies that max(x, y, k) is bounded by a number depending only on m.

As pointed out earlier, we secure that x and y are large as compared with k. We re-write equation (1) as

(y + 1) ... (y + mk) (mk)! k! (1)

(mk)!

k!

All the constants appearing in this paper are effectively computable.

233

ONA CONJECTURE CONCERNING 8.9.10=6!

We count the powers of 2 on both the sides to obtain

k < ord2 ( k! ') < ord2

(x + 1

k! x + k )

< max ord2 (x + i) < - 1
log (x + k) log 2

which implies that :

x>2k-k.

(6)

By equation (1), we have

xk < (y + mk)k i.e.

x < (y + mk)'.

(7)

By (6) and (7), we observe that x and y are large as compared with k. Further, we combine (6) and (7) to write (8)

2k

- k < (y +

mk)'.

If y is bounded, we observe from (8) that k is bounded and equation (1) implies that x is bounded. Thus, we may always assume that y exceeds a sufficiently large number yo depending only on m. Further, by equation (1), we observe that x > y > yo.

For extracting k-th roots on both the sides of equation (1), we need to introduce some notation. We write ink

(9)

(z + 1) . . . (z + mk) =

Aj (m, k)zmk-j. j=0

Further, we determine rational numbers

Bj=Bj(m,k) withl<j<m such that : (10)

(zm + Biz'n-1 + ....+ Bm)k =

ink

Hj (m j=0

k)zmk-j

T.N. SHOREY

234

Hj (m, k) = Aj (m, k) for 0 < j < m.

k /Bl \= A, (m, k)

kB2 + 12 I Bi = A2 (m, k) .............../......................

Therefore B1i

,

B,,,, are determined recursively.

Let z be a sufficiently large positive number as compared with k and m. The relations (11) imply that the left hand side of (9) is close to the left hand side of (10). Therefore, the k-th root of the left hand side of (9) is close to the k-th root of the left hand side of (10). We can use this observation with z replaced by x or y, since x and y are large as compared with k and m. Therefore, the k-th root of the left hand side of (1) is close

to x +

k+ 1

and the k-th root of the right hand side of (1) is close to + B,,,.. Consequently, by equation (1), we derive that ym + + x + k1 is close to y"' + Blym-1 + + B. In fact, we show that Blym-1

(12)

Ix - (ym+Blym l +...+Bm_ k+1 )kT-1 2

where , den(B,))). T = (21cm (den(Bi), On the other hand, the left hand side of (12) is at least T-1 whenever it is not equal to zero. Hence, we conclude that

k+1

(13)

x=ym+Blym-1+...+Bm_

2

We substitute (13) in (1) to derive that (14)

HH (m, k) = Aj(m, k) for 0 < j < 2m

and (15)

H2,, (m, k) - A2m(m, k)

=

k(k + 1)(k - 1) 24

235

ONA CONJECTURE CONCERNING 8.9.10=6!

Thus, we have added m - 1 relations to the relations (11) with which we started. This is the basic idea of the proof. We calculate

Bl(2,k) = 2k+ 1 , B2(2,k) = (k+ 1)(2k+ 1)/3 and

(16)

H4(2, k) - A4(2, k) = (4k5 - 5k3 + k)/90.

By (15) and (16), we derive that m > 2. Then, we apply a result of Balasubramanian 113, Appendix] to conclude from (14) that k is bounded

by a number depending only on m. Thus, there are only finitely many possibilities for k and we fix k. We put

+1) ... (Y + mk) and

£(Y) = L(O(Y), Y)

where

k+1 O(Y) = Ym + BlYm-1 + ... + Brn 2

By equation (1) and (13), £(y) =

0,

which implies that either £(Y) is a zero polynomial or y is bounded by a number yo depending only on m and k. By taking yo > yo, we conclude that £(Y) = 0.

(17)

Now, I give two proofs to exclude the possibility (17).

We assume (17). Then (18)

L(X,Y) = (X - O(Y))

(Xk-1 + R1(Y)Xk-2 +

... + Rk-1(Y))

where Rj(Y) E Q[Y] for 1 < i < k. By equating the terms independent of X in the factorisation (18), we observe that the polynomial

(Y+1)...(Y+mk)-k!

236

T.N. SHOREY

is reducible over the field of rational numbers. Now, we apply a result of Brauer and Ehrlich 151 to conclude that k!

>k-lmk - 1)! 2(

[(mk - 2)/2]!

The right hand side is an increasing function of m and the inequality is not valid for m = 4. Therefore, we derive that m = 3 which implies that k = 2. Now, by looking at the constant term of £(Y), we observe that B3 (3, 2)

- 4 = W.

This is not possible, since B3 (3, 2) is a rational number. Next, we turn to the second proof to exclude the possibility (17). We as, i,,,,, jl, sume (17). Then, there exist pairwise distinct integers ii, , jm.

such that O(Y)+1 = and b(Y)+2-(Y+ji)...(Y+j,,,.)

Thus (19)

By putting Y = -ii in (19), we have

(ji - ii) ... U. - ii) = 1 which implies that m = 2. Then, we observe from (19) that

jl + j2 = it + i2 , 3132 = i1i2 + 1. Consequently (jl - j2)2 = (

2-4

which is not possible.

This completes our sketch of the proof of (5). Now, I mention some extensions of (5). Let f (X) be a monic polynomial of positive degree with rational coefficients. For an integer m > 2, we consider the equation

(20) f(x+1). . f(x+k) = f(y+1). . f(y+mk) in integers x, y, k > 2.

237

ONA CONJECTURE CONCERNING 8.9.10=6!

If f is a power of an irreducible polynomial, Balasubramanian and Shorey [3] proved that equation (20) with

f(x+j)#0 for 1<j
(21)

implies that max(I x 1, 1 y 1, k) is bounded by a number depending only on m and f. This is an extension of (5), since (21) is satisfied whenever f(X) = X and x is a non-negative integer. Further, it is easy to observe that the assumption (21) is necessary. The proof utilises an extension of the second proof for excluding the possibility (17). The second proof depends on the fact that f (X) = X is irreducible. For extending this proof, we need that f is a power of an irreducible polynomial. Furthermore, this is the only place in the proof where the hypothesis that f is a power of an irreducible polynomial is used.

For an integer d > 1, Saradha and Shorey [14] obtained another extension of (5) by proving that if x > 0, y > 0, k > 2 are integers satisfying (22)

(y+(mk- 1)d),

x(x + d) ... (x + (k - 1)d)

then max (x, y, k) is bounded by a number depending only on d and m. In fact, this is an immediate consequence of the following result ([ 14, Theorems 1,21): For e > 0, there exists a number Cl depending only on m and e such that equation (22) with max (x, y, k) > C1 implies that

d>

(23)

y(1-E)/(+n+1)

log d > (

M m(m + 1)

- e)K

where K and M are positive numbers given by K2 = k log k

(24)

,

M2 = m(m - 1)/2.

We observe from (23), (24), (25) and (22) that max (x, y, k) is bounded by a number depending only on d and m. Further, for positive integers d1, d2 and m > 2, Saradha and Shorey [ 151 considered a more general equation than (22) : x(x +

(x + (k - 1)d1) =y(y +

(mk - 1)d2)

(25)

in integers x > 0, y > 0, k > 2.

It was shown in [15] that equation (25) with m = 2 implies that either max(x, y, k) is bounded by a number depending only on d1, d2 or k = 2, d1 = 2d2, x = y2 + 3d2y. On the other hand, equation (25) with m = 2 is

238

T.N. SHOREY

satisfied whenever the latter possiblities hold. If m > 2, it was proved in [ 151

that there exist numbers C2 and C3 depending only on dl, d2, m such that equation (25) implies that k < C2 and moreover max (x, y) < C3 unless (*) d1 /c 2" is a product of m distinct positive integers composed of primes not exceeding m and m > a(k) where

a(k) =

14 for 2 < k < 7 50 for k = 8 I exp(klogk - (1.25475)k - logk + 1.56577) for k > 9.

This includes a result on the case dl = d2 = d mentioned above, since (*) is never satisfied. Further, we derive that equation (25) with 3 < m < 14 or 3 < m < 2568, k > 9 implies that max (x, y, k) is bounded by a number depending only on d1, d2, m. Finally, Saradha, Shorey and Tijdeman (17] showed that condition (*) is not necessary. Consequently, we conclude that equation (25) with m > 2 implies that max (x, y, k) is bounded by a number depending only on d1, d2, m. This is a consequence of a more general result which we describe now. For distinct positive integers £ and

m with gcd(f, m) = 1 and £ < m, Saradha, Shorey and Tijdeman [ 17] proved that there exists a number C4 depending only on d1, d2, m such that if x > 0, y > 0 and k > 2 are integers satisfying

x(x + di)

.

(x + (fk - 1)dl) = y(y + d2) ... (y + (mk - 1)d2),

then max(x, y, k) < C4

unless f = 1, m = 2 which corresponds to equation (25) with m = 2 and we refer to the result already stated in this case. By applying the theory of linear forms in logarithms, it is shown in [ 17] that the preceding assertion is also valid for k = 1 provided that f E {2, 4} and (f, m) = (3, 4). Now, we turn to equation (25) with m = 1. In this case, there is no loss of generality

in assuming that x > y and gcd(x, y, dl, d2) = 1. Then Saradha, Shorey and Tijdeman [ 161 proved that equation (25) with m = 1 implies that there exists a number C5 depending only on d2 such that either

x=k+l,y=2,d1=1,d2=4 or

max(x, y, k) < C5.

We observe that

(k+ 1). (2k)=2.6...(4k-1) fork= 2,3....

239

ON A CONJEC'TJRE CONCERNING 8.9.10=6!

since the right hand side is equal to

2k(2k)!/(2.4... (2k)) = (2k)!/k!.

Therefore, the above possibilities for the case d1 = 1, d2 = 4 cannot be excluded. Further Saradha, Shorey and Tijdeman [18] showed that equation (25) with m = dl = 1 implies that y < k2d2/12 and furthermore k < d2 - 2 unless y 2(mod 4) and d2 = 21 for some integer 2 > 2. In the case m = 2, dl = 1, Saradha, Shorey and Tijdeman [ 18] proved that equation (25) implies that y(y + (2k - 1)d2) < (0.44)k4d2 and furthermore

k < d2 - 2 unless k < 35 and d2 = 2e for some integer 2 _> 2. These results are applied in [181 to determine all the solutions of equation (25) with m = d1 = 1, d2 E 12,3,5,6,7,9, 10} and m = 2, dl = 1, d2 E {5, 6}. Let a and b be positive integers. We consider (26)

a(x + 1) ... (x + k) =b(y+1)...(y+k+2) in integers x > 0,y > 0,k > 2,2 > 0.

Equation (1) is a particular case of (26), namely, a = b = 1 and k + 2 is an integral multiple of k. ErdOs [71 conjectured that there are only finitely many integers x > 0, y > 0, k > 2, 2 > 0 with k + 2 > 3 and x > y + k + 2 satisfying

(26). This is a difficult problem. The assumption k + 2 > 3 is to exclude Pell's equations and the assumption x > y + k + 2 is to guarantee that the two blocks of consecutive integers in equation (26) are non-overlapping. Mordell 191 proved that equation (26) with a = b = 1, k = 2, 2 = 1 implies

that x = 1, y = 0 and x = 13, y = 4. Mordell's result initiated much of research in this direction. Avanesov [ 1] confirmed a conjecture of Sierpinski

by proving that x = 0, y = 0; x = 3, y = 2; x = 14, y = 7; x = 54, y = 19 and x = 118, y = 33 are the only solutions of equation (26) with a = 3, b = 1, k = 2, 2 = 1. Tzanakis and de Weger [20] determined all the solutions of equation (26) with a = 1, b = 2, k = 2, 2 = 1. Boyd and Kisilevsky [4] showed that x = 1, y = 0; x = 3, y = 1 and x = 54, y = 18 are the only solutions of equation (26) with a = b = 1, k = 3, 2 = 1. Cohn 161 proved that equation (26) with a = 1, b = 2, k = 4, 2 = 0 is satisfied only if x = 4, y = 3. Further, Ponnudurai [ 10] showed that x = 2, y = 1 and x = 6, y = 4 are the only solutions of equation (26) with a = 1, b = 3, k = 4, 2 = 0. Let us consider equation (26) with 2 = 0. We re-write equation (26) with

2=0 as (27) (axk-byk)+Al(axk-1-byk-1)+ +Ak-1(ax-by)+Ak(a-b) = 0 where A1,. .. , Ak are given by (28)

F(z)=(z+1)...(z+k)=zk+Alzk-1+

+Ak.

240

T.N. SHOREY

If a = b, we observe from (27) that (29)

(xk

- yk) + Al

(xk-1 _ yk-1) +

... + Ak-1(x - y) =0

which implies that x = y, since all the summmands in (29) are of the same sign. Now, we assume that a # b. Shorey [191 showed that there exists a number C6 depending only on a and b such that equation (26) with £ = 0, x > y+k and k > C6 implies that the first summand in (27) is positive and the second summand in (27) is negative (then all the summands in (27) following the second one will be negative). This is equivalent to saying that

equation (26) with e = 0 and x > y + k implies that either k < C6 or k = [a + 1] where

a =log

(a)/ log (y

We have not been able to exclude the latter possibility. This is the case if we allow C6 to depend also on P(x) and P(y).(2) In fact, Shorey [191 showed

that equation (26) with .£ = 0 and x > y + k implies that max(x, y, k) is bounded by a number depending only on a, b, P(x) and P(y). Saradha and Shorey [11) extended these results to equation (26) where $ is not necessarily equal to zero. Now, we give a sketch of the proof that equation (26) with x > y + .£ + k

implies that max(x, y, k, e) is bounded by a number depending only on a, b, P(x) and P(y). The proof depends on Gel'fond - Baker theory of linear forms in logarithms. We assume (26). By (26) and (28), we observe that

0 < aF(x) - bF(y)yl = Uk + AlUk_1 + ... + AkUO

(30)

where

Uj=ax'-by'+e for0
(31)

If Uk < 0, we observe from x > y that Uj < 0 for 0 < i < k which contradicts (30). Thus Uk > 0.

(32)

We write Cl, c2, ... , c12 for positive numbers depending only on a, b, P(x)

and P(y). We may assume that x > cl with cl sufficiently large. In view (2)

For an integer v > 1, we write P(v) for the greatest prime factor of v and we

put P(0) = P(1) = 1.

ONA CONJECTURE CONCERNING 8.9.10=6!

241

of the result of Shorey [ 191 mentioned above, we may suppose that f > 0 which implies that k + $ > 3. Then, it is proved in [ 11, Corollary 21 that

x - y< c2tx(log x)/k.

(33)

As in the proof of (5), we count the power of 2 on both the sides of (26) for deriving that

f
(34)

By (33) and (34),

x - y < c4x(log x)2/k.

(35)

On the other hand, we apply an estimate of Fel'dman (see Baker [2]) on linear forms in logarithms to obtain

x - y > x(log x)-`5

(36)

By (35) and (36), k < (log x)CB.

(37)

Now, we show that

y < (log x)".

(38)

We may assume that (39)

y > (k +

Q)4'

otherwise (38) follows from (34) and (37). Further, we derive from (26), (31), (32), (37) and (39) that (40)

0 < Uk < cs((k +

£)2yk+l-1 + k2x-1).

On the other hand, we apply again the estimate of Fel'dman on linear forms in logarithms for deriving that Uk > max(xk, yk+e) ((k + E) log

which, together with (34) and (37), implies that (41)

Uk > max(xk, yk+t) (log x)-°10

x)-C9

242

T.N. SHOREY

Finally, we combine (40), (41), (34) and (37) to conclude (38). By (37), we have

x > kP(x) > k-(x) where w(x) denotes the number of distinct prime divisors of x. Therefore, there exists a prime p dividing x such that pordp(x) >

(42)

k.

Now, we count the power of p on both the sides of (26) which we re-write as

a(x+l)...(x+k) _b(y+1)...(y+k+.2) (k +

k!

By (42), we obtain 1 < ordp (a (x +

P

1) - k! (x + k))

\

, ord,(a),

J

which sharpens (34) as : (43)

$ < Cu. l

By (26),

X < (y + k + f)1+(e/k) which, together with (37), (38) and (43), implies that x < c12. This completes the proof.

We refer to [111 for more results on equation (26). For example, it is proved in [ 11 ] that equation (26) with x > y + k + f implies that : x > C7k3(log k) -4 and

x-y>C8x2/3 where C7 and C8 are positive numbers depending only on a and b.

Manuscrit recu le 10 decembre 1993

243

ONA CONJEC7IJRE CONCERNING 8.9.10=6!

References [ 1 ] E.T. AvANEsov. - Solution of a problem on polygonal numbers (Russian),

Acta Arith. 12 (1967), 409-419. [2] A. BAKER. - The theory of linear forms in logarithms, Transcendence Theory: Advances and Applications, Academic Press (1977), 1-27. [3] R. BALASUBRAMANIAN and T.N. SHOREY. - On the equation f (x + 1)

f (x + k) = f (y + 1) . . . f (y + mk), Indag. Math. N.S. 4 (1993), 257-267. [4] D.W. BOYD and H.H. KISILEVSKY. - The diophantine equation u(u + 1)(u

+ 2) (u + 3) = v(v + 1)(v + 2), Pacific Jour. Math. 40 (1972), 23-32. [5] A. BRAUER and G. EHRLICH. - On the irreducibility of certain polynomials,

Bull. Amer. Math. Soc. 52 (1946), 844-856.

[6] J.H.E. COHN. - The diophantine equation Y(Y + 1)(Y + 2)(Y + 3) _ 2X (X + 1)(X + 2) (X + 3), Pacific Jour. Math. 37 (1971), 331-335.

[7] P. ERDOs. - Problems and results on number theoretic properties of consecutive integers and related questions, Proc. Fifth Manitoba Conf. Numerical Math. (Univ. Manitoba Winnipeg) (1975), 25-44. [8] R.A. Mac LEOD and I. BARRODALE. - On equal products of consecutive

integers, Canadian Math. Bull. 13 (1970), 255-259. [9] L.J. MORDELL. - On the integer solutions of y(y + 1) = x(x + 1) (x + 2), Pacific Jour. Math. 13 (1963), 1347-135 1. [ 101 T. PONNUDURAI. - I he diophantine equation Y (Y + 1) (Y + 2) (Y + 3) = 3X (X + 1) (X + 2) (X + 3), Jour. London Math. Soc. 10 (1975), 232-240. [11] N. SARADHA and T.N. SHOREY. - On the ratio of two blocks of consecutive

integers, Proc. Indian Acad. Sci. (Math. Sci.) 100 (1990), 107-132. [12] N. SARADHA and T.N. SHOREY. - On the equation (x + 1) . . . (x + k) _ (y+ 1) . . (y+mk) with m = 3, 4, Indag. Math., N.S. 2 (1991), 489-510. [13] N. SARADHA and T.N. SHOREY. - On the equation (x + 1) . . . (x + k) _ (y + 1) . . . (y + mk), Indag. Math., N.S. 3 (1992), 79-90. [14] N. SARADHA and T.N. SHOREY. - On the equation x(x + d)

(x + (k -

1)d) = y(y+d) . . . (y+(mk-1)d), Indag. Math., N.S. 3 (1992),237-242. [15] N. SARADHA and T.N. SHOREY. - On the equation x(x + d1)

(x + (k -

1)d1) = y(y + d2) . . . (y + (mk - 1)d2), Proc. Indian Acad. Sci. (Math. SO.), 104 (1994).

244

T.N. SHOREY

[16] N.

T.N. SHOREY and R. TIJDEMAN. - On arithmetic progressions

of equal lengths with equal products, Math. Proc. Camb. Phil. Soc. (1994), to appear. [17] N. SARADHA, T.N. SHOREY and R. TIJDEMAN. - On arithmetic progressions

with equal products, Acta Arith., to appear. [18] N. SARADHA, T.N. SHOREY and R. TIJDEMAN. - On the equation x(x +

(y + (mk - 1)d), m = 1, 2, Acta Arith., to appear. [ 19] T.N. SHOREY. - On the ratio of values of apolynomial , Proc. Indian Acad.

Sci. (Math. Sci.) 93 (1984), 109-116. [20] N. TzANAIUS and B.M.M. de WEGER. - On the practical solution of the Thue equation, Jour. Number Theory 31 (1989), 99-132. T.N. SHOREY

School of Mathematics Tata Institute of Fundamental Research Homi Bhabha Road Bombay 400 005, India

Number Theory Paris 1992-93

Redei-matrices and applications Peter Stevenhagen

1. - Introduction In this paper we describe an algebraic method to study the structure of (parts of) class groups of abelian number fields. The method goes back to the Hungarian mathematician L. Redei, who used it to study the 2-primary part of class groups of quadratic number fields in a series of papers [[ 18]-[24]] that appeared between 1934 and 1953. The case of the l-primary part of the class group of an arbitrary cyclic extension of prime degree l was studied by Inaba [[ 121, 19401, who realized that one should look at the class group as a module over the group ring. The matter was then taken up by FrOhlich 1[61, 19541, who generalized Inaba's results by extending Redei's quadratic method to the case of a cyclic field of prime power degree. In the seventies, generalizations in the line of Inaba were given by G. Gras 1110] ]. In all cases,

one studies 1-primary parts of the class group of an abelian extension for primes l that divide the degree. Recently, completely different methods have been developed by Kolyvagin and Rubin, showing that the structure of any l-primary part of the class group of an abelian field of degree coprime to l can be described 'algebraically'. For primes dividing the degree it is not yet clear whether the approach works. The Kolyvagin-Rubin methods can be seen as refinements of the analytic class number formula, and they are more general than the Redei-FrOhlich method as they work for most 1. On the other hand, they depend on the existence of infinite collections of auxiliary prime numbers, so effective versions of the (ebotarev density theorem are needed to yield deterministic algorithms. Because of this somewhat involved nature they can only be used in practice for abelian fields of very small degree. Moreover, the method does not give any clue as to the average behaviour that is to be expected when it is applied to a family of fields. For instance, it cannot be used to compute the class number h,+ of the maximal real subfield of the n-th cyclotomic field for any n that is not very small. Also, it does not tell us

P. STEVENHAGEN

246

whether the fact that h+ is either 1 or very small, which can be shown to be the case for all n < 200 having at least two prime divisors if one assumes the generalized Riemann hypothesis [[31]], should be seen as a common occurrence.

The Redei-Frbhlich method is based only on class field theory and therefore of a rather different nature. It can be used only to describe pprimary parts of the class group when p does divide the degree of the extension, which is exactly the case that had to be excluded before. We will see that it gives in this case rise to density statements telling us how many fields in a given infinite family will have some prescribed part in their class group. An example : the class number h3p is even for some explicit collection of primes p of Dirichlet density 1/16 and divisible by 3 for a collection of density 1/18.

An application of Redei's quadratic method that goes back to Redei himself concerns criteria for the solvability of the negative Pell equation x2 - Dy2 = -1. This is a question that is closely related to the behaviour of 2-class fields, as was made clear by Scholz [[25]]. We will discuss it in the two final sections of this paper.

2. - Redei-matrices In this section, K will denote a number field that is cyclic of prime degree

1 with Galois group G = Gal(K/Q). It is our intention to study the 1-part C of the class group of K. If l = 2, our convention will be that C is the 2part of the narrow class group of K. Correspondingly, we call an extension unramified if all finite primes are unramified. The difference between the narrow and the ordinary class group that may exist is of interest only when l = 2. It will be discussed in detail the following two sections. As C is a finite abelian 1-group with a natural G-action, it is a module over the group ring over the l-adic integers Zt [G]. The norm N = >geG 9 annihilates the class group, so we can study C as a module over ZI [G] IN. If (1 denotes a primitive l-th root of unity, we have an isomorphism Z1[G]/N -2- Zl[(l] Ors

> (l

showing that A = Z1 [G] IN is a discrete valuation ring whose maximal ideal is generated by a - 1, with or a generator of G. The residue class field of A is the finite field of l elements IF1. Every finite A-module M is isomorphic to a module of the form $

11 A/A(°-1)"i i=1

with n;, E Z>1 for i = 1, 2, ... , s. Thus, we can specify the isomorphism

REDEI-MATRICES AND APPLICATIONS

247

class of the A-module M by giving the sequence of integers dimF,(M(o-1)k-1/M(°-1)k).

rk = #{i: ni > k} =

Note that r1(M) = s and that {rk(M)}k is a decreasing sequence with rk(M) = 0 for k sufficiently large. The evaluation of r1 (C) amounts to doing genus theory for the field K. More precisely, class field theory associates to C an unramified extension H of K, called the 1-class field of K, for which the Galois group Gal(H/K) corresponds to an is canonically isomorphic to C. The quotient unramified extension H1 of K that is known as the genus field of K. It is the maximal unramified extension of K that is abelian over Q, and Gal(H1/Q) is isomorphic to the elementary abelian 1-group G x C/C°-1. If x denotes a Dirichlet character generating the character group X of G that corresponds to K, we can write x as a product x = flit= 1 xi, where t is the number of primes that ramifies in the extension K/Q and xi is a character of conductor a power of some ramifying prime pi and of order 1. The conductor of xi is equal to pi if and only if pi # 1. The field H1 corresponds to the group of Dirichlet characters C/Ca-1

Xi =

ftx.

i=1

It follows that Gal(H1/K) has order 1t-1, i.e. the (a-1)-rank r1 (C) is equal to t - 1, where t is the number of ramifying primes in K/Q. The subgroup Co = C[o, -1] of G-invariant ideal classes in C is known as the subgroup of _ambiguous ideal classes. As G is cyclic and C is finite, the order of CG = H° (G, C) equals the order of H1(G, C) = C/C°-1. It is not difficult to check that Cc is generated by the t classes [pi] of the ramified primes pi of K. The order of CG is 1t-1, so there is exactly one additional relation between these classes that is independent of the obvious relations [pti]=0. The Redei-Frohlich theorem gives a description of r2(C) by combining the two descriptions of ri (C). Note that as abelian groups, we have : C/C(o-1)2

C/C4 { C/Co-1 X C

if l = 2; 1/C(°-1)

2

if l> 2,

and that the 1-rank of C/C1 is equal to the sum EI-I rk. The theorem is based on the observation that r2 (C) can be obtained from an explicit description of the natural map

0 : C[v - 1]

C/Ca-1.

P. STEVENHAGEN

248

This is a homomorphism between elementary abelian 1-groups, so it can be viewed as a linear map between vector spaces over IF1. With this terminology, the (v - 1)2-rank of C is nothing but the Fl-dimension of the kernel of 0. This dimension can be given in terms of the rank of a certain matrix over F1, called the Redei matrix of K, as follows. 1. THEOREM (Redei-Friihlich). - Let K/Q be a cyclic extension of prime

degree l with group (v) and conductor f , and let X : (Z/ f Z) * -> IF1 be a generator of its character group, with values taken in the additive group IF1. Let pl, p2, ... , pt be the primefactors off , andX = Ei-1 Xi the corresponding decomposition of X. Then the 1-primary part C of the narrow class group of K has (Q - 1)2-rank r2 (C) = t - 1 - rankF, R, where the entries ai3 E IF1 of the Redei matrix R = (ai,j )i. j=1 are defined by : ai3 = Xi(pj)

if i

j;

t

Y. aid = 0. i=1

Proof : as C[v - 1] is generated by the classes of the ramified primes pi, we have a natural surjection p : Ft --f C[ai - 1] that maps the j-th basis vector ej to the class of p;. The group C/C°-1 is canonically isomorphic to the subgroup Gal(Hi/K) of Gal(Hi/Q) under the Artin map. We know H1 explicitly from genus theory : it is the compositum of the cyclic fields Q(Xi) of conductor a power of pi corresponding to the characters Xi. Each character Xi furnishes an isomorphism Gal(Q(Xi)/Q) -' F1, and they can be combined into an isomorphism EB 1Xi : Gal(Hi/Q) - lFi. The Redei map R : Fl -> Ff is defined as the composed map R : Fit

P . C[a - 1] - C/Ca-1 => Gal(Hi/K) C Gal(H1/Q) _

E)xi ------

Fit

of vector spaces over Fl. As the kernel of p is of dimension 1, one has (2)

r2 (C) = dime [ker 0] = dimF, [ker R] - 1 = t - 1 - rankF, R,

as desired. The image of a basis vector ej is the Artin symbol of p; in Gal(Hi/K). If i j, the restriction of this symbol to Gal(Q(Xi)/Q) is the Artin symbol of pj, and this is mapped to ai3 = Xi (pj) by Xi. For the diagonal entry aii the Artin symbol and Xi (pi) are not defined, but we can use the fact

that ®Xi maps Gal(Hi/K) to the hyperplane {(ai)i E IF' : Ei_1 ai = 0}. The desired identity F_i_1 ai3 = 0 follows immediately.

REDEI-MATRICES AND APPLICATIONS

249

The Redei matrix R is by definition a singular (t x t) -matrix since the sum of its rows is zero. It is said to have maximal rank if the rank equals t - 1.

Obviously, the rank is maximal if and only if r2 (C) = 0. We will meet this condition in the next section when investigating the solvability of the negative Pell equation.

The field H2 corresponding to the quotient C/C(°-1)2 is the central 1-class field of K, i.e. the largest unramified extension E of H1 that is normal over Q and for which the group extension

0 -p Gal(E/Hl) ---+ Gal(E/Q) -) Gal(Hi/Q) -40 is a central extension. In Frdhlich's terminology [[711, the central class field

H2 is a field of class two : its Galois group 11 = Gal(H2/Q) is not in general abelian but its lower central series has length at most two. This is equivalent to saying that the commutator subgroup [S2, S2] is contained in the center of S2, or that [ci, [SZ, Ii]] = 0. The Redei-FrOhlich method enables us to obtain Gal(H2/K) from very simple rational data. More precisely, we

can determine ri = ri(C) for i = 1, 2 in terms of the prime factors of the discriminant, and this leads to (A/A°-i)rl-r2 X (A/A(U-1)2).2 Gal(H2/K) =A f (Z/2Z)rl-r2 x (Z/4Z)r2 if l = 2; if 1 > 2. l (Z/JZ)rl+r2

The first isomorphism is an isomorphism of modules over the ring A = Z1 [G] IN, the second is an isomorphism of abelian groups.

3. - Applications As a first application, we will obtain divisibility results for the real cyclotomic class numbers hn of the type discussed in the introduction. Recall that h,+ is the class number of the maximal real subfield Fn = Q(Sn + (n-1) of the cyclotomic field of conductor n.

3. LEMMA. - Suppose that l > 2 and that the l -class group C of the field K in theorem 1 has (v - 1)2-rank r2(C) = r. Then jr divides h+, with n the conductor of K.

Proof : as K is real of conductor n, it is contained in Fn. The genus field H1 of K is equal to H n F, so H2Fn/Fn is an unramified abelian extension of Fn of degree [H2 : H1] = jr. This degree divides hn by class field theory, so we are done. This lemma provides us with an easy method of constructing infinitely many n for which hn is divisible by an arbitrarily high power of a prime number

250

P. STEVENHAGEN

1. One simply takes those n for which Fn contains a subfield K of degree l over Q for which the Redei-matrix is of rank much smaller than t - 1. By taking it equal to the zero matrix, the following result is obtained. 4. THEOREM. - Let n be divisible by t distinct primes congruent to 1 mod l

such that each of these primes is an l-th power modulo all others. Then divides h, .

It-1

For fixed t, there are infinitely many pairwise coprime n satisfying the hypothesis of the theorem. This follows easily from Dirichlet's theorem on primes in arithmetic progressions. If t -1 primes congruent to 1 mod l have

been chosen such that each is an l-th power modulo the others, the t-th prime that makes the hypothesis of the theorem hold true can be chosen 1)12(t-1)]-1 This follows from an infinite collection of Dirichlet density [(1from the Gebotarev density theorem, as the condition on the t-th prime is that for each of the t -1 previous primes p, it splits completely in the field Ep that is obtained by adjoining an l-th root of unity (l and to the subfield of degree l in the p-th cyclotomic field. The fields Ep are all of degree (l -1)12 over Q, and they are linearly disjoint over Q((l). As a very special case, we obtain the claim made in the introduction that h132 is divisible by 3 for a set of primes p of Dirichlet density 1/18. For odd 1, results similar to those in the preceding theorem have been proved by Cornell and Rosen [[3], 19841 using cohomological methods that

go back to Furuta [[9]]. For t = 2 and t = 3 their results are identical to those following from lemma 3, for large t their method is better. Neither method gives any result for the prime conductor case t = 1. For l = 2, the lemma and the arguments given above have to be adapted for several reasons. First of all one has to take care of the ramification of real primes. This leads one to consider only those quadratic fields K that have real genus fields, i.e. real quadratic fields for which all odd prime divisors of the discriminant are congruent to 1 modulo 4. One only obtains a divisibility result 2r-1 h ,which is weaker than lemma 3. In order to find 2T hn one needs to show that H2 is real. This can sometimes be done using a method of Scholz [[25]]. Secondly, one has to adapt the density computation given above as the fields Ey have smaller degree, a statement equivalent to the quadratic reciprocity law. Rather than working out all details here, we give a characteristic example. It is stronger than the cohomological result in [[3]].

5. THEOREM. - Let p and q be primes congruent to 1 modulo 4 that are mutual quadratic residues, and suppose that the fourth power residue symbols (q) 4 and (P) are equal Then hr9 is even. 4 It is not difficult to see that we obtain the claim in the introduction for q = 13. For n = pq = 5 29 = 145 it follows that hi45 is even. This is one

REDEI-MATRICES AND APPLICATIONS

251

of the two values n < 200 with n not a prime power for which h+ > 1. In fact, one can use Odlyzko's discriminant minorations to show [13111 that h145 = 2.

There are no results of a similar algebraic nature in the prime conductor case t = 1, and we cannot produce infinite families of primes p for which hP is even. See [12811 for a more complete discussion.

A second application of the technique of Redei matrices arises in the study of the solvability in integers of the negative Pell equation x2 - Dy2 = -1, where D > 1 is a squarefree integer. With ED a fundamental unit in Q(om) and N the norm to Q one has x2

- Dye = -1 is solvable in integers e= NED = -1.

Indeed, if the equation is solvable there are units of norm -1, so the fundamental unit cannot have norm + 1. Conversely, if NED = -1 it may be that ED is not in Z[/], but as its cube ED always is we still get an integral solution to the equation. As it is more natural to work with discriminants

than radicands, we will further take D to be a quadratic discriminant and say that the negative Pell equation is solvable for D if the equation x2 - Dy2 = -4 has integral solutions. If the equation is solvable for D, then D is positive and -1 is a quadratic residue modulo every prime divisor

of D, so D must be in the set D of real quadratic discriminants that are not divisible by any prime congruent to 3 mod 4. A question that has been studied by many people but that is still completely open is the following. 6. PROBLEM. - Let D(-1) C D be the set of real quadratic discriminants for which the negative Pell equation is solvable. Decide whether the limit lim X-

#{D E D(-1) : D < X} 00

#{DED:D<X}

exists and if so, determine it.

This is a very hard problem, and to my knowledge is is not even known whether the liminf and the limsup of this expression are in the open interval (0,1). The relation between the solvability of the negative Pell equation and the previous section is given by the following immediate consequence of class field theory.

7. LEMMA. - The negative Pell equation is solvable for a quadratic discriminant D if and only if the narrow 2-Hilbert class field of Q(%) is real.

P. STEVENHAGEN

252

Proof : both statements are equivalent to the fact that Q(v) is a real field for which the narrow Hilbert class field coincides with the ordinary Hilbert class field.

Let H be the narrow 2-Hilbert class field of K = Q(/D). This is the situation of the preceding section, with l equal to 2. From the lemma, we see that the negative Pell equation is solvable for D if and only Hk is real for all k > 1. The condition that the genus field H1 is real is equivalent to the requirement that D is in D, since H1 is obtained from K by adjoining a square root of (-1)(P-X)/2 for each odd prime divisor p of D. If H = H1 this condition is also sufficient for solvability of the negative Pell equation. 8. LEMMA. - The negative Pell equation is solvable for D E D if the Redei matrix of Q (v/D) has maximal rank

Proof : the condition implies an equality H1 = H2, so H = H1 is real and we are done by the previous lemma. If D E D has t distinct prime divisors, the corresponding Redei matrix R is by the quadratic reciprocity law a symmetric (t x t)-matrix over F2 whose

rows and columns add up to zero. Let R' be the (t - 1) x (t - 1)-minor obtained by leaving out the last row and column from R. If D ranges over the subset Dt of D consisting of those discriminants that have exactly t distinct prime divisors, it is intuitively clear that the corresponding Redei minor R'D behaves like a random symmetric (t - 1) x (t - 1)-matrix over ]F2, i.e. that 1imX-00

#{DEDt:D<X and R'D=S} #{DEDt:D<X}

exists and does not depend on the choice of the symmetric matrix S. The statement is a reformulation of the fact that the vector consisting of the (2) Legendre symbols (P) of an element D = p1p2 ... pt is randomly distributed as a function on Dt. The details of a correct proof are not trivial. Redei's original proof [[22]] proceeds by induction on t, and so does the proof of the rediscovery of the result in [[5]]. There is also an easy way out by adapting the notion of density [[17]].

Once one knows that a discriminant D E Dt gives rise to a random symmetric matrix, one can determine how likely it is that such a matrix is non-singular. We give a slightly more general result for future reference. 9. PROPOSITION. - Let n > 1 be an integer and q a prime power. Then there are An(q) = q(n21)

fl

1
(1 - q-k)

R$DEI-MATRICES AND APPLICATIONS

253

symmetric (n x n) -matrices over the field of q elements IFq that are nonsingular. The number of matrices of arbitrary rank r c {O, 1,... n} equals [T] gAr(q), where 1'r1 q denotes the number of r-dimensional subspaces of a vector space of dimension n over IFq.

Proof : the result for q = 2 occurs in rather cumbersome terminology and with a lengthy proof in 112211. A completely elementary proof by induc-

tion on n can be found in [[131]. In order to see that the statement given there is identical to ours one needs the explicit value 1(q` - 1)

fn

flr 1(qt - 1) flti 1 (qi

Lr] q

The first half of the proposition immediately implies the second half, as symmetric matrices correspond bijectively to symmetric bilinear forms and giving a symmetric bilinear form of rank r on V = IFq is equivalent to giving

a subspace W C V of dimension n - r and a non-degenerate symmetric bilinear form of the factor space V/W. This remark also shows that the numbers An (q) can be computed inductively from the relation n

rnl

r-o r q

AT(q) = q(n21),

so it suffices to check that the given expression satisfies this relation. An elegant way of doing this is given in [[511.

We will only use the preceding proposition for q = 2 and n = t - 1, so we write At_1(2) = 2(2)at with [t/2] at = fJ(1 -

21-2j)

j=1

Set Dt(-1) = Dt n D(-1). We now know that for fixed t, we have a lower density for Dt(-1) in Dt, since the two preceding lemmas imply liminf

#{D E Dt(-1) : D< X}

#{DEDt:D<X} >at>am

00

-2j). j=1

The numerical value a, = .4194224... is already in 1[2211. The density result has been reproved in [[ 1711, [[1 Ifl and [[5]]. The formulation given by

these authors is different from Redei's, as they interpret the Redei matrix as an incidence matrix of a graph on t points.

P. STEVENHAGEN

254

The equations D = Ut>1Dt and D(-1) = Ut>1Dt(-1) make it very plausible that the lower density of D(-1) in V is not smaller than a.. However, I do not know how to prove this. The problem is that each Vt is a subset of zero density in V, and it seems non-trivial to prove a density result for D from a density result for each of the subsets Dt. The preceding argument can be further refined in order to obtain a still higher value of the lower density of Dt (-1) in Dt. In particular, we will push the limit value for t -* oo over the value 1/2 that has been suggested as a possible value [[ 17]]. However, we need to pass to the next higher level, i.e. the field H3, in order to do this.

4. - Higher levels In principle, the Redei-Frohlich method for determining r2 (C) can be extended to determine inductively all values rk (C). Having defined the first Redei map

R = R1 : Ft --> C[v - 1] -- C/Co-1 -4 Fit, one can repeat the procedure and consider the higher Redei maps Rk :

kerRk_1 -> C[Q -1] n

C(o_1)k

Just as in the case k = 1, we obtain the (o -

1

can

C(0-1)k-1

/C(o-1)k.

1)k+1-rank from this map by

rk+1(C) = dimF, [ker Rk] - 1 = rk (C) - rankF, Rk,

which is the analogue of (2). Despite the close analogy, a serious complication arises for these higher levels. For k = 1, we were able to embed c(a-1)k-1 /C(a-1)k in a canonical way in a vector space of dimension t over ]Fl. This was due to the fact that we could describe the genus field H1 very explicitly in terms of Dirichlet characters. The fields Hk for k > 2 are no longer abelian over Q, and no general method is known to describe them explicitly. This is a serious drawback that accounts for the fact that there is no generalisation of theorem 1 to higher levels that is of a comparable simplicity. For the same reason, we do not have general density results for these levels that resemble those in the preceding section.

Only in the special case where k = 1 = 2, there is a more explicit version of the theory that goes back to Redei [[22]] and was further developed by Frohlich [[7]]. We can formulate it in modern terms as follows.

Let D be a quadratic discriminant, and D = jlt=i dg its factorization into prime power discriminants. The set V of discriminantal divisors of D is defined as the set of divisors d of D of the form d = llt=1 d7 with ei E 10, 1}. This is in a natural way a vector space of dimension t over F2 with a canonical basis consisting of the divisors d1. The natural surjection

REDEI-MATRICES AND APPLICATIONS

255

V -> C[2] maps dti to the class of the ramified prime ai of K that divides dti. As the genus field H1 of K = Q(v/D) is generated over Q by the square roots dti, Kummer theory tells us that the Galois group Gal(Hi/Q) can be seen as the dual space V* = Hom(V, IF2) of V. The kernel of the Redei map R1 : V -- V * consists of divisors d E V for which the associated Artin symbol oa E Gal(H/K) is the identity on H1. The kernel of the dual map Ri : V = V** -> V* consists of those d E V for which Vd- is left invariant by the Artin symbols of all ideals that have order 2 in the class group. Note that D itself is always in this kernel. A decomposition D = Dl D2 with D1 E ker Ri is called a decomposition of the second kind. These decompositions are characterized by the fact D1 is a square modulo all prime divisors of D2 and vice versa. The prime 2

needs special attention here. Given a decomposition D = Dl D2 that is of the second kind, Redei explicitly constructs a quadratic extension of Q( Dl D2) that is cyclic of degree 4 and unramified over K. This is possible since the equation x2 - D1y2 - D2 Z2 = 0 has non-zero rational solutions by Legendre's theorem and the assumption on the decomposition. For a primitive integral solution (x, y, z) with well-chosen 2-adic behavior, the extension E that is generated over K by a square root yD, of x + y D has the desired properties. The extension E/K depends on the choice of the solution, but the quadratic extension EH1 = H1(ryD1) of H1 does not. Every element v E Gal(H2/Hi) is determined by its action on the elements 'D, for D1 E kerRi, so we can view this Galois group as a subspace of Hom(kerRi,IF2) _ (kerRi)*. With these identifications, we can describe the second Redei map : R2 :

kerRi -> C[2] fl C2 -> C2/C4

Gal(H2/Hi) C (kerRi)*

explicitly as an F2-linear map between vector spaces of dimension 1+r2(C)The 8-rank r3(C) of the narrow class group of K is given by the formula r3 (C) = r2 (C) - rankF, R2,

which is non-negative as R2 is always singular. As soon as one chooses a basis for ker R1 and for the space ker R*1

of decompositions of the second kind, R2 is given by a matrix whose entries describe the action of the Artin symbols va coming from d E ker R1

on explicit elements 7d' with d' E ker Ri . Note the equality R1 = Ri in case D is in the set D of discriminants that are of interest for the negative Pell equation. The entries of R2 are quadratic symbols of quadratic

irrationals and can be computed rather easily. Redet's paper [[2211 has numerous identities that express these `new number theoretic symbols', as he calls them, in rational terms, and FrOhlich 11811 does the same in a

P. STEVENHAGEN

256

more systematic way. However, these expressions are usually given in terms of the chosen solution of x2 - D1y2 - D2 Z2 = 0, and this makes it difficult to obtain density results in terms of the prime divisors of D. Special cases have been dealt with by Morton [[141-[1611, and density statements for the

behavior of C/C8 have been proved by the author [[26]] in the case that t - 1 prime divisors of the discriminant are fixed and the last one varies. For t = 2 this yields results that had been known for some time. In the previous section, we showed that the negative Pell equation is solvable

for all D in a subset of density at of Dt since these D have r2 (C) = 0. Following an idea that goes back to Redei [[21]] and Scholz [[25]], we can use the 8-rank theory to enlarge this set even further by looking at those D that have r2 (C) = 1. We will indicate briefly how this is done.

The density Qt of the set of D E Dt having r2 (C) = 1 follows easily from proposition 9. One has ,Qt = at if t is even and pt = (1 - 21-t)at if t is odd. Note that limt_,00)3t = limt_,00 at = a... For D as above, there is exactly one non-trivial decomposition D = D1D2, and one can show that the higher Redei matrix R2 equals R2

-

D2 4 \ (D2) 4

(D I /4

which has to be interpreted in the obvious way as a matrix over ]F2. As the biquadratic residue symbols have value f1, there are 4 possible values for this matrix, and they each occur for a set of D that has density .114,Qt in Dt. If (D )4 and (D )4 are both equal to 1, the matrix R2 is the zero matrix and H2 is strictly smaller than the 2-Hilbert class field H. In all other cases,

its rank is one and H = H2. We can determine whether H2 is real by a generalization of the argument used in proving theorem 5. One has :

H2isreal

\Da)4=1. (:;-)

It follows that H = H2 is totally complex and the Pell equation is not solvable if the biquadratic residue symbols are not equal, which happens for a collection of discriminants of density,Qt/2 in Dt. If both symbols equal

-1, the Pell equation is solvable, and this happens for a set of density Qt/4. Taking together the two collections of D for which the Pell equation is solvable, we conclude that for each fixed t, the set Dt (-1) has lower density at + 1)3t and upper density 1 - 2 Qt inside V. For increasing t these values rapidly converge to : s 4

a,, = .52428.. .

1 - 2 a = .79029.. .

REDEI-MATRICES AND APPLICATIONS

257

It remains a challenging problem to deduce any non-trivial density result for D(-1) in D. Hardly anything is known about the distribution of the ranks rk(C) when

k > 4. Some numerical data are available in the quadratic case 1121, [27]], mainly for cyclic C. The best known example is probably that of the quadratic field Q(v/=p), with p a prime congruent to 1 mod 4. In this case

the discriminant D = -4p has two prime divisors and C is a non-trivial cyclic 2-group. It follows from theorem 1 that the order of C is divisible by 4 exactly when p - mod8, and the 8-rank results quoted above imply that the order is divisible by 8 if and only if p splits completely in Q((s, V/-1-+-i),

which is a non-abelian field of degree 8 over Q. All numerical evidence suggests strongly that the order of C is divisible by 16 for a set of primes of density 1/16, but the existing techniques do not even suffice to show that this happens infinitely often. The question is closely related to the 2-adic behavior of the fundamental unit e, in the field Q(J), see [[27]].

Note added in proof It is now conjectured that the limit value in problem 6 exists and equals 1 - a,, = .5805775582..., with a,,, as in section 3, see [[29]]. The heuristics

can be extended to the case of quadratic orders [[30]]. They have been confirmed by extensive computer calculations [[1]].

Manuscrit recu le 7 septembre 1994

P. STEVENHAGEN

258

References [1] W. BosMA, P. STEVENHAGEN. - Density computations for real quadratic

units, preprint (1994). [2] H. COHN, J.C. LAGARIAS. - On the existence of fields governing the 2-

invariants of the class group of Q(Vl p) as p varies, Math. Comp. 41, 711-730 (1983). [3] G. CORNELL, M.I. RosEN. - The $-rank of the real class group of cyclotomic

fields, Compositio Math. 53, 133-141 (1984). [5] J. E. CREMONA, R.W.K. ODONI. - Some density results for negative Pell

equations; an application of graph theory, J. London Math. Soc. (2) 39, 16-28 (1989). [6] A. FROHLICH. - The generalization of a theorem of L. Redei's, Quart. J. Math. Oxford (2) 5, 130-140 (1954). [7] A. FROHLICH. - On fields of class two, Proc. Lond. Math. Soc. (3) 4, 235-256 (1954). [8] A. FROHLICH. - A prime decomposition symbol for certain non Abelian numberfields, Acta Sci. Math. 21, 229-246 (1960). [9] Y. FURUTA. - On class field towers and the rank of ideal class groups, Nagoya Math. J. 48, 147-157 (1972).

[10] G. GRAs. - Sur les 1-classes d'ideaux dans les extensions cycliques relatives de degre premier 1, Ann. Inst. Fourier, Grenoble 23,3, 1-48 (1973).

[111 J. HURRELBRINK. - On the norm of the fundamental unit, preprint, Louisiana State University (1990). [12] E. INABA. - Uber die Struktur der i-Klassengruppe zyklischer Zahlkorper vom Primzahigrad 1, J. Fac. Sci. Imp. Univ. Tokyo, section I, vol. W 2, 61-115 (1940). [13] J. MACWILLIAMS. - Orthogonal matrices over finite fields, Amer. Math. Monthly 76, 152-164 (1969). [14] P. MORTON. - Density results for the 2-classgroups of imaginary quadratic fields, J. reine angew. Math. 332, 156-187 (1982).

[15] P. MORTON. - Density results for the 2-classgroups and fundamental units of real quadratic fields, Studia Scientiarum Math. Hungarica 17, 21-43 (1982). [16] P. MORTON. - The quadratic number fields with cyclic 2-class groups, Pac. J. Math. 108, 165-175 (1983).

REDEI-MATRICES AND APPLICATIONS

259

[17] R. V. PERLis. - On the density of fields with N(e) _ -1, preprint, Louisiana State University (1990). [18] L. REDEI, H. REICHARDT. - Die Anzahl der durch 4 teilbaren Invarianten

der Klassengruppe eines beliebigen quadratischen Zahlkorpers, J. reine angew. Math. 170, 69-74 (1934). 1191 L. REDE!. - Arithmetischer Beweis des Satzes fiber die Anzahl der durch vier teilbaren Invarianten der absoluten Klassengn.ippe im quadratischen Zahlkorper, J. refine angew. Math. 171, 55-60 (1935).

[20] L. REDEI. - Uber die Grundeinheit and die durch 8 teilbaren Invarianten der absoluten Klassengruppe im quadratischen Zahlkorper, J. reine angew. Math. 171, 131-148 (1935). [211 L. REDEI. - Uber einige Mittelwertfragen im quadratischen Z iilkorper, J. reine angew. Math. 174, 131-148 (1936). [22] L. REDE!. - Ein neues zahlentheoretisches Symbol mitAnwendungen auf die Theorie der quadratischen Zahikorper, J. reine angew. Math. 180, 143 (1939).

[231 L. REDEI. - Bedingtes Artinsches Symbol mit Anwendungen in der Klassenkorpertheorie, Acta Math. Acad. Sci. Hung. 4, 1-29 (1953). [241 L. REDEI. - Die 2-Ringklassengruppe des quadratischen Zahlkorpers and die Theorie derPellschen Gleichung, Acta Math. Acad. Sci. Hung. 4, 3187 (1953). [251 A. SCHOLZ. - Uber die Losbarkeit der Gleichung t2 - Due = -4, Math. Zeitschrift 39 [261 P. STEVENHAGEN. - Class groups and governing fields, Publ. Math. Fac.

Sci. Besancon, annee 1989/90, 1-94 (1990). [27] P. STEVENHAGEN. - On the 2-power divisibility of certain quadratic class

numbers, J. Number Theory 43 (1), 1-19 (1993). 1281 P. STEVENHAGEN. - Class number parity for the p-th cyclotomic field, Math. Comp. 63 no. 208 (to appear, 1994). [291 P. STEVENHAGEN. - The number of real quadratic fields having units of negative norm, Exp. Math. 2 (2), 121-136 (1993). [301 P. STEVENHAGEN. - Frobenius distributions for real quadratic orders, J. Theorie des Nombres Bordeaux (to appear, 1995). [31] F.VAN DER LINDEN. - Class number computations of real abelian number fields, Math. Comp. 39, 693-707 (1982). Peter Stevenhagen Faculteit Wiskunde en Informatica Plantage Muidergracht 24 1018 TV Amsterdam, Netherlands e-mail : psh@fwi . uva. n1

Number Theory Paris 1992-93

Decomposition of the integers as a direct sum of two subsets R. Tijdeman

1. - Introduction Two subsets A and B of a set C induce a decomposition of C if every

element of C has a unique representation a + b with a E A, b E B. Notation : C = A T B. We call A and B complementing C-pairs. A first study of such pairs arose in the forties from Hajos' proof of Minkowski's conjecture on systems of linear inequalities. Hajos reduced this conjecture to an equivalent statement on decompositions of finite abelian groups, which he was able to prove. A survey of the work on decompositions of finite abelian groups is given in Section 2. The question of characterising all complementing Z-pairs seems first to have been stated by de Bruijn in 1950. De Bruijn came to the problem while he studied bases for the integers. Let A be a finite set of integers including 0. A set of integers {bl, b2,. ..I is called an A-base whenever any integer x can be expressed uniquely in the form 00

x=

00

Eibi

i=1

(Ei E A,

IEiI < oo). i=1

if it can be rearranged in the form h2d3.... } where h denotes the cardinality of A and dl, d2, d3, .. .

An A-base is called simple {dl, hd2,

are integers. De Bruijn [21 considered the special case where the elements of A have no common factor and where h is a prime. He conjectured that under these assumptions A ® B = Z implies that B is the set of multiples of h. He remarked that a proof of his conjecture would imply that every A-base is simple. For later work on A-bases we refer to de Bruijn [61, Long and Woo [161, Swenson and Long [281. In 1974, Swenson 1271 showed that there is no effective characterisation of all complementing i-pairs. More precisely, he showed that any two finite

sets of integers A, B with the property that all sums a + b (a E A, b E B)

R. TIJDEMAN

262

are distinct, can be extended to two infinite complementing Z-pairs. For a similar construction, see Post [20]. In contrast, there is a particularly nice characterisation of all complementing Z>o-pairs. The result, which was implicit in the work of de Bruijn [5], was rediscovered by Vaidya [30]. It is obvious that A fl B = {0} and 1 E A U B. Suppose 1 E A. Then A and B are infinite complementing Z>opairs if and only if there exists an infinite sequence of integers {mi}i>1 with mi > 2 for all i, such that A and B are the sets of all finite sums of the form 00

CK)

a = E x2iM2i, b = E x2i+111'I2i+1 i-o

i-o

i

where 0<xi<mi+1fori>0andMO=1andMi= flmj fori> 1.If j=1

A or B is a finite set, a similar characterisation holds with the change that the sequence {mi} will be of finite length r and the only restriction on xr is that it be nonnegative. C.T. Long [ 151 gave a corresponding characterisation of all complementing C-pairs in case C = {0, 1, ... , n - 1}. He also showed

that in this case the number C(n) of complementing C-pairs is the same as the number of ordered nontrivial factorisations of n. The number C(n) is determined by

2C(n) = E C(d)

(n E Z>1).

din

Hansen [ 14] and Niven [ 181 generalised these results to a characterisation of the complementing pairs of the set Z>o x Z>o. Long [ 151 made the interesting

observation that it follows from the above characterisation that if A and B are infinite sets such that A ®B = Z>o then A ®(-B) = Z (see also Brown [1]). In particular, we can take for A the set of finite sums of odd powers of 2 and for B the set of finite sums of even powers of 2. Eigen and Hajian [31] showed that if A and B are infinite sets such that A ®B = Z>o, then there exists a continuum number of sets b such that A ® B = Z.

In Section 3 we formulate a conjecture which, if true, provides an inductive characterisation of all complementing sets A, B for which the cardinality of A is fixed integer n. It was already observed by Hajos [12] and de Bruijn [5] p. 240 that B is periodic if A is finite. In this way the problem is reduced to a finite problem which can be stated in terms of finite cyclic groups. If n is a prime number, then our conjecture coincides with de Bruijn's one stated above. This conjecture was proved by Sands [24] in 1957. We shall show how a proof of our conjecture can be derived from Sands' results if n is a prime power. The general case remains open.

DECOMPOSITION OF THE INTEGERS AS A DIRECT SUM

263

We shall further show by a combinatorial argument that if m is coprime to

n and A ®B = Z, then mA ®B = Z (we define mA = {mala E A}). On using this result we give an alternative proof of de Bruijn's conjecture.

The problem of characterising all sets B such that A $ B = Z in the special case where A consists of the finite sums of odd powers of 2 was posed to me by Yu. Ito. He was interested in the problem because of his joint research with S. Eigen and A. Hajian on exhaustive weakly wandering sequences for ergodic measure preserving transformations [71, [91, [8]. Such a characterisation will be presented in Section 4.

2. - Decomposition of finite abelian groups About one century ago Minkowski [ 171 proved the following fundamental

result on the geometry of numbers : Let h1i ... , n be homogeneous linear forms in the variables x1, ... , xn with real coefficients and determinant 1. Then there exist integers x1,. .. , xn, not all zero, such that Ill

(1)

1,...,IU

1.

Since L;1i ... , n may have integer coefficients, the equality signs cannot be deleted. Minkowski conjectured that (1) can be replaced by IS1I < 1,...,ISnI < 1

(2)

unless at least one of the linear forms has integer coefficients. Minkowski proved the statement for n < 3. Several mathematicians worked on it and in 1940 it was known to be true for n < 9. In 1941 Hajos [11] established Minkowski's conjecture in the affirmative. His proof consists of three parts : (1) reduction to some equivalent geometric statement on k-multiple lattice tiling of the unit cube, (ii) further reduction to the equivalent group theoretic statement given below.

(iii) proof of this group theoretic statement.

Hajos' theorem is very fundamental and has various aspects. Fary [101 reformulated it as a result on the structure of commutative compact topological groups. Now we state Hajos' result in terms of group theory. Let G be a finite abelian group with unit element 1. A subset of G is called a simplex if it is of the form {1, a, a2, ... , ae-1 } where a E G has order > e. Notation [a]e, or briefly [a]. It is clear that [a] is a subgroup of G if and only if a has order e. We say that G is the free product of the sets Al, ... , A. if every element

of G has a unique representation a1 Hajos' theorem reads as follows.

an with aj E A; for j = 1, .

.

.

, n.

R. TIJDEMAN

264

If G is the free product of n simplices, then at least one of the simplices is

a subgroup of G. Hajos' proof has been simplified by Redei [22] and Szele [29]. Szele [29] p. 57 conjectured that Hajos' theorem would hold true for any decomposition of the finite abelian group G. A simple example (cf. [ 131 p. 185) shows

that this is false. Let G be the cyclic group defined by a8 = 1 and let A = {1, a2}, B = {1, a, a4, a5}. Then none of A and B are subgroups of G whereas G is the free product of A and B. Note, however, that a4B = B. A subset A of G is said to be periodic whenever there exists an element g E G, g 1, such that gA = A. De Bruijn [2] conjectured that if G is a finite abelian group of order > 1 and G is the free product of the sets A and B, then A or B is periodic. He observed that the assertion is not true if G is the infinite cyclic group generated by g. Szele (cf. [ 131 p. 185) made

the same observation. He took for A the product of the subsets {1,g2}, { 1, g32}.... and for B the product of the subsets 11, g-1 }, 11, g-4 }, {1, g-16}, .... Here G is the free product of A and B and none of A and B is periodic. { 1, g8 } ,

Some years earlier, however, Redei [21] had published two examples of Hajos which show that de Bruijn's conjecture is false. The simplest example refers to the abelian group generated by the elements a, b, c of orders 4, 4, 2 respectively. This group is the free product of the nonperiodic sets {1, a}, {1, b} and {1, a2, ab2, a3b2, c, a2bc, a2b3c, b2c}.

Later, Hajos [ 131 showed that any finite cyclic group the order of which is the product of three pairwise relatively prime numbers > 1, two of which are composite, can be represented as the free product of two nonperiodic subsets. He gave an explicit example of order 180 = 9 x 4 x 5, which is the smallest number satisfying the conditions. Let us follow de Bruijn in calling a group good if any factorisation of G as a free product of A and B implies that A or B is periodic and otherwise bad. De Bruijn [3] extended Hajos' result by showing that if n = dld2d3 with (d1, d2) = 1, d3 > 1 and both d1 and d2 are composite numbers, then the cyclic group of order n is bad. The smallest order of this type is 72. De Bruijn [4] gave the explicit example (g72 = 1) g18 926 g34

90 98, 916 B : g12, 917 918 921 924 941 945 948 954 960 965, 969 A :

,

On the other hand, cyclic groups of the following orders have been proved to be good (p, q, r, s are distinct primes) : p' ' (A > 1) (Hajos [12]), pq, pqr (Redei [231), p"q (A > 1) (De Bruijn [41), p2g2, p2qr and pqrs (Sands [241).

DECOMPOSITION OF THE INTEGERS AS A DIRECT SUM

265

This covers all cyclic groups. Already in 1947 Redei [211 had shown that the non-cyclic group of order p2 is good. The problem was completely solved by Sands [25, 261 who determined all good finite abelian groups. Sands [241

further proved that if the finite cyclic group G is the free product of the subsets A and B and the cardinality of A is a prime power, then either A or B is periodic. This had been conjectured by de Bruijn ([3] p. 371) for the case that the number of elements of A is prime. Hajos 1 121 proposed the question whether every decomposition of a finite abelian group G is quasiperiodic. A factorisation of G as free product of A and B is called quasiperiodic if either A or B, B say, can be split into a number of parts B1, B2, ... , B,,,, (m > 1) such that ABi = giAB1 (i = 1,. .. , m) where the elements gl,... , gn form a subgroup of G. De Bruijn's example is quasiperiodic as we can take B1 = {g12 917 918 924 941 965} and gi = 1, 92 = g36. Hajos' example is quasiperiodic as we can take A = {1, a, b, ab}, B1 = {1, a2, ab2, a3b2}, B2 = {c, a2bc, a2b3c, b2c} and gi = 1, g2 = c.

De Bruijn [4] obtained some partial result on Hajos' question.

3. - A is finite Suppose A E B = Z where A is finite. Let A = {ao, a1,. .. , an_i }. Since,

for any integer x, we have (A - x) E B = Z, we may assume without loss of generality that 0 = ao < al < ... < an_i. If x = a + b with a E A, b E B, put (x)A = a, (x)B = b. The following result of Hajos [121 and de Bruijn 121 p. 240 reduces the problem of characterising all complementing Z-pairs to a problem on finite sets which can be stated in terms of finite cyclic groups. LEMMA 1. - The sequence {(x)A}XEZ is periodic. If the period length is L

then n divides L and B is periodic mod L.

Proof : put M = an_i. Consider the nM + 1 vectors ((i)A, (i + 1)A, ... , (i + M - 1)A) for i = 0,1, ... nM.

By the box principle at least two vectors are equal, for i = s and i = t with

s < t, say. Hence (x)A = (x+t-S)A forx = s,s+1,...,s+M- 1. Suppose k is the smallest integer with k > s + M and (k)A

(k + t - $)A

-

If (k)A # 0, then put k = a + b with a E A, b E B. We infer that (b)A = 0 and s < k - M < b < k. Hence (b + t - s)A = (b)A = 0 which implies b + t - s E B. Since k + t - s = a+ (b + t - s), we obtain (k + t - S)A = a = (k)A, a contradiction. If (k + t - s)A 0, a similar argument yields a contradiction. Thus (x)A = (x + t - s)A for all x > s. By

R. TIJDEMAN

266

symmetry we also have (x)A = (x+t-s)A for all x < s. Let the period length of {(x)A}xEZ be L. Since Z = U o 'jai + B}, all ai have the same density in the sequence {(x)A}xEZ. Therefore, they occur with the same frequency in one period. This implies n1 L. Since the elements of B are precisely the integers x with (x)A = 0, we have that B is periodic mod L. 0 By simple transformations each complementing Z-pair A, B with A

finite can be reduced to the standard situation that A is represented by < an_1 < L with gcd(ao, al, ... , an-1) = 1 and B is ao = 0 < a1 < a2 < < bn,._ 1 < L. Here L = nm. represented by U'- 1(bi +ZL) with 0 < b1 < Namely, 0 = a + b for some a E A, b E B. By taking A - a in place of A and B + a in place of B we have 0 E A n B. If gcd(ao, a1, ... , an_1) = d > 1, then 7G=

(3)

A

®Bjd

j forj=0,1,...,d-1

where Bj = {b E Bib - j(modd)}. The elements of A/d are coprime and we have obtained d complementing Z-pairs with coprime a's. It is obvious that B can be represented as indicated. Without any trouble we can add or subtract multiples of L from the elements of A to obtain the required structure. The problem is now reduced to the decomposition problem for the cyclic group of residue classes mod L (where we only know that L is a multiple of the cardinality of A).

It will be clear from the previous sections that a characterisation of all complementing i-pairs is not a simple matter, even if we assume one of the subsets to be finite. However, in the latter case, a kind of inductive characterisation would be possible, if we could prove the following

statement. CONJECTURE. - If A®B = Z, 0 E An B, gcdaEA a = 1 and A has exactly

n elements, then there exists a prime factor p of n such that all elements of B are divisible by p.

Suppose the statement is true. Then the elements of A are equally distributed among the residue classes mod p. We can make a splitting in

p complementing i-pairs as indicated in (3), with d = p and A and B interchanged. So the problem is reduced to the decomposition problem for

the cyclic group of residue classes mod(L/p) and the procedure can be repeated. The following examples show that p is not determined by L and n.

L=12, n=6, A=10,1,4,5,8,91, B = {0, 2(mod 12)}, p=2; L=12, n=6, A=10,1,2,6,7,81, B = {0, 3(mod l2)}, p=3.

DECOMPOSITION OF THE INTEGERS AS A DIRECT SUM

267

If the conjecture is true, then every such decomposition is quasiperiodic

in accordance with Hajbs' conjecture stated at the end of the previous section. For we can split A into residue classes mod p, Ao, &... , A,-1, say, and Ai +B = pZ+i for i = 0, 1, ... , p-1. If the number n of elements of A is a prime, then the conjecture implies that A represents a complete residue system mod p and every element of B is divisible by p. This is precisely the conjecture of de Bruijn stated in the introduction. A proof of this conjecture can be obtained by combining results of de Bruijn and Sands. De Bruijn ([2)), p. 241) provided an argument which implies that his conjecture is true if the following statement is true : if the finite cyclic group G is the

free product of the subsets A and B and the cardinality of A is prime, then either A or B is periodic. As remarked in the previous section, the latter statement was proved by Sands. I shall present a completely different proof of de Bruijn's conjecture (Theorem 2). Subsequently I shall extend

de Bruijn's argument to the case where the cardinality of A is a prime power. By combining this with Sands' general result we obtain a proof of my conjecture, stated above, in case n is a prime power (Theorem 3). We start with a result without any restriction on n. THEOREM 1. - Let A ®B = Z with 0 E A fl B and cardinality n of A finite.

Then, for any integer h with gcd(h, n) = 1, we have hA ® B = Z.

We need some lemmas. Let A= {ao, a1, ...,an-, } with ao = 0. LEMMA 2. - For any integer x

{(x+ao)A,(x+al)A,...,(x+an_1)A}=A, {(x-ao)A,(x-a1)A,...,(x-an_1)A} =A. Proof : suppose (x + ai) A = (x + aj) A. Then x + ai - Q1 = x + aj - /32 for some 01, 32 E B. Hence ai + 32 = aj + ,31. Since such a representation is unique, we have i = j. The proof of the second statement is similar. 0 LEMMA 3. - Let q be a prime power with gcd(n, q) = 1. Then, for any integer x, {(x + gao)A, (x + ga1)A, ... , (x + qan-1)A} = A.

Proof : let q = pk, p prime. Put D = {(a, a, ... , a) E Agla E A}. Define ByLemma2 we have

f : A9 ->Aby(a1ia2i...,aq)'--1 n-1

U f (a1, a2, ... , aq-1, aj) = A. j=0

R. TIJDEMAN

268

Hence f (a) (a E A9) assumes each element of A exactly nq-1 times. Note that f (al, a2, ... , aq) does not change value if we permute al, a2, ... , aq. If (al, a2.... , aq) contains entry aj exactly h times (j = 0, 1, ... , n - 1), then it has precisely q!

lo! ii! .. In-I! permutations in Aq. This multinomial coefficient is divisible by p, unless all but one lo, equal zero, that is (al, a 2 ,--. , aq) E D. It follows that

f assumes on Aq\D each value of A a number of times which is divisible by p. Since p-n9-1, we infer that f assumes each value of A on D. Thus f (D) = A. 0

LEMMA 4. - Let q = -1 or a prime power with gcd(n, q) = 1. Then

qA®B=Z. Proof : we first show that all numbers {qa + b}aEA, bEB are distinct.

Suppose al, a2 E A, 31, /32 E B are such that qal +,31 = qa2 + 02. If q = -1, then the assertion follows from a2 + 01 = al + /j2. Otherwise qai - /j2 = qa2 - /3i. This number has a unique representation a +,3 with a c A,,3 E B. It follows that -,0 + qa1 = a + /32, -0 + qa2 = a +,31. Hence (-,3+gal)A = (-/3+ga2)A. By Lemma 3 we obtain al = a2 and therefore 13i =/32 By Lemma 1, B is periodic mod mn for some positive integer m. Hence B consists of m residue classes mod mn. Let {bo, b1, ... , bm_1 } be a set of representatives. Since by the first paragraph all numbers qal + bj (i =

0, 1,...,n- 1, j = 0,1,...,M- 1) are in distinct residue classes modmn, these mn numbers represent a complete residue system mod mn. Hence qAED B=Z. 0

Proof of Theorem i : since h can be written as the product of prime powers and factors -1, each coprime to n, we reach the conclusion by repeated application of Lemma 4.

0

COROLLARY. - Let h be an integer with gcd(h, n) = 1. Then h(aI - a2) _ 01 -,32 for some al, a2 E A, ,Ol, /j2 E B implies al = a2, 01 = Q2.

Proof : we have hal = (hal + /32)hA = (hat + /31)hA = hat.

0

Subsequently we show how a proof of de Bruijn's conjecture can be derived from Theorem 1.

DECOMPOSITION OF THE INTEGERS AS A DIRECT SUM

269

ThEOREM 2. - Let A®B = Z with 0 E AnB, the elements of A are coprime

and the cardinality n of A is prime. Then every element of B is divisible by n. Proof : since B is periodic mod mn for some m, we assume without loss of generality that A consists of the nonnegative integers ao = 0, a1, ... , an_1 and that bo, . . . , b,,,,_1 are integers with 0 < bo < . . . < b.,,,,_1 < mn such that B = U oi(b; + mnZ). Put Bo(z) = 1 + zb1 + zb2 + + zb^-1 and (4)

B(z) _ E zb = BO(z) (1 + zmn + z2mn + ...) =

Bo(Z) nen

bEB b>O

Note that every pole of B is an mn-th root of unity. We shall show that it is an n-th root of unity. Set Ah(z) = 1 + zhal + Zha2 + + zhan 1 . Then for every h > 0 with gcd(h, n) = 1 we have, by Theorem 1, 00

Ah(z)B(z) = > zk - Ph(z) =

(5)

1 1

k=0

z

- Ph(z)

where Ph(z) E Z[z]. Hence every pole # 1 of B is a zero of Ah. Let ( be a pole of B with (# 1. Put Sk = E o (k-; for k E Z. Since Ah(() = 0 whenever gcd(h, n) = 1, we have sh = 0 whenever gcd(h, n) = 1. By the theorem on elementary symmetric functions (formulae of Newton-Girard) we obtain, since n is prime,

IIn-1 j-o (z-(a') =zn+cn where cn is some constant. Since ao = 0, we have cn = -1. Therefore, is the complete set of n-th roots of unity. Since (ao = 1 (a1, ... , the a's are coprime, there exist integers to, ti, ... , tn_1 such that 1 = toao + tiai + + tn_lan_1. Hence, putting (i = e2nti/n ( = ((ao)to ((a1)tl ... ((an-1)tn-1 = (t for some t E Z. (an-1

Thus (is an nth root of unity. From (4) we see that every pole of B is simple and that (zmn -1) /(zn -1) divides Bo(z). This implies, by the choice of the b's, Bo(z) = (1 + zn + z2n

+... + z(,,,.-i)n)

(1 + fiZ+... + fn_izn-1)

for some coefficients fi, ... , fn-1. Since Bo has only m nonzero coefficients, = fn-1 = 0. Thus Bo(z) = (1 - zmn)/(1 Zn ) and we see that fi =

-

B(z) = (1 - zn)-1 = 1 + zn + Z2n + multiples of n.

, in other words, B consists of the 0

We use Sands' result on finite cyclic groups [241 to obtain the following generalisation of Theorem 2.

R. TIJDEMAN

270

THEOREM 3. - Let A ® B = Z With 0 E A fl B, gcdaEA a = 1 and A has exactly pt elements with p prime, t E Z>1. Then all elements of B are divisible by p.

Proof : let n = pt. By Lemma 1, B is periodic. Let L be its minimal period. If G denotes the group of residue classes mod L, then Z = A ® B furnishes a decomposition G = A* ® B* where A* and B* consist of the residue classes mod L determined by the elements of A and B. respectively. Note that L is divisible by pt.

For t = 1 the statement is true by Theorem 2. So suppose t > 1. We apply induction on t. It follows from Theorem 2 of Sands [24] that A* or B* is periodic. B* cannot be periodic because of the minimality of L, so it has to be A*. Note that the elements g with g + A* = A* form a subgroup Go of G. We shall show that Go contains the residue class Ll p (mod L). If not, Go contains L/q for some prime q # p. Then a E A* if and only if a+ vL/q E A* for all v E Z. Hence A* splits into subsets of size q. Since A* has pt elements, this is impossible. Thus A* is periodic mod L/p. Let A** and B** consist of the residue classes mod L/p determined by the elements of A and B, respectively. By the previous paragraph A** has pt-i elements and A** ® B** = Z/(L/p)Z. Put r = pt-1. Let A = {ao = 0, al, ... , ar_1} be a set of integers representing A**. Since gcdaEA a = 1

and every element of A is of the formal + wL/p, there exists integers + Vr_lar_1 + vrL/p = 1. vo, Vl, ... , Vr_1i yr such that voao + v1a1 + W e infer that the greatest common divisor d of do, a1, ... , aris coprime to L/p. Let h be the inverse of d mod(L/p). Then 1_

1_

1_

L

hao, hat, ... , har_1 = dao, dal, ... dar_1 (mod -) . P

Therefore, by Theorem 1 , d A = G do ao = 0, dal , ... , ar_ 1 } is a set of pt-I relatively prime numbers with d dA ® B** = hA ® B** = Z. Hence, by the induction hypothesis, all elements of B** are divisible by p. Since B** = B + (L/p)Z and p is a divisor of L/p, all elements of B are divisible

by p

0

4. - A is the set of finite sums of distinct odd powers of 2 We use the following notation. If n = f Ek b32 with b; E {0, 1} is the binary notation of n, then we write n = ±bkbk_1 ... bo. We say that bk is the first bit and bo the last. The bit b; is said to be at place j, for j = 0, 1, ... , k. If j is even, then b; is at an even place, otherwise at an odd place. If b; is the last bit 1 in the binary expansion of n, then ord2(n) = j.

DECOMPOSITION OF THE INTEGERS AS A DIRECT SUM

271

Let A be the set of finite sums of distinct odd powers of 2 and A the set of the finite sums of distinct even powers of 2. Yu. Ito asked me to characterise all sets B such that A ® B = Z. Obviously A ®A = Z>o, whence A ® (-A) = Z. THEOREM 4. - The above A satisfies A ® B = Z if and only if B is such

that (i) if b, b' E B with b b', then ord2 (b - b') is even, (ii) the set B is maximal with respect to (i),

(iii) -A c A+ B. The first condition says that there is an even number k such that 2k I b - b', but 2k+1 { b - b'. The second means that B cannot be enlarged without affecting (i). The third condition is equivalent to saying that for

every element a E A there is an a E A such that a+ a E -B. Still another interpretation is that any finite collection of bits at even places can be completed to some nonpositive number in B by inserting suitable bits at odd places, zeros at even places and putting a minus sign in front of the number. Recently it was proved by Eigen, Hajian and Kakutani (32) that if F is a finite set of integers, then F can be extended to a complementary set B of A if and only if (i) holds for F.

Proof : (this proof was shown to me by Yu. Ito. A simpler proof of (i) can be found in S. Eigen, A. Hajian and S. Kakutani (32), Lemma 1). (ii) Suppose b V B and ord2(b-b) is even for every b E B. Since b = a+ b for some a E A, b E B, we have b - b = a (z- A, whence ord2 (b - b) is odd. NO Obvious.

(i) Put An = 22nA and Bn = An ®B. Then the 2n sets n-1 Bn +

ej2 2i+1,

eo, E1,

,Cn_1 E {0,1},

j=0

are disjoint and their union is Z. We claim that Bn + k 22n = Bn for k E Z. For n = 0 it is clear. Suppose the claim is valid for n = m. Then Bm+1 + k 22m C B,n + k

22m

= Bm = Bm.+1 U (Bm+1 +

22m+1).

Since B,n+1 and Bm,+1 + 22m+1 are disjoint sets of the same cardinality, we conclude that adding or subtracting 22m+1 to an element from one set

yields an element from the other set. It follows by induction on III that Bm+i + 1. 22m+1 = Bm+i + 22m+1 for odd 1 and B,,,,+1 + 1. 22m+1 = Bm+1 for even 1. This proves the claim.

R. TIJDEMAN

272

Suppose B - B contains an element z with ord2 (z) is odd. Then

b - b' = z = (2k+1)2 21+1 = k . 221+2 +2 21+1 for some b, b' E B and k, 1 E Z. Since B C Bn for all n, we have

b=b'+k 221+2+221+1 E B1+1+221+1 Thus b E B1+, fl (B1+1 + 221+1) but these sets are disjoint.

The proof of the sufficiency part of Theorem 4 requires some lemmas. LEMMA 5. - If(i) holds, then every integer is represented at most once as

a+bwithaEA, bEB. Proof : suppose al + bi = a2 + b2 for some al, a2 E A and bl, b2 E B. Then al - a2 = b2 - bi. However, ord2(al - a2) is odd and ord2(b2 - bl) is even, unless al = a2, bl = b2. 0

LEMMA 6. - Let (i) hold. If b and b' are elements of B with bb' > 0 such that b = ±b2k-lb2k-2 ... bo, b' = ±b2k-lb2k-2 ... bo and b2j = b2i

forj=0,1,...,m-1.Then b3=b.forj=0,1,...,2m-1. Proof : clear.

LEMMA 7. - If (i) and (iii) hold, then every non-positive integer has a representation a + b with a E A and b E B.

Proof : we have 0 E A, whence 0 E A ® B. Consider n E Z 0. Put bo = no. Since bo E A, there exists some nonpositive element of B ending with bo by (iii). Let bi be the bit at place 1 of this element. Define a E {0, 1} such that, in binary notation, bibo - n1no + aiO(mod 4). Next consider n2nino + a10. Let b2 be the bit at place 2 of this sum. Then, by (iii), there is a nonpositive element in B such that the last two bits at even places are b2 and bo. Let b3 be the bit at place 3 of this element. Define a3 E {0,1 } such that b3b2bibo - n3n2nlno+a3OalO(mod 24). By considering n4n3n2n1no+a3OaiO and continuing the procedure, we eventually construct bits b2k+1, b2k,... , bl, bo and a2k+1, a2k-1, ... , a3i al such that (6)

b2k+lb2k ... bibo - -n + a2k+lOa2k-10 ... Oa1O (mod 22k+2)

and b2k+1 is the bit at place 2k + 1 of some nonpositive element b of B with b2k, b2k-2, ... , bo at the last k + 1 even places and zeros at all other even places.

DECOMPOSITION OF THE INTEGERS AS A DIRECT SUM

273

Note that, by Lemma 6, the bit at place 1 of b is bi (apply it for m = 1), the bit at place 3 of 6 is b3 (apply Lemma 6 for m = 2), and so on. Thus b ends with the 2k + 2 bits b2k+lb2k bibo, whence -b can be written as bibo with t E A. Further, observe that on both sides of (6) the numbers are nonnegative and less than 22k+2, by -n < 22k, so that actually in (6) both sides are equal. Put a = n - b. Then, for some t E A, t 22k+2 +b2k+lb2k

a

b

with a E A and b E B.

0

LEMMA 8. - Let B E Z be a set satisfying (i) and (iii) and such that A ® B represents every nonpositive integer, but not 1. Then every element of B - 1 has its last nonzero bit at an even place.

Proof : suppose b is a negative element of B such that the last nonzero bit of b-1 is at an odd place, at place 2m- 1, say. By (iii) there is a nonpositive bo in B with bo = b2 = = b2,,,_2 = 1 and b2k = 0 element b* _ -bt b1_1

for k > m. Since the last bit 1 of b - 1 is at place 2m - 1 we have 2m-1

Hence, by Lemma 6, b* - b(mod

22m+1). Thus

2m-1

It follows that b* -1 has zeros at all even places. Hence a :_ - (b* -1) E A and 1 = a + b* E A ® B. a contradiction. Suppose b is a positive element of B such that the last nonzero bit of b - 1 is at an odd place, at place 2m - 1 say. By (iii) there exists a negative element b' = -b11 b11_1 ... b o in B such that bo = b2 = =b2 m_2 = 1 and b2k = 0 for k > m. Since we have proved in the previous paragraph that the last nonzero bit of b' - 1 is at an even place, we find that bo = bi = _ U b2,,,,_1 = 1. Thus ord2(b - b') = 2m - 1 which contradicts (i).

Proof of the sufficiency part of Theorem 4 : by Lemmas 5 and 7 it remains to prove that every positive integer has a representation a + b with

a E A and b E B. Let n be the smallest positive integer without such a representation. Then n V B, since 0 E A. Put B = B - (n - 1). Then A ® B represents every nonpositive integer, but not 1. By Lemma 8 every element of b - 1 = B - n has its last nonzero bit at an even place. Put B* = B U {n}. Then B* is larger than B and ord2(b - b') is even for every b, b' E B* with b # b'. Thus B is not maximal with respect to (I), in contradiction to (ii). Hence every positive integer is contained in A ® B. 0

274

R. TIJDEMAN

Theorem 4 induces a similar characterisation for A. COROLLARY. - A ® B = Z if and only if B satisfies (i') if b, b' E B with b # b', then ord2 (b - b') is odd, (ii') the set B is maximal with respect to (i'),

(iii') -ACA + B. Proof: note that A=2A andA=2A U(2A+1).

'='. By 2A®2B=2Z, we have A3(2BU(2B+1))=Z. Hence 2B U (2B + 1) satisfies the conditions (i), (ii), (iii) of Theorem 3. It follows immediately, that B satisfies (i'), (ii'), (iii'). / .'. By (i') all elements of B are even or all are odd. In the latter case

we replace B by B + 1. This involves no loss of generality. Let t be the set of numbers of B divided by 2. Then t satisfies conditions (i), (ii), (iii) of Theorem 1. Thus A + B = Z. Hence 2A + 2B = 2Z and

A®B = (2A U(2A+1))®2B = 2A®2B U (2A+1)®2B = 2ZU(2Z+1) = Z. 0

It is obvious that conditions (i) and (ii) of Theorem 3 are not enough to

guarantee A ® B = Z. The set q satisfies (i) and (ii), but A ®A = Z o. Yu. Ito asked for some set of type A for which the complementing sets are characterised by (i) and (ii) only. He wondered whether A' = { (- 1) a/2 ala E

Al is such a set. P. ten Pas [19) showed that there exist sets B' and B" which both satisfy (i) and (ii) such that A' ® B' = Z and A' (D B" =,/: Z. Acknowledgement. I am indebted to Yu. Ito and J. Urbanowicz for useful discussions and to S. Eigen and Yu. Ito for remarks on an earlier version of this paper.

Manuscrit recu le 3 decembre 1993

DECOMPOSITION OF THE INTEGERS AS A DIRECT SUM

275

References

[11 J.L. BROWN. - Generalized bases for the integers, Amer. Math. Monthly 71 (1964), 973-980. [2] N.G. de BRUIJN. - On bases for the set of integers, Publ. Math. Debrecen 1 (1950), 232-242. [3] N.G. de BRUIN. - On the factorisation offinite abelian groups, Indag. Math. 15 (1953), 258-264. [4] N.G. de BRUIJN. - On the factorisation of cyclic groups, Indag. Math. 17 (1955), 370-377. [5] N.G. de BRUIJN. - On number systems, Nieuw Arch. Wisk. (3) 4 (1956), 15-17. [6] N.G. de BRUIJN. - Some direct decomposition of the set of integers, Math. Comp. 18 (1964), 537-546. [7] S. EIGEN and A. HAJIAN. - A characterisation of exhaustive weakly wandering sequences for nonsingular transformations, Comment. Math. Univ. Sancti Pauli 36 (1987), 227-233. [8] S. EIGEN and A. HAJIAN. - Sequences of integers and ergodic transformations, Advances Math. 73 (1989), 256-262. [9] S.EIGEN, A. HAJIAN and Y. ITO. - Ergodic measure preserving transformations

of finite type, Tokyo J. Math. 11 (1988), 459-470. [10] I. FARY. - Die Aquivalente des Minkowski-Hajosschen Satzes in der Theorie der topologischen Gruppen, Comm. Math. Hely. 23 (1949), 283-287. [111 G. HAJ6s. - Uber einfache and mehrfache Bedeckung des n-dimensionalen Raumes mit einem Wurfelgitter, Math. Z. 47 (1941), 427-467.

[12] G. HAJ6s. - Sur la factorisation des groupes abelien, Casopis Pest. Mat. Fys. 74 (1950), 157-162. [13] G. HAJ6s. - Sur la probleme de factorisation des groupes cycliques, Acta. Math. Acad. Sci. Hungar. 1 (1950), 189-195. [14] R.T. HANSEN. - Complementing pairs of subsets in the plane, Duke Math. J. 36 (1969), 441-449.

[15] C.T. LONG. - Addition theorems for sets of integers, Pacific J. Math. 23 (1967), 107-112.

276

R. TIJDEMAN

[16] C.T. LONG and N. Woo. - On bases for the set of integers, Duke Math. J. 38 (1971), 583-590. [17] H. MINKOwsKI. - Geometrie der Zahlen, Leipzig, 1896. [18] I. NivEN. - A characterization of complementing sets of pairs of integers, Duke Math. J. 38 (1971), 193-203. [19] P. ten PAS. - Complementing sets for Z (in Dutch), Leiden, 1990. [20] K. Posr. - Problem 71, Nieuw Arch. Wisk. (3) 14 (1966), 274-275. [21] L. REDEI. - Zwei Liickensdtze caber Polynome in endlichen Primkorpern mit

Anwendung auf die endlichen Abelschen Gruppen and die Gaussischen Surnmen, Acta Math. 79 (1947), 273-290. [22] L. REDEI. - Kurzer Beweis des gruppentheoretischen Satzes von Hajbs, Comm. Math. Helv. 23 (1949), 272-282. [23] L. REDEI. - Ein Beitrag zum Problem der Faktorisation von endlichen Abelschen Gruppen, Acta Math. Acad. Sci. Hungar. 1 (1950), 197-207. [24] A.D. SANDS. - On thefactorisation offinite abelian groups, Acta Math. Acad. Sci. Hungar. 8 (1957), 65-86. [25] A.D. SANDS. - The factorisation of abelian groups, Quart. J. Math. Oxford (2) 10 (1959), 81-91. [26] A.D. SANDS. - On the factorisation of finite abelian groups II, Acta Math. Acad. Sci. Hungar. 13 (1962), 153-159. [27] C. SWENSON. - Direct sum subset decompositions of 7G, Pacific J. Math. 53 (1974), 629-633. [28] C. SWENSON and C. LONG. - Necessary and sufficient conditions for simple A-bases, Pacific J. Math. 126 (1987), 379-384.

[29] T. SzELE. - Neuer vereinfachter Beweis des gruppentheoretischen Satzes vonHajos, Publ. Math. Debrecen 1 (1949), 56-62. [30] A. M. VAIDYA. - On complementing sets of nonnegative integers, Math. Mag.

39 (1966), 43-44. [31] S. EIGEN and A. HAJIAN. - Sequences of integers and ergodic transformations,

Advances in Mathematics 73 (1989), 256-262. [32] S. EIGEN, A. HAIIAN and S. KAKUTANI. - Complementing sets of integers - A

result from ergodic theory, Japan J. Math. 18 (1992), 205-2 10.

R. TIJDEMAN

Mathematisch Instituut R.U. Postbus 9512 2300 RA Leiden

The Netherlands

Number Theory Paris 1992-93

CM Abelian varieties with almost ordinary reduction Yuri G. ZARHIN

In this note we discuss the Hodge group Hdg(X) of a simple Abelian variety X of CM-type. It is well-known that dimQ Hdg(X) _< dim(X). Assuming that X has somewhere good almost ordinary reduction, we prove

that dimQ Hdg(X) = dim(X) and give an explicit description of Hdg(X).

1. - Almost ordinary Abelian varieties Let A be an Abelian variety defined over a finite field k of characteristic p. We call A almost ordinary if dim(A) > 1 and it has the same Newton polygon as the product of (dim(A) - 1)-dimensional ordinary Abelian variety and a supersingular elliptic curve. This means that its set of slopes is {0, 1/2, 1}

and slope 1/2 has length 2. For example, an Abelian surface is almost ordinary if and only if it is neither ordinary nor supersingular. One may easily check that if g = dim(A) > 1 then A is almost ordinary if and only if its p-rank equals g - 1, i.e., the group of "physical" points of order p is isomorphic to (Z/pZ)9-1. Almost ordinary varieties were studied by Oort 113] in connection with the lifting problem of CM Abelian varieties to characteristic zero. In particular, he proved that each almost ordinary Abelian variety can be lifted to characteristic zero as CM Abelian variety (recall 126] that each Abelian variety over a finite field can be lifted to characteristic zero as CM Abelian variety up to an isogeny). Of course, if we start with an (absolutely) simple Abelian variety over a finite field, then its lifting will be also (absolutely) simple. It follows from ([5], Th.7; 112], Th.4. 1) that polarized almost ordinary Abelian varieties of given dimension constitute subvarieties of codimension 1 in the moduli spaces of Abelian varieties. See also [ 141.

A special case of a theorem of Lenstra and Oort [61 asserts that, for each positive integer g > 1 and for each prime number p there exists an absolutely simple almost ordinary g-dimensional Abelian variety defined Supported by C.N.R.S.

Y.G. ZARHIN

278

over a certain finite field field of characteristic p. It was proven by Oort [ 13] that the endomorphism algebra of simple almost ordinary Abelian variety

(over finite field) is a number field of degree 2 dim(A). Notice (see Sect. 6.6 below), that each simple almost ordinary Abelian variety is absolutely simple.

One may easily check that each non-simple almost ordinary Abelian variety is isogenous either to the product of ordinary Abelian variety and simple almost ordinary Abelian variety or to the product of ordinary Abelian variety and a supersingular elliptic curve. Let A be an Abelian variety over a finite field k of characteristic p. We write FA for the multiplicative subgroup of C* generated by the eigenvalues of the Frobenius endomorphism of A [29, 30, 31]. It is known (1301, Sect. 2.1; [33], Sect. 4.1), that the rank rk(I'A) of FA is a positive number which does not exceed dim(A) + 1. The non-negative integer rk(I'A) - 1 is called the rank of A and denoted by rk(A) [31]. One may easily check ([31], Sect.

2.0), that 0 < rk(A) < dim(A) and rk(A) = 0 if and only if A is supersingular. Now, assume that A is simple and almost ordinary. In that case it is known ([71, Th. 5.7) that either rk(A) = dim(A) or rk(A) = dim(A) - 1. In addition, if dim(A) is even then rk(A) = dim(A), i.e., rk(FA) = dim(A) + 1. If rk(TA) = dim(A) then the endomorphism algebra of A must contain an imaginary quadratic field; see [7], Th. 3.6. H.W. Lenstra (see [31], pp. 286288) has constructed an example of 3-dimensional simple almost ordinary Abelian variety A with rk(FA) = dim(A). His construction also gives an example of a 3-dimensional absolutely simple CM Abelian variety having an almost ordinary reduction. 2. - Q-adic Lie Algebras

Let X be an Abelian variety defined over a number field K. We assume that K is sufficiently large, i.e., all endomorphisms of X are defined over K. We will also fix an embedding of K into the field C of complex numbers and consider K as a certain subfield of C. We write K(s) for the algebraic closure of K in C. We write G(K) for the Galois group of K. We write g for the dimension of X. Let E be the endomorphism algebra of X ; it is a finite-dimensional semisimple Q-algebra. For a positive integer in, we denote by X,,,, the group

{x E X(K(s)) I mx = 0}. It is well known that X. is a free Z/mZ-module of rank 2g. Let us fix a prime number £. Then one may define the ZI Tate module T1 (X) as the projective

CM ABELIAN VARIETIES WITH ALMOST ORDINARY REDUCTION

279

limit of the groups X,,,, where m runs through the set of all powers .£i and the transition maps are multiplication by 2. It is well known that T1(X) is a free Zi-module of rank 2g . Clearly, all X,,, are finite Galois submodules of X(K(s)), and the Galois actions for m = £ glue together to give rise to a continuous homomorphism

pi = pi,x : G(K)

Autz, Ti(X).

The image

Ge = Gi,x = Im(pi,x) C Autz Ti(X) is a compact £-adic Lie subgroup in Autze Ti (X ). Let us put VV (X) := TT(X) ®Ze Q

Clearly, Vi(X) is a Qi vector space of dimension 2g and one may

with a certain Zi-lattice of rank 2dim(X) in naturally identify Vi (X). In particular, AutZe Ti (X) becomes an open compact subgroup in AutQ, Vi(X). This allows us to regard pi as an .£-adic representation ([ 191):

pi = pi,x : G(K) - Autz, Te(X) C AutQ, Ve(X). We have

Gi C Autz, Ti(X) C AutQ, Vi(X). Clearly, Gi is a compact (and therefore) closed subgroup of AutQ, VV(X) and therefore is a closed 2--adic Lie subgroup. Let gi = gi,x C EndQ, Vi(X) be the Lie algebra of Gi [ 19]. A theorem of Faltings [4] asserts that of is a reductive Qt-Lie algebra, its natural representation in Vi (X) is completely reducible and the centralizer of this representation is E ®Q Qi. A theorem

of Bogomolov [1] asserts that gi is an algebraic Lie algebra containing homotheties Qtid. It follows that

gi,x = Qiid (Dg°,x. Here

9°,x := sl(Vi(X)) n gi,x is an algebraic reductive Qi-Lie algebra. Its natural representation in VV(X) is completely reducible and the centralizer of this representation is E ®Q Qi.

It is known that the rank of g° is a non-negative number which does not

Y.G. ZARHIN

280

exceed g. If the equality holds then the Lie algebra is "as large as possible" and one may give an "explicit" description of go in terms of E; see (32], Th. 3.2; [331.

Let v be a non-Archimedean place of K such that X has a good reduction X (v) at v. Then GQ contains a Frobenius element Frv E Ge C AutQe V1(X)

canonically defined up to conjugation in G1 1191. If we view Frv as a linear operator in Ve (X), then its eigenvalues are just eigenvalues of the Frobenius endomorphism of X(v). In particular, if r(Frv) is the multiplicative group generated by the eigenvalues of Frv, then

I (Frv) = rX(v)

Notice, that the rank of ge is greater or equal than rk(r(Frv)) (see 1331, Corollary 2.4.1). This implies that the rank of go is greater or equal than

rk(r(Frv)) - 1 = rk(FX(v)) - 1 = rk(X(v)). We have

0 < rk(X(v)) < rkge < g = dim(X(v)). In particular, if rk(X(v)) = dim(X(v)) then rk(ge) = 9.

For example, if X(v) is a simple almost ordinary Abelian variety and g is even then (see the end of Sect. 1) rk(g°) = g

(recall that g = dim(X) = dim(X(v))).

The aim of the present paper is to prove that if X (v) is an almost ordinary Abelian variety then rk(ge°)

= 9

under an additional assumption that X is an absolutely simple Abelian variety of CM-type. (Compare with the corresponding results for Abelian varieties having a reduction of K3 type ([32], Th. 3.0 and Sect. 7.1).

CM ABELIAN VARIETIES WITH ALMOST ORDINARY REDUCTION

281

3. - Abelian varieties of CM-type Let X be an absolutely simple Abelian variety of CM-type. Then its endomorphism algebra E is a CM-field of degree 2g. We write a -4 a' for the complex conjugation on E. We write TE for the Well restriction RE/QGm of the multiplicative group Gm. Clearly, TE is a 2g-dimensional algebraic torus. Let UE be the g-dimensional algebraic subtorus of TE defined by the condition

UE(Q) = {a E TE(Q) = E* I aa' = 1}.

3.1. - The Hodge group We write V (X) for the first rational homology group H1 (X (C), Q) of X (C)

:

it is a 2g-dimensional Q--vector space. It also carries a natural

structure of 1-dimensional E-vector space. The choice of a polarization on X gives rise to a certain non-degenerate skew-symmetric bilinear form cp : V (X) X V (X) -+ Q

such that cp(ax, y) = cp(x, a'y)

for all x, y E V(X) and a E E. Let us choose a non-zero e E E with

E =-e. Then there exists a non-degenerate E-Hermitian sesquilinear form

0, : V(X) x V(X) -* E such that co(x, y) =

n'E/Q(e-i0E(x y))

where TrE/Q : E - Q is the trace map (see 121], [2], Sect. 4; [ 171, p. 531). If we change e by el then the form is multiplied by a non-zero totally real element el /e of E. The unitary group U(V (X ), 0) viewed as a Q-algebraic group does not depend on the choice of a and can be naturally identified with UE. In particular,

U(V(X), 0,) (Q) = {a E E* I aa' = 1}. Here we identify E with its image in EndQ V (X ). Its Lie algebra

uE := Lie(U(V(X), VE)) = Lie(UE) _ {a E E I a + a' = 0}.

Y.G. ZARHIN

282

Let Hdg(X) be the corresponding Hodge or as it sometimes called the special Mumford-Tate group of X (see (10, 15, 17, 18, 11]). It is a connected commutative reductive algebraic Q-group. It is well-known ([ 17], p. 531)

that

Hdg(X) C U. Let f1Dg = f1DgX be its Lie algebra. Clearly,

13DgCuE={aEEIa+a'=0}. It is known that it is a commutative Q-Lie algebra, i.e., its rank and dimension coincide, and rk(f1Dg) = dirngp 11Dg < dimQUE = 9;

the equality holds true if and only if Hdg(X) = UE.

For example, it is known that this equality holds true when g is a prime (a theorem of Tankeev-Ribet [ 17, 23]). For arbitrary dimensions there is a Ribet's inequality (118], p.87) loge (2g) < dimQ Hdg(X)

(see also [81). For further properties and examples of the Hodge groups of CM-Abelian varieties see 118, 3, 8, 281. There is a well-known natural isomorphism of Qe-vector spaces

Vt(X) =V(X) ®QQ . It is known that for Abelian varieties of CM-type the Qt-Lie algebra g° is a commutative Qt-Lie algebra, i.e., its rank and dimension coincide. A theorem of Pohlman [16] asserts that the isomorphism of the Qt-vector spaces mentioned above gives us an identification 17Dg®QQt =g°

of commutative Qe-Lie algebras.

Clearly, if rk(g°) = g, then it follows easily that dimQ 13Dg = g and, therefore,

Hdg(X) = UE.

CM ABELIAN VARIETIES WITH ALMOST ORDINARY REDUCTION

283

3.2. - Remark. One may define the Hodge group Hdg(X) for any (complex) Abelian variety X not necessarily of CM-type [10]. It is a connected reductive algebraic (12-group which is commutative if and only if X is of CM-type. The Mumford-Tate conjecture [201 asserts that the 2--adic Lie algebra ge,x can be obtained from the Lie algebra of Hdg(X) by extensions of scalars from Q to Qt. The theorem of Pohlman cited above proves the MumfordTate conjecture for Abelian varieties of CM-type.

4. - Main result. The main result of the present paper is the following statement. MAIN THEOREM. - Let X be an absolutely simple g-dimensional Abelian variety of CM-type defined over a number field K and all endomorphisms of X are also defined over K. Let E be the endomorphism algebra of X. Assume that there exists a non-Archimedean place v of K such that X has a good reduction X (v) at v and X (v) is an almost ordinary Abelian variety X (v). Then Hdg(X) = UE.

In other words,

dime Hdg(X) = dim(X) = g.

4.2. - Remark. Assume that g is even and X (v) is a simple almost ordinary Abelian variety. Then dim(X(v)) = dim(X) = g is also even and, as we have already seen, g = rk(X(v)) < rk(g°) = dimQ Clag < dimQ UE = g and, therefore, dimQ ljag = dimQ UE = g.

This proves the Theorem under our additional assumptions.

4.3. - Remark. Assume that g is odd and X (v) is a simple almost ordinary Abelian variety. Then dim(X(v)) = dim(X) = g and, as we have already seen, g - 1 < rk(X(v)) < rk(ge) = dimQ fjag < dimQUE = 9 and, therefore, g - 1 < dimQ [jag = dimQ Hdg(X) < dimQ UE = 9-

4.4. - Combining the last two Remarks, we obtain that the Theorem follows from the next two lemmas.

Y.G. ZARHIN

284

4.5. LEMMA. - Let X be an absolutely simple g-dimensional Abelian variety of CM-type defined over a number field K and all endomorphisms of X are also defined over K. Let E be the endomorphism algebra of X. Assume that there exists a non-Archimedean place v of K such that X has a good reduction at v and this reduction is an almost ordinary Abelian variety X (v). Then X (v) is a simple Abelian variety.

4.6. LEMMA. - Let Y be an absolutely simple g-dimensional Abelian variety of CM-type. Let E be the endomorphism algebra of Y. Assume that

dimQ Hdg(Y) = g - 1 = dimQ UE - 1. Then g is even.

4.7. - Remark. It is well-known ([ 17], Th. 0, p. 524) that the equality

Hdg(X) = UE implies that all Hodge classes on all powers of X are linear combinations of the products of divisors classes. In particular, all these Hodge classes are algebraic, i.e., the Hodge conjecture holds true for all powers of X. Since the Mumford-Tate conjecture holds true for Abelian varieties of CM-type [ 161, we obtain that all Tate classes on all powers of X are linear combinations

of the products of divisors classes. Indeed, by a theorem of Faltings [4], each 2-dimensional Tate class on an Abelian variety over a number field is a linear combination of divisor classes. In particular, all these Tate classes are algebraic, i.e., the Tate conjecture [24, 25] holds true for all powers of X.

5. - Proof of the Lemma 4.6. We start this section with the explicit description of Q-algebraic subtori in UE of codimension 1. This description had tacitly appeared in [6] and, later, was explicitly formulated and proved in [9]. In our exposition we follow 191.

Suppose E contains an imaginary quadratic subfield k. Let us define the algebraic subtorus SUE/k of UE by the condition SUE/k(Q) = {a E UE(Q) I NormE/k(a) = 1}. One may easily check that SUE/k has codimension 1 in UE. Clearly, its Lie algebra SUE/k := Lie(SUE/k) = {a E UE I TrE/k(a) = 0}.

CM ABELIAN VARIETIES WITH ALMOST ORDINARY REDUCTION

285

Here TrE/k : E --> k is the trace map. Notice, that this trace map commutes with the complex conjugation (if someone is unhappy with the definition of SUE/k by its Q-rational points then there is another description of SUE/k.

Namely, it is a Q-algebraic (connected) subtorus of UE such that its Lie algebra coincides with SUE/k). The following statement was proven in [9[, Sect. 7.3.

5.1. KEY LEMMA. - Let H be an algebraic subtorus of codimension I in UE. Then there exists an imaginary quadratic subfield k of E such that :

H = SUE/k.

5.2. - Since H := Hdg(X) is an algebraic subtorus of codimension 1 in UE, we obtain, applying the Key Lemma, that there exists an imaginary quadratic subfield k of E such that Hdg(Y) = SUE/k.

This means that CJ-0g = SUE/k.

Now, let us choose a non-zero e c k c E such that

Now, if we consider V (X) as a g-dimensional k -vector space, then the EHermitian form 0E gives rise to the k-Hermitian form '

E/k4'e : V(X) X V(X) --4k,

0(x,y) =TrE/k(0,(x,y))It follows easily that P(x, Y) = TrE/k(E-14(x, y))

for all x, y E V (X) and 0 is non-degenerate. Clearly, UE C u(V(X), ) :_

{aEEndkV(X) I 0(ax,y)+b(x,a'y)=0bx,yEV(X)} and

SUE/k C SUk(V(X),0) := {a E U(V(X),

)

1 TrV(X)/k(a) = 0}.

Y.G. ZARHIN

286

Here

TV(X)/k : Endk V(X) - k is the usual trace map on the algebra of k-linear operators of the k-vector space V(X) (notice, that the maps TrE/k and Trv(X)/k coincide on E). So, we obtained that 49 = SUE/k C SUk(V(X),V)) It turns out that the inclusion hag C SUk(V(X),')

can be rewritten in terms of the action of k on the tangent space of X (see Well [27]). Namely, if Lie(X(C)) is the tangent space of the complexAbelian variety X then the inclusion means that Lie(X(C)) is a free k ®Q C-module ([91, Lemma 2.8; see also [ 17], p. 525). Since Lie(X(C)) is a g-dimensional complex vector space and k ®Q C = C ® C, the dimension g must be even. This ends the proof.

6. - Proof of the Lemma 4.5. By functoriality of Neron models, there is a natural embedding

E = End(X) ® Q -> End(X(v)) ® Q

and 1 E E acts on X(v) as the identity map. Notice, that E is a number field and

[E : Q] = 2 dim(X) = 2 dim(X(v)). The following proposition will be proved at the end of this Section. 6.1. PROPOSmoN. - Let Y be an Abelian variety over an arbitrary field

K and assume that the semisimple Q-algebra End°(Y) = End(Y) ® Q contains a numberfield F of degree 2 dim(Y) such that 1 E E is the identity automorphism of Y. Then there exists a K-simple Abelian variety Z over k such that Z is )C-isogenous to the power Zr of Z with r = dim(Y)/ dim(Z).

6.2. - Applying the Proposition 6.1, we obtain that there exists a k(v)simple Abelian variety Z over k(v) such that X (v) is isogenous to Zr for a certain positive integer r. In order to prove the lemma 4.5, we have only to check that

r=1.

First, notice, that each slope of the Newton polygon of X (v) has length divisible by r. Since the slope 1/2 has length 2, either r = 1 or r = 2. If r = 2 then X(v) is isogenous to Z2 and, therefore, 1/2 is the slope of the Newton polygon of Z with length 1. But it cannot be true, since the length

CM ABELIAN VARIETIES WITH ALMOST ORDINARY REDUCTION

287

of the slope 1/2 must always be even [30, 7], due to the fact that all the break-points of the Newton polygon are integral. This rules out the case r = 2. So, r = 1 and we are done.

6.3. - Proof of the Proposition 6.1. Assume that Y is not )C-isogenous to a power of a )C-simple Abelian variety. Then, using the Poincare reducibility theorem, one may easily check that there exist Abelian 1C-subvarieties Y1, Y2 C Y of positive dimensions, enjoying the following properties : a) the natural homomorphism Y1 X Y2 -> Y, (yl, y2) --' yl + y2

is an isogeny; b) Hom(Yi, Y2) = {0}, Hom(Y2, Yi) = {0}. This implies that

0 < dim(Yi) < dim(Y); 0 < dim(Y2) < dim(Y); End°(Y) = End°(Yi) ® End°(Y2). Let pri : End°(Y) --4End°(Y) be the corresponding projection homo-

morphisms. Clearly, if idy E End°(Y) is the identity automorphism of Y then pri (idy) E End° (Y) is the identity automorphism idy, of Y . This implies that Fi := pri(F) c End°(Yi) is a number field isomorphic to F; in particular, its degree equals 2 dim(Y) > 2 dim(Y) (i = 1, 2.) Now, in order to get a contradiction let us recall the following well-known fact (see [22], Sect. 5.1, Proposition 2). 6.4. SUBLEMMA. - If the endomorphism algebra of an m-dimensional Abelian variety contains a number field which, in turn, contains the identity automorphism, then the degree of this field divides 2m. In particular, it does not exceed 2m.

6.5. - Now, in order to finish the proof by coming to the contradiction, one has only to apply the Sublemma to the Abelian variety Yi of dimension m = dim(Y) and the number field Fi of degree 2 dim(Y) > dim(Y).

6.6. - Remark. Similar arguments prove that if k is a finite field and A is a gdimensional k-simple almost ordinary Abelian variety over k then A is absolutely simple. Indeed, for each extension k' of k the Abelian variety A' := A x k' is an almost ordinary Abelian variety and End° A' contains a number field End° A of degree 2g = 2 dim(A'), which, in turn, contains the

Y.G. ZARHIN

288

identity automorphism. By the Sublemma, A' must be k'-isogenous to the power Z' of k'-simple Abelian variety Z. Now, the same arguments with the Newton polygons as in Sect. 6.2, prove that r = 1, i.e., A' = Z is k/-simple.

7. - Acknowledgements I am deeply grateful to H.W. Lenstra and B. Moonen for helpful discussions. This paper is a result of my stay in Paris in June-July of 1993 and I am deeply grateful to the Groupe d'Etudes sur les Problemes Diophantiens (Universite de Paris VI) for the hospitality. The support of the Universite Paris Nord is also gratefully acknowledged. I am grateful to Frans Oort who had read the manuscript and made many valuable remarks. My special thanks go to Daniel Bertrand and Larry Breen, whose efforts made my trip to France possible.

Manuscrit recu le 21 janvier 1994

CM ABELIAN VARIETIES WITH ALMOST ORDINARY REDUCTION

289

REFERENCES

111 F.A. BoGoMOLOV. - Sur l'algebricite des representations Q- adiques, C.R. Acad. Sci. Paris Ser. I Math. 290, 1980, 701-704. [2] P. DELIGNE. - (notes by J.S. Milne). Hodge cycles on abelian varieties,

Springer Lecture Notes in Math. 900, 1982, 9-100. [3] B. DODSON. - On the Mumford-Tate group of an abeliart variety with complex multiplication, J. Algebra 111,1987,49-73.

[4] G. FALTINGS. - Endlichkeitssatze fir abelsche Varietaten fiber Zahlkorpern, Invent. Math. 73,1983, 349-366. [5] N. KOBLITZ. - p-adic variation of the zeta-function over families of varieties defined over finite fields, Compositio Math. 31, 1975, 119218. [6] H.W. LENSTRA, Jr. and F. OoRT. - Simple Abelian varieties having a prescribedformal isogeny type, J. Pure Appl. Algebra 4, 1974, 47-53. [7] H.W. LENSTRA Jr. and Yu.G. ZARHIN. - The Tate conjecture for almost

ordinary abelian varieties over finite fields, Advances in Number Theory, Proc. of the Third Conf. of the CNTA, 1991 (F. Gouvea and N. Yui, eds.), 179-194. Clarendon Press, Oxford, 1993. [8] L. MAI. - Lower bounds for the rank of a CM-type, J. Number Theory 32, 1989, 192-202. [9] B.J.J. MOONEN and Yu.G. ZAR-IIN. - Hodge classes and Tate classes on simple abelianfourfolds, Duke Math. J., to appear. [10] D. MuMFORD. - A note of Shimura's paper Discontinous groups and abelian varieties, Math. Ann. 181, 1969, 345-351. [11] V.K. MuRTY. - Computing the Hodge group of an abelian variety, Seminaire de Theorie des Nombres, Paris 1988-89, (C. Goldstein ed.), Progress in Math., Birkhauser 91, 1990, 141-158. [12] P. NORMAN and F. OoRT. - Moduli of abelian varieties, Ann. of Math.

112, 1980, 413-439. [13] F. OORT. - CM-Dings ofAbelian varieties, J. Algebraic Geometry 1, 1992, 131-146.

290

Y.G. ZARHIN

[ 141 F. OORT. - Moduli of Abelian varieties and Newton polygons, C.R. Acad. Sci. Paris Ser. I Math. 312, 1991, 385-389. [ 151 1. 1. PIATETSKII-SHAPIRO. - Interrelations between the Tate and Hodge

conjectures for abelian varieties, Math. USSR Sbornik 14, 1971, 615625. [16] H. POHLMAN. - Algebraic cycles on abelian varieties of complex multi-

plication type, Ann. of Math. 88, 1968, 161-180. [17] K. RIBET. - Hodge classes on certain types of abelian varieties, Amer.

J. of Math. 105, 1983, 523-538. [18] K. RIBET. - Division fields of abelian varieties with complex multiplication, Memoires de la S.M.F., nouvelle serie 2, 1980, 75-94.

[19] J.-P. SERRE. - Abelian 1-adic representations and elliptic curves, Addison Wesley, second edition, 1989. [20] J.-P. SERRE. - Representations l-adiques, Kyoto International Symposium on Algebraic Number Theory, Japan Society for the Promotion of Science, Tokyo (1977), 177-193 (= CE 112). [21] G. SHIMURA. - On the field of definition for a field of automorphic functions, Ann. of Math. (2) 80, 1964, 160-189. [22] G. SHIMURA and Y. TANIYAMA. - Complex multiplication of abelian

varieties and its applications to number theory, Publ. Math. Soc. Japan 6, 1961. [23] S.G. TANKEEV. - Cycles on simple Abelian varieties of prime dimension, Izv. Akad. Nauk SSSR ser. matem. ; English translation in Math. USSR Izvestija 46, 1982, 155-170. [24) J. TATE. - Algebraic cycles and poles of zeta functions, Arithmetical Algebraic Geometry, Harper and Row, New York, 1965, 93-110.

[25] J. TATS. - Endomorphisms of Abelian varieties over finite fields, Invent. Math. 2, 1966, 134-144. [26] J. TATE. - Classes d'isogenie des varietes abeliennes sur un corps finL (d'apres T. Honda), Seminaire Bourbaki 352 (1968), Springer Lecture Notes in Mathematics 179 (1971), 95-110. [27] A. WEIL. - Abelian varieties and the Hodge ring, Collected papers, Springer-Verlag III, 1980, 421-429. [28) S.P. WHITE. - Sporadic cycles on CM abelian varieties, Compositio Math. 88, 1993, 123-142.

CM ABELIAN VARIETIES WITH ALMOST ORDINARY REDUCTION

291

[29) Yu.G. ZARHIN. - Abelian varieties of K3 type and £-adic representations. Algebraic Geometry and Analytic Geometry Tokyo 1990, ICM90 Satellite Conference Proceedings, Springer-Verlag, Tokyo (1991), 231-255. [30) Yu.G. ZARHIN. - Abelian varieties of K3 type, Seminaire de Theorie

des Nombres, Paris 1990-91, (S. David ed.), Progress in Math., Birkhauser ]L08, 1993, 263-279. [311 Yu.G. ZARHIN. - The Tate conjecture for non-simple Abelian varieties overfinitesfields. Algebra and Number Theory, Proceedings of a Con-

ference held at the Institute for Experimental Mathematics, University of Essen, Germany, December 2-4, 1992 (G. Frey and J. Ritter, eds.) de Gruyter, Berlin, (1994), 267-296. [321 Yu.G. ZARHIN. - Abelian varieties having a reduction of K3 type, Duke

Math. J. 65, 1992, 511-527. 1331 Yu.G. ZARHIN. - £-adic representations and Lie algebras. Elliptic Curves and Related Topics, (M. Ram Murty and H. Hisilevsky, eds.) CRM Proceedings & Lecture Notes 4 (1994), AMS, 183-195.

Yuri G. ZARHIN

The Pennsylvania State University, Department of Mathematics 325 McAllister Building, University Park, PA 16802, USA

e-mail address : [email protected] and Institute for Mathematical Problems in Biology Russian Academy of Sciences Pushchino, Moscow Region, 142292 Russia

Analytic Number Theory (London Mathematical Society Lecture Note Series 247)

Read more

Number Theory and Polynomials (London Mathematical Society Lecture Note Series)

Read more

Combinatorics (London Mathematical Society Lecture Note Series)

Read more

Solitons (London Mathematical Society Lecture Note Series)

Read more

Algebraic Set Theory (London Mathematical Society Lecture Note Series)

Read more

Lectures on Invariant Theory (London Mathematical Society Lecture Note Series)

Read more

Spectral Theory and Geometry (London Mathematical Society Lecture Note Series)

Read more

Handbook of Tilting Theory (London Mathematical Society Lecture Note Series)

Read more

Homological Group Theory (London Mathematical Society Lecture Note Series)

Read more

Spectral Theory and Geometry (London Mathematical Society Lecture Note Series)

Read more

2 - Homotopy Theory (London Mathematical Society Lecture Note Series)

Read more

Sheaf Theory (London Mathematical Society Lecture Note Series)

Read more

Recent Perspectives in Random Matrix Theory and Number Theory (London Mathematical Society Lecture Note Series)

Read more

Oligomorphic Permutation Groups (London Mathematical Society Lecture Note Series, 152)

Read more

Syzygies (London Mathematical Society Lecture Note Series 106)

Read more

Introduction to Subfactors (London Mathematical Society Lecture Note Series)

Read more

Higher Operads, Higher Categories (London Mathematical Society Lecture Note Series)

Read more

The Core Model (London Mathematical Society Lecture Note Series)

Read more

Advances in Linear Logic (London Mathematical Society Lecture Note Series)

Read more

Interaction Models (London Mathematical Society Lecture Note Series)

Read more

L-Functions and Arithmetic (London Mathematical Society Lecture Note Series)

Read more

Shintani Zeta Functions (London Mathematical Society Lecture Note Series)

Read more

Groups, Combinatorics and Geometry (London Mathematical Society Lecture Note Series)

Read more

Surveys in Combinatorics (London Mathematical Society Lecture Note Series)

Read more

St Andrews: Volume 1 (London Mathematical Society Lecture Note Series)

Read more

Surveys in Combinatorics, 1995 (London Mathematical Society Lecture Note Series)

Read more

Trends in Stochastic Analysis (London Mathematical Society Lecture Note Series)

Read more

Low Dimensional Topology (London Mathematical Society Lecture Note Series)

Read more

Solitons (London Mathematical Society Lecture Note Series 85)

Read more

Linear Algebraic Monoids (London Mathematical Society Lecture Note Series 133)

Read more

Recommend Documents

Analytic Number Theory (London Mathematical Society Lecture Note Series 247)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor J.W.S. Cassels, Department of Pure Mathemati...

Number Theory and Polynomials (London Mathematical Society Lecture Note Series)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor N.J. Hitchin, Mathematical Institute, Univer...

Combinatorics (London Mathematical Society Lecture Note Series)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor I.M.James, Mathematical Institute, 24-29 St...

Solitons (London Mathematical Society Lecture Note Series)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor I.M. James, Mathematical Institute, 24-29 St ...

Algebraic Set Theory (London Mathematical Society Lecture Note Series)

...

Lectures on Invariant Theory (London Mathematical Society Lecture Note Series)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor N.J. Hitchin, Mathematical Institute, Univers...

Spectral Theory and Geometry (London Mathematical Society Lecture Note Series)

Handbook of Tilting Theory (London Mathematical Society Lecture Note Series)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor N.J. Hitchin, Mathematical Institute, Univer...

Homological Group Theory (London Mathematical Society Lecture Note Series)

...

Spectral Theory and Geometry (London Mathematical Society Lecture Note Series)

LONDON MATHEMATICAL SOCIETY LECTURE NOTE SERIES Managing Editor: Professor N.J. Hitchin, Mathematical Institute, Univers...