Lecture Notes in Mathematics Edited by A. Oold and B. Eckmann
326 Alain Robert
Elliptic Curves Notes from Postgraduate Lectures Given in Lausanne 1971/72
Springer-Verlag Berlin Heidelberg New York Tokyo
Author Alain Robert Universite de Neuchitel, Institut de Mathematiques Chantemerle 20, 2000 Neuchitel, Switzerland
1st Edition 1973 2nd Corrected Printing 1986
Mathematics Subject Classification (1980): 12835, 12B37, 14G 10, 14H 15, 14H45, 32G 15 ISBN 3-540-06309-9 Springer-Verlag Berlin Heidelberg New York Tokyo ISBN 0-387-06309-9 Springer-Verlag New York Heidelberg Berlin Tokyo This work is subject to copyright. All rights are reserved, whether the whole or Part of the material is concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks. Under § 54 of the German Copyright Law where copies are made for other than private use, a fee is payable to -Verwertungsgesellschaft Wort", Munich. C by Springer-Verlag Serren Heidelberg 1973 Printed in Germany Printing and binding: Seltz Offsetdruck, Hemsbach/8ergstr.
2146/3140-543210
NOTATIONS AND
CON V E NT ION S
We have used the usual letters for the basic sets of numbers N (natural integers 0,1,2, ... ), ~
~
(ring of rational integers),
(field of rational numbers), R (field of real numbers), [ (field
of complex numbers), F q (finite field with q elements). As a rule, we denote by AX the multiplicative group of units (invertible elements) in a ring A. In formulas, the cypher 1 always represents the number one (except in log x ... so that in one occurence I have used log -1 to avoid ambiguities). Also ~(x) =e
2rrix
(normalized exponential).
In a theorem, I list properties under Latin letters a), b), ... keeping
i), ii), ... for equivalent properties, but the meaning is
always clear by the context. The following system has been adopted for cross-references. All theorems, propositions, corollaries, lemmas, remarks, definitions, formulas, errata, ... are numbered in one sequence. Such a cypher as (nL3.4) refers to the item (3.4) of chapter III, i.e. the fourth numbered in section 3. (This happens to be a lemma 3.J" From inside chapter III we would refer to (3.4) (in section 3, sometimes simply to lemma 3
this last system of numeration has not been used systema-
tically, but only when it can be more suggestive locally).
TABLE
OF
CONTENTS
I : COMPLEX ELLIPTIC CURVES
CHAPTER
1. Weierstrass theory
2
2. Theta functions (Jacobi)
19
3. Variation of the elliptic curve and modular fonns
35
4. Arithmetical properties of sane modular fonns
66
II : ELLIPTIC CURVES IN CHARACTERISTIC ZERO
CHAPTER
75
1. Algebraic varieties and curves
77
2. Plane cubic curves
98
3. Differential forms and elliptic integrals
124
4. Analytic p-adic functions
144
5. Tate's p-adic elliptic curves
160
CHAPTER
III : DIVISION POINTS
173
1. Division points in characteristic zero
174
2. t-adic representations
184
3. Integrality of singular invariants
197
4. Division points in characteristic p
215
CHAPTER
N: COMPLEMENTS
225
1. Hasse's invariant
226
2. Zeta function of an elliptic curve over a finite field
240
3. Reduction mod p of rational elliptic curves
246
REFERENCES
254
INDEX
262
I NT ROD UC T ION Elliptic curves are special cases of two theories, namely the theory of Riemann surfaces (or algebraic curves) and the theory of abelian varieties, so that any book concerned with these more general topics will cover elliptic curves as example. However, in a series of lectures, it seemed preferable to me to have a more limited scope and introduce students to both theories by giving them relevant theorems in their simplest case. I think that the recent recrudescence of popularity of elliptic curves amply justifies this point of view. I have not chosen the most concise style possible and sometimes have committed the tfcrime of lese-Bourbaki" by giving several proofs of one theorem, illustrating different methods or point of views. I shall not give here any idea of the topics covered by these notes, because each chapter has its own introduction for that purpose (prerequisites are also listed there). Let me just mention that I have omitted complex multiplication theory for lack of time (only integrality of singular invariants is proved in chapter III). In the short commented bibliography (given for each chapter at the end of the notes), I quote most of my sources and indicate some books and articles which should provide ample material for anyone looking for further reading. The origin of my interest in elliptic curves has to be traced to a series of lectures given by
~.
Demazure (Paris Orsay, oct.-dec.67)
on elliptic curves over [. Although the presentation I have adopted differs somewhat from his, I have been much influenced by the notes from these lectures (especially in the section on theta functions). I would also like to seize the opportunity of thanking here Y. Ihara,
VIII J.-P. Serre, G. Shimura for very helpful discussions, correspondence ••. Only at the end of the lectures did I learn through S. Lang that he had also written a book on elliptic curves. It seemed however that the material covered was sufficiently different to allow the publication of my notes, and I hope that they will still have some use. Finally it is a pleasure to thank the audience of the lectures whose interest stimulated me, my wife who gave me some hints on language and L.-O. Pochon who proof-read most of the notes, pointed out some mistakes and established an index. However, I take full responsability for remaining mistakes and would be grateful to anyone bothering to let me know about them !
September 1972
A. Robert Institut de Mathematiques Universite de Neuchatel CH-2000 NEUCHATEL
(Switzerland)
CHAPTER ONE COMPLEX ELLIPTIC CURVES
This chapter has as first aim the presentation of the classical theory of elliptic functions and curves, as first studied in the nineteenth century by Abel, Jacobi, Legendre, Weierstrass. In particular, the following mathematical objects will be shown to be equivalent i) Compact complex Lie group of dimension one. ii) Complex torus (tIL, with a lattice Le([ • iii) Riemann surface of genus one (with chosen base point). iv) Non-singular plane cubic of equation 2
Y =x
3
+
Ax
+
B
Here, we shall basically only assume that the reader is familiar with the theory of analytic functions of one complex variable. However, we shall mention briefly some "superior" interpretations which can be fully understood only with some knowledge of the basic definitions and properties of vector bundles and sheaves on Riemann surfaces.
- 1.2 -
1. Weiersnass' theory
A lattice in a (finite dimensional) real vector space V is by definition a subgroup L generated by a basis of V. Thus a lattice is isomorphic to a group ~d where d
dim(V) , and V/L is a compact group.
=
When we speak of lattice in a complex vector space, we always mean lattice for the real vector space of double dimension obtained by restricting the scalars from [ to R. Thus a lattice L in [ is a subgroup of the form (Jot /
L
=
Z w1 + Z ~1
wi th two complex numbers
lU· such that 1
is not real.
Wz.
Let L be a lattice in [ which we shall keep fixed. We say that a meromorphic function f on [ is an elliptic function (with respect to L, or L-elliptic) when it satisfies fez + tV)
=
fez)
for all GJE:L
(whenever one member is defined!). An elliptic function can be considered as a meromorphic function on the analytic space [/L. It is obvious that the set of elliptic functions (with respect to L) is a field with respect to pointwise addition and multiplication. It is nonetheless obvious that if f is elliptic, so is its derivative f' . Let frO be a non identically zero elliptic function. For any point a
~[
, we can use the Laurent expansion of f at a to define the
rational integer ord (f) (smallest index with non-zero Laurent coeffia
cient). This number is positive if f is regular at a, and strictly negative if f has a pole at a. By periodicity of f, we get ord (f) for any a
t.J E
orda+~(f)
L, hence we shall often consider a as an element of
[/L to speak of ord a (f). The formal sum of elements of [/L div(f) = ~orda(f).(a) , has only finitely many non-zero coefficients (because [/L is compact and the zeros and poles of f are isolated), hence can be considered as 2
- 1.3 -
element of Div([/L)
= ~[[/L]
(we consider this set as additive group,
without the convolution product). The first algebraic properties of elliptic functions are given by the foliowing (1.1) Theorem 1. Let f be a non-constant L-elliptic function, and div(f)
=
Ln. (a.) 1
.
1
Then we have: a) f is not holomorphic. b)
LRes a . (f) = 0
c)
Ln i =
d)
1
0
Ln.a. 1
in [/L
0
1
Proof: If an elliptic function f is holomorphic, it is bounded in a period parallelogram (this is a compact set), hence bounded in the whole complex plane by periodicity. It must then be constant by Liouville's "theorem, hence a). We choose now a basis 4)1 ' W
2
that f has no zero
of L and a E: [ such
or pole on the sides of the period parallelogram
Pa of vertices a, a +
~l
' a +
~2
' a +
~l
+ W2 · The integration of
f(z)dz on the (anticlockwise) oriented boundary dP 21Ci
.L Res a . (f) 1
=
a
of P
a
gives
0 because the opposite sides cancel their contributions
by periodicity of f. This proves b). Then c) follows from b) applied to the elliptic function f'lf • For d), we integrate the function zf'/f over the same boundary dP
a
. The contributions from parallel sides do
not cancel here, but e.g.
(Saa +
4Jof _
J
a +
4)1
+ W.z.
a + WI.
Since f admits the period wI ' (a
J
a
+ lUi
f'lf dz
)zf' (z)/f(z)dz f([a,a+~lJ)
a +
- 4)2
J a
4.11
f' I f d z •
is a closed curve C and
21Ci·Index(O;C)
Adding these two similar terms gives
3
Ln.a. 1
1
q.e.d.
- 1.4 -
We make a few comments on the meaning of this theorem. First, a) asserts that a non-constant elliptic function f has at least one pole. Moreover, if f and g are two elliptic functions with same divisor, the quotient fig has no pole, hence is a constant c : f = c.g . This shows that the divisor of an elliptic function determines the function up to a multiplicative constant. Then b) shows that an elliptic function f cannot have only one pole, this pole being simple. An elliptic function can have only one double pole with zero residue, or two simple poles with opposite residues. These are the simplest possibilities. Then c) brings out the fact that an elliptic function has as many zeros as poles. Thus if f is any non constant elliptic function and c any complex value, f and f - c having the same poles must have the same number of zeros. This explains that f takes all complex values the same number of times. This constant number of times f
(non-constan~
assumes every value may be called the valence (or the order) of f. For any divisor ~n.(a.) £Div(E), where we put E = ~/L to simplify the 1 1 notations, we define its degree as being the sum [n i • This gives a group homomorphism deg : Div(E)
~
~
With this definition, c) can be restated as deg(div(f))
=
0 for all
elliptic functions f ; O. Finally d) gives a further necessary condition on the divisor of an elliptic function (Abel's condition). We shall prove that the conditions c) and d) on a divisor of E are also sufficient to assure the existence of an elliptic function having precisely that divisor. We have to prove the existence of non-constant L-elliptic functions, and we shall follow Weierstrass' idea of constructing an elliptic function with only a double pole (on E), with zero residue. The construction is of transcendental nature. 4
- I.S -
For that purpose, we try an expression of the form L(z - w)-2, provided it converges. But for z fixed, and '4)/~ 00
,
(z - "') - 2
,-.J
tiJ-
2•
(1.2) Lemma. Let V be a (finite dimensional) real Euclidian vector space, and L a lattice in V. Then L IIwl- s converges absolutely for 0; tUE:L
Re(s)
>
dim(V).
Proof. We put d
= dim(V)
and we choose a basis
~l
' ... ,
~d
of L.
We observe first that it is sufficient to prove the convergence of the sum extended over the w=Ln.CJ. with positive n.~O, not all zero. We 1 1 1 group together those for which
Ln.1
n> 0 is a given strictly posi-
tive integer, and choose two constants A>O, B;>O such that 1
Aln ~ Iwl' Bin
The number of these w is asymptotically equal to the area of the simd plex in V with vertices n~. , hence is of the order of Cn - l (C~O). 1
(Actually the exact number can be computed, it is the binomial coefficient (n~~iz).) We find thus a contribution to the sum of the order of
nd-l/n s • The absolute convergence of this series occurs exactly when Re(s) - d
+
1
';>
1 , hence for Re(s»d as asserted.
Now, the series of general term (z
-~)
if we only subtract the asymptotic behavior (z - ~)-2 - w-
2
-2 does not converge, but
w- 2 ,
we get terms
which must be of the order c(z)~-3 (the incredulous
reader can check that by computing the difference of these
fractions~),
hence will give a convergent series. We define the Weierstrass' function (1. 3)
P(z)
=
p(z :
the sum being extended over
L)
=
z- Z
+
L:'{( z
- c.» - Z - w- Z
J
= L - {Of, which we indicate by a
w £ L'
prime in the summation (as a rule, we shall omit the prime for sums extended over all L, and keep it for sums over L'). This series is now absolutely convergent (for all z
L), and it is uniformly (and nor-
~
mally ) convergent when z stays in a bounded part of [ - L 5
- 1.6 -
p admits
It is not absolutely obvious that
L as lattice of periods,
but it is an even function, and we proceed as follows. The derivative of
~
can be computed by termwise differentiation
(1.4)
rl(Z)
=
=
fl(Z:L)
-Z['(z - 4»-3
(no convergence factors are needed here by the lemma, and they have disappeared automatically). This function is obviously L-periodic, hence L-elliptic, and odd. By integration, we get ~(z) + C'"
Since
~
is even,
=-
Z
gives
~/2
=
C~
(C~
a constant).
0 (we are in characteristic r2).
While we are at it, we give the Laurent expansion of
~
and
P'
at
the origin. By termwise differentiation, we see that ~
(2k)
(z)
=
~
(2k+l)! L-(z - ~)
-2k-2
so that the Taylor coefficients of the even function fez)
=
p(z) - z -2
are the f(Zk) (O)/(2k)!
=
(Zk+l) oG
.
where we have deflned G = G2k (L) = 2k get
,I -2k
+ Zk 2
(for k> 2). We eventually
L- lU
(0 < Izl < Min IwI)
(1. 5)
4J~L'
,
and by derivation (1.6)
~' (z)
-ZZ-3 +
~
(Zk+l)ZkG
k~l
+ z Zk Z
2k-l
It is easy to give explicitely the divisors of several elliptic
r
functions connected with being odd, it follows that -a
=a
~'
modL, i.e. 2a€L (and a
or its derivative. For instance,
p'
vanishes at the points a€([ such that
¢ L). A set of
points is 4Jl 12 , (J2 /2 , W3/2 wi th
W
3
l +
t.J
representatives of these 2 . Since
W
~'
has only a
triple pole on [/L, it can have only three zeros, and we must have ( 1 . 7)
d i v ( ~')
Similarly, the elliptic function
= -
3 ( 0) + (1-) + (~~) +
p- ~(a) 6
(for a
E
cY) ·
L) has only a
- 1.7 -
double pole on [/L, hence it has only the two zeros -a and a. At first,
p' (a)
this is true for a ;. -a mod L, but if Za eL, zero a of ~ -
p(a)
o shows
that the
is at least double, hence of order two, and
(1.8)
dive
(The two symmetric zeros of
r-
rea))
=
-Z(O)
+
(a)
+
(-a) ·
are not explicitely known.)
~
We are now able to derive the explicit structure of the field of all elliptic functions. (1.9) Theorem Z. The field of all L-elliptic functions is
([ (r
where
~
=X ,
and &2
=
60G
~'
'~')
(X) [y]/ (y2 - 4X 3 + &2 X + &3)
is identified with the image of Y in the quotient,
= 60l?w- 4 ,
4
';! ([
Ii
!!
=
&3
&~
-
140G
27&;
= 140I:'~-6
6
f
satisfy
0
Moreover, the subfield of eV61 elliptic functions is [( ~) . Proof. Let f be any elliptic function. We can write f as sum of an f = fl
even elliptic function and an odd elliptic function (indeed put fl(z) we can write f
=
=
£1
l(f(z) +
+
f(-z)) and f Z
p' (fZ/ P')
with fZ/
- f l )· Since
f
P'
+
fZ
r'
is odd,
even elliptic function.
This already shows that the field of all elliptic functions is a quadratic
extension of the subfield of even elliptic functions. We supVi = ord a . (f) if ai 1
pose now f is even. We let Y.
1.
~
-a i (mod L) and
= lord a (f) if Zai €,L. These are integers since f being even has i
a pole or a zero of even order at the points the product
&
=
TI(
r-
~. 1
. We consider then
tHai)))li extended over a set of representa-
tives of the classes {ai,-a i } with a i f/;. L . By construction, f and g have divisors with same coefficients, at least for all the points (a), where a
~
L, but the coefficient of (0) must also be the same in the
two divisors by condition c) of Theorem 1. It follows that f is a
p. To prove the P', we use the
constant multiple of g, hence a rational function in (quadratic) algebraic relation satisfied by 7
p and
- 1.8 -
Laurent expansions (l.S) and (1.6). Only the first few coefficients matter : z-2 + 3G Z2 4
f(Z)
pt (z) hence
= -2z- 3
+
SG Z4 6
+
6 (QCz )
+
6G z + 20G Z3 6 4
((}(zS)
+
4z -6 - 24G z -2 - 80G 6 + (D(z2) 4 z-6 + 9G z -2 + lSG + (!J(z2) ~(z) 3 6 4 -2 140G + CO(z2) -60G 4 z (z) 2 - 4 ~(Z)3 6
~' (z)-2
Thus,
P'
&3 + h(z)
-g2 P(z) where
,
h is L-e11iptic, holomorphic, and (f}(z2). This proves h =
To prove that the discriminant
does not vanish, we use a different
~
form of the algebraic relation between the divisor of
P'
o.
p and
r'.
by (1.7), we also know that of
all zeros of the cubic polynomial
4
Since we know
p,Z ,
p3 - g2 P - g3
hence we know
• They are the
w· for i = 1,Z,3, and are all double zeros. By comparing the e i = p.c~) leading terms, we get the equality 4(1' - el)(jJ - e 2 )([.J - e 3 ) (Note that this is also a special case of the explicit expression of p'2
any even elliptic function as rational function of By (1.8) we have ( 1. 1 0)
di v (
r-
e i) = - 2 (0) + 2 f~i. )
p as
derived aboveJ
( for i = 1, 2 , 3)
As the points Wi/Z are incongruent mod L , we conclude that these divisors are all distinct. In particular the functions
p-
e i must
all be distinct. This proves that the three complex numbers e i are
IT (e.-e.) r O. irj J
distinct, and the discriminant ~=
As is well known,
1
this discriminant can be expressed in function of the coefficients of the equation as g~ - 27g~ . Since the coefficient of cubic equation is 0, we also happen to see e
8
1
+
e
Z
+
p2
in the
e 3 = 0 , q.e.d.
- 1.9 -
Let Xoc[
2
2
be the curve of equation y = 4x
I claim that it is non-singular (when g~ - 27g~
3
r
- g2x - g3 .
0). Indeed, the
singular points would satisfy
1. (y2 ox
o o=
- 4x 3 + g2x + g3)
%;.(...)
2y
hence would be on the x-axis with x
= !(g2/l2)1 . But these points
3 2 are not on the curve because g2 - 27g 3 ' 0, as a short computation shows.
We can define a mapping (pez)
'P' (z.)).
fa :
Xo c 1E 2 by
IE - L -
f o (z)
=
It gives a bij ection between Eo and Xo where
Eo = E - {OJ (and E
=
f and P'
[/L as before). Indeed,
value exactly twice with
P(z) = P(-z),
assumes every complex separates z and -z
(all this mod L). This holomorphic map has a holomorphic inverse
f~l : Xo ~ Eo because its derivative z ~ (f'(z),~,,(z)) never vanishes (the only points z where simple zeros of
r',
hence
~'(z)
~"(Wi/ 2)
° are
the
~./2 1
which are
~ 0).
To go from the affine curve Xo to its natural "completion" (or "compactification") we embed [2 into the projective complex plane p2([) = [3 _ {oj /homoth., by (x,y) .....-.. (x,y,l) = (AX,AY, A ), and we
define XClP 2 (IE) by the homogeneous equation y2T = 4X3 - g2XT2 - g3T3 (I hope that there will be no confusion between the indeterminate X and the projective curve). Then we can extend fo to ep as follows.
(1.11) Corollary. Let
t---+-
p
(Z3 (Z) ,z3
p' (z) ,z3)
be the mapping
a: ~ Xcp2([) extending fo. Then ep gives an analytic isomorphism E ~ X , and this last curve is non-singular. All assertions have been proved, or can be checked directly. We only note that the point at infinity on X is (0,1,0) = (O,A,O), and that it 9
- 1.10 -
is non-singular (we shall see later that it is an ordinary flex on X). A small point deserves mention. Since [ is simply connected, and
~
is a local isomorphism, we can identify ep with the universal
covering map for X. This universal covering map is known explicitely
p and
by means of the
~'.functions.
2
y t = 4x
for the curve
3
· g2xt
2
This is an explicit uniformization
3 - g3t. (We shall see later, that for
3
2
any couple (g2'&3) of complex values such that g2 - 27g 3 I 0, there exists a lattice Lea: such that &2 = g2 (L) and g3 = g3 (L), so that the uniformization can be made explicitely for all those cubics. If, more generally, X is a non-singular projective curve of genus
~2,
the
theorem of uniformization of Poincare-Koebe asserts that the universal covering of X is the open unit disc in
a:,
hence X is isomorphic to the
quotient of the disc by a discrete subgroup of automorphisms of the disc. But the link between X - or its defining equation - and this discrete subgroup cannot be given explicitely (yet),
except in very
special cases.)
yZ = 4x 3 - gzx - g3
We can transform slightly the equation
of Xo . First, we replace y by y/2 and define new constants A
= -g2/4 and B = -g3/4 , so that we get the new equation
(1.12)
y2
=
x3
+
Ax
+
B
with
4A 3
+
27B 2 f 0
Moreover, we know the roots of the right-hand side, they are the e.1 (see proof of Th.2), so that this equation (1.12) can be rewritten
TT (e.-e.) ilj J
(1.12)'
1
I 0
It is still useful to make a further change of variables by putting /2 (e l -e 2) - 3 y, and" /\ = (e 3 -e 2) (e l -e 2) -1 , whence we get the new equation
.,r = (x-e 2) (el-e Z) -1, (1.13)
'? =
~(~ -1) (~ -
A)
with
A(A
-1)
I
0 •
This is the reduced form of the cubic given by Legendre. This equation has only one parameter, namely A (but different values of 10
A can give
the ttsamett - isomorphic - curves). By using the isomorphism between E and the curve X, we consider X as an abelian group. (1.14) Theorem 3 (Addition). a) Geometric form: three points on X have sum zero whenever they are colinear (they lie on a line). b) Analytic form: Let u f ±v (mod L) be two complex numbers. Then
p
= - ~(u)
(u + v)
-
fey)
+
r'(V)) 2
&'(u)
-
feu)
-~(v)
~ ~----4
A picture will help. We suppose that the three roots of the cubic polynomial 4x 3 - gzx - g3 are real. Then it is possible to choose a real hyperplane in [Z containing the curve Xo . The complex conjugation induces a symmetry with respect to a plane which we take horizontal :
y
t. t ~
,
I
X
~+~
complex curve X
real points of Xo
Proof. a) There is no loss in generality in supposing that the first two points have distinct first coordinates: PI with xl f
X
z ·
Let Y = mx
= (xl,y l ), Pz = (xz'YZ)
h be the equation of the straight line
+
through these two points. The elliptic function
~'
- mr - h has a
pole of order three (at the origin of E), hence has three zeros, two of which u, v are known, namely those u and v for which 'Cu) = PI and fey)
= Pz · We call P3 = few) where w is the third zero of that ellip-
tic function. The last property (Th.l, d) of divisors of elliptic functions shows that u
+
v
+
w
o 11
Cmod L), hence the sum of the P. 's 1
- I. 12 -
is zero. Conversely, if three points are on X and on a line, they must be the three zeros of an elliptic function (given by the equation of the line as above), hence have sum zero. b) We keep the notations of above, but add PI P2
(P(v)
,&>' (v)),
so that m
= Y2
-
&l'(U) -
Y1
x2 - xl
~(u) -
e'
(~(u)
,
~' (u)),
and
(v)
rev)
Also we can write for the coordinates of the intersection points of Xo and the line
y2 _ (mx
h)2
+
4x
3
2 2
- g2x -g3 - m x 4(x 3 - x 2m/4 + ••• ) 4 (x - ~(u)) (x -
This gives
~eu)
+
pev)
+
2
~eW) = m
(1.15)
~
p(v) ) (x - p(w) ) •
j4 hence the announced formula by
(mod L).
-2P(u)
+
4"1
which can be expressed rationally in
p"(u)
r, P'
2
P' (u) -2 (and the constants g2' &3)·
By induction, one can get rational expressions in ~(nu)
q.e.d.
u in the addition formula, one gets also
~(2U) =
and g3 for
2
=
pand w = -u-v
the formula for m, the parity of By letting v
- 2mhx - h
r
(and also for p'(nu)) where n is any positive
integer. The function
.~
of Weierstrass having zero residues at all its
poles, has a well-defined primitive over the simply connected space [. We can find one by integrating term by term the definition of
r
as infinite sum, adjusting the constants of integration of all terms so as to get a convergent series. Thus we define z -1 -
L'((z
(Classically, this primitive is
-~(z)
(1 • 16 )
~ ( z)
= -
- fA») -1
+
+
Z (a)-
2
)
with the notations of Weierstrass
but we refrain from using this zeta notation, because certain formulas use also Riemann's zeta function, hence create difficulties.) 12
- I. 13 -
The integration can be carried one step further, but the non-zero residues of ~
make its primitive logarithmically ramified at the
points of the lattice, so we define this primitive to be log(£(z)), with a certain function f, easily defined by a canonical Weierstrass' product. It is again a habit to define l/f = (1.17)
~
by
lTh - z/(o»)exp(~ + !(~) 2)
G"{z) '" z
By definition, the functions ~ and'~ are odd functions. ~ is holomorphic with simple zeroes at the points of the lattice (it is an entire function). Also by definition (1 . 18 )
~
=
, (z)
r(z)
The periodicity of
p gives
choose a basis
~
~,
and
=-
( tr' (z) / tr( z) )'
by integration
p(z)
~(z) = ~(z+W)
+1J(W). If we
of L as before, we put
(1 • 19)
~ ( z)
The two constants
~i
= ~ (z
(
+ Wi) + 1)i
1J i = - 2 ( ( Wi / 2)
are not arbitrary. An integration of
)
•
~(z)dz
on
the sides of a period parallelogram Pa gives (1.20) This is a relation discovered by Legendre. Moreover, the integration of (1.19) gives
~(z+~i)
=
Ci~(z)e
?)i Z
with
two constants Ci which can be computed by making z = -W /2 and using i the fact that a- is odd (also note that o-(Wi /2) f. 0 because the only zeros of
~
are the points of the lattice). We find
(1.21)
G"(z + Wi)
The function Indeed, let ~ tives
ak~[
=
~
=
-e
.,.(z+l~
lr(z)
is especially useful to study divisors on E.
[.dk(ak)E:.Div(E) be any divisor, and take representa-
of the points of E appearing in f
d
'"
Tfcr(z -
~.
Then define
ak)d k
This is a meromorphic function in
« which
depends not only on
~
but
also on the selected representatives a k of the points appearing in d a fact which we keep in mind, but not in notation! By (1.21) we have 13
- 1.14 -
fde z
+
(-1)
dk
IT(-l)·e
=
Wi)/fd(z)
Ed k
dklJi (z - a k +
~i(z + }Wi)Ld k
·e
·e
lUli )
-ryiLdkak
In this formula, we see that the two conditions deg(~)
imply that f div(f d )
=~
d
=
I:d k = 0,
l:dka k
=
0 ,
is an elliptic function. It is also clear that
because
~
has simple poles at the points of the lattice L
only. Finally, we observe that if
~
is any divisor satisfying the
two conditions c),d) of Th.l, say (k ~ 1) ,
then we can define representatives a
o
d'k
1
k dk
d
d
satisfying [.dka k = 0 and d
for k = 0 k
- 1 for k
d k for k
~
1
2
Ldk(a k ) = Ldk(a k ), so that we can k~1 ~o construct an elliptic function f d with divisor d. =
(1.22) Theorem 4. Let ~ =L:dk(a k ) EDiv(E) be a divisor on E. There exists an elliptic function with divisor
~
(unique up to a non-zero
multiplicative constant) if and only if the two following conditions are satisfied : deg(~)
=
0 and L:dka
k
€ L. If these conditions are
satisfied, it is possible to choose representatives such that
o , nat to
and for this choice, this elliptic function is proportio-
fdCz)
= TT~Cz - a k )
d
k
As application, we quote a formula. We know explicitely the
divisor of
f - rCa),
r
hence we get
Cz) - ~ Ca) = C en: z - a) O"'C z + a) O'C z)
-2
•
The constant C can be determined by looking at the asymptotic behavior when z --+ 0
()" is odd and a
ttt.
L implies
14
~(a)
;.: 0) whence
- I.lS -
C = -
~(a)
-2 . We have proved
(1.23)
~(u) - ~(v)
for u,V ; O(mod (1.23)'
LJ.
f'(Z)
-
~(u-v)~(u+v)~(u)
-2
~(v)
-2
Similarly, 2("( z) -
3
3lTer( z -1 Wi) a-O (.\j) -1
(z t!/;. L) ,
l=1
and this is also
- ()' (z) / tr( z ) ) It
by
(1.18) !
The Theorem 4 has an important Corollary which gives the structure of the abelian group of classes of divisors mod the divisors of elliptic functions. We define peE) to be the subgroup of Div(E) consisting of divisors of elliptic functions, also called principal divisors, or divisors linearly equivalent to zero. This is a subgroup of the group Divo(E) of divisors of degree zero. We have just seen, that in Divo(E), peE) is characterised by Ldka k = 0 ESE for ~ = Ldk(a k ) . The Picard group Pic (E) of E is by definition the quotient group Pic(E) = Divo(E)/P(E) (endowed with an analytic structure, as we shall show presently). (In the scheme terminology of Grothendieck, it would rather be the bigger quotient Div(E)/P(E), but we stick to the classical definitionJ Then we define a mapping ~ : E ~ Divo(E)/P(E)
a
~
(a) - (0)
mod peE)
This is a group homomorphism. Indeed (a) - (0) + (b) - (0) - (a+b) + (0)
= (a) + (b) - (a+b) - (0) is a principal divisor by the criterium given by Th.4. Then we can see that
p
when a ~
is bijective: =
~(a)
0 (in E), so that
= (a) - (0) is a principal divisor only
4?
is injective, and moreover, if
= !:dk(a k ) has degree zero, it can be written ~ = Ld k (Ca k ) - CO)) = L dktCa k ) = pCL,dka k ) , 15
- 1.16 -
which proves
ep
surjective.
(1.24) Corollary. The mapping
E ----.. Pic(E) = Divo(E)/P(E) a
~
(a) - (0) mod P(E)
is a group isomorphism. In particular, Pic (E) is canonically an analytic group. We mention a few exact sequences which connect now Div(E)/P(E) with the PieCE). Firstwe have the exact sequence of definition of Pic(E):
o --.
P(E) --. Divo(E) ~ Pic (E) ~ 0 •
On the other hand, we have also an obvious exact sequence
o
--+a
Pic (E) 21
-+-
Div (E) IP (E)
deg.
7l
--+
0
E
so that Div(E)jP(E) is an extension of
~
by a group isomorphic to E.
We can bunch together the above exact sequences in a longer exact sequence
o~
peE) ~ Divo(E) ~ Div(E)jP(E) ~ ~~ 0 .
The cohomological
interpretation of these sequences will only be
indicated in next section (on theta functions). Let us observe that the homomorphism deg : Div(E) is nothing but the augmentation homomorphism
~] ~
~
~,
~, and on the
other hand, the homomorphism [,dk(a k ) .....-. L:dkak£E is the canonical homomorphism ~~]~ E given by the (nonetheless canonical) ~-module structure of the abelian group E. The principal divisors are the elements of the intersection of the two kernels of these homomorphisms peE)
-$]=: ~
This shows that an analogue of P(E) can be defined for any abelian group G. An additive formula corresponding to the multiplicative (1.22) (Th.4) is due to Hermite. It can be explained quickly as follows. 16
- 1.17 -
Let f be an elliptic function with poles at points a
with principal
k
part ~
A 1 (z - a) + A (z - a) 2
(where a
a k and n
= n(k)).
-2
+
•••
+
An(z - a)
-n
We define the meromorphic function
-Al~(Z - a) + A2~z - a) + ••• ••• +
(-l)n (n-2)(Z - a)/(n-l)! Anp
Then we add all these explicit functions (corresponding to the principal parts of f at the poles a
L
g(z)
By construction
=
a ) k
Pa(z - a)
a=ak f - g has no pole, and is an elliptic function,
because the sum of the residues of f is zero. The announced formula follows
fez)
(1.25)
(1.26) Addendum. We look at phic mapping
p:
~
as function on E
= [/L.
It is a ho1omor-
a:u{oo} = §.2 (the complex sphere of Riemann). We
E -
claim that it is a double sheeted "covering" of §.2 with four (distinct) ramification points of order two of ~.
~
is two, and
(W/2)
el,e2,e3'~ £~
2
• Indeed, the valence
takes the values e.1. with multiplicity two because
= 0 (the case
00
is more obvious). This mapping shows that
E can be constructed by glueing two Riemann spheres with two cuts (interchanging the sheets), one joining e l to e and the other one 2 joining e 3 to 00. On the algebraic curve X, this mapping is explained as follows. It coincides with the first coordinate mapping (on Xo ) X --...
(x,y,t)
]pI ([) ~
S2
(x,t)
~
xlt
(On Xo ' we can take t = 1.) A natural generalization of this situation is to consider a projective curve X of homogeneous equation 2 2g-l = iT J I (x - e. t) (1 ~ i ~2g+l) Y t 1. 17
- 1.18 -
with the non-singularity condition
for t ; 0
1T(e. - e.) ; 0 1. J
i;j
The mapping X ~ ~2 defined again by (x,y,t) ~ (x,t) ~ xlt is still a double sheeted "covering" with the Zg+Z ramification points e l , •.• ,e Zg + l ' e Zg + Z =
~
.The involutive automorphism of
interchanging the sheets is (x,y,t)
~
(x,-y,t). This is an example
of what is called a hyperelliptic curve. (1.27) Open Problem. We have mentioned after (1.11) (p.l.lO), the general problem of explicit uniformization
of algebraic curves of
genus g>2. There is a still more general (unsolved!) problem for an explicit uniformization of curves with a given signature (we shall not define this notion) which gives an interesting problem for elliptic curves. In its simplest form, it can be stated as follows. Let Eo be a punctured elliptic curve, i.e. an elliptic curve X minus one point, which we take as the origin. The fundamental group of X o is isomorphic to a free (non-abelian) group over two generators, and the universal covering (as analytic and topological space) of Xo is known to be isomorphic to the upper complex line H ={z E.a::lm(z»
oj
of Poincare (or equivalently, the open unit disc in a:) .How can one find explicitely a uniformizing map
f:
H
~
Xo ? In particular,
Xo is isomorphic to a quotient of H by a discrete subgroup of automorphisms
r
(with finite volume quotient :
of first kind, isomorphic to How can one determine (1.Z8)
~
= g~
r
1t'l(Xo )
~
is a Fuchsian group
Z*71, free amalgamation).
in SL (2,R) by means of the equation of Xo ?
Erratum. p.I.8, bottom. If we define
- 27g~
r
n =
then ~ is proportional to D with
independent of L:
~
= -16·n •
18
IT (e.-e.) J
ifj
and
1.
a coefficient
- 1.19 -
Z. Theta functions (Jacobi)
Several proportional bilinear alternate forms appear in the theory, e.g. when one generalizes the relation (1.Z0) (Legendre's relation). So we start by showing that a lattice Le([ determines a canonical one B: ([><([ - - +
as follows. Let 4)1' we may suppose
~
R-bilinear ,
R
be a basis of L. By interchanging
Im(i1/ CJf)
~O
~
and
(we could also change the sign of
Z '
4J
~).
Such a basis is said to be direct or positively oriented. Then for
(Z.l)
BL(x,y) = det(~ X~)
B(x,y)
Y1
By definition, it is obvious that B(Wl,W )
Z
z)
Yla =
1. More generally,
!l (for W!1 E: L) whenever (w!) is a basis of L 1
B (Wi' W
and more precisely, 1 (ibid.) whenever (Wi) is a direct basis of L This shows that B is defined independently from the choice of the direct basis Wl,W . Thus B is an m-bi1inear alternate form on [. Z But (x,y) ~ Im(xy) = (xy - xy)/Zi is also such a form. Hence B must be proportional to this form, and exactly B(x,y)
=
for any direct basis Wl,W
Im(xy)/Im(WlWZ) . We can express slightly differently this
Z constant of proportionality. Indeed,
Im(~Wz) =
Im(w1W1WzlCAi) = /Wllm(W z/"1) = l~zArea(l.c."zIwl)
Area(Wl,WZ) = S ~ 0 . We define this constant S to be the mesh of the lattice L . Hence =
(Z • Z)
B(x,y)
=
1
-
-
1
-
~
B(x,iy). From (Z.Z), we
ZiS(xy - xy) = SIm(xy)
Let us look at the bilinear form (x,y) 19
- 1.20 -
see that B(x,iy) = S-lIm(ixy) = S-lRe (xy) is symmetric, m-bilinear and positive definite: B(x,ix»O for 0
~
xE:a:. To sum up, the
R-bilinear form B has the following properties (2.3)
a) B(y,x) = -B(x,y)
(B is integral on L)
b) B(L,L) C 7l
c) (x,y)
(B is alternate) , ,
B(x,iy) is symmetric positive definite. (2.4) Remark. When L is a lattice in a: n , the existence of an m-bili~
near form B : a:n~ a: n ---. R
satisfying the three conditions of (2.3)
gives a criterium for the existence of sufficiently (global) meromorphic functions on
r?L
to be able to separate the points. When this is
the case, a:n/L has a projective embedding and is called an abelian variety (over [). Any form B satisfying (2.3) is called Riemann form on a: n • We have just seen that for n = 1, this condition is always satisfied (more precisely
canonically satisfied).
Now we start with the subject of this section. It is impossible to find holomorphic (non-constant)
L-el1iptic functions, but we can
try to find holomorphic functions which satisfy equations of the form 9(z + ~)
=
eA(W)z.+ B(W)9(Z)
(W£L).
The quotient of two such holomorphic functions with same A and B will obviously give L-el1iptic functions. The
~
function of Weierstrass
has the above property, and the transformation formula (1.21) leads to another definition of the constants (also the relation of Legendre (1.20) shows that it might be useful to introduce a X in
~
to get a
more algebraic form of this relation). We adopt the following (2.5) Definition. A theta function 9 of type (h,a) with respect to L
is a meromorphic function on the complex line a: satisfying 9(z + w) = a(l.\J)e1rh (W) (z+1fc»9(z) (c.)€L). We note that a theta function 9 is characterized by the fact that (9'/9)' is an elliptic function. We shall later on be mainly interes20
- 1.21 -
ted in holomorphic theta functions, but it would be too much of a strain to restrict ourselves to them from the beginning. It follows immediately from the definition that if we have theta functions "."
,."
"",
9 of type (h,a) and 9 ; 0 of type (h,a), then 9~ is a theta function of type (h + h,aa)
i-I
"
"
"
"
"
(-11, i-I)
9/9
"
"
"
"
"
(h - h,a/a)
",
,."
Examples of theta functions abound. First comes to mind the
~
function
of Weierstrass, and all the products of its translates. In particular all elliptic functions are theta functions. Moreover, the exponentials z ....... e
C'(z 2 + (jz
+
r
are also theta functions, as is easily checked.
Because the multiplicator
a(~)exp(~h(w)(z
+
l~))
is an entire function
with no zero, we can define unambiguously the divisor of a theta function div(9) as element of Div(E) (the order of 9 at a point a is the same as the orders of 9 at all translates a + W periods
c.J Eo L).
of a by all
For example div(~)
(one point of E with multiplicity one) ,
= (0)
diV(z~eo(z2.+(jz+t) =
0
(neutral element of Div(E)).
For this reason, the exponentials of quadratic polynomials are called trivial theta functions. The type of a theta function 9 is a couple of a:L--'(f,~,
h:L--..a:
with interesting properties. First,
replacing
shows that h is a homomorphism, hence extended in R-linear homomorphism (f, coboundary
ah is defined by
is R-bilinear alternate (f,
~
map~ings
~-linear,
~
~h(x,y)
(f,
---+
u::.
W
by w+ w' in (2.5)
hence can be uniquely
(f, still denoted h. Its
h(x)y - xh(y) , so that
~h
It must be proportional to
the canonical B defined in (2.2), the constant being determined by the value
~h(~,W2)
for a direct basis 21
(~i)
of L. But
- 1.22 -
2nideg(div9) Hence we have proved dh(x,y) = 2i.deg(div9)·B(x,y)
(2.6)
=
S-ldeg(div9)·(xy - xy)
This is a generalization of Legendre's relation (1.20). (2.7) Definition. A theta function 9 of type (h,a) is said to be reduced when
9, 0 and la(w)1 = 1
r
Every theta function 9 9
such that 90 (0)
and
is real
h(w)w
(for anYWE:Ll.
0 can be uniquely written in the form
9 o ·9 re d where
e0
is a trivial theta function
1 (hence of the form 90 (z) =
exp(~z
2
+ ~z))
and
9 red a reduced theta function. It is indeed sufficient to adjust the two constants
0(
and
~
suitably. In particular div(9)
div(9 re d).
(2.8) Proposition. Let 9 be a reduced theta function of type (h,a) and put d = deg(div9). Then h(x) = dS
-1
x.
~
If moreover 9
.
~
another reduced theta function. a) 99 and 9/9 are reduced theta functions , b) div(9) = div.(9) implies 9 = c~ (c Proof. h(z)
€
a:)
We can write the R-linear extension of h in the form AI
+
Bz (with constants A and B). Because h(z)z = Alzl~ Bz 2
must be real for all z (as follows immediately from h(W)w
real for
all WE: L and real linearity), we see that B = 0 and A is real. Then (2.6) gives h(z)
= dS -1 I
Then a) is trivial, and to prove b) we use
it, and replace 9 by 9/8 . We have to prove div(9)
= 0 and 9 reduced implies 9 = constant.
By the first part of the proposition, div(9) = 0 implies d = 0, hence h
= O. Thus
9(z
+
w) = a(w)9(z) and because lal
that 9 is an entire bounded function 22
= 1, we see
(\9{ is periodic). Liouville's
- I.Z3 -
theorem shows then that 9 must be constant. Let now ~red be the set (a ~) of reduced theta functions. Then
P(~red)
~red mod. scalars
is a group (if 9 F 0 is reduced, so is 9- 1 ). (Z.9) Corollary. The canonical mapping div
W(~red) ~ Div(E)
e mod
homoth.
~
div9
is a group isomorphism. Proof. This map is a group homomorphism because div(9~) = div9 The map is surjective by using the reduced theta function corresponding to
ITcr;.ed (z
+
dive.
~red
- ak)d k
The map is injective by b) of Prop. above. It is thus equivalent to work with reduced non-zero theta functions (mod multiplicative constants) or with divisors on E. The proof of (l.ZZ) only used one facet of this general fact. (2.10) Theorem (Appel-Goursat). Let d~l,
be a divisor on E of degree
~
and choose a theta function 9 with div(9)
=~.
Then the
complex vector space of holomorphic theta functions having same type as 9 is of dimension
deg(~) =
d .
Proof. Let (h,a) be the type of 9, and define T(u)
=
a(~)-uexP(-l~h(Wl)WlUZ).~(Wlu)
for every holomorphic theta function
f
having type (h,a). We shall
show that such functions; exist! After a small computation, one finds T(u
(*)
Also, replacing u by u
+ 1)
= T(u)
WZ/Wl (where we suppose the basis (Wi) of L direct, hence 1m (1:) > 0), we see that T(u
+ ~)
=
+ ~
with
T(u)a(~)a(~l)
-1:
~=
expWu(h(WZ)Wl-h(Wl)WZ) ·
23
- 1.24 -
that is,
We define C
1:
and q
a(w2)/a(~I)
= exp(~i~),
so that the above
transformation formula is more simply T(u + ~) = Cq-dexp(-Z~idu)T(u)
(**) If
~,hence ~
exist, we see from (*) that there will exist a holomor-
phic function F in
« - {OJ
such that
T(u) = F(~) = F(e
Zniu
).
The Laurent expansion of F at the isolated singularity 0 £
a:
will have
the form -0.
But C**) implies certain relations for the coefficients at • Indeed exp(Z1t:'i(u+1:)) = eZxiueZn:i-c = qZC
and exp(-Z1t'idu) =
l:-d
gives
t
This in turn gives, by comparing the coefficients of ( , or
a,tqU
Hence
at C-k q Zt+d .q Z (t +d) +d ••• qz[t +(k-l) d] +d
But
Zt+d + Zt+3d + •.• + zt+(Zk-l)d = Ztk + d(l + 3 + ••. + (Zk-l)) --.,.--J
~
~
= Ztk
+
dk Z
Finally, we see that atC
a t + kd
-k Ztk dk Z q q
and by summing these coefficients together, we get F(t)
=
Lat'l/' eo
-..
=
.l-1
Z
L a {,(,LC-kqUk+dk ~kd IJ
I=o:t 24
lc,Z
- 1.25 -
with
F.t (~')
=
L.
C-kqdkZ+Ztk (dk
k£ 1l
Now all these series converge absolutely because Im(~.".O implies Iql = le 7Ci l:J= e-1rIm (1:)
Iqdktl = ,ql dJc1
and the term lkl ~ ~
t
=
decreases faster
<
1
than all the others for
· This shows that all F~(t) are well defined in [~ for
0.1, ... , d-l • We could have started by these functions, and in
reverse direction define
T and finally
f .
As the space of the F's
is d dimensional, so must be the space of the T, and also the space
+ , which
of the
proves the theorem.
It is interesting to look at the case of 9
=
d = (0) has degree precisely d proof, F (,)
o
with C
=~,
so that
1. With the notations of the above
2
= L:c-kqk ~k k
a(WZ)/a(~)~ T (u) = o
= -(-l)-~
L:
kE:ll
_e-~i~
• Substituting,
(_l)ke~~(kZ+k).eZ~ikU
Grouping the terms k and -k-l for k ~ 0 gives
To(u)
= l:
k)O
hence finally
(Z.ll)
To(u:L)
(_1)kqk(k+l){e2~ikU - e - 2n'iku - 2'1t'iU] ,
= Zie-XiUl:t~~k(k+l)sin(~(Zk+l)U) k~o
The dependence of To in the lattice L is through q exp(ni~/Wl)
= eXi~ =
. This function has a zero at u = 0 and since we have
taken d = 1 (and
~>O)
it must be the only theta function of its
type (up to a multiplicative constant) by the theorem. It must in particular be proportional to ( Z • 11) ,
en z)
= c-e - xi z1(,)• .L
Coming back to u = z/lAll we have ~~ k (k+1 ) •sin [ 'IC( Zk+ 1) z/
WI]
k~O
25
- 1.26 -
We may introduce an order on the group of divisors of E as follows
L d k (a k )
d ~ 0 for all k , k and if JIlore generally £ ,£'€Div(E) , d =
d >;: d' ?"-
);: 0
whenever
£ - £'
whenever
-
~
0
In particular if d is the divisor of a theta function 9, we see that div(9)
~
0
exactly when 9 is holomorphic. Although the divisor of
the zero function is not defined, we make the formal convention that div(O)
~
£ for all £ € Div(E).
Then we define L(~)
for any divisor £
~
=
{f L-elliptic: div(f) ~ -£
i
Div(E). Due to the preceding convention 0 E:L(£) ,
and it is easily seen that
L(~)
is a complex vector space (the nota-
tion L comes from the appellation
linear
system, sometimes used
for this space, or for the set of divisors of the form div(f) for f
E: L (£)
).
(2.12) Corollary (Riemann-Roch). If d is a divisor on E of degree then
dim[L(~)
I,
= deg(£) .
Let 9 be any fixed theta function with div(9) =
Proof
~
~
(we may
suppose 9 reduced, cf. (2.9) ) and let (h,a) be the type of 9. Define then
f
~
Because
~/9
f
on the space of theta functions of fixed type (h,a).
and 9 have same type, their quotient f,
=
~/9 is an
elliptic function, and div(f,) = div(f) - div(9)
div(4)) - d
We see that
~
cP........
f+ =
div(4=»
~
0 #0 div(f+)
~
-£
establishes an isomorphism (of inverse
f9 ) between the d-dimensional space of holomorphic theta
functions of type (h,a) (Th.(2.10)) and 26
L(~).
- 1.27 -
Let us now indicate how the theta functions are bound with the cohomology of the elliptic curve E, and in particular with the holomorphic line bundles on E. To any L-theta function 9 we associate a holomorphic line bundle F
~ E on E as follows. Let (h,a) be the type of S. We define S a holomorphic action on the trivial line bundle on E = [ , compatible
with the translations in the base by ( z + CAl, a (w) e 7[ h (w) (z +
(z , v)6)
The quotient
E ><
of
[
--+-
J w) v)
•
'E (first proj ection) under this action of
L gives the holomorphic line bundle Fe
~
E . By definition, we could
as well write F(h,a) instead of FS . It is obvious that z
~
(z,S(z))
defines an equivariant section of the trivial line bundle on 1(, hence can be identified with a section of the quotient Fe
~
E . (We say
section instead of meromorphic section.) If S' is another section of F
~ E, S' can be identified with an equivariant section of the S trivial line bundle on E, hence with a theta function of type (h,a).
Thus e'le is elliptic and S' = fS (with an elliptic function f). Thus FS = Fe'
· Moreover, if 8 is a trivial theta function, Fe --+- E
has a holomorphic never vanishing section 8, so that it is holomorphically trivial (hence the denomination of trivial theta function). If 8 and 8' are two theta functions with same divisor, their quotient must be a trivial theta function, so that Fe and F , are holomorphi8 cally equivalent line bundles on E. Again, we might as well note Fd
instead of FS ' and summing up, we see that Fd depends only on the class of the divisor ~ mod P(E). If we assume the basic existence (by no means trivial) theorem of existence of meromorphic sections for all (holomorphic) line bundles over a Riemann surface, it is easy to see that
d
~
Fd defines a bijection between Div(E)/P(E) and the 27
- 1.28 -
set of isomorphism classes of (holomorphic) line bundles over E. This set is naturally a group for the tensor product (corresponding to the usual product of cross sections 9) so that
Fd is a group isomorphism (the line bundles are often called invertible bundles ~ ~
for the reason that among the ho10morphic vector bundles with finite dimensional fibers, they are precisely those which are invertible for the tensor product). Exp1icite1y, the inverse bundle of F(h,a) is F(-h,l/a) as follows from the formula giving the type of 1/9 in function of the type of 9. The theorem of Appe1-Goursat (2.10) may be restated as follows : The holomorphic sections of the line bundle Fd ~ E , when deg(~) ~ 1, make up a complex space of dimension equal to deg(~). (When deg(~) = 0, this dimension is not always 0 indeed, the constants are sections of the trivial line bundle, and so we see that this dimension is one when d is principal. The correction to bring to the formula giving this dimension for small degrees is due to Roch. Riemann only gave the formula for divisors of degree sufficiently high, with an exact bound. With this in view, the Corollary (2.12) should be called Riemann's instead of RiemannRoch' s!) We come finally to the cohomological interpretation of the theta functions and classes of divisors, by introducing the structure sheaf
0
of E (sheaf of germs of holomorphic functions on E), and its
subsheaf of units
17~ (sheaf of germs of nowhere vanishing holomor-
phic functions). Let also
~
denote as usual the normalized exponential
(of period 1) considered as sheaf homomorphism kernel is the constant sheaf
~,
e:
0
~ ~~
.
and we have an exact sequence of
sheaves
28
Its
- 1.29 -
This exact sequence gives rise to the exact cohomology sequence HO (E, II
a:
({)) --+-Ho (E, eJ1--.H I (E ,I) --....H l (E, 0) -.H l (E ,(t'M) -+-H 2 (E ,ll)--..O e U,.."
--=--..
Because e : a:
,y
a: ~
~ a:~
is surjective, we may cut the beginning of the
above sequence and get the shorter exact sequence (2.13) We interpret each term. By Serre's duality HI(E,fP) is complex vector space
HO(E,.i): the
J1l(E) of global holomorphic differentials on E.
These differentials are all proportional to any
non-~ero
of them
(and all invariant under translations, as we shall see later) so that HI(E, 0) ~ Then Hl(E,I) ,.., is a lattice in HI(E,
([·CJ
(!))
~
a: •
(the lattice of "integral"
holomorphic differential forms) which can be identified with L
~
a:
under the preceding isomorphism. The sequence (2.13) starts
0 ~ L ~ a: ~ •••• The last term H2 (E,1l) is easily identified with H2 (L,Z) (group cohomology of L with trivial thus as follows:
action on
,.",
Z because L has no torsion, acts freely on the contractible
space a: with quotient precisely E : Hurewitz Theorem). This group can also be interpreted as space of
Z-bilinear alternate integral forms
on L. These forms are all proportional to the canonical one B : H 2 (L,Z)
~ Z.B· ~
7l
The space HI(E,~~) is the most interesting one. It can be computed by taking a covering of E with open discs U. in ([ (mod L), each
U~
being sufficiently small to avoid pairs z, z+ 4J (w E L - (OJ). Since all the Uo( and their finite intersections
U
~ ... ~
=
n .c, U
t,it'
are
contractible (by convexity), hence cohomologically trivial, the sheaf cohomology HI(E,~~) is already the (sheaf-)~ech cohomology of the nerve of this covering (with variable coefficients in the groups
- 1.30 -
5
on the couple of indices
A l-cocycle
so(fS E: (?><cu..p )
CoC, (3) ~
:
(d,P)
In other words,
(o(,p Soft
~s =
-1
'1) ~ s~1 so(l so(~ =
= se(t1"
s(31
f ¢
of the open intersecting discs U« .
is a l-cochain s such that ~s
when U"'~
on
0, i.e.
1 on
u"Pt = u.nu,..nu, ·
(when this triple intersection
uo(~y
is non-empty). This is a compatibility condition on the transition functions
s.~
defined on the non-empty
.
u~~
A l-cocycle is said to
be a l-coboundary, or more simply to be trivial, when there exists a map
G":
Cl( I--+-
cr(aI)€t1>CCU",) with
5
=~r
C"',13)
:
1---+
rC~)/cr(",) on U
olp
To each l-cocycle s one can associate a holomorphic line bundle Fs
~
E on E as follows. We glue together holomorphically the
a:
trivial line bundles on the Uo( , Pee : U-e x
--+-
U-<
by means of
the slip
::[
(z, v)
(z,s (z)v) 0«(3
u )(: ([
~o(l
Uf!> {\ Uo( Th~
_
Uo(
compatibility conditions on the
s~~
[
n U~ (cocycle condition) ensure
the coherence of these patchings. The line bundle Fs will precisely be holomorphically trivial when s is trivial (existence of a global holomorphic, nowhere vanishing cross-section), so we have an isomorphism
Hl(E,~) ~ s
~
group of holomorphic line bundles /E (Fs~E)
We have already indicated the fact that this group is also isomorphic to the group of classes of divisors Div(E)/P(E) . The co boundary homomorphism
30
- 1.31 -
is given on the line bundles precisely by ~
: F(h,a) ~
(1/2i)ah
d·B
(where d is the degree of any theta function of type (h,a), or equivalentl~ the
degree of any meromorphic section of
F(h,a)~E
).
This justifies in some sense the terminology adopted for "coboundary ah" of
the homomorphism h : L
[ . (More precisely,
~
we should have defined the coboundary of h by C) h (4)1
' lcJz)
=
( 1I
Cc1 h ( w2 ) )
2i) (h (WI ) 4)2 -
If we identify 7l·B with 7l , the coboundary
0 gives what is known
c : Hl(E,~~) ~
under the name of Chern character
class c(F) of a holomorphic line bundle
, but... ! )
F~E
7l : the Chern
is computed (in theory
at least) by taking the degree of the divisor of any meromorphic section of
F~E
. With these interpretations, the exact sequence
(2.13) gives the (isomorphic) exact sequence (2.13)'
o
---+
L -+- [
--+-
deg group of 1ine bundles IE --... Z '--
~
--+-
0
./
or HI (E ,~), or Di v (E) IP (E) The subgroup of H1(E,eJJ consisting of line bundles of Chern class zero (or the subgroup of classes of divisors of degree zero) is thus isomorphic to E = [/L . This is the previously defined (1.24) isomorphism. These line bundles are also called flat line bundles on E and are precisely those which are topologically trivial. So we restate the definition : Pic (E)
"connected component" of Hl(E,~>c) group of holomorphic line bundles over E which are topologically trivial (trivializable!) .
It is important and interesting to note that Hl(E,~~ is also the group of invertible sheaves over E (~odules or O-modules, locally free of rank one), by associating to the bundle
F~E
the sheaf
of germs of its holomorphic cross-sections. The point is that this 31
- 1.32 -
sheaf can be defined without speaking of the line bundle (a real advantage in characteristic, 0
!) as follows. If s EHl(E,~~) is
a coherent system of transition functions, the sheaf by
~s(U)
s is defined
~
= r(U,fs ) = group (or ring) of holomorphic functions on U.
The restrictions are twisted as follows:
f
---+
s~oc
·restr(f) to Uoc(3
Equivalent formulations are, when starting with a divisor d = Ldk(a ), k to put { f:U-..([
d
fez) = (z-a k ) kh(i) in neighbd.
of a k if in U, wi th h hOlomorphiC} , and when starting with a theta function 9 of type (h,a), to put ta(U) = {f:U+L~[ holomorphic, satisfying
f(z+~)=a(~)enh(~)(z+l~)f(z) for Z€U+L} • The sheaf restrictions in these two definitions are the usual restrictions of functions. (2.14) Complement 1. We show why it is sufficient to consider multiplicators of the form Let indeed
F
~
B(W))
exp(A(~)z +
when treating theta functions.
E be any holomorphic line bundle over the elliptic
curve E = ([/L. We use again the basic existence theorem of Riemann surface theory which gives the existence of a meromorphic section s of this bundle. We choose a lifting of this section to the trivial line bundle over the universal covering has the property that (s' Is) (z
+ IU)
-
E = ([ of E. This section 5
(s' Is) (z) is holomorphic
everywhere (entire function) in z for a fixed element w of L. But ~'/s is a section of the canonical (cotangent) line bundle over E.
As the tangent, as well as the cotangent line bundles over E, are trivial, this proves that (s'/s)(z
+
32
~)
-
(s'/s)(z) is constant in z,
- 1.33 -
hence (5' Is)' is an elliptic function. This is what we wanted to prove. (2.15) Complement 2.
It is possible to give more explicitely than
has been done, the possible types of theta functions. Let 9 be a theta function of type (h,a). We have seen that a) h : L
~ ~
is an additive homomorphism,
= 2id·B where d
b) ~h
degdiv(9) ,
and a simple computation shows that a : L ~ ~~satisfies c) a (t.o + w') = (-1) dB(W ,t.t)') a (w) a ( W r) for t..), lAJ' E: L. In particular, fal
: L -+-
TIt.:
is a homomorphism, and this is what
was needed to decompose B = Bo·B red as after (Z.7). As the proof of (2.8) shows, we can write h(z) = (d/S)z + «·z with a complex constant ~.
Moreover, if
win ~L (for
f:
L
new)
denotes the biggest positive integer n such that
tV £L), we see that (_l)dafl(W) a(tJ)
~ [~
~ t - + Re(~tU)
is a homomorphism
the absolute value of which must be of the form for a suitable complex constant
~.
Summing up, the three
above properties a),b), and c) imply that
z + o(·z
h(z)
(diS)
a (c.))
(-1) d-n (c.)) eRe (~c.)) (w)
X
with complex constants
(U = [1 = unit
oC,
(5, and a unitary character ~: L~ V
circle in [). Conversely, all such functions satisfy
the three conditions a),b),c) and hence form the type of a theta function B (with divisor of degree d). (Z .16) In Jacobi t s theory, the variable q
plays as important
.a
= e 1Ci"t where -c =WZllU l
role as ~ itself. If we put t = q2, the
normalized exponential! gives an isomorphism ~
: [I
Z
"-".
[~,
whence also
~ : [I
1l + Z
--r """"'.
[)C I t'll
,
where t~ denotes the multiplicative subgroup of [~ generated by t 33
- 1.34 -
t
Obviously» 1m(1:) > 0
71
=
~t n
: nE
1
~
is equivalent to Itl
defined an elliptic curve as being a quotient of the multiplicative group
Itt < 1
[~
by a discrete subgroup generated by an element t of modulus
(or Itl>l, but not
It(
= 1).
We shall see that this point of
view is more suited to the study of p-adic elliptic curves.
34
- 1.35 3. Variation of the elliptic curve and modular forms
To compare different elliptic curves, we consider two lattices Land L' in [. Then we put E
= [/L , E' = [/L' and compare them by
means of holomorphic maps.
f:
(3.1) Proposition. Let
E ---+- E' be any holomorphic map. If
Cf
is not constant, it is induced by an affine linear transformation Z
...-.. 0( Z
C
: E
Il
+ (3
--+-
z ..-.. z +
and is surj ective. In particular, if we define
E' Q.r Z ....-.
f3
(cons tant map) and T~
~ (translation by
0/ is a homomorphism E ~
p in E'), then E' · (Ii f is not
ep
=
t
= Tp
+ C(3
fez
+ w) -
fez)
0
t
and
constant, it is surjective~
Proof: We choose a lifting, to the universal coverings and note that
.!?L
E' ---... E'
E = E'= [ ,
must be an element of L' for every
W€L. By continuity in z of this expression, we see that this diffe-
rence
is independent of z and by derivation,
L-elliptic function. By (l.l.a) we see that consequently ~(z) Obviously
0(
=
«z +
~
f'
must be an entire
~'must
be constant, and
with two complex constants
must send L into L'
O(·L eL'. When
0<
«and ~ .
f 0, this is a
serious restriction on Land L'. All assertions of the proposition follow from that. (3.2) Corollary. and such that
Ii
is a holomorphic map E
~
~
E' , is injective
0, then it is a group isomorphism, and the
~(O)
lattices L , L' are homothetic. Another proof of this corollary would be as follows. Let a', b' and c'= a'+b'€E'. By (1.22) the divisor (O)+(c')-(a')-(b') is principal, hence of the form div(f') for a function f' on E'. Let f
=
f'of . Its divisor is div(f)
=
(0) + (c) - (a) - (b)
35
- 1.36 -
= ,-l(d), •••. This principal divisor must also
where we have put a
satisfy Abel's condition (apply (I.Z2.d) again), which gives c
=a
+
b. This proves that
f
,-1 is a homomorphism, hence
is an
isomorphism (and is biholomorphic). For
1:
the notation
in the upper half-plane H = {1:' ([: Im(t') >' for the lattice
L~
has a direct basis
7l
L~
+
oj
we introduce
7l·'t'C([ • Every lattice L
homothetic to (at least) one Lt' I ,I4)Z hence is (take 1: = ~Z/ t..)1) • But if we choose a direct basis such that 'flJll is W
minimum among the modules of the non-zero elements then
WE:L
- {OJ , and
CA)Z of module minimum among the elements of L - 7lo41 (so as to
have a direct basis), then we can say much more on by construction, This leads to
J-rJ )
\1: ±
I};;'
(x
imaginary parts,
\xJ
means
= J Rett'»)
Jwz
1. Then also
11:1
and if
~ 1) 2
'1·
+
t
~IJ ~
IWzi
~/~ = ~
by choice of
•
First, <..)2.
1: = x + iy is decomposed in real and
x2
yZ ~
+
Replacing £.)2 by
y2, hence
Wz - tA\
Jx
t
I) ~
Jxl.
This
if necessary, we may
assume (3.3) Again, replacing Re(1:)
'0
(~l' ~Z)
when '--=1 = 1
by
(~2'-~)
if necessary, we may assume
(cf. picture below). We denote by D the subset of the upper half-plane H defined by (3.3) and Re (-r)
H
'0
when 11:1 = 1. Then the
lattices L1:' for
"t'6D are a system
of representants for the classes of homothetic lattices in ([ . It can be checked directly that -f
-.t 2
o
i•
L~
has only
two direct bases satisfying the above conditions when namely
36
(l,~)
and
~~ ~(t/3),i
(-1,-~).
In the
,
- 1.37 case of the square lattice
~ =
i, four direct bases of that kind are
available: (l,i), (i,-l), (-l,-i) and (-i,l). Similarly in the case 1:
= ~(1/3) =
S
(primitive 3~ root of 1), there are six possible
choices of such bases (hexagonal lattice). When we refer to the fundamental region D defined by (3.3), we always implicitely assume that
Re('t)~O
11:1= 1, as indicated just after (3.3).
when
Most functions encountered in Weierstrass' theory (first section) had homogeneity properties in the lattice. For example ~(~Z:AL)
= A,-2 P(Z:L) =
G
2k (A L)
1\-2k 1\
G
2k
and
(L)
= A-12~(L)
IJA ("L)
We introduce slightly different notations. When ~(z:~)
=
0, we put
P(z:LT )
G2k ( ~) = G2k (L~ ) ,
always with Lor =
Im(~)~
A (1: ) = ~ ( L~ )
7l + 7l1: cO: • Since
~
,
never vanishes, we can
define the homogeneous function of degree zero
J(~) = g~(T)/a(~) is homogeneous of degree -4 in the lattice, and we 4 look at this function as function in Im(t') > 0). It is important to (remember g2
60G
note that by uniform convergence of G (T) Zk
for 1:
Im(~)~
£
> 0,
=
L'(m-r + n) -Zk
(k
> 1)
we can evaluate the limit of this expression for
= x + iy (x fixed)
, and y
~ 00,
by taking the limit of each
term in the summation. All terms with m ; 0 tend to zero, so that n -2k
2 ~(2k)
These constants can be computed explicitely (by comparing two expansions of cotg(z), and they can also be linked with the Bernoulli numbers: in particular, they can be seen to be
37
~
0). The homogeneity
- 1.38 -
properties of these functions as function of the lattice give very remarkable properties in the variable (3.4) Proposition. Let
~,~/€H
't'EH
= {z 6(1;
Im(z)
:
7
oj.
and LT,L v be the corresponding
lattices. Then a) L"t' = L"t: if and only if b) L1: / ''''' L-r if and onl y i with a matrix
c(
or + n wi th an integer n ,
t"'
(a 1:
f~'
= (~
~)
E: SL
2
b) / (c 1:
+
(~)
+
d) = 0(' (or)
(integral coefficients
and determinant +1). In this case, the factor of proportionality is
C1: +
d :
(CL +
d)L1:'
= L~
Proof. The first part is easy. For the second, we take any AE[~ such that ALTJ
= L~
· Since
and A must belong to Lt we can write
aT + b
{ wi th 0< = ( ca
A~'
C1:: +
d
shows that there exists another integral matrix of the form
=
~-l A L-r
~.~
and
db) ' an integral matrix. But conversely L1;"'
(1 0
n) 1
~
with
«o~
(n an integer) by the first part. In particular,
must be an invertible integer. This proves det(<<) = tl .
det(~)
A simple calculation shows that
Im(-r') hence det(oc)
=
>0
(ad - bc) 'c~
+
by hypothesis
dl- z lm(1:)
=
det(o<) lct:
+
dl-zlm(t:)
H. Summing up, O(E:SL (Zl) as Z asserted. If conversely 1:' = O«T) with «E:SL ( 7l), the above computaZ tions show also that (C1: + d)L't'1 = L1;: q.e.d. 'r,~' €
(3.5) Corollary. The holomorphic functions
for any
0(=
(ca
b)
G2k ( ex ('1:'))
= ( c 1:
GZk(~)
satisfy
+ d) 2kG 2k ( 1:) , ( k >' l)
d e:SLZ(Zl), and tend to a finite limit when
1:
tends
to infinity on the imaginary axis. This is just a reformulation of the homogeneity properties, in view of the proposition.
38
- 1.39 -
We introduce the following notation : Jacobian of (t'f---.+oO«('t')) at
J (oC , z.)
Thus, if
0(
= (ca ~), J (0<, "t')
(
a1:' +
C'r +
b)' _
d
(ac1:'
+
ad -
~ =7.
ac~
-
,
(DCESL ( 7l)). 2
cb)/(c~ +
d)
2
(C1: + d)-2
hope that this function will never be confused with the modular
~e
invariant J
= g~/ f:.
,especially since they do not depend on the
same variables.) The corollary can be reformulated as
for
't= ~
Hand
0(
E:
J(~(~))
When
0(
't~ ~ +
=
-k
G2k (T) SL 2 ( 7l). Also for example
G2k(<«~))
J(<<,~)
= J(L)
is of the form (~
(k
> 1)
(by homogeneity of degree 0).
~) (nE:Z), it gives an integral translation
n in H, having Jacobian 1, hence all the preceding functions
are periodic of period one (even when the degree of homogeneity is
1
0) and thus can be expanded in a Fourier series of the form
~
L- ane
Z1Cin~
-00
Since the absolute value of e 2'1Cin't
for
t: =
iy purely imaginary, is
e -Z1t'ny , we see that the G
can have no terms with index n negative Zk (they are~ounded when ~ tends to ~ on the imaginary axis). The
same happens for
~(~),
and as consequence,
J(~)
will have only
finitely many non-zero Fourier coefficients of negative index n. The behavior of any holomorphic function with a Fourier expansion (fini tely many an 1 0 for n < 0) for
of the form
z = iy purely imaginary and y with absolute value
land
~ ~
e2~dY
is given by the first term,
if no
= -Inol is negative, and
is the index of the first non-vanishing Fourier coefficient. These properties lead us to a general definition of modular forms. (3.6) Definition. Let k be any integer. A (meromorphic) modular form of weight k is a meromorphic function f on the upper half39
- 1.40 -
plane H, satisfying : J(o<,z) -k fez)
for OCE:SL Z( 7l) (whenever one member is defined!)
a) f(OCCz))
b)
/f(
Z
)1-<
/\y for some (positive) real constant /\,
when y = Im(z)
~
00
•
The second property means that there exists a constant C';)o 0 such that \f(
z)1 <: ceAy
for all y greater than some large yo(depending on
A,
c
and of course f). This condition ensures that the Fourier expansion of f (which exists by a) ) has finitely many non-zero coefficients of negative index. The smallest index n corresponding to a non-zero Fourier coefficient, is called the order of f at infinity, and denoted by ordoo(f) = neo . The orders of f at all points P E: D (fundamental region of H defined by (3.3) ) or at all points P E: H are defined as usual by considering the Laurent (or Taylor) expansion of f around P. ~
For convenience, we put D = DU{oo} , so that we can speak of ordp(f) ~
when PE.D , and f ;
o.
The extent to which our examples GZk exhaust the possibilities of holomorphic modular forms is tackled with a detailed study of the poles and zeros of general modular forms. We have indeed (3.7) Proposition. Let f f 0 be a modular form of weight k. Then ordoo(f) where
~
=
~(1/3)
+
iordi(f)
+
tord~(f)
+
L.ordp(f) = k/6
(primitive third root of 1) and the summation is
extended over the points P E D - {i ,~J
(P f i, t) .
This proposition suggests strongly that we should adopt a new definition for the orders as follows : viCf)
= }ordiCf),
v~Cf)
= ord CCf)/3 and vpCf) = ordpCf)
for all other PE~. The formula could indeed be written then more simply
40
- 1.41 -
As for theta functions, we could show that the modular forms come from meromorphic cross-sections of suitable line bundles over (space of orbits r(z), with t~
r= SL Z( 7l)
r\H
compactified by adding
)
one point at infinity. These line bundles would be the
~-powers
of the canonical one (cotangent bundle), so that a modular form of
u(-J.
weight k comes from a k-differential on r\H
Proposition (3.7)
expresses the degree of their divisors. Proof of (3.7): By hypothesis, n. = ord.(f) zero nor pole when Im(z»
> - co
,
so that f has no
Rand R is large enough. Let DR be the
intersection of the fundamental domain D with is to integrate f'lf on the boundary
~DR
Im(z)~
R. The idea
of DR. In case f has zeros
or poles on the sides, f'lf will have simple poles on the sides, and the usual modification of the contour has to be made. By hypothesis f'lf has no pole on the top horizontal part of
~DR
and if the modified
contour is as shown on the picture (we suppose there that only one pole occurs on the vertical sides, and only one pole occurs on the portion Jzl = 1, -l
(in this direction), we use the
Fourier expansion of f (certainly valid for Im(z)
~
R, because this
region contains no pole of f): fez) = a
n.,
eZ~in_z(l
+
~(e2~iz)),
so that f' (z)/f(z)
G
o
or
If' (z)/f(z)
'/2.
.
27rlnoo + (!J( e
for 41
Im(z)~ ~
· 2 7t:ln 00
27Ciz ) ,
I= 0 ( e - 21r I m( z ) )
. This shows that
- 1.42 -
j[
(f'/f)dz
M
= -2nin~
+
O(e- 2xR )
~ -2xi~
R~~
On the semi-circle (of radius t >0 sufficiently small so as to enclose no pole of f'/f other than i), we get
r
JDE
(f' /f)dz _
-} (21tiord (f))
for
i
£~O.
The arcs BC and PG can be treated similarly and give 2 + (f' I f) (z) dz --+ --627t'i· ord (f) when lBC JpG ~
(r
r )
£,.... 0.
On CD, we suppose that the semi-circle chosen to avoid the pole of f'/f is the transform of that on EF (one of them having radius !), under the mapping z
~
-liz (this mapping transforms EF in DC).
We compute the integral over EF by making the change of variable ~ = -liz, so that z ~~
(f'/f)(z)dz
applies EF --+- DC. Thus (f'/f)(-I/{)d(-I/~) =
(f'/f)(-l/l)t-2d~ = (g'/g)(')d~ with
g(~) =
f(-1/0
= f((~ -~)~) = Z;2kf(~).
because f is supposed
to be a modular form of weight k. Hence (g' I g) (() d~
= [2k/~
+
(f' I f)
(l~d~
Now we gather the two integrals over CD and EF of f'/f :
(r
JCD
1
+
EF
)Cf' /f)Cz)dz
=
(r J
CD
+
r )Cf' /f)dz JrDC 2kd;~ +
i DC
Only the integral of 2k dl over DC remains to be computed, or at 1; least its limit when £~O. For the limit, we can 'rectify" the small semi-circle (of radius £) to put CD on Izi
=
I (this does not change
the value of the integral a bit !) and integrate from i to ~ . This is only the variation of the argument between i and' hence
(r
JeD
+
r ) (f' /f)dz
JEF
-
2k(2n:i/12)
21ri'~
The residue theorem gives on the other hand, the global result ;
(f' /f)dz
=
21[i
L
ordp(f) 42
- 1.43 -
with a summation over the poles P of f'lf enclosed by the contour, i.e. all the poles of f'lf in D, distinct from i and
~.
The compari-
son between the explicit computation and the global result gives the announced formula q.e.d. In the course of the proof, we have used the following result of function theory. Let g (g = f'lf in our case) be an analytic function (in a neighborhood of the origin) having a simple pole at the origin, and let CI ' Cz be two continuously differentiable arcs starting at the origin, having respective tangents d l ' d Z with angle ~. Then the integral of g(z)dz on circles of radii t between CI ' Cz and on circles of radii t between d l ' d Z (both in the positive direction if 0( is positive) have same limit
We are mainly interested in holomorphic modular
forms, so
we define Mk to be the complex vector space of holomorphic modular forms f of weight k with ordoo(f) ":1 0 (thus all vp(f) for p~1} are ~O). We are going to show that all M are finite dimensional, and defining k dk dim[(M k ), we shall show that we have the explicit formula L. d Tk (1 - TZ) -1 (1 - T3) -1 E:. 7l[T]] k k This formal series is called the Poincare series of the graded module M =
ek Mk
·
(3.8) Consequences of (3.7). Let fc:::Mo . Since the constants are in Mo ' f - f(i) will be in Mo . By construction, this function has a zero in i, so that the formula given by the proposition cannot hold. This proves f - f(i)
=
0, hence Mo
= ~
is of dimension 1.
Now if f, fE:M k have all same orders ordp(f) = ordp(r) (PE))), their quotient flf will be in Mo ' hence constant. This shows that a function of M is determined up to a multiplicative constant by its k ~
orders at the P6i}. Obviously Mk = W!when the integer k is negative. 43
- 1.44 -
For k
= 1, there is no possibility to satisfy (3.7) with integral
orders, so that M = {Ol, d = l l lity in (3.7):
ord~(f)
O.
For k = 2, there is only one possibi-
= 1, all other orders being o. This proves
= 1, and all functions in M2 vanish at 4. We know that g2 or G4 is Z a holomorphic modular form of weight two, so that M2 = [g2 = [G 4 and
d
g2 vanishes at { with order one (simple zero)! Similarly for k
1 all other
we have only one possibility to satisfy (3.7) : ord. (f) 1. orders being zero. Hence d
= 1, M3 = [g3 = [G 6 and g3 (or
3
= i.
one simple zero located at z on~
Ag~in
for k
=
3,
G ) has only 6
4, there is only
possibility to satisfy (3.7): ord{(f) = 2, all other orders being 2
so that d 4 1, M4 = [g2 = [G S and in particular, GS ' propor2 tional to G ,has a double zero at ~ (and no other zero). In full: 4 ~ero,
L.'(mz
=
+ n) - S
c(
r:: (m z I
4 + n) - ) 2
( 1m ( z)
> 0) ,
a dream of youth, realized up to a constant (which can be determined explicitely). Finally there is only one possibility to satisfy (3.7) when k
= 5: ordi(f) = ordt(f) = 1 so that d S = 1 and MS = [g2 g 3 = [G 10 .
For larger k's, there are several possibilities. Since vanishes in the upper half-plane, necessarily ord00 ~) 3
tions
~€M6
=
never
1. Other func-
2
in M are g2 and g3. We can quote £ E:M with any prescribed 6 6 simple zero P ; i, ~ in D • It is sufficient to construct Z 332 3 2 g3(P)·gZ - g2(P)·g3 ag 2 + bg 3 3
2
The space M6 is thus of dimension d 6 = 2, spanned by g2 and g3 . But more generall~ we have (3.9) Lemma. Let £:M f
k
~
1-+
[ be the linear map defined by =
~(f)
lim f(iy) y+eo
and 1:1'- : Mk ---... M + be defined by multiplication by the function 6. k 6 Then we have an exact sequence of complex vect~spaces
o
A-
£
--+- Mk
~
M +6 ~ [
and in particular d k + 6
= dk
+
k
1, for k
-. 0 , ~O
•
- 1.45 -
Proof: Because
0, it follows that multiplication by
~;
= Ker(£)CM k +6
tive map. Put M~+6 the composite f
=
£(f)
o
. Since ordoocA)
=
~
is an injec-
1, we see that
{6·) is zero. Conversely, if fE:M k +6 is such that
0, i.e. f vanishes at infinity, it can be divided by
~
(this
function will have the same orders at all P E:D and order one less at 00). Finally, GZ (k+6) does not vanish at infinity (we could also take g;g~ of weight Zoe
3[1 , with
+
C(
and ~ chosen so that ZO( + 3(3 = k+6,
possible since Z and 3 are relatively prime). The exactness gives d k - d k + 6 + 1 = O. From this lemma, and the explicit description of the M and d k
for
0
k
(k < 6, we can check that (1 -
r 2 ) (1
-
r 3)
LdkT
k
k~O
=1
We can also say the essentially equivalent result
1
[k/6 (integral part of k/6) if k51 mod 6 dk
= dima:(M k ) =
l(k/6] +
1 if k ~ 1 mod 6
Here, since these formulas are true for 0 ~ k < 6, they will be true by induction for any k because d k +6 = d k + 1 · We can state the general result. (3.10) Theorem 1. Let M
= E9 Mk be the
~raded algebra, sum of the
spaces M of holomorphic modular forms of weight k (bounded when
k
z
=
iy and y ~ ~). Let on the other hand
a:(x,y] be the algebra over
0:, of polynomials in two indeterminates X,Y with the degree d defined
by the two conditions d(X)
ep : (t[X, YJ is an isomorphism of
= Z and
dey)
= 3. Then
--+ M defined by X ~raded
modular form, bounded when z
t-+
&Z and Y ~ g3
algebras. In other words, every holomorphic
= iy is purely imaginary and
y
~~
is a polynomial in gz and g3 (or a polynomial in G4 and G6 ). Here the degree d has been defined ad hoc, but if a:
[x' ,y ~
is
a polynomiahalgebra in two indeterminates, and if we look at the 4S
- 1.46 -
polynomials subalgebra generated by X
=
X' 2 and y
=
will inheri t the degree d from the natural degree of case, d(Xnym) = 2n
+
y' 3, this subalgebra
a: [X' ,Y ,].
In any
3m by definition. Another way of saying the same
thing would be to consider the graded algebras [[X], d' being defined as the double of the usual degree, and
a: [y] ,
d" being defined as the
triple of the usual degree, and then [[X,Y] = [[X]0 [[Y]with the degree d = d'~ d". The Poincare series of ([ [X] ,d') is obviously 1 + T2 + r 4 + r 6 + = 1/(1 - r 2) because this algebra has one generator Xk in degree d' = 2k and no non-zero element of odd degree. Similarly the Poincare series of (a: [Y] ,d tt ) is (1 - T3 ) -1. I t is then well known (and easy to prove) that the Poicare series of the tensor product of two graded algebras is the product of their Poincare series, so that the Poincare series of ([ [X, Y] ,d) is peT)
= (1 -
T 2 )1(l - T 3 f1.
This proves the theorem completely, but we give the classical proof as well. Proof of Theorem 1. The elements of M for 0 ~ k < 6 have been checked k to be polynomials in gz and g3 Since ~ = g~ - 27g~ is also a polynomial in g2 ' and g3 ' induction applies by the lemma, and shows that any f €M k (any integer k) is a polynomial in g2 and g3 ' so ~ is surjective. We have to emphasize the fact that g2 and g3 satisfy no polynomial relation P(g2,g3) = 0 with 0 t: PE:[[X,Y]. Decomposing P into homogeneous components (for the degree d) shows that we may assume P d-homogeneous to start with. If we had \ ij Lc. ·g2 g 3 = 0 2i+3j=a 1J we could solve e.g. for g2 and get \ , i j L- c .. g2 g 3 j t: 0 1J
(after suitable division by g~g~)
which is impossible, because all monomials in the right hand side contain a positive power of &3 vanishing at z = i, and the left member 46
- 1.47 vanishes only at z = ~ (wi th order i o ). This concludes the proof. Now we call modular function a (meromorphic) modular form of weight
o.
Our example is the modular invariant J
ord.(A) = 1, we have
ord~(J)
3
= g2/~
. Because
= -1, but J is holomorphic in H.
(3.11) Theorem 2. The modular invariant J defines a holomorphic bijection of the fundamental domain D defined in (3.3)(and after) onto the complex line J : D ~~ [ , which is conformal except at z = i (ramification index 2) and at z This function is real on the sides of with the normalizing values
J(~)
=
~ D
(ramification index 3).
and on the imaginary axis,
0, J(i)
= 1, lim J(iy) = ~ . y+oo
Finall~
every modular function is a rational function of J. A
Proof: J has a simple pole at P = 00 E: D, so that the same will be true of any of the functions J - A
any complex value). In the formula
(~
of (3.7), only one compensation is possible, namely vp(J - A) for one (and only one) point P€D. According to
A the
1
following
possibilities occur : ordp(J
- A)
1 with p ~ i, {
A= J(i)
ordi(J -
~)
2 if
-
~)
3 if A= J (~)
ord~(J
(because &3 (i)
0)
0 (because &2({)
0)
1
This proves that J takes once and only once each complex value in the fundamental region
D
(not counting multiplicities). The second part
of the theorem follows from
- = G (z) - = Lr ' (mz- + n) - 2=kG- -(z) G2k (-z) 2k 2k hence J(-z) = J(z). In particular J(z) will be real for -z for z purely imaginary. When Re(z)
=
1 , we have
-z
= z, i.e.
= z - 1 , so that
the periodicity of J shows that it must also be real on the lines Re(z)
= tl. For w = (10 -10) and for z on. the unit circle, we have
-z = w(z) , so that J(z)
=
J(-Z)
J(w(z))
47
=
J(z) must also be real.
- 1.48 -
Since J preserves the orientations, it must apply the part Re(z)
~O
of the fundamental region D onto the closed upper half-plane. From that (and the normalizing conditions), it can be reconstituted globally by means of the symmetry principle of Schwarz. Finally, let f be any (meromorphic) modular function. The conditions on f show that there must exist a meromorphic factorization F f HV{iot) ~ a: u( 00) J ~ ,"~F a: u( 00)
with [u(~)= Wl(a:)
= S2 Riemann sphere. Since any meromorphic function
on the Riemann sphere is a rational function, it follows that F is rational and that f
=
F(J) is a rational function of J.
q.e.d.
The second part of the theorem shows that J gives a conformal representation of D/\ (Re(z) (0) deleted of i , ' still with
J(~)
=
0 , J(i)
=
onto (Im(z) ~ 0) - {O,l}
I (but no conformity at these points).
Then using the symmetry principle of Schwarz on Re(z) and then
J
-1 , +} , •••
Izi = I
0
will give the periodic function J in D + Z.
As the symmetry principle also applies along arcs of be applied to
= -} , ][),
it can
, and inductively, it will eventually lead to
a definition of J (by analytic extension) on H. Although this definition of J may be considered as more elementary than the one we have given (because it does not refer to elliptic curves in any way), it seems far from easy to derive from it the arithmetical properties of
J which we are going to consider later. Also we note that the action of SL 2 (R) on H has a natural extension to JPI (JR) = lRu(ioo) (considered as boundary of H) by fractional linear transformations. The transforms under SL ( 7l) are the rational points ~ em on the real 2 axis, so that a must vanish at all these points (tend to 0 when we
of ioo
approach these points vertically from above)
48
and
the real axis is
t
- 1.49 -
the natural boundary for the analytic extension of
~
(and of J).
(3.lZ) Corollary 1 (Little Picard Theorem). Let f : [
Proof: We may assume a = tion points of J
° and
[ be an
f ((I:)n {a, b} = ¢ , then
entire function. If a f b ~[ are avoided by f f must be constant.
~
b = 1. Let AcH be the set of ramifica-
-1
A
J{O,l}. Then J defines a topological analytical
covering J : H - A --+ [ - {O,l)
(local analytic isomorphism). By the
monodromy principle (G: is simply connected) f : [ be lifted to the covering I F : [
~
~
[ - {O,l} can
H - A. But Liouville's theorem
(applied to exp(iF)) shows that F must be constant (by continuity), and so must f be. More important for us is the following (3.13) Corollary Z. Every non-singular plane (projective) cubic curve yZt = x 3 + Axt Z + Bt 3 , is isomorphic (as analytical group) to an elliptic curve E = [/L , wi th a lattice L C[. Proof. The group law on this cubic is given by the condition that three points have sum zero if and only if they are on a line. If we make the transformation of variable y
~
y =
y/Z , we get an equation
for x,y in the Weierstrass form with gz = -4A and g3 = -4B. For this new equation, the non-singularity condition is gi - Z7g~ ~ O. By the theorem, there exists a z £H such that J(z) define a first lattice L
= gi/~i - z7g~)and we
Lz in ~ . Because J is homogeneous of degree 0 in the lattice, any homothetic lattice L would give the =
same value for J. We choose thus a complex constant ~~Osuch that moreover gZ(AL) = A-4 gZ(L) = gz = -4A. This is possible if A 1 0
A4 = -gZ(L)/(4A)
, and determines ~ up to a power of i
=1=1.
In this
case (A 1 0 implies J ; 0) 2
g3 ().L) = so that
g3(~L)
3
g2(~L) (J(~L)
-
1)/(27JC~L))
=
2
g3 '
= ±g3 · By homogeneity of g3 of degree six in the 49
- 1.50 -
lattice, changing A to iA if necessary, will give the equality. For this lattice L', (1.11) gives an isomorphism E
=
plane cubic curve X : y2 t
=
[/L'
~
X with the
4x 3 - g2xt2 - g3t3. In the case A
= g2
0,
we note that the non-singularity condition implies g3 1 0, so that one (at least) of the lattices
AL~
~E[~)
will do. It is indeed
sufficient to take for A any sixth root of g3(L~)/g3 . (3.14) Corollary 3. Two elliptic curves E
=
[/L and E' = [/L' are
isomorphic (as analytic spaces, or equivalently as analytic groups), if and only if J(E) • J(L) = J(L')
= J(E').
Proof: Indeed, two elliptic curves are isomorphic if and only if their lattices are homothetic. Now there are unique region D with L'VL't and
L'~
L"
~,~'
in the fundamental
. The result follows from the injec-
tivity of J on D. The corollary 2 is the basis of the algebraic study of elliptic curves. For example (3.15) Application. Let X be the plane cubic of equation (g~-27g~rO) 2
Y t = 4x
3
- g2xt
2
g3t
continuous). We look at
3
and take any frE:Aut([) (not necessarily (f
as mapping [2 ----. [2 defined by
(x,y) ~ (x~,yr). Then the image X~ of X X fl [2 is the affine o 0 r part of a plane cubic X , and the two curves X and X~ are isomorphic 3 3 exactly when G" leaves J = g2/(g2 - 27g~) fixed. tr
It is obvious that X
has the equation y2 t
4x 3 - gzxt tr
2
3 - g3CS"'t (non-
singular because X is assumed non-singular). Hence the invariant r of X is Jr where J is the invariant of X. This proves the assertion. r Note that if X is isomorphic to [/L and X to [/L r ' we have in general no simple way of determining Lr in function of L (we mean, no algebraic way of determining Lr in function of L: the connection is given by the transcendental function J).
50
- 1.51 -
This application suggests that the field be chosen to be equal to of E (because
~(J)
~(g2,g3)=>~(J)
might
for a special "model" of the equation
~(J)
is the fixed field of the group of automorphisms
~cAut(~) satisfying Jr = J). It is easy to show that this is true.
g'2 = g'3 = 27J/(J - 1) 323 4x - gxt - &t
E:~(J).
The curve
is isomorphic to E (it has the invariant J = J(E) ). If &3 = 0, any non-zero &2 will do, for instance g2 1, hence an equation y2 t = 4x 3 - xt 2 = 4x(x - }t)(x + }t) for E with coefficients in
=
~
(J = 1 in the case &3 = 0).
~(J)
(3.16) Remark 1. It is easy to check elementarily that G6 vanishes at i and that G vanishes at , . By definition 4 G (i) = (mi + n)-6 = l:'i 6 (-m + ni)-6 6
L.
(m~O
= (-1)3 G6 (i). Hence 2G (i) = O. Similarly since ~3 6 polynomial for 4), we see G (0 = 4
L.
(m/n)'0
(m~
n) -4
L' (2(m I:
Hence (1 - ,2)G4(~)
+
J
+
= 1 and ~2
o
+
~
+
L'z:8 (m{3
+
n(2)-4
n4 2)-4 = L'(2(m
1
+
(minimal
n(-1-4))-4
,2(m-n - n,)-4 = (2G4(~)'
= 0 and G4(~) = O. This method does not give
the orders of these zeros however, nor does it show that these "trivial" zeros are the only ones. (3.17) Remark 2. Let q
=
e 7tit: be Jacobi's variable (cf. proof of (2.10]
and t = q2 = ~('t'). Then ~ ~ t = ~(-r) maps the region -l~Re(~)0 (one - one) onto the punctured unit circle O
~ ~
t.
The condition defining modular forms (3.6.b) means that the corres51
- 1.52 -
ponding function on the punctured circle has at most a pole at the origin. The Laurent expansion of this function will thus be valid in a small punctured circle O
F(t)
=
=
L
a tn
L a e2xinz "~ ... n
=
"~'\>__n
A priori, the Fourier expansions of f are valid in regions r l
t
=
0, as soon as we suppose that tF(t)I
tends to a limit, finite or infinite, for t~O). The variable t
= q2
is better that
~.e:H
for
some
then the lattices
if
t
~,~'£H,
= t
in the sense that if L~
and
t L~,
= ~(1:)
and t'
= ~(--r')
are the same if and only
t •
Automorphic forms have another interpretation in group theory. Let G be the Lie group SL 2 ( m). Then the group of analytic automorphisms of the upper half-plane H is naturally identified with G/(!l), acting by fractional linear transformations. The stabilizer of i is the subgroup of matrices g i
ai ci
+ b + d
d
= (: :) such that
(ai + bled - cil , c i + dl
whence the conditions 2 2 c + = +
lei
'
dl 2 =
1
E:
and
=
(i
ac
+
+
ac
bdJlci
+
+
dl- 2
bd - O.
In G, they characterize the special orthogonal group M = SO(2JR): M ={(
c~s' cos4 sin.-). '
-sln-l
°'''<21t'' J (stabilizer of i)
•
We introduce also the subgroups of G: A
= {(~ ~-1)
: a€ R: },
52
N
= {(~
~):
u€R
J.
H
- 1.53 -
We have -then A.N
b) G
={(ca
dE:: c =
o}
and also obviously G = M·A·N (= N·A·M = G- 1 ). This shows that the action of G on H is transitive : G(i) = NAM(i)
= NA(i)
N(iR»C)
= lR
gM
gM(i)
+
+ iR»C
H.
+
Thus we have an identification (3.18)
G/M ~ H
defined by
t-+
r=
In consequence we can also identify (with r\G/M ~~
r\H by
rgM
= g(i)
SL (1l) )
2
r(z) where z
~
z •
= g(i).
r\G/M appears as set of the lattices of the form Lz Although it is true that through the first identification every In this way,
function f on H can be considered as function over G invariant under M, this is not useful for modular forms. We proceed differently. If £ is a modular form of weight k (holomorphic on H, but no condition
at i 00) , we define the corresponding function F (3.19) and J(g,i)
F(g) = J{g,i)kf(z) where z (ci + d) -2 for g =
(~
for m =( c~s~ Sin'-) .., -sIn'" cos~
(3.20) F(gm~)
:) E: G
,
(cos~-isin~)-2
and then that
= J(gm~,i)kf(gm,(i)) = J(g,i)kJ(m"i)kf(g(l)) =
because
E: M
Ff on the group:
= g(i)E,H ,
With this definition, we see first that J(m"i)
= e2i~
=
gm~(i)
F(g)e
= g(i)
2ik~ =
= F(g)X2k(m~)
,
z, with the even character X2k of M (this
character is the 2k-th power of the basic odd character X of M defined by x(m",J = e i 8-). For a matrix (E: r we have k k-k (3.21) F(¥g) = J(~g,i) · f(t(z)) = J{~g,i) J(r,z) fez) J(g,i)kf{z)
= F(g)
•
This shows that all the lifted functions Ff are functions over
r\G (irrespective of the weight of f), the weight k appearing now S3
- 1.54 -
in the character according to which F transforms along M. If we suppose that f
o
k ' i. e. f is a holomorphic modular form of weight k vanishing at ica ,then the Fourier expansion of f will start E:M
with a term eZ~iz , hence will have the order of magnitude of e
-27Cy
(y = Im(z)) at most If(z))
= O(e- Z1CY )
for y
= Im(z)-"06 ,
and this will in turn give strong decreasing properties of F
=
Ff
along A, which ensure that F will be for example square summable on
r\G (for the invariant measure coming from a Haar measure on G : because G is semi-simple, every Haar measure on G is bi-invariant). 2 Thus the spaces M~ can be embedded in L (r\G), each of them being embedded in an isotypical component for the restriction to M of the right regular representation of G in LZ(r\G). The holomorphy condition on f gives a condition on F which can also be interpreted group theoretically, but this would lead us too far here. Let us treat an example. We take f = G (holomorphic of weight k), and we construct Zk its corresponding function F on the group, which we still denote GZk abusively · First, we note that L'{mz + n) -2k = L.
L
!~l
(clz
+ dt)
L
= .L.!-Zk. = '(2k) Thus, the problem is to lift (cz
(cz
L..
(c,d)=l
(cz
(3.22)
Ek(z)
=
=!
+
d) -Zk
d) -2k .
+
d)-Zk to the group. To simplify,
+
we work with the normalized function Ek(z) = y+~
=
( c , d) = 1
l,~l
satisfies lim Ek(iy)
-2k
(c ,d) =1
GZk(z)/(Z~(Zk))
(which
1). ~
L-
(c,d)=l
(cz
Then we observe that all matrices
+
d)-2
k
1 ~ (~ ~) E
54
r
SL ( 7l) have Z
- 1.55 -
coefficients c,d prime to each other (the determinant condition gives Bezout's condition), and conversely, for each couple (c,d) of integers, prime to each other, Bezout's condition shows the existence of a couple
of integers, say (a,b) such that the matrix (~ Now, it is easy to check that two matrices of
lines if and only if "they are congruent mod if
J and
T'
r
~) has determinant one. have same second
C. = "t (~ J'
have same second line c, d, then
=
~). More precisely
y.. 'r
with a matrix
~ of the form (~ ~) (n a rational integer) or of the formt~ _~) (n as before). This gives in fact a bijection between the couples (c,d)
roO Y € roo\r
of relati vely prime integers, and the classes
.
Using this,
we can write (3.23)
Ek(z)
1 }
L.
(c,d)=l
(cz
+
d) -2k
1
2
L. J(y,Z)k J;,\r
On the group G, we have by definition Ek(g)
J(g,i)kEk(g(i)) =
! If we put
L:
r;.\r
!
J(g,i)kJ(r,g(i))k
L
,:\r
J(l g ,i)k
L(g) = J(g,i) we find the very simple and suggestive formula
Ek(g) = !
(3.24)
L
':o\r
L(1 g )k
showing that on the group, E is obtained in the cheapest way from L k (except for the factor Because L(ng)
!)
so as to be invariant on left under
r.
L(g) for any gE:G and n E:(!I)N, it is indeed sufficient
to take a sum of L (yg) over
6
mod (left)
r fl (±I)N
=
r:
to obtain
a (left-)f -invariant expression. The fact that the hexagonal and square lattices are the only ones admitting more than
two ("canonical") bases as constructed
after (3.3) is equivalent to the fact that
r/(~l)
has only two finite
subgroups of order 3 and 2 respectively, generated by 55
(:1
~) and
- 1.56 -
(~ -~) fixing ,
and i resp. From this and a careful study of the
tesselation of H given by the transforms of the fundamental region D under
r/(tl) one can conclude that this group is the free product
of these two cyclic subgroups SL ( 2
(3.23)
and also that
r
~)/(±l)
= C
2
* C 3
(C
n
= ~/~),
itself is amalgamated product of the subgroups of
order 6 (generated by
e ~))
and of order 4 (generated by l over their intersection of order two (generated by(~l_~)) :
(~ ~l})
(3.23)' (3.24) Proposition. For y
Im(z)
are valid (uniformly in x
Re(z)):
~(az ~(b
+
b : Lz ) ~ -w 2/3 1
2
: L ) .-,.
TC (
z
for a not integer
, (a,b real ),
1 - -) for b not integer
2
sin Kb
( 2 7l") -12A{ z ) ~ ( - z) --+
the following limit formulas
~~
3
,(b real),
1
J(z)~(z) ~ 1/1728 = (12)-3
2
-6
·3
-3
Proof. By definition, we have
pCaz
+
b : L z ) = (az
+
b)-2
+
l::'«az m,n
+
b
+
mz
+
~-2 -(mz
+
n)-2).
The convergence of this series is uniform with respect to Im(z);, t:"7 0, so we compute the limit by taking the limit of each term. When a is not an integer, the terms ((a m. Only remain the terms (mz
+ +
m)z
+
(b
+
n))-2 tend to 0 for every
n)-2 with m = 0 (and n 1 0). The
formula ~ n- 2 = ~2/6 easily gives the first formula. When a = 0, n>O but b is not an integer, only the terms m = 0 give a contribution,
feb : L z ) _b- 2
L
+
L'(b
+
n)-2 - "E.'n- 2
=
(b - n) -2 - 2{(2) = x2 sin -2'J(b - 7(2/ 3
56
- 1.57 -
With the values
~(4)
= 7r4/90 and
'(6) =
6
-n:
/945, we find
= ( 21f) 6/216 •E3
&Z
(2lt) 4/12· E2
!:l
Z (21f) 12/1728 • (E~ _ E ) 3
&3
J
= E3Z/(E 3Z _ E3Z)
But the first coefficients of the t-expansion (Fourier expansion) of EZ and E3 may be computed easily by starting with the equalities ltcot<Z
=
= z-l
ni(t + l)/(t - 1)
+
r='~z - n)-l - n- 1].
and differentiating with respect to z three (resp. five) times both sides. We see then that E2 ( z)
=1
+ 24 0t +
C1 ( t 2 )
• E3 (z)
=
1 - 504 t +
O( t 2 )
(a much stronger result, giving all Fourier coefficients explicitely will be derived in detail later, in (4.1) ). This shows that E3Z - E3Z (3·Z40 + Z·504) t + O(t Z)
This finishes the proof of the proposition. We turn to a closer sudy of the function A of Legendre, and its dependence on the elliptic curve. By definition, if E is an elliptic curve, it has an equation of the form E : y2
and
~
= 4x
3
- &2 X
-
&3
3
4
IT (x i.1
- h( 1w.
a-
1.
))
= (e 3 - eZ)/(e l - e Z) with e i =
possible equation for E of the form : yZ
x(x - l)(x - A)
Ccf. Cl.13) ) with
A'
0 ,1 .
To bring a dependence in z eH, we define explicitely el(z)
P(1
ezCz)
PC1z: Lz )
e 3 (z)
P(lCz+l): Lz )
:
Lz )
and "C z)
Ce 3 Cz) - eZCz))/CelCz) - eZCz)) 57
- 1.58 -
If we take a direct basis all
tc)z
az + b
=
we shall obviously have
=0
, b :: c
a :: d :: 1
~
, tUz of Lz of the form , tell
cz + d
mod Lz , as soon as We define the subgroup f(Z) c r = 5L (1l) Z
4l =
mod Z
=
and ~WZ =
~z
0) mod Z. In other words, (1 (a c ~) - 0 1 kernel of the natural homomorphism of reduction mod Z
by the condition
5 L Z(Z)
---+- SL Z(II ZZ) ,
hence in particular is a normal subgroup of
r
r(Z). is the
with quotient isomorphic
to the non-abelian group of order 6 (necessarily isomorphic to When
~
E'3).
is in f(Z), we can thus write
Hence eZ(z) = J(o< ,z)eZ(O«z)) , and similarly for e l and e • Coming back to 3 property
=
A(O«Z))
We say that
A
for any
A(Z)
0(
A ,
this gives the
E:r(Z) .
is a modular function for the group
f(Z). To have a
full description of the behavior of this function under the fractional linear transformations coming from the full look at a system of representatives of observed that w
o
(1
=
-1 1 0) and n = (0
Z and nw of order 3 in
r
r,
it is sufficient to
mod f(Z). We have already
1 1) are such that w is of order
f/(±l); they will generate such a system of
representatives. Consequently, it is sufficient to connect A(-l/z) and ~
A(z+l)
under
r.
with
A(Z) to have the description of the behavior of
The first matrix corresponds to the change of basis Wz
= -1 ,
or 1~ = z I Z , 1lUZ This gives ~
-1
=z ,
~l
~
~
, so that e 1 and e Z are interchanged. 1 - A(Z)
(-liz) 58
•
- 1.59 -
Under the second transformation, "2 = z
+
1
~
hence e 2 and e 3 are interchanged ~(z + 1) = (e Z - e 3 )/(e l - e 3 )(z) To sum up
(3.25) Proposition. The function
~(Z)/(A(Z)
- 1)
,
•
of Legendre is a modular function
~
r
f(Z) (normal subgroup of
for the group
(z + 1)
of index 6 formed of all
matrices congruent elementwise to the unit matrix mod 2). Under a fractional linear transformation ~
the s ix fun c t ions
t - " , 1/(1
1/'). ,
t
of
r,
A is transformed in one of
- A), 1 - 1/').. = (~- 1)/ A, A/ (A - 1) ,
;\(-l/z) = 1 - A(Z) , A{Z + 1) =
and explicitely
A/(~ -1)
(z) .
To give an explicit relation with J, it is necessary to study the limit of
A
the matrix (~
when Im(z) -+
i)
in some detail. Since
00
contains
acting by a translation of 2 in H, the function
is periodic of period 2. As before we put q = e~iz expansion of"
r(2)
around q
sarily valid for Im(z)
"7
= 0
and the Laurent
gives the Fourier expansion of
0 because
~
(neces-
~
A has no pole in the upper half-
plane H). Using (3.24), we see that ~ 2 1t£ ( 1 - 1/ 3 ) + 'It' / 3
( e 1 - e 2) (z) ~
7(2
for 1m (z)
--+-
00
but we need more. (e 3 - e 2 ) (z) --+ 0 for Im(z) -+- 00 Coming back to the defini tion of e 2 and e by means of the ~ function,
Also
3
and grouping together the corresponding terms in the expansion of
p,
(this cancels the convergence factors and gives an absolutely convergent series), we find
~ (e 3 - eZ)(z) = L-m,n
(~+ ~z
[
~ ~
((m+~)z
m,n
'7(2
[
L
m
+ mz - n)
+ a - n)
-2
-2 -
-
(az + mz - n)
((m+a)z - n)
[cos -2 (m+ D'7(z - sin-2 (m+ D1CZ ]
59
-2]
-2] =
- 1.60 -
where we have used the classical expansion formula
L
1(Z /sinZ'lI'z ..
(z - n)-Z
"€Z Using the duplication formulas for the trigonometric functions, we
l/cosZ~ - 1/sin2~ = -4cosZ~ /sinZZ«
find that the difference
•
We have thus obtained -47tZ
L. cos (Zm+1)7tz/sin Z(2m+1)1t"z m
-81tZL. cos(2m+l)1rz/sinZ(2m+l)"Jrz m~O
Furthermore
I (q +
cosnz
161(Z
q
-1
1= fi
) and sinltz
r. (qZm+1
+ q-2m-1)/(q2m+1 _ q-Zm-l)2
m~O
r. ...
~ qZrn+1(1 + q4rn+2)/(1 _ q4m+2)2
m~O
m~O
q(l + qZ)/(l - q2)Z +
L..
q + O(qZ)
m>l
This gives the desired formula (e
3
- e ) (z) .. 16xZq + O(qZ) , Z
and so
A(z) q-1
(3.26)
=
A{z) e -xiz -.. 16
From this and the relation A(-l/z) )(iy)
~
1 when
gives A(l + iy)
y~O ~ ~
=1
(y real), and when
y~O
~(z
=
24
for Im(z)
-+-
00 •
- A(Z), we infer that + 1)
=
~(Z)/(A(Z)
- 1)
(ibid.).
We come to the explicit relation between
A and
J. Any symmetric
combination of the six functions appearing in (3.25) will be a modular function for
r
(the growth condition for Im{z) -+00 is satisfied by
what we have just seen). We construct the symmetric product of the translates by 1 of these functions : f = (1 + ")(1 + l/~)(l + (l-A))(l + 1/(1-~))(2 - 1/~)(2A-l)/(~-l).
By (3.11), f is a rational function of J. Since f has no pole in H, it must be a polynomial in J, and more precisely since f has a simple 60
- 1.61 -
pole at i
is a linear function in J : f = A
+
B·J • We determine these constants.
The limits q-2 f
e- 2niz f(z) ~ -4.16
q-2 J
e-2~izJ(z) ~ 1/1728
show that B
=
g3(i)
=
=-
_33
=
=
0, e 2 (i)
= 0, and quite generally e 1
-e 3 (i) gives
-1, and finally if e (i) 3
A(i) 0, A(i)
and since J(i) = 1 this proves
= 27 - 27J or
f
= 2- 6 3- 3 ,
27. To determine A, we look at the special z
0 implies e l e 2e 3 (i)
If el(i) A(i)
=
= _2 6
=
+
e2
+
=
e 3 = O.
2. If e 2 (i) = 0, similarly
= 1.
In all cases f(A(i)
A
27. We have obtained
J
1 - f/27.
=
A small computation gives now
=.!. (1
J
(3.27)
-
A2 (1
27
A
e
l
+
e
2
+
e
3
=
0
,
+ ),2)3
_ ~)2.
The relation between J and
th~
e.' s is easily found now. From 1
J
e.2 + e 2 + 2e e {1,2,3j, j k for li J j , k = k J e2 + e2 -2(e 1 e + e 2e + e 3 e l ). Then (3.27) 3 3 2 2
we get e~ 1
hence by summation e 12
+
gives immediately (3.28)
Since g2
-4(e l e 2
+
e e3 2
+
e 3 e 1 ), this expression shows that
A u. = l6(e -e ) 2 (e -e ) 2 (e -e ) 2 l 2 2 3 3 l
IT (e i -e j )
= -16.
J,..
lrJ
(it could,of course, have been derived more simply from it).
61
i.
•
0
- 1.62 -
We draw two possible fundamental regions for f(2) in H.
D(2)
D(2)'
(3.29)
" / /
,,---- " .....
'~.1
\ \1
o
-1
\
The behavior of a near
i~
r(2)-invariant (say meromorphic) function in H, is made by introducing as usual the variable q = e ?tiz . In
the neighbourhood of 0 (in D(2) or D(2)'), we introduce the variable 7Ci z q' e = e -Jrij z' (z' = -l/z near 0 means z near iot ). When z" is in a neighbourhood of 1 (in D(2) or D(2)'), z' = z"-l is near 0, so we take the variable q" = e 1tiz = e- 1Ciz ' = e- 1ti /(z"-l). (3.30) Definition. A modular function for
f(2) is a meromorphic
function f in H satisfying a)
f(~(z))
= fez) for all
~ ~
f(2) (when one member is defined),
b) f has poles at most at the points ioO,O,l with reference to the Laurent expansion in the variables The function
A has
these properties. Indeed,
(uniformly in Re(z)),
'A(z)
~
q,q~q"
A(Z)
~
respectively. 0 when
Im(z).~
1 when z + 0 in D(2), so only the
point 1 deserves verification. But there, we can use ~(-l/(z-l))
= 1/(1 -
A(Z))
We could also look at (3.27) because J has simple poles (in the 62
- 1.63 -
mentioned points) with respect to the variables t = q2 , t' = q,2 and ttl = qtl 2 The orders of a f(2)-modular function are thus · d at a 1 1 · we 11 d e f lne pOlnts
0
f ~ D(2)
D(2)U{ioc),0,1} (if the function
does not vanish identically) , by means of the corresponding Laurent expansions. As in (3.7), integrating f'lf on the boundary
~D(2),
making small semi-circles to avoid the zeros and poles of f and the points
° and 1,
we find
p~
(3.31)
ordp(f)
=
0 •
(This formula is simpler to establish than the corresponding one in
~.7)
because we have no ramification points like we had there in
7;, i , and we take here only the case k=O.) Because
and nowhere that
° in H,
~
is holomorphic
and because A has a simple zero at iaO, we see
A and all the ,,- c (c £
have a simple pole at 1. These ............... functions must have one and only one zero in D(2) to compensate in (3.31). This proves that "
~
gives a bijection
: D(2) ---+-
'6(2) ---.
I IP (
a:
U
{oo}.
Consequently (as in (3.11)), every modular function for rational function in
A.
f(2) is a
We formulate an analogue of (3.11).
(3.32) Proposition. The function
A of
Legendre defines a conformal
bijection of the fundamental region D(2) defined in (3.29) onto
{a, I} . This function is real on the imaginary axis and on the sides
of D( 2) '. The rig ht hal f D( 2)' () (Re ( z)
"7
0)
ismappedon tot he uppe r
half-plane with a continuous-extension to the boundaries. Every r(2)modular function is a rational function in ,.. A couple of diagrams may help to clarify the situation.
63
- 1.64 -
H
1
(3.33)
topological analytical covering ~
f(2)\H
1
ramified covering at t, i
r\H
The fibers of -1 ~ (0)
~
=
!pI (([)
~
xl •
J
!pI (([)
- {O, 1 ,co} 6-sheeted ramified covering
- too}
have 6 elements except
{A(~),
A(W({»
= ~({+l~ has only two elements, each
corresponding to a ramification index 3,
1r:l
(1)
l~(i), ~((i-l)/i), ~(-l/(i-l)~ has only three elements,
each corresponding to a ramification index 2. In fact,
~
is the universal (maximal) covering of pl(([) with
respect to the three ramification conditions index 3 above 0 index 2 above 1 index 2 above
00
•
On the function fields of modular functions for rand f(2) (both have been proved to be purely transcendental fields of degree one), gives the injection
G:(J)
--+ ([(~). The group
r/r(2) ~
~
S3 acts
naturally in ([(A) by field automorphisms over ([ (the group ~3 acts in the purely transcendental extension k(X) by k-automorphisms for any field k). The fixed field is precisely ([(J) (by Luroth's theorem the fixed field in k(X) is purely transcendental over a generator which can be taken X- 2 (1-X)-Z(I-X+X 2)3 of degree 6: compare with (3.27) ). These facts show that
([(~)/([(J)
of group isomorphic to ~3 and we can say that f(2)\H ~
J\H
is a Galois covering of group
r/r(2) ~ ~3
64
is a Galois extension
- 1.65 -
Let us conclude this section by a few formulas, given without proof. (3.34)
A(z)
16 llr(1 q
1
f=
n~l
(3.35)
~(z)
+ q +
2
n
)'
q2n-1
,Cft+tl 1+ 2.L. 1'"
16 n:,O
=
(L.
"EZ
L
'1
(II+-jl' "I-
,,~Z 1
"~1
Also the three singularities {o,l,oo}
of
~
make it possible to
derive an inversion formula z = iF(!,l:l:l-l) F(l,!:l:" ) where F
2Fl is the usual (Gauss') hypergeometric function.
65
- 1.66 -
4. Arithmetical properties of some (elliptic) modular forms
We are going to give explicitely the Fourier coefficients of the modular forms G of weight k, or more precisely of the normaZk lized Ek = GZk/(Z~(Zk)). (4.1) Proposition. The modular form E
k
of weight k (k>l) has
th~
following Fourier expansion Ek(z) where the constant
tk
1 +
L. N
~(z)
)
~l
=
C-I)k C21r )2k/ CZ';C2k)C2k-I)!)
(B k the Bernouilli number of index k), and
br
e Z7tiz
(t
'fk is given by
'fk = C-I)k 4k / Bk
function defined
N
=
GiCN)
~(N)
is the arithmetical
sum of the t-th Eowers d
t of all the
positive divisors d'7 0 of N. Proof ., We start with the classical formula of function theory -1
L
-II
-1
+ (z + n) {(z - n) n )1 (note that the terms nand -n have to be grouped together to secure
1t'cotg7t"Z
z
+
absolute and normal convergence on bounded sets of [ -
~).
By term-
wise derivation, we get C7Ccotgn:z)
I
=
_z-2 -
L
L
{CZ - nf2 + Cz + n f2 )
n~l
-(z + n)-Z
n£71
and by induction (TCcotg-n:Z) (k)
(-l)k.k!
L:
k l (z + nf -
(k
nE:71
•
~l)
But the Fourier expansion of these functions is easily determined " h q W1t
=
e xiz ,an d t = q Z
7C cotg1l"Z
~cos~z
/
"
S1n~z
we get
~(z),
= ~1"( q
+
q-l)/(q _ q-l)
'Jt'i ( t
+
1) / ( t
whence 7t'cotgll'Z
'7l'i - Z1C i / (1 - t )
~i - 2~i(1 66
+ t
+ t
2
+
••• )
•
- 1 ),
- 1.67 -
For the derivation, we note that dt/dz Hence (1t co tg'Jrz) (k)
= - ( Z1tl.) k+ 1 L'\ n ktn n;Jl
, for
k~l
Comparison gives with mz instead of z (t m instead of t)
L.
(mz + n) -k-l
=
(-1) k+l (Zon-i) k+l /k!
n" Zl
nkt mn
(m
f 0) •
n~l
Also, replacing k by Zk - 1
l::
L.
(mz + n)-Zk = (Z~i)Zk/(Zk-l)!
n£Zl
r= n)l
nZk-ltmn
Summing now over m (and treating the case m = 0 separately), we find
Z~(Zk) + (Z7Ci)Zk Z/(Zk-l)!
L L. ... m)l
n~l
because the sums with m and -m give the same result (cf. left handside). The double sum is
L..
nZk-ltmn and its terms may be rearran-
m,n~l
ged at will. We group together the terms with mn = N, a fixed integer N ~ Zk-l N ~ 1, so that the coefficient of t is the sum L- n = o-Zk-l (N) niN with the notations of the proposition. Summing up, we have obtained
with a coefficient
Now we recall that the Bernouilli numbers may be defined by the expansion cotgz = z
-1
1:
Bk22kz2k-1j(2k)!
k~l
The classical Weierstrass product formula sinz = z
IT (1 - L) 2
n~l
n 7[2
= z
IT (n 2 n2
n~l
gives by logarithmic derivation "' cotgz = z -1 + Z L-
z/(z Z-n Z1t Z)
n~l
67
-
z2)j(n
2
i)
- 1.68 -
Comparison gives then
2~(2k)/~2k = Bk 2 2k /(2k)l and hence
4k(-1)k/ Bk as claimed.
'(k
We give the first few values of these numbers B1
'(2) =
r1
B2 = 1/30
1/6 2/6
,
'(4) = 7[4/ 90
7t
= -24 ,
t2
= 240 ,
= 1/42 ,
((6) = ,f /945
B4 = 1/30 x6/3 3 ·S·7
r3
14 = 480
B3
= -504 ,
, ...
, ... , ...
2 (4.2) Application. The relation E2 = E4 (both functions are normalized at iao , hence the proportionality constant is 1) gives the arithmetical relation
of 0"3 and
· Indeed 1 + 240L G"3(n)t n n)l
EZ E
1 - 504
3
E4
r7
=
L
n)l
G"'s (n) t
n
1 + 480 L 0"'7 Cn) t n n~l
hence the explicit relation n-l
~7(n) = f (n) + l20L6"3(m)~3(n-m)
3
m=l
l:
-2k
~
Using (3.23) we may write Ek = (mz + n) with a summation extended over the couple of integers m,n satisfying (m ,n) = 1 , m> 0 , n
>0
if m = 0 ,
hence the relation
(~ valid for Im(z»
(mz + n)
-4) 2 = 1: (mz
+ n)-8
0 (this relation was alluded to in (3.8) ).
Similarly E3E2 = ES gives a relation between r3 ' ~5 ' ~9 · By Theorem (3.10) every Ek is a polynomial in E2 and E3 hence can be computed by means of 0-3 and a"S. 68
r2k - 1
- 1.69 -
(4.3) Formulas.
For reference we give explicite1y some proportionality
constants. With 1
240 L. 0""3 (n) t
+
n
(t
~(z))J
n~l
1 - S04
r:. ()S (n) t n
n~l
we have
60 G = (2~)4·2-2·3-1.E
&2
4
2
g 140 G = (2~)6.2-3.3-3.E 3 6 3
A = (2n)1202-6o3-3(E~
- E;)
J = E3/(E 3 - E2 ) = 22.3-3A-2(1_A)-2(1_~+~a)3
223
and we introduce j
(2 6 .3 3
26 ·3 3 .J
=
(12)3
= 1728)
28A-2(1_~)-2(1_~+Al)3 ( j (t)
=
0
J
j (i)
= 1728 ,
j (ioo)
=
00 .)
(4.4) Theorem 1. The Fourier expansions of (2x)-12~ j
and of
= l728·J have rational integral coefficients (21r) -12
6 (z )
1728J(z)
=
=
L.
n)l
1ft
1:( n) t n
L
+
c(n)t
1:(n)E:ll,
n
(t =
~(z)),
~(l)
= 1,
c(n)E:ll,
n~O
and in particular j is normalized by the condition of having residue +1 at ioo
•
Proof. Let us put E
2
=1
+
240U
=1
+
2 40 3 oS oU and E
= 1 - 23 ·3 2 ·7·Y , with two power series
3
=
1 - S04V
U, Y in t and integral
coefficients. By the above formulas, it is sufficient to show that 4 3 (1 + 2 • 3 • S • U)
= (1
322 6 3 - 2 • 3 • 7 •V) mod 2 •3
in the sense that all coefficients of these series in t satisfy that congruence condition. Hence we have to check that
69
- 1.70 -
or 24 3 2 s.U + 24 3 2 7.V
mod 2 6 3 3
0
=
0 mod 12 (= 22 3). But we observe that for any integer n~l, the number (n-l)n 2 (n+l) = n 4 - n 2 is divisible by 3 and 4 hence by 12, hence the congruence n 4 = n 2 (mod 12) and a
or still
fortiori n
SU + 7V
=
S
=n3
(mod 12). This gives
proves the desired congruence sU + 7V
~3(n)
=0
= ~S(n)
(mod 12) and
(mod 12). Since
~(l)
=
1,
the integrality property of j follows at once. (4.5) Theorem 2. If we introduce the function of Dedekind
",(z)
=
e wiz/12
IT (1
ql/12
_ e21rinz)
n~l
IT (1
_ q2n)
n~l
then we have the following formulas a) (Euler)
,(z) = e7tiz/12
L.
(_1)n q n(3n+l)
nE. 7l
b) (Jacobi) (27() -12A(z)
= ,(z) 24 = t
TT (1
_ tn) 24 ,
n~l
. h th · W1t e usuaI notat1ons q
=
e wi z an d t
=
q2
=
~
() z .
Proof. The basis for this kind of theorem is (3.8) which shows that the dimension of the vector space Mo of modular forms of weight 6 6 vanishing at ioo is dim(M~) = 1, hence M~ is spanned by ~: M~ = ([.~ • We put g ( z)
e lCi z/ 12
L
(-1) nen ( 3n+1) 7l: i z
n€71
= e7tiz/12
L
n£71
exp(lfi3zn 2+ 2nin(!z + 1))
and moreover 8 (z : T) = Hence g{z)
=
L..
nE:ll
exp (7ri zn 2 + 2n'in-c)
(Jacobi's
8 3 ).
e~iz/12 8(3z:1 + lz). We use now the classical summation
formula of Poisson for which we recall the hypothesis. If f is a continuous function on R ,
r=
n E.71 70
f(x
+
n) converges normally
- 1.71 -
on every compact subset of
L
Rand
'fen)
n£ll
Fourier transform of f), then we have For f(x) --
e~izxZ
~
(f denotes the
00
~f(x + n) = L:t(n)eZ~inx .
O ° (Oth ° WI Z = ly, Y ~ 0 a rea 1 parameter ) , t h IS gIves
L. exp (7tiz (x+n) Z) = By analytic continuation for z
L
I <
L. exp (- (xi/ z) n Z
Vi/ z E:
and x = T
H
E
H
+ Z7rinx) .
this gives
exp(lriz(l." + n)Z) = Vi/z 9(-1/z:T) .
Coming back to g : g(-l/z) = e-~i/1Zz 9(-3/z : I - lz) e-7Ci/1Zzr;:;"3i
(Z731 IZ/3i
L. exp('Jt'iz/3(n
+ I -lz)z)
L:: exq(~iz/12Km2 - 2m/z~ = L exp(7rim Zz/1Z + 1t"im/6) m odd m=Zn+l
= fZ/3i y--
m'oad
exp(rrimZz/1Z).cos(rrm/6).
When m is of the form 3m' (m' odd), cos (lfm/6) = cos (mnYZ) = only the terms m m
~
~
o.
Thus
±l (mod 6) contribute in the summation, and since
-m interchanges these two classes, it is sufficient to take
twice the sum over the class m g(-l/z) But cos (7t"k
+
i)
g(-I/z)
YZ/3i·Z =
(-rf €
(z/i
=1
(mod 6)
L. exp(xi(6k+l)Zz/lZ) ·cos(1tk
k€ll
+
L
exp(Jri(36k Z + lZk + l)z/lZ)
r= exp(~i3znZ
+ ~izn) = YZ/i g(z)
We prove now that a similar functional equation holds for Because H is simply connected and choose a branch of 10g'rJ
=
6
/2 • so that
{iii e~iz/1Z
(log,) (z)
!)
7fiz/12 +
~iz/1Z +
~
never vanishes in H, we can
in H :
L
n;,l
10g(1 _ e27rinz)
L: 1:
(_l/m)eZ~inmz
n~l m~l
71
~(z).
- 1.72 -
Hence replacing z by -lIz : log
"1 (-l/z) =
I claim that log ~(-l/z)
=
,(-l/z)
L..
-lli./12z -
=
n ,m;;,l
(l/m)e-ZJrinm/z
!log(z/i)
+
log,(z)
(this will prove
VZ/i ~(z) ), with the principal determination of log in
~ - ilR+ (hence the determination of VZ/i which takes the value +1 at
z
=
i). We have to prove that -1l"i/12z - 'It'iz/12
L.
+
n,m~l
(l/m) (e 2'Jfimnz _ e -21rimn/z)
11 og (/.) Z 1
= 2
By summing first on n (the series is absolutely convergent), using the formula for the sum of a geometric series, we are led to a term e
2Jrimz
e -27tim/z 1 - e -2xim!z
= (i/2)(cotg(xmz)
+
cotg(~m/z))
Remains to prove that the sum -1ti / 1 2z - 1[i z/ 12
+
(
i / 4)
L
n,O
(c 0 t g (7[ nz)
+
co t g (TC n/ z) ) / n
is equal to llog(z/i). To compute the sum, and prove this, we use the following trick of Siegel's. We consider the auxiliary function f(s) = cotg(s).cotg(s/z) It has a double pole at s = 0 and simple poles for s = nx , nnz when 0 !: nE:ll. Since cotg(s) = s-l - s/3 cotg(s/z)
=
zs-l - s/3z
Ress=o ,(s)/(8s)
+
=
•••
•••
,
and also
we get
,
(-~/3
+
- 1/(3z))/8
=
-z/24 - 1/24z
Ress=nlC!:O tfCs)/(8s) = (1/8n1t)cotg(n1C/z) Res s=nXZr~O .(s)/(8s) = T
(1/8n~z)cotg(nxz)·z
This shows that the desiredsum is precisely 2'J{i LResepCs)/8s , with a summation extended over the whole s-p1ane. Let V be a positive number, not an integer and P the parallelogram with vertices Then
.A(-nrp)
f(s)/(Bs)ds
=~p
f(lIn:t)/(Bt)dt
72
±l,~z.
- 1.73 -
tends to the desired sum when bounded on
)p for
."
y~~.
Since t
~ ~(v~t)
is uniformly
and since this function has the limits
~oo
1 on two opposite sides, -Ion the other opposite sides (vertices not included), the limit is easily estimated lim [, \I~eo
ip
If ()/ 5) /
( 85) d 5
=H
JZ JZ +
1
-1
) (d s /
s)
= l(logz - 10g1 + logz - 10g-1) = !log(z/i) with the principal determination of log in
~
- iR+ .
The conclusion of the proof of the theorem is now easy. Indeed, g 24 as well as ~ 24 have Fourier series starting with t = q 2 and are modular forms of weight 6 (indeed SL2(~) is generated by 1 ). This characterizes 6. completely because M0 = (1o -10) and (01 1) 6 More precisely, (g - ~)/g is a modular function vanishing at i~
a:.~.
Since it is holomorphic everywhere on H, it must be identically zero, this proves g =
~
, and finishes the proof of the theorem.
This theorem gives a relatively easy way of computing the first coefficients ~(l)
=
1,
~(n)
L(2) = -24,
~(3)
= 252,
~(4)
= -1472,
Ramanujan conjectured and Mordel1 proved (1920) that moreover (4.6)
L'C(n)n -s n~l
for Re(s»
=
~
(1 _ ~(p)p-s + pll-2s)-1
p prime
13/2. The proof of this has now become a standard applica-
tion of the theory of Hecke operators in modular forms. An equivalent way of saying (4.6) is (4.6) ,
't(mn) = 'C(m) 'ten) when (m,n)
= I
,
m.) 1, n;, 1,
't(pk+l) = 'rep) 't'(pk) - pll,((pk-l) , P prime, k ~ I These Ramanujan coefficients satisfy many famous congruences, e.g. except for a finite number of n, they are all divisible by 691 (although Ramanujan knew that for
n~5000
73
the..:(n) are never divisible
- 1.74 -
by 691, he was lead to conjecture the preceeding result, which was later proved by Watson). Actually, Walfisz proved that the
~(n)
are
2 ·7·691 for nearly all n. Some conjectures divisible by 2 5 ·32·5 remain open. For example, Ramanujan conjectured that 1~(p)I<2pll/2 Deligne could recently show (1969) that this would indeed be the case if the general conjectures of A. Weil on the form of the zeta function of algebraic varieties over finite fields were proved to be true. Lehmer conjectures that
~(n)
~
0 for all n : this has been
proved at least for n <10~. (For all these questions, we refer to J.-P. Serre: Une interpretation des congruences relatives 1 la fonction
T
de Ramanujan, Seminaire Delange-Pisot-Poitou, 1967/68,
n 0 14.)
The Fourier coefficients c(n) of j have also some interesting (and intriguing) arithmetical properties. Their growth is wild c(O) = 744, c(l)
= 196884, c(2) = 21493760,
For example : if n is divisible by 2a , then c(n) is multiple of 2 3a +8 , 3 2b +3 , 3b ,
"
"
"
"
5c ,
"
"
"
"
Sc+l
,
CHAPTER TWO
ELLIPTIC CURVES IN CHARACTERISTIC ZERO
Many properties of (complex) elliptic curves are purely algebraic in the sense that they could also be derived in the field generated over the rationals by the two numbers g2 and g3 ' or perhaps in its alge.raic closure. It is thus sufficient to work in a fixed field of characteristic zero (algebraically closed if needed). Our first aim will be to show that any non-singular (projective) plane cubic can be given in Weierstrass' normal form. The study of the differentials over the curve (and their classification) will also be made purely in algebraic terms. It is true that by Lefschetz' principle the case of the complex field is crucial, and we can always reduce algebraic geometrical problems in characteristic zero to the same problems over [. However, it is not always useful to do that. In particular, when the coefficients &2 ' g3 are rational, it may be very interesting to look at the points of the curve yZ = 4x 3 - gzx - &3 with coordinates in the p-adic fields
~p
(or algebraic extensions of these). It is very strik-
ing how Jacobi's theory of theta functions can be made here if the absolute invariant j is not a p-integer (i.e. j
~ ~
p
). Thus we
conclude the chapter with a brief description of Tate's elliptic curves. We shall assume that the reader is acquainted with the following facts of algebra :
75
- 11.2 -
A[X]
a) If A is a noetherian ring, so is
(Hilbert's basis
theorem), and if A is a unique factorization domain, so is also
A[X] (X
is an indeterminate).
b) Notion of integral element over a ring and simple properties (sum and product of integers are integers, transitivity). c) Notion of valuation (with value group contained in R), and existence of extensions of a given valuation, corresponding to a field extension:)
A certain familiarity with the p-adic fields
~
p
(p a prime) and metric
spaces with an ultra-metric distance, might also help in the last sections, although this is not absolutely required. In the section on analytic p-adic functions, some propositions are proved for their interest and are not used after. Such are ( 4 . 2), (4. 3) and (5. 11) •
v: K)( -+- R = R+ whereas absolute values are considered multiplicatively 1•••l v : K~ ....
*) Valuations are considered additively
R: .
76
- 11.3 -
1. Algebraic varieties and curves
Let k be any field and (Pi) a family of polynomials of k[Xl, ... ,x n ] (where n is a strictly positive fixed integer). For any extension field K of k, we define the set VKCKn as set of common zeros x = (xl' · · · ,x n ) E: Kn of the family (P. ) P.1 (x) = 0 for 1 all indices i. Ifx€V , it is obvious that P(x) = 0 for every K polynomial P of the ideal I generated by the Pi in k[Xl, ..• ,x n ] . By Hilbert's basis theorem (the ring k[Xl, ... ,x n] is noetherian),
the ideal I is generated by a finite number of polynomials, so we may suppose that the family (Pi) is finite to start with. We may then replace k by the field generated over the prime field by the finite number of coefficients occuring in the finite number of polynomials P. , and thus suppose that k is finitely generated over 1
the prime field. If k is of characteristic 0, we are thus reduced to the case of a field k finitely generated over
~,
and all such
fields can be embedded in the field [ of complex numbers ([ is algebraically closed, and its transcendance degree over
~
is not even
denumerable). Thus if the characteristic is 0, we could also have started with k
=[ :
this is Lefschetz t principle. We come back to
the general case, and note that the set V (or V ) may very well k K be empty (e.g. when the ideal I is the whole ring k(Xl, ... ,xn] in which case we say that the ideal is not strict). But when K is algebraically closed
we have
(1.1) Theorem 1 (Hilbert's Nullstellensatz). Let I be a strict ideal of the ring k[X1, .•.
,xn]
• Then there is a common zero x = (Xi)
in a finite algebraic extension K of k. In other words VK is not empty if K is an algebraic closure of k.
77
- 11.4 -
Proof. The proof is made by using essentially the notion of integral element over a ring. First part : we show that if I is a maximal (strict) ideal of k[XI, ...
,xn]
, then the field K
k [X I ' • • • , Xn ] / I
k[xl, ... ,X ] is algebraic over k. This is done by induction over n, n the case n = I being trivial, because k[x] is not a field if x is transcendental over k. We note then that k(x )[X 2 ,· .. ,x n] = k[xl,···,x n ] , l because the right member is a field. By induction hypothesis, the n - I elements x , ... ,x n are algebraic over the field k(x I ) k', 2 hence there is a polynomial P(x l ) E: k[x I ] so that all P(xI)x i for i = 2, ... ,n , are integral over k[x 1]
(this P is a common denominator
for the Xi'S). If now f£.k(xl)Ck[xl, .•. ,x n], a suitable positive integer N
=
N(f) will be such that
pN(XI)·f~k[XJ [P(xI)xZ,· •• ,P(xI)X n ] hence integral over k[x I ] (by transitivity of the notion of integrality) This implies pN f 4::k [xl] , or f
~k [Xl ,P(x I ) -1] and
k(x ) C k [xl' P(x ) I I
-1] ,
Hence the result by transitivity of
and so Xl is algebraic over k the algebraicity. Second part
finally
We
s~lect
a maximal ideal I' containing
I and we look at the homomorphism k [Xl' · · · ,X n] ----... k [Xl' · · · ,X n] /1' = k [Xl' · · · ,x n] = K n By construction x = (xl' ... ,x n ) E:K and P(x) = 0 for all P E: I ' , in particular P(x) = 0 for all PE;IcI' (1.2) Corollary 1. Let PiE:k[XI, •..
,x n]
andKbeanextensionofksuchthat n Pi (x) = 0 for a "point" x = (xl' .•. ,x n ) £K . Then there is a point
y = (Yl' •.• 'Yn) with coordinates in a finite algebraic extension of k such that Pi(y) = 0 (for all indices i). be the ideal of K[XI, •.• ,x n ] generated by I. By hypotheK does not contain I, hence a fortiori I fa I elK. Thus I is strict.
Proof. Let I sis I
K
78
- 11.5 -
(1.3) Corollary 2. Let I be an ideal of k[X , •.• ,xn]and K an algebraic
l n VK(I) cK • Then P E k [Xl' ... ,X n] vanishes on V(I) if and only if a power of P is in I. In particular if I is a closure of k. Put V(I)
prime ideal, P £1 if and only if P vanishes on V(I). Proof (Rabinowitsch). We add an indeterminate Xo and consider the ideal I of k[Xo, ... ,xnJ generated by I and 1 - XoP where P is the polynomial supposed to vanish on V(I). By construction, I has no zero on V(I) hence no zero at all. By the Nullstellensatz, I is not strict and we can write 1 = A (1 - X P)
o
0
+
L
a fini te sum wi th some polynomials P.1
A.P. 1
E:
1
I. Making X0
l/.P in this
formal relation leads to a relation '\
-1
, XI ' · · • , Xn ) Pi (X 1 ' · • · , Xn ) = 1 , and multiplying by the highest power pt of P appearing in the denoL-
Ai (P
minators of the Ai (P
-1
t
,Xl' . · · ,X n ) , we get P
E:
I.
n Let us fix an algebraic closure K of k. An affine variety in K defined over k is by definition a set of the form V(I) for an ideal I of k[Xl, •.. ,xnJ. This variety is said to be irreducible when the corresponding ideal of k[XI, .•. ,Xn]formed of all polynomials vanishing on it is a prime ideal (by definition, a prime ideal is an ideal whose complement is stable under multiplication and contains 1, or equivalently an ideal giving by quotient an integral domain). By the above corollary 2, irreducible (affine) varieties defined over k and prime ideals of polynomial rings over k correspond one-one to each other. This notion is relative to the field k, and we should say k-irreducible instead of irreducible, to be quite precise. When K is an algebraic closure of k and the ideal I K generated by I in K[Xl, ... ,x n ] is still prime, we say that VK(I) = VK(IK)CK n is absolutely irreducible (it is irreducible over any field, containing K or not). 79
- 11.6 -
As before, let k be any field, and let K be a field extension of k. The set of common zeros of a family (F ) of homogeneous polynoi mials (we shall say: forms) of k[Xo' ••• 'X n ] in Kn + l can be identified with a subset of pn(K), the space of lines through the origin in Kn + l (or equivalently the quotient of Kn + l - to} under the equivalence relation given by homotheties). Again this set of zeros depends only on the ideal I generated by the Fi ' and we denote it by VK
=
VK(I)
c: lP n (K)
This is what we call a projective (sub-)variety of Wn(K) defined over the field k. We say that an ideal of a polynomial ring is homogeneous when it can be generated by a family of forms. One sees at once that an ideal of polynomials is homogeneous exactly when the homogeneous components of its members are still in it. Thus, a homogeneous ideal has a set of generators constituted by a finite number of forms (and Lefschetz' principle is still valid for projective varieties in characteristic zero). A hypersurface in pn(K) defined over k is the set of zeros of a single form F with coefficients in k, or equivalently of a principal homogeneous ideal (F) of k[X o , .•• ,X] n ). We shall need (a special case of) a theorem asserting that when K is algebraically closed, a homogeneous ideal wi th m, n genera tors has a zero in pn(K) (such an ideal has always a zero in Kn + 1 , namely the origin, but this trivial zero does not correspond to a point of pn (K) ). In other words, the intersection of m, n hypersurfaces in Wn(K) is not empty if K is algebraically closed. This is a corollary of the dimension theorem, or of elimination theory, but we prove it directly. (1.4) Theorem 2. Let k· be a field, l"m,n two positive integers, and
(F.) .-1 1 1- ,
•••
,m
a set of m forms of k[Xo' ... 'X n ]. Then there is a
fini te algebraic extension K of k and a point 0 , x = (x o 80
' •••
,x n )
E:
Kn+1
- 11.7 -
such that Fi(X) = 0 for i = l, .•• ,m
(if none of the F.1 is constant).
Proof. Let k' = k(Fl(X), ••. ,Fm(X))ck(X0 , ... ,X) n with Fl. (X) = =
Fi(Xo~
..• ,Xn) · The transcendence degree of k' over k is smaller or
equal to m, hence one X.1 at least is still transcendent over k'. Let p = deg k(X o ' ••• ,Xn)/k' has been chosen so that
~
and assume that the numbering
1
Xl' .•. ,X p is a transcendence basis of
k(Xo' ... 'X n ) over k'. We put L = k'(Xl, •.. ,Xp ) and make a diagram. We define a discrete valuation on L
I
k(Xo'···'X n )
trivial on k' by v(X l ) = -1 , v(X Z) • •• = v(X ) = 0 • This defip
finite algebraic
k'(Xl, ... ,X p ) = L purely transcendental of degree p
I
k'
keF.
1
(X)).-l 1-
~
nes v uniquely because if P/Q is a representation of an element of L as
1
rational element over k', then ••• m
v(P/Q) = deg 1 (Q) - degl(P) where deg l denotes the degree in Xl (this
k
is the valuation at infinity associated to Xl). This valuation has an extension to k(Xo' ... 'X n ) which we still denote by v. Let i be such that v(X i ) is minimal, hence v(Xi)~-l. Thus
.
for j = O, ••• ,n v(X·/X.) = v(X j ) - v(X i ) ~ 0 J l. Now the valuation ring Rv V(F)~OJ has a unique { F E: k (X 0' • • • , Xn ) (*)
V(F)~O} , so that Jl= Rv/M v is a field
maximal ideal Mv ={F€R v containing k' (:> k). Also v(F (X IX.
t
01
= - dt
dol
, ... ,X n IX.)) = v(F." (X , ••• ,X )/X. ) l '0 n 1
v ( Xi) ~ d-t
= deg (F.t)
~
1
Let yJo be the class of X./X. in Jl (this is meaningful by (*)), so that J
of Y
=
1
(y.) ~ J1?+1 (remember y. J
1
=
1) is a common zero of all
F,
~
by (**0. The theorem follows by Cor. 1 of Th. 1 applied to the ideal 81
- 11.8 -
generated by I and I - Xi (which has still y as zero). (Alternately, one could also say that if 0 were the only zero of I in Kn + l , with an algebraic closure K of k, then I would contain all X~ , j=O, ... ,n), J
for a sufficiently large integer N by Cor.Z of Th.l, and thus could not vanish on 0 ;: y
E
.n.n+l .)
It is convenient to fix an algebraic closure K of k. As in the affine case, we say that a projective variety (in pn(K)) defined over k is k-irreducible, when its corresponding homogeneous ideal of k[Xo' ... 'X n] is prime, and we say that it is absolutely irreducible when its corresponding homogeneous ideal of K[Xo' ... 'X n] is still prime. Let k[X , .•. ,X] = ffi Ad be the decomposition of this graded o n d)O ring in homogeneous components (the Ad are finite dimensional k-vector spaces). A homogeneous ideal I of this graded ring is precisely an ideal I such that 1= $I d = $(InAd ). By Cor.Z of Th.l, I has 0 as only zero if and only if I d = Ad for some d ~ 0 (hence 1m = Am for all m ~ d). We shall say that a homogeneous ideal I is strict when Id
t:
Ad for all d:) 0 (or equivalently when the corresponding subvarie-
tyof pn(K) is not empty). Th.Z says that an ideal of k[Xo' ... 'X n] generated by m ~n non-constant forms is strict. The only prime homogeneous ideal which is not strict is
d
$
Ad . Let I be a.ny strict,
~l
homogeneous, prime ideal of A = ~Ad ' and let V be the corresponding projective variety in
pn(K). We define the field key) of k-rational
functions on V to be the field of classes of elements F/G with F and Ge:A d for some with the equivalence relation
d~O,
and G¢I d
whenever FIG Z - FZG I e I Because I is a prime ideal, the quotient A/I is an integral domain FI/GI~FZ/GZ
and
82
- 11.9 -
k(Y) c field of fractions of A/I
+
All this is done without reference to a k-basis of Al (only the gradation of A is used), or as we say, without coordinate system (i.e. without the special linear forms X.). But if I is as before, one linear l.
form Yo ¢ II and we may even suppose that Yo is selected among the forms
x.l.
if we want. After renumbering the indeterminates, we may thus
suppose
Xo It. I. We fix now Xo and classify the strict, homogeneous,
prime ideals I which do not contain Xo • With that purpose in mind, we introduce two operations (inverse to each other in a sense), the first one being the homogeneization operation which associates to each polynomial GEo k [Xl' ... ,X n ] the homogeneous polynomial (form) G* in the n + 1 indeterminates Xo, ... ,X n defined by * _ deg(G) (1.5) G (Xo'···'X n ) - Xo G(Xl/Xo' ... 'Xn/X o ) The second one is the dehomogeneization which associates to each form FE: k[X o ' ... 'Xn] the polynomial in the n indeterminates F0 defined by (1.6)
Making Xo = 1 in the definition of G*, it is obvious that (G*)o
G
for every polynomial G. Moreover it is easily seen that (F o )* = F for every form F not divisible by Xo (from this one can see in general that (F )* = X-m·F where m is the largest integer such that F is o 0 divisible by X~ ). These operations are extended to ideals and give a one-one correspondance between (e.g.) strict, prime ideals of k[Xl, ... ,Xn]and strict, prime, homogeneous ideals of k[Xo' ... 'X n ] which do not contain Xo . If we fix again a strict, homogeneous, prime ideal I not containing Xo ' the homomorphism key) cfield of fractions of A/I G
has for kernel precisely the set of G such that G* 83
E.
I and so consists
- 11.10 -
of those G
= (G*) o E: I 0 . In other words, we have an isomorphism
field of fractions of k[XI, .•. ,Xn]/I o ~ key) given by the preceding map. In particular, we could have defined the field of k-rational functions key) on the k-irreducible projective variety V corresponding to the ideal I as being the field of k-rational functions on the k-irreducible affine variety Vo corresponding to the ideal 1 0 (this last field being defined as field of fractions of the integral domain k[Xl, •.. ,Xn]/I o ). The drawback of this method is that it depends a priori on the choice of the hyperplane at infinity Xo = 0 (where Xo t I) · When we work with a finite number of projective varieties, it is always possible to select a linear form Yo (if k is infinite) such that all ideals defining the varieties do not contain Yo (note that it may not be possible to choose Yo among the X.1 here). In other words, it is possible to choose a hyperplane at infinity not containing any member of the finite family. Thus many (local) problems are reduced to affine ones after a suitable choice of hyperplane at infinity. We also note that by Cor.2 of Th.i, the elements of key) have an interpretation as functions defined over non-empty subsets of V
=
VI ' because if FIG is one representation of an element of key), Gt I implies that the set of points of V where G does not vanish is not empty, and on this part of V, F/G defines a function, because F and G are forms of the same degree. Moreover, if Fl/G l is another representation of the same element of k(V) , it will determine the same function in the intersection of the domains where these functions are defined. Conversely, if FI/G1 and FZ/G Z determine the same function on the subset of V where G1G Z does not vanish, they determine the
84
- 11.11 -
same class in key). We do not prove this in general, hence shall refrain from using it (in this general form). We say that an element f
E:
key) is defined at the point P of
V (= V and K is an algebraic closure of k) if f admits a representaK tion FIG with either F(P) 1 0 or G(P) 1 0 (in fact F(P) has no meaning because P is a set of points of the form ~P
o with P0 € Kl1+~\O} and F(P) = (K~)deg(F)F(P ), so that we should write F(P ) 1 0 or o
0
F(P) 1 to}). Then f is said to be defined and finite at the point P when moreover, FIG can be selected (in the class of f) so that G(P) 1
o.
The set of functions f
at P is a ring Rp • For f unambiguously by
FIG
(P)
E:
E:
key) which are defined and finite
Rp c k(V), the value f(P) is defined
= F(Po)/G(P ) o
for a PoE:P. If we assume that
P does not lie on the hyperplane of equation Xo = 0 (which we may after renumbering the indeterminates), and if we identify key) with the field of fractions of R = k[Xl, .•. ,Xn]/I o ' we see that Rp is the subring consisting of fractions FIG with G(P o ) 1 0, where Po (1, ... ) is in the class of P. The set of f £ R such that f(P 0) = 0 is a prime ideal Rlp
r
of R (even a maximal ideal if k
= K) because the quotient
c K. Then Rp is nothing else than the localized of R at
Rp = R = R[(~]-l C field of fractions of R
r
= key)
l'
•
(1.7) Definition. The subring Rp of k(V) consisting of functions defined and finite at PeV K is called local ring of V at P. This definition is justified by the fact that Rp is a local ring in the sense of commutative algebra
Rp
-
R; = {f
= FIG:
F,GER and F(P)
= o}
=1'~
is an ideal (hence the only maximal ideal of Rp ). (1.8) Proposition. The local ring Rp is a noetherian ring.
85
- II.lZ -
Proof. We show that every ideal J p of Rp is finitely generated. Let J = J p n R. Then J is an ideal of the noetherian ring R (a quotient k[Xl, ... ,XnJ/I
of a noetherian ring is noetherian) and so we can o choose a finite family fl, ... ,f N of R-generators of J. I claim that generate J p (over Rp ). Indeed, if fEJ p , we can write f = g/h i with h(P) f 0 and g = L a.f. (a. E:R). Then 111 the f
f
=
r:
(ai/h)f i
with ai/h£R p by definition ·
We turn now to the study of curves. (1.9) Definition. An irreducible variety VClPn(K) (defined over k) is an algebraic curve (over k) if the transcendence degree of key) over k is one. We shall mainly be interested in irreducible plane curves, i.e. curves in lP Z(K) defined by an irreducible form F E: k[X ,Xl' XZ]. o
Plane curves are hypersurfaces inWZ(K), so that we can apply Th.Z. (1.10) Proposition. Let K be an algebraic closure of k and F and G two forms of k[Xo'XI,x ] determining two curves V(F), V(G)ClPZ(K). Z
If F
~
G have no common factor, the intersection V(F)n V(G) is
finite (and not empty). Proof. We assume that neither F nor G is divisible by Xo ,and we prove the assertion of the proposition in the affine piece Xo 1. Thus Fo and Go have no common factor in R = k[Xl,X Z] (notations of (1.6) ) and also
no common factor in k(X l ) [Xz] (Gauss' lemma). As this ring is now principal, we can write aF 0
+
AF o
+
bG o = 1
wi th sui table a, b E: k (Xl )[X ] z and consequently there exists a polynomial 0 t- C(X l ) ~ k[X l ] so that BG o = C f 0 with polynomials A,B
E:k[X l ,X ] 2 The common zeros of Fo and Go must have a first coordinate among the finite set of roots of the polynomial C. Similarly for the second
86
- 11.13 -
coordinates of the intersection points (with another polynomial ~). This proves that the intersection is finite, because V(F) (and V(G) ) have only finitely many points on the line at infinity Xo
O.
(1.11) Corollary 1. Let FE: k [X o ' Xl ' Xz] be a prime form, and V(F) be the corresponding irreducible plane curve. If f
E:
k(V), the set of
points P E: V(F) where f is not defined and fini te f ¢ Rp is fini te. Proof. Let us select a representation AlB of f, with B¢(F). Because F is irreducible, Band F have no common factor and we can apply the proposition, because the set of points where f is not defined and finite is contained in V(B)nV(F). (l.lZ) Definition. Let FEk[X o 'X 1 ,x Z] be an irreducible form, and P £. V(F) c pZ (K). The point P is called regular (or simple, or non-singular) on V(F) when the formal derivatives
(~F/~Xi)(P)
(i = O,l,Z)
do not vanish simultaneously at P (same convention as before regarding the meaning of this). A point which is not regular is called singular. A projective curve is called regular if it has no singular point, and singular if it has at least one singular point. (We shall also say that an affine curve is regular, or non-singular, when the projective curve defined canonically by homogenizing is regular.) Euler's relation gives LXi(dFldXi)
= deg(F)·F
and if P E: V(F) is not on the line at infini ty Xo = 0, we normalize PoE:P by Xo(P o ) = 1 (i.e. Po = (1,·,·) ) and we deduce
But obviously
aFldX.1 = (oF 0 IdX.)* for i = 1,Z , and so, the conditions 1
(dF I~X.)(P )
=
(dF IdX.)*(P ) not both 0 for i
01001
0
= 1,2
are already sufficient for the non-singu1ari ty of a point P €. V(F) not on the line at infinity Xo
=
0 .
87
- 11.14 -
(1.13) Corollary 2. Suppose that the field k is perfect. Then the set of singular points on an irreducible plane curve defined over k is finite. Proof. If F and the
dF/aX i have infinitely many common zeros, they
must have a common factor, and hence (by irreducibility of
must be a multiple of F
dF/~X. 1
F). Because the dF/dX.1 are forms of degree,
deg(F) - 1 , it follows that they vanish identically, and by Euler's relation deg(F)·F =
L.
X.(dF/dX.) -= 0 . 1 1
This is sufficient in characteristic 0, because F defining a curve. In characteristic p
~
=0
is not a form
0, we see that deg(F) must be
a multiple of p, and more precisely, the vanishing of the partial derivatives shows that F depends only of the X~1 . Because k is perfect F = (F')P by taking the pth roots of all coefficients of F, and so F could not be irreducible. We have obtained a contradiction in both cases. (1.14) Examples. We consider the following affine curves.
o
X3
+
y 3 - Xy
=
0
They are irreducible, and have all the origin as singular point. The class f of Y/X is not defined at the origin for any of these curves, nor is l/f defined at the origin for any of them. On the curve of equation Y - X3 = 0 , both £ and l/f are defined at the origin, only f being finite at the origin
3
fe:R(O,O)(Y-X) 88
- 11.15 -
(1.15) Convention. For the rest of the section, K a fixed algebraic closure of k. and V
=
=
k will denote
V (F) Cp 2(K) will be a KK
irreducible (i.e. absolutely irreducible) projective plane curve defined over k by a form F E: k [X o ' Xl' X2 ] . Let X = X denote the set of (discrete) normalized valuations K v of K(V) (i.e. such that vK(V)~ =~) which are trivial on K. This set is going to play the role of "spectrum" of the field K(V) and will be very close to the set of points of V (cf. Cor.l below). We say that a valuation ve:X is centered at a point P = Pv £V if for f E:R p v(f»O is equivalent to f€M p Obviously v is centered at most at one point P because if P ; P' £V there are functions f in RpO Rp ' with
f €.M p but f ¢M p ' (take for f
the quotient of two linear forms gjh where h does not vanish at P,P' and g vanishes at P but not at P'). (1.16) Lemma. Every valuation v E:X is centered at a (unique) point p = PV E.,V , so that we have a well defined mapping VK defined by v ~ ~(v) = Pv Proof. Let X,Y,Z be a permutation of the X.1 such that v(X/Z)
n
X
~
~
° and
v(Y/Z):> 0. Denote by x (resp. y) the class of XjZ (resp. Y/Z) in K(V). If vex - ()
=
° for
all
~ E:
K, we would infer that for every
polynomial P(X)€ K[X] we would also have v(P(x)) = 0, because every polynomial is a product of linear factors in K(algebraically close~. Also v(R(x)) = numerator
° for
every rational function R(X)£K(X) because both
and denominator have valuation 0. But as K(V) is algebraic
over K(x), it would follow that v is trivial: if teK(VJwrite its minimal polynomial t N + alt N-
which implies vet) This
~
=
l
+ •••
=
O. hence Nv(t)
= v(alt N- l + ••• )
0. Thus there is a {E: K such that vex - {) >0.
is unique, because if
~1
were another element of K with 89
- 11.16 -
the same property, we would derive Inf(v(x-~)
,v(x- f'))
>
° contrary to
v(~/_~) = v((x-~)
-
(x-~')) ~
the fact that v is assumed trivial
on K. Now write the equation of the affine part of V in the form
=
Fo(X,Y) and factor Fo(~ ,Y)
L. v(y
c
Fo(~'Y)
11 (Y
- '1i)
- (X -
=
~)A(X,Y)
°,
- ~i) in linear terms. We have
= vex - {)
+ v(A(x,y))
~ vex - 0 > 0
Because x and y have been chosen with yositive valuation, every polynomial in x and y will have positive valuation, and the above relation can only hold if for one i at least v(y -
~i)
>
0. But for
the same reason as above, this strict inequality can hold at most for one
~i
• By construction, the point P
=
( ~, 'Ji ,1)
Pv
is on
the affine curve Va of equation Fo ' and defines a point (still denoted by P) on V. After a translation, we may suppose that this point is at the origin Pv
=
(0,0,1) of the affine coordinate system chosen.
If fE:.R p , then f admits a representation AlB
with B(O,O,l) ;
Because now vex) >0 and v(y) >0, this implies v(Bo(x,y)) = v(f) = v(Ao(x,y)/Bo(x,y)) = v(Ao(x,y)) Ao(O,O) =
° or
>
° is
O.
° and
equivalent to
to f£M p as was to be shown.
We show a kind of converse to this lemma. (1.17) Theorem 3. Let P be a regular point of V. Then there is a
unique valuation v p valuation, f
E:R p
ring and Mp =
ord p E: X which is centered at P. For that
is equivalent to ordp(f)
(~p)
~ 0.
Thus
Rp
is a valuation
is principal with generator any element such that
n M~
ordpC't'p)
1. Also
determined
br ordp(7t~R;)
r)O
lo},
so that the valuation in guestion is
ordp(M~ - Mr 1 ) = r. (K = k, cL end of proaL)
If S = S(V) denotes the finite set (K is perfect,(1.13),(1.15)) of singular points of V, we see that ord : V - S p
~
ord p is inverse of
7t
(rc-v and
on the sets where they are defined). 90
V-1t
~
X defined by
are the identi ty mapping
- 11.17 -
Proof. Let again X,Y,Z be a permutation of the X.1 such that Z(P) , 0 and (dF/dY)(P) ,
° . Normalize Poe:P by Z(Po )
= 1 ,
and identify Po
with a point (~,~) of the affine plane K 2 • After a translation (replacing x -
~
by x' and y -
by y') we may assume that Po = (0,0)
ry
is at the origin of the affine coordinate system defined by X,Y • Thus the affine part Vo of V is defined by the polynomial Fo with no constant term. Supposing that Fo is not proportional to Y (trivial case), we may assume Fo(X,Y) with A(O)
=
YA(Y) - B(X,Y)X
1 (this is no restriction) and
B(X,Y) = Bo(X)
° (because
YBl(X)
+
+ •••
Fo is not divisible by V). We denote also by x and y the classes of X/Z and Y/Z resp. in K(V).
with Bo(X) ,
First step. Obviously Mp = (x,y) is generated (over Rp ) by x and y. We show (under the preceding hypotheses) that Mp is principal and generated by
x , or what amounts to the same, that y is a
rr p
multiple of x in Rp
By (*) we have yA(y)
•
y = xB(x,y)/A(y) e:xR p because A(O) ,
B(x,y)x, hence
=
° implies
l/A(y) E:R p •
Second step. We show more precisely that there is an integer r such that y€xrR;
= M~
-
If on the contrary B(O,O)
Mr
1
= 0,
• This is so with r
=
~
1
1 if B(O,O) "f O.
we proceed as follows: multiplying
by A(y), A(y)B(x,y) = A(y)Bo(x) and writing Bo(X) =
+
Bl(x)A(y)y
+
•••
XB~(X),
A(y)B(x,y) =
= x(B~
x(B~(x)A(y)
+
B1 (x)B(x,y)
+
y( ••• ))
B1Bo + y( ••• )) = x(~o + y( •.• )) with a new polynomial Ero(X) = B~(X) + Bo(X)Bl(X) containing exactly one power of X less than B (X). If Xr - l is the highest power of X +
o
dividing Bo ' we shall get after r-l steps 91
- 11.18 -
A(y)
r-l
B(x,y)
x
=
r-l
~
(Bo(x)
+
y( ••• ))
with fo(O) ~ 0 • This shows that y
= xrC(x,y)/A(y)r
°,
with C(O,O) !
r
hence the assertion (C(x,y) / A(y) E:R p ). )C
Third step. We show now that
n
~ = {O} mlO (this is Krull's theorem valid quite generally for local noetherian rings, but we indicate a direct proof in our particular case). Equivalently we show Rp -
{OJ
=
U
m)O
JCRR"
p p
=
U (~p _ ~+1) p
m~O
If fe:R p - {OJ we choose a representation C/D of f with D(O,O) ! 0. If C(O,O) ! 0 , fER; and we are done. If on the contrary C(O,O)
0 , it is rather obvious that a method similar to that m )( used in the second step will lead to f £ x Rp • =
The conclusion of the proof is now easy. Any normalized valuation (trivial on K) centered at P is such that vex)
=
1 and trivial on
R~ • This shows that there is at most one such valuation. Conversely
we can define for f = xDlg(x,y) with mE.7L ordp(f) = m. As every element f of
K(V)~
and g(x,y)€R; is of this form this gives
the existence of the required valuation, and concludes the proof. We note however, that we have only used the fact that k = K is algebraically closed to be able to suppose that the point under consideration was at the origin. In other words, to have x -
~Ek(V)
and Y -"]Ek(V).
Thus, the proof shows that the result is true if we replace K by the subfield k'
k(~,~) =
k(P) generated by the coordinates of Po
over any field of definition of V (i.e. containing the coefficients of the polynomial F0 (X, Y) defining V). This field k(P) is a fini te algebraic extension of k (independent of the choice of the line at
92
- 11.19 -
infinity, as long as we take the representant Po with one coordinate equal to 1). (1.18) Corollary 1.
Ii
V = VK is an absolutely irreducible non-singular plane curve, n: X ~ V is a bijection. (One could show in general that finite.
7t
is surjective and ~(S{V)) is
In fact X is in one-one correspondence with the normalized
- or any non-singular model - of the curve V.) One should be careful about the fact that even if element v, v{f) >
° (for f E:K{V))
-1 ~(P)
has only one
does not imply f G:M p • Take for example as in (1.14) the curve y2 =x 3 , p at the origin, and f = y/x.
Then there is a unique valuation v centered at P, and for that valuation v{f) Recall now that a subring A of a field F is called valuation ring if for any
°;: x
E:
F, either x or x-I belongs to A. The rings
associated to valuations (by v(f»O) as before, are obviously valuation rings in this sense. But conversely (1.19) Corollary 2. Rp is a valuation ring of K{V) if and only if the point P is regular (and then Rp is the ring of ord p ). Proof. Remains to show that if Rp is a valuation ring, then P is regular. So we suppose that ylx By definition we can write y/x
E:
=
Rp (otherwise interchange y and x) • A(x,y)/B(x,y) with formal polyno-
mials A(X,Y), B(X,Y)E:K[X,Y] and say
B(O~O)
= 1 (we suppose P at
the origin). This gives YB(X,Y) - XA(X,Y) £1 o (F o ). A degree consideration shows that up to a constant factor Fo = YB(X,Y) - XA(X,Y) so that (dFo/3Y)(0,0) = 1 ;:
° and P
(0,0) is regular.
We note explicitely that all f£K(V) are defined at all regular points PEV (possibly infinite at some of them). If f is defined and finite at the regular point P, f(P) is the unique 93
- 11.20 -
element aoof K such that ordp(f f
= ~mod
aJ >0. Equivalently we may write
Mp . From this it follows that f can be expanded in a
formal Taylor series 2
T(fj = a o + alxp + a 2np + ••• -1 where a l is determined by a l = (f - ao)~p mod Mp , and so on inductively. This series converges to f in Rp (or its completed~p) for the (1.20)
topology defined by taking the powers of the maximal ideal Mp as neighbourhoods of 0 (hence all ideals of Rp are neighbourhoods of 0 because Rp is principal). We can also look at this expansion as formal series T(f)
E: K[[1rp]]
• More generally, every f
E:
K(V) can be
expanded in a Laurent series at P, which will be an element of the field of fractions K((np )) of the former ring K[T~] gives the usual
Then ord p
notion of order of a Laurent series.
Suppose now that V is non-singular. Let Div(V) be the free abelian group generated over the set of points of V. Then we can define the divisor of f e: K(V)
by the formal expression L.ordp(f}(P) •
By (1.11) applied to f and l/f , only finitely ordp(f) will be different from 0 which assures div(f)
=r:
ordp(f). (P) £ Div(V) . PE-V (If V were not assumed to be non-singular, we could define the (1.21)
divisor of f on X using the fact that ~(S(V)) is finite, by div(f) =
L
veX
v(f)· (v)
94
E:
Div(X)
.)
- 11.21 -
Another consequence of Th.3 is the possibility of defining purely algebraically the order of contact of a line and the curve V at a regular point PE:V. Let o<X
+ ~Y +
l =
0 be the equation of
an affine line. We can define the order of contact of the curve and this line at the regular point P £V to be the positive integer ordp(<<x
r).
+ ~y +
This order will be 0 exactly when P is not on
the line under consideration, it will be through P. When this order is
~
if the line passes
~l
2 we say that the line is tangent
to the curve at P, and when this order is
~
3 we say that this line
is an inflexion tangent at P, and that P is a flex of V. Every regular point has a unique tangent. To see this, we suppose the point at the origin of the affine coordinate system chosen, and we write the defining equation of the curve in the form Fo(X,Y) = aX terms of degree is
~
bY
+
~2.
2, so that aX
+
G(X,Y) with a polynomial G containing only
+
Then ordp(ax bY
=
+
by) = ordp(-G(x,y)) = ordp(G(x,y))
0 is the equation of a tangent to p (note
that (a,b) f (0,0) because P is regular). In this case ~p
bx - ay is a uniformizing variable (i.e. generator of Mp )' and
if a'X
b'Y
+
get a'x
+
a'X
b'y +
0 was the equation of another tangent line, we would
=
2
= 7t p u
(UE:R p )
b'Y - b 2X2
+
and thus 2abXY - a 2y 2
= 0 mod (F o )
,
hence, by degree consideration, the left hand-side would be equal to Fo (up to a constant in K), and this shows (a' ,b') proportional to (a,b). To find a convenient criterium for the point P to be a flex, we make the further reduction where a we can take
~p
=
0 (so that b
r
0 and
= x as uniformizing variable, as in the proof of
Th.3), and we write more exp1icitely the defining polynomial (1.22) Fo (X,Y) = Y + rX 2 + sXY + ty 2 + G'(X,Y)
95
- 11.22 -
with a polynomial G' containing only terms of degree higher than 3. Then Y = 0 is the equation of the tangent, and ordp(Y)
ordp(rx
2
+
sxy
+
equals 20rd p (x) = 2 as soon as r is a flex precisely when r
=
o.
ty ~
2
+
G'(x,y))
0 (because ordp(Y) ~2). Thus P
We shall transform this condition in
a projective form. Let F be the form defining V and let H(X ,X 1 ,X 2) = det(~2F/aX.OX.) o 1 J This is a form of degree ~ 3(n - 2) if n = deg(F), called the (1.23)
Hessian of F. The plane curve V(H) defined by H(X o 'X 1 ,X 2 ) = 0 is also called the Hessian of V. If the characteristic of the field K is 0, the Hessian form is of degree 3(n - 2) exactly, and we have the following criterium. (1.24) Proposition. Suppose that K is of characteristic 0, and let V(H) be the Hessian of V. Then a regular point P of V is a flex precis,ely when P f:: V ()V(H) is an intersection point of. the Hessian (of V) with V. Proof. We take Z(P) ; 0, is at
~nce mor~
(~F/dY)(P)
the~origin
a permutation X,Y,Z of the Xi such that
; 0, and assume that Po (normalized by Z(Po)=l)
of the coordinate system defined by X,Y. Moreover
we may assume that the tangent line to P is the line Y = 0 and so Fo will be given by (1.22) of which we take the notations. We compute ZH(X,Y,Z) by multiplying by Z the "last" column of H and we add to this column X times the first one and Y times the second one ( this does not change the value of the determinant). Euler's relation in the last column, will give the three terms (n-l)~F/aX
, (n-l)dF/dY , (n-I)()F/dZ ,
because these partial derivatives are forms of degree n-l (we are in
characte~istic
0). We do the same trick with the lines, adding
96
- 11.23 -
X times the first one, Y times the second one, to Z times the last one. We get
F" XX
F"
XY
(n-l)F~
F"
F" YY
(n-l)F y
YX
I
(n-l)F~ (n-l)F~ n(n-l)F
(with notations for the partial derivatives which are better suited to typewri ters! ). If P ~ V we get thus 2r
s
s
2t
a
b
a
b (n-l)2
o
with the notations of (1.22) where we have supposed
a
= 0, b = 1.
This shows that the point Po will be on the Hessian precisely when r = 0 in the case of (1.22), which is a necessary and sufficient condition for P to be a flex. We have used the characteristic assumption to be assured that the partial derivatives are forms of degree n-1 and to conclude r = 0 from 2(n-l)2 r = O. Thus the characteristic assumption can be relaxed Il.'
in special cases. For example, if F is a form of degree 3 (defining a cubic V), the result of the proposition will obviously hold in characteristic p ; 2, 3. (1.25) Corollary. In characteristic 0, a non-singular plane curve of degree > 2 has at least one flex. Proof. Indeed, the Hessian is of degree 3(n-2»0 in that case, and it is sufficient to apply Th.2 (1.4).
97
- 11.24 -
2. Plane cubic curves
Let k be any field, K
=
-
k an algebraic closure of k. A plane
cubic curve defined over k is given by a cubic form (2.1)
3
aoX o
+
(a l X1
+
2
a 2X2)X o
+ •••
+
3
a 6X1
+
2
a 7Xl X2
+
2
a 8 X1 X2
3
+ a9~
with ten coefficients a e:k. Two families (a ) determine the same i i cubic curve in p2(K) whenever they are proportional (with non-zero factor of proportionality). Thus we can identify cubics in
p2(K)
defined over k with elements of p9(k). Some of these cubics are reducible (degenerate in a line plus a conic, or three lines, distinct or not), and some irreducible cubics have singularities. We shall show first, that with mild assumptions, a K-irreducible non-singular such cubic has a very simple equation in a suitable coordinate system. Equivalently, we could say that after a suitable projective transformation of p2(K), the curve of equation (2.1) can be brought on a curve with a much simpler equation. (2.2) Theorem 1. Let V be an irreducible plane cubic defined over the algebraically closed field K of characteristic p f 2. Suppose that V has a flex P. Then there is a coordinate system in which V has an equation of the form y 2Z - F(X,Z) = 0 with a form F of degree three. Conversely, if F is not divisible by Z, this equation defines an irreducible cubic with a flex on the line Z
= o.
Proof. Choose a basis X , Y, Z of linear forms such that the flex has (X,Y,Z) coordinates respectively equal to (0,1,0). Suppose moreover that Z
=
0 is the tangent to V at the flex P. In the
affine coordinate system Y form (1.22) with r
=
=
1, the equation of V will have the
0
Z + sXZ + tZ 2 + A3 (X,Z)
98
0
- II.2S -
with a homogeneous polynomial
~of
degree 3. In projective form, this
equation will be
Zy Z
sXYZ
+
tYZ Z
+
+
A3 (X,Z) = 0 .
Because we assume that the characteristic is not 2 we may define a new form Y'
=
Y
+
Zy,Z
}(sX +
+
tZ) whence an equation
A~(X,Z) = 0
with another homogeneous polynomial A; of degree 3. This is the required form. Conversely, if F is not divisible by Z, the irreducibi1ity amounts to the irreducibility of the polynomial y2 _ F(X) in K[X,y] or in K(X)[Y]
(~auss' lemma)
where F is a polynomial of degree 3. This is obvious because this degree being odd, F cannot be a square. Thus y2 Z - F{X,Z) =
°
is the equation of an irreducible cubic (if F is not divisible by Z), and this cubic has a flex at (X,y,Z) = (0,1,0). We emphasize that if V is K-irreducible and non-singular, and if the characteristic p is not 2 or 3, the existence of a flex follows from (1.25) (and the remark just before it), hence (2.3) Corollary. Every irreducible non-singular cubic in characteristic p r 2,3 has a Weierstrass equation y 2Z = X3 + aXZ 2 + bZ 3 with 4a
5
p
r
+
27b 2
r
0. An irreducible non-singular cubic in characteristic
2 having a flex, has an equation in Legendre form y2 Z = X{X - Z) (X - A Z)
~
r
°, 1
(over the algebraically closed field K). Proof. The first result follows from the remark above combined with the possibility of completing the cube in characteristic r 3 and thus killing the factor X2 in the cubic polynomial in X. The Legendre form follows from a normalization of the three roots of the cubic polynomial. (2.4) Remark. The Weierstrass form in characteristic p 99
r
2,3 can
- 11.26 -
be
attained over any field of definition k of V containing the
coordinates of the flex P. The same remark obviously does not apply to the Legendre form. But we see at once on the Legendre form, that the only irreducible singular cubics having a flex, in characteristic p ; 2 have equations of the form X2 (X - Z) flex at (0,1,0), double point at (0,0,1) X3 flex at (0,1,0), cusp at (0,0,1) Now we turn to the existence of group laws on cubics. Let V be an irreducible plane cubic over the algebraically closed field K of characteristic p ; 2 having a flex P. Let S = S(V) be the set of singular points on V and R(V) = V - S(V) the set of regular points of V. For any two points Q, Q'
E:
R(V), let Q1' be the third
intersection point of V and the line through Q, Q' (this line is the tangent to V if Q we define
Q +
P
and thus Q" = Q if Q = Q'
Q'
=P
). Then
Q' to be the third point of intersection of V and
the line through P and Q" (the symmetric of Q" with respect to
Pl.
Obviously all intersection points involved are regular on V (look at the two singular cubics above, and note that lines through the singular point of V cut V in only one point, and are not tangent to that point). Not less obvious are Q + Q'
P
= Q'
Q
+ Q
P
+
P
P =Q
and if QP denotes the symmetric of Q on V (with respect to P), Q + QP
P
=P
We study this law in the singular cases first. We start with the case of an irreducible cubic with a double point (and p ; 2), hence an equation of the form a) above. The two tangents at the origin (0,0,1) are Y = X and Y
=
-x.
100
In a new system of coordinates
- 11.27 -
admitting these tangents as axis, the equation will have the form X3 - y3 _ XYZ
= 0 • We take the affine coordinate system Y = 1
which contains all regular points on V, and make a picture. y
z
(2.5)
x
x 3 _ y 3 - xy
o
(Z
xz
1)
Z
X3 - 1 (Y = x 2 - l/x
1)
In the last drawing, we see that a line avoiding the singular point is a non-vertical line of equation z = mx
+
h , and this line cuts
the curve in the points of abscissa (mx
+
h)x
1
x
3
- mx
2
- hx - 1
=
0 ,
hence in three points (xi,zi) (i = 1,2,3) with x l x 2x 3 proves that the law + ,where P is the flex x = 1 P
is isomorphic to the multiplicative group projection (x,z)
~
(R(V),
K~under
This
1 Z
=
0 (y
1)
the first
x
p) ---+
K
This result does not use the fact that K is algebraically closed, and is true as soon as the field k contains the coordinates of the flex and of the double point. The picture shows the case K = R. Secondly, we take the singular case corresponding to the equation b) above (case of a cusp). Again, we take the affine coordinate system for which Y
=
1, containing all regular points
of V. The flex is then at the origin of this affine system.
101
- 11.28 -
z
y (2.6)
x y
x Z = X3 (Y = 1)
2 = X3 (Z = 1)
As before, a line avoiding the singular point of V is not vertical in this system, hence has an equation of the form z = m x
+
h ,
cutting the curve in three points of abscissa satisfying x 3 - mx - h = O. The three roots of this equation satisfy xl
x2
+
+
x 3 = 0 (because the equation does not contain a term in x 2 )
hence the first projection (x,z) ---- x gives here an isomorphism
R(V),+
K+
P
where P is the flex x
(additive group of
0, z = 0 (and y
=
K)
,
1). The fact that K is
algebraically closed is not more important here than in the last case of the double point, and the picture illustrates the case K
R.
Finally we come to the case where V is non-singular and the characteristic is neither 2 nor 3. With the choice of flex P, we show that V is a group for the law
+
P
•
Only remains to check the
associativity of this "addition". We choose a coordinate system in which the curve has a Weierstrass' equation, and which puts P on the line at infinity with Z = 0 as tangent. Thus we suppose that the affine equation of the curve is
y2 = X3
+
aX
+
b •
If a line does not go through the chosen flex it is not vertical in this system and has an equation y = mx
+
h , and the intersection
. h ave x-coor d·lnates satls . fylng · pOlnts x 3 - m2x 2 If P.
1
(2.7)
+(a
(xi'Yi) are these three points, we have x3
=
-xl - x 2
+
((Y2 - Yl)/(x 2 - xl))
102
2
- 2rn h) x
+
b
-
0 h 2 =.
- 11.29 -
where (Y2 - Yl)/(x Z - xl) has to be replaced by the slope of the tangent if xl = X z (compare with (1.1.14)). Thus the coordinates of PI
p P2
are rational functions of the coordinates of PI and Pz
'
with coefficients in the prime field. Thus the associativity amounts to the formal verification of the equality of two rational expressions with coefficients in the prime field, or still to the verification that the two polynomials obtained by multiplying the numerator of the first and the denominator of the second on one hand, the denominator of the first and the numerator of the second on the other hand, coincide. Hence the associativity for the sum of three points (distinct from P = (0,1,0); this is sufficient, because this point acts as unit) is reduced to a formal identity between polynomials. In particular, it is sufficient to check it in characteristic
°
(polynomials with rational integral coefficients). But in this case, it follows from the transcendental theory of chapter
I. (Of course
we could have avoided this use of Lefschetz principle had we done more algebraic geometry of curves. The interested reader should read the proof given in Fulton, cf. reference given for this chapterJ The group law has been defined in such a way as to give sum to three points of V on a line. We sum up
103
°
(=
P)
- 11.30 -
(2.8) Theorem 2. Let V be a non-singular, absolutely irreducible plane cubic defined over the field k of characteristic p ; 2,3 , and having a flex P with coordinates in k. Then there is a group law on the set Vk of points of V having coordinates in k, such that P is unit and that three points have zero sum P whenever they lie on a line. (2.9) Corollary 1. The flexes on a non-singular irreducible plane cubic defined over the algebraically closed field K of characteristic p ; 2,3 (once one of them P is fixed) are the points Q£V = V K such that Q Q Q = P , or more briefly 3Q = O.
p p
Proof. Indeed the condition given is equivalent to Q
+
P
Q = QP
because this symmetric QP of Q with respect to P is the inverse of Q for the group law deduced from P. This means that the third intersection point of V and the tangent at Q is Q = (QP)p itself. (2.l0Y Corollary 2. Let V be an irreducible non-singular plane cubic over the algebraically closed field K of characteristic
o.
Then V has nine flexes which make up a subgroup of V (when one of them is chosen as unit element) isomorphic to (71/371)">C: (Z/3Z) . Proof. Had we proved Bezout's theorem, the first part would follow immediately, because the Hessian of a cubi.c is of degree 3 hence has 3x3
=
9 intersection points with the cubic. We proceed differen-
tly (and prove a more precise result: the nine flexes are always distinct). We use Lefschetz' principle and thus suppose that the cubic is defined over k c: ([, and we assume that the algebraic closure
k of k in K is itself embedded in ([. Because the group law is given by rational functions (we suppose that P has coordinates in k, or that the cubic is given in Weierstrass' form) with coefficients in k, the solutions of 3Q = 0 have coordinates in kc([. We are reduced
104
- 11.31 -
to the case K = [, where the situation is easy by the transcendental theory of the first chapter : points of order dividing three on the elliptic curve [/L
are the points (1/3)L/L
~
L/3L , which
form a subgroup of order 9 isomorphic to ~/~)Z as was to be proved. (Z.ll) Remark. The nine flexes on such a cubic curve have a special configuration. Indeed, if PI ' Pz are two distinct flexes, we have 3P l
= 0 , 3P Z = 0 hence 3(-Pl -P Z) = 0 , so that -PI -P Z is a third
flex colinear with PI and Pz
•
To sum up, any line joining two
distinct flexes contains a third (distinct) flex. However, the nine flexes are not on a line, because a line cannot cut the cubic in more than three points. In a suitable coordinate system, these nine points will be contained in an affine portion KZ• We cannot make a picture in R Z of this configuration, because any finite set of points of R Z such that a line joining two distinct of them contains a third point of the set is necessarily on a line (proof?).
(Z.lZ) Example. Let yZ = 4x 3 - gzx - g3 be the (affine) equation of a non-singular plane cubic with gz ' g3
~ ~
. Then the set of
points (x,y) £ ~Z on this curve, form an abelian group with the point at infinity as neutral element (for the group law described before (Z.8) ). It can be shown that this abelian group is finitely generated (Mordell-Weil theorem), hence isomorphic to the direct sum of a finite group and a free group
Zr of rank r ~O. It is still
an open problem to determine this rank r in terms of the cubic.
105
- 11.32 -
Let k be an algebraically closed field, L a function field of one variable over k, i.e. a 0eparable) finitely generated extension of k of transcendental degree one. Then it is well known that there exists a transcendental element xE:L such that L is a finite separable algebraic extension of k. Such an extension has a generator y, so that we can write L = k(x,y). Let F be the minimal polynomial of y over k(x) and multiply it by a suitable polynomial in x so as to get an irreducible polynomial F€k[X,y]. Then, L if V
key)
=
V(F) is the (affine) irreducible variety of zeros of F.
=
We denote by X the set of normalized valuations of L which are trivial on k (hoping that this X will not be confused with the former indeterminate X !). We have a mapping (1.16) 1t
X
:
~
xCv)
V
This mapping is bijective outside the set S(V) of singular points -1
of V. We assume that rr (S(V)) is finite (this could be proved, but we know at least that it is true if V is non-singular). Thus every fE'.L
has a well defined divisor div(f)
=
L.
v(f)· (v)
E:
Div(X)
VE:.X
For every divisor
~ E:
L(~)
=
Di v (X) we define the k-subspace L (~) of L by {fE.L : div(f) ?; -~ }
(remember the convention
o Ei L (~)
div(O)~ -~
for any divisor
~,
so that
).
(2.13) Definition. We say that L is of genus g over k preceding conditions are satisfied and dimkL(~)
Observe that
= deg(~)
h
~
fh
+
1 - g
when the
when
for all
~
E:Div(X) of
deg(~)
defines a k-linear isomorphism of
> 2g
- 2.
L(~)
onto L(i - div(f)). In particular, these spaces have the same dimension and this proves that the degree of the divisors of the form div(f)
106
- 11.33 -
are always zero in a field of genus g.
o.
(2.14) Example 1. The rational field k(x) is of genus
Indeed, in
this case, the set X is in one to one correspondence with the projective line over k, and divisors d over X are thus expressions of the form -d = d00 (~)
+
~ d a (a) with rational integers d a vanishing for
ae:
all but a fini te number of a we observe that f
J---+-
f
E:
k. To compute the dimension of
TT (x - a) d a
L(~),
is a k-linear isomorphism of
a£k L(i) into the vector space of polynomials k[x] , having as image the subspace of polynomials F such that -deg(F)
= ord~(F) ~
or equivalently, such that
-d_ -
~
ae:k
deg(F)~d
polynomials is of dimension d
+
=
da
= -deg(~)
deg(~).
I
This subspace of
1 , so that the conditions of the
definition (2.13) are satisfied with g =
o.
(2.15) Example 2. The function field key) over a singular plane cubic curve V is of genus O. It is sufficient to look at the two special curves defined in (2.4). If the singular cubic has an . . db y ratlona . I functIons, . equatIon y 2 = x 3, . I t can b e parametrIze 3 e.g. y = t and x = t 2 • If the singular cubic has an equation
y2
x 2 (x - 1) , it can also be parametrized by rational functions,
e.g. x
=
1
+
t2 , y
=
t(l
+
t 2 ) • In both cases, key)
k(x,y) ck(t)
is a rational field (LUroth's theorem). It has genus 0 by the example 1 above. (2.16) Example 3. We have proved in (1.2.12) that if L is a lattice in [, then the field of L-elliptic functions is a field of genus one over the complex field.
107
- 11.34 -
0,1 in
The above examples exhaust the possibilities for g the following more precise sense.
(2.17) Theorem 3. Let k be an algebraically closed field, L a function field of one variable over k. If L is of genus 0 over k, k(x) (L is a rational field).
then L is purely transcendental L
If L is of genus one over k, then L is the field of rational functions over a non-singular irreducible plane cubic with a flex. In this case (g = 1), if the characteristic p of k is different from 2, the cubic can be taken in Legendre's form y2 = x ex - 1) (x - A) , and if the characteristic p of k is different from 2 and 3, the cubic can be taken in Weierstrass' form y2 = x 3
+
ax
b . In these
+
forms, the non-singularity amounts to AI: 0,1 and respectively 3 Z 4a + 27h I 0 . Proof. Observe once for all that X is not empty: L contains a rational subfie1d and all valuations from this rational subfield (trivial on k) can be extended to L (this shows more precisely that X is infinite because k is infinite). First part. Assume g = 0, and take any v£X, putting
= (v) for
~
the corresponding divisor on X of degree one. By hypothesis L(Q) is of dimension 1, hence equal to k, and contained in dimension 2. Select any
- L(Q)cL, so that necessarily
xk . By construction vex) = -1 because x ~ L(Q), and consequently v(x n ) = -n for all integers n. By induction on n, one
L(~)
=k
xe:L(~)
of
L(~)
$
sees that L(n~) = k • xk e ... e xnk . I claim that the rational subfield k(x) of L is the whole field L. Take any f in L - k and write its divisor div(f) where
(f)~ ~
div(f)
~
=
(£)0 -
(f)~
-f
-f
(symbolically f (0) - f (-) ),
0 is the polar part of the divisor of f. In particular
- (f)oo and so f E::L( (£)00). This shows that the 2 (n
108
+
1)
- 11.35 -
functions
1 , x , x2
,
...
, x
n
f , xf, x 2 f, are in
L(n~ +
n
+
+
d oO
By hypothesis this last space is of dimension
(f)~).
1 where doo = deg (f)eo . As soon as n ~ d oo
inequali ty n
dIDO
+
1
+
<
2(n
+
there is a strict
'
1). Hence there must be a linear
dependance relation over k between the former functions, and this leads to an equation
L b i xi
L a i xi. f
( ai' b i
E: k ,
0
~i
E; d-o) •
As the xi are linearly independant over k • the coefficients a i i) cannot vanish simultaneously. This proves f = (l:boxi)/{L:aox 1 1 is in k(x) as was to be proved.
=1
Second part. Assume now g Because select L(3~)
L(3~)
L(~)
and
take'~
= (v) with v£X as before.
is of dimension 1 now, we must have -
x£L(2~)
L(~)
This element must have vex)
L(~).
=
= k. So we -2. Since
is of dimension 3 over k we can select YE:L such that k
~
xk
$
yk , and this implies v(y) = -3. From this we
derive
(because v(x 2)
-4)
-S)
$
x 2k $ xyk (v(xy) 2 3 yk ~ x k • xyk • x k
$
yk e x 2k e xyk
L(4~)
k e xk ~ yk e x 2k
L(S~)
k
L(6~)
k e xk xk
k because both v(x 3 )
xk $ yk
$
$
$
$
y2 k
v(y2) = -6 . This shows the existence of a
linear relation over k between the powers in question, that is, a cubic relation between x and y of the form F(x,y) = y2
(2.18)
+
with coefficients in k. replace y
+
! (etX
+~)
equat10n y 2 = JX 3 o
(o(x
+
~)y+ ~x3
+
~x2
+ ex +
"1
0
If the characteristic is not two, we can
by Y and thus kill the term in y, getting the
r:2
+ oX
-
+ Ex +
TJ-
• Th e Legen d re 109
f orm
- 11.36 -
is obtained from there by normalizing x and y in order to get a unitary polynomial in x, and by normalizing its roots as indicated. We have repeatedly said that then A1 0,1 is the non-singulari ty condition, a fact which can be checked in any characteristic p 1 2 (the only possible singular points occur with ordinate y = 0, and their abscissa must be simultaneous roots of the cubic polynomial in x and its derivative). In characteristic p 1 3 , the possibility of killing the term in x 2 is open by completing the cube (and non-singularity condition as claimed if p 1 2,3). Remains to show that the subfield k(x,y) of L is equal to L. As in the first part, we take any f functions
2
L - k , and consider the 3 (n
...
,
x
y
,
xy, x 2y,
, x nY
f
,
xf, x 2f,
,
x
+ 3)~ +
,
,
x
+
1)
n
1
They are all in L( (2n
,
E:
xnf
(f)oo) of dimension 2n
+
3
+
doo (notations
as in the first part). As soon as n is sufficiently large (n :)doo + 1 will do), they will be linearly dependant over k, hence a relation i i i La.x + y Lb.x = f Lc.x I I I
i (otherwise, y€,k(x) : a sheer LC.X 1
with a non-zero polynomial
nonsense in view of the fact that vex) is even and v(y) odd). This gives eventually f = rex)
+
s(x)y E:k(x,y) with r(x), sex) E:k(x)
(and gives once more the fact that y is quadratic over k(x) by taking f = y). Let us just recall that the non-singularity of the curve follows from example 2 above.
q.e.d.
No doubt the reader will have recognized the analogy between the two functions x and y over the curve and the Weierstrass' functions
p and
P'
(the transcendental possibility of taking for y the
derivative of x calls for the factor 4 in yZ 110
= 4x 3
- gzx - g3 ).
- 11.37 -
With the notations of the proof (case g = 1), the divisor called very ample, because Lover k. The divisor
L(3~)
3~
is
contains a set of generators of
itself is called ample because one of its
~
multiples is very ample. This terminology however, adds little to the understanding of the present simple situation. We shall eventually show that conversely, all function fields over non-singular cubics are of genus one. For that, we have to give a purely algebraic proof replacing that of the first chapter using theta functions. The first steps consist in finding substitutes for Liouville's theorem and Abel's condition (I.l.la,c,d). Also, the whole idea behind the use of the subspaces
L(~)
of a function
field L is that they give quite generally a filtration of the k-vector space L wi th fini te dimensional subspaces L(~) cL(~')
: this is why
L(~)
is defined by
implies
(~~ ~'
div(f)~ -~
with a minus
sign !). (2.18) Proposition. Let L be a function field over k. a) L(Q)
=
k , or i'n other words : a function f £L without pole
is constant, f E:k. b) The spaces
L(~)
are finite dimensional for all divisors
and more precisely, if dimkL (~) c)
.!i
~
d
dimkL(~)
+
~~O
~£Div(X),
is a positive divisor of degree d
1.
=
deg(~) +
1 for one positive divisor d f 0, then
L is rational (of genus 0 over k). Proof. a) If f ¢k, define a valuation of kef) by v(f)
=
-1. If w
denotes a normal i zed extens ion of v to L, we have w(f) <: 0 so that f has at least the pole
WE:
X.
The same argument applied to 1/ (f - a)
shows that a non-constant function f £L takes all values a e:k. b) We prove the more precise assertion concerning positive divisors.
111
- 11.38 -
Let
~~O
and
and let us show that
VE:X,
We suppose that
dimL(~ +
(v))~dimL(~)
+
1.
Pv €V is a non-singular point (cf. note after proof) and let Xp be a uniformizing variable at P = Pv ~(v)
(1tp) = Mp CR p , that is v('fp ) = l .
L(~
+
We consider the linear map d +1
(P)) ~ k defined by f ~ (~pp
multiplicity of P in
f)(P) where d p is the
By definition its kernel is precisely
~.
L(~),
and this proves the desired inequality. Using a), induction applies and proves b). c) If dim
L(~)
then dim L((P))
=d
+
1 for one positive divisor
~
, 0 of degree d,.
= 2 by b) for all points P occuring in d (with
multiplicity dp>O). Hence for one such point P L((P)) = k
$
ordp(x) = -1 .
xk,
Consequently L(n(P)) is of maximal dimension n
+
1 for all positive
integers n. Take any y£L, and multiply it by a suitable polynomial in x ih order to make it integral over k[xJ. Then y will have poles only where x has poles, and y
E:
L(N (P)) for a sui tably big integer
N. This shows Y £k(x) and L = k(x) is rational. It is true that we have performed the proof in the non-singular case. More precisely, we have used only the following fact : if v E: X, there exists a plane curve V such that L = key) (a model of L)
and that v is centered at a non-singular point Pv E:. V.
However, we shall only use the results of the proposition in the non-singular case. (2.19) Corollary. If the irreducible plane curve V admits one rational function f€k(V) with only one pole, and a simple one, at the regular point PE;.V, then V is isomorphic to a projective line: key)
=
kef) is a rational field (of genus 0 over k).
This is a particular case of part c) of the above proposition. In particular, f has only one zero and div(f) 112
=
(PI) - (P2) ·
- 11.39 -
(Some readers will probably have noted the analogy between this corollary and the characterization of the sphere 52 = pl([) - e.g. among real compact surfaces - by the existence of a Morse function with only two critical points.) We come back to the case of cubics for Abel's condition (analogous to (I.l.l.d)). Let thus V be an irreducible non-singular plane cubic over
the (algebraically closed) field k of characte-
ristic p f Z. We suppose that the coordinate system has been chosen in such a way that V has the (affine) equation (Z.ZO)
Vo :
Z - (X-e )(X-e )(X-e ) = 0 l 3 Z
Y
with distinct e. eke We could normalize the e. 's so as to have 1
1
e l = 0, e 2 = 1, e 3 = A f 0,1 and work with this equation in Legendre form. However, we prefer to· keep more freedom, because
r
if moreover p
3, we might prefer to work with the Weierstrass'
form (corresponding to the other normalization e l + e + e 3 Z
=
0).
(Z.Zl) Lemma 1. If we denote by ord. the normalized valuation corresponding to the flex P at infinity, we have ord.(x) ord.(y)
=
=
-Z and
-3 .
Proof. We have to remember that x is the class of X/Z (y that of Y/Z) in the function field L
=
key) in our projective definition
of this field. By construction of the coordinate system, P is at the origin of the affine coordinate system defined by Y the line Z
=
=
3 , ordp(class of X/Y)
because the line of equation X gent (distinct from Z
and
1
=
1 and
0 is tangent at the flex P, so that
ordp(class of Z/Y)
ord~(y)
=
=
=
=
1 ,
0 goes through P but is not tan-
0). This shows
ordp(class of Y/Z)
ordp(class of X/Y)
=
-3
= ord~(x)
113
as asserted, -
ord~(y)
= ord~(x)
+
3 .
- 11.40 -
(2.22) Lemma 2. L
=
Proof. The automorphism defined by y
~
o.
k(V) is not of genus ~
of the quadratic extension L of k(x)
-y acts on the set of normalized valuations X
of L and also on the set of points Q £V. Obviously Qr is the point with coordinates x(Q),-y(Q) if Q
r
P and P~ = P is fixed
under this action. Let us take now a function f except at P (f is defined and finite at all Q have the same property and if f We show that f£k[x,y]
= a(x)
€.
r
L which is regular
P). Then f
b(x)y , f
+
~
r
will
= a(x) -b(x)y .
k[V o] the ring of regular functions on
the affine part Vo of V. It is sufficient to show that a(x) and b(x) £ k[x] . We prove it for example for a, by decomposing a in linear terms (k is algebraically closed) : a(x) If one integer
n~
=
TT(x _ ~)n~. I
were strictly negative, a would have
at the intersection points of Vo with the vertical X = apply this to the determination of L( (P))
C
poles ~
. We
k [x ,y]. But a non-constant
polynomial in x has order at infini ty E; -2 (by lemma 1) and a polynomial in x and y containing y has order at infinity This proves L((P))
=
~-3.
o.
k is of dimension 1 so L cannot be of genus
r Q e:.V)
(2.23) Lemma 3. For Q eV o (i.e. P
div(x - x(Q)) = -2(P) d i v (Y - mx - h)
= -
+
3 ( P)
(Q) +
and m,h E:,k, we have +
(Q 1)
(-Q) +
(Q 2)
+
(Q 3)
where the points Qi t:: V are the intersection points of V0 wi th the line of equation Y - mX - h = O. Proof. We have already determined the coefficients of (P) in these divisors in the first lemma (the pole of order 2 of x at P cannot compensate the pole of order 3 of y at the same point). It is tr
obvious tha t x - x (Q) vanishes only a t the two points Q and Q
-Q
=
=
(x(Q),-y(Q)). The multiplicities of these zeros are given by
the order of contact of the vertical X - x(Q) 114
=
0 and V at Q.
P
Q =
- 11.41 -
Because (~/~y)(yZ - (X-e l )(X-e )(X-e 3 )) can vanish at Q only if Z
y(Q)
0, we find that the vertical X -x(Q) = 0 is not tangent to
V at Q (hence the multiplicity is 1) unless y(Q) = 0, i.e. Q
-Q.
In this last case the multiplicity is 2 because these points are not flexes on V. The first formula is thus completely proved. The second one is derived similarly. We note that lemma 3 gives the divisors of all linear functions on V and that their degrees vanish. Also, they satisfy Abel's condi-
p on
tion (when p , 3 so as to have the group law
V). We give
explicitely the special case m = h = 0 with a consequence, in the following lemma. (2.24) Lemma 4. Let Ei(i = 1,2,3) be the three intersection points of Vo with the X-axis of equation Y = 0, so that E.1
-
div(x - e i ) = -2(P) div(y) = -3(P)
+
+
(E l )
(e.1 ,0). Then -
2(E i ) +
(E 2)
+
(E 3 )
The subgroup of V (when moreover p , 2,3) of points Q such that 2Q = 0 is fE o =P= 0, El , E2 , E3 } isomorphic to 71../27i x 71../ 271.. The assumption that the characteristic p is not 3 is not necessary, as we shall presently see. But in characteristic p = 2, we shall see much later (in the third chapter) that there can at most be one non-zero point of order two, and that the case where Eo = P is the only point of order dividing 2 also occurs (supersingular curves). Now we come to a closer study of the subgroup P(X) of principal divisors (of the form div(f) for some f d
= [dQ(Q)
Q~ = [dQ.Q
bL~)
E:Div(X) is any divisor, we put d
of Div(X). If
= deg(~) =
(sum in the group V with respect to
p).
LdQ
Then the
principal divisors d are exactly those for which d = 0 and Qd
115
and
0
- 11.42 -
(2.25) Theorem 4 (Abel). The group of principal divisors on X is a subgroup of Divo(X) , the group of divisors of degree
!.:
V --+ Divo(X)/P(X)
Q
o.
Moreover,
is an isomorphism
(Q) - (P) mod P(X)
~
In particular, the group law on V is independent - up to isomorphismof the choice of the flex P as neutral element. (As the law
+
P
has been shown associative only if p , 2,3 , we have
to make that assumption in the above formulation. However, a more careful analysis shows that we only use the two properties Q
+
p
QP = P
and (Ql
p Q2)
~ Q3
P (Q2 P Q3)
= Ql
= P whenever the
Q. 's are on a line. Thus this theorem gives another proof for the 1
associativity, which works for the characteristics p , 2.) Proof. All linear functions in k [x ,y] have principal divisors in Divo(X) by lemma 3, and they satisfy also the condition Q d Take now any principal divisor dive£)
= L.dQ(Q)
=
o.
E:P(X). Modulo
divisors of linear functions, we have (d - 1) (P) + (Qd)
so that it is sufficient to prove d of the form div(f) = (d - 1) (P)
=
0 and
Qd = 0 for divisors
(Qd) · But Qd ' P implies that f has only one simple zero at Qd (because f having a pole, d-l < 0) which is impossible by (2.19) applied to l/f, L not being of +
genus 0 (lemma 2). Then div(f) = d(P) is possible only with d
0
(loc. cit.) and with fE:k. Now we may consider DivoCX)/PCX) and define
f
as indicated. This map is a homomorphism in virtue of
lemma 3 which shows that !(-Q)
- f(Q)
~(Q 1) +
It is injective because
+
ep (Q 3)
pCQ 1)
=
=
0
f o r Q1
~(Q2)
116
+
Q2 + Q3 = 0 ·
implies that CQ2) - (Ql) is
- 11.43 -
a principal divisor, and this can happen as above (L is not of genus 0) only when Ql = Q2 · Let us commit the crime of "lese Bourbaki" and show that ~ is surjective. If d is a divisor of degree zero, d
= L.. d Q( (Q)
LdQ(Q)
- (P))
and this gives as in (1.1.24). The theorem is completely proved, and we are able to turn to the converse of the theorem 3 and show that cubics, when non-singular, have function fields of genus 1. (2.26) Theorem 5 (Riemann-Roch). Let V be an irreducible non-singular plane cubic over the (algebraically closed) field k of characteristic p
~
2. Then its rational function field L = key) is
of genus 1. Proof. We start by showing that the space L(n(P)), where n is a strictly positive integer, is of dimension n. We know that it is contained in k[x,y] (proof of lemma 2). The functions xi, xjy for 0 ' i
3,n, i.e. 0 <:j
+
<: (n-3)/2 make up a basis of
L(n(P)) because they are linearly independent (functions of the form xi have a pole of even order 2i at P and xjy have a pole of odd order at P), and there are exactly n such functions (only a function with a simple pole at P is missing). Then we take an arbitrary divisor ~
- d(P)
Hence
~ = =
L. dQ(Q)
of degree d ~ 0 . We have already seen that
(Qd) - (P)
L(~) =
+
L( (d - 1) (P)
div(f) with a suitable f +
(Q)
+
div(f))
~
E
L>< (Th.4).
L( (d - 1) (P)
the last isomorphism being given by multiplication
: h
+
(Q)), ~
h·f .
Remains to show that this last space is of dimension d, or that it contains one function not in L((d-l)(P)) (cf.(2.l8.b) proof) for d>l.
117
- 11.44 -
We have to construct a function in L((P) We observe that if Q
(Q)), not constant.
+
-Q, the divisor -(Q) - (P)
~
+
(-Q)
+
(2Q)
is principal by Th.4, so the function of which it is the divisor will do in this case. If Q = -Q, then Q is one Ei (i=1,2,3). We suppose Q = E (for example), and replace the latter divisor by l -(P) - (E l ) + (E 2) + (E 3 ) which is also principal (by the same criterium of Th.4 or more directly using lemma 4, we see that it is the divisor of y/(x - ell ). This concludes the proof. Let us turn to a full classification of function fields of one variable over k (algebraically closed and of characteristic p
~
2,3) of genus 1. As before, let X be the set of normalized
valuations of L trivial on k, and fix one
VE:X.
For a choice of
generators x,y of L such that (2.27)
x E:L(2(v)) - k , Y e:L(3(v)) - L(2(v)) , y2 = x 3 + ax + b
we define (2.28) This constant is well defined, independently of the choice of the generators x,y for if x' ,y' are other generators such as in (2.27), and satisfy y,2 x' = 0( x
x,3
+ (3
+
a'x'
, y' =
+
b' , we have
ry + rx
+
e.
wit h «1; 0 , P, 6, e
E:
k .
Substituting, we find immediately; = 0 (no cross term xy in (2.27)) t = 0 (no term in y) ~ = 0 (no term in x 2 ), so that finally x' = 0< x , y' = y and "(2 = 0( 3 E: k . 1f we put t = rio( E: k , then t 2 = ~, t 3 = and x' = t 2x , y' = t 3y . This shows that the
r
r
invariant
j~
deduced from the choice of the generators x' ,y' is
equal to j v . More important is the observation that j v is actually independent of the choice of v E:X •
118
- 11.45 -
(2.29) Lemma 5. There is a field automorphi sm
't' =
1:'(v', v)
Lover k which brings any VE::X on any (other) v'£X :
of
~L(n(v))
=L(n(v'))
for all positive integers n. Proof. Fix
v~X,
and let V be the non-singular cubic defined by a
choice of generators
x,y of L with vex) = -2, v(y)
=
-3 as before.
n(v) = P is the flex at infinity on V. We consider the v translation mappings
Then
x ----.,...
X defined by
w
t---+-
w - v'
defined by
Q
1---+
Q - Q'
V ---.. V
defined by the group law
+
=
+
p
on V. Because the addition is defined
rationally, the coordinates of Q - Q' are rational expressions of the coordinates of Q (with coefficients in k depending on Q'). Call
"t
= rl(x,y)
1:
= r 2 (x,y) these two rational expressions xT(Q) = rl(x,y)(Q) = rl(x(Q),y(Q)) = x(Q - Q') x
and similarly
y
y~(Q) = y(Q - Q')
. Now the mapping
(2.30)
is a k-isomorphism (it is surjective because its inverse is given by the consideration of the inverse translation Q
~
(Note that every f £L has a unique representation f By definition,
fT (Q
+
L(n(p))T
=
Q
+
a(x)
Q' ) . +
b(x)y.)
Q') = f(Q). In particular =
L(n(Q'))
If x,y are two generators as above, x~ ,yT will be two other generators with v'
(x~) =
-2 and V'(yT)
=
-3, and they will satisfy b ( t
is trivial on
k). We can then compute iv' by means of these two generators and get jv' = jv · (2.31) Definition. A function field of one variable of genus one
(over k) is called an elliptic field (over k). We have just seen that an elliptic field has a well defined invariant 119
- 11.46 -
j
~
k if the characteristic p is neither 2 nor 3. But the reader will
check easily that more generally, if p ; 2, one could use Legendre's equation and define j by (1.4.3). The invariant j is called the absolute invariant of the elliptic field. (2.32) Theorem 6. The mapping L
j
~
=
j(L)
k which associates
£
to every elliptic field L (over the algebraically closed field k of characteristic p ; 2) its absolute invariant, defines a bijection between the set of k-isomorphism classes of elliptic fields and k. Proof. We give a proof for the case p ; 2,3 (and leave the case p = 3 as exercise). First, if two elliptic fields Land "L' have the same invariant j = j', they will be the function fields of the non-singular cubics of respective (affine) equations y 2 = x3
+
ax
+
band
y,2 = x' 3
+
a"x
+
b,
If both a and a' vanish, the fields are certainly isomorphic because then h f 0, b'f 0 and we can choose a sixth root t of b'/b so that the mapping x ~ x'= t 2x , Y ~ y'=t 3y defines an isomorphism from V to V' and from L to L'. If a f 0, take any square root t of a' fa in k so that a,3 = t 6a 3 . Then j
j' implies b'Z = t 6b 2 and b'= ±t 3 b. Repla-
cing t by -t if necessary, we may assume b' = t 3b and define as before an isomorphism L ~ L'. Secondly, we have to see that any element j (in k) is the invariant of an elliptic field L/k. If j = 0 take for example for L the function field of the non-singular cubic y2 = x 3 If j f 0, replace the preceeding curve by y2 = x 3 root of
l7~8 = 4/(4
+
+
x
+
+ 1.
b where b is any
27b 2 ) (there are many other possible choices!).
(2.33) Remark 1. A law
Qwith
Q as neutral element can be introduced
on a non-singular cubic V as before (2.7) without the assumption that
Q is a flex (but p f 2). Because Q' morphism
V~V
~
(i.e. an isomorphism
Q'pQ defines a birational iso-
L~L),
lation gives an isomorphism between the laws 120
we see that this trans-
p and Q.
In particular,
- 11.47 this last law is associative (a group law). (2.34) Definitions for arbitrary field k. When k is not algebraically closed, we say that L is a function field of one variable over k if a) k is algebraically closed in L b) L is separable over k and of finite type
,
c) L is of transcendental degree one over k Then
k (an algebraic closure of k) is linearly disjoint from Lover k.
We say that L is an elliptic function field over k when moreover d) Lk is of genus one over k. An elliptic function field has an invariant (we have defined it only when p f 2)
J
~
k
which will be invariant under all automorphisms of
k/k hence purely inseparable over k : j
€
kP-~ .
An algebraic curve E is called an elliptic curve over k if it has a point rational over k and if its function field k(E) is an elliptic field over k. Such a curve is always k-isomorphic to a plane curve in Wei er s t r ass form: - L(2(P)),
then Cf
if P =
E:
E(k), t a ke (E L( 2 (P)) - L( (P) ) ,
'? E. L( 3 ( P))
-
(~'1J) : E --+p2 is rational over k,
Weierstrass equation and f(P) (point at infinity on feE) by construction) is a flex on ,(E). If E c p2 is itself a plane curve and P ~ E(k) is a flex, then there is a coordinate system of p2 in which E has a Weierstrass equation. (2.35) An example. Let us show that a field of genus one does not satisfy necessarily the conditions of definition (2.13) if k , Let k be a non-perfect field of characteristic 3 and a that ~
=
al / 3
£
E:
k.
k - k 3 , so
k l / 3 is not in k. Let L be the function field key)
where V is the cubic of (affine) equation y2 = x 3 -a, so that L = k(x ,(x
3
1
- a) 2) is the separable quadratic extension of k(x) obtained by adjoining a square root of x 3 - a. The cubic y2 = x 3 - a has a =
121
- 11.48 -
singular point (0<,0) over k,
so that its group law (having for
neutral element the flex at infinity) is defined at all points of Vk = V(k) (points having coordinates in k in one representation). This enables to show that for all divisors then
dimkL(~) =
~ ~
Div(V k ) of degree d > 0,
d exactly as in the proof of Th.5 (use (2.6)).
However, if k' is the extension k(o<) (or k l / 3 , or k) of k,
t' = L· k' = k' (x, (x - oc) 3/2) = k' ((x - ()() 1) is a rational field, hence of genus 0 over k'. This example shows that some care must be taken with non-perfect base fields k and this is why we define the genus after the constant field extension
k
instead of using (2.13) directly over k. We also require E(k) not
empty for an elliptic curve E over k in order to have a group structure over E(k) (and not only over E = E(k)). (2.36) Remark 2. Let k be a not necessarily algebraically closed field .of characteristic p
r
2,3 and let
v
x 3 ·+ ax
V'
x3
+
a' x
+
b + b'
be two absolutely irreducible cubics defined over k (a,a' ,b,b'
~
k)
and having the same absolute invariant j = j'. Then these curves are isomorphic over
k.
If neither a nor b vanish, the equality j =j'
implies (a'/a)3= (b'/b)2E:k hence the existence of a a' /a = t 2 , b'
/U
-
t 3 and the isomorphism V ~V' given by x t-+-tx,
3 Y ~ t / 2y is already defined over k(t 1). When a = we have a' =
° and
t e:k with
hence bb'
~
0. Take for
~
a sixth
° (case roo~
j = j'=O)
of b'/b
and define the isomorphism V ~V' by x ......... t-'-2 x , y ........ J.t3 y . It is rational over the extension b
=
b'
=
° (case
j = j'
k(~)
of degree six of k. Similarly if
= 1728), there is an isomorphism V -==4- V'
rational over an extension of degree four of k.
122
- 11.49 -
(2.37) Complement. We have used the result asserting that a function field of one variable over an algebraically closed field k has a separating base (consisting of one element x), to show that such a field is always isomorphic to the rational function field over an irreducible plane curve (cf. before (2.13) ) . This is well-known (e.g. Bourbaki, A1gebre, Chap.V,§9 n 0 3). We indicate here a simple proof avoiding derivations. More precisely, we prove Let k be a perfect field, L a finitely generated extension of k of transcendental degree one. Then there exists an element x£L such that L/k(x) is algebraic separable. Let us suppose first that L = k(x,y) is generated by two elements, and let P(x,y)
=
0 with
P~k[X,YJ be an algebraic dependence relation between x and y. Let
f and t
be the maximal positive integers such that t
f
P(X,Y) = Q(XP ,Yp ) . We show that L/k(x) is algebraic separable if algebraic separable if f
,.e ).
P(X,Y) = Q' (XP and since P(x,y)
f-
l~'f
(and L/k(y)
Because k = kP , we can write
t
.t
,Y)p
0, we shall have
,
Q' (x p
£-l
is separable for y wi th coefficients in k(xP
,y)
f-(
=
0 . This equation
) c k(x). Then the
general case is treated by induction. If L = k(Y1' ... Yn+l)' assume that Y1' ... 'Yn are separable over k(x). Let P as above express an th algebraic relation between x and Yn+l. By extraction of p roots, it will furnish a separable relation for x over k(Yn+l) or for Yn+l over k(x). The result follows by the transitivity property of separability. The reader will note that this property fails in general if we drop the assumption of finite generation: take k algebraically closed of characteristic prO and an indeterminate X; then L = k(XP has no element x with L/k(x) algebraic and separable. 123
-f
)
f~O
- II.SO -
3. Differential forms and elliptic integrals
In this section, we work throughout with an algebraically closed field k of characteristic p = 0 (in the second part, where we consider elliptic integrals and differential equations, we shall even assume k
= [).
We start by giving an algebraic definition of differentials on curves (globally, without speaking of the line bundles over the curve over which they can be interpreted as sections). (3.1) Definition. Let K be a function field of one variable over k and E
~
K-vector space. Then we denote by Derk(K,E) the K-vector
space of derivations D : K
~
E with values in E and trivial on k.
Such derivations are mappings satisfying
o
a) D(a) b) D(f
l
+
for all a Ek
f 2)
= D(f l )
+
D(f 2) for all f 1 ,f 2 E: K ,
c) D(f l f 2) = f 1D(f 2 ) + f 2D(f l ) for all f l ,f 2 EK · In particular they are linear mappings over k because a) and c) imply that D(af)
=
aD(f) for a €k and f E:K.
(3.2) Example. If K = k(x) is a rational field, the mapping D
~
D(x) is a bijection Derk(K,E)
~
E. This follows simply
from the fact that the formal rules of derivations for quotients are true and so D(f) is determined,as soon as D(x) is given arbitrarily in E , for all rational expressions f
=
f(x) in x.
(3.3) Proposition. If K' is a finite algebraic extension of the function field of one variable K over k, then the restriction operation Derk(K' ,E) ~
~
Derk(K,E) is bijective. (In other words,
k-derivation D of K has a unique extension as k-derivation of K'J
124
- 11.51 -
Proof. By the primitive element theorem K' = K(y) for an element y£K' (we are in characteristic 0). Let P(y) = 0 be the minimal polynomial of y over K. For all extensions pD(y) + p'(y)·B(y)
=
D of
D we must have
0 where pD is the polynomial obtained from P
by replacing its coefficients by their derivatives, and P' is the usual derivative of P. Since y is simple root of P (separability), we have P'(y) I 0 and so D(y) is necessarily given by _pD(y)/p,(y) This proves ~he uniqueness of the extension fi . The existence follows from the definition D(y) = _p D(y)/p 1 (y). This certainly defines a derivation
of the polynomials ring K[Y] which will be trivial
on the principal ideal generated by P, which is the ideal of algebraic relations satisfied by y (incredulous readers may have to check it pen in hand, or go back to Bourkaki, Algebre, Chap. V, §9 nOI Prop.5), and hence a derivation of K' = K(Y]/(P(Y)). (3.4) Corollary 1. For every function field K (of one variable over k) and any K-vector space E, Derk(K,E) is isomorphic to E over K. Proof. Let x be any element of K - k. Then K is a finite algebraic extension of the rational field k(x) . An isomorphism is then given by Der(K,E) D
~
~
Der(k(x) ,E)
restriction of D
~
E
~
D(x) .
Note that the composite of these two bijections is indeed K-linear (in Der(k(x) ,E) we consider E as k(x)-vector space by restriction of the scalars). Also note that there is no canonical isomorphism between the spaces in question. (3.5) Corollary 2. If f,g E:K and g ¢k, then D(f)/D(g) €K is independent of the non-zero derivation D€Der(K) Proof. Observe that if
g~k,
D(g)
=
= Derk(K,K).
0 only for D
=
0 (this is the
only extension of the zero derivation of keg) to K). Because Der(K)
125
- 11.52 -
is of dimension one over K, the quotients have a well-defined meaning. Then, if D' is another non-zero derivation, D'
hD for some h E: K
proves that D'(f)/D'(g) = D(f)/D(g). As in the proof of this corollary we shall denote Derk(K,K) simply by Derk(K) or by Der(K) if there is no risk of confusion about which field k we are using. Also, if g E: K - k we denote by Dg £Der(K) the unique derivation defined by Dg (g) = 1. The above corollary can be reformulated as D(f)/D(g) = Dg (f)£K is independent of the derivation D 1 0 chosen in Der(K). (3.6) Definition. Let K and E be as in (3.1). We define the space of E-valued differentials of K, in notation Diffk(K,E), to be the K-dual of the space Derk(K,E). Thus, an E-valued differential form of K
w: Derk(K,E) --.. K .
is a linear form
By Cor.l above, Diffk(K)
=
Diffk(K,K) is a vector space of dimension
one over K (with no canonical basis, so that we cannot multiply its elements, but we can divide any element by a non-zero one, finding a quotient in K).For example, each f€K defines a differential form df : D ~ (D,df) (3.7)
d:
K
=
D(f). This gives a k-linear mapping
~
Diffk(K) , f
~
df
One can check easily that this mapping is in fact a derivation D(fg) = fDg
(n,d(fg)
(D, fdg One has df
=
+
+
gDf = f
f~k.
g(D,df)
gdf> for every D ~Der (K)
0 exactly when D(f)
precisely when
+
=
0 for all derivations D, hence
This shows that Ker(d) = kcK. Although K and
Diffk(K) are one dimensional K-vector spaces, d is not surjective (as we shall see) because it is only k-linear (and not K-linear). A differen-tial of the form df (for some f £K) is said to be exact, so that d(K) is the k-subspace of exact differentials in Diffk(K).
126
- 11.53 -
There is no canonical derivation D in Derk(K), but d is a canonical derivation of K with values in Diffk(K). More precisely (3.8) Proposition. The canonical derivation d : K
~
Diffk(K) is
universal in the sense that every E-valued derivation D : K
~
E
can be factored uniquely through d , with a K-linear mapping AD : Diffk(K)
~
E , i.e. so that the following diagram is commuta-
tive
Proof. The uniqueness of the factorization is clear because d(K) , 0 generates the K-vector space Diffk(K) of dimension one. For the existence, take any XE:K - k. There exists a unique K-linear map AD : Diffk(K)
~ E
such that AD(dx)
=
D(x) (note that dx
f
0
generates Diffk(K) over K, and that AD depends - a priori - on x). Then it is clear that ADed is a derivation (over k). The derivation D - ADed.-:Derk(K,E) is zero on x by definition, hence is identically zero (unique extension of the zero derivation k(x)
~
E (3.3) ).
This proves that AD gives a factorization of D (and is independent of the special choice of x by the uniqueness already proved). Let f and ge:K with g not constant (g¢k). Then df and dg must be proportional in Diffk(K) with a coefficient in K. This coefficient may be computed by comparing one (any) value of these linear forms df/dg
=.
(D,df) / (D,dg) = D(f)/D(g) = Dg(f)
(0, De:Der(K))
.
In particular if we fix a transcendental element x E: K (over k) , df/dx = Dx (f) for every f E:K , so that the differential quotient df/dx coincides with the derivation (3.9)
with respect to x : d/dx
D is the formal derivative with respect x
to x.
127
- 11.54 -
By choosing generators x,y of K, we can assume that K = k(x,y) is the field k(V) of k-rational functions over a plane curve V defined over k. (3.10) Lemma. Let PE:V be a regular point and
lr= 1rp
a generator of
the maximal ideal Mp of the local ring Rp at P : ordp(n)
= D7( (f) E: Rp
df / d1t
for all f
=
1. Then
E. Rp •
Proof. The expansion of functions fE:K in Laurent series at P gives an embedding K
~
keen)) of the field K = k(V) into the field of
all formal Laurent series in
~
with coefficients in k (this is the
field of fractions of the ring k[(rrJ] of integral formal series in n and coefficients in k). The derivation term by term with respect to ~ of formal Laurent series gives a derivation
coincides with DE:Derk(K) on the element coincide with
D~
Tt"E
JD~€ Derk(k((n))) which
K. By (3.3),
on the algebraic subextension K of
k(~)
D7t must in keen)).
Since elements feR p are characterized by the fact that they have an image in k[l~]1 their derivative Dn(f)
k[[re]]. Also, one sees from D7['
=
IrK
that if
D~(f) will also be in 1t'
is another uniformizing
variable at P, that d1r' /drr = 1)7[(7t') e: R; is a unit of the local ring Rp : (dTt'/dTC)(P) Eik)C .
This lemma shows that d/d1t
=
D7t is necessarily continuous
on Rp for the topology defined by taking the powers M~ of the maximal ideal Mp as basis of neighbourhoods of 0 in Rp . Consequently, Laurent series may be derived term by term. (3.11) Defini tion. Let P be a regular point of V, and 1r such that
ord p (rr)
=
1.
order at P
.!i
~
tV =
ordp(U»
fd T( =
Diff (K) is a differential, we define its k ordp(f) (when defined : f f 0 or c..> f 0). E:
This is a meaningful definition because if
n' is another uniformizing
variable at P, dlt/drr' is a unit in Rp and thus and ordp(f)
= ordp(f.d~/drr')
. 128
tA)=
fd'fr
=
f(dn/drr'dlt t
,
- 11.55 -
Properties which follow immediately from the definition are (3.12)
a) ordp(lA>
+
w t ) ';p Inf(ordp(w) ,ordp(w t ) )
(with equality if these orders are distinct) , b) ordp(fw)
=
ordp(f)
+
ordp(W) for f t:K, WE:Diff(K) .
From the proof of (3.10) we can also see that (3.13)
c) ordp(df)
=
ordp(f) - 1 if ordp(f)
The points P e: V where a differential
W
r
0
(f
E::
K).
has order <0 are called poles
of wand the points P where it has order,> 0 are called zeros of
W.
(3.14) Proposition. Let K be the function field of k-rational functions over a non-singular plane curve V defined over k, and be a differential form of K. Then div(lU)
L
=
PE:V
ordp(w) (P)
E:
tA)
I: 0
Div(V) .
In other words, a non-zero differential of K has only finitely many zeros and poles. Proof. By choice, x is transcendant over k, so that V is not a vertical line in the (X,Y) coordinate system and dx rential of a uniformizing variable
d (x - x (P)) is the diffe-
at all P with non-vertical
tangent (this lasr set is finite by (1.10)). Thus if
fdx, we
W=
shall have ordp(w) = ordp(f) at nearly all points P and this is zero except if P appears in the divisor of f. (3.15) Definition. A differential form W€Diffk(K) is said to be abelian, or of first kind when it has no pole : ordp(w)
~
0 for
all points P ~ V (which is supposed non-singular). The abelian differentials make up a vector space differential'df can be abelian only when f
€
~t
k and df
over k. An exact =
0 (3.13).
An example of non-zero abelian differential is given by the following (3.16) Proposition. The dimension (over k) of the space
~1
of
abelian differentials of K is 0 if K is of genus 0 over k (K rational over k) and 1 if K is of genus lover k. Proof. Take first the case K = k(x) is rational over k. Then V is the 129
11.56 -
projective line
ku{~J
(identified with the X-axis). Because x - x(p)
is a uniformizing variable at all finite points P and dx = d(x - x(P)), we see that the condition ordp(fdx)
~o
for all finite points P
implies that fl::.k[x] is a polynomial in x. At the point at infinity, we have 'TtoO
ord~(x)
= -1 so that we can take the uniformizing variable
= 1/x. The n x = 1/rr 00
has order ordco(c.J)
~
,
dx =
-2 d Tr
-It06
oO
and
£J
-2 if frO. This shows
= f dx = - f ( l/TL) / 7t002d Tr
oO
=
~1
o.
We turn to the
case of genus one and suppose K = k(x,y) is the function field over the cubic of equation y2 = x 3 + ax + b (for some a, b
Eo
k). I claim
that w = dx/y = (l/y)dx has order 0 at all points P of the cubic V. This is obvious if P is not among the E. (i = 0, ... ,3 , points of 1
order 1 or 2) because then x -x(P) is a uniformizing variable at P and
0
r dP (dx / y ) =
0
r d P ( d (x - x ( P) ) J -
0
r dP ( y) =
0
r dP (x - x (P)) - 1 - 0 = 0
(because yep) 1: 0 at these points). At the points E. = (e. ,0) (i=1,2,3) 1
we ha v e
0
rd. (y) 1
1 and
0
rd. (dx) =
0 rd. (d (x - e. )) 1 1 1
1
= 2 - 1 = 1 whi c h
gives ord.1 (dx/y) = O. At the flex at infinity E0 (neutral element), we choose the uniformizing parameter ~00 = y/x 2 (2.21) (we could also choose x/y). Small computations give
dn~
3 (1/x 2 )dy -2(y/x 3 )dx = (2ydy)/2x 2y - 2(x 3+ax+b)/x .dx/y (-1/2 - 3a/(2x 2) - 2b/x 3 ).dx/y .
We can write this in the form wi th f
(3.17) This proves in particular
E
ord~(dx/y)
X
R00
and f (00) =
o.
=
-l .
Had we only needed this
order, we could have proceeded more expediently as follows (using (2.21) and (3.13)) ordco(dx/y)
ord~(dx)
-
ord~(y)
=
ord~(x)
- 1 + 3 = 0
but we shall use the more precise form given in (3.17). The conclusion of the proof is now straightforward : if ~'= f(dx/y) is an abelian differential form, we must have div(f) = 0, so that f 130
E:
k.
- 11.57 We are now able to give a purely algebraic definition of the residue of a differential form at a regular point P. (3 18) Proposition. Let K be the function field over the plane curve
V, and let P be a regular point on V. Then there exists a unique linear form
res p :
Diffk(K)
~
k
having the following properties a) resp(€A»)
=
0 if w£Diffk(K) wi th ord p (4))
o
b) resp(df) df
n
=
~p
for all exact differentials (f
be a uniformizing variable at P :
and wri te the Laurent series in f
= ~ a i 7ti
1<-1 The differential
0 E:
K)
ordp(f) for all f E::K
c) res p (T)
Proof. Let
~
+
a_lilt
~ =
fd~
+ f
1'['
=
ordp(~)
1,
of a function f £K as follows:
o
can then be written
L
+ f d1t a o1Li d1! o i<-l 1 The conditions b),a),c) in this order give then respectively eN
i
res p (7{ d rr)
res p d(1(i+1 I (i+l))
res p (f o d7'C)
o
=
0
(i< -1)
,
a_I (k-linearity)
resp(a_ l dn17t)
Any k-linear form having the three required properties will thus give
resp(fd~)
= a_I' and this shows the uniqueness. Conversely,
the existence is proved by defining of uniformizing variable
~)
resp(fd~)
= a_I (with a choice
and by checking that the three properties
are indeed satisfied. For the last one, write f (hence n
=
=
nng with g£R;
ordp(f)) and take the logarithmic derivative! This
definition is independent of the special uniformizing parameter selected by the previous proof of uniqueness. By definition, the residue of a differential form the regular point P €. V is the constant res-p (U))
131
E:.
k.
w of K at
- 11.58 -
(3.19) Definition. Let V be a non-singular (irreducible, plane) curve and K its function field k(V). A differential form
~€Diffk(K)
is said to be of second kind when resp(w) = 0 for all points P E: V, and of third kind when We shall denote by kind and by
ordp(c.»~-1
for all points P£V.
~2 the k-subspace of Diffk(K) of forms of second
~3 the k-subspace of Diffk(K) of forms of the third
kind. These classes are by no means mutually disjoint. For example a differential of first kind is also of second and third kind. Also note that by definition, exact differentials are of second kind. The following formulas hold: (3.20)
d (K)
c ~ 2 ' d (K) (\ ~l
=
{o} , !J 1 = 1;2 () t>3
(3.20) Examples. 1) We consider a non-singular cubic of (affine) equation y2 = x 3 W =
+
ax
+
b (4a 3
+
27b 2 'f 0) and we take the differential
xdx/y . Only the point Eo at infinity deserves special considera-
tion. We take there the uniformizing variable that x Tt2 = y2/ x 3 = 1 x
-2
~ 7t
+
+
a/x 2
2
M.o has no term in
xdx/y = (-2:n:- 2
+
+
'](
b/x 3 = 1 -1
+
1(4 U
E:
= y/x 2 , so
7t:
=
7(..
1
+
M~ . Also
. Using now (3.17)
v)d7t with some regular v€R-o'
which shows that xdx/y has no residue at
~
hence no residue at all.
This is consequently a differential of second kind on the elliptic curve V. 2) The differential form dx/ (x - x o ) (wi th any
X
o E: k) on the same
elliptic curve as before is a third kind differential. Indeed, ord oO ( dx/ (x - x o ))
ordoo(y/ (x - x o )) -3
+
2
+
+
ordoo(dx/y)
0 = -1
At a point P e: V where x (P) = X o ' x - X o has a simple zero if P 'f -P and a double zero if P = -P in which case y has a simple zero. This shows that the differential in question has at most simple poles and is therefore of third kind. More precisely we have seen that its 132
- 11.59 -
divisor is d i v ( dx / (x - x (P) )) = - (00) - (P ) - (- p )
+
( EI) +
(E 2) + (E 3)
·
The interested reader can check that its residues are given by res p ( dx / (x - x (P) )) = res _p ( . . .) = I , res 00(
• • .)
= -2 .
Another differential of the third kind is given by (x -x(p))-ldx/y when PrE 1. . Its divisor is indeed equal to the divisor of (x -x(P)) which has only simple poles when P is not on the X-axis. Its residues are given by
resp(~)
= l/y(P) and
res_p(~)
= -1/y(P) .
Now we restrict ourselves to the consideration of non-singular elliptic curves. (3.21) Lemma. Let V be a non-singular cubic of equation y 2 = x 3 + ax and let K be its function field key) (a,b e:k and 4a 3
L: resp(W)
PE:,V
+
27b 2
r
+
b
0). Then
= 0 for all differential forms of K .
Proof. Every f 4::K = k(x,y) can be written uniquely in the form f
= a(x)
+
b(x)y with rational functions a(x) ,b(x) e:k(x). We observe
then that for any point P£V, resp(a(x)dx) = res_p(a(x)dx) and resp(b(x)ydx) =-res_p(b(x)ydx), so that grouping these terms together resp(fdx) + res_p(fdx) = resp(f + f
f1'
)~x
= 2res p (a(x)dx) where we
have written fr = a(x) -b(x)y (~is the non-trivial automorphism of the quadratic extension K of k(x), corresponding to the symmetry p
~
-p of V). Then we can write
L. resp(fdx)
PE-V
But resp(a(x)dx) tion on the line Y
+
L. res_p(fdx)
-PE.V
2
L
PE:V
resp(a(x)dx)
resx(p) (a(x)dx) if we consider now a(x) as func0 (X-axis). We are reduced to proving the
lemma for a rational field. There we use the canonical decomposition of a(x) in principal parts at its poles
r~
and a polynomial (this
is an additive decomposition). As the lemma is satisfied for all these principal parts (only two opposite residues occur, with one of them at infinity) and for a polynomial (no residue at all), it is proved. 133
- 11.60 -
This proof of the lemma can in fact be generalized for any function field (over a non-singular plane curve with our definitions). Because div(dx/y) = Q , we see that div(fdx/y) = div(f) and so, all divisors of differential forms are principal divisors. They are
o.
in particular of degree
This is a special feature of elliptic
curves. One can check easily that the degree of the divisor of any differential form of the rational field k(x) is -2. In general this degree is 2g - 2 on a curve of genus g : it vanishes precisely for elliptic curves. We give now Abel's classification of differential forms on elliptic curves. (3.22) Theorem (Abel). Let K be a function field of one variable over k (algebraically closed and of characteristic 0) of genus one, and choose generators x,YE:K so that K
=
k(x,y) with y2=x 3 +ax+b.
Then every differential form w ~Diffk(K) can be written uniquely in the form df +
with fE.K,
o('~'
0(
x dx / y +
Yp e:k
(The components in the
A
,-
\ dx / y + '---
V' Y + Y (P) 0 PEV 0 P x - x (P)
dx y
•
~. 's are not uniquely determined, but this 1
gives a possible decomposition.) Proof. Consider the differential fA)
2
=
~-
L '(P
Pc::V
o
y + y (P) dx x - x(PJ --.-y
(with a summation on the affine part Vo v - {EJ of V). I claim that ~2 is of second kind. For this we observe that the zeros of x -x(P) occur at P and -P with corresponding y(-P) = -yep) : (y
+
y(P)) / (x - x (P)) has a simple pole at P and no pole at -P (if
-P ~ P). The function y-l(y +y(P)) takes the value 2 at P and the value 0 at -P (if -P f P) with multiplicity one. This shows that
134
- 11.61 -
the residue at P is 2 (no pole at -P if distinct from P). By (3.21) we see that the residue at infinity must be -2 (one could also compute it directly using (3.17) ). Now by definition,
w2 has no
residue at all finite points PEV o and consequently no residue at infinity by lemma (3.21). We are reduced to proving the theorem for second kind differentials. Then we wri te
w = "'2 = fdx/y wi th
f = a(x) + b(x)y, and by additive decomposition of a(x) and b(x) according to the principal parts of their poles (with polynomials corresponding to the
pol~
at infinity), we get a linear combination
of differentials xndx/y , xndx
n+l
d (xn+1)
, (x - ~)
-n
dx / y , (x - ~)
-n
dx
with positive integers n. The last term is an exact differential except for n = 1 when it is a third kind differential. The term before, (x - ~) -ndx / y, is of second kind for n ~ 1 (because x is an even function •.. ) and for n = 1 it is of third kind if E f e
i
(i=1,2,3)
and of second kind otherwise. If we start with a second kind differential, we may disregard the terms which are not of second kind in the decomposition, because they will eventually compensate each other. Let us consider the more typical term xndx/y (which is of second kind with a pole of order 2n at infinity Eo). To reduce it to the case n n
d(x y)
=
1, we make the following computation: nx n-l ydx + x n dy nx n - l (x 3 + ax + b)dx/y
~xn(3x2 + a)dx/y n+2 n n-l (n + f)x + a(n + ~)x + bx dx/y This formula shows how one can compute x n + 2dx/y in terms of n n-l n n-l x dx/y ,x dx/y, d(x y) . If n = 0 , the coefficient of x dx/y +
is 0 so that this term disappears. By descending induction, all these terms are brought down to combinations of exact differentials, and of dx/y , xdx/y . This gives the full treatment for the pole 135
at
- 11.62 -
infinity of Wz
·
Other poles can be treated similarly
or brought at
infinity after a change of variables (a translation of the curve). The uniqueness of this special decomposition comes from the fact
Ip
that the
are uniquely determined (they are the half residues of
and then an equal i ty
df
=
0(
xdx/y +
~dx/y
~)
would imply that f has only
one pole, a simple one at infinity, which is impossible in a field of genu s
~
0, un Ie s s d f = 0,
(3.23) Corollary.
rJ.
= 0, (3 = o.
~2/(~1 + d(K))
is of dimension one over k.
Proof. Indeed the differential xdx/y generates a supplement from
~l
+
d(K) in
mz ·
This classification theorem describes exactly the deviation from exactness for differential forms over a field of genus one. In particular, the k-codimension of ~2 in Diffk(K) is infinite. Let us now turn to the integration of differentials over an elliptjc curve defined over k
= [.
According to the preceding
classification, there are three kinds of elliptic integrals. (3.24) Elliptic integral of the first kind. Let us take an elliptic curve y 2 coefficients g2,g3 satisfying g23 tive complex curve V
= Va:
-
=
4x 3
- g2 x - g3 with complex
2 27 g3 ~ O. We consider the projec-
in the projective plane p2([), and we use
its parametrization by means of the functions
~ and ~' of Weierstrass
(3.25)
o and we consider differentiable paths (called simply paths for short)
136
- 11.63 -
on V which are images of pa ths in ([. Let , : [0,1]
---+-
a: be any pa th
starting at the origin 4(0) = 0 with end point z = 4(1), and let Z denote its image Z
= ~.,
as path in V. This path starts at the
flex Eo at infini ty on V and ends at a certain point P = Z(1) E:. V. Then we can compute the line integral
J:
dx/y
of the first kind
differential dx/y by pull back by means of Cf: (3.26)
L
dx/y
Z
Let
=
1
dP(u)/ P'(u)
{
=
i
~(1)
du
~
- Z;(O)
=
Z
•
WI ' W2 be a direct basis of the lattice of periods of these
functions of Weierstrass (corresponding to g2 and g3) and lt t ~ tWi (tE [0,1] ). The image of .fii under circuit Zi in V. The corresponding integrals linear path
JZi
(3.27)
dx/y
f
i
be the is a
= ~i
are the periods of the differential dx/y along the circuit Z. . 1 Thus the integral
JP
dx/y = Z + n 1Al + n (,)2 1 1 2 Eo is defined up to an integral combination of the periods (depending
on the chosen path Z from Eo to P on V. The mapping (3.28)
Z
•
J/x/
y
=
V --+
Z
a:: rJ
gives thus an isomorphism from the universal covering V of V to [. In this sense, the integral of the first kind on V uniformizes V. These formulas can also be written in the form (3.29)
or (3.29)'
iX(4t3_g2t-g3)-ldt
= (Arcp)(x)
00
JP(Z) (4t 3 - g2 t
- g3)-ldt
=
Z
00
f:
which are to be compared with (1 - t
2
) -ldt
= Arcsin(x)
137
J
Sin~
o
2
1
( l - t )-~dt
- 11.64 -
(3.30) Elliptic integral of the second kind. Since the differential form xdx/y has a pole at Eo ' we have to fix a starting point Po for our paths on V distinct from Eo We take P
f(Zo) with any point zoe:([ not in the lattice L
o
Then we take pa ths , If we call Z
in
a: -
L starting at z
=, (EJ -1
and ending at z.
o
= ~ot the images of these paths as before, then for
anyone of them
fZxdx/y
(3.31)
with the primitive
Ip(U)dU = ~
~ of ~
~(z)
-
~(zo)
defined in
(1
,
.1.18), and this is indepen-
dent of the chosen path' linking Zo to z (avoiding the lattice points). The periods of this second kind differential are easily computed by taking the translated paths n~
=
z
+
fi.
101
(we suppose
that Zo is so chosen that these paths avoid the lattice points) and their images in V, the circuits
.
z~
1
(3.32)
By (1.1.19) we get
fZ xdx/y ~
= ~ (z
0
+ 4J.) 1
~ (z
0
)
1
We can also give the less precise form to (3.31) corresponding to (3.29) ~(z) 3 _1 (3.33) J (4 t - g 2t - g 3) 2 td t ~(z) - e(z o ) p(zo) which is determined up to an integral multiple of the periods
r
~i
(depending on the chosen path connecting Po to P on V). (3.34) Elliptic integral of the third kind.
l.y+y(P) dx has two simple 2 x - x (P) y with respective residues +1,-1. We make a cut
The third kind differential poles, in P and E 0
S on V starting at Eo and ending at P. We consider paths' in
a: - Us~
=
~1 (V - S) and their images
Z
=
,·Z
in V - S. Then the
line integral of the considered third kind differential along Z
138
- 11.65 -
depends only on the end points zl
and Zz
= ~(l)
{(Z) of '(because
=
the period of the differential around the cut on V vanishes). Let
PI and P z be the images of zl resp. Zz in V.
L1:.. feu) -
L1.
(3. 35)
y + yep) dx = Z 2 x - x(P) Y
( 2
cfJ'(u) + p'(z) du p(z)
We have to go back to (1.l.Z3) where we find an expression for P(u) - P(z) which we derive logarithmically with respect to u and symmetrize. After replacing z by -z we find the expression 1
(3.36)
P'(u)
+
P'(z)
- ~ (u
Z p(u) - p(z)
+
~( z)
z) + ; (u) -
0-( u + z ) _ ~ ( ) 1 og a{u)cr(z) ~ z u
of which a primitive is
of this function can be determined in u
£
a: -
A uniform branch
U
and from there, an
Sw
"explicit" expression for the third kind integral can be given. However, we shall not pursue the computations any further, in particular, we do not compute the periods of this third kind integral. (3.37) Remark. It is interesting to write the bilinear relations in int egr a 1 form ( c f .
(I . 1 • Z0) ); '?1 t.UZ
~,dX/y ~ ,xdx/y 1
-
Z
1,
Zz
dx/y
- 'Iz CUI
i
=
Z1t'i
,xdx/y
g i ve she r e
2 IT i
Zl
If we decide to associate the intersection number +1 to the two cycles Z~ , Z; (in this order) on V (they have only one intersection point if they are chosen as in the figure (3.3Z) ) we see that we can ~xtend this definition to any couple Z,Z' of cycles on V (both are integral linear combinations of circuits) avoiding Eo by the formula
, Z" Z
~~I dx/y 21tl
P'z
1,xdx/y
~
fZ
,dx/y
i
XdX/ y } ·
Z
Thus, in some sense, the canonical bilinear form (I.z.l) has an interpretation
in terms of algebraic intersections: B(w, 4")
139
is the
- 11.66 -
intersection number of the two cycles on V corresponding to wand w' (or nearby cycles avoiding Eo ). (3.38) Elliptic differential equation. There is a way of defining elliptic functions by means of differential equations, which is essentially equivalent to the consideration of elliptic integrals of the first kind. The computations below will also show how an integral of the form
!p(x)-ldX with a polynomial P of degree 4 can be reduced
to a similar integral with another polynomial of degree of the type
~
3 (hence
(3.24) at worst). We replace the integral by the diffe-
rential equation satisfied by the inverse function and study thus (3.39)
(dx/dz)2 = P(x) , P polynomial of degree 4 •
We look for meromorphic solutions of this equation, with a complex variable z£[. Let us see what happens when we make homographic changes of function. For g ~SL2([)' define x' = xg = g-l(x) , x = g(x') = cx ax'f
+
+
b
d
if g
(a c
b) d
•
We can then compute dx / dz = ( cx'
+ d) -
2(dx' / d z) = J (g ,x ' ) (dx' / d z)
with the notation introduced in (1.3) for the Jacobian J(g,x') (applied here to complex unimodular matrices g). Then (3.39) gives the modified equation J(g,x') 2 (dx'/dz) 2 (3.40)
(dx'/dz)
2
=
(dx/dz) 2
= P(x)
P(g(x'))
-2
= J(g,x') P(g(x'))
It is natural to introduce the notation pg(x) = J(g ,xf2 p (g(x)) = (cx for the new polynomial (of degree
~
+
d) 4p(~~ : ~)
4) obtained. 1 t is then an easy
computation to check pgg' = (Pg)g' using the chain rule for derivation J(gg' ,x) = J(g,g' (x)) .J(g' ,x) pgg'(x)
=
J(gg',x)-2 P(gg'(x))
140
J(g',x)-2 pg(g' (x))
- 11.67 -
and this is (Pg)g' (x) by definition. Thus, if we put Tg (P) = pg
-f
for P polynomial of degree ~ 4 (with complex coefficients)
we define a representation
g
Tg of the group SL 2 ([) in the complex vector space of polynomials of degree less or equal to ~
four. This representation admits two classical invariants described as follows. If we write the polynomial P in the form 3+ 4 (3.42) P(x) = a o + 4a l x + 6a x 2 + 4a x a 4x , 3 2 we define 2 a o a 4 - 4a l a 3 + 3a 2 g2 (P) a
(3.43) g3(P)
0
a1
a
al
a2
a3
2
a
a 2 a3 4 (These two quantities play an important role in the algebraic determination of the roots of the quartic P(x) = 0, as in Euler's method, cf. Burnside and Panton, Theory of equations, II p.113.) A direct verification of gi(P)
= gi(pg) is rather tedious, and
some gain is obtained by checking it for the generators
(~-~),
(3.44)
(~ ~1),
(~~)
(tE:a:)C and a E:a: ).
If we write a~ for the coefficients of pg (defined with the factors I
4,6 as in (3.42), we see easily that (0 -1) implies a~ = a 4- i 1 1 0 so that pg has the same coefficients as P in opposite order. This g
leaves obviously g2 and g3 invariant. In the second case, - 1-les a g = t 2i-4 a Imp i i and again this substitution leaves g2 and g3 invariant. For the last g
= ( 0t
O~
~v
type of generators, we show the invariance under the Lie algebra of the tangent subgroup. In other words, if g
141
(~
~) we show that
- 11.68 -
the differentiable functions g2(pg) , g3(pg) of a have a zero derivative (at the origin or everywhere). For this we write the Taylor expansion of pg (x) = P(x P(x
+
a)
=
pea)
P' (a)x
+
+
a) :
+
p"(a)x 2/2
+ •••
and find ag o
pea) , a g
P , (a) / 4 , a ~
1
=
P" ( a) I 12 ,
p( 3 )ea)/24, a~
a~
=
From this we get g 2 (P g ) = 1.- (P ( a) p(4)( a) - P' (a) p(3)( a) 24 whence
p( 4 )ea)/24
+
1P" ( a) 2)
-!... (P' (a)p(4.>(a) - P"(a)p(J)(a) - P' (a)pCf>(a) 24
+
(3 )(
p" ( a ) p
a) )
+
0
The assiduous reader will check it for g3 ! Then we say that the polynomial pg is in normal form when it is given by pg(x)
=
4x 3
+
ax
+
b
With the notations introduced in (3.42), this gives a 4 = 0 , a 3 = 1 , a = 0 , a l = a/4 , a o = b Z and if we compute the invariants defined in (3.43) we get -b This shows that pg is in normal form precisely when pgex) = 4x
3
- g2 ep )x - g3 ep )
The important point is that if P has no multiple roots, it has a transform pg (for some unimodular complex matrix g) in normal form and g2 ep )3 - 27g 3 ep )2 phy of the form x
~
f
0 • To see this, we first make a homogra-
II (x·- a) which has the effect of reversing
the coefficients of P(x - a) (x -a)4 P(1/(x -a))
P(a)x 4
142
+
O
+
•••
- 11.69 -
Thus i~ we choose a root a of P(x), the term in x 4 will disappear in the transformed polynomial (similarly, a double root will give a transformed polynomial of degree 2 and a triple root will give a polynomial of degree I : in particular, if P has a root of order g2(P)
~3
= g3(P) = 0). Combining the latter homography with a suitable
linear transformation
x
~
a'x
+
b' will bring the polynomial in
normal form. Finally, we may consider substitutions of the form z ...--... z'
Az ,
X
t---+-
x'
= ) - 2x
They transform the initial differential equation
supposed in
normal form, in .
( dx' / d z')
2
=
4x t
3
-
hence we get a new polynomial &2
=
g2(P)
= ,,-4 g2 (p)
-4
-6
A g 2 (P) x ' - A g3 (P) ,
P
with invariants
• &3
=
g3(P)
=
6 A- g 3 (p)
This proves that if we can solve one differential equation (3.39), then we can solve all differential equations of the same type having the same invariant
(this invariant is finite if the differential equation cannot be solved by using the elementary trigonometric functions, i.e. when P has no multiple root). Differential equations (3.39) lvith same invariant j are essentially equivalent, and it is sufficient to solve one equation in each class , determined by a given j
E:
a: •
Coming back to the integral form of the differential equation, it can be seen that the meromorphic solutions admit necessarily two independent periods (we refer to standard textbooks for this point, e.g. Valiron, Theorie des Fonctions, p.422).
143
- 11.70 -
4. Analytic p-adic functions
We gather here a few facts on p-adic numbers which will be used in later sections. Let p be a prime number and
Zp
~ Z/pnz
be the ring of p-adic integers. This is a compact totally disconnected (additive) abelian group, and its elements admit unique representations a=Lapn
with
n~O n
O~an~p-l
Addition and multiplication of two such series is done naturally (these laws are completely determined on the finite such series and extend uniquely by continuity, but note that they are not the formal series laws in the indeterminate p because for example, the sum of two an's with sum
will contribute a higher power of pl. Because
~p
every integer mEll has a non-zero image in a
Z/pl'lz (for n sufficien-
tly large) we deduce that there is a canonical embedding (integers n )} 0 corresponding to polynomials in p),
o.
ring of characteristic ring. We denote by
~p
canonical embeddings
~
~
Zp
and tl p is a
It is easy to see that
~p
is an integral
its field of quotients so that we have two
Zp
~
Qp and
~ ~ ~p.
We identify
zp
with their images in Q . Elements of Qp admit representations p as Laurent series in p of the form and
~
a =
L
n;>n o If we denote by ord
anpn p
with
O~an<.p-l
the order of the Laurent series : ord (a) is p
the smallest index in the Laurent series of a such that an ; 0, the topology of
=
p
is also given by the norm
= p-ordp(a) ~R: p Zp is the closed unit ball in Qp lal
Then
~
la/
defined by lal ~l (it is also
open and defined by laf
- 11.71 -
We could also have defined equivalently of
~
~p
, resp. zp , as completed
, resp. Z , for the topology defined by the p-adic absolute
value lal p = p-vp(a) where vp(a) is the power of p in the decomposition of a as product of prime powers (and a sign). One sees easily that
~ fl Zp is the localized of
Z(p)
= {min: m,neZ and (n,p) = I}
Z
at the prime ideal (p) , and thus
also the completed of the local ring
~(p)
Zp is
topologized by taking
the powers (p)k = (pk) of the maximal ideal (p) = pZ(p) as basis of neighbourhoods of 0
€~(p)
.
Let K be any finite algebraic extension of absolute value of
~
p
~
p
• The p-adic
has an extension to K (because the additive
valuation v p ord has an extension) and in fact a unique one p because K is a finite-dimensional normed space over the complete field
~
p
,and any two norms on such a vector space are equivalent.
The existence and uniqueness of an extended absolute value lead to the following expression. Let a€K and L the normal extension generated by a (in an algebraic closure ijpof Qp). Then for every absolute value 1... 1 on L and every automorphism x ~
Ixrl
cr of L over ~p ,
is also an absolute value on L, so that Ixl=
uniqueness, and this shows that if we denote by N
NL/~
Ix, p
by
the norm
of L, and by n = [L : ~J the degree of L, then the extension of
I... '
to the element a is given by the explicit formula
lal = IN(a)/l/n In particular, we see that the extended absolute value is invariant under all automorphisms over
~p
, and this proves that these auto-
morphisms are continuous for this norm. Because each element of
ijp is contained in a finite extension of ~p , it follows that ijp itself is a valued field. 145
- 11.72 -
But note that Q is of infinite degree over Q : by Eisenstein's p p n criterium, the polynomial X - p is irreducible over ~ p hence any of its roots will give an extension of degree n (arbitrarily large) of
be any root of the preceeding equation. Its absolute value must satisfy Inl n = Ipl = lip, hence ITt I = p -lin · If K ~p
. Let
1t'
denotes the field obtained by adj oining
1T
to (Qp : K =
~p
(Tr), this
shows that the value group \K~I in R: contains the subgroup p(l/n)Z
=
{pm/n : me: Z
lC
R:
Letting n tend to infinity, we conclude that (4.1)
It would be easy to show that this value group is exactly pQ , but we only need to know that it is dense in val of
R:
R:
(any non-empty open inter-
contains infinitely many absolute values Ixl with x €~;
).
Being of infinite dimension over ~p , ~p is no more locally compact and not complete for the uniform structure determined by its absolute value (it is a metric space). We denote by J1 p the completed field of
~
p
. By construction (or definition) the absolute value of ~
J1p
has a unique extension to tant to know that
J1 p
(and
111;1
p
= 1~;lcR:). It is impor-
is algebraically closed, hence is a "universal
field" (corresponding to [ which is also complete and algebraically closed), and it will play the role of the complex field for a transcendental study of p-adic elliptic curves. This result follows from the general lemma : (4.2) Lemma. Let K be an ultrametric valued field and
'K'
its completed
field (this is still an ultrametric valued field). If K is algebraically closed, so is Proof. Let P
---
of
'K'[x]
=
P(X)
l' . =
I:a.X i be a non-constant unitary polynomial 1
(a i e:I( ), and look at the fini te extension L of K defined
by adding a root of P to I( : 146
- 11.73 -
with the canonical image
~ of X in the quotient ring (a field).
The absolute value of ~ has a unique extension to L (again because L is a finite dimensional vector space over the complete valued field ~ ). Now fix
f >0 and take a uni tary polynomial P E: K[X] of
the same degree as P and wi th coefficients
a.1
E::
K close to the
r·
corresponding coefficients a i of P : lai ail < Because K is supposed to be algebraically closed, all its roots ~.1 lie in K and we can write P (X) =
IT (X
Then on one hand P(~) =
P(~) =
-
~i )
n(~ - ~
(P - P)
(~i E: K)
•
i) and on the other
(~) = L (ai
- a ) i
~i
so that we get the following estimate for the absolute values
This implies that for one i at least
I~ - ~il < as soon as
o
(o,l/n < e
is chosen according to
argument shows that ~
(;i e:K)
,
d < f.n/M1. The preceding
belongs to the closure of K : ~
E
i< .
Consequently J( is algebraically closed. As we shall use several ultrametric properties of
Jlp ,
we
recall some of them briefly here. An ultrametric space in general, is a metric space equipped with a distance d satisfying the stronger inequality d(x,z) < Max(d(x,y),d(y,z))
for all x,y,z .
Let us introduce the following convenient notations and terminology. The closed ball B'(a) of radius r and center a is defined by r d(x,a) ~r. The open ball Br(a) of radius r and center a is defined by d(x,a)
=
r. (Note the
perversity of the language, because all these sets are both open and 147
- 11.74 -
closed in an u1trametric space, as will follow from the next observations
~)
If b£Br(a), then Br(b)
= Br(a)
because
d(x,a)
=
d(x,a) can be thought of as the distance of x
and the ball Br(a) (again, we can replace Br(a) by B;(a) if x ~B;(a)). Consequently two disjoint balls Br(a) and Bs(b) are at a well defined distance from each other : d(x,y)
=
d(a,b)
is independent from x€Br(a), yE:Bs(b) .
If two balls have a common point, the smallest one is contained in the biggest one, and in particular, two balls of the same radius having a non-empty intersection coincide. We note also that if rand s < r, then B' (x) is contained in the sphere of radius s r : B'(b)c.U (a). These spheres are thus open sets and B'(a) being s r r
d (x, a)
the union Br(a) U Urea) is open. Similarly Br(a) = B~(a) - Urea) is closed. The situation concerning Cauchy sequences is also particu1arly simple : a sequence (x n ) in an ultrametric space is a Cauchy sequence exactly when d(x,x 1) ~ 0 (a dream of youth), and in n
n+
a commutative complete ultrametric group, a family (x n ) is absolutely summable exactly when x n ~ 0 These properties are in particular true for the complete algebraically closed valued field
~
, and they enable to make a
very simple theory of analytic functions on 11
p
. The Cauchy integral
cannot be defined in this context, and all proofs have to be derived by direct reasoning on power sequences (the handling of which is fortunately much simpler than in the complex case).
148
- 11.75 -
First we prove a p-adic analogue of the classical theorem of Rouche (for polynomials).
(4.3) Theorem 1. Let K be a finite algebraic extension of f,gE:K[X] be two polynomials with f(O) has n (~1) roots on the unit sphere and that If(x) - g(xll< £n for all
g(O)
U
= 1.
~p
,
Suppose that f
of K = ijp (or of .1\» , l with ,xJ~l (i.e. xe:Bi(O) ), U (0)
X6K
where 0 < (. < 1. Then g has same number of roots as f .2..!! U and they are moreove~ equally partitioned among the closed balls B~(a) of radii
£
contained in U (a
£ U) •
Proof. Let us number the roots a.1 of f by taking those on U first. Then decompose f as product of linear terms n
IT
f(X)
(1 - X/ail
i=l and similarly for g
m
IT
g(X)
TT
(1 - X/ail
,
IT
(1 - X/b]_)
•
la.1~ " 1 (1 - X/be) J
j =1
Ib jl J' 1
Now observe that for x on the unit sphere U, i.e. lxl
={
1 if lail
lx/ail
=
=
1 , we have
>1
'ail
-1
>1
if
Thus for x E: U one has
m
B Tf
j=l
wi th two constants
11 -
A ~l, B ~ 1 •
x/b-I J
We now look at the balls B; (a i )
(disj oint if distinct) centered at the roots of f in U (1 and choose d> t
close to
(,
~
i , n)
so that the balls BS (a i ) are distinct
(or disjoint) at the same time as the corresponding smaller B~(ai). We let x vary in the region (annulus) D1 = B cr ( a 1)
- B; ( a 1 ) c U (c
149
K 0 r .ft p )
,
- 11.76 -
e
· Because B1(a i ) is disjoint from BS- (a l ) if B~ (a i ) -; B~ (a l ) , we infer that 'x - a i J '> S' ~ (, for
defined equivalently by
those i and we get
n
IT Ix
If(x)l = A
i=l
- ail
all< f
>
>
A~n
gn
for x E:D I •
But If(x) - g(x)/< £n for the same x, hence by the ultrametric property, necessarily
If
(x) I = Jg (x)1 for x
€ D
I
• More precisely, still
for x£D I ' if we denote by n l the number of roots of f in ITlx - a. I
1"=1
1
IT
=
If(x) I
This shows that
n'pe -
Ix - a. I 1, =/x -all
a"~B'(a ). 1~ ( I
,
~
= Allx - al(l
a. v
b j e:.
I
=(a i -all for x
e: DI
' and similarly if
I
B~
,
1,
ml denotes the number of roots of g in B£(a l ), we get ml g(x) = Bllx - all for x €D I (note that Ix - bjl = Ix - all
I
B~(al)
if
(a l ) ). Since we know that f and g have same modulus on DI '
we get Ix - all
n - m l
l
independent of x in DI ·
Taking x,x'E:D I with distinct distances in ]f,cf[ from a l (this is possible because the set of absolute values is dense in ]~,E[ proves n l = ml · This shows more generally that f and g have same number of roots in all closed balls
B~(ai).
For any other ball
B~
n
contained in U, the inequality If(x)l>c will still be valid in B£ hence )g (x)1
=
If(x)' in B~
and nei ther f nor g have roots in B£
This proves the statement completely. We can now study power series on a formal series f(X)
= L. anx n~O
If
n
, n Lanx converges for some x
np .
Let
f
E:
.12p [[X]] be
(an E: IIp) n
Eo .1L
X
p
, this means anx
n
---+-
0 , or
J Ixl n ~ 0 and consequently la )Iyl n --.. 0 for every y l:: 12 in n n " p the closed ball Iyf ~ lxl and the series~will converge normally on
la
that closed ball (absolutely at each point and uniformly on the ball). 150
- 11.77 -
(This attractive situation is well-known to fail with ordinary complex power series!) Under these circumstances, f defines a mapping B', x I (0)
11p · If all coefficients a n lie in a finite
~
sub-extension K of
~p
, and if y is also in K with
'Y/~lx(,
the
value fey) will also be in K because this field is complete (and all partial sums are in K). The whole theory is based on the fact that the modulus If(x)f can be given more or less explicitely, at least as far as x is not on certain exceptional spheres. We have to remember that, in a finite (or infinite) sum, a term alone of maximal size (absolute value) carries all responsability for the modulus of
I
the whole sum ! For instance, if a o to, necessarily If (x) = lao I in a neighbourhood (an open disc) of x o. More generally, if n is the order of f
~
0 at the origin, i.e. the smallest index i
with a i ; 0, we shall have )f(x)I = lanllxl n in a neighbourhood of m x = o. If however f(X) r a Xn , the size of a term a x (m >n) with n m n am.r 0 will necessarily overtake the size of anx for lx' large enough. This leads, to the notion of cri tical radius. (4.4) Definition. Suppose that the power series defined by f converges in an open ball B = BR(O) with some positive R>O. Then a R radius 0
Obviously, ·a cri tical radius occurs in l.n.~ I because wi th the p
notations of the definition, r
=
Ia / a 1 1 / (n -m) m n
Cri tical radii can be ordered in increasing sequence 0 < r l
There is a very useful notion connected with critical radii. (4.5) Definition. Let r be a critical radius for the power series defined by f. Then the corresponding critical index n r 151
~
- 11.78 -
definition the biggest index i with lail r
i
M~ J
j !ajlr .
If rl,r Z' .•• denotes the increasing sequence of critical radii of f, we denote by n1,n Z' ••. the corresponding (increasing) sequence of critical indices : n. = n r · 1 1 the definition, it is obvious that n· (4.6) 'f(X)1 = la n .( r 1
. By
the remarks preceeding
ri
I
critical (exceptional) spheres. On a critical sphere, we can only assert an inequality
(4.7) This has as obvious consequence the following proposition. (4.8) Proposition. If f is given by a convergent power series (in some open disc) with f(O)
~
0, then the zeros of f can only occur
on the critical spheres (having a critical radius). More striking will be the result that f has indeed zeros on all critical spheres (and in fact only finitely many on each sphere). The p-adic analogue of Liouville's theorem is also a direct consequence of (4.6). An entire function f : !lp ~ Jl p is a function given by a power series which converges at all points x~Jlp • Thus entire functions are defined by those power series having n an satisfying lanl = o(r- ) for every positive number r. (4.9) Proposition. If f :
n
--+
coefficien~
Jl is an entire function with
P P some majoration of the form If(x)l.,; M IxlN for all x€
Jlp
(with a
positive constant M and a positive integer N), then f is a polynomial of degree smaller or equal to N. Proof. If If(x)l.,; M Ixl
N
is true throughout
Jl p
»
then (4.6) shows
that there is no critical index ni>N. This implies that all an wi th index n > n i vanish and proves the proposi tion.
152
- 11.79 -
We have to consider a slightly more general situation, namely
=
that of Laurent series. Let thus f(X)
00
•
2:c.X 1 be a formal series
-00
1.
with coefficients c. = c.(f) in a fixed finite extension K of 1.
1.
~
P
(the reader will observe that K could be replaced by the universal domain ..Q.p if wedid not want to prove rationality conditions) . We suppose that f(x) is convergent in a non-empty open annulus and we denote by
... < r_ 1 <
ro
<
< ...
rl
the sequence of critical radii of f (this sequence could be empty, but if f converges for all x
€~ ,
it can be empty only if f is reduced
to a monomial : more precisely, if there are infinitely many nonzero coefficients c. with positive indices i, the sequence of critical 1.
radii will be unlimited at the right, whereas if there are infinitely non-zero coefficients c i with negative indices, the sequence will be unlimited at the left). Let also
... <
n_ l
< no < n l < ...
be the corresponding sequence of critical indices. By definition, n i is the biggest integer n with lan'r~ maximal. Similarly, if we define m.1. to be the smallest integer n with fa n 'r~1. maximal, we have obviously mi = n - l · We shall fix our attention to one particular i critical radius which we take to be r o = 1 (changing of variable X if necessary). Multiplying f by a suitable monomial we may assume
(4.10)
n
c
o
1 , lC i
>0
I~
1
(strict inequa1i ty for i
We have then
(4.11)
1 If(x)1 = {\X , nifl..
<0
and for i
>
n) .
x r_ l
To fix ideas, we suppose that there are infinitely many coefficients -1
c i ; 0 with negative indices i (otherwise we would change X to X
153
,
- 11.80 -
the case of a polynomial in X and X-I would be trivial in the considerations below), and we take truncated sums
f~(X)
L
=
c.X i
wi th M~ 0 < n ~ N
1
M~i~N
,
c
Mf 0
Let us introduce the roots a = ai(M,N) of the polynomial f Nj cMX M M i
f~(X) '" c MxM/T(1
(4.12)
Because the coefficients of f
o~ ri
=
N
M
- X/a i )
are the same as those of f for
i ~ n , the critical radii ri of f~ satisfy r~l '" r_ 1 ' r~ '" r o r l ' and in particular, the roots a i not on the unit sphere,
satisfy either 'ail ~ r_ 1
(4.13)
or
(ail ~ r l
With (4.12) we can give the following expressions for the modulus of the function f :
If~(X)1 '"
(4.14)
I M/I I c
{
M
n
I xl a 1· / if
x la /
r_l
i
IcMllx l M IT
lx/ail i f
lail ~1 But in these two regions, the leading terms are c = 1 and c o n respectively
(as for f) and (4.11) is valid for f~ ,fN(x)1
=11 n
if
/xl
M
r_l
if
1
< Ix 1< r 1
Comparing these two estimates, we get immediately the following
-M = 1M I = (4.15)
n
Ic M,
lail
lail
(ail
<1
<: 1
number of roots (ail = 1 (independent of M and N)
n
Now we let -M
number of roots
=
IMI and N tend to infinity and consider the varia-
tion of the roots ai(M,N). I claim that if they are properly numbered (considering i as fixed) they make up Cauchy sequences. Take indeed
another bigger truncated f~: with M' that lf~(x) - f~:(x)1
<
£n
<
M and N'
>
N , and suppose
for lxl '" 1 . Call simply ai the roots of the polynomial corresponding to f N', . Using (4.15) for this M 154
- 11.81 -
new truncated, together with (4.12), we get
If~: (x) I = 1 l}
11 - x/aj
I
=
IT Ix -
J Substituting the root a of f N for x, we get i M ~J ~ n
TT
1
la i - aj I =
~J<:n
I
for Ixl
1
I
lf~: (a i ) - f~(ai) < £n ,
I
hence for one j at least la i - aj <: the limit roots a~ = lim a. (M,N) E:.Q 1
aj
1
e: • P
Because the roots a i
E:
K.
. Let us introduce a notation
for the normalized polynomials having these roots:
g~(X) =
11 (1
- X/a (M,N)) i
= IT (1
, g(X)
- x/a"i)
•
These are polynomials of degree n. First step. The limit roots a~ are roots of f. Indeed we have 1
I
N = ) (f - f ) (a. (M, N) ) I ~ Max Ic . (f) J · M 1 1 i(M 1 i)N Letting M-+ -00, and N-+ oo , and using the continuity of f (remember , f ( a . (M, N) )
that ICi'~ 0
for
\il~oo) we get f(a i )
o.
=
Second step. For every automorphism ~~Ga1(K/K) and every root a i
of f~ , ar
is still a root of f~ (which has coefficients in K)
and moreover
la~1
=
Jail .
This proves that the set of roots a i
on the unit sphere is invariant under
g~ €
K[X]
will have coefficients in
~,
K.
and by Galois theory,
Because the roots of g~
make up Cauchy sequences, the coefficients of g~ (which are symmetric combinations of roots) will also make up Cauchy sequences in the complete field K. This proves g e: K[X] and consequently the limit roots a~ 1
(roots of g) are in the algebraic closure K of K. QQ
Third step. Then f = g·h where heX) = L:c.(h)X - -
-00
1
•
1
has no root on the
unit sphere (r = 1 is no more a critical radius for h) has coef-
ficients ci(h) e K and converges for all x N
Because gM(X) = I
+
•••
±
(TT ai)x n -1
£11; . Write f~
has all its coefficients in
the ring of integers R of K defined by K
155
Ixl
~
I (and so does f),
- 11.82 -
the coefficients Ci(h~) £ R will also be integral (Gauss' lemma). K For each fixed i, the coefficients Ci(f~) and Ci(g~) converge in RK · Thus the coefficients Ci(h~) converge in RK ' and this proves the existence of the formal series h, with integral coefficients. Using reduction mod the maximal ideal MK of R defined by Ixl<:l, we get K AI,v"'" ,.., the equality f = gah • But f and g are polynomIals of degree n
,...,
and constant term unity. Hence h
•
and h has only one coefficient
= I
of absolute value one : r = I is not a critical radius for h. In fact, more precisely, we have the inequalities Ici(h)1 ~ ~up lCi+j(f)I
IC i (h)l
i~O
J~n
~ lei (f) 1
i ~ 0
because one can check these inequalities for the truncated polynomials (g: has two extreme coefficients of unit modulus) and they will remain valid at the limit. We can then state the main result of this section. (4.16) Theorem (L. Schnirelmann). Let f(X) --
. = -00 Lec.X I be a formal I ~
Laurent series with coefficients c. in a finite extension K of Q 1
We suppose that f(x) converges for all x
£12; . Then,
--
P
.
f(X) can be
written in the form f(X) = cX
k
IT
'0<\< 1
IT
(1 - «IX)
(1 - X/O()
Icfl~l
wi th fini te non-empty seOts of roots
0( £
K occuring on the cri tical
spheres of f. Gathering these roots of given modulus together, we get a representation f(X) = cX wi th polynomials gi (X)
E:
k
IT
i
K[X]
t· (X) IT
i;,o
1
or
g. (X) 1
1 i (X) E: K[X-Ij
having the same roots
as f on the critical spheres of radii r i ' c £ K , k£ ~ • Proof. As has been seen, we can remove the critical radii of f one after the other, by dividing by polynomials (in X or X-I).
156
- 11.83 -
Then we observe that the infinite products
TT
10<1<0
(l - «/X)
=
IT
i
<0
g. (X). 1
IT
10(1 ~O
IT
(1 - X/«)
.0 1>
XE:.n; , and hence, going to = heX) TTg. (X) IT g. (X) with a convergent
are convergent for all write f(X)
g.(X) 1
the limit, we can
power series i
of the form cX k with a certain rational integer k
£
Z and a constant q.e.d.
ce:K.
If f is as above, and L is any finite extension of K, then for every x
K is
~ L~,
f(x) will belong to the complete field L. Because
the union of such finite (complete) subextensions L we get
(4.17) But the theorem shows that conversely if a £ f(x)
= a are also in K : consider L
=
K the
solutions of
K(a) and the formal series
f - a which has all its coefficients in L. All its zeros are in
K.
(4.18) Corollary 1. If f is as above, f determines mappings f :
rc --..
K
,
j()C ---+
K,
..n.)( ~..n
p is contained in K (f can Then the inverse image -;. (K) of K in p take algebraic values only at algebraic points).
n)(
p
(4.19) Corollary 2. If f has an essential singularity at the origin (infinitely many Laurent coefficients ci(f) ; 0 for i
~
0), then f
takes all values a £.K (infinitely many times) in all punctured balls Bt (0) - {OJ of K (where £,.0 is any positive number) (p-adic analogue of the big Picard theorem). Proof. With the assumptions of this corollary, f has an infinite and sequence (ri)i
infinitely many times in all punctured neighbourhoods of f by f - a (where a
£
o.
Replacing
K is arbitrary) gives the announced result. 157
- 11.84 -
= L:c.x i
(4.20) Corollary 3. Let f(X) be a formal Taylor series f(X) with coefficients c.1
~
i~O 1
Co 1 0 , converging in an open
K , f(O)
ball Br (O)cJl p . If f does not vanish in this ball, the formal power series for l/f will also converge in Br(O). Proof. By the above theorem, f has no critical radius in Jxl Hence, if we assume Co ICil t
i
1,
<1
But the inequalities Icil
as one checks
If(x)l
for
~ r-
t i
= 1
for Ixl
< rand
.
determine a subgroup of
immediatel~ using
< r.
1
+
X'K[[XJ1 ,
the ultrametric property of the
absolute value. For reference we formulate explicitely the following (4.21) Corollary 4. in the whole of
Ii
x l lp ,
f(X) is a formal Laurent series converging k and if f has no .zero, then f(X) = cX is
a monomial in X . In particular, there is no holomorphic function in the whole of
Jl p
playing the role of the exponential of the classical function theory. In fact it is possible to define an exponential
r xi/., e X -_ L1. i,;,o x+y with the formal property e
. e X· e Y . As· consequence, e x IS
inversible (with inverse e- x ) for every element x
€12p
in the domain
of convergence of the series. This proves that the exponential series has no zero, no critical radius (and in particular cannot converge in the whole of
11 p ).
We introduce the following notations )(
HK : ring of p-adic holomorphic functions on 12p
defined by convergent Laurent series with coef(4.22)
ficients in K field of p-adic meromorphic functions on
Jlxp
defined over K (field of fractions of H ) . K 158
- 11.85 -
By Schnirelmann' s theorem, a meromorphic function f E::M
K
has only
finitely many zeros and poles in a closed annulus
o
<: r
=' Ixl
~ r'
<
00
and their multiplicities are well defined. Still by the same theorem, we see that if f has a zero of order d
a
at the algebraic element
a ei(x , then it will have zeros of the same orders at all conjugates a~ of a over K (and similarly for poles). Conversely, given any set
of elements aE.K)C with multiplicities d a e 71 satisfying the preceding two conditions, it is possible to construct a canonical convergent Weierstrass product, defining a meromorphic function f eMK having precisely the zeros (or poles) d a · These data
a~K
with respective multiplicities
determine f up to a multiplicative factor of the
form eX k (c € K, k e: 7l). This is what we shall use systematically in the next section. (4.23) Remark. Only for simplicity have we assumed that the characteristic of K is 0, in our study of p-adic analytic functions. The results are true in general (i.e. also for formal power series fields F ((t)) over finite fields of characteristic p) as one checks q
easily. Only at one point does one have to be a little bit more careful in the proof of Schnirelmann's theorem, namely in the second step on p.IT.81. The Galois invariance property only implies that the coefficients of g: are purely inseparable over K. But if a root « of f: has order of inseparability pr over K (i.e. if r is r the smallest .integer such that O(P is separable over K), its multiplicity will be a multiple mp
r
N
N
of this order in both f M and gM · This shows that the coefficients of g: will be separable over K. N Hence gM £
K[X]
as in characteristic
159
o.
- 11.86 -
s.
Tate's p-adic elliptic curves
In a first attempt to define p-adic elliptic curves, one is .1l p /L (instead of ([/L) wi th discrete
led to consider quotients
subgroups L in the additive group of extension K of
~
p
11p (or in a finite algebraic
). However, this theory is soon recognized to
collapse through lack of candidates! If tV
E:
L (a subgroup of a p-adic
field), then pn and
W + W + lU +
W
I pnwl
implies
1"
W
•••
E: L
= lteJl/pn --.. 0 (n-+oo)
= 0 if L is discrete. Thus, the only discrete additive
subgroup of a p-adic field is the trivial one L
=
{o}.
In the classical case, the normalized exponential transforms a lattice L subgroup
= 7l + 1: 7l
q71 = ~(~71) (wi th q
we should define q2 = ~(~)
~
: a:
~ ([~
(1m (1:) > 0) into the mul tiplica ti ve
~(T)
E ([)(,
\q I < 1). (Strictly speaking
to keep perfect coherence with the first
chapter. However, this would lead to using throughout even powers of q, and the reader will easily convince himself that this new normalization is more natural.) But now, there are plenty of discrete subgroups (of rank one) in each
O~~'
11;
(or in KX , with K as above). In fact,
generates such a subgroup qZ, and we shall concentrate
. . 11.)(/ our attentlon to t h e quotlent p q7l
(or to K~/q~). This means that
we shall work with Jacobi's point of view viz. p-adic theta functions. p-adic case
Complex case
no p-adic analogue
([/L
~
1
no exponential available
normalized exp
([>C
I q71
.n;/q71 : p-adic elliptic curve.
160
- 11.87 -
We repeat the definitions (5.1)
n;
HK : space of holomorphic functions on wi th coefficients n c n E: K (f(X) =\c L- n X converging for O<\xl
MK
space of meromorphic functions on ~ given by quotient
of two functions f , g ., 0 in HK . Thus, by section 4, a meromorphic function fIg ~ M has a canonical K Weierstrass expansion a f/g(X) = cX k (1 - a/X) d a (l _ X/a) d
IT
lal~l
IT
\al>l (for a € K) satisfying
with rational integers d a ~ ~ a) only finitely many multiplicities d a are not 0 in (5.2) any annulus of the form 0 < r'~ lal~r"< ~ b) for a 11 a
iM' and
cr E: Gal (K/K)
d
= d (f a a Then for q E: r= and )ql < 1 , we define the subfield LK = LqK of MK as being the field of functions satisfying f(qX) = f(X) : E:
,
LK = Li = LK(q) : f ~MK and f(q-1 X) = f(X) (X € RX ) . This field will be shown to be an elliptic function field over K. (5.3)
Every function f ~
Li
(we shall say q-elliptic function or simply
elliptic function) has a well-defined divisor over KX/q~ satisfying the condition (5.2.b) above. The group of all such divisors will be denoted by (5.4)
Iq7l )
-)C
DivK(K
Iq~ ) ,
-x
c Div(K
and the divisors of this subgroup (satisfying (5.2.b)) will be called rational over K . Purely formally first, we put Eq = Eq(K) = K)(/q71 We want to show that this set can be identified with the set of (K (5.5)
points on a non-singular cubic (elliptic curve) defined over K . As we have already pointed out, the whole study is based on theta functions.
161
-)
- 11.88 -
(5.6) Definition. A theta function S (relative to the subgroup q~) is -w
a meromorphic function S € MK ~ K having a q-periodic divisor div 8 -d E: DivK(E q ) (this means that the set of zeros and poles of 8, together with the multiplicities, is invariant under q~, and thus defines a divisor on I
Eq ). If S is a theta function, and S (X) a theta function, and div S' theorem, S'(X)
=
S(q
=
-1
X), then S
I
is also
div S. Consequently by Schnirrelmann's
=
c-1(-X)dS(X) , which means (c ~ K~ , d €~) .
e(q-lx) = c-l(-X)dS(X)
If we define e " (X) = c , Xk S(X), we see immediately that e"(q-lX)
=
(cqk)-lC_X)dSIt(X)
and since SIt is the most general expression of a theta function of divisor equal to that of S,
this proves that the class of c in K'J
is well defined by the divisor div 9
= ~ ,
independently of the theta
functiQn chosen with this divisor. Because every divisor -d € DivK(E q ) is the divisor of a theta function, we can define two homomorphisms dq
DivK(E q )
-+
d
(5.7)
~
DivK(E q )
K)t:/q'l
~
d
~
d
dq (d) -
)
4>q (~)
=
c mod q71
(5.8) Proposition. The mapping d
q is the degree homomorphism, and ~q is the Abel-Jacobi homomorphism which associates to the divisor da ~ = da(a) , the element c = TTa in K~/q~ (if the group law on
r=
KX/q~ was noted additively, this element c would be Abel's sum I:daa). Proof. Let us start with the basic theta function
eo (X) = ~ (1
- qnX) lrli(l - qnx-l) , n>O (this is the theta function 8 with the notations of Jacobi). Its 4 divisor is the only point 1 (mod q~) with multiplicity 1, representing (5.9)
n~O
a simple zero of this function at all points of q~ : div 8 0 = (1) . 162
- 11.89 -
One checks easily 8 (q-lX)
-X8 0 (X) whence
o
~qCCl)) =
dqCCl)) = l , For any a e: K
X
Then
,
1 mod qZ •
let us put SaCX) = Soca:-lx), so that div Sa =Ca).
SaCq-lX) =
and so
~qCCa)) =
dqCCa)) = 1 ,
a mod qZ .
~ d a (a) € DivK(E q ) is any divisor on Eq (rational over K), the theta fun'ction 8 da has divisor ~, hence one gets a
If -d
=
L-
IT
L d a · dq (8 a ) = L~ da IT
d q (d) -
=
deg_d
mod qZ
as asserted. (S.lO) Corollary 1. A divisor form ~ = div f
~ (d)
€
e DivK(E q )
is principal (i.e. of the
Li ) precisely when
d q C~) = deg ~ =
0 and
1 mod qZ .
=
'f'q -
for some f
~
(S.ll) Corollary 2. If a £ KX and b is not conjugate to a mod qZ (~ K), there exists a principal divisor and db
with multiplicities d a
=
1
0 .
=
Proof. Consider first the case a ~
~
6
KXand take for d the divisor
= (a) - (au) - ((v) - (vu)) with u,V
€
KX chosen so that all a, au,
v, vu, b are distinct mod qZ (K~/qZ is infinite). This divisor is principal because it satisfies the two conditions of corollary 1. When on the contrary a ¢: K)C , let d = [K(a) Then -a d
=
K] ~ 2 and N = NK(a) /K(a)
~ (a~) is rational over K and of degree d. Moreover
~qC~a) = N .
If now u,v
e: KX are chosen such that N = ud-lv and
u, v, b, ar are all distinct mod qZ , then d
=
d -a
satisfies the conditions of corollary 1, so is principal. As usual, for LK(~)
=
~ ~
DivK(E q ), we define the K-subspace
Li(~) eLi by div(f) ):: -~ (with the convention div(O) ~-~
163
€
K)(.
- 11.90 -
for all divisors
~).
These are finite dimensional vector spaces over
K and the fact that Li is an elliptic function field will follow from the following more precise result. (5.12) Theorem (Riemann-Roch). For every divisor positive degree d '70,
~
€ DivK(E
Q) of
= d .
dimKLK(~)
Proof. If d = 1, a reduction which we have used repeatedly brings us 1 1 o. Then back to the case d = (a) has only one multiplicity d a corollary 1 shows that LK«a)) = K is of dimension one. Now, we can use induction on d = deg _d
> 1. Put a = ~ '1'q (d) _ and select b
E: K~
(db=O)
distinct from 1 and a mod q~. Define -d' = -d - (b) £ DivK(E q ) of degree d' = d - 1 > O. By induction hypothesis L(~') is of dimension d' (over K). As it is the kernel of the linear mapping f ~
feb) ,
L(~)
~
K
J
it is sufficient to show the surjectivity of this mapping. But the divisor (a)
o at
+
(d-l)(l) -
b : feb) 1
o.
~
= div(f) is principal and has multiplicity
This proves the theorem.
Note that if K' is a finite algebraic extension of K, then "K' = HK·K' and so MK, = MK·K' (if fIg E MKt is a quotient of elements of "K' with g ~ 0, amplify this fraction by the conjugates of the denominator so as to be reduced to the case g £H K : then use f € "K' = HK·I ' ). Choosing a basis (e i ) of K' over K, write any f E: Li' C MKt in the form f f(q-lX)
=
L
fie i with some f i Eo MK . Because £(X) and the linear independence of the e. (K' is linearly 1
disjoint from MK over K, e.g. because we may assume K'normal over K), x'. we infer fi(q-1X) = fi(X), i.e. f i ~ This proves =
Li .
Li'
Li·
Then the theorem above applied for K' instead of K shows that dimI,Li, (~) = deg ~ This proves that one over
K).
Li
if
deg ~ /" 0 .
is of genus one over K
(Li· K = \t,Li'
In particular, Li is (isomorphic to) 164
is of genus
the field of
- 11.91 -
K-rational functions over a non-singular (absolutely irreducible) cubic defined over K. 1 contend that moreover Eq is the set of normalized valuations of Li·K trivial over K : (5.13)
Eq ~ XCV) ~ VCR) a ~ ord a ~ Pa = center of ord a where orda(f) is defined transcendentally with Schnirelmann's theorem as the multiplicity of the zero (or pole) of f at a. That these mappings are. injective is obvious (V is non singular). The surjectivity of the first one (or the composite) is easily seen : if P ~ VCR) was not in the image, then take a finite extension K'
= K{P)
of K
such that P € V(K'), and f £ L ,(2(P)) not constant. This leads to a K contradiction because Schnirelmann's theorem shows that ordaf < 0 for some a
€
Eq • Moreover, because V(K) is the set of points fixed
by Ga1(K/K), XK(V) is the set of valuations which are invariant under all
(f'"E:
Gal(R/K) and in the isomorphism (5.13) we see that the inverse
image of V(K) is precisely (5.14)
K)C/q'll.
=
Eq(K) C E q
Eq{i()
= i..>c/cl .
From now on, we shAll identify E with the corresponding non-singular q
cubic curve V and write for example E (K') instead of V(K l q
)
for any
extension K' of K . The main problem left over is that of determining the invariant of the elliptic function field Li over K, as a function of q in the (punctured) uni t disc
Iql < 1
at the Weierstrass functions
of K)(. This will be done now by looking ~
and
P'
as functions of the p-adic
variable q (instead of the usual variable z).
165
- 11.92 -
To find the p-adic analogue, let us come back to some classical computations regarding the lattice L
By defini tion
1l + 1: 1l.
f(z:-r)
=
function of Weierstrass, relative to the
~
z-2 +
L.
f
(z-m-n )-2_(m+n )-2}.
(m,n)r(O,O')
with a normal convergence on all bounded sets not containing lattice points. We sum this series by keeping first n fixed and using the classical formula
(Tt/sin1tz) 2
all rational integers m
I:
~~).
L.
=
(z - m) -2 (summation extended over
° we
For n 1
L.
{(z-m-n-c) -2 - (m+n-c) -2) =
get
L
(z-n-r -m) -2 -
m m because both series converge, and the sum is then
(n-r +m) -2 •
m
(1t/sinn:(z-n~))2 _ (1t/sin1tl11:) 2
(5.15)
The terms corresponding to n
z- 2
+
L {( z -m) - 2 miD
o
give similarly
\ -2 m-2} =L(z-m)
0/)
-2 [-2 m
m
1
(TC/sin7tz) 2 - .; /3 Gathering this expression with (5.15) for all n 1 0, we get (5.16)
~(z :-c)
=
L . (TC/ sin K( z+n-r)) 2
- rr.2 / 3
L (tt/ sinlUl'c) 2 n,O
-
Now we introduce the new variables
x = ~(z)
(5.17)
= e
2niz
and
e
q
2 Tti 1:
We see that and thus P(z:1:)
(21ti) 2 =
(
P(X)
+
1/12 -
L qn/(1_qn)2]
ni0
2rci) 2 p ( X)
with (5.18)
[Ln qnX/(l - qn X) 2
= P(X:q) =
L
00
qn X/(1- q n X)2 + 1/12 - 2Lq n/(1- q n)2 I
We can still express the last term in this expression slightly differently. Using the binomial formula, we get indeed
166
- 11.93 -
n -2 = 1 (1 - q)
+
qn(l _ qn)-2
qn
2q n +
3q 2n
+
2q2n
+
+
•••
3q3n
+
so that
=
where we use the (5.19)
and
5
=
(q)
k
mo~e
L
L--mqm/(1 - qm) m)l general notation
~
rnkqrn j (1 - qrn)
=
L..
lTk(N)qN
(kE::N),
N>l th ~k(N) denotes the sum of the k powers of all divisors of N . m~l
Thus we have (5.20) To express the differential equation satisfied by this function P we note that ( 21ti) -1 d / d z
and
50
p' =
=
=
X d / dX
D
(say)
(2~i)2(2~i)DP = (2~i)3DP
. 6
(21tl) (DP)
Vl' 2
2
r
=4
r3
g2P - g3
and finally (DP) 2' = 4p 3 _ g'P - g3 with 2 , -6 -4 &'2 = (27[i) g2' g3 = (2lti) &3 From (5.20) we also get (5.21)
(5.22) and from the classical expressions given in the first chapter (1.4.2-3) (5.23)
,
240s 3 (q))/12 , g; =-( 1 - 504s S (q))/216 j = (12)3 g 3/ 6 (12)3(g~)3j~1 = q -1 + c(n)q n . 2 n)O q TI (1 _ qn) 24 1:1' = (21r)-12b. g2
=
(1
+
L
,
n~l
(look at (1.4.5.b) for the last formula). The advantage of these expressions is that the factors
~
have been
cleaned out, that their coefficients (integers divided by some powers of 2 and 3 : 216 = 6 3 ) have a universal meaning, and in particular
167
- 11.94 -
make sense over p-adic fields. Returning now to the p-adic case with
o
of
q
E.
K , Iql <:1, we define two q-elliptic functions by (5.20) and
(5.22) respectively noting that these series converge due to the fact
that their general terms tend to O. The first one has a pole of order 2 at 1 (mod q~) whereas the second one has a pole of order 3 at the
same point (and they are regular elsewhere). These functions must generate Li over K (by the Riemann-Roch theorem). In fact all their coefficients are in
~(q)
c K. The complex identity (5.21) gives a
formal identity in X and q with coefficients in Z[1/6Jc~ cK (5.23), hence will also be satisfied p-adically. This proves in particular that the absolute invariant j = j(E q ) is given by the convergent series (5.24)
j(q)
= l/q
+
744
+
~c(n)qn
(c(n)
E: Z)
n~l
having the same coefficients as the classical one. Because the c(n) are integers, their p-adic absolute value is smaller or equal to 1, and
[:c(n)qn will itself be a p-adic integer (the subring of integers
n;,O in K, defined by
lxl <; 1, is compact, hence complete) and we infer
from that, that j(E q ) cannot be a p-adic integer: Ij(E )( q
(5.25)
In fact, for any j
E:
:>
K with
I
tjl > 1,
there exists a unique q E: K with
Iql < 1 and j = j (q). This follows from the next lemma which uses essentially the ultrametric property of the absolute value of K. (5.26) Lemma. Let f(X)
cn
E: K
satisfying Jcnl
~
= I:cnx n be a formal series with coefficients
n)O 1. Then the mapping x
~
l/x
+
f(x) defines
a continuous bij ection between the punctured uni t disc 0 < Ixl < 1 and the exterior of that disc Ixl
>1
.
Proof. First we observe that the series converges for
Ixl < 1
by hypo-
thesis of integrality of the coefficients c n • Hence x ~ x-I + f(x) is well-defined and continuous in 0 < Ixl < 1. Let us check now the 168
- 11.95 -
injectivity of the mapping. If x-I + f(x) = y-1 + fey) we derive x-I - Y-1 = f (y) - f (x) =
(y - x) (xy) -1
L
c (yn - x n ) n
n~l
and hence because we assume
Ixl
L
c (y - x) (yn-l +... +x n - l ) n < 1 ,Iy' < 1 all terms yn-1+ ... +x n - l =
n~l
are in the unit disc and so is their sum:
This implies indeed Jy - xl = 0, y = x. To show the surjectivity, we fix any ye: K with \yl>l and we solve y = l/x + f(x) for x by iteration. This equation is equivalent to y
(1
=
+ xf(x))/x and also to
= y-1(1 + xf(x)). Thus we define inductively a sequence (x.). 0 by 1 1~
x
-1
0, x i +l = Y (1 + xif(x i )) · By induction one checks (and more precisely Ixil = Iy -11 for i >1). Then
X
o
~J
<1
-1
x i +l - xi = Y (xif(x i ) - xi_lf(x i _ l )) y
n n '-- c (x. -x. l)(x. +... +x. 1) n 1 11 1-
-1 \
n~O
hence by iteration IX i • l -
xJ'IY-~~l
- xJ
= rilxll
showing that (xi) is a Cauchy sequence (we are in an ultrametric space ! ) . The limit is the required solution. We know that two elliptic curves over K are isomorphic over
K
if and only if they have the same absolute invariant j. We may ask questions about K-isomorphism classes of elliptic curves. More precisely,
,
we may ask : when are the two elliptic curves (gi ' gi £ K) y2 4~3 - g2x - g3 • y2 = 4x 3 - g;x - g; with j = j' isomorphic over K (When do they give rise to K-isomorphic function fields?). The answer is given by (2.36). When the &i'S are not 0, a necessary and sufficient condition is that " g 3 = t 2 (g2/ g 3) with some t g2/
,-4
€
,
(indeed this is equivalent to &2 = t g2 and &3 169
K t
-6
&3).
- 11.96 -
For the curves defined by a Weierstrass equation with gZ g3 f 0 we define thus the relative invariant
1 =
(s.Z7)
>l:Z
mod K
-~(gZ/g3)
·
This is a well-defined element of K~/K~Z, independent of the particular Weierstrass equation chosen with coefficients in K which (together with the absolute invariant j) characterizes completely the K-isomorphism class of the elliptic curve. The reason for the choice of the factor
-~
will be apparent below. To compute the relative invariant
of Tate's curve Eq , we need a lemma. (5.28) Lemma. Let K be a p-adic field, x 1 + 4x is a square of K : 1 + 4x
E
E:
K with \xl < 1. Then
>eZ
K
Proof. We start with the formal series expansion (1+4X)1 = 1 + !(4X) + !(-I)(4X)Z/Z! + with coefficient of Xk (up to the sign) given by 1.3-s ... (Zk-3)Zk/k!
=
z.
1·3·s ... (Zk-3)·Z·4·6 ... (Zk-Z)/(k!(k-l}!)
~ (2(k-l))
(Zk-Z)! = Z k!(k-l)! But (2;)
eC:_-l1)).
k
k-l
Z (k-l)) = (Zk-l)-k (2 k-l
is an integer
and k is prime to Zk-l (e.g. because Zk - (Zk-l) = 1) so that k must divide
2
This proves that the coefficients of (1
all integers: (1 + 4X)1
=
power series expression Because both 1
+
l:-a Xn (a n~O
n
L.a n x n
n
+
4X) 1 are
€~) which gives a convergent for the square root of 1 + 4x .
Z40 s 3 and 1 - 504 s 5 are of the form 1 + 4x wi th
Ixl
~
=
+1 2i~(1
+ 2405 )(1 -
3
5045 )-1 5
=
9y 2 ,
and is a square. We have chosen the factor -1 precisely to trivialize
170
- 11.97 -
the relative invariant of Tate's curve. We sum up our results in the main theorem of this section. (5.29) Theorem (Tate). Let q be such that 0
< Iql <
1,
~
K (p-adic field of characteristic 0)
and Eq be Tate's elliEtic curve
r:.>
Then
a) Eq is defined over K with set of K-rational points Eq(K) isomorphic to K>C/q71 . b) The absolute invariant of Eq is given by the convergent series
HE q ) =
q -1 + 744 +
L
c(n)qn
n~l
(with the coefficients c(n) of the classical q-expansion of j), so that in particular
Ij (E q )/ > 1.
c) The relative invariant .. 1 : · Eq over K 1S tr1v1a
Conversely, for every \ j
1 E:
y=
I> 1
-!g2/ g3
of a Weierstrass equation of
Kx2 •
in K, there is a unique curve Eq wi th
invariant j. Observe tha t because \ j
I > 1,
j
is never 0 nor 12 3 for Tate's
curves, and consequently neither g2 nor g3 vanishes. Then, our definition of the relative invariant is meaningful. For characteristics p ; 2,3 this theorem is valid without modification (the only denominators appearing in (5.23) are powers of 2 and 3) because Schnirelmann's theorem is valid in all characteristics. In characteristic 2 (and 3), some other normalizations have to be adopted (as is explained in the book by P. Roquette on Analytic Theory of Elliptic Functions over Local Fields). (5.30) Remark. The expression (5.20) for P(X) could essentially have been derived from the basic theta function 8
o
defined in (5.9) in
the usual way (this is the method chosen by Roquette). First, the logarithmic derivative gives the analogue to the ~ function of Weierstrass. Taking the derivative once more gives substantially the P function : 171
- 11.98 -
P(X) - 1/12
+
2s (q) l
The fact that we do not get exactly the P-function comes from the definition of 9 0 without convergence factors (they are needed in the classical theory in the () -function, but are superfluous in the p-adic case).
172
CHAPTER
THREE
DIVISION
POINTS
Division points on a complex torus play a role analogous to roots of unity on the circle. In particular, there are plenty of them, and if the torus is given as cubic with rational coefficients, their coordinates are algebraic numbers, hence their algebraic interest. By taking suitable limits of groups of division points, some canonical p-adic spaces are attached to the curve, which, formally at least, are similar to the tangent space at the origin Lie(E) Tp (E)
= Hom{R,E)
Hom (~p/llp ,E)
is of
dimensio~
two
free Zp-module of rank two
two p p Apart from the basic definitions and properties, we have also indicaVp(E)
Tp(E) ~. ~p
is of
dimension~
ted some applications. In particular, if L is a sublattice of an imaginary quadratic field, we have proved in two ways that its invariant j(L) is an algebraic integer. The first one is the classical analytical one, whereas the second one (Tate) uses division points through
t-adic representations of Tate's curves.
Prerequisites for this chapter are still quite limited. For the sake of simplicity we have only treated Hecke correspondences of prime level, so that we have to use the fact that a quadratic imaginary field always contains an element of prime norm.
173
- 111.2 -
1. Division points in characteristic zero
Let us start with some p-adic preliminaries. Every p-adic number x (0 ~ q
<
Qp has a well-defined polar part (x) of the form q/p pn) defined by the property
~
x E <x)
+ ~p
= (x)
(equivalently x
mod
~p
n
) .
La pm (0 ~ a < p), then (x) is the finite sum L.a pm . m m m
=
is locally constant on Qp (it is constant on the additive cosets of Zp ) . This proves that
x
~
(x)
x ...-.-.. ~p (x) = ~<x> = exp (27ti <x») is a (non-trivial continuous) character of Qp. This character is called Tate's canonical character of
~p
In particular, -p e identifies ~p/Zp with the subgroup ~(p-NZ) C[1 C [~ generated by the ~p.
roots of 1 having as order a power pn of the prime p. (1.1) Proposition 1. For x -
£I y
~ ~p(xy)
. Then x
E:
~
Q , let p
--
x defines
x be
the character of
~
p
defined
an isomorphism of Qp onto its
(Pontryagin) topological dual. Proof. The homomorphism x ~p(xy)
1 for all y
E: ~p
~
x is
injective because
x=
0 means
or xy €Zp for all y, hence x = O. Its
image is dense in the character group by duality, because if y £Q p is orthogonal to all characters x, it implies as above y = O. Let us see that it is bicontinuous. The continuity of x Conversely, let us show that
x~
0 implies x
~
x
is clear.
O. Let Vr(R) be the
~
neighbourhood of 0 in the dual of Qp defined by li(y)
-11
= I~p(xy)
1
-11 < r
for
IYl p ~
R
11
Fix u E: ~ p wi th -p e (u) ; and choose r ~~ Ie so that if x E. Vr(R) -p (u) then x-lu = y cannot be in the ball of radius R (by the definition of
174
- 111.3 -
Vr(R) just given) when R -+
and
00
the image of
~p
x
Ix -lulp > R
Ix Ip
and
x -+ 0 . As consequence,
e: Vr (R), necessarily
under x
x is
~
. This shows that
a locally compact subspace of this
dual and must be closed (because it is a subgroup). This proves the surjectivity of the mapping. (1.2) Corollary. If we identify
~p
with its topological dual,
I p is
its own orthogonal, and the dual of this compact group is (isomorphic to) the discrete group
~p/Ip.
o --+ I p --..
~p
In other words the exact sequence --. ~p/Zp --+- 0
is autodual, i.e. gives by duality the same sequence
t
o
+ - ~p/~p
....- ~p +- I p
0
-+-
Proof. The assertion on the orthogonal of Z
p
follows directly from the
fact that the kernel of e is ~p. For the remaining assertion, just -p use the general fact that if H is a closed subgroup of the locally compact abelian group G, then the dual
to the quotient G/H general result
i
co~ld
Hof
H is canonically isomorphic
(of course in our case G
= ~p' H = Zp and this
be derived more directly!).
(1.3) Proposition 2. There is a canonical isomorphism In other words, the p-primary component of ~
~/Z
~/~
--+ $ p
~ /~
p
p
.
is (isomorphic to)
II . P P
Proof. The composite homomorphism Q hence a homomorphism
~/~
~ ~p/~p
homomorphism ~/~ ~ TT~ /~ p
p
p
~ ~
p
~ ~ ~
P
P
is trivial on I,
. The product of these gives a
whose image is contained in the direct
$ ~ I~' (the subgroup of families having only fini tely nonp p p zero components) because a rational number has only finitely many
sum
primes dividing its denominator. hence belongs to
~
p
for nearly all
primes p. The injectivity follows from the fact that for x x
£
~p
(when considered in
175
~p)
for all p
~
x
~ ~
€~
•
- 111.4 -
Finally the elements q/pn€ ~/~ hence generate the component
~
p
t;
have images integral at all
p
IZ p .
~.4)Proposition 3. The automorphism group of (~/~)n is canonically
isomorphic to
IT Gl (~ ) p n p
.
Proof. Because automorphisms respect the decomposition in p-primary components, we are reduced by Prop.2 above, to determining the automorphism groups of (~ II )n. But by transposition, the automorp p phism group of (~ I~ )n is isomorphic to the automorphism group of p
p
= GI n (Z p ) because every automorphism p p of the group ~n is ~-linear, hence also Z -linear by continuity. In the dual
~n . But now Aut~n) p
p
fact, every matrix in Gl n with p-adic integral entries determines a ~p-linear mapping ~n --. ~n leaving stable the subgroup Zn, p
p
p
hence
an automorphism of the quotient. This gives explicitely the isomorphism of Prop. 3 . Now we go back to our business and let k be a field of characteristic 0, E = E be an elliptic curve over k given in Weierstrass k form (or simply a non-singular projective plane cubic over k with one selected rational point over k). We denote by teE) the torsion subgroup of E, and for any integer N by NE)
~
1, by tN(E) (or more simply
its subgroup of elements x having an order divisible by N :
N·x = 0 € E. For example we have already seen that t (E) ~ (71/Zll.) 2 set of ramification points of ~: E 2
-+- pI
,
t 3 (E) ~ (Z/3Z) 2 set of flexes of E if in Weierstrass normal form (cf. (1.1.26) and (11.2.9-10) ) . We are going to prove a more general result. (1.5) Proposition 4. For any integer N of division points satisfying N·x
=
1, the subgroup tN(E) C E
0 is isomorphic to (~/~)2
(N-l~/~)2. The full torsion subgroup is
176
~
- 111.S -
Proof. We may assume k is of finite type over the prime field
~,
hence we may choose an embedding k ~ [ and with this choice E c E[ • By the transcendental theory, E[ is isomorphic to a complex torus
[/L with a certain lattice L c [ and in particular
It remains to see t(Ea:) integers N
> 1.
C
E = Ei( , or equivalently tN(Ea:) c E for all
But N·x
=
0 is the equation for an algebraic variety
V over k (which might be reducible), and we have seen that V[ fl Ea:
is finite. For any automorphism under
~
~£
Aut([/k), V and E are invariant
(because defined over k), hence
(V[ n E[)
r
= V[ n Ea:
This proves that these intersection points can only have finitely many conjugates under the automorphism group in question. These coordinates must thus be algebraic : tNCE([) = V[ () E[ c Ei( and this completes the proof. Observe that the proof shows that the field generated over k by the coordinates of the points of tN(E) is a Galois extension of k, and the same is true if we consider only the first (or second) coordinates of the points of tN(E) (in such a statement, we consider the affine coordinates of the points not at infinity if the curve E is given in Weierstrass normal form; we could also consider the projective coordinates, normalizing one of them to 1). If E and E' are two elliptic curves as above, and if h is a homomorphism E
~
E', then it induces a homomorphism, still denoted
by h, between the corresponding torsion subgroups
177
h : teE) ~ teE').
- 111.6 -
For example, this gives a representation of the endomorphism ring of E : End(E) --.. End(t(E))
= TTMZ(1 )
p
p
(cf. Prop.3 above). In practice,
it is more conceptual to let the endomorphisms of teE) act on a vector space over
~
p
, and for that purpose, to introduce the action
on the topological dual of teE) (or any isomorphic space) and to extend the scalars to
~p.
There is a canonical construction due to
Tate which we explain briefly. pG
G.
x n and p x n (xn)n>O € Tp(G) means that pX n + l for all integers n ~ o. This is a Z -module because t neG) is a
o
Let, for a moment, G be any additive p-divisible group We define the Tate module (1.6)
T (G) p
By definition, x
= l~
t neG) p
n
=
module over Z/p~ = ~ p Ip~ p a canonical isomorphism
p and lim ~ Ip~
Hom(p-~~,G)
+-p
p
p = ~ p . Because there is
=t
(G) pn another possible definition for Tate's module would have been -n (1.7) l~ Horn (p ll/ll, G) Tp(G) Horn ( 1im --..,.. p -Ilz IZ , G)
Horn (~ I'll , G)
P P
We could still replace G by its subgroup G = t (G) of elements p pOD having a power of p as order (p-primary component of G) in this last formula. Now we consider the projective sequence of homomorphisms of multiplication by p ... ~ G ~ G ~ G p p p We put VP (G) = Vp(Gp ) = lim(G ~ G ) +-p p and call it the extended Tate module of G (or G , relative to the (1.8)
p
prime p). By definition Tp(G) C Vp(G) is the submodule consisting of sequences x
= (x n ) with Xo = 0, and the projection x
178
~
Xo onto the
- 111.7 -
o~ component gives an exact sequence
o
Gp ~ 0 . Because each x ~Vp(G) is such that pmx has O~ component pmxo ~
Tp(G)
~
Vp(G)
~
0
for m sufficiently large, we see that
U
V (G) = p -m T (G) p m~O p
and because Tp(G) has no torsion (multiplication by p is the shift operator), we have (1.9)
Vp(G) = Tp(G)
~ ~p
II
p
In particular, this extended module is a vector space over
~
p
.
Let us take in particular for G a torus OR/~)n ~ ([l)n. Then (1.10)
Tp(G) ~ HOm(~p/7lp,R/71)n ';;;' HOm(~p/zp,a:1)n =
(top.dual of~ II )n ~ ll n p p p (and this is also isomorphic to Hom((~ II )n,a: 1 ) ~ top.dual of t 1lt
P
P
~
(G)).
This is a free Zp -module of rank n and consequently Vp (G) is a vector space of dimension n over ~ p in this case. These spaces can be looked at as p-adic analogues of the tangent space (elements of order p are "closer to the origin" than elements of order p2, in the
= ~,~ p =
algebraic sense). For p Voo(G)
=
Hom(R,G)
=
~~
= R and we could put
Lie(G)
To come back to our case, we let G
,
E
=
E be the group of
k
k-rational points on our elliptic curve. There are two canonical representations attached to the space Vp(E) (2-dimensional over
~p).
The first one is a representation of the ring of endomorphisms of E (1.11)
End(E)
~
End(Vp(E))
and the second one is a representation of the Galois group of the algebraic (separable) closure of k over k (1.12)
Gal (k/k) --.. Aut(V (E)) . P
These two representations are the main reason for introducing the
179
- 111.8 -
p-adic modules Tp(E) and Vp(E).
In particular, the Galois module
Vp(E) is isomorphic to the vector dual (over
~p)
of the etale cohomo-
logy group Hl(E) defined by Artin-Grothendieck. It could thus be p
called first p-adic homology group of E. In a particular case (transcendental invariant j), the image of Gal(k/k) in Aut(Vp(E)) will be determined explicitely in the next section. From a somewhat different point of view, let us add a few general considerations on rational division points over an elliptic curve E defined over a number field k, i.e. those with coordinates in k. More precisely, we can show that the k-rational torsion subgroup t(E k ) of E is finite (this would follow from the Mordell-Weil theorem asserting that Ek is finitely generated, but we prove this corollary directly by a local method). Let (10) be a prime ideal in the
1
ring of integers
~k
of k, which we assume prime to (2) for the sake
of simplicity. We denote by K
=
k?
its
r
-adic completed field, by
R the ring of integers of K and by P the maximal ideal of the local ring R. The announced result will follow from the following local uniformization theorem for
~
-adic elliptic curves.
(1.13) Proposition. Let E be an elliptic curve defined over the p-adic field K, say by a Weierstrass equation y2 = 4x 3 + ax + b with integral coefficients a,b
£
R. Then there is an open subgroup
U of EK,isomorphic both algebraically and topologically to the (additive) group of integers R of K. Proof. The neighbourhood in question on the elliptic curve E will be defined by x large, or l/x small, and because infinity is a ramification point of index two for x we shall take t = l/x l as local uniformizing variable. To be able to do that, we have to check that l/x is indeed a square in our p-adic field K. But
180
- 111.9 -
and this is a square in K as soon as x is big, say x -1 ~ the formal series for (1
+
~,
because
X)l is convergent in ~ (the denominators
of its coefficients have only powers of 2 in their denominators, hence are in R : we use the fact that
1
is prime to (2) here, otherwise
the convergence radius of that series would be smaller). Thus let us put x = t- 2 , whence y 2 = 4t- 6 + at- 2 + b = t- 6 (4 + at 4 + bt 6 ) and l/y = :!:t 3 (4
+
at 4
+
bt 6 )-1 = ±(t 3 /2
+
higher order terms)
is given by a power series with coefficients in R. More precisely, the coefficients of this series are in
~[l,a,b]~ R
(polynomials in
a and b with rational coefficients having only powers of two in their denominators). Let us now define by a formal term by term integration z(t) = d; (t 3/Z + ••• )(-Zt- 3 )dt = -t + •••
J
J
th coefficient of that series, nan Now if we call an the n-
E
7l[l,a,b] ,
because the integration has introduced the denominator n. This series has same convergence radius as that giving l/y because n ord x-n = nord x - ord n ~ 00 and so Ixn/nl =lxl n /n --. 0 if lxl< will find t = -z + bzz Z + b 3 z 3 + b i recursively as polynomials in the
if ord x > 0 1 . If we solve now for t, we and the equations giving the
b j (j < i) and a k (k ~ i) (compare with the expansions of the
show that n!b n e ~[l,a,b] logarithm and the exponential functions). Thus the series giving t = t(z) will converge in the same disc as the exponential series t(z)
=
I:
n>l
£~ zn n.
Pn{a,b) ~ Z[l,a,b] .
But the exponential series has a non-zero convergence radius as follows from the well-known formula ord (n!) = n - Sen) p
p - 1
if Sen) = [no1 for n = [n.pi 1
181
- 111.10 -
I cIa im now t hat z
t-+
(x ( t ( z) ) , y ( t ( z) ))
= (( t
( z) - 2, - 2t ( z) - 3 , ...)
is a group isomorphism in the domain of convergence of these series. But this assertion amounts to a lot of identities between the coefficients Pn(a,b) of t= t(z). To check these identities, we choose an embedding a
~
A,b
B of
~
then we just observe that
~[!,a,b]
into the complex field [ •
they resul t from the classical (complex)
theory of elliptic curves, where they are true formally (i.e. when A and B are indeterminates) because we could choose A and B transcendental, independent. This proves the polynomial identities and the isomorphism in the domain of convergence of the series t = t(z), where all p-adic series have a meaning. (1.14) Corollary 1. Let j
€
K be fixed. Then there is a constant M.
such that the order Card t(E K) defined over K of invariant j.
J
Mj for every elliptic curve E
~
Proof. The projective space p2(K) is compact because it can be covered by the three compact charts, images of the compact sets
Jlj
=
{ex i ) i = 0 , 1 ,2
: xi E: R , x j = I} ·
This implies that the closed subset EK is also compact. On the other hand EK has an open subgroup U isomorphic to R,hence without torsion. Consequently is finite Now the K-isomorphism classes of elliptic curves of invariant j are parametrized by the finite sets K)'C/(K~)2
if j
~
0,1728
(relative invariant
K)&/(K~)4 if j
1728
K~/(K>c)6 if j
0
(l[. 5 . 27) )
,
This gives the uniformity of the bounds for the orders of t(E ) for K fixed invariant j.
182
- 111.11 -
(1.15) Corollary 2. Let E be an elliptic curve defined over the number field k. Then the order of the rational torsion subgroup t(E k ) is finite. Proof. Observe that with the above notations t(E ) k
C
t(E K )
(1.16) Remark. It has been conjectured for some time that the finite number of rational torsion points Card t(E k ) on elliptic curves defined over k is bounded by a constant Mk depending only on the number field k and not the elliptic curve E over k (with
Mk ~ ~ for increasing k c k = ~). Manin first proved a weak form of that conjecture, showing that for any prime number p, the th component of that order is bounded (uniformly in E defined over pk). Recently, the conjecture in its strong form has been proved by Demjanenko.
183
- 111.12 -
2. An t-adic representation of a Galois group
Let E be an elliptic curve defined over a field k of characteristic ~(j)
o.
If j = j(E) is the absolute invariant of E, necessarily
c k. We suppose that k c [ is embedded in the complex field.
For any integer N
~
1, we denote by tN(E)
t of E having an order dividing N : N·t module of rank 2. There is a canonical
=
NE the subgroup of points
O. This is a free
~-bilinear
~/~
-
form over this
module, with values in the group ~(N-~~)of N~ roots of 1 in [ which can be defined as follows. For t E:NE, the divisor N((t) -(0)) over E is of degree 0 and satisfies Abel's condition, hence is principal. Take a rational function over E, f t £[(E) with this divisor (f t is determined up to a multiplicative constant by this condition) div(f t ) = N((t) - (0)). Select t'
£
E with N·t' = t (noting that two
such p.oints t' and t" must differ by a point in NE: t" = t' +u with u
E:
NE). The divisor
~t =
L..
[(t' +u) - CU)] €. Div(E)
u~E
depends only on t and not on the choice of t' with N·t'
t, has
degree 0, and satisfies Abel's condition: (t'+u-u) = t' = N2t' = N·t = 0 . UENE UENE This proves that ~t is also principal, and we can find a rational
L
L..
function Ft £ [(E) on E with divisor ~t (and this condition determines Ft up to a multiplicative constant). Then
diV(F~)
=
N~t
=
L
E N
N( (t' +u) - (u))
is the divisor of the rational function fteN : v there exists a constant c € Ft ) with get
F~ (v
~
ft(N.v) so that
(depending on t, the choices of f t and Ft(v) = cft(N.v), and replacing v by v + s (for s e: NE), we [~
N
+
s)
= F~ (v)
• Hence there exis ts a well-defined Nt_h root
- 111.13 eN(t,s) with Ft(V + s)
eN(t,s)Ft(v) (for all v
=
E:
E). This mapping
eN has the following properties. (2.1) Proposition. The mapping eN
and satisfies
~-bilinear
a) eN is antisymmetrical : eN(s,t)
= eN(t,s) -1
b) eN is non-degenerate : eN(t,s) = 1 for all s
=
t
£
NE implies
0 E:E
c) for any automorphism
F
of [ trivial on k (or on any field
of definition of E) eN(t,s)~ = eN(tr,sr) Proof. For brevity we write e = eN in this proof, hoping that no confusion will arise! By definition, it is clear that e(t,s
= e(t,s)e(t,st). (t)
=
+
s ')
Let us prove that also e(t +t' ,s) = e(t,s)e(t' ,s).
For that, put ttt = t + t' and take a rational function F div(F)
+
E:
a: (E) wi th
(t') - (t") - (0). Then
N diV(F ) = N(t) + N(t') - N(t") - N(O)
=
div(ftft,/f t ")
with certain choices of functions ft' f t , and f t " corresponding to the points t, tt, ttl. Thus
(F~F~,/F~II)(V) = and hence
(cftc'ft,/C"ft")(NoV)
(FtFt,/Fttt)(v)
=
C'F(N.v)
is invariant under the substi tution v e(t,s)e(t' ,sJ/e(t",s) = I hence the proof of a). For n ft,n(v)
=
=
C·F N (N·v)
~
v + s (s
E::
~-bilinearity.
NE). This proves We turn to the
1, ... ,N define the translate ft,n of f t by
ft(v -nt), and compute the divisor of the product
N N div(TI f t n) = N [(t n=l' n=l N Hence f C is constant and n=l t,n
L.
+
nt) - (nt)]
TIft,n
=0 ·
IT
IT ft,n (N ov) = IT f t (Nv - nt) IT f t (N(v - nt')) definition to TT CnF~(V -nt') and must be constant.
ITft,n (v) is equal by
185
- 111.14 N
IT
Extracting the N1J1 root, we see that
n=l
and replacing v by v + t' we get
N
IT
n=l
Ft (v - nt ') must be constant
N
Ft(v-nt') = N-1
IT
n n=l
Ft(v+t' -nt')
Ft (v - nt' )
n=O
and after simplification by the common factors, Ft(v) = Ft(v -Nt') Ft(v - t) is invariant by translation of t
: e(t,t) = I
(for any
t E:: NE). From there, replacing t by t + s and using the bi1ineari ty of the symbol e = eN ' we derive 1
=
e(t + s,t + s) = e(t,s)e(s,t)
For the proof of b), note that if e(t,s) Ft (v + s)
=
. for all s
I
~NE,
we have
= Ft (v) for all these s and so Ft (v) = ep(Nv) implying
F~ = (~.N)N = ftoN so that div(~)
=
div(~oN) = N-ldiV(fiN) and
div(ft)/N
=
(t) - (0) .
Abel's condition gives t = 0 so that e
cr
Finally, for c), we observe that
=
eN is non-degenerate.
acts on E and on the set X of
normalized valuations of [(E) (trivial on [) according to r
f"
(ord p €oX or p 6.E).
(ordpJ (f ) = ordp(f)
In particular, if x and yare coordinate functions on E regular at P (and defined over k) r
r
~
~
(ord p ) (x - x (P )) = (ordpJ (x - x (P) )
and similarly (ord p { (y -y(pll"))), 0, centered at P
r
r
: (ord p )
=
r
that if f ~ [(E), div(f )
ordpr , ordpr (f )
=
=
e(tr ,sr)Ftcr(v r
=
ordp(f) .
t
, F
t
=
r
= Ftr(vG""+sr) =
F~((V + s)O"'")
Ft (v)eT" = e ( t , s)cr Ft cr (vcr)
e(t~,s~)
~ on
are chosen as before, we see that
r
)
is
This shows
(div f f (with the natural action of
(f ) , F tr = (F ) so that t t
= e ( t , s)O""
This proves e(t,s)r
ord p (x - x (P)) >- 0 ,
which proves that (ordpt cr
divisors). If t,s £NE and f we can choose f tr
=
as asserted in c).
186
- III.lS To be able to go to the inverse limit in the symbols eN (with N
= tn) we have to give a connection between two of them.
(2.2) Proposition. Let M,N be two (strictly) positive integers, t
tN(E)
£
C
tMN(E) , s E: tMN(E). Then one has eMN(t,s) = eN(t,Ms)
( Ms E:tNCE) ).
Proof. We simplify the notations for the proof, denoting by e = eN ' e'
= e MN
(putting primes' to all notions relative to MN). We have
used the notations div f t = N(t) From there we deduce
cf oN
and
N(O)
t
(F oM)MN = (F No M)M t
t
Hence we may choose f' F
,
(F t'MN )
MN(t) - MN(O) as it should be),
c " f t MN . Now by definition of e ,
=
F~(v + s) giving
div f t
, e' = eM
FtoM
t
so that
,
(implying
t
Ft(Mv+Ms)
e' (t,S)F~(V)
=
e'(t,s)Ft(Mv). But by definition of e(t,s)
=
Ft (w + Ms)
=
e (t ,Ms) Ft (w) ,
hence the result. Taking now t
~
tMN(E), s
~tMN(E),
using the bilinearity, we
get immediately eMN(t,s)
M
=
eMN(Mt,s)
=
eN(Mt,Ms).
Now we take for M and N powers of the prime t . Remembering that an element of tn
E::
t
~n
T~(E)
can be identified with a sequence
(E) and .t·t
e(t,s) (for t
=
(t n ) and s
tive limit
=
n
t
n-l
(tn)n~O
with
,we define a bilinear symbol
(t,s) = (e~n(tn,sn))n~o =
(sn) ) and we consider the result in the projec-
~ -tnt-'- ~ ~ 7l/tn 71. = 7l-t, • Thus <.,.)
187
defines a non-
- 111.16 degenerate antisymmetrica1 (2.3)
~~-bi1inear
T.t (E) x Tl, (E) ~
7lJ.,
,
pairing
(t ,s) ~
We shall need the following easy lemmas. (2.4) Lemma 1. Let D be a domain in [ (not empty open connected subset) and (f i ) an at most denumerable family of meromorphic functions in D. If k is a demumerable subfield of [, then there is a point zo£ D such that the mapping £
~
f(zo) (is well defined and) gives an
embedding of the field k(fi)i generated over k by the f i into the complex field. Proof. The field of meromorphic functions k(f.). is denumerable, and 1 1 each meromorphic function has at most denumerably many poles. Since D is open (hence not denumerable), there exists a point Zo which is pole for no function f ek(f i ). The mapping f ~ f(zo) is then welldefined, not zero, hence gives an isomorphism k(f i ) ~ [ (into). A mapping of the form f ~ f(zo) (defined on a subring of a certain function field) is called a specializatiQn. When moreover the homomorphism f
~
f(zo) is injective (hence gives an isomorphism into [)
the specialization (or the point Zo itself) is said to be generic. With this terminology, lemma 1 asserts that every denumerable function field of meromorphic functions on a domain D c [ has a generic specialization. (Lemma 1 is also true for a denumerable function field of meromorphic functions on a domain D in [n, using Baire's theorem - D is locally compact - because the pole sets of meromorphic functions have no interior points.) (2.5) Lemma 2. Let o<.e.G1 2 (7l/Nll) be a transformation with the property 0« t) = :!: t for every t E: (1L/ N1l) 2. Then 0( = ±1 . Proof. Call t ,t the canonical basis of (ll!NZ) 2 and define the signs l 2 E.1 = ±1 by «(ti) = £i t i · Then o«t l + t 2) = E.1 t 1 + f,zt z must be t(t 1 + t 2 ) by hypothesis. Hence £1 = £2 = {, and 0( = £·1 188
- 111.17 (2.6) Lemma 3. Reduction modulo N (a positive integer) defines a 51 2 (7l/N1L). In other words, the
surjective homomorphism following sequence is exact : (1) (We denote by
rN
-+-
rN
--+-
r
= 51 2 (Ll)
~ 51 (7lINll) --... (1) 2
the principal congruence subgroup of
r of level
N. By defini tion, it is a normal subgroup of
r wi th fini te index.)
Proof. We have to show that every matrix (~
~) with integral coef-
ficients and determinant ad - bc
=1
mod N is congruent mod N (term-
wise) to a matrix with determinant 1. The elementary divisors theorem shows that there exists
~,
r
€
SlZ(7l) with
r(~ ~) d = (~ ~) in
diagonal form. Hence we are reduced to proving the theorem for diagona1 matrices only. But if mn • 1 mod N we have the congruence 0) (m o n
= (m 1-mn
mn-l) n{2-mn)
mo
d N
·
The matrix in the right-hand side is easily verified to have determinant 1. (In this lemma, 51 2 could be replaced by 5l n , the proof being made by induction on n with a similar method.) Now we introduce Weber's functions. They are associated to division points on elliptic curves in the following way. Let Im(T) > 0 and consider the complex torus [/L T • Its division points of order dividing N are the images of the points
a~/N +
bIN with integers
a and b. Introducing the line vector i = (a/N,b/N) e: (N-7;ll) 2 we can denote them by i(i) (product of a line vector by a column vector, resulting in a scalar). Or, if we prefer to introduce the integral line vector i' = Ni = (a,b) sion points in the form by
f i (t:)
=
g2:
€
(7lINll) 2, we can also write these divi-
., "tIN
1
(liN). Then Weber's functions are defined
3 (1:)
P(i(i) : Lt:)
if 0 ~ i E: (N-7;?Z) Z
(we discard an integral multiplicative constant which would play no
189
- 111.18 -
role here). We also put f o = j (the modular function). In more algebraic terms, these Weber functions are normalized first projection of division points : Cf : G:/L"( - - E c p2 (G:) defined by y2 = 4x
3
- g2 (t:)x - g3 ('r)
~~x
pI (([)
is a commutative diagram, and if we put ~ = gZg3(-c)x for the norrna-
a
lized first projection of E, then
(Z.7) where t l = ~(~/N)
, tz
=
f(l/N)
£
E ·
We observe that the rational function x on E being defined over k =
(field of definition of E) : x ek(E), the nor-
~(gZ(t),g3(~))
malized first projection
~
is also rational over k :
~£k(E).
Had we taken any other Weierstrassian model E' of ([ILL
= ~4g2(L)
coefficients g;
~:
E
~
,
with other
g~ = ~6g3(L)' the isomorphism
E' , (xo'Yo) ..-..+ (tL2xo'p:'yo)
,
shows that the normalized first projection ~' of E would take the same values as
(Z.8)
~
~ ~'(t')
on corresponding points :
=
~(t) if
o/(t)
t'
(or
t
=
t
,
, =f (z)).
In particular, we can choose for E' a model defined over the smallest field possible: k'
=
~(j(r)).
(Z.9) Proposition. The field ~(f.:i 1
€
(N-1z~)Z) of rational functions
over the upper half-plane is a Galois extension of the field
~(fo)
= ~(j).
Proof. By lemma I, there is a point giving a generic specialization f
't"
~
in the upper half-plane, f(~)
on this field, and con-
sequently we have to prove the corresponding assertion for the extension
190
=
- 111.19 -
,
Choose then a model E ,over the field *)
[/L~ •
~(j(~)),
of the complex torus
t
By (2.7) and (2.8), noting that i'(t~) describes the set
of points t' ~ E' annihilated by N , we get (Z.lO)
(Q(f i ("t))i
= ~(j ('t'),
e" (t
):0
~
,€
t
,
tN(E )).
This is a Galois extension of ~(j(~)) because ~' is rational over this field : any automorphism j C't) will transform E ~' (t,er)
,
a- of the complex field [, trivial over
,
in itself, and tN(E ) in itself with (J"
= ~' (t,)tr (as well as x'(t'CT') = x'(t') ).
This proves the proposition. We shall determine later the Galois group of the extension But knowing that this extension is algebraic, we con-
~(fi)i/~(fo).
clude that any
t
(in the upper half-plane) such that j(r) is trans-
cendental is generic: the subring of
~(fi)i
over which f
~
f(~)
is well-defined (finite) must contain (Q(j) and the f i hence coincide with
(Q(j) [f.]. 1: 1
(Z.ll) Corollary. Every
~
(in the upper half-plane) such that j(r)
is transcendental, gives a generic specialization f
~
f(~)
of the
field (Q(fi)i into the complex field [. It is easy to construct certain automorphisms of the field ~(fi)i
write
(over
r = (~
~(j)).
If
~€SlZ(7l),
f
fOI
~
~) and observe that the lattice
generated by the two vectors at'
+
band ct'
+
gives one. Indeed, (Cl: +
d)L t(t:) is
d hence is equal to Lr .
Then (Z .1Z)
g~3 (-C) gHi (~~ :~) : (c"t
+
d) L t ("r:))
= f i/t:) ·
Thus composition with the fractional linear transformation associated to
~
permutes the indices (for i 1 0 and obviously leaves f o = j
fixed) through right mul tiplication of the line index i by *) Observe that j c-r) must be transcendental, hence gZg3 (1:') ; 0 · 191
y.
- 111.20 -
P-function
Because the
of Weierstrass (or the first projection of
1 E:. f N will
E) is even, we also see that phism of
~(f.). 1
1
produce the trivial automor-
, or equivalently the permutation i
rN(±l)
t {i
indices, exactly when
~
±i of the
(cf. lemma Z above). Hence we
get an injective homomorphism
r / rN( :!: 1) 't S1 Z (ll/ N1l}/(ti) -+-
(2.13)
±y
t-+
Gal (
1
t-+
£.-l=f. ) 1
11
(cf.lemma 3). Let E be an elliptic curve . of invariant j 0 £ [ defined over the minimal field k =
K
K'
I
e: t N (E))
= k(x(t): t e: tN(E)) }
I
G
G'
k
Take a basis
1 ,t Z of the ~N1l-modu1e·tN(E) and define for every automorphism r e: Gal (K/k) the matrix peer) by ( 2 • 14 )
Then
p is
t
(t 1 t2
Jr
= (~) = f (0-) (t 1 )
r : G --+-
(2.15)
G1 (7l/Nll) Z
If r6 Ga1(K/K'), then x(t r ) hence
f(~)
tz
t1
an injective homomorphism
=
= x(t) implies t r = ±t for all t
!1 by lemma Z. Indeed t
~
€ tNCE)
-t defines an automorphism
of Kover K' and this automorphism is not trivial if N > 2 which we shall suppose from now on. Then (Z.15) gives also an injective homomorphism (2.16)
p' :
G' -
It is easy to see that if *)
p'
p
Gl zOZlN1Z)/(:!:l) *)
•
is surjective precisely when
, . p IS
is surjective, then either (~ -;) or (~l ~) is in the image of
If N = 2, K = K' , G = G' and (tl) = 1 so that (2.16) is also trivially true. 192
- 111.21 -
p , so that by taking the square, we find that -1 is in the image of
p
and so
p is
surjective. Let us show that K' contains a primitive
NtJl root of 1. Let
'N = eN(tl,t Z)· Because the bilinear symbol eN . .tlve . Nth is non-degenerate, this is a prlml - root of 1 (if (~ = 1 , we
see that eN(Mtl,t Z) = 1 hence eN(Mtl,t) = 1 for every t £tN(E), and this implies Mt l = 0, i.e. M multiple of N). On the other hand, if
r
E:. Aut (([/k), then
eN(tl't Z{
=
eN(tr,t~)
eN(tl'tz)detfCcr) ,
=
with the matrix p(er) of the restriction of on K', then we have just seen that and
peer) =
(j
to K. If 0- is trivial
~~ = (N. By Galois theory, this proves (N
€
precisely, the action of an automorphism ere G (or
p
given explicitely in the representation
40"N
(Z .17)
=
y
det
"»N
p( 0") = ~det N
= 1
±l so that det p(er)
K'. But more CT"E
~ N is
G') on
(or p') by p'(o-)
Because every automorphism of k({N) C K' over k extends to an automorphism of K' over k, we conclude that the image of
p'
(and
f)
contains elements of all determinants in ~N~)~. We have not used fully the assumption on the invariant jo "So far. But when jo is transcendent, we shall show that
p
and
f'
are surjective (hence isomor-
phisms). Obviously, it only remains to show that in this case, the image of
p'
contains SlZ~N~)/(±l). But the function j takes all
complex values (once in the fundamental domain for the modular group), so that there exists a
~
in the upper half-plane with
Because jo is transcendent, the specialization f for the field
~(fi)i
~
(corollary (Z.ll)). We denote by
j(~)
= jo ·
f(T) is generic ~(r)
the
automorphism of the specialized field corresponding to the automorphism of (Q(f i ) over (Q(j) given by (Z.13), for tf5:. SlZ~N~)/(!l). Let E' be an algebraic model of ([/L~ defined over jeT) = jo . Then E' is isomorphic to E, and we can even take E' = E. By (Z.7), the
193
- 111.22 -
field
~Cf·C~)). 1
1
is the same as the field
~Cj
0
,x(t) :Oft£tNCE)) =
=kCxCt):Ort£tNCE)) = K'. Let us determine the representative matrix for the automorphism
~Co)
ott) = fi.~
f i (-.)
('C)
of this field. We have for i
r0
= fi~ (T) = ~(i'~ (~~))
if the basis t 1 ,t is chosen suitably (as in (Z.7)). On the other Z hand,
tr(~)
C)
~ Ci ' (t 1) ) 0-( t ) = ~ (i ' (t I(
f. C"'C)ef ~ 1
t
ti')))
z
= ~ (i ' P(0- CV) ) (t 1) ) D
t
z
Comparison gives i
'~
¥=
This implies of
p
(or
= :!:i'po-(~)
p')
poo-(~)
for all i €
r
€.
(?Z/N7Z) 2
SlZ(Ll/Nll)/(±l) and shows that the image
contains every element of determinant 1.
To sum up, we have proved (Z.18) Theorem. Let E be an elliptic curve of transcendental invariant jo
~
[ defined over the minimal field k
integer N
~
= ~(jo).
For every positive
1, the Galois representation
p=
pCN) : Gal(k/k) --+ Aut(tN(E))
is surjective. If t
is any prime number, the
Pt :
Gal (K/k)
-+-
~-adic
representation
Aut CT~ CE)) ~ GI Z(Zl.t)
is surjective. Observe that the second assertion is deduced from the first one by taking N = t n and letting n ~ 00 • The projective limit of these surjective homomorphisms is still surjective, because the image of Gal(k/k) must both be dense and compact (rt
is continuous by defini-
tion). Put I = (~/~)Z (line vectors) and consider the field (of rational functions on the upper half-plane) generated by all f.1 (i € 1, remembering that f o = j is the modular function) K1 = ~(fi)I This field is a union of Galois extensions of ~(j) = ~Cfo) hence is a Galois extension itself and the Galois group of KI/
194
- 111;23 -
the projective limit of the Galois groups of the subextensions K~
=
~(fi)tN(I)
, hence projective limit of the groups G1Z~N~)/(~1).
Hence we have proved
Gal(KI/~(j))
(Z.19)
=
nGlz(~p)
If we identify (by (1.3) )
~/ll = (l) ~p/~p we can say that the
acti~n
/ (:!:l) •
P
of an element
¥ ~ TlIG1ZOZ p ) p
(mod
~l)
is
given on the generators f. through right multiplication of the index 1
by the matrix :
fr:1 )= 1¥ ' the fact that f.
This follows from
(2.16) is an isomorphism (onto) for
every positive integer N, because if we choose a generic specialization f
f(L) for the field K (any "C with j (-c) transcendental I will do), the action of a matrix on the index set I (at right) is t-+
transformed in the action on division points on an algebraic model defined over
The field KI contains all roots of unity (identified with constant functions on the upper ~(j)
of the complex torus
[/L~.
half-plane). This follows by generic specialization from what has been seen on page 111.18 . Write cyclotomic extension of
(Z.ZO)
~.
~ab = ~(!(l/N))N
for the maximal
Then we also have
Gal(~(fi)I/~ab(j)) = lITS1ZOZ p ) / (~l) , P
and by (2.12)
(Z.Zl)
Gal(~(fi)I/~(j)) = ~SlZOZp) / (~l)
P The field [(j) is the field of all modular function (for
r)
by
(1.3.11) and it is also easy to see that the field ~ (f i ) tN(I) = ~ (f i) i e: (N-~/ll) Z
is the field of all automorphic functions of level N (with respect to the principal congruence
subgroup
195
r N of r).
- I I I .24 --
Let us write
n
~ =
1l (ring of supernatural integers!) p p picture of the embedding of these fields.
_________
t
lQ(f.)= lQ 1
G1 2 (1l)/(±1)
ab
(f.) 1
I
rl
1 ([(f·)I
12 (1l)/(±1)
a:(j)
~ lQab(j)
196
and make a
- 111.25 -
3. Integrality of singular invariants
The main goal of this section is to prove that if
is a
~
quadratic imaginary number (with positive imaginary part), then the
= 123g~/~
value jeT) of the modular invariant j
is an algebraic
integer. We shall give two proofs of that fact : the classical proof which is transcendental and has some interesting features in itself, and a proof based on Tate's p-adic elliptic curves and their division points~(which justifies the introduction of this section in this
chapter! ) . (3.1) Definition. Let L
be a lattice. We define its ring of
c[
endomorphisrns by End (L) = {a e.
a: :
aL c L} :::> 1l
and we say that L is singular when End(L) If we also define from
~
to
~
L~ = ~·L ~
L
e Z
~
.
~ ~
by extension of the scalars
in the abelian group L, we also define the ring End(LQ2) = {a
a: :
£
aL~ C L~} ::> Q
In fact, if 0 f a ~ End(L~), then aL~ and this ring
End(L~)
= L~, so that a-I
€
End(L~) ,
is a field (of characteristic 0). By definition End(L) and that End(L)
n
R = Zl. This
shows that L is singular precisely when End(L ) ;
~
is a strict
it is clear that
End(L~) ~
Q
extension of the rational field. The following lemmas are all immediate consequences of the definitions. (3.2) Lemma 1. For a normalized lattice L = lattice, it is necessary and sufficient that
L~ ~
to be a singular be imaginary quadra-
tic. Proof. If
L~
is singular, take a € End(L) not an integer (hence a €R).
Then aLt:cLt: implies a
=
m
+
n1:6.L-r(with some integers m, n; 0) and
*) communicated to me by J.-P. Serre. 197
- I I I .26 -
also aL
L~
£
satisfied by
hence (m ~
q which gives a quadratic equation
+ n~)~ = p~ +
(we suppose
Im(~)
>
Conversely, if ~ is imaginary quadratic, ~(~L~) C ~L~
r is imaginary quadratic).
0 hence
(equality in fact), so that
t
2
Eo
End(~L~)
(3.3) Lemma 2. L is a singular lattice whenever imaginary quadratic field, and K = Proof. We may suppose L = singular
~ £
End(~L't')'
if L =
;
~.
K is an
End(L~) L~
.
and then we have seen that if
L~
hence K
proves that more precisely
~(L)
~L't' shows that
= ~(~)
End(~L~)
C
End(~L~).
But
is
L~
~L~= ~(~)
= K.
(3.4) Lemma 3. L is a singular lattice whenever End(L) is an order ~
in the ring of integers of the quadratic field K =
End(L~)
(then
L is homothetic to an ideal of this order). Proof. This is obvious because the ring of integers is the maximal order of K (remember that an order is a free abelian group of maximal rank 2 which is also a ring with 1). Then L is isomorphic to a normal lattice
L~
contained in K, and a.multiple of this lattice will be
contained in the order End(L). Thus L is homothetic to an ideal of the order End(L) having precisely the order End(L) as ring of stabilizers (not an ideal of the ring of integers itself in general). As to the structure of the orders in an imaginary quadratic field, we have the following lemma. (3.5) Lemma 4. Let K be a quadratic field. The suborders ring of integers
~
of K are the rings
integer f given by f =
[~K:
ot]
~f
=
~
(finite because
+ f~
at
~
of the
with a positive
has rank 2).
Proof. Let l,w be a basis of the free abelian group (I-module) and let a
+ fw
be an element of ot with minimal positive component
of w . Because 1 Of.,
= 7l
conductor
= Z
E.. C1 ,
f.o-K of the order
+ f7lw
~K
+
f(.J
€ ot
~f. (X,
and we shall have precisely
The index f of at in
~K
(i t characterizes completely
198
is called the Of.,
in our case).
- 111.27 We have
=
~l
~K
and an elementary computation shows that the discri-
minant Df of ~f is given by Df nant of
= f2 D (where D = Dl is the discrimi-
or of K/Q ) •
~K
Recalling that we say that two lattices are isomorphic if they are homothetic, we get the following result. (3.6) Proposition. The classes of isomorphic singular lattices with fixed endomorphism ring End(L)
=
~f
(order of conductor f in a
given imaginary quadratic field K) are in one to one correspondence with the ideal classes of
~f
(with respect to the equivalence given
by multiplication by principal ideals of
~f)
ring (or ring of stabilizers) given exactly by In particular, if f
having endomorphism ~f.
= 1 and h is the class number of K (order
of the ideal class group of
~K)'
there are precisely h isomorphism
classes of lattices L c ([ with End(L)
=
denotes the number of ideal classes of
K . If more generally h f
.6~f
having precisely
~f
as
endomorphism ring, there will be h f isomorphism classes of lattices L with End(L) I flP f '
=
,(}f.
This ideal class group could be identified with
the ideal class group (mod f) of fractional ideals of {)oK
prime to f with respect to the subgroup of principal ideals (<<) with
~
=I
= ~~
mod f. This is the classical ideal class group mod f of
classfield theory, and in particular h f < 00 (for all f
~
1). However,
we do not need this (in particular, we shall not even need the finiteness of h
= hI).
(3.7) Theorem 1. If L is a singular lattice, j
j(L) is an algebraic
number. Proof. This is the only point where we need elliptic curves for the main theorem of the section (in the transcendental method). Let E
= E[
~
be the elliptic curve given by the equation
[/L
2
Y
4x
3
- gzx - g3
with gi
199
= gi(L) ·
- I I I • 28 -
Then End(L)
~
End(E) (ring of holomorphic endomorphisms of E keeping
the origin - point at infinity - fixed: cf.(! .3.1) which shows that they are automatically homomorphisms) and any endomorphism h : E
-+
E
is ipso facto algebraic, i.e. given by rational expressions (if Pi . E d enotes t h e 1· th - proJ. ectl.on
~
pI ,
l'
=
1,Z ,
. f rom t h e com1ng
embedding E cp2, then each p .• h : E ~ pI is an elliptic function 1
on E, hence a rational function in x = PI ( = If now r
£
fJ)
and y = Pz (=
t:>' ) ).
Aut([) is any field automorphism and E~ is the elliptic
curve of equation y2
= 4x 3
- g~x - g~
the mapping h ~ hr gives an isomorphism End(E) ~ End(E~) (note that
h~
is again rational hence analytic, and thus a homomorphism by
(1.3.1) because it leaves the point at infinity (0,1,0) e.EG'" fixed,
as did h for (O,l,O)£E). This implies Er~[/Lr satisfying K =
t
End(L~)
End(L~).
= End(L),
with a lattice La-
hence isomorphic to a sublattice of
Since there are only denumerably many such lattices, this
implies that the set of isomorphism classes of Eer (
Aut (G:) ) of all conjugates of j is at most denumerable in [.
This proves that j is 'algebraic (otherwise it would have as many conjugates as the cardinal of a transcendental basis of G: over
~
which has the power of the continuum!). A posteriori, we see that j can have only finitely many conjugates. If one wants to use the fact that h f < 00 , one deduces that the degree [Q(j):Q] of j over ~ is smaller or equal to h f (f : conductor of End(E) ) : [~(j(~f)):~] ~ h f (3.8) Definition. A lattice L' is called isogenous to a lattice L
when there exists a
£
([><
wi th aL'
C
L .
Isogeny is an equivalence relation (if aL' c L, then a multiple nL
200
- 111.29 -
C L' ). If L' is isogenous
of L will be contained in aL', hence (n/a)L to L, then by definition L~.
Recall (1.3.4) that Ltl
matrix
~£(~ ~)
= End(L~)
End(L~)
L~,~ L~ when~' = ~(L)
and that
:
is homothetic to
with some unimodular
group of inversible integral matrices
(3.9) Lemma S. Two lattices LT and
= «(L) with some matrix
L~
= L'[ when -c' = 1:+ n = 1J(t:) with some
matrix ex e: SL 2 (ll) (= GL (ll) + Z with positive determinant).
~,
because
L~,
«£GL2(~)+
are isogenous exactly when (rational matrix with positive
determinant). Proof. Take a complex number t , 0 such that tL rl CL r . Then t"t'
a't + band t
,
= c't
+ d with
~ = (~
~) e: MZ(:l)
.
«(~). Because ~ and ~' have both positive imaginary parts
Hence
't
det(~)
> 0 (as in the proof of (1.3.4) we have more precisely
1m (t')
1m ('e) Ic 1:
dl- 2det (0<) ). Conversely, if 1:' = 0( (l:') with
+
DC
as
in the assertion, we can multiply it by an integer (without changing 't') and suppose
(c 'r + d) L't'
C
oc
Eo
M (:l) to s tart with. If Z
0(
= (~
~),
then
L -c •
As a consequence, the set of lattices with fixed
End(L~) =
K
(quadratic imaginary field), is a full equivalence class of isogenic lattices (L r two distinct t'
is in this class whenever K = "t,
or'
E..
~(T),
i.e.
~ £
K, and
K with positive imaginary parts are linked by
= (~ ~)('t) with a> O,b
E.
~
)
•
(3.10) Theorem 2. If L' is isogenous to L and j then j' is integral over
= j(L), j' = j(L'),
ll[j] . (Note that we do not suppose L to
be singular.) The proof will be carried out in five steps. First step: reduction to ~ = (~ We can suppose L =
L~
, L
,
~) (p a prime number).
L«(r)
201
and by the elementary divisor
- 111.30 -
t,S
theorem, there are unimodular matrices
m,n, m divisible by n, so that
IX
SL2(~)' and integers
£
= t (: ~) ~ . Using the invariance
of the modular function j, we can say that integral over
je~
j.
r' (:
~).b
~)o
is equivalent to
E
~ [j] =
integral over
&(-r)) in turn equivalent to j
~ [j] . But j. (~
over
0(:
j
and (replacing -r by
~[j]
~) ("r) = j (~t:)
0 (:
1l [jeb
1
~) integral
wi th an integer min. By succes-
sive multiplications by prime numbers p we see that we are reduced to showing that j (pT) L
is integral over 1l [j (-r)] for all primes p (and
in the upper half plane). Let jp be the function on the upper half
plane defined by jp(T) =
j(p~)
. We shall prove that the function
ll[j]
jp is integral over the ring of functions
(transcendental point
of view). Second step: LpT is a sublattice of L~.. of index p. Our guess (!) is that the conjugate algebraic numbers to jp(T) = j(p~) are the invariants of the other sublattices of index p in L = LT
•
There is
a lemma.
~z = <el,e Z) ~ (~ ~) for 0.,. v< P
(3.11) Lemma 6. a) The lattices of index p in
Ly
<e l +veZ,pe Z)
>
=
~Z'o(lI with
o{y=
(pel,e Z = ~~«p with «p = (~ Lp are p + 1 lattices of index p in ~2. and
b) Let
r
=
~). In particular, there
SL Z (~), and define the matrices
DC y
(0
~ y ~
p) as in a).
Then we have the disjoint coset decomposition
(11 :
disjoint union) .
These are just two ways of expressing the same result as we shall see. Let
r
operate at right in 71 2 (we consider elements of
Z2 as line
vectors, and we multiply them at right by square matrices, getting again line vectors). The mapping
0<"""""
202
7l 2 .0<
gives a one-to-one
- 111.31-
r 0(
correspondence between left cosets
and sublattices L = 7l 2 • 0(
of
2
7l • If we restrict this correspondence to matrices of determinant p,
we shall get exactly those lattices L which are of index p in
Z2.
By elementary divisor theorem, every matrix of determinant p is in
rePo
the double coset
o)r 1
. This explains the equivalence of the
two assertions a) and b) of the lemma. Let us prove a). Let L be an arbitrary lattice of index p in lated by p, so that pe
2 2 7l • Every element of 7l /L is annihi-
L for every e eZ 2 . This proves that L lies
£
between 72 2 and p 7l 2 : p 72 2
L C 72 2 , and more precisely, the lattices of index p correspond to the subgroups of index p in 72 2/p·Z2. o
o
C
Because p_7l 2 is of index p2, these subgroups are of order p and are the lines (over F p Z/p-72) through the origin in F p ~ F p . These are the points of ]pI (F ) and there are p + 1 points on this proj ective p
line. Coming back to the lattices,
~e
see that the (O~ V ~
p-l)
correspond to the non-vertical lines (through the origin) in F and
Lp
= (pe l' e) = 72 2 .(P0 2
0,
v ~
~n
F
P
n
. OC., () -- J. ("t' + y) = Je T -p-
p-l) and form the so called modular equation of degree p
(3.12)
The
~
corresponds to the vertical one.
0) 1
. proves t h Id Th IS e emma, an so we put .J. v () 1: (for
P
Fp(j ,X) ~n(jo,···,jp)
=
(X - j,,)
p 1 X + +
~
are the elementary symmetric functions in the
functions j" . Third step:
The functions
Fp(j ,X) e:: ~
n
a:[j] [X] = a:[j ,Xl
are indeed holomorphic functions over the upper
half-plane and are invariant right multiplication by
r
~n·1
-<
(1er) because by lemma 6b)
~n
permutes the left cosets
permutes the jy • Because the jv )jy (z))
=
r
O(aI
and thus
have a reasonable growth, say
exp(A"Im(z)) for Im(z) ~
203
00
,
the same will be true for
- 111.32 the functions
· These functions
~n
~n
are thus holomorphic modular
functions, and by (1.3.11) are polynomials in j :
cr n = L.
Fourth step : Let Pn (j) = of
~n
(with q =
~(L)
~n
£
[[j] ·
akqk be the Fourier expansion
k)-N
and N = N(n) equal to the degree of Pn ). Then
= ak(n) are rational integers. k have Fourier expansions in powers of q1/p = e(T/p).
we show that the coefficients a
The functions jy
Explicitely, if we introduce the primitive p~ root of one ~ = ~(l/p), we have j(~) = e(_"C+Y) p p
(3.13)
= ~-"'q-l/p j(p-r)
L
+
n:,O
= q-P
+
+
>c(n)e(n"C+ v ) = p
~
c(n)l;lIn qn/ p
lO~
y
< p-1)
,
Lc(n)qpn n~O
with integers c(n)
£~
(the coefficients of the q-expansion of j are
integers (1.4.4) ) . This shows that the coefficients a k
in the ring 7l [q . But if f integer r, 1 , r
~
EO:
=
ak(n) are
Gal (ll (~) Ill), then ~f = t r for some
p-l and applying the automorphism
wise to the q-expansions permutes the jv (1<
v, p-1)
e coefficientand leaves jo
as well as jp fixed. The symmetric combinations will thus have invariant coefficients under all automorphisms in question and be in ~ f'\ 71. [~J = 7L
(simply because 1l [']
is contained in - in fact equal to -
the ring of integers of Q(') ) . Fifth (final) step : It is given by the following q-expansions principIe, which will imply in particular Pn (j)
(3.14) Lemma 7. Let P(j)
=
L:
akqk
E: 71.
[j],
hence Fp (j ,X) E: 71. B,X].
be the q-expansion of a po1yno-
k~-N
mial in the j function (with complex coefficients). Then the subgroup of [ generated by the coefficients of P is equal to the group generated by the Fourier coefficients a k (k ~ -N). This lemma is trivial if N = 0 (P is a constant), so we use induction on N and suppose that it is already proved for N-l instead of N.
204
- 111.33 -
Because j has a simple pole with residue one (in q = 0), N must be the degree of P and the coefficient of jN in P must be a_ · Consider N -1 .N k + c(k)qk)N P(j) - a_NJ a_N(q akq
L
L
k~O
k~-N
L.
bkqk
k~-(N-l)
By induction hypothesis, the coefficients of P(j) - a_NjN generate
E&
llb
k~-(N-l)
rate the group
(b ) k
+
so the coefficients of P (j) gene-
,
k
lla_ N
But this group is (ak> k>-N (because it contains a_ N and the c(k) are integers). Thus theorem 2 is completely proved, and if w~ write j* instead
of the indeterminate X, Fp(j,j*) ~~[j,j*] and the roots of Fp{j,j*) = 0 are precisely the j* =}y (O~ v.,;p). Because these roots are permuted transitively (lemma 6b) under the automorphisms
~ ~~(~)
of the upper half-plane, Fp is irreducible as polynomial in j* with coefficients in 1l [jJ (or 1l (j) by Gauss' lemma, or still a: (j) ) . We have identically Fp(j(~),j(p~))
= 0
(Im(-r)
>
0)
,
hence, dividing by p, also identically Fp(j(T/p),j(~))
=0
This shows that the two polynomials Fp(j,j*), Fp(j* ,j) have j*
£ [(j){j*]
jo as common root, and by irreducibility of the polynomial
Fp(j,j*)
= (j*)P+I + ~p{j*)p + ••.
there must exist a polynomial P(j,j*) £ll[j,j*] such that
Fp(j*,j)
= P(j,j*)Fp(j,j*)
Iterating this procedure of inversion will give
Fp(j,j *) = PU* ,j)Fp(j * ,j) = PU* ,j)P(j,j *)FpU,j *) , and hence
P(j*,j)P(j,j*) = lin
ll[j,j*] : P(j*,j) = P(j,j*)
205
!l.
- 111.34 -
If P(j,j*) = -1, we have Fp (0J * , j ) = -F P (j , j *) and giving the value .* j , we would get identically F (j,j) = 0, so that Fp(jJj*) would J p
have the root J. * = j and be divisible by j * the
irreducibility of Fp(j,j*) in
modular polynomial F
p
j , a contradiction to
7l[jl[j*] • This proves that the
is symmetrical in j and j*. -
These polynomials can be very difficult to determine exp1icitely as the case p
= 2 already shows (Bateman t.3,p.25) j 3 + j * 3 - ( j j *) 2 + 3 4 5 3 4027 j j * + 2 4 3 • 31 j j * (j + j *)
F 2 (j , j *)
_ 24 3 4 5 3 (j2
+
j*2)
28 3 7 56 (j
+
j*) _ 212 3 9 5 9
+
It can be shown however, that F (j,j*) :: (j - j*p)(jP - j*) P
mod p
(3.15) Theorem 3. When L is a singular lattice, jell is an algebraic integer. Proof. We observe that L is isogenic to the ring of integers in the quadratic field
End(L~) =
K, so by transitivity of the notion of
integrality, it is sufficient to prove that
itself is an
j(~K)
integer. There exists an element a € K having a prime norm N(a) = p. For example, take a prime p which is not inert in K (i.e. which does not generate a prime ideal of ideals
f'
c- K),
P'&K =
PCf
with principal
q , necessarily of norm p. Then N(r) I
element of prime norm p. This shows that
~K
Inf N(a) gives an
=
ae.p
is isomorphic to a
sub1attice (namely a 1)-K) of index P in C"K : p
N(a) = Card (-e}K/ a ~K)
=
[-6-K: a -()..K]
By definition of the modular polynomial, we have
Fp(j(~K),j(~K))
o.
But the polynomial Fp(j,j) ~~[j] has leading term -1 as the q-expansions of the j -jy show: (j - i,,) tr)
q
-1
-q-P
higher order terms
+ +
(for
0, y
~
p-l) ,
higher order terms
(these expansions are to be regarded as power series in q1/p), and so
206
- 111.35 -
p _q-2p
+
terms
higher order terms
=
_j2p
+
•••
This proves that-Fp(j,j) is a unitary polynomial of degree 2p in j giving an integral equation of dependance of
j(~K)
over
~.
Now we turn to the p-adic proof of Th.3, not using Th.2 (but using Th.l which is comparatively very simple). For that, we shall prove (3.16) Theorem 4. Let L be a finite algebraic extension of the p-adic field (Q p ,q E: J:C such that lql < 1 -and Eq = Eq (L) be the corresponding elliptic curve of Tate. Then End(E ) = ~ .
--
--
q
Proof. (We are using L instead of K for the p-adic field, keeping
K for End(E q ) 0 (Q .) First step
For every algebraic extension L'
of L we have a parametrization
'f :
--+- Eq(L ' ) C
L'" X
p2(L ' )
~ (P(x),DP(x),l)
if x ¢ q~
giving isomorphisms
.. /
q'll
.........,.~ Eq (L I
)
(observe that the theorem is certainly true for finite algebraic extensions L' of L by our treatment of Tate's curves in chapter
rr ,
hence also for every algebraic extension of L, because such an extension is union of finite algebraic ones). Now
'f being composite of
two homomorphisms is a homomorphism itself, and if we denote by
+
the group law of Eq we have ~(xy) = ~(x)
+
Let us take now an L-automorphism
fCY)
(x,Y E.L'><)
•
~ of an algebraic closure
This automorphism is automatically continuous on
L for
L of L .
the topology
deduced by the unique extension of the absolute value of L (L is
207
- 111.36 -
complete). In particular
transforms a sequence tending to 0 in
~
another sequence tending to 0, hence a convergent series in another convergent series (the conjugate of the sum can be computed by taking the sum of the conjugates). Because the p-adic functions P and DP have their coefficients in the field L (q
~
Land P(X) = P(X:q) is
the analogue of the q-expansion of the Weierstrass function) , we shall have P(xG") hence also
P (x)fT'
, DP (xG") = DP (x)G"
This proves that the parametrization
~(x~)
is compatible with the action of Gal(L/L) on L~ on one hand, and on
f
Eq(L) CPZ(L) on the other (in other words, automorphisms a-
E:
commutes with the
Gal (L/L) ) .
Second step: Let T.(. = T.t (E q ) denote Tate's module for the prime ,f... We shall show that if
the image
.t
does not divide ord (q) (= -ordLj (E q ) ) , then L
of Gal (tiL) in Aut (T-t /tT.t,)
unipotent operator with matrix (~
Aut (t-t (E q )) contains a
~) in a suitable basis (~F~)
of this space. We denote by L' the cyclotomic extension of L obtained by adjoining a primitive t~ root of I to L (hence L' contains all roots of I of order ~). Because [L': L] is a divisor of l - I , hence prime to .(., we shall still have
~ prime to ordL,(q)
(= e.ordLq with
the ramification index e = e(L'/L) dividing the degree of L'/L).
t~ root in L'. Choose an {t~ root
In particular, q has still no
ql/t
L and an automorphism
€.
G'"( q l/-l)
"i q lIt . Necessarily
(f' E:.
Gal (L/L')
er( ql/.f) I ql/t
C
Gal (L/L) such that
=,
is a primitive
{~ root of I (hence in L'). Put now e l = ,((), e Z = ~(ql/~). This is a basis of ttCEq) and in this basis, the action of the automorphism ~
is described by a unipotent matrix as asserted
erC')
e~
=
fCC)r
e~
=
,(ql/t{ = '((J"(ql/t)) =
=
C«({r) =
= el
f( l;.ql/.t)
208
- 111.37 -
Third step : There is a finite algebraic extension L" of L so that every endomorphism
0(
e: End(E q ) is defined over L". There is nothing
to prove if End(E q ) = 7l , so we suppose that End(E ) = .D- = 7l + 7l 0( q f is an order in an imaginary quadratic field. The assertion will be proved if we only prove that the endomorphism
~
is defined ·over a
finite algebraic extension of L (multiplication by an integer in E
is
q
defined over L). But we can embed L into the complex field [ (algebraically closed and having a transcendence basis over the prime field phism
~
having the power of the continuum). Then for every automorAut([/L), ~r is still an automorphism of E
~ £
that the set of conjugates that
«
q
i«rJ
. This proves
is at most denumerable. This proves
is algebraic, defined over a finite algebraic extension of
L. As consequence, if t
t(E ) is a division point on Eq , if q 0( € End(E ) is defined over L" and 0- E: Gal(L"/L), we shall have q ~(tr) = ~(t)r . This shows that the t-adic representations of End(E ) £
q
and Gal(L/L") in T.( (E q ) commute. Fourth step. We conclude the proof by contradiction. Let us replace L by a finite algebraic extension (still denoted by L) over which every
~
integers
£
End(E ) is defined. We have proved that for all prime q
l
not dividing ordL(q), there is an element ~€ Gal(L/L)
having representing matrix (~
i)
for its action on t~(Eq)
= T~/tTl ·
Since all elements of End(E ) must have representations in this q
space commuting with this unipotent matrix, they must themselves be in upper triangular form with two equal characteristic values (a c implies
a+b) c+d
(a c
c = 0
b)(l d 0
i)
(1 0
l)(a 1 c
~)
b+d) = (a+c c d
and a = d
Hence all elements of the image of End(E q ) in End(tt(E q )) would have equal eigenvalues. This would lead to a contradiction if End(E q )
209
- 111.38 -
generated a quadratic field K
(Q (R), as soon as
t
would be taken
prime to d . is imaginary quadratic (with Im(-r) > 0), then
(3.17) Corollary. If T j(~)
is an algebraic integer.
Proof. By Th.l,
j(~)
is algebraic, so that
[/L~
is isomorphic to a
non-singular cubic curve defined over a number field k. If not an integer, there would be a prime ideal gers
~k
~
~
were
of the ring of inte-
of k appearing with a negative exponent in the decomposition
of the principal fractional ideal generated by j ideal
j(~)
wi th ord j
f
< O.
would exist a unique q
E:.
hence a prime
= j(~),
In the completed p-adic field k k; with JqJ
<1
t
of k, there
and j = j (q). Then our
algebraic cubic would be isomorphic to Tate's curve Eq over k~ (or already over a quadratic extension of k,). But we have seen that End(E q ) = ~ for all Tate's curves, contradicting our hypothesis that ~
being imaginary quadratic,
End([/L~)
,
~
.
The third step in the proof of Th.4 above can be made much more explicit: if E is an elliptic curve defined over a field k (of characteristic 0), then every endomorphism of E is defined over kK, where K is the field generated by End(E). This is trivial if End(E)
=~.
The other case will be treated now by using first kind differential forms on E. We have to review some notions. Let V be an absolutely irreducible (projective plane) algebraic curve defined over the field k where Jl.
characteristic zero. Thus V
=
VJt
is an algebraically closed field containing k. If
automorphism of
n
over k, then G'" acts on the space Der(V)
of derivations D : Jl(V) V (trivial over (3.18)
of
~
=
Der.n(V)
Jl(V) of the rational function field of
JL). In fact one defines Dr £ Der(V) by the formula for f
210
-= Jl(V) .
- 111.39 -
(Here, as usual, we denote by f~ the rational function on V having conjugate coefficients under
~
in a representation of f as quotient
P/Q of two polynomial functions on V : this definition of fr is independent of the choice of the representation P/Q because the ideal of definition of V is generated by polynomials with coefficients in k hence is invariant under
If we recall that the derivation Df with respect to a rational function is uniquely defined by the property ~.)
1 (we suppose that f is not constant), we see that D~ ( fO'") = Df ( f)r = 1 = Dfer ( f G') , hence (3.19)
Now if w € Diff(V) is a differential form on V, i.e. an
~(V)-linear
form over Der(V) (these two spaces have dimension lover Jl(V) ), we also define (C4>"",D r
(3.20)
>=
In particular if w = df (f ( (df) G"" , Dr)
=
'"
(W,D)tr €
=
(for D
J2.(V)), we have D( f) cr = Dr( f (f" )
E:
Der(V)) .
,
hence (df)cr = d(ff') and more generally (3.21)
Let now P
€
V be a regular point,
at P. Because
~E
Jl(V) a uniformizing variable
(dnp,DTt:) = 1 every differential form p
GIl
on V can be
written (3.22)
'd
=
(tV ,D ) dTrp
TCp r The proof of (2.l.c) shows that ordpr= (ord p ) hence that we can
choose
rr; = ~pr
as uniformizing variable at pre fn particular,
(I()~D1(pr) is regular at P (in Rp ) whenever
(W,D 7rp ) is regular at
(the local ring R,.. at p'" is R~). This shows that if p
w€.al
is a
first kind differential form on V, its conjugate w~ will also be a first kind differential form on V.
211
P
- 111.40 -
If
CA)
= Wr for all r
form on V (take f function g
6
Eo
E:
Aut (ll./k), then
is a k-rational differential
key) not constant, and write r
w =~
Jl.(V)
Co\)
(.&)=
gdf with some
gives immediately gr = g for all k-auto-
morphisms cr of Jl. ) . Let now
~
: V
~
W be a rational mapping (when expressed in a
coordinate system in W,the coordinates of «
must be rational func-
tions on V) between two curves (satisfying the same assumptions as the curve V of the last paragraph). Thus« defines a homomorphism 0(.
:
.Q.(V) ~
n(w)
,
•
We suppose ex not constant, so that
0(
0 will be inj ective.
;
Certain derivations D €Der(V) have a direct image example if D
= Df
eO(
(f
under~.
For
is a derivation wi th respect to a
€ Jl.(W))
rational function in the image of
~. we can define «.(D)
€
Der(W)
by the following procedure. Noting that Df 10( sends f.« on 1 (by definition), hence leaves stable ..Q.(W). exc n(V), we have h
and
«.(D f
.0(
) is defined by g ~ (D f • 0(
(3 • 23)
)
(g) ..
~
0(
E
Jl(W) ,
h. In a formula, this reads
= Df •
0(
(g • ot)
•
Then by "duality" we can define the inverse image (under O() of any differential form w £ Diff(W) by putting (3.24)
(we put indices V or W with the bilinear symbol (.,.)
to make more
precise the curve with respect to which it is defined). This formula characterizes
~.(~) completely with a single non-constant fE Jl(W)
because then Df.O( In particular if
; 0 and Diff(V) is one-dimensional over .fi(V) · W
df (f E:Jl(W)), we get
W .« 00(
whence (3.25)
c(
=
0(.
(D f
.0( )
(f) •
0(
= Df 00(
(f.«)
• (df) = d(f • o() and more generally it follows • 0( (gdf) = (g 00( )d(f. 0( ) (f,g E .Jl(W)) . 212
1,
- 111.41 -
This formula justifies another notation for If
(f'
..n.,
is a k-automorphism of
and f
£.
* (w), namely
6).0(.
0(
Jl(W) a rational function on
W, it is obvious that (f. O()r = fr.o(r and from there using (3.25),
that
*
(3.26) Let P
DC. ( 4) ) ~
r
=
r
(4) • O()
and
~P
• 0(
r
... = ( 0( r) * (4))
V be a regular point, «(P) = Q be a well-defined regular
* (R ) = R • 0( C R ) . p Q Q are uniformizing variables at P and Q respectively,
point on W (thus we suppose If
= £() r
~Q
we shall have
0(
regular at P :
0(
.olE:Rp (and this implies D (7t "OC) E: Rp by (H.3.10) ). Q 1tP Q Then if the differential form w € Diff(W) is regular at Q, « * (w) 7t
will be regular at P. By (3.22) the hypothesis means that <~)D~>
Q
is regular at Q. Let us compute
W
because D~/D ltQ.o(
Hence
D~ (7t'Q • oc ) /D
1tQ
" 0( (JrQ 0
=
c( )
Dw: (1t • Cl( p Q
)
=
Dl» (lrQ " 0(.) [
<W,DT[~W 00<]R411Q'
The first term in this product lar at P. The second term regular at Q and
~
D~(TIQ"o()
0<] ·
is in Rp because
0<
(tV ,D 7t )w • 0(
is also in Rp because Q regular at P. The last term is
0« d
o(*D.,Vol)W •
W
=
<0{* d"Q ,DJrQ.o(
is regufA)
is
>
This proves the assertion. In consequence, if
V = 1 ~
is regular on the
whole of V (non-singular) and W ~~1 is a first kind differential on W, then
*
~ (~)
is a first kind differential on V.
(3.27) Proposition. Let E be an elliptic curve in characteristic 0 admitting complex multiplications: End(E)
;~.
Let k ( c [) be a
field of definition of E and K ( C [) be the field End(E) 0 ~
213
~
genera-
- 111.42 -
ted by the complex multiplications of E. Then every endomorphism of E is defined over kK. Proof. We use the representation «
~ ~
* of the ring of endomorphisms
of E in the space of first kind differential forms over E. Because this space is of dimension one over [ (ll.3.l6) ~* must be a homothety by a well-defined complex number
a~
End(E) ~ End(n~) ~ [
Let now ~
I
WE~k
(canonical isomorphism)
be a k-rational first kind differential form on E, and
an automorphism of [ over kK. We have *
0(
r
r
(L())
={
( w. 0<) = c.J r
(a0( CJ,)) =
0
era·~ c(
0(
r
=
a",r · U)
= a 0( . W
hence a«r= a« . But the above isomorphism extends to a (normalized) embedding End(E) @ ~ injective. From 0(
71.
~
[ (with image K), and in particular is
aO(r = aO( we derive thus 0("=
0(
and this shows that
is defined over kK. (The representation of End(E) in !lJ ~ is
simpler to explain in transcendental terms : if [/L the isomorphism End(L)
~
of first kind applied onto dz
~
~
E induces
End(E) and dz is the differential form
w, then a
d(az) = a dz .)
214
€ End(L) induces the homothety
- 111.43 -
4. Division points in characteristic p
This section is descriptive in character, its goal being to indicate a possible approach to division points in characteristic
p f 0 (we shall also assume p ; Z,3 to be able to work with Weierstrass normal forms, although results are true quite generally). Let E be an elliptic curve of (affine) equation qZ
4p3 - gzp - g3
(~= g~
-
Z7g~ f
0). We know that i t is unifor-
mized by the Weierstrass functions p = p(z) = ~(z) and q = p' = P'(z). We are trying to express Pn = p(nz) and qn = p'(nz) as rational functions of p and q. It is easy to do so for n addition formula (1.1.14) for u ~ v
=
Z noting that the
z leads to the duplication
=
formula (1.1.15) (4.1)
Pz
=
-Zp + Ip"Z/p'Z
=
By derivation of the identity p'Z Zp'p"
=
1ZpZp' - gzp'
4p3 - gzp - g3 we get
p"
=
6pZ -lgz
To find an expression for qz ' we have to find the equation q
=
mp
+
h of the line tangent to E at (p,q). After some elementary
computations, we find m
p"/p', h = (p'Z - pp")/p' so that
(4.Z)
3pp"/p' _lp,,3/ p ,3 - p'
qz
=
-mpZ - h
To find an expression for P3 we use the addition formula P3
=
-PZ - P
+
Z
1 (qz - q) / (p Z =
By (4. Z)
qZ - q
q Z - p'
and
PZ";p
(p"Z -1Zpp,Z)/(4p'Z)
(4.3)
qZ - q
1
pt
Pz -p
Z
p) ( 1Z pp , Zp" - p" 3 - 8P , 4) / ( 4P , 3)
so that
lZ pp , Zp" - p" 3 - 8P, 4 p"Z -lZpp'Z
From this, it is easy to find (4.4)
P3
=
P
-1-1- 8p,4(Z4pp'Zp" - Zp,,3 - Bp,4)/(p"Z _lZpp,Z)2 p'Z
215
- 111.44 -
p _ 4P , 2( 12 pp , 2p " _ p" 3 _ 4P , 4) / (p" 2 _ 12 pp , l) 2 To simplify the computations somewhat, put g2 = -4a, g3 = -4b , so that p"
(4.S)
= 6p2
+ 2a
= 2( 3p 2
+ a) •
Then we see after some more computations (4. 6)
P3
=
P -
p'?( 2p 6 + 10ap4 + 40bp3 -10a 2p 2 - 8abp - 2a 3 -16b 2) ( 3p 4 + 6ap2 + 12bp - a 2) 2
The equation of the curve E is now q2 = 4p3 + 4ap + 4b so we put x = p and y = lq = Ip' so as to have the equation
(4.7) Let us also define (x 'Yn) = n· (x,y) so that xl n and Y Then (4.1) gives Y = Ip' Yn = !qn 1 4 2 2 2 (4.8) xl = x - ( 3x + 6 ax + 12 bx - a ) / ( 4Y ) and (4.6) gives (4.9)
x
3
=
x -
=x
P
PI ,x n
8y 2 (x 6 + Sax 4 + 20bx 3 - Sa 2x 2 - 4abx - 8b Z - a 3 ) ( 3x 4 + 6ax 2 + 12 bx - a 2) 2
Now it can be proved by induction that 2
(4.10)
xn
Yn with polynomials
= (~n+2~~-1 - ~n-2~~+1)/(4Y~~)
(4.11)
~n
~2n ~2n+l
=,X
-
~n-l~n+1/~n
defined recursively by 2 2
~n(~n+2~n-1 -~n-2~n+l)/(2y) 3 3 ~n+2~n - ~n-l~n+1
To start the induction in (4.11) we need the first-values given by (4.8) and (4. 9) (4.12)
1
2y 3x 4 + 6ax 2 + 12bx - a 2 4y(x 6 + Sax 4 + 20bx 3 - Sa 2x 2 - 4abx - 8b 2 - a 3 )
216
Pn
- 111.45 -
These polynomials are universal polynomials in x, a, b tJJ 2n/Y ,
111 2n+ 1 E::
7l
[x , a, b] .
with integral coefficients and have the following properties (which can be verified by induction ... ) : (4.13) a) XW~ - Wn-1W n + l is a polynomial in x of degree n 2 and leading coefficient 1 (we have used y2 x 3 + ax + b) , b) ~2 is a polynomial in x of degree n 2 -l n 2 . t 0 eac h c ) x~n2 - ~n-1tJJn+l and 11.~n are prIme
0 th er
( conSl. d ere d
as polynomials in x) and stay so even after reduction mod p (p being a prime; 2,3). (For this last property, cf. J.W.S. Cassels: A note on the division values of
~(u),
Proc. Cambridge Phil. Soc.,.1i, (1949) ,pp .167-172 .)
These properties (which are classical in characteristic 0) are purely algebraic and have the following consequences: (4.14) Theorem. Let E be the elliptic curve given by (4.7) with coefficients a,b in an algebraically closed field k of characteristic p, and denote by (x,y)
~
n(x,y)
~E
=
n1
E
: E
~
E the algebraic homomorphism
(xn'Yn). Then n
2
Proof (for p ; 2,3). We start by proving [k(x) : k(x n )] = n 2 , and for that purpose we use LUroth's theorem giving the degree of the left-hand side as the degree of the rational fraction x n 2 2 deg(x ) deg (x~n - tJJn-1~n+l)/~n n 2
2
n - ~ n- 1~ n +l),deg(~ n )) by property c) above (we consider the reduced polynomials mod p as Max(deg(x~
polynomials in x and coefficients in k). From property b) we infer that deg(tJJ2) ~ n 2 - 1 (reduction mod p can lower the degree!) and n thus finally [k(x) : k(x n )] = deg x (x n ) = n 2 , by property a) above.
217
- 111.46 -
Then we consider the following diagram of fields
n2
k(x) --- k(x,y)
k(E)
k(x n ) -- k(xn'Yn)
=
I
Now the automorphism (x,y)
I
~
k(E)on E
(x,-y) of k(E) over k(x) induces
the non-trivial automorphism (xn'Yn)
~
(xn'-Yn) of k(E)on E over
k(x n ) (use (4.10) to check this, considering separately the cases n odd and n even). This proves that k(x) and k(xn'Yn) are linearly disjoint over k(x n ) and consequently
[k(x,y) : k(xn,y n )] = [k(x) : k(X n )] = n 2 (There is fortunately a proof of this basic theorem avoiding
completely the computations relative to the polynomials
~n'
but it
uses intersection theory and is less elementary.) If
~
: E
E
~
is a homomorphism, it is useful to define the
degree (resp. separable, inseparable degree) of « by (4.15)
deg(o<)
=
[k(E) : k(E).cx] = deg (oc) .deg i (oc) , s deg s (<<)
(k(E)
k(E)oo(]s
[k(E) and to say that
Of
is separable if k(E) is separable over k(E).O( .
(4.16) Corollarr 1. The degree of n E is n 2 and if n is prime to p, 2 nE is separable (n,p) = 1 ~ deg(n E) degs(n E) = n Proof. If (n,p) 1, n 2 is not divisible by p and degi(n E) which must be a power of p (and divide n 2 ) must be 1. (4.17) Corollarr 2. ~ n is prime to p, tn(E) ~ f;Z/~)2 and so for any prime
,f; p, T.t, (E) is a free
llt -module of rank two and Vt (E) is
a vector space of dimension two over
~t
.
Proof. We look at the following diagram of algebraic curves and morphisms
218
- 111.47 -
x
pl....-
xn(x)
1'1 ~
I
(4.18)
!Xn
E
(x ,y)
E
(xn'Yn)
!nE
r
where the horizontal maps are the first projections (double sheeted coverings with ramification points of index two at division points of order two) and the left vertical map is the rational function xn
= xn(x). It is obvious that if a ~pl is fixed, xn(x) = a has
at most n'solutions x £1'1. Let us prove that it has indeed ~distinct solutions except for finitely many exceptional values of a. Let us write xn(x)
=
u(x)/v(x) with the polynomials given by (4.10), (u,v)
If the polynomial u(x) - av(x) has a multiple root
=
1.
;, then
u (x) - av (x) = (x - ~) 2r (x) u (x) I v (x) = a
(u(x)/v(x))'
+
(x - () 2r (x) I v (x)
(x -
f) s (x)
with a rational function sex) (having v 2 as denominator, and v does not have the root
f), so that (u/v) , still vanishes at
discard the finite number of values where vex) that
E is
=
~.
If we
0, this implies
a root of u'v-uv'. This proves the assertion for all
values of a except those a = u(A)/v(A) where
A is a root of u'v-uv'.
(Observe that u'v - uv' is not identically 0 because u'v = uv' would imply that u'v is divisible by u hence u' divisible by u because (u,v) = 1, hence u' = 0, deg(u) divisible by p, which is not the case.) To contlude the proof of the corollary, it is sufficient to prove that the kernel of n E consists of m = n 2 distinct points: if n = t is a prime number p, ttt (E) = 0 impl ies t.(, (E) ';t 'llit'll >< 'll/.t'll and
r
from there we deduce t r (E) ~ (~/tr71) for r ) 1 and the result of the t corollary in general. To prove that this kernel has m distinct points, it is sufficient to show that one fiber n -1 (a,b) consists E of m distinct points of E. But take for a precisely one value a £1'1 219
- 111.48 -
-1
2
such that the fiber x n (a) has n = m distinct solutions al, ... ,am . For each point a k , there are two points on E above it, say (ak,b k ), (ak,-b k ) (not necessarily distinct). If for one k, b k = -b k , then b k = 0 for this k (we assume that p ; 2) and the compatibility of n E with the automorphism (x,y)
(x,-y) shows similarly that
~
in nE(ak,b k ) = (a,b) we must also .have b = 0. This implies that all b k = 0 (there must at most be m couples (ak,±b k ) above (a,O) and that the fiber above (a,b) consists of m distinct points in all cases. (4.19) Corollary 3. The group of points E
Ek is a divisible group
(when k is algebraically closed). f ~.
P
. 1 mapp1ng . Because t h e rat10na x : pI
n
the diagram (4.18) shows that n E : E if n is divisible by p)
~
~
p l .1S surJect1ve, ..
E must be surjective (even
Now we go to the other extreme and examine the case n = p. For that purpose we have to give a few prerequisites on the Lie algebra of E. Quite generally, if
~:
V --. W is a regular morphism
of algebraic varieties (defined over the algebraically closed field k), both of them having no singular point, then fis compatible with the local rings attached to these varieties, by definition. More precisely V and Q =
W, and if Rp c k(V), R C k(W) denote respecQ tively the rings of regular functions at P and Q, then RQ.~ c Rp . if P
E:
€
We denote also by Mp the maximal ideal in Rp consisting of functions vanishing at P, so that Mp is the kernel of the evaluation mapping f
~
f(P) on Rp , giving an isomorphism Rp/M p = k. We define the
algebraic tangent space of V at P by (4.20)
as k-vector space. Then the regular mapping
220
~
gives by composition
- 111.49 -
a k-linear map T(fJ p
T(V)p
= ~p
TCW)Q defined by D
~
~ D.~
*
(4.Zl)
When V
=
G is an algebraic group, the tangent space at the origin is
(by definition) the Lie algebra of the group T(G)e = Lie(G). The canonical injections jl,jZ: G maps
Lie(G)
Lie(G
~
--+a
G >eG
give rise to two linear
G), and if we use them, it is easy to see
~
that we can make the identification
Lie(G
~
G)
G
G gives rise to a k-linear
The multiplication morphism
m : G
map
Lie(G), and because
~:
Lie(G)
Lie(G)
$
is the identity on G,
~
~
~
~
Lie(G).
must be the
to be the sum map of the k-vector
space Lie(G). The diagonal morphism (A)e : Lie(G)
$
m·jl = mojz = lG
1'4.(jl)e = JA-e(iZ)e = lLie(G)
identity on Lie(G). This forces
has
~
Lie(G)
~
~
G --+ G
>co
G ,g .....-. (g,g)
(D,D)
Lie(G) • Lie(G) , D
as tangent map
at the oTigin. Putting these facts together, we see that the homomorphism of taking the square G ~ G, g ~ gZ composite
g
4
~
(g,g)
""
~
g2
has D
~
(D,D)
~
which is the
ZD as tangent map
at the origin. By induction one concludes that the tangent map at the origin of
g ~ gn
n· : Lie(G) --...Lie(G)
(n ~ 0) is the multiplication by n, in the Lie algebra of G. If n is a multiple
of the characteristic of k, the tangent map at the origin of g ~ gn is thus the zero map in Lie(G). A translation argument shows that the tangent map of
g ~ gn
at any point x
£
G is the zero map
of T(G)x if n is divisible by the characteristic p of k. E is an elliptic curve over k
We use these results when G
(hence E is a commutative algebraic group), and we still suppose that
221
- 111.50 -
the groundfield k is algebraically closed (hence perfect). (4.22) Proposition. If the positive integer n is divisible by the characteristic p of k, n
: E
E
E is inseparable. In particular,
~
degs(PE) = 1 or p · Proof. Write the rational function field k(E) = k(x,y) as quadratic extension of a purely transcendental extension k(x) of k, so that k(x,y) is separable over k(x) (we still assume p 1 2). Then if P
~
E,
x p = x -x(P) is defined and a uniformizing parameter at P for all but finitely many P's, and D = Dx = Dx for these P's. Now if p f: E ~ E , P ~ Q has tangent map identically 0 everywhere
= n E when p divides n), and if we denote D ~T(E)p = Derk(Rp,k) the derivation Rp ~ Rp ~ k, we shall
(which is the case for by
have
~
(')e(D) = O. Thus, for every f
€
RQ ' f.~ eR p
and
D(fof) = 0 implies D(f-f) ~ Mp Let us fix f € k(E). Then Dx(f·f) E taken over all P
£
(I Mp where the intersection is
E such that x p is a uniformizing variable and
f is regular at Q =
~(P).
As nearly all P
we see that the intersection in question
~E
n Mp
satisfy these conditions, =
{OJ is reduced to 0
and Dx(fof) = O. As Derk(k(E)) is a vector space over k(E) of dimension one, we conclude that D(fof)
0 for every derivation D of k(E)
(trivial on k), and that f0'fE::k(E)P. Hence k(E).
C
k(E)P C k(E) , , purely inseparable ,
inseparable (A more general proof could be given using results from the beginning of section (lV.I).) (4.23) Corollary 1. The group of points t (E) of order dividing p p
~
E is isomorphic ei ther to {I} or to
7llp7l.
Proof. The proof of (4.17) above shows that only the separable degree
222
- 111.51 -
counts for the order of the group Ker (n ) : Card Ker (n ) = deg s (n ) . E E E (4.24) Corollary 2. The Tate module Tp(E) is a free ~p-modu1e of rank 0 or 1 and the vector space Vp(E)
~ ~p
is of dimension 0
or 1. Proof. Observe that for two morphisms deg(o<.(3)
deg(~).deg(~), degs(~.~)
degs(p~)
degS(PE)
r
= 1 or p
r
E
~
E, we have
degs(<<)·degs(~).
~
= {I} or tl p
q.e.d.
The two cases occur indeed, but the case tpCE) valently T (E)
=
p
2
=
{I} (or equi-
{l\) only occurs when PE is purely inseparable, 2
2
k(E)P, and this implies (rather easily) E ~ EP , j = jP ,
'k(E)ePE j EF
2
In particular,
and
T (E) = {l.! or lim 7l/pr71 p
~,~:
•
Thus, this completely degenerate case only occurs for fini-
P
tely many k-isomorphism classes of curves (which can be defined over the finite field F 2). P
Using classfield theory (or rather Brauer's theory) and the natural representations of End(E) (or EndQ(E) = End(E)
.e -adic End~(E)
spaces
V
t
~
Q ) in the
(E,) (for all primes .(), it can be shown that
is of the following types a)
~
field of rational numbers
b) K a quadratic imaginary extension of
q in which
the ideal (p) splits completely , c) Hp,oo a definite quaternion algebra over ~ wi th discriminant p (ramified only at p and oa). In the case cl, End(E) must be a maximal order in
P,OO and this case occurs precisely when there are no points of order p on E. The finite number of invariants j
€
~
F 2 such that an elliptic curve p
with invariant j has no point of order p is thus the classnumber hp of the quaternion algebra
~
p ,00
. (For further indications on this
223
- 111.52 -
topic, look at the chapter on complements, especially (1~1.16) and (lV.l.23) ) •
224
CHAPTER
FOUR
COMPLEMENTS
As its title indicates, this last chapter provides some complements which could be considered as an introduction to the arithmetic theory of elliptic curves. In particular, the first two sections deal with properties of elliptic curves in characteristic p ; 0 (or defined over finite fields) which have no conterpart in characteristic
o.
Then the last section gives the first
elementary results on the reduction mod p theory which associates to an elliptic curve over
~
a family of elliptic curves over finite
fields. As usual, I have not been able to refrain from mentioning some more advanced results (without proof) and some standard open conjectures.
225
- IV.2 -
1. Hasse's invariant
Let k be a field of characteristic p
~
0 and K a finitely
generated extension of k of transcendence degree one, in other words, K is a function field of one variable over k. We have shown (TI.2.37) that if k is perfect (k P X €
=
k), then there is a transcendental element
Kover k such that K is separable over k(x). Such elements x will
be called separating elements for the considered extension. Quite generally, a function field K/k is called separably generated if it admits a separating element (and this is always the case if k is perfect). (1.1) Proposition. Let K be a separably generated function field
(~
one variable) over k. Then K = KP.k(x) = (KPk)(x) for every separating element x of K. In particular, if k is perfect, K
=
KP(x) for every
separating element x of K. Proof. Let us fix a separating element x
~
K so that K/k(x) is a
finite algebraic (separable) extension. By the primitive element theorem, there is an element y
£
K, separable over k(x), such that
K = k(x,y). We have the following diagram of field extensions
Now k(x) is (purely inseparable) of degree p over k(x P ) as is immediately seen (or follows from Luroth's theorem). If k is perfect,it follows then that the degree of Kover KPk
226
=
KP is also p, a prime
- IV.3 -
number, hence K is generated over KP by any element not in KP, e.g. x. The proposition is thus proved in the case k is perfect. In general, we have k(x)]
=
[K
and we shall show that [K : k(x)] P and the proposition will follow as
which will imply [K : KPk]
above. Let n -be the degree of y over k(x) : n = [K : k(x)] . Then yP is also separable of degree n over kP(x P )
=
k(x)P. Because k(x P )
is purely inseparable over kP(x P ), it is linearly disjoint from the separable extension kP(xP,yp) and this implies that the degree of yP over k(x P ) is still n which was to be proved.
(1.2) Corollary 1. Let D k. If D
r
Derk(K) be a derivation of K trivial over
£
= KPk.
0, then Ker(D)
Proof. If we select a separating element x proposition that every element y ~
L-
Y =
O'-i~p-l
~
£
K, it follows from the
K can be uniquely written as
i u.x 1
Then because D must be trivial on KPk, ~
D(y) If D
r
0, also D(x)
and D(y)
=
r
0 implies u
i
=
iuix
i-I
·D(x) l<:i
0 for l~i(p-l, hence y
= U
r
0)
o e:KPk.
This justifies in some sense the following definition. (1.3) Definition. The elements of KPk cK are called the p-constants of K whereas the other elements (i.e. elements of K - KPk) are called p-variables of K. (1.4) Corollary 2. Every p-variable of K is a separating element. Proof. If x is any p-variable, then K = KPk(x) and the k-derivations of K are consequently fully determined by their value D(x) on x.
227
- IV.4 Because K is separably generated of transcendence degree one (and finitely generated) the space Derk(K) is of dimension lover K, hence D(x)
~
K is arbitrary. This proves that every k-derivation
D €Derk(k(x)) has a unique extension to K, hence K is separable over k(x). (l.S) Lemma. Let K be separably generated (as before), and take two p-variables x,y of K. Then in the decomposition i (~) / (I) (u i E: KPk) , u.x dx x 1.
L
O~i~p-l
the constant term u is given by 0
(~)P/(I)P e: KP c KPk
uo
dx
x
Proof. Recall that the differential quotient
£i is equal
to ~ for
any non-zero derivation D £ Derk(K) (observe that Dx , 0 by Cor.l). Let y = a o + alx + + anx n wi th an- , 0, 1 ~ n ~ p - 1 be the decomposition of y relatively to x. Let K' be a decomposition field for this polynomial. As it is composite of fields of degrees dividing n
< p,
its degree must be prime to p and K' must
b~
separable over
KPk. The derivation D has a unique extension as derivation D' of K'(x) over k (and D' must be 0, the only extension, on K'). Then we have y
= a
IT (x
n l~i(n
-
(~i
E·) 1.
6 K')
•
Taking the logarithmic derivative, we get
hence
(~)/(l.) dx
x
L
l(i
(1 -
1
l-xP~iP
(l+x/~.+ .•. +xP-l/~~-l)) 1.
The p-constant term (coefficient of xo) is u
o
=
L.
l
(1 -
II (1 - xP~?))
228
L
~ ll,~n
1.
p
(1 -
1/(1-X~-1·l))1
- IV.S -
(1.6) Remark. If K is separably generated over k, so is KPk ~ k. Indeed, if x is a separating element for Kover k, i.e. K separable over k(x), then KP is separable over kPex P ), and a fortiori after the purely inseparable extension k(x P ) of the base field kP(x P ), we see that KPk is separable over k(x P ). This proves that x P is a separating element of KPk/k if x is one for K/k. If D is a k-derivation of K, we denote by DP the k~derivation of KPk defined by DP(x P ) = (D(x))P. Also, if d :.K
Diffk(K) denotes the canonical derivation of K,
~
and dP : KPk ~ Diffk(KPk) the corresponding derivation of KPk,
nP
then dPx P is the linear form
~ (nP,dPx P) = DP(x P )
= D(x)P.
Because D(x) =(D,dx), we see that (DP,dPx P) = (D,dx)P which we could also write dPx P = (dx)P. In any case, if x and yare p-variables of K
(~)p = dPyP/dPx P is the differential quotient of the two p-variables x P and yP of KPk. On the other hand, if x is a p-variable of K, and
Dx denotes as usual the derivation with respect to x of K (characterized by Dx(x) = 1), then DP(x P) = Dx(x)P = lP = 1 x
D (x P ) and DP = D E: Derk(KPk) x xP xP Now we turn to a more detailed study of differentials of K. By Prop. (1.1), every differential W of K can be uniquely written d d P-ldx, u.1 ~ ~ KPk o x + ulx x + ••• + up_Ix (once the p-variable x of K is fixed). However, it is more convenient W
=
U
to write it
(1.7)
CU
= f dx dx x = f ox
with p-constants f.1
£
+
f 1 dx
+
•••
+
f
P-
IX
p-2
dx
KPk. Then, all terms
f.xi-ldx = f.d(xi/i) 1
1
are exact differentials (1 form
c.J
can be wri tten
and an element F
€
CJ
~
i
~
p-l). Consequently, the differential
= fo dx + dF wi th a p-constant f x· 0
E:.
KPk
K (the element f o being uniquely determined by x).
229
- IV.6 -
This shows in particular that the space Diffk(K)/dK
~
k-differentials
of K mod exact differentials of K is of dimension one over k (with dx basis x mod dK ) · (1.8) Proposition. The operator Sx : Diffk(K) ~ Diffk(KPk) defined
by S (w) = f dPx P/x p for w = f dx/x + dF is well-defined independently x 0 0 of the choice of the p-variable x chosen to explicit it. This is a KPk-linear operator S called Cartier operator. By definition, it vanishes on exact differentials. Proof. Only the independence of the choice of the p-variable x deserves consideration, so let x and y be two p-variables of K. We use lemma (1.5) to compute S (~) = S (~/r.dx) = (~)P/(~)P.S (dx) x y x dx x x dx x x x and by the above remark (1.6) this is equal to dPyP / ~
dPx P
xP
• dPx P
dPyP
xP
This implies obviously Sx
=
one over k. Also note that
yP
=
S (~) Y Y
Sy , because Diffk(K)/dK is of dimension S(~)
=
0 is equivalent to
w
exact.
When the base field k is perfect, we can introduce the modified Cartier operator S' : Diffk(K) (1.9)
SI(f/: + dF)
~
Diffk(K) by
= f~/P:x
(fo~
KP.
f~/P
E:K).
This operator is not linear over KP, it is additive and satisfies the condition S'(fP~) = fS'(W) for fP ~KP (sometimes called p- l _ linearity).
w of K is called logarithmic (or logarithmically exact) if it is of the form -dx for some 0 , x E:K. (1.10) Definition. A differential
x
(1.11) Proposition. Let k be a perfect field and S' the modified
Cartier operator on differentials of the function field of one variable Kover k. Then a)
s' (w)
=
0 is equivalent to co exact
230
- IV.7 -
b)
is equivalent to
= ~
S'(~)
c) S' (fPw
+
gPlJ) = f S' (w)
+
logarithmic
~
(f,g e: K,
g S' (v)
W,Y E:
Diffk(K)).
These conditions characterize uniquely the operator S'. Proof. Everything is obvious except perhaps b). But if we have
S'(~)
=w,
flip, hence f €:IF c k. Take the integer o 0 0 P a congruent to f o mod P and such that 0 ~ a ~ p-l. Then W =
f dx wi th f
tV
Ox
= f dx = a dx = o x
x
ili2 a
is logarithmic
x
It would not be difficult to see that the operator S' leaves invariant the subspace of first kind differentials of K, but we give a more precise result in our case of interest, namely the case of elliptic curves. (1.12) Theorem 1. Let k be a perfect field, 1
1 Ae
k~,
K~
the
elliptic field of k-rational functions over the elliptic curve E
A
y2 = x(x -1) (x - A). Then the image
given in Legendre normal form
of the first kind differential dx over E y
under the modified Cartier
operator is given by
S'(~) = H (A)l/p dx y
Y
p
with Deuring's polynomial
HpC'-)
C-1/-
t (~)\i i=O 1
(t
=
!(p-l)) .
Proof. We write dx with
t
y
= y - py P-1 dx = y - px t (x-I).t (x-A).L dx ,
= 1 (p-l). Then S' (dx/y) =
(l/y) S' (xl (x_l)l (x-A)l dx) .
We use the binomial expansions (x -1 ).e
L
Cx-~)'2
L C~)xj c-"l-j
(~) x i 1
( -I)! - i
and note that it is sufficient to compute the coefficient of xp-1dx in xl (X-l)t (x-Af dx because all other terms give exact differentials.
231
- IV.8 -
Thus we compute the coefficient of x
L
.
.
,
(~)(~)(_l)i+j
1+J=~
J
1
t in (X-l)-L (x_')..)l. It is p CA)
;..t-j
H
whence S'(dx/y) = (I/Y)S'(Hp(~)XP ~)
,
!. H (,,) IIp dx . This gives x
p
y
the assertion of the theorem. The roots of the polynomial H
P
Indeed, 4 ( -1 ) H ( 0) p
=
1 "
0 nor .A = 1.
are nei ther.A
(~)
~
0 , (-1 ) Hp ( 1)
t 2 L (~) i=O 1
=
t: having all
and (Zf) is the quotient of (P-l)(P-Z) ••. (P;I) by
their prime factors (p, hence not congruent to 0 mod p. This shows the existence of certain elliptic curves E
A
with S'(dx/y)
=
0,
hence with exact first kind differentials. (1.13) Definition. An elliptic curve is called supersingular when its
first kind differentials are exact. Hence we have shown the existence of finitely many supersingular elliptic curves Eft in all odd characteristics. Because the notion of first kind differential (and of exactness) only depends on the rational function field K
A
k(E~)
two values of
~
leading to
isomorphic curves, i.e. to the same invariant j(A) (as given in (I.4.3)) give simultaneously supersingular curves or not: supersingularity is a property of the invariant j of the curve. In particular, since A and I - A
lead to the same value for the invariant j,
the two polynomials HpeA) and Hp(l - A)
(of the same degree) must
have the same roots. This implies that they are proportional
Hp (A) = cpH p (1 - A) with a constant c p such that c~ = 1, hence c p = !l ~Fp (note that this gives another proof of the fact that Hp(l) fOE:: lF p ) . Just for fun, let us compute the exact value of c p . By definition, we have (p-l) !
(! (p-l)) 232
!
2
-
and
IV.9 -
n-l n-l n-3 (p-l)! = 1.2·3· ... · ( Z ) ( p - Z ) ( p - Z ) ... (p-l)
is congruent mod p to
(P;1)l2(_1{ This proves c
(_l)t
p
(1. 14 )
(and also to -1 by Wilson's theorem). and leads to
H (1 - ).)
p
= (-
II Hp (). )
.
It is obvious that (1.15) (1.16) Theorem 2. The Hasse invariant of the family
Hpo.)
E~
is given by
= (-llFCLl:1:A) E:Fp[[).JJ
with Gauss' hypergeometric function F = 2F1 mod p. Moreover, the roots of Hp(A)
=
0 are all simple, and the number h of supersingular
invariants j (corresponding to the roots of Hp ) is given by the formulas h
1 12
(p - 1)
1 TI
if P
-
1 mod 12
(p - 5) + 1
p
-
5
(p - 7) + 1 1 . n(p - 11) + 2
P
-
7
1 12
P :: 11
if P
1
= 2
or 3
In other wor.ds, if we use the quadratic residue symbol of Legendre
h = ~(1 - (-;)) + t(l - Cp4~) + /2 (p-1) for all primes p. Proof (for
p~2).
Let us first check that we have the following
congruence of binomial symbols mod p, when The congruence has to be understood in
~
p
p~2,
k~!(p-l).
, so that mod p really
means mod p~ . The left-hand side is (p-1)(p-3) ... (p-2k+1)/Z kkl while the right-hand side is (-1) (-3) ... (-2k+l)/Z kk! . Because the
233
- IV.lO denominators 2k k! are units in ~ p for p amounts to the trivial congruence
r
2, the asserted congruence
(p-l) (p-3) ... (p-2k+l) - (-1) (-3) ... (-2k+l)
iii
0
(p)
In fact the binomial coefficients (-~) are rational numbers with powers of 2 only in their denominators (look at the proof of ([.5.28)) _1
(~)
k
>
€
~rI]
c
~p (p prime ~ 2), and these numbers are in p~ when
I(p-l) so that we have
t
00
(-l)!H (A) =L.(t)2 Ak =L(-I)2>.k = F(},}:l:,A) €F [fA]] p k=O k k=O k P But this hypergeometric function satisfies the differential equation (hypergeometric differential equation with a
" + (1 -
A(1 - A ) F
2~)
F
,
- 1F
b = I, c = 1) 0,
Where we consider this equation as differential equation for formal series. This proves that the roots of this function are simple ( F(A) = F'(A) = 0 implies F"(A)
o because
we have already checked
that the roots are neither 1 nor 0, and differentiating, we would get F
" , (A) = 0, ... , hence finally F = 0 as formal series). It only
remains to prove the formula for the number of supersingular j's . I
recall that (1.4.3) j = 28 X- 2 (1 - A) -2(1 - A +tX 2 )3
(and this formula defines j in characteristic p = 3), and thus the affine
X-line is a covering of the affine j-line (Luroth's theorem)
of degree 6, with ramified points over j
o of
ramification indices
3 corresponding to the roots A.1 (i = 1,2) of 1 - .A + .A 2 = 0, and over j = 12 3 with ramification indices 2 at the points A= -1,},2 . Here we have implicitely assumed p ; 3 (and p ~ 2) because if p = 3, 12 3 • 0 mod 3 shows that all ramification points collapse over j = 0 3 and give an index of ramification 6 at A = -lover j = 0 = 12 E: F 3 • Before giving the general discussion, let us treat the case of charac-
234
- IV.Il -
teristic p
=
3. Then H is of degree one, has only one root and 3
there is only one supersingular invariant j. But all values of
A
corresponding to this value of j must also be roots, so that the
A= -1 of ramification index 6 above
only possibility is
Indeed, H3 (A) = - 1 - A case p
>3
such that
H
=
= 0 = 12 3 .
j
A = -1 ! Consider now the'
0 also gives
along similar lines. Suppose that there is a prime p p
=
0
does not have roots corresponding to j =
(I do not claim that such p exist,
0
or 12 3
but it will be a consequence of
the discussion that they do exist!). Then, for each value of supersingular j, there will be six values of A above j, and we shall have the formula
~
= l(p-l) = 6h, or p
l2h
+
1 and this shows that
P would necessarily be congruent to 1 mod 12 in this case. Suppose now that p is such that j = 0 is supersingular but j = 12 3 is not supersingu1ar in characteristic p. Then the h-l supersingular values of j distinct from 0 must come from six roots
Aof
there will be only two corresponding values for
t
1 (p-l)
6(h-l)
=
~
Hp and above j
=0
:
+ 2
In this case, we would thus have p
l2(h-l)
+
5
Similarly, if p is such that j
-
5 mod 12. 0 and 12
3
are supersingular in
characteristic p, we get
l = 1(p-l)
6(h-2)
hence p
=
l2(h-2)
+
11
+
= -1
2
+
3
mod 12 .
Finally suppose that j = 12 3 is supersingular but j
o
is not, in
characteristic p, then
leads to
t
l(p-l)
p
l2(h-l)
6(h-l)
= +
7
+
= -5
3 mod 12 .
Because these different conditions for p are mutually exclusive, they
235
- IV.12 -
must characterize the different cases and all occur (if we use Dirichlet's theorem about primes in arithmetic progressions, we see that all these cases occur infinitely many times - with the same density). Moreover, we have found the following formulas (where
E
p
denotes the finite set of supersingular invariants j in characteristic p) h
Card(r ) p
1 1 2 (p-1) if P _ 1 (12) case
I:
~
0,12
1 1 2 (p+7) if P _
0 €I:
, 12
5
IJ
l2(P+13) if p :-1
II
11 (p+5) if P 2
JJ
-5
;0
p
p
0,12 0
~I:p
3
3
3
~
I:
p
,
E:I: p
3 , 12 E:I:
p
This concludes the proof of the theorem, giving more precisely the cases when 0 or 12 3 are supersingular (in characteristic 2, there is only one supersingular curve, namely y2
+
Y
= x3,
and with a sui table
definition for its invariant - given by Deuring - it gives
E
Z=
{a} )
Although we have already used the term "Hasse's invariant" of the curve
E~
as being the value Hp(A) of Deuring's polynomial, let
us give a formal definition now for this terminology. First, we observe that by definition, S'(dx) y
=
H (A)l/p dx p
y
where S' denotes
the modified Cartier operator, hence also
S(dx) = H (A)(dx)p
(1.17)
y
p
y
In general, if E is an elliptic curve given with a first kind differential
W;
0, we define the Hasse invariant
by
S(w) = H(E,W) wP
(1.18)
If we replace
H(E,~)
W
by any other first kind differential, say
fAJ'
= aw
with ae k~, then obviously H(E,aw) = al-PH(E,~), so that we can say that the Hasse invariant is defined up to multiplication by elements of the form a l - p independently of the choice of the basic first
236
- IV.l3 -
kind differential. When the elliptic curve is given by an equation yZ
f(x) with a cubic polynomial f having all its roots distinct,
the choice of first kind differential
W=
dx is sometimes implicitely y
made, and we have dx p-1 (1.19) H(E,)') = H(E) = coefficient of x in f(x)}(p-l) as in the proof of Th.l (l.lZ). Let us take now E in Weierstrass form y2
4x 3 - &2 X
-
&3 and
let us compute its Hasse invariant (with respect to d;) in function of &2 and &3. We have to find the coefficient of x 2t before) in (4x
3
1,
- gzx - g3)
(t = !(p-l) as
· Using twice the binomial expansion
formula in a suitable way we get t -t-m 3 t ~ ~ t! 4 t-m-n m()n J.l-2rn-31\ (4x - gzx -g3) = l - L- m!n!(t-m-n)! (-gZ) -&3 x m=O n=O and because 3n = t - Zm implies n ~ ~ -m, the required coefficient is
L
1 m+n 4 t-m-n t! mn (- ) m! n! (.t-m-n)! gZ&3 Zm+3n=.e This is an isobaric polynomial of degree in gz and g3 if these (l.ZO) H(E)
e
elements are given the respective weights Z and 3. On the other hand, the Eisenstein series
E~
can also be expressed as isobaric
polynomial of the same form in g2 and g3 ' but coefficients in characteristic 0 (1.3.10). The result is that (1.21) L~t
(Deligne) .
us just sketch a proof of this formula. We compare the q-expan-
sions of both sides. By Ek(z)
=1
+
with
(1
.4.1)
¥k ~ 02k_l(n)qn
(q
e 2xiz
~(z)
)
n~l
(B I
= 1/6, B2 =
1/30, B 3
= 1/42, ... ).
But the denominator of Bk is the product of the primes m such that m-l divides 2k (von Staudt), so that~ is always divisible by p : The q-expansion of (E
t mod p) is identically 1. We show that the
237
- IV.14 -
same is true for the q-expansion of H(g2,g3). To find this q-expansion, we evaluate the value of the Hasse invariant of Tate's curve K>CI q1l
=
with the local field (of characteristic p) Fp((q)) valuation of this field is defined by ord(q)
=
K (the additive
1, hence Iql
!p
<1)
The differential dx takes the form y
dx _ dP )( - DP
dP (l/X)(d/dX)(P)
dX
Jf
Formally (this has to be justified), this is a logarithmically exact
= (dx)p and H y
differential, hence S(dx) y
=
1 also. This shows that
the two q-expansions in question coincide and (1.21) follows from this. We mention a few related results without proof. First let
E be an elliptic curve defined over the finite field F q of characteristic p (say q
=
pf). Then the number N of rational points of E
(including the point at infinity) with coordinates in F
q
satisfies
the following congruence (1.22)
N
=1
- H(q-l)/(p-l)
*)
mod p
Here again, H denotes the Hasse invariant of E, with respect to any
F q -rational first kind differential: if a (aI-PH) (q-l)/(p-l)
~
F qX
H(q-l)/(p-l)
F
€
P
Suppose now that E is supersingular, i.e. H
= o. Then the above
congruence, applied to all finite extensions F , of F in a fixed q q
Fp
algebraic closure
shows that
EOF p )
has no p-torsion. A consequence
of (1.22) is thus that the supersingular elliptic curves have no point of division of order p. Conversely, if H ; 0, the norm of H in an extension F
qrn
of F
p
being the mth - power of the norm of H
relatively to the extension F q IF p
will be 1 for large m, and
this will show that E has division points of order p (rational over (
*) Observe that H q-l
) / (p-l)
= HPf-l + ••• +1 = N0InF IF (H) e:. lF q P
238
p
- IV.lS -
the algebraic closure of F p ). In fact one can prove the following equivalences : i) The Hasse invariant H(E) is zero ,
ii) EOF ) has no point of order p p
iii) The ring of endomorphisms End(E) is
(1.23)
not commutative, iv) A power nn of the Frobenius (x ~ x q ) of E is equal to an integer (necessarily pn/z. 1E ) . Thus anyone of these properties characterizes supersingular elliptic curves. Moreover, if E is supersingular, multiplication by p gives an isomorphism
purely inseparable of degree p2, hence E E is isomorphic to its p2 power E ~EP2 Consequently j = j(E) jP
2
p.1
and this shows that all supersingular invariants are in F 2 • P
(1.24) Remark. As in characteristic 0, we say that an elliptic
curve in characteristic p is singular, or that its invariant is singular (to avoid confusion with singular points) when its ring of endomorphisms is different from
~
: End(E)
invariants are exactly the elements j ~F
p
.
~~.
The singular
When moreover End(E)
is not commutative, the corresponding singular invariant is supersingular. This explains the terminology. (1.25) Curiosity. We may observe the following formulas using the classical polynomials of Legendre
Also (1 - A)l p
,f.
(~) 1 - A
t
=
L.. i=O
~
. First we have
(~)2. Ai 1
so that the Hasse invariant of Legendre's family is given by H (A)
P
=
(A-I)l p
t
(~) mod p I -
A
239
(.t = j(p-I) )
- IV.16 -
2. Zeta function of an elliptic curve over a finite field
Let V be a non-singular, irreducible, projective plane curve defined over the finite field k = F q of characteristic p , 0 (q = pf) and let K be the function field key) of k-rational functions on V. (Actually, we could start with any function field K of one variable over F q , but it may be more suggestive to speak of points of the curve V than of places of the field K.) As usual V denotes the set of points in an algebraic closure k of k : V
V (k)
c
p2 (k)
and Div(V) is the free abelian group generated by the points of V (finite formal combinations of points of V with integral coefficients). The subgroup of rational divisors Divk(V) C Div(V) consists of divisors ~ = L:dp(P) having equal multiplicities on conjugate points dp
d pr (for every automorphism tr
of a divisor
~
E:
Gal (k/k) ) . We define the norm
by the formula N(~)
= qdeg(~)
and the zeta function of V by the formal Dirichlet series l;V(s) =
(2.1)
L
~~O
N(~)-s
We shall see that this series converges absolutely for Re(s)
> 1, but
we treat it formally now. A positive divisor
~ £
Divk(V) is called prime (rational over k)
if it cannot be expressed as the sum of two positive non-zero divisors. Obviously prime rational divisors are of the form
1
=
L
(per)
sum over a class of conjugate elements of V.
The degree of such a prime rational divisor is the number of conjugates of any point occurring in it, and if k(P) is the field generated by the coordinates of P (one of them being normalized to 1) over k, this degree must be the number of automorphisms of k(P)/k, hence
240
- IV.17 -
de g (1)
=
[k(P) : k] • Because every positive divisor can be expressed
uniquely as sum of prime divisors (rational over k), we see that (2.1) is equivalent to
11
~ Y (s)
(2 • 2)
r pr1me
(1 - N ('P) - s) -1
product extended to the (positive) prime divisors over V, rational
n
over k. If we take u = q-S as variable, this formula reads (2.3)
= 'v(s) =
Zy(u)
1> pr1me
udeg~))-l
(1 -
•
Let us take the logarithms of both sides (I recall that we are making formal computations so far) 10gZ y (u)
=
L.
Pmd L1> L u eg('}» m)l
=
L
But if
l'
= L(p
or that
k(P)
then that
r
deg(~)
j
n>l ),
cF
10g(1 - u
('jl))-l =
1m = Ldeg('b)
L
ldeg('P)jn
the condition
deg
'1'
T
fn>1
m degq:» u mdeg(1))
deg(~)l un/n
deg(~) In
f
means that [lFq(P) : lF q]
In,
,and is equivalent to P ~ YOF ). If we recall qn qn is precisely the number of distinct conjugates of P,
we obtain the expression (2.4) with
log Zy(u) N n
=
=
L.
n)l
Card Y (IF) qn
N unln n
L.
deg(~)/n
deg (r)
Two equivalent formulas are (2.5)
0)
(2.6) Because N
n
Card Y(lF n) q
~ Card p2 (IF n) q
q 2n + qn + 1 ,
we see that the series (2.4-6) converge absolutely for lui
<
q
-2
and thus for Re(s) > 2, thereby proving the convergence of (2.1) for large Re(s) and the legitimacy of the formal computations in that domain. Also, if V is a straight line, we see from (2.6) that 241
- IV.18 1
(l-u)(l-qu) because Card pI (IF) q
qn
n
1
+
Thus the zeta function of a line has a meromorphic extension to the complex s-plane as rational function in q-s. This fact is general, and one can prove that the zeta function of a curve of genus g has the form P
(2.7)
z
(u)
= (l-U~(l
Zy(u)
-qu)
with a polynomial Pzg(u) E:. Z [uJ of degree Zg in u, satisfying P2g(O) = 1,
P
2g
(1) = N , 1
P
2g
(u)
qgu2gP2g(1/ qu)
This implies the functional equation 'y(l - s)
=
q (1 - 2s) (1 - g) 'yes)
A. Weil has also proved that the zeros of these zeta functions all lie on the critical line Re(s) = ! (for g = 1, this was due to H. Hasse). The main interest of the explicit knowledge of the zeta function of a curve as rational function in u resides in the fact that it is equivalent to the knowledge of all number of points Nn of V (over the extension of degree n of k
= F
q
). Actually, using the
functional equation, we see that the determination of the g coefficients of u, •.• , u g in the polynomial P
suffices to determine
2g
completely the zeta function (and hence all Nn for n
~
1). For
an elliptic curve, the rationality of Zv is very easy to show. (Z.8) Theorem. Let E be an elliptic curve defined over k
=
Fq
(i.e. an absolutely non-singular projective plane cubic over F q with one rational point chosen as neutral element). The zeta function 'E(s) extends as meromorphic function of s in the whole complex plane and satisfies
4E (1
- s)
=
(E(s). This function is rational in u 1 - (1 +q - N1 )
U +
(1 -u)(l -qu)
242
qu
Z
=
q-S
- IV.19 -
Proof. The function field keEl of k-rational functions over E is of genus one (IT.2.26) (when p f 2, which we will suppose for this proof; because we assume that there is a rational point P
~E(Fq)'
the proof of the reference given is still valid although k is not algebraically closed). Let DiV~(E) denote the set of k-rational divisors on E of degree n ~ 0, and Div~(E)/P(E) the set of classes of divisors of degree n mod principal divisors (this has a meaning because the principal divisors have degree zero). From Abel-Jacobi's condition, we see that this set is parametrized by the rational points of E (the sum of the affixes of the points in a rational divisor over k is a point of E which must be equal to all its EQf ) ), and in particular has a finite conjugates, hence be in E k q number N of elements equal to the number of rational points NI on E (and in particular independent of the degree n in Div n ). On the
other hand, it is easy to compute the number of positive divisors in a given class (of degree n ~ 0). For ~ £DiV~(E) and f ~ +
div(f)
~
0
~
~ ~
-div(f)
f
E:
€
k(E),
L (~)
and thus the number of positive divisors in the class of d is the number of principal divisors div(f) with f
~ L(~),
f f
o.
Since a
function is determined up to a multiplicative constant by its divisor, this number is qdim L (~)
_ I
q - I
Because the field keEl is of genus one, dim deg(~)
> o.
This shows that ~
d=deg
d
q
L-
~~1
- ds .9-...:...l N q - I
243
L(~)
deg(~)
if
- IV.20 -
(I-S/(1 -q 1-5) -q -5/(1 -q -5)) . = -N- q q - 1
q-S we get (adding the term 1 corresponding to
Hence putting u d
0) 1 + q
~ 1 (qui (1
- qu) - ul (1 - u))
1 - (9 + 1 - N)u + (1 -u)(l -qu)
qu 2
which is the desired expression. One checks immediately the functional equation on this "explicit" formula. On this explicit formula we see
< l/q Re(s) > 1
that the zeta function ZE converges for luI
Iq-sJ < l/q
, qRe(s) =
IqSj > q ,
or for
(we had only checked the convergence for luI
> 2
after (2.6) ) • There is a formula for the number of rational points for an elliptic curve over F p given in Legendre form y2 = x (x - l) (x -
A)
(A E: F ) • p
For each finite x £F p ' there will be no point, one point, two points on E (F p ) wi th first coordina te x if x (x - I) (x - A) is resp. #
not a square, zero, a square of F p . Using the quadratic residue symbol, this number is (x(x-l) (X-A)) + l , and if we add the point at p
infinity, we find (in this case) ( 2 • 9)
N = 1 +
L.. { (x (x -1 ) (x - ).))
xEF
p
(We use the convention (~)
+
1}
P
= 0 when a = 0 for the quadratic residue
symbol.) (2.10) Remark. If we write the numerator of the zeta function of
the elliptic curve E in the form Z (2.I ) I - cu + qu Hasse has shown first that the inverse roots
~. 1
have absolute value
ql . This proves that the zeros of this zeta function are allan the
244
- IV.21 -
critical line Re(s)
= }.
The corresponding general property for the
zeta functions of curves (any genus) over finite fields has been established by A. Weil (see the references concerning this chapter). Finally there is result connecting the zeta functions of two isogenous curves:
if E and E' are two elliptic curves defined over
F q , and if there exists a non-zero homomorphism E
--+
E' (defined
over some extension of Fq ), then the zeta functions of E and of E' over Fq are the same.
245
- IV.22 -
3. Reduction mod p of rational elliptic curves
Let k be a number field (finite algebraic extension of ~
~)
and
a (non-zero) prime ideal of the ring of integers of k. By elliptic
curve over k, we always mean a non-singular (over
k) projective
plane cubic defined by a polynomial with coefficients in k,with one rational point over k chosen as neutral element for the group law (usually, this neutral rational element will be on the line at infinity with a suitable coodinate system in the projective plane). (3.1) Definition. The elliptic curve E over k is said to have good reduction mod
~
if there exists a suitable coordinate system in
the projective plane p2 in which E is given by an equation with
coefficients in the ring of 1-integers ,
of k, this equation mod
~
still defining an elliptic curve (non-singularity condition) over
the residual finite field :IF q
=
-<'-k/1 = ~/'P~ (q
We shall always assume that the prime ideal
~
=
Nk/(Q (~) ) ·
is not a divisor of
6 (hence q prime to 2 and 3), so as to be able to work with Weierstrass equations. If E has good reduction mod
~
, the reduced curve
has thus a Weierstrass equation, and the original curve can be
chosen in Weierstrass form with a discriminant
~ €~~
with
ord~6
=
(because the non-singularity condition for the reduced Weierstrass
equation is precisely (A mod~) equation for
E
= 6 f 0). If we take any Weierstrass
over k, its discriminant will be in k with
ord~A
=
0
for nearly all (i.e. all but a finite number including the divisors of 2 and 3) prime ideals of
~k
' hence :
(3.2) Proposition 1. Every elliptic curve over k has good reduction mod~
for nearly all prime ideals (not dividing 2 and 3) of the
ring of integers of k.
246
0
- IV.23 -
(3.3) Proposition 2. Let
~
be a prime ideal (not dividing 2 or 3) of
the number field k, and E an elliptic curve over k. reduction mod
1'
its absolute invariant j
has good
~E
j(E) satisfies the
=
following conditions a) or~j ~ 0, ord j
=0
b) ord (j - 12 3 )
mod 2
Conversely, if j
E
1
1 =0
mod 3
k satisfies these conditions, there exists an
elliptic curve over k with invariant j having good reduction mod Proof. If E has good reduction mod y2
4x
3
- g2 x - g3 with
its invariant j
ord~gi ~
l23g~/~
=
=
20rd~(2
, it has a Weierstrass equation
~
ord~L11 o. This implies that
0 and
satisfies ord j
sible by 3), and similarly for j - 12
1.
3
=
16
30rd~(l2g2) ~ 0 (divi-
=
6
2
2 3 g3/11
.
3
,0rd'P(J - 12 )
=
3 3
3 g3) must be even (and positive). Conversely, we assume
that j satisfies the enounced conditions. The cases j =
0
and j = 12
3
are easily treated separately, so we assume that j does not take these two special values. Define then g = 27j/(j-12 3 ) (r 27, r 0 because j
r
We have g - 27
2v (g) + v (g - 27)
=
ord
1
.
Then v(g2(g - 27))
2v ( j) - 3v (j - 123 ) mus t be a mu 1tip 1e
hypothesis. Choose a e: k such that v(a 6g 2 (g - 27)) 2
3
g2 = a g, g3 = a g. Then the elliptic curve y has still the invariant j, has a discriminant and 0
~
v(j)" = 3v(12g ) implies v(g2) 2
and using v(gZ)
j.
27.12 3 / (j - 12 3 ), and the discriminant of this curve
is g2(g - 27). For simplicity write v =
= 4x 3 - gx - g has invariant
0), so that the elliptic curve y2
~
0, we get v(g3)
~
~
o.
2
8
= 0
0
f 6 by
and put
= 4x 3 - g2 x - g3 satisfying
Finally, since
0
v(~)
v(~)
=
0
o.
Observe that this proposition does not mean that any elliptic curve over k with invariant j satisfying the conditions has good reduction mod ~, but it is isomorphic over k to an elliptic curve over k with good reduction (another k-form of the given elliptic curve). 247
- IV. 24 -
The preceding proposition shows that the condition sufficient to imply the good reduction mod
Ord~j ~
0 is not
of an elliptic curve
~
over k of invariant j. However: (3.4) Corollary. Let E be an elliptic curve over k with invariant j,
ord,j
~
o.
Then there exists a finite algebraic extension k'/k such
that when considered as elliptic curve over k', E has good reduction mod
p for
1.
all prime ideals (of 6- ,) dividing (This is the potential k good reduction of E at as defined by J.-P. Serre and J. Tate.)
1
Proof. It is sufficient to take for k' a normal extension ramified at ~
with indices e(,ll) (independent of ~
dividing~) multiple
of 6. Then all conditions of the proposition will be satisfied. Let S be a finite set of prime ideals of the ring of integers of k. We denote by
~(S)
elements such that ord,x
= ~[S]-l ~
the subring of k consisting of
0 for all prime ideals
not belonging to
~
S (fractions having denominators in S). If S is given, we can always enlarge it in a fini te set S'
:J
S such that
-OCS') is a principal
ideal ring. This follows from the fact that the ideal class group of k is finite combined with the characterization of ideals in a ring of fractions (if
~l'
...
'~
are representing ideals for the
class group of k, it is sufficient to take for S' any finite set of ideals containing S and the prime divisors of the
«.1 's).
(3.5) Proposition 3. Let S be a finite set of prime ideals of
such that
~(S)
is principal, and let E be an elliptic curve over k.
E has good reduction mod ~
for all prime
1 ~ S if
has a Weierstrass equation with coefficients gi€
6.
3
2
~k
and only if E
~S)
and
><
g2 - 27g 3 € ~S) · Proof. That the conditions are sufficient is obvious. Conversely, =
we have to establish the existence of one Weierstrass equation with simultaneous good reduction outside S. Let y2
248
=
4x 3 - gzx - g3 be
- IV. 25 -
1
any Weierstrass equation for E over k. If
is a prime not in S,
there exists by assumption, a Weierstrass equation for E over k, of the form yZ
= 4x 3 - gZ('!»x - g3('I»
Necessarily
gZ(~)
with some
a(~)
gi·a4(~),
with
gi(~h::: -&1'
and
~~)E. ~
= g;.a6(~) and ACt»
g3('P)
•
= A'.aIZ(~)
€ k. Because the first model (with the primes) has
good reduction at nearly all primes (not in S) of k (Prop.l), we may choose gi(~) = gi (i=2,3) and a(~)
n ~or~a('P)
Now the ideal generated by
for nearly all'P~S.
= 1
in .ocS) must be principal,
11£S
and this proves the existence of an element a ord1'a . Th e trans f ormat10n
0
=
ord~ a (1) for all ~
f coor d·1nates x
3
4x
4, .A. g2 = a g2 € 'v(S)
6, A. a g3 E:. v(S) and
g3
k with
s ·
a -2 x,
~
give the equation yZ
fi:.
~
y
~
- gzx - g3 for E over k, with A L.\
=
a
l2
A,
,.c
u E ~S) as desired.
(3.6) Theorem (~afarevi~). Let S be any finite set of prime ideals of ~k. Then the set of k-isomorphism classes of elliptic curves with good reduction outside S is finite. Proof (modulo a theorem of Siegel). Let us enlarge S until it contains all divisors of 2 and 3, and
~(S)
is principal. By proposition 3,
any elliptic curve over k with good reduction outside S has a Weierstrass equation yZ
4x
3
- gzx - g3 with gi
€
1s)
and
L1 E.~;)
Now A is well-defined mod (~(~))IZ, and by Dirichlet's unit theorem,
~~) is finitely generated - more precisely isomorphic to the product of a finite abelian group by a free abelian group of rank Card(S) - 1, -where
S denotes
A,,c / places 0 f k - so that v(S)
the union of S with all the archimedian
A.~ 12·1S (~(S))
f inite. It 1S . t hus su ff·· lC1ent
to prove the theorem when the discriminant
~3 _ 27,2 = ~
l:i is fixed. But
is the equation of an elliptic curve
~Z = (~/3)3 - ~/Z7
(invariant j
249
=
0).
- IV.26 We are looking for the points on this elliptic curve, having coordinates
E= g2
and,? = g3 in the ring -<'-(S). Siegel's theorem asserts
that there are only finitely many such points. (3.7) Remark. Siegel originally proved that on any affine curve of genus g
~
1, there are only finitely many points with integral coordinates.
Mahler and Lang proved an improved version of this theorem replacing the ring of integers
~k
by rings of the form
~S)
(S as above).
The full proof of Siegel's theorem uses the Mordell-Weil theorem through Jacobians, hence is quite difficult. However, (3.6) only uses the result of Siegel in the case of an elliptic curve (g of absolute invariant j
o
= 1)
(essentially, only Mordell's part,for
this curve, is needed). Let E be an elliptic curve over k, S a finite set of prime ideals of
(containing the divisors of 2 and 3) such that E has
~k
good reduction outside S. If
1¢ S,
tic curve over the finite field function of E
1
the reduced curve
E~
is an ellip-
and the numerator of the zeta
~/f
is
1 - a~ N (1')
-s
+ N (f)
l-2s
with a
r=1
+ N (1') - Card E"e~/'P) •
The Hasse-Weil zeta function of E is defined (up to a finite number of factors) by the infinite product (3.8)
IT
l' ¢s
(1 - a N( )-s + N( )1-2s)-1
l'
l'
l'
This infinite product is known to converge absolutely for large Re(s) and defines a holomorphic function in a right half-plane. There are several standard conjectures (verified for special curves and for elliptic curves admitting complex multiplications) concerning the meromorphic extension of this zeta function to the whole plane, and a functional equation under the transformation s
~
2 - s. It is
also conjectured (Birch and Swinnerton-Dyer) that the order of the zero at s = 1 of this function is precisely the rank of the group 250
- IV.27 -
E(k) of rational points over k. Remember that by the Mordell-Weil theorem, this abelian group is finitely generated, hence product of a finite group (torsion subgroup) by a free abelian group Zr wi th a rank r defined by r
dim E(k) ~ (Q • Q (3.9) Remark. To obtain the simplest functional equation possible, =
it is necessary to define all factors of the zeta function of E (not only those for
l' tt
S where E has good reduction). For
dividing 6),. we take a Weierstrass equation of E over k ~-integral
coefficients gi(P) and discriminant
possible order. Thus 0
:r;;
precisely when ord" 1:1 C'P)
< 12.
ord'P A (1')
>
1
~(P)
'P
(not
with
of minimal
Bad reduction occurs
0 and then the reduced curve mod p
is
an irreducible cubic (in Weierstrass form) with either a node (two roots only of the cubic polynomial mod
p
coincide, leading to a
double point) or a cusp (the three roots of the cubic polynomial coincide). The addition law of E reduces in the first case
mod~
in a multiplicative group on the set of regular points, and in the second case, it reduces to an additive group on the set of regular points (cf. (H.2.5) and CTI.2.6)). Thus the nodal case is said to correspond to multiplicative reduction, and the cuspidal case to additive reduction. The Euler factor attached to such a place with bad reduction has then the form ( 3 • 1 0) with
(1 -
~
=
d1' N(~) - s) - 1
±1 in the nodal case (the +1 being chosen when the tangents
at the node are rational over
~k/~
' i.e. when the two points in the
normalized curve sitting above the node are rational over that field), and
~ = 0 in the cuspida1 case. A prime
~
giving cuspidal reduction is more degenerate than
a point with nodal reduction, and in fact the cuspidal reduction can
251
- IV.28 -
be avoided by enlarging the groudfield k (which cannot be done with the nodal or mUltiplicative reduction: this is why this reduction is also called semi-stable). (3.11) Proposition. Let E be an elliptic curve defined over the
number field k, and let
1
be a non-zero prime ideal (not dividing 6)
of the ring of integers
~k
of k. Then there exists a finite extension
k' of k such that additive reduction never occurs for ideals ~' in k'
rp.
dividing
Proof. Let y2 • 4x 3 - g2x - g3 be a Weierstrass equation of E over k. Let also e i denote the three distinct roots of the right-hand side (in an algebraic closure
k
of k) and define k' = k(~) -=1 2 3 · 1
1
"
Over k', we can write the equation of E in the form y with e l
+
and use v
eZ
4(x - ell (x - e Z) (x - e 3 ) o. Choose (and fix) a prime divisor ~' of
e3
+
= or~,
mation x.....-+
we are
2
r in k',
for the corresponding valuation. Using a transfor-
).2x • y
.>.\
t-+
with )..= l/-Jei (for a suitable i).
led to the case where ei = 1
t
v(e j )
~
If the e j were all congruent mod
o
=
L ej
ii
3a mod
1"
0 (j=l,2,3) •
r'
hence
to a E: 1>"k,IIJ' , we would have a = 0 E. ~,/ll'
,
which leads to a contradiction to the fact that 1 = e i = 1 ; 0 mod Thus at least two of the reduced Eoots mod are distinct and the
l'
cuspidal case cannot occur. The definition (and this proposition) shows that the zeta function of an elliptic curve
E
depends on the field k over which
E is considered to be defined.
252
~'
.
- IV.29 -
(3.12) Further results. Two elliptic curves defined over some algebraic number fields are called isogenous when their corresponding lattices in [ are isogenous (][.3.8) or equivalently, when there exists a non-zero homomorphism from one to the other (this homomorphism will then necessarily be surjective with finite kernel, and be defined over an algebraic closure of the number fields in question). Then it can be proved that two isogenous elliptic curves have the same (Hasse-Weil) zeta functions.* Conversely, it is conjectured that two elliptic curves (defined over number fields) with the same zeta function are isogenous. This converse has been proved for curves with non-integral invariants j by J.-P. Serre.
*) Over a common number field of definition.
253
(look at Th.a[.3.l0))
REF ERE NeE S FOR CHAPTER I Let us start with some representative classics: BRIOT C., BOUQUET C. : Theorie des fonctions elliptiques, 2e ed. Paris, Gauthiers-Villars 1875, 2 vol. FRICKE R. : Die elliptischen Funktionen und ihre Anwendungen, Leipzig-Berlin, Teubner 1922. APPEL P., LACOUR E. : Principes de la theorie des fonctions elliptiques et applications, 2e ed. Paris, Gauthier -Villars 1922. This last book should especially be consulted for applications such as spherical pendulum, plane elastic curve, heat theory, ... Among more recent books on function theory containing a chapter on elliptic functions, we quote only WHITTAKER E.T., WATSON G.N. : (A course of) Modern analysis, Cambridge at the University Press, 4th ed. reprinted 1965. But a reader interested in the most simple properties of elliptic functions only should start with d AHLFORS L.V. : Complex analysis, 2 ed. New-York, McGraw-Hill, 1966. For the theory of theta functions, we have adopted notations close to those of Weil in WElL A,. : Introduction a l'etude des varietes kahleriennes, Act. Sc. et Industrielles 1267, Paris, Hermann 1958. Let us also mention WElL A. : Theoremes fondamentaux de la theorie des fonctions theta, Seminaire Bourbaki Mai 1949.
254
- V. 2 -
SIEGEL C.L. : Vorlesungen Uber gewahlte Kapitel der Funktionentheorie (Notes by Gottschling E., Klingen H.), Mathematische Institut der Universitat, Gottingen, 1964. For automorphic functions and forms, FORD L.R.
Automorphic functions, 2d ed. New-York, Chelsea Publ. 1951.
LEHNER J.
Discontinuous groups and automorphic functions,
(Math. Survey 8) Amer. Math. Soc. Providence, 1964. Shorter introductions are provided by GUNNING R.C. : Lectures on modular forms (Notes by Brumer A.), Princeton University Press, Princeton N.J., 1962. SERRE J.-P. : Cours d'arithrnetique (Collection "Le Mathematicien"), Presses Universitaires de France, Paris 1970. The formula
SL2(~)/(±1) = ~/(2)
*
~/(3)
and interesting generaliza-
tions can be found in SERRE J. -P. : Arbres, arnalgames et SL 2 ' (notes redigees avec la collaboration de Bass H.), to appear in the Springer-Verlag lecture notes in mathematics series, Berlin. I have selected Siegel's method for the derivation of the infinite product of SIEGEL C.L.
~
(Jacobi's theorem) : A simple proof of
~(-l/L) = ~(T) V~/i
Mathematika 1,1954, p.4 (Complete works, vol 2, p.188). SIEGEL C.L. : Analytische Zahlentheorie (Notes by KUrten K.F., Kohler G.), Mathematische Institut der Universitat, Gottingen, 1964. For more formulas connecting elliptic functions to theta functions ERDELYI-MAGNUS-OBERHETTINGER-TRICOMI : Higher Transcendental Functions, (Bateman Manuscript Project), New-York, McGraw-Hill, 1953,vol.2.
255
- V.3 -
FOR CHAPTER
II
Prerequisites on noetherian rings, integers, extension of valuations (and the "Nullstellensatz") are all contained in LANG S. : Algebra, Addison-Wesley Publ.Co., Reading Mass. Third printing 1971 (World Student Series ed.). For a good introduction to the theory of algebraic curves (Bezout and Riemann-Roch's theorem) on a relatively elementary level, a student could start with FULTON
w. :
Algebraic curves, Benjamin Inc., New-York, 1969.
One can also look at CHEVALLEY C. : Introduction to the theory of algebraic functions of one variable (Survey 6) Amer. Math. Soc. Providence, 1951. EICHLER M. : Einfilhrung in die Theorie der algebraischen Zahlen und !unktionen, Basel u. Stuttgart, Birkhauser, 1963. These books contain also an introduction to the algebraic theory of elliptic functions. For the classification of elliptic differential forms and integrals, one can consult COURANT R. HUREWITZ A. : Funktionentheorie, (Die Grundlehren ... Bd.3) Berlin, Springer-Verlag, 1922. For analytic p-adic function theory, the basic material is contained in DWORK B. : On the zeta function of a hypersurface, I.H.E.S. Publ. Math. No 12 , Bures-sur-Yvette, 1962. GUNTZER A. : Zur Funktionentheorie einer
ver~nderlichen
fiber einem
vollstandigen nichtarchimedischen Grundkorper, Archiv d. Math., 17, 1966, pp.4l5-43l. Note however that in this last article, the groundfield is always supposed to be algebraically closed and complete, so that a little
256
- V.4 -
bit more work has to be done to get Schnire1mann's theorem in its strongest form CIT.4.16), or its corollary CIT.4.18). Schnire1mann's original reference is SCHNIRELMANN L. : Sur 1es fonctions dans 1es corps normes et a1gebriquement fermes, Bull. Acad. Sci. USSR. Math, vo1.s, pp.487-497. Tate's p-adic elliptic curves are treated in ROQUETTE P. : Analytic theory of elliptic functions over local fields, Gottingen, Vandenhoeck and Ruprecht, 1970. A sketch of this theory is also given in SERRE J.-P. : Abelian !-adic representations and elliptic curves, New-York, Benjamin Inc., 1968.
257
- V.5 -
FOR CHAPTER
III
Division points on abelian varieties are treated in LANG S. : Abelian varieties, Interscience Pub1. Inc., New-York, 1959. MUMFORD D. : Abelian varieties, Oxford University Press (for the Tata Institute of fundamental Research, Bombay), Oxford, 1970. A more elementary point of view is adopted by SHIMURA G. : Arithmetic theory of automorphic functions (Introduction to the ... ), Iwanami Shoten, Pub1., and Princeton University Press, 1971, who gives the classical proof for the integrality of singular invariants. We have also followed this book for the presentation of the second section. More on t-adic representations will be found in SERRE J.-P. : Abelian t-adic representations and elliptic curves, New-York, Benjamin Inc., 1968, already referred to for chapter II. An introduction to the basic ideas of eta1e cohomology (for pedestrians ... ) is given in the talk MUMFORD D. : Arithmetical algebraic geometry, Proceedings of a Conference held at Purdue University, ed. by O.F.G. Schilling, Harper & Row, Pub1., New-York, 1965, pp. 33-81. For the finiteness theorem for the rational torsion, we have given Wei1's proof as indicated by J.W.S. Cassels in the excellent survey article CASSELS J.W.S. : Diophantine equations with special reference to elliptic curves, Journal London Math. Soc.,41, 1966, pp.193-291. For the original proof, see LUTZ E. : Les solutions de l'equation y2
258
x 3 -Ax -B dans 1es corps
- V.6 -
p-adiques, C.R.Acad.Sc. Paris, 203,1936,pp.20-22. WElL A. : Sur 1es fonctions elliptiques p-adiques, loc.cit. pp.22-24. Let us also quote SERRE J.-P. : p-torsion des courbes elliptiques, Sem. Bourbaki, vol. 1969-70, expose 380, pp. 281-294. Plenty of references on elliptic curves are given in the survey article by Cassels, and in the book on t-adic representations by Serre. For more recent references, one can look at the bibliography of the book by Shimura and that of SERRE J.-P.
Proprietes ga10isiennes des points d'ordre fini des
courbes elliptiques, Inv. Math., 15,1972, pp. 259-331.
259
- V.7 -
FOR CHAPTER IV The Cartier operator is defined and studied in Eichler's book (reference given for Chapter II). A standard reference for the Hasse invariant of elliptic curves is DEURING M. : Die Typen der Mu1tip1ikatorenringe el1iptischer Funktionenkorper, Abh. Math. Sem. Univ. Hamburg, 14,1941, pp.197-272. We have chosen Igusa's method to prove that the roots of Hp(A)
=
0
are simple and to determine the number of supersingu1ar invariants. IGUSA J.I. : Class number of a definite quaternion with prime discriminant, Proc. Nat. Acad. Sc. USA, 44,1958, pp.312-314 . Another way of defining the zeta function of a curve over a finite field, using an ade1ic integral, is given in WElL A. : Basic Number Theory, (Die Grund1ehren ... Bd.144) , Berlin, Springer-Verlag, 1967 . Observe that he gives the functional equation with the :xp1icit rational form of this function (Th.4, p.130), but omits to prove that the coefficients of the polynomial in the numerator are integers. For a proof that there exists only finitely many integral solutions on the curve y2 = x 3 + k, look at MORDELL L.J. : Diophantine equations, London-New-York, Academic Press, 1969. Since I have omitted to define the absolute invariant j of an elliptic curve in characteristic 2, the interested reader will have to go back to the appendix of Roquette's book on p-adic elliptic functions (reference given for Chapter II), where he will find reduced forms for all characteristics. The original article is
260
- V.8 -
DEURING M. : Invarianten und Normalformen e11iptischer Funktionenkorper, Hath. Zeitschrift, 47,1941, pp. 47-56. For finer reduction properties of elliptic curves, we refer to the third chapter of NERON A. : Modeles minimaux des varietes abe1iennes sur 1es corps locaux et globaux, I.H.E.S. Pub1. Math. 21, Bures-sur-Yvette, 1964, . and for applications of minimal models to SERRE J.-P., TATE J. : Good reduction of abelian varieties, Ann. of Math., 88,1968, pp. 492-517. The Riemann hypothesis for the zeta function of elliptic curves over finite fields is proved in the book by S. Lang on abelian varieties. One can also come back to WElL A. : Sur 1es courbes a1gebriques et 1es varietes qui s'en deduisent, Act. Se. et Industrie1les 1041, Paris, Hermann 1948.
261
- V.9 -
ND E X
*)
A Abel's conditions -
theorem
Differential form 11.52 11.55 abelian (or 1st kind)
1.4 11.42 11.89
11.60
exact - 11.52 second kind 11.58
Abelian variety 1.20 Absolute invariant 1.69 11.46 11.94
third kind 11.58 Division points 11.4 Divisor 1.16 ample - (very ample -) - of a differential form - of an elliptic function
Addition on cubics 1.11 1.25 Affine variety 11.5 irreducible 11.5 Algebraic curve 11.12 Appel-Goursat theorem 1.23 Automorphic form 1.39 1.47 1.52 1.58 1.62
IV.16 rational - over K
pr~e
c
11.87
IV.16
- of a rational function 11.20 11.32 - of a theta function 1.21 11.88
Cartier operator IV.6 ~ech cohomology 1.29 Chern character 1.31
Double point
Complex torus 1.1 Conductor 111.26 Critical index ( - radius) Cubic curve I I . 24 Cusp 11.27
11.26
E
Cohomological interpretation of theta functions 1.28 Congruence subgroup 111.17
Elliptic curve (see complex torus) -
over:IFq
IV.18
p-adic - 11.86 in general: a curve with elliptic function field + point over k 11.47 Elliptic differential equation 11.66 - integral 11.62 11.64 (function) field 11.45 - function 1.2 11.87 Endomorphism (of a lattice) 111.25 Entire function (p-adic) 11.78
11.77
D Degree (of a divisor) 1.4 (of a homomorphism) 111.46 Derivation 11.50 canonical 11.53 Deuring's polynomial
11.37 11.55 1.2
IV.7
*) References are to pages:
11.27 refers to page 27 of chapter II . 262
- V.lO -
L
F Flex
Legendre's relation
11.21
Linear system
Fonn 11.6 Fundamental region Function field
11.32 11.47 IV.2
M
Genus 11.32 Group law on cubics
1.11 11.26
MOdular equation 111.31 - form 1.39 1.43 - function 1.47 1.58 1.62
H IV.9
IV.12
Hasse-Weil zeta function
- invariant 1.47 111.25 Mbrdell-Weil theorem 111.8
IV.26
1.16
N
Hessian (of a form) 11.22 Hilbert's Nulstellensatz 11.3 Homomorphism of elliptic curves Hyperelliptic curve
1.18
Inflexion (tangent)
11.21
Norm (of a divisor) 1.35
J 1.51
o Order of contact 11.21 - of differential forms (at a point) 11.54 - of elliptic functions (ibid.) - at infinity 1.40 - in the group of divisors
Lattice 1.2 111.25 endomorphism of mesh of a 1.19 singular -
111.25
Laurent series 11.20 Legendre's A-function -
1.57 equation of a cubic 1.10 11.25 11.34
1.26
p p-adic numbers
L
IV.16
Node (see double point)
Isogeny 111.28 IV.29 Isomorphic elliptic curves 1.35 1.50 - lattices 1.35 111.27
Jacobi variable
1.26
function) 1.27 Liouville's theorem (p-adic) 11.78 Local ring 11.11 IV.6 Logarithmic differential
G
Hermite formula
1.13 1.22
Line bundle (associated to a theta
1.36 1.62
elliptic 11.45 separably generated -
Hasse invariant
(contd .)
11.70
p-constant IV.3 Picard big theorem (p-adic) -
group i.15 little theorem
1.49
Plane curve 11.12 Poincare-Koebe theorem 1.10 p-variable IV.3 Projective variety 11.6 11.8 irreducible -
263
11.83
1.4
- V.ll -
Q q-expansions principle
111.32
Taylor series Theta function holamorphic p-adic reduced trivial -
R Ramanujan coefficients 1.73 Rational function 11.8 Reduction mod p IV.22 good IV.22 potential good IV.24 Regular curve 11.13 - point 11.13 Relative invariant 11.96 Residue 11.57 Riemann fonn on [n 1.20 Riemann-Roch theorem 1.26 11.43 11.90 Rouche theorem (p-adic) 11.75
s Safarevie' theorem IV.2S Schnire~an's theorem 11.82 Separating element IV.2 Simple point 11.13 Singular curve 11.13 IV.15 lattice 111.25 - point 11.13 Specialization 111.16 generic 111.16 Strict homogeneous ideal 11.8 Supersingular curve IV.8 IV.lS
11.20 1.20 - 1.23 11.88 1.21 1.21
u Ultrametric space 11.73 Uniformization 1.10 1.18 local - theorem 111.8
v Valence (of an elliptic function) 1.4 Variety 1.20 11.15 11.6 Valuation (centered at a point) II.IS - ring 11.19
w Weber functions 111.17 Weierstrass canonical expansion (p-adic) 11.85 11.87 - equation of a cubic 11.25 11.34 ~ -function
I .5
z Zeta function IV.16 - of an elliptic curve Hasse-Weil IV.26
T Tangent (line) 11.21 algebraic - space 111.48 Tate's character 111.2 - elliptic curve 11.86 - module (extended module) 111.6 - theorem. 11.97
264
IV.18
ERRATA Pages nwnbers refer to bottom pages numbers. p.73
p.74 p.102
p.183 p.23l p.234
line 2 from bottom. It is not true that ~(n) is divisible by 691 for nearly all integers. For example, when p is prime, -c(p) :: 1 + pil -mod 691 and the set of primes p such that t(p) is divisible by 691 has density 1/690 according to Dirichlet's theorem. But ~(n) = Oil(n) mod 691 and Oil(n) is divisible by 691 for "nearly all n" in the sense of the naive density on the set of integers ••• cf. J.-P. Serre, C.R.Acad.Sc.Paris 279 (1974), pp.679-682. Since Weil's conjectures have been proved by P. Deligne, Ramanujan's conjecture is true. In the case of a double point, the isomorphism of the sMOoth part of the singular cubic with the multiplicative group is defined over an extension of degree ,2 of the base field : it is enough to take the extension generated by the two slopes at the singular point (these are quadratic numbers). Der.tjanenko's claim of proof has however remained unverified so that the strong conjecture is still considered as open. line 4. The proof of part b is incorrect. A correct argument is to be found in S~minaire Chevalley 1958/59: Vari6t~s de Picard p.6.06-6.08 • line 6. the nlUllbers (-kl ) are -not in Iflp when k" I (p - 1) and the hypergeometric series F(I,I:l:A) l1as to be truncated. The reduction mod p of the series of F(I,I:l:~)/F(l,l:l:~P) is a finite series congruent to Hasse's polynomial (up to sign). This still makes it clear that the Hasse polynomial is a solution of Ule hypergeometric differential equation. cf. B. Dwork, Publ.Math. I.H.E.S 31 (1969) pp.47-49.
A.R.
P.S. Correct spelling:
Bernoulli (pp.66-67) Hurwitz (p.256)
Feb. t 85