This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
To summarize, the right half-plane of guaranteed convergence is 1/2 to the right of the right half-plane of convergence for the Riemann zeta-function, because we have a term of absolute value fj, in the Euler product which was absent in the case of ((s). We now discuss the additive form of 1,(E., s) in more detail. Using Proposition 2, we can rewrite (5.2) in terms of the map 2„ defined in (3.3)— (3.4):
-3- .
L(E„, s) =
n (i
p4.2n
zi(P)) 1 ,
(Nns
(5.5)
where we have used 2.(P) to denote its value at any generator of the ideal P. Notice that, since 2„ is a multiplicative map taking the value 1 at all four units +1, + i, we may regard it equally well as a map on elements x of Z[i] or on ideals I. We can now expand the product (5.5) in the same way one does for the Riemann zeta-function and for Dirichlet L-series (see Problem 1(a) in the last section). We use the two facts: (1) every ideal /has a unique factorization
81
§5. The Hasse—Weil L-function and its functional equation
as a product of prime power ideals; and (2) both Li and N are multiplicative: 2(11 '2) = in(4)2„(4), N (11 /2) = N /1 • N/2 . Then, by multiplying out the geometric series, we obtain:
L(E„, s) =
E2„(/)(N/Ys„ 1
(5.6)
where the sum is over all nonzero ideals of Z[i]. A series of the form (5.6) is called a "Hecke L-series", and the map 2 is an example of a "Hecke character". In a Hecke L-series, the sum on the right of (5.6) is taken over all nonzero ideals in some number ring. A multiplicative map x on the ideals in that ring is said to be a Hecke character if the following condition holds. There is some fixed ideal f and a fixed set of integer's na , one for each imbedding a of the number field into Oalg cl , such that if / is a principal ideal generated by an element x which is congruent to 1 modulo the ideal f, then x(/) = 11,a(x)no. In our example, the number ring is Z[i] ; there are two imbeddings al = identity, a2 = complex conju1, na2 = 0; and we take f = (n) gation in Gal(OUP11); we take n (n' = (2 + 2i)n if n is odd, 2n if n is even). Then the condition simply states that 2.((x)) = x if x :4- 1 (mod n'). It is very useful when the Hasse-Weil L-series of an elliptic curve turns out to be a Hecke L-series. In that case one can work with it much as with Dirichlet L-series, for example, proving analytic continuation and a functional equation. It can be shown that the Hasse-Weil L-series of an elliptic curve with complex multiplication (see Problem 8 of §I.8) is always a Hecke
L-series. The relation between the additive form (5.6) and the additive form (5.3) is quite simple. We obtain (5.3) by collecting all terms corresponding to ideals I with the same norm, i.e.,
E
b,„,„ =
2„(1).
I with Ni=m
Notice that, since 2„(/) = 21 (I) . (4) by (3.3), we have
n bitt ,=-- (—) if
M i with
(:)bn„ rktl=m
where we have denoted b,„ = b„,, 1 . Thus, if for fixed n we let x. denote the multiplicative map on Z given by mF-- (A) (for m prime to 2n), we have CO
L(E„, s) =
E
m=1
x„(m)b„,m -s
2 = 1 — 2 05 -s — 30 9 -s + 6 (-L1) 13' + 2 (2-2) 17' + • • • 5 3 13 17 (5.7) (note: (1)2 is 1 if 34'n and 0 if 31n); one says that Li(E, s) is a "twisting" .ofL(Ei , s) = E b,„Tn by the character x„. One can, verify that for n squarefrçei the conductor of x „ is n when n-:-.=- 1 (mod 4) and is 4n when n a-.- .- 2 or 3n
82
II. The Hasse—Weil L-Function of an Elliptic Curve
mod 4 (this follows from quadratic reciprocity). In other words, x„ is a primitive Dirichlet character modulo n or 4n. To keep the notation clear in our minds, let us review the meaning of xn, xn' , and Z,. First, xn is a map from Z to { + 1, 0} which is defined by the Legendre symbol on integers prime to 2n. Second, xn' is a map from Z[i] to { + 1, + i, 0} which takes elements x prime to 2n to the unique power of i such that xn (x)x -a- x(ix) modulo 2 + 2i (see (3.3)—(3.4)). Thirdly, )-( n is a map from Z[i] to Z[i] which takes an element x to xxn'(x); also, in can be regarded as a map from ideals of 1 [i] to elements of Z[i] which takes an ideal I prime to 2n to the unique generator of / which is congruent to x,i(rN I) modulo 2 ± 2 1 . The character x„ is intimately connected with the quadratic field Namely, if m = p 0 2 is a prime number, then the value of xn (p) = (11,) shows whether p splits into a product of two prime ideals (p) = P1 P2 in e(iii) (this happens if (11,1) = 1), remains prime (if (l,)-±.- — 1), or ramifies (p):-.---- P2 (if CI» = 0, i.e., pin). (See [Borevich and Shafarevich 1966].) We say that x,, is the quadratic character associated to the field Q It is not surprising that the character corresponding to the field 0(fri) appears in the formula (5.7) which links L(E,„ s) with L(E i , s). In fact, if we allow ourselves to make a linear change of variables with coefficients in 0(j7), then we can transform En : y2 = x' — n2 x to El : y' 2 = x'3 — x' by setting y = n Jo,' , x = nx' . One says that En and E1 are isomorphic "over the field G (fii)•" Returning now to the expression (5.6) for 1,(E„; s), we see that it can also be written as a sum over elements of Z [i] rather than ideals. We simply note that every nonzero ideal has four generators, and so appears four times if we list elements instead of ideals. Thus,
(J . )
1 A bm,n = — 't
a+bi with a2 +b 2 =m
and
—41 ,,EEz. 2,,(x)(Nbcys I
E (a + bi)(a + bi) 02 2 + b2 y 9 4 a+biEl[i]
(5.8)
where xn' was defined in (3.3)—(3.4). (The sums are over nonzero x, a + bi.) Notice the analogy between the sum (5.8) and Dirichlet L-series. The only differences are that the number ring is Z[i] rather than Z, and our Hecke character in (x) includes an ordinary character x„i(x) (with values in the roots of unity) multiplied by x. We now proceed to show that L(E,„ s) can be analytically continued to the left of Re s = i, in fact, to an entire function on the whole complex plane; and that it satisfies a functional equation relating L(E,„ 4 to L(E„, 2 — s).
83
§5. The Hasse—Weil L-function and its functional equation
Since L(E„, s) is a "to-dimensional" sum over Z[i] ,/-:.-,' z 2 , i.e., over pairs of 'integers rather than integers, it follows that we "shall need to look at Fourier transforms, the Poisson summation formula, and theta-functions in two variables. We shall give the necessary ingredients as a sequence of propositions whose proofs are no harder than the analogous results we proved in the last section for the case of one variable. Since the definitions and properties we need in two dimensions are just as easy to state and prove in n dimensions, we-shall consider functions on For now, n will denote the number of variables (not to be confused with our use of n when writing En : y2 = x 3 — n 2x, y„, etc.). We will use x = (x 1 , . . . , x n) and y .= (r 1 , . . . , yn) to denote vectors in R. As usual, we let x • y = x1 XrrYn , 1 x 1 = \I X . X' We shall also use the dot-product notation when the vectors are in C"; for example, if n = 2 we have x • (1, i) = X i ± x2 i. Let 99 be the vector space of functions f: Ill n —4 C which are bounded, smooth (i.e., all partial derivatives exist and are continuous), and rapidly decreasing (i.e., ixrf(x) approaches zero whenever ixi approaches infinity for any N). For fe Y we define the Fourier tranSform f : ilr --+ C as follows (where dx denotes dx 1 dx 2 - . - dx,):
y1 + ' ' ' +
f(y) --=
f
(5.9)
e''f(x)dx. Œr
This integral converges for all y E ER", and fe Y. Proposition 9. Let f: R" --o C, g: R" —0 C be functions in Y .
(1) (2) (3) (4)
If a e R" and g(x) = f(x + a), then 0(y) = If a E gr and g(x) = e21 ' 0 f(x), then 4(y) = Ay — a). If b e [R, b > 0, and g(x) := f(bx), then 0(y) = b7r(yib). Iff(x) = e, then f = f
Proposition 10 (Poisson Summation Formula). If g e 9, then
y me?"
g(m) =
E
0-(M).
m el"
The proofs of Propositions 9 and 10 are completely similar to those of properties (1)—(3) of the Fourier transform in one variable and Propositions 5 and 6 of the last section. One simply has to proceed one variable at a time. If we C" and Je Y, we let w • Afd=e, w, g + w20a--4 + • • . + wn xn -
1"—f
Proposition 11. Iffe Y and g = w 'di, then
6(y) = 27ri1'. y f(y).
both sides of the equality are linear in w, it suffices to prove the proposition when w is the j-th standard bais vector, i.e., to prove that the Fourier transform of 4f(x) is aiyi Ay). This is easily done by subPROOF. Since
84
IL The Hasse—Weil L-Function of an Elliptic Curve
stituting t---,',--,f(x) in place off(x) in (5.9) and integrating by parts with respect 0 to the j- th jvariable (see Problem 5(a) in the last section). For the rest of this section, we take n = 2 in Propositions 9-11, and we return to our earlier use of the letter n in En , xn , etc. Theorem. The Hasse-Weil L-function L(E n , s) for the elliptic curve En : y 2 = x3 _ n2x, which for Re s> i is defined by (5.1), extends analytically
to an entire function on the whole complex s-plane. In addition, let
n odd; n even.
(5.10)
A (s) c-i=e- f- (-Z-) I" (s)L(En , s).
(5.11)
32n 2 , 1 N = OW = 16n 2 ,
Let
Then L(E„, s) satisfies the functional equation
A(s) = ± A(2 — s),
(5.12)
where the "root number" +1 is equal to 1 if n.--z---. 1, 2, 3 (mod 8) and is equal to —1 if n a 5, 6, 7 (mod 8). PROOF. The proof is closely parallel to the proof of analytic continuation and the functional equation for Dirichlet L-series with odd character, which was outlined in Problem 5 of the last section. Namely, we express L(E„, s), written in the form (5.8), in terms of the Mellin transform of a two-dimensional version of the theta-function On(t) defined in Problem 5(c). We shall use the letter u rather than a to avoid confusion with the use of a in (5.8). Thus, let u := (u1 , u2) e R2 , where u 0 Z 2 , and let t e gl be positive. Let w be the fixed vector (1, 0 e C 2 , so that, for example, m- w — m 1 + m2 i for m E Z 2 . We define:
0) =
E
(m +
u) • w e - non-Ful2 ;
(5.13)
m e /2
P(t) = E in • ,,,, , 2,.•„,_mt2 1 i .
(5.14)
m e Z2
Regarding u and t as fixed, we find a functional equation for On (t) by means of the Poisson summation formula (Proposition 10); to obtain 0„(t) on the left side of Proposition 10, we choose
g(x) = (x + u) - w e -vix+u1 2 .
(5.15)
To find the Fourier transform ofg(x), and hence the right side of the Poisson summation formula, we proceed in several steps, writing f(x) = e'lxi 2 , g 1 (x) = f(fix), g 2 (x) = w hg 1 (x), and finally g(x) = ilg 2 (x + u). We have:
85
§5. The Hasse—Weil L-function and its functional eqUation
f(y) = e -nlY12 by Proposition 9, part- (4); t'e 01012 by Proposition 9, part (3); g 2 (y) = 2nit -1 w • y e - frit"12 by Proposition 11; 4(y) = —i t -2 w - y e 2niu'Ye -( lr/i ) IY 12 by Proposition 9, part (1). If we now evaluate
6(m)
for me V, and sum over all in, we obtain the func-
tional'equation --i „ ( I
0.(t) = --it 0
-t).
(5.16)
We now consider the Mellin transform of Mt): fcc.° ts0.(t)I, and show that the integral converges to an entire function of s. First, .for large t it is easy to bound the integrand by something of the form e- c t, using the fact that 1m + ur is bounded away from zero, since u is not in Z2 . Next, for t near zero one uses the functional equation (5.16) and a bound for Ou ) of the form e-`1' , where we use the fact that the only term in (5.14) with 1m1 2 = 0 vanishes because of the factor m- w. These bounds make it a routine matter to show that the integral converges for all s, and that the Mellin transform is analytic in s. If we now take Re s >1-, we can evaluate the Mellin transform integral terra by term, obtaining a sum that begins to look like our L-function:
(+
e
f CID * hau f,\ dt _ E on _}..u).w co t se -Irtim±u1 2 dt I S U ki )— —
t
„,, z2
t
J0
=n- sr(s) E On + u) ' w
(see (4.6)).
MEZ2 l in + U l ”
Now for Re s > i, we can rewrite L(E„, s) as a linear combination of these sums with various u. We now suppose that n is odd. The case n = 2n0 even is completely similar, and will be left as an exercise below. We take w = (1, i). If we use (5.8) and recall that z n'(x) depends only on x modulo n' = (2 + 2i)n, and hence, a fortiori, only on x modulo 4n, we obtain : 4 0.5.a,b<4n
xaa+bi) E a + bi + 4nm • w me z2
1(a, b) + 4nmi 2s
'
,
= 1 (4n) 1_2s 4
1:
jf a ± bi) X0:5_a,b<4n
E (m-1-(t9t -,)).w 12s . ,,,t0 (* mE VI M 4-
,
Thus
1 _1 ir-s f(s)L(En , s) = -4- (4n) - -s 1
co
%--,
2,
05a,b<4n (a,b)*(0,0)
,7(a • ;(1
bi) f 0
t s0a14n bl4n(t) -6-1± . ' i
(5.17)
Since the integral inside the finite sum is an entire function of s, as are the
86
II. The Masse—Weil L-Function of an Elliptic Curve
functions (4n) 1-25 and it/F(s), we conclude that 1(E,, s) has an analytic continuation to an entire function of s. Moreover, we can transform this integral using the functional equation (5.16) and replacing t by +: ,
00
(A dt _ = _i f c° 1 "al4n.b14nk' ) r t sf)
0
t s- 2 0,214n,b14n 1 .c_i_t = _ i r r
co
dt . o,(o_
t 2-sal4nb14n
r
0
In the entire function (5.17) we now suppose that Re 2 — s > 4 (i.e., Re s -so that we can evaluate this last integral as an infinite sum. Using (4.6) again, inserting the definition (5.14), and interchanging summation and integration, we obtain
r
J
= 70-2 r(2-s)
E m- we ( 2ni/4n)m.(a,b)imi -2(2-s) mel2
Thus, for Re 2 — s> 4 , the right side of (5.17) is equal to
_ iony -2sn s- 2r(2 _ s\ ___1 v m • w sm 1 4 mez21m12(2-s) Ls where for ni E Z 2
E Li/ (a + bi)e ( 2 nii4 n)m sm d-t-f 0
- (a,b)
.
Lemma. If m 1 + m2 i is not in the ideal (1 + i), then Sm = 0; whereas if m 1 + n1 2 1 = (1 + i)x with xEZ[i], then Sm = 24(x)g(), where g(x ) is the Gauss sum defined in Proposition 4 of§II.3 (see (3.9)).
Before proving the lemma, we show how the functional equation in the theorem follows immediately from it. Namely, if we make the substitution m • w:---- m 1 + m2 i = (1 + i)x in the sum in (5.18), the lemma gives us
E
E
ni • w
mez2 I m 12(2-s) Sm
1 + Ox 2(
xez[i] 1(1 ± Oxi 2(2-0
= (1 + 02s' (
Xax)g(Xfn)
—2 )(2 + 2i)n n xegi]
by Proposition 4. But this last sum is 4L(En , 2 — s), by (5.8). Bringing this all together, we conclude that for Re 2 — s> 4 the right side of (5.17) is equal to _ 44/7)1 - 2sn s-2
1- ( 2
— s)(1 + Or' — 2 (2 + 20nL(E„, 2 — s) n (5.20)
-___1(2 ( -172) ns -21— s)(80 1-si,(E„, 2 — s). On the other hand, if we bring the term (1/2)s over to the right in the functional equation (5.11)-(5.12) in the theorem, we find that what we want
s 87
§5. The Hasse—Weil L-function and its functional equation
to prove is: n - sr(s)L(E„, s) = (
—2 ) (N/2)s(27) 5-2 ( \/N) 2-sr(2 n
(--n2)
(N/4 )' - sns -2 1-(2
s)L(En , 2 — s)
s)L(En , 2 — s).
And this is precisely (5.20). Thus, to finish the proof of the theorem for odd n, it remains to prove the lemma. PROOF OF LEMMA. First suppose that in + m2 i is not divisible by I + i. This is equivalent to saying that m i and m 2 have opposite parity, i.e., their sum is odd. Now as a, b range from 0 to 4n, the Gaussian integer a + bi runs through each residue class modulo (2 + 2i)n exactly twice. Each time gives the same value of x,i'(a + bi), since Li' (a + bi) depends only on what a + bi is modulo n' = (2 + 2i)n. But meanwhile, the exponential terms in the two summands have opposite sign, causing the two summands to cancel. To see this, we observe that if a l + b 1 i and a2 + b2 i are the two Gaussian integers in different residue classes modulo 4n but the same residue class modulo (2 + 2i)n, thei, a l + b 1 (a2 + b2 (2 + 2i)n (mod 4n), and so (2nil4n)m.ya1,b1)-(a2,b2)) = e(27cil4rOm.(2n,2n) = e rri(nt 1 +m 2 ) = _1 . e This proves the first part of the lemma. Now suppose that m 1 + m 2 i = (1 + i)x. Note that in (a, b) = m l a + m2 b = Re((m, tn 2 i)(a + bi)) = Re((1 i).5(a + hi)). Hence, the exponential term in the summand in Sm is tli(ic(a + bi)), where •
i/f(x) def --=-- e211'" (with n' = (2 + 2i)n). Since x„' is a primitive character modulo (2 + 2i)n (see Proposition 3), we can apply Problem 9(a)—(b) of §11.2. Since the summation in (5.19) goes through each residue class modulo (2 + 2i)n twice, we have
S„, 2
E
x (a + bi)111(.t(a + hi))
a+biEgil/( 2 +2 0,8
2 k/r1(:)g(Z;1) = 2 Xax)9(4). This proves the lemma, and hence the theorem (except for some slight modifications in the case of even n, which will be left as an exercise).
In the problems we shall outline a proof of the analogous theorem in the case of an elliptic curve, namely y2 = x 3 + 16, which has complex -multiplication by another quadratic imaginary integer ring, namely Z[w], where co + -PO. There is one additional feature which is needed because we end up summing not over [i], which can, be thought of as 1 2 , but rather over a lattice which is the image of 1 2 under a certain 2 x 2-matrix.
II. The Hasse-Weil L-Function of an Elliptic Curve
88
So we have to apply Poisson summation to a function much like the function in this section, but involving this matrix. We conclude this section by mentioning two references for a more general treatment of the theory of which we have only treated a few special cases. First, in C. L. Siegel's Tata notes [Siegel 1961] (see especially pp. 60-72) one finds L-functions whose summand has the form 1 e
27r im- u
P(m + y) (Q[ni + Y]) s+g12 '
where nier, u, ye Er, Q is the matrix of a positive definite quadratic form, and P is a "spherical polynomial with respect to Q of degree g". The case we needed for L(Eii, s) was: n = 2, Q = ( ?), P(x 1 , x 2) = x l + ix 2 , g = 1. In Problem 8 below we have the case Q = ( 1,2 if), P(x 11 x2) =4 (a) + 1 /2)x1 ± (w12 + 1) x2 (where w = —1/2 ± i012), g = 1. In [Lang 1970, Chapters XIII and XIV], two approaches are given to this topic. In Ch. XIII, the approach we have used (originally due to Hecke) is applied to obtain the functional equation for the Dedekind zeta-function of an arbitrary number field. This is a generalization. of Problems 2 and 6 below. However, the case of more general Hecke L-series is not included in that chapter. A quite different approach due to J. Tate—using Fourier analysis on p-adic fields—is given in Ch. XIV of Lang's book. PROBLEMS 1. Finish the proof of the theorem for n even. 2. (a) Find a functional equation for 0(1) = E„„ z2e - orn 12, t > O. (b) The Dedekind zeta-function of a nucigiber field K is defined as follows: CK(S) SZ E(Ni)_s,
where the sum is over all nonzero ideals I of the ring of integers of K. This sum converges for Re s> I (see [Borevich and Shafarevich 1966, Ch. 5, §1]). Let K = 04 Prove that CI< (s) is an entire function except for a simple pole at s = 1 with residue n/4, and find a functional equation relating CK (s) to CK (1 — s). 3. For u and y in R 2, let
0:(t) . y def
e2itim-ve-ntim+u12
I-a
1
t > 0.
mEZ 2
Find a functional equation relating 0:(t) to 0!..,(f). 4. (a) In the situation of Proposition 11, express the Fourier transform of (w in terms off(y) for any nonnegative integer k. (b) Suppose that k is a nonnegative integer, ue R 2 is fixed with u it V,, t > 0 is fixed, and w = (1, 0 E C 2. What is the Fourier transform of g(x) = ((x + u) • oke -ntix+u1 2 ? (c) With k, u, t, w as in part (b), define:
-
89
§5. The Hasse-Weil L-function-and its functional equation
(On
Ou, k(t) =
+
. ok e -ntIm+ui
2 5
m e 22
o u,k (o=
E (m . oke2nim•ue -ntimi2
•
m € 22
r Find a functional equation relating 0 k (t) to 0"(+). (d) Suppose that / is a fixed ideal of Z[i], and x: (Z[i]//)*
C* is a nontrivial
character. Outline (without computing the details) how one would prove that the function Exe zro xkx(x)(Nx)_s extends to an entire function of s (if x were trivial, there would be a simple pole at s = 1), and satisfies a functional equation relating its value at s to its value at k + 1 s. (e) 'Explain why any Hecke L-series for Z[i] is essentially of the form in part (d).
5. Let f
C, fe
(a) For MeGL.(R), let Mt denote the transpose matrix, and let M* = (M - V. Let g(x) = f(Mx). Express 4(y) in terms of f(y). (b) Let L be a lattice in fi??; equivalently, L is of the form L = MZ", where M E GL(), i.e., L is obtained by applying some matrix M to the elements in the standard basis lattice Z. Let L' be the "dual lattice" defined by: L' = {ye Rnix • ye Z for all xe L}. First show that L' is a lattice. Then prove a functional equation relating ExeL f(x) to EyE d(y).
f.
6. Let w= - 1+ (a) Let M ((., 3 2 ). Show that the lattice L = Zkoj C = R2 is L = MZ 2 . What are,M* and L'? Show that L', considered as a lattice in C, is twice the "different" of Z[co], which is defined as {xee(co)ITr(xy) E Z for ally E Z [C0] 1. Here Tr x =
/x+.t- =2Rex. (1)) Prove that E xelfroj
e'"2
t- e -(40t)ixt2 2r_ 2s IN/ 3 xEzz(0,1
(c) Let K = Q(co). Prove that CK(s) extends meromorphically onto the complex sPlane with its only pole a simple pole at s = 1. Find the residue at the pole, and derive a functional equation for C K(S). (d) With K as in part (c), prove the identity: CK(s) = C(s)L(, s), where x is the nontrivial character of (Z0Z)*. (e) Use part (d), the theorem in §11.4, and Problem 5(e) of §11.4 to give another proof of the functional equation in part (c).
7. Let E be the elliptic curve y2 = x3 + 16. Let w = 4- 10. (a) Show that the reduction mod p of y2 = x3 + 16 is an elliptic curve over Fp if and only if p # 2, 3. For such p, recall that the Euler fador at p is (1 — 2a p p + p 2s)— where (1 — 2ap T + pr) is the numerator of Z(EIEF;; T). Show that for p # 2, 3 this factor is cxdpeg P N p- -1 )
where the product -is over all (one or two) prime ideals P of Z[co] dividing (p) and ot`iteg P is the unique generator of P such that acileg P I (mod 3). (I)) When p = 3, define the Euler factor at 3 to be simply 1, as we did for En : y2 = x3 — 112 .x when pl2n. When p = 2, at first glance it looks like we again have
90
IL The Hasse—Weil L-Function of an Elliptic Curve
y2 =--- x3 , and so we might be tempted to take 1 for the Euler factor at 2 as well. However, this is wrong. When one defines L(E, s), if there exists a 0-linear change of variables that takes our equation E: y2 = x3 + 16 in FD4 to a curve C whose reduction mod p is smooth, then we are obliged to say that E has good reduction at p and to form the corresponding Euler factor from the zeta-function of C over Fir In Problem 22 of §II.2, we saw that y2 = x3 + 16 is equivalent over Q to y 2 + y = x 3, a smooth curve over F2 whose zeta-function we computed. Show that the Euler factor at p = 2 is given by the same formula as in part (a). (c) For x E 7L[(.1)] prime to 3, let x(x) = — ce)i be the unique sixth root of I such that x(x) 1 (mod 3). Let x(x) = 0 for x in the ideal (,./ — 3). Show that L(Es)_ !
xEli
i (Nx)
(d) Let 11/(X) =
e(27c43)Tr(x/W3)
def
where Tr x = x + ï = 2 Re x. Verify that tif(x) is an additive character of Z[w]13 satisfying the condition in Problem 9 of §II.2 (i.e., that it is nontrivial on = IxEztco)/3 X(X)11/(X), where any larger ideal than (3)). Find the value of g(x, x is as in part (b).
8. (a) Let w = (1, co), u e — Z 2 , and t > 0 be fixed. What is the Fourier transform of g(x) = (x + u) • w e -ntl(x+to.wI 2 ? (b) Let 6,i(t) = E2 On), where g(x) is as in part (a). Find the Mellin transform 4)(s) of E x(a + bco)0.(i). tr-. a13,b/3) 0
(c) Prove that L(E, s) is an entire function, where E: y2 = x3 I 16 is the elliptic - -
curve in Problem 7. (d) Prove the functional equation A(s) = A(2 — s) with A(s) = ( def
,127
2n
)S
r(s)L(E,
s).
§6. The critical value The value at s = 1 of the Hasse—Weil L-function L(E, s) of an elliptic curve E is called the "critical value". When we have a functional equation relating L(E, s) to L(E, 2 — s), the point s = I is the "center" of the functional equation, in the sense that it is the fixed point of the correspondence s+ -42 s. The importance of this critical value comes from the following famous conjecture. Conjecture (B. J. Birch and H. P. F. Swinnerton-Dyer). L'(E, 1) = 0 if and only i f E has infinitely many rational points. In this conjecture E is any elliptic curve defined over Q. In the general case it has not even been proved that it makes sense to speak of L(E, 1),
91
§6. The critical value
because no one has been able to prove analytic continuation of L(E, s) to the left of the line Re s = 4. How-ever, analytic continuation and'a functional equation have been proved for any elliptic curve with Complex multiplication (see Problem 8 or 0.8), of which our En are special cases, and for a broader class of elliptic curves with a so-called "Weil parametrization" by modular curves. (It has been conjectured by Weil and Taniyama that the latter class actually consists of all elliptic curves defined over OE) We shall call the above conjecture the - weak Birch-Swinnerton-Dyer conjecture", because Birch and Swinnert on-Dyer made a much more detailed conjecture about the behavior of L(E, s) at s = I. Namely, they conjectured that the order of zero is equal to the rank r of the group of rational points on E (see the beginning of §I.9). Moreover they gave a conjectural description of the coefficient of the first nonvanishin.Q term in the Taylor expansion at s = 1 in terms of various subtle arithmetic properties of E. For a more detailed discussion of the conjecture of Birch and Swinnerton-Dyer, see [Birch 1963 ], [Birch and Swinnerton-Dyer 1963, 1965], [Cassels 1966], [Swinnerton-Dyer 1967 ], [Tate 1974 ]. There is a simple heuristic argument— far from a proof—which shows why the weak Birch-Swinnerton-Dyer conjecture might be true. Let us pretend that the Euler product for 4E, s) (see (5.1) for the case E = En) is a convergent infinite product when s = 1 (which it isn't). In that case we would have:
L(E, 1) = J
1
2aE,pp - s + P1-2s
= np p 1 —P 2a
= n Np ' p
p + 1 — 2aE,p is the number of F,-points on the elliptic where Np N1 curve E considered modulo p. Now as p varies, the N p "straddle" p at a distance bounded by 2,/iy. This is because 2aE, p ap ap , and the reciprocal roots ap have absolute value j3 (see (2.7) for E = En , and the discussion of the Weil conjectures in §1 for the general case). Thus, roughly speaking, /Vp p + If Np spent an equal time on both sides of p as p varies, one could expect the infinite product of the pl Np to converge to a nonzero limit. (See Problem I below.) If, on the other hand, the Np had a tendency to be Ilp pf on the large side: Ni, p \rp, then we would obtain L(E, 1) (P + -f/j) = rip 1 /( 1 + P -1/2) = O. To conclude this heuristic argument, we point out that, if there are infinitely many rational points, one would expect that by reducing them modulo p (as in the proof of Proposition 17 in §1.9) we would obtain a large guaranteed contribution to Np for all p, thereby ensuring this lopsided behavior Np PO.Zjd p + Nrp. On the other hand, if there are only finitely many rational points, then their contribution to Np would be negligible for large p, so that Np would have the "random" behavior Np p + Needless to say, this heuristic argument is not of much help in trying to prove the weak Birch--8winnerton- Dyer conjecture. But there is considerable evidence_ both computational and theoretical to support the conjecture of Birch alid Swinnerton-Dyer. The most drama t
92
II. The Hasse—Weil L-Function of an Elliptic Curve
partial result so far was the proof in 1977 by John Coates and Andrew Wiles that for a large class of elliptic curves, an infinite number of rational points implies that L(E, 1) = O. Other major advances have been obtained in [Greenberg 1983] and [Gross and Zagier 1983]. Recall from Problem 8 of §1.8 that an elliptic curve is said to have complex multiplication if its lattice is taken to itself under multiplication by some complex numbers other than integers. Theorem (J. Coates and A. Wiles). Let E be an elliptic curve defined over 0 and having complex multiplication. If E has infinitely many G-points, then
L(E, 1) = O. The proof of this theorem is rather difficult (see [Coates and Wiles 1977]), and it will not be given here. (The original proof further assumed that the quadratic imaginary field of complex multiplication has class number 1, but this turned out not to be necessary.) Since our curves En have complex multiplication, the Coates-Wiles theorem applies, and, in view of Proposition 18 of Chapter I, tells us that 0, then n is not a congruent number. Conversely, if we allow if L(E„, 1) ourselves the weak Birch-Swinnerton-Dyer conjecture, then it follows that L(En , 1) = 0 implies that n is a congruent number. There is one situation in which it is easy to know that L(En , 1) = O. Recall that the "root number"—the plus or minus sign in the functional equation for L(E„, s)—is equal to (--,--,2) for n odd, and (-,...) for n = 2n0 even (see the theorem in §5). Proposition 12. If n -m 5, 6 or 7 (mod 8), and if the weak Birch-SwinnertonDyer conjecture holds for En , then n is a congruent number.
According to the theorem in §5, if n -=.- 5, 6, 7 (mod 8), then A(s) = - A(2 - s), where A(s) is given by (5.11). Substituting s = 1, we conclude that A(1) = -A(1), i.e., A(1) = O. But by (5.11), A(1) differs from L(En , 1) by a nonzero factor (namely, ,IN 12n). Thus, L(E„, 1) = 0, and the weak Birch-Swinnerton-Dyer conjecture then tells us that En has infinitely many Q-points, i.e., by Proposition 18 of Chapter I, n is a congruent number. 0 PROOF.
In certain cases, the claim that all n--=- 5, 6, 7 (mod 8) are congruent numbers has been verified without assuming the weak Birch-Swinnerton-Dyer conjecture. A method due to Heegner (see [Birch 1975]) for constructing points on E„ enàbles one to prove this claim for n equal to a prime or twice a prime. That is, if n is a prime congruent to 5 or 7 modulo 8, or twice a prime congruent to 3 mod 4, then n is known to be a congruent number. It is interesting to note that even in the cases when Heegner's method ensures us that n is a congruent. number, the method does not give us an effective algorithm for constructing a nontrivial rational point on E„, or equivalently, finding a right -triangle with rational sides and area n.
§62 The critical value
93
_
Very recently, Gross and Zagier were able to improve greatly upon Heegner's method. As a special case of their results, they showed that for n1-_-=_ 5, 6, 7 (mod 8) the elliptic curve En has infinitely many rational points provided that L(E„, s) has only a simple zero at s = 1, i.e., I,' (E„, 1) 0 O. This result represents substantial progress in making Proposition 12 unconditional. Moreover, their method is constructive, i.e., it gives you a rational point on the curve (equivalently, a right triangle with area n) when L' (E, 1) 0 O. See [Gross and Zagier 1983 ]. In the cases when the root number is +1, we cannot be sure in advance whether L(E„, 1) is zero or nonzero. So in those cases it is useful to have an efficient algorithm for computing L(E„, 1), at least to enough accuracy to know for certain that it is nonzero. (It is harder to be sure of ourselves in cases when the critical value seems to be zero.) We cannot use the series (5.3) or (5.8) to evaluate L(E„, 1), since they only converge when Re s> 1. So we now turn our attention to finding a rapidly convergent expression for L(E„, 1). Let us return to the functional equation for L.,(E „, s), and give a slightly different, more efficient proof. Recall that
L(E„, s) =
4 xeZ[i]
-
with 2n (x) = x4(x), where. x„' was defined in (3.3)—(3.4). Suppose we ask the question, "What function F(E„, t) has it - sl" (s)L(E, s) as its Mellin transform?" By our usual method using (4.6), we see that the answer is
L----f 74,c.zul 1F(E„,0 E in (x)e -ntix12 .
(6,1)
We now proceed to find a functional equation for F(E„, t), which will then immediately give us once again our functional equation for the Mellin transform 1,(E„, s). The only difference with our earlier derivation is whether we take the character sum before or after applying the functional equation (compare with the two derivations in Problems 3 and 5 of §II.4 and in Problems 4 and 6 for the functional equation for a Dirichlet L-series). Recall that xn' (x) is a primitive character of (EMI n')* , where n' ::.-- (2 + 2i) n for n odd, n' = 2n for n even. Let a + bi run through some set of coset representatives of Z[i] modulo n', and for each a + bi define a corresponding pair (u 1 , u2) of rational numbers by: u1 + u2 i ---i. (a ± bi)In' . Replacing x by a + bi + n'(ni . (1, i)) for m e Z 2 , and setting
(6.2)
NI = 10 2 = -IY4
(see (5.10)), we can rewrite F(E, t) as follows: !IA:-i
E
t. a+bieZ[i]in'
(
- nN 'ti m+u 1 2 . a+ bi) E on + 0•0, Oe , meZ2 ,
,
If we , replace ! by 11i-tri , the summand in the inner sum- becomes On(1) in he ,
H. The Hasse-Weil
94
L-Function of an Elliptic Curve
notation of (5.13). We then use' the functional equation (5.16) for As a result we obtain:
LLA E x;,(a
ni
E in • (1, ocirtimi2 E
4
oe2n1m•ue -ntimi2
bi)(- it 2) z m
a+bi
=
O(-».
zja +
a+bi
Now m • u = Re((m i m 2 i)(u 1 + u2 1)) = Re((m i m 2 i)(a bi)In'). We now use Problem 2 of §11.2 to rewrite the last inner sum as knl (M1
m2i)
xn /
(a
/n1 bi)e'd Re((a+bi) •
a+ bi
But 4(m 1 - rn 2 i) = x(m i + m 2 i), and the sum here is the Gauss sum g(4) evaluated in Proposition 4 (see (3.9)). We finally obtain:
1 F (E= --!it2n'(en') E L ( n, + m2 i)e- ntim 12 4 ma2 where the e is () for n odd, i(i) for n = 2n 0 even. Replacing m 1 + m 2 i by xE Z [i] l we see that the sum is precisely 4F(En , t). Thus, we have
F
-2 ) n
1
N' t 2F(En , t), n odd; (6.3)
(-no1)N't2F(En, t), n = 2n0 even. We can easily derive the functional equation for L(En , s) from (6.3). We for n odd, (7-d) for n = 2n0 even. We have shall write + to denote
co
1 )di , dt 1 ' ,, PRE„, t) — = +— f t .1' .t, — n' t - N' o -- Jo by (6.3). Making the change of variables u = 41 -r„ we obtain : n-s1"(s)1,(E„, s) =
co u 2-sF(En , u)-du il s 1. N = + n -sf(s)L(E n , s) 0 u = +Nil -sirs-2 f (2 - s)L(E„, 2 - s). Finally, replacing N''s by (fg/2) 2-23 and multiplying both sides by (sTI-V12)s, we obtain the functional equation of (5.11)-(5.12).' We now use our function F(E„, t) in the case when the root number is +1, i.e., nE.---_-. 1, 2, 3 (mod 8), in order to find a convenient expression for L(E„, 1). Thus, suppose that
F (En ,
We
1
= N't2F(E„, t).
(6.4)
use this functional equation to break up the meltin transform of RE„, t)
95
§6. The critical value
is the "center" of the functo oo. The point into two integrals from tional equation, i.e., the fixed point of the correspondence t We have : CO
n- sf(s)L(E n , s) =
di F tsF(E„, t)— =
+f
di
tsF(En , t) T .
0
1/,/N'
0
iN/N'
In the second integral we replace t by, and then use (6.4) to write F(En , in terms of F(En , t). The result is: co dt
n- s1"(s)L(E n , s) =
1 N't1
(tsF(En , t) + N' i-st'sF(En ,
11-iw" Now set s = 1. Multiplying both sides by n, we immediately obtain : CO
F(En , t)dt. (6.5) L(En, 1) = 2irf 1/ 1,/vir Recall that the Hasse-Weil L-function can be written as a Dirichlet series 00
L(En , s) = E
where
b,
41 xEEztil 2n(x).
(6.6)
rn=1 fki
x=m
Comparing with the definition (6.1) of F(En , t), we see that 00 R E„, t) = E b,n , n e'rn
(6.7)
m=1
We can now substitute the series (6.7) into (6.5) and integrate term by term. (Notice that the procedure below will work only because we have a positive lower limit of integration in (6.5); if we tried directly to use the Mellin transform, in which the lower limit of integration is 0, we would not have convergence.) Using the formula fa' e'dt = with a = N'112 , c = nin, we immediately obtain the following rapidly convergent infinite series for
L(En , 1). Proposition 13. The critical value of the Hasse-Weil L-function of the elliptic curve En : y 2 = x3 - n2 x for squarefree n 1, 2, 3 (mod 8) is given by:
L(En , 1) = 2
, co
2.,
m=1
n ni
e
f2n 2, 2n,
where
1
n odd; n even.
(6.8)
Here the coefficients b n,,n are the Dirichlet series coefficients obtained by expanding 00
L(En , =
(1
2ain,pp-s
p 1 2S) -
bm , nm'. m=1
In addition, the absolute value of the coefficient b m ,e is bounded by ao (m) \ m, where o-0 (m) denotes the number of divisors of m.
96
IL The Hasse—Weil L-Furiction of an Elliptic Curve
have already proved all except for the bound on b„,.„. If we write the Euler factor in the form (1 — app -s) -1 (1 1 , expand each factor in a geometric series, and collect coefficients ofp -es for each positive integer -dpe. e, we find that the Coefficient of p - es is oe; oepe-1 (3t-p ocie, – 2 -5-cp2 This means that, if m has prime factorization m = pp • • • pr, then PROOF. We
b„,,„ =
+ cr ej—i-c—t Pj
• •
Pj
j=1
Since icxp l = 1;1 = ,Fp for all p, we immediately obtain the bound
I bm,ni
11(ei OP7i12 = .J=1
0.0
(m)Ji,
where we have used the easy fact from elementary number theory that o-0 (m) is the product of the (ei + 1). This completes the proof.
The bound for b„, n is useful in estimating the remainder after we compute the series in (6.8) out ' to the M-th place. In particular, if we find that the remainder is less than the value of that partial sum, we may conclude that L(E„, 1) 0 O. As an example, we treat the case n = 1. The first few Dirichlet series coefficients bni for L(Ei , s) are given in (5.4). By (6.8), we have L(E I , 1) = 2( eoe/r12f1"
3 ç-
5c
2 –1 7/02..rf i3 e ++ 177e + • • •)
= 0.6555143... + R25, where we have denoted Rm = 2 Z„, 14 bze. A very crude estimate for o0 (m) is 2.1rTi (see the problems). Thus,
Rm I
4 E e-nm/2.1-2- = on>M
-1 —
4 g/2„rfe–n14/2‘r2. . e
5.2 x 10 -12 . Actually, the convergence is so fast that we only needed to evaluate the first term to show that L(E 1 , 1) 0 0: SO R25 is bounded by
L(E i , 1) = 0.6586. . . + R5 1 with
1R5 1 0.023.
This computation, together with the Coates-Wiles theorem, then tells us that 1 is not a congruent number. In fact, this argument undoubtedly qualifies as the world's most roundabout proof of that fact, which was proved by Fermat more than three hundred years ago. (See [Weil 1973, p. 270 of Vol. III of Collected Papers]; see also Problem 3 of §I.1.) The next topic we take up is die systematic study of functions such as theta-series which have certain types of functional equations under t and similar changes of variable. Such functions are caILd "modular forms". Actually, modular forms are functions of the form Z bni e'r iz rather than
97
§6. The critical value -
E bm e- ', but the simple substitution t =-- —21z will transform our theta-series from this chapter into what turns out to be modular forms. - In studying modular forms, we will at the same time be approaching elliptic curves from another perspective. But these two aspects of elliptic curves—the congruence zeta-function and Hasse-Weil L.:series, and the theory of modular forms—have combined in recent years to form a richly interlocking picture. PROBLEMS
1. In the heuristic argument for the weak Birch—Swinnerton-Dyer conjecture, make the following ridiculous assumptions: (1) 12a E, p — 11= V-i; and (ii) (2aE, p — 1)/03 = ± I is, "evenly" distributed, and happens to coincide with the value at p of a fixed quadratic Dirichlet character x(p) = (g) for some fixed N. (One of the reasons why these assumptions are ridiculous is that 2aE,,, is an integer.) Show that then L(E, 1) is equal to the value L(x, D of the Dirichlet L-function at the center of symmetry of its functional equation. 2. Prove that if the root number in the functional equation for L(E„, s) is 1, then L(E„, s) has either a nonzero value or else an even-order zero at s = 1; and if the root number is —1, then L(E„, s) has an odd-order zero at s = 1. 3. In the notation of Proposition 13 (here we abbreviate b,„ = ki. „, ap = aEn.p), prove that: (a) bp = 2ap if p41 2n ; bp = 0 if pi 2n ; (b) b„,,„22 = b,,,i b,,,2 if m 1 and m 2 are relatively prime; (c) bpe+1 = 2apbpe — pbpe-i for e _.. 0 (here take b 11p = 0 when e = 0). 4. Prove that 0-0 (m) <
2J for all m, and that m -4 0-0(m) --+ 0 as m --,
co for any positive
IC.
5. Compute L(E2 , 1) and L(E3 , 1) to about three decimal places of accuracy, verifying • that they are nonzero. 6. Prove that L(Ei 0 , 1) # O. , 7. Suppose you knew a lower bound c for the absolute value of all nonzero L(E„, 1), ne= 1, 2, 3, . . . squarefree. (No such c is known.) For n very large, what is the order of magnitude of M such that you could determine from the first M terms in (6.8) whether or not L(E„, 1) = 0? 8. (a) Write a flow chart for a computer program that evaluates L(E,„ 1) through the M-th term of (6.8) and estimates the remainder. (b) If you have a computer handy, use part (a) to find L(Etz , 1) to three decimal places.
CHAPTER III
Modular Forms
Our treatment of the introductory material will be similar to that in Serre's A Course in Arithmetic (Chapter VII), except that we shall bring in the "level" from the very beginning.
§1. SL2 (1) and its congruence subgroups For any commutative ring R, the "general linear group" GL 2 (R) is defined to be the set of matrices g = ( a ,;) such that det g = ad — bc is in R* (the multiplicative group of invertible elements of R). It is easy to see that GL 2 (R) is a group. The "special linear group" SLAR) is defined to be the subgroup of GL 2 (R) consisting of matrices of determinant 1. In this section we shall be concerned with the cases R = ill (the real numbers), R = Z, R = ZINZ (for a positive integer N). Let E defiote C L.) { oo} (i.e., the complex plane with a point at infinity, or equivalently the complex projective line Pt, also known as the "Riemann sphere"). Given an element g =(," i)e SL-2 (R) and a point z e C, we define
az + b . gz d=ef CZ + d'
goo del = ale = lim gz. z-oco %
(Thus, g(—d1c)= co, and if c= 0, then goo = co.) These maps Z 1--+ gz are called "fractional linear transformations" of the Riemann sphere C . It is easy to check that (1.1) defines a group action on the set E. , in other words: g1 (g2 z)=(g1 g2)z for all g1 ,g2 e SL2 (R), z E E. Notice that for g `=--- —I = (1 ._?)e S L 2 (R), (1.1) gives the identity map.
99
§1. SL2 (Z) and its congruence subgroups
But + I are the only matrices which act trivially on C, as we see by supposing that z = (az + b)1(ez + d) for all z, i.e., cz 2 + (d — a)z — b = 0 for all z, which implies that b = c = 0 and a = d; but the only scalar matrices - ( , 2( ) of determinant 1 are ±1. Thus, the quotient group SL2 (R)/ + f, which is sometimes denoted PSL 2 (R), acts "faithfully" on C, in other words, each element other than the identity acts nontrivially. Let H c C denote the upper half-plane H = {z e C I Im z > 0}. It is important to note that any g e SL2 (ER) preserves H, i.e., Im z > 0 implies lm gz> O. This is because
Im gz = Im
(az + 0(c-if + d) = az + b icz + df -2 Im(adz + bcZ). = Im Icz + dI2 . cz + d
But Im(adz + bd) = (ad — bc)Im z = Im z, since det g = 1. Thus,
lm gz = icz + d1 -2 Im z
a b for g= c d e SL2(R).
/( 1.2)
Thus, the group SL2 (R) acts on the set H by the transformations (1.1). The subgroup of SL2 (I1) consisting of matrices with integer entries is, by definition, SL2 (Z). It is called the "full modular group", and is sometimes denoted F. We shall denote f d=ef r/+./.. (Whenever we have a subgroup G . denote G1+1 if G contains — ./; if —I0G, we set of SL 2 (R), we shall let 6 6= G.) This group f = s12 (7L)/-Fi acts faithfully on H. It is one of the basic groups arising in number theory and other branches of mathematics. Besides F = SL 2 (7L), certain of its subgroups have special significance. Let N be a positive integer. We define
f(N)7 {(ac b) d ESL 2 (Z)la -=. d -74. 1 (mod N), b :÷-_. c E-2 0 (mod MI . (
1.3
)
This is a subgroup of r = SL 2 (Z), actually a normal subgroup, because it is the kernel of the group homomorphism from F to SL 2 (7L/NZ) obtained by reducing entries modulo N. In other words, r(N) consists of 2 x 2 integer matrices of determinant 1 which are congruent modulo N to the identity matrix. Note that F(1) = F. We shall later see that F(N) is analogous to the subsemigroup 1 + NZ c: Z consisting of integers congruent to 1 modulo N. 1"(N) is called the "principal congruence subgroup of level N". - Notice that F (2) = F(2)/ + I, whereas for N > '2 we have f(N) = because —1 # 1 (mod N), and so — I is not in r(N). A subgroup of F (or of r) is called a "congruence subgroup of level N" if it -contains F(N) (or F(N), if we are considering matrices modulo + I). Notice that a congruence subgroup of level N also has level N' for any multiple N' of N. This is.because F(N) = r(N/). We say that a subgroup of r (or of f) is a "congruence subgroup" if there exists N such that it is a congruence subgroup of level N. Not all subgroups of F are congruence
III. Modular Forms
100
subgroups, but we shall never have occasion to deal with non-congruence subgroups. For our purposes the most important congruence subgroups of F are:
)er ic 0 (mod N)};
170 (N) Fi (N)
c d
a {(c ci)e Fo (N)ia —= 1 (mod MI.
(1.4) (1.5)
bd )e It is easy to check that these are, in fact, subgroups. Note that if F1. (N), then since ad — be = 1 and Nic, it follows that ad 1 (mod N), and hence d 1 (mod N). We sometimes abbreviate the definitions (1.3)—(1.5) as follows:
T(N) =()) mod NI; 01
ro(N) =
0*
1* ) mod N}; I', (N) = {( 01
) mod N},
where * indicates the absence of any congruence condition modulo N. Whenever a group acts on a set, it divides the set into equivalence classes, where two points are said to be in the same equivalence class if there is an element of the group which takes one to the other. In particular, if G is a subgroup of F, we say that two points z 1 , z2 e H are "G-equivalent" if there exists g e G such that z2 = gz 1 . Let F be a closed region in H. (Usually, F will also be simply connected.) We say that F is a "fundamental domain" for the subgroup G of F if every z eH is G-equivalent to a point in F, but no two distinct points z 1 , z2 in the interior of F are G-equivalent (two boundary points are permitted to be
G-equivalent). The most famous example of a fundamental domain is shown in Fig.
:
F def = {z e —1 :5_ Re z
and Izi 1 } .
(1.6)
Proposition 1. The region F defined in (1.6) is a fundamental domain for F.
PROOF. The group F = SL2 (1) contains two fractional linear transformations which act as building blocks for the entire group:
T
1 1) : G1 0 —1
S
i 0
kf(
z + 1; (1.7)
: z —11z.
§1. SL 2 (Z) and its congruence subgroups
101
1 2
Figure III.!
To prove that every zeH is F-eqiiivalent to a point in F, the idea is to lf it use translations Ti to move a point z inside the strip —1 < Re z lands outside the unit circle, it is in F. Otherwise use S to throw the point to bring it inside the strip, outside the unit circle, then use a translation and continue in this way until you get a point inside the strip and outside the unit circle. We now give a precise proof. Let zeH be fixed. Let r be the subgroup of F generated by S and T (we shall soon see that actually F = + F'). If y = ( db ) e F', then Im yz = Im ziicz + dI 2 by (1.2). Since c and d are integers, the numbers Ia . + di are bounded away from zero. (Geometrically, as c and d vary through all integers, the complex numbers cz + d run through the lattice generated by 1 and z; and there is a disc around 0 which contains no nonzero lattice point.) Thus, there is some y = (c' bd) E r such that Im yz is maximal. Replacing y by Ty for some suitable j, without loss of generality we may suppose that yz is in the strip —I< Re yz < But then, if yz were not in F, i.e., if we had lyzi <1, then, by (1.2), we would have:
1m Syz = Im yzilyzr > Im yz, which contradicts our choice of y e F' so that Im yz is maximal. Thus, there exists y e such that yz E F. We now prove that no two points in the interior of F are F-equivalent. We shall actually prove a more precise result. Suppose that z 1 , z 2 e F are F-equivalent. Here we are not supposing that z 1 and z2 are necessarily distinct or that they are necessarily in the interior of F. Without loss of generality, suppose that Im z2 Im z1 . Let y = er, e r be sticil that z2 = 1. Since z 1 is in yz i . Since Im z2 Im z 1 , by (1.2) we must have icz i + F and d is real (in fact, an integer), it is easy to see fromzFig. III.1 that this 2. This leaves the cases :i) c 7-7-- 0, d = +1; inequality cannot hold if (ii) c = +1, d = 0, and z 1 is on the unit circle; (iii) c d = .+1 and z 1 = + In casel(i), either y or —y ; (iV)C = —d= +1 and z, +
102
in. Modular Forms
is a translation V; but such a y can take a point in F to another point in F only if it is the identity or if j = + I and the points are on the two vertical boundary lines Re z = +1. In case (ii), it is easy to see that y = + TS with a = 0 and 2 1 and z2 on the unit circle (and symmetrically located with respect . In case to the imaginary axis) or with a = ± 1 and 2 1 = z 2 = ± + 1), and if such a map takes z 1 €F to 22 eF (Hi), y can be written as + we must have a = 0 and z2 = 2 1 = — + `ffs or else a = 1 and z2 = 2 1 + 1 = + Ç3. Case (iv) is handled in the same way as case (Hi). We conclude that in no case can 2 1 or z2 belong to the interior of F, unless +y is the identity and 22 = 2 1 . This proves the proposition. In the course of the proof of Proposition 1, we have established two other facts. Proposition 2. Two distinct points 2 1 , z 2 on the boundary of Fare F-equivalent only if Re 2 1 = ±4 and 22 = 21 + 1, or if 2 1 is on the unit circle and 22 =
In the next proposition, we use the notation Gz. for the "isotropy subgroup" of an element z in a set on which the group G acts: by definition, Gz =
{g eGigz = z}. Proposition 3. If z e F, then
rz +Iexcept
in the following three cases .•
(i) r= ± {/, S } if z = ; (ii) F = ± {I, ST, (ST) 2 } if z = co = + (Hi) F= {I, TS, (TS) 2 } if z = —63 = + Both Propositions 2 and 3 follow from the second part of the proof of Proposition 1. (ST) 3 = —I, and (TV = — I. Thus, in F = Notice that S 2 = SL 2 (7L)/+ I the elements S, ST, TS generate cyclic subgroups of order 2, 3, 3, respectively; and these subgroups are the isotropy subgroups of the elements i, (1), — el-3, respectively. These points in H with nontrivial isotropy subgroups are called "elliptic points". Another by-product of the proof of Proposition 1 is the following useful fact. Proposition 4. The group = SL,(Z)/+I is generated by the two elements S and T (see (1.7)). In other words, any fractional linear transformation can be written as a "word" in S (the negative-reciprocal map) and T (translation by 1) and their inverses. PROOF. As in the proof of Proposition 1, let F' denote the subgroup of f generated by S and T. Let z be any point in the interior of F (e.g. z = 21). Let g be an element of F. Consider the point gz EH. In the first part of thé
103
§1. SL 2 (Z) and its congruence subgroups
proof of Proposition 1, we showed that there exists y e I"' such that y(gz) e F. But since z is in the interior of F, it follows by Propositions 1 and 3 that yg = + I, i.e., g = e r. This shows that any g e r is actually (up to a sign) in r. The proposition is proved. Thus, any element in Î can be written in the form Sal T b, s a, T b2 Sairi, where all of the a bi are nonzero integers, except that we allow a l and/or b1 to be zero; and, since S 2 = —I, we may suppose that all of the ai equal 1, except that a l = 0 or 1. We may also use the identity (ST) 3 —1 to achieve a further simplification in some cases. We now return to the fundamental domain F for F = SL 2 (Z). Recall that in Chapter I §4 we also had a fundamental domain, in that case a parallelogram H c c for the lattice L. In that case the group was L, the action of g EL on a point z E C was simply g(z) = g + z. Every z e C is L-equivalent to a point in H, and no two points in the interior of fl are L-equivalent. In that situation we found it useful to glue together the boundary of fl by identifying L-equivalent points. We obtained a torus, and then we found that the map zi--+ (0(4 p'(z)) gives an analytic isomorphism from the torus CIL to the elliptic curve y2 = 4.r 3 g2 (L)x g 3 (L) (see §1.6). In our present situation, with the group F acting on the set H with fundamental domain F, it is also useful to identify r-equivalent points on the boundary of F. Visually, we fold F around the imaginary axis, gluing the 23 and + iy and the point eti° to e 2 i(112-0) for y > ..rpoint + iy to < O < I-. The resulting set F with its sides glued is in one-to-one correspondence with the set of f-equivalence classes in H, which we denote FVf. (The notation F\l/ rather than H/F is customarily used because the group f acts on the set H on the left.) In Chapter I we saw how useful it is to "complete" the picture by including a point or points at infinity. The same is true when we work with rvi. Let ii denote the set H {co} u G. That is, we add to H a point at infinity (which should be visualized far up the positive imaginary axis ; for this reason we sometimes denote it ico) and also all of the rational numbers on the real axis. These points {co} u Q are called "cusps". It is easy to see that r permutes the cusps transitively. Namely, any fraction a/c in lowest terms can be completed to a matrix (ca e r by solving ad — bc = 1 for d and b; 'this matrix takes co to a/c. Hence all rational numbers are in the same F-equivalence class as co. If r/ is a subgroup of r, then F' permutes the cusps, but in general not transitively. That is, there is usually more than one r' equivalence class among the cusps {co} u O. We shall see examples later. By a "cusp of F'" we mean a F'-equivalence class of cusps. We may choose any convenient representative of the equivalence class to denote the cusp. Thus, we say that "f has a single cusp at oo", where oo can be replaced by any rational number ,) a/c in this statement. We extend the usual topology on H to the set 11 as follows. First, a funda,
-
-
III. Modular Forms
104 00
iC
a Nc
0 a lc
Figure 111.2
mental system of open neighborhoods of co is Nc = {zEHIIM Z >C} L.) {CO} for any C > 0. Note that if we map H to the punctured open unit disc by sending z ,-.4 q d= ef e27tiz
,
(1.8)
and if we agree to take the point co E,171 to the origin under this map, then Nc is the inverse image of the open disc of radius e'Rc centered afthe origin, and we have defined our topology on I/ v {co } so as to Make this map (1.8) continuous. The change of variables (1.8) from z to q plays a basic role in the theory of modular functions. We use (1.8) to define an analytic structure on Hu {po } . In other words, given a function on H of period 1, we say that it is meromorphic at oo if it can be expressed as a power series in the variable q having at most finitely many negative terms, i.e. it has a Fourier expânsion of the form
f(z) =
E an e 2n1nz = E anqn, neZ
(1.9)
neZ
in which an = 0 for n << 0. We say that f(z) is holomorphic at co if a„ = 0 for all negative n; and we say that _Az), vanishes at co iff(z) is ,holomorphic at co and a() = 0. More generally, iff(z) has period N, then we use the map z F._, q Nd=ef e2lriz/N to map H u {co} to the open unit disc. We then express f(z) as a series in qN , and say that it is meromorphic (is holomorphic, vanishes) at co if an = 0 for n « O (respectively, for n < 0, for n ..._ 0). Next, near a cusp alceQ c ii we define a fundamental system of open neighborhoods by completing a, c to a matrix a = (ca db)e r and using a to transport the N, to discs which are tangent to the real axis at a/c (see Fig. 111.2). In other words, with this topology, to say that a sequence zi approaches a/c means that cC i zi approaches /co, i.e., that Im a-i zi approaches infinity in the usual sense. Notice, by the way, that the topology near the rational numbers a/c does not agree with the usual topology on the real line, i.e., a
§1. SL2(Z) and its congruence subgroups
•
105
sequence of rational numbers which approaches a/c as real numbers will not approach a/c in our topology. Let F denote the fundamental domain F with F-equivalent boundary points identified and with the cusp co thrown in. Thus, the points of F are in- one-to-one correspondence with F-equivalence classes in H. We take the topology on Fwhich comes from the topology on H. That is, by a small disc around an interior point of F we mean a disc in the usual sense ; by a small disc 'around co we mean all points lying about the line Im z = C, where C is + iy we mean the halflà.rge; by a small disc around a boundary point disc contained in F together with the half-disc of the same radius around + iy which is contained in F; and so on. Thus, P- has an analytic structure coming from the usual structure on H, except at cc, where it comes from the usual structure at 0 after the change of variable (1.8). In Chapter I we found that the Weierstrass 0-function gave us an analytic isomorphism of CIL with an elliptic curve in PI. in our present situation, we shall later see that a certain function (called the "j-invariant") gives an analytic isomorphism between F = IV' and the projective line (Riemann sphere) PL We now look at fundamental domains for subgroups r' c r. Suppose n < oo, so that F can be written as a disjoint union of n that [r: cosets r = u1= 1 ai r .I claim that F' = U7=1 ŒT 1 Fwi1l serve as a fundamental domain for F'. Let us verify that every ZEH is F'-equivalent to a point in F' ; the verification that no two interior points of F' are F'-equivalent will be left as an exercise (see below). Let z E H. Since F is a fundamental domain for F, we can find 'y e F such that yz e F. Then for some i we have y = at i y' with y' E F', and hence y' ze cci -l F a F', as desired. Roughly speaking, F' is n times as big as F because there are one n-th as many elements y' E r which one can use to move z around. There are many possible choices of the ai in the coset decomposition in the previous paragraph. In practice, we shall always try to choose the a, so that the resulting F' is simply connected. As an example, let us find a fundamental - domain F(2) for r = F(2). (We shall use the notation F(N) to mean any fundamental domain for F(»), Fo (N) to mean any fundamental domain for Fo (N), etc.) Since r(2) is the kernel of the surjective map SL2 (Z) SL2 (7L/27L), and since SL2 (Z/2Z) = GL2 (Z/2Z) is isomorphic to S3 (see Problem 6 of §I.8), it follows that [F: r(2)] = 6. As coset representatives of r modulo F(2) let us choose:
106
III. Modular Forms
c1 2
F
-
3
- /
A...
4 3"
-
-
I
I —
1 i
i
1 2-
-
Figure 111.3. A fundamental Domain F(2) for F(2).
The resulting fundamental domain F(2) = U---1 aT i F is depicted in Fig. 111.3. Because any fractional linear transformation takes a circle or line to a circle or line and preserves symmetry about the real axis, it follows that the boundary of any fundamental domain_ constructed in this way consists of vertical lines and arcs of circles centered at rational numbers on the real axis. The boundary of the fundamental domain in Fig. 111.3 is made up from the vertical lines Re z = — i and Re z = 1, the circles of radius 1 centered at 0 and —1, and the circles of radius I centered at 1 and —1. We see that r(2) has three cusps: oo, 0, — 1. That is, there are three f(2)-equivalence classes of cusps with those three points as representatives. Now the etymology of the word "cusp" is clear: geometrically, F(2) has the appearance we usually associate with the word "cusp" at the points 0 and —L PROBLEMS
1. Prove that F1 (N) is a normal subgroup of Fo (N) but not of F. Is ro (N ) a normal subgroup of F?
107
§1. SL2 (Z) and its congruence subgroups ."
2. (a) Prove that for any N, the map SL 2(7L) -0 SL 2 (Z/NZ) obtained by reducing the
_ matrix entries modulo N is a surjective group homomorphism. (b) Prove that for any- positive integers M and N, the maps ("reduction modulo ' N") from SL 2 (Z/MNZ) to SL 2 (7LIN7L) and from GL 2 (Z/MNZ) to GL 2 (ZINZ) are surjective group homomorphisms. 3. What is the order of the group (a) GL2 (Fq)? (b) SL 2 (Fq)?
_
4. (a) What is the kernel of the homomorphism GLAZIpeZ)-+ GL 2 (ZIpZ)? (13) What is the order of the group GL 2 (ZIpeZ)? (c) What is the order of the group SLAZIpeZ)? 5. Let N = pTI • - • Kr be the prime factorization of the positive integer N. Show that the reductions modulo pp, j = 1, . .. , r, give isomorphisms
GL 2 (1INZ)Z fl iGL 2 (ZIp7.'7L)
and
SL 2 (ZINZ) 2; fl;SL 2 (ZIp7jZ).
6. What is the order of the group SL 2 (Z/ATZ)? 7. Find the indices [r: [' (N)], [F l (N): r(N)],[ro(N):Fl (N)],
[ro(N): rorn, and
[r: r„(N)]. 8. Find a simple isomorphism from F(N) to a subgroup of ro(N 2) having index 4)(N) in F0 (N 2). In particular, F(2) and F0(4) are isomorphic. 9. Suppose that F1 and
r, are two subgroups of finite index in r, and F1 = ocF2 a' for
some a. E GL2 (0). If F2 is a fundamental domain for F2 , prove that cf.F2 is a fundamental domain for F1 • 10. Using the previous problem, draw a fundamental domain for F0 (4). 11. Suppose that N--- U.'=1 J ai r'' 9 where r is a subgroup of index n in r. Let F' ..-U7= 1 ar I F. Show that no two distinct points in the interior of F' are F'-equivalent. 12. Describe all congruence subgroups F' of level 2, i.e., all groups contained between F and F(2): F(2) c F' c F. For each such F', find a simply connected fundamental domain by taking a suitable part of the fundamental domain F(2) for r(2) that was given in the text. 13: (a) Prove that ±S and T 2, with S and T as in (1.7), generate one of the subgroups of level 2 in the previous problem. That group was denoted 0(2) by Hecke. mod 2} ; and that T and ST 2 S = (b) Prove that TSTand T 2 generate r°(2) clef ). generate ro(2 (c) Prove that the elements T 2 and ST -2 S generate r(2). (d) Prove that T and ST'S generate ro(4). . {
(:
(:,)
14. (a) Prove that the following is a complete set of Coset representatives lai for F0 (,) in r, i.e., r = Uceiro(pe) is a disjoint union: 1;
T -k S, k = 0, 1,
' - 1;
ST'PS, k= 1, 29 ... , p e-i -, 1.
-(1)) Use part (a) to draw a fundarr , -ntal domain for rom. nental domain for F0 (p). Draw the fundamen(c) Using part (a), describe a fu tal domain for F0(3). 15. Suppose Fl and r2 are subgroup
're index in F having fundamental domains
108
Ill. Modular Forms
F1 and F2, respectively. Does F1 c F2 imply that I-, counterexample.)
r2 ? (Give
a proof or
16. (a) The assertion at the end of the section that r(2) has 3 cusps (i.e., that there are three f(2)-equivalence classes of cusps) does not immediately follow from the appearance of the fundamental domain, because boundary points of the fundamental domain may be 1-(2)-equivalent. But prove directly that oo, 0, —1 are f(2)-inequivalent to one another. (13) How many cusps does each of the congruence subgroups in Problem 12 have?
17. Let {ai} be a complete set of coset representatives for 1--- / in F, where r is a subgroup of finite index in F. Show that the cusps of f' are among the set {ay' oo }, but that c:57' oo and ak-1 œ are f'-equivalent if and only if there exists ne1 such that cck- 1 rn je F. 18. Prove that fo (p) has two cusps 00 and 0; and that r0(p2) has p + 1 cusps: oo, 0, and — 1 ilep for k = 1, . . . , p 1.
§2. Modular forms for SE2 (7L) Definition. Let f(z) be a meromorphic function on the upper half-plane H, and let k be an integer. Suppose that f(z) satisfies the relation f(yz) = (cz + Of(z)
for all
y=
fa b d)
e SL 2 (7).
(2.1)
In particular, for the elements y =-- T = (V) and y = S =1-,), (2.1) gives f(z + 1) = f(z);
(2.2)
f(— 11z) = (— z) kf(z).
(2.3)
Further suppose that f(z) is "meromorphic at infinity". Recall that this means that the Fourier series
f(z) =
E an qn,
where
q
e 2niz
(2.4)
neZ
has at most finitely many nonzero a„ with n < O. Then f(z) is called a "modular function of weight k for f = SL 2 (Z)". If, in addition, f(z) is actually holomorphic on Hand at infinity (i.e., an =0 for all n < 0), then f(z) is called a "modular form of weight k for r = SL 2 (Z)". The set of such functions is denoted Afk (f). If we further have ao = 0, i.e., the modular form vanishes at infinity, then f(z) is called a "cusp-form of weight k for f". The set of such functions is denoted Sk (F). (The use of the letter S is traditional, and comes from the German "Spitzenform" for "cusp-form". Cusp-forms are sometimes also called "parabolic forms".) _
-
109
§2. Modular forms for SL 2 (Z)
Finally, the expansion (2.4) for a modular function f(z) is called its
"q-expansion". We first make some easily verified remarks about these definitions.
'
Remarks. 1. If k is odd, there are no nonzero modulai functions of weight
k for r. We see this by substituting y = (1 _?) in (2.1). So in this section we shall always suppose that k is even. ddY: = 4-((az + b)I(cz + d)) = (cz + d) -2 , we can rewrite (2.1) in " 2. Since — the-form: cilyof(yz) = f(z), i.e., f(z)(dz) 42 is invariant when z is replaced by yz. From this we see that if (2.1) holds for yl and y2 , then it holds for Vi y2 . Since all of f is generated by S and T, this means that (2.2)-(2.3) imply (2.1). 3_, The conditions are preserved under addition and scalar multiplication, i.e., the sets of modular functions, forms, and cusp-forms of some fixed weight are complex vector spaces. In addition, the product of a modular function (or form) of weight k 1 and a modular function (respectively, form) of weight k2 is a modular function (respectively, form) of weight k 1 + k 2 ; and the quotient of a modular function of weight k 1 by a nonzero modular function of weight k2 is a modular function of weight k 1 — k 2 . In particular, the set of modular functions of weight zero is a field. 4. If k = 0, then the condition (2.1) says that f(z) is invariant under r = sL2(z), i.e., it may be considered as a function on iv/. If k = 2, then the differential form f(z)dz on H is invariant under f, i.e.,flyz)dyz = f(z)dz, since dyzldz = (cz + d) -2 .
(Eisenstein series). Let k be an even integer greater than 2. For z e H we define EXAMPLE
Gk(z)= Et def m, n
1
(2.5)
(mz + n)"
where the summation is over pairs of integers m, n not both zero. If we let L, denote the lattice in C spanned by 1 and z, then this is a familiar definition from Chapter I (see (6.3) of Chapter I). That is, the function Gk (z) in (2.5) is what we then denoted Gk (L). The new point of view in this chapter is to think of Gk (z) = GAO as functions of z, not merely as coefficients in the Laurent expansion of the Weierstrass p-function. (The letter z was used in a different way in §6 of Chapter I.) Because k is at least 4, the double sum (2.5) is absolutely convergent, and uniformly convergent iri any compact subset of H. Hence Gk (z) is a holomorphic function on H. It is also obvious that Gk (z) = Gk (z + I), and that the Fourier expansion (2.4) has no negative terms, because Gk (z) approaches a finite limit as z --4 ix:
iim Eionz + nyk = E
z-+1 Ç) M,11
n*0
12 — k ,=
Mk).
110
_
111. Modular Forms
Finally, we easily check that z -k Gk (— 1/z) =---
+ nz) -k = Gk (z),
(
,
i.e., Gk (z) satisfies (2.3). We have proved Proposition 5.
We now compute the q-expansion coefficients for Gk. We shall find that these coefficients are essentially the arithmetic functions of n _fEdk-i. (2.6) uk _ i (n) cw din
Proposition 6. Let k be an even integer greater than 2, and let z H. Then the modular form Gk (z) defined in (2.5) has g-expansion Gk (z) = X(k) (1 —
2k '
where q = ec iz , and the Bernoulli numbers x ex — 1
—
Bk
(2.7)
are defined by setting
G° D X k ...0 k — . k! k=0
(2.8)
E
(Note. Our notation is slightly different from Serre's in A Course in Arithmetic. Basically, he uses 2k where we use k.) PROOF.
The logarithmic derivative of the product formula for sine is a
L d a+n a—n n=1
a e H.
,
(2.9)
If we-write the left side as iri(e + e - R ia)1(enia — e - a) = ni + 2761(e 27" — 1), multiply both sides by a, replace 2nia by x, and expand both sides in powers of x (using (2.8)), we obtain the well-known formula for C(k): C(k) = —(2ni) k Bk/2k!
for k> 0 even.
(2.10)
Next, if we successively differentiate both sides of (2.9) with respect to a (see Problem 8(d)-(e) of §11.4) and then replace a by mz, we obtain: co
1
_.. (2n _
)
k
iG Di
+ n)k (k — 1) 1• n=1 '3
co n k–l e 2ninmz = : 2k c(k\ E dk
Bk
. . iern
) (1=1
(where we have used (2.10) and replaced n by d and e2lriz by g). Thus, Gk (z) = 2C(k) + 2
" E E
1 m=i n = _co onz +
yok
= n(k)(1 — -2-k i d k-1 en) . Bk m,d=1
111
§2. Modular forms for SL 2 (7L)
Collecting coefficients of a fixed power qn in the last double sum, we obtain the sum in (2.6) as the coefficient of qn . This completes the proof. Because of Proposition 6, it is useful to define the "normalized Eisenstein series', obtained by dividing Gk (z) by the constant 2C(k) in (2.7):
k (z) = 2
2Bkk ak
Gk (z) = 1
(n) q„. E
(2.11)
Thus, Ek (z) is defined so as to have rational q-expansion coefficients. The First few Ek are:
E4(z) = 1 + 240
E6 (z)
=
(n)qn ; c I — 504 icr5 n=1
E8 (z)
480 )i cr7 (n)q" ;
=1+
n=1 GO
— 264
E
o 9 (n)q;
n=1 -
± 65520 'it un 691 E14 (z) = 1 — 24
;
cr13 (n)q. n=1
An alternate way of defining the normalized Eisenstein series is to sum only over relatively prime pairs m, n in (2.5):
Ek(z) =1 E 2 ,,,, z
(m,n)=1
1 +
(2.12)
where (m, n) denotes the greatest common divisor. The equivalence of (2.12) and (2.11) will be left as an exercise (see below). (The discriminant modular form A(z)). Recall from our study of the Weierstrass p-function of a lattice L that one defines g2 (L) = 60G4(L), g3 (L) = 140G6 (L) to be the coefficients occurring iiI the differential equation satisfied by the 0-function (i.e., in the equation of the elliptic curve corresponding to L; see (6.8)—(6.9) in §I.6). Now let us define g2 (z) = g2 (L2), g3 (z) = g3 (L) for z E H. That is, EXAMPLE
g2 (z) = 60G4 (z) ;
g3 (z) = 140G6 (z).
(2.13)
Then g2 (z) and g3 (z) are modular forms for I" of weigh 4 and 6, respectively. Since CO) = rc4/90 and C(6) = n 6/945, we can exPress y, and g3 in terms of the normalized Eisenstein series E4 and E6 as follows :
112
III. Modular Forms
g3 (z) = 287 1c6E6 (z)•
g2 (z) =Irc 4E4 (z);
(2.14)
The discriminant of the elliptic curve corresponding to Le —a function of the lattice which is nonvanishing for all nontrivial lattices, i.e., in all cases when the cubic polynomial 4x 3 g 2 (4)x g 3 (L 2) has distinct roots—is given by (see Problem 2 of §I.6) (270 1 2
(z) = g 2 (z) 3 — 27g 3 (Z) 2 =
(.E (z) 3 172 8 -
E6 (Z) 2).
(2.15)
By Remark 3 following the definition of a modular form, we see that A(z) is a modular form of weight 12 for F. Moreover, because both E4 (z) and E6 (z) have constant term act =4.- I in their q-expansions, we see that the constant term in (2.15) is zero, i.e., A(z) is a cusp-form. It is the first example of a cusp-form that we have seen. We shall later see that it is the cusp-form of lowest possible weight for F. The q-expansion of (270 -12A(z) has rational coefficients, and the first coefficient is easily computed to be 1 (use: E4 = 1 + 240q + ••• , E6 = 1 — 504q + • • • ). We shall later prove a remarkable product formula, due to Jacobi, for this discriminant function. The example of Eisenstein series gives us one modular form for every even weight starting with 4. It might seem unfortunate to have to pass up 2. Is there any Eisenstein series that can be salvaged in the case k = 2? It turns out that there is, but the normalized Eisenstein series E2 is not a modular form. We use the same definition as for the other Ek , except that the double sum when k = 2 is not absolutely convergent, and we must take care with the order of summation. Note that in the proof of Proposition 5, when showing that z -kGk (— l/z) = Gk (z), we needed to reverse the order of summation over m and n. Because this change of order of summation is no longer justified when we only have conditional convergence, it turns out that E2 fails to transform "correctly" under zi--+ —11z. E2 satisfies a slightly more complicated transformation rule, a rule that will be used in important ways later. Thus, we define (here ' means that n 0 if m = 0)
2, onz + 102
E2 (Z) = 2c(2) m
3 —1 +
E
co
1 ( rn7
n
)
(2.16),
6 1 = 1 ±-2-E „=„„ (mz + d)2 9 The inner sums clearly converge for any zeH and any m; and, once we obtain a different expression for,the inner sums, we shall see that the outer sum over m then converges absolutely. As in the proof of Proposition 6,
113
§2. 'Modular forms for SL 2 (7L) -
we find that for any fixed m = 1, 2, 3, . . . co 4 1 = — -L.- C(2) i de m , (mz + n) 2 LI 2 d=1
where
q=
e2.
This gives us 00 00
E2 = 1 — 24 E E
den.
m=1 d=1
Since 11 < 1, it is easy to see that the double sum over m and d is absolutely convergent. Collecting coefficients of qn by summing over all divisors d of n, we obtain: OD
E2 = 1 — 24 E cr 1 (n) qn .
(2.17)
n=1 ,
As in the case of the higher Ek, E2 is a holomorphic function on H which is periodic of period 1 and holomorphic at infinity. In order for it to be a modular form of weight 2 for 1-, all we would need in addition is for z7 2E2 ( — liz) to equal E2 (z). From (2.16) we find that _ l z- 2E2 (—
\
Ito
E Ef 2c(2),-.° „= _ co
=1+4 i p
(
-
rn + nz
E m*0 (riz
2
(2.18)
1 + 17) 2.
Thus, the extent to which E2 fails to satisfy, the "right" rule is a reflection of the alteration produced by reversing the order of summation. We now compute this "error term".
Proposition 7. z -2E2 (— 1/z) = E2(Z) ±
12 21riz
(2.19)
PROOF. The proposition says that 1 212niz is the difference between the double
sum (2.18) and the double sum (2.16). Suppose we introduce a "correction term" am ,, inside both double sums which causes the double sums to be absolutely convergent. In that case, the order of summation of the "corrected" double sums does not make any differénce,.i.e., the two "corrected" double sums are equal. It then follows that the difference between (2.18) and (2.16) is equal to the difference between E. E n am,„ and En En, am, ,,. The idea is to choose am,„ to be a term close to Om + n) -2. but which is easier to sum. - Let us take
am,„ = a,,,,,,(z) =
1 ___ , (rnz + n — 1) (mz + n) (mz , + n — 1)
Om + n) •
114
III. Modular Forms ,
Since the difference between (mz + n) -2 and a„,,„ is 1/(mz + n) 2(mz + n it follows -that the following "corrected" E2 is absolutely convergent:
-
1),
1
P2(z) =-- 1 + — 732 mE , 0 n=cii_ 00 (mz + 7, ) 2 31 ---- 1 + —2i
n
co
3
1
I
\ 2 + it 2 E E m#0 n= - Go 111Z + n mz + n - 1 •
m*On -= -co ktnZ + n 1
But since the last inner sum telescopes to zero, we have: P2 (z) = E2 (z). Because the double sum for E2 is absolutely convergent, we have 3 j9, v 1 Z4 (OnZ + n) R.- co m#o 00 3
E2 (Z) =" 1 + ---7 n
Zia
(2.20)
am „ (z). ,
So it remains to evaluate this' last double sum. Now this double sum differs from the sum in (2.18) by an absolutely convergent series. As in the derivation of (2.17), we find that for n> 0 1 v _ 1 E ' ,,,o (_ rnz _ ny z2 „,z*0 -' (
1 -
_
c° de - 2nidniz 47r2 ,72
1 n2
nlz + m) 2 —
(1 = 1
4.,
•
Since - l/z is a fixed element of H, we see that the outer sum over n in (2.18) converges absolutely. Thus, the same is true for E. s„, a„,,„(z). This justifies writing N
co
E E
EE
a „„, „(z) a „ , ,„(z) = lim 12--- - - co In4:o N -400 n= -N+1 rn*0 N
=
lim E E
1111-"° m*0 n= -N+1
The last inner sum telescopes to 1/(mz - N) we have
I
v
„,4:0 (mz
-
1
-
am , (z).
11(mz + N), and by (2.9)
)=2 z „,z=1 -s -Nlz +m
N mz + N
2
+
I -Niz -
.)
7r.N) , z z N
=. - (r cot (--- -1- --)•
z
We conclude that the double sum is equal to 2: N1 c LI on0 ot(
2n .
-
. e -2niNiz + 1
R./ V 1z) = — lirn i -21TiNlz _ 1 Z N.-0 e
=
2ni z•
Substituting this into (2.20) gives: 6i nz
_..-,
E2 (Z) = Z -2 E2 (-1/Z) + — = Z hE2( - 11Z)
This completes the proof of Proposition 7.
2niz
0
115
§2. Modular forms for SL 2 (7L)
—1/2
0
1/2
Figure 111.4. Contour for the Proof of Proposition 8.
The next result will play a basic role in determining the spaces Mk (V) and Sk (f) of modular forms and cusp-forms of given weight for r, and it will also be useful in proving that two modular forms defined in different ways are actually the same in certain cases. Proposition 8. Let f(z) be a nonzero modular function of weight k for r. For Pe H, let v(f) denote the order of zero (or minus the order of pole) of f(z) at the point P. Let voo (f) denote the index of the first nonvanishing term in the q-expansion off(z). Then 1 Pe FAH,P*i.o.)
vp(f)
---
T-2- -
(2.21)
(Note. It is easy to check that v(f) does not change if P is replaced by yP for y E F.) PROOF. The idea of the proof is to count the zeros and poles in F\II by integrating the logarithmic derivative of f(z) around the boundary of the fundamental domain F. More precisely, let C be the contour in Fig. 111.4. The top of C is a horizontal line from H = + iTto A = iT, where T is taken larger than the imaginary part of any of the zeros or poles of f(z).
116
III. Modular Forms
(Note. That this can be done, i.e., that f(z) does not have poles or zeros
with arbitrarily large imaginary part, follows from the fact that the change of variables q = e 22 makesf(z) into a meromorphic function of g in a disc around q = O.) The rest of the contour follows around the boundary of F, except that it detours around any zero or pole on the boundary along circular arcs of small radius E. This is done in such a way as to include every riequivalence class of zero or pole exactly once inside C, except that i and co (and Sco = 63) are kept outside of C if they are zeros or poles. In Fig. 111.4 we have illustrated the case when the zeros and poles on a boundary of F consist of i, co (and hence also SO, one point P on the vertical boundary line (and hence also the F-equivalent point on the opposite line), and one point Q on the unit circle part of the boundary (and hence also SQ). —
According to the residue theorem, we have ,
1f
r(z)dz=
2ri j c f(z)
E Perl11,13
vp(f).
(2.22)
#i,o)
On the other hand, we evaluate the integral in (2.22) section by section. First of all, the integral from A to B (see Fig. 111.4) cancels the integral from G to H, becausef(z + I.)= f(z), and the lines go in opposite directions. Next, we evaluate the integral over HA. To do this we make the change of variables q = e2"iz . Let f(q) = f(z) = E anqn be the q-expansion. Since f(z) = -4f(q) -c4( -1, we find that this section of the integral in (2.22) is equal to the following integral over the circle of radius e'T centered at zero:
i f dfidqdq. -2-Trij f(q) Since the circle is traversed in a clockwise direction as z goes from H to A, it follows that this integral is minus the order of zero or pole of AO at 0, and this is what we mean by — vœ (f). To evaluate the integral over the arcs BC, DE, and FG, recall the derivation of the residue formula. Iff(z) has Laurent expansion cm (z — a)'" + • • • near a, with c.,,, 0 0, then f(z)/f(z) := z Ta + g(z), with g(z) holomorphic at a. If we integrate f(z)/f(z) counterclockwise around a circular arc of angle 0 centered at a with small radius e, then as c —> 0 this integral approaches mi0 (the usual residue formula results when 0 = 2n). We apply this to the section of (2.22) between B and C, letting c --+ O. The angle approaches 43, and so we obtain --iir (v.(f)in13) = —v„,(f)16. (The minus sign is because the arc BC goes clockwise.) In the same way, we find that as s ---+ 0 the part of (2.22) from D to E becomes — vi (f)12, and- the part between Fand G becomes
= — v(f )/6. What remains is the integral from C to D and from E to F. Combining the above calculations, we find from (2.22) that the left side of (2.21) is equal to that remaining section of the left side of (2.22). Thus, to prove Proposition 8, it remains to show that in the limit as e-- 0
-
117
§2. - Modular forms for SL 2 (7L)
_..* k f (z) dz + 1 f f (z) d -1f 2ni EF f(z) z 12 . 2ni j cp f(z)
(2.23)
To compute the sum of these two integrals, we note that the transformation S: zi-0 — liz takes CD to EF, or more precisely, to FE, i.e., Sz goes from F to E along the contour as z goes from C to D along the contour. The desired formula (2.23) will follow from the following more general lemma. e F, with c 0 0, and let - f(z) be a merornorphic Lemma. Let y = function on H with no zeros or poles on a contour C c H. Suppose that f(yz) = (cz + d)Y(z). Then
‘c -f)((zz)) dz L ffl ((zz)) dz =
k ic
J
dz z + (d1c)*
(224) .
The required equality (2.23) follows immediately from the lemma, where we set y = S = a 1), note that S(CD) = FE, and compute that as g —> 0
1 dz — 276 fCD z
,
1/4
1 dO = -12 1/3
(where z = e 2n10).
PROOF OF THE LEMMA. Differentiating
(2.25)‘ ,
f(y z) = (cz + d) kf(z), we obtain
,
f (yz) -cILYI = (cz + d)kf'(z) + kc(cz + d) k- V(z). dz
(2.26)
We now divide (2.26) by (2.25): f (7z)dy — f (z) ciz + k edz
f0z) • z —f(z)
cz + d •
Thus, the left side of (2.24) is equal to
I,
r(z)dz f (Yz) dyz =
f(z)
f(yz)
k
Çc cdz
cz + d •
This completes the proof of the lemma, and of Proposition 8.
D
We now derive several very useful consequences of Proposition 8.
Proposition 9. Let k be an even integer, F = SL 2 (7L). (a) The only modular forms of weight 0 for r are constants, i.e., 1110 (F) = C. (b) iii/k (F) = 0 if k is negative or k = 2. (d). Mk (F) is one-dimensional, generated by Ek 1 if k = 4, 6, 8, 10 or 14; in other words, Mk (f) =_CEk for those values of k. , (d) Sk(r) = 0 ifk< 12 ork= 14; S12(r) = CA ; and for k> 14 sk(r) =
118
III. Modular Forms
AMk-12(r
) (i.e., the cusp forms of weight k are obtained by multiplying
modular forms of weight k 12`by the function A(z)). (e) Mk (f) = Sk (F) CEk for k > 2. PROOF. Note that for a modular form all terms on the left in (2.21) are nonnegative.
(a) Letfe Mo (F), and let c be any value taken by f(z). Then f(z) c eMo (f) has a zero, i.e., one of the terms on the left in (2.21) is strictly positive. Since the right side is 0, this can only happen if f(z) — c is the zero function. (b) If k < 0 or k 2, there is no way that the sum of nonnegative terms on the left in (2.21) could equal k/12. (c) When k = 4, 6, 8, 10, or 14 we note that there is only one possible way of choosing the v(f) so that (2.21) holds: for k = 4, we must have v,,,(f ) = 1, all other vp(f)= 0; for k = 6, we must have vi(f)= I, all other vp(f)= 0; for k = 8, we must have v.(f) = 2, all other vp(f)= 0; for k = 10, we must have v0(f ) = vi(f)= 1, all other vp(f)= 0; for k = 14, we must have v »(f ) = 2, vi (f ) = 1, all other v(f) = O. Let f1 (z), f2 (z) be nonzero elements of Mk (z). Sincefi (z) and f2 (z) have the same zeros, the weight zero modular function f1 (z)/f2 (z) is actually a modular form. By part (a), f1 and f2 are proportional. Choosing f2 (z) E k (z) gives part (c). (d) For fe Si (r) we have v,„(f ) > 0, and all other terms on the left in (2.21) are nonnegative. Notice that when k 12 and f = A, (2.21) implies that the only zero of A(z) is at infinity. Hence, for any k and any Je Sk (F), the modular functionfl A is actually a modular form, i.e.,f/A E Mk _ 2(F). This gives us all of the assertions in part (d). (e) Since Ek does not vanish at infinity, given fe Mk (f) we can always subtract a suitable multiple of Ek so that the resulting f cEk eMk (f) vanishes at infinity, i.e., f cEk e sk (r). We now prove that any modular form for I" is a polynomial in E4, E6 (see Problem 4 of §I.6 for a different proof of this fact). Proposition 10. Any feAl k (r) can be written in the form
f(z) =
ci,jE4 (2) iE6' (Z) i .
(2.27)
4i+ 6j=k
PROOF. We use induction on k. For k = 4, 6, 8, 10, 14 we note that E4, E6, E4 E6 , E4 E6, respectively, is an element of Mk (r), and so, by Proposition 9(c), must span Mk (f). Now suppose that k = 12 or k> 14. It is clearly possible to find i and j such that 41 4- 6/ = k, in which case E‘it Ei Emk(r). Given fe Mk(r), by the same argument as in the proof of Proposition 9(e)
119
§2. Modular forms. for SL 2 (1)
we can find ceC such that f we can write f in the form
—
cE14ie Sk (F). By part (d) of Proposition 9, -
. f= cElEi + AA . = cELE-4 +
(270 1 2 1728
(E43 — E 62 )fl , s
where fi e Mk-12( 1") , Applying (2.27) to fi by the induction assumption a (with k replaced byjc — 12), we obtain the desired polynomial forf.
The kinvariant.We now define a very important modular function of weight zero:
E4 (Z) 3 2 1728 1728g2 (z) 3 Jkz litf A (z) — E4(z) 3 E6 (Z) •,
(by (2.14)-(2.15))
(2.28)
-
Proposition 11. The function j gives a bijection from FV-1- (the fundamental
dom qin with r-eguivakni sides identified and the point at infinity included) and the Riemann sphere N = C u lool. In the proof of Proposition 9(d) we saw that A(z) has a simple zero at infinity and no other zero. Since g 2 does not vanish at infinity, this means that j(z) has a simple pole at infinity and is holomorphic on H. For any C E C the modular form 1728g3 — cA eM 12 (F) must vanish at exactly one point P El"\H, because when k = 12 exactly one of the terms on the left in (2.21) is strictly positive. Dividing by A, we see that this means that j(z) — c ---z-- 0 for exactly one value of ze F\l/. Thus, j takes co to oo and on o F\li is a bijection with C. PROOF.
Proposition 12. The modular functions of weight zero for F are precisely the
rational functions off. PROOF. A rational function of j(z) is a modular function of weight zero (see Remark 3 at the beginning of this section). Conversely, suppose that f(z) is a modular function of weight zero for F. If zi are the poles of f(z) in rvy, counted with multiplicity, then f(z)•11 i (j(z) j(zi)) is a modular function of weight 0 with no poles in H, and it suffices to show that such a function is a rational function of f. So, without loss of generality we may assume thatf(z) has no poles in H. We can next multiply by a suitable power of A to cancel the pole off(z) at oo. Thus, for some k we will have A(z)Y(z) e M/2k (F). By Proposition 10, we can write f(z) as a linear combination of modular functions of the form EE6ilY (where 4i + 6j = 12k), so it suffices to show that such a modular function is a rational expression in j. Since 4i + 6j is divisible by 12, we must have i = 3i0 divisible by 3 and j :----- 2j0 divisible by 2. But it is easy to check that EPA and EPA are each of the form aj + b, by (2.28); and Ero.Eljo/A k is a product of such factors. This o proves the proposition. —
120
III. Modular Forms
The definition (2.28) is not the only way one could have defined the invariant. It is not hard to see that any ratio of two non-proportional modular forms of weight 12 would have satisfied. Propositions 11 and 12. But j(z) has the additional convenient properties that: its pole is at infinity, i.e., it is holomorphic on H; and its residue at the pole is 1, as we easily compute from (2.28) that the q-expansion off starts out !-1 + • • . Recall that in Chapter I we used the Weierstrass p-function and its derivative to identify the analytic manifold C/L with an elliptic curve in Pt. Similarly, in our present context we have used the j-invariant to identify 1"\ii as an analytic manifold with the Riemann sphere PL Proposition 12 then amounts to saying that the only meromorphic functions on the Riemann sphere are the rational functions. Thus, Proposition 12 is analogous to Proposition 8 in §1.5, which characterized the field of elliptic functions as the rational functions of x = p(z), y =
A loose end from Chapter I. In Chapter I we defined an elliptic curve over C to be a curve given by an equation of the form y2 = f(x), where f(x) is a cubic with distinct roots. We then worked with curves whose equations were written in the form
= 4x 3 — g2 (L)x — g 3 (L)
(2.29)
for some lattice L. It is not hard to see that a linear change of variables will bring an equation y2 = f(x), where f(x) has distinct roots, into the form
y 2 = 4x3 Ax B
with
A 3 — 27B2 0 0.
(2.30)
But in order to write such a curve in the form (2.29) we need to show that a lattice L can always be found such that g2 (L)= A, g 3 (L) z--- B. This was not proved in Chapter I, but it is easy to prove now using modular forms. In what follows we shall write a lattice L= {mo t + nco2 }, where z = c1) 1 1 o2 e H, as follows: L = 2Lz = {nviz n2 }. Here 2 = (0 2 . Thus, any lattice L is a complex multiple of (i.e., a rotation plus expansion of) a lattice of the form L. Proposition 13. For any A, BE C such that A 3 0 27B 2 there exists L =AL z
such that
g 2 (L)
A;
(2.31) (2.32)
PROOF. It follows immediately from the definition 1- 4 g 2 (Lz), and similarly g 3 (2L) = 1 6 g 3 (L3. -
of g 2 that g2 (2Lz)=
By (2.14), we can restate the proposition: there exist 2 and z such that E4 (z) = 24(3/4n4)A and E6 (z) = 26(27/80)B. Let a = 3A14E4,b = 27B/8e. Then the condition A 3 0 27B2 becomes a3 b2 . Suppose we find a value of z such that E6 (z) 2/E4(z) 3 = b 2/a3 . Then choose 2 so that E4 (z) = 24a,
121
§2. Modular forms for SL 2 (7Z)
in which case we must have
E6(Z) 2 = b2E4 (z) 31a3 = 212 b 2,
i.e.,
E6 (z) = +16b.
If we have + in the last equation, then our values of z and 2 have the required properties (2.31)42.34 If we have —, then we need only replace 1 by So it remains to find a value of z which gives the right ratio E6 (z) 2/E4(z) 3 . Now EVE43. = I — 1728/i, by (2.28). Since j(z) takes all finite values on H, and b2/a3 0 I, we can find a value of z such that b 2 la' = 1 — 17281j(z) = E6 (z) 21E4 (z) 3 . This completes the proof. Thus, there was no loss of generality in Chapter I in taking our elliptic curves to be in the form (2.29) for somelattice L.
The Dedekind eta function and the product formula for A(z). We conclude this section by proving the functional equation for the Dedekind a-function, from which Jacobi's product formula for A(z) will follow. The function q(z), z e H, is defined by the product -
ri(z) = e 2 ' 24
(1 11.- kJ. n=1
e2ninz).
(2.33)
(In Problem 7 of §II.4 we gave another definition, and another proof of the functional equation. In the problems below we shall see the equivalence of the two definitions.) Proposition 14. Let .1- denote the branch of the square root having nonnegative
realpart. Then
n(— 1/z) = fzTig(z).
(2.34)
PROOF. The product (2.33) clearly converges to a nonzero value for any ze H, and defines a holomorphic function on H. Suppose we show that the logarithmic derivatives of the left and right sides of (2.34) are equal. Then (2.34) must hold up to a multiplicative constant; but substituting z --= i shows that the constant must be 1. Now the logarithmic derivative of (2.33) is
ty(z) 2ni t n(z) — 24
co
24
E
ne 2. ninz
n= 1 1 — e 2irin )
If we expand each term in the sum as a geometric series in qn (0? = e27'), and then collect terms with a given power of q, we find that til(z)
_27ri
?Az) — 24
— 24
o-1 (n)qn) =
(2.35)
Meanwhile, the logarithmrc derivative of the 'relation (2.34) that we want is:
122
III. Modular Forms --,
rii (— 1 I z) z _2 ti(— 11z)
(2.36)
2z
Using (2.35), we reduce (2.36) to showing that E2 ( — liz)z -2 =
12 + E2(Z), 2niz 0
and this is precisely Proposition 7. Proposition 15.
(2/0 -12 A(z) -z--- q fl (1 — q") 2 4 ,
q = elniz
(2.37)
.
n=1
PROOF. Let f(z) be the product on the right of (2.37). The function f is holomorphic on H, periodic of period 1, and vanishes at infinity. Moreover, f(z) is the 24-th power of q(z), by definition; and, raising both sides of (2.34) to the 24-th power, we find that f(-11z) = z i2f(z). Thus, f(z) is a cusp form of weight 12 for r . By Proposition 9(d), f(z) must be a constant multiple of d(z). Comparing the coefficient of q in their q-expansions, we conclude that o f(z) equals (270' 2 A(z). Notice the role of the Eisenstein series E2 (z) in proving the functional equation for ri(z), and then (2.37). The 17-function turns up quite often in the study of modular forms. Some useful examples of modular forms, especially for congruence subgroups, can be built up from ri(z) and functions of the form ri(Mz). The q-expansion in (2.37) is one of the famous series in number theory. Its coefficients are denoted T (n ) and called the Ramanujan function of n, since it was Ramanujan who proved or conjectured many of their properties: Go co
E
efq t(n) q n d=
n=1
fi (1 — qn)24•
n=1
Among the properties which will be shown later: (1) t(n) is multiplicative (t(nm) = t(n)t(m) if n and m are relatively prime); (2) t(n)-.=- 0-11 (n) (mod 691) (see Problem 4 below); (3) t(n)/n 6 is bounded. Ramanujan conjectured a stronger bound than (3), namely: F rom < n i 1/20_o (n) (where o (n) is the number of divisors of n). The Ramanujan conjecture was finally proved ten years ago by Deligne as a consequence of his proof of the Weil conjectures. For more discussion of t(n) and references for the proofs, see, for example, [Serre 1977] and [Katz 1976a]. -0
PROBLEMS
1. Prove that for k > 4: Ek (z) = 4
E rn,nER (rn,n)=1
On: + nr k .
123
§2. Modular forms for SL 2 (7L) 2. What is E2 (0?
3. (a) Show that El = E8 E4E6 = E10, and E6E8 = E14 (I)) Derive relations expressing a 7 in terms of a3 ; a9 in terms of a 3 and as ; and a 13 in terms of as and u7 . 4. (a) Show that E12 — El = di, with c = (270- 12 26 35 7 2/691. (b) Derive an expression for t(n) in terms of a„ and 475 . (c) Prove that t(n) a 1 1 (n) (mod 691). 5. Prove that E4 and E6 are algebraically independent; in particular, the polynomial in Proposition 10 is unique. 6. (a) Prove the identity
oo
n aX n
cic
E
JO X n
n1 n= 1 (b) Let a 1 (mod 4) be an integer greater than L Show that Ea 4. 1 (i) = 0, and prove the following sequence of summation formulas: co
E1 e
n=
/1/504, no
B +1 = 1/264,
t, —
2(a + 1) a
1/24,
u = 5;
= 9; a = 13, etc.
a
7. Lef(z) be a modular form of weight k for F. Let
1 .f (z) —k E2 (z) f(z). g (z) =12 2n Prove that q(z) is a modular form of weight k if and only iff(z) is a cusp form.
2 for F, and that it is a cusp form
. 8. (a) Prove that E6 = E4 E2 and E8 = E6E2 — (b) Derive relations expressing a5 in terms of al and a3 , and o-7 in terms of a l and C5 .
9. Recall the following definitions and relations from Problems 4 and 7 in §11.4. Let x be a nontrivial even primitive Dirichlet character mod N. Define GO
0 (X, 1 ) =
E An)
2
Re 1>0.
n= 1
Then OCX, = (N 2 0 -1129(X) 0 ( 0-5 1/N 2 t). Now let x be the character mod 12 such that x(+ 1) = 1, x( + 5) = —1, and define (z) =-- 0(x, — I 12) for z H. Then Pi( — 1/z) = Vz/i (a) - Prove that Ti(z + 1) = e 224 - ) and that ri 24 e Si 2 (F). (b) Prove that i)(z) = (c) Write the equality in part (b) as an identity between formal power series in q . (Note. This identity is essentially Euler's "pentagonal number theorem." For a discussion of its combinatoric meaning and two more proofs of the identity, see [Andrews 1976, Corollaries 1.7 and 2.9].) 10. Find the j-invariant of the elliptic curve in Problem 3(b) of §1.2, which comes from the generalized congruent number problem. What is j in the classical case = 1?
124
In. Modular Forms Prove that there is no other value of 2 e15 for which the corresponding j is an integer. It is known that an elliptic curve with nonintegralf cannot have complex multiplication.
§3. Modular forms for congruence subgroups Let y = ( ) e f = SL 2 (1), let f(z) be a function on ii =1/uQu {co} with values in C u {co } , and let k e Z . We introduce the notation . f J[y]k to denote the function whose value at z is (cz_+ d) -1R(az + b)I(cz + d)). We denote the value off f [y] k at z by f(z)l[y] k :
f(Z)I Mk d:7--:f (CZ
+
dr kf(YZ) for
y
= 7 a b) ) e r. c dj
(3.1)
More generally, let GL(Q) denote the subgroup of GL 2 (Q) consisting of matrices with positive determinant. Then we define
f(z)l[y] k d=ef (det y)142 (cz ± d) -if(yz)
for
y = (ac bd e G LI" (0). )
(3.2) For example, if y = (f; °a ) is a scalar matrix, we have f i[y] k .--.--- f unless a < 0 and k is odd, in which case f j[yi k = f. Care should be taken with this notation. For example, by definition f(2z)i[y] k means f l[y] k evaluated at 2z, i.e., (2cz + d)Y(Paz + NI (2cz + d)). This is not the same as g(z)I[y] k for g the function defined by g (z) = f(2z); namely, g (z)l[y] k = (cz + d)- k g (yz) = (cz + d)_"f(2(az + b)I (cz + d )) . With this notation, any modular function of weight k for f satisfies fl[y]k =-- f for all y e I". Some functions are invariant under DI for other y e GL (Q). For example, recall the theta-function defined in §4 of Chapter II: 0(t) = EnE2 e'n2 for Re t > O. We saw that it satisfies the functional equation 0(1) = r 112 0(11t) (where \r- is the branch such that .-If = 1 ) . We define a function of z e H, also called the theta-function, by setting 0(z) = 0(— 2iz), i.e., —
0(z) = E
e2
_E
ne2
Substituting
—
qn2 for
ZEI-19
q = e2niz.
(3.3)
neZ
2iz for t in the functional equation fox : 0, we have 0(z) = (2z/0 -112 0( — 1/4z).
(3.4)
Squaring both sides and using the notation (3.2), we can write
° III= _ ie 2
for
_. = I E GLI(Q). Y (— 4 0 )
(3.5)
125
§3. Modular forms for congruence subgroups
_Notice that
fiEv1v21,=(fiEv1l,)1[Y2]k
for y 19 y2 e GLI(G).
(3.6)
To see this, write (det y) k12 (cz + d ) k in the form (dyzIdz) k12 , and use the chain rule. We are now ready to define modular functions, modular forms, and cuspforms for a congruence subgroup r c F. Let qN denote e 2z1z .
Definition. Let f(z) be a meromorphic function on H, and let I' r be a congruence subgroup of level N, i.e., r' r(N). Let k EL. We call f(z) a modular function of weight k for r if
fi[y] h =1 for all
y e r',
(3.7)
and if, for any y0 eJT = SL2 (1), f(z)l[yo]k has the form Ea,A
with
an = 0 for n << O. ( 1 8)
We call such an f(z) a modular form of weight k for r if it is holomorphic on I1 and if for all yo E r we have an = 0 for all n < 6 in (3.8). We call a modular form a cusp form if in addition at) = 0 in (3.8) for all yo e F. Thus, as in the case r' = r treated in §2, a modular "function" is allowed to have poles of finite order, a "form" must be holomorphic at all points including the cusps, and a "cusp-form" must vanish at all cusps. This interpretation of (3.8) as a condition "at the cusps" will be explained below. The first condition (3.7) is the obvious analog of the first condition (2.1) for modular functions for F.:The second condition (3.8) is called "meromorphicity" at the cusps ("holomorphicity" if an = 0 for n < 0, "vanishing" if a= 0 for n 0). We now explain this further. Let g = f !Doi, for some fixed yo e GLI(0). If f is invariant under r', i.e., if f 1[A I = 1 for ye r', then it follows from (3.6) that g is invariant under the group yo-l ryo : for all yo-l yyo e yo-i r'yo we have 91 ND- YYdk = (f I [Yo]k)1 [Y0-1 YY0]k = f [no]k= (ff[Y]k)iEvolk= fiEYdk=- - g. In particular, if 'yo e IT and r' F(N), then yo'r'yo also contains r(N) (since r(N) is normal in r), and so g = fi[ya is invariant under r(N). Because TN = a e r(N), we have g(z + N) = g(z), and so g = f I [yo]h has a Fourier series expansion in powers of qN = e 2niziN The content of condition (3.8) is that this expansion has only finitely many negative powers of qN (no negative powers for holomorphicity, only positive powers for vanishing). It may happen that g = f f[ydk is invariant under a smaller translation -
-
,
Th, where hIN, i.e., g (z + h) = g(z). In that case the only powers of q N that appear in the Fourier series are powers of qh = qr. For example, if yo = I and r' = ro (N), then Te r' and g(z + 1) g(z). In that case g =f has an expansion in powers of q = q 1 = q. On the other hand, for r = row) and yo = S = (7 1) we have yo-l r'yo = r° (N) ={(: '0 mod N}; thus, if f is a modular function for ro (N), in general we only have g(z + h) = g(z) for h = N, where g = fj[S]h.
M. Modular Forms
126
Any cusp sea u { a)} can be written in the form s = a' oo for some a = die in lowest terms and finding any solution a bd )6 F (by writing s= and b to the equation ad bc = 1). If we set yo = a' in (3.8), the behavior of g =f 1[Œ-']k near a) (i.e., near qN = 0) is a reflection of the behavior cz co -A.Mrlz b)I(—cz + a)). off near s, since g(z)-= —
—
Proposition 16. The condition (3.8) depends only on the F'-equivalence class of s =--- yo cc. More precisely, ify, co = y/ y2 co for some y' E F', then the smallest
power of qN that occurs in the Fourier expansion of fl[yi k and fl[y 2 ]k is the same. Moreover, if this smallest power is the constant term, then the value at qN = 0 is the same for fi[yj k and fl[ya f k is even; if k is- odd, this value may at most change sign.
Y' Y2 cc, then the element yi-1 y/y2 e F keeps co fixed, in which case it must be of the form + Ti. (Note that (ea bd)oo = oo is equivalent to c = 0, and the elements of r with c = 0 are + Ti.) Thus, we have yiœ l y'y2 = Ti. Let g(z) = .fiz)1[T1]k = I an d . Since f — ± Ti, so that y2 = ± (—okf, and fl[y' -1 ], = f (because y' E F'), it follows that fID1 2, = PROOF. If y1 oo =
1
)VI [h ]k ! [P]k = (±1)kg1[71-1. Thus, f(z) [Y2]k
(± o kg(z . + j) =_. (± o k Eane 2virwAid.
In other words, the qN-expansion coefficients corresponding to y2 differ from those corresponding to y1 only by roots of unity. The proposition now 0 follows immediately. If / is a meromorphic function on H which is invariant under [yli, for y' and if se Q k..) {co} with s = yo co, yo e F, then we say that f is meromorphic (is holomorphic, vanishes) at the cusp s if fi [y 0] k has a Fourier expansion with only finitely many negative terms (respectively, with no negative terms, with no negative terms or constant term). Proposition 16 says that meromorphicity, holomorphicity, vanishing at s does not depend on the choice of yo for which s = yo cc, and in fact only depends on the F'-equivalence class of s. Thus, the condition (3.8) is really a set of conditions, one corresponding j to each cusp s of F'. (Recall that "cusp of 1' means "r-equivalence class of cusps") For example, if F' = Fo (p) for p a prime, we saw (Problem 1 8 in §111.1) that there are only two cusps co, O. Thus, the condition (3.8)' amounts to the two conditions f(z)= E an q n q = e2 , with a.= 0 for n«0; (3.9) z -kf( — 1/z) = Eb,74,
qp =
e2p,
with
b„= 0 for n « O. (3.10)
We call the Fourier series in (3.9) the q-expansion of f at co, and we call (3.10) the q,,-expansion of f at the cusp O. If f is holomorphic, we write f(co) for a 0 and f(0) fc:q. b o . Note that f(0) is not the limit off(z) as z O.
§3. Modular forms for congruence subgroups
127
,
.,
We let Mk (F') and Sk(F') denote the set of modular forms of weight' k for F' and the set of cusp-forms of weight k for F', respectively. As in the case F' = F treated in the last section, it is easy to see that these are C-vector spaces, that fe /ilk , (f) and g e Mk2 (F) implies fg e Mki .f.k2(r), and that the vector space of weight zero modular functions for F' is a field. Also note that if —Je F', then there are no nonzero modular functions for F' of odd weight k, since then fI[—I] k = —f it is immediate from the definition that if r" c F', then a modular function/modular form/cusp-form for I' is also a modular function/modular form/cusp-form for r". There are more interesting ways to get modular forms for a congruence subgroup F" from forms for another subgroup F'. For example, if f(z) = E anqn e Mk(F), then f(Nz) = E an q' and .f(z) = E a(n)q' (the "twist" of f by a Dirichlet character x) turn out to be modular forms, although for a smaller congruence subgroup than F. The next proposition gives two important classes of constructions of this type. In part (1)) of the proposition we use the notation Mk(N, x) with x a Dirichlet character mod N to denote the subspace of Mk (Fl (N)) (see (1.5)) consisting off(z) for whichfl[y] k = x(d)f whenever y = (ca ',i/ ) e Fo (N). In particular, for x the trivial character Mk(N, Xtriv) = Mk(Fo(N)). Proposition 17. (a) Let F' be a congruence subgroup of F, let aeGLI (CO, and set F" = a -1 F'a n F. Then F" is a congruence subgroup of F, and the map fi ->f PI takes Mk (F) to Mk (F"), and takes Sk (r) to Sk (F"). In particular, iffe Mk (f) and g(z) = f(Nz), then g e Mk(ro (N)) and one has g(co) = f(oo), g(0) = N - kf(0). (b) Let x and x i be Dirichlet characters modulo M and N, respectively. If f(z) = I';,°=0 anqn E Maiii, x) and fx, (z) cr-A,E,T=o an x i (n)qn, then fxi G mk (mN2, xx). Iff is a cusp form, then so isjx' i . In particular, iff E Mk (F0 (M)) and x i is quadratic (i.e., takes values +1), then fxi e Aik (ro wN2)). PROOF.
(a) We need two lemmas.
Lemma 1. Let a e GL (Q) have integer entries, and let D = det Œ. If r D
r(N), then a'F'oe PROOF OF LEMMA
F(ND). 1. Suppose y e r(ND), i.e., y = 1 + ND/3 for some
2- x 2-matrix fl with integer entries and det y = I. We must show that 'y e cc-1 rya, i.e., that F' 9 art' = a(1 + ND/3)a-1 = 1 + NDafla- 1 . But cc' = Do:' is an integer matrix. Since det aya-1 = det y = I and not' =1 + WOE', we have ŒyŒ 1 e r(N) c F', as claimed. 0
Lemma 2. Suppose that f(z) has the property (3.8)for all yo E F, i.e., f(z)1 [yo], = In°1„. an qg, (where n o = 0 if f(z) is holomorphic at the cusps, no = 1 if f(z) vanishes at the cusps) Then f(z) has the same property for all aeGL1- (0), = E,",',_ a„ 0 bii in Nj(rm. some positive integers a and D which depend on a). _
128
III. Modular Forms
PROOF OF LEMMA 2. Since a can be multiplied by a positive scalar without affecting [a]k , without loss of generality we may suppose that a has integer entries. It is an easy exercise in linear algebra to show that there exists yo e F = SL 2 (7) such that 70-1 OE = t,),), where a and d are positive integers. Then
f(z) I [Cd k =
(f(Z)
f [YO]k)
I [(a0
db
k
2nin((az+ b)1(1)/N = (ad)"'2d—k gi> a ne n= no
= (a/d)'42
E a e 27ribnAIN nan Nd n = no
n
briggrd, = nE = ano
where
10
bn
if ci,11 n;
(ald)ka e 2 ninbladN ania
if a
in.
This proves the lemma.
Li
We now turn to the proof of the proposition. The first assertion in part (c) follows from Lemma 1. Now suppose that fe Mk (r). Then for a y'a e r" (where y' e I') we have (f Mk)! [a' y' = (fi [yl k)i = f [a]k ; • in addition, for yo e F we have (f [Œ]k)1 [YO] k = fi [ayo ] k , and the condition (3.8) holds for f [a] k , by Lemma 2. Thus,f1 e Mk (F"). Iff vanishqs at all of the cusps, then so doesfl[a] k , by Lemma 2. To obtain the last assertion in Proposition 17(a), we write a = ?), g = N - k2fl[a]k , and note that a-l Fa n F = Fo (N). The values at the two cusps co and 0 come from the above formula for bn with n = 0 and a replaced by ( ?) at the cusp co and by (i:4,1 Sa Iv) ) at the cusp O. This completes the proof of Proposition 17(a). (1)) Let = e2 , and let g =EN x i (j)tj be the Gauss sum. Then co (1 Nil
N —1
f 1 (z) =
E
n=0
1=0
N v=0 co
N —1 = Ar IT
anqn
(v) X1 (1V)
1,v= 0
iv
E ane2 irtn(z—vIN) n= 0
N —1 = n
N
E0
(v)fiz — vIN).
Now let y = bd ) e F0 (MN 2). We want to examine fxa (yz). 'Let yv denote k0 1 . Then for each y and y', 0 < y y' < N, we compute ,
yory;; 1 =
— cvIN b + (v'a vd)IN cvv'IN 2) d + cv'IN
•
129
§3. Modular forms for congruence subgroups r
If y' is chosen for each y so that v'a -=-- vd mod IV (such a choice is unique, because a and d are prime to N), then we have yvyy,7, 1 E ro (M and so ),
fy
4 1 (yz)=--A ,.
N-1
E -ii(v)ftyvvy
1 1, z)
n N-1
=
N v=o
y i (v)x(d)(cy,,z + d + cv'IN) kAy,,z) a
=
N-1
X(d)(ez + d) k --- E -ii(of(z N v. 0
- v' IN).
But )71 (v) = -ii (036. (a)h-Ci (d) = X1(c1) 2 i1(V). Thus, a
xi fti (yz) = xxi(d)(cz + d) k ----'IV
N-1
E i-c-- ,(0ftz - v' IN) = xxi(d)(cz + d)kfx, (z),
V=0
and so 4, i has the right transformation formula to be in Mk (MN 2 , XXI)The cusp conditions are verified by the same method as in the proof of part (a) of the proposition. Namely, for all y E F,fx ,(z)1 [y] k is a linear combination a off(z)l [m ]k, and so the cusp conditions follow from Lemma 2. The next proposition generalizes Proposition 9(a) in the last section. Like Proposition 9(a), it is useful in proving equality of two modular forms from information about their zeros. Proposition 18. Mo (r) = C for any congruence subgroup
r c F. That is,
there are no non-constant modular forms of weight zero. Letfe Mo (r), and let a = f(z 0) for some fixed zo e H. Let r = U air be a disjoint union of cosets, and consider g d=ef ri (f 1[000 — a), i.e., PROOF.
g(z) =
ri (f(Œ
-1 z)
— a).
(3. 1 1)
Then g(z) is holomorphic on H, and it satisfies (3.8), because f does. M oreover; given y f - we have gl [y-1 ] 0 = II (f i [(yai) 1 0 — a). But since f 1[(4-10 does not change if a is replaced by another element ay' E ar, and since left multiplication by y permutes the cosets ai r, it follows that { f I[(y air1 ] 0 } is merely a rearrangement of { f i[oci 9 0 } . Thus, g l[y- 1 0 = g, and we conclude that g e Mo (T). By Proposition 9(a), g is a constant. Since the term in (3.11) corresponding to the coset ir isf(z) — a, it follows that for z = z0 the product (3.11) includes a zero factor. Thus, g = O. Then one of th`e factors in (3.11) must be the zero function (since the meromorphic functions on H form a field). That is, f(cci l z) — a = 0 for some j and for all z e IL Replacing z by aj z, we have: f(z) = a for all ze H, as claimed. o As an example of the applications of Proposition 18, we show that for any positive integers N and k (with k even) such that k(N + 1) =--- 24, a
-
130
HI. Modular Forms
cusp-form of weight k for ro(N) must be a multiple of (n(z)ri(Nz)) k . Here n(z) = e 2riz/24 nn (1 _ qn) , q = e2, as in §2.
Proposition 19. Let f(z) be a nonzero element of sk (ro(N)), where N = 2, 3, 5, or 11 and k = 8, 6, 4, or 2, respectively, so that k(N + 1) = 24. Then f(z) is a constant multiple of g(z) aTf (ti(z)ri(Nz)) k . By Proposition 17(a), g(z) N+1 ---= A(z)A(Nz) is an element of S24( r.0 (N)). In addition, g(z) N+1 is nonzero on H, since A(z) 0 0 on H. At infinity, g(z)' = q N+1 n (1 _ q n)24(1 _ en\) 24 has a zero of order N + 1 in its q-expansion. At the cusp zero, we write -24z12A (z)(zIN) 12A(zIN) oz , , N+1 I[S]24 = z -246,(-1/z),A(—Nlz) = z ) = N -12 q r1 ii 0 _ 4)240 _ q 11 1/1)2 4 , PROOF.
which has a zero of order N + 1 in its qN-expansion. On the other hand, since fesk (ro (N)), its q-expansion at oo is divisible by q, and the 'isexpansion of fl[S] k is divisible by qAi. Now (PO', as a ratio of two elements of S24 (f0 (N)), is a modular function of weight zero for Fo (N). Since g(z) 0 0 on H, this ratio is holomorphic on H. Moreover, the qexpansion off ' is divisible at least by q N+1 , and at zero the qN-expansion in g' and the 4 +1 in is divisible at least by q"- . Hence, the g N+1 1[S] 24 are 'canceled, and the ratio is holomorphic at the cusps, i.e., ( .fig ) v +1 e Mo (fo (N)). By Proposition 18, ( flg) N+1 is a constant, and hence fig is also a constant. This completes the proof. 0
ei +1
Notice that we did not actually prove that g(z) ---= (ri(z)ri(Nz)) k is in sk (ro(N)), unless we can be assured that there exists a nonzero element feS k (ro (N)). The same proof, for example, would tell us that any nonzero Je S3 (r0(7)) must be a constant multiple of (n(z)v(7z)) 3 ; but s3(r0(7)) = 0, since 3 is odd and — /e I-10 (7). However, for the values of N and k in Proposition 19 it can be shown that dim sk(ro(N)) = 1 (see Theorem 2.24 and Proposition 1.43 in Chapter 2 of [Shimura 1971]). Thus, Proposition 19 is not vacuous. For N = 2, k = 8 we can see this directly, since we know that T and ST 2S ------ ( 1 _?) generate r0 (2) (see Problem 13(b) of §III.1).
Proposition
20. (q(z)ri(2z)) 8 E S8(1-0(2)).
Clearly, g(z) = (ri(z)q(2z)) 8 is holomorphic on H. Its q- expansion at oo is: ( e 2iriz/24±2ni2424)8 n 0 ...... qn)8 (1 _ q2.)8 = q ri 0 _ qn)8 (1 _ q 2n) s . Using the relation ?A-1/z) =--- V zlig(z), we easily see that g(z) vanishes at the cusp 0 as well. It remains to show invariance under [(1 _?)] 8 . Set a = (2 1). Then (1 _?) = lane, and PROOF.
g (z)l[a] 8 = 24 (20 -8 0/( — 1/2z)q( — 1/z)) 8 = (2z2) -4(V2z/i n (2z) / ti(z)) 8 = (r/(z)t/(2z)) 8 =-- g(z).
131
§3. Modular forms for congruence subgroups
Since g[T]8 = g, and Ha is always trivial, it follows that g is invariant under la Ta, as desired.
Eisenstein series. Let N be a positive integer. Let a = (al , a2) be a pair of integers modulo N. We shall either consider the ai as elements of ZINZ or as 'integers in the range 0 < ai
1
E
f
niev
on,z+
(3.12) M 2 )k
ma- a mod N
If a = (0, 0), we delete the pair in = (0, 0) in the sum (3.12). But there is no point in considering the case when a = 0, since, setting in 1 = Nm, M 2 = Nn, we have
G(z) = N -k
onz+nyk.N - kGk (z), m,neZ
•
which is the Eisenstein series for F which we already studied in §2. In what follows we shall suppose that a 0 (0, 0). Notice that we are allowing k > 3 to be either odd or even. Proposition 21. Gkcj-mod
N
mk(r(N .
)) and Gr a 2 ) mod N e mk (r i (N) ).
PROOF. First, the series (3.12) is absolutely and uniformly convergent for z in any compact subset of H, because k > 3 (see, for example, Problem 3 in §1.5). Hence G(z) is holomorphic in H. Now let y = (ca bd) e F. Then G(z)I [y] k =
E
(cz + dyk
mr, a mocIN
E
k ( az + b mi cz-+ d + m2)
t(p"
mz-Eamod N
W" 1"" + m2 c)z + ()nib + rn2c1))k
Let m' = (m i a + m2 c, mb + m 2 d) (m i m2 )( ca bd) =m1). Note that modulo N we have m' ay. Let a' = ay (reduced modulo N). Then the maps mi--m/ = my, and m' = m'y' give a one-to-one correspondence between pairs m e V with m a mod N and pairs m' e Z 2 with m' a' mod N. This means that the last sum above is equal to Em' Ea' mod NOnfiz + (z). Thus, = Gkay mod N
Gg-m°dN
for y e re
(3.13)
If 'y e F(N), then by definition y I mod N, and so ay a- a mod N. Thus, (3.13) shows that G: is invariant under [y] k for iy e F(N). Similarly, if y (N) and a l = 0, we have (O a2)y (0, a2) mod N, and so Gra2) m0d N is ,
132
HI. Modular Forms
invariant under [y]k for y e f 1 (N). It remains to check the holomorphicity condition at the cusps. But [yoik permutes the G" dN for yo e F, by (3.13). Hence it suffices to show that each G: is finite at infinity. But
E
lim GAz) =
z-4.iao
if a l 0 0;
0
E
mEarnedN,m1=0
n"
if a 1 = O.
n-rta 2 mod N
The sum / n" over n L--s a2 mod N converges because k >_ 3. It is essentially a "partial" zeta-function. More precisely,
± (_ l) CCk.` - a2( ) 9 where Ca(k)i-Q. c Gr,a 2)(co ) _ a,(k) ts1 n.s.---amodN
This completes the proof of the proposition. As a special case of (3.13), if we set y = —! we have
GI-(2 = (— Div. This can also be seen directly from the definition (3.12). Thus, for example, G: = 0 if k is odd and 2a--= 0 mod N. It is now not hard to construct modular forms for any congruence subgroup F', f = I"' D f(N), out of the Eisenstein series G:d rno N . For fixed a, the elements y E F' permute the Eisenstein series G:, where a' ranges over the orbit of a under the action of r, i.e.,
g' Gel ra foiverl. Let r = #(41') be the number of elements in the orbit. If F(Xi , .. . , X,.) is any homogeneous symmetric polynomial in r variables with total degree d, and we set Xi = Gri (where ayi rubs through the orbit ar'), then F(Gr', . . . , Ge ') is easily seen to be a modular form of weight kd for F'. For example, taking F to be X1 + - • - + Xr or X1 . . . Xr , we have E Ge
,„2 , mod
N(z )
€ mk (ro(N)) ;
a2 e (7L /NZ)* Gi(c 0 , a2) mod AT( 2.) e mo(N),,
n
(3.16) ( Fo(N)).
a2 e(ZINZ)*
We now compute the qN-expansion for G:. We shall be especially interested in the cases when a l = 0 or a2 =---- O. Recall the formula used in deriving the q-expansion for Gk in the last section (see the proof of Proposition 6):
1 k = (..._ ok--12c(k).___ k 'E i• e2nijz nE ez (z + n) Bk j = 1
for
k > 2,
z e H.
(3.17) Let
133
§3. Modular forms for congruence subgroups
c
cf
a
=
NkBk
{0 k d7f Ca2(k) (— 1) k r i2 (k)
if a / 0 0; if a / = O.
(3.18) Then
G:(z) =
14,
E + ni 1 :4a 1 mod E N,m i *0 m2 Ea 2 mod N
(
E
1)"
-k
N + n)
m i Ea, mod N,m i >0 neZ
+ N -k
in2) –k
m iz + a2
E
=b+N
iz +
E
(m i z - a2
m 1 E–a 1 modN,m i >0 nEZ
=
00
E
ck(
ik_ie2Rifi(rniz+a2)/N)
m i Ea ' mod N j=1 m i >0 03
E
(—ok
m1=--- – al mod m >0
e2iriAmiz–a2)1N)
E N j=1
In these computations we had to split up the sum into two parts, with m i replaced by m 1 for m i negative, because in applying (3.17) with z replaced by (m / z ± a2)I N we need (rn i z + a 2)IN E II, i.e., m 1 > O. Now let -
qN = e2nizIN
crf e2mmi
(3.19)
Then G(z) =
E
b + ck
00 E i k„ va2q.km 1
m 1 E-a l mod N j=1
(3.20)
( _ ok
co
E
E
m i s--. –a i mod N j=1 m >0
To find the coefficient 91; of q?k, it remains to gather together terms with = n. We shall only do this in the cases when a l = 0 and a2 = O. As a result, we have the following proposition. Proposition 22. Le t ck ,bt k ,,q N be as in (3.18) (3.19). For k > 3 let Gklm°dN (z) -
a mod be the Eisenstein series (3.12). Then the qN- expansion of G N
G:(z) = bt k + E
(3.21)
tfirT07;
n=1
Can be computed from (3.20). If a = (a 1 , 0), then for n > 1 = Ck(
E iln n/jE-..-`. a l mod N
j k, +( _ ok
E jjn
nijE –a l mod N
(3.22)
134
HI. Modular Forms
If a = (0, a2), then for n __ I
bNa n,k = ck E ik—i (Va 2 ± ( ..._1)k—i(2 2) . (3.23)
er, k = 0 if N )(n;
jln
Thus, for a = (0, a 2) co
q = e2niz .
Ge ,a 2)(z) = b(00:ka2 ) ± ck 1 E i k, (Va 2 ± (____ 01,: ja 2) q n 1 n=1i
(3.24)
Proposition 23. If 2a a- (0, 0) mod N and k is odd, then G laT md N = O. Other-
wise, Ge'a2) is nonzero at co, and Ge" 9) has a zero of order min(a i , N — a 1 ), where we are taking al in the range 0
> 0 if a2 < N/2; i ( I if a2 > N/2. n=c, ( a2 + nN) k (N — a 2 + nN)k)t< 0 I
Finally, we look for the first possible value of n in (3.22) for which either sum in (3.22) is nonzero. That value is n = min(a i , N — a l ), where we have the possible value/ = 1 in one of the two sums. Thus, b' °) = +ck for n = 0 min(a / , N — a 1 ). This completes the proof.
- "dN can be As an application, we show that a certain product of G31" expressed in terms of the n-function. We shall use this result in the next section, where we give Hecke's proof of the transformation formula for 0(z). Recall the Weierstrass p-function and its derivatives from Chapter I: So(z; w 1
,
(0
1 1 1 • 2) = —2z + „iI ,„ ez l (z + nu°, + nw 2) 2 (n2C01 + no) 2 ) 2 '
sg (z co 1 , co 2) = —2 E '
(z
;
+ m, u) + nco2)- 3
;
m,neR
(0 1 9
(0 2) =
E
(— l )"(k — 1)!
(z
+ imo, + nw 2) -k ,
k > 3.
m,neZ
If a 0 (0, 0), we can express Gk'llmd N (Z) in terms of gp (k -2) as follows: G: mod N (z)
—k a i z + a., .`+ Ira + n) N m,nEZ
= N _k z
( -- Ok = Nk (k — l)!
(k_ 2 )
al z + a2 ;
(3.25)
Z,
1) . This is the value of p (k-2) for the lattice L, = linz + n1 at a point of order
135
§3. Modular forms for congruence subgroups
N modulo the lattice, namely (a1 z + a2)/N. Thus, the Eisenstein series for r(N) are related to the values of the derivatives of p at division points. Now suppose that k = 3. By (3.25), G(z) can vanish only if 0 1 ((a1 z + a2)IN ; z, 1) = O. But pi vanishes only at half-lattice points (see the end of §1.4). If 2a ET: (0, 0) mod N, i.e., if 2(a 1 z + a2)IN is a point in L z , then G39- is the zero function, by Proposition 23. Otherwise, oVa l z + a2)IN ; z, 1) is nonzero. We have proved Proposition 24. If 2a # 0 mod N, then Grd N (z) V: lr ) for z-e H.
Let p be an odd prime. We now define 12- 1
h(z) =-- n Gl o,a2 ,modp(z).
(3.26)
a 2 =1
Proposition 25. h(z)e M3(p ... 1)(1- 0( p)), its only zero is a (p 2 — 1)/4 —fold zero at 0, and it is a constant multiple of (TIP(z)lri(pz)) 6 . The first part is the special case of (3.16) when N = p, k = 3. h(z) is nonzero on H by Proposition 24, and at infinity by Proposition 23. To find its order of zero at 0, we examine the qp-expansion of PROOF.
hl[S]3 (p _i ) =
P— 1 P-1 [I Gr 2 ,0)m04 p n Gl o 2,modp i[s h= ,a
a 2 =1
a2=1.
by (3.13). According to Proposition 23, the first power of qp which appears in Gr2 ' 0)rmdP is qpinin("2 4'- '2 ). Thus, the order of zero of h(z) at 0 is (1) - 1)12
12 - 1
E min(a 2 , p --: a2) = 2 a 2 =1
E
a2 = (p2 — 1)/4.
a2 =1
Now set 172(z) == (re(z)li(pa. Then î(z) 4= A(z)P I A(pz) is holomorphic and nonzero on H, since A(z) is holomorphic and nonzero on H. Because A(z) e S12 (r) c S12 (1-0 (p)) and A(pz) e Si2 (ro(p)) by Proposition 17(a), it follows that h(z) 4 is a modular function of weight 12(p — 1) for Fo (p). Its q-expansion at infinity is q p no ...... q n) /q p no _ e p) 24 _ n „, _ q n) p,o 24" hence 1i(z) 4 is holomorphic and nonzero at co. At the cusp 0 we have I 2(p-1) • f — , at 11z)P-IA(—plz) kz)4 1[S] 1 2(p-1) = z— = z-12(p-1)Z12PA(z)P1((zIp)12A(zip)) = p'2 A(z)P I A(z Ip) = p 12 q p no .....
q .) 24p/qpn(1
qnp)24 1
which has leading term p 1242-1 . Thus, both h4 and ii 4 are elements of M12(p_ 0(1"0 (p)) with no zero except for a (p2 — 1)-ordér zero at O. Hence their ratio is a constant by Proposition 18. But (///h )4 = const implies that hiii = const. This concludes the proof of Proposition 25. 0
136
III. Modular Forms ,
Notice that, by (3.15), we have some duplication in the definition (3.26) of 12( 2). That is, if we define ( p— 1)12 f(z) = fi , Gr,a2) mod p(z ) ,
(3.27)
a2 =1
we find, by (3.15), that
h(z) = (
0
( p-1 )/2f(z)2.
'
—
(3.28)
Proposition 25 then tells us that the square of the ratio off(z) to 07 P(z)11(pz)) 3 is a constant. Hence, the ratio of those two functions must itself be a constant, and we have proved Proposition 26. The function f(z) defined in (3.27) is a constant multiple of
(r1P(z)M(pz)) 3 . Because each of the GV'a2) in (3.27) is in M3 (1", (p)) by Proposition 21, it follows that fe M3(p _ 0/2 (1"1 (p)). However, unlike 144 f(z) is not, strictly speaking, a modular form for the larger group Fo (p). Proposition 27. Let f(z) be defined by (3.27), and let y = Ca D e Fo (p). Then
f l[y] 3( P — 1)/2 = (1,)f, where (i) is the Legendre symbol (which is +1, depending on whether or not d is a square modulo p). PROOF.
Since (0, a 2 ) (° )2L---_ (0, da2) mod p, it follows by (3.13) that
f I [y] 3,p - 1 „ 2 =
(p-1.)/2
fi Gr ,da 2
)
mod p .
a 2 =1
But by (3.15), the terms in this product are a rearrangement of (3.27), except that a minus sign is introduced every time the least positive residue of da2 modulo p falls in the range (p + 1)/2, (p + 3)/2, . . . , p 1. Let nil be the number of times this occurs. Thus, fl[Y]3(p012 = ( 1)ndf. According to Gauss's lemma, which is an easily proved fact from elementary number theory (see, for example, p. 74 of [Hardy and Wright 1960 ]) , we have (— 1)d = (1). This proves the proposition. 0 —
—
The transformation formula in Proposition 27 is an example of a general relationship between modular forms for r1 (N) and "twisted-modular" forms for Fo (N), called "modular forms with character". We now discuss this relationship. We start with some very general observations. Suppose, that F" is a subgroup of F', and f(z) is a modular form of weight k for the smaller group F" but not necessarily for the bigger one. Then for y e I' we can at least say thatf i[y - l k only depends on the coset of y modulo F". That is, if y" e r", then fl [(yy") - 1 = fi[y- lk . Now suppose that the subgroup F" is normal in F'. To every coset yF" associate the linear map fF--> f l[y- lk which takes an element in Mk(F") to Mk (F") by Proposition 17 (with F" and y -1 in place of F' and a; note that yF"y -1 n F = r", since y e F' and F" is normal in F'). This gives us a group
137
,
§3. Modular forms for congruence subgroups
-
1
homomorphism p from F'/F" to the linear automorphisms of the vector space Mk(F"), because for yi , y2 e F' , P(Yi Y2): fE.-fl{(yiy2) - 9k = (fl[Ynk)i[Yilk = POO (p(y2)f).
In other words, we have what is called a "representation" of the group
r'/r" in the vector space mk(r"). If : F'/F" -- C* is a character of the quotient group, then define Mk(F', x) to be the subspace of Mk(F") consisting of modular forms on which the representation p acts by scalar multiplication by x, i.e.,
mk(r, x) a tfe Afk(r")1p(y)f= lLe ---,
x(y)f for all ye
r}.
If F'/]T" happens to be abelian, then according to a basic fact about representations of finite abelian groups, Mk(F") decomposes into a direct sum of Mk(F', x) over the various characters x of FYI'. We shall soon recall the simple proof of this fact in the special case that will interest us. We apply these observations to the case F" = ri (N), F' = Fo (N). Since ri (N) is the kernel of the surjective homomorphism from FO (N ) to (11 N 1)* that takes (ca db to d, it follows that Fi (N) is a normal subgroup of Fo (N) with abelian quotient group isomorphic to (ZINZ)* (see Problems 1-2 of §III.1). Let x be any Dirichlet character modulo N, i.e., any character of (Z/NZ)*. In this context the subspace Mk(ro(N), x) Mk(ri (N)) is usually abbreviated Mk(N, x). That is, )
Mk (N, x) ,1---3 ffe Mk (Fi (N))I f l[y] k = x(d)f for y = (
a 11 ) e ro c_ d
(N4 (3.29)
In particular, if x = 1 is the trivial character, then Mk (N, 1) =
mk(ro(N)).
Proposition 28. Mk (f i (N)) = 10Mk (N, x), where the sum is over all Dirichlet /characters modulo N. As mentioned before, this proposition is actually a special case of the basic fact from representation theory that any representation of a finite abelian group decomposes into a direct sum of characters. However, we shall give an explicit proof anyway. First, any function f that satisfies f 1[y] k = x(d)f for two distinct characters x must clearly be zero; hence, it suffices to show that anyfe Afk (Fi (N)) can be written as a sum of functions fx E x). Let , PROOF.
Mk(N,
1,
fx = 0(:AT)
d
E
k(d)f I[Ydlo
where yd is any element of Fo (N) with lower-right entry congruent to d mod N. We check that for y = (` S)e fo (N)
fx I Hit =--- 4)(1N) d Zz)* TCfrOf f Fy 1 which, if we replace dd' by d as the variable of summation, is easily seen
138
HI. Modular Forms
to equal x(d)f, i.e.,ft e Mk (N, x). Finally, we sum theft over all characters x modulo N, reverse the order of summation over d and x, and obtain:
Efx =
1
_
Ex(d) E N) x d. ( zINE) * o(
fiETL,
which is equal to f, because the inner sum is 1 if d = 1 and 0 otherwise. o Thus, f can be written as a sum of functions in Mk (N, x), as claimed.
Notice that Mk (N, x) = 0 if x has a different parity from k, i.e., if (— 1) 0 (-1)k . This follows by taking y = —I in the definition (3.29) and recalling that f i[ nk = ( 1)k. f For example, as an immediate corollary of Proposition 28 and the pre7 ceding remark, we have —
—
Proposition 29.
AO , 1), k even; 4 AM r 1( )) = I ./IA(4 , x), k odd, where 1 denotes the trivial character and x the unique nontrivial character , modulo 4. Notice that the relationship in (3.29) is multiplicative in y; that is, if it holds for y l and y2 , then it holds for their product. Thus, as in the case of modular forms without character, to show that f(z) is in Mk (N, x) it suffices to check the transformation rule on a set of elements that generate F o (N). As another example, we look at 0 2 (z) = (Inez qn2)2 , whose n-th qexpansion coefficient is the number of ways n can be written as a sum of two squares. Proposition 30. 0 2 e M1 (F 1 (4)) ---.--- Aft (4,
x), where x(d) = (
-
1)("w 2 .
the transformation rule for —I, T, and ST 4S = (-1 _7), which generate To (4) (see Problem 13 of §1117.1). This is immediate for T, since 02 has period F. Next, the relation fl[ n i = f= x( 1)f holds for anyf, by definition. So it remains to treat the case ST4S. Let PROOF. It suffices to verify
—
—
—
(0 —1 CtIV d=ef V
and
aN
aN-1 = —laN , N
so that
o) '
(a b) -1 c d ccN
(
=
—c/N\ — Nb a ). d
(3.30)
We write ST4S = —a4 Tcc4-1 = ia4 Tec4 , and use the relationship 0 2 1 [cc4j i _j® 2 (see (3.5)) to obtain 021[574511 z..= 021[Œ4T,,4]1 =_._ — 021[Ta4]1 = —i®21 [°44]1 [Œ4]i = (Recall that the scalar matrix il acts trivially on all functions, i.e., [1/4], identity.)
6111•••18 ■
-
139
§3. Modular forms for congruence subgroups
To finish the proof of the proposition, we must show the cusp condition, i.e., that 0 2 1[701 1 is finite at infinity for all yo e T. But the square of 0 2 1[y0] 1 is 04 I [To] 2 , and it will be shown in Problem 11 below that 04"e M2 (ro (4)) ; in particular, this means that CO'f[yo ] 2 , and hence also Col [yo]i , is finite at o infinity. This completes the proof. The spaces Mk (N, x) include many of the most important examples of modular forms, and will be our basic object of study in several of the sections the follow. We also introduce the notation Sk (N, x) to denote the subspace of cusp forms: Sk (N, x) j=f2Mk (N, x) n S k (r i (N)).
The Mel/in transform of a modular form. Suppose that f(z) = E an 4 (where q Iv ......_ e 2niz1N ) is a modular form of weight k for a congruence subgroup F' of level N. Further suppose that Ia n ' = 0(n') for some constant c ER, i.e., that an/ne is bounded as n --> co. It is not hard to see that the qN-expansion coefficients for the Eisenstein series G:m" N have this property with c = k — 1 + e for any r, > O. For example, in the case F' = F, the coefficients are a constant multiple of o-k _ i (n), and it is not hard to show that ak _ 1 (n)I rl k—i+e 0 as n ---÷ cc . We shall later show that, if f is a cusp form, we can take c = lc/2 + E. It has been shown (as a consequence of Deligne's proof of the Weil conjectures) that one can actually do better, and take c = (k — 1)/2 + E. In Chapter II we saw that the Mellin transform of OW = E e -'2 and certain generalizations are useful in investigating some important Dirichlet series, such as the Riemann zeta-function, Dirichlet L-functions, and the Hasse-Weil L-function of the elliptic curves En : y2 = x 3 — n2x. We now look at the Mellin transform for modular forms. Because we use a variable z in the upper half-plane rather than t (e.g., t = — 2iz), we define the Mellin transform by integrating along the positive imaginary axis rather than the positive real axis. The most important case is F' = F1 (N). For now we shall also assume that f(co) z.-- O. Thus, let f(z) = E,7=1 an qn e Mk (Fi (N)). (Recall that since Te F1 (N), we have an expansion in powers of q = e27riz rather than qN .) We set ioo f(z)zs' dz. (3.31) def —0
We now show that if f(z) = E,T, i anq" with lan i = 0(d), then the integral g(s) defined in (3.31) converges for Re s> c 4- 1: \ ico
.
JO
oo
it°
f(z)zs -i dz = E an n =1
J0
2 dz zse ninz ______ z
— E0° an (— 27r1 1n y) n=1
foe
dt tse-r 7
(where t = —27rinz)
0
00
(-270 -sr(s) E an n -s
(see (4.6) of Ch. II)
n=1
(3.32)
140
111. Modular Forms
(where the use of I" in the gamma-function F(s) has no relation to its use in the notation for congruence subgroups; but in practice the use of the same letter F should not cause any confusion). Since iann-si = o(nc-Res\), this last sum is absolutely convergent (and the interchanging of the order of integration and summation was justified). - Inœ=0 42Ane mk (r,(N)) has la c) 0 0, we replace f(z) by f(z) — ac, If f(z) =----in (3.31). In either case, we then obtain g(s) = (-2ni)_sr(s)LAs), where co co ;le Lf (s) d=ef and for Re s > c + 1, if f(z) =
E
E
n=1
n=0
(3.33)
with Ian ' = 0(d).
In addition to their invariance under [y]k for y el",(N), many modular forms also transform nicely under [acia, where ŒN = ( ) as in (3.30). It will be shown in the exercises that, for example, any function in Mk(N, x) for z a real character (i.e., its values are +1) can be written as a sum of two functions satisfying f1[aN]k= Ci-kf
C 7--- 1 or —1,
(3.34)
where one of the functions satisfies (3.34) with C = 1 and the other with C = —1. An example we already know of a function satisfying (3.34) is 02 : the relation (3.5) is a special case of (3.34) with k = 1, C = 1, N = 4. We now show that if (3.34) holds, then we have a functional equation for the corresponding Mellin transform g (s) which relates g (s) to g (lc — s). For simplicity, we shall again suppose that f(z)= I an sq" with ao = 0. We can write (3.34) explicitly as follows, by the definition of [aid', : f(-1INz) = CN -42 (—iNz) kf(z).
(3.35)
In (3.31), we break up the integral into the part from 0 to il\FN and the part from i//i to ix. We choose ifIKT because it is the fixed point in H of ŒN : zF.--+ —1INz. We have
ico g(s) =
fib.rsT
= •••■•••■■•
••••••■••
dz
.fiz)z s--z-
—f
ioo
111Vz)
f(— 1 INz)(— 11Nz) 3d( — 1INz iw-st
ioo (f(z)zs ± f(-1INz)(-11Nz)s)—dz z fiwN ico ) —dz (f(z)zs ± i k CN - wf(z) ( — 1 /Nz)s—k
. f ibliq
Z
,
because of (3.35). In the first place, this integral converges to an entire function of s, because f(z) decreases exponentially as z -4 ioo. That is, because the lower limit of integration has been moved away from zero, we no longer have to worry about the behavior of the integrand near O. (Compare with the proof of Proposition 13 in Chapter II, where we used a similar technique to find a
141
§3. Modular forms for congruence subgroups
rapidly convergent series for the critical value of the Hasse-Weil L-function ; see-the remark following equation (6.7) in §II.6.) Moreover, if we replace s by k s in the last integral and factor out ik CN -42 (- NY , we obtain:
(i'C'N k/2 (- N)sf(z)z's
g(k s) = i k CN -k2 (-
= eCN -k12 (-
f(z)zs)
dz
ioo I (Pc CAT -kl2f(z)(- 11Nz)s -k +f(z )e)
= i k CA T -142 (- N)sg(s), because the last integral is the same as our earlier integral for g(s). This equality can be written in the form
(- i\ N) 5g(s) = C(- iffV)- k- sg(k s). Thus, by (3.32)-(3.33), if we define A(s) for Re s> c + 1 by
(3.36)
A(s) = (- i Kr)g(s) = ( N 12n)sr(s)L f (s),
we have shown that A(s) extends to an entire function of s, and satisfies the functional equation
(3.37)
A(s) = CA(k s).
An,
As an example of this result, we can take f(z) = A(z)e S 1 which satisfies (3.35) with N = 1, k = 12, C = 1. Then A(z) = Z t(n)q", 6.(s) = z,f_ -c(n)n -- , and A(s) = (2n) -sr(s)LA(s) satisfies the relation : A(s) =
A(12 The derivation of (3.37) from (3.34) indicates a close connection between Dirichlet series with a functional equation and modular forms. We came across Dirichlet series with a functional equation in a very different context in Chapter II. Namely, the Hasse-Weil L-function of the elliptic curve : y2 = .x3 - n 2 x satisfies (3.36)-(3.37) with k = 2, N = 32n' for n odd and 16n2 for 'n even, C = for n odd and ( ) for n even (see (5.10)(5.12) in Ch. II). We also saw that the Hasse-Weil L-function of the elliptic curve y2 = x3 + 16 satisfies (3.36)-(3.37) with k = 2, N = 27, C = I (see Problem 8(d) of 11.5). So the question naturally arises: Can one go the other way? Does every Dirichlet series with the right type of functional equation come from some modular form, i.e., is it of the form Lf(s) for some modular form f? In particular, can the Hasse-Weil L-functions we studied in Chapter II be ,obtained by taking the Mellin transform of a suitable modular form of weight 2? That is, if we write L(E„, s) in the form E:=1 bm ars (see (5.3) in Ch. II), is E b niqm the q-expansion of a weight two modular form? Hecke [1936] and Weil [1967] showed that the answer to these questions is basically yes, but with some qualifications. We shall not give the details,
142
W. Modular Forms ,
which are available in [Ogg 1969 ], but shall only outline the situation and state Weil's fundamental theorem on the subject. Suppose that L(s)= I anti' satisfies (3.36)7(3.37) (and a suitable hypothesis about convergence). Using the "inverse Mellin transform", one can reverse the steps that led to (3.36)-(3.37), and find that f(z)= E anqn satisfies (3.34). For now, let us suppose that N = A2 is a perfect square, and that C = ik (k even). Then, iff(z) satisfies (3.35), it follows that ft (z) cre--f f(z I A) = 1 anqr,1 satisfies: f1 (— 11 z) = z kfi (z). Thus, fi is invariant under [S] k and [T] k , and hence is invariant under the group generated by S and V. Hecke denoted that group 0(2). We have encountered the group 0(2) before. In this way one can show, for example, that L(Eino , s) corresponds to a modular form (actually, a cusp form) of weight 2 for 0(8n 0). Unfortunately, however, Hecke's groups 0(A) turn out not to be large enough to work with satisfactorily. In general, they are not congruence r(2) is an exception.) subgroups. (6(2) But one can do much better. Weil showed, roughly speaking, that if one has functional equations analogous to (3.36)-(3.37) for enough "twists" E x(n)ann -s of the Dirichlet series Z anti', then the corresponding qexpansion is in mk(ro(N)). we now give a more precise statement of Weil's theorem. Let xo be-a fixed Dirichlet character modulo N (x o is allowed to be the trivial character). Let x be a variable Dirichlet character of conductor m, where m is either an odd prime not dividing N, or else 4 (we allow m = 4 only if N is odd). By a "large" set of values of m we shall mean that the set ' _iv} jez , contains at least one m in any given arithmetic progression {u+ where u and y are relatively prime. According to Dirichlet's theorem, any such arithmetic progression contains a prime; thus, a "large" set of primes is one which satisfies (this weak form of) Dirichlet's theorem. By a "large" set of characters x we shall mean the set of all nontrivial x modulo m for a "large" set of m. Let C = ±1, and for any x of conductor m set
C, --z--• CX0(m)( — N)g(X)igW,
(3.38)
where g(x) = E.7,_, x(j)e 27'iiiN is the Gauss sum. Given a q-expansion f(z)= Ef=o anqn, q = e'iz, for which lan i = 0(d), we define Lf (s) by (3.33) and A(s) by (3.36), and we further define CO
Lf (x, s) =
E x(n)anirs ;
A(y, s) = (m \ I-A 1 72n)r (s)Lf (x, s).
(3.39)
n=1
Weil's Theorem. Suppose that f(z)=E„œ_ o an qn, q = e2', has the property that Ian ' =.-- 0(nc), ce R. Suppose that for C =1 or —1 the function A(s) defined by (3.36) has the property that A(s)-1- a o (lls --I 1 (k — 4) extends
143
§3 Modular forms for congruence subgroups
to an entire function which is bounded in any vertical strip of the complex plane, and satisfies the functional equation A(s) CA(k s). Further suppose that for a "large" set of characters x of conductor m (in the sense explained above), the function A(z, s) defined by (3.39) extends to an entire function which is bounded in any vertical strip, and satisfies the functional equation A(z, s) = Cx A(, k s), with C, defined in (3.38). Then feM k (N, x 0), and f satisfies (3.34). lf, in addition, L1 (s) converges absolutely for Re s> k s for some e> 0, then fis a cusp form. -
One can show that the Hasse—Weil L-functions of the elliptic curves in Chapter II satisfy the hypotheses of Weil's theorem (with zo = 1). The same techniques as in the proof OC the theorem in §I1.5 can be used io show this. However, one must consider the Hecke L-series obtained in (5.6) of Ch. II by replacing 2,7(I) by the character jc",(.0z(Ni1) with z any Dirichlet character modulo ni as in Weil's theKm. For example, if we do this for L(E, , s), where E1 is the elliptic curve, 1=- x 3 x, we can conclude by Weirs theorem that
fE i (z) = q
2q5 3 (i) 6q' 3 + 2q' 7
E
bni e
(3.40)
rn>25
(see (5.4) of Cu. 10 is a cusp form of . weight two for 1'0 (32). If we form the q-expansion corresponding to the L-series of E : y 2 = n 2x, namely, fEn (z) = z n(m)k nqin, it turns out that fEn e M2 (F0(32n 2)) for n odd and fEn eM2 (F0 (16n 2)) for n even. Note that when n 1 mod 4, so that zn is a character of conductor n, this is an immediate consequence of the fact that fE , M2 (f0 (32)), by Proposition 17(b). More generally, it can be shown that the Hasse—Weil L- function for any elliptic curve with complex rrfultiplication satisfies the hypotheses of Weirs theorem with k = 2, and so corresponds to a weight two modular form (actually, a cusp form) tor FAN). (N is the so-called "conductor" of the elliptic curve.) -Many elliptic curves without complex multiplication are also known to have this property. In fact, it idtS conjectured (by Taniyama and Weil) that every elliptic curve defined over Vivo rational numbers has L-function which satisfies Weil's theorem for some N. Geometrically, the cusp forms of weight two can be regarded as holornorphic differential forms on the Riemann surface F0(N)V-1 (i.e., the fundamental domain with FAN} equivalent boundary sides identified aï: the cusps included). The TaniyamaWeil conjecture then can be shown to take the form: every elliptic curve over G can be obtained as a quotient of the Jacobian of some such Riemann surface. For more information about the correspoK dence between modular forms and Dirichlet series, see [Hecke 1981] ,teil 1967], [Ogg 1969 ], and
[Shimura 197f].
144
III. Modular Forms
PROBLEMS
L Let a e GL1. (0), and let g(z) ---= f(az). Let y = (cz bd )e F. Notice ihat f(az)l[y] h was defined to be (caz + d) kf(yaz), which is not the same as g(z)l[y]k = (cz ± d)' f(ayz). Show that if a = (g ?), i.e., if az =-- nz, then g(z)1[T]k -----.ficaaccrx - lk = f(nz)IEGin "Mk. be a congruence subgroup of r of level N, and denote r; = {y E FITS = . 5 } for sEQ v {co}. Let s= Œ 1 CC, 0: E F. (a) Prove that ars'OE -1 z-_- (Œria -1 )co . (b) Show that there exists a unique positive integer h (called the "ramification index" of r at s) such that
2. Let
r'
(1) in the case —.ter
• r; = + aoe t
trinEz a;
(ii) in the case —1.0r either r; = 0,-1 {T hrilf ne za; Or
r; =
nela•
Show that h is a divisor of N. (c) Show that the integer h and the type (I, Ha, or III)) of s does not depend on the choice of a E r with s = a' co ; and they only depend on the F'-equivalence class of s. (d) Show that if a -1 oc is of type I or Ha andfe /cr.), thenfl[a -l ] k has a Fourier expansion in powers of qh . A cusp of r is called "regular" if it is of type I or Ha; it is called "irregular" if it is of type lib. (e) Show that if a-1 co is an irregular cusp, and femk (r), then fl[a- i]k has a Fourier expansion in powers of q2,, in which only odd powers appear if k is odd and only even powers appear if k is even. If k is odd, note that this means that to show that fE Mk (r) is a cusp form one need only check the q-expansions at . the regular cusps.
3. Let h be any positive integer, and suppose 2hIN, N 4. Let F' be the following level N congruence subgroup: F' = f(a, ,1!1 ) --a-. (1 -4)i mod N for some j}. Show that co is a cusp of type Jib. 4. (a) Show that F1 (N) has the same cusps as ro(N) for N = 3, 4. (b) Note that —.10 ri (N) for N> 2. Which of the cusps of r'1 (3) and 1T1 (4), if any, are irregular?
5. Find the ramification indices of F' at all of its cusps when: (a) F' = Fo(p) (p a prime); (b) F' = Fo(p2); (c) F' = F(2). 6. Prove that if r a F is a normal subgroup, then all cusps have the same ramification index, namely
[Toe : ±r].
7. (a) Show that any weight zero modular function for F' c r satisfies a polynomial of degree [F: r ] over the field C( j) of weight zero modular functions for T.
145
§3. Modular forms for congruence subgroups
(1)) Show that, if r is a normal subgroup, and if f(z) is F'-invariant, then so is - f(Œz) for any °ter. Then show that the field of weight zero modular functions for r' is a galois field extension of C( j) whose galois group is a quotient of In practice (e.g., if r is a congruence subgroup), it can be shown that the galois group is equal to f/F'.
8. Prove the following identities, which will be useful in the problems that follow and in the next section, by manipulation of power series and products in q = e2 1ni :
(a) 0(z) + e(z + = 20(4z); E2 (z) = 48Z odd n> 0 61 (n)q n ; (b) E2 (z + 2k p4 n crk-i(n)q n (c) Ek(Z) — ( 1 + Pk-1 )Ek(PZ) ± Pk-1 Ek(P 2 Z) = Bk (k 2, p prime); (d) E2 (z) 3E2(2z) + 2E2 (4z) =-1-(E2 (z) E2(z +1)); (e) q(z + -21) = e21ti/4 8 7/ 3(2z)/ri (z)ri(4z). 9. Prove that if k is even and f(z) has period one and satisfies f(— 114z) = (— 4z 2)42 then fl[y]k = f for all ye To (4). f(z),
10. (a) Prove that 18 (4z)/I 4 (2z)e M2 (FM)), and find its value at each cusp. 67 + 1). (13) For a E Z prove that E2(ST'Sz) = (az + 1)2 E2 (z) — 4(az -214-(E2 (z) — 3E2 (2z) + 2E2(4z)) = Zoddn> 0 ai (n)qn by Problem (c) Let F(z) 8(c). Prove that F(z)E m2(1T0(4)), and find its value at each cusp. (d) Prove that F(z) = ri8 (4z)M 4(2z). Then derive the identify q
(1 n;=.- 1
q4n)4 (
1+
q 2 r74.
E odd
0.
1
(n) q .
n> 0
(e) Give a different proof that —24F(z) = RE2 (z) — E2(z + 1/2)) is in M2 (1-0(4)) by proving that, more generally, E2 (z) — 141,7E7:01 E2 (z + j/N) is in M2 (r0 (N 2)).
11. (a) Prove that 0(z)4 em2 (r0 (4)), and find its value at each cusp. (13) Show that 0(z)4 and F(z) (see preceding problem) are linearly independent. (c) Prove that te 0 (2z)/q 8 (z)r/8 (4z)em 2 (r0 on, and find its value at each cusp. (d) Prove that 0(z) = (e) Prove that 40(z) = 2n1124172 (z +. D/r/(2z). 12. Let N = 7 or 23, and let k = 24/(N + 1). Let x be the Legendre symbol x(n) Prove that any nonzero element of Sk(N, x) must be a constant multiple of (ri(z)ri(Nz)) k 13. Using Propositions 25-27, prove that (r/(z)r/(3z)) 6 eS6 (fo (3)) and (r/(z)n(7z)) 3 E S3 (7, x) where x(n) = 14. Let 4 (z) = En , z e izn2 = 0(z/2). Let x be the unique nontrivial character of 0(2)/ F(2) (which has 2 elements). Show that 04 e M2 (6(2), x). 15. Let fe Mk (N, x), and set ŒN =( 1). (a) Prove that f i[ŒN]k E Mk(N1 3(), and that the mapfl-4.f I kNI is an isomorphism of vector spaces from Mk (N, x) to Aik (N, 7). Prove that the square of this map (i.e., where one uses it to go from M/Or, x) to Mk(N, je) and then again to go , from ./4k (N, k) back to Mk(N7 X)) is the map (-1)k on Mk(Ar7 x)(b) 1f x = X, i.e., if x takes only the values +1, then prove that Mk(N, = Mk+ (N, X) 0 MC (N, X), where Mk ± (N, X) crf ffe,Mk(N, X)Ifi[ŒN]k = In other words, any modular form in Mk(Ne, x) can be written as a sum of one
146
-
Ill. Modular
FOrms
which is fixed under ik [aN]k and one which is taken to its negative by i rL.aN_,k• (c) Let N = 4. In Problem 17(d) below, we shall see that 04 and Fspan M2 (r0 (4)) = M2(4, 1) (where F = En odd ci(n)q n as in Problem 10, and 1 denotes the trivial character in M2 (4, 1)). Assuming this, find the matrix of [a4 ] 2 in the basis 04, F; show that M2+ (4, 1) and M2- (4, 1) are each one-dimensional; and find a basis for M2 (r0 (4)) consisting of eigenforms for r04.1 2 If you normalize these eigenvectors by requiring the coefficient of q in their q-expansions to be 1, then they are uniquely determined. 8 2 : + Enœ, i o-k . . i (n)qn. Express L1(s) (see = — 16. (a) For k 2 even, let f(z) the definition in (3.33)) in terms of the Riemann zeta-function. (1)) Write an Euler product for Lf(s). (c) Let fx(z)= E;f)= 1 (4_ 1 (n)x(n)q" for a Dirichlet character x. Write an Euler product for Li(s).
17. Let F(2) be the fundamental domain for F(2) constructed in §1 (see Fig. 111.3). Then
F' = ŒF(2), where a = (2) 1), is a fundamental domain for F0 (4) = al- (2)a-1 (see Problem 10 in §III.1). The boundary of F' consists of: two vertical lines extending from ( 3 + i0)/4 and from (1 + LA/4 to infinity ; two arcs of circles of radius 1, one centered at and one centered at 0; the arc of the circle of radius -4- and center -4- which extends from 0 to (9 + i.A/28; and the arc of the circle of radius and center A which extends from (9 + LA/28 to 1. Consider F0 (4)-equivalent points on the boundary of F' to be identified. (a) Find all elliptic points in F' (i.e., points which are F-equivalent to i or co = (-1 + 0)14 Which are on the boundary and which are in the interior of F' ? (b) Let f(z) be a nonzero modular function of weight k (keZ even) for 1-0 (4). Let v(f) denote the order of zero or pole of f(z) at the point P. At a cusp P = Œ 1 co, we define v(f) to be the first power of qh with nonzero coefficient in the Fourier expansion of fl[a - lk (where h is the ramification index; see' , Problem 2 above). Prove that: Ep€F, v(f) = k/2, where the summation is over all points in the fundamental domain F', including the three cusps, but taking only one point in a set of F0 (4)-equivalent boundary points (e.g., { + iy, 14- iy} or { + + + (c) Describe the zeros of 0(z)4 and F(z) (see Problems 10-11 above). (d) Prove that 04 and F span Af2 (r0 (4)). (e) Prove that Mk(F0 (4)) = 0 if k < 0, and it contains only the constants if k = 0. (f) Prove that for k = 2,k o a nonnegative even integer, any fEMk (F0 (4)) can be written as a homogeneous polynomial of degree k0 in F and O. (g) Prove that S6 (F0 (4)) is one-dimensional and is spanned by 0 8 F 1604F 2 . (h) Prove that ,j 12(2z) S6 (F0(2)), but that 2 (2z) E S6 (F0(4)). Then conclude that q 12 (22.) = 08 F— 1604F 2 . (i) Prove that for k = 2k0 6, any fe Sk (F0(4)) can be written as a homogeneous polynomial of degree k 0 in F and 04 that is divisible by 04F(04 — 16F). 18. (a) IffE Mk , (N, x i ) and g E Mk2 (N, X2), show that fg e Mk, ÷k2 (N, Xi X2)• (b) Let x be the unique nontrivial character of (/4)*. Show that any element of M1 (4, x) is a constant multiple of 0 2 . (c) With x as in part (b), find a formula for dim Sk (FI (4)) = dim Sk(4, (d) Let f(z) = (ii(z)q(2z)) 8 , and let g(z) = f(2z). Show that S8 (1"1 (4)) is spanned by f and g.
§4. Transformation formula for the theta-function
,
j 47
19. Let I' c r be a congruence subgroup, with r _-= Uai r, so that F' = Ucc.T' F is a fundamental domain for rt. Let j; ---- f I [ocr ' ],, forfe Mk (r). Suppose thatfe Sk (r), the ramification index of r' at the cusp so that fi (z) = Z__, anA, where h si = aj i oo. In particular,f(z) = I nc°,_ i anqii: at the cusp co. (a) Show that there exists a constant C independent off and x such that Ifi(x +
(b) (c) (d) (e) (f)
i y)I Lç CC-2 "mi for y> e.
Let g(z) = (Im z) k12 1f(z)f. Show that g(yz) = Om z)"12 1f(z)l[y] k i for y E F. Show that g(z) ref Om zr 14(2)1 is bounded on F. Show that g(z) is bounded on F'. Show that g(z) is bounded on H. Show that for any fixed y: 1fh im-27rirox+iyvh dx. an = - fi x ±
h o
(g) Show that there exists a constant C1 such that for all y:
Ian ! < ciy-ki2 e21oty/h . (h) Choosing y = 1/n in part (g), show that for C2 = C1 e2irm : lan i < C2 n k12 . (i) Show that la1n -42 is similarly bounded for each j.
§4. Transformation formula for the theta-function We first define some notation. Let d be an odd integer, and let c be any integer. The quadratic residue symbol () is defined in the usual way when d is a (positive) prime number, i.e., it equals 0 if dic, 1 if c is a nonzero quadratic residue modulo d, and — 1 otherwise. We extend this definition to arbitrary odd d as follows. First, if g.c.d.(c, d) > 1, then always (i) = O. Next, if d is positive, we write d as a product of primes d=rlipi (not necessarily distinct), and define (-) = II i (j-i). If d = 4 1 and c = 0, we adopt the i convention that (K) = 1. Finally, if d is negative, then we define (f-i ) = ( t) if c > 0 and () = —(1`6) if c < O. It is easy to check that this quadratic residue symbol is bimultiplicative, i.e., it is multiplicative in c if d is held fixed and multiplicative in d if c is held fixed. It is also periodic with period d.when d is positive : (cP) = (-) if d> O. However, one\ must be careful, because periodicity fails when d is negative and c + d and c have different signs: (q4-) = —() if e> O> c + d. This is because of our convention that ( -s) = — (t'il) when both c and d are negative. On the other hand, this convention ensures that the usual formula (2-) = (— 1) (" )/2 holds whether d is positive or negative. Next, we adopt the convention that „ii: for zE C always denotes the branch whose argument is in the interval ( — n/2, n/2]. We next define Ed for d odd by: -
ser = -Ni(l), i.e.,
148
HI. Modular Forms
11
if d d
1 mod 4; 3 mod 4. •
(4.1)
With these definitions we finally define the "automorphy factor" j(y, z), which depends on y = db) e F0 (4) and z e H:
AY,
crlf
c; 1 NiCZ + d
for
y
(
a b c d)er°(4),
z e H.
(4.2)
Inez q n2 = Inez e Irtizn 2 . Recall our definition of the theta-function 0(z) = The purpose of this section is to prove the following theorem, following Hecke [1944 ].
Theorem. For y eF0(4) and z e H 0 (yz) =j(y, z)0 (z),
(4.3)
where Ay, z) is defined by (4.2). Notice that the square of Ay, z) is ( 1-)(cz d), and so the square of the equality is precisely what we proved in Proposition 30. Thus, for fixed y e F0 (4) the ratio of the two sides of (4.3) is a holomorphic function of z e H whose square is identically 1. Thus, the ratio itself is + 1. The content 1, i.e., that j(y, z) has the right sign. of the theorem is that this ratio is Simple as that sounds, the theorem is by no means trivial to prove. At first, it might seem sensible to proceed as in the proof of Proposition 30, proving that (4.3) holds for generators of F0 (4). However, then we would have to show that the expression j(y, z) in (4.2) has a certain multiplicative property which ensures that, if (4.3) holds for y1 and y2 , then it must hold for Vi y2 . But that is a mess to try to show directly. We shall, in fact, conclude such a property for j(y, z) as a consequence of the theorem (see Problem 3 at the end of this section). In proving the theorem, it turns out to be easier to work with the function 4 (z) def = 0(42) = Ine z ein2z , which we encountered in Problem 14 of §111.3. This function satisfies : 4(T 2z) = 4)(z) (obvious from the definition) and izO(z) (immediate from (3.4)). Hence, 4)(yz) has a transforma0(Sz) = tion rule for any y in the group 0(2) generated by + T 2 , S. It is because 0(2) is such a large group—having only index 3 in F—that it is sometimes easier to work with 4). The corresponding group under which 0(z) = a= ?), has a transformation rule is a- '0(2)a (see Problem 1 of §III.3). But cc -1 4:5(2)crt is not contained in I" = SL2 (7L); its intersection with f is the subgroup F0 (4) of index six in F. Thus, we can work with a "larger" subgroup of F (i.e., its index is smaller) if we work with 4)(z) rather than 0(z). The next lemma gives an equivalent form of the theorem in terms of 4)(z).
Lemma 1. The theorem follows if we prove the following transformation formula for 4)(z):
149
§4. Transformation formula for the theta-function
ck(yz) = i" -c)12- 6 \I — i(cz for y r such that y =(
)
+ d)(1)(z)
(4.4)
(mod 2) with d 0 O.
Suppose that we havé the transformation rule for O(z). We must show that 0 satisfies (4.3) for all y = (ac ' ) e F0 (4). We first note that if c = 0, then (4.3) holds trivially, since in that case 0(yz) = 0(z), while = 1 (since d = +1). So in what follows we suppose that j(y, z) = c O. )E F0 (4) with c Owe write For any y = PROOF.
,
0 (7z) =
n az
a) = (y`( — 1 /2z)), 1/2z) — c/2
+.1 _ /4, (2b(— 112z) —
where y' = (2,7 .42) and y' m- (_? (13) (mod 2), because 41c. We apply (4.4) = (1)(), with y' in place of y and — 1/2z in place of z. Using the fact that we obtain
en
0
(yz) = i( -d)/2 d2) (
— i(d( —
112z)
c12)4)( - 112z).
2i0(z) by (3.4). The product Next, we have 4)( — 1/2z) = 0( — 1/4z) = of the two square root terms is + Vcz + d (note that, because of our convention on the branch of the square root, we have .fx,./j; =± xy; for example, — 1 = — Nib. But since the three functions — i(d( — 112z) — c12), — 2iz, and Vcz + d are all holomorphic on H, the + must be the same for all z; so it suffices to check for any one value of z, say z = i. But in that case „I— 2iz = „/2, and .\/.7-c.f; = +\41)cy always holds when y is positive real. Thus, the product of the two square root terms is Vcz + d, and we have C) (yz) = i( 1 -d)/2
2
_c d d
/c.z +
N
d0(z).
1 --d)/2(-- 2) -= To complete the proof of Lemma 1, it remains to check that i( k d which we easily do by considering the cases d 1, 3, 5, 7 (mod 8). —
t'd
The remainder of this section is devoted to proving (4.4). For a fixed odd prime p, let us denote
= n '(z)/(pz),
(4.5)
where n (z) is the Dedekind eta-function, as in §2 and §3. According to i.e., tfr3 is a modular Propositions 26-27, we have 1/.3 Em3,p_ iofp, form for ro (p) with character x(d) = (9,-). This is the basic tool which will be used to prove (4.4). But it will take several lemmas to relate tir 3 and 0.
Lemma 2. (P(pz)/(z) = 131/(z)/0 2 (z + 2 'I) •
(4.6)
150
III. Modular Forms
By Problem 11(e) in the preceding section, with z replaced by we find that the left side of (4.6) is equal to
PROOF.
by
e -2/ri/24 1,1 2 (Pz2+1 )/n ,
nz
,hfz)
,
V)IrKz)y = e(CP -
(e - 27ri/24 1,1 2(z -
11/24)2irt
-42
and
(p.z2f-1) .:_
02(z12-1) n 2 09 .12-1 ) -
But since q(z + 1) = e224q(z), and p((z + 1)/2) = (pz + 1)/2 + 91, it follows that the last term on the right is e-( P -1)2ni124. This proves (4.6).
Lama 3. Le t - p be an odd prirne, let y = 0 3(2 1 1). Then fl[y] 3(p - 1)12 =
cl D E 0(2) n fo(p), and let f(z)=
().!
Let a =( ), so that Iii((z -I- 1)/2) = tfr(az). Then 0((yz + 1)/2) = kayz) = ig(aya-i )ocz). Now for y as in the lemma, we have PROOF.
ŒyŒ 1 =
+ c + d — a — 012) 2c d c
(Note that b+d a c is divisible by 2 because y e 6(2)) Hence, by Propositions 26-27, we have —
—
yz + 1) = (d_ c)
3 (
2
(2caz + d
—
c) 3( P-1)J2tp 3(otz) z+1 2 )'
.( 4)(cz + d)3(P-1)1203(
because d
—
c
d (mod p ) . This is the relation asserted in the lemma.
Lemma 4. Let p be an odd prime, let y = 4(pz)I0P(z). Then gl[y](l _p)/2
bd)
e 0(2) nTo(P), and let g(z)=
=
We first claim that g8 transforms trivially under y. Let a = (g ?), and let y' = yo -1 . Then both y and y' are in 6(2), and we can use Problem 14 of the preceding section to compute PROOF.
g8(z)i [y1441 p) =
(CZ
± d)4 P-1 )0 8(y'ocz)14) 8P(yz)
_ (Œz ± d)-40 8('Y'az) (cz + d)-41'08 P(yz) _ 0 8(Œz) —
8
8P (Z) = g (z),
as claimed. Meanwhile, the ninth power of g transforms under y by (11,), as we see by raising both sides of (4.6) to the 9th power and using Propositions 26-27 and Lemma 3 (here f(z) is as in Lemma 3):
99 1 [7] 9(1 -p)/2 = (0 91f6 )1[7]9(1-p)/2 = (IA 31 [11(13-1)/2)3 — 4-415)1fr 3)3 — (d)a9 • i) ( V) 6 ( J. I [7] 3( p-1)/2) 6
151
§4 . \Transformation formula for the theta-function
faking the quotient of these two relationships gives
gibli-p)/2- := g9 ibil 9( 1 - p)/2/g 8 I [T] 4( 1 -p)
=
0
p)g .
(
r
The next lemma generalizes Lemma 4 by replacing the prime p by an arbitrary positive odd number n. Lemma 5. Let n be
a positive odd integer, let y ,:--- (a, bd)e 0(2) n ro (n), and let
g(z) = 4)(nz)10n(z). Then gibli-no= ()g PROOF. We write n =pi - • . p, as a product of primes (not necessarily distinct), and we use induction on the number r of prime factors. Lemma 4 is the case r = 1. Now suppose we know Lemma 5 for n; we shall prove the corresponding equality for a product n'rz---- np of r + I primes. We write
0(n'z) _ cknaz) N(pz)Y çbn'(z) — O n(OLZ)0 P(Z) ) 1
(4.7)
where a = (6 ?). For y = (a, be') e Fo (n) n 0(2) we have y' = aya-1 = fan) n 0(2), and so, by the induction assumption,
4)(nocyz)10"(Œyz) = 4)(ny'az)10"(y'az) = (1:7-1 ) (az + d)
(
c 7p bdi E
0(nocz)b4?(ocz)
= (-nd)(cz + d)" -n)I2 0 onezve(ŒZ). In addition, by Lemma 4, we have
d 4)(pyz)10P(yz) =(- )(cz -I- d)(1-P)12 4)(pz)10P(z). P Combining these two relations, we see that replacing z by yz in (4.7) has the effect of multiplying by
. ,Gd)
(CZ
+ d)" -n)12 (()(CZ + d) (1-P)12) n = (1)( 4)(CZ ± n p P
d , (n ')
d) (1- np)I2
(cz + d)"-n1)12.
This completes the induction step, and the proof of the lemma.
0
We are now ready to prove (4.4). We first note that.both sides of (4.4) remain unchanged if y is replaced by —y (see Problem 2 below). Hence, without loss of generality we may suppose that y = (:. bd ) F... (_? D mod 2 with c > O. We now apply Lemma 5 with n replaced by the positive odd integer c,
obtainir
152
III. Modular Forms
(1)0(cC(7YzZ) =
()(CZ
± d ) 1 -c)/2 O(CZ) CV(Z)
(4.8) •
The ratio that ultimately interests us is 4)(yz)10(z). Solving for this in (4.8) gives
(0
(cczz)) . Yzz) )c = ( d c ) ( c z 4_ d) c- 1 0 04)(Y
: (
(4.9)
)
On the other hand, we have (using: ad
1 = bc)
—
cyz . c az + b ,... a cz+d
I cz±d i
Since a is even and 4) has period 2, this means that
Ccyz) = 0 ( cz + 1 d) = V —i(cz + d)(k(cz + d) (4.10) = \I —i(cz + d)0(cz). Combining (4.9) and (4.10) gives 4)(Yz)
0(z)
)c = (--1)(cz + d) — 1 ) /2.\/ — i(cz + d). c
(4.11)
(
Meanwhile, we saw in Problem 14 of the preceding section that 0 8 is invariant under Ed4 for y E (5(2), i.e.,
8k = 0)(v/
(CZ
+
VI'
for any k e Z.
, (4.12)
We now raise both sides of (4.11) to the c-th power and divide by (4.12), where k is chosen so that c2 = 8k + 1. (Since c is odd, of course c2 :1-=._ 1 mod 8.) The result is: (KYz) = (-1-)(cz + d)cc -112-41c(— i(cz + d))(` -l v2V — i(cz + d).
0(z)
c
But c(c — 1)/2 — 4k + (c
—
1)/2 = (c2
—
1)/2 — 4k = O. Hence,
0(yz) = (y_),_ i.,,c_ i ,i2 , . y It v — i(cz c‘ 0(z)
+ d),
which is the transformation formula (4.4) that we wanted to prove. This concludes the proof of the main theorem as well. 0
The transformation formula for the theta-function is similar to the transformation formula for a modular form of weight k if we take k =1, i.e., except for a power of i the "automorphy factor" is (cz + d) 1/2 . In the next chapter we shall see that there is a general theory of modular forms whose weight is a half-integer, and the transformation formula for 0(z) plays a fundamental role in describing such functions.
153 -
§5., The modular interpretation, and Hecke operators
PROBLEMS
,
1. Prove that the generalized quadratic residue symbol (i) as defined in this section satisfies the following form of quadratic reciprocity: if c and d are both odd integers, then
( _ occ-tud-io()
if c or d is positive; (f) = _(...... o(c-i)(d- 1W4() if c and d are negative.
2. Show directly that both sides of (4.4) remain unchanged if y is replaced by — y. 3. (a) Show directly (using (3.4)) that the theorem (4.3) holds for the generators T, and ST'S of 1-0 (4). (b) Show that the theorem would follow from part (a) if one could show that
Ace13, z) = Act, &M g, z) for all a, /3e r0(4).
—
I,
(4.13)
(c) Conversely, show that the theorem proved in this section implies the relation
(4.13).
§5. The modular interpretation, and Hecke operators A basic feature of modular forms is their interpretation as functions on lattices. More precisely, we consider the most important cases of a congruence subgroup F': F' = F, F(N), F 0 (N) or F(N). (Of course, F = fl (1) = F0 (1) = F(1), so everything we say about the cases F 1 (N), F0(N) or F(N) will apply to F if we set N = 1.) By a "modular point" for F' we mean : (j ) for F' = f: a lattice L c C; (ii) for F' = F1 (N): a pair (L, t), where L is a lattice in C, and t e CIL, is a point of exact order N; (iii) for F' = F0(N): a pair (L,- S), where L is a lattice in C, and S c C1L is a cyclic subgroup of order N, i.e., S = Zt for some point te CIL, of exact order N. _ (iv) for f' = F(N): a pair (L, { t 1 , t2 }), where 1 1 , 1 2 e CIL have the property that every tekLIL is of the form t = mi l ± nt 2 , i.e., t i , t 2 form a basis for the Points of order N (in particular, t i and t must each have exact, , order N). Given a lattice L, in general there will be several modular poir.. s of the form (L, t), (L, S), or (L, {l i , 1 2 )). However, when N = 1, there is only one modular point corresponding to each L, and we identify it with the modular point L for F. Let k e Z. In each case (i)-(iv), we consider complex-valued functions F on the set of modular points which are of "weight k" in the following sense. If we scale a modular point by a nonzero complex number A, then the value of F changes by a factor of A - k . That is, for 2 e C* we consider AL =
154
III.
P.111 e LI, /it e C , S = Pik ES} c weight k if for all E C *
Modular Forms
AL. Then F is defined to be of
case (i) F(2L) = A'F(L) for all modular points L; = ATY(L, t) for all modular points (L, 1); case (ii) case (hi) F(2L, /IS) = /1-1`F(L, S) for all modular points (L, S); ) t2 }) = .1-kF(L, t 1 , t2 }) for all modular points case (iv) F(AL, (L, { t 1 , t2}). An example of such a function of weight k is Gk (L) =
E
(k > 2 even).
(5.1)
0*IeL
Notice that any function F in case (0, such as Gk, automatically gives a function for the other groups; for example, by setting F(L, t) = F(L). Given a function F of weight k, we define two corresponding functions P and f as follows. fl(co) is a complex-valued function on column vectors = ( T) such that co 1 ko 2 E H; f(z) is a function on the upper, half-plane H. Let 2L. be the lattice spanned by co l and co2 , and let L, be the lattice spanned by -z and 1. Given F as above, we define case (i) P(c )) = F(L); case (ii) P(w) = F(L., (o 2IN); case (iii) P(co) = F(L., Zco 21 N); case (iv) P(co) = F(L., {w 1 /N, w 2/N}). In all cases we define f(z) = P(f). Thus, for example, the function f(z) that corresponds to Gk (L) (see (5.1)) is the Eisenstein series we denoted Gk (z) in §2 (see (2.5)). For y = (ca e F = SL 2 (Z), we define the action of y on functions of co Rya)), where yoi is the usual multiplication of a column by the rule 'AO vector by a matrix.
r, r,
(N), FAN) or f(N). The above Proposition 31. Let k E Z, and let r = association of F with P and f gives a one-to-one correspondence between the following sets of complex-valued functions: (1) F on modular points which
have weight k; (2) P on column vectors co which are invariant under y for y E r and satisfy 1-7(.1co)= .1.-k .P(w); (3) f on H which are invariant under [Y]k for y e
PROOF. We shall treat case (ii), and leave the other cases as exercises. Suppose y E 1"1 (N) and F is a weight k function of modular points (L, We first compute: P4- (yco) = F (1,
cco ± cico 2 -
because Law1+bc02.cw1+do)2 = L ) mod L (since y E F1 (N)). We also have
= F(L., w2/N) Aco), y e F) and (co), + dco 2)/.N
co2/N
§5._ The modular interpretation, and Hecke operators
155
-
-
\
P(co) =. F(LAw , Aco 2 IN) = /1 -4 F(L., co 2IN) = Next, we have ^
f(yz)= F (L (az+b)1(cz+d) , ,
I
Tv
)
= (ez ±
clY c F (Laz+b,cz+d,
ez + d N
)1
because F has weight k. But the lattice spanned by az + b and cz + d is Lz ; and (cz + d)I N.=..--- k mod Lz ; hence
f(yz) =--- (cz + d)k F (L z ,
NI) = (cz ± d)kf(z).
-
Thus, the P and feorresponding to F have the properties claimed. To show the correspondence in the other direction, given f(z) we define P(co) to be coi kfico //co 2); and given P we define F(L, t) to be Aco), where co is chosen to be any basis of L such that co2/N a t mod L. One must first check that the definition of F makes sense (i.e., that such a basis co exists), and that the definition of F is independent of the choice of such a basis co. The first point is routine, using the fact that t has exact order N in C/L, and the second point follows immediately because any other such basis must be of the form yco with y e F1 (N). It is also easy to check that, oncefis invariant under [y]k for y e F1 (N), it follows that F is invariant under y and has weight k; and that, ifFhas weight k, then so does the corresponding F. The construction going from F to P to f and the construction going from f to P to F are clearly inverse to one another. This concludes the proof. o We say that F is a modular function/ modular form/ cusp form if the corresponding f is a modular function/ modular form/ cusp form as defined in §3. We now discuss the Hecke operators acting on modular forms of weight k for r1 (N). We could define them directly on f(z) E Mk (1"1 (N)). However, the definition appears more natural when given in terms of the corresponding functions F on modular points. Let ..r denote the 0-vector space of formal finite linear combinations of modular points, i.e., 2' = OCIeL is the direct sum of infinitely many one-dimensional spaces, one for each pair (L, t), where L is any lattice in C and t E C1L is any point of exact order N. A linear map T: 2' --0 2' can be given by describing the image TeL,, = E. an ep.n of each basis element; here {P.} are a finite set of modular points. For each positive integer n we define a linear map Tn : 2) --* ..r by the following formula giving the image of the basis vector eL , t : 1 Tri (eL, r) = — 4'5' ev,t, n
(5.2)
where the summation is over all lattices L' containing L with index n such that (L', t) is a modular point. (Here for t E Ca, we still use the letter t to /
156
Ill. Modular Forms
denote the image of t modulo the larger lattice L'.) In other words, L' IL C CIL is a subgroup of order n, and t must have exact order N modulo the larger lattice L' as well as modulo L. The latter condition means that the only multiples Zt which are in L' IL are the multiples INt which are in L. = I", this condition disappears, and we sum In the case N = 1, i.e., = n. The condition on t is also empty if over all lattices L' with [L' : n and N have no common factor. To see this, suppose that g.c.d.(n, N)= 1, and suppose that N't c L'. Then the order of N't in L'IL divides N (because N'Nt E L) and divides n (because # L'IL=n), and so divides g.c.d.(N,n)=1. Thus N't E L. Notice that the sum in (5.2) is finite, since any lattice L' in the sum must because each element of L' IL has order be contained in L = dividing n = # L' IL. Thus, each L' in the sum corresponds to a subgroup of order n in iLIL ,-z-!,(11nZ) 2 . Note that T1 = 1 = the identity map. Next, for any positive integer n prime to N we define another linear map T7, „ :2—*Y by
1 Tn,n (eL,t ) = --ye( 1/n)L,t•
(5.3)
Note that t has exact order N modulo L because g.c.d.(N, n) = 1. Again we are using the same letter t to denote an element in C/L, and the corresponding element in CAL. It is easy to check the commutativity of the operators ,
Tni , ni Tn2,n2
Tnin2,nin2 = T,g2,„2 Tni ,ni
Tn,n Tm = Tni Tn, n .
(5.4)
It is also true, but not quite so trivial to prove, that the T.'s commute with one another for different m's. This will follow from the next proposition. Proposition 32. (a) If g.c.d.(m, n) = 1, then T„,„= T.T, 7 ; in particular, 2",,, and Ti, commute. (to) Ifp is a prime dividing N, then Tpt = 7. (c) If p is a prime not dividing N, then for 1> 2
Ti = Tpi--1Tp — pTp1- 2l
,
.
(5.5)
PROOF. (a) In the sum (5.2) for Tmn , the L' correspond to certain subgroups S' of order mn in namely, those which have trivial intersection . with the subgroup It CIL. Since g.c.d.(m, n)= 1, it follows that any such S' has a unique subgroup S" of order n; if L" L is the lattice corresponding to S", then S'/S" gives a subgroup of order m in ,-1,-,- L"/L". Both S" and S'/S" have nontrivial intersection with Zt. Conversely, given S"=L"ILc -1-iLIL of order n and a subgroup S' =L'IL",c kL"/L" of order m, where both subgroups have trivial intersection with It, we have a unique subgroup L'IL c ALIL of order mn with nontrivial intersection with Zt. This shows
157
§5. The modular interpretation, and Hecke operators
thatthe modular points that occur in Tm7,(eL ,,)= ï E eL ,,, and in Tm (Tn(e L. )) =-1-ITm (er ,,,) are the same. (b) By induction, it suffices to show that Tpi-i Tp = Tpt for I> 2. Let t' =It. Then Tp /(eL , t) = p -' lev ,„ where the summation is over all L' L LIL has order p i and does not contain t'. Notice that such that L' IL c L' IL must be cyclic, since otherwise it would contain a (p, p)-subgroup of pT iLIL. There is only one such (p, p)-subgroup, namely f,L1L, and t' E since pt' = Ni e L. Once we know that L' IL must be cyclic, we can use the same argument as in part (a). Namely, for each L' that occurs in the sum for Tp r(e L ,,) there is a unique cyclic subgroup of order pin L' IL; the corresponding lattice L" occurs in the sum for Tp (e"), and L' is one of the lattices that occur in 7;i -i(eL „,,). This shows the equality in part (b). (c) Since p,t N, the condition about the order of t in C/L' is always fulfilled. We have 1,i -iTp (eL,,)= p -1 IL ,, EL, e L,.„ where the first summation is over all lattices LI' such that S" = L" IL has order p, and the second summation is over all L' such that S' = L' IL" has orderp'. On the other hand, Tpt (eL. ,) = poe 1 EL, eL .,,, where the summation is over all L' such that L' IL has order IA Clearly, every L' in the inner sum for 1p r--1 Tp is an L' of the form in the sum for Tpi, and every L' in the latter sum is an L' of the form in the former sum. But we must count how many different pairs L", L' in the double sum lead to the same L'. First, if L' IL is cyclic, then there is only one possible L". But if L'1L is not cyclic, i.e., if L'IL D g1L, then L" can be an arbitrary lattice such that L" IL has order p. Since there are p 1 such lattices (for example, they are in one-to-one correspondence with the points on the projective line over the field of p elements), it follows that there are p extra times that eL ,,, occurs in the double sum for Tpt-iTp . Thus,
Tpl(eL , t) = 7p l-i 7p (e 1 , 1)
p
,t• L'(1/p)L
[L':( lip)Li=pI- 2
But
1 T 1-2T (e )= — 2 TP I-2e (1/p)L, t =p 14,t P, P
[L':( 1/P)Li
e
t
This concludes the proof of part (c). If n = prŒr is the prime factorization of the positive integer n, then Proposition 32(a) says that 71 = Tpf, Tpft . Then parts (b)-(c) show that each Tpcp is a polynomial in Tpi and Tpi , pi . It is easy to see from this and (5.4) that all of the Yin's commute with each other. Thus, the operators T,,, (n a positive integer prime to N) and Tm (m Any positive integer) generate a commutative algebra AP of linear maps from 2 to 2; actually, eit9 is generated by the Tp , p (p,I1 N a prime) and the Tp (p any prime). There is an elegant way to summarize the relations in Proposition 32 as formal power series identities, where the coefficients of the power series
III. Modular Forms
158 J
are elements in follows:
Ye.
First, for p
v.:el
IN,
Tp 1X 1 =--
we can restate Proposition 32(b) as
1 I — TX
PIN'
(5.6)
i.e., (I TpIX 1)(1 — Tp X) = 1 in le[[X]]. This follows from Proposition 32(b) because, equating coefficients, we see that the coefficient of X' is — Tri-iTp . Similarly, for p,I 'N, part (c) of Proposition 32 is equivalent to the identity 1 pN, (5.7) i TplX 1 = Tp X + pTp, pil: ' 1 i.--, 0
7;,X + pTp, p X 2 and equate i.e., if we multiply both sides of (5.7) by 1 coefficients of powers of X, we see that (5.7) is equivalent to the equalities —
Tp — Tp = 0,
Ti = 1,
Tpt — Tpr-iTp + pTpi-2 T„,p for / > 2.
To incorporate part (a) of Proposition 32, we introduce a new variable s by putting X = p - s for each p in (5.6) or (5.7). We then take the product of (5.6) over p with pIN and (5.7) over all p with p,11 N: co I 7 1-2s I -H fI E p1P -Is = fl piN 1 — TpP s p-f1■1 1 — TpP -s + Tp,pP all p 1 ,-- 0 But, by part (a) of the proposition, when we multiply together the sums on the left in this equality, we obtain / To', where the sum is over all positive integers n. The proof is exactly like the proof of the Euler product for the Riemann zeta-function. We use the factorization n = p71 • • • A.'', and the « p r-ars). We hence conclude that relation: T„n -3 = (Tpli p -l Œts) • - • (T Pr' oo
To' = n=1
ri piN 1 —
1 11 I E 12s• - • T ,PP TpP s pp' 1 — TpP - +P
(5.8)
For d an integer prime to N, let [d]: Y -÷ 2 be the linear map defined on basis elements by Me ', , =-- eL di • Note that di has exact order N in C/L, because g.c.d.(d, N) =--- 1. A lso nc;te that [d] depends only on d modulo N, i.e., we have an action of the group (Z/ NZ)* We now consider function's F on modular points and the corresponding functions f(z) on H. Again we suppose that we are in the case I' = 1"1 (N). If T: 21 ---* 21 is a linear map given on basis elements by equations of the form T(e La ) = 1 an e pa , then we have a corresponding linear map (which we also denote T) on the vector space of complex-valued functions on modular points: TAL, t) =EaF(P). For example,
[d]F(L, t) ------ F(L, dt) T„,„F(L, t) = n -20, t) n T„F(L, t) = 1 E F(L1 , t), n L,
(here g.c.d.(d, N) = 1);
(g.c.d.(n, N) = 1);
(5.9)
159
§5. The modular interpretation, and Hecke operators
where the last summation is over all modular points (L', t) such that [L' = n, as in (5.2). Proposition 33. Suppose that F(L, t) corresponds to a function f(z) on H which is in Mk (Ti (N)). Then [d]F, TF, and TF also correspond to functions (denoted [d]f, Tf, and Tnf) in Mk (ri (N)). If f is a cusp form, then so are [d]f, Tn.nf, and Tnf. Thus, [d], 7, and Tn may be regarded as linear maps on mk(ri (N)) or on sk(ri (N)). In this situation, let x be a Dirichlet character modulo N. Then Je MAN, x) if and only if [d]F = x(d)F, i.e., if and only if
F(L, dt) = x(d)F(L, t) for de (ZI NZ)* .
(5.10)
PROOF. To show that [d]f, T„, „And Tjare invariant under [y] k for y e F1 (N), by Proposition 31 it suffices to show that [d]F(L, t), TF(L, t), and T„F(L, t) have weight k, i.e., [d]F(AL, At) =--- A -k [d]F(L, 1), TF(AL, At) =--A -k T„,nF(L, t), and TnF(AL, At) = A-k TnF(L, t); but this all follows trivially from the definitions. We next check the condition at the cusps. Note that if a = e GL (Q) and iff(z) corresponds to F(L, t), then f(z)l[a] k = (det Œ) 2 (CZ d) -k F (L„z , -110
= (det 00'12 F (1-,..z+b,cz+d, cz In particular, if ŒE F, then this equals F(Lz , (cz d)/N). Next, for each de(ZINZ)*, choose a fixed ad e T such that ad (cr) )mod N. (This is possible, because g.c.d.(d, N) = 1, and the map F SL 2 (7L/N7L) is surjective, by Problem 2 of §1II.1.) We then have .f(z) I [act]k
F (45 / c-v )= [d] F
(L
Z
[d]f(z),
i.e., [d]f = fif ad_ik• 1 Thus, for 70 E F we check (3.8) for [d]f as follows: [d],f i[Ydk = f I [6dYdk , which has a q-expansion of the required type, i.e., [d]fsatisfies (3.8) iffdoes. Similarly, we find that 7,J(z) = n-2 F(L,L,, h) = nk -2 F(Lz , ;v2-) = n k- 2 [I ] f(z) , so this case has already been covered. We next consider the cusp condition for T„f(z), which is a sum of functions of the form 1,7), where L' is a lattice containing L 2 with index n. We take such an L' and let (co l c02) be a basis for L'. Since Lz L' with index n, there is a matrix t with integral entries and determinant n such that (7) = TO) (co denotes the column vector with entries co l and c02). We can choose a set T of Such t (independently of z) such that
Tf(z) =
E F(L., ( z ) n t€ , 9 N
We now consider each F(L., h), where (I) = t(: 21). We find a yer such that yr 71 ,1,(g ',7,) with zero lower-left entry and a, b, d integers with ad = n
III. Modular Forms
160 -
(it is an easy exercise to see that this can be done). The lattice spanned by CO = T -1 (7) is the same as that spanned by y() = -,vg ",)(7). Thus, a
F(Lto I ) = F (I,( az+ bjln,d In , -9 N = ci c F (L(az + b )1d , 1 , -N'— N = ak [a] f((az + 1))1d). But if f(z) satisfies (3.8), then so does f((az + 12)1d), and [a] also preserves (3.8), as shown above. This proves the cusp condition for Tnf. Finally, let y d E To (N) be any element with lower-right entry d. As shown above, f(z)l[yj k = F(Lz , kJ) = [d]F(Lz , Ntf). Thus, f INk corresponds to [diF, and so ,f,ifL.,vd_iI k T his completes 0 the proof of the proposition. We saw before (Proposition 28) that a function Je Mk(Ti (N)) can be written as a sum of functions in Mk (N, x) for different Dirichlet characters x. Thus, using the one-to-one correspondence in Proposition 31, we can write a modular form F(L, t) as a direct sum of F's which satisfy (5.10) for various x. Proposition 34. The operators 7,, and 7 commute with [d], and preserve the space of F(L, t) of weight k which satisfy (5.10). If F(L, t) has weight k and
satisfies-(5.10), then T„F = nk-2 x(n)F.
PROOF. That the operators commute follows directly from the definitions.
Next, if [d]F = x(d)F, it follows that [d] TF = T[d] F = x(d)TF and [d] TF = T[d]F = x(d)7,F. This is just the simple fact from linear algebra that the eigenspace for an operator [d] with a given eigenvalue is preserved under any operator which commutes with [d]. Finally, if F(L, t) satisfies (5.10), then 1;,,nF(L, t) = n-2 F(L, t) = nk-2 F(L, nt) =
0
nk-2 [n]F(L, 0 = nk-2 X(n)F(1-1 1).
If we translate the action of Tn , T. „ and [d] from functions F(L, t) to functions f(z) on H, then Proposition 34 becomes Proposition 35. Tn and T„„ preserve Alk(N, X), and also Sk (N, x). For 4 f€ Mk (N, X) the action of T„,:, is given by T„,„f = nk-2 x(n)fs Proposition 36. The operators 7
Mk (N, x) satisfy the formal power series
identity a,.
E nr,-- 1
Tun - s = jj 0 - Tpp -s + x(p)p k-i — 2s‘)—1 .
(5.11)
all p
PROOF. We simply use (5.8) and observe that for
p)( N we have Tp, pf = p k-2 x(p)f, while for .1)1 N the term on the right in (5.11) becomes (1 — Tpp -s)l because x(p) = 0. 0
161
§5. The modular interpretation, and Hecke operators
As a special case of these propositions, suppose we take x to be the trivial character of (ZIN1)*. Then Mk (N, -x) = Mk(Fo (N)), and the modular forms fe Mk(N, x) correspond to F(L, 1) on which [d] acts trivially, i.e., F(L, = F(L, dt) for all de (ZINZ)*. Such F(L, 1) are in one-to-one correspondence with functions F(L, S), where S is a cyclic subgroup of order N in C/L. Nainely, choose t to be any generator of S, and set F(L, S) = F(L, t). The fact that F(L, t) = F(L, dt) means that it makes no difference which generator of S is chosen. Conversely, given F(L, S), define F(L, t) = F(L, S1), where = Zt is the subgroup of C/L generated by t. So we have just verified that functions of modular points in the sense of case (iii) at the beginning of this section correspond to modular forms for ro (N). We now examine the effect of the Hecke operator Tm on the q-expansion at oo of a modular form f(z) E Mk(N, x). That is, if we write f(z) = E an q" and Tmf(z) = Zbq, q = e'r iz, we want to express b„ in terms of the an . We first introduce some notation. IffE C [Pin f Za n e, we define ;
ben
;
Um f=
(5.12)
an gnim,
where the latter summation is only over n divisible by m. Note that U1 = identity, and Um o J' the identity, while V„,o Un, is the map on power series which deletes all terms with n not divisible by ni. Suppose that f(z) = E anqn, q = e2niz, converges for ze H. Then we clearly have:
Vmf(z) = f(mz);
Umf(z) =
Proposition 37. Let f(z) = E,T=0 anqn , q = e 2 E,T_.. 0 kg". Then
bn= apn
X(P)P
m -1
m i=0
f
+j m
E).
(5.13)
, fe Mk (N, x), and let 7,f(z)
k-1. a
nip9
(5.14)
where we take x(p) = 0 i piN and we take anip = O fn is not divisible by p. In other words,
Tp =
x(p)p k-1 Vp on
Mk (N, x).
(5.15)
We have Tpf(z) = F(L' , k), where F is the function on modular points which corresponds to f and the sum is over all lattices L' containing Lz with index p such that ki has order N modulo L'. Such L' are contained in the lattice -,15Lz generated by 1, and +„ and the lattices of index p are in one-toone correspondence with the projective line Np over the field of p elements Fp = ZIpl. Namely, the point in 1131-p with homogeneous coordinates (a 1 , a2 ) corresponds to the lattice generated by Lz and (a l z a 2)/p. Thus, there are p + 1 possible L' corresponding to (1, j) for j = 0, 1, . . . , p — 1 and (0, 1). If ON, then all p ± 1 of these lattices L' are included; if piN, then the last lattice (generated by Lz and must be omitted, since k has order 1, in that case. Note that the lattice generated by Lz and (z Alp is L(z+i)/p . Thus, if pi N we have 7,f(z) = t, h) = Upf(z). If PROOF.
162
III. Modular Forms
ON, then we have the same sum plus one additional term corresponding
to the lattice generated by
s pz . Thus, in that case 4 and 1,-; this lattice is f,L
p:N) 1) = Upf(z) + plc' F (Lpz, N
Tpf(z) = Upf(z) + ;I'01 , pz,
= Upf(z) + p k-1 x(p)F (L pz , Tv1 -) = Upf(z) + pk-1 x(p)f(pz),
which is (5.15). The expression (5.14) for the bn follows directly from (5.15) 0 if we use the expressions (5.12) for the operators Vp and U. As a consequence of Proposition 37 we have the factorization
1 — 7X+ x(p)p k-1 X 2 = (1 — UpX)(1 — x(p)p k-1 VpX),
(5.16)
where both sides are regarded as polynomials in the variable X whose coefficients are in the algebra of operators on the subspace of C [[q]] formed by the q-expansions of elements f(z) E MOT, X). To see (5.16), note that the equality of coefficients of X is precisely (5.15), while the coefficients of X' agree because U po Vp = 1. Proposition
38.
We have the following formal identity : co
co
E
ru n — s
n=1
= E x (n)nk —i
i Un).
n=1
or, equivalently,
Tn =
(5.17)
n=1
E x(d)d' Vd
(5.18)
0 Unid.
din
PROOF. By (5.11) and (5.16) we find that the left side of (5.17) is equal to
Fl ((1 — upp — s)(1 — x(p)pk -1 VpP —s
)) -1 •
P
Since the Up and Vp do not commute, we must be careful about the order of the factors. Moving the inverse operation inside the outer parentheses, we reverse the order, obtaining
ri (0 -
x(p)pk - i vpp - syi (1
- Up').
P
Note that Up , and Vp2 do commute for p 1 0 p2 , as follows immediately from (5.12). This enables us to move all of the (1 — U p - s)_l terms to the right past any (1 — X(P2)Pr i Vp,P2-s) -1 term for /32 0 p. This gives us separate products with the Vp's and with the Up's :
1
1
Il l - x(p)pk -1 Vpp - s n 1 - Upp - s: p
p
We now expand each term in a geometric series and use the fact that V. = Vn, o Vn and U. = Una o U. The result is (5.17). 0
163
§5. the modular interpretation, and Hecke operators
Proposition 39. Under the conditions of Proposition 37, if Tmf(z) = E(:,°_, 0 b,,qn,
then
x(d)dk-i
Z
bn =
a mnid 2
(5.19)
'
dig-c.c1-(m,n)
PROOF. According to
(5.18), we have
Tm Z (lair = n= 0
,
E x(d)d k-1 Vd o Umid E an q n n= 0
dim
= E x(d)d" 1 E dim
mldfn
a qd2 n
ron .
If we set r = d2 n1m, the inner sum becomes I arm1d2gr with the sum taken over all r divisible by d. Replacing r by n and gathering together coefficients o of q' , we obtain the expression (5.19) for the n-th coefficient. Most of the most important examples of modular forms turn out to be eigenvectors ("eigenforms") for the action of all of the Tm on the given space of modular forms. Iff E Mk(N, z) is such an eigenform, then we can conclude a lot of information about its q-expansion coefficients.
is an eigenform for all of the operators Tm with eigenvalues Am , m = 1, 2, . . . : Tmf = Anil'. Let am be the q-expansion coefficients: f(z) =Em",, o am qm. Then am = Amai for m = 1, 2, . ... In addition, al 0 0 unless k = 0 and f is a constant function. Finally, if ao 0 0, then A m is given by the formula Proposition 40. Suppose that f(z) E Alk (N,
A m ------
)()
E x(d)d'.
(5.20)
dim
Using (5.19) with n = 1, we find that the coefficient of the fi rst power of q in Tmf is am . If Tmf = Am!, then this coefficient is also equal to A m a 1 . This proves the first assertion. If we had a l = 0, then it would follow that all am = 0, and f would be a constant. Finally, suppose that a() 0 O. If we compare the constant terms in Tmf = 2.f and use (5.19) with n = 0, we obtain: A m ao = 1)0 = Eon x(d)d' i ao . Dividing by a() gives the formula for PROOF.
0
Am .
Iff is an eigenform as in Proposition 40 (with k 0), then we can multiply it by a suitable constant to get the coefficient of g‘ equal to 1: a l = 1. Such an eigenform is called "normalized". In that casé, Proposition 40 tells us that am = Am is simply the eigenvalue of Tm . If we then apply the operator identity (5.11) to the eigenform f, we obtain identities for the g-expansion coefficients of f. Namely, applying both sides of (5.11) to a normalized eigenform fE Mk(N, x), we have: CO
E an n=1
= fio _ app_s +x(p)?,,,sy i
•
III. Modular -Forms
164
1. Let N = 1, and let k > 4 be an even integer. Suppose that is an eigenform for all of the T., and suppose that a o 0 0 f= auir G Mk and f is normalized (i.e., a l = 1). Then Proposition 40 says that f= (10 + Eo-k _ i (n)qn. But recall the Eisenstein series Ek = 1 — Lik-1(1)qn (see (2.11)). Thus, f and --IkEk are two elements of Mk (r) which differ by a constant. Since there are no nonzero constants in Mk (F), they must be equal, i.e., a o =— Bk/2k. Therefore, up to a constant factor, any weight k eigenform for r which is not a cusp form (i.e., a o 0 0) must be Ek. Conversely, it is not hard to show that Ek is actually an eigenform for all of the Tm . This can be done using the q-expansion and (5.14) (it suffices to show that it is an eigenform for the Tp , since any Tm is a polynomial in the operators Tp for p prime). Another method is to use the original definition of Tm on modular points, applied to Gk (L) = - o*rel.r k . 2. Let N = 1, k = 12. Since S12 (1") is one-dimensional, spanned by (2n) -12 A(z) = E;;°=1 t(n)q" (see Propositions 9(d) and 15), and since the L preserve Sk (F), it follows that f= Et(n)q" is an eigenform for L. It is normalized, since t(1) = 1. Then Proposition 40 says that Tmf = t(m)f for all in. Thus, the relation (5.11) applied to f gives co 1 (5.21) -c(n)n - s = sp11—2s• EXAMPLES.
(r)
E
allp 1 — t(P)P—
n=1
This Euler product for the Mellin transform off = E t(n)q' is equivalent to the sequence of identities (see Proposition 32):
t(mn) = t(m)t(n)
for in, n relatively prime;
T(P i) = t(p 1 )c(p) P 1 'T(P 1-2 )Ramanujan conjectured that in the denominator of (5.21) the quadratic 1 — t(p)X p' X 2 (where X = p -s) has complex conjugate reciprocal roots ap and a-14p ; equivalently, 1 — t(p)X pll X 2 = (1 — ap X)(1 — FipX), i.e., t(p) = ap + p with lap ! = p l 112 From this it is easy to conclude that IT(p)i 2/3' 1/2 , and, more generally, IT(n)! < 0- 0 (n)n 1112 (see the proof of Proposition 13 in §II.6). The Ramanujan conjecture and its generalization, the Ramanujan-Petersson conjecture for cusp forms which are eigenforms for the Hecke operators, were proved by Deligne as a consequence of the Weil conjectures (see [Katz 1976a]). The Euler product (5.21) is reminiscent of the Hasse-Weil L-series for elliptic curves. For more information on such connections, see [S4imura 1971]. 3. Let N = 4, z(n) = (4) = (— 1)(11-1)12 for n odd. We saw that Mi (N, x) is one-dimensional and is spanned by .0 2 . If we apply Proposition 40 to e2 = + q + • - • + 21,qm + • • • 9 we find that Am = (-L -;1 ), where the sum is over odd d dividing m. For example,
Ap 1
,—1
-r
—
p
)
10 ifp 3 (mod 4); 2 if pE:-. 1 (mod 4).
(5.22)
165
§5. The modular interpretation, and Hecke operators
Since 2„, is one-fourth the number of ways m can be written as a sum of two squares, we have recovered the well-known fact that p can be so expressed in eight ways if p 1 (mod 4) and in no way if p E-: 3 (mod 4). That fact is usually proved using factorization in the Gaussian integers Z[i]. The Gaussian field 0(i) has discriminant 4, and corresponds to the character x of Z having conductor 4: x (n) = 6-,1). Let 1 denote the trivial character of (Z/41)*, and let p denote the two-dimensional representation ( ) of (Z/4Z)*. Then we can write (5.22) in the form A p = Tr p(p). This turns out to be a very special case of a general fact: a normalized weight-one eigenform in M l (N, x) has its q-expansion coefficients determined by the trace of a certain two-dimensional representation of a certain galois group. For more information about this, see [Deligne and Serre 1974].
Another approach to Hecke operators. Let F1 and some group G.
F2
be two subgroups of
Definition. F1 and F2 are said to be "commensurable" if their intersection has finite index in each group: [F1 F/ n F2] < 00 , [F2 : F1 n F2] < 00 . Let r be a congruence subgroup of F SL 2 (1), and let a e G = G.L4 - (0). Then r and a' F'a are commensurable. To see this, suppose r F (N ). By Lemma 1 in the proof of Proposition 17, we have r(ND) for some D. Let F" = a-l r'a F(ND) and F' n ara F' n cc Pa. Then F" r(ND), otr"oc— i r (ND). Thus, [r F"] [F: ['WD)] < cc, and also [a' F'a : F"] = [F' : aF"a -1 ] [F: F(ND)]
BASIC EXAMPLE.
union of d right cosets, then r =
U1=1 r"Yj •
-
Given an element Vi ay2 with yi , y2 e F', we can write y2 = y"y5 with y" F" for some j. Since y" a -1 1'a, we can write y" = c( 1 y'a, so that yi ai/2 = yl a(a -l y'a)y; = (Vi y')ay; e ray'. We must show that the right cosets F'ay; are distinct for different j. Suppose y.l ay; = y2 ayk'. Then yIyk' -1 = e F'a. Since y5yk1-1 e r, it follows that Wyk' E F", i.e., y; e F"yk', and so j = k. The converse assertion is also routine, so we shall omit the details. PROOF.
166
iii.
Modular Forms
We shall define Hecke operators in terms of double cosets for a large class of congruence subgroups of f, a class which includes 1"1 (N), fo (N) and r(v) as special cases. Then we shall show that in the case of ri (N) this definition coincides with our earlier definition of the operators Tn acting on mk(ri (N)). ' Let S + be a nonzero additive subgroup of the integers, i.e., S + = MZ for some positive integer M. Let Sx be a subgroup of (ZINZ)* . We shall also use S" to denote the subset of Z whose image modulo N is in 51 " . (If N = 1, then we agree to take Sx = Z.) For example, if Sx = {1 } , then we also use S' to denote 1 + NZ c Z. Let n be any positive integer. We define
An(N, S<, S +) a-:;--f {integer matrices
e v Nit., a e Sx , b e S + , det (
) = n}.
(5.23) If N = 1 and V = S + = Z, then A" is simply the set of all 2 x 2-matrices with determinant n. It is easy to check that Am (N, S>, S +) • A"(N, V , S +)\c A"(N, V , S +),
(5.24)
and that A l (N, S' , S +) is a group. A l (N, 5' , S +) is clearly a congruence subgroup of I", since it contains r(N/), where N' is the least common multiple of M and N (recall S+ = MZ). Here are our familiar examples:
ro (N ) = A l (N, (7L1 NZ)* , Z);
ri (N) = A l (N , {1 , Z); }
I" (N) = A 1 (N, {1} , NZ). Definition. Let r be a congruence subgroup of r, and let a e GL I (0). Let 1"" = r n a -1 1-"a, and let d z---.- [f': r"], I"' = U1=1 1""yj. Let f(z) be a function on H which is invariant under [y]k for y e r. Then , f(Z)
Proposition
d
I Crlarik ,-;-,. E f(z)f[aynk.
(5.25)
i---1
42. f(z)l[rall k does not change if a is replaced by any other
representative a' of the same double coset : Fair = Far. Nor does it depend on the choice of representatives y.; of r/ modulo r". If Je mk (r), then fi[railk e
mk(n.
PROOF. We first prove the second assertion, that is, that
(5.25) is unchanged
if y.; is replaced by ylyi, where 7; e r". Since r" a-l ra, we have 7; = a-l .pia for some -,i e r. Then f l[ay; y:]k = flP jayAk =f[ŒY5]k, because f IDA, =f. Next,. we observe that, by Proposition 41, the sum on the right in (5.25) can be written Ef(z)I[ai] k , where- the ai are any elements such that rar = U rai . It is then immediate that the definition depends only on the double coset Far and not on the choice of representative a. Finally, suppose that fe Mk (r). If y e r', then (f f[rocrlk)([y]k = Ef lEotyAk = f [Par ], since right multiplication by y just rearranges the right cosets ray]. If f
I
167
§5. The modular interpretation, and Hecke operators
satisfies the cusp conditions, then so does fl[ayA k for each j, by Lemma 2 in the proof of Proposition 17. This completes the proof of Proposition 42. CI
Definition. Let r = A 1 (N, S", S+ ), and let n be a positive integer. Let fe Mk (r). Then
7f
(k12)-1
Ef I [rank,
(5.26)
where the sum is over all double cosets of F' in An(N, Sx , S t ). By Proposition 42, we have Tnfe mk (r). _ Equivalently, we can define Tn f
= n(k/2)-1 E
.,...k, fi[al where Pai runs through the right cosets of ri in An(N, S x , S). Proposition 43. In the case
r = ri ( N), the definition
(5.27)
(5.26) agrees with our
earlier definition of the Hecke operators T n . PROOF. Let A" = An (N , {1}, z). For each a e (Z1 NZ)* we fix a
GF
such that
Lemma. (5.28)
A" = U fi (N)øa (" disjoint
0 dt
where the disjoint union is taken over all positive a dividing n and prime to N, and for each such a we set d = nla and take b = 0, 1, .. . , d — 1. The terms on the right in (5.28) are clearly contained in A. Suppose the union were not disjoint. Then for some a', b', d' = nid we would have Yi aa (g :11) = y2 o-„,q; ct;:), and so F would contain the matrix (g byl; ':,:y1= (a/81' 4,); then a' = a, d' = d, and so (g ,'0,) = ab('; db :) for some j, i.e., b = b' + jd. If 0 < b, b' < d, this means that b = b'. To prove the lemma, it remains to show that any a = (ae: n e A" is in one of the terms on the right in (5.28). Choose g, h relatively prime so that go' + hc' =- 0, and complete the row g, h to a matrix y = ge il; ) e F. Then ya has determinant n and is of the form (8' :). Replacing y by +(13,. by if necessary, without loss of generality we may suppose that ya = (g ) with a> 0, ad = n, 0 b < d. Then, considered modulo N, the equality a = y -1 (g bd ) gives (1) ,7*) r_.=_- ( 1,!; :)(g 1,;), i.e., y-1 is of the form Yi aa for some yi e 1"1 (M. Thus, a = y1 a,,(0` bd) e r-i1 (N)cra(8 !,), as desired. This completes the proof of the lemma. o PROOF OF LEMMA.
(
,We now prove the proposition. Let Tr be the linear map, on Mk (Fi (N)) defined by (5.26) (or (5.27)), and let Tir d be the earlier definition. Since Aik(Fi (N)) = C i x Mit(N , x), it suffices to show that T:ew = T:k1 on Mk (N, z).
168
Iii.
Modular Forms
Let Je Mk (N, x). By the lemma, we have T: ewf = w k/2)-1 Ef i pia (g bA k. Since f Mk (N, x) we have f I [aa ],, = x(a)f. Also, for each a> 0 and d ad = n we have d-1
E f(z)IN
b=0
bdnk
with
= 11 nki2 cl-kf(az d± 1 b=0 = n ki2 ai-k Va 0 dUdf(z),
by (5.13). Thus, Tnnewf = (k/2)- 1
Ex(a)n ka d i
—k va 0 udf.
an
(The sum is over a > 0 dividing n and prime to N; but the term x(a) ensures . that the terms with g.c.d.(a, N) > 1 will drop out.) That is,
T:ew z---
E x (a)ak - i va o u„,a =
Told ,
all;
by (5.18). This proves Proposition 43.
to
In many situations, the definition of Tn in terms of double cosets is more convenient than the definition in terms of modular points. For example, we shall use this definition below to show that Tu is a Hermitian operator with respect to the Petersson scalar product. Moreover, the double coset approach can be generalized to situations where no good interpretation in terms of modular points is known. We shall see an example of this in the next chapter, where we shall define Hecke operators on forms of half-integral weight. For a more detailed treatment of the double coset approach in a more general context, see [Shimura 1971].
The Petersson scalar product. Suppose we have an integral over some region in the upper half-plane and make the change of variable z F.+ az = where a = (ca 1:1) E GL I (0). If we have a term dxdyly 2 in the integrand, this does not change under such a change of variable. To see this, we compute:
, daz ....._ = det a dz (cz + d) 2 '
az
Im _ det a Im z — Icz + dr .
(5.29)
In general, if we make a differentiable change of complex variable z 1 = u(z), then the area element dxdy near z is multiplied by lut(z)1 2 (see Fig. 111.5). Thus, interpreting the first equality in (5.29) in terms of the real variables x = Re z and y = Im z, we see that an element of area near z is expanded by a factor idazIdzi 2 = (det a) 2/Icz + dr. Thus, dxdyly 2 is invariant under the change of variables, by (5.29).
Proposition 44. Let F' c F be a congruence Subgroup, let F' c H be any fundamentardomain for F', and define
169
§5._ The modular interpretation, and Hecke operators
Figure 111.5
I .7v
dxdy
11(r1)
cfff
F"
(5.30)
2 '
Then 0 (a) The integral (5.30) converges and is independent of the choice of Fi . (b) [t.- : i-4 1 = griht(r). (e) If a e GLI (0) and cc-1 ra c F, then [r: rf] = [f: a-1 f' al PROOF. First let [r: 11 = d, let f = U1=1 (04'9 and take F' = U ai l FNotice that the integral for p(F) converges, because
f dxdy f 1/2 JF
< I Y2 .1
y'ydx d :=
2 Vi
—1/2J srS/2
If for each j we make the change of variables zi--+ aj z, we find that ja7 1 F dXdY1Y2 = IF dxdyIy 2 ; hence, for our choice of F' we find that the 1 integral in (5.30) converges to dp(F). Suppose we chose a different fundamental domain F1 for F'. Then we divide F1 into regions R for which there exists a e F' with aR c F', and again we use the invariance of dxdyly 2 under zl---+az to show that the integral over R c F1 and the integral over the corresponding region aR in F' are equal. Finally, to show part (c), we note that a' F' is a fundamental domain for a-i ra. Since
f
dxdy _ f dxdy
Gt_ir y2
r y2
,
it follows that p(a - ir a) = p(r), and so part (c) follows from part (b).
a
mk(r).
Now suppose that f(z) and g(z) are two functions in We consider the function f(z)g(z)y k , where the bar denotes Complex conjugation and y = Im z. If we replace the variable z by az for a e GL I (0), we obtain: f(az)g(az)y k (det allez + di 2 )k, by (5.29). But this is just (f(z)l[a] k) (g(z)i[a] k)y k . Thus, the effect of the change of variable is to replace f by f[Œ]k and g by g I [a]k .
rf = r
Definition. Let be a congruence subgroup, let F' be a fundamental domain for F', and let f, g e Mk (F'), with at least one of the two functions f,
170
in Modular Forms
g a cusp form. Then we define
f
o dyxd2y . (5.31) i;g> (if [f : fl < p i" It is immediate from this definition that
...z .
75 g yk
Proposition 45. The integral in (531) is absolutely convergent, and does not depend on the choice of F'. If F" is another congruence subgroup such that f, g emk (r"), then the definition of
n
Remark. The term lir : in (5.31) is needed in order to have the second assertion in the proposition. The requirement that f or g be a cusp form is needed to get convergence of the integral, as we shall see. First take F' = H ail F, where f = U Œif-/- In each region cT i F make the change of variables which replaces z by cc1 1 z, thereby transforming the integral into (z)i[ail]kg(zacc-i]. , dxdy
PROOF.
Efj f)
)
F
K
•T
y
2 •
Sincef, g G mk (r), we can writef l[ct,7 ' ],, = Es13.--.0 and, 91[01.r 1 ]k = Ec:= 0 bnd, with either an = 0 or bn = 0, since f or g is a cusp form. Because IqN I = le2 niziNi= e -21rY1N and y > 1 in F, and because an and b„ have only polyno. mial growth (see Problem 19 in §3), it is not hard to see that the integral is absolutely convergent. Next, if F1 is another fundamental domain, we proceed as in the proof of Proposition 44, comparing SR f(z)g(z)y'dxdy with SŒR f(z)g(z)? -2 dxdy, where R is a subregion of F1 and al?, with a e F', is the corresponding subregion of F'. Making the change of variable z 1- az and using the fact that f I NI, = f, giNk = g, we find that the two integrals are equal. Finally, suppose f, ge Amr"). First suppose that F" c F'. Writing f' = f ", F" =U67'F', we have
U1&
1
0-4 : f 1 fr,
— k dxdy f(z)g(z)y 2 Y
.
=
1
-
1
[I':
E ri"] s
l f(z)1[6i-l ]kg(z)1[6i-l ]Yk k dxdy y F'
[f : ft] But all of the summands on the right are equal, sincefi[bi] k = 1 gi[61- 1k = g; and there are [r': i---"] values of!. We thus obtain the right side of (5.31). If F" s* rt, we simply set r"f = F" r' F', and show that (5.31) and (5.31) with F".in place of F' are each equal to (5.31) ü7' - "I in place of F'. This completes the proof of Proposition 45. 0
171
§5. The modular interpretation, and Hecke operators
Proposition 46. Let f, g e Mk (r) with f or g a cusp form. Let a e GL (Q). Then
(5.32)
<j" g> =
PROOF. Let F" = F' n ar'a'. Then f, g e Mk(f"), and fi[a]k , gl[a]k E Mk(GC 1 F"a) by Proposition 17(a). Here a -1 1""a = a' Fla n F'. Let F" be a fundamental domain for r", and take o( 1 F" as a fundamental domain for a -l F"a. Then the right side of (5.32) is
1
,c_ iF„ [Ï: .4 : a - r"a] f
kdxdy 1 f(z)1[ 0 ]9(z) I [Œ]k.Y
r-
I
-
fr,f(Z)g (z)
„kdxdy . y2
-,/
--Since [r: a-1 1-- "a] = [r: t- "] by Proposition 44(c), we obtain the left side 0 of (5.32). Now for a E GL I (0) we note that a can be multiplied by a positive scalar without affecting [a]k. So without loss of generality we shall assume in what follows that a = ( 'c'i ) has integer entries. Let D = ad — bc = det a, and set a' = Da- .1 _tic -ab‘) Then [aœl ]k = [alk . (
.
Proposition 47. With f, g, a as in Proposition 46,
(5.33)
In addition,
o
yi , 7;' e r'.
Proposition 48. Let r = IVN), A" = dn(N, {1}, Z) (see (5.23)). Let f, g e Mk (r) with for g a cusp form. For each de (ZINZ)* , fix some ord er such that ad ---- ( '' (2, ) mod N. Let n be a positive integer prime to N. Then
=
If a = C JD e A" then a .7-_-- (1) bn) mod N, CL' (!)- -bnin Mod N. Thus, o-n a' e An. By definition,
(g ---i b) mod N, and
)
,,f= 77 k/2)-1 Efl[ r, r /], 7 (
where the sum is over the distinct double cosets of F' = A' in A". Let dOE denote the number of right cosets in F'ar. Then dOE = [r: r n a -1 ra] by Proposition 41. By ( ',), (5.25) and the second assertion in Proposition 47,
we have
-
172
111. Modular Forms
g>,
i.e., oc i = y2 Œ2 (7 y1 c). Since o-,,-1 1"1 (N)an = rj. (N), we have ; e and so ; and cx2 are in the same double coset. Next, we observe that c4, = dOE, because
F'
0.00 -1 I"' an oc' = F'
awn-1 rance- ' = rt r ara -1
= oc(oC i ra r noc -i and so
dOE = [1"' F' tm oc -1 1"'cd = [F': F' (an ap -l ran al = by Proposition 44(c). Thus,
A basis of eigenforms. We first remark that for any congruence subgroup F' and any integer k, the C-vector space Mk(r) is finite dimensional. One way to show this would be to take a fundamental domain F' for r of the form Ff = H ail F and integrate f'(z)lf(z) around the boundary for nonzero fe mk(r), as in the proof of Proposition 8 of §111.2, thereby obtaining a bound on the total number of zeros off in a fundamental domain. But if Mk (f) were infinite dimensional, by taking suitable linear combinations one could obtain nonzero fe Mk(r) which vanish at any given finite set of points in F'. Alternately, one could prove finite dimensionality, and even obtain formulas for the dimension, using the Riemann—Roch theorem (see [Shimura 1971, §2.6]). - The Petersson scalar product (5.31) gives a hermitian scalar product on the finite dimensional C-vector space Sk (F'). That is,
§5. The modular interpretation, and Hecke operators
1 73
nice properties: the We saw before that eigenforms for the I coefficients can be expressed in terms of the eigenvalues for the Tp , and the corresponding Dirichlet series have Euler produCts. Because of Proposition 50, it is possible to find a basis of such forms. Proposition 51. There exists a basis of the C-vector space Sk (N, x) whose elements are eigenforms for all of the T„ for which g.c.d. (n, N) = 1. For any fixed Tn with g.c.d.(n, N) = 1 and any subspace S c Sk (N, x) which is preserved by T„, there exists a basis of S consisting of eigenforms of Tn . To see this, we apply the following basic fact from linear algebra (see [Lang 1984, §XIV.12]) to the hermitian operator en Trz : Given any hermitian operator T on a finite dimensional C-vector space S, there exists a basis of Ss consisting of eigenvectors for T. We further note that any eigenspace for is preserved by all T„,, as follows from the fact that T, and Tn, commute: if Trif = .1„.f, then T„(T„, f) = T„,Tf= An T„, f. (This remark does not require n or n' to be prime to N.) Thus, to prove the proposition, we list-the Tn for n prime to N (actually, it suffices to work with the Tp for primes p,11 N, since any eigenform for them is an eigenform for the Tn with n prime to N). We write Sk (N, x) as a direct sum of eigenspaces S for the first Tn in the list. Then we write each S as a direct sum of eigenspaces for the next Tn , then we write each one of those spaces (which is an eigenspace for the first two Tn's) as a sum of eigenspaces for the third Tn ; and so on. Because Sk (N, z) is finite dimensional, after finitely many steps this process must stop giving us any new smaller spaces. At that point Sk (N, x) is expressed as a direct sum of subspaces on each of which the Tn for n prime to N act as a scalar. Any basis consisting of forms in these subspaces will satisfy the requirements of the proposition. PROOF.
Proposition 51 does not quite give us what we want, because of the restriction that n be prime to N. In order to have an Euler product (as, - for example, in (5.21)), we would also want our basis forms to be eigenforms for Tp for pi N. One way we could ensure this is if we found that the eigenspaces for all of the Tn with n prime to N are each one-dimensional, because we know that the Tp for pi N commute with the Tn and so preserve each eigenspace. Such an assertion is called "multiplicity one", i.e., each set of eigenvalues for the Tn with n prime to N corresponds to only one eigenform in the basis that was constructed in Proposition 51. Multiplicity one does not hold in general; however, it does hold if we restrict our attention to forms which do not come from lower level. We now explain what this means. If di d2 = N and fe (d1 )), then we also have fe Mk (ri (N)) and also g(z) d=e1f(C12 e Mk (fl (N)) (see Proposition 17). The subspace of sk(r, (N)) spanned by the forms obtained in these two ways from forms fe sari (d)) for proper divisors di N is called the space of "old forms". It is not hard to show that all Hecke operators preserve this space. The orthogonal comple-
mk(ri
174
in. Modular Forms
ment to the space of old forms with respect to the Petersson scalar product is called the space of "new forms". In other words, a formfesk (F,(N)) is a new form' if and only if
1. Prove Proposition 31 in cases (iii)—(iv),
I"' = Fo (N), F(N).
2. (a) Let ŒN = 1). We saw in Problem 15 of §III.3 that [ŒN]k preserves Mk (Fo(N)). Prove that the Hecke operator T. commutes with ŒN for n prime to N. (b) Let F =Enodd cr (n)q", 04 = En anqn M2 (F0(4)) (where a the number of ways n can be written as a sum of four squares). Write the matrix of T2 in the basis 04, F, and find a basis of normalized eigenforms for T2. (c) Show that 1 2 does not commute with a4 (see Problem 15(c) in §III.3). (d) Suppose n is odd. Since T. commutes with cx4 and T2, it preserves each eigenspace in part (b) and each eigenspace in Problem 15(c) of §III.3. Show that the operator T. on the two-dimensional space M2 (F0(4)) is simply multiplication by ai (n). (e) Derive the following famous formula (see, for example, Chapter 20 of [Hardy and Wright 1960 ]) for the number of ways n can be written as a sum of four squares:
an
=
8a, (n) 12401 (n0)
for n odd; for n= 2rn0 even, 2,1'n0 .
3. If JE Mk (N, x), show that f(0o) = 0 implies 7„f(00) = 0, but that f(s) = necessarily imply 7"„f(s) = 0 for other cusps s (give an example).
0 does not
4. (a) Show that if feMk (N, x) and g(z) = f(Mz), then gE Mk (MN, x), where 2( is defined by xi(n) = x(n mod N) for ne(ZIMNZ)*. Let T. be the Hecke operator
§5. The modular interpretation, and Hecke operators
175
considered on Mk(N, x), and let Ts' be the Hecke operator considered on Mk (MN, x'). Suppose n is prime tor M. Show that forfeMk (N, x) c: Mk (MN, x') and g(z) = f(Mz)E Mk (MN, X') we have: 7 f= 7;,f ; '
g(z) = (Tf)(Mz).
(b) Let f(z) = (q(z)q(2z)) 8 , g(z) = f(2z) E S8 (1-0 (4)) (see Problem 18(d) of §111.3). Show that for odd n, T„ acts on the two-dimensional space S8 (F0(4)) by multiplication by a scalar. (c) Find the matrix of T2 acting on s8 (r0(4)) in the basis f, g ; then find the basis of normalized eigenforms for all of the Tn .
5. Let f, g E Mk (N, i) with f or g a cusp form. Let AN. Note that Tp = Up . Show that
In this problem one encounters fairly large numbers; for example, the coefficients in part (c) are in Q(V144169). While Proposition 51 guarantees the existence of a basis of normalized eigenforms, it does not guarantee that it will always be easy to find it computationally.
7. Let (L, t) be a modular point for Fi (N). Let a e W(N, {1}, z), and let y e (N). For each a and y, consider the lattice L' = !-ay.L. Show that L' is a lattice which contains L with index n, that L' depends only on the right coset of F'1 (N) in An(N, {1 } , Z) which contains ay, and that t has order N in C/L'. Show that the L' in (5.2) are in one-to-one correspondence with the right cosets of Fi on in An(N, { 1 } , Z). Use this to give another proof of Proposition 43 by comparing (5.9) and (5.27). 8. Let fe Mk (Fo(p)). (a) Show that yo = 1 and yi = (1). -1.;), 0 <j < p, are right coset representatives for F modulo Fo (p). (b) Let Tr( f) be defined by: Tr(f) = liP, 0fl[y j k . Show that Tr(f)eMk(F). (c) If f happens to be in man, then show that Tr(f) = (p + 1)f and Trtff [(p) l )] k) = ki2 7;f
CHAPTER IV
Modular Forms of Half Integer Weight
Let k be a positive odd integer, and let A = (k 1)/2. In this chapter we shall look at modular forms of weight k/2 = A + 1/2, which is not an integer but rather half way between two integers. Roughly speaking, such a modular form f should satisfy Maz + 13)1(cz + d)) = (cz + d)+112f(z) for (ca ,i) in f = SL2 (7L) or some congruence subgroup I' c T. However, such a simple—
minded functional equation leads to inconsistencies (see below), basically because of the possible choice of two branches for the square root. A subtler definition is needed in order to handle the square root properly. One must introduce a quadratic character, corresponding to some quadratic extension of Q. Roughly speaking, because of this required "twist" by a quadratic character, the resulting forms turn out to have interesting relationships to the arithmetic of quadratic fi elds (such as L-series and class numbers). Moreover, recall that the Hasse—Weil L-series for our family of elliptic curves E„ : y2 = x 3 — tex in the congruent number problem involved "twists" by quadratic characters as n varies (see Chapter II). It turns out that the critical values L(E„, 1) for this family of L-series are closely related to certain modular forms of half-integral weight. One classical reason why modular forms were studied is their use in investigating the number of ways of representing an integer by a quadratic form: m =41,_. 1 A fi nin, = [nir A[n], where A = [Ail] is a giiren symmetric matrix, [n] is a column vector and [n]' the corresponding row vector. For example, the number of ways m can be represented as a sum of k squares is equal to the coefficient am in the q-expansion of O": k
q EnJ =7-. Eam es i ek = j=11 i gni2 = 1 n i= n 1 , - - • , nk= — —
co
co
In Chapter III we saw that for k even Ok e Mk/2 (4,Xic—Ii), where x_.. / is the
177
§1. Definitions and examples , /
character modulo 4 defined by x_ 1 (n) = (-741) = (-1)(n -1)12 . More generally, for k even one can use In q tn"Ini to construct modular forms of weight k/2. The properties of modular forms are then useful in studying the number of representations m = [n]fil[n]. For example, in Problem 2 of §III.5 we used the action of the Hecke operators on M2 (1"0 (4)) to derive a simple formula for the_number of ways an integer can be written as a sum of four squares. For more information about the connection between quadratic forms in k variables and modular forms of weight k/2, see, for example, [Gunning 1962] and [Ogg 1969]. It is natural to ask whether a similar theory exists for quadratic forms in an odd number of variables. This would be a theory of modular forms of weight k/2 where k is an odd integer. One would want ek for k odd to be an example of such a form. Early investigators of representability of an integer as a sum of an odd number of squares—Eisenstein, and later G. H. Hardy understood the desirability of such a theory of modular forms of halfintegral weight. But it was only recently—starting with Shimura's 1973 Annals paper—that major advances have been made toward a systematic theory of such forms which rivals in elegance and beauty the much older theory of forms of integral weight. In this chapter we shall first present the basic definitions and elementary properties of forms of half-integral weight, largely following [Shimura 1973a, 1973b], but with slightly different notation. We shall discuss examples in some detail. But when we come to the fundamental theorems of Shimura and Waldspurger, we shall not give the proofs, which go beyond the scope of an introductory text. Rather, we shall motivate those theorems using the examples. Finally, we shall conclude by returning to the congruent number problem and Tunnell's theorem, which we used in Chapter I to motivate our study of elliptic curves.
§1. Definitions and examples We shall always take the branch of the square root having argument in (-7r/2, ir/2]. Thus, .sii. is a holomorphic function on the complex plane with the negative real axis ( — co, 0] removed. It takes positive reals to positive reals, complex numbers in the upper half-plane to the first quadrant, and complex numbers in the lower half-plane to the fourth quadrant. For any integer k, we define zk12 to mean Whenever we have a transformation rule such as f(yz) = (cz + d) kf(z) (where y = (ca jt ), yz = (az + b)I(cz + d)), we call the term (cz + d)k_ the "automorphy factor". It depends on y and on z. That is, an automorphy factor J(y, z) for a nonzero- functionf has the property thatf(yz) = Ay z)f(z) 4 for-z e //And y in some matrix gr-oup. Because ,
178
iv. Modular Forms of Half Integer Weight
f(aliz) _ f(aliz) . Agz) .f(z) f(13z) f(z) it follows that any automorphy factor Ay, z) must satisfy
J(oefl, z) = f(Œ, /3z) J(fl, z).
For example, Ay, z) = cz + d for y = ( ,171)e F = SL 2 (7L) satisfies (1.1). If — 1 = G1 Pi ) is in our matrix group, then J(y, z) must also satisfx: J(— 1, z) = 1. Another example of an automorphy factor is Ay, = j(y, z) = (ff)ed-1 \ cz + d, y = bd )e F0 (4), which is the automorphy factor for 0(z) = qn2 , q = e2iriz (see §1II.4). Suppose that, in defining modular forms of weight k/2 for k an odd integer, we try the most obvious thing: we look for functions f satisfying f(yz) = (cz + d) kl2f(z). However, Ay, = (cz + d ) ci2 cannot be an automorphy factor of a nonzero function for any congruence subgroup. F' c F and any odd integer k. To see this, suppose r f(N), N> 2. Let a = iN_N), ?). Then a, fl E F', and (1.1) would require the k-th power of the following equality of holomorphic functions on H:
AN 2 2N)z +1 — N = \41--Nzl(— Nz + 1) + 1 — N • \I —Nz + I.
N
(1.2)
Clearly, the square of (1.2) holds; thus (1.2) itself holds up to a sign. Since both expressions in the radicals on the right are in the lower half-plane, the right side is the product of two complex numbers in the fourth quadrant; Ne H. but the left side is in the first quadrant, since (N 2 — 2N)z + 1 Hence, (1.2) is off by a factor of — 1, and (1.1), which is the k-th power of (1.2), is also off by —1 if k is odd. Thus, we cannot have Ay, z) = (cz + 0 42 . The natural way out of this difficulty is to force (1.1) to hold by simply defining the automorphy factor J(y, z) to be the k-th power of
j(y, , z) cr:f 0 (y z) I 0 (z) = (-c-
d
1 1 \I cz- + d
for
r
y = (a hi\ c d)
E
o
(4). (1.3)
That is, for a congruence subgroup F' c F0 (4), one defines a modular form of weight k/2 to be a holomorphic function on H which transforms like the k-th power of 0(z) under any fractional linear transformation in I"' (and which is holomorphic at the cusps, in a sense to be explained below). Recall that (fi) is the Legendre symbol, defined for a positive odd prime d as the usual quadratic residue symbol, then for all positive odd d by multiplicativity, and finally for negative odd d as (-) if c> 0 and — (-4-) if c < O. (Also, (4---) = 1.) Further recall that we define ed = 1 if d -a- 1.(mod 4) and Ed = i if dE--_-: — 1 (mod 4). Thus, e j . X - i (d) = (— 1)(d-1)12 (note that this holds for negative as well as positive d). Thus, if we define f(z)i[y] k/2 = j(y, z) kflyz) for y e F0 (4) and k odd, we shall require a modular form of weight k/2 for F' to be fixed by [y] k/2 for y e f'. As in the case of integer weight, we shall want to define [Œ]k/2 for arbitrary matrices a e GL I (0) with rational entries and positive determinant.
179
§1. Definitions and examples \
But since the automorphy factor j(y, z) is defined only for ye r0 (4), we have no preferred branch of the square root for arbitrary a e GLI(0). This circumstance requires us to work with a bigger group than GOO), a group G that contains two "copies" of each a EGLI (G)., one corresponding to each branch of the square root of cz + d. The example of the autornorphy factor Ay, z), which is a square root of (-7i)(cz + d), shows that we also should allow square roots of —(cz + d). Thus, we shall actually define G to be a four-sheeted covering of GOO). We now give the precise definition of the group G. Let Tc C denote the group of fourth roots of unity: T :.--- {,± 1, + i} . We now define G to be the set of all ordered pairs (a, 0(z)), where a = (a, 1:) e GL(Q) and 4)(z) is a holomorphic function on H such that
2 qt)(z) =
i
cz + d
Met a
\
for some t e T2 = { +1 } . Tliat is, for each a e GL() and for each fixe'd t = +1, there are two possible elements (a, +0(z))EG. We define the product of two elements of G as follows:
(1.4)
(al 0(z))(fil P(z)) = (Œ)6, 0(13z)0(z))-
Remark. We could define G in the same way but with T defined to be the group of all roots of unity or even (as in [Shimura 19734) the group of all complex numbers of absolute value 1. However, for our purposes we only need fourth roots of unity. Proposition 1. G
is a group under the operation (1.4).
PROOF. We first check closure, i.e., that the right side of (1.4) belongs to G. If a = (a, :11), fi = (eg f,), we have
(0(f3z)tP(z)) 2 = t i (ctiz + d)t 2 (gz + h)t\i'det a det /3
-=.-- t 1 t2 (c(ez +f) + d(gz + h))/Vdet afl = 1 1 12 ((ce +- dg)z + cf + dh)IVdet a/3,
■
which shows that (4, (P(Az)/i(z))e G. Associativity is immediate. It is also obvious that ( ( ), 1)EG serves as the identity element. Finally, to find an inverse of (a, 4)(z)), where a = (, bd ), we write D. = det a, a' = Da - I = ( cr, -ab), and look for (a( 1 , lit(z)) such that: OW 1 z)0(z) = ,1. That is, we set 0(z) = 110(or'z), and then must check that (a', 1//(z)) e G. But
tit(z) 2 = 1 1 ( ( dzb + d)/) = —1 f:6(—cz + a)/(ad — bc) a t —
.
1 t
4.
a )/„Met a -1
D D
which is of the reqUirVI
,
form. This completes the proof.
o
180
Iv. Modular Forms of Half Integer Weight
We have a homomorphism
P: G which simply projects onto the first part of the pair: (a, c/)(z)))--> a. The kernel of P is the set of all (1, 4)(z)) E G. Since 0(z) 2 = t for (1, 0(2))€G, it follows that 0(z) must be a constant function equal to a fourth root of unity. Thus, if we map T to G by I (1, t),_ we have an exact sequence 1
in other words, the kernal of P is the image of T under t (1, t). Similarly, if a = aa for a E 0, the inverse image of a under P is the set of all pairs (a, t) for le T, since in that case we again find that 4)(z) 2 must» be +1. We let G' = P --- 1(r) be the set of all pairs (a, (z)) e G such that Œ € r = SL 2 (Z). G' is clearly a subgroup. If = (a, 4)(z)) e G, we shall sometimes use the notation z to mean the same as az for z e H. For = (a, 0(z)) e G and any integer k, we define an operator []k/2 on functions J on the upper half-plane H by the rule f(z)
k12 f(ŒZ) 0(Z)
k•
(1.5)
This gives an action of the group G on the space of such functions, i.e., we have (f(z)1F1_Ik12., 6 L.- , 2_11 10 = f(Z) I [1 2]k/2 because of (1.4). L.,, A Now let ry be a subgroup of F0(4). Then j(y, , z) is defined for elements y e (see (1.3)). We define d--ff
r'
(Y, AY, Z ))1Y Er/.
(1.6)
Clearly is a subgroup of G (because of (4.13) in §III.4) which is isomorphic to rt under the projection P. Let L denote the map from T0 (4) to G which takes y to 5./ cfff (71./(79 Z)) e G.
Then P and L are mutually inverse isomorphisms from r0 (4) to r0 (4). We call L a "lifting" or "section" of the projection P. In our notation we are using tildes to denote this lifting from 1T0 (4) to G:
L:
(Y1 .1(Y1 z));
P: 6' .1-(Y z ) i---° Y. Suppose that f(z) is a meromorphic function on H which is invariant under where I"' is a subgroup of finite index in F0 (4). We now []ki2 for Y e describe what it means for f(z) to be meromorphic, holomorphic, or vanish at a cusp seQu tool. We first treat the cusp co. Since r has finite index in F0 (4) and hence in f, its intersection with
± fi
j)1 \O 1/jjz
181
§1. Definitions and examples
'Mil must, be of the form f+ a 'Di} if — 1 e I"' and either {a ini} or 1( — z) if — I rf. (We always take h > O.) Since [ -2-'1 ] 42 is the identity, and j((1) = 1, in all cases f is invariant under h, and so has an expansion in powers of gh e2h. As in the case of integer weight, we say that f is meromorphic at oo if only finitely many negative powers of qh occur, we say thatfis holomorphic at oo if no negative power of qh occurs, and we define f(co) to be the constant term when f is holomorphic at co. , Now suppose that se Q L.) fool. Let s = a co, Œ E F, and let =--- (a, 0(z)) be any element of G 1 which projects to a: P( ) = a. Define rsi = e I ys = sl, and let Gal = It/ e G cx) = co} . We have
i1 j)' {(± ( 0 because Foe ={ +(,1) 1)}, and the 4)(z) for ( ) must be a contant function I, as noted above. Now ir is contained in G' and fixes co, i.e., G. Moreover, P gives an isomorphism from Ç-1-rs' to a' F;a Ç. Since F' has finite index in F, it follows that a-i Fs'a is of one of the forms { + nj}, {(t) 1)-11, or ((— Mil. Thus, for some t E T we have Gsp
1 , = {(-±(i h)
iE z
tY} •
h> 0 is determined by requiring a n to be a generator of Given s ana a-1 1a modulo + 1; then t is determined, since P is one-to-one on e I. i.e., we can find t by applying the lifting map L to + c Proposition 2. The element ( ( ';), t) e Gcol depends only on the r-equivalence class of the cusp s. First suppose that c is replaced by another element (a l 4 (z)) e G 1 such that s = oo. Then e G' fixes co, and so it is of the form t i ). Then (±(,!,
PROOF.
Ofgi (M) -1 (± -1 rg)(M) ) t
(
(-±-(j)
hiM i RG
—1 1)
But (() f), t) commutes with ((4 t), so conjugating by affect(( 1), t). Now suppose that s l = ys = ('2c)oo, where y e F', = (y, that r;1 = yr;y — ', and so f;1 = ,pr;53-1 . Thus,
± GO -
=± and weagain obtain the same (0
)g
does not
z)) E f". Note
±
t). This completes the proof.
182
iv.
Modular Forms of Half Integer Weight
We are now ready to define meromorphicity/holomorphicity/vanishing at Let s = the cusp s of r". Suppose f is invariant under DI/2 for .2. E we have e and set g = fiN142 . Then for any element +
fiEgL i2= f1R1q 2= g..
g [
t)] k/2 :
That is, g is invariant under [(a) I
g(z) = g(z)
h
01
t-kg(z + h). t
Icf2
where r = 0, 14 , 1, or Then e -2nirzlh g(z) is invariant under z 1 * z + h, and so we obtain a Fourier series expansion e'i"filg(z) = Eanq'hi, Write i k = —
nign-1-79/11. g(z) = Eane2
We say that f is meromorphic at s if an = 0 for all but finitely many n < 0, and that f is holomorphic at s if an = 0 for all n < O. Iff is holomorphic at s, g(z). Automatically f(s) = 0 if r 0, since in that we define f(s) case the first term is ao e'z'fit. If r = 0, then f(s) = ao . It is easy to see that these definitions depend only on the F'-equivalence class of s, i.e., g = kI2, where y E f' and c oo = s (see flMic /2 may be replaced by g = f the proof of Proposition 2, and also the proof of Proposition 16 in §III.3). It should be noted, however, that there is some indeterminacy in the value f(s) of a modular form at a cusp. Namely, if we replace e G l by ((4 7), t ig, with t t e T, then the effect is to multiply g = fiM k/2 by CI k . If k/2 is a halfinteger, this may alter the value by a power of i; if kl2 is an odd integer, then f(s) may be defined only up to + I (compare with the proof of Proposition 16); and if k/2 is an even integer, then f(s) is well defined in all cases. Given a cusp s of I"' and an integer k, we say that s is k irregular if r 0, and we say that s is k regular if r = 0, i.e., if t k = I. Thus, iff is holomorphic at the cusps, it automatically vanishes at all k-irregular cusps. Note that whether a given cusp s of F' is k-regular or k-irregular depends only on k modulo 4. That is, if t = ± i, then the cusp is k-irregular unless k/2 is an even integer; if t = 1, then it is k-irregular unless k/2 is an integer; and if t = I, then it is always k-regular. In the case when /02 is an odd integer, the terminology agrees with the definition of regular and irregular cusps in Problem 2 of §1.11.3. -
-
—
Definitions. Let k be any integer, and let F' c F0(4) be a subgroup of finite index. Let f(z) be a meromorphic function on the upper half-plane H which is invariant under [nk/2 for all j) E ff. We say that f(z) is a /nodular function of weight 1(12 for r/ if f is meromorphic at every cusp of I'. We say that such an f(z) is a modular form and write Je Mk/2 (r/), if it is holomorphie on H and at every cusp. We say that a modular form f is a cusp form and write fe Slog"), if it vanishes at every cusp.
183
§1. Definitions and examples
rouso c To (4).
Let x be x) denote the subspace of
Now let N be a positive multiple of 4, so that
a character of (ZPVZ)*. We let Mk/2 (ro (N), -44/2(r (N)) consisting off such that for y = (c" bd) e FAN)
(1.7) fiEflka = x(cOf. We define Ski2 (ro (N), x) = Sql2 (f i (N)) n 402 (f" 0 (N), x). The exact same argument as in the integer weight case shows that
Mk/2 (f'l (N)) = Ox Mk/g0 (MI x). Furthermore, it follows immediately from the definitions that if x i and X2 are Dirichlet characters modulo N, then E Mki/2 ( Ir0 (N),
xi)
(i = 1
5
2)
implies
f1 f2
J J
-
(IC
j+k2)12(f 0( N ) , Z1Z2) .
(1.8)
Notice that for any k e Z, we have Mk/2(fro (N), x) = 0 if x is an odd character, as we see by substituting — 1 for y in (1.7): f[((1 Wko f = x( Of For example, since the trivial character x= 1 is the only even character of (Z/41)*, this means that
Mk/2 (fAl (4)) = Mk/2 (to (4), 1) = mk,2(t0(4))If 4IN, recall that x_i denotes the character modulo N defined by x_ i (n) = ( 1) n-0/2 for n (ZI NZ)* . Notice that in our definitions k is any integer, not necessarily odd. For k even, let us compare the above definitions of Mk/2 (0, Sic/2(t '), Mk/2 (f-0 (N) X), Ski2(to(N), x) with the definitions of Mk/2(F /), S12 (r'), Mk/2 (N, )01 Sk12 (N, x) in Chapter III. First, for any = (a, (z)) e G, cc = (a, ',;), 4i(z) 2 = t(CZ j SO d)/.,./det cc, note that f(z)IM k/2 = 4)(Z)- kfoŒ) = rk/2f(z)i [Œ1k12 that Mk/2 for kl2 e Z differs only by a root of unity from the operator [a]k/2 defined in the last chapter. It immediately follows that for kl2e Z the cusp —
condition defined in the last chapter is equivalent to the cusp condition defined above. There is a slight difference, however, between the condition that f be invariant under [y]k/2 for y = eic r" and the condition that f be invariant under []k/2. Namely, firin-111/2 =i ( y, z) - kf(yz) = (fri)(cz dn -ki2f0z)) (7-61 1 )1q2f [Y]k/2 . Thus, if 102 is odd, then M k12 differs from [y],02 by x_, (d). From this discussion we conclude Proposition 3. Let 41N, kl2e Z. Then Mk 1 2 (f‘O(N) ,
= mk12(1V, eix),
sk1 2 r-0(N), X) = Ski2(N (
It is now not hard to describe the structure of Mki2 (ro(4)). If we have a polynomial P(X1 , , X„,)e C[X, , . . . , X.] and assign "weights" wi to Xi, then we say that a monomial ri Xi has weight w = Z ni wi , and we say that a polynomial P has pure weight w if every monomial which occurs in P with nonzero coefficient has weight w.
184
iv. Modular Forms of Half Integer Weight
Proposition 4. Let 00(z) = , qn2 , F(z) = Z n>0 odd at OW q = e2ni2 Assign weight 1/2 to O and weight 2 to F. Then Mk/2(f0(4)) is the space of all polynomials in C[0, F] having pure weight kl2. PROOF. In Chapter III we found that for k/2 e Z, the space Mko (N, consists of polynomials in 0 and F having pure weight k/2 (see Problems 17-18 of §III.3). This gives Proposition 4 when k/2 E Z. Next, by the definition of [13] k/2 , we have OID1 1/2 = Co for y e f0 (4). 0 is holomorphic at the cusps, because we checked holomorphicity of 0 2 at the cusps, and if O had a singularity at s, so would 0 2 . Thus, Cs € M112 (f0 (4)). It now follows by (1.8) that any polynomial in O and F of pure weight kl2 is in Mk/2 (f0(4)). Conversely, suppose that fe Mk/2 (Î o (4)), where k is odd. Then f 0 is in aPk+1)/4 e2P(0 , F) , M(k + 1)/2 ( r0(4)), and so can be written in the formf where a E C is a constant which equals O if 4)(1 k + 1 and P(0, F) has pure weight (k — 1)/2. Then f OP(0, F) = aF(')1410 e Mk/2 (t0 (4)). But because 0 vanishes at the cusp —1, while F does not, this form satisfies the holomorphicity condition at --12- only if a = O. Thus, f= OP(0, F), and the proof is complete. Corollary. Dim Mk12(f210( 4 )) = 1 + [k/4]. PROBLEMS
1. Let y = '1)e ro (4), and let p = ( ( ), m -14). Check that p e G. (a) Compute pp'. (la) If ye f0 (4m), then y/ = 7)y ?) -1 is in r0(4). Compare piip -1 with .ç)i . Show that they are always the same if m is a perfect square, but that otherwise they differ by a sign for certain y e F0(4m). (
2. Let y =
'De r0(4), and let p = N '14,1i). Check that p E G. (a) Compute (b) Suppose that 41N and y e fo(N). Then yl = (g, 1)y( ,( ,, 1) -1 E ro (N). Compare pijp - i with 55 f .
3. Find h and t for each cusp of r0(4). 4. Let r c f0 (4) be a subgroup of finite index. Let /02 e Z. Show that if c I-, (4), then Mk/2 (f") = Mk/2 (r). Otherwise, let x be the unique nontrivial character of n r 1 (4); show that m„12(r) Mk/2(r', e2 ) in the notation of 011.3 (see the discussion following Proposition 27). 5. (a) Describe Ski2 (r0 (4)), and find its dimension. (b) Show that for k 5, the codimension of Ski2 (t0 (4)) in Mk/2 (r0(4)) is equal to the number of k-regular cusps. (c) Find an element E anqn in S1 312 (r0 (4)) such that a l = I and a2 = 0 (there is only one).
6. Let 4IN, and define xiv by kv (d) = (1) if g.c.d.(N, d) = 1, xN(d) = 0 if g.c.d.(N, d) > 1. (a) Show that xN is a Dirichlet character modulo N.
. §2. Eisenstein series of half integer weight for r0(4)
185
(b) Let p be as in Problem 2 above, let x be an arbitrary Dirichlet character modulo N, and suppose that fe Mkigo(N), z). Show that fj[P]ko e Mk12(f-0 (N), i.x ki ) •
ro (4 ).
7. Find the value of 0 G Milgo (4)) at the three cusps of
§2.
Eisenstein series of half integer weight for
r0(4)
Recall the functions Gk (z) and Ek (z) in Mk (F) that we defined in §I11.2 for k > 4 an even integer. We set Gk (z) = E(niz + n) -k , where the sum is over m, n e Z not both zero; and then Ek (z) was obtained by dividing by Gk (ioo) = Mk). Alternately, we can obtain Ek (z) by the same type of summation Z (mz + n) -k if we sum only over pairs m, n which have no common factor (see Problem 1 of 011.2): -
Ek (z) -_-_-_ E
i
(2.1)
Om + n)k'
where the sum is over m, n e Z with g.c.d.(m, n) = 1 and only one pair taken from (m, n), ( in, n). In §111.2 we obtained the g-expansion —
—
Ek (z) = I —
2k '
1 =....... e 2itiZ
4
k n=1
We now want to give an analogous construction, with k replaced by a half-integer k/2 and F replaced by F0(4). To do this, we give a more intrinsic description of the sum (2.1). Notice that if we complete the row (m, n) to form a matrix y = (4,1 bn) e F, which can be done because g.c.d.(m, n) = 1, then the summand in (2.1) is the reciprocal of the automorphy factor Jk (y,9 z) for functions in Afk (F). That is, such functions f satisfy : f(yz) = .1k (y, z)f(z) for .4(y, z) = (mz + n)". In (2.1) we are summing litik(y, z) over all equivalence classes of y e F, where Yi ?--, y2 if y1 and y2 have the same bottom row up to a sign, i.e., if y2 = + a. Dy1 for some +G; D E For, . In other words, we may regard ik (y, z) as a function on the set Fcc\F of right cosets, and then rewrite (2.1) as follows: E k(Z) =
E Jk (y, zy l 9 y E rcAr
k > 4 even.
(2.2)
It is now possible to give a proof of the invariance of Ek (z) under [yjk for Yi e F which easily generalizes to other such sums of automorphy factors. Namely, note that, by the definition of [Y1]k, we have
Ek(z)I [Yi lc ------- .4 0'19 zY
= Jkoi,
l Ek(vi z)
zri
E Jk(y,
y e roAr
yizr
= E Jktyvi, zr, ye
foAr
because of the general relation (1.1) satisfied by any automorphy factor.
iv. Modular Forms of Half Integer Weight
186
But right multiplication by yl merely rearranges the right cosets F.y. Thus, the last sum is just a rearrangement of the sum in (2.2) for Ek. Since the sum is absolutely convergent for k > 2, we conclude that Ek I [yj k = Ek. We now mimic this procedure for half integral weight k/2 and the group f0 (4). In order to have absolute convergence, we must assume k/2 > 2, i.e.,
k > 5. Note that we obviously have F0 (4). = F. = { + ( - 1)}. Definition. Let
k
be an odd integer,
k > 5.
E
Eko (Z) =
Ay,
z)_ ".
(2.3)
7 e roD\ro( 4)
) e F0 (4) for each As representatives y of F.VE0 (4) we take one matrix (in, n) with 41m, g.c.d. (m , n) = 1 (in particular, n is odd), and we may specify n> 0 so that we get only one of the pairs + (m, n). Thus (a m
b n
a b Ek/2 (Z) = , .i E m,n e Z, 4Im,n> 0, g.c.d.(m,n)= 1 (( m n) zY k Substituting the definition of j(y, z) and noting that (-7:) -k = (v), since k is odd, we obtain Elo (z)
=
E
(—m)Einc(inz
41m, n> 0 g.c.d.(m,n)=1
n
+ n)
2,
k > 5 odd.
(2.4)
mk/2 (r0(4)) now follows by the exact same argument The fact that Ek/2 - — . as in the case of the Eisenstein series of integral weight for 1" which we considered above. Namely, Ek/2(z)i [MI0 = j(yi , z)_ kEk12 (y i z), which just gives a rearrangement of the sum (2.3). Finally, the cusp condition is routine to check, so we shall leave that as an exercise (see below). Unlike the case of 1", where there is only one Eisenstein series of each weight, Ek/2 (z) has a companion Eisenstein series Fki2 (z), defined by Fk/2 = Ek/2
That there are two basic Eisenstein series for each half-integer k/2 is related to the fact that F0 (4) has two regular cusps (see Problem 3 of §1 and the discussion at the very end of §III.5). To see that Fkl2 e Mk/2 (f0(4)), we apply Problem 6 of §1 in the case N = 4, x = 1. To write Fk12 more explicitly, we compute
Fk/2 (z)= (2z) -4/2 Ek/2( — 1/4z ) =
242
(4z)kI2
m)r_k
4Im1 . n> 0 g .c.d .(nz, n)= 1
I
n - n (— ml4z + n) k/2
1 = 2k/2 "S" (M) ek 4-4 n " (— m + 4nz)k12'
187
§2. Eisenstein series of half integer weight for fo (4)
where we have used the fact that n > 0 to write ,i4z \i—m/4z + n = —m + 4nz. We now replace m by —4m, so that the sum becomes a summation over all me/ with g.c.d.(4,n, n) = 1, i.e., n odd and g.c.d.(m, n) = 1. We obtain
1 n J F(nz + m)kI2 • IC k
E m,ne 7
Fic/2(Z) =
n>Oodd
(2.5)
g.c.d.(m,n)=1
We next compute the g-expansions of Ei02 and Fki2 . We proceed much as in the computation of the q-expansion of Eit in §111.2, except that there are added complications. The derivation of the g-expansion for Fie/2 (z) turns out to be slightly less tedious than for Ek12 (z). Hence, we shall give that derivation, and omit the analogous proof of the g-expansion formula for Ek/2 (z). To compute the g-expansion of Fie/2 (z), we start with the following relation, which was proved in Problem 8(c) of §11.4 for any a> 1, Z E H (where we have replaced a, s by z, a here): co 00 6,2nit2. E e—nial2r(a)-1 (2.6) E (z + h) ° (Note: for z e H we define z" = ea log z, where log z denotes the branch having imaginary part in (0, n) for z e H.) Since the sum (2.5) for Fk12 (z) is absolutely convergent, we may order the n I and write m = —j + nh, terms as we please. If we let j =-- 0, 1, . h eZ, we obtain Fk I2(z)
n>0 odd
= 2 —k/2
(nz
E
2-142
0 j
E 472 —k/2
nh) lc/2
h= — co co
E (:)(27tycae—nikliti-r 0 Sj
n> 0 odd
j
n
E
2 ()_1
by (2.6) with z replaced by z fin and a replaced by k/2. Ecit , q = e 2giz where Thus, Fit/2(Z) ir k/2
=
r(
... 1)e
rak14 1(kj2)—
e - 2 nitlin E ,nk n —kj2 E • n> 0 odd -0 <j
(2.7)
We now simplify the expression (2.7) in the important special case when is squarefree. Let m e Z be squarefree. For j > 0 odd we let 26.(f) denote (7). Thus, if j is a prime, it is L(J) which tells us whether j splits, remains prime, orj.amifies in CP(jn) (i.e., this depends on whether xpi(j) equals 1, — 1, or 0, respectively). There is a unique character, also denoted x„„ which has 1 mod 4 and conductor 14ml if m 2 or 3 mod 4 and conductor iml if m which agrees with zni (j) = C9 for j > 0 odd. We can express x„, in terms of the usual Legendre symbol as follows:
188
IV. Modular Forms of Half Integer Weight
if m
E---
1 mod 4;
if m -a-- —1 mod 4; if m = 2m0 za-- 2 mod 4, where we define (51--) = ( —1)( i-1)12 for j odd and (il) = 0 for j even, and (4) = (— 0°2-1)/8 for ] odd and () = 0 for j even. i Note that Z.( — 1) = 1
if m > 0,
and ;Cm( — 1) = —1
if m < 0. (2.8)
Recall that we de fi ne A = (k — 1)/2.
has q-expansion coefficient bl which for I squarefree is equal to L(4_ 1)).1 , I — A) times a factor that depends only on A and on 1 mod 8, but not on I itself. This factor is given in (2.16) below. Proposition 5. The Eisenstein series
Fic/2
We first prove a lemma.
Lemma. Let n = no n?, where no is squarefree. Then for I squarefree, the inner sum in (2.7) vanishes unless n 1 11, and f n i ll it equals
1) ---an(no ,Ino p(ndn i ,
(2.9)
wherits to 0 if n i is not squarefree and equal to Al5bius function (equal (-1)" if n 1 is the product of r distinct primes). First, if n 1 = 1, then (Tia) has conductor n = no . If / has a common factor d with n, then, combining terms which differ by multiples of nld, we see that the inner sum in (2.7) is zero. On the other hand, if lealnlr, then making the change of variables f = —/j leads to (-7,-,E)E (De'afin . This Gauss sum is well known to equal snjti (see, e.g., §5.2 and §5.4 of [Borevich and Shafarevich 1966]). This proves the lemma when n 1 = 1. We next show that the inner sum is zero if g.c.d.(no , n 1 ) = d> 1. Note that PROOF OF LEMMA.
1 (4)
if g.c.d.( j, n) = I,
[0
otherwise;
and it follows that (1,) --z-- (--/--d. 2 ) • Combining terms with j which differ by multiples of nId2 and usingnthe fact that c/2 4'/ (since 1 is squarefree), so that If , e -27ti1( j+j'n1d2 )/n = 0, we find that the inner sum is zero in this case.
189
to(4)
§2. Eisenstein series of half integer weight for
Now suppose that n 1 > 1 is prime to n o . Let p, run through all primes dividing n,: n, = 11/4v. Set qv = pv; and let bv(j) = 0 if pd./ and (5,,( j) = I if pv )fr f. Then (-,4) = (54) II,, (VD. Any JE ZinZ can be written uniquely in the form
j =ion?. + >in/q,
0 -...ily
0 --.io*< no,
Then e
-2niifin
= e — 2nilj olno n e --2ni1Jviciti .
v
Further note that
(- n/-)—- (--no9 n (5,,u) . (-1(2) H Thus, the inner sum in (2.7) becomes
(
E is) e --
2 niljolno n x bv(iv)e
-2niii„m;,.
v j,
Jo no
(2.10)
The first sum is en0 ( ),/n 0 , by the first part of the proof (the case n--= no). The sum inside the product is q.;,—
I
Ee
(4/11 ,1 -1
-27tiliv fte, ___
---, x i.„
iv = o
j„=o
e-27rilp v jv114, .
Here the first sum is zero, because q 4 1. The second sum is also zero, unless ellp' „; this can happen only if q v = p, and p,11. Thus, the expression (2.10) is zero unless each q, = p,, i.e., n i is squarefree, and n 1 11. If n l i/, then each sum inside the product in (2.10) is equal to p„ and we obtain the following expression for the desired sum: —
s„
en
1NInoll( — pv). . v
(Note that eno = sn .) The latter product is u(n 1 )n 1 . This completes the proof 0 of the lemma. We are now ready to compute bi for / squarefree. We replace the inner sum in (2.7) by its value given in the lemma, and sum over all n with a fixed n o . According to the lemma, we obtain
ak/2t ik/2-1
bi
: -= r ( D e nik/4
V k+1
n i l/ and squarefree n i odd n o >0 odd
0
1
-1 (noni) -ki2 fno ,I no11 (n On 1 .
Since enoni = en., the expression inside the summation over no is equal to
snk+1 '
(-
1-1/2 ‘—' no(0 no odd n i ll 1)
,
The sum over n l il is equal to Fledd 0(1 — p -k), since it does not depend on
rv.
190
Modular Forms of Half Integer Weight
no , we can pull it outside the summation over n o . In addition, we use the relation (recall: A = (k 1)/2) E k+i
—
1
•■•••■•
( no
0
— 1 (1' 1)12 — 11 no no no
••■••
no
—1 ) 2
(no I
As a result, we obtain Ir k/2 1 102-1
b=" 1
we
nikI4
n
2,t)
(i
n0 >0 odd and squarefree
p odd
( — 1 ) A ( / no- A. no 0)
(2.11)
The expression G-701)A(54,--,) is what we denoted k_ 02 1 00). As remarked above, these are the values at odd squarefree positive n o of a primitive character x( _ 0,11 of conductor N = 1 if (— 1)A/ 1 mod 4 and conductor N = 41 if (-1) A/ -a- 2 or 3 mod 4. Thus, the sum in (2.11) looks very much like the value at s = ). of the Dirichlet L-series co
s) ci=e, E
Re s > 1.
(2.12)
n=1
The difference between the two sums is that the sum in (2.11) only ranges over all odd squarefree n. But if we write an arbitrary n in the form n = noni with n o squarefree, we see that x( _ 1)),/ (n) = 4_ 0,1. 1 (no) if n 1 is prime to N, and 4- 1),1 1(n)= O if g.c.d.(n, N)> 1, where N = 1 or 4/ is the conductor. First suppose that N = 41. Then, if we were to multiply the sum in (2.11) by n i >0 prime toN
=
n (1 - p - 2A)-1 =
p
N
MA) H
PIN
—
we would obtain the L-series (2.12) with s= A. In other words, we have ,k/2/ k/2 —1 =r " (1)eniki4 LOC(_. 1 )A1 1 A) H (1 — p 22) . b, (2A) [I (1 — AN p#
2
The products cancel, except for p =21N on the left, so we obtain (here we replace k by 22 + 1): L(4-1)Ai,
(2 2)
for
/I)
/ A-1/2
1—
2 -21
it
F( 2 + 4) elti(A/2+ "
(2.13)
(-1) 2/ a-- 2 or 3 mod 4.
If, on the other hand, N = 1, i.e., ( —1 )' /=-_=- 1 mod 4, then to obtain L(4_, ),11 , A) we must multiply the sum in (2.11) by
11 I — 4_1}A/(2)2 —A pf2/
A
p
211
where the first term (1 — x( _ /) ).,(2)2 -2`) -1 is necessary in order to get the terms in (2.12) with n even. Since 1 — 2 -2 ' = (1 + x(w1),2,1(2)2-A)(1
§2. Eisenstein series of half integer weight for
ro (4)
191
we can rewrite the last product as
(1 + x(_..1)A1(2)2") FT 1 (
p . 22) . 1 . -
This leads to the equality b1 =-for
1,(x
a it)
(-1) l ' ((22)
/2-112 1 + 4_1)21 (2)2
7r 2+1/2
f (A +
(2.14)
(— 1)21 ---= 1 mod 4.
The formulas (2.13) and (2.14) express the /-th q-expansion coefficient of Fki2 in terms of the values at positive integers of an L-function and the Riemann zeta-function. Although these formulas include complicatedlooking factors, these factors largely disappear if we use the functional equations for L(4_ 0 2 1 , s) and ((is) to express bi in terms of values at the negative integers. Namely, by the theorem in §11.4 we have (( 22) = ((1 — 22) 702-1/2“12. _ do/1,w. Next, Problems 3(e) and 5(e) of §11.4 give us the functional equation for L(4_ 0 2 / , s), where the functional equation is slightly different for even and odd characters. By (2.8) it follows that x( _ 1)21 is an even character if 2 is even and an odd character if A is odd. We thus have 7.c A-(1/2)
L(X(-1) 111 2) = N )
L(4_ 1)21 , 1 — A)
(2.15) •
r(1.--1-A)/r(12)
if 2 is even; if il is odd.
tr( 1 — 12)/r ( + IA) Substituting these equalities in (2.14) and using the relations r(,1) = raii,m-i- ± 1.1)2À - 1/jr (see (4.4) of §II.4) and r(x)r(1 — x) = 7r/sin 7rx (see (4.3) of 11.4), one obtains (the details will be left to the reader): 2A-1/2 if (— 1)2 1 Er: 1 mod 4; 1 + 4-101(2)2' if (— 1)21 2 or 3 mod 4.
(2.16) Notice that the L-function value depends on the precise value of 1, but all of the other factors are either independent of 1 or else only depend on the value of / modulo 8. This completes the proof of Proposition 5. 0 For fixed A even,as lranges through squarefree positive integers, the even characters x( ... 1)). 1 run through the characters of all real quadratic fields 0(/i). The discriminant D of Q(fi) is D =---. 1 if I ±- :- 1 mod 4 and D = 41
192
Iv.
Modular Forms of Half Integer Weight
if I 2 or 3 mod 4. For fixed A odd, the odd characters 4_ 021 run through the characters corresponding to all imaginary quadratic fields Q(.\/ — 1), whose discriminant is D = —1 if — 1 1 mod 4 and D = —41 if —1 2 or 3 mod 4. Thus, for (— 1) 2/ 1 mod 4, the formula (2.16) expresses the IDJ -th g-expansion coefficient of F24.112 in terms of the value at 1 — 2 of the Lfunction of the quadratic character of conductor ID. By a similar computation, it is not hard to show that for (— 1) 21 2 or 3 mod 4, IDI = 41, the IDI-th g-expansion coefficient is given by titnsi
=
u4i
= 122-1b. =
L(xc_ 1) 4,
1. — A) • 22-1/2 (1 + (— 1) 2i)(1 —22) 1 — 2 -22.
(2.17)
We now look at the q-expansion coefficients of the other Eisenstein series E2+112. By an argument similar to the derivation of (2.16)—(2.17), one shows that for / squarefree the 1-th coefficient at in the g-expansion of Eki2 = is given by al
, t_ 1 + xe._ 021 (2)2 1 -2 if( — 1) 2 / 1 mod 4; = + ( _ 10.4 02 -1/2- „bi if (— 1) 2 / a 2 or 3 mod 4 (2.18) = L(xt_ iy2 1 , 1 — C(1 — 22)
I . 1 + x" ),42)2' if(— 1)21 1 'mod 4;
2 1 + 4_ 1 00)2 -2 1
1
if(— 1)21
1 — 222
2 or 3 mod 4; (2.19)
and that when ( — 1) 2/ 2 or 3 mod 4, so that D = (— 1)241 is the discriminant of a quadratic field, we have
+ (— 1) A 1)2_2(1
2'22)b4,
L(X(
—
1 C(1 — 2.1) 5
-
2f -22
21 _ 2 1-2,1.• (2.20)
Thus, we have found that both Ek/2 and Fic/2 have ID! -th g-expansion coefficient equal to L(x D, 1 — A) times a factor which depends only on and, in the case D = (— a.- I mod 4, on xD(2), which equals 1 if(— 1) 21 1 mod 8 and —1 if(— 1) 2/ 5 mod 8. For now, let us abbreviate x = XD (2). The same is true of any linear combination of Ek/2 and Fk/2 1 SO it should be possible to find a linear combination whose IDI-th q-expansion coefficient is exactly L(x D , I — A). More precisely, we look for coefficients a and fl such that C(I
22)(cxEk/2 + M0)
has }DI-th g-expansion coefficient () DI equal to L(x D , 1 — A) for all discriminants D of real quadratic fields if A is even and of imaginary quadratic
§2. Eisenstein series of half integer weight for
r0 (4)
193
fields if A is odd. The term CO — 22) is inserted for simplicity, in order to cancel the term CO — 22) in the denominators of all q-expansion coefficients of Ek/2 and fi /2 . Thus, we want: for
D = (-1)21 a-- 1 mod 4,
L(xD, 1 — it) = cipi (1 1 ± 21-Âx 1 2' -( "2) (1 = L(XD, 1 — 2) 2 1 + 2 -1 x a -f- 1 + (— 1)ai . 1 for
D = (-1)241,
(-1) 21 -=-:_: 2 or 3 mod 4,
L(XD, 1 — A) = elm - 22
1 22- "/2) 11 = 1-4(XD, 1 — 2)( 21 ' 11 1 221-2A cc + 1 4_ (__ 1)2i . 1 _ 2-2217 . Dividing by L(x 0 , 1 — 2), we obtain a 2 x 2 system of equations in the unknowns a, fl. Because x depends on / modulo 8, it is not a priori clear that the a and f3 for which aalDi + Pip' = L(x D, 1 — 2)/C(1 — 22) will be independent of I. However, when we solve the equations for a and fi, we find that the solution does not actually contain x, and so is independent of I. Namely, we obtain a = 1,
fi = (1 + (-1)202 -2-"12).
We conclude Proposition 6. Let 2 = (k — 1)/2 _,._ 2, and let Ek12, Fki2 e Mi c1 2 (t0 (4) ) be defined by (2.4)-(2.5), Then
ik) 2-102 1-9 2 ‘ 4- (1 ± rk 1 ) e Mk12( t 0 (4)) has the following property: if D = (-1)21 or (-1) 241, 1> 0, is the discriminant of a quadratic field and h) is the corresponding character, then the ID1-th g-expansion coefficient of 1412 is equal to L(x D , 1 — 2). IAl2 ra C( 1 — 22)(Eko
This proposition is due to H. Cohen [1975 ]. (His notation is slightly different from ours.) It can be viewed as a prototype for the theorem of Waldspurger-Tunnell which we shall discuss later. The WaldspurgerTunnel! theorem exhibits a modular form of weight 3- (think of A as being equal to 1 in Proposition 6) for F0(128) such that the square of its n-th g-expansion coefficient is a certain nonzero factor times L(E„, 1), where L(E, s) is the Hasse-Weil L-function of an elliptic curve E (see §I1.5) and E. is the elliptic curve y2 = x3 — n 2 x. Thus, the g-expansion coefficients of this particular form of half integral weight are closely related to all of the critical values 4E., 1) which we encountered in Chapter IL If we were allowed to take 2 = 1 in Proposition 6, then the ID1-th qexpansion coefficient would be L(XD, 0), which differs by a simple nonzero
194
IV. Modular Forms of Half Integer Weight
factor from L(x D , 1) (because of the functional equation), and is essentially equal to the class number of the quadratic imaginary field CP(NA (here = 1 or — 40. We will say more about the case A = 1 below. For another proof of Proposition 6, using so-called "Jacobi forms," see [Eichler and Zagier 1984]. We now discuss some interesting consequences of Proposition 6. Since = any element 1-4/2 e -M - /c/2(T0(4)) can be expressed as a polynomial in - 00 qn2 and F = En> 0 odd al OW (see Proposition 4), it follows that there are formulas expressing L(x D , 1 — .1.) in terms of i (n) and the number of ways n can be expressed as a sum of r squares. We illustrate with the simple case k = 5, /1 = 2. in this case M512 (t0 (4)) is spanned by CO and OF, and a comparison of the constant coefficients and the coefficients of the first power of q shows that H5/2 — 1 100 5 — *OF (see the exercises below). This yields the following proposition. —
Proposition 7. Let sr(n) denote the number of ways n can be written as a sum of r squares thus 0 5 =--- E s5 (n)q". Let D be the discriminant of a real quadratic field, and let XD be the corresponding character. Then 1
L(xD, 20s5(D) 1 = 17l)
D- j2 odd
Many other relations of this sort can be derived, in much the same way as we used the structure of Mk (F) in the exercises in §111.2 to derive identities between various ar (n) . For details, see [Cohen 1975 ]. These relations can be viewed as a generalization to A > 1 of the so-called "class number relations" of Kronecker and Hurwitz. (For a statement and proof of the class number relations, see, for example, Zagier's appendix to [Lang 1976] and his correction following the article [Zagier 1977 ]. ) The class number relations correspond to the case À. = 1, i.e., L(x D , 1 — A) = L(XD, 0). But A = 1 falls outside the range of applicability of Proposition b. To get at the class number relations from the point of view of forms of half integral weight, one must study a more complicated function H3/2 which transforms under F0(4) like a form of weight but which is not analytic. The IDI-th coefficient of H3/2 is essentially the class number of the imaginary quadratic field O(Ji75). The study of 113/2 is due to Zagier [19754 In some sense, the situation with 1/312 is analogous to the situation with E2 in §111.2. In both cases, when we lower the weight just below the minimum weight necessary for absolute convergence, problems arise and we no longer have a true modular form. The study of E2 and H312 is much subtler than that of Ek and 14/2 for k> 2 and k/2 > 2, respectively. But in both cases the Eisenstein series, though they fail to be modular forms, have a special importance: we saw that the functional equation for E2 led to the functional equation for the discriminant form A(z) e S1 2(n ; and H312 has as its coefficients the class numbers of all imaginary quadratic fields.
§2. tisenstein series of half integer weight for
r0 (4)
195
We next examine the quite different question of how the nonsquarefree q=expansion coefficients of Ek12, Flo, and 14/2 can be determined. Again we shall give the details only for Fk12 and shall omit the similar computations
for- Ek12 .We proceed one prime at a time. That is, for each prime p we express bi in terms of bio , where 1 = el° and p2 4". Finally, combining the different primes p, we shall be able to express 1)1 in terms of bio when / = 44 and 10 is squarefree. This information can be conveniently summarized in the form of an Euler product for Er, bioliti- s, as we shall see later. The easiest case is p = 2. Suppose that / = 4v 1o , 44'10 . We return to our earlier computation of 1)1 in (2.7). The general formula (2.7) is valid for all /, not necessarily squarefree. If we replace 1 by 4/ in (2.7), the double sum does not change, because replacing I by 4/ is equivalent to replacing j in the inner sum by 4f, which runs through as j does, since n is odd. Thus, 410-1 9 and so for 1 = 4v/0 we have ba, = = 2(k--2)v bio. (2.21) This case p = 2 can be summarized as follows: for any 10 , co
E
v=o
1
4v2-sv = /71 0
(2.22)
0 1 — 2k -2- s
Comparing coefficients of 2-" on both sides of (2.22) shows that (2.22) is equivaleni to (2.21). (Note that we do not actually need to assume that 44(49 .) Now suppose that p is odd. Let 1 p=2v/o, where p2 V ° . We again use (2.7) and divide into two cases. Case (i ) pi 10 . Let 10 =pio • We write the double sum in (2.7) in the form n> Ø odd Sri, where Sn is the term in (2.7) corresponding to n. Let n = nop', where p2 41 no . We fix n o and look at the sum of the Sn over all n = n0p 21 , h = ct, u 1, 2, . . . . Note that k/2 = no-kap -hk . we 6, = 8,. and nnow evaluate the inner sum overje ilni. First suppose that Ono . As representatives j of i mod ni we choose iop 2h ji no as j0 ranges from 0 to no — 1 and j1 ranges from 0 top 21 — 1. With p fixed we introduce the 'notation
=
if pij;
(2.23)
I if p Then for h > I we have p 2h
no (11) =
5(j) (it
)
;;
6(/1) (k) •
Also, e -21rign =
p I + 2(v-h)
Thus, the term Sn in (2.7) corresponding to n =-- nop
is the product of
Iv.
196
E
k – 142 e no no
(-/0 nn-
—
0
Modular Forms of Half Integer Weight
e –2 7rtiojino — —
Sno-
and -hk
uUl)e-21ciiiroP1+2(v–h).
j 1
1 , e2nii1 pi+2(v-h) _
E
e 2.ii2p
0<j2
0
(where ] 1 = pj 2). The first sum is zero if h> y and it isp 2h if h < v; the second sum is zero if h> y + 1 and it is p 2 11 -1 if h < v ± 1. We conclude that all of the terms in the outer sum in (2.7) corresponding
to n =
no p 21' for fixed n o contribute Sno
v ( 1 + E p –hk( p 2h _ p 2h-1) ..... p
–(
v+1)kp2(v+1)-1
h=1,
= sno (1 _ p i-k) E p V
h(2–k)
h=0
We next suppose that pino , no = pti o . As representatives j of Z mod nZ we choose j
0 ....Ç i i
0 .,../0 < O'
=AP 2h+1 +iirio ',
We have = 11) (i o P)
en) = GI)G1-0)
(
P
Flo
Then S„ is the product of k –k/2 enn
E ,
joP e --2/tijoifiro
o:sio
and
E
0,<,2h-fi
.ii. e-2,,iiiTo p2(vœh) (p
But it is easy to see that for any h the latter sum is zero. Namely, if h < y, then we obtain E li. (k) P = 0; and if h > y, then the sum over any given residue class mod p, i.e., j i = j2 + pj 3 for fixed j2 , is equal to /2 e- 2/rii,ro p2(v -0 E e - 27rii3T0 p 21 \ PJ 0
and the latter sum is zero, because 2(v — h) ± 1. if n is divisible by an odd power of p.
,
< O and p,110 . Thus, Sn = 0
ro (4)
§2. Eisenstein series of half integer weight for
197
We' conclude that for p1/0 we have
bi=
irk/2
r (k) &Eno.
pki2)-1
— p I -k\) L p h( 2-k)
s no > 0 odd, pf no °
h= 0
Here Sn o depends only on 10 and not on / = /0 p2v, because for pit no (.2t_ 2v) 2
E (i) e - 27rilfino o
no
05j
0 jp2vino •
no
0.si<no
(i) -2 ne
E
0
ito Sno.
no
Thus,
bdb,o (00) (k/2)-1
E h(2 — k) = E
p h(k -2)
h=
h=
This relation between bl and 40 can be summarized in the following identity, which is valid for any 10 with pilo , p2 ,11 /0 : co
E
bi0p 2vp
-sv
=
y=o
1 p -s)(1
0 (1
p k- 2-sy
(2.24)
To. see the equivalence of this identity to the formula derived above for b,/b, 0 ; it suffices to expand (1 — p -s) 1 and (1 — pk- 2 -s%) in geometric series and in that way conclude that the coefficient of p the right in (2.24) is p 1(k-2) h=0
Case (ii) 010 . Again we fix no with p2 )(n0 and consider all terms Sn in = op2h5 h = 0, 1, 2, . . the outer sum in (2.7) for which nn First suppose that Ono . We take] = jo p' + 1 n0 as before, define 6(j) by (2.23), and find that (5( o e- 2 7t i j i lop 2(v h)
Snop-hk
Sn
05:7 1
As before, this sum can be rewritten in the form
>2
0<j i < p 2h
e
2 R ii p 2(v — h)
E
."M■
(3,i2
which equals zero if h> y and p2h p2h-1 if h
E h=0
sn0p 2h =
2(v —h)
E p -hk ( p 2h
v. Thus, p 2h-1 ■ )
) .
(2.25)
h=1
Now suppose that pino , no =pri o . At this point we change our notation, replacing ti o by no . That is, we shall determine E h Sn0p 2h+1, where now no is not divisible by p. As before, we set j =i0p2h+1 +j1 n0 , and find that S„, = 0p2k÷i, is equal to the product of where nn
Di. Modular Forms of Half Integer Weight
198 E
( joil
0 .5i0<no ..
n
p – k/2 spkno n 0– ki2
e - 2 nijollno
(2.26)
o
and P
-hk (no)
I0 p
1.) e 2 n —
p
(2.27)
P
0
v h) —1
The latter sum is easily seen to vanish if either h
p –vk (—
10)
(5)
E 0
P
(after the change of variables j = ---10j1 ); and this sum is p2 v times the Gauss sum ep jy, Meanwhile, (2.26) is equal to ppno icn o)k
Hence f r n = n 0 p 2 P +1 we have co
(p_.) sn no ° •
, o
\k (P 10) 2 v irv pp –vk (— Snop 2h+i = pe n Noy —o) –142ikspnoi P
E
(7)
S
• .
h= 0
We check that sp(cpnoien)k(k)(51) = p, no F-." + 1 mod 4. Hence, co
s
E h=0
n oP
2h+1 =
r i by considering the four cases
p v(2–k)– A„ 4(
v
( –
1
)
Li o k-'no i.')' •
Thus, combining this with (2.25), we find that the total contribution to the outer sum in (2.7) of all terms corresponding to n of the form nop 2h or n0 p 2h +1 (for fixed no not divisible by p) is equal to S0 i +
pv(2
2 X( -1
v E p –hk( p 2h _ p 2h-1)
z0(P) +
.
(2.28)
h=1
In the case I = lo , y = 0, this contribution is simply
Sno ( I ± p-Ax(-1),140(P)). Let us temporarily abbreviate x= x( - 1),110 (p). Then we compute that p v(k– 2) l ± p v(2 –k)–A x
E
p –hk( p 2h
p 2h-1)
h=1
no > 0 odd, On o
(1 -1-p - Ax)
Ep h=0
h(k– 2)
v-1
E n0 >0 odd,p4 no
E p h(k- 2)_I ± p –Ax
/1=0
E
Sno
st,
o
199
§2. Eisenstein series of half integer weight for ta(4) ,
By a simple algebraic computation, using x 2 == 1, we find that this last ratio is equal to v-1
...= v n h(k-2) _ zi9 ,1-1 E pha . . 2) bi .. bil =01-- •0hz.• V
(2.29)
h=0
The relations (2.29) for y = 0, co
E v=.0
biop 2vp -sv
1, ... , can be expressed by the identity 7 1 — Ze-i)Ai o (P)P 2-1-s _ p
= 0,00
(2.30)
- si
on the right. as we see in the usual manner by gathering coefficients of p Note that the relation (2.24) that we derived in case (i ) is the same as (2.30), because x( .._ 02 10 (p) = 0 when p110 . Thus, in all cases we have (2.30). _ The next proposition combines the identities (2.30) for all p odd and (2.22) for p = 2.
Proposition 8. Let 10 be a squarefree positive integer, let bi denote the qexpansion coefficient of F2, and set 2= (k — 1)/2. Then co
— E ,, i t, ,-s_ ,, ,,,,,
7 1 =1
b io
2 k-2-s
I
— X(-1)Aio (P)P A-1-s k-2-s) 11 (l _ p -s)(1 oddp
_
(2.31)
PROOF. Let /1 = p vi i • • • p rv r. Let fp (s) denote the factor in the product cotre= (1 _ 2 k-2. -s) --1 ) ,. Then we must show that 401? sponding to p (here f2 (s) equals tol0 times the coefficient of /1- S = p -i svi - • - pr in 4,1 • • • fp r . We use induction on r. For r = 1, this is precisely what (2.22) and (2.30) say. If r> 1 and 12 --= pi l • • • prv_r_.-1 1 , we assume the result for r 1, and then we have (by (2.22) or (2.30) with /0 replaced by /04): —
boipp r
= 4.13 - (coefficient ofp:-"r in fpr ) = blo • (coeff of /2- s in fp , • - • fpr _d(coeff ofp,-"r info)
by the induction assumption. But the product of these two coefficients is This completes the proof. (Compare with the coefficient of /1- s in f,„ • • •fpr. • o the identity (5.8) for Hecke operators in §III.5.) In a similar manner, one can derive identities for the q-expansion coefficients al for E4/2 as / = /0 q varies over integers with fixed squarefree part 10 . We shall only give the result. One again proceeds one prime at a time. For odd p, the same identity holds as for F1112: OD
E ai.,2 v=o
SV
a vp=
I-
1 0 (1
— p - s)(1
—
p k-2- s)
(2.32)
for fixed 10 , p 2 ,1'/0 . But for p = 2 the identity_ turns out to be a little more complicated than for F142 . / The most interesting Eisenstein series for F0 (4) is the linear combination
IV. Modular Forms of Half Integer Weight
200
1/42 = CO — 22) (42 + (1 + ik)2-42F42) in Proposition 6. Let cl be the /-th , an odd prime p, q-expansion coefficient of H42. If I ,....... /02 Vl p2 /If I10for then, because of (2.30) and (2.32), we have the same identity for the ci : co
,.. 1 — x,-iytio (p)/3 2-1.-5 _ -sv E c,o p2vp — .-qo u _ p -s)(1 _ p k-2-s) v=0
(2.33)
In the case p = 2, one has b104v = 2'_ 2 'b10 , and for a104v one can derive formulas expressing a104v in terms of k04v (these formulas are given in [Cohen 1973]). Combining these formulas for the case p = 2, one obtains the following result: if / 0 is a positive integer such that either ( — 1) 2/0 1 mod 4 or else ( — 1)1 0 is four times an integer congruent to 2 or 3 mod 4, then the coefficients cv v, y = 0, 1, 2, . . . , satisfy the same identity (2.33) with p = 2. Thus, under these circumstances the identity (2.33) holds for all primes p, including 2, in the case of Hfc/2. The identities (2.33) for all primes p lead to a relation similar to (2.31), where either /0 is squarefree and (— 1)2/0 =.7..- 1 mod 4, or else /0/4 is squarefree and ( — 1 )A /0/4 a.- 2 or 3 mod 4. It can also be shown (see the problems below) that cl = 0 for I which are not square multiples of such 10 . Since the coefficients el° in these cases are precisely the values L(x N0 A io , 1 — 2), we obtain the following proposition. Proposition 9. The element lik2 e Alk2 (r0 (4)) given by Hki2 = CO — 22) (42 + (1 + ik)2 -1(12F42) has q- expansion coeffkients c l which can be de-
termined by the identity co
1,.1
1 — x,,,A,,,(p)p ' s n (1 _ p—s ---=-- L( x ,_,,,,o , 1 — ,1.) mil, ___ pk-2—sy
)('
(2.34)
where either 10 is squarefree and (— 1) I0 :=.- 1 mod 4, or else 1014 is squarefree and (— 00/4F.--- 2 or 3 mod 4. If I is not a square multiple of such an 10 , then ci = O. Finally, co = ((1 — 2.1.). Thus, there are two very different elegant properties satisfied by the q-expansion coefficients of Hko. First, the coefficients corresponding to discriminants of quadratic fields are the values of the L-function for the corresponding quadratic character at the fixed negative integer I. — 2 = (3 — k)/2. Second, for a fixed discriminant, the coefficients ci with la square multiple of that discriminant satisfy an Euler product identity. Both properties are closely analogous to the results about forms of half integral weight which we shall need later to describe Tunnell's work on the congruent number problem. The first property of N2 is parallel to Waldspurger's theorem, asserting the existence of a modular form of half integral weight whose / 0-th q-expansion coefficients are closely related to L(Elo , 1). The second property is an example of a very general correspon-
r0(4)
§2. Eisenstein series of half integer weight for
201
dense due to Shimura [1973a] between forms of half integral weight and forms of integral weight. Let us look again at the Euler product (2.34) for 142 . The only part that depends on 10 is the numerator 1 — 4_ 1)210 (p)p 2-1- s. The remaining part of the Euler product
o ___ p - s)-1 n
p k-2-sy1
depends only on 1412 and not on the choice of 10 . Observe that this is an Euler product we encountered before: it is the Euler product relating all the coefficients of the normalized Eisenstein series
Bk-1 E 2(k — 1) k-1
Bk-1 2(k — 1) +
co
n=1
ak-2(11)qn
(see Problem 16 in § 11 1.3). We say that 14/2 e Mk/2 (f'0 (4))' corresponds to 1)))Ek _ l e Mk_i (F) under the Shimura map. (—B k _ i /(2(k We shall discuss the Shimura map in more detail later. The Shimura map applies to forms of half integer weight which have Euler products similar to (2.34). As in the case of forms of integral weight, Euler products arise from the property of being an eigenform for Hecke operators. But when we work with half integral weight, it turns out that we only have Hecke operators Tn2 whose index is a perfect square; all other T„ are identically zero. This is why, in the case of half integral weight, we get a weaker type of Euler product, which only connects coefficients whose indices differ by a perfect square multiple. Hecke operators on forms of half integral weight are the subject of the next section. PROBLEMS
1. Verify the cusp condition for 42, and find the value of E42 and Fk/2 at all three cusps of F0 (4). 2. Derive (2.16) from (2.15). 3. Derive (2.17). 4. Check that the constants a and fl given in the proof of Proposition 6 are the solutions to the two equations giving () Di =-- L(x D , 1 — 2) in all cases. 5. Suppose that we looked for a linear combination ŒE 2 /3F42 whose 1-th q-expansion coefficient for / squarefree is equal to L(x( . . 0.1./ , 1 — .1). (That is, when ( 2 or 3 mod 4, this L-function value is cl rather than c41 .) Show that no linear combination has this property for all such I. —
6. Show that if(— 1)A 1 2 or 3 mod 4, then c1 7---- 0 in the q-expansion of 14/2 • (Noie. 1f (f0 (4)) denotes the subspace of Mk2(r0(4)) consisting of forms whose q-expan:, sion coefficients satisfy this property, then it has been shown: [Kohnen 1980] that the Shimura map gives an isomorphism of Aik";-2 (f-0(4)) with 114'k _ I (r).)
It.
202
Modular Forms of Half Integer Weight
7. What is the constant term in 14/2 ? What is the coefficient of the first power of q in 14/2 ? When is this latter coefficient zero? 8. Show that H5/2 = The' — *OF, and show how this gives Proposition 7.. 9. Show that for k > 5 odd and for suitable a and b, we have e k — aE1.i2 eSk12 (1 0 (4)). Find a and b. Express 0 5 and 07 in terms of 42 and Fic/2.
—
bFki2
§3. Hecke operators on forms of half integer weight We first recall how the Hecke operator Ti, can be defined on forms of integral weight by means of double cosets. For simplicity, we review the case of the full modular group F. For n a positive integer and Je Mk (F) we can define Tif as follows. Lei A" be the set of all 2 x 2—matrices with integer entries and determinant n. For any double coset Far c A", where a e A", we define fl[FoeF]k = f I Pryillk ,çwhere the sum is over all right cosets Fayi c Fa r; equivalently, yi runs through a complete set of right coset representatives of F modulo a-l ra n F (see Proposition 41 in §III.5). Then Trift:f n(42 "
Ef 1 [rank )
where the sum is over all double cosets of F in A". There are not many double cosets of F in A"; in fact, if n is squarefree there is only one. More precisely, we have the following proposition. ,
Proposition 10. A complete set of double coset representatives of F in A" is {(n4 :no» , where n o , n 1 run through at/positive integers such that n = non. In, particular, if n is squarefree, -then An = Fa °O F. On the other hand, if
n = p 2 is the square of a prime, then AP2 = F(01 p°2)F u pi" (where p( (; 1 Pp bc)) . PROOF. Consider the abelian group Z 2 with standard basis generators e l = (i), e 2 = (?). Any matrix a e A" gives a subgroup of index n in Z 2 , denoted Œ1 2 , namely, the span of ow l and ae2 , the columns of a. Conversely, any subgroup of index n in Z 2 can be obtained as aZ2 for some Œ. Now fix any a e A. By the elementary divisor theorem (see, e.g., [Van der Waerden 1970, Vol. 2, p. 4]) , there exists a basis e'1 , e2' of 1 2 such that the subgroup a 1 2 is the span of n 1 e'1 and n 1 n0 e'2 for some positive integers n o , n 1 with n = non?. Let y1 E F be the change of basis matrix from e l , e 2 to e'1 , e'2 , and let y2-1 cf be the change of basis matrix from ae l , ae2 to n i el, n 1 n0 e'2 . Thus, acy2-1 = [n 1 e'1 , n 1 n0 e] = h(''d ni°no), i.e.1 / = Yi( nd n0)Y2 E F(nd ,7 °,0)F. Since aced" is arbitrary, this proves that the indicated double cosets exhaust A. Conversely, it is easy to see that these double cosets are disjoint: in fact, a e F(nj ,,°„)F = n 1 F( if and only if n 1 is the greatest common divisor
,vr
§3. Hecke operators on forms of half integer weight
203
of all entries in the matrix Œ. This is because, if a = Ti a'72 with Ti., y2 G F, then dial! entries of a' implies dial! entries of a; co'nversely, because a = yi-l ay2-1 , it follows that diall entries of a, implie diall entries of a'. This 0 completes the proof. In §III.5 we discussed the Hecke operators on forms of integral weight Z) for the congruence subgroup r 1 (N). In that case one lets A" = An(N, (see (5.23) in §III.5), i.e., An is the set of matrices of determinant n which are congruent to ( ) modulo N. For fe Mk (Fl (N)) one defines Tn f d= ef n(42)-1
y
[ri (Mari (N)ik,
where the sum is over all double cosets of f1 (N)in A". Ifg.c.d.(n, N) = 1, then a complete set of double coset representatives of 1" i (N) in A" = A"(N, {1 } , Z) is an 1 (:Go ni°01, where n_:=01._ ( n1 Ion1 1are n0 ) such that an, as in Proposition 10 and ani is a fixed element of F modulo N.
\ Proposition 11.
PROOF. If a e A", we know by Proposition 10 that a = n iY
) Y2' where Ti ' y2 e r and n is the greatest common divisor of the entries of Œ. We must show that a can be written in the form n 1 y 1 ( n00 »21 with y, y2/ e ri (N), = 'y'o e F i (N). We or equivalently, in the form n l o-,,y 1( 01- Z)y2' with suppose that n 1 = 1, i.e., n = no ; the general case is completely similar. Lemma. If g.c.d.(n, N) = 1, then a set {ti } of right coset representatives for F modulo (N) can be chosen so that -E.; E Fo(n).
r,
F be any coset representative. We look for e Fi (N) such that T1 = ay 1)t E To (n), in which case we merely replace -r by f. It is easy to see that there exist u and y relatively prime such that n divides au + cv. Let y = u + (j1 + j2 N)n, ô = y + In, where . ./ 1 and I are chosen so that u + ji nE--- 0 mod N, y + ln -a- 1 mod N (this is possible because g.c.d.(n, N) = 1). If we show , that j2 can be chosen so that y and 45 F1 (N) and Cy ID:r G are relatively prime, then we can find a, fi such that ( yt Fo (n), because ya + k au + cv 0 mod n. But if u + jl n + j2 Nn and + ln have a common factor, we first note that such a divisor must be prime to N (since y + ln 1 mod N) and alsdi prime to n (since g.c.d.(u, y) Then if P is the product of all prime divisors of y + in not dividing Nn, 1 mod P. In this way we can fi nd we can find j2 such that u + ji n + j2 Nn the required f2 . We now return to the proof of the proposition. We have ja = n° )T2 with Vi, y2 e F. Using the lemma, we write Ti e F in the form y = where e F0 (n) and )4 e r1 (N). Since e Fo (n), it follows that a. ony l y;:( on) c f. We write PROOF OF LEMMA . Let
T = Ga E
ni.
204
Modular Forms of Half Integer Weight
,
a = Yi ( 10
n) (01 °,1 )-1
o
n) Y2 = Yi (10 O
CO n0 ) Y '
where y2' = ( )_1 y'( ?i)y2 € I". It remains to show that y2' E r1 (N). But since a E A" is congruent to ( ) modulo N, we have modulo N _a (1 zir) (1 0) y,2 (1 *) y2, V)
\O n)
I)0 nj
from which it immediately follows that y'2 E ri (N). This completes the proof. 0 We now look at the analogous construction for forms of half integer weight. Recall that in that case we must work with the groups to(4) cz G' c G, where
G = {(Œ, 4)(z))1 0( = 41 1)\ dr GLI(Q) ' ct•(z) 2 = t(cz ± d)[\/det a for some t = +11
G 1 = f(a, 0(z))EG la ell;
T0(4) = loc,./(a, z))I(x =-' (ac bd)e fo(4), j(Œ, z) = ( c )ed-l \lcz + df. Suppose that 41N and fe M42 (r1 (N)). Let n be any positive integer prime to N. In the case of integer weight we defined T„ by considering double cosets of the form I", (Mani ( . „(1)Fi (N), where n= non?. So for half integer weight one considers double cosets of the form rl (N)c n 1 6,norl (N), where ), tnn for some t = +1, +i. keG is any lifting of (1,- Z), i.e., t20 = Since t would turn out only to affect our definition by the constant multiple tic, for simplicity we agree always to take t = 1. Also for simplicity we consider the case n 1 = 1, no = n; the general case is completely similar. So we now examine the action of the double coset f",(N) n ri (N) onf, where
(a) n°0
fl=
WO
nl' ).
-
That is, we compute
,- f E fiR:iOW, fir , usign ri (N)],„2 ;t-
(3.1)
i
where the sum is over all distinct right cosets of tl (N) in our double coset, i.e., over a set N of right coset representatives of ti (N) modulo r" = Writ (Ngn n r' 1 (N) (see Proposition 41 in §1II.5).
Proposition 12. If n is a positive integer prime to square, then fia" i (Ngn i; l (Nnki2 = O.
N which is not
a perfect
205
§3. Hecke operators on forms of half integer weight
Given a e GL(0) and = (a, 0)e G, we construct a map from + il in the following way. Given a-1 f1 (N)a n F1 (N) to T = {±1, r= clef y = a'y l a with y, y/ e fl (N), observe that •p. and -..1 'pi e Gl both have the same projection in f, and so they differ by an element of the form (a) ?), 01 i.e., PROOF.
For fixed a and , we consider the map that associates to y the number t. One verifies (see the exercises below) that t(y) is a group homomorphism from I"' to T, that it depends only on a and not on the choice of 0(z) in = (a, 0), and that in the case a = ( ) we have
n
for y =
a
(
b)
c d
E I' = a-l ri (Ma nr ,(N).
(3.2)
Let K c rf denote the kernel of this map t. Recall the definition t" =-- VP, (Ng, n t l (N). We claim that k = i.e., if Ile rI (N) is of the form W:)).1 „ with y1 E fr(N), then y e K, and conversely. To see this, first suppose that y, yi E ri(N) and 55 = ç. Applying the projection P: G -- GL (Q) gives y = a ?Ph ( °„), i.e., ye F' = Œ'f 1 (N)Œ r r, (N). Since V•P", ,,, = 5) =. Al, t), it follows that y e Ker t. Conversely, if yeK c r/, so that y = a-i- yi a and ,;-" l i)j, ,,, =---- 5i(1, t) with t = 1, it immediately follows that 5-le Thus, in general, "t" = 1 ? is smaller than tt, i.e., the intersection of V r i (Ngn with ri (N) is a subgroup of the lifting of 1' = a -1 F1 (N)an ri (N). This subgroup t" is all of t- - / if and only if the map t is trivial. In our case t(y) = (0, so that t is trivial if and only if n is a perfect square. (We are always assuming here that n is prime to N; the case n = PIN is treated in Problem 3 below.) If n is not a perfect square, then t" is a subgroup of index 2 in ft. In that case let ft = f" u fA"I be a right coset decomposition; thus, 't = a-l t i a and i- = tT i ti t, • (1, 1). Let r1 (N) =Ui r")); be a right coset decomposition of I-, (N) modulo r. Then —
= U it" 'Tli u u r"-t-55 is a right coset decomposition of t1 (N) modulo r" = cif-, (Ng, n By the definition and Proposition 41 of §III.3, we have fiCr i(N)nr i(N)ika=
r, (N).
E fl Rnaci2 ± E fl Ri :qiiki2ft
But for each ./ we have ,
fiRn:q j1142 = f 1Rnt •-7- 1 111-01c/2 = flEti( 1 , — 1 )nM1q2 = fl[(', —ign .Piik121
because f is invariant under {t1L2 for t i E ri (N). Since [ (1 ,
—
1
)]k/2
-
Iv. Modular Forms of Half Integer Weight
206 (-1)k = —1 by definition, we have
fiR.Mk12 + fiRit-Mki2 = fiMiki2 — f I[fl:P i]ki2 = O.
o
This completes the proof of Proposition 12.
The same argument will show that flag' 1(N)5-7,1„0J1 ,,. 7 )142 = 0 if no is a positive nonsquare integer prime to N. Because of Proposition 12, we can only work with a Hecke operator on Mk/2 (r i (N)) of index prime to N if that index is a perfect square n2 ; otherwise, the Hecke operator is identically zero. Recall that the building blocks for the Hecke operators in the integer weight case are the Tp for p a prime (see Proposition 32 of §III.5). Similarly, in the half integer weight case, the bu- ilding blocks for the T2 , g.c.d.(n, N)= 1, are the T2, ON. So let us examine in detail the action of Tp2 on Mk/2 (r 1 (N)), where Tp 2 is defined on fe Mki2 (ri (N)) in the following way: Tp2 ffi.-7;.-f p(ki 2) - 2f 1 [r i (N)
p2 ri (Nnk/2 , where
p2 = ((ol :2) 5
VP ' ) (3 . 3) This definition is not quite the immediate analog of Tp2 acting on fe Mk (Fi (N)), which is given by pk- 2 (f 1 [ri (N)(01 :2 )1”i (N)] k + x(p)f) for fe Mk (N, x), because P2 (N, {1},Z) = F1 (N)( 2)F1 (N) u F1 (N)o-p(g p° ) • r 1 (N) by Proposition 11, and fi{o -p(g °p) ri (N)] k 7--- x(p)f. Besides replacing k by /c/2 and F1 (N) and (01 p°2) by their liftings ti (N) and 42 , we also drop the trivial double coset r l(N ) ap(g 7))1"1(N). In the case of integer weight, when Tp and not Tp 2 is the building block, T,,2 involves two double cosets; but for half integer weight we shall agree to take only the nontrivial double
coset. As in the case of integer weight, in studying Mk12 ( r1 (N)) it is convenient to decompose it into x .-components. If x is any Dirichlet character modulo N, recall from §1 that
Mk/2(ro(N), X) d-Ti tfe Mico (ri (N))1 f1D1, 12 = x(d)f for all
y=(
a b ) G f 0 001; c cl
and that
Mg" i(N)) = 0,(Mka(r4 (An x), where the sum is over all even Dirichlet characters modulo N (of course, Mk/2 (to (N), x) = 0 if x is an odd character, since f = fi[-1],c12 = x( — 1)f). By an argument which is completely analogous to that in the integer weight case, one can show that Tp2 takes Mk/2 (r1 (N)) to itself; moreover (see Problem 5 below), it preserves the x-component. We now fix a Dirichlet character x modulo N, suppose thatfE Mki2(f" o(N),
207
§3.,,Hecke operators on forms of half integer weight
z), and evaluate (3.3) explicitly. /Our purpose is to express the Fourier coefficients for Tp 2 f in terms of those forf By the lemma in the proof of Proposition 43 in §111.5, we know that AP 2 (N, {1}, Z) is the disjoint union of the right cosets of 1'1 (N) with representatives
(1 b 0 132 (p h)
flh
=
6p
0 < b < p2 ;
for
p
2
for
0 < h < p;
and
T= a
2
(P
01).
0
Here for n prime to N, un denotes a fixed element of r such that ccn ( 1 ," mod N. The trivial double coset is the right coset corresponding to flo ; thus, we have the disjoint union
p2 -1
1 0 r i (N)( ) o 132
b=0
p-1
r AN)Œbu
(N)sh u r i ( N)T. U ri h=1
In order to evaluate (3.3) using the definition (3.1), we need the right coset decomposition of r1 (N)42r 1 (N) in the form Ui (N)4255; . If we could write each ab , A„ T in the form ( 01 ;2)y, where y E (N), then the y would be right coset representatives of 1"1 (N) modulo (01 :2) - 'r1 (N)(0 :2) n r, (N), and so the corresponding liftings )-; would be right coset representatives of t1 (N) modulo ;3-21 Pi (N)2 n f,(N) (see Problem 1(b) below). In that case, (3.3) is equal to Ey fiR p2A k12 . Here we may adjust ab , filh , or T by multiplying on the left by any c f1 (N)—thismerely replaces one right coset representative of f 1 (N)in f1 (N)(0' 2 )f 1 (N) by another representative of the same coset. In other words, it suffices to write ab , flh , in the form y'(ol :2)y, where y, y' E f (N), and then compute Tp 2 f(z) _(k/2)-2 E f(2 (3.4) ,
.) 1 [42 ,n ki2
Y
The result will be the following proposition. We shall go through the detailed computation of Tp 2 f after the statement of the proposition.
Proposition 13. Suppose that 41N, x is a Dirichlet character modulo N, pN aone 2nit E is a prime, and k = 22 + l is a positive odd integer. Let f(z)= 42
(t0 (N), x). Then CO
Tp 2 f(z)
=
E be 2ginz n=0
where bn
z
ap 2„ + Xk
(here we take an/p2 = 0 if p2 ,1'n).
1) 1111) p -1 ,7n
ri2)pk- 2 a n ip 2 ,e 1 It‘lu
(3.5)
208
Iv.
Modular Forms of Half Integer Weight
As explained above, we must write each ab , flu , T in the form y'( p°2)7 with y, )1' E ri (M. First, ab = Q) :2) = (4) :2) (11:■ 7.); i.e., for ab we can take y' = 1, y = ( ). Thus, the contribution to (3.4) from all of the ab is equal to p2 --i oi po2 ,,tœp 01 bi 5 1 PROOF.
f(z) I R
p042)-2 E
b= 0
= p ( Ic/2 ) - 2 p2E-1
f (Z
P
b=0 1 P2-1 /Z
= -Pi 1)=0
+ 2
1
ki2
)
(jo -k
+b P
2)'
Since p2
E
1
elny in(z+b p2
b=0
0
2
p 2e 2ninz/p
if pqn ; if p 2
we see that this contribution is E n ap2„e2'. Thus, the ab give the first term on the right in (3.5). We next evaluate the contribution of T. Since g.c.d.(p 2 , N) = 1, we can find two integers u, y such that up2 vN = 1. Then we have = T = crp2( 0
1)
2 (p 2 — [1 0 p 2u P if u o p 2)—N
t)
)
i.e., we take y' = Gp2y" with y" =( 7), andy = (ifki We have
7) ; then y', y E r, (N).
f(Z) f [42 142 = f(Z) f [ap2 j)" 42 n142
z ))
= X(P 2 )AZ)I
al -
0\ p2) 5 VP)
(Y5 AY ,
A simple computation of the product inside [ ] 42 gives ((P02 7), p -1/2). Thus, the contribution to (3.4) from t is equal to
p ac12)- 2x ( p 2)AZ)
KPO2
°) 5 - 1/2] 1 P
1q2
—P =P
)P
k-2
AP 2 )
ane
z) 2ninp 2 z
•
Hence, T gives the third term on the right in (3.5). Finally, we evaluate the contribution of the Ph , 0 < h
I-1)(a br (1 0\_ 1 = p Ac c 1)
p2
h)( d —bIp 2\ er;ow). alp2
Clearly pia, so we write a = pa', c = Nc'. Thus, we need —blp
half, e Z.
209
§3. Hecke operators on forms of half integer weight
So first choose any integer a' prime to p such that a = pa' -_=-:. 1 (mod N). Then any C. D e I" with a = pa', c = Nc' will be in r, (N). Next choose any b prime - to a' such that bla---. ha' (mod p) (this is clearly possible). Since g.c.d.(Nb, pa') = 1, we can find c' and d such that dpa' — c'Nb = 1. Thus, C D t--- (rvac. j) e Ti (N), and
- 0 li _ (pd — hc (a' h — NIla (I 0) (pa' O p) — —pc a' )V1 p 2) c dP which we denote 'y"( )y. As in the previous paragraph, where we evaluated the contribution of T, we obtain
f(z) f [42 A kf2 = x(p)f(z) I by", AY", z)) ' (( 01 ° p2 )' Jo) ' ( y, AY, z))] kl 2 We compute
((pd — he (a' h — b)/) a
— Pc
„
(—
pc)
_i N./ —pcz +
a')
2) , f-p) . ((p c:' b d) , 0 8 ; 1 \Ic z + d) . ((1° : Upd — he (a'h — b)p) (—pc) _ i a ,p2 , \\ —pc =
Z
± a)
. ((pa' b\ t c . . _ cd )' O gdi 'lcz+cl) . ((p h ,
a, Op) (c)(— Pc )
Since dpa' — c' Nb = (h) = GO. Thus,
d (1
1 1)
1, we check that Zes i8d = 8p(*)(— 1)(a'-1)(p-1)/49 and that
a ' )/8as8d= ( -19c
(
( 7;1'1 ) ( ;1() (cPVEP ( —all)
—
..= ___ i b hc , 6p 6p I P P since b F:=- ha' (mod p); but be -z--: — 1 (mod p), and so we finally obtain ..„.
((po p) h , s;1 (—h p )) Thus, the contribution of all of the Ph to ( 1 4) is equal to P—1
19(k/2)— 2 X( P)
f(z) E R(1:`, " hP ' ciP h= 1
= P (k/2)-2 X(P)
)
(----; )4 i131 .1 (p11) f
( --„h))]
(;
F
±
142
IV. Modular Forms of Half Integer Weight
210 Now
Q° _n ane 2ninz , c ane 2ninzY (1_1) e 2ninhip = ep \fr) E 1.1 ( 111) f (z + II) = i n=0(P) h=1 \P) h=1 \Pi \ Pi n=0 where we have made the usual change of variables (replacing nh by h) and used the value Ep,Fp for the Gauss sum. Thus, we obtain p(142)- 2x (p\
(''''.
1 ) c k+1 _if) i
n an e2ninz ......
n=O P = pl .-11(p) ey i ( L1 ) ane2ninz . /
P
P
P
n=0
-,
P
So the contribution to the n-th coefficient of Tp 2 f(z) is p' -1 x(p)(fl,' Ân)an . This is the middle term on the right in (3.5). The proof of Proposition 13 0 is complete. We note that it can be shown (see Problem 3 below) that the formula in Proposition 13 also holds when p(N, in which case x(p) = x(p 2) =0, so we have simply ton = ap2n . We further note that, as in the case of integer weight, one can show that the different Hecke operators 1n 2 commute; that T„2.2 = 7 n 2 Tm 2 when g.c.d.(m, n ) = I; and that Tp 2v is a polynomial in T2. Thus, the operators T1 2 for different p generate the algebra of operators C[{7.21,7= 1 ]. In the case of integer weight, we applied a formula analogous to (3.5) in the case when we have a modular form which happens to be an eigenfunction for all of the Hecke operators. The result was a formula for the ratio of an to a l , which can be written in the form E an = a l . (Euler product). (See Propositions 36 and 40 in §III.5.) In the case of half integer weight, we can consider modular forms which happen to be eigenfunctions for all of the Hecke operators. But since only the 7,i2 are nontrivial, we only obtain a formula for the ratio of avi to ale i.e., we can relate coefficients whose indices differ by a perfect square _factor. As in the integer weight case (see the end of §III.5), the spaces Alki2(r0(N), x have a basis of eigenforms for all Hecke operators of index prime to N; and certain important subspaces have a basis of eigenforms for all of the Hecke operators, i.e., for Tn2 when g.c.d.(n, N) = I and for Tp when piN. Thus, let us now suppose that fe A1142 (1T o (N), x) is an eigenform for all of , the Hecke operators 7 n2. )
Proposition 14. Let f(z) =E n _, o ane2 z e Mki2 (r0 (N), x) be an eigenform for all of the Hecke operators Tp 2. Let Ap be the corresponding eigenvalue, i.e., Tp 2 f = Apf. Suppose that 10 is not divisible by any square prime to N (i.e., p 2 1/0 only ifpiN). Then
211
§3. Hecke operators on forms of half integer weight
1
co
E avi i i- s = al° 1 1 =1
_ x(p) ((— 1 0) )- -5 i P' P )
ri i _ Apir
1
s +z(p 2 ) „k_2_2s.
(3.6)
all p I
,
of the letter A to denote (k — 1)/2 and its use with a subscript to denote an eigenvalue should not cause confusion.) (Note: The use
PROOF. If Tp 2 f .-----
Ap f with
p,' N, then (3.5) gives for any /1 prime to p:
A paioli = ate1 2 p 2 + p'lx(p)
(1)10) (— A
A p aioli p 2v = a1ot2 p 2(v+1) + pA-1 x(p) /-•
P ((-
(3.7)
ai. 01 2; 1
!Y'0
aio ti p2v
P )
+ P16-2 X(P 2 )a1oq p 2(v-1), (3.8)
V = 1, 2, . . . . Ifp1N, then we need not assume that 1 1 is prime to we have the same relations (3.7)—(3.8) in all cases, with only the first term on the right nonzero when pIN. On the other hand, if we look at the terms in (3.6) corresponding to all I I which differ by a power of p, i.e., if we consider atolip 2v(i l pys for fixed 1 / prime to p (if p,tN) and variable y = 0, 1, 2, . . . , and if we set X = p - s, we find that (3.6) is formally equivalent to the following set of identities for all p and all li prime to p (if p)(N): co
1 — X(P)( -1Y1° )PA-1 X
...... ,, 1011 1 _ ilpix,+ E aio,ip.,.n.yv — v=0
L2)pk-2x2
-
(3.9)
But when we multiply both sides of (3.9) by the denominator 1 — /lpX ± z(p2)pk - 2 A.., and 2 compare coefficients of Xv+', we obtain (3.7) for y = 0 and (3.8) for y = 1, 2, . . . . Thus, (3.9) holds, and we have established (3.6). o Proposition 14 explains the appearance of Euler products of the type we found in the last section (compare (3.6) with (2.31)). In the next section, we start by formulating Shimura's theorem, which gives a deeper significance to the Euler product in Proposition 14. The Euler product (3.6) turns out to be closely related to a Euler product for a modular form of integral weight k — 1. PROBLEMS
1. Let c = (a, 0(z))e G, let F' c ro (4 ) be a congruence subgroup, and let F" = F' n ( 1 F'ac. For y e r" define 1(y) by the relation 5-C 1 = "Iji ' (1, 1(Y)), where Vi = ŒYŒ -1 (a) Show that the map t depends only on a and not on the 4)(z) in . Prove that t is from f" to T. a group homomorphism , ... (b) As usual, let denote the lifting of an element or subgroup of F0 (4) to G, i.e.,
. w. Modular Forms of Half Integer Weight
212
1) = (y, j(y, z)) for y E ro(4), t-4/ = {5ly E r} for r c r0(4). Let K c r" be the kernel of t. Show that ie = f/r\r'fAl. Thus, if t is trivial, then y'--q gives an f-e. isomorphism from f" to f " = re (-1 (c) Prove that for fEM142 (r"), if t is nontrivial, then f irlz ' elko = O. -Recall that f l[fy ] w = Eifi[Oi] 2 , where the sum is over all yi such that fAle't = fe0; (disjoint union). .
LI/
a-
2. Prove that /(y) = (i) for a = (), y = (a, :L1) E ro (4)
F0 (4)a.
3. (a) For 1- '---4-- ri(N), 41N, and a = a ?), show that t is trivial if and only if 81N. (13) For 1-' = I", (N), 4p1N, and a = ( °p) , show that t is trivial. (c) For 1-', p, and a as in part (11), show that Tpv f(z) = E ap v„e 2'inz for f(z) = E an e'nz E Mk/2 041 (N )), 1; = 1, 2, . . . . If p = 2 and 8,i'N, show that this is still true for y even. In particular, Proposition 14 holds for pIN. 4. Compare the formula for the q-expansion coefficients k, for '42 f when JE Mica (fo(N), x) with the corresponding formula for 7 ,2 f when JE Mk(N, X). 5. We noted in the text that 7p 2 takes MkR ( zi (N)) to itself. Show that 7;2 preserves Mk/2 (f0 (N), x) for any Dirichlet character x modulo N.
§4. The theorems of Shimura, Waldspurger, Tunnel!,
and the congruent number problem We now state Shimura's fundamental theorem giving a correspondence from forms of half integer weight k 1 2 to forms of (even) integer weight k — 1. Theorem ([Shimura 1973a]). Let k > 3 be an odd integer, A = (k — 1)/2, 4IN, x be a Dirichlet character modulo N. Let f(z) = 4°_, a ne' e Sic12( f0(N), x) be an eigenform for 7p 2 for all primes p with corresponding eigenvalue A p : 1,2f = Apf. Define a function g(z) = 4'1_ 1 be 2 " by the formal identity co 1 E b a n -s = 11 (4.1) + x(p)2pk-2— 2s • n=1
alip 1 — A pp —s
Then g e Mk-i(N I , x2) for some integer N' which is divisible by the conductor of x2 . If k > 5, then g is a cusp form.
Notice that the definition (4.1) of the b„ is equivalent to the following: (1) b 1 = 1; (2) bp = 2, for all primes p; (3) bpv = 2bp v-i — x(p) 2pk-2 bp v-2 for y >. 2; (4) b,,,,, = bn,b, if m and n are relatively prime. In Shimura's original theorem, the determination of the level N' of g was a little complicated. However, it has since been shown ([Niwa 1975]) that
-
§4. The theorems of Shimura, Waldspurger, Tunnel!, and the congruent number problem 213 one can always take N' ----z NI2. It should also be noted that Shimura actually - proved a somewhat more general theorem, applying to f which are not necessarily eigenforms for all of the Tp2. As a simple numerical ex-ample, suppose N =--- 4, x is trivial. The first nonzero cusp form of half integer weight occurs when k = 9, 11 = 4 (see Problem 5 of §IV.1). Up to a constant multiple, it is f = ORO' — 16F) =--E awl", where we have chosen f so that a l ---7. 1. (By Problem 17(h) of §I11.3 this f is also equal to q 12 (2z)/0 3 (z).) Clearly, f is an eigenform for all 7p 2, because S912 (r0 (4)) is one-dimensional. Then Shimura's theorem holds with N' = 2. Now Ss (f0 (2)) is one-dimensional and spanned by the normalized form g(z) = (n(z),1(2z)) 8 = ql-In"=1 (1 — q") 8 (1 — err ), 8 = E bnqn (see Propositions 19 and 20 in §III.3) ; hence this g(z) must be the g(z) in the theorem. It is now easy to relate the coefficients bin of g to the coefficients an off: Namely, using (3.7) with /0 = / 1 =-- 1, = 4, and noting that a1 = 1, x' = x (this is the trivial character mod 2, which equals 1 on odd numbers and 0 on even numbers), we obtain: bA =
=..- a P 2 + p3
if
p > 2;
b2 = a,.
(4.2)
While (4.2) follows immediately from Shimura's theorem, it is nevertheless quite a remarkable numerical identity : the p-th coefficient in q n (1 — q")8 (1 — q 2") 8 is equal to p 3 plus the p 2 -th coefficient in q 11 (1 — 47 2n) 1 2/(E q)3! Like many numerical relations that follow from the theory of modular forms, this fact looks rather outlandish when stated in this elementary form without the theoretical context. If we have a fixed set of linearly independent forms f in Sko (r0 (N), x) which satisfy the hypotheses of Shimura's theorem, then we can extend the Shimura map by linearity to the subspace of Skj2 (r0 (N), x) spanned by them. Note that the image gi off is always a normalized eigenform in Sk_ 1 (N/2, x 2). If we take another set {f'} of forms which satisfy Shimura's theorem and are also a basis for the same space as thef (for example, if we multiply each f by a scalar), the Shimura map is clearly affected. When we refer to the image of a single eigenform under the Shimura map, we shall always mean the normalized eigenform g in Shimura's theorem. But if we have a space of modular forms with fixed basis f of eigenforms with Shimura(f) = gi , then we define Shimura(/ aifi) = I aigi , which is not necessarily a normalized eigenform. So our meaning of "image" or "preimage" under the Shimura map depends upon the context. In general, it is possible for several different fe Ski2 (r0 (N), x) to go to the same g e Mk_ , (ro (N), x 2). For instance, there might be anf' e Sk12(fO(N) , x') which corresponds to g, where x' is a character different from x which has the same square: x 2 = x'2 . However, when N = 4, Kohnen [1980] described the situation more precisely. We now describe his main result. Let Mk+12 (r0 (4)) denote the subspace of Mk/2 (r0 (4)) consisting of f(z) = Eane for which an = 0 whenever (— 1)'n - -. 2 or 3 modulo 4. It is certainly a priori possible that there are no nonzero forms f with this property, which
214
IV. Modular Forms of Half Integer Weight i
requires that "half" its coefficients vanish. However, we know from Proposition 9 that 14/2 is such a form. Also, it is easy to check (using Proposition 17(a) of §I11.3 and (1,.8) of §IV.1) that 0(z) f(4z) e /1/42 (r0 (4)) for any fE M(k-1)12(SLi(Z)). It turns out that Aik2 (ro (4)) is the direct sum of the one-dimensional space spanned by the Eisenstein series 14/2 and the subspace Sk+12 (4)) of cusp forms:
(ro
,
/
sk412(r0(4)) = {f=
E anqn E Skigo( 4))1an = 0 if (-1)'n -m 2 or 3 mod 41. (4.3)
It is not hard to show that Mk+/2 (f0(4)) is preserved by all of the Hecke operators Tp2 except for T4. According to Shimura's definition of T4, one has T4 I ant? = E a4„q". If we look back at §IV.2, we see that Flo is an eigenform for T4 (with eigenvalue 2 2 , see (2.21)), but Ilk/2 is not. In fact, it is easy to see that T4Hk/20 Alk+i 2 (ro (4)). For this reason, Kohnen modifies' T4, and defines a slightly different operator TI so that the m-th coefficient of T: 1 anqn for (-1)2m -a 2 or 3 mod 4 is zero and for (— l )m a 0 or 1 mod 4 is equal to
a4„, + x(_..1)Am(2)2'ia,, + 2k-211"m14• With Shimura's definition, since we have x(p) = 0 when pIN even if x is the trivial character on (ZINZ)*, the second and third terms vanish in
a4m + XX(-1)Ani( 2) 22-1 ain + X(4) 2k -2am/4. Thus, Kohnen's modification is to replace the trivial character x by the map which takes the value 1 (and never 0) on all numbers including those not prime to N. It is T-4,- rather than T4 which preserves M 2 (F0(4)) and Sk4i2 (f0 (4)). Note that Hk/2 is an eigenform for T: with eigenvalue 2 2 . Kohnen further shows that M4-2 (r0 (4)) has a basis of eigenforms for all of the Tp2 (p 0 2) and for n which is unique up to permutation of the elements and scalar multiplication. There is no obvious way to normalize an eigenform f= E anqn ; for example, we cannot necessarily multiply by a scalar to get a l = 1, since a l = 0 for all fe 47/2 (f0 (4)) if .3. is odd. But one can require that the coefficients all lie in as small a field extension of 0 as possible. It turns out that the images g of these eigen-basis forms Je Sk4i2(r0(4)) under the correspondence in Shimura's theorem are all contained in Sk_i(F), r = 51,2 (Z) (the results of Shimura-Niwa only guarantee that they are in Sk-1( r0(2))) ; they are all distinct; and they form a basis for Sk_i(r) consisting of normalized eigenforms for all of the Hecke operators T„ acting on Sk _ 1 (F). (In particular, dim Sk41-2(r0(4)) = dim Sk_i (F).) Thus, if we take each of our basis elements for S4-2(r0(4)) to its image under the Shimura map—and take - p I/42 to the normalized Eisenstein-series -i---1-1 —and then extend by linearity to all of Afk4i2 (r0 (4)), we obtain an isomorphism from M4-2 (r0 (4)) to Mk_i (F) (and from Sk,2 (r0 (4)) to Sk _i(fl). This isomorphism commutes
§4. The theorems of Shimura, Waidspurger, Tunnell, and the congruent number problem
215
with the Hecke operators, in the sense that:
Mk4i2(r0 (4)) TP2
f
g E Mk -1(r)
I I
TP
and
nI
g
I
T2
T44-fi--+ T2 g
Ti,g
The basic method of Shimura's proof of his theorem was to use Weil's theorem which we discussed briefly at the end of §III.3. Weil's theorem says that if E bnn and its "twists" E bn O(n)n' for certain Dirichlet characters each satisfy the right type of functional equation relating the value at s I s, then g =. E bn iqn e Mic _i(Fo(N /), x 2). But the proof to the value at k that all of these functional equations are satisfied is not easy ; about twenty pages of [Shimura 1973a] are devoted to an investigation of delicate analytic properties of the Dirichlet series corresponding to g and its twists. Shimura's correspondence seems rather roundabout. It says : take the q-expansion of a suitable Je Sk/2 (r0 (N), x); look at the q-expansion coefficients a„ as n varies over integers with fixed squarefree part /0 , and form a Dirichlet series from them which turns out to have an Euler product ; then take the part of this Euler product which is independent of /0 , and expand it and finally, go from this new Dirichlet into a new Dirichlet series Eb n series to the q-expansion E bn qn, which will be your modular form of integral weight. After Shimura's paper appeared, people started looking for a more conceptual, less roundabout construction of the Shimura correspondence. Certain more direct, analytic constructions were given by Shintani [1975] and Niwa [1975 ]. In addition, group representation theory was found to provide a conceptual explanation of this correspondence (see [Gelbart 1976], [Flicker 1980]). Moreover, the use of representation theory has led to striking new results about forms of half integer weight, especially in the work of J.-L. Waldspurger. Using representation theory, Waldspurger [1980, 1981] proved a remarkable theorem establishing a close connection between critical values of 1 e 21 and the coefficients in 4-series for a modular form g of weight k the q-expansion of a form f of half integer weight 10 which corresponds to g under the Shimura map. Roughly speaking, the theorem' says that the critical value is equal to the square of a corresponding q-expansion coefficient times a nonzero factor which can be explicitly described. Waldspurger's general result is complicated to state, so we shall \ only describe what it says in two particular situations. As mentioned before, Kohnen [1980] showed that the Shimura map gives an isomorphism —
—
—
ra s2 (f(4)) Shimu --° Sk-i (n.
(4.4)
Here S 2 (r0 (4)) is defined in (4.3). Let g(z)=-- E bonqn e Sk _ i (f) be a normalized eigenform for all of the Hecke operators, and let xi, be the character corresponding to the quadratic field of discriminant D. Suppose that
216
IV. Modular Forms of Half Integer Weight
(— 1)'D > 0, i.e., the quadratic field is real if A = (k — 1)/2 is even and it is imaginary if 2 is odd. Recall that Lg (ID, s) denotes the analytic continuation of Enap - i XD(n)bnn -8 (which can be shown to converge absolutely if Re s> k12). Let f(z) = Zan g?' e S4-2 (r0 (4)) be the unique preimage of g under the Shimura map (4.4), i.e., g - 7---- Shimura(f ). Let
Theorem ([Kohnen—Zagier 1981]). With the above notation and hypotheses, n
y VIDI
‹g , g>)122Di.
Lig (XD, /1.) = (11)1) (k — 1)!
on
(4.5)
The basis of eigenforms fe ,s 2(r0 can be chosen so that the q-expansion coefficients are all in some totally real number field. However, there is no natural way to normalize them: we can multiply each f by an arbitrary constant c in that field. But note that the right side of (4.5) remains unchanged when f is multiplied by c, since ai2D1 and
§4. The theorems of Shimura, Waldspurger, Tunnell, and the congruent number problem
217
terms of 0 and E4). For more computational details of this example. see
[Kohnen-Zagier 1981]. Our second example of Waldspurger's theorem is the one studied by Tunnel l for application to the congruent number problem. In §111.3 we explained that the Hasse-Weil L-function L(E l , s) = E kin for the elliptic curve E1 : y 2 = x' — x corresponds to a cusp form E bqn of weight two. In turns out that g(z) --=--- E b„q" is in S2 (r0(32)); g is a normalized eigenform for all of the Hecke operators, and, in fact, is the.unique such "new-form", i.e., form which does not come from any lower level N < 32. For n squarefree, let D = —n if —n -_--- _ 1 mod 4, D = — 417 if —n --..._ 2 or 3 mod 4, that is, D is the discriminant of the imaginary quadratic field WV —n). In Chapter II we saw that L(E„, s) = E XD(m)bnim -s is a twisting of L(E 1 , s). (Actually, in (5.7) of §11.5 we wrote x(m) rather than x p(in): --,- - ( il) but b„,= 0 unless m--= 1 mod 4, in which case xp (m) --t-- ( ,;7)---zso one can equally well use either x i) or x„. Because we will be looking at the critical value at 2 =-- 1, we want to work with quadratic characters of imaginary quadratic fields, i.e., ( —1 ) D = —D > O.) Thus, we can write
= E b„,in- s ; L(E„, s) = Lg (XD, s) = E XD(m)bin. L(E 1
,
S')
=
Iag (S)
We saw that the critical value L(E„, 1) = Lg (x D , 1) vanishes if and only if 11 is a congruent number ("only if" here is conditional upon the BirchSWinnerton-Dyer conjecture). It is this critical value which Waldspurger's theorem provides a means of describing. Let fi denote the "real period" of E1 : y 2 = x3 — x, which is obtained by integrating dxly over the segment [1, co) where y is real: I
' dx 6 .,:fT-f ji .\/x.3 _ x = 2.622- • -.
Theorem ([Tuntieil 1983]). There exist a form f = E an, e e S312 (ro (128)) and a form f' =--- E a471" E S312 ( to (1 28), x 2) such that Shimura( f ) = Shimura(f) = g = Eke and p2 for n odd,'
L(E„, 1)
4\1-ii
(4.6)
'2 for n even. , fi a„12
In particular, L(E,„ 1) =--- 0 if and only i f an = 0 (n odd) or a'n12 = 0 (n even). Hefore discussing Tunnell's explicit construction of the forms f and f', we shall discuss how this theorem can be viewed as an analog of the results about Hka which we proved in §IV.2 (see Proposition 6).
218
IV. Modular Forms of Half Integer Weight
We saw that under the Shimura map Hici2, = I c 4" e Alki2 (ro" (4)) corresponds to
I
Bk-1
= — ( 2 — k) 'g= +
2
E 61.. . z (n)qne Mk_ i (I").
In Problem 16 of §III.3, we saw that Lg (s) = “s) • “s — (k — 2)) and that Lg (x, s) = L(x, s)L(x, s — (k — 2)), where the L-functions on the right are Dirichlet L-functions. In particular, setting x = xr, and s = A - 7--- (k — 1) 12, we obtain (4.7) Lg(XD, = L(XD, A)1i(xD, I — A).
2)
But, by the functional equation for L(x D , s), we can rewrite L(x D , A) in terms of L(xD , 1 — A) as we did in (2.15). We obtain an expression for L(xD, A) as a product -of the form * • L(x D , 1 — A), where * denotes a nonzero factor involving the gamma-function and powers of n. Substituting in (4.7) and using Proposition 6, we obtain
L9(X», 2) = *. cjoi-
(4.8)
When reformulated in this way, the results of 1V.2 are very similar to the previous two examples of Waldspurger's theorem. As in the Kohnen—Zagier formula, on the left in (4.8) we have the value of the L-function for some g E Mk — i (r), twisted by XD' at the center of its critical strip ; on the right we have the square of the corresponding q-expansion coefficient of the form in Mi2(f0(4)) which goes to g under the Shimura map. However, the KohnenZagier theorem does not include this case, because g and f= Hk/2 are not cusp forms (in their formula, one cannot even define
- §4. The theorems of Shimura, Waidspurger, Tunnel', and the congruent number problem
2 19
Tunnell's first ta. sk is to find all fe S3/2 (ro (N), ,) whose image under the Shimura map is the modular form g e S2 (F0(32)) corresponding to L(Ei , s). According to [Niwa 1975], any such f has Shim(f)es2 (ro (vo), x2). So one might try taking N =-- 64. However, there is no guarantee that Shimura(f ) is not in S2 (1T0 (N), z 2) for N' a proper divisor of N/2. For example, we know that any fe Sk4i2 (r0 (4)) has Shimura(f) in Sic _ i (f ) = Sk_ i (I-0 ( 1)). So in fact thefwe want could bein S312 (r0(N), x) for N a multiple of 64. Tunnel! computed that, in fact, no fof level 64 maps to g, but that all preimages of g under the Shimura map have level 128. The character x must be even, since Sk/2 (f0 (N), x) = 0 for odd x ; it niust have conductor dividing 128; and Z 2 must be trivial, i.e., x must be quadratic. There are two such k: the trivial character x = 1 and the character z2 defined by x 2 (f) = (i) for j odd. Tunnel! determined that Shimura -1 (g) consists of two forms in S312 (r0 (128)) and two forms in S312 (r0 (128), x 2). Moreover, he found that these four forms of weight i can be constructed in a particularly simple way: by multiplying a certain form f, of weight 1 by forms of weight 1 of the type 0(mz). Up to a constant multiple, 0(mz) is equal to 01[("0' ?), m 114] 112 , and so it easily follows from Problem I in §IV.1 that 0(mz)e M112 (r0(4111), Z rn) (see also Proposition 17 in §III.3). If we multiply this by a formfi (z) E M1 (F0(128), x), then the product f1 (z)0(mz) is contained in M3/2 (r0 (128), xx-.) when 4mI128. (Recall that Mi (F0 (128), x) = M212 (r0 (128), x - x_ i ) by Proposition 3 of §IV.1, and so the character for fi (z)0(mz) is x • x... i The form fi is chosen to be fi
(z) =
E (
.._ 1) nq(4n+1)2+8n2
tn,ne2
It is an easy exercise to show that
fi (z) = (0(z) — 0(4z)) (0(32z) — and therefore fi e M1 1 2 ( '40( 1 6)) • M1/2 (r 0( 128), )(2) c m 1 (1-0 024 x-2). Actually,fi also vanishes at the cusps of ro (128), and so we havefi e S, (ro (128), _ X-2). Thus, for mi 32 we have
f1 (z) ® ( m ) e S3/2 (rO( 128 ), X2rn)It can be shown, by the way, that fi can also be written as a product f1 (z) = 17(8z)q (16z) = (in ( 1
q8 ) (l ____ tir 1 6n) .
For a short proof using the Jacobi triple product formula, see [Moreno
1980]. Proposition ([Tunnel! 1983D. The modular forms f1 (z)0(2z), f 1 (z)0(8z)
. G
and fi (.00(4z), fi (z)0(16z) e S312 (fo (128), x 2) are a maximal set of linearly independent eigenforms for all of the Tp 2 whose image under the Shimura map is the modular form g =-- E bing'n e s2 (r0 (32)) corresponding to L(E l , s).
S312 (r0 (128))
220
iv.
Modular Forms of Half Integer Weight
Tunnell then proves his theorem by a close examination of wht. Waldspurger says in the very special circumstances of the congruent number problem. In our situation, Waldspurger's theorem boils down to thé assertions that: (1) there is a linear combination f = E a,sq" of the preimages in S312(r0(128)) such that for all odd squarefree 10 Lg (x... i., 1) = ca7. for some constant c; (2) there is a linear combination f' = Ian' qn of the preimages in S31 (r0 (128), z 2) such that for all odd squarefree /0 Lg(X-210,
1) = c'di02
for some constant c'. Tunnell computes that one can take f(z) = fl (z) 0(2z), c = fl/4NA-3„ and f' (z) = f1 (z)0 (4z), c' = 13/2„/210 . Since Lg(x...„, s) = L(E„, s), this gives his theorem with f(z) = (0(z) — 0(4z))(0(32z) — 10(8.0)0(2z), f' (z) =. (0(z) — 0(4z)) (0(32z) — i.0(8z))0(4z). The numerical identity (4.6) in Tunnell's theorem is quite bazaar. By the formula (6.8) in §11.6, it says that for n odd and squarefree : co dx ,/;-7 E8 ( —n bni e-nminN4 = a'n c° , nt1
I
in
1 \ X3 —
X
where L(Ei , s) = E b„,m' and a. is the n-th q-expansion coefficient in (4.9) below! Note that in (4.6) we are interested only in the odd q-expansion coefficients off and f' . But for n odd, the n-th coefficient is the same as the n-th coefficient in
1 0(z) (0(32z) — — 0(8z) 0(2z) 2
1 = E q 2x2,y2,32z2 _ -, E
x,ye zEZ
2x2 +y2 +8z2 4?
x,y,zeZ
(4.9) and
1 0(z) (0(32z) — — 0(8z)) 0(4z) = 2
E ex 2 +y2 +32z 2_ 1 x,y,zez
E q4x2,2+8z2
2 x,y,zeZ
,
(4.10) respectively. (Notice that the coefficient of q'e, in (4.9) and (4.10) is obviously zero' if 1 0 1--- 5 or 7 mod 8 for (4.9), 10 EF. 3 mod 4 for (4.10); but this tells us nothing new, since in Proposition 12 of §11.6 we saw that L(E, 1) = 0 if n = 10 = ._.:- 5 or 7 mod 8 or n = 210 a 6 mod 8.) Collecting coefficients of qto in (4.9) and (4.10), we conclude the version of Tunnell's theorem that we cited at the beginning of Chapter I. -
§4. The theorems of Shimura, Waldspurger, Tunnell, and the Congruent number problem
221
X n is a congruent number "elementary" (proved in Chapter 1) En y 2
= x3
has
n 2x
infinitely many rational points Birch-Swinnerton-Dyer conjecture
Coates-Wiles- -
L(E n , 1) = 0
Theorems of Shimura, Waldspurger, Tunnell
the n-th q-qcpansion coefficient in Tunnell's product of theta-functions is zero
Figure IV.!
Theorem ([Tunnell 1983]). If n is a squarefree and odd (respectively, even) positive integer and n is the area of a right triangle with rational sides, then ± 32z 2} = 1# {x , y, ze if ?? = 2x 2 + y2 4_ 8z2} # {x, y, zelin = 2x2
+ y2
(respectively,
# x,y,ze7
= 4x2 ± y 2 7-1- 32z 2}
{
{x,y, ze7
= 4x2 + y2 + 8 z 2}) .
If the weak Birch—Swinnerton-Dyer conjecture is true for the elliptic curves En: y 2 = x3 n2x, then, conversely, these equalities imply that n is .a congruent number.
In Fig. IV. I we recall the logical structure of the argument.
222
IV. Modular 'Forms of Half Integer Weight
We note that very recently B. H. Gross and D. Zagier [1983] have been able to show that the weak Birch—Swinnerton-Dyer conjecture is true for En for a large class of n-. For such n, Tunnell's theorem becomes an unconditional equivalence between the congruent number property for n and the equalities in the theorem involving the number of representations of n (or S) by some simple ternary quadratic forms. As mentioned in Chapter I, Tunnell's theorem has the practical value of leading to an effective and rapid algorithm for determiningwhether n is a congruent number. In addition, one can give quick new proofs of certain conditions for n not to be a congruent number. For example, if n is a prime congruent to 3 modulo 8, Tunnel! shows that an -=7- 2 mod 4, and therefore n is not a congruent number. For this and other corollaries, see Tunnell's
paper. The only sense in which Tunnell's theorem is not yet a completely satisfactory solution to the ancient congruent number problem is that in one direction it is conditional upon the weak Birch—Swinnerton-Dyer conjecture for certain elliptic curves. But lately, significant progress has been made toward a proof of that conjecture in enough generality to include the curves En . In addition to the work of Gross and Zagier mentioned above, R. Greenberg [1983] was able to prove that, if the conjecture were to fail for an elliptic curve such as En which has complex multiplication, then that would imply a highly improbable combination of consequences for the TateShafarevich group of the elliptic curve. It is remarkable that the nearly complete solution that we now have to such an old and naive question as the congruent number problem, has required some of the most powerful and sophisticated tools from diverse branches of twentieth century mathematics.
Answers, Hints, and References for Selected Exercises
0.1 1. See [Hardy and Wright 1960, pp. 190-191]. 3. (b) Follow the proof that + y4 = u2 is unsolvable on pp. 191-192 of Hardy and Wright. 4. For fixed n and fixed x (so that Z is fixed), the triples that correspond to x come from the intersection of the two conic sections X 2 + 1 2 = Z 2 and XY = 2n in the XY-plane. Given one point of intersection (X, Y), the other three are (— X, — Y), (Y, X), and (— Y, X), and so do not give a distinct triple. 5. (a) 1681/144; (b) 25/4; (c) 841/4, 1369/4. 7. Since x2 , y2 O, 1 or 4 (mod 8), it follows that 2x2 + y2 + 8z2 can never equal an integer n 5 or 7 (mod 8). The first congruent number n 1 or 3 (mod 8) is 41, which is the area of the right triangle with sides 6-k, 131, 14A.
§1.2 1. replace y by yln 2 and x by xln 2. (e) x = (e) X 3 4 —3 —4
Y 4 3 —4 —3
Z 5 5 5 5
x
y
—3 2 12 18
9 8 36 72
nY I(X Z), y = 2n2 /(X + Z)
X Y Z 3 4-5 3—,5 4 —3 —4 —5 —4 —3 —5
12 —36 18 —72 —3 —9 —2 —8
3. (a) If Z is the side opposite 0 and X, Y are the other two sides, the law of cosines gives X 2 + Y 2 — 2AXY = Z 2 . Then the point u = (X — AY)/Z,v = YIZ is on the ellipse u2 + B 2 y2 = 1. Again use the slope of the line joining (-1, 0) to (u, v) to parametrize this ellipse. Show that u = (1 — B2 t 2)1(1 + 13 2 1 2), y = 2t/(1 + B2 /2 ). Now the area of the triangle is I.XY sin 0 = 11XY, so we obtain: n = -1-BZ 2 (YIZ)(XIZ) = 12-BZ 2 V(ii Av) BZ 2 41 + 2At B 2 t 2)/(1 + B 2 12)2 . Finally, set x = Bt , y = (1 +' B 2 12)1Z, so that ny 2 = x(x2 + 2 14x — 1). (b) Since
Answers, Hints, and References for Selected Exercises
224
A = (1 — A 2)I(1 + A2) and B = 24(1 + A2), the right side of the cubic equation in 22 x 1) = x(x part (a) becomes x (x2 + + 71).
§1.3 1. Counterexample if K = Ft) ZIpZ: F(x, y, z) = xP x. 3. 1; 0; 2 5. (b) Here's a counterexample if char K = p. Let d be a multiple ofp, F(x, y, z) = x' + + Z d + x 4y bz`, where 0 < a, b, c, a + b+ c = d. Then all partial derivatives vanish at (0, 0, 1), (0, I, 0), (1, 0, 0), but these points are not on the curve. In fact, if p = d= 3, a = b = c = 1, then the curve is smooth at all of its points, even though there are three points of Pi where all three partial derivatives vanish. (d) With K = ill or C, think of F as a map F: K3 K, so that the smoothness condition becomes nonvanishing of the gradient. Then apply the chain rule to the
xyz-space K. 6. (b) Reduce to the case composite function: x'y'z'-space z1 = z2 = 1, set Ax = — x 1 ), so that f(xi )Ax yi ). Then 0 = P(x l + Ax, Yi +f '(x l )Ax + a,„Axm + • • • , 1) = P(x + Ax, y i +nx i)Ax, 1) +
-1 6 (x1 Ax, Yi +f(xi )Ax, 1)a,n Axm + • • • = ± 1) -deg 1.' 1*(X 1 + X21, Yi 2 1 + z 2 1) + (nonzero constant)tm + higher terms.
y2 t,
§1.5 2. (a) f(z) = (e 216a, e21''') for z = act), + 110)2 , a, b e R. (b) N 2. (c) In part (a), let a = flp,b= klp; then for j, k E Fp not both zero we have (j, k)4-4 subgp gen by (e2nibl e21 4P) gives the required one-to-one correspondence ;there are p + 1 subgroups. 3. (a) If s = 2, reduce to the case when f(m, n) = 1 if m, n m l , n 1 (mod N), f(m, n) = —1 if m, n m 2 , n2 (mod N), f(m, n) = 0 otherwise (where m i , ni are fixed pairs); then pair together the jN + m l , kN + n 1 term and the jN + m2 , (—N)' kN + n2 term. (b) n)P(s-2)0(0 1 N11 (02; (0 11 (02)(s — 1)! 05rn,n
E
§1.6 2. g; — 27gI.
3. g"(z) = 6p(z) 2 — ig 2 . 6. “8) = n8 /9450. 7. 4/3, 8/27 10. Set v(z) -1 (0(z)), find (dvick) 2, and show that dvldz = +1. 14. (a) Substitute t = sin2 0 and integrate by parts. (b) Use Problem 12(b), and substitute x = (t — e 1 )I(e 2 — e l) to get the expression under the radical in the form x(x — 1)(x — A). (Note: An elliptic curve written in the form y2 = x(x — 1)(x — A) is said to be in "Legendre form") Then make the substitution t = 11x. (c) Expand (1 — .1.0 -2/2 in a binomial series in pare (b), and use part (a). §1.7 2. (a) ((x2 + n2)/2y) 2 . (c) let ord 2 denote the power of 2 dividing the numerator minus the power dividing the denominator; dividing into the cases ord 2 x < ord 2 n, ord 2 x = ord 2 n, ord 2x > ord 2n, determine ord 2 (x 2 + n2) and use y2 (x2 n 2 )x to find ord 2y, in order to conclude that ord 2y ord2 (x2 + n 2). (Of course, ord 2n =
Answers, Hints, and References for Selected Exercises
225
or 1, since n is squarefree.) 3. (a) 0 together with 3 x-intercepts. (b) points of inflection. (c) from each of the 3 x-intercepts there are 4 lines which are tangent to the curve at points of order 4: (d) if we have a configuration of three line's crossing three other lines, and if the elliptic curve passes through 8 of the 9 points where they cross, then it passes through the ninth. 4. (a) eight; setting y" ---- O after twice differentiating y2 =f(x) implicitly and then multiplying both sides by 2y2 gives: (2yy1) 2 = 2y2f "(x), and hence f ' (x) 2 — 2f(x)f"(x) = O. (13) x = ±n-‘,/ 1 ± 2013. 5. four; draw lines from — Q which are tangent to the curve. 7. 4; 3; 8. The four points of order 4 not of order 2 are found by drawing lines from (n, 0) which are tangent to the curve. 8. two points of order 2 (at infinity and (a 1/3, 0)); three points of order 3 (at infinity and ((4a) 1/3 , + (30 1/2) if a is positive, (0, + (-0 1/2) if a is negative); four points of order 4, namely, the two of order 2 and the two points (a 1/3 (1 + 0), + (3a(3 + 20)) 1/2) (if a is positive; change the two + 's to — if a is negative).
§1.8 16x4 + 8g2x2 + 32g3x + g3 with x = 16(4x 3 g 2x g 3) 3g 3x 11-16,gi, with x = (z). 3. Look at zeros and poles 2. f3 (z) = 3x4 of go (Nz) (z); determine the constant —1 by comparing coefficients of z- 2 • (See [Lang 1978b, pp. 34-35 ]. ) 4. Considered on the points kw, + k-co2 in the z-plane, a must take every such point either to itself or its negative (modulo L). When finding the matrix entries by looking at the cases (j, k) = (1, 0) or (0, 1), we fi rst obtain ( ±1i ±?), but consideration of other j, k shows that both signs must be the same. The analogous situation in cyclotomic fields is to set ŒNF = Wan 2F-11) (i.e., adjoin just the x-coordinate of the point of order N). Then Gal(ON/07:1) is the subgroup { +1 in (Z/]/)* . 5 • The image is conjugated by the change of basis matrix. 6. (a) a subgroup of order 2, (b) the trivial subgroup, (c) the entire group, (d) a subgroup of order 2. (Note: In this and the next problem, these subgroups are only defined up to conjugation; see Problem 5.) 7. (a) 48, (b) Using Problem 4(b) of §1.7, we see that K3 is generated by + + 20/3 and by the solutions y of = x12 2 (2013) = ± NI 3 ± 2V3. (c) By part (a), [K3 : 0] divides 48. Since it is obtained by successive extraction of square roots, K3 has degree in fact dividing 16. On the other hand, by part (b), it is easy to see that K3 contains i and also the fourth root of 4n 2 (3 + 20), which satisfies x' 24n 2x4 48/74 , which is irreducible (by Eisenstein's criterion for the prime 3, if 3)(n; if 31n, a generalization of Eisenstein's criterion can be used). Thus, the field F obtained by adjoining this root has degree 8. But F IR, while K3 31, so that [K3 : 0] 16. Hence, the degree is 16, and the image of the galois group is a 2-Sylow subgroup of G1,2 (1134 Here are two alternate ways of showing that [K3 : is at least 16: (i) K3 has at least degree 2 over 41(s/2nV3 + 20-) c ER, so it suffices to show irreducibility over Q of the polynomial 1. p(2z) =
,
}
11 8 choices of ± 2 nV3 ± 2A• But otherwise a product of 4 of the roots would be in Z, and this can be ruled out directly. (ii) First show that [Q(N/3 + Q] = 4, after which it suffices to show that Q(./2n,/3 + 2 \73) # ON-s4/3 + 20). Otherwise we would obtain 2n-\/3 + = (a + + 2./)2,
226
Answers, Hints, and References for Selected Exercises
a, be WA ; since ,s,/3 + 200 0 (/ ), this gives 0 = a2 + b2 (3 + 20), a contradiction, since a, bE R. (d) 1), i.e., z 1 = co 1 /3, z 2 = (co i + co2)/3. 8. (c) Show that a is an eigenvalue of a 2 x 2 matrix with integer entries. 9. (a) 3, (b) 4, (c) 6, (d) 7, (e) 8, (f) 5, (g) 7.
§1.9 1. Possible conditions on p and f: 81f; or else 21f and p —1 (mod 12). 2. (a)
7 11 3 5 type (2, 2) (2, 4) (2, 4) (2, 2, 3)
p
13 17 19 23 4) (4, 4) (2, 2, 5) (2, 4, 3)
(b) (4, 4) (c) (2, 2, 7), (d) (2, 4, 9), (e) (2, 2, 9, 37). For q = 17 and 9, you have to check that the x-coordinates of the doubles of all 12 points not of order 2 are 0, 1 or —1, i.e., all points are of order 4. Note that those cases also follow from Problem 11 below. 3. Same as Problem 2 except for: p = 13, (2, 2, 5); p = 17, (2, 2, 5). 4. Notice that the right side x3 — a runs through Fq as x runs through IFq , so the number of points is the same as on y2 = x (plus the point at infinity). 5. See Problem 10 below. 6. p type
5 7 11 13 17 19 23 (2, 3) (2, 2) (4, 3) (2, 2, 3) (2, 9) (2, 2, 7) (8, 3)
7. Show that for all but finitely many primes p —1 (mod 6) the group homomorphism from the subgroup of points of order m in E(0) to E(Fp) is an imbedding. Show that this implies that points of finite order can only come from 2 points of order 2 and/or 3 points of order 3. Then find whether the points of order 2 or 3 can have rational coordinates. 9. (a) Show that a point of order N is taken to another point of order N by the complex multiplication automorphism; but if both (x, y) and (—x, .‘f —1 )') have coordinates in IFq , then V-1 E F. 10. Proceed as in 9(a), using the complex multiplication (x, y)f-+ gx, y), where C is a nontrivial cube root of unity in F42. 11. Suppose that cc.< )6. Let G be the quotient group of the group of points of order modulo the subgroup of points of order P. Then G P.-, 7LI1P -Œ7L. Show that the complex multiplication used in 9(a) gives an automorphism of the group G whose square is the automorphism —1 (which takes every element in Z//a-ŒZ to its negative). Show that there is no such automorphism if I a 3 (mod 4), because — 1 is not a square in (zyz)*. If / = 2, use the fact that —1 is not a square in (Z/4Z)*.
§H.1 4. Z(T) = (1 — T 2) - '12 ; Z(T) = (1 — T 2) - ' ; take the equation x2 — a = 0 for a any quadratic nonresidue mod p. 5. Z(T) = 1/(1 — T)(1 — T2 ) • • - (1 — 7' 1 ). 6. (a) Z(T) = 1/[(1 — T)'(1 — Tr 1 (1 — r 2)' 1 • • • (1 — T 17 -z )1-1 ] (b) The limit of the Z(T) in part (a) as M approaches infinity, an infinite product which is not a rational function.
7. (b)-(c) Z(P/Pq /Fq ; T) = 1/(1 — T)(1 qT)(1 q 2 T) • • (1 — qm T). 9. Write V as a disjoint union of affine varieties, and use Problem 1. To reduce to the case of a single equation, use induction and the observation that the number of simultaneous
Answers, Hints, and References for Selected Exercises
227
zeros off = g = 0 is equal to the sum of the number for f = 0 and the number for g = 0 minus the number for fg = O. 10. 1/ (1 T)(1 qT).
T). T)(1 11. (1 qT)I(1 T). 13. Following Problem 13 12. 1 1 (1 — 7)(1 — qT)(1 — q 2 T) 2 (1 — 473 T)(1 of §I.9, you quickly see that the field extension generated by the points of order 1' is contained in Fe. It remains to show that only the /m-order points, and none of exact order /m+1 , have coordinates in V -1. Prove this by computing the exact power of! that divides Ni m-: . Consider separately the cases (i) / remains prime in the quadratic extension Q(Œ); (ii) the ideal (I) = Li: splits into a product of two prime ideals. Show that, for example, if LC is the highest power of L dividing a — 1, then — 1. In this way, use the fact that N1 is is the highest power of L dividing exactly divisible by / 2 to prove that NM-1 is exactly divisible by 1" . 16. Write Z(T) 1 + c i T + c 2 T 2 + - • • = P(T)/Q(T), Q(T) = b 0 + a l T + • •, where P(T) and Q(T) have no common factors and all coefficients are integers. Check that the Polynomial Q(T) is primitive. Use the Euclidean algorithm to write PU + QV = m, where Ul V E Z[1] 1 M eZ, m O. Write mIQ = U(P1(2) + V = do + di T + . Since in = Q(T)(d o + d 1 T + • ) with Q(T) primitive, it follows by the proof of Gauss's lemma that midi . In particular, mid° , and this means that the constant term 100 is +1. It immediately follows that P(T) also has constant term +1.
§11.2 1. For example, to prove (3), in the double sum for J (x1 , x2)g(x i z2), replace x by xfy and then replace y by x + y. For property (2), see 2(a) below. 4. i.%J, NrS, iNfl, 3. 5: 1 + 2i, 3, —3 + 2 1, —1 ± 41. 6. x(4)J(x, x) = x(4x 4x2) = E(1 — (2x — 1) 2) = x(1 — x2) (replacing 2x — 1 by x) = + X2(Y))X( 1 Y) (where y = x2 if 2 (y) = 1), and this equals J(x21 X). 7. (1 ± 3W-3)/2, 8. (a) Use part (13) with m a square root of 1/n in Fq2 (b) Replace (-5 + X by x/m 2 and y by yftn 3 in y2 = x3 — n2x. 9. (a) Replace x by x (b) Choose J to be the ideal of elements x for which ax E I, and take the sum over a fixed coset in (R11)* of the subgroup consisting of elements congruent to I modulo J ; show that each such sum vanishes. -(Look at the example R = Z, I = (N ) to get used to the argument.) (c) Check that g(x, = g(X, 11-i) = x(— 1)g(X, 1,11). Then we have fg(x, = g(x, ifr)g07, IT) = Exe mgrE ye Rbr X(x))7(y)0(x — y) (here it makes no difference whether we sum over R/I or (RI I)*). Replace y by xy in the inner sum, thereby changing the summand to 2(y)igx(1 — y)). For fixed x not prime to I, if we let J denote the ideal of elements whose product with x is in I (thus, J is strictly larger than I), we see that the inner sum vanishes by the argument in part (b). So we can replace the outer sum ExE(Rto* by Exa m. We then interchange the order of summation to obtain: ig(X, IP)12 = Eye Rg 30) Ixe Riill/(X(1 .y)). When y = 1, the inner sum is I. But when y # 1 it vanishes, since, by assumption, is nontrivial on the subgroup (1 — y)R + 1/I in RI I. 10-17. See Weil's paper [Weil 1949] or else [Ireland and Rosen 1982, §4 in Chapter 11]. 20. (b) By part (a), modulo 3 we
have J(x3 , x3) =7._- (Ex3 (x)0(x)) 3 EA(x)0 3 (x) = ExEr, = — 1. (c) An, x3) is an element of Z [co] of norm p which is congruent to — 1 Imod 3. (e) Since J(x3, X3) has norm p and the right congruence, it suffices to show that it is in the ideal (a + bco) (rather than the other possibility (a + bCo)). Working mod a + bco, replace x3 (x) by P-1)/3 and x3 (1 — x) by (1 — x ) P-1)/3 in the definition of Ax3 , 2(3), and use the fact
Answers, Hints, and References for Selected Exercises
228
that Ex-i = 0 unless p — 1 divides/ 21. (a) Use Problem 6 and the HasseDavenport relation. 22. (a) a # 0 and char K # 3. (b) Use: the map xi-4 x3 is one-to-one from K to itself, and so x' = u always has exactly one solution. (c) There is one point at infinity. For any fixed y, the number of x such that x3 = y2 + ay is I + X3CY2 ± ay) + Z3(Y2 ± ay). Sum this over yEK, making the change of variable y = ax, so that x3 (y2 + ay) = y(* 3 (a) 3(x — x2) = 7 3 (a)x3 (x)7443 (1 — x) (recall that —1 = 1 in K). (d) Compute the case r = 1 directly; then use the Hasse-Davenport relation. Z(C/F2 ; T) = (1 +7 2)1(1 — T)(1 — 2T). (e) Completing the square in the equation y2 + y = x3 and substituting y' = y + 4, x' = x gives 3/ 2 = x' 3 + i, i.e., (8y') 2 = (4x') 3 + 16. Then set y -4---- 8y', x = 4x'. , 23. (C) The following tables give the number of points in factored form. The type of the group follows from Problem 10 of §ll. 1 unless that number is divisible by the square of a prime 1 m 1 (mod 4). Those cases are marked with an asterisk and discussed below.
Isl,!" r 23 1 23 . 13 3 2 3 - 401 5 7 229337
N7 22 2 2 .37 22 . 761 22 .29.673
r 2 4 6 8
Nr
25 27 - 5 2 5 - 13 - 37 29 -3 2 .5 .17
r 10 12* 14*
Nr
25 .401.761 27 - 5 . 13 2 - 37 - 61 25 • 292 .337.673
,
(d) Nr N" r 22 .5 23 1 22 .5.109 3 23 .277 5 2 3 -101-461 2 2 .5 2 .3701*
r N. 2 25 .5 4* 27 .3 2 -5 2 6 25 -5.109-277
r N. 8* 29 .3_2.52 .73.97 10* 2 5 -5 2 -101.461.3701
To handle the asterisked cases; suppose that En(Fq) contains exactly I points of order _ /: jP, 0 ..,._j < I; and that En (Fqr) contains exactly /2 points of order /2 of the formV, 0 .j < 12. Let Fqr# Fqr be the extension generated by the coordinates of Q. Following Problem 12 of §II.1, use the map a i—> a(Q) — Q on Ga1(F4F4) to show that r' = I. Use this to show that the / part of the type is (I, I) in all cases except NT and N10 in part (d). In those two cases, show the type is (/ 2) as follows. If there were /2 Frpoints of order I, let Fe' be the extension of Fq they generate. By suitably choosing a basis {11, Q} for the 2-dimensional F, - vector space of points of order l, get an injection of Gal(Fge/iFq) into matrices of the form a) *°),,=.-, IF*. Thus, r'il — 1.
§11.3 4. By (3.11), we have g((i-,)) = Za,b=o,1,.. -.,p-1( a2 ;b2 )e2nialP = p - 1 + 4:1 Egl (a22 )e2xio. In the inner sum replace b by ab to obtain g((7,)) =
V))
= p — 1 — E1,71(1-±,--'2) — p — 1 + (Eri -71 e 2n NEIT-1 ( i 2p— 1 — 43(1 + ( 1V)) = 2p — 1— # {a, b E Fpla2 = 1 + 1)2 } = 2p — I — # {x, yE Fp fxy = 1} (after the change of variables x = a + b, y = a — b; a = (x + y)/2, b = (x — y)/2). Hence, g(7,)) = 2p — 1 — (p — 1) = p.
§11.4 2. (b) Note that L(x, s) =- EbEGX(b)fs(b), and use part (a) together with Problem 9(a), (b) in §11.2. (c) L ( cs , 1) = — kg(x)Z aeG k(a) log(1 — --1. (d) L(x, 2) =
229
Answers, Hints, and References fix . Selected Exercises
3. (a) t- 1/2e2niay-ny2ft. (e) si2 r(i)Nsii(x, S') =rn –(1– 1 — s). (f) Set s = -1- in part (e). (h) L'(, —2k) = D kk ! n-2k--1/2N2kg()r(k + -1-)L,(Z, 2k + 1). 5. (a) Integrate by parts in the definition of the Fourier transform off ' (x). (b) — iyi-3/2e2giay-ny2/t. x i 2 -( "(e) 2-2= -3)121-)L(, 1 s). (h) For x(n) (i) we )Ns L(x, s) — ig()n obtain: L'(x, —1) = ,*(1(e-r21613) — 1(e 27613)), where /(x) is the function in Problem 2(d). 7. Use part (a)(iii) of Problem 4. 8. (a) Add the expression for 1(a, 1 — s) + 1(1 — a, 1 — s) in Problem 3(c) to the expression for t obtain: l o rle(so)5(:( a,o 1(a, 1 — s) 1(1 — a, 1 s) 7irnsp+rbm) -
1(a, 1 —
21- 41
s)/2)
in -s+1/2 {VS ± 0/2) g(a, ± 21- (1 — 42)
a, s))
—
CO — a, s)).
§11.5
dri + floL ts(00) +)÷. As in §II.4, 2. (a) 0(t) = 10(11t). (b) Let 4)(s) = ts(0(t) 1)— ++ show that 10(s) is entire, and for Re s> 1 is equal to rot' ts(0(t) — = +1+...s._ n-s r(s)zotmEe im i-2s ± i i s n -s f(s)44(s). By substituting lit for t in the integrals for 45(s), show that 0(1 s) = 0(s), and hence ir-si" (s)r,K (s) = '7Es-1 F(1 — s)4(1 s). Finally, show that 44(s) = ÷.) (4)(s) — — -T,!--s) is analytic IF(s) = 7/. 3. Since the (s — 1)44(s) = except at s = I, and that Fourier transform of g(x) = e2niv-x e-ntlx+ul 2 is -0(y) = +e 2ltit (y- e ) e-( n/l)1Y-'12 , one .. yi2 = Te- 2ffiu . 1et v(+). 4. (a) (2niw y) k Ay). ( 3) (v )ki,e21ia ye _otio1 obtains: (c) Ou,k (t) = 1-k-1 0" (+). (d) Let a + bi run through a set of coset representatives modulo 1= (n), and write x in the sum in the form(a+ +.).w)4 bi) + n'(m + m 2 i), so that ((pfl _m e zz2 7 the sum becomes n'k + bi) m+ui 2s (where u l + u2 i = +bi X(ifi (a + bi)In'). The inner sum is essentially the Mellin transform of Ou, k (t)• The functional equation in part (c) will then give a linear combination 6f terms of the form f; t k+1 0" (0 —dand this linear combination can be expressed in terms of a Gauss sum and our original sum with s replaced by k + 1 s. (e) Suppose the (a0 , al ) for our Hecke character 2 is (k 1 , k 2) (this pair of integers is called the "infinity type" of). Consider 2 as a function on elements as well as ideals of Z[i] by defining T((x) = £((x)). Let k = 1k 1 — k21. If k 1 > k2 , then the mapping is easily seen to be a character of (Z[i]/D*; if k 2 > k 1 , then def X(X) =f i(OXk (NX)k l is a character of (Z[i]/j)*. In, for example, the case 1( 1 > k 2 , the Irecke L-series is then .1 "E 0 x e ZVI 2(X)(N1 xrs E xk x(x)(NV2 - s. 5. (a) Make the change of variables x' = Mx to obtain 4(y) = Sure-216M 2 Y f(e) idde :`,;41 — -frfiFifif(M *y), since Ats i x' • y = x' • M*y. (b) L' M*Z" ; let g(x) =f(Mx). Then rr g(m) = I mEz nj(rt) ide!mi EyL,./(y) by part (a). Exez.f(x) = 6. (a) M* = -A-(1 ?); considered in C, L' = Note that Tr xy = 2x - y, where on the left x and y are considered as elements of C, and on the right as elements of R2 . (b) In Problem 5(b), let f(x) = e - '11'12 , g(x) = f(Mx) with M as in part (a); then apply Poisson summation. (c) Let 0(t) be the sum on the left in part (b), and let 4)(s) = 1`2,,rst 5(0(t) 1) 1L + 0(0(0 —')1'-. For Re s > I show th a t
Answers, Hints, and References for Selected Exercise
230
4)(s) = (20)sa + 1 10 + n -s1"(s)6CK(s). The residue at s == 1 is n/3f."3. Finally, replacing t by 4/3t in the integrals for ck(s) leads to the relation: 0(s) = (2/j) 21 4(1 s), and this leads to A(s) = A(1 s) for A(s)rf (07270s1"(s)4(s). (d) Use the Euler product form K(S) fl p(1 (INP) -s) -1 . For ea prime ideal P of norm p 1 (mod 3), the contribution from P and P is (1 - sr 2 ; for P = (p), where p 2 (mod 3), the contribution is (1 - p -2T' = (1 (l - X(P)PT"' ; and for P (NI -3) the contribution is (1 - X(3) 3-T"' (since x(3) =-- 0). So the Euler product is the Euler product for “s) times the Euler product for L(x, s). (e) Multiplying the functional equations for C(s) and for L ( c, s) gives A(s) = A(1 s) for A(s) = n.-42 r()C(s)(3/7 ) 42 r((s + 1)/2)L(x, = ("jinn K(s)r OW as + 1)/2) = const -0/27r)sr(s)C K(s) by (4.4). 7. See Problems 20-22 in §II.2; (d) g(x, tfr) = 3. 8. (a) By part (1) of Proposition 9, j(y) equals e'Y times the Fourier transform of (x w)e - nox'12 ; note that x w = Mx • (1, i) with M as in Problem 6(a); then proceed De zniu- y e--(47rooly•(-1„), (I)) Use as in Problem 6(b) to obtain 4" (y) = 34t y . n-sr (s)6L(E, s). (d) Replacing t by IF in the Problem 7(c) to obtain 0(s) = eh where the integrals in 4 (s), one obtains 4)(s) = (-1)s -1 Ex(a + ba))14T 2t summation is over 0 a, b < 3 ; u = (a13, b13); and ) e2g1"•rne-4tInl'( '' 1) I 2 . Then for Re (2 s) > 3/2 use (4.6), OU(t) = E„, z2 tn • ( Probrm 2 of §11.2 (note that u • in = iTr((a + bco)(- &m 1 m 2)1i0)), and the evaluation of the Gauss sum in Problem 7(d). The result is: 0(s) = (4/3)5-1 370 -2 1-(2 s)6L(E, 2 s). Equating this with the expression in part (b) and t 12
1/
collecting terms gives the desired result.
01.6 2. The function f(s) = A(1 + s) is even in the first case (so that its Taylor expansion at s 0 has only even powers of s) and odd in the second case. 3. (c) Use: •(3-1, and 2ap = ap 'ercp .
5-6. n
first few nonzero kn,„ 2 b 1 = 1, b 5 = 2, b9 = -3 3 b 1 = 1, b5 = 2, b 13 = 6, b17 = -2 10 = 1, b9 = -3, b 13 = 6, , l7 = -2
1,(E„, 1)
0.92707 1.5138 1.65
7. You want 14 4. 1 1 to be less than c/2, i.e., c/2 > 4(1---
remainder estimate
0.00027 1R25 1 0.00123 1R29 1.ç.. 0.289 e m+iy,fir ; for 1R131 ...ç.
N' e - gml`51- , so choose M> N ' 1og(8,1Thre) -71-NTAT; log n, i.e. (2n N/2/7r)log n for n odd, (2n/n)log n for n even. 8. (a) Use Problem 3 to find the b„,,,,. (b) See Problem 7 of §I.1. large n the right side is asympotic to
1. Te 1-1 (N) c ro (N ), but STS -1 F0 (N); hence, neither I-, (N) nor Fo (N) is normal in r. 2. (a) Given Ac SL 2(ZI1VZ), let A be any matrix with A = A mod N. Find B, CESL 2 (Z) such that BAC is diagonal: BAC = (g ?,), where ad =' det A 1 (mod N). It suffices to find A= re 'SI) with determinant 1, since then SL 2 (Z), and B -1 B -1 (.4 1)0C -1 = A 71(mod N). To find A so that 1 = det 1 = 1 + ((ad - 1)/N + xd)N - zN 2 , first find x so that xd -(ad
1)/N (mod N), and then choose z so as to make det /i= 1.
231
_Answers, Hints, and References for Selected Exercises
3. (a) (q 2 — 1)(q 2 — q); (b) q(q 2 — 1). 4. (b) Since the kernel in part (a) has p4(`'' ) elements and #GL2 (7L/pZ) = (p 2 — 1)(p2 — p), it follows that #GL 2 (11peZ)= p4e-3(p2 1)(p — 1). (c) Divide the answer in (1)) by (Ape) to get p 3e-2( p2 _ 1) - 5. Use the Chinese remainder theorem. 6. N 3 rIppia — P -2 ). 7. N 3 flpiN (1 — p -2); N; N11,0,(1 — p -1 ); N 2 1-1piN (1 — 1) -1 ); Nri piN (1 + if') 8. The image of F(N) under conjugation by (/?; 1) is the subgroup of F0 (N 2) consisting of matrices whose upper-left entry is r--- 1 (mod N). 10. and 14(b). See Figures A.1—A.2. 12. Besides F and F(2), there are four, namely, the preimages of the three subgroups of order 2 in S3 and the one subgroup of order 3 under the map 1" —0 SL2 (Z/2Z)•:-., S3. (1) r0(2) = {(t, :) mod 2), F0(2) = the right half of F(2); (ii) r(2) = {(: :) mod 2 } , F°(2) = Fu T -1 Fu SF; (iii) 0(2)-.----. {(ccf bd) E..- I or S mod 2 } , fundamental domain = Fu T' Fu T' SF; (iv) {C. bdr-1- I, ST or (ST) 2 mod 21, fundamental domain = Fu T' F. 13. (a) This can be proved using the fundamental domain, as in the proof of Proposition 4 in the text. Here is another method. Let G denote the subgroup of 0(2) generated by +S, T2 . Clearly G C 0(2). Conversely, write g E (g1(2) as a word of the form +Sal T 1'IST b2 • • • STN, where al = 0 or 1 and bi # 0, j = 1, . . . , / — 1. Use induction on / to show that g E G. We work mod +1, so that S 2 = (ST) 3 = 1. Without loss of generality we may suppose a l = 0, bi # 0, since we can always multiply g on the left or right by S without affecting whether g e G. For the same reason we may suppose that b 1 ------- 4 = 1, since T 2 e G. Note that I= I or 2 is impossible, since T,TSTO 0(2). If / > 2, write g = TSP2 - - • . Since (STS)(TST) = 1, we have TST = (STS) - ' = ST' S, and so T 2 Sg = TSTb2 -1 S • • • , which is just like g but with b2 replaced by b2 — 1. If b2 > 0, use induction on b2 to finish the proof. If b2 < O, write ST'g = ST -1 ST b2 - • • = TSr2 +1 - - - , and again use induction on 'bd. (b) Use: r°(2) = F0(2) = STO(2)(ST) -1 . (c) Let G be the subgroup of generated by T2 and ST'S. Since G c= r(2), it suffices to show that r: G]= 6, e.g., that any "word" in S and T can be multiplied on the left by elements of G to obtain one of the elements cx; used in the text for coset representatives. Use S 2 = (ST) 3 = 1 and induction as in part (a). (d) Use part (c) and the isomorphism in Problem 8. 14. (e) Fo( p) is bounded by the vertical lines above l and —1; arcs of circles with diameter [0, 2] and [— 2/(2p — 1), 0]; and arcs of circles with diameter [ —1/(k — 2), —1/4
r
[
(The centers of the boundary arcs are at
—1 /2, 0, 1/6, 4/10.) %
1/2 Figure A.1
232
Answers, Hints, and References for Selected Exercises
I
1 ....Pw
...
■ ......
.......
4....
•as, .....
1
0
1/2
Figure A.2. Two possible fundamental domains for F0 (4):
I. Let F(2) be the fundamental domain for F(2) in the text. Then ŒF(2), where a = (3 1), is a fundamental domain for F0(4) = Œr(2)a -1 (see Fig. A-1). IL U__ / ocT'F is a fundamental domain for F0(4), where F is the fundamental domain for r = SL 2 (Z) given in the text, and ai are coset representatives for r modulo F0 (4). Here we have taken: a l = 1, a2 = S, a 3 = T'S, a4 = 7-2 S, a5 = 7-3 S, a 6 = ST'S. (In Fig. A.2, the number j labels Œ' F; the solid boundary arcs are centered at 1, —4, —i; the dotted arcs are centered at 0, —1, —4, --i, —4; the point P is (-7 + i0)/26.)
k = 3, . . . ,p. For example, F0 (3) is the union of the regions marked 1, 2, 3 and 4 in Figure A.2. 15. See Problems 12 and 14(c) for counterexamples. 16. (b) (i) 2; (ii) 2; (iii) 2; (iv) 1.
§111.2 1. In the sum for Gk, group together indices m, n with given g.c.d. 2. Substitute z = i in Proposition 7 to obtain: E2 (i) = 3/7r. 3. (a) Since both sides of - each equality are in a one-dimensional ilik (F), by Proposition 9(c), it suffices to check equality of constant terms. (b) Equate coefficients of q" on both sides of the ' equalities in part (a) to obtain: o-7 (n) = o.3 (n) + 120 ET.1 o-3 (j)o-3 (n — j); 110-9 (n) = —100.3 (n) + 21a5 (n) + 5040 ri',::,1 o-3 (j)o-5 (n — j); 613 (n) = 21a5 (n) — 20a7 (n) + 10080E7:4 a.5 ( j)a7 (n — j). 4 • (a) Since the left side is in S1 2 (f), by Proposition 9(d) it suffices to check equality of coefficients of q. (b)t(n) = 76556 o-1 / (n) + -a5 (n) — 0.5 ( j)o.5 (n —j). (c) Consider part (b) modulo 691. 5. Let F(X, Y) be an irreducible polynomial satisfied by X ----- E4 , Y = E6. Substituting z = i in E4 (z), E6 (z) leads to a contradiction, since E6 (i) = 0, E4 (i) * 0. 6. (b) Use part (a) with x = e- 2tt = e2ni(i), along with the relation 0 = 2(2, /)/3„, / E„.,. / (i) — 2(a1+ i)Ba+ 1 — Efz/ sria (n)q n with q = e2gi(i). '7. Use Proposition 7 and the derivative of the identity f(— 1 fz) = z kf(z). 8. (a) By Problem 7, the right sides are in m6(r) and m8(r), respectively. Now proceed as in Problem 3(a). (b) 210-5 (n) = I0(3n — 1)(73 (n) + ai (n) + 240E7:1 al (j)a3 (n — j); 20a7 (n) = (42n — 21)a 5 (n) — ai (n) + 504 E.7:1 ai (j)a5 (n — h. 9. (a) Use the fact
233
Answers, Hints, and References for Selected Exercises
that n2 1 (mod 24) if n is prime to 12. (b) Use Proposition 9(d) to show that their 24 ±1 mod 12 q — q n) = 24-th powers are equal. (c) Ens 15 mod 12 q (n2 -1)/24 .
10. j= 256(.1 -2 + I + 12 ) 3/(.1. + .1 - 1) 2 ;j= 1728 if A = I; if A = alb in lowest terms (with a and b positive) and jeZ, that means that a4 b4(a2 + 132) 2 divides 256(a4 a2 b2 + b 4) 3, but since a, b and a2 + 112 each have no common factor with a4 + a2b2 + b 4, that means that a2 b2 (a2 + b2) divides 16, and now it's easy to eliminate all possibilities except a = b = 1. 1
§111.3 4. (a) Since F0 (N) = +F/ (N) in those cases, there is no difference between F0 (N)-equivalence and fl (N)-equivalence. (b) —1/2 is an irregular cusp for F0 (4).
5. Group
ro(p)
Cusp Index
o
co
f(2)
ro(P2)
—1Ikp, k =1, 1
p
,p 1
0
oo 2
2
—
1 2
7. (a) See the proof of Proposition 18; replace a by X in (3.11). 8. (a) Replacing z by z + 1/2 in Z e21riz"2 gives Z(-1)"e'" 2 ; meanwhile, the right-hand side is 2 E e2R1z(2n)2. (b) works the same way. (c) On the left side, the constant term is clearly zero; the coefficient of qn for p ,t n is (-2/c/Bk)ak_ 1 (n); if pin but p21e n, then the coefficient is —2k/B k times 0-k _ 1 (n)— (1 + p k-i )o- (n1p) = 0; if n = pmn o with and Ono , then the coefficient of q" is —2k/B k times (pmn o) —(1 + pk-i )o-k _ i (pm -1 n o) p ic-1 0.k_, ( pm— 2no)
>1
( 1 +Pk-i )crk i(Pm-1 ) +Pk-i crk 1(P m-2)) = O. (d) Use parts (b) and (c) with p = k = 2. (e) Rearrange the infinite product of (1 _ e2icicz -1/2)n) = 1 Ong') by writing 1 + qn = (1 — qn), getting
ak---1010)(0i 1(Pm1) -
—
-
-
(where 11 denotes 11(1 — qn)). But this equals nn2even even Mall n twice an even 11 4in • 9. See the proof of Proposition 30. 10. (a) Since n8 (z)ri8 (2z)ES8(F0 (2)) by Proposition 20, to show invariance of ri 8 (4z)/n 4 (2z) under [y] 2 for y E F0(4) it suffices to show invariance of (4z)e(z)71 4(2z) under [y] 10 for y e r0(4) ; now use Problem 9 to show this. At the cusp 00 5 we see that r/8 (4z)/q4(2z) = q11(1 q4") 8/(1 q2")4 has a first order zero; to lin even • rintwice an oddinnodd
find q(0) we apply [5 ]2 : Z -2 q 8 ( - 41z)17/ 4(— 21z) = z -2 (z/4) 4 n8 (z 14)1( —(z/2) 2 114(42)) = —411(1 — q) 8/( 1 q , which approaches --1-4 as z cc. To find the value at the cusp —1, which is equivalent to the cusp = T(— 1/2), we can apply [(I ?)] 2 , as follows: —
/78 (2z + 1)-2
(
4z
(2z + 1) 4
2z + 1) = (2z + 1)' ( 2z ) 114 2z + 1
4z (2z +
64z 2
( 1 4z
1\ 2,14
2z ) ti
=
8
8
1 77 2° (2z) 16 ri 8 (z)1 8 (4z)'
1)
1 __
2z
)
24
2Z1 ) (-
e —26
)4
—1\ 4z )1 8
1
by Problem
z1) e-2ni/6 4 (
I 2z
8(e)
234
Answers, Hints, and References for Selected Exercises
which approaches as z oo
(b) E2 (ST -aSz) = E2 (S(- a -))= (a + E2( - a - 12") + Ir(- a - +), but E2 (--a - = E2 (-1), so this equals (a + 71) 2 (z 2E2(z) + + TN- a - = (az + 1) 2122 (z) - 6-7- (az + 1). (c) Obviously F1[7] 2 = F. Using Problems 1 and 10(b), we have -24F(z)1[ST -4 S] 2 = E2 (z)1[ST -4 S] 2 3E2(2z)l[5T -2 S] 2 + (E2 (2z) + (42+10 + 2(E2 (4z) 2E2(4z)I[ST-1 S12 = E2 (Z) + (42+1) (42+1)) -24F(z). We now look at the cusps. First, we clearly have F(co) ----= O. In evaluating FI[S] 2 , ignore terms which approach zero as z co ; then, using Proposition 7, obtain F(z)1[S] 2 "rd — 24E2(Z) — 3 (1E2 (Z/ ) 2(- E2(44)) -) = + we similarly ignore terms which approach zero: This is F(0). To find F( + 2(2z ± 1 ) -2 E2(1÷– 24F(z)i[ST -2 S] 2 = E2(z)I[ST -2 Sh 7 3E2 (2z)I[ST f 2) by Problem 1 above, and, by Proposition 7 and part (b), neglecting terms which = D2 E2(-ii approach zero, this becomes E2 (z) - 3E2(2z) + 2(2z + 1) -2 ( E2 (Z) — 3E2(2z) + z 2 E2 — ± 6E2 — "1i) --- 4E2( I)) by Problem 8(d); again applying Proposition 7 and neglecting small terms, we obtain E2(z) - 3E2 (2z) + -13- ( - 16E2 (4z) + 24E2 (2z) - 4E2 (z)) - t - 3 + -I ( -16 + 24 - 4) = (d) Since the only zero of ri 8 (4z)f 4(2z) is a -4, so that F( 1) = M 1) = simple zero at co, it follows by Proposition 18 that F(z)101 8 (4z)M 4(2z))€1110(F0 (4)) is en) = 1 + t )•(e)- • First show a constant. To get the identity, write (1 - 17 4n)/( that A(z + ilN)eS12m(ro(N 2 )), using the same type of argument as in the proof of Proposition 17(b). Then set f(z) = 6.(z) N 111A(z + jIN), and take the logarithmic derivative of the equality f(yz) = f(z) for y eFo (N2). 11. (a) Prove invariance under [A 2 for y E r0(4) as in Proposition 30. At the cusps, clearly 04(co) = 1; at 0 we have 04 (z)1[5 ] 2 = 2 2 04 ( - 1/(4z/4)) = z -2 (- z 2 /4)04(z/4), which approaches -I as z oo ; at .1- we have 04 (z)1[ST -2 S] 2 = (2z + 1) 2 04(z/(2z + 1)) = (2z + 1)2®4(_ 1/(4( - 1/4z - 1/2))) = (2z) 2 ® 4(- 1/4z - 1/2) by (3.4), but by Problem 8(a) this equals (2z) -2 (20( 1/z) - E(- 1/4z))4 = -1(2(20 -112 0(z/4) 0(z))4 = (0(z/4) - 0(z)) 4, which approaches zero as z co. (b) Use: 04(oo) 0 = F(co). (c) One can proceed directly, as in Problem 10(a). Alternately, write it as re (2z) and use Propositions 17 and 20 and Problem 10(a). An even 6.(2z) —
-
(
-
-
-
-
en\
.-
(n(z)g(2z))8778(4z ) ,
easier method is as follows. Letfi (z) 714(2z)M 8 (4z), and let f2(z) re ° (2z)117 8 (z)e (4z). In the solution to Problem 10(a), we saw that for cc = ST -2 S = ?) we havef2 = 16f1 1[4 2 . Sincefi is invariant under [y] 2 for y e F0 (4), it follows thatf2 is invariant under a -1 1-0 (4)a, which is easily checked to be equal to F0 (4); finally, since a keeps 0 fixed and interchanges the'equivalence classes of cusps oo and -1, we see thatf2(0) = 1 6f1(0) = - 1,f2( 16f, ( co) = o, f2(œ) = 145fi(—) = 1. (d) Same procedure as 10(d). (e) Use Problem 8(e). 12. Follow the proof of Proposition 19. 13. Write (q(z)11(3z)) 6 = . (z)(n(3z)17/ 3 (z)) 6 , (1(z)1(7z)) 3 = (z)(q(7 z)1,7 7 (z)) 3 . 14. Check the generators T2 and S, using (3.4) to get 0(Sz) = firk/Az); to check the cusp -1 = (TST)' cc, write 04(z)I[TST] 2 = 04(z/2)1[(1 ?)] 2 (see Problem 1). 15. (a) For y = e 170 (N) note that ocNyaV = (- `kb -`,,1N)e Fo (N), and so (f I [aN]01[Y]k = (f 1 [OENYŒV ]k)I1ŒNik = x(a)f [ŒN]k. But X(a) = TO). (b) For fe Mk (N, x) write f f+ ' + f , where f = ± efl[aN]k). (c) By 10(d), +.1q + - = + F; matrix is F(z)I[Œ4]2 = -- *4 8 (z)M 4 (2z) = (1 116); M; -(4, 1)' = C-10 4, M;(4, 1) = -12-404): 16. (a) E ak_ i (n)n -s = Zfrii _l_„1 -1 (fin)7 = Z Ent 5 = C(S 7 k)(s).
235
Answers, Hints, and References for Selected Exercises
I: 1/2
0 Figure A.3
Sc
SB
Figure A.4
s 4. pk-1-2s)l . (b) L1(s) = n[(1 — p)(1 — Pk-1-5)r =14( 1 — ak-i(P)P(c) L1 x (s) = 11 19[0 — X( P)P - s)( 1 —
= np( 1 — X(P)ak-i(P)P -s + X(P) 21)4-1-25) -1 . 17. (a) Any point r'-equivalent to i must be F0 (4)-equivalent to one of the points aj-1 i, where 1" = Uï... 1 airo (4). In this way, find that i, (2 + i)15 e Int F' are F-equivalent to i; co and (5 + i.11)/14E Int F' are F-equivalent to co ;. the two F0 (4)-equivalent boundary points (— I + 012 and (3 + 0/10 are F-equivalent to i; and no boundary points are F-equivalent to co. (b) Follow the proof of Proposition 8, but with slightly more involved computations. Note that the three "corners" (-3 + i. 1 ) /4, (1 + i0)/4, (9 + i0)/28 are all F0 (4)-equivalent, and the sum of the angles at the three corners is 360 0 . To illustrate the elements in the integration around the cusps and along the circular arcs, let us compute (27ri)-1 f f ' (z)dz If(z) over the contour ABCDEF pictured in Fig. A-3, where we suppose that f(z) has no zeros or poles on the contour (but may have a zero or pole at the cusp 0), and we ultimately want the limit as E --+ O. Here BC is a circular arc of radius e centered at O. The element a l = (1 ?) E F0 (4) takes the arc from 'A to O to the arc from D to O. The image of B is very close to C, and so we can use the integral from D to a l B to approximate the integral from D to C. First, we use S to take the arc BC to a horizontal line between Re z = —3 and Re z = 1. (see Fig. A.4). By (2.24), we have
f : fj
.
z z) )
dz =--- is: f l i dz — k f c dz . B
Z
But the latter integral approaches 0 as E -4 0 (since the angle of the are BC approaches zero), while the first integral on the right'approaches —v o(f ) (as in the proof of Proposition 8 , but we use the map 44 = e21644 into the unit disc). Next, by
236
Answers, Hints, and References for Selected Exercises \
(2.24), we have
= _ k f B dz j A z+ 1/4
fBf/(z)dz +fDr(z)dz,, l'Brkl(tz fœlitLikIdZ
J 4 f(z)
c f(z)
JA f(z)
--, _k J /
0
j Go f(z)
dz z+ 1/4
= -k
I,
1/4
dz
.
— 2 -1-isrS)/4 Z
Similarly, using a2 = a 1) e r0(4) to take the arc DE to the arc FE, we find that the sum of the integrals over those two arcs is equal to - k times the integral of l• from (2 + i0)/28 to i. These two integrals are evaluated by taking in z between the limits of integration, where the branch is determined by following the contour. As a result, we find that the sum of the two integrals is - k times the logarithm of
1/4
1/4 --1 0)/4 (2 + i.0)/28 (-2+
•
where, if we keep track of the contour, we see that we must take ln( - 1) = - ni. Thus, (20 -1 times the sum of these two integrals is equal to k/2. For a systematic treatment of formulas for the sum of orders of zero of a modular form for a congruence subgroup, see [Shimura 1971, Chapter 2]. (c) The only zero of 04 is a simple zero at the cusp -1; the only zero of F is a simple zero at co. (d) If fE M2 (ro(4)), apply part (b) to the element f - f(cc)0 4 - 16f( - - -1) Fe M2 (ro (4)), which is zero at -1 and co, and hence is the zero function. (e)-(f) See the proof of Proposition 9. (g) Look at the value at each cusp of a012 + b08 F + c04 F2 + dF3 : f(cc) = a, f(-1) = dI163 , so a = d= 0; then f= 1(0) = (--4 - ) (-4)( -I), so c ------ -16b. (h) Let f(z) = ql 2 (2z). Since f(z) 2 = A(2z)e M12( 1-0( 2 )), we immediately have vanishing at the cusps. Let Œ = (_.1 ?). By writing 2az = -1/(1 - 1/2z), show that f i[a]6 = -f, and so fit M6 (r'0(2)). However, r0(4) is generated by T and a2. Next, since s6(r0 (4)) =C(08 F- 1604F2), to check the last equality in part (h), it suffices to check equality of the coefficient of q. 18. (b) Iffe M 1 (4, X), by subtracting off a multiple of ®2 we may suppose that f(co) = 0; then M2 (4, 1)f 2 1.--- a?q2 + - • - . But a multiple zero at co Would contradict Problem 17(b), unless f is the zero function. (c) First show that, for k _._. 5 odd, Sk (4, x) = 0 -2 Sk+I (4, 1); then, by Problem 17( 1 ), dim Sk (4, X k) --= [(k - 3)12] (where [ ] is the greatest integer function). (d) Use Proposition 20, Problem 17 (with F' = F0 (2), a = CI 7)), and part (c).
§111.4 2. Show that V- i(cz + d)/V - 4- cz - d) = rsg" (it suffices to check for z = 0), and then divide into four cases, depending on the sign of c and d. 3. (b)-(c) Use the (. 0 a/3z) 0(43z) 0(i3z) relation 0(z) - 0(/3z) 0(z) .
§111.5 2. (a) By (5.27) and (5.28), Tn(f i[ŒN]k) = d k/2" Ea,bf 1[ŒNcra(c; ba) ]k . Let , Œa ,b = anaN ajg riDoc-IV ENV, tl}, Z), and show that the Œ,b are in distinct cosets of r;(N) in An(N, {1}, Z); and hence form a set of representatives; so, by (5.27),
Answers, Hints, and References for Selected Exercises
237
(Tnf)i[aidic = n(42 " Ea,bfl[cca,baN]k = n (W2)-1 Ef i[cr naN0-.(8 !)I beCaUSe fl[Crac = f. (b) TF = 0, T204 = 04 + 16F, so {F, F + A.Ø 4} is a normalized eigenbasis. (c) (7'2 F)i[Œ4] 2 = 0 # T2(FI[Œ4]2)- (d) Any eigenvector of T2 or [M k must be an eigenvector of T„ ; this includes F,iF +2-40 4 , C04, and so all of M2(r0( 4 )); by Proposition 40 applied to F, we have An = al (n) (n odd). (e) Let 04 (z) = Eanqn. For n odd, a, = (n)a i = 8u1 (n) by Proposition 40. For n = 2no twice an odd number, compare coefficients of qn0 in T2 04 = 04 + 16F to get an = a 16a1 (n0) = 24d1 (n0). For n = 2n 1 divisible by 4, compare coefficients of qi_'1 in T2 04 = 04 + 16F to get a„ = an, +13 = 24o-1 (n0) by induction. 3. The first part follows from (5.19) with n = 0; a counterexample for other cusps is in Problem 2(b), since Co4 + 16F does not vanish at s = —1/2. 4. (a) Both 1,, f and Tnf are equal to n(142 ". Eflk(401 M k over the same a, b, d, because g.c.d. (n, M) = 1. Note that g = M -42 f 1[01 M k . Thus, AP/2 T g = n(42)-1 E f 1[( ?)o-jg 'la, where ?,) mod MN. Set aa in the sum for T,, f equal to (411 ?)aj ig -?)'. Then = ((k/2I Ef f[aa(1 ?)(sSI !)O' ?)--I]ol oof ?)],= ( 7nf)1[0o1 ?ilk, because in (f; bia4) = (11 ?)(iO4 ?)--1 the entry bM runs through a complete set of residues mod d. (b) Since s8(r0 (2)) = Cf, f is an eigenform for Ta ;.then by part (a), g is an eigenform with the same eigenvalue. (c) T2 9 = U2 V2 f = f; T2 f = —8f by a comparison of coefficients. Hence {f, 8g +f} is the normalized eigenbasis. 5. <7, f, g> = < p0/ 2)-1f1[( )] kl g> = Ef:
(11■1
[
12 = q
24. 43q 2 + 4.9.49.139e + 64.171337e + • ; T2f1 = q + 1080q 2 + • • • =f2 + 2112f; T2 f2 = —24 . 43q +(64 .171337 + 2 23)q2 + - - = —1032f2 + 29367211(0 Tr T2 = 1080, Det T2 = — 2 ". 3 2 • 2221; eigenvalues = 540 + 12V144169; normalized eigenforms aref2 + (131 ± J144169) 12f1. (d) 2f2 + 3144fi . (e) From part (d): 2(4 . 9 -49 139) — 48 . 3144 = 89 5-23 41. Alternately, compute T3f1 = —48f2 + 36 - 2619f1 5 T3 f2 = 36 . 681112 *f (we don't care about *), so Tr 7'3 = 36(2619 + 6811) = 8 .9 .5 . 23 - 41. 8. In case of difficulty, see §3.1(c) of [Serre 1973 ].
NV.
= «73 ?), n2-114) ((: ()cil " d) ((141n) m 114) = ((;„, ()V czlm + d) by (1.4). (b) yi = (J„, mdb), so "j), = 1), ()E;i 1 Niczim - + d)= ?), (5)). Thus, they are equal if m is a square mod d. For m not a perfect square, to find y for which = 135" p -1 (1, —1), it suffices to find d prime to 4m such that (7) = —1, since one can then find y of the form Lam 'De ro (4m). To do this, first let m = 24mm 0i, where s = 0 or 1 and m o is squ-arefree. Choose d Ez- 1 mod 8 and d do mod mo , where do is chosen so that
1. (a)
(t) = —1. Then one easily checks that (7) = (ia) = (4) = — 1, as desired. 2. Replacing y by —y, if necessary, without loss of generality we may suppose that b> 0 or else b == 0 and a> 0. (a) = (ev N 114Vinf 1:1), (DE» cz + d)-
((-1 1r), —iNi/4,/-z-)=((z ;-,),N14-,iect:::-.:(5)evN cz + d)((_? le) Note that \I(az + b)I(cz + d) - cz + d = q i Vaz + b with 1 1 = —1 if a .< 0' and c 0, th = 1 otherwise. Thus,
238
.
Answers, Hints, and References for Selected Exercises
pi'P-1 z--- (( 2Nb - r), ti 1N '14( E V 1/ a ( — 11Nz)+ b(—iN 114\li))= , 0 4vb 'IN), q i (D8 d-- ' — Nbz + a). (b) We have .•j3 1 = ((-daNb - `,;IN ), (411)--)e c-1- 1 \I — Nbz + a). Since ad — bc = 1 and 41c, we have a 2_ ---: d mod 4, and so ea = ed. Thus, it remains to compare ( 1%16 ) with Ili a We suppose c > 0 (an analogous argument gives the same result if c < 0). Then
r). Now
- fivb ) = ( 04)(7) = (--&) — 1, and (4-) = (÷), so that (-=-(N (N fr (1b )( ) = )( 1 71 — ) sgn a =--- (Dth. Thus, yi = pj3p -1 ( ( ?), (1)). Again ,-, and p,- p -1 are equal if N is a perfect square; otherwise there exist y for which the two differ by (1, — 1). 3. At oo, p = 1, h = 1, t = I. At 0, take p = ((? I), N[Z-), so p" = ((_? 16), —i N/i), and we need f 0 (4) p(( in, Op = 7), -iiv-tiz+ h . VZ) = ((rh1 ), 1 ,I z + h) (( 4 L), -il)= ( (( .7), —it,lhz — I). This is in fo(4) if /i = 4, 1= 1. Finally, at the cusp —I take a = (_1 ?), p = ((A ?), V —2z + 1), so p" = (0, V2z + 1). Since aa Da' er0 (4), we can take h = 1. To find t, compute p(( 1), 0 p' — ((_,1 4), :J-4z — 1), which is j((_ _1), z) provided that ! = i. S. (a) Ski2(f.0(4)) = 0 if k < 9• For k ,.._ 9, it consists of elements of the form OF(I6F — 04)P(0, F), where P is a polynomial of pure weight (k — 9)/2. Thus, for k .... 5, dim Sk2 (r0 (4)) = [(k — 5)14]. (b) Since ! = 1 at the cusps co and 0, there are always those two regular cusps. Since 1 = i at the cusp —4, that cusp is k-regular if and only if ik = 1, i.e., 41k. But I + [k/4] — [(k — 5) 14] = 2 if k ,... 5, 4,1'k and = 3 if k ._._ 5, 41k, as call be verified by checking for k = 5, 6, 7, 8 and then using induction to go from k to k + 4. (c) ®F(® 4 — 16F)(0 4 — 2F). 6. (a) If d'E_-- d mod N and Nj4 is odd, then (1) = (11..) = ( 1 r//4 - 1)(d-I W2( Ndi4 ) = N If N/4 is even, then d' -2E d mod 8, and the proof works (_._ 1)(NI4-1)(d' -1)12(4) = .ur) the same way, with the additional observation that (i) = (-,÷). (b) The proof that the h
(
.. .
cusp condition holds for f 1[4,12 is just like the analogous part of the proof of Proposition 17 in §I11.3. Now let y = C. 'Del-004 and let yl = ( -dbN -eir). By 4. Problem 2 above, we have p53 p-1 = (1, xpl(d))i, 1 . Thus, (f i [P]42) i Vilk/2 "4"--(f 1 [ei'P -1 ]ki2)1[P]k/2 = A(d)(fi Di ik/2)1[P]ko z--- X ki (d);((a)f 1[P]ka. But since ad ÷ .- --_-. 1 mod N, we have x(a) = .k(d). Thus, (f i [P]k2)1[A42 = )---X il v(d)f I [P],42, as desired. 7. 0(co)- = 1, ®(— 1/2) = 0, c(0) = (1 - 012. §IV.2 1. E, 2 (co) = 1, Eki2( 0) = Ekt2 (— 1/2) = 0; Fic/2(0) = (if)_k ; 12(00) = F142(— 1/2) = O. 5. If one uses (2.16) and (2.19) rather than (2.17) and (2.20), then the solutions a and , fl to the resulting 2 x 2 equations involve x, which depends on 1. 6. By Proposition 8, it suffices to show this for squarefree. In that case use (2.16) and . (2.19) to evaluate the /-th coefficient of Ekg + (1 ± ik)2 -k12 F142. 7. lika = ((I — 22) + ( I — 2)q + - . . ; C(1 — A) = 0 if and only if 2 .._ 3 is odd. 9. Use Problem 1 above and Problem 7 of §1V.1 to find a and b so that Co" — aEk12 — bFkg vanishes at the cusps. a = 1, b = (1 + 0'2-k12. e5 = £512 _ (1 + i)/N/2 F512, 0 7 = E712 + (1 — i)/V2 F7/2 .
t
§IV.3 2. The computation is almost identical to that in Problem I in §IV.1. 3. (c) By the lemma in Proposition 43 in §111.5, right coset representatives for ri (N) modulo fl (N) n cC i r'1 (N)Œ, where a = (01 p, are ah = (01 pb,), 0 -._ b < pv, By Problems 1(b)
239 -,
Answers, Hints, and References for Selected Exercises
and 3(b), we use the ab to get representatives for r,(N) modulo ri (N) n r-v i ri (N)ÇV. Since ab = (01 :Oa ), we have Tp v f(z) = p“k14- ' ) • Ebfl[(( (i) :r), Pvil • ( ( i), 1 )L2 z ---- p f( z,-?) ------ E pin ane2xinziPv. The same argument works for Ti v, since the corresponding t is trivial even when 8,f'N. 4. (3.5) gives bn for half integer weight 102; in the case of integer weight k, the formula (5.19) of
§111.5 with m .„.. p 2 gives b„ ...._ ap,n+ x(p)pk-i an+ x ( p 2)p 2k- 2 a nfp 2 . If k is
formally replaced by 1c/2 here, the middle term on the right differs by )( NI ).1„(p)j) from the middle term on the right in (3.5). 5. Let y = (ca 1:1)e ro (N) be such that p2 11). Then for yi e 1", (N) we have .1g2:iiij A'i.. = 1:p2 'g p 21 42 :1) I 53.i : By Problem 2, we have i,2).)„.-2 1 = with Vi = ( p4j2c bliiii 2) e ro(N). Let t y' yiy E1"1 (N). Then D
(fi [f1'1 (M4411001/2)1[342 = Efl[42'Plii)/ki2 i
= EA Dl
pl2;(jk/2
-----
X(d)
Zil Rp2 Mk/2 •
But one checks that the correspondence ri (N)421`)i i-- f1 (N) p2ti permutes the right cosets in r, 0\042 f-, (N), and so this equals x(d)fl[f-i (N),2 fi (N)L2 . (Verify that if ti and Tx are in the same right coset of 1-1 (N) n a - -1 r 1 (N)a in T1 (N), where a =. (01 :2), then so are y; = rciy and yi, = yt.r y'.) Thus, Tp 2 .T1[342 = X(d)72f it remains to note that Fo (N) is generated by 1"1 (N) and y = (ea e fo (N) for which p2 1b. Hence 1p2 fl[fl ki2 = x(d)Tp2 f for all y = (ca dI' ) e ro(N).
Bibliography
R. Alter, The congruent number problem, Amer. Math. Monthly 87 (1980), 48-45. G. E. Andrews, The Theory of Partitions, Addison-Wesley, 1976. N. Arthaud, On Birch and Swinnerton-Dyer's conjecture for elliptic curves with complex multiplication, Compositio Math. 37 (1978), 209-232. E. Artin, The Gamma Function, Holt, Rinehart & Winston, 1964. E. Artin, Collected Papers, Addison-Wesley, 1965. A. G. van Asch, Modular forms of half integral weight, some explicit arithmetic, Math. Anna/en 262 (1983), 77-89: A. O. L. Atkin and J. Lehner, Hecke operators on ro (m), Math. Anna/en 185 (1970),
. 134-16a. R. Bellman, A Brief Introduction to Theta Functions, Holt, Rinehart & Winston, 1961. B. J. Birch, Conjectures on elliptic curves, Amer. Math. Soc. Proc. Symp. Pure ,Math. 8 (1963), 106-112. B. J. Birch, Heegner points on elliptic curves, Symp. Math., 1st. d. Alta Mat. 15 (1975), 441 445. B. J. Birch and H. P. F. Swinnerton-Dyer, Notes on elliptic curves 1 and 11, J. Reine Angew. Math. 212 (1963), 7-25 and 218 (1965), 79 - 108. Z. Borevich and I. Shafarevich, Number Theory, Academic Press, 1966. A. Brumer and K. Kramer, The rank of elliptic curves, Duke Math. J. 44 (1977), 715-742. J. W. S. Cassels, Diophantine equations with special reference to elliptic curves, J. London Math. Soc. 41 (1966), 193-291. J. W. S. Cassels and A. Fr6hlich, eds., Algebraic Number Theory, Academic Press, 1967. J. Coates, The arithmetic of elliptic curves with complex multiplication, Proc. Int. Cong. Math. Helsinki (1978), 351-355. J. Coates and A. Wiles, On the conjecture of Birch and Swinnerton Dyer, Inventiones , Math. 39 (1977j, 223-251. H. Cohen, Sommes de carrés, fontions L et formes modulaires, C- R. Acad. Sc. Paris 277 (1973), 827-830. , H. Cohen, Sums involving the values at negative integers of L-functions of quadratic characters, Math. Anna/en 217 (1975), 271-285. H. Cohen, Variations sur un thème de Siegel et Hecke, Acta Arith. 30 (1976), 63 93. -
-
Bibliography
241
H. Cohen and J. Oesterlé, Dimensions des espaces de formes modulaires, SpringerVerlag Lecture Notes in Math. 627 (1976), 69-78. H. Davenport and H. Hasse, Die Nullstellen der Kongruenz-zetafunktionen in gewissen zyklischen Fallen, J. Reine Angew. Math. 172 (1935), 151-182. P. Deligne, Valeurs de fontions L et périodes d'intégrales, Amer. Math. Soc. Proc. Symp. Pure Math. 33 (1979), Part 2, 313-346. Serre, Formes modulaires de poids 1, Ann. Sci. Ecole Norm. Sup. P. Deligne and 7 (1974), 501-530. L. E. Dickson, History of the Theory of Numbers, Chelsea, 1952. P. G. L. Dirichlet, Werke, Chelsea, 1969. B. Dwork, On the rationality of the zeta function, Amer. J. Math. 82 (1960), 631-648. M. Eichler and D. Zagier, On the Théory of Jacobi Forms, Birkhauser, 1984. Y. Flicker, Automorphic forms on - covering groups of GL(2), Inventiones Math. 57 (1980), 119-182. S. Gelbart, Weil's Representation and the Spectrum of the Metaplectic Group, Lecture Notes in Math. 530, Springer-Verlag, 1976. D. Goldfeld, Sur les produits partiels eulériens attachés aux courbes elliptiques, C. R. Acad. Sc. Paris 294 (1982), 471-474. D. Goldfeld, J. Hoftstein and S. J. Patterson, On automorphic functions of half-integral weight with applications to elliptic curves. In : Number Theory Related to Fermat's Last Theorem, Birkhauser, 1982, 153-193. D. Goldfeld and C. Viola, Mean values of L-functions associated to elliptic, Fermat and other curves at the center of the critical strip, J. Number Theory 11 (1979), 305-320. D. Goldfeld and C. Viola, Some conjectures on elliptic curves over cyclotomic Trans. Amer. Math. Soc. 276 (1983), 511-515. R. Greenberg, On the Birch and Swinnerton-Dyer conjectures, Invenriones Math. 72 (1983), 241-265. B. H. Gross, Arithmetic on Elliptic Curves with Complex Multiplication, LectureNotes in Math. 776, Springer-Verlag, 1980. B. H. Gross and D. Zagier, On the critical values of Hecke L-series, Soc. Math. de France Métn. No. 2 (1980), 49-54. B. H. Gross and D. Zagier, Points de Heegner et dérivées de fonctions L, C. R. Acad. Sc. Paris 297 (1983), 85-87. R. C. Gunning, Lectures on Modular Forms, Princeton Univ. Press, 1962. R. K. Guy, Unsolved Problems in Number Theory, Springer-Verlag, 1981. G. H. Hardy, On the representation of a number as the sum of any number of squares, and in particular of five or seven, Proc.Nat. Acad. Sci. 4 (1918), 189-193. (Collected Papers, Vol. 1, 340, Clarendon Press, 1960.) G. H. Hardy, On the representation of a number as the sum of any number of squares, and in particular of five, Trans. Amer. Math. Soc. 21 (1920), 255-284. (Collected Papers, Vol. L, 345, Clarendon Press, 1960.) G. H. Hardy and E. M. Wright, An Introduci;on to the Theory of Numbers, 4th ed., Oxford Univ. Press, 1960. R. Hartshorne, Algebraic Geometry, Springer-Verlag, 1077. H. Hasse, Number Theory, Springer, 1980. E. Hecke, C1ber die Bestimmung Dirichletscher Reihen durchihre Funktionalgleichung, Math. Annalen 112 (1936), 664-699. (Math. Werke, 591-626.) E. Hecke, Herleitung des Euler-Produktes der Zetafunktion und einiger L-Reihen aus ihrer Funktionalgleichung, Math.-Annalen 119 (1944), 266-287. (Math. Wei--ke, 919-940.) E. Hecke, Lectures on the Theory of Algebraic Numbers, Springer-Verlag, 1981. E. klecke, Lectures on Dirichlet Series, Modular Functions and Quadratic Forms, Vatfdenhoeck_ and, Ruprecht, 1983. K. Ireland and M. Rosen, A Classical Introduction to Modern Number Theory, Springer-
242
Bibliography
Verlag, 1982. N. Katz, An overview of Deligne's proof of the Riemann hypothesis for varieties over finite fields, Amer. Math. Soc. Proc. Symp. PUre Math. 28 (1976), 275-305. N. Katz, p-adic interpolation of real analytic Eisenstein series, Annals of Math. 104 (1976), 459-571. N. Koblitz, p-adic Numbers, p-adic Analysis, and Zeta-Functions, 2nd ed., SpringerVerlag, 1984. N. Koblitz, p-adic Analysis: a Short Course on Recent Work, Cambridge Univ. Press, 1980. N. Koblitz, Why study equations over finite fields?, Math. Magazine 55 (1982), 144-149. W. Kohnen, Modular forms of half integral weight on 1-0 (4), Math. Anna/en 248 (1980), 249-266. W. Kohnen, Beziehungen zwischen Modulformen halbganzen Gewichts und Modul. formen ganzen- Gewichts, Bonner Math. Scriften 131 (1981). W. Kohnen, Newforms of half-integral weight, J. Reine und Angew. Math. 333 (1982), 32-72. W. Kohnen and D. Zagier, Values of L-series of modular forms at the center of the critical strip, lnventiones Math. 64 (1981), 175-198. S. Lang, Algebraic Number Theory, Addison-Wesley, 1970. S. Lang, Elliptic Functions, Addison-Wesley, 1973.. S. Lang. Introduction to Atodular Forms, Springer-Verlag, 1976. S. Lang, Sur la conjecture de Birch-Swinnerton-Dyer (d'après J. Coates et A. Wiles), Sém. Bourbaki No. 503. In: Springer-Verlag Lecture Notes in Math. 677 (1978), 189-200. S. Lang, Elliptic Curves Diophantine Analysis, Springer-Verlag, 1978. S. Lang, Units and class groups in number theory and algebraic geometry, Bull Amer. Math. Soc. 6 (1982), 253-316. S. Lang, Algebra, Benjamin/Cummings, 1984. W. J. LeVeque, Fundamentals of Number Theory, Addison-Wesley, 1977. Yu. I. Manin, Cclotomic fi elds and modular curves, Russian Math. Surveys 26 (1971), 7-78. Yu. I. Manin, Modular forms and number theory, Proc. Int. Cong. Math. Helsinki (1978), 177-186. D. Marcus, Number Fields, Springer-Verlag, 1977. Modular Functions of One Variable IV, Lecture Notes in Math. 476, Springer - Verlag, 1975. C. J. Moreno, The higher reciprocity laws: an example, J. Number Theory 12 (1980), 57-70. D. Mumford, Algebraic Geometry I: Complex Projective Varieties, Springer-Verlag, 1976. S. Niwa, Modular forms of half-integral weight and the integral of certain thetafunctions, Nagoya MO. J. 56 (1975), 147-161. A. Ogg, Modular Forms and Dirichlet Series, W. A. Benjamin, 1969. R. A. Rankin, Modular Forms and Functions, Cambridge Univ. Press, 1977. B. Schoeneberg, Elliptic Modular Functions: an Introduction, Springer-Verlag, 1974. J.-P. Serre, Formes modulaires et fonctions zeta p-adiques, Springer-Verlag Lecture Notes in Math. 350 (1973), 191-268. J.-P. Serre, A Course in Arithmetic, Springer-Verlag, 1977. J.-P. Serre and H. M. Stark, Modular forms of weight 1/2, Springer-Verlag Lecture Notes in Math. 627 (1977), 21-67. L R. Shafarevich, Basic Algebraic Geometry, Springer-Verlag, 1977. G. Shimura, Introduction to the Arithmetic Theory of Automorphic Functions, Princeton Univ. Press, 1971. G. Shimura, On modular forms of half-integral weight. Annals of Math. 97 (1973),
Bibliography
243
440-481.. G. Shimura, Modular forms of half integral weight, Springer-Verlag Lecture Notes in Math. 320 (1973), 59-74. T. Shintani, On construction of holomorphic cusp forms of half-integral weight, Nagoya Math. J. 58 (1975), 83-126. C. L. Siegel, On Advanced Analytic Number Theory, Tata Institute (Bombay), 1961. N. M. Stephens, Congruence properties of congruent numbers, Bull. London Math. Soc. 7(1975), 182-184. H. P. F. Swinnerton-Dyer, The conjectures of Birch and Swinnerton-Dyer and of Tate. In : Proc. Conf Local Fields, Springer-Verlag, 1967. J. Tate, The arithmetic of elliptic curves, Inventiones Math. 23 (1974), 179-206. J. Tunnel], A classical Diophantine problem and modular forms of weight 3/2, 'liven 72 (1983), 323-334. -honesMath. M.-F. Vignéras, Valeur au centre de symmétriè des fonctions L associées aux formes modulaires, Sém. Delange-Pisot-Poitou, 1980. B. L. van der Waerden, Algebra (2 vols.), Frederick Ungar, 1970. J. L. Waldspurger, Correspondance de Shimura, J. Math. Pures et Appl. 59 (1980), 1-132. J. L. Waldspurger, Sur les coefficients de Fourier des formes modulaires de poids demi-entier, J. Math. Pures et Appl. 60 (1981), 375-484. R. J. Walker, Algebraic Curves, Springer-Verlag, 1978. L. Washington, Introduction to Cyclotomic Fields, Springer-Verlag, 1982. A. Weil, Number of solutions of equations in finite fi elds, Bull. Amer. Math. Soc. 55 (1949), 497-508. (Collected Papers, Vol. 1, 399-410.) A. Weil, Jacobi sums as "Grôssencharaktere," Trans. Amer. Math. Soc. 73 (1952), 487-495. (Collected Papers, Vol. II, 63 - 71.) A. Weil, On a certain type of characters of the idèle-class group of an algebraic number-field, Proc. Intern. Symp. on Alg. Num. Theory, Tokyo-Nikko (1955), 1-7. (Collected Papers, Vol. II, 255-261.) A. ,,i1, Ober die Bestimmung Dirichletscher Reihen durch Funktionalgleichungen, Math. Anna/en 168 (1967), 149-156. A. Weil, Review of "The mathematical career of Pierre de Fermat" by M. S. Mahoney, Bull Amer. Math. Soc. 79 (1973), 1138-1149. (Collected Papers, Vol. Ili, 266-277.) A. Weil, Basic Alumber Theory, 3rd ed., Springer - Verlag, 1974. A. Weil, Sur ics ,,ommes de trois et quatre carrés, L'Enseignement Math. 20 (1974), 215-222. (Co/It (l ('d Papers, Vol. 111, 303-310.) A. Weil, La cyclotom:c jadis et naguère, Sém. Bourbaki No. 452. In : Springer-Verlag Lecture Notes in Math. 431 (1975), 318-338 and in l'Enseignement Math. 20 (1974), 247-263. (Collected Papers, Vol. III, 311-327.) A. Weil, Sommes de Jacobi et caractères de Hecke, Gait. Nadir. Nr. .1 (1974), 14 pp. (Collected Papers, Vol. III, 329-342.) A. Weil, Number Theory: an Approach Through History from Hammurapi to Legendre, Birkhauser, 1983. E. Whittaker and G. Watson, A Course of Modern Analysis, 4th ed., Cambridge Univ. Pre ss, 1958. D. Zagier, Nombres de classes et formes modulaires de poids 3/2, C. R. Acad. Sc. Paris 281 (1975), 883-886. D. Zagier, Modular forms associated to real quadratic fields, Inventiones Math. 30 (1975), 1-46. D. Zagier, On the values at negative integers of the zeta function of real quadratic fields, Enseignement Math. 22 (1976), 55-95. D. Zagier, Modular forms whose Fourier coefficients involve zeta-functions of quadratic fields. gnrir 'r Lecture Notes in Math. 627 (197;6, 105-169.
Index
Addition law, 7, 29-35 Affine variety, 52 coordinate ring of, 55 Algebraic geometry, 25-26
Algorithm for congruent number problem, 4, 5, 222 semi-algorithm, 5 Automorphy factor, 148, 152, 153, 177-178
Bernoulli
numbers, 110 polynomials, 54 Bezout's theorem, 32 Birch-Swinnerton-Dyer conjecture, 3, 46, 9093, 218, 221, 222 Branch points, 21, 25
Character additive, 57, 62 conductor, 67 Dirichlet, 62, 75 of group, 61 multiplicative, 57, 62 primitive, 62 quadratic, 82, 176, 187-188, 191-192
/
trivial, 57 Class number, 176, 194, 218 relations, 194 Coates, J., 92 -Wiles theorem, 92, 96, 221 Commensurable subgroups, 165 Complex multiplication, 42, 50, 92, 124, 143, 222 Conductor of character, 67 of elliptic curve, 143 Congruence
subgroup, 99-100 principal, 99 zeta-function, 51, 52 Of elliptic curve, 53, 59 Congruent
number, 1, 3, 5, 46, 70, 92, 221-222 number problem, 1, 2, 4, 221-222 generalized, 8, 123-124, 223-224 Coordinate ri ng, 55 Critical value, 90, 95, 193, 215, 216, 217 Cusp, 103, 106, 108, 126 condition, 125-126, 180-182 form, 108, 117-118, 125, 127, 155, 182 irregular, 144, 182 regular, 144, 1-74, 182
246
Cyclotomic fields, 37
Dedekind eta-function, 78, 121, 122 zeta-function, 56, 88, 89 Deligne, P., 53, 122, 164 111-112, 122, 164 Diagonal hypersurface, 56 Different. 89 Dilogarithm, 76, 78 Diophantus, 1 Dirichlet L-series, 75, 188, 190, 193 series, 80, 141 and modular forms, 140-143 theorem (primes in arithmetic progression), 45, 142 Discriminant modular form, 111-112, 12"1 , 164 of polynomial, 26 Double coset, 165, 204 Doubly periodic, 14
Eigenforms for Hecke operators, 163, 173-174, 201 Euler product, 163 half integer weight, 210-211, 214 normalized, 163 for involution of M2(r0(4)), 146 Eisenstein, F., 177 Eisenstein series, 109-110, 123, 154, 164, 174, 185 of half integer weight, 186-188, 193 Euler product, 199-201 of level N, 131-134 L-function of, 146 normalized, 111, 122 Elementary divisor theorem, 202 Elliptic curve, 9, Il addition law, 7, 29-35 additive degeneracy, 36 complex multiplication, 42, 50, 92, 124, 143, 222 over finite fields, 40-41, 43 inflection points, 13, 35, 41 Legendre form, 224 multiplicative degeneracy, 36 points of order N, 21, 36, 38-40 rank, 44, 46, 51, 91 torsion subgroup, 36, 43.-44, 49-50 Weierstrass form, 24, 26, 33, 120
Index
functions, 14-16, 18, 25 integrals, 27-29, 217 point, 102, 146 -ri(z), 78, 121, 122 Euclid, I
Fermat, 2, 96 curve, 56 last theorem, 2, 5 Fields cyclotomic, 37 of division points, 37 finite, 40 Fourier transform, 71, 83 for finite group, 76 Fractional linear transformation, 98, 102 Functional equation Dedekind eta-function, 78, 121 Dedekind zeta-function, 88, 89 Dirichlet L-series, 77-78 Hasse-Weil L-series, 81, 84, 90, 91 L-series of modular forms, 140-143, 216 Riemann zeta-function, 73-74 theta-functions, 73, 76-78, 85, 88-89, 124 Fundamental domain, 100, 103, 105-107, 146, 231-232 parallelogram, 14
0(X), 107, 142, 148 Galois action on division points, 37-38, 42, 50 Gamma-function, 70-71 Gauss lemma (on quadratic residues), 136 sums, 56, 62, 67-68, 188 Gaussian integers, 14, 41, 42, 65, 165 General linear group, 38, 98 Genus, 53.-54 Good reduction, 43, 90 Grassmannian, 55 Greenberg, R., 222 Gross, B. H., 93, 222
Hardy, G. H., 177 Hasse-Davenport relation, 60, 62-63, 70 Hase-Weil L-function, 3, 61, 64, 75, 79, 81, 84, 90, 141 and modular forms, 143 Hecke, E., 88, 141, 142 character, 81 L-series, 81
247
index
Hecke, E. (cont.) operators, 155, 156, 158, 167, 202 algebra of, 157, 210 Euler product, 158, 160 in half integer weight, 168, 201, 206207, 210 Hermitian, 168, 172 on q-expansion, 161, 163, 207 trace, 175 via double cosets, 167, 168, 202 Heegner, K., 92-93 Homogeneous polynomial, 10, 12, 52 Hurwitz, A., 194 Hypergeometric series, 29 ,-
Irregular cusp, 144, 182 Isotropy subgroup, 102
Jacobi. K., 112 forms, 194 sums, 56, 57, 61 triple product, 219 j-invariant, 105, 119-120, 123-124
Kohnen, W., 214 plus-subspace, 213-214 -Shimura isomorphism, 201, 213-216 -Zagier theorem, 216 Kronecker, L., 194
Lattice, 14, 21, 89, 153 dual, 89 Legendre form of elliptic curve, 224 symbol, 60, 65, 82, 136, 147, 178, 187-188 Level, 99 L-function Dirichlet, 75, 77-78, 188, 190, 193 Hasse-Weil, 3, 61, 64, 75, 79, 81, 84, 90, • 141 of modular form, 140-143, 216 Linear group general, 38, 98 special, 98 Line at infinity, 10-11 Liouville's theorem, 15
Me llin transform, 71, 84, 85, 139
inverse, 142 Möbius function, 188 Modular curve, 91 form, 96-97, 108-109, 117-118, 125, 127, 155 with character, 127, 136-139, 183 and Dirichlet series, 140-143 Euler product, 163 of half integer weight, 3, 176, 178, 182 weight one, 165 function, 108, 119, 125, 127, 155, 182 group SL 2(Z), 99 point, 153 Mordell theorem, 43-44 -Weil theorem, 44 Multiplicity one, 173-174, 214
New form, 174 Niwa, S., 212, 214, 215
Pentagonal number theorem, 123 Petersson scalar product, 168, 169-170, 216 Points at infinity, 10, 11, 13, 103 on elliptic curve, 11, 12 Poisson summation, 72, 83 Projective completion of curve, 11 line, 11, 12, 98, 105 plane, 10 dual plane, 12 space, 11 variety, 52 Pythagoras, 1 Pythagorean triple, 1-2, 3, 5, 8 primitive, 5, 6-7
_ q-expansion, 104, 109, 125, 126 Quadratic character, 82, 176, 187-188, 191-192 form, 176-177, 221 reciprocity-, 66, 82, 153 residue symbol, 60, 65, 82, 136, 147, 178, 187-188
Ramanujan, S., 122 conjecture, 122, 164 -Petersson conjecture, 164 T(n), 122. f23, 164 RamiPe • - • •icx, 144
248
Rank of elliptic curve, 44, 46, 51, 91 Reduction mod p, 43 Regular cusp, 144, 174, 182 Representation, 137, 165 theory, 215 Residue field, 43, 55-56 theorem, 15, 30, 116 Riemann sphere, 10, 21, 98, 105, 119, 120 surface, 53, 54, 143 zeta-function, 27, 51, 64, 70, 73 Root. number, 70, 84, 92, 97
Shimura, G., 177, 215 map, 200-201, 213-215 theorem, 212-213, 215' Shintani, T., 215 Smooth curve, 9, 13, 52-53 Special linear group, 98
Tangency, order of, 13 Taniyama, Y., 91, 143 -Weil conjecture, 91, 143 Tate, J., 88 -Shafarevich group, 218, 222 T(n), 122, 123, 164 Theta-functions, 72, 76-78, 84, 88-89, 96-97, 124, 176-177 Hecke transformation formula, 148 Torsion subgroup, 36-37, 43 ill Torus,1452 Tunnel!, J., 1, 200, 217, 219 theorem, 1, 3, 4, 6, 193, 217, 219-222 Twisting, 81, 127, 142, 176
IndeJ
Type of finite abelian group, 40
Upper half-plane, 99
Variety affine, 52 projective, 52
Waldspurger, J.-L., 215 theorem, 193, 200, 215, 220 Weierstrass p-function, 16-18, 21, 134 derivatives of, 17, 18, 20, 21, 22 differential equation of, 21, 22, 24 form of elliptic curve, 24, 26, 33, 120 Weight, 108, 153-154 half integer, 176 one, 165 of polynomials, 183 Weil, A.,, 44, 91, 141, 142, 143 conjectures, 53-54, 91, 122, 139, 164 parametrization, 91 -Taniyama conjecture, 91 theorem, 142-143, 215 Wiles, A., 92
Zagier, D., 93, 194, 222 Zeta-function congruence, 51, 52 Dedekind, 56, 88, 89 partial, 132 Riemann, 27, 51, 64, 70, 73