Non-vanishing of L-Functions and Applications

Modern Birkh¨auser Classics Many of the original research and survey monographs in pure and applied mathematics publish...

Author: Maruti Ram Murty; Vijaya Kumar Murty

33 downloads 836 Views 2MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Modern Birkh¨auser Classics Many of the original research and survey monographs in pure and applied mathematics published by Birkh¨auser in recent decades have been groundbreaking and have come to be regarded as foundational to the subject. Through the MBC Series, a select number of these modern classics, entirely uncorrected, are being re-released in paperback (and as eBooks) to ensure that these treasures remain accessible to new generations of students, scholars, and researchers.

M. Ram Murty V. Kumar Murty

Non-vanishing of L-Functions and Applications

Reprint of the 1997 Edition

M. Ram Murty Department of Mathematics and Statistics Jeffery Hall, Queen’s University Kingston, ON, K7L 3N6 Canada

V. Kumar Murty Department of Mathematics University of Toronto 40, St. George Street Toronto, ON M5S 2E4 Canada

ISBN 978-3-0348-0273-4 e-ISBN 978-3-0348-0274-1 DOI 10.1007/978-3-0348-0274-1 Springer Basel Dordrecht Heidelberg London New York Library of Congress Control Number: 2011941445 Mathematics Subject Classification (2010): 11Mxx, 11M41, 11G40, 11R52, 11R42

© Springer Basel AG 1997 Reprint of the 1st edition 1997 by Birkhäuser Verlag, Switzerland Originally published as volume 157 in the Progress in Mathematics series This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in other ways, and storage in data banks. For any kind of use, permission of the copyright owner must be obtained. Printed on acid-free paper Springer Basel AG is part of Springer Science+Business Media (www.birkhauser-science.com)

Fernando Sunyer i Balaguer 1912–1967 ∗∗∗ This book has been awarded the Ferran Sunyer i Balaguer 1996 prize. Each year, in honor of the memory of Ferran Sunyer i Balaguer, the Institut d’Estudis Catalans awards an international research prize for a mathematical monograph of expository nature. The prize-winning monographs are published in this series. Details about the prize can be found at http://www.iec.es/fsbprang.htm Previous winners include – Alexander Lubotzky Discrete Groups, Expanding Graphs and Invariant Measures (vol. 125) – Klaus Schmidt Dynamical Systems of Algebraic Origin (vol. 128)

Fernando Sunyer i Balaguer 1912–1967 Born in Figueras (Gerona) with an almost fully incapacitating physical disability, Fernando Sunyer i Balaguer was conﬁned for all his life to a wheelchair he could not move himself, and was thus constantly dependent on the care of others. His father died when Don Fernando was two years old, leaving his mother, Do˜ na Angela Balaguer, alone with the heavy burden of nursing her son. They subsequently moved in with Fernando’s maternal grandmother and his cousins Maria, Angeles, and Fernando. Later, this exemplary family, which provided the environment of overﬂowing kindness in which our famous mathematician grew up, moved to Barcelona. As the physician thought it advisable to keep the sickly boy away from all sorts of possible strain, such as education and teachers, Fernando was left with the option to learn either by himself or through his mother’s lessons which, thanks to her love and understanding, were considered harmless to his health. Without a doubt, this education was strongly inﬂuenced by his living together with cousins who were to him much more than cousins for all his life. After a period of intense reading, arousing a ﬁrst interest in astronomy and physics, his passion for mathematics emerged and dominated his further life. In 1938, he communicated his ﬁrst results to Prof. J. Hadamard of the Academy of Sciences in Paris, who published one of his papers in the Academy’s “Comptes Rendus” and encouraged him to proceed in his selected course of investigation. From this moment, Fernando Sunyer i Balaguer maintained a constant interchange with the French analytical school, in particular with Mandelbrojt and his students. In the following years, his results were published regularly. The limited space here does not, unfortunately, allow for a critical analysis of his scientiﬁc achievements. In the mathematical community his work, for which he attained international recognition, is well known. Don Fernando’s physical handicap did not allow him to write down any of his papers by himself. He dictated them to his mother until her death in 1955, and when, after a period of grief and desperation, he resumed research with new vigor, his cousins took care of the writing. His working power, paired with exceptional talents, produced a number of results which were eventually recognized for their high scientiﬁc value and for which he was awarded various prizes. These honours not withstanding, it was diﬃcult for him to reach the social and professional position corresponding to his scientiﬁc achievements. At times, his economic situation was not the most comfortable either. It wasn’t until the 9th of December 1967, 18 days prior his death, that his conﬁrmation as a scientiﬁc member was made public by the Divisi´ on de Ciencias, M´edicas y de Naturaleza of the Council. Furthermore, he was elected only as “de entrada”, in contrast to class membership. Due to his physical constraints, the academic degrees for his oﬃcial studies were granted rather belatedly. By the time he was given the Bachelor degree, he had already been honoured by several universities! In 1960 he ﬁnished his Master’s

Fernando Sunyer i Balaguer 1912–1967

vii

degree and was awarded the doctorate after the requisite period of two years as a student. Although he had been a part-time employee of the Mathematical Seminar since 1948, he was not allowed to become a full member of the scientiﬁc staﬀ until 1962. This despite his actually heading the department rather than just being a staﬀ member. His own papers regularly appeared in the journals of the Barcelona Seminar, Collectanea Mathematica, to which he was also an eminent reviewer and advisor. On several occasions, he was consulted by the Proceedings of the American Society of Mathematics as an advisor. He always participated in and supported guest lectures in Barcelona, many of them having been prepared or promoted by him. On the occasion of a conference in 1966, H. Mascart of Toulouse publicly pronounced his feeling of beeing honoured by the presence of M. Sunyer Balaguer, “the ﬁrst, by far, of Spanish mathematicians”. At all times, Sunyer Balaguer felt a strong attachment to the scientiﬁc activities of his country and modestly accepted the limitations resulting from his attitude, resisting several calls from abroad, in particular from France and some institutions in the USA. In 1963 he was contracted by the US Navy, and in the following years he earned much respect for the results of his investigations. “His value to the prestige of the Spanish scientiﬁc community was outstanding and his work in mathematics of a steady excellence that makes his loss diﬃcult to accept” (letter of condolence from T.B. Owen, Rear Admiral of the US Navy). Twice, Sunyer Balaguer was approached by young foreign students who wanted to write their thesis under his supervision, but he had to decline because he was unable to raise the necessary scholarship money. Many times he reviewed doctoral theses for Indian universities, on one occasion as the president of a distinguished international board. The circumstances under which Sunyer attained his scientiﬁc achievements, also testify to his remarkable human qualities. Indeed, his manner was friendly and his way of conversation reﬂected his gift for friendship as well as enjoyment of life and work which went far beyond a mere acceptance of the situation into which he had been born. His opinions were as ﬁrm as they were cautious, and at the same time he had a deep respect for the opinion and work of others. Though modest by nature, he achieved due credit for his work, but his petitions were free of any trace of exaggeration or undue self-importance. The most surprising of his qualities was, above all, his absolute lack of preoccupation with his physical condition, which can largely be ascribed to the sensible education given by his mother and can be seen as an indication of the integration of the disabled into our society. On December 27, 1967, still fully active, Ferran Sunyer Balaguer unexpectedly passed away. The memory of his remarkable personality is a constant source of stimulation for our own eﬀorts. Translated from Juan Aug´e: Fernando Sunyer Balaguer. Gazeta Matematica, 1.a Serie – Tomo XX – Nums. 3 y 4, 1968, where a complete bibliography can be found.

Satyam Jnanam Anantam Brahma

Table of Contents

Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

xi

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

1

Chapter 1 The Prime Number Theorem and Generalizations § 1 The Prime Number Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 2 Primes in Arithmetic Progression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 3 Dedekind’s zeta function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 4 Hecke’s L -functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

5 15 19 21

Chapter 2 Artin L-Functions § 1 Group-theoretic background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 2 Deﬁnition and basic properties of Artin L-functions . . . . . . . . . . . . . . . . . . § 3 The Aramata-Brauer Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 4 Dedekind’s conjecture in the non-Galois case . . . . . . . . . . . . . . . . . . . . . . . . § 5 Zeros and poles of Artin L-functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 6 Low order zeros of Dedekind zeta functions . . . . . . . . . . . . . . . . . . . . . . . . . . § 7 Chebotarev density theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 8 Consequences of Artin’s conjecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 9 The least prime in a conjugacy class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

25 27 30 32 35 37 41 46 52

Chapter 3 Equidistribution and L-Functions § 1 Compact groups and Haar measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 2 Weyl’s criterion for equidistribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 3 L-functions on G . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 4 Deligne’s Prime Number Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

65 66 67 68

ix

x

Table of Contents

Chapter 4 Modular Forms and Dirichlet Series § 1 SL2 (Z) and some of its subgroups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 2 The upper half - plane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 3 Modular forms and cusp forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 4 L-functions and Hecke’s theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 5 Hecke operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 6 Oldforms and newforms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 7 The Sato-Tate conjecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 8 Oscillations of Fourier coeﬃcients of newforms . . . . . . . . . . . . . . . . . . . . . . . § 9 Rankin’s theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

75 76 77 81 82 83 83 84 90

Chapter 5 Dirichlet L-functions § 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 2 Polya-Vinogradov estimate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 3 Jutila’s character sum estimate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 4 Average value of L( 12 , χD ) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 5 Non-vanishing for a positive proportion of characters, I . . . . . . . . . . . . . . § 6 Non-vanishing for a positive proportion, II . . . . . . . . . . . . . . . . . . . . . . . . . . . § 7 A conditional improvement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

93 95 97 104 110 119 128

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions § 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 2 The integrated Polya-Vinogradov estimate . . . . . . . . . . . . . . . . . . . . . . . . . . . § 3 The main terms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 4 Estimates for real character sums . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 5 Estimates for some weighted sums . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 6 The statements A± (α) and C ± (α) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . § 7 Proof of main result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

133 141 142 152 158 160 170

Chapter 7 Selberg’s Conjectures § 1 Selberg’s class of Dirichlet series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177 § 2 Basic consequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180 § 3 Artin’s conjecture and Selberg’s conjectures . . . . . . . . . . . . . . . . . . . . . . . . . 181 Chapter 8 Suggestions for Further Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187 Name Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192 Subject Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194

Preface

This monograph brings together a collection of results on the non-vanishing of Lfunctions. The presentation, though based largely on the original papers, is suitable for independent study. A number of exercises have also been provided to aid in this endeavour. The exercises are of varying diﬃculty and those which require more eﬀort have been marked with an asterisk. The authors would like to thank the Institut d’Estudis Catalans for their encouragement of this work through the Ferran Sunyer i Balaguer Prize. We would also like to thank the Institute for Advanced Study, Princeton for the excellent conditions which made this work possible, as well as NSERC, NSF and FCAR for funding. Princeton August, 1996

M. Ram Murty V. Kumar Murty

xi

Introduction

Since the time of Dirichlet and Riemann, the analytic properties of L-functions have been used to establish theorems of a purely arithmetic nature. The distribution of prime numbers in arithmetic progressions is intimately connected with non-vanishing properties of various L-functions. With the subsequent advent of the Tauberian theory as developed by Wiener and Ikehara, these arithmetical theorems have been shown to be equivalent to the non-vanishing of these L-functions on the line Re(s) = 1. In the 1950’s, a new theme was introduced by Birch and Swinnerton-Dyer. Given an elliptic curve E over a number ﬁeld K of ﬁnite degree over Q, they associated an L-function to E and conjectured that this L-function extends to an entire function and has a zero at s = 1 of order equal to the Z-rank of the group of K-rational points of E. In particular, the L-function vanishes at s = 1 if and only if E has inﬁnitely many K-rational points. The analytic continuation of the L-series associated to E has now been established in the work of Wiles and his school for all elliptic curves which have semistable reduction at 3 and 5. So it now makes sense to talk about the values of the L-function of such curves for any point of the complex plane. In recent work of V. Kolyvagin, it was necessary to have a quadratic twist of a given elliptic curve whose L-function has a simple zero at s = 1. This monograph is concerned with the non-vanishing of a general L-function, with special emphasis on classical Dirichlet L-functions, Artin L-functions and L-functions attached to modular forms. The ﬁrst technique to prove a theorem on non-vanishing arose in the work of Hadamard and de la Vall´ee Poussin. It is based on the simple trigonometric inequality 3 + 4 cos θ + cos 2θ ≥ 0. It is remarkable that such a technique is capable of vast generalization. Theorem 1.2 of Chapter 1 is one such generalization. Theorem 4.1 of Chapter 3 is another. Both of these theorems have immediate applications to general questions concerning equidistribution.

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_1, © Springer Basel AG 1997

1

2

Introduction

The prime number theorem is a special case of the more general Chebotarev density theorem. In Chapters 1 and 2, we trace the development of these ideas and discuss in some detail analytic properties of Artin L-functions. The eﬀective Chebotarev density theorem, which plays such an essential role in many questions of an arithmetic and diophantine nature, is also described. In Chapter 3, we discuss a general formalism due to Serre. It becomes clear that questions of uniform distribution reduce to questions about analytic continuation of L-functions through the Tauberian theorems and an appropriate use of trigonometric inequalities. The subject of modular forms and their associated L-functions has its origin in the works of Ramanujan and Hecke. After reviewing quickly some basic notions, we discuss in Chapter 4 the theme of non-vanishing of modular L-functions and their symmetric power analogues. This is done in the context of the Sato-Tate conjecture. We also discuss the application of these ideas to questions of oscillation of Fourier coeﬃcients of cusp forms. It is a ‘folklore’ conjecture that the classical Dirichlet L-function L(s, χ), associated to a Dirichlet character χ(mod q) does not vanish at the central critical point s = 1/2. As of 1995, this is still unproved. In Chapter 5, we discuss this question from a variety of methods. First, one can consider averages such as 1 1 L( , χ) and |L( , χ)|2 . 2 2 χ mod q

χ mod q

By developing asymptotic formulas for these averages, one can obtain the existence of many characters χ for which L( 12 , χ) = 0. To get stronger results, one considers not the above averages, but sums which are weighted with an auxilliary function. In work of Balasubramanian and K. Murty, it is shown that for each suﬃciently large prime q, the number of Dirichlet characters χ(mod q) such that L(1/2, χ) = 0 is at least ≥ (.04)φ(q). This is the content of Theorem 5.1 of Chapter 5. The methods are involved and based on the study of the averages 1 1 L( , χ)Mz ( , χ) 2 2 χ mod q

where Mz (s, χ) is a Dirichlet polynomial which ‘molliﬁes’ the L function. The method of averages to prove non-vanishing of L-functions is developed in Chapter 5 in the context of Dirichlet L-functions, and in Chapter 6 in the context of L-functions of modular forms. The main result of Chapter 6 shows that for a holomorphic modular form f which is a newform of weight 2, there is a quadratic character χ such that the twisted L-function L(s, f, χ) does not vanish at the central critical point. The method of averages can be summarized by considering the following general problem. Suppose we are given a Dirichlet series ∞ an f (s) = ns n=1

Introduction

3

which converges absolutely in some half plane and extends to an analytic function in the region Re(s) > c. Suppose further that all the twists f (s, χ) =

∞ an χ(n) ns n=1

by Dirichlet characters χ(mod q) have the same property. Given s0 ∈ C such that Re(s0 ) > c, does there exist χ(mod q) such that f (s0 , χ) = 0? To answer this, it is natural to study f (s0 , χ) χ mod q

and determine its asymptotic behaviour. More generally, one can study

cχ f (s0 , χ).

q≤Q χ mod q

Such a study was necessary in the recent work of V. Kolyvagin on the Birch and Swinnerton-Dyer conjecture. In his situation f (s) was the L-function of a modular elliptic curve, s0 = 1 and cχ = 0 unless χ is of order 2. We derive asymptotic formulas for such sums in Chapter 6. This technique has been ampliﬁed and expanded in many works such as that of Iwaniec, Luo-Rudnick-Sarnak, Barthel-Ramakrishnan, K. Murty-Stefanicki, and Y. Zhang. There are at least two more important techniques of non-vanishing of Lfunctions that are not discussed in this book. One is the method of Rohrlich which can be termed ‘Galois theoretic’. The other is the ‘automorphic method’ of Bump, Friedberg and Hoﬀstein. The important topic of general automorphic L-functions is not touched in this monograph. In this connection, we refer the reader to the monograph of Gelbart and Shahidi. Finally, in Chapter 7, we discuss Selberg’s conjectures concerning Dirichlet series with Euler products and functional equations. These conjectures imply that no element of the Selberg class vanishes on the line Re(s) = 1. Most likely, the Selberg class coincides with the class of automorphic L-functions. An intriguing pathway of research is to compare and contrast these two points of view.

Chapter 1 The Prime Number Theorem and Generalizations

§1 The Prime Number Theorem It was a century ago that Jacques Hadamard and Charles de la Vall´ee Poussin proved (independently) the celebrated prime number theorem. If π(x) denotes the number of primes up to x, the theorem states that π(x) = 1. x→∞ x/ log x lim

Their method had its origins in a fundamental paper of Riemann written by him in 1860. That paper outlines a ‘program’ for proving the prime number theorem. It begins by introducing the ζ function which is deﬁned for Re(s) > 1 as ζ(s) =

∞ 1 . s n n=1

Riemann then proceeds to show that (s − 1)ζ(s) extends to an entire function and satisﬁes a functional equation s 1−s )ζ(1 − s). π −s/2 Γ( )ζ(s) = π −(1−s)/2 Γ( 2 2 In addition, ζ(s) can be written as an inﬁnite product over the prime numbers p: −1 1 ζ(s) = 1− s p p

Re(s) > 1.

(1)

This equality is an analytic reformulation of the fact that every natural number is a product of prime numbers in an (essentially) unique way. Because the product

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_2, © Springer Basel AG 1997

5

6

Chapter 1 The Prime Number Theorem and Generalizations

is absolutely convergent in Re(s) > 1, equation (1) also reveals that ζ(s) does not vanish in this half-plane. Earlier, Euler1) had noticed that unique factorization of the integers could be written in this way as well as the functional equation for the zeta function, but he treated ζ(s) as a function of a real variable. Riemann emphasized that many intricate questions about the distribution of prime numbers can, by virtue of the above identity, be translated into complex analytic questions involving the ζ function. It took several decades to vindicate Riemann’s approach and put it on rigorous footing. Many new ideas of complex analysis were discovered and developed as a consequence. By the time Hadamard and de la Vall´ee Poussin completed their proof, there was a general method in place for tackling all such questions. At the heart of their proof is the fact that the zeta function does not vanish on the line Re(s) = 1. As it later transpired, this non-vanishing theorem is equivalent to the prime number theorem. It is interesting to note that Hadamard, in his characteristic humility, writes, “Stieltjes avait d´emontr´e que tous les z´eros imaginaires de ζ(s) sont (conform´ement aux pr´evisions de Riemann) de la forme 1/2+it, t ´etant r´eel; mais sa d´emonstration n’a jamais ´et´e publi´ee. Je me propose simplement de fair voir que ζ(s) ne saurait avoir de zero dont la partie r´eele soit ´egale `a 1.” (Oeuvres, p. 183). We now understand this in a better light. The dominant theme that arises from the papers of Riemann, Hadamard, and de la Vall´ee Poussin is the following. Suppose we are given a sequence of complex numbers an and we would like to know the behaviour of an . n≤x

The idea is to study the associated Dirichlet series f (s) =

∞ an ns n=1

as a function of a complex variable and infer from the analytic properties the desired behaviour of the summatory function. Indeed, suppose that all the an ’s are bounded by some constant C. Then the associated Dirichlet series deﬁnes an analytic function for Re(s) > 1. Suppose further that the series can be continued analytically to Re(s) > 1 − δ where δ > 0. Beginning with the fundamental line integral c+i∞ s 1 if x > 1 1 x ds = 1/2 if x = 1 2πi c−i∞ s 0 if x < 1 1)

Equation (1) is referred to as an Euler product. More general Euler products will be introduced later in the chapter.

§1 The Prime Number Theorem

7

for any c > 0, we easily see by term by term integration, c+i∞ 1 xs an = f (s) ds 2πi c−i∞ s n≤x

when x is not an integer. Here c is chosen so that f (s) converges absolutely on Re(s) = c. We can now invoke methods of contour integration and attempt to infer something about the behaviour of the sum in question. For instance, in the case under discussion, if we assume in addition that f (s) = O(logA (|s| + 2)), for some constant A > 0, then it is it is not diﬃcult to deduce that an = O(xθ ) n≤x

for any θ > 1 − δ. Over the subsequent decades, the techniques and methods have been streamlined and made elegant and eﬃcient, notably through the work of Hardy, Littlewood, Ikehara and Wiener. The following theorem represents the quintessence of their work and goes under the parlance of the (Wiener-Ikehara) Tauberian theorem. ∞ Theorem 1.1. Let f (s) = n=1 an /ns be a Dirichlet series. Suppose there exists ∞ a Dirichlet series F (s) = n=1 bn /ns with positive real coeﬃcients such that (a) |an | ≤ bn for all n; (b) the series F (s) converges for Re(s) > 1; (c) the function F (s) (respectively f (s)) can be extended to a meromorphic function in the region Re(s) ≥ 1 having no poles except (respectively except possibly) for a simple pole at s = 1 with residue R ≥ 0 (respectively r). Then A(x) := an = rx + o(x) n≤x

as x→∞. In particular, if f (s) is holomorphic at s = 1, then r = 0 and A(x) = o(x) as x→∞. Remark. Note that we can equally deduce that bn = Rx + o(x) n≤x

as x→∞. We relegate the proof of this theorem to later in this chapter. For the moment, we will quickly proceed to deduce the prime number theorem from the non-vanishing of ζ(s) on Re(s) = 1.

8

Chapter 1 The Prime Number Theorem and Generalizations

Let us begin by observing that for any Dirichlet series, ∞ ∞ an A(n) − A(n − 1) = s n ns n=1 n=1 ∞ 1 1 = A(n) − ns (n + 1)s n=1 n+1 ∞ dx =s A(n) s+1 x n n=1 ∞ A(x) =s dx. xs+1 1

In particular,

∞

ζ(s) = s 1

[x] s dx = −s xs+1 s−1

1

∞

{x} dx xs+1

(2)

where [x] denote the greatest integer less than or equal to x and {x} = x − [x]. Since the fractional part of x is less than 1, the integral

∞ 1

{x} dx xs+1

converges for Re(s) > 0. Hence, we obtain an analytic continuation of (s − 1)ζ(s) in this half-plane. Furthermore, ζ(s) has a simple pole at s = 1 with residue 1. By taking logarithms of both sides in equation (1) and then diﬀerentiating, we observe that ∞ ζ Λ(n) − (s) = ζ ns n=1

where Λ(n) =

log p if n is a power of the prime p 0 otherwise

denotes the von Mangoldt function (after the mathematician who introduced the notation). From the second equation in (2), we see that −ζ (s)/ζ(s) has a simple pole at s = 1 with residue 1. If in addition we knew that ζ(s) does not vanish on Re(s) = 1, then −ζ (s)/ζ(s) is represented in the region Re(s) > 1, by a Dirichlet series with non-negative coeﬃcients and has a meromorphic continuation to Re(s) ≥ 1 with only a simple pole at s = 1 with residue 1. Applying the WienerIkehara Tauberian theorem, we deduce that n≤x

Λ(n) = x + o(x)

§1 The Prime Number Theorem

9

as x→∞. It is now an easy exercise to deduce the prime number theorem from this asymptotic formula (see exercise 1). Hadamard’s proof that ζ(1 + it) = 0 for t ∈ R, is exceedingly simple and can be explained intuitively as follows (see [Ka]). Let us write, log ζ(s) =

∞ an ns n=1

and note that an ≥ 0. Since ζ(s) has a simple pole at s = 1, log ζ(1 + ) = log 1/ + O(1) as →0+ . If ζ(s) has a zero of order m (say) at s = 1 + it, then log ζ(1 + it + ) = −m log 1/ + O(1). Therefore, using an ≥ 0, we deduce that nit −m for most n’s. So m = 1 and n2it 1 for most n’s so that log ζ(1 + 2it + ) = log 1/ which is not possible since 1 + 2it is a regular point of ζ(s) for t = 0. This proves the non-vanishing. Though this is an intuitive proof, it captures the essence of the argument. The traditional proof begins by considering for σ > 1 the combination2) − Re (3 log ζ(σ) + 4 log ζ(σ + it) + log ζ(σ + 2it)) ∞ Λ(n) = (3 + 4 cos(t log n) + cos(2t log n)) . σ log n n n=1

(3)

Since 3 + 4 cos θ + cos 2θ = 2(1 + cos θ)2 , we see that the right side of equation (3) is non-negative. Hence, |ζ(σ)3 ζ(σ + it)4 ζ(σ + 2it)| ≥ 1 for t ∈ R and σ > 1. If ζ(1+it) = 0 for t = 0, then as σ→1+ in the above inequality, the left hand side tends to zero which is a contradiction. This completes the proof that ζ(s) does not vanish on Re(s) = 1. We now present a generalization of this non-vanishing result (see [VKM, p. 199]). 2)

The traditional proof will use the traditional notation, a curious combination of Greek and Roman letters, in writing s = σ + it with √ σ denoting the real part of s and t the imaginary part. i will of course denote −1.

10

Chapter 1 The Prime Number Theorem and Generalizations

Theorem 1.2. Let f (s) be a function satisfying the following hypotheses: (a) f is holomorphic in σ > 1 and non-zero there; (b) on the line σ = 1, f is holomorphic except for a pole of order e ≥ 0 at s = 1; ∞ s (c) log f (s) can be written as a Dirichlet series n=1 bn /n with bn ≥ 0, for σ > 1. If f has a zero on the line σ = 1, then the order of the zero is bounded by e/2. (Here we are writing s = σ + it.) Proof. Suppose f has a zero at 1 + it0 of order k > e/2. Then, e ≤ 2k − 1. Consider the function g(s) = f (s)2k+1

2k

f (s + ijt0 )2(2k+1−j)

j=1

= f (s)2k+1 f (s + it0 )4k f (s + 2it0 )4k−2 · · · f (s + 2kit0 )2 . Then, g is holomorphic for σ > 1 and vanishes to at least ﬁrst order at s = 1 as 4k2 − (2k + 1)e ≥ 4k2 − (2k + 1)(2k − 1) = 1. However, for σ > 1,

log g(s) =

∞

⎛ bn n−s ⎝2k + 1 +

n=1

2k

⎞ 2(2k + 1 − j)n−ijt0 ⎠ .

j=1

Let φn = t0 log n. Then, for σ > 1, Re (log g(σ)) = log |g(σ)| =

∞

⎛ bn n−σ ⎝2k + 1 +

n=1

2k

⎞ 2(2k + 1 − j) cos(jφn )⎠ .

j=1

Now we have the identity

F (k, θ) := 2k + 1 +

2k

⎛ 2(2k + 1 − j) cos(jθ) = ⎝1 + 2

j=1

k j=1

(see exercise 2). Hence, log |g(σ)| ≥ 0 for σ > 1. That is |g(σ)| ≥ 1. This contradicts g having a zero at σ = 1.

⎞2 cos jθ⎠ ≥ 0

§1 The Prime Number Theorem

11

By applying the Tauberian theorem to the function −ζ (s)/ζ(s), which by the non-vanishing theorem satisﬁes the conditions of Theorem 1.1, we deduce the prime number theorem in the form ψ(x) := Λ(n) ∼ x n≤x

as x→∞. The famous Riemann hypothesis asserts that ζ(s) = 0 for Re(s) > 1/2. If we assume this hypothesis, one can prove by methods of contour integration the formula ψ(x) = x + O(x1/2 log2 x). After this brief discussion, we are now ready to prove Theorem 1.1. Proof of Theorem 1.1. If the an are real, it suﬃces to prove the theorem for F (s), for then one can apply such a result to F (s) − f (s) which is a Dirichlet series with non-negative coeﬃcients in the region Re(s) > 1. If not all the an are real, then set ∗

f (s) =

∞

an /ns

n=1

and observe that

1 i f = (f + f ∗ ) + 2 2

f − f∗ i

.

Then it suﬃces to prove the theorem for F (s). We begin by reviewing certain facts from Fourier analysis (see [Rudin]). Let S = {f ∈ C ∞ (R) | for all n, m ∈ Z+ , lim xn |x|→∞

dm f (x) = 0}. dxm

For functions f ∈ S, we have the Fourier inversion: ∞ 1 fˆ(x) = √ f (t) e−itx dt 2π −∞ ∞ 1 f (x) = √ fˆ(t) eitx dt. 2π −∞ Hence, 1 fˆ(x − y) = √ 2π

∞

f (t) eity e−itx dt

−∞

ˆ − y) and f (t) eity are transforms of each other. The formula so that f(x ∞ ∞ f (x) g(x) dx = fˆ(t) gˆ(t) dt −∞

−∞

12

Chapter 1 The Prime Number Theorem and Generalizations

is known as Parseval’s formula. The Riemann-Lebesgue lemma asserts that ∞ lim f (t) eiλt dt = 0, λ→∞

−∞

for absolutely integrable functions. The F´ejer Kernel Kλ (x) =

sin2 λx λx2

has Fourier transform √ 2 2π 1 − 0

ˆ λ (x) = K

|x| 2λ

if |x| ≤ 2λ otherwise.

∞ s Now let F (s) = ≥ 0 be as above and deﬁne an analytic n=1 bn /n , bn function for Re(s) > 1. Put B(x) = n≤x bn . Replacing bn by bn /R, we can suppose without loss of generality that R = 1. Then, by partial summation, we see that for Re(s) > 1, ∞ B(x) F (s) = s dx. xs+1 1 Set x = eu . Then

F (s) = s

Note that

∞

∞

B(eu ) e−us du.

0

e−u(s−1) du =

0

1 . s−1

Hence, putting s = 1 + δ + it, δ > 0, we get ∞ F (1 + δ + it) 1 − = (B(eu ) e−u − 1) e−uδ e−iut du. 1 + δ + it s−1 0 Set g(u) = B(eu )e−u , and h(t) =

hδ (t) =

F (1 + δ + it) 1 − , 1 + δ + it s−1

F (1 + it) 1 − 1 + it s−1

(s = 1 + it),

which is regular for t in R. Our goal is to prove√that g(u) → 1 as u → ∞. The above formula says that the Fourier transform of 2π(g(u) − 1)e−uδ is hδ (t). Applying Parseval’s formula, we deduce ∞ ∞ ˆ λ (t) dt. (g(u) − 1)e−uδ Kλ (u) du = hδ (t)K −∞

−∞

§1 The Prime Number Theorem

13

But note that we also have by Parseval’s formula

∞

−∞

(g(u) − 1)e−uδ Kλ (u − v) du =

∞

−∞

ˆ λ (t)eitv dt. hδ (t)K

ˆ λ has compact support, the limit as δ → 0 of the right hand side exists. Since K The same is true of the left hand side. Hence ∞ ∞ ˆ λ (t)eitv dt. (g(u) − 1))Kλ (u − v) du = h(t)K −∞

−∞

By the Riemann-Lebesgue lemma, we deduce

∞

lim

v→∞

−∞

Thus,

(g(u) − 1))Kλ (u − v) du = 0.

∞

lim

v→∞

−∞

g(u)Kλ (u − v) du = π.

Set −λ(u − v) = α. Then u = v −

vλ

lim

v→∞

−∞

α λ

and so as g is bounded,

α sin2 α g v− dα = π. λ α2

We can now prove the theorem. Since B(x) is monotone increasing, we see that

g(u2 ) ≥ g(u1 )eu1 −u2 ,

√ Thus, for |α| ≤ λ, we have

u1 ≤ u 2 .

α 1 1 − √1 + α − √2 g v− ≥g v− √ e λ λ ≥g v− √ e λ. λ λ λ Since lim sup

v→∞

we deduce,

√ λ

√ − λ

α sin2 α g v− dα ≤ π, λ α2

1 π − √2 lim sup g v − √ e λ ≤ √ . 2α λ v→∞ sin λ √ dα 2 − λ

Since v is arbitrary, changing v to v +

√1 , λ

we get

lim sup g(v) ≤ 1.

v→∞

α

14

Chapter 1 The Prime Number Theorem and Generalizations

The lower bound is obtained similarly: √ λ

lim inf

v→∞

Since

√ − λ

α sin2 α 1 g v− dα ≥ π + O( √ ). λ α2 λ

α 1 1 √2 v+ √1 −v+ α λ λ g v− ≤g v+ √ e ≤g v+√ e λ, λ λ λ

we obtain √λ 1 sin2 α 1 √2 lim inf g v + √ e λ dα ≥ π + O( √ ), √ 2 v→∞ λ λ − λ α so that lim inf g(v) ≥ 1,

v→∞

as desired. Together with lim sup g(v) ≤ 1, v→∞

we deduce limv→∞ g(v) = 1 as needed. This completes the proof of the theorem. By the same method, one can deduce the following variation: Theorem 1.3. Suppose that the function F (s) has the following properties: (a) there exists β > 0 such that for Re(s) > β,

∞

F (s) = s

B(u)e−us du

0

where B(u) is a positive monotone increasing function; (b) there exist constants α > −1, c > 0 such that F (s) =

H(s) and H(β) = cΓ(α + 1) (s − β)α+1

where H is holomorphic in Re(s) ≥ β. Then B(u) = (c + o(1)) uα eβu as u → ∞.

§2 Primes in Arithmetic Progression

Corollary 1.4.

15

Suppose that for Re(s) > β, F (s) =

∞

bn /ns

n=1

with bn ≥ 0, and that for Re(s) ≥ β, F (s) admits a meromorphic continuation with at most a pole of order α + 1 (α > −1) at s = β. Then bn = (c + o(1))xβ logα x n≤x

as x → ∞. This formulation is useful in most applications. There are other variations (e.g. see Ellison [p. 64–65]) where upper and lower bounds for n≤x bn can be obtained from the knowledge of analytic continuation for Re(s) ≥ β, |Im(s)| ≤ T . We can apply this to the Riemann zeta function. Indeed, ∞ ∞ [x] s {x} ζ(s) = s dx = − s dx s+1 s+1 x s − 1 x 1 1 gives a meromorphic continuation of ζ(s) for Re(s) > 0 with only a simple pole at s = 1 and residue 1. If dk (n) denotes the number of ways of writing n as a product of k positive numbers, it is easily seen that ∞ dk (n) ζ (s) = . ns n=1 k

Hence, by Corollary 1.4 we deduce that

dk (n) = (1 + o(1))

n≤x

x logk−1 x Γ(k)

as x → ∞.

§2 Primes in Arithmetic Progression In 1837, Dirichlet proved the inﬁnitude of primes in a given arithmetic progression. Historically, this work preceded Riemann’s paper on the zeta function. Dirichlet proved his theorem by introducing the L-functions L(s, χ) which now bear his name. However, he treated them as functions of a real variable only and therefore obtained results of the form 1 lim = +∞, + ps s→1 p≡a(mod q)

16

Chapter 1 The Prime Number Theorem and Generalizations

where the summation is over primes p ≡ a(mod q). He ﬁrst proved his theorem for prime modulus q and then a year later, treated the general case. In the course of this discovery, he contributed two fundamental ideas: (a) the beginnings of the theory of group characters and (b) the celebrated class number formula. The ﬁrst was essential in ‘sifting’ the primes in a given arithmetic progression. The second was used to establish the non-vanishing of certain of his L-functions needed in his proof. If (Z/qZ)∗ is the multiplicative group of coprime residue classes, let χ : (Z/qZ)∗ →C∗ be a homomorphism. (These are called Dirichlet characters.) One now deﬁnes for any n ∈ Z, χ(n) = χ(n mod q) if (n, q) = 1 0 otherwise and (by abuse of language) we also call these Dirichlet characters. There are φ(q) such characters where φ is Euler’s function and we denote by χ0 the trivial character. Analogous to the Riemann zeta function, we deﬁne L(s, χ) =

∞ χ(n) . ns n=1

Since |χ(n)| ≤ 1 for all values of n, the series converges absolutely for Re(s) > 1. If we write S(x) = χ(n), n≤x

then as in section one, we can write L(s, χ) = s 1

If χ = χ0 , then

∞

S(x) dx. xs+1

(4)

χ(n) = 0

n mod q

so that we easily see |S(x)| ≤ q upon partitioning the interval [1, x] into subintervals equal to or at most of length q. Hence, (4) converges for Re(s) > 0 and this gives us an analytic continuation for L(s, χ) when χ = χ0 , in this half-plane. As in the case of the Riemann zeta function, the multiplicativity of χ(n) and the unique factorization of the natural numbers combine to give the Euler product: L(s, χ) =

−1 χ(p) . 1− s p p

§2 Primes in Arithmetic Progression

17

Notice again, that this product shows L(s, χ) = 0 for Re(s) > 1. Taking logarithms and using the orthogonality relations of the characters, we obtain

1 1 = χ(a) ¯ log L(s, χ) ks kp φ(q)

pk ≡a mod q

χ mod q

valid for Re(s) > 1. Dirichlet noticed that ⎡ ⎤ 1 L(s, χ0 ) = ⎣ 1 − s ⎦ ζ(s) p p|q

and hence lim log L(s, χ0 ) = +∞

s→1+

by the divergence of the harmonic series. For χ = χ0 , we know L(s, χ) is regular for Re(s) > 0. If L(1, χ) = 0 for all χ = χ0 , then we deduce that

lim s→1+

pk ≡a mod q

from which we infer

lim s→1+

p≡a mod q

1 = +∞ kpks

1 = +∞, ps

since the contribution for k ≥ 2 in the penultimate sum remains bounded as s→1+ . We will now establish the non-vanishing of L(s, χ) on Re(s) = 1 by appealing to Theorem 1.2. But before we do, we need the following lemma, which is a variation on a well-known result of Landau (see exercise 5). ∞ s Lemma 2.1. Let g(s) = n=1 an /n be a Dirichlet series where a1 = 1 and an ≥ 0. Suppose that the series is absolutely convergent in Re(s) > 1. Suppose further that g(s) admits an analytic continuation to Re(s) ≥ 1/2. Then g(1/2) = 0. Proof. We can expand g(s) as a Laurent series about the point s = 2 in a disc |s − 2| < 3/2: ∞ g (m) (2) g(s) = (s − 2)m . m! m=0 Computing g (m) (2) explicitly from the Dirichlet series g

(m)

m

(2) = (−1)

∞ n=1

an (log n)m n−2 = (−1)m bm

(say),

18

Chapter 1 The Prime Number Theorem and Generalizations

where bm ≥ 0. Hence, g(s) =

∞ bm (2 − s)m m! m=0

valid for |2 − s| < 3/2. In particular, if 1/2 < s < 2, then g(s) ≥ g(2) ≥ 1 since all the terms are non-negative. Taking lims→1/2+ g(s) gives g(1/2) ≥ 1. This establishes the result. Theorem 2.2.

L(1 + it, χ) = 0 for all t ∈ R and all χ = χ0 .

Proof. We will apply Theorem 1.2 to the function f (s) =

L(s, χ).

χ

Observe ﬁrst that log f (s) =

χ

log L(s, χ) =

pk ≡1 mod q

1 kpks

is a Dirichlet series with non-negative coeﬃcients absolutely convergent in Re(s) > 1. Since L(s, χ0 ) has a simple pole at s = 1 and L(s, χ) is regular in Re(s) > 0 for χ = χ0 , we ﬁnd that f (s) satisﬁes (a), (b), (c) of the theorem with e ≤ 1. Hence any zero of f (s) is of order bounded by 1/2. Therefore L(1+it, χ) = 0 for t ∈ R, t = 0. We must still analyse the possibility L(1, χ) = 0. If L(1, χ) = 0 for some non-real character χ, then L(1, χ) = 0 and f (s) would have a zero at s = 1 contrary to what has already been established. So we need to look at the possibility L(1, χ) = 0 for a real-valued character. If this were the case, we consider g(s) =

−1 L(s, χ0 )L(s, χ) χ0 (p) χ(p) = 1+ 1 − L(2s, χ0 ) ps ps p

which is a Dirichlet series with non-negative coeﬃcients because the Euler product is supported only at primes where χ(p) = +1. Note that if L(1, χ) = 0, then this cancels the pole of L(s, χ0 ) at s = 1 so that g(s) is regular there. In fact, g(s) is regular for Re(s) ≥ 1/2 since the numerator is regular for Re(s) ≥ 0 and the denominator does not vanish for Re(s) ≥ 1/2 essentially by Theorem 1.1. Moreover, since L(2s, χ0 ) has a simple pole at s = 1/2, g(s) has a zero at s = 1/2 which contradicts Lemma 2.1. This completes the proof.

§3 Dedekind’s zeta function

19

Applying the Tauberian theorem to L − χ(a) (s, χ) L χ mod q

we immediately deduce that ψ(x, q, a) :=

Λ(n) ∼

n≤x n≡a mod q

x , φ(q)

which is the prime number theorem for arithmetic progressions.

§3 Dedekind’s zeta function Let K be an algebraic number ﬁeld of ﬁnite degree n over Q. The zeta function of K is deﬁned as ζK (s) = (Na)−s a

where the sum is over all integral ideals of OK , the ring of integers of K. Because of Dedekind’s theorem that every ideal of OK can be factored as a product of prime ideals uniquely, we have the Euler product formula: −1 1 ζK (s) = 1− , Nps p where the product is over all prime ideals of K. Notice that this product shows ζK (s) = 0 for Re(s) > 1. Hecke proved in 1917, that (s − 1)ζK (s) extends to an entire function such that lim (s − 1)ζK (s) = κ =

s→1+

2r1 (2π)r2 hR w |dK |

where r1 is the number of real conjugate ﬁelds, 2r2 is the number of complex conjugate ﬁelds, h is the class number, R is the regulator, w is the number of roots of unity and dK is the discriminant of K. Moreover, ζK (s) satisﬁes a functional equation ξK (s) = ξK (1 − s) where

s |dK | ξK (s) = Γ(s/2)r1 Γ(s)r2 ζK (s). 2r2 π n/2

The functional equation allows one to write the residue in a slightly elegant form ζK (s) hR =− r s w s→0 lim

where r = r1 + r2 − 1. By the Tauberian Theorem 1.1, we immediately deduce

20

Chapter 1 The Prime Number Theorem and Generalizations

Theorem 3.1.

Let am be the number of ideals of K of norm m. Then, am ∼ κx m≤x

as x→∞. Proof. Since ζK (s) has a simple pole at s = 1, the result follows from Theorem 1.1. As before, we can consider −

Λ(a) ζK (s) = ζK Nas a

where

log Np if a = pm for some prime ideal p 0 otherwise, is the number ﬁeld analogue of the von Mangoldt function. It is now clear that we can apply the Tauberian theorem to −ζK (s)/ζK (s) to deduce the prime ideal theorem: Λ(a) =

Theorem 3.2. ≤ x. Then

Let πK (x) denote the number of prime ideals of K whose norm is πK (x) ∼

x , log x

as x→∞. Proof. For Re(s) > 1, log ζK (s) is a Dirichlet series with non-negative coeﬃcients by virtue of the Euler product. Moreover, ζK (s) is holomorphic on Re(s) = 1 except at s = 1 where it has a simple pole. Hence, we can apply Theorem 1.2 to deduce ζK (1 + it) = 0 for all t ∈ R. Applying Theorem 1.1 we deduce Λ(a) ∼ x Na≤x

as x→∞. We now deduce the result by partial summation (see exercise 1). Let us consider the special case K = Q(i). If r(n) denotes the number of ways of writing n as a sum of two integer squares, then Theorem 3.1 gives r(n) ∼ πx n≤x

as x→∞, since Z[i] is a unique factorization domain and thus has class number 1. It is also not diﬃcult to see that a rational prime p is the norm of a prime ideal Z[i] if and only if p can be written as the sum of two integral squares. If p can be so written, and p is odd, there are exactly two prime ideals of Z[i] of norm p. Thus, Theorem 3.2 in this case proves that the number of primes p ≤ x which can be written as the sum of two squares is 1 x ∼ 2 log x as x→∞.

§4 Hecke’s L-functions

21

§4 Hecke’s L-functions We begin by constructing the analogues of Dirichlet’s L-functions. We ﬁrst need to deﬁne the notion of “ideal classes” and then deﬁne characters of these classes. Let K be an algebraic number ﬁeld and f an ideal of OK . A natural starting point is to consider the ideal class group and to deﬁne characters of this group. One can generalize this to obtain a notion of ideal classes mod f as follows. The multiplicative group generated by all ideals coprime to f will be denoted by I(f). The principal ray class P (f) (mod f) is the subgroup of principal ideals of the form (α/β) with (i) α, β ∈ OK and coprime to f; (ii) α ≡ β(mod f); (iii) α/β is totally positive (that is, all its real conjugates are positive). The quotient group G(f) = I(f)/P (f) is called the ray class group mod f. The elements of this group are called ray classes. These will be considered as analogues of the groups (Z/mZ)∗ in the rational number ﬁeld case. Let us note that without the totally positive condition, the construction leads to (Z/mZ)∗ /{±1} if K = Q and f = (m). Let χ be a character of the abelian group G(f). Deﬁne χ(a) L(s, χ) = Nas a where the sum is over integral ideals a of K. This series converges absolutely for Re(s) > 1 as is seen by comparing with the Dedekind zeta function which converges absolutely in that region. We again have the Euler product: −1 χ(p) L(s, χ) = 1− Nps p which is valid for Re(s) > 1. This product shows the non-vanishing of L(s, χ) in that region. If χ = χ0 , the trivial character, Hecke showed that L(s, χ) extends to an entire function. By considering f (s) = ζK (s)L(s, χ)L(s, χ)L(s, χχ) and applying Theorem 1.2, we deduce that f (s) does not vanish on Re(s) = 1 for s = 1. In the latter case, we consider g(s) = ζK (s)3 L(s, χ)4 L(s, χ2 ) which has non-negative coeﬃcients. We can apply Theorem 1.2 again to get the non-vanishing of L(1, χ) provided χ2 = 1. In the last case, we need to consider as before ζK (s)L(s, χ) h(s) = ζK (2s) and apply the reasoning as before. This allows us to deduce

22

Chapter 1 The Prime Number Theorem and Generalizations

Theorem 4.1. class is

The number of prime ideals with norm less than x, in a given ray 1 x |G(f)| log x

as x→∞. We can consider a more general situation. Recall that each ﬁnite prime p deﬁnes a valuation vp : K→Z given by vp (α) = the exponent of p in the prime ideal decomposition of the principal ideal (α). We can extend this deﬁnition to ideals. A generalized ideal a of K is an ideal af together with a set of embeddings {σ1 , . . . , σi } of K into R. We will say that α ∈ K satisﬁes the congruence α ≡ 1(mod a) if α is a unit at all the primes dividing af , vp (α − 1) ≥ vp (af ) and σj (α) > 0 for j = 1, . . . , i. We can now deﬁne G(a) as the quotient group of fractional ideals coprime to af modulo principal ideals (α) with α ≡ 1(mod a). Hecke deﬁned L-series for characters of these generalized ideal class groups and an analogue of Theorem 4.1 is true for prime ideals in a given ideal class. We recomend that the reader study the theory of Hecke L-functions as explained, for example, in Lang [L], both from the classical and adelic points of view (Tate’s thesis).

Exercises 1. Let f (t) have a continuous derivative f (t), for t ≥ 1. Let cn for n ≥ 1 be constants and let C(u) = cn . n≤u

Then, prove that

x

cn f (n) = f (x)C(x) −

f (t)C(t)dt,

1

n≤x

and that

f (n) =

n≤x

x

x

f (t)dt + 1

Deduce that

(t − [t])f (t)dt + f (1) − (x − [x])f (x).

1

Λ(n) = x + o(x)

n≤x

as x→∞ if and only if lim x→∞

π(x) = 1. x/ log x

References

23

2. Prove that 2k + 1 +

2k

⎛ 2(2k + 1 − j) cos(jθ) = ⎝1 + 2

j=1

k

⎞2 cos jθ⎠ .

j=1

Notice that the case k = 1 is the classical trigonometric identity of Hadamard and de la Vall´ee Poussin. 3. Suppose we had a trigonometric polynomial a0 + a1 cos θ + · · · an cos nθ ≥ 0. Show that in the proof of Theorem 1.2, we obtain k ≤ |a0 /a1 |e. Show further that |a0 /a1 | > 1/2. (See F´ejer [Fe].) 4. For (a, b) = 1, compute gcd {φ(an + b) : n ∈ Z}. What about the same question for φ(an2 + bn + c) ? 5. (Landau’s theorem) Suppose an ≥ 0 and f (s) =

∞ an ns n=1

has abscissa of convergence equal to α. Show that f (s) has a singularity at s = α.

References [Ellison] W. Ellison, Prime numbers, John Wiley and Sons, Paris, Hermann, 1985. ¨ [Fe] L. F´ejer, Uber trigonometrische Polynome, J. Reine Angew. Math., 146 (1916), pp. 53–82. [Ka] Jean-Pierre Kahane, Jacques Hadamard, Math. Intelligencer, Vol. 13, No. 1, (1991), pp. 23–29. [L]

S. Lang, Algebraic Number Theory, Springer-Verlag, 1986.

[VKM] V. Kumar Murty, On the Sato-Tate conjecture, in Number Theory related to Fermat’s Last Theorem, (ed. N. Koblitz), Progress in Mathematics, Vol. 26, (1982), pp. 195–205. [Rudin] W. Rudin, Real and Complex analysis, Bombay, Tata Mcgraw-Hill Publishing Co. Ltd., 1976.

Chapter 2 Artin L-Functions

§1 Group-theoretic background In this section, we shall collect together a few group theoretic preliminaries. We begin by reviewing the basic aspects of characters and class functions. Let G be a ﬁnite group. If f1 , f2 : G → C are two C-valued functions on G, we deﬁne their inner product by (f1 , f2 ) =

1 f1 (g)f2 (g). |G| g∈G

If f : G → C is a C-valued function on G, and σ ∈ G, we deﬁne f σ : G → C by f σ (g) = f (σgσ −1 ). We say f is a class function if f σ = f for all σ ∈ G. Let H ⊆ G be a subgroup and f : H → C a class function on H. We deﬁne a class function IndG Hf :G→C on G as follows. Let g1 , . . . , gr (r = [G : H]) be coset representatives for H in G (so that G = ∪gi H). Extend f to a function f˙ on G by setting f˙(g) = Then (IndG H f )(g) =

f (g) 0

g∈H g ∈H

r

1 ˙ −1 f˙(gi−1 ggi ) = f (s gs). |H| i=1 s∈G

Let f1 be a class function on the subgroup H and f2 a class function on G. The Frobenius reciprocity theorem tells us that (f1 , f2 |H ) = (IndG H f1 , f2 ).

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_3, © Springer Basel AG 1997

25

26

Chapter 2 Artin L-Functions

Let H1 , H2 be subgroups of G and let f be a class function on H2 . Suppose that G = H1 H2 . Then one of Mackey’s theorems tells us that G H1 H2 H1

H2

H1 (IndG H2 f )|H1 = IndH1 ∩H2 (f |H1 ∩H2 ).

Let ρ : G → GLn (C) be an irreducible representation of G and set χ = T r ρ, the character of ρ. Then χ is a class function on G and every class function is a Clinear combination of characters χ of irreducible representations. A class function which is a Z-linear combination of characters will be called a generalized character. For each g ∈ G, deﬁne a symbol xg and consider the C-vector space V = ⊕g G Cxg . If |G| = n, then dim V = n. The regular representation regG of G regG : G → GL(V ) is deﬁned by → xσg ) . σ → (xg Its character will be denoted by the same letter and we easily see that regG (σ) =

n 0

In terms of characters regG =

σ = e (identity) σ = e.

χ(1)χ

χ

where the sum is over all irreducible characters of G. In terms of induction, regG = IndG {e} 1 where 1 denotes the (trivial) character of the identity subgroup {e}. The reader is referred to Serre [Se1] for an excellent introduction to the representation theory of ﬁnite groups.

§2 Deﬁnition and basic properties of Artin L-functions

27

§2 Deﬁnition and basic properties of Artin L-functions Now let L/K be a Galois extension of number ﬁelds, with group G. For each prime p of K, and a prime q of L with q|p, we deﬁne the decomposition group Dq to be Gal(Lq /Kp ) where Lq (resp. Kp ) is the completion of L (resp. K) at q (resp. p). We have a map from Dq to Gal(kq /kp ) (the Galois group of the residue ﬁeld extension) which by Hensel’s lemma is surjective. The kernel Iq is the inertia group. We thus have an exact sequence 1 → Iq → Dq → Gal(kq /kp ) → 1. → xNp where Np is the carThe group Gal(kq /kp ) is cyclic with a generator x dinality of kp . We can choose an element σq ∈ Dq whose image in Gal(kq /kp ) is this generator. We call σq a Frobenius element at q and it is only deﬁned mod Iq . We have Iq = 1 for all unramiﬁed p (and in particular, these are all but ﬁnitely many p) and so for these p, σq is well-deﬁned. If we choose another prime q above p,then Iq and Dq are conjugates of Iq and Dq . For p unramiﬁed, we denote by σp the conjugacy class of Frobenius elements at primes q above p. Let ρ be a representation of G : ρ : G → GLn (C). Let χ denote its character. For Re(s) > 1, we deﬁne the partial L-function by det(I − ρ(σp )(Np)−s )−1 Lunramiﬁed (s, χ, K) = p unramiﬁed

where the product is over primes p of K with Iq = 1 for any q of L with q|p. To obtain an L-function which has good analytic properties (such as functional equation), it is necessary to also deﬁne Euler factors at the primes p which are ramiﬁed in L and also at inﬁnite primes of K. Let p be a prime of K which is ramiﬁed in L, and q a prime of L above p. Let V be the underlying complex vector space on which ρ acts. Then we may restrict this action to the decomposition group Dq and we see that the quotient Dq /Iq acts on the subspace V Iq of V on which Iq acts trivially. Now we see that any σq will have the same characteristic polynomial on this subspace and we deﬁne the Euler factor at p to be this polynomial: Lp (s, χ, K) = det(I − ρ(σq )|V Iq (Np)−s )−1 . This is well-deﬁned and gives the Euler factors at all ﬁnite primes. Remark. Since G is a ﬁnite group, once ρ is given, there are only a ﬁnite number of characteristic polynomials that can occur. For example, if we take the trivial onedimensional representation, only the polynomial (1 − T ) occurs. But the subtlety in the Artin L-function is the assignment p → σp . In other words, which one of the ﬁnite number of characteristic polynomials is assigned to a given prime p determines and is determined by the arithmetic of the ﬁeld extension, in particular the splitting of primes.

28

Chapter 2 Artin L-Functions

We have also to deﬁne the Archimedean Euler factors. For each Archimedean prime v of K we set ((2π)−s Γ(s))χ(1) if v is complex Lv (s, χ, K) = ((π −s/2 Γ(s/2))a (π −(s+1)/2 Γ((s + 1)/2))b if v is real. Here a + b = χ(1) and a (resp. b) is the dimension of the +1 eigenspace (resp. −1 eigenspace) of complex conjugation. We shall write γ(s, χ, K) = Lv (s, χ, K). v inﬁnite

The Artin L-function L(s, χ, K) satisﬁes a functional equation of the following type. First, one deﬁnes the Artin conductor fχ associated to χ. It is an ideal of K and is deﬁned in terms of the restriction of χ to the inertia groups and its various subgroups. More precisely, let ν be a place of K. Let w be a place of L dividing ν and let G0 denote the inertia group Iw at w. We have a descending ﬁltration of higher ramiﬁcation groups (see [CF], p. 33]). G0 ⊇ G1 ⊇ · · · . Let V be the underlying representation space for ρ. Deﬁne ∞ |Gi | n(χ, ν) = codim(V Gi ). |G | 0 i=0

Then n(χ, ν) is an integer and is well-deﬁned (that is, it is independent of the choice of w above ν). Moreover, it is equal to zero apart from a ﬁnite number of ν. This allows us to deﬁne the ideal fχ = pn(χ,ν) . ν ν

We also set χ(1)

Aχ = dK NK/Q fχ . Let us set Λ(s, χ, K) = As/2 χ γ(s, χ, K)L(s, χ, K). Then we have the functional equation Λ(s, χ, K) = W (χ)Λ(1 − s, χ, ¯ K) where W (χ) is a complex number of absolute value 1.

§2 Deﬁnition and basic properties of Artin L-functions

29

The number W (χ) itself carries deep arithmetic information. For example, it is related to Galois module structure. The reader is referred to the monograph [Fr] of Fr¨ ohlich for an introduction to this subject. We now recall some of the formalism of Artin L-functions and their basic properties. It is summarized in the two properties: L(s, aχ χ, K) = L(s, χ, K)aχ for any aχ ∈ Z (1) χ

L(s,

χ

IndG H

χ, K) = L(s, χ, LH ) where LH is the subﬁeld of L ﬁxed by H. (2)

Using (1) and (2), we ﬁnd that L(s, χ, K)χ(1) =L(s, regG , K) = L(s, 1, L) = ζL (s) χ irred

=

(1 − (Nq)−s )−1 . q

There is a theorem of Brauer which says that for any irreducible χ, there are subgroups {Hi }, one-dimensional characters ψi of Hi and integers mi Z with χ= mi IndG Hi ψi . i

Using (1) and (2), we see that L(s, χ, K) =

L(s, ψi , LHi )mi . i

If χ is one-dimensional, then Artin’s reciprocity theorem identiﬁes L(s, χ, K) with a Hecke L-series for a ray class character. By Hecke and Tate, we know the analytic continuation of these L-series (see Chapters 13 and 14 of [La]). From the Brauer induction theorem, it follows that any Artin L-function has a meromorphic continuation. Artin’s conjecture asserts that every Artin L function ¯ L(s, χ, K) associated to a character χ of Gal(K/K) has an analytic continuation for all s except possibly for a pole at s = 1 of order equal to the multiplicity of the trivial representation in ρ. (Note that χ determines ρ up to isomorphism and so our notation is justiﬁed). This is a very central and important conjecture in number theory. It is part of a general reciprocity law. The conjecture of Artin is known to hold in many cases. Most of these arise from a combination of the one-dimensional case and group theory. Some examples are given in the exercises. Returning to the general case, we see from the factorization L(s, χ, K)χ(1) ζL (s) = χ irred

Chapter 2 Artin L-Functions

30

that Artin’s conjecture implies that ζL (s)/ζK (s) is entire. In fact, let L/K be a ˜ (not necessarily Galois) ﬁnite extension and let K/K be its normal closure. Say ˜ ˜ G = Gal(K/K) and H = Gal(K/L). Then L(s, IndG H (1H ), K) = L(s, 1H , L) = ζL (s). On the other hand,

IndG H 1H = 1G +

aχ χ

1=χ irred

with 0 ≤ aχ ∈ Z. So, L(s, IndG H 1H , K) = ζK (s)

L(s, χ, K)aχ .

Putting these together, we see that Artin’s conjecture implies that ζL (s)/ζK (s) is entire, whether L/K is Galois or not. This special case of Artin’s conjecture is called Dedekind’s conjecture. Below, we shall discuss it in several cases. In particular, it is known to hold in the case L/K is Galois (Aramata-Brauer) and ˜ in case L/K is solvable (Uchida-van der Waall).

§3 The Aramata-Brauer Theorem Let L/K be Galois with group G. Theorem 3.1

The quotient ζL (s)/ζK (s) is entire.

By the properties of Artin L-functions described in §2, the Theorem follows from the following result. Proposition 3.2 There are subgroups {Hi }, 1-dimensional character ψi of Hi and 0 ≤ mi ∈ Z so that mi IndG regG −1G = Hi ψi . (Note that (regG , 1G ) = (IndG {e} 1, 1G ) = (1, 1G |{e} ) = 1 by Frobenius reciprocity). For any cyclic subgroup A deﬁne θA : A → C by θA (σ) = |A| if σ generates A 0 else and λA = φ(|A|) regA −θA , where φ denotes Euler’s function. Thus,

λA (σ) =

φ(|A|)|A| if σ = 1 −θA (σ) if σ =1

§3 The Aramata-Brauer Theorem

31

Proposition 3.2 will be proved in two steps. Step 1. λA =

mχ χ with mχ ≥ 0, mχ ∈ Z and χ ranges over the characters of A.

Step 2. regG −1G = of G.

1 |G|

G A IndA

λA where the sum is over all cyclic subgroups A

To prove Step 1, it is enough to show that (λA , χ) ≥ 0 for any irreducible χ of A. But (λA , χ) = φ(|A|) − (θA , χ) = φ(|A|) − χ(σ) = (1 − χ(σ)) σ∈A <σ>=A

σ∈A <σ>=A

= T r(1 − χ(σ)) ∈ Z (for any generator σ of A) Now for χ = 1, Re(1 − χ(σ)) > 0 if σ = e and = 0 if σ = e. Then, if A = {1}, (λA , χ) is positive for all χ = 1 and = 0 if χ = 1. If A = {1} then λA = 0. This proves Step 1. To prove the equality of Step 2, it is enough to show that for any irreducible character ψ of G, both sides have the same inner product with ψ. Now (regG −1G )(g)ψ(g) ψ(g) = |G|ψ(1) −

(|G|(regG −1G ), ψ) =

g G

Also, by Frobenius reciprocity, (IndG (λA , ψ|A ) A λA , ψ) = A

A

= {φ(|A|)ψ(1) − ψ(σ)} σ∈A <σ>=A

A

= ψ(1)

φ(|A|) −

A

Now

A

φ(|A|) =

A

σ∈A <σ>=A

ψ(σ).

σ G

1=

1 = |G|.

σ G

This completes Step 2 and the proof of Proposition 3.2.

Chapter 2 Artin L-Functions

32

We illustrate the equality of Step 2 above with an example. Let L/Q be a biquadratic extension (Galois). Then the identity is L K1 K 2 K3 Q

ζL (s) ζ(s)

4

=

ζL (s) ζK1 (s)

2

ζL (s) ζK 2 (s)

2

ζL (s) ζK3 (s)

2

which when unwound, gives the usual factorization ζL (s) = ζ(s)L(s, χ1 )L(s, χ2 )L(s, χ3 ).

§4 Dedekind’s conjecture in the non-Galois case As explained in §2, Artin’s holomorphy conjecture implies that the quotient ζK (s)/ζF (s) is entire even when K/F is not normal. This latter assertion, called Dedekind’s conjecture, is still an open problem in general. Dedekind’s conjecture has been settled in a few cases, notably for extensions K/F whose normal closure has solvable Galois group. This is due to Uchida [Uc] and van der Waall [vdW]. In fact, their method allows us to prove the following. Theorem 4.1 Let K/F be a ﬁnite extension of number ﬁelds and suppose that ˜ the normal closure K/F has Galois group G which is the semidirect product of ˜ H = Gal(K/K) by an abelian normal subgroup A of G. Then Dedekind’s conjecture is true for K/F . That is, ζK (s)/ζF (s) is entire. Proof. Let us write IndG H (1H ) =

mχ χ

where 0 ≤ mχ ∈ Z, m1 = 1 and χ ranges over the irreducible characters of G. Consider IndG mχ χ|A . H (1H )|A = By Mackey’s theorem, A A IndG H (1H )|A = IndH∩A (1H |H∩A ) = Ind{1} 1 = regA .

§4 Dedekind’s conjecture in the non-Galois case

Thus, IndG H (1H )|A =

33

where ranges over all the irreducible characters of A. Thus, for all χ, mχ = 0 or 1 and (χ|A , ) = 0 or 1 for any ∈ Irr(A). Now, take an ∈ Irr(A) such that there is a χ ∈ Irr(G) with mχ = 0 and (χ|A , ) = 1. Let T be the inertia group of : T = {σ ∈ G : σ = }. → (σaσ−1 ).) Of course, T ⊇ A and we can write it (Here, σ is the character a as T = H A where H ⊆ H. We can extend to a character ˜ of T by setting ˜(ha) = (a) for any h ∈ H , a ∈ A. Let us write

IndTA = IndTA ˜|A =

mψ ψ.

ψ∈Irr(T )

Notice that IndTA (g) =

[T : A] (g) 0

Thus, [T : A] = (IndTA )|A =

if g ∈ A . otherwise

mψ ψ|A .

Thus for every ψ ∈ Irr(T ), with mψ = 0, ψ|A is a multiple of . In fact, mψ = (ψ, IndTA ) = (ψ|A , ), and so ψ|A = mψ . It follows that

m2ψ = [T : A].

From this, we deduce that the characters {IndG T ψ} are distinct and irreducible. Indeed, we have G G [T : A] = m2ψ ≤ (IndG A , IndA ) = ( , (IndA )|A ) and IndG A |A = [T : A]

g

where the sum on the right is over a set of coset representatives for T in G. By deﬁnition of T , the conjugates g are distinct and our claim follows. Now, mψ (χ, IndG 1 = (χ|A , ) = (χ, IndG A ) = T ψ). ψ∈Irr(T )

Thus, there is a unique φ = φ(χ) ∈ Irr(T ) with mφ = 1 and (χ, IndG T φ) = 1. By the irreducibility of both characters, it follows that χ = IndG φ. Also, as T ψ|A = mψ , we have φ(1) = mφ = 1. Hence, χ is the induction of a linear character. This proves that IndG H 1H is a sum of monomial characters and the proposition follows.

34

Chapter 2 Artin L-Functions

Corollary 4.2 (Uchida, van der Waall) Let K/F be an extension of number ˜ a normal closure of K/F . Suppose that Gal(K/F ˜ ) is solvable. Then ﬁelds and K ζK (s)/ζF (s) is entire. ˜ ) and H = Gal(K/K), ˜ Proof. As above, we set G = Gal(K/F We proceed by induction on the order of |G|. We may assume that H is a maximal subgroup of G. For if J is a maximal subgroup of G with H ⊂ J ⊂ G, and M is the ﬁxed ﬁeld of J, then ζK (s)/ζF (s) = (ζK (s)/ζM (s)) (ζM (s)/ζF (s)) where the ﬁrst factor on the right is entire by the induction hypothesis and the second by the maximality of J. Also, since G corresponds to the normal closure of K/F, we may assume that H does not contain any proper non-trivial normal subgroup of G. Now let A be a minimal normal subgroup of G. As G is solvable, such an A exists and is (elementary) abelian. Moreover, A is not contained in H. Then HA = G and H ∩ A = {1}. Indeed, the ﬁrst equality is just the maximality of H and the second follows from the minimality of A and the observation that H ∩ A is again a normal subgroup. Thus, G is the semidirect product of A by H and Theorem 4.1 applies. Finally in this section, we can ask the following variant of Dedekind’s conjecture. Let L/K be an extension with group G, and let H be a subgroup. Let ρ be an irreducible representation of G. Then, is the quotient L(s, IndG H (ρ|H ), K)/L(s, ρ, K)

(‡)

entire? This includes the general case of Dedekind’s conjecture (if we take ρ = 1G ). (‡) can be proved by the method of the Proposition above, if G is solvable and ρ is an abelian character. Indeed, we need only make two observations. First, if we write mχ χ IndG H (ρ|H ) − ρ = then restricting to A shows that

mχ χ|A = ρ(1)

.

Moreover, if G is any group and A is an abelian normal subgroup, and is an (irreducible) character of A, then IndG A =

mi IndG T ψi

where ψi (1) = mi and T is the inertia subgroup of . Thus, if ρ(1) = 1, and = 0, then (χ|A , ) 1 = (χ|A , ) = (χ, IndG A ) =

mi (χ, IndG T ψi )

§5 Zeros and poles of Artin L-functions

35

and as before, this implies that there is an i with mi = 1 and χ = IndG T ψi . Even G without assuming ρ(1) = 1, we get χ = IndT ψi for some and i. But we may not know the holomorphy of L(s, ψi ). (Notice also, that we can restrict to the case H is a maximal subgroup. For if J ⊇ H is a maximal subgroup, (M = ﬁxed ﬁeld of J) G J L(s, IndG H (ρ|H ), K)/L(s, IndJ (ρ|J ), K) = L(s, IndH (ρ|H ), M )/L(s, ρ|J , M ).

and so L(s, IndG L(s, IndJH (ρ|H ), M ) L(s, IndG H (ρ|H ), K) J (ρ|J ), K) = · . L(s, ρ, K) L(s, ρ|J , M ) L(s, ρ, K)

§5 Zeros and poles of Artin L-functions There is another approach to the Aramata-Brauer theorem which does not explicitly use the decomposition of regG −χ into monomial characters. To describe it, let us set nχ = nχ (s0 ) = ords=s0 L(s, χ, F ). Then, in [St], the inequality

n2χ ≤ r2 ,

r = ords=s0 ζK (s)

χ∈Irr(G)

is proved. From this, it follows for example that ζK (s)/L(s, χ, F ) is entire except possibly at s = 1, and that the same holds for the product ζK (s)L(s, χ, F ). This raises the question of whether regG −χ can be decomposed as a non-negative sum of monomial characters. This was answered in the aﬃrmative by Rhoades [R]. Some special cases were computed in [Mu1]. Our approach applies in a wider context of an L-function formalism which is satisﬁed by a variety of objects in number theory and algebraic geometry. Let G be a ﬁnite group. For every subgroup H of G and complex character ψ of H, we attach a complex number n(H, ψ) satisfying the following properties: (1) Additivity: n(H, ψ + ψ ) = n(H, ψ) + n(H, ψ ), (2) Invariance under induction: n(G, IndG H ψ) = n(H, ψ). The formalism can be applied to the above case when G is the Galois group of a normal extension K/k and n(H, ψ) is the order of the zero at s = s0 of the Artin L-series attached to ψ corresponding to the Galois extension K/K H . It can also be applied to the situation when E is an elliptic curve over k and n(H, ψ) corresponds to the order of the zero at s = s0 of the “twist” by ψ of the L-function of E over K H (see [MM] for deﬁnitions and details).

Chapter 2 Artin L-Functions

36

We consider the following generalized character introduced by Heilbronn: θH =

n(H, ψ)ψ

ψ

where the sum is over all irreducible characters ψ of H. Our ﬁrst step is to show that Proposition 5.1

θG |H = θH .

Proof. θG |H =

χ

=

n(G, χ)χ|H ⎛

⎞ n(G, χ) ⎝ (χ|H , ψ)ψ ⎠

χ

ψ

where the inner sum is over all irreducible characters of H and the outer sum is over all irreducible characters of G. By Frobenius reciprocity, (χ|H , ψ) = (χ, IndG H ψ) and so G θG |H = n(G, χ)(χ, IndH ψ) ψ. ψ

χ

But now, by property (1), the inner sum is n(G, IndG H ψ) which equals n(H, ψ) by property (2). Thus, θG |H = θH . This immediately implies: Proposition 5.2 Let reg denote the regular representation of G. Suppose for every cyclic subgroup H of G, we have n(H, ψ) ≥ 0. Then n(G, χ) is real for every irreducible character χ of G and

n(G, χ)2 ≤ n(G, reg)2 .

χ

Proof. By Artin’s theorem, every character can be written as a rational linear combination of characters induced from cyclic subgroups and so n(G, χ) is real. By the orthogonality relations, (θG , θG ) =

n(G, χ)2 .

χ

On the other hand, (θG , θG ) =

1 |θG (g)|2 . |G| g∈G

§6 Low order zeros of Dedekind zeta functions

37

By Proposition 5.1, θG (g) = θ g (g) =

n(g, ψ)ψ(g)

ψ

which is bounded by n(G, reg) in absolute value by our hypothesis and property (1). This completes the proof. Similar reasoning implies Proposition 5.3 Let ρ be an arbitrary character of G. Suppose for every cyclic subgroup H of G, and irreducible character ψ of H, we have n(H, ρ|H ⊗ ψ) ≥ 0, then n(G, ρ ⊗ χ) is real for every irreducible character χ of G and

n(G, ρ ⊗ χ)2 ≤ n(G, ρ ⊗ reg)2 .

χ

These results can also be generalized to the context of automorphic forms. Some preliminary work in this direction can be found in [MM].

§6 Low order zeros of Dedekind zeta functions By analogy with the conjecture that the zeros of the Riemann zeta function are simple, one expects that the nχ are bounded. One might ask whether nχ χ(1) or even the stronger nχ 1 holds. We begin by establishing a zero-free region for Dedekind zeta functions. This is due to Stark [St]. This in turn gives a region where Artin L-functions are zerofree except possibly for a simple exceptional zero. Proposition 6.1 Let M be an algebraic number ﬁeld of degree n = r1 + 2r2 where M has r1 real embeddings and 2r2 complex conjugate embeddings. For σ > 1 we have ζM 1 1 1 |dm | r1 Γ Γ − (σ) < + + log + (σ/2) + r (σ). 2 ζM σ σ−1 2 22r2 π n 2 Γ Γ Also, if M = Q, ζM has at most one zero in the region σ ≥1−

1 , 4 log |dm |

|t| ≤

1 . 4 log |dm |

Chapter 2 Artin L-Functions

38

Proof. Consider f (s) = s(s − 1)ζM (s). By logarithmically diﬀerentiating the Hadamard factorization, we get the relation ρ

1 1 1 = + log |dm | s−ρ s−1 2 1 n r1 Γ s Γ ζ − log π + ( ) + r2 (s) − log 2 + M (s). + s 2 2 Γ 2 Γ ζM

The sum on the left runs over zeros ρ of ζM (s) in the strip o < σ < 1 and the terms with ρ and ρ¯ are grouped together. For s = σ > 1 we have 1 1 + > 0. σ − ρ σ − ρ¯ Thus for σ > 1 we have

ρ

1 1 ≤ σ−ρ σ−ρ ρ

where the sum on the left denotes summation over any convenient subset of the zeros ρ which is closed under complex conjugation. In particular, the sum

1 σ−ρ

is positive and we deduce the inequality of the Proposition. Now take s = σ with 1 < σ < 2. All the terms on the right of the above inequality after 12 log |dm | are negative and thus

ρ

1 1 1 < + log |dm |. σ−ρ σ−1 2

If ρ = β + iγ is in the rectangle speciﬁed in the statement, (with γ = 0) then ρ¯ is also in the same rectangle and taking the contribution from ρ and ρ¯ only, we get the inequality 2(σ − β) 1 1 < + log |dm |. 2 2 (σ − β) + γ σ−1 2 But this is false for M = Q at σ = 1 + log 1|dm | < 2. The same value of σ gives a contradiction if there are two real zeroes in this rectangle (or a single real multiple zero). This completes the proof of the Proposition. The following consequence is also due to Stark [St].

§6 Low order zeros of Dedekind zeta functions

39

Corollary 6.2 Let K/F be a Galois extension. For any Artin L-function L(s, χ, F ) of this extension, the region σ ≥1−

1 , 4 log |dk |

|t| ≤

1 . 4 log |dk |

is free of zeros except possibly for a simple zero. This zero exists only if χ is a real Abelian character of a quadratic subﬁeld of K. Next, we examine the case when the Dedekind zeta function may in fact vanish, but the order of zero is small. We shall study this under the assumption that K/F itself is a solvable Galois extension. If we are at a point s = s0 where ζK (s) has a “small” order zero, then it is possible to show more than just the analyticity of ζK (s)/ζF (s) at s0 . We have the following result due to Foote and K. Murty [FM]. Theorem 6.3

Let K/F be a solvable extension and write αt 1 [K : F ] = pα 1 · · · pt ,

p1 < p2 < · · · < pt

for the prime power decomposition of the degree. Suppose that at s = s0 , we have r = ords=s0 ζK (s) ≤ p2 − 2. Then for each χ ∈ Irr(G), the Artin L-series L(s, χ, F ) is analytic at s = s0 . This has the following immediate corollary. Corollary 6.4 If K/F is a Galois extension of odd degree and ζK (s) has a zero of order ≤ 3 at a point s0 then all Artin L-functions of K/F are analytic at s0 . This represents a partial generalization of the result Corollary 6.2 of Stark. Of course, Stark’s result makes no assumption on the Galois group of K/F . We give a brief outline of the proof. Assume the theorem is false, and take G to be a minimal counterexample for which Artin’s conjecture fails, at a point s = s0 where the order of ζK (s) is small as explained in the statement. We want to prove that the generalized character θG deﬁned above is an actual character. We repeatedly use the two key properties of θG namely, θG |H = θH for any subgroup H of G and θG (1) = ords=s0 ζK (s). The ﬁrst follows from Proposition 5.1 and the second follows from the factorization of ζK into the L(s, χ, F ). Moreover, by our assumption of minimality, we may suppose that θH is a character for every proper subgroup H of G. In addition, the

Chapter 2 Artin L-Functions

40

induction hypothesis and the invariance of L-functions under induction allow us to assume that χ is not induced from any proper subgroup of G. Also, we may assume that χ is faithful. For if Ker χ is non-trivial and M (say) denotes its ﬁxed ﬁeld, then by the Aramata-Brauer theorem (Theorem 3.1), ζM (s) divides ζK (s). In particular, ords=s0 ζM (s) ≤ r and the second smallest prime divisor of [M : F ] is ≥ p2 . Since L(s, χ, F ) is the same whether viewed as an L-function of K or M, the analyticity of this L-function at s0 would follow from the induction hypothesis. We now decompose θG into three parts θ1 , θ2 , θ3 as follows. Let θ3 be the sum of all terms nλ λ such that λ is not a faithful character of G. Let −θ2 be the sum of all terms nχ χ for which nχ is negative. Finally, let θ1 be the sum of all terms nψ ψ where ψ is a faithful character with nψ > 0. Again by the assumption of minimality, we see that (θ2 , θ3 ) = 0 and by deﬁnition, θ1 is orthogonal to θ2 and θ3 . Thus, we get the decomposition θG = θ1 − θ2 + θ3 . We will now get further information about the constituents of θ2 by restricting to an appropriate subgroup. As we shall see, a key tool in this is Cliﬀord’s theorem. It provides us with two pieces of information. Firstly, since G is solvable and non-abelian, it has a normal subgroup N of prime index, p say, which contains the center Z(G) of G. Cliﬀord’s theorem tells us that for any χ ∈ Irr(G), χ|N is either irreducible or χ is the induction of a character from N . In particular, if we take for χ a summand of θ2 , it follows that χ|N is irreducible. Secondly, it tells us that any abelian normal subgroup must be central (that is, contained in the center), for otherwise every χ ∈ Irr(G) would be induced from a proper subgroup contradicting the non-triviality of θ2 . Now, every non-trivial normal subgroup of a solvable group contains a nontrivial abelian subgroup which is normal in G. Thus, no irreducible constituent λ of θ3 is faithful on the center. We must therefore have (θ2 |N , θ3 |N ) = 0. Since θG |N = θ1 |N − θ2 |N + θ3 |N is a character of N, it follows that either θ1 |N = θ2 |N or θ1 |N = θ2 |N + φ for some character φ of N . A further argument using Cliﬀord’s theorem in fact eliminates the second possibility. Indeed, choose an irreducible component α of φ = θ1 |N − θ2 |N and let ψ be an irreducible component of θ1 − θ2 such that ψ|N contains α. Notice that the G conjugates of α are also contained in θ1 |N − θ2 |N

§7 Chebotarev density theorem

41

and hence also in ψ|N . It follows that the sum of the distinct conjugates form a character of degree ≤ φ(1). Cliﬀord’s theorem tells us that ψ|N is equal to this sum and so ψ(1) ≤ φ(1) ≤ r. Also, ψ is a constituent of θ1 and so it is faithful by deﬁnition. Now, a theorem of Ito tells us that in a solvable group, a p-Sylow subgroup is abelian and normal if there is a faithful character of degree smaller than p − 1. Thus, the conclusion of the previous paragraph and our assumption that r ≤ p2 −2 1 imply that G has an abelian normal subgroup of order n/pα 1 . This would force G to be nilpotent and Artin’s conjecture is known to hold for such groups as every irreducible character is monomial. This again contradicts the nontriviality of θ2 . We conclude that θ1 |N = θ2 |N . This is the only step in which the assumed bound on r is used. The ﬁnal contradiction now comes by showing that θ1 = θ2 . To do this, take x ∈ G\N . Denote by H the subgroup generated by x and the center of G. As H is abelian, it is a proper subgroup of G. As we observed earlier, every irreducible component λ of θ3 has the property that its kernel Ker λ meets the center non-trivially. Thus, the same holds for IndG H (λ|H ). Now, taking an irreducible component χ of θ2 , we know that χ is faithful and so (χ, IndG H (λ|H )) = 0. By Frobenius reciprocity, (χ|H , λ|H ) = 0 and so (θ2 |H , θ3 |H ) = 0. Now, θG |H = θ1 |H − θ2 |H + θ3 |H and again θG |H is a character of H. Thus, θ1 |H − θ2 |H is either zero or a character. By our earlier argument, we know that θ1 (1) = θ2 (1) and so we must have θ1 |H = θ2 |H . Combined with our earlier result for N it follows that θ1 = θ2 . This contradiction completes the proof. The argument suggests that the condition r ≤ p2 − 2 be replaced by a bound on r involving the least degree of a faithful character. Results in this direction have in fact now been obtained by Foote [Fo] and by Foote and Wales [FW].

§7 Chebotarev density theorem Let K/F be a ﬁnite Galois extension of number ﬁelds with group G. Let C be a subset of G which is stable under conjugation. Thus C is a union of conjugacy classes. Deﬁne πC (x) = #{ν a place of F unramiﬁed in K, NF/Q (pν ) ≤ x and σν ⊂ C}.

Chapter 2 Artin L-Functions

42

The Chebotarev density theorem asserts that πC (x) ∼

|C| πF (x) |G|

where πF (x) denotes the number of primes of F of norm ≤ x. Eﬀective versions of this theorem were given by Lagarias and Odlyzko [LO]. We state two of their results. The ﬁrst of these assumes the Riemann Hypothesis for Dedekind zeta functions. The second is unconditional. Theorem 7.1 Suppose the Dedekind zeta function ζK (s) satisﬁes the Riemann hypothesis. Then πC (x) =

1 |C| |C| πF (x) + O( · x 2 (log dL + nL log x)). |G| |G|

This version of their result is only slightly more reﬁned than the statement given in [LO] and is due to Serre [Se2, p. 133]. The proof of Theorem 7.1 is very analogous to the classical proof of the prime number theorem in arithmetic progressions, as presented, for example, in the monograph of Davenport [D]. However, there are some points of diﬀerence and we now brieﬂy discuss them. As in the classical case, the proof begins by expressing the characteristic function of the conjugacy class C in terms of characters of G. However, we have to deal with the fact that G is non-abelian and that we do not know the analytic properties of Artin L-functions. In particular, we do not know Artin’s conjecture. We have |C| δC = χ(gC )χ |G| χ where δC denotes the characteristic function of the class C and gC is any element in this class. Hence |C| π(x, δC ) = χ(gC )π(x, χ) |G| χ where for any class function φ we set π(x, φ) =

φ(σν ).

Nν≤x

Here the sum is over places ν of F unramiﬁed in K and of norm ≤ x. If we want to include ramiﬁed primes and also prime powers in the sums, we introduce the function π ˜ (x, φ) = φ(σνm ) Nν m ≤x

§7 Chebotarev density theorem

43

where in the case ν is a ramiﬁed prime, we deﬁne φ(σνm ) =

1 φ(g) |Iw |

where Iw is the inertia group at a prime w of K dividing ν and the sum is over elements g in the decomposition group Dw whose image in the quotient Dw /Iw maps to σνm . The advantage in this sum π ˜ is that it is closely related to the logarithmic derivative of the Artin L function. At this point, we use some group theory to replace the Artin L-functions with Hecke L-functions. Indeed, let H be a subgroup of G and h an element of H. Let CH denote its conjugacy class in H and C its conjugacy class in G. Let δ : H −→ {0, 1} denote the characteristic function of CH . Now set φ = IndG H δ. By deﬁnition, we see that φ is supported only on the conjugacy class C and so φ = λδC . The value of λ is easily computed by Frobenius reciprocity: λ

|C| |CH | = (φ, 1G ) = (δ, 1H ) = . |G| |H|

Thus λ=

|CH | · |G| . |H| · |C|

From the inductive property of L-functions, it is not hard to see that π ˜ (x, φ) = π ˜ (x, δ). Now the right hand side is written as a sum involving the characters of H. In particular, if we are given C and we let H be the cyclic subgroup generated by gC then we are able to express π ˜ (x, δC ) in terms of Hecke L-functions. As we know the analytic properties of these L-functions, we are now able to follow rather closely the classical method as developed in [D] to prove Theorem 7.1. Though the above technique has the advantage of replacing the non-abelian L-functions with abelian ones, it does so at some cost. The estimates will now involve the ﬁeld constants (that is, degree, discriminant, etc.) of the ﬁxed ﬁeld M (say) of H. In general, as we do not have any information about M we are forced to majorize its ﬁeld constants by those of K and this magniﬁes the error terms signiﬁcantly. This problem could be avoided if we were able to deal directly with the Artin L-functions. This theme is developed in the next section. We conclude this section by stating some unconditional results developed in [LO] and in [LMO].

Chapter 2 Artin L-Functions

44

Theorem 7.2

If log x nL (log dL )2 , then

|πC (x) −

1 |C| |C| ˜ exp(−cn− 2 (log x) 12 )) Li(x)| ≤ Li(xβ ) + O(|C|x L |G| |G|

˜ is the number of conjugacy classes contained in C, β is the exceptional where |C| β zero of Proposition 6.1, and the term |C| |G| Li(x ) is to be suppressed if the exceptional zero β does not exist. Sometimes it is useful to have an inequality rather than an explicit error term. Such a bound is provided by the following result of Lagarias, Odlyzko and Montgomery [LMO]. Theorem 7.3

We have πC (x)

|C| Li(x) |G|

provided log x (log dL )(log log dL )(log log log e20 dL ). In applying these results, it is very useful to have some estimates for the discriminant of a ﬁeld. These upper bounds are consequences of an inequality due to Hensel, and are developed in [Se2]. Let DK/F denote the diﬀerent of K/F . It is an ideal of OK and its norm dK/F from K to F is the discriminant of the extension. Let ν be a place of F and w a place of K dividing it. Let pν denote the residue characteristic of ν. Hensel’s estimate states w(DK/F ) = ew/ν − 1 + sw/ν where 0 ≤ sw/ν ≤ w(ew/ν ). Here ew/ν is the ramiﬁcation index of pν in K. Using this, one can get an estimate for the norm of the relative discriminant. Let us set nK = [K : Q],

nF = [F : Q]

and n = [K : F ] = nK /nF . Let us also set P (K/F ) to be the set of rational primes p for which there is a prime p of F with p|p and p is ramiﬁed in K. Then, log NF/Q dK/F ≤ (nK − nF ) log p + nK (log n)|P (K/F )|. p∈P (K/F )

This bound does not assume that K/F is Galois. If we know in addition that K/F is Galois, the following slightly stronger estimate holds: log NF/Q dK/F ≤ (nK − nF ) log p + nK (log n). p∈P (K/F )

There is an analogue of this for Artin conductors also. This analogue is needed in the proofs of the results of the next section.

§7 Chebotarev density theorem

45

Proposition 7.4 Suppose that K/F is Galois with group G. Let χ denote an irreducible character of G and denote by fχ its Artin conductor. Then log NF/Q fχ ≤ 2χ(1)nF {

log p + log n}.

p∈P (K/F )

Proof. Firstly, we observe that for each i ≥ 0, 1 χ(a), |Gi |

dim V Gi =

a∈Gi

where Gi is as in Section 2. Thus, for each ﬁnite ν, |Gi | n(χ, ν) = |G0 | i

1 χ(1) − χ(a) . |Gi | a∈Gi

Denote by Oν (respectively Ow ) the ring of integers of Fν (resp. Kw ). Deﬁne a function iG on G by iG (g) = w(gx − x) = max{i : g ∈ Gi−1 } where Ow = Oν [x]. Rearranging gives n(χ, ν) =

χ(1) 1 (|Gi | − 1) − |G0 | i |G0 |

χ(a)iG (a).

1=a∈G0

Applying this formula for χ the trivial character, and the character of the regular representation of G0 , we ﬁnd that

iG (a) =

(|Gi | − 1) = w(DK/F ). i

1=a∈G0

Hence, n(χ, ν) =

1 |G0 |

iG (a)(χ(1) − χ(a)) ≤

1=a∈G0

2χ(1)w(DK/F ) . ew/ν

Now using the above stated estimate for w(DK/F ) we deduce that log Nfχ ≤ 2χ(1)

1 ew/ν

(ew/ν − 1 + sw/ν )fν log pν

Chapter 2 Artin L-Functions

46

and this is ≤ 2χ(1)

fν

eν 1− ew

log pν +

eν fν w(ew/ν ) log pν ew

where eν (resp. ew ) denotes absolute ramiﬁcation index at ν (resp. w) and we have used ew/ν = ew /eν . Also, as w(ew/ν ) = ew νp (ew/ν ) and as K/F is Galois, ew/ν divides n. Thus ⎧ ⎫ ⎨ ⎬ log Nfχ ≤ 2χ(1)nF log p + log n . ⎩ ⎭ p∈P (K/F )

This completes the proof. We remark that there is no analogue of Hensel’s estimate in the function ﬁeld case. This is one source of diﬃculty in extending to this case the eﬀective versions of the Chebotarev density theorem discussed in this and the next section. The reader is referred to [MS] and the references therein for the function ﬁeld analogues.

§8 Consequences of Artin’s conjecture These estimates can be signiﬁcantly improved if we know Artin’s conjecture on the holomorphy of L-series. The improvement is in the dependence of the error term on C. The results of this section are from the paper [MMS]. We shall only discuss the conditional result Proposition 7.1. Let χ be a character of G and denote by π(x, χ) the function π(x, χ) = χ(σν ). Nν≤x

Let δ(χ) denote the multiplicity of the trivial character in χ. As before (see §2) χ(1) Aχ = dK NF/Q (fχ ) and Λ(s, χ) = Λ(s, χ, F ) = As/2 χ γ(s, χ, F )L(s, χ) Proposition 8.1 Suppose that the Artin L-series L(s, χ) is analytic for all s =1 and is nonzero for Re(s) = 12 , 0 < Re(s) < 1. Then 1

π(x, χ) = δ(χ) Li(x) + O(x 2 ((log Aχ ) + χ(1)nF log x)) + O(χ(1)nF log M (K/F )) where 1/nF

M (K/F ) = ndF

p∈P (K/F )

p.

§8 Consequences of Artin’s conjecture

47

Proof. The argument proceeds along standard lines and so we just sketch it here. Artin proved the functional equation Λ(s, χ) = W (χ)Λ(1 − s, χ) ¯ where W (χ) is a complex number of absolute value 1 and χ ¯ is the complex conjugate of χ. We know that (s(s − 1))δ(χ) Λ(s, χ) is entire and we have the Hadamard factorization s Λ(s, χ) = ea(χ)+b(χ)s (1 − )es/ρ (s(s − 1))−δ(χ) ρ where a(χ), b(χ) ∈ C and the product runs over all zeroes ρ of Λ(s, χ) (ncecessarily 0 ≤ Re(ρ) ≤ 1.) From the equality Λ(s, χ) = Λ(¯ s, χ) ¯ we deduce the relation

Λ Λ (s, χ) = (¯ s, χ). ¯ Λ Λ Moreover, the functional equation implies the relation Λ Λ (s, χ) = − (1 − s, χ). ¯ Λ Λ From these two relations, we deduce that Re

Λ 1 ( , χ) = 0. Λ 2

Also, if ρ is a zero of Λ(s, χ) then so is 1 − ρ¯. Hence, Re

1 ( − ρ)−1 = 0 2

as is seen by grouping together the terms corresponding to ρ and 1 − ρ¯ in the absolutely convergent sum. Logarithmically diﬀerentiating the product formula at s = 12 and taking real parts, we deduce that Re(b(χ) + Hence, Re

1 ρ

) = 0.

Λ 1 1 1 (s, χ) = Re( ) − δ(χ) Re( + ). Λ s − ρ s s − 1 ρ

Chapter 2 Artin L-Functions

48

Let N (t, χ) denote the number of zeros ρ = β + iγ of L(s, χ) with 0 < β < 1 and |γ − t| ≤ 1. Evaluating the above formula at s = 2 + it and observing that Re(

1 2−β )= 2 + it − ρ (2 − β)2 + (t − γ)2

is non-negative for all ρ and is atleast 1/5 if |t − γ| ≤ 1 we deduce that N (t, χ) Re

Λ (2 + it, χ). Λ

Since the Dirichlet series for L(s, χ) converges at 2 + it, the right hand side is easily estimated, the essential contribution coming from log Aχ and the number of Γ factors. We get N (t, χ) log Aχ + χ(1)nF log(|t| + 5). By developing an explicit formula as in [LO] or [Mu2] we ﬁnd that

χ(σν ) log Nν = δ(χ)x−

xρ + O(χ(1)nF log M (K/F )) ρ

|γ|<x

Nν≤x

1

+ O(x 2 (log x)(log Aχ + χ(1)nF log x)), where the prime on the sum indicates that we only include places ν that are unramiﬁed in K. The sum over zeros can be estimated by observing that 1 N (j, χ) ρ j j<x

|γ|<x

and using the above estimate for N (t, χ). The estimate for π(x, χ) can be deduced by partial summation. Proposition 8.2 Suppose that all Artin L-series of the extension K/F are analytic at s = 1 and that GRH holds. Then 2 1 |C| πC (x) − Li x xn2F (log M (K/F )x)2 . |C| |G| C

Proof. We ﬁrst observe that 2 1 |C| |C| 1 π(x, 1G ) − Li x = (π(x, 1G ) − Li x)2 . |C| |G| |G| |G| C

§8 Consequences of Artin’s conjecture

49

Expressing π(x, 1G ) in terms of characters, we see that this is ⎞ ⎛ 1 ⎝ ≤ |π(x, χ)|2 + (π(x, 1G ) − Li x)2 ⎠ |G| χ=1

where the sum is over the non-trivial irreducible characters of G. By Propositions 7.4 and 8.1, 1 π(x, χ) − δ(χ) Li x χ(1)nF x 2 (log(M (K/F )x). The result follows on noting that

χ(1)2 = |G|.

χ

Proposition 8.3 Let D be a union of conjugacy classes. Under the same hypotheses as in Proposition 8.2, we have πD (x) =

1 1 |D| Li x + O(|D| 2 x 2 nF log M (K/F )x). |G|

Proof. We have πD (x) −

|D| |C| Li x = πC (x) − Li x |G| |G| C

where the sum is taken over all conjugacy classes C contained in D. Now applying the Cauchy-Schwarz inequality we deduce that 2 12 1 1 |C| |C| πC (x) − πC (x) − Li x ( |C|) 2 Li x . |G| |C| |G| C

C

C

The result now follows from Proposition 8.2. Remark. Using Hensel’s estimate for the discriminant, it is possible to write the error term in Theorem 7.1 as 1

O(|C|x 2 nF log M (K/F )x). 1

Thus Artin’s conjecture allows us to replace |C| with |C| 2 . In some cases, we can also improve Theorem 7.1 even without assuming Artin’s conjecture. We give two such results below.

Chapter 2 Artin L-Functions

50

Proposition 8.4 Let D be a union of conjugacy classes in G and let H be a subgroup of G satisfying (i) Artin’s conjecture is true for the irreducible characters of H (ii) H meets every class in D. Suppose the GRH holds. Then ⎛ ⎛ ⎞ ⎞ 12 2 |D| |C| ⎠ ⎜ 1 ⎟ πD (x) = Li x + O ⎝x 2 ⎝ nF log M x⎠ |G| |CH | C⊆D

where M = M (K/F ) and CH = CH (γ) for some γ ∈ H ∩ C. Proof. Firstly, we have the relation πD (x) = π ˜D (x) + O(

1 1 log dK + nF x 2 ). |G|

Using the estimate from Hensel’s bound, we have 1 log dK nF log M x. |G| Also, π ˜D (x) =

π ˜C (x) =

C⊆D

|C| |H| · π ˜C (x). |G| |CH | H

(8.1)

C⊆D

Now, |C| |H| · (˜ πCH (x) − πCH (x)) |G| |CH | C⊆D ⎛ |H| |C| ⎜ ≤ δC (σm ) + ⎝ |G| |CH | Nν m ≤x H ν C⊆D

m≥2

⎞

⎟ δCH (σν )⎠

Nν≤x ν ramified in L/K

1 |C| |G| |H| 2 1 2 ≤ maxC⊆D · nF x + log dK |G| |CH | |H| log 2 |H| 1 |C| ≤ max (nF x 2 + nF log M x) |CH | and this can be absorbed into the error term. Therefore, we can replace π ˜CH by πCH in the equation (8.1). Now, |C| |H| · πC (x) |G| |CH | H C⊆D ⎛ ⎞ |C| |D| |H| 1 |C | H πCH (x) − = Li x + O ⎝ · Li x⎠ . 1 1 |G| |G| |H| |CH | 2 |CH | 2 C⊆D

§8 Consequences of Artin’s conjecture

51

Now applying the Cauchy-Schwarz inequality and using Proposition 8.2, we ﬁnd that the error term above is ⎛ ⎞ 12 |H| ⎝ |C|2 ⎠ 1 |G| · x 2 nF log M (K/F )x |G| |CH | |H| C⊆D

where F is the ﬁxed ﬁeld of H. This proves the proposition since M (K/F ) M (K/F ). We state one immediate corollary of this result. Corollary 8.5

Under the same hypotheses as above, 1 1 1 |D| |G| 2 πD (x) = Li x + O |D| 2 x 2 nF log M x . |G| |H|

The corollary follows immediately on noting that |C| |G| ≤ . |CH | |H| We now present one further result in this direction. This estimate has the feature that in some cases, it gives a better result than what one deduces from Artin’s conjecture. Proposition 8.6 Suppose the GRH holds. Let D be a nonempty union of conjugacy classes in G and let H be a normal subgroup of G such that Artin’s conjecture is true for the irreducible characters of G/H and HD ⊆ D. Then 1 |D| |D| 2 12 πD (x) = Li x + O x nF log M x |G| |H| where M is as in the previous proposition. ¯ be the image of D in G/H. It is a union of conjugacy classes in G/H Proof. Let D and ¯ · |H| |D| ¯ 12 x 12 nF log M (F /F )x) πD¯ (x) = Li x + O(|D| |G| where F is the ﬁxed ﬁeld of H. As HD ⊆ D, ¯ · |H| = |D| |D| and πD (x) = πD¯ (x) + O((log dK )/|G|). Also, M (F /F ) M (K/F ). The result follows.

52

Chapter 2 Artin L-Functions

Finally, we can ask what the true order of the error term in the Chebotarev theorem should be. Let α(G) denote the number of conjugacy classes of G. Question. Is it true that for any conjugacy set D ⊆ G, |D| πD (x) = Li x + O |G|

|D| α(G)

12

1 2

x nF log M x ?

This would be implied by the Proposition 8.2 if all the terms are of the same order. In the case F = Q and K/F is Abelian, our question is a well-known conjecture of Montgomery.

§9 The least prime in a conjugacy class Let L/K be a ﬁnite non-trivial Galois extension of number ﬁelds with group G. Our main result is an estimate, assuming the Riemann Hypothesis for Dedekind zeta functions (GRH), for the least norm of a prime ideal of K which is unramiﬁed in L and which does not split completely. The results of this section are from [Mu3]. If C is any subset of G stable under conjugation, Lagarias and Odlyzko [LO, pp. 461–462] showed, assuming (GRH) that there is a prime ideal p with NK/Q p (log |dL |)2

(9.1)

for which the Frobenius conjugacy class σp of p lies in C. Here, dL (resp. dK ) denotes the (absolute) discriminant of L (resp. K). In this estimate, an important tool was the eﬀective version of the Chebotarev density theorem proved in [LO]. By the results of the previous section, it follows that the assumption of Artin’s conjecture (AC) on the holomorphy of Artin L- series allows one to prove a sharper version of this theorem. In particular, the assumption of AC implies that the estimate (9.1) may be improved to NK/Q p

(log |dL |)2 (log |G|)2 . |C|

(9.2)

In fact, the term (log |G|)2 may also be removed by using a more detailed argument. The purpose of this section is to show, assuming the GRH, that in the special case C = G − {1}, there is a prime ideal p of K of degree 1 which is unramiﬁed in L which does not split completely and which satisﬁes NK/Q p

log |dL | |G| − 1

2

2

nK log |dL | nL

.

(9.3)

where nK = [K : Q] and nL = [L : Q]. Thus, the estimate (9.3) shows that for the special set C = G − {1}, one can do substantially better than (9.1).

§9 The least prime in a conjugacy class

53

Next, we shall show that for certain subgroups H of G the bound (9.3) may be extended for the least norm of a prime ideal p for which σp does not intersect H. The precise statement is given in Theorem 9.3. We apply these last results to the group of points on an elliptic curve over a ﬁnite ﬁeld. Let E be an elliptic curve without complex multiplication and deﬁned over Q. Denote by N the conductor of E. Let us set T = lcmE |E (Q)tors | where the lcm ranges over elliptic curves E which are deﬁned over Q and are Q-isogenous to E. In [K, Th. 2] Katz proved that gcd |E(Fp )| = T where the gcd is taken over primes p of good reduction. It is well known and easily proved that both sides are divisible by the same primes. Using our results, we can make this eﬀective in the following sense. Let l ≥ 5 be a prime and assume the GRH. If l does not divide T then we show (Theorem 9.4) that there is a prime p so that p ( log N )2 and E(Fp ) does not have a point of order . We begin by proving the estimate (9.3). We recall that L/K is a non-trivial Galois extension. Theorem 9.1 Assume the GRH. Then, there exists a prime ideal p of K (i) p is of degree 1 over Q and unramiﬁed in L (ii) p does not split completely in L and nK NK/Q p ( log |dL |)2 nL where nK = [K : Q] and nL = [L : Q]. Proof. We consider the kernel function of [LMO, §2], namely s−1 2 y − xs−1 k(s) = k(s; x, y) = . s−1 For y > x > 1 and u > 0, it has the property that the inverse Mellin transform 1 ˆ k(u) = k(s)u−s ds 2πi (2) is given by the formulae ˆ x, y) = k(u;

⎧ 0 ⎪ ⎨1

u log 1 ⎪ ⎩ u log 0

y2 u u x2

if if if if

u > y2 xy < u < y 2 x2 < u < xy u < x2 .

Chapter 2 Artin L-Functions

54

Now consider the integral JK

1 = 2πi

ζK − (s) k(s; x, y)ds. ζK (2)

On the one hand, it is equal to (log y/x)2 −

k(ρ; x, y)

ρ

where ρ runs over all zeroes of ζK (s). Write ρ = β + iγ. If NK (r; s0 ) denotes the number of zeroes ρ of ζK (s) with |ρ − s0 | ≤ r then ([LMO, Lemma 2.2]) NK (r; s0 ) 1 + r(log |dK | + nK log(|s0 | + 2)). Since |k(ρ; x, y)| ≤ it follows that

k(ρ; x, y) x−2δ

x−2(1−β) |ρ − 1|2

∞

δ

β≤1−δ −2δ

x

1 dNK (r; 1) r2

(δ −2 + δ −1 log |dK |).

As we are assuming the GRH, we may take δ =

1 2

and we see that

y JK = (log )2 + O(x−1 log |dK |). x On the other hand, the integral is equal to the sum ˆ Λ((N p)n )k((N p)n ; x, y). p,m

The contribution to this sum of ideals pn for which N pn is not a rational prime is

nK (log y)(log y/x) x log x

as in [LMO, (2.6)]. Moreover, the contribution of primes p which ramify in L is y (log N p)x−2 log x p|dL/K

as in [LMO, (2.27)]. (Recall that dL/K is the norm to Q of the discriminant of the extension L/K.) Since [Se2, p. 129] p|dL/K

log N p ≤

2 log |dL |, n

§9 The least prime in a conjugacy class

55

the contribution of primes p which ramify in L is Let us set J˜K =

1 y (log |dL |)x−2 log . n x

∗

ˆ p; x, y) (log N p)k(N

where the sum ranges over primes p of K of degree 1 which are unramiﬁed in L. Then the above estimates imply that ' y y 1 1 y ( J˜K = (log )2 +O x−1 log |dK |+nK (log y)(log ) +( log |dL |)x−2 (log ) . x x x log x n x On the other hand, by an argument similar to that given above, y JL = (log )2 + O(x−1 log |dL |). x Now if we suppose that every prime ideal p of K with N p ≤ y 2 either ramiﬁes or splits completely in L, then JL ≥ nJ˜K . Putting this together with the above estimates, and choosing x=(

α log |dL |) n

and y = bx for some b > 1 and α > (log b)2 we deduce the inequality (n − 1)(log b)2

n nnL (log bx)(log b) n2 (log b) + + 2 n. α α(log |dL |)(log x) α (log |dL |)

For a suﬃciently large value of b, we get a contradiction. This completes the proof. Remarks. 1. This method can also be used to produce an unconditional bound. In terms of its dependence on L the main term is |dL |1/2(n−1) . 2. Note that we used the normality of the extension L/K in asserting that a prime of K which splits completely in L has [L : K] prime divisors in L. 3. We note an interesting consequence of the above. Assume the GRH. Suppose the class number h of K is larger than 1. There exists a non-principal prime ideal p of K of degree 1 over Q with NK/Q p (log |dK |)2 . Indeed, choose for L the Hilbert class ﬁeld of K, and use the fact that dL = dhK .

Chapter 2 Artin L-Functions

56

We describe two variants of Theorem 9.1. (A) Consider the following diagram of ﬁelds. F L1 L Lr 2 ... K M Q Theorem 9.2 Assume the GRH. Let L1 , . . . , Lr be distinct non-trivial Galois extensions of K. Let F be an extension of K containing all the Li and M a subﬁeld of K so that F/M is Galois. Set m = min[Li : K] f = [F : K] and assume that r < m. Then, there exists a prime ideal p of K satisfying (i ) p is of degree 1 over Q and NK/M p does not ramify in F (ii ) p does not split completely in any of the Li , 1 ≤ 1 ≤ r with NK/Q p B 2 where

r B = max

i=1 (log |dLi |)

m−r

) ,

m log |dF | . f (m − r)

Proof. Let S denote the set of degree 1 prime ideals p of K with N p ≤ y 2 for which p ∩ OM does not ramify in F . Suppose that every element of S splits completely in some Li . Then, with notation as in the proof of Theorem 9.1, we have

JLi ≥ m

p∈S

ˆ p; x, y). (log N p)k(N

§9 The least prime in a conjugacy class

57

Using the estimate for JL and J˜K given in the proof of Theorem 9.1, we deduce that y 1 r(log )2 + O( log |dLi |) x x i y m mnK (log y)(log y/x) ≥ m(log )2 + O( log |dK |) + O( ) x x x log x + O(

m 1 (log |dF |) 2 log y/x). f x

Simplifying, and choosing x = αB and y = βx with some β > 1 and α > (log β)2 , we get the inequality (m − r)(log β)2 ≤ O((m − r)(log β)) which is a contradiction if β is suﬃciently large. (B) With L/K a normal extension and G = Gal(L/K) as before, we take a subgroup H of G. We want to ﬁnd a prime p of K so that σp is disjoint from H. Theorem 9.1 had to do with H = {1}. Theorem 9.3. Assume the GRH. Denote by N = NG (H) the normalizer of H in G and let R be the ﬁxed ﬁeld of N . Let H1 , . . . , Hr be a set of normal subgroups of N and L1 , . . . , Lr their respective ﬁxed ﬁelds. Suppose that (1) for each g ∈ G, gHg −1 ∩ N is contained in some Hi (1 ≤ i ≤ r). (2) if m = min[Li : R] then r < m. Then, there exists a prime ideal p of K with 2 NK/Q p BH

and satisfying (a) p is of degree one and does not ramify in L (b) σp is disjoint from H. Here, ) 1 1 m BH = max log |dL |, (log |dL |) . m−r |Hi | |N |(m − r) Proof. Each Li is a Galois extension of R and L is a Galois extension of R containing all the Li . By Theorem 9.2, we can ﬁnd a prime ideal P of R of degree one (over Q) so that p = NR/K P does not ramify in L, P does not split completely in any of the Li and NR/Q P B 2

where B = max

* ) r 1 m log |dLi |, (log |dL |) . m − r i=1 [L : R](m − r)

58

Chapter 2 Artin L-Functions

The splitting completely condition means that σP ∩ Hi = φ Now

1 ≤ i ≤ r.

σp = ∪τ τ σP τ −1

where the union is over a set of coset representatives {τ } for N in G. It follows that σp ∩ H = φ. Hence p satisﬁes (a) and (b). Now as |dLi | ≤ |dL |1/|Hi | , we deduce the stated bound. Remarks. 1. In the case r = 1, the assumptions (1) and (2) may be stated as (1 ) for any g ∈ G, gHg −1 ∩ N is nonempty ⇒ g ∈ N (2 ) H is a proper subgroup of N . 2. If we are only interested in ﬁnding a prime p so that σp = (p, L/K) is not contained in H, then we do not need to consider the conjugates of H at all. Rather, it suﬃces to take a degree one prime P of R such that p = P ∩ OK does not ramify in L and (P, L/R) is not contained in H. But as H is normal in N , this just means that P does not split completely in M = the ﬁxed ﬁeld of H. We can ﬁnd such a P with 2 log |dL | NR/Q P . |N | − |H| Corollary 9.4. Let the notation and hypotheses be as in Theorem 9.3. If C is a subset of G stable under conjugation and H intersects every conjugacy class in C nontrivially, then there is a prime p of K satisfying 2 NK/Q p BH

as well as (a) and (b ) σp is not contained in C. Let E be an elliptic curve deﬁned over Q and let N denote its conductor. For p N , we may consider the group |E(Fp )| of Fp -rational points on E. Its cardinality is given by |E(Fp )| = p + 1 − a(p) for some integer a(p). ¯ which are in the kernel of multi¯ The action of Gal(Q/Q) on points of E(Q) plication by gives a representation ¯ ρ : Gal(Q/Q) → GL2 (F ). It has the property that for p N , ρ (σp ) has trace a(p) and determinant p modulo . Recall that we have set T = lcmE |E (Q)tors | where the lcm ranges over elliptic curves E which are Q-isogenous to E.

§9 The least prime in a conjugacy class

59

Theorem 9.5 Suppose that E does not have complex multiplication and let ≥ 5 be a prime which does not divide T . Denote by N the conductor of E. Assume the GRH. Then, there is a prime p ( log N )2 such that E(Fp ) does not have a point of order . Proof. Let us denote by G the image of ρ . It is known that the ﬁxed ﬁeld of the kernel of ρ contains the ﬁeld of -th roots of unity. Let P G denote the image of G under the natural map GL2 (F ) → P GL2 (F ). It is well known (See [Se2, p. 197]) that one of the following holds: (i) P G contains P SL2 (F ) (ii) G is contained in a Borel subgroup of GL2 (F ) (iii) G is contained in a non-split Cartan subgroup of GL2 (F ) (iv) P G A4 , S4 or A5 (v) G is contained in the normalizer of a Cartan subgroup C but is not contained in C. We shall consider each in turn. (i): Consider the Borel subgroup (see [Se2, p. 197]) ∗ ∗ ⊆ G = GL2 (F ) B= 0 ∗ and the subgroups H=

1 ∗ 0 ∗

,

H =

∗ ∗ 0 1

of B. A simple calculation shows that NG (H) = NG (H ) = B. We also have that for any g ∈ G, gHg −1 ∩ B ⊆ H or H. We apply Theorem 9.3 to get a prime p which is unramiﬁed in L, the ﬁxed ﬁeld of the kernel of ρ and which has the property that σp ∩ H = φ and p x2 where

x=

1 2 (log |dL |), − 1 − 2 ( − 1)

)

−1 (log |d |) . L 4

60

Chapter 2 Artin L-Functions

Now by Hensel’s inequality, log |dL | nL log N 4 log N and so p ( log N )2 . Now consider D = {g ∈ G : trg = 1 + det g}. Clearly, every conjugacy class in D intersects H non-trivially. Hence σp is not contained in D, or in other words σp ∩ D = φ. Thus a(p) ≡ 1 + p(mod ) and this means that |E(Fp )| = p + 1 − a(p) ≡ 0(mod ). (ii): We may suppose (after a suitable choice of basis) that G ⊆ B (with B as above). We are again looking for a prime p such that σp ∩ H = φ where H =G∩

1 ∗ 0 ∗

.

If G = H, then it is clear that divides T and this is excluded by assumption. Thus, we may suppose that G = H. Since H is a normal subgroup of G, it follows from Theorem 9.1 that there exists a prime p with the desired property and p

'

(2 1 log dF [G : H]

where F is the ﬁxed ﬁeld of H. Since F is a Galois extension of Q ramiﬁed only at primes dividing N , we have p (log N )2 . (iii): This is impossible if > 2 since G contains the image of complex conjugation, a matrix with distinct F -rational eigenvalues (namely +1, −1), whereas the eigenvalues of every element of a nonsplit Cartan subgroup are either equal or lie in F2 \F .

Exercises

61

(iv): In this case |G| . By the result of Lagarias and Odlyzko , quoted as (9.1) 2 at the beginning of this section, there exists a prime p whose σp is 2 (say) and p < (|G| log N )2 ( log N )2 . Such a prime has a(p) ≡ 4 ≡ 1 + 4 ≡ 1 + p (mod ). (v): In this case, there is a quadratic character with the property that p N and (p) = −1 ⇒ a(p) ≡ 0(mod ). Let K be the quadratic extension of Q corresponding to . This ﬁeld has the property [Se2, p. 198] that it is unramiﬁed at and can only ramify at primes dividing N . Hence, we can ﬁnd a prime p such that p ≡ 1(mod ) and (p) = −1 with p (log |dK(ζ ) |)2 ( log N )2 where ζ is a primitive -th root of unity. For such a prime, a(p) ≡ 0 ≡ 2 ≡ 1 + p(mod ). This proves the theorem.

Exercises 1.

Let χ be an irreducible character of a ﬁnite group G. If χ is a linear combination with positive real coeﬃcients of monomial characters, then mχ is monomial for some integer m ≥ 1.

2.

Let A be a normal subgroup of the group G and let χ be an irreducible character of G. Then either the restriction of χ to A is isotypic (that is, a multiple of one character) or there is a subgroup H containing A and an irreducible character σ of H such that χ = IndG H σ. (See [Se1, Prop. 24]).

3.

A ﬁnite group G is called supersolvable if there is a sequence of subgroups {1} = G0 ⊆ G1 ⊆ · · · Gn = G with each Gi normal in G and with successive quotients Gi /Gi−1 cyclic. (a) Prove that a nonabelian supersolvable group has a normal abelian subgroup which is not contained in the center. (b) Use (a) and Exercise 2 to prove that an irreducible character of a supersolvable group is monomial (that is, the induction of a one-dimensional character of some subgroup). Exercises 4–7 are based on the paper [R] of Rhoades.

Chapter 2 Artin L-Functions

62

4.

Let F be a set of characters of the ﬁnite group G. We say that a class function θ is semi-orthogonal to F if (θ, φ) ≥ 0 for all φ ∈ F. (a) If F is the set of all characters of G, then a generalized character θ is semi-orthogonal to F if and only if θ is a character. (b) Let F˜ = { xφ φ : 0 < xφ ∈ R, φ ∈ F}. Then a class function θ is semi-orthogonal to F if and only if it is semi˜ orthogonal to F.

5.

If F = {IndG H ψ : H an Abelian subgroup of G} then (a) the generalized character θG is semi-orthogonal to F. (b) if a generalized character θ = mχ χ is semi-orthogonal to F then |mχ | ≤ |θ(1)|.

6.

Let F be a subset of Rk and deﬁne H(F ) = {x ∈ Rk : (f, x) ≥ 0 for all f ∈ F } and

C(F ) = { xi fi : 0 < xi ∈ R, fi ∈ F }

where ( , ) denotes the standard inner product. (a) If F is a subspace, then H(F ) is the subspace of Rk orthogonal to F and H(H(F )) = F and C(F ) ⊂ F . ∗ (b) [R, Lemma 1] If F does not contain the zero vector and all elements of F have non-negative coordinates, then H(H(F )) = C(F ). 7.

Let G be a ﬁnite group and F a subset of characters of G. Expressing the elements of F as a sum of irreducible characters of G, identify F as a subset of Rk for some k. Using Exercise 6(b), show that a generalized character ψ of G can be written as a positive rational linear combination of characters in F if and only if (ψ, θ) ≥ 0 for all θ semi-orthogonal to F. Deduce that for any irreducible character χ of G, regG ±χ can be written as a positive rational linear combination of monomial characters.

8.

Let L/K be a ﬁnite Galois extension with group G. Show that the Artin L-functions L(s, χ, K) (as χ ranges over the irreducible characters of G) are multiplicatively independent over Q. That is, if L(s, χ, K)cχ = 1 χ

for some rational numbers cχ then cχ = 0 for all χ.

References

9.

63

Let F/Q be a ﬁnite Galois extension with group G and let H, H be two subgroups. Denote by K and K the corresponding ﬁxed ﬁelds. (a) Show that ζK (s) = ζK (s) if and only if for every conjugacy class C of G, we have #(H ∩ C) = #(H ∩ C). (b) Let G = S6 (the symmetric group on 6 letters) and consider the subgroups H = {(1), (12)(34), (12)(56), (34)(56)} and H = {(1), (12)(34), (13)(24), (14)(23)}. Prove that the above condition is satisﬁed and deduce that the Dedekind zeta functions of the corresponding ﬁxed ﬁelds coincide. (This is due to Gassman, 1926.)

10. Let 1 < a ∈ Z be a squarefree integer and q a prime. Set K = Q(a1/q ) and prove directly that ζK (s)/ζ(s) is entire. 11. Let f (T ) ∈ Z[T ] be an irreducible polynomial of degree larger than 1. Show that the set {p : f (T ) ≡ 0(mod p) has a solution} has positive density. 12. Let E be a biquadratic extension of Q and let K1 , K2 , K3 be the three quadratic subﬁelds. Show that ζ(s)2 ζE (s) = ζK1 (s)ζK2 (s)ζK3 (s). Deduce a relation amongst the class numbers of the Ki .

References [CF] J. Cassels and A. Fr¨ ohlich, Algebraic Number Theory, Academic Press, 1967. [D] H. Davenport, Multiplicative Number Theory, Springer-Verlag, 1980. [Fo] R. Foote, Non-monomial characters and Artin’s conjecture, Trans. Amer. Math. Soc., 321 (1990), 261–272. [FM] R. Foote and V. Kumar Murty, Zeros and poles of Artin L-series, Math. Proc. Camb. Phil. Soc., 105 (1989), 5–11. [FW] R. Foote and D. Wales, Zeros of order 2 of Dedekind zeta functions and Artin’s conjecture, J. Algebra, 131 (1990), 226–257. [Fr] A. Fr¨ ohlich, Galois Module Structure of algebraic integers, Ergebnisse der Mathematik und ihrer Grenzgebiete, Springer-Verlag, 1983.

64

Chapter 2 Artin L-Functions

[K] N. Katz, Galois properties of torsion points of abelian varieties, Invent. Math., 62 (1981), 481–502. [La] S. Lang, Algebraic Number Theory, Springer-Verlag, 1986. [LO] J. Lagarias and A. M. Odlyzko, Eﬀective versions of the Chebotarev Density Theorem, Algebraic Number Fields, ed. A. Fr¨ ohlich, 409–464, Academic Press, New York, 1977. [LMO] J. Lagarias, H. Montgomery and A. M. Odlyzko, A bound for the least prime ideal in the Chebotarev Density Theorem, Invent. Math., 54 (1979), 271–296. [Mu1] V. Kumar Murty, Holomorphy of Artin L-functions, in: Proc. Ramanujan Centennial Conference, pp. 55–66, Ramanujan Mathematical Society, Chidambaram, 1987. [Mu2] V. Kumar Murty, Explicit formulae and the Lang-Trotter conjecture, Rocky Mountain J. Math., 15 (1985), 535–551. [Mu3] V. Kumar Murty, The least prime which does not split completely, Forum Math., 6 (1994), 555–565. [MM] M. Ram Murty and V. Kumar Murty, Base change and the Birch and Swinnerton-Dyer conjecture, in: A tribute to Emil Grosswald: number theory and analysis, Contemp. Math., 143 (1993), 481–494. [MMS] M. Ram Murty, V. Kumar Murty and N. Saradha, Modular forms and the Chebotarev Density Theorem, Amer. J. Math., 110 (1988), 253–281. [MS] V. Kumar Murty and J. Scherk, Eﬀective versions of the Chebotarev Density Theorem in the function ﬁeld case, C.R. Acad. Sci. Paris, 319 (1994), 523– 528. [R] S. Rhoades, A generalization of the Aramata-Brauer theorem, Proc. Amer. Math. Soc., 119 (1993), 357–364. [Se1] J.-P. Serre, Linear representations of ﬁnite groups, Springer-Verlag, New York, 1977. [Se2] J.-P. Serre, Quelques applications du Th´eor`eme de Densit´e de Chebotarev, Publ. Math. IHES, 54 (1981), 123–201. [St] H. M. Stark, Some eﬀective cases of the Brauer-Siegel theorem, Invent. Math., 23 (1974), 135–152. [Uc] K. Uchida, On Artin L-functions, Tohoku Math. J., 27 (1975), 75–81. [vdW] R. W. van der Waall, On a conjecture of Dedekind on zeta functions, Indag. Math., 37 (1975), 83–86.

Chapter 3 Equidistribution and L-Functions

§1 Compact groups and Haar measures Let X be a compact topological space and C(X) the Banach space of continuous, complex-valued functions on X, with the supremum norm: + , ||f || = sup |f (x)| x ∈ X . Let x1 , x2 , x3 , . . . be a sequence of points of X. Let μ be a Radon measure on X (that is, a continuous linear form on C(X)). The sequence x1 , x2 , x3 , . . . is said to be μ-equidistributed if 1 f (xi ). n→ ∞ n i=1 n

μ(f ) = lim

We will now follow [Se] in our treatement. Lemma 1.1 Let φα be a family of continuous functions on X with the property that their linear combinations are dense in C(X). Suppose that, for each α, the sequence μn (φα ), 1 ≤ n < ∞ where 1 φα (xi ), n i=1 n

μn (φα ) =

has a limit. Then the sequence x1 , x2 , x3 , . . . is μ-equidistributed for some unique measure μ satisfying for all α : μ(φα ) = lim μn (φα ) n→ ∞

Proof. If f ∈ C(X), a familiar argument using equicontinuity shows that the sequence μ1 (f ), μ2 (f ), . . . has a limit μ(f ), which is continuous and linear in f . This proves the lemma.

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_4, © Springer Basel AG 1997

65

Chapter 3 Equidistribution and L-Functions

66

Lemma 1.2 Suppose that x1 , x2 , x3 , . . . is μ-equidistributed. Let U be a subset of X whose boundary has μ-measure zero and for all n, let nU be the number of m ≤ n such that xm ∈ U . Then lim

n→ ∞

nU = μ(U ). n

Proof. We normalize our measure so that μ(X) = 1. Let U 0 be the interior of U . We have μ(U 0 ) = μ(U ). Let > 0. By the deﬁnition of μ(U 0 ), there is a continuous function φ ∈ C(X), 0 ≤ φ ≤ 1, with φ = 0 on X − U 0 and μ(φ) ≥ μ(U ) − . Since μn (φ) ≤ nU /n we have lim inf nU /n ≥ lim μn (φ) = μ(φ) ≥ μ(U ) − ,

n→ ∞

n→ ∞

from which we obtain lim inf nU /n ≥ μ(U ). The same argument applied to X − U shows that lim inf (n − nU )/n ≥ μ(X − U ). n→ ∞

Hence, lim sup nU /n ≤ μ(U ) ≤ lim inf nU /n.

n→ ∞

n→ ∞

which implies the lemma. Example. If X = [0, 1] and μ is the usual Lebesgue measure, a sequence {xn }∞ n=1 of points of X is μ-equidistributed if and only if for each interval [a, b] of length d in [0, 1] the number of m ≤ n such that xm ∈ [a, b] is equal to dn + o(n) as n → ∞.

§2 Weyl’s criterion for equidistribution If G is a compact group, let X denote the space of conjugacy classes of G (that is, the quotient space of G by the equivalence relation induced by inner automorphisms of G). Let μ be a measure on G; its image under the quotient map G → X is a measure on X which we also denote by μ. Proposition 2.1 Let G be a compact group, X its space of conjugacy classes. Let μ be a measure on G. The sequence {xn }∞ n=1 is μ-equidistributed if and only if for any irreducible character χ of G, we have, 1 χ(xi ) = μ(χ). n→ ∞ n i=1 n

lim

Proof. The map C(X) → C(G) is an isomorphism of C(X) onto the space of class functions on G; by the Peter-Weyl Theorem, the irreducible characters χ of G generate a dense subspace of C(X). Hence the proposition follows from Lemma 1.1.

§3 L-functions on G

67

Corollary 2.2 Let μ be the Haar measure of G with μ(G) = 1. Then the sequence ∞ {xn }n=1 of elements of X is μ-equidistributed if and only if for any irreducible character χ of G, χ = 1, we have, 1 lim χ(xi ) = 0. n→ ∞ n i=1 n

Proof. This follows from Proposition 2.1 and (a special case of) the orthogonality relations: μ(χ) = 0 if χ is irreducible = 1 and μ(1) = 1. Corollary 2.3 (H. Weyl) Let G = R/Z, and let μ be the normalized Haar measure on G. Then {xn }∞ = 0 we n=1 is μ-equidistributed if and only if for any integer m have e2πimxn = ◦(N ) n≤N

as N → ∞. Proof. It suﬃces to remark that the irreducible characters of R/Z are the maps x → e2πimx , (m ∈ Z). In [BTD], the image of the Haar measure of SU (2, C) in the space of conjugacy classes is calculated. Each conjugacy class has a representative of the form

which has measure

eiθ 0

0

e−iθ

,0 ≤ θ ≤ π

2 2 π sin θdθ.

§3 L-functions on G Let G be a compact group, X its space of conjugacy classes as above. Suppose that for each prime p, we associate a conjugacy class Xp in X. As p varies, how are the Xp distributed (say, with respect to the normalised Haar measure on X)? To answer this, we deﬁne the L-function associated to each irreducible complex linear representation ρ of G in the following way: Let ρ : G −→ GLn (C) and deﬁne L(s, ρ) =

p

det(1 − ρ(Xp )p−s )−1 .

68

Chapter 3 Equidistribution and L-Functions

Theorem 3.1 Suppose that for each irreducible representation ρ = 1, of G, the L-function L(s, ρ) extends to an analytic function for Re(s) ≥ 1, and does not vanish there. Then the Xp ’s are uniformly distributed with respect to the image of the Haar measure in the space of the conjugacy classes of G. Proof. By the Wiener-Ikehara Tauberian Theorem 1.1 in Chapter 1, together with corollary 2.2 above, the result follows immediately. Example. Let ζn be a primitive n-th root of unity and set K = Q(ζn ), be the usual n-th cyclotomic ﬁeld. Then Gal(K/Q) (Z/nZ)∗ . Moreover, for each prime p n, σp is deﬁned and σp (ζn ) = ζnp . Thus σp depends only on the arithmetical progression to which p belongs mod n. Hence, if π(x; n, a) is the number of primes p ≤ x, p ≡ a (mod n) then π(x; n, a) ∼

1 i x φ(n)

as x → ∞. The L-functions attached to G in this example are the classical Dirichlet L-functions. We therefore obtain the prime number theorem for arithmetic progressions if for t ∈ R, L(1 + it, χ) = 0. Hence, the prime number theorem for arithmetic progressions, and more generally, the Chebotarev density theorem, ﬁt into this general formalism. In [Se2], Serre formulates a ‘motivic’ generalization of the Chebotarev density theorem.

§4 Deligne’s Prime Number Theorem Let G be a compact group. An irreducible character χ of G will be called quadratic if its degree is 1 and its image consists of ±1. Theorem 4.1 Let G be a compact group. Assume that for every non-trivial, irreducible representation ρ of G, L(s, ρ) is holomorphic at s = 1. If χ is quadratic we suppose that L(s, χ) is holomorphic on [1/2, 1]. Then L(1, ρ) = 0 for all irreducible ρ = 1.

§4 Deligne’s Prime Number Theorem

69

Remark. We want to point out that the second condition above is essential. For instance, consider the group G = ±1 of order 2. For each prime p, we will deﬁne Xp = −1. G has only two irreducible characters, the trivial one which gives the Riemann zeta function and the non-trivial one which gives p

1 − p−s 1 ζ(2s) = = . −s −2s 1+p 1 − p ζ(s) p

This function is holomorphic at s = 1 but not at s = 1/2. However, it does vanish at s = 1. Before we prove the theorem, we establish the following which is a variation of the theme of Hadamard and de la Vall´ee Poussin. Lemma 4.2 Let χ be an irreducible character of G. Then there exists a function f on G of the form f= cψ ψ ψ

which is a sum over a ﬁnite set of irreducible characters ψ where cψ ∈ Z. Moreover, Re(f ) ≥ 0, f = 0 and c1 ≤ cχ . If χ = 1 is not quadratic, then c1 < cχ . Proof. We consider three cases. If χ = 1, then let f = 1. If χ is quadratic then let f = 1 + χ. If χ is a character of degree 1, which is not quadratic, then let f = 3 + 4χ + χ2 and we see that Re(f ) ≥ 0 by the inequality of Hadamard and de la Vall´ee Poussin. Now let χ be a character of degree d > 1. If G is ﬁnite, we could take f= ψ(1)ψ ψ

which is the character of the regular representation. We ﬁnd that f (g) =

|G| if g = 1 0 otherwise.

In the case of an arbitrary compact group, we will try and approximate this construction. Given any > 0, there exists a delta like function F on G such that F is continuous, real-valued, non-negative, invariant under conjugation, and F (x) = F (x−1 ) satisfying F (x)dx = 1, G

and

F (x)χ(x)dx > d − G

70

Chapter 3 Equidistribution and L-Functions

for > 0. Note that the latter quantity is a real number since it is invariant under complex conjugation. Now choose a ﬁnite sum that approximates F :

cψ ψ.

ψ

We may assume without loss that cψ are rational numbers. Thus we have a function F =

cψ ψ

ψ

which is a ﬁnite sum, with cψ rational numbers, real valued and non-negative, c1 = 1 and cχ > d − . Now take f = F ∗ F to get non-negative coeﬃcients so that

F (xy −1 )F (y)dy.

f (x) = G

Moreover, if ψ is irreducible, the orthogonality relations yield ψ∗ψ = so that f=

1 ψ ψ(1)

c2ψ ψ. ψ(1)

The coeﬃcients are still rational numbers. The coeﬃcient of the trivial character is 1 and of χ is > (d − )2 /d. Clearing denominators gives us the desired function. Proof of Theorem 4.1 Assume that ρ is not one or quadratic. Choose f = ψ cψ ψ as in the lemma with c1 < cχ . Assume L(1, χ) = 0. Then, L(s, f ) =

L(s, ψ)cψ

ψ

= L(s, 1)c1 L(s, χ)cχ

L(s, ψ)cψ ,

ψ=1,χ

so that L(s, f ) has a zero at s = 1. Therefore, for some positive integer m, −L −m (s, f ) = + O(1) L s−1 as s→1+ . Hence, lim − Re(

s→1+

L (s, f )) < 0 . L

Exercises

71

On the other hand, L(s, f ) has non-negative coeﬃcients and therefore, − Re(

L (s, f )) ≥ 0 L

which implies lim − Re(

s→1+

L (s, f ) ≥ 0 L

which is a contradiction. This completes the proof in this case. If χ is quadratic, consider f (s) = L(s, χ)L(s/2, 1)L(s/2, χ) and suppose that L(1, χ) = 0. By hypothesis, f (s) is holomorphic for s in [1, 2] with a zero at s = 1. It is easily veriﬁed by looking at the Euler factors that the coeﬃcients of −f /f are non-negative. Call β the zero of f on [1, 2] closest to 2. Such a zero exists since f (1) = 0. Therefore, for some positive integer m, −

f m (s) = − + O(1) f s−β

as s→β. But Landau’s lemma (Exercise 5 in Chapter 1)implies −

f →∞ f

as s→β. This is a contradiction. Therefore L(1, χ) = 0 and this completes the proof.

Exercises 1.

Prove that the sequence {log p} as p varies over the prime numbers is not uniformly distributed mod 1.

2.

(Erd¨ os-Tur´ an inequality): Let {xj }nj=1 be a ﬁnite sequence of real numbers. For 0 ≤ α < β ≤ 1. Let A([α, β] : n) be the counting function #{j ≤ n : xj ∈ [α, β](mod 1)}. Deﬁne the discrepancy Dn (x1 , . . . , xn ) =

A([α, β] : n) − (β − α). n 0≤α<β≤1 sup

Then, for any positive integer M , Dn ≤

M n 6 4 1 1 1 + − e2πihxj . M +1 π h M + 1 n j=1 h=1

Chapter 3 Equidistribution and L-Functions

72

Recently, Montgomery [Mo] obtained the following improvement: 1 Dn ≤ +2 M +1 M

h=1

n 1 1 1 + min(β − α, ) e2πihxj . M +1 πh n j=1

*3. Let Fp denote the ﬁnite ﬁeld of p elements. Let ψ : Fp →C∗ be a ﬁxed nontrivial additive character. For each character χ of the multiplicative group F∗p , we deﬁne a Gauss sum G(χ, ψ) = χ(x)ψ(x). x∈F∗ p

If χ = 1, we know that |G(χ, ψ)| = p1/2 so we can write G(χ, ψ) = p1/2 e2πiθp (χ) . Show that the sequence obtained by listing θp (χ) as p ranges over all the prime numbers and for each ﬁxed p, χ ranges over all the non-trivial characters mod p, is uniformly distributed mod 1. 4.

Suppose that

1 χ(xi ). n→∞ n i=1 n

cχ = lim Deﬁne Φ(g) =

cχ χ(g),

χ

where the sum is over all irreducible characters arranged by increasing degrees. Then determine conditions when {xn }∞ n=1 is uniformly distributed with respect to the measure Φ(g)dμ where μ is the normalized Haar measure of G. 5.

In Theorem 4.1, let us suppose further that each L(s, ρ) is holomorphic on Re(s) = 1 with s = 1. Show that L(1 + it, ρ) = 0 for t ∈ R, t = 0. (Hint: for t ∈ R and non-zero, consider St = R/tZ and the group G × St . Deﬁne for each prime p the conjugacy class (Xp , log p) and now apply Theorem 4.1 to the L-functions associated to G × St .)

6.

For each prime p ≡ 1(mod 4), let χ mod p be a character of order 4 and deﬁne J(χ, χ) = χ(x)χ(1 − x). x mod p x =0,1

Show that if we write J(χ, χ) = a +bi with a, b ∈ Z, then a2 +b2 = p. Writing √ J(χ, χ) = peiθp show that the sequence of θp ’s is uniformly distributed mod 1.

References

7.

73

Show that the set of primes p which can be written as x2 + xy + 6y 2 √ with x, y ∈ Z has density 1/6. (Hint: consider Q( −23).)

*8. In the above question, deﬁne φp by √ y 23 tan φp = . 2x + y Show that the φp ’s are equidistributed mod π. 9.

2 2 Show √ that the density of primes represented by x −2y is 1/2. (Hint: consider Q( 2)).

*10. For each prime in question 9, show that the quantity √ Xp = log(|x| + |y| 2) √ is well deﬁned modulo log(3 √ + 2 2). Show that the sequence is uniformly distributed mod log(3 + 2 2).

References [BTD] T. Br¨ ocker, T. tom Dieck, Representations of compact Lie groups, Graduate texts in mathematics, 98, Springer-Verlag, 1985. [Mo] H. Montgomery, Ten lectures on the interface between analytic number theory and harmonic analysis, No. 84, CBMS, Amer. Math. Soc. 1994. [Se] J.-P. Serre, Abelian -adic representations and elliptic curves, Benjamin, New York, 1968; second edition, Addison-Wesley, Redwood City, 1989. [Se2] J.-P. Serre, Propri´et´es conjecturales des groupes de Galois motiviques et des repr´esentations -adiques in Proc. Symp. Pure Math., 55 (1994) p. 377–400.

Chapter 4 Modular Forms and Dirichlet Series

§1 SL2 (Z) and some of its subgroups It was Ramanujan who in a fundamental paper of 1916 introduced his τ -function as the Fourier coeﬃcient of a modular form and then attached a Dirichlet series to it. He established an analytic continuation of the series and a functional equation for it. He then made his famous conjectures about the multiplicativity of these coeﬃcients and their size. The multiplicativity conjecture would allow him to write his Dirichlet series as an Euler product thereby establishing an analogy with classical zeta and L-functions. Subsequently Mordell proved that τ (n) is a multiplicative function but it was left to Hecke to develop a more elaborate theory and establish the existence of an inﬁnite family of such examples. Ramanujan’s conjecture on estimating the size of τ (n) however deﬁed immediate attack. The fundamental method of Rankin and Selberg did allow one to get good estimates for them but they were not optimal. The ﬁnal resolution of Ramanujan’s conjecture came from algebraic geometry when it was shown to be a consequence of Deligne’s proof of the celebrated Weil conjectures. In this chapter, we will give a brief introduction to the fundamental concepts and study the oscillations of the Fourier coeﬃcients from the standpoint of the non-vanishing of various L-functions. We begin with some basic notions. If R is any commutative ring with identity, SL2 (R) denotes the group of matrices a b c d of determinant 1 and a, b, c, d ∈ R. We will consider the case R = Z and look at some subgroups of SL2 (Z). The principal congruence subgroup of level N is denoted Γ(N ) and consists of matrices in SL2 (Z) satisfying a b 1 0 ≡ (mod N ) c d 0 1

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_5, © Springer Basel AG 1997

75

76

Chapter 4 Modular Forms and Dirichlet Series

Since this is the kernel of the natural map (reduction mod N ): →

SL2 (Z)

SL2 (Z/N Z)

Γ(N ) is a normal subgroup of ﬁnite index in SL2 (Z). The Hecke subgroup of level N is denoted Γ0 (N ) and consists of matrices

such that N | c. Since

a b c d

∈

SL2 (Z)

Γ(N ) ⊆ Γ0 (N ) ⊆ SL2 (Z) ,

Γ0 (N ) has ﬁnite index in SL2 (Z). The group Γ1 (N ) consists of matrices γ ∈ SL2 (Z) satisfying γ≡

1 ∗ 0 1

(mod N )

Clearly Γ(N ) ⊆ Γ1 (N ) ⊆ Γ0 (N ) ⊆ SL2 (Z) A congruence subgroup of SL2 (Z), is by deﬁnition, a subgroup which contains Γ(N ) for some N . Γ0 (N ), Γ1 (N ) are examples of congruence subgroups. An element γ ∈ SL2 (Z) is called elliptic if |tr γ| < 2, parabolic if |tr γ| = 2 and hyperbolic if |tr γ| > 2.

§2 The upper half-plane Let h denote the upper half-plane: √ + , h = z = x + −1y : x ∈ R, y > 0 Let GL+ 2 (R) be the group of 2 × 2 matrices with real entries and positive determinant. Then GL+ 2 (R) acts on h as a group of holomorphic automorphisms: γ:z →

az + b , cz + d

γ=

a b c d

∈ GL+ 2 (R)

Let h∗ denote the union of h and the rational numbers Q together with a symbol ∞ (or more suggestively i∞). The action of SL2 (Z) on h can be extended to h∗ by deﬁning a a b ·∞= for c =0 c d c

§3 Modular forms and cusp forms

and

for rational numbers

r s

a b 0 d

77

· ∞ = ∞;

with (r, s) = 1, we deﬁne

a b c d

r ar + bs = s cr + ds

·

with the understanding that when cr + ds = 0, the right side of the above equation is the symbol ∞. The rational numbers together with ∞ are called cusps. If Γ is a discrete subgroup of SL2 (R), then the orbit space h∗ /Γ can be given the structure of a compact Riemann surface, and is denoted XΓ . We will be interested in the case Γ is a congruence subgroup of SL2 (Z). In that case, the algebraic curve corresponding to XΓ is called a modular curve. In case Γ = Γ(N ), Γ1 (N ) or Γ0 (N ), the corresponding modular curve is denoted X(N ), X1 (N ) and X0 (N ) respectively. For further details, the reader may consult [VKM].

§3 Modular forms and cusp forms Let f be a holomorphic function on h and k a positive integer. For γ=

a b c d

∈ GL+ 2 (R),

deﬁne (f |k γ)(z) = (detγ)

k/2

−k

(cz + d)

f

az + b cz + d

.

For ﬁxed k, the map γ : f → f |k γ deﬁnes an action of GL+ 2 (R) on the space of holomorphic functions on h. (Sometimes, we simply write f |γ for f |k γ.) Let Γ be a subgroup of ﬁnite index in SL2 (Z). Let f be a holomorphic function on h such that f |k γ = f for all γ ∈ Γ. Since Γ has ﬁnite index,

1 1 0 1

M

=

1 M 0 1

∈Γ

for some positive integer M . Hence f (z + M ) = f (z) for all z ∈ h. So, f has a “Fourier expansion at inﬁnity”: f (z) =

∞ n=−∞

n an qM ,

qM = e2πiz/M

78

Chapter 4 Modular Forms and Dirichlet Series

We say that f is holomorphic at inﬁnity if an = 0 for all n < 0. We say it vanishes at inﬁnity if an = 0 for all n ≤ 0. Let σ ∈ SL2 (Z). Then σ −1 Γσ also has ﬁnite index and (f |σ)|γ = f |σ for all γ ∈ σ−1 Γσ. So for any σ ∈ SL2 (Z), f |σ also has a Fourier expansion at inﬁnity. We say that f is holomorphic at the cusps if f |σ is holomorphic at inﬁnity for all σ ∈ SL2 (Z). We say that f vanishes at the cusps if f |σ vanishes at inﬁnity for all σ ∈ SL2 (Z). Let N be an integer ≥ 1 and a Dirichlet character mod N . A modular form on Γ0 (N ) of type (k, ) is a holomorphic function f on h such that a b a b (i) f| = (d)f for all ∈ Γ0 (N ) c d c d and (ii) f is holomorphic at the cusps. (Note that (i) implies f |γ = f for all γ ∈ Γ1 (N ) and so (ii) is meaningful.) Also the Fourier expansion of such a form is : f (z) =

∞

an qn ,

q = e2πiz

n=0

The integer k is called the weight of f . Such a modular form is called a cusp form if it vanishes at the cusps. The modular forms on Γ0 (N ) of type (k, ) form a complex vector space Mk (Γ0 (N ), ) and this has a subspace Sk (Γ0 (N ), ) consisting of cusp forms. The subspace has a canonical complement: Mk (Γ0 (N ), ) = Ek (Γ0 (N ), )

-

Sk (Γ0 (N ), )

and the space Ek is called the space spanned by Eisenstein series. These spaces are ﬁnite dimensional. Moreover, one can deﬁne an inner product (Petersson) on Sk (Γ0 (N ), ) by dxdy f, g = f (z)g(z)y k 2 y h/Γ0 (N) Examples 1. Let k ≥ 4 be even. Then Gk (z) =

m,n∈Z (m,n) =(0,0)

(mz + n)−k

§3 Modular forms and cusp forms

79

is a modular form of weight k for SL2 (Z). Its Fourier expansion is Gk (z) = 2ζ(k) + 2

∞ (2πi)k σk−1 (n) q n (k − 1)! n=1

where σk (n) = d|n dk , and ζ denotes the Riemann zeta function. If we normalize Gk so that the constant term is 1, and use the well-known formula 2ζ(k) = − we ﬁnd Ek (z) = 1 −

Bk (2πi)k k!

∞ 2k σk−1 (n)q n Bk n=1

Here Bk is the k-th Bernoulli number deﬁned by ∞

Bk tk t = et − 1 k! k=0

Ek is called the k-th Eisenstein series. 2. Ramanujan’s cusp form Deﬁne Δ(z) = q

∞

(1 − q n )24 , q = e2πiz

n=1

Then Δ(z) =

∞

τ (n) q n

n=1

is a cusp form of weight 12 for SL2 (Z); τ (n) is called the Ramanujan function. Ramanujan conjectured in 1916 that (i) τ (nm) = τ (n)τ (m) (n, m) = 1 (ii) |τ (p)| ≤ 2p11/2 , p prime (i) was proved by Mordell in 1928 and (ii) by Deligne in 1974. More generally, if f ∈ Sk (Γ0 (N ), ), then its Fourier coeﬃcients satisfy an = O(n

k−1 2 +δ

) for any δ > 0.

For k ≥ 2, this is due to Deligne. If k = 1, this is a theorem of Deligne-Serre. 3. Deﬁne Bn,χ by

∞ teat Bn,χ tn χ(a) ct = e − 1 n=0 n! a=1 c

80

Chapter 4 Modular Forms and Dirichlet Series

where χ is a Dirichlet character modulo c. Let

E1,χ

⎛ ⎞ ∞ 2 ⎝ =1− χ(d)⎠ q n B1,χ n=1 d|n

If χ is odd ( i.e., χ(−1) = −1), then E1,χ ∈ M1 (Γ0 (c), χ). One can also show that if χ is even (i.e., χ(−1) = 1) then

E2,χ

⎛ ⎞ ∞ 4 ⎝ =1− χ(d)d⎠ q n B2,χ n=1 d|n

is in M2 (Γ0 (c), χ). For higher weights, one has analogous results. See for instance [La]. Theorem 3.1

Let Sk (N ) = Sk (Γ0 (N ), 1). Then dim S2 (N ) = 1 +

where i(N ) = N i2 (N ) = i3 (N ) = i∞ (N ) =

p|N

i(N ) i2 (N ) i3 (N ) i∞ (N ) − − − 12 4 3 2 1 1+ p

0 .

if 4 | N otherwise

0 .

if 2 | N or 9 | N otherwise

−4 1 + p|N p

−3 p|N 1 + p

φ ((d, N/d))

d|N

This formula occurs in [Kn, p. 272] with a misprint which has been corrected above. One can write down analogous formulas for dim Sk (N ) for k ≥ 2. (See for example, [Shi] or [CO].) For explicit examples, see Frey [F]. For k = 1, no such simple formula is known. However, for N prime, it is conjectured [D] that dimS1 (N ) =

1 h(−N ) + O(N ) 2

√ where h(−N ) denotes the class number of Q( −N ).

§4 L-functions and Hecke’s theorem

81

§4 L-functions and Hecke’s theorem If f ∈ Sk (Γ0 (N ), ) and f (z) = attach an L-function by

∞

n=1

an e2πinz is its Fourier expansion at i∞, we ∞ an ns n=1

L(s, f ) =

Since y k/2 |f (z)| is invariant under Γ1 (N ), and hence represents a function on the compact Riemann surface X1 (N ), it is bounded on h. Therefore 1/2 −2πny e an = f (x + iy)e−2πinx dx −1/2

y −k/2 Setting y = 1/n gives an = O(nk/2 ). This shows that L(s, f ) represents an analytic function for Re s > 0 −1 . It is not an element of SL2 (Z). However, Let WN = N 0

k+2 2 .

WN Γ0 (N )WN−1 ⊂ Γ0 (N ) and so f → f |WN preserves Mk (Γ0 (N )) and Sk (Γ0 (N )). Moreover, f |WN2 = f . WN is called the Atkin-Lehner involution. Since WN is a linear transformation of the vector space Sk (Γ0 (N )) and WN2 = 1, it decomposes the space into Sk+ (Γ0 (N )) and Sk− (Γ0 (N )) corresponding to the eigenvalues ±1. Note that if Sk (Γ0 (N )) = 0 then k is even. Theorem 4.1 function and

(Hecke) Let f ∈ Sk± (Γ0 (N )). Then L(s, f ) extends to an entire Λ(s, f ) = N s/2 (2π)−s Γ(s)L(s, f )

satisﬁes the functional equation Λ(s, f ) = ±(−1)k/2 Λ(k − s, f ) Proof. Since f |WN = ±f , we ﬁnd i f = ±N k/2 ik y k f (iy) Ny Since N −s/2 Λ(s, f ) =

∞

f (iy)y s−1 dy

0

we see that N

−s/2

√ 1/ N

f (iy)y s−1 dy +

Λ(s, f ) = 0

∞

√ 1/ N

f (iy)y s−1 dy

82

Chapter 4 Modular Forms and Dirichlet Series

In the ﬁrst term, we replace y by 1/N y and use the modular relation to get N

−s/2

Λ(s, f ) = ±N

k/2 k

i

∞

√ 1/ N

f (iy)y

k−s−1

dy +

∞ √ 1/ N

f (iy)y s−1 dy

which gives the analytic continuation and functional equation. Corollary 4.2

Let f ∈ Sk (Γ0 (N )). Then L(s, f ) extends to an entire function.

§5 Hecke operators

∞ Let p denote a prime number and f (z) = n=0 an q n be a modular form on Γ0 (N ) of type (k, ). The Hecke operators Tp and Up are deﬁned by f |Tp = f |Up =

∞ n=0 ∞

anp qn + (p)pk−1

∞

an q np

if p N

n=0

if p | N

anp qn

n=0

It is not diﬃcult to show that f |Tp , f |Up are also modular forms on Γ0 (N ) of type (k, ), and they are cusp forms if f is a cusp form. Theorem 5.1 (Hecke) The Tp ’s are commuting linear transformations of Sk (Γ0 (N ), ). As such, the space can be decomposed as a direct sum of eigenspaces. Let f ∈ Sk (Γ0 (N ), ). We will say that f is an eigenform if f is an eigenfunction for all the Hecke operators Tp ’s and Up ’s. If ∞ f (z) = an e2πinz n=1

is the Fourier expansion at i∞, and a1 = 1, we call it normalized. Two eigenforms will be called equivalent if they are in the same eigenspace in Sk (Γ0 (N ), ) under the action of the Hecke operators. Theorem 5.2 (Hecke) The space Sk (Γ0 (N ), ) has a basis of normalized eigenfunctions for all Tp ’s. For each normalized eigenform f , L(s, f ) =

−1 −1 ap ap (p) 1− s 1 − s + 2s+1−k p p p

p|N

which converges absolutely for Re s >

pN

k+2 2 .

Remark. The product converges for Re s >

k+1 2 .

See [Ogg] for further details.

§7 The Sato-Tate conjecture

83

§6 Oldforms and newforms Hecke’s theorems give no correlation between L-functions having functional equations and those having Euler products. The reason for this diﬃculty is two-fold. If d | N , then an element of Sk (Γ0 (d)) can also be considered as an element of Sk (Γ0 (N )) and an eigenfunction for all Hecke operators Tp , p d in Sk (Γ0 (d)) is also an eigenfunction for all Hecke operators Tp , p N in Sk (Γ0 (N )). Also if f ∈ Sk (Γ0 (d)), then f (N z/d) ∈ Sk (Γ0 (N )), as a trivial calculation shows. We can combine both of these observations in the general context of Sk (Γ0 (N ), ). Suppose N | N and that is a Dirichlet character modulo N . If f is a cusp form on Γ0 (N ) of type (k, ) and dN | N , then z → f (dz) is a cusp form on Γ0 (N ) of type (k, ). The forms on Γ0 (N ) which may be obtained in this way from divisors N of N , N = N , span a subspace Skold (Γ0 (N ), ) called the space of oldforms. Its orthogonal complement under the Petersson inner product is denoted Sknew (Γ0 (N ), ) and the eigenforms in this space are called newforms. We have Sk (Γ0 (N ), ) = Skold (Γ0 (N ), )

-

Sknew (Γ0 (N ), )

Theorem 6.1 (Atkin-Lehner) If f is a newform then its equivalence class is onedimensional. If f is a newform of level N , then L(s, f ) extends to an entire function, has an Euler product and satisﬁes a functional equation. We say that f is of CM type if there is a quadratic ﬁeld K such that a(p) = 0 whenever p N and p is inert in K. The analytic behaviour of the coeﬃcients of f varies according as f is or is not of CM-type.

§7 The Sato-Tate conjecture Let f (z) =

∞

a(n)e2πinz

n=1

be a newform of weight k and level N which is also a cusp form. Let us write for each prime p N , a(p) = 2p

k−1 2

cos θp .

Since we know the Ramanujan conjecture, the θp ’s are real. Inspired by the SatoTate conjecture for elliptic curves, Serre [Se] conjectured that if f is not of CMtype, then θp ’s are uniformly distributed with respect to the Sato-Tate measure 2 sin2 θ. π

84

Chapter 4 Modular Forms and Dirichlet Series

If we consider the group SU (2, C) and consider its space of conjugacy classes, we can make the following assignment: iθ e p 0 p → conjugacy class of . 0 e−iθp The Haar measure on the space of conjugacy classes in SU (2, C) is 2 sin2 θ. π Therefore, following the formalism of the previous chapter, we see that the SatoTate conjecture is true if a certain family of L-functions admit an analytic continuation to Re(s) ≥ 1 and do not vanish there. More precisely, consider for each m ≥ 1, −1 m eiθp (m−2j) Lm (s) = 1− . ps j=0 pN

Clearly, Lm (s) converges for Re(s) > 1. It is in fact conjectured that each Lm (s) extends to an entire function. By Theorem III.3.1, we see that the Sato-Tate conjecture is true if and only if Lm (1 + it) = 0 for every real t and m ≥ 1. The fact that L1 (s) extends to an entire function follows from Hecke’s theory. In 1939, Rankin and Selberg (independently) introduced a powerful method into the theory of numbers and as a consequence established that (s − 1)ζ(s)L2 (s) extends to an entire function. It was Shimura who using the theory of modular forms of half-integral weight managed to isolate L2 (s) and established an analytic continuation for all s ∈ C. By the powerful methods of the Langlands program, Shahidi has established the analytic continuation for L3 (s), and L4 (s) to Re(s) ≥ 1. (See [Sh].) In some cases, he has obtained better results establishing a meromorphic continuation and deﬁning sets where possible poles may exist. Since we do not need these here, we will not go into these details. Ogg [Ogg2 ] has shown that if for each r ≤ 2m, Lr (s) has an analytic continuation to Re(s) > 1/2 − δ for some δ > 0, then Lm (1 + it) = 0. K. Murty [VKM2] showed that it suﬃces to have analytic continuation of each Lm (s) up to Re(s) ≥ 1.

§8 Oscillations of Fourier coeﬃcients of newforms Deligne’s theorem proving the Ramanujan-Petersson conjecture implies that if f is a newform of weight k, then |an | ≤ n

k−1 2

d(n)

§8 Oscillations of Fourier coeﬃcients of newforms

85

where d(n) is the divisor function. It is known that the maximum order of d(n) satisﬁes d(n) = O(exp(c log n/ log log n)) for some constant c > 0. Therefore, an = O(n

k−1 2

exp

c log n ). log log n

We would like to know if this is best possible. This question has a long history. Before we discuss the past accomplishments on this question, we recall the Ω notation. Let g be a positive function and f any function. We say f (x) = Ω(g(x)) if there is some constant c > 0 such that |f (x)| > cg(x) for inﬁnitely many x→∞. We also write f (x) = Ω± (g(x)) if there exists a constant c > 0 such that f (x) > cg(x) for inﬁnitely many x→∞ and f (x) < −cg(x) for inﬁnitely many x→∞. Hardy proved that an = Ω(n Rankin showed

k−1 2

).

|an | = +∞. (k−1)/2 n n→∞

lim sup

Then Joris proved that for δk = 6/k2 , we have an = Ω(n(k−1)/2 exp(c(log n)δk − ). This was improved by Balasubramanian and R. Murty to δk = 1/k. In the special case of the Ramanujan τ -function, they showed τ (n) = Ω(n11/2 exp(c(log n)2/3− )). In 1983, R. Murty [RM] proved the following result.

86

Chapter 4 Modular Forms and Dirichlet Series

Theorem 8.1 For any normalized newform f of weight k and level N , there is a constant c > 0 such that k−1 c log n an = Ω± n 2 exp . log log n In view of the above discussion, this is best possible apart from the value of the constant c > 0. For any cusp form of weight k, he also established the following result: Theorem 8.2 For any cusp form f of weight k and level N , there is a constant c > 0 such that k−1 c log n 2 an = Ω n exp . log log n To prove these, we ﬁrst need the non-vanishing of L3 (s) and L4 (s) on Re(s) = 1. To do this, we prove a slightly more general theorem: Theorem 8.3 If Lr (s) has an analytic continuation up to Re(s) ≥ 1/2 for 1 ≤ r ≤ 2m, then L2m−1 (1 + it) = 0 for t ∈ R. If L2r (s) has an analytic continuation up to Re(s) = 1 for 1 ≤ r ≤ m, then L2m (1 + it) = 0. Corollary 8.4

L3 (1 + it) = 0 and L4 (1 + it) = 0 for all t ∈ R.

Proof of the Corollary. By the history described at the end of the previous section, we know that L1 (s), L2 (s), L3 (s) and L4 (s) extend to analytic functions for Re(s) ≥ 1. The result now follows from the theorem. Proof of Theorem 8.3. We ﬁrst show that L2m (1 + it) = 0. Consider f (s) = L0 L2 · · · L2m . Then, by the trigonometric identities (see exercise) sin(n + 12 )θ 1 + cos θ + cos 2θ + · · · cos nθ = 2 2 sin(θ/2) and cos θ + cos 3θ + · · · cos(2n − 1)θ = we see that log Lr (s) =

sin(r + 1)nθp n,p

sin nθp

sin nθ 2 sin θ 1 . npns

Therefore, as 1+

sin(2n − 1)θ sin 3θ sin 5θ + + ··· + = sin θ sin θ sin θ

sin nθ sin θ

2 ,

§8 Oscillations of Fourier coeﬃcients of newforms

87

we ﬁnd that log f (s) is a Dirichlet series with non-negative coeﬃcients. Moreover, the Euler product shows that f (s) does not vanish in σ > 1. An application of Theorem I.1.2 with e ≤ 1 gives the result. Now consider, g(s) = (L0 L1 · · · L2m−1 )2 L2m . An easy computation gives

log g(s) =

⎛ ⎝(2m + 1) +

n,p

2m−1

⎞ 2(j + 1) cos(2m − j)nθp ⎠

j=0

Since 2m + 1 +

∞

2(j + 1) cos(2m − j)θ =

j=0

sin(m + 1/2)θ sin(θ/2)

1 . npns

2 ,

we see that log g(s) is a Dirichlet series with non-negative coeﬃcients. If L2m−1 (1+ it) = 0, with t = 0, then g(s) has a zero of order ≥ 2 on Re(s) = 1, s = 1. As g(s) has a pole of order 2 at s = 1, we get a contradiction again by Theorem I.1.2. We need to consider L2m−1 (1) = 0. If this happens, then g(s) is regular. By a wellknown theorem of Landau (see exercise 5 in Chapter 1), we ﬁnd that log g(s) has a singularity at its abscissa of convergence. As L0 (s) = ζ(s) has zeros in Re(s) ≥ 1/2, g(s) has zeros in this half-plane. Therefore the abscissa of convergence of log g(s) is σ0 ≥ 1/2, and as g(s) is analytic in Re(s) ≥ 1/2, σ0 is a zero of g(s). But then g(σ) ≥ 1 for σ > σ0 . We get a contradiction by letting σ→σ0+ . This completes the proof of the theorem. Theorem 8.4 Suppose Lr (s) has an analytic continuation up to Re(s) ≥ 1/2 for all r ≤ 2m + 2. Then (i)

for r ≤ m + 1,

(2 cos θp )2r =

p≤x

1 r+1

2r r

(1 + o(1))

as x→∞, and (ii) for r ≤ m,

(2 cos θp )2r+1 = o(x/ log x)

p≤x

as x→∞.

x log x

88

Chapter 4 Modular Forms and Dirichlet Series

Proof. By Theorem 8.3, we know that Lr (s) does not vanish on the line σ = 1. Observe further that r sin(r + 1)θp i(r−2j)θ = e . sin θp j=0 Therefore, by the Tauberian theorem, we deduce for 1 ≤ r ≤ 2m + 2, sin(r + 1)θp = o(x/ log x) sin θp p≤x

and Tn (cos θ) = cos nθ for each n ≥ 1, we as x→∞. Writing Un (cos θ) = sin(n+1)θ sin θ ﬁnd for 2 ≤ r ≤ 2m + 2, Tr (cos θp ) = o(x/ log x), p≤x

because of the identity 2Tn (x) = Un (x) − Un−2 (x). Note that Tn (x) and Un (x) are the familiar Chebycheﬀ polynomials. Also, 1 x T2 (cos θp ) = (− + o(1)) , 2 log x p≤x

and

T1 (cos θp ) = o(x/ log x),

p≤x

as L1 (s) is regular and non-vanishing for Re(s) ≥ 1. Now deﬁne T0 (x) = 1/2. Then the inverse relation for the Chebycheﬀ polynomials gives r r r (2 cos θ) = 2 Tr−2k (cos θ) k k=0

where r = [r/2] (see the exercises). Therefore, p≤x

r

(2 cos θp ) = 2

r

Tr−2k (cos θp ).

k=0 p≤x

By the above results, the inner sum is o(x/ log x) unless r − 2k = 2 or 0. Hence, (ii) is deduced. In the case of (i), we ﬁnd that x 2r 2r 2r + (1 + o(1)) (2 cos θp ) = − r−1 r log x p≤x

as x→∞. The term in the brackets is easily seen to be the coeﬃcient stated in (i). To prove the omega theorem, we will need a few preliminary combinatorial identities. We collect them below and leave them as exercises.

§8 Oscillations of Fourier coeﬃcients of newforms

Lemma 8.5 (i) (ii)

89

−2j r r 2j 2 2r + 2 j −2r−1 (−1) =2 , j j j+1 r+1 j=0 r 2−2r 2r + 2 r 2j + 2 2j + 2 2−2j j (−1) = . j+1 j+2 j j+1 r+2 r+1 j=0

Proof. Exercise. Theorem 8.6 Suppose that Lr (s) has an analytic continuation up to Re(s) ≥ 1/2 for all r ≤ 2m + 2. Then, each of the statements holds for a set of primes of positive density: 2 (i) for any δ > 0, −δ < 2 cos θp < , δ(m + 2) ) 4m + 2 (ii) for any > 0, |2 cos θp | > − , m+2 1 2m+1 1 2m + 2 (iii) for any > 0, 2 cos θp > βm − , where βm = . 4(m + 2) m + 1 There is a corresponding result for negative values of ap . Proof. For (i), consider the polynomial Pm (x) = (x2 − 4)m (x − α)(x − β), where α, β will be suitably chosen later. By Theorem 8.4 and Lemma 8.5, we deduce log x αβ 1 2m + 2 Pm (2 cos θp ) ∼ (−1)m + . m+1 x 2 m+2 p≤x

Examining the graph of Pm (x) and choosing α, β so that αβ > −

2 , m+2

if m is even and αβ < −

2 , m+2

if m is odd, we set α = −δ to get the desired result. To prove (ii), consider Qm (x) = x2m (x2 − γ) where γ shall soon be chosen. By Theorem 8.4, log x 1 1 2m + 2 2m −γ . Qm (2 cos θp ) ∼ x m+2 m+1 m+1 m p≤x

90

Chapter 4 Modular Forms and Dirichlet Series

Examining the graph of Qm (x), we note that if −1 m + 1 2m + 2 4m + 2 2m γ= = , m m+2 m+1 m+2 we obtain (ii). To prove (iii), we begin by noting 1 |2 cos θp |2m+1 ≥ (2 cos θp )2m+2 . 2 p≤x

p≤x

By Theorem 8.4, we ﬁnd 4 (2 cos θp )2m+1 p≤x a(p)>0

1 m+2

2m + 2 m+1

x log x

as x→∞. Thus, for a positive proportion of the primes 1 2m+1 1 2m + 2 2 cos θp > − . 4(m + 2) m + 1 This completes the proof of the theorem. The proofs of Theorem 8.1 and 8.2 are now immediate. We relegate these to the exercises below. One should consult the appendix due to Serre in [Sh] for certain improvements of these results.

§9 Rankin’s theorem Rankin [R] proved the following theorem: Theorem 9.1 Let f be a normalized newform which is a cusp form of weight k with respect to Γ0 (N ). Let an denote the n-th Fourier coeﬃcient and write a(n) = an /n(k−1)/2 . Given β ≥ 0, let F (β) = Then

2β−1 β (2 + 32−β ) − 1. 5

|a(n)|2β x(log x)F (β) .

n≤x

In particular, if k = 2, then

|a(n)| x(log x)−1/18 .

n≤x

The proof of Rankin’s theorem can be found in [R] or [Sh, p. 174–175]. It relies in a crucial way on the analytic continuation of L4 (s) to Re(s) ≥ 1.

Exercises

91

Exercises 1.

Show that [SL2 (Z) : Γ(N )] = N 3

1 1− 2 p p|N p prime

2.

Show that: (i) Γ0 (N ) is not a normal subgroup of SL2 (Z) if N > 1. . (ii) Γ0 (N ) has index N p|N 1 + p1 in SL2 (Z). (iii) Γ1 (N ) is not normal in SL2 (Z) but is normal in Γ0 (N ). Compute its index. (iv) γ ∈ SL2 (Z) has ﬁnite order if and only if γ is elliptic.

3.

Show that dim S2 (2) = 0.

4.

Assuming the Sato-Tate conjecture, determine the largest positive constant c such that the estimate (in the notation of Section 9),

|a(n)| x(log x)−c

n≤x

is valid for all suﬃciently large x. 5.

6.

Show that if f ∈ Sk (Γ0 (N ), ), then f |WN ∈ Sk (Γ0 (N ), ). Derive the functional equation for L(s, f ). √ Using the fact that 2 cos θp > 2 − for a positive proportion of primes deduce that an = Ω(n(k−1)/2 exp(c log n/ log log n)), for some positive constant c. Deduce a corresponding result for negative oscillation.

7.

Given any cusp form f of level 1, show that there are numbers mi which are not all zero so that r mi Ti (f ) i=1

is an eigenform. Here, Ti denotes the i-th Hecke operator and r is the dimension of the space of cusp forms of weight k. Now deduce Theorem 8.2 for any cusp form of level 1.

92

Chapter 4 Modular Forms and Dirichlet Series

References [CO] H. Cohen and J. Oesterl´e, Dimensions des espaces de formes modulaires, Lecture Notes in Mathematics, 627(1977), 69–78. [D] W. Duke, The dimension of the space of cusp forms of weight one, Int. Math. Res. Not., 1995, pp. 99–109. [F]

G. Frey, Construction and arithmetical application of modular forms of low weight, in: Elliptic curves and related objects, pp. 1–21, eds. H. Kisilevsky and R. Murty, CRM Proceedings and Lecture Notes, vol. 4, Amer. Math. Soc., Providence, 1994.

[Kn] A. Knapp, Elliptic Curves, Princeton University Press, 1992. [La] S. Lang, Introduction to modular forms, Grundl. Math. Wiss. 222, SpringerVerlag, Berlin, New York 1976. [Ogg] A. Ogg, Survey of modular functions of one variable, in: ‘Modular functions of one variable I’, (Ed. W. Kuijk), Lecture Notes in Math., 320 (1972) pp. 1–36, Springer-Verlag. [Ogg2] A. Ogg, A remark on the Sato-Tate conjecture, Inventiones Math., 9 (1970) p. 198–200. [RM] M. Ram Murty, Oscillations of Fourier coeﬃcients of modular forms, Math. Annalen, 262 (1983) p. 431–446. [R] R. Rankin, Sums of powers of cusp form coeﬃcients II, Math. Ann., 272 (1985) p. 593–600. [VKM] V. Kumar Murty, Introduction to abelian varieties, CRM Monograph Series, Volume 3, 1993, American Math. Society, Providence, USA. Chapters 9–11. [VKM2] V. Kumar Murty, On the Sato-Tate conjecture, in: Number Theory related to Fermat’s Last Theorem, (ed. N. Koblitz), Birkh¨ auser-Verlag, Boston, 1982, pp. 195–205. [Sh] F. Shahidi, Symmetric power L-functions for GL(2), in: Elliptic curves and related objects, (ed. H. Kisilevsky and M. Ram Murty), CRM Proceedings and Lecture Notes, Vol. 4 (1994) pp. 159–182. [Shi] G. Shimura, Introduction to the arithmetic theory of automorphic functions, Publ. Math. Soc. Japan, vol. 11, Iwanami Shoten, 1977.

Chapter 5 Dirichlet L-Functions

§1 Introduction Let χ denote a Dirichlet character and L(s, χ) the associated Dirichlet L-function. Let us begin by considering how one would approach the problem of showing that L( 12 , χ) = 0. In the following, we assume that χ is deﬁned modulo a prime q. We ﬁrst study the average 1 L( , χ). 2 χ(mod q)

By the approximate functional equation, one can show that χ(n) 1 √ . L( , χ) ∼ 2 n √ n< q

Hence, χ(mod q)

χ(n) 1 √ ∼ φ(q). L( , χ) ∼ 2 n √ χ n< q

Similarly, χ(mod q)

χ(n1 )χ(n 1 ¯ 2) |L( , χ)|2 ∼ √ 2 n n2 1 χ n ,n
2

1 ∼ φ(q) n n
M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_6, © Springer Basel AG 1997

93

Chapter 5 Dirichlet L-Functions

94

Therefore,

2 1 φ(q) L( , χ) 2 ⎛ ⎞ 1 |L( , χ)|2 ⎝ 1⎠ 2 1 L( 2 ,χ)=0 ⎛ ⎞ φ(q)(log q) ⎝ 1⎠ 2

L( 12 ,χ)=0

and so

1 φ(q) = 0} . #{χ mod q : L( , χ) 2 log q

This kind of argument can in fact be made precise. The reader is referred to the paper [B] of Balasubramanian where only a slightly weaker result is proved. We may try to consider higher moments as well. In general, one might expect that for some c > 0 depending on k,

dk (r)2 1 |L( , χ)|2k ∼ cφ(q) 2 r k r
where dk (r) denotes the number of decompositions of r as a product of k positive integers. Now, dk (p) = k so the Dirichlet series dk (r)2 rs has Euler product

1+

p

k2 + · · · ps

which near s = 1 behaves like ζ(s)k ∼ (s − 1)−k . 2

2

Thus for some c > 0, depending on k, 2 dk (r)2 ∼ c X(log X)k −1 r<X

and so we might expect

2 1 |L( , χ)|2k ∼ ck φ(q)(log q)k . 2

(∗)

§2 Polya-Vinogradov estimate

95

If we assume this and try to apply the above argument, we do not seem to get anything better. Of course, (∗) is still very highly conjectural. The largest value for which an asymptotic formula is known is k = 2, and prime q. This is a result of Heath-Brown [HB]. In spite of our diﬃculty in showing that L( 12 , χ) = 0 holds often, no example is known for which L( 12 , χ) = 0. Siegel [Sie] has shown that any point on the line σ = 12 is a limit point of zeroes of the L(s, χ) as χ ranges over all Dirichlet 1 characters. More precisely, he shows that for any [T1 , T2 ], with T2 −T1 (log q)− 2 , there exists a χ(mod q) with L(s, χ) = 0 at some point s = σ + it in the rectangle 1 1 1 σ∈[ , + ], t ∈ [T1 , T2 ]. 2 2 log log q The dimensions of this rectangle can be somewhat reduced now. A variant of these problems is to restrict attention to real characters χ. In this case, one can average over such characters only and show that many of them do not vanish. In this chapter, we shall look at both of these problems. After establishing the fundamental estimate of Polya and Vinogradov in Section 2, we discuss in §§3–4 the work of Jutila which establishes an asymptotic formula for the sum 1 L( , χD ) 2 ' ( where χD is the real character given by D. . From this, one can deduce the nonvanishing of inﬁnitely many such L-functions. In §5, we describe the work of R. Balasubramanian and V. K. Murty [BM] which considers Dirichlet L-functions to a prime modulus q. Here, it is proved that for a positive proportion of the χ(mod q), we have L( 12 , χ) = 0. Finally in §6, we brieﬂy describe the work of R. Murty [RM] which establishes a strengthening of §5 assuming the Riemann Hypothesis .

§2 Polya-Vinogradov estimate In this section, we shall describe an estimate for character sums involving a Dirichlet character. Let q ≥ 1 be an integer and let χ denote a character mod q. We shall consider a sum of the form χ(n) n≤x

or more generally,

y≤n≤x

χ(n).

Chapter 5 Dirichlet L-Functions

96

Since χ is periodic mod q, we certainly have the trivial estimate χ(n) ≤ min(q, x − y + 1). y≤n≤x

If χ is the trivial character, then this is best possible. However, for nontrivial χ it is possible to do substantially better. Theorem 2.1

We have the estimate 1 χ(n) q 2 log q. y
Remark. If χ is primitive, then the implied constant may be taken to be one. In general, sharp estimates for the constant, as well as connections of this problem with diophantine approximation have been given in work of Hildebrand [Hil]. Proof. First, suppose that χ is primitive. We have the identity in terms of Gauss sums: q 1 an χ(n) = χ(a)e( ¯ ) g(χ) ¯ a=1 q where e(α) = exp(2πiα). Summing both sides over n, we have to estimate q 1 an χ(a) ¯ e( ). g(χ) ¯ a=1 q y
Since the inner sum is a geometric series, it can be evaluated exactly and we ﬁnd it to be (M + 12 N + 12 )a sin πN a/q e q sin πa/q where M = [y] + 1 and N = [x] − [y] − 1. Hence, using the fact that the Gauss 1 sum g(χ) ¯ has absolute value q 2 we deduce that |

χ(n)| ≤ q− 2

y≤n≤x

1

q−1

1 . sin πa/q a=1

Using the estimate sin πβ > 2β

if 0 < β <

1 2

we deduce that q−1

[q/2] 1 1 <1+q < 1 + q log q/2. sin πa/q a a=1 a=1

§3 Jutila’s character sum estimate

Thus |

97

1

χ(n)| < q 2 log q.

y≤n≤x

The proof of the estimate in the case that χ is not primitive is left as an exercise. One can ask whether the estimate provided by the Polya-Vinogradov inequality is best possible. Montgomery and Vaughan [MV] have shown that assuming the Riemann Hypothesis for Dirichlet L-functions, we have the sharper estimate 1 χ(n) q 2 log log q. n≤x

It is possible to remove the factor log q entirely if we work with even characters and integrate over the summation parameter. For a character of any parity, it is possible to eliminate the log q factor if we integrate and take the mean square. These observations play a crucial role in Chapter 6.

§3 Jutila’s character sum estimate The character sum referred to is the following: 2 D S(X, Y ) = n n≤Y |D|≤X where the outer sum is over D which belong to the set D of integers satisfying: (i) D is not a square and (ii) D ≡ 1(mod 4) or D = 4N with N ≡ 0(mod 4). Note that if (D/·) is a non-principal character, then D ∈ D. is

If we apply the Polya-Vinogradov estimate to the inner sum, we ﬁnd that it |D|(log |D|).

From this, we deduce that S(X, Y ) X 2 (log X)2 . In [Jut1], Jutila proves the following. Theorem 3.1

For X ≥ 3 and Y ≥ 1, we have S(X, Y ) XY (log X)2 .

Remark. In [Jut1], one actually ﬁnds the weaker estimate XY (log X)8 . Several authors, including Jutila (see [Jut2, p. 155]), observed that the methods of [Jut1] in fact yield Theorem 3.1.

Chapter 5 Dirichlet L-Functions

98

The main tool in the proof of Theorem 3.1 is a Lemma of Vinogradov [Vino] which gives an explicit and eﬀective smooth approximation to the characteristic function of an interval. We state the lemma as follows. Lemma 3.2

Let α, β and δ be real numbers satisfying 0<δ<

1 , δ ≤ β − α ≤ 1 − δ. 2

Then there exists a periodic function ψ(x) with period 1, and satisfying (i) ψ(x) = 1 in the interval [α + 12 δ, β − 12 δ] (ii) ψ(x) = 0 in the interval [β + 12 δ, 1 + α − 12 δ] (iii) 0 ≤ ψ(x) ≤ 1 in the intervals [α − 12 δ, α + 12 δ] and [β − 12 δ, β + 12 δ] (iv) ψ(x) has the Fourier expansion ψ(x) = β − α +

∞

(am e(mx) + a ¯m e(−mx))

m=1

where am = (2πim)−1 (e(−mα) − e(−mβ)){

sin( 12 πmδ) 2 } 1 2 πmδ

and |am | min(β − α, m−1 , δ −2 m−3 ). In the proof of Jutila’s estimate, one uses not a single function ψ but a family of such functions. Denote by ψ(x, u) the function ψ(x) of Lemma 3.2 with the parameters 1 1 1 δ = X − 2 , α = δ, β = Y u−1 + δ. 2 2 Then one has the estimates ∂ψ(x, u) δ −1 ∂x and dψ(n/u, u) X 2. du u Moreover Y dam,u 2 min(1, m−2 δ −2 ). du u

§3 Jutila’s character sum estimate

99

Proof of Theorem 3.1. First, we observe that the estimate is easy in certain ranges of the parameters. For example, the case Y ≥ X/4 follows from the Polya1 Vinogradov estimate and the case Y ≤ X 2 also follows easily and is left as an 1 exercise. We are now left with the essential range X 2 ≤ Y ≤ X/4. Moreover, the Polya-Vinogradov estimate also implies that √ S( XY , Y ) XY (log XY )2 . Hence, we may assume in the summation over |D| that √ Y ≤ XY ≤ |D| ≤ X. Using the fact that n ψ( , |D|) = |D|

|D| √ X

1 if

0 if Y +

≤n≤Y |D| √ X

≤ n ≤ |D|

and the above inequality on |D|, we see that n D n D ψ( , |D|) = ψ( , |D|) + |D| n |D| n √ 1 n≤|D|

Using the fact that

√ Y
n ψ( , |D|) |D|

√ |D| Y + √ ≤ Y + X < |D| X

the last sum can be extended to the range Y < n < Y + D S(D) = . n

Then, we can decompose the inner sum as follows: S(D) = S1 (D) + S2 (D) + S3 (D) n D S1 (D) = ψ( , |D|) |D| n n≤|D| n D S2 (D) = (1 − ψ( , |D|)) |D| n 1 n≤X 2

and

S3 (D) =

Y ≤n≤Y +X

ψ( 1 2

n , |D|) |D|

D n

.

√ X. Let us write

n≤Y

where

D n

X
n≤X 2

−

D n

.

Chapter 5 Dirichlet L-Functions

100

We consider the sum S=

∗

|S(D)|2

√ XY <|D|≤X

3 i=1

∗ √

|Si (D)|2

XY <|D|≤X

where ∗ denotes a sum over fundamental discriminants. Now, we can use Vinogradov’s Lemma to estimate these sums. Suppose now that D is a fundamental discriminant. First, we have ∞ D D S1 (D) = gD am,|D| +a ¯m,|D| m −m m=1 ' ( where gD = g( D. ) is the Gauss sum. Let us split the sum over m into two 1 1 segments S11 (D) and S12 (D) depending on m ≤ X 2 and m > X 2 . We have 2 ∗ D 2 |S11 (D)| |D| am,|D| m √ √ m≤X 12 XY <|D|≤X XY <|D|≤X which on expanding is equal to r,s≤X

1 2

√

|D|ar,|D| a ¯s,|D|

D rs

.

(3.1)

XY <|D|≤X

To estimate this, ﬁrst consider the case when rs is a square. Using the estimates provided by Vinogradov’s lemma, we have to consider 2

1 r<s≤X 2 rs=t2

|D|≤X

|D| min

Y 1 Y 1 min . |D| r |D| s

The inner sum over D is split into three subsums depending on the value of the minima, namely sY < |D| ≤ X, rY < |D| ≤ sY and |D| ≤ rY . Consider the ﬁrst range. The sum is

1 r<s≤X 2 rs=t2

sY <|D|≤X

Y

2

r<s<X/Y rs=t2

Y2

s≤X/Y

|D|

Y2 |D|2

X 1 log + O( ) Ys Ys

log

X Ys

t<s s0 |t

1 + O(Y 2

(3.2)

s≤X/Y

1 ). Ys

§3 Jutila’s character sum estimate

101

Here s0 is deﬁned as follows: if pa ||s then pb ||s0 where b is the least integer ≥ a/2. It is easy to show that ∞ n/n0 = ζ(2s − 1)ζ(s)/ζ(2s). ns n=1

Hence,

n/n0 x(log x).

n≤x

Inserting this into (3.2), using partial summation, and observing that the O term is O(Y log X), we deduce that (3.2) is

Y2

s≤X/Y

s X log XY (log X/Y ). s0 Ys

For the sum in the range rY < |D| < sY we have an estimate

1 r<s≤X 2 rs=t2

|D|≤X

|D|

Y 1 XY |D| s

1 r<s<X 2 rs=t2

1 s 1 XY s s s0 1 s<X 2

XY log X by estimates similar to the ones described above. Finally, in the remaining range |D| ≤ rY , we have that the sum is

|D|≤X

|D|

1 |D| 2 Y
1 Y (log X)2 1 |D| t2 |D| 2 rs=t

|D|≤X

XY (log X) . 2

Now we estimate the contribution of terms with rs not a perfect square. We introduce the functions D fr,s (u) = uar,u a ¯s,u and gr,s (u) = . rs |D|≤u

Let p(n) = 0 if n is a square and p(n) = 1 otherwise. Then the part in consideration is by partial summation * X p(rs) gr,s (X)fr,s (X) − √ gr,s (u)fr,s (u)du . 1

r,s≤X 2

XY

Chapter 5 Dirichlet L-Functions

102

Using the estimates fr,s (X) Xr−1 s−1 ,

1

gr,s (u) (rs) 2 log X

and fr,s (u)

Y −1 (r + s−1 ) u

we ﬁnd that the sum in question is 1 1 X(log X)(rs)− 2 + (rs) 2 (log X)Y (log X/Y )(r−1 + s−1 ) 1

r,s≤X 2

X 3/2 log X + XY (log X)(log X/Y ) XY (log X)2 . Next to estimate S12 we proceed in an analogous fashion. Now r, s run over 1 the range r, s > X 2 . Explicitly, we are trying to estimate D |D|ar,|D| a ¯s,|D| . (3.3) rs √ √ XY ≤|D|≤X

r,s> X

The pairs for which rs is a square give a contribution X2 d(t2 )X 2 n−6 X 3/2 (log X)2 . t2 >X When rs is not a square, the estimates above for fr,s and fr,s are replaced by

fr,s (X) X 3 (rs)−3 and

X 2 Y −1 (r + s−1 ) + X 2 (rs)−3 . ur2 s2 Inserting these, one ﬁnds that the contribution of S12 is (rs)1/2 (log X)(X 3 (rs)−3 + X 2 Y (log X/Y )(rs)−2 (r−1 + s−1 )) fr,s (u)

√ r,s> X

and this is X 3/2 log X + XY (log X)(log X/Y ) XY (log X)2 . Next we consider the sums S2 and S3 . The sum S3 is Y
1 2

|D|≤X

ψ(

r s , |D|)ψ( , |D|) |D| |D|

D rs

.

§3 Jutila’s character sum estimate

103

The contribution of pairs r, s with rs a square is estimated by bounding the ψ values by 1 giving a total estimate of X

1X

1 Y
d(t2 ) 1

t≤Y +X 2

XY (log X)2 . For the nonsquare terms, we have by partial summation that the sum is

r s p(rs) ψ( ,X)ψ( ,X)gr,s (X) − X X 1

* r s gr,s (u)d ψ( ,u)ψ( ,u) . √ u u XY

X

Y
Now

√ s ψ( , u) = 0 if u < (s − Y ) X. u

By the estimate dψ( nu , u) X 2 du u and the Polya-Vinogradov estimate, we see that

X

√

XY

X √ r s X gr,s (u)d ψ( , u)ψ( , u) rs log X du. √ √ 2 u u u max((s−Y ) X, XY )

Inserting this into the sum over r, s we ﬁnd that it is √ Y
√ √ r X log X

√ Y <s
√

s √ max(s − Y, Y )

and this is

√ Y
⎧ ⎨ √ √ r X log X ⎩

√ Y <s
√ s √ + Y

√ √ Y + Y <s
⎫ √ ⎬ s s−Y ⎭

XY (log X)2 . A similar estimate holds for S2 . To complete the proof of the Theorem, we have to estimate the contribution of imprimitive characters (that is, the contribution from those D which are not

Chapter 5 Dirichlet L-Functions

104

' ( ' ( fundamental discriminants). If d. is the primitive character inducing D. and D = de2 we have D d | |2 = | |2 n n n≤Y n≤Y

(n,e)=1

=|

δ|e

≤ d(e)

d μ(δ) δ

δ|e

m≤Y

δ −1

d m

|2

d | |2 . n −1 n≤Y δ

Hence, for ﬁxed values of d and δ with |d|δ ≤ X we have a contribution to S(X, Y ) an amount which is √ √ d d X X ≤| |2 d(e)| |2 d(δ) log . n n |d|δ |d|δ −1 2 −1 −1 e ≤X|d| n≤Y δ n≤Y δ δ|e

Now the summation over |d| ≤ Xδ −1 for a ﬁxed δ gives a contribution √ X d(δ) δ

|d|≤X/δ

√ 2 1 d X log . n |d| n≤Y /δ |d|δ

Applying the result for fundamental discriminants and using partial summation, we deduce that this is √ ) X XY X d(δ) (log )2 . δ δ δ δ Now a summation over δ accomplishes the proof of Theorem 3.1.

§4 Average value of L 12 , χD In this section, we outline the proof of Jutila [Jut2] of the mean and mean-square of L( 12 , χD ). The results are as follows. Theorem 4.1

We have ∗ 0
1 L( , χd ) = c1 Y log Y + c2 Y + O(Y 3/4+ ) 2

where the sum is over fundamental discriminants and c1 is a non-zero constant.

§4 Average value of L

Theorem 4.2

'1

2 , χD

(

105

We have ∗ 1 L( , χd )2 = c3 Y (log Y )3 + O(Y (log Y )5/2+ ) 2

0
where the sum is over fundamental discriminants and c3 is a non-zero constant. Similar statements hold for negative discriminants also. Corollary 4.3 Let N (Y ) denote the number of fundamental discriminants 0 < d ≤ Y such that L( 12 , χd ) = 0. Then N (Y ) Y / log Y. Proof. By Cauchy’s inequality, 2 ⎛ ⎞ ∗ ∗ 1 1 L( , χd ) ⎝ L( , χd )2 ⎠ · N (Y ). 2 2 0
∗ d dw n

0
the summation ranging over fundamental discriminants, and fY (n) = fY (n, 0). For a real primitive character χ mod q > 1, and an X ≥ 1 we have the identity ∞ 1 L( , χ) = χ(n) exp(−n/X)n−1/2 2 n=1 q −s Γ( 1 (a + 1 1 2 − L( − s, χ) 2πi (− 12 − ) 2 π Γ( 12 (a +

1 2 1 2

− s)) Γ(s)X s ds + s))

which follows easily from the functional equation. Here, a = 12 (1 − χ(−1)). Summing over χ corresponding to d > 0, and observing that a = 0 in this case, we get ∗ 0
∞ 1 L( , χd ) = fY (n) exp(−n/X)n−1/2 2 n=1 ∞ Γ( 1 − s ) 1 1 − fY (n, −s)ns− 2 π s 41 2s Γ(s)X s ds 2πi (− 12 − ) n=1 Γ( 4 + 2 )

=S−I

Chapter 5 Dirichlet L-Functions

106

say. When n is a square, we have 1

fY (n) = cn Y + O(Y 2 d(n)) where cn =

3 1 (1 + )−1 . 2 π p p|n

Moreover, for Re(s) > 0, fY (n, s) = cn

Y 1+s + O((|s| + 1)d(n)Y 1+s

1 2 +σ

).

We apply this ﬁrst to S. Indeed, in S the squares contribute an amount ∞

(cm Y + O(Y 2 d(m2 ))) exp(−m2 /X)m−1 1

m=1

=Y

∞ 1 cm exp(−m2 /X) + O(Y 2 (log X)3 ). m m=1

From Exercise 5, we know that ∞ 1 cm 3 1 1 exp(−m2 /X) = 2 1− ( log X + c) + O(X − 2 + ) m π p p(p + 1) 2 m=1 for some constant c. For non-square values of n, we apply the estimate of Exercise 3, namely 2 ∗ d N Y (log N )4 . (4.1) n 1
−1/2

fY (n) exp(−n/X)n

n=1

∞ 1 1 ( fY (n)2 exp(−n/X)) 2 (log X) 2 . n=1

Now using (4.1) and partial summation, we deduce that the right hand side is Y 1/2 X 1/2 (log X)5/2 . Thus

S = c1 Y log X + c Y + O(Y 2 X 2 (log X)5/2 ) 1

where c1 =

1

3 1 1 − =0 2π 2 p p(p + 1)

§4 Average value of L

'1

2 , χD

(

107

and c is some constant. In I, the squares contribute an amount ∞ 1 Y 1−s 2s−1 2 1+ − m cm + O((|s| + 1)d(m )Y ) 2πi (− 12 − ) m=1 1−s πs

Γ( 14 − 2s ) Γ(s)X s ds. Γ( 14 + 2s )

Using the fact that

∞

cm m−s =

m=1

where P (s) =

1− p

3 P (s)ζ(s) π2 1 (p + 1)ps

,

we see that the above integral gives 1 s 1 1 Y 1−s 3 s Γ( 4 − 2 ) − ζ(1 − 2s)P (1 − 2s)π Γ(s)X s ds + O(Y 1+ X − 2 − ). 1 2πi (− 12 − ) 1 − s π 2 Γ( 4 + 2s ) An easy calculation shows that ζ(1 − 2s)Γ(s) = −

1 3γ/2 + − γ2 + · · · 2 2s s

and so the main term above is 1 3 P (1) −Y {c + 2 log X/Y + O((X/Y ) 2 − −1 )} π 2 for some constant c . Next, we observe that when summation is restricted to the non-squares, the integrand is ∞ −1/2− exp(−|t|)X |fY (n, −s)|n−1− n=1 n =m2

where t is the imaginary part of s. Using the above character sum estimate (4.1) and partial summation, it follows that for σ ≥ 0 we have |fY (n, s)|2 (|s| + 1)2 N Y 1+2σ (log N )4 . n≤N n =m2

Hence the integral over t is −1 Y 1+ X −1/2− . Hence I = −Y {c +

3 P (1) 1 1 log X/Y + O((X/Y ) 2 − −1 )} + O( −1 Y 1+ X − 2 − ). 2 π 2

Now choosing X = Y

1 2

gives the result. This proves Theorem 4.1.

For the proof of Theorem 4.2, we need several lemmas.

Chapter 5 Dirichlet L-Functions

108

Lemma 4.4

We have the following estimate. D (k) Y X 12 d(k)2 (log XY )17 . d(n)χ0 (n) n |D|≤X n≤X

Proof. See exercise 6. For any A > 0, we have the estimate ∗ d 1 1 d(n) Y 2 X(log XY )5/2 log log XY + Y X 2 (log XY )−A n

Lemma 4.5

n≤X

0
where the sum is taken over integers n which are not squares and fundamental discriminants d. Proof. Let us consider the double sum where the inner sum is taken over d ≡ 1(mod 4). The other cases are similar. The sum in question is ⎛ ⎞ m ⎝ d(n) μ(a)⎠ n 2 m≤Y a |m

n≤X

m≡1(mod 4)

=

a≤Y

(2)

μ(a)χ0 (a)

1 2

(a)

d(n)χ0 (n)

h≤Y /a2 h≡1(mod 4)

n≤X

h n

= S1 + S2 where the sum S1 is over a ≤ Z = (log XY )B for a suitable constant B > 0 and S2 is the remainder. Applying Cauchy’s inequality twice, we get ⎞ 12 ⎛ ⎜ (a) S1 ≤ ⎝ a−1 ⎠ ⎝ a d(n)χ0 (n) a≤Z a≤Z n≤X ⎛

h≤Y a−2 h≡1(mod 4)

2 ⎞ 12 h ⎟ ⎠ n

⎧ 2 ⎞⎫ 12 ⎛ ⎨ h ⎬ (log Z) a d(n)2 ⎝ ⎠ ⎩ a ⎭ n n n 1 2

h

⎛ ⎞1 2 2 1 1 h ⎠ X 2 (log X)3/2 (log log XY ) 2 ⎝ a . n a n h

The double sum over h and n is estimated by (4.1) above. We ﬁnd it is 2 h Y Xa−2 (log X)2 . n n h

§4 Average value of L

'1

2 , χD

(

109

Hence, summing over a we get 1

S1 Y 2 X(log X)5/2 log log XY. In S2 we sum ﬁrst over n using Lemma 4.4. Taking into account also the values of h which are squares, we get

S2

Z
(Y a−2 X 2 d(a)2 (log XY )17 + Y 2 a−1 X(log X)) 1

1

1 2

1 2

1

Y X (log XY )20−B + Y 2 X(log XY )2 . Combining the above estimates we get the stated result. Now we are ready to establish the mean square of L( 12 , χd ). Proof of Theorem 4.2. Again from the functional equation, we have the following identity: ∗

1 L( , χd )2 2

0
=

fY (n)d(n) exp(−n/X)n− 2 1

n=1

1 − 2πi

{

∞

1

fY (n, −2s)d(n)ns− 2 }

(−3/4) n=1

Γ2 ( 14 − 2s ) Γ(s)(π 2 X)s ds Γ2 ( 14 + 2s )

= S(X, Y ) + I(X, Y ). Using Lemma 4.5, the non-squares in S(X, Y ) contribute an amount

√

XY (log XY )5/2 log log XY + Y (log XY )−A .

For the integral, the sum over the non-square values of n is split at a point U ≥ Y say. For the initial segment, the line of integration is moved to −η (say) for some 0 < η < 12 . This gives an estimate 1 2

U Y

1 2

Y2 UX

η (log U Y )5/2 (log log U Y ).

For the tail, one again uses Lemma 4.5 and partial summation to get an estimate 1

U 2Y

1 2

Y2 UX

3/4 (log U Y )5/2 log log U Y.

Chapter 5 Dirichlet L-Functions

110

If we choose U = X = Y , the error terms give a total contribution of Y (log Y )5/2 (log log Y ). Finally, it is easy to check that the squares in the sum over n in S(X, Y ) and in I(X, Y ) give a total contribution of cY (log Y )3 + O(Y (log Y )2 ) where

1 4p2 − 3p + 1 c= 2 1− = 0. 8π p p4 + p3

Combining all of these estimates proves Theorem 4.2.

§5 Non-vanishing for a positive proportion of characters, I We shall discuss the following result from [BM]. Theorem 5.1

Suppose that q is prime and suﬃciently large. Unconditionally, #{χ(mod q) : L(1/2, χ) = 0} ≥ cφ(q)

where c ≥ .04. The assumption on q can be weakened signiﬁcantly but only at the cost of making the proof more intricate due to the presence of imprimitive characters. To prove the theorem, we have to consider a molliﬁer polynomial M (s, χ) = λ(n)χ(n)n−s n≤Z

where λ(n) are the Barban-Vehov weights and Z is a parameter to be chosen. √ More precisely, we will choose Z = q, and Y = (log q) and ⎧ 1≤n≤Y ⎨ μ(n) log Z/n λ(n) = μ(n) log Y ≤n≤Z Z/Y ⎩ 0 n ≥ Z. The molliﬁer polynomial is supposed to be a good approximation to 1/L(s, χ). Let us also set a(n) = λ(d). d|n

Observe that a(1) = 1,

a(n) = 0 f or 1 < n ≤ Y.

§5 Non-vanishing for a positive proportion of characters, I

111

Now, we consider the integral 1 1 1 L( + w, χ)M ( + w, χ)X w Γ(w)dw. 2πi (2) 2 2 Here X is a parameter to be speciﬁed later. We ﬁnd that the integral is S(χ) =

∞ a(n)χ(n) n exp(− ). 1 X 2 n n=1

On the other hand, moving back the line of integration, we see that it is 1 1 1 1 1 L( , χ)M ( , χ) + L( + w, χ)M ( + w, χ)X w Γ(w)dw. 2 2 2πi (−η) 2 2 Applying the functional equation in the integral, we get 1 1 1 1 L( − w, χ)M ¯ ( + w, χ)γ( + w, χ)X w Γ(w)dw 2πi (−η) 2 2 2 where γ(s, χ) =

g(χ) 1 ia q 2

12 s− 12 π 2 2π sin (a + s) Γ(1 − s) π q 2

where a = 0, 1 and χ(−1) = (−1)a and g(χ) is the Gauss sum determined by χ. In applying the functional equation, we are assuming that χ is primitive. Now, we can expand L( 12 − w, χ) ¯ as a Dirichlet series provided η > 12 . We split this series at Z and so we get two integrals and the formula 1 1 S(χ) = L( , χ)M ( , χ) + I(χ) + J(χ). 2 2 Now we compare mean and mean square estimates for S(χ), I(χ) and J(χ) and show that they are incompatible with the frequent vanishing of L( 12 , χ). Choosing X = q, we prove the following estimates: S(χ) ∼ φ(q) (1)

5 |S(χ)|2 < φ(q) 2 2 |I(χ)| ∼ cφ(q)

where c= and

(2) (3)

4 2 1 1 + 1 − ∼ .374750 π 2 p>2 (p − 2)(p + 1) p>2 (p − 1)2

|J(χ)|

q . log q

(4)

Chapter 5 Dirichlet L-Functions

112

What is the role of the speciﬁc molliﬁer weights in all of the above estimates? The key points are that by an important result of S. Graham [Gra], we have N log N/Y Y
Now, |

χ:L( 12 ,χ)=0

I(χ) + J(χ)| ≤

χ:L( 12 ,χ)=0

|I(χ)| +

|J(χ)|

1/2 q ≤ φ(q)1/2 |I(χ)|2 + O( ) log q √ cφ(q).

Therefore,

S(χ) (1 −

√

c)φ(q).

χ:L( 12 ,χ)=0

On the other hand, by Cauchy-Schwarz, 1/2 1 S(χ) ≤ #{χ : L( , χ) = 0}1/2 |S(χ)|2 . 2 1 χ:L( 2 ,χ)=0

Thus,

√ 1 2 = 0} (1 − c)2 φ(q). #{χ : L( , χ) 2 5 Now we shall give a few more details on this outline. We begin with two results on the Barban-Vehov weights . tions

Let 1 ≤ z1 ≤ z2 . Following Barban and Vehov [BV], we introduce the func μ(n) log(zi /n) if n ≤ zi Λi (n) = 0 if n > zi ,

for i = 1, 2. We also deﬁne

⎧ μ(n) Λ2 (n) − Λ1 (n) ⎨ log(z2 /n) λ(n) = = μ(n) log(z 2 /z1 ) ⎩ log(z2 /z1 ) 0

1 ≤ n ≤ z1 z1 ≤ n ≤ z2 n > z2 .

(5.1)

§5 Non-vanishing for a positive proportion of characters, I

Let us deﬁne a(n) =

113

λ(d).

d|n

Graham [Gra] has found asymptotic estimates for the mean square of the a(n). We recall his main result. We have ⎧ N 1) ⎨ N log(N/z + O 2 2 log (z2 /z1 ) log (z2 /z1) |a(n)|2 = N N ⎩ n≤N log(z2 /z1 ) + O log2 (z2 /z1 )

Proposition 5.2

if z1 < N < z2 if z2 ≤ N.

Applying the Cauchy-Schwarz inequality and Proposition 5.2, we deduce the following. Proposition 5.3

Let r ≤ N be prime and (b, r) = 1. We have

|a(n)|

b
N φ(r)1/2 (log

z2 /z1 )1/2

.

We next obtain an estimate for a shifted convolution. Proposition 5.4

Let 1 ≤ k ∈ Z, t ∈ R and k ≤ M < N. Then we have

a(n)a(n − k)

M
k N + z22 φ(k) (log z2 /z1 )2

The proof will require two preliminary results. We begin by recalling a result from Graham [Gra, Lemma 2]. Lemma 5.5

For any integer r, and any c > 0, μ(n) ' ( Q r log = + Oc σ− 12 (r) log−c (2Q) . n n φ(r) n≤Q (n,r)=1

Lemma 5.6

We have for 1 ≤ d1 , d2 ≤ z2 and r1 , r2 ≥ 1 that

1≤j1 ≤z1 /d1 , 1≤j2 ≤z2 /d2 (j1 ,j2 )=(j1 ,r1 )=(j2 ,r2 )=1

Λ1 (d1 j1 )Λ2 (d2 j2 ) j1 j2

d1 r1 d2 r2 + σ− 12 (d1 r1 ) + σ− 12 (d2 r2 ) . φ(d1 r1 ) φ(d2 r2 )

The same estimate holds even if we drop the condition that (j1 , j2 ) = 1.

Chapter 5 Dirichlet L-Functions

114

Proof. The sum in question is Λ1 (d1 j1 )Λ2 (d2 j2 ) j1 j2

μ(e) =

e|(j1 ,j2 )

μ(e)

Λ1 (d1 j1 )Λ2 (d2 j2 )

e≤z1 /d1

j1 j2

,

the inner sum ranging over j1 , j2 satisfying 1 ≤ j1 ≤ z1 /d1 , 1 ≤ j2 ≤ z2 /d2 j1 , j2 ≡ 0 (mod e),(j1 , r1 ) = (j2 , r2 ) = 1. Let us set r = r1 r2 and d = d1 d2 . Then the sum is seen to be μ(e) Λ1 (d1 e1 )Λ2 (d2 e2 ) 2 e 1 2 e≤z /d ≤z /d e, ≤z /d e 1 1 (e,r)=1

= μ(d1 )μ(d2 )

1

1 1 2 2 2 (1 ,r1 )=(2 ,r2 )=1

⎧ ⎪ μ(e) ⎨

e≤z1 /d1 (e,dr)=1

e2 ⎪ ⎩ ×

= μ(d1 )μ(d2 )

e≤z1 /d1 (e,dr)=1

1 ≤z1 /d1 e (1 ,d1 er1 )=1

⎧ ⎪ ⎨ ⎪ ⎩

⎫ ⎪ μ(1 ) log(z1 /d1 e1 ) ⎬ ⎪ 1 ⎭

2 ≤z2 /d2 e (2 ,d2 er2 )=1

⎫ ⎪ μ(2 ) log(z2 /d2 e2 ) ⎬ ⎪ 2 ⎭

2 μ(e) dk erk −c 2zk 1 + O σ (d er ) log c k k −2 e2 φ(dk erk ) dk e k=1

using Lemma 5.5. The main terms contribute an amount μ(e) μ(d1 )μ(d2 )dr dr · . 2 φ(d1 r1 )φ(d2 r2 ) e≤z /d φ(e) φ(d1 r1 )φ(d2 r2 ) 1 1 (e,dr)=1

The product of the O-terms contributes an amount 1 σ 1 (d1 r1 )σ 12 (d2 r2 )σ− 12 (e)2 σ− 12 (d1 r1 )σ 12 (d2 r2 ). e2 − 2 The cross-terms contribute an amount 1 + d1 er1 2z2 −c · σ 1 (d2 er2 ) log e2 φ(d1 er1 ) − 2 d2 e e≤z /d 1 1 (e,dr)=1

d2 er2 2z2 , · σ− 12 (d2 er2 ) log−c φ(d2 er2 ) d2 e d1 r1 d2 r2 σ− 12 (d2 r2 ) + σ− 12 (d1 r1 ) φ(d1 r1 ) φ(d2 r2 ) +

§5 Non-vanishing for a positive proportion of characters, I

115

since the series σ− 12 (e)/eφ(e) converges. This proves the ﬁrst statement. The second statement is easy to verify since there is now no condition relating j1 and j2 . We argue as above setting e = 1. Now we are ready to prove the estimate of the shifted convolution. Proof of Proposition 5.4. Again, we consider the sum ⎛ ⎞⎛ ⎞ ⎝ Λ1 (d)⎠ ⎝ Λ2 (e)⎠ M
d|n

(5.2)

e|n−k

and we ﬁnd that it is equal to

Λ1 (d)Λ2 (e)

1.

(5.3)

M
d,e

We see that the inner sum is zero unless (d, e)|k and if (d, e) divides k it is N −M + O(1). [d, e] Thus, (5.3) is (N − M )

Λ1 (d)Λ2 (e) + +O( |Λ1 (d)Λ2 (e)|) [d, e] d,e d,e (d,e)|k

(5.4)

(d,e)|k

The O-term is easily seen to be z1 z2 . To evaluate the main term, we see that the sum over d, e is Λ1 (d)Λ2 (e) φ(m). de d,e m|d (d,e)|k

m|e

This is seen to be equal to φ(m) Λ1 (md0 )Λ2 (me0 ) . m2 d0 e0 m|k

d0 ,e0

Here, the inner sum ranges over pairs d0 , e0 satisfying 1 ≤ d0 ≤

z1 , m

1 ≤ e0 ≤

z2 m

(d0 , m) = (e0 , m) = 1.

(5.5)

Chapter 5 Dirichlet L-Functions

116

Also note that in the outer sum m must be squarefree for otherwise Λ1 (md0 ) = Λ2 (me0 ) = 0. Thus, invoking Lemma 5.6, we ﬁnd that the main term in (5.5) is 2 μ2 (m)φ(m) m + σ− 12 (m) m2 φ(m) m|k

k . φ(k)

Hence the main term in (5.4) is

k N. φ(k)

Summarizing, (5.3) is k N + z1 z2 . φ(k)

The proposition follows.

Next, we discuss the molliﬁer polynomial. We introduce the parameters Y = (log q) Z = q 1/2

.

Corresponding to the choices z1 = Y and z2 = Z, we have the weights λ(n) =

Λ2 (n) − Λ1 (n) . log(Z/Y )

We deﬁne the Dirichlet polynomial M (s, χ) =

λ(n)χ(n) ns

n≤Z

where χ is a Dirichlet character. Then, we have L(s, χ)M (s, χ) = where a(n) =

∞ a(n)χ(n) ns n=1

λ(d)

d|n

satisﬁes a(1) = 1 a(n) = 0 for 1 < n ≤ Y. We record the following estimate.

§5 Non-vanishing for a positive proportion of characters, I

117

For σ = 12 , we have

Lemma 5.7

|M (s, χ)|2

χ(mod q)

1 (q + Z) 1 · (q 2 −σ · + Y 1−2σ ). (1 − 2σ) (log q)2

Proof. We use the large sieve inequality [D] to get

|λ(n)|2 n2σ n≤Z ⎧ ⎫ ⎨ 1 log Z/n 2 1 ⎬ (q + Z) + · 2σ ⎩ n2σ log Z/Y n ⎭ n≤Y Y
|M (s, χ)|2 (Z + q)

χ(mod q)

The result follows from our choices of Y and Z. Now we consider the basic equation that relates the above quantities. Let us deﬁne ∞ a(n)χ(n) −n/q S(s, χ) = S(s, χ, q) = e . ns n=1 Let s ∈ C with 1 > σ = Re(s) ≥ 12 . Using the well-known identity

1 2πi

X w Γ(w)dw = e−1/X ,

(2)

we ﬁnd that for a character χ, S(s, χ) =

1 2πi

L(s + w, χ)M (s + w, χ)q w Γ(w)dw. (2)

Moving the line of integration to the left, we ﬁnd that S(s, χ) = L(s, χ)M (s, χ) +

1 2πi

L(s + w, χ)M (s + w, χ)q w Γ(w)dw

(5.6)

(−η)

where σ < η < 1. We can decompose the integral along the line −η into two parts as follows. Suppose that χ is non-trivial. We apply the functional equation L(s, χ) = γ(s, χ)L(1 − s, χ) ¯

Chapter 5 Dirichlet L-Functions

118

where g(χ) γ(s, χ) = 1 ia q 2

12 s− 12 π 2 2π sin (a + s) Γ(1 − s). π q 2

(Here g(χ) is the Gauss sum, a = 0, 1 and χ(−1) = (−1)a .) Then we truncate the Dirichlet series expansion of L(1 − s − w, χ) at Z. Let us set * χ(n) 1 I(s, χ) = γ(s + w, χ) M (s + w, χ)q w Γ(w)dw 2πi (−η) n1−s−w n
and 1 J(s, χ) = 2πi

γ(s + w, χ) (−η)

⎧ ⎨ ⎩

n≥Z

⎫ χ(n) ⎬ M (s + w, χ)q w Γ(w)dw n1−s−w ⎭

Thus, we get S(s, χ) = L(s, χ)M (s, χ) + I(s, χ) + J(s, χ).

(5.7)

If L(s, χ) = 0, then S(s, χ) and I(s, χ) + J(s, χ) are equal. We will therefore try to show that, in general, they are not equal and for this purpose we study their mean values. We begin with J(s, χ) which is the easiest of the three to estimate. For | Im s| < 1 and 0 ≤ σ ≤ 1, we have

Proposition 5.8

1=χ(mod q)

q 2 −σ |J(s, χ)| log q 3

Proof. From Stirling’s formula, we know that γ(s, χ) (q(|s| + 1)) 2 −σ . 1

Using this and the deﬁnition, we ﬁnd that

|J(s, χ)| q

1=χ(mod q)

|

n≥Z

1 2 −σ+η

q

−η

χ(mod q)

(|w| + 1) 2 −σ+η 1

(−η)

χ(n) ||M (s + w, χ)||Γ(w)||dw| n1−s−w

which by a double application of the Cauchy-Schwarz inequality is ⎛ ⎞ 12 χ(n) 1 ⎝ (|w| + 1)1−2σ+2η | q 2 −σ |2 |Γ(w)||dw|⎠ n1−s−w χ(mod q)

×

12 |M (s + w, χ)|2 |Γ(w)||dw|

n≥Z

§6 Non-vanishing for a positive proportion, II

⎛ q

1 2 −σ

⎝

(|w| + 1)1−2σ+2η |

⎝

⎞ 12

n≥Z

χ(mod q)

⎛

119

χ(n) n1−s−w

|2 |Γ(w)||dw|⎠ ×

⎞ 12 |M (s + w, χ)|2 |Γ(w)||dw|⎠ .

χ(mod q)

Using the large sieve inequality and Lemma 5.7, we ﬁnd that

|J(s, χ)| q 2 −σ 1

⎧ ⎨ ⎩

(q + n)n2(σ−η−1)

n≥Z

1=χ(mod q)

⎫ 12 ⎬ ⎭

×

* 12 1 (q + Z) q 2 −σ+η 1−2(σ−η) × ·( +Y ) 1 − 2(σ − η) (log q)2 12 1 q 1 1 −σ σ−η q2 Z + × Z |2(σ − η) − 1| |σ − η| * 1 1 1 (q + Z) 2 q 2 ( 2 −σ+η) × . 1 log q |2(σ − η) − 1| 2 Now, let us choose η so that it satisﬁes 1 1 > |η − σ| > 4 8

(say)

if σ < 34 . We would then have 1=χ(mod q)

q 2 −σ |J(s, χ)| log q 3

(5.8)

which proves the result.

§6 Non-vanishing for a positive proportion, II Next, we study the mean and mean square of S(s, χ) Proposition 6.1

For any > 0, we have S(s, χ) = φ(q) + O (q 1−σ+ ). χ (modq)

Moreover, the same estimate holds if we sum only over non-trivial characters.

Chapter 5 Dirichlet L-Functions

120

Proof. By deﬁnition, we have that

S(s, χ) =

χ(mod q)

∞ a(n) −n/q e ns n=1

= φ(q)

∞ n=1 n≡1(mod q)

χ(n)

χ(mod q)

a(n) −n/q e . ns

Using the bound |a(n)| ≤ d(n) n , we ﬁnd that the sum is −1/q

e

+ O

The O-term is

1 q σ−

∞

−σ

t

exp(−t) .

t=1

q −σ+ .

It thus follows that

S(s, χ) = φ(q) + O (q 1−σ+ ).

χ(mod q)

Finally,

a(n) e−n/q q 1−σ+ ns

S(s, 1) =

(n,q)=1

as before. This proves the result. Proposition 6.2

We have χ(mod q)

1 1 5 |S( , χ)|2 = φ(q) + O(q(log q)− 2 ). 2 2

Proof. We see that the sum is equal to ∞

a(n1 )a(n2 ) exp(−(n1 + n2 )/q) χ(n1 )χ(n2 ) 1 2 n1 ,n2 =1 (n1 n2 ) χ(mod q) which is seen to be φ(q)

∞ n1 ,n2 =1

a(n1 )a(n2 ) exp(−(n1 + n2 )/q), 1 (n1 n2 ) 2

(6.1)

§6 Non-vanishing for a positive proportion, II

121

where the sum ranges over pairs (n1 , n2 ) satisfying n1 ≡ n2 (mod q),

(n1 , q) = (n2 , q) = 1

We split the double sum into three pieces Σ1 + Σ2 + Σ3 . In Σ1 , we have n1 < n2 , in Σ2 we have n1 > n2 , and in Σ3 we have n1 = n2 . The estimation of Σ1 and Σ2 is the same, so we only consider Σ1 . We have Σ1 =

∞ n1 =1 (n1 ,q)=1

∞

a(n1 ) exp(−n1 /q) 1 2

a(n2 ) exp(−n2 /q) 1

n1

.

(6.2)

n22

n2 =1 n2 ≡n1 (mod q) n2 >n1

We begin by considering the sum over n2 . We must necessarily have n2 > q for if n2 ≤ q, then n1 ≤ q also and so the congruence n2 ≡ n1 (mod q) would force n1 = n2 . We split Σ1 into three subsums Σ11 , Σ12 and Σ13 where in Σ11 we have n2 ≥ q log q in Σ12 we have q ≤ n1 < q log q and n1 < n2 < q log q. in Σ13 we have n1 < q and q < n2 < q log q. In Σ11 , we see, by partial summation, that the sum over n2 is ⎧ ⎫ ⎪ ∞ ⎪ ⎨ ⎬ 1 q−1 |a(n)| u− 2 e−u/q du. ⎪ q log q ⎪ ⎩ n1
We have from Proposition 5.3 that |a(n)| n1
u 1 2

1

φ(q) (log q) 2

.

Thus, we ﬁnd that the integral is

1 3/2 q (log q)1/2

∞

u 2 e−u/q du 1

q log q

and this is (log q)−1/2

∞

v 2 e−v dv 1

log q

q

−1

.

Inserting this into the n1 -sum, using Proposition 5.2, the Cauchy-Schwarz inequality and partial summation, we have 1

Σ11

q2

1 (log q) q 1 2

q− 2 (log q)− 2 . 1

1

Chapter 5 Dirichlet L-Functions

122

Now we consider the contribution of Σ12 . This is

a(n1 )e−n1 /q 1 2

1

n1

q≤n1
a(n2 )e−n2 /q

.

n22

n1
We split the n1 sum into O(log log q) sums of the form

a(n1 )e−n1 /q 1 2

1

n1

U
a(n2 )e−n2 /q

.

n22

n1
Let us write n2 = n1 + jq. The above double sum may therefore be written as

e−j

e−2n1 /q

a(n1 )a(n1 + jq) 1

1

(6.3)

n12 (n1 + jq) 2

U
j
If we drop the condition (n1 , q) = 1, then we introduce an additional sum

e−j

j
e−2k

U
a(kq)a((k + j)q) 1 1 . (kq) 2 ((k + j)q) 2

(6.4)

Observe that as q is prime, and λ(n) = 0 for n > Z = q 1/2 , we have a(kq) =

λ(d) =

d|kq

λ(d) = a(k).

d|k

Therefore, we have the estimate |a(kq)| ≤ d(k) k . A similar estimate holds for a((k + j)q). Using this in (6.4), we see that it is q−1

e−j

j
and this is

U
e−2k k

1 2 −

(k + j) 2 − 1

q−1 .

The sum in (6.3) may thus be replaced by j
e−j

U
e−2n1 /q

a(n1 )a(n1 + jq) 1

1

n12 (n1 + jq) 2

(6.5)

§6 Non-vanishing for a positive proportion, II

Let us set G(u) =

123

a(n1 )a(n1 + jq).

U
By Proposition 5.4, we see that for U < u, G(u)

j φ(j)

(u + (j + 1)q) (log q)2

.

The sum over n1 in (6.5) can be estimated using partial summation. We ﬁnd that it is equal to 2U 2u 2u 2U G(u)e− q e− q + G(u)d . 1 1 1 1 u 2 (u + jq) 2 U u 2 (u + jq) 2 U Using the estimate for G(u) quoted above, we see that this is 1

e−2U/q

j (U + jq) 2 U (log q)−2 . 1 φ(j) q 2 U

Incorporating these estimates into the sum over j, we ﬁnd that (6.5) is for σ =1

(U + jq) 12 U 12 j −j e−2U/q (log q)−2 e q φ(j)

j
which is

1 1 U 2 e−2U/q −j j e (U + jq) 2 q(log q)2 φ(j)

j
q (log q)−3/2 1 2

U 2 −2U/q e . q

Now summing this over U, we ﬁnd it is (log q)−3/2 . Now we discuss the contribution of Σ13 . By the Cauchy-Schwarz inequality, we see that ⎛ ⎞ 2 −n2 /q a(n1 )2 a(n2 )e ⎜ ⎟ |Σ13 |2 exp(−2n1 /q) ⎝ ⎠. 1 n1 q
n1 ≤q

2 n2 ≡n1 (mod q)

2

Chapter 5 Dirichlet L-Functions

124

The ﬁrst factor above is O(1) as can be seen from our discussion of Σ3 below. As for the second factor, we see that it is equal to

a(n2 )e−n2 /q a(n2 )e−n2 /q (n2 ) 2

1

1

n22

q
.

Again, we split this sum into three sums according as n2 < n2 , n2 = n2 , and n2 > n2 . The third is the same as the ﬁrst. Also, we note that the ﬁrst sum is just Σ12 which we have estimated above as being (log q)−3/2 . As for the second, we see that it is equal to q≤n2
a(n2 )2 e−2n2 /q . n2

Using Proposition 5.2 and partial summation, this is (log q)−1 . Inserting this into the above, we deduce that Σ13 (log q)−3/4 . Finally, we discuss the estimation of Σ3 , namely the terms with n1 = n2 . Thus, Σ3 =

∞ a(n)2 exp(−2n/q) = + + . n n=1 n>q n≤Y

(n,q)=1

(6.6)

Y
Since a(n) = 0 for 1 < n ≤ Y , we have = exp(−2/q)

(6.7)

n≤Y

Also, by partial summation and Proposition 5.2, we ﬁnd that (log(z2 /z1 ))−1 . n>q

Thus, we see from (6.6) - (6.8) that Σ3 = 1 +

a(n)2 exp(−2n/q) + O((log q)−1 ) n Y
(6.8)

§6 Non-vanishing for a positive proportion, II

125

Let us denote the sum on the right by S. We ﬁnd that a(n)2 n S= 1+O . n q Y
Now, the O-term is

1 a(n)2 q Y
1 1 q q log(z2 /z1 ) (log q)−1 . The main term is equal to

a(n)2 . n

Y
Finally, using Proposition 5.2, we get a(n)2 a(n)2 = + + . n n n
Z
Y
The ﬁrst sum is equal to 1 since a(n) = 0 for 1 < n ≤ Y . Using Proposition 5.2 and partial summation, we see that the second sum is (log n/Y ) 1 1 + O . · (log Z/Y )2 n log Z/Y Y
=

1 +O 2

1 log q

.

Similarly, the third sum is Z
1 1 +O log Z/Y n

which is =1+O

1 log q

1 log q

.

Putting these together we deduce that

a(n)2 5 1 = 1 + O( ) . n 2 log q n
This completes the proof of the proposition. We need also a result on the mean square of I( 12 , χ). We only state the result and refer the reader to [BM, Proposition 10.1] for the proof.

Chapter 5 Dirichlet L-Functions

126

We have

Proposition 6.3

1 q log log q |I( , χ)|2 = cφ(q) + O( ). 2 log q

Now we can put together the above results to prove Theorem 5.1. Proof of Theorem 5.1. Let s0 ∈ C satisfy (5.7). For χ = 1, we have

1 2

≤ Re s0 < 1 −

1 logq .

We return now to

S(s0 , χ) = L(s0 , χ)M (s0 , χ) + I(s0 , χ) + J(s0 , χ). Thus,

S(s0 , χ) =

'

( I(s0 , χ) + J(s0 , χ) + S(s0 , χ)

χ=1

where ranges over χ = 1 such that L(s0 , χ) = 0 and non-trivial χ(mod q). By Proposition 6.1, we have

over the remaining

' ( S(s0 , χ) = φ(q) + O q 1−σ+ .

χ=1

Thus, we have

S(s0 , χ) = φ(q) −

' ( ' ( I(s0 , χ) + J(s0 , χ) + O q 1−σ0 +

and consequently,

' ( S(s0 , χ) ≥ φ(q) − | I(s0 , χ) + J(s0 , χ) | ' ( + O q 1−σ+

Now, assuming | Im s0 | < 1, ' ( | I(s0 , χ) + J(s0 , χ) | ≤ |I(s0 , χ)| + |J(s0 , χ)| (1 1' ≤ φ(q) 2 |I(s0 , χ)|2 2 3 q 2 −σ +O log q by Proposition 5.8. Now using Proposition 6.3, we have 1 ' 1 ( √ 1 log log q 2 | I( , χ) + J( , χ) | ≤ cφ(q) + O q + O(q(log q)−1 ). 2 2 log q

§6 Non-vanishing for a positive proportion, II

127

Thus, 1 1 √ 1 log log q 2 + | S( , χ)| ≥ (1 − c)φ(q) + O(q 2 ) + O q 2 log q + O(q(log q)−1 ). On the other hand, by the Cauchy-Schwarz inequality, setting N (s0 , q) to be the number of χ(mod q) with L(s0 , χ) = 0, we get

|

( ' S(s0 , χ)|2 ≤ N (s0 , q) |S(s0 , χ)|2 .

We have from Proposition 6.2 χ(mod q)

1 1 5 |S( , χ)|2 = φ(q) + O(q(log q)− 2 ). 2 2

We deduce that / √ 2 1 2 log log q 1 φ(q)(1 − c) + O q ≤ N ( , q) 1 + O((log q)− 2 ) . 5 log q 2 Thus,

/ √ 2 1 log log q 2 N ( , q) ≥ φ(q)(1 − c) + O q . 2 5 log q

The methods of this section in fact prove a signiﬁcantly stronger result [BM, Theorem 11.1]. We state it below. Theorem 6.4 primes q,

Fix a σ in the interval

1 2

≤ σ < 1. Then, for all suﬃciently large

L(σ, χ) =0 for a positive proportion of the characters χ(mod q). Remark. The proof produces a lower bound for this proportion. How large q must be taken will depend on σ. Finally, we state another result [BM, Theorem 12.1] which can be proved by reﬁning the techniques described above. Theorem 6.5 Let q be a suﬃciently large prime. There is an absolute constant c > 0 such that for a positive proportion of the characters χ mod q, L(σ, χ) = 0 in the interval 1 c + ≤ σ ≤ 1. 2 log q

Chapter 5 Dirichlet L-Functions

128

§7 A conditional improvement It is an interesting question to ask whether Theorem 5.1 can be strengthened by assuming the Riemann Hypothesis. Of course, the Riemann Hypothesis tells us that there are no zeroes for Re(s) > 12 , but it gives no direct information about s = 12 . The following result is due to R. Murty [RM]. Theorem 7.1 Let q be a prime. Assume the Riemann Hypothesis for all the Dirichlet L-functions L(s, χ). The number of characters χ mod q with L( 12 , χ) =0 is at least 12 φ(q). The proof depends on the explicit formula method. Let us write ∞ L − (s, χ) = Λ(n)χ(n)n−s . L n=1

Proposition 7.2 Let F be a function satisfying the following hypotheses: (i) For some > 0, F (x) exp((1 + )x) is integrable and of bounded variation. (ii) The function (F (x) − F (0))/x is of bounded variation. For such a function deﬁne the transform ∞ φ(γ) = F (x)eiγx dx. −∞

Then γ

φ(γ) = 2F (0) log

√ q 1 ∞ Γ + φ(−i/2)δχ − (1 + it)φ(t)dt 2π π −∞ Γ −2

∞ Λ(n)χ(n) F (log n) n n=1

where the sum on the left hand side is over γ such that L( 12 + iγ, χ) = 0 and 1 1 2 ≤ Re( 2 + iγ) ≤ 1 and 1 if χ is the principal character δχ = 0 otherwise. The following two lemmas are proved easily by straightforward calculations. Lemma 7.3

Let T > 0 and deﬁne F (x) = 2T − |x| 0

if |x| ≤ 2T otherwise.

Then F satisﬁes the conditions of Proposition 7.2 and 2 2 sin(γT ) φ(γ) = . γ

Exercises

Lemma 7.4

129

Let T > 1. Then 2 ∞ Γ sin tT (1 + it) dt T. Γ t 0

Proof of Theorem 7.1. We choose the function F given in Lemma 7.3 and apply the explicit formula to the Dedekind zeta function of Q(ζq ): Z(s) = L(s, χ). χ(mod q)

As we are assuming the Riemann Hypothesis, all of the γ are real. Let us set T =

1 log x. 2

Let rχ denote the order of zero of L(s, χ) at the point s = 12 . Then we have the inequality 1 √ rχ (log x)2 ≤ 2φ(q)(log x)(log q/2π) + 4x 2 + O(φ(q) log x) − 2φ(q) χ

×

n≤x n≡1(mod q)

1 x Λ(n) log . n n

We can discard the last sum on the right as it is non-negative. Now choosing x = φ(q)2 gives

rχ ≤

χ

1 φ(q) + O(φ(q)/ log q). 2

This proves the result.

Exercises 1.

Prove the Polya-Vinogradov estimate for imprimitive characters as follows. Let χ mod q be induced from χ1 mod q1 and write q = q1 r. Then χ(n) = χ1 (n). n≤x

n≤x (n,r)=1

Express the condition (n, r) = 1 in terms of the M¨ obius function to deduce that the quantity to be estimated is μ(d)χ1 (d) χ1 (m). d|r

m≤x/d

Chapter 5 Dirichlet L-Functions

130

Now apply the Polya-Vinogradov estimate for the inner sum and deduce that the above quantity is 1

q12 log q1

1

|μ(d)| q 2 log q1 .

d|r

2.

1

Prove the estimate of Theorem 3.1 in the case Y ≤ X 2 as follows. (i) First, D S(X, Y ) ≤ + d(n2 ) 1. rs 2 2 r,s≤Y rs =a2

Show that

|D|≤X

n ≤Y

|D|≤X

d(n2 ) Y (log Y )2

n≤Y

to deduce that the second sum is XY (log Y )2 . (ii) As for the ﬁrst sum, prove that if we include square values of D in the inner sum, we increase the whole quantity only by an amount which is 1 Y 2 X 2 ≤ XY . (iii) View the inner sum as ∗ χ(D) |D|≤X

where χ(D) = (D/rs) is a character of conductor ≤ 4rs and the sum is over integers D in the speciﬁed range and which satisfy (ii) in the deﬁnition of D. Show that this sum can now be expressed by a ﬁnite number of character sums with nonprincipal characters of conductor ≤ 16rs. Apply the Polya-Vinogradov estimate to deduce that the sum 1 over D is (rs) 2 log Y . Now sum over r, s to deduce that the whole sum is Y 3 log X ≤ XY log X. 3.

Prove the following variant of the estimate of Theorem 3.1: 2 ∗ D XY (log X)4 . n |D|≤X n≤Y Here, the sum over n is restricted to fundamental discriminants. To deduce this from Theorem 3.1, begin by writing the inner sum as ⎞ ⎛ ∗ D D ⎠ ⎝ = μ(d) n n 2 n≤Y

n≤Y

d |n

References

and rearrange as

d≤Y

4.

1 2

131

D μ(d) . n n≤Y d2 |n

Prove that when n is a square, 0
d Y 1+s ds = cn + O((|s| + 1)d(n)Y n 1+s

1 2 +σ

)

for σ > 0. Here, the summation is over fundamental discriminants. 5.

Prove that for some constant c, we have ∞ 1 cm 3 1 1 2 exp(−m /X) = 2 1− ( log X + c) + O(X − 2 + ). m π p(p + 1) 2 p m=1

6.

Using the mean-value estimate of Jutila [Jut3] |D|≤X

T

−T

1 |L( + it, χ)|2 XT (log XT )16 2

deduce the estimate of Lemma 4.4.

References [B]

R. Balasubramanian, A note on Dirichlet’s L-functions Acta Arith., (1980), 273–283

38

[BM] R. Balasubramanian and V. Kumar Murty, Zeros of Dirichlet L-functions, Ann. Scient. Ecole Norm. Sup., 25 (1992), 567–615 [BV] M.B. Barban and P.P. Vehov, On an extremal problem Trans. Moscow Math. Soc., 18 (1968), 91–99 [D] H. Davenport, Multiplicative Number Theory, Springer-Verlag, 1980. [Gra] S. Graham, An asymptotic estimate related to Selberg’s sieve J. Numb. Thy., 10 (1978), 83–94 [HB] R. Heath-Brown, The fourth power mean of Dirichlet’s L-functions, Analysis, 1 (1981), 25–32 [Hil] A. Hildebrand, Large values of character sums, J. Numb. Thy., 29 (1988), 271–296 [Jut1] M. Jutila, On character sums and class numbers, J. Numb. Thy., 5 (1973), 203–214

132

Chapter 5 Dirichlet L-Functions

[Jut2] M. Jutila, On the mean value of L( 12 , χ) for real characters, Analysis, 1 (1981), 149–161 [Jut3] M. Jutila, On mean values of L-functions and short character sums with real characters, Acta Arith., 26 (1975), 405–410 [KM] V. Kumar Murty, Non-vanishing of L-functions and their derivatives, in: Automorphic forms and analytic number theory, pp. 89–113, ed. R. Murty, CRM, Montreal, 1990. [Me] J.-F. Mestre, Formules explicites et minorations de conducteurs de vari´et´es alg´ebriques, Comp. Math., 58 (1986), 209–232 [MV] H.L. Montgomery and R.C. Vaughan, Exponential sums with multiplicative coeﬃcients, Invent. Math., 43 (1977), 69–82 [RM] M. Ram Murty, Simple zeroes of L-functions, in: Number Theory, ed. R. Mollin, pp. 427–439, de Gruyter, 1989. [Sie] C. L. Siegel, On the zeros of the Dirichlet L-functions, Annals of Math., 46 (1945), 409–422 [Vino] I. M. Vinogradov, The method of trigonometrical sums in the theory of numbers, Interscience, London-New York, 1955.

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

§1 Introduction Statement of results Let f be a holomorphic cusp form for Γ0 (N ) of weight 2 and character . We assume that f is a normalized newform for the Hecke operators. Denote by L(s, f ) the L-function attached to f. For Re(s) > 3/2, it is given by an absolutely convergent Dirichlet series ∞ a(n) L(s, f ) = . ns n=1 For L(s, f ) we have the functional equation As Γ(s)L(s, f ) = ωA2−s Γ(2 − s)L(2 − s, f¯),

where A = deﬁned by

√ N /2π, ω is a complex number of absolute value 1 and L(s, f¯) is L(s, f¯) =

∞ a ¯(n) ns n=1

for Re(s) > 3/2 and by analytic continuation for all values of s. (This series actually converges (conditionally) for Re(s) > 5/6. Thus, the functional equation and the Dirichlet series serve to deﬁne L(s, f ) for all values of s.) ' ( For any D, let χD denote the quadratic character D. mod D. Let us set LD (s, f ) =

∞ a(n)χD (n) . ns n=1

Let D be a fundamental discriminant (that is, the discriminant of a quadratic ﬁeld). Thus, D ≡ 1(mod 4) and D is squarefree, or D = 4D0 , D0 ≡ 2, 3(mod 4)

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_7, © Springer Basel AG 1997

133

134

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

and D0 is squarefree. If (D, N ) = 1, then LD (s, f ) satisﬁes the functional equation (A|D|)s Γ(s)LD (s, f ) = ωχD (−N ) (D)(A|D|)2−s Γ(2 − s)LD (2 − s, f¯). The problem considered in this chapter is whether there exists a fundamental discriminant D prime to N such that LD (1, f ) = 0. This question has been considered in the case = 1 and settled aﬃrmatively by Waldspurger [W1], [W2]. Using a completely diﬀerent approach, K. Murty [M] showed that the answer is always aﬃrmative. Theorem 1.1 There exist inﬁnitely many fundamental discriminants D such that LD (1, f ) = 0. The purpose of this chapter is to give an exposition of this result. For forms with nontrivial character, Shimura [Sh] showed some years ago that there exists a twist (not necessarily quadratic) such that the twisted L-function does not vanish at s = 1. This has been generalized to number ﬁelds and general points by Rohrlich [Ro] who shows that given any point s0 ∈ C there exists a twist (not necessarily quadratic) such that the twisted L-function does not vanish at s0 . It should be pointed out that in the context of number ﬁelds, there are examples of Waldspurger to show that in general, we may not be able to ﬁnd a quadratic twist such that the twisted L-function does not vanish at the central critical point. Indeed, Waldspurger works with cuspidal automorphic representations of PGL(2) over any number ﬁeld, and produces a necessary and suﬃcient condition for the existence of such a twist. This condition is always satisﬁed when the ﬁeld is Q and the representation corresponds to a holomorphic modular form. Some references to more recent work on this question are given at the end of this section as well as in Chapter 8. Though we have stated the result for forms of weight 2, we remark that it remains valid for holomorphic cusp forms of any weight ≥ 2. We shall in fact prove the following result from which Theorem 1.1 will follow immediately. Theorem 1.2

Let a ≡ 1(mod 4), (a, 2N ) = 1. Then, 1 Y Y LD (1, f ) dt = C(f )Y + O( ) Y 1 (log Y )ν |D|≤t D≡a(mod 8N )

where C(f ) = 0 and 0 < ν < ρ = 1 −

√ √ 2 + 3 3) = .0652 . . . .

1 √ ( 5 2

Note that in Theorem 1.2, the sum is over all D with |D| ≤ t, D ≡ a(mod 8N ) and not only over fundamental discriminants. To deduce Theorem 1.1 from Theorem 1.2, one has only to check that if there were only a ﬁnite number of fundamental discriminants D0 such that L = 0, then the quantity estimated in √D0 (1, f ) Theorem 1.2 can be shown to be O( Y (log Y )). (The interested reader is referred to Exercise 1 below, or to [MM, pp. 450–451] where a similar calculation is carried out in detail.)

§1 Introduction

135

Notation We begin by introducing some notation. If D0 is a fundamental discriminant which is coprime to N, then we may write the functional equation in the unsymmetric form LD0 (1 + s, f ) = ωχD0 (−N ) (D0 )|D0 |−2s A−2s Let us deﬁne μ ˜ by

Γ(1 − s) LD (1 − s, f¯). Γ(1 + s) 0

∞ 1 μ ˜(n, f ) = . L(s, f ) n=1 ns

Any D ≡ 1(mod 4) can be written as, D = D0 δ 2 , where D0 is a fundamental discriminant and μ ˜(d, f ) D0 LD (s, f ) = LD0 (s, f ) . ds d 2 d|δ

For β = ±1, let us set f˜Yβ (n, s; a) =

0<βD≤Y D≡a(mod 8N )

fYβ (n, s; a)

=

D n

0<βD0 ≤Y D0 ≡a(mod 8N )

Let us also set 1 = Y

and gYβ (n, s; a)

|D|s ,

D0 n

f˜Yβ (n; a) = f˜Yβ (n, 0; a),

g˜Yβ (n, s; a)

1 = Y

D unrestricted

|D0 |s ,

D0 fundamental

fYβ (n; a) = fYβ (n, 0; a).

Y

f˜tβ (n, s; a) dt 1

Y

ft (n, s; a)dt. 1

We shall write d(n) for the number of positive divisors of n. Also, b stands for a generic integer. Thus the statement nd = b2 means that nd is not the square of an integer. For an integer n we shall often write n = n1 n2 where (n2 , 2N ) = 1 and p|n1 =⇒ p|2N. The complex numbers s0 and s1 have real parts σ0 and σ1 (respectively). We set log+ x = log x if x > 1 1 otherwise.

136

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

Outline of Proof. Let D be an integer with D ≡ 1(mod 4). As in [MM], we begin by considering the integral 1 2πi This is equal to

LD (1 + s, f )X s Γ(s)ds. (2)

∞ a(m) D exp(−m/X). m m m=1

On the other hand, moving the line of integration to the left, we obtain a residue at s = 0 equal to LD (1, f ) and an integral 1 2πi

LD (1 + s, f )X s Γ(s)ds. (−η)

Here, 0 < η < 1 and the integration is on the line Re(s) = −η. In the above notation, LD (s, f ) = LD0 (s, f )

μ ˜(d, f ) D0 d|δ 2

ds

d

.

Applying the functional equation for LD0 (s, f ) we see that the integral is (D0 )ωχD0 (−N )× 1 × 2πi

−2s

|D0 |

LD0 (1 − s, f¯)

(−η)

s μ ˜(d, f ) D0 Γ(1 − s) X d|δ 2

d1+s

d

Γ(1 + s)

A2

Γ(s)ds.

Now, let us ﬁx an integer a such that a ≡ 1(mod 4) and (a, 2N ) = 1. Unlike [MM], we now sum the above equation over all D satisfying |D| ≤ Y with D ≡ a(mod 8N ). Thus, we include both positive and negative values of D. This is an important ingredient in controlling the error terms. (See the discussion later in this section and also Lemma 2.2 and Lemma 5.4). If we take η > 12 , then LD0 (1−s, f¯) is given by an absolutely convergent Dirichlet series. Inserting this and rearranging, we ﬁnd that LD (1, f ) = S(X, Y ) + I(X, Y ) |D|≤Y D≡a(mod 8N )

where

∞ 0 a(m) ˜+ S(X, Y ) = fY (m; a) + f˜Y− (m; a) exp(−m/X) m m=1

§1 Introduction

137

and I(X,Y ) = ω (a) 1 × 2πi

a μ ˜(d,f ) ¯(δ 2 ) × N d √ 2 δ≤ Y (δ,2N )=1

d|δ

0 Γ(1 − s) X s a ¯(n) + − f 2 (nd,2s;a) − fY /δ2 (nd,2s;a). Γ(s)ds. n1−s Y /δ Γ(1 + s) A2 d (−η)

We shall prove that 1 Y where

Y

S(X, t)dt = C(f )Y + O(X(log X)−ρ )

1

1 a(n1 n22 ) a φ(n2 ) C(f ) = . 8N n ,n n1 n22 n1 n2 1

2

This is similar to the constant that occurs in [MM, Theorem 1] and an analogous argument shows that it is nonzero. Actually, we work with a more general series. For (h, 2N j) = 1, and Re(s0 ), Re(s1 ) ≥ 0, deﬁne C(s0 , s1 , j, h) =

1 8N (1 + 2s0 )(1 + s0 )

n=n1 n2 n2 h=b2 (n2 h,j)=1

a(n) n1+s1

a n1

φ(n2 h) . n2 h

That the series converges in this domain follows from estimates in §3. Moreover, we note that C(f ) = C(0, 0, 1, 1). The error estimate above is obtained by applying the integrated Polya-Vinogradov inequality and an estimate of Rankin. For the integral, we show that a smoothened version of it is 1 2

X Y

1 2

Y X

4/5 (log Y )ν

for any 0 < ν < ρ/10. Choosing X = Y (log Y )10ν in the above estimates, the main theorem follows. The above estimate for the mean value of I(X, t) is the technical heart of this chapter. It requires an integrated and reﬁned version of the Polya-Vinogradov estimate (§2, §4), together with an iterated argument to estimate certain weighted sums of Fourier coeﬃcients and Dirichlet characters (§6). Preliminary estimates for such weighted sums are obtained in §5. The treatment of the main terms is discussed in §3 and the theorems are proved in §7.

138

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

There are several estimates which we shall repeatedly use. Firstly, for Fourier coeﬃcients, we have the estimate of Rankin |a(n)|2 X 2 . n≤X

There is also the estimate of Rankin-Shahidi [Ra] (see Theorem IV.9.1)

|a(n)| X 3/2 (log X)−ρ .

n≤X

Secondly, for character sums, there is the estimate of Fainleib and Saparnijazov [FS] (see also [MM, Lemma 1]) which is a generalization of Theorem V.3.1 of Jutila [J]: 2 h (N 2 /φ(N ))dXY (log Xd)2 . nd n≤X 0<βh≤Y 2 h≡a(mod 8N ) n2 =b

Remarks on the proof. To prove Theorem 1.2, we consider averages over both positive and negative discriminants, and in addition, we introduce a further smoothing factor. Thus, we consider 1 Y

1

Y

LD (1, f ) dt =

|D|≤t D≡a(mod 8N )

|D|≤t D≡a(mod 8N )

|D| 1− LD (1, f ). Y

If = 1, then ω = ±1 and the functional equation gives the relation (1 − ωχD (−N ))LD (1, f ) = 0. Thus, the requirement that LD (1, f ) = 0, imposes a condition on D. In particular, ' ( if D ≡ a(mod 8N ) is ﬁxed, we require sgn D to be chosen so that sgn D = ω Na . In our arguments, the importance of choosing the correct value of sgn D is seen as follows. When we sum over D of a ﬁxed sign, we expect (and get) contributions to the main term from (the analogues of) both the sum S(X, Y ) and the integral I(X, Y ). Together, this contribution is a 1+ω (sgn D) cY N for a nonzero constant c. Thus, the “wrong” choice of sgn D would cause the main term to be cancelled. For essentially, the same reasons, summing over D of both signs doubles the contribution of S(X, Y ) to the main term and cancels the contribution of I(X, Y ). (See the proof of Theorem 1.2 in §7). If = 1, then

§1 Introduction

139

the root number does not give an obstruction to the choice of the sign of D. By including both values of sgn D, therefore, we get a statement valid in all cases. There is a second, and deeper, reason for including both positive and negative values of D. In order to handle the error terms that come from S(X, Y ), we need to estimate character sums of the form D m |D|≤Y D≡a(mod 8N )

when m2 is not a square. If χ is a nontrivial Dirichlet character modulo q, the Polya-Vinogradov estimate (Theorem V.2.1) is √ χ(D) q(log q) 0<βD≤Y

for β = ±1. If in addition, χ is even, Hua has shown (see Lemma 2.1 below) that 1 Y

Y 1

χ(D)dt

√ q.

0<βD≤t

Thus, for even characters, the extra averaging allows us to save a factor of log q. Now, by summing over both positive and negative values of D, and integrating over t, we are able to ﬁlter out odd characters (see Lemma 2.2) and use this improved estimate. To estimate the integral 1 Y

Y

I(X, t)dt 1

we ﬁnd that the device which ﬁltered out odd characters in the sum has exactly the opposite eﬀect in the integral. (This is because of the sgn character that is introduced by the functional equation). Therefore, we need an analogue of Hua’s estimate which is valid for all characters. In §4, we give such an estimate in mean square. Indeed, a special case of Lemma 4.6 gives 1 Y Y 0 n≤X

0<βD≤t D≡a(mod 8N )

where log+ a = and β = ±1.

D n

2 X dt X 2 (log+ )2 Y

log a if a > 1 1 otherwise

140

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

In addition to this, we need a sharp estimate for the quantity a(n) 1 Y ' ( ft+ (nh, 2s0 ; a) − ft− (nh, 2s0 ; a) dt. 1+s 1 n Y 1 n≤X n2 h =b2

It turns out that the required estimate is intimately connected with estimates for two other quantities. We shall now brieﬂy describe these. For the purpose of this exposition, we state them approximately as follows. (The reader who wishes a more precise statement is referred to the relevant sections.) Let α > 0, β = ±1. We consider the following statements: 1 Y D Aβ (α) : |D|2s0 LD (1 + s1 , f )dt = main term + Y 1 h 0<βD≤t D≡a(mod 8N )

+ O(Y

1−σ1 +2σ0

√ 0 h(1 + |s0 | + |s0 − s1 |)2 (log Y h)α log(h log Y ) )

for all σ0 ≥ 0, 0 ≤ σ1 < 12 , (ah, 2N ) = 1, a ≡ 1(mod 4). The “main term” here grows like Y 1+2σ0 . Thus, Aβ (α) does not give an asymptotic formula unless σ1 > 0. Y a(n) 1 β C β (α) : f (nh, 2s0 ; a)dt n1+s1 Y 1 t n≤X n2 h =b2

' =O Y

√

1 2 +2σ0

X

1 2 −σ1

h (2σ1 − 1)

( × (1 + |s0 | + |s0 − s1 |)2 (log Y h)α log(h log Y ) for all σ0 ≥ 0, 0 < σ1 < 12 , a ≡ 1(mod 4), and (ah, 2N ) = 1, together with a similar estimate for the sum over n ≥ X and 12 < σ1 < 1. To produce the required estimate of I(X, t), we need to know that for any λ > 0, C β (λ) holds. After reviewing some basic estimates from [MM] in §5, we study the above statements in §6 . We replace C β (α) by a smoothed version, again denoted C β (α). This version suﬃces for our application. Then, we show that Aβ (2) holds (Lemma 6.1), that Aβ (α) implies C β (α) (Proposition 6.3) and that C β (α) implies Aβ (4α/5) (Proposition 6.4). This chapter is based on the preprint [M]. We note that Iwaniec [I] has found a beautiful method which proves the main result of [MM] with an improved error term of O(Y θ ) with a θ < 1. He accomplishes this by using Gauss sums to introduce an extra averaging. His method can be used to prove the asymptotic formula of Theorem 1.2 also and this is developed in [MS ]. Recently, Friedberg and Hoﬀstein [FH] proved the non-vanishing result (but not the asymptotic formula) over any number ﬁeld. Their method involves the Rankin-Selberg convolution and metaplectic Eisenstein series and is quite diﬀerent from our techniques.

§2 The integrated Polya-Vinogradov estimate

141

§2 The integrated Polya-Vinogradov estimate We begin with the following well known integrated version of the Polya-Vinogradov estimate. Lemma 2.1. Let β = ±1, and let χ be an even, nontrivial Dirichlet character modulo q. Then, 1 Y √ χ(D) dt q. Y 1 0<βD≤t

Proof. This is due to Hua (see [BC, eqn.(7)]). Lemma 2.2

If n2 h = b2 , and (ah, 2N ) = 1, then 1 Y

1

Y

|D|≤t D≡a(mod 8N )

D nh

1

|D|2s0 dt (|s0 | + 1)(nh) 2 Y 2σ0 .

Proof. We have

|D|≤t D≡a(mod 8N )

D nh

|D|2s0

D ψ(D) |D|2s0 nh ψ(mod 8N) |D|≤t 1 ¯ D −1 2s0 = ψ(a) ψ(D) |D| (1 + ψ(−1) ). φ(8N ) nh nh

1 = φ(8N )

ψ

¯ ψ(a)

0
' . ( Each character ψ nh is nontrivial as n2. h is. Thus, we may apply Lemma 2.1 and partial summation to the inner sum to complete the proof. Lemma 2.3

For β = ±1, and σ0 ≥ 0, we have

0<βD≤t D≡a(mod 8N ),(D,n2 h)=1

|D|2s0 =

1 φ(8N n2 h) t1+2s0 + O(t2σ0 d(n2 h)(|s0 | + 1)). φ(8N ) 8N n2 h 1 + 2s0

This is proved by partial summation and some elementary estimates. The details are left to the reader. As an immediate application, we state the following asymptotic estimates for average values of L-functions, the proof of which is left as an exercise. (We shall not need these in the remainder of the chapter.)

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

142

Proposition 2.4 For (ah, 2N ) = 1 and σ0 ≥ 0 we have 1 Y D |D|2so LD (1 + s1 , f )dt Y 1 h |D|≤t D≡a(mod 8N )

= C(s0 , s1 , 1, h)Y 1+2s0 + E(s0 , s1 , h, Y ) where if σ1 > 1, E(s0 , s1 , h, Y ) is 1

(|s0 | + 1)Y 2σ0 (h 2 ζ(σ1 ) + d(h)2 ) and if

1 2

< σ1 < 1, it is

Y 1−σ1 +2σ0 (log Y )3(1−σ1 ) (|s0 | + 1)h 2 ((1 − σ1 )−1 +

1

Y log Y

12

1 |Γ(σ1 − )|)). 2

We remark that using the results of §6, it is possible to reﬁne the error terms. To establish asymptotics for values of s1 inside the critical strip, we shall need a more reﬁned approach. Moreover, we shall need a substitute for Lemma 2.1 that holds for odd characters. These two problems are addressed in the next two sections.

§3 The main terms We are interested in estimating the quantity 1 Y LD (1 + s1 , f )dt Y 1 0<βD≤t D≡a(mod 8N )

for 0 ≤ σ1 < 1/2. However, it is important to treat a more general sum. For σ0 ≥ 0, 0 ≤ σ1 < 12 , a ≡ 1(mod 4), (ahj, 2N ) = 1, (h, j) = 1, β = ±1, we consider 1 Y D |D|2so LDj 2 (1 + s1 , f ) dt. Y 1 h 0<βD≤t D≡a(mod 8N )

Towards this end, for D ≡ a(mod 8N ), consider the integral 1 LDj 2 (1 + s1 + w, f )X w Γ(w)dw. 2πi (2) This is

(n,j)=1

a(n) n1+s1

D n

exp(−n/X).

§3 The main terms

143

Moving the line of integration to Re s = −η, with 0 < η < 1, we obtain a(n) D exp(−n/X) n1+s1 n (n,j)=1 1 = LDj 2 (1 + s1 , f ) + LDj 2 (1 + s1 + w, f )X w Γ(w)dw. 2πi (−η) Writing Dj 2 = D0 δ 2 with D0 a fundamental discriminant, we see that the integral is equal to w 1 μ ˜(d, f ) D0 X L (1 + s + w, f ) Γ(w)dw. D0 1 2πi d|δ2 d1+s1 (−η) d d j|δ

1 2

Suppose now that η > integral becomes

+ σ1 . Then, on applying the functional equation, the

a 1 μ ˜(d, f ) ¯(n) D0 × ωχD0 (−N ) (D0 ) 2πi d|δ2 d1+s1 (−η) n1−s1 −w nd j|δ

Γ(1 − s1 − w) (X/d)w Γ(w)dw. Γ(1 + s1 + w) ' ( Multiplying through by |D|2s0 D h , summing over ×(A|D0 |)−2(s1 +w)

0 < βD ≤ t, D ≡ a(mod 8N ), and integrating over t, we see that 1 Y D |D|2s0 LDj 2 (1 + s1 , f ) dt Y 1 h 0<βD≤t D≡a(mod 8N )

is equal to the sum of (n,j)=1

and

a(n) 1 n1+s1 Y

1

a − βω (a) A−2s1 N

1 × 2πi ×

(−η)

⎛ Y

⎜ ⎝

0<βD≤t D≡a(mod 8N )

2

¯(δ )δ

δ2 ≤Y j|δ (δ,2N )=1

a ¯(n) n1−s1 −w

D nh

1 Y /δ 2

4s0

δ2 h

⎞ ⎟ |D|2s0 dt⎠ exp(−n/X)

d|δ 2

μ ˜(d, f ) d1+s1

Y /δ 2

ftβ (ndh, 2s0 1

Γ(1 − s1 − w) (X/dA2 )w Γ(w)dw. Γ(1 + s1 + w)

− 2s1 − 2w; aδ¯2 )dt

144

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

Here, as usual, we are writing ftβ (ndh, 2s0

− 2s1 − 2w, aδ¯2 ) =

0<βD0 ≤t D0 δ2 ≡a(mod 8N )

D0 ndh

|D0 |2s0 −2(s1 +w) .

We have now to determine the size of the sum and the integral. For the sum, we expect that the main term will come from the sum over those values of n for which n2 h = b2 . This “main term” therefore is ⎛ ⎞ ⎟ Y a(n) a ⎜ ⎜1 ⎟ 2s0 ˜ β (s0 , s1 , j, h) = M |D| dt ⎜ ⎟ exp(−n/X). 1+s 1 n n1 ⎝ Y 1 0<βD≤t ⎠ n h=b2 2 (n2 ,j)=1

D≡a(mod 8N ) (D,n2 h)=1

When there is no possibility of confusion, we shall simply write ˜ β (s0 , s1 , j, h). M = M It is more convenient to work with the series

def

Mβ (s0 , s1 , j, h) =

n2 h=b2 (n2 ,j)=1

a(n) n1+s1

a n1

⎛

⎞

⎜ Y ⎜1 ⎜ ⎝Y 1

0<βD≤t D≡a(mod 8N ) (D,n2 h)=1

⎟ ⎟ |D|2s0 dt⎟ . ⎠

For σ ≥ 0, and (d, 2N D) = 1, deﬁne Fd (s, f, D) =

a(m) m1+s

m2 d=b2 (m,D)=1

In terms of this function, we see that 1 Y Mβ (s0 , s1 , j, h) = Y 1

a m1

.

|D|2s0 Fh (s1 , f, jD)dt.

0<βD≤t D≡a(mod 8N ) (D,h)=1

˜ β and Mβ are related by the following estimate. The two series M Lemma 3.1

We have

3 1 1 ˜ Mβ (s0 , s1 , j, h) = Mβ (s0 , s1 , j, h) + O( 1 + 3/4 X −σ1 − 8 Y 1+2σ0 ). p p|j

uniformly in h.

§3 The main terms

145

The proof will be given later in this section. As for the integral, we again expect that the “main term” will come from the sum of those values of n where n2 dh is a perfect square. The “main term” is therefore 2 a δ −βω (a) A−2s1 ¯(δ 2 )δ 4s0 N h δ2 ≤Y j|δ (δ,2N )=1

×

μ ˜(d, f ) 1 a ¯(n) a 1+s1 2πi 1−s1 −w d n n 1 (−η) n dh=b2 d|δ 2 2 ⎛

⎜ Y /δ2 ⎜ 1 ×⎜ ⎜ Y /δ 2 1 ⎝ ×

0<βD0 ≤t D0 δ2 ≡a(mod 8N ) (D0 ,2N n2 dh)=1

⎞

⎟ ⎟ |D0 |−2(s1 +w−s0 ) dt⎟ ⎟ ⎠

Γ(1 − s1 − w) (X/dA2 )w Γ(w)dw. Γ(1 + s1 + w)

For η ≥ σ1 , the “main term” may be written in terms of the function Fd as follows: 2 a δ μ ˜(d, f ) N = −βω (a) A−2s1 ¯(δ 2 )δ 4s0 N h d1+s1 2 δ2 ≤Y d|δ

j|δ (δ,2N )=1

Y /δ2 1 |D0 |2s0 −2s1 Y /δ 2 1 w 1 X Γ(1 − s1 − w) ¯ × Fdh (−s1 − w, f , D0 ) Γ(w)dw dt. 2πi (−η) dA2 |D0 |2 Γ(1 + s1 + w) ×

If 0 < σ1 < 18 , we move the line of integration to the right, and get a residue at w = 0 equal to 2 a Γ(1 − s1 ) δ μ ˜(d,f ) def −2s 2 4s 1 0 Nβ (s0 ,s1 ,j,h) = βω (a) A ¯(δ )δ N Γ(1 + s1 ) δ2 ≤Y h d1+s1 2 d|δ

⎛ ⎜ 1 ×⎝ Y /δ 2

j|δ (δ,2N )=1

Y /δ 2 1

⎞

⎟ |D0 |2(s0 −s1 ) Fdh (−s1 , f¯,D0 )dt⎠ .

0<βD0 ≤t D0 δ2 ≡a(mod8N )

We also set Nβ (s0 , s1 , j, h) = 0

if

1 1 < σ1 < . 8 2

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

146

Remark. The point here is that the residue is insigniﬁcant if σ1 is bounded away from zero. The choice of 1/8 as a breaking point was only for convenience, and any other point will work just as well. Lemma 3.2

For 0 ≤ σ1 <

1 8,

we have

N = Nβ (s0 , s1 , j, h) + O(Y For

1 8

5 8 +2σ0

X 16 −σ1 ). 3

≤ σ1 < 12 , we have N Y 1+2(σ0 −σ1 +η) X −η |Γ(−η)|

for any σ1 ≤ η < 1. The proof will be given later in the section. We expect that 1 Y

Y

1

0<βD≤t D≡a(mod 8N )

D h

|D|2s0 LDj 2 (1 + s1 , f )dt

is asymptotic to Mβ (s0 , s1 , j, h) + Nβ (s0 , s1 , j, h). It will be necessary to have estimates on the growth of these functions. This will be based on a study of the function Fd (s, f, D) introduced above. Analysis of Fd (s, f, D) Let us deﬁne Bp (s) =

∞ a(p2 ) =0

p2s

.

Let us also deﬁne the function Fd0 (s, f ) =

a(d0 n2 ) (d0 n2 )s

for Re(s) > 1. Then, by Exercise 3, −1

Fd0 (s,f ) = L(2s,Sym )ζ(4s − 2) 2

p|d0

(p)p 1 − 2s p

1−

1

−1

p4s−2

and this gives the analytic continuation of Fd0 (s) for Re(s) > 3/4.

1 + (p)p . ps

§3 The main terms

Now, we have the relation ⎛ ⎜ Fd (s, f, D) = ⎝ p|m1 ⇒p|2N (m1 ,D)=1

a(m1 ) m1+s 1

a m1

147

⎞⎛ ⎟⎜ ⎠⎜ ⎝

m2 d=b2 (m2 ,D)=1

⎞ a(m2 ) ⎟ ⎟. ⎠ m1+s 2

The ﬁrst factor is entire for σ > − 12 , and uniformly bounded for σ ≥ −3/16. Now, let us write d = d0 d21 with d0 squarefree. As for the second factor, we see then that it is equal to ∞ −1 a(p2 ) a(n2 d0 ) = Fd0 (1 + s, f ) . (n2 d0 )1+s p2(1+s) =0 p|2ND

(n,2ND) = 1

Suppose that (D, 2N ) = 1. Then, we can write Fd (s, f, D) = Fd0 (1 + s, f )G(s, f )

Bp (1 + s)−1

p|D

where we have set

⎛

⎜ G(s, f ) = ⎝

p|m⇒p|2N (m,D)=1

⎞ a(m) a ⎟ Bp (1 + s)−1 . ⎠ m1+s m p|2N

It is clear that G(s, f ) is analytic for Re(s) > − 21 . Combining this with what was stated earlier, it follows that Fd (s, f, D) is analytic for Re(s) > −1/4. Lemma 3.3 Write d = d0 d21 with d0 squarefree, and suppose that (d, 2N D) = 1 and (D, 2N ) = 1. Then, for σ ≥ −3/16, and an absolute constant c1 > 0, 1 1 ν(d ) Fd (s, f, D) c1 0 (d0 )−σ− 2 |L(2s + 2, Sym2 )ζ(2 + 4s)−1 | (1 + 1+2σ )3 . p p|D

Proof. The proof of [MM, Lemma 13] shows that ν(d0 )

Fd (1 + s, f ) c2

(d0 )−σ− 2 |L(2s + 2, Sym2 )ζ(2 + 4s)−1 | 1

for an absolute constant c2 > 0 and σ > −1/4. It is easily proved that for σ ≥ −3/16,(say), ∞ −1 3 −1 a(p2 ) 1 1 1 + 1+2σ 1 − 2+4σ p p p2(1+s) =0 and this is

1+

1 p1+2σ

3 .

It follows that the desired result holds with a possibly larger value of c2 .

148

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

Remark. It should be clear that it is only essential that σ be larger than and bounded away from −1/4. Now we give the proofs of results stated earlier. Proof of Lemma 3.1. We have

M=

1 2πi

(2)

n2 h=b2 (n2 ,j)=1

a(n) n1+s1 +w

⎛ ⎞ ⎜ Y ⎟ a ⎜1 ⎟ |D|2s0 dt⎟ X w Γ(w)dw. ⎜ n1 ⎝ Y 1 0<βD≤t ⎠ D≡a(mod 8N ) (D,n2 h)=1

Moving the line of integration to Re(w) = −σ1 − 18 , we see that 1 2πi

M = Mβ (s0 , s1 , j, h) +

(−σ1 − 18 )

Mβ (s0 , s1 + w, j, h)X w Γ(w)dw.

By Lemma 3.3, we see that the integral is

1−

p|j

1 p3/4

−3

1 1 X −σ1 − 8 Y 1+2σ0 |Γ(−σ1 − )|. 8

This proves the result. Proof of Lemma 3.2. Suppose ﬁrst that 0 ≤ σ1 < 18 . By deﬁnition, N − Nβ (s0 , s1 , j, h) is equal to 2 a δ μ ˜(d, f ) −2s1 2 4s0 −βω (a) A ¯(δ )δ N h d1+s1 2 δ2 ≤Y d|δ

j|δ

Y /δ 1 |D0 |2s0 −2s1 Y /δ 2 1 w 1 X Γ(1 − s − w) 1 Fdh (−s1 − w, f¯, D0 ) Γ(w)dw dt 2πi (η) dA2 |D0 |2 Γ(1 + s1 + w) 2

for some

1 4

> η > 0. Using the factorization Fd (s, f, D) = Fd0 (1 + s, f )G(s, f )

p|D

Bp (1 + s)−1

§3 The main terms

149

described earlier, we see that Y /δ2 1 |D0 |2s0 −2s1 Y /δ 2 1 w 1 X Γ(1 − s1 − w) ¯ Fdh (−s1 − w, f , D0 ) Γ(w)dw dt 2πi (η) dA2 |D0 |2 Γ(1 + s1 + w) is equal to 1 ¯ ¯ Fd (1 − s1 − w, f)G(−s 1 − w, f ) 2πi (η) 0 ⎛ ⎞ Y /δ2 1 ⎝ |D0 |2s0 −2s1 −2w Bp (1 − s1 − w)−1 dt⎠ Y /δ 2 1 0<βD0 ≤t p|D0 w X Γ(1 − s1 − w) Γ(w)dw. dA2 Γ(1 + s1 + w) By an identity stated earlier, we get an estimate −1

|Bp (1 + s)| Thus,

1+

2σ0 −2σ1 −2η

|D0 |

p|D0

3 16

1−

1

−1 .

p4σ+2

Bp (1 − s1 − w)−1

p|D0

0<βD0 ≤t

and choosing η =

3

p2σ+1

|D0 |2s0 −2s1 −2w

0<βD0 ≤t

1

3 −1 1 1 + 1−2σ1 −2η 1 − 2−4σ1 −4η p p 1

− σ1 (say), this is

|D0 |2σ0 − 8 σ−5/8 (D0 )3 t2σ0 + 8 . 3

5

0<βD0 ≤t

Also, ¯ |L(2 − 2s1 − 2w, Sym2 )ζ(2 − 4s1 − 4w)−1 | Fd0 (1 − s1 − w, f) ⎧ ⎫ −1 ⎬ ⎨ p 1 × 1 + 2−2σ1 −2η 1 − 2−4σ1 −4η pσ1 +η . ⎭ ⎩ p p p|d0

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

150

Inserting all this into the integral, we see that the entire expression that we are trying to estimate is

δ 4σ0

δ2 ≤Y j|δ

×

d 12 d(d) d1+σ1

d|δ 2

1+

p|d0

* −1 3 −σ 2σ0 + 58 1 Y X 16 1 3/16 1 − p δ2 d p13/8 p5/4 p

which is Y 2σ0 + 8 X 16 −σ1 5

3

δ −5/4

δ2 ≤Y j|δ

d 12 d(d) d|δ 2

d19/16

d3/16 σ−5/8 (d)

and simplifying, we see that this is Y 2σ0 + 8 X 16 −σ1 . 5

For

1 2

3

> σ1 > 18 , Lemma 3.3 implies that for σ1 ≤ η < 1, N Y 1+2(σ0 −σ1 +η) X −η |Γ(−η)|.

This proves the lemma. Let (h, 2N ) = 1. We have for σ1 = 0, and −σ1 +

Lemma 3.4 1 2πi

1 8

> c ≥ −σ1

Nβ (s0 , s1 + w, j, h)X w Γ(w)dw |Γ(c)|Y 1+2(σ0 −σ1 −c) X c .

(c)

Proof. By deﬁnition, the integral of the lemma is equal to 2 a δ μ ˜(d, f ) −2s1 2 4s0 βω (a) ¯(δ )δ A N δ2 ≤Y h d1+s1 2 d|δ

j|δ

Y /δ 1 |D0 |2s0 −2s1 Y /δ 2 1 w 1 X Γ(1 − s − w) 1 Fdh (−s1 − w, f¯, D0 ) Γ(w)dw dt. 2πi (c) dA2 |D0 |2 Γ(1 + s1 + w) 2

By Lemma 3.3, we see that uniformly in d and h, the inner integral is |Γ(c)|

X dA2 |D0 |2

c σ−7/8 (D0 )3 .

§3 The main terms

151

Hence, the sum over D0 is

|D0 |2(σ0 −σ1 )

which is

X dA2

X dA2 |D0 |2

c

c σ−7/8 (D0 )3 |Γ(c)|

t1+2(σ0 −σ1 −c) |Γ(c)|.

Inserting this into the big expression, we see that 1 Nβ (s0 , s1 + w, j, h)X w Γ(w)dw 2πi (−σ1 ) c 1+2(σ0 −σ1 −c) |˜ μ(d, f )| X Y 4σ0 δ |Γ(c)| 1+σ 1 d d δ2 2 δ2 ≤Y d|δ

j|δ

|Γ(c)|Y 1+2(σ0 −σ1 −c) X c

δ −2+4(σ1 +c)

δ 2 ≤Y

d|δ 2

d(d) d

1 2 +σ1 +c

|Γ(c)|Y 2(σ0 −σ1 −c)+1 X c . We conclude this section with an estimate that we shall use in §7. Lemma 3.5

We have

˜ β (0, 0, 1, 1) = 1 C(0, 0, 1, 1, )Y + O(Y X −1/8 ) + O((log X)3 ). M 2 Proof. Inserting the asymptotic formula provided by Lemma 2.3 into the integral, we see that it is equal to * a(n) a 1 Y 1 φ(n2 ) t + O(d(n2 )) dt exp(−n/X). n n1 Y 1 8N n2 2 n2 =b

The error term contributes an amount which is

|a(n1 )| |a(b2 )| n1

n1

b

b2

d(b2 ) exp(−b2 /X)

|a(p)| |a(p2 )| d(b2 ) 1 + + + · · · exp(−b2 /X). p p2 b b

p|2N

The ﬁrst factor depends only on N and can be absorbed into the implied constant. For the sum over b, we see by standard estimates that it is (log X)3 .

152

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

Now consider the contribution of the main term. We see that it is equal to Y a(n) a φ(n2 ) exp(−n/X) 16N n n1 n2 2 n2 =b

which is Y 1 16N 2πi

F˜1 (w, f, 1)X w Γ(w)dw (2)

Here, for (d, 2N D) = 1, we have set F˜d (s, f, D) =

m2 d=b2 (m,D)=1

a(m) m1+s

a m1

φ(m2 d) . m2 d

Using the properties of F˜d given in Exercise 4, and moving the line of integration to the left, we see that this is ⎧ ⎫ ⎨ ⎬ Y a(n) a φ(n2 ) 1 + F˜1 (w, f, 1)X w Γ(w)dw . ⎭ 16N ⎩ n n1 n2 2πi (−1/8) 2 n2 =b

Now applying the estimate for F˜1 of Exercise 4, we see that this is Y ˜ F1 (0, f, 1) + O(Y X −1/8 ) 16N as required.

§4 Estimates for real character sums In this section, we develop a substitute for Lemma 2.1 in the case of odd Dirichlet characters. Lemma 4.1

Let χ be a ﬁxed Dirichlet character and h an integer. Then, n≤X

∗

n|L(1, χ

. )|2 X 2 hn

' . ( is a nontrivial charwhere the sum over n only includes values such that χ hn acter, and the implied constant depends only on the conductor of χ.

§4 Estimates for real character sums

153

Proof. This follows by a small modiﬁcation in the proof of Jutila [J, Theorem 3] and partial summation. Indeed, we have . χ(m) m log X L(1, χ ) = + O( √ ) hn m hn X m≤X ' . ( for ﬁxed χ and all n ≤ X such that χ hn = 1. It follows that 2 . χ(m) m ∗ ∗ 3/2 2 n|L(1, χ )|2 = n + O(X (log X) ). hn m hn n≤X

n≤X

m≤X

Expanding the ﬁrst term, we see that it is the sum of χ(r2 ) ∗ A = n χ(m)2 2 r 2 r≤X n≤X

m|r

(r,hn)=1

and

B =

m1 ,m2 ≤X m1 m2 =b2

χ(m1 m2 ) m1 m2

S(d) =

n

n≤X

It is clear that A = O(X 2 ). (In fact, it is for B, suppose χ is a character mod r. We d | n | ≤ X 3/2 n n≤X n≤X p|n⇒p|r χ(d) = ( d ) hn Let us set

∗

m m 1 2 . hn

asymptotic to cX 2 for some c = 0). As note that −1 1 √ X 3/2 1 − p−1/2 . n

n≤X

p|r

d n . n

As r is ﬁxed, we deduce that d d ∗ n = S(d) + O(X 3/2 ). hn h n≤X

Therefore, B X 3/2 (log X)2 +

d(m) |S(m)|. m m≤X 2 m =b2

If we set T (Z, d) =

d n≤Z

n

,

then S(d) = XT (X, d) −

and [J, Theorem 1] (see also [MM, Lemma 1]) asserts that |T (Z, d)|2 ZW (log W )2 . d≤W d =b2

n≤X−1

T (n, d)

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

154

By partial summation, it follows that 1 |T (Z, d)|2 Z(log W )3 . d d≤W d =b2

Thus, we see that for 1 ≤ n ≤ X, we have ⎛

⎞1/2

d(m) d(m)2 ⎠ |T (n, m)| ≤ ⎝ m m 2 2 m≤X m≤X

⎛

⎞1/2

⎜ 1 ⎟ 2⎟ ⎜ |T (n, m)| ⎝ ⎠ m m≤X 2

m =b2

m =b2

(log X) n

2 1/2

It follows that

3/2

(log X)

= n1/2 (log X)7/2 .

d(m) |S(m)| X 3/2 (log X)7/2 . m m≤X 2 m =b2

It follows that B = O(X 3/2 (log X)7/2 ) and this proves the lemma. Recall that we have set

log a if a > 1 . 1 otherwise Lemma 4.2. Let χ be a ﬁxed Dirichlet character and (h, 2N ) = 1. Then 2 1 Y D Xh 2 ∗ χ(D) dt hX 2 (log+ ) Y hn Y 0 log+ a =

n≤X

0
where the sum over n is as above. ' . ( Proof. Suppose ﬁrst that ψ = χ hn is a primitive character, mod q (say). Polya has derived the expression D 1 − e−2πimt/q q log q ¯ χ(D) = g(ψ) ψ(m) + O(1 + ). hn 2πim H 0
0<|m|≤H

(See [BC] or [MV].) Here g(ψ) is the Gauss sum attached to ψ. Integrating with respect to t, we deduce that ⎛ ⎞ Y 1 D ⎠ ⎝ χ(D) dt Y 0 hn 0
= g(ψ)

0<|m|≤H

¯ ψ(m) qlogq qg(ψ) + O(1 + )+ 2πim H 4π 2 Y

0<|m|≤H

¯ ψ(m) −2πimY /q 1−e . m2

§4 Estimates for real character sums

Now,

155

¯ ψ(m) 1 ¯ ¯ + O( q )). = (1 − ψ(−1))(L(1, ψ) 2πim 2πi H

0<|m|≤H

Thus, letting H −→ ∞, we ﬁnd that ⎛ ⎞ 1 Y⎝ D ⎠ χ(D) dt Y 0 hn 0
is equal to g(ψ)

1 ¯ ¯ + O(1) + qg(ψ) (1 + ψ(−1))L(2, ¯ ¯ (1 − ψ(−1))L(1, ψ) ψ) 2πi 4π 2 Y ∞ ¯ qg(ψ) ψ(m) −2πimY /q 2πimY /q ¯ e . − + ψ(−1)e 4π 2 Y m=1 m2

If ψ(−1) = −1, then 2πmY mY 2πimY /q ¯ e−2πimY /q + ψ(−1)e = −2isin . q q Hence, in this case, 1 Y

⎛

Y 0

=

⎝

0
χ(D)

⎞ D ⎠ dt hn

∞ 3/2 g(ψ) 1 mY ¯ + O(1) + O( q L(1, ψ) min(1, )) 2 πi Y m=1 m q

and this is equal to g(ψ) ¯ + O(1) + O(q 12 log+ q ). L(1, ψ) πi Y If ψ(−1) = +1, then by Lemma 2.1 ⎛ ⎞ 1 Y⎝ D ⎠ √ χ(D) dt q. Y 1 hn 0
Hence, in all cases, ⎛ ⎞ 2 Y 1 D ¯ 2 + O(1) + O(q(log+ q )2 ). ⎠ ⎝ χ(D) dt q|L(1, ψ)| Y hn Y 0 0
Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

156

Note that q is O(hn). Hence, summing over those n such that χ

'

. hn

(

= 1, we get

⎛ ⎞ 2 Y 1 D ∗ ⎝ ⎠ χ(D) dt Y hn 0 0
n≤X

hX 2 . In the last step, we used the estimate of Lemma 4.1. If Xh ≥ Y, then

∗

hn|L(1, χ

n≤X

Y /h
. )|2 + hn

hn(log

Y
hn 2 ) Y

and this is

Xh 2 ) . Y If ψ is not primitive, we let ψ∗ denote the primitive character mod q ∗ say, that induces ψ. Let us write q = q ∗ r. Then, we have ⎛ ⎞ ⎛ ⎞ Y Y 1 D ⎠ 1 ⎜ ⎟ ⎝ χ(D) dt = ψ∗ (D)⎠ dt ⎝ Y 0 hn Y 0 0
0
(D,r)=1

and the latter is equal to d|r

Thus,

1 μ(d)ψ∗ (d) Y /d

⎛

Y /d 0

⎝

⎞ ψ∗ (D)⎠ dt.

0
⎛ ⎞ 2 1 Y D ∗ ⎝ ⎠ χ(D) dt Y hn 0

n≤X

0
⎛ ⎞ 2 1 Y /d ∗ ∗ ⎝ ⎠ dt . d(r)2 ψ (D) Y /d 0 n≤X 0
n≤X

∗

∗ + q 2 ∗ 2 ∗ ∗ d(r) q |L(1, ψ )| + O(1) + O(q (log ) ) . Y 2

d|r

§4 Estimates for real character sums

Since

r L(1, ψ) φ(r)

|L(1, ψ∗ )| it follows that the above is

n≤X

∗

¯ 2 q|L(1, ψ)|

⎛

3

d(r) r + O⎝ φ(r)2

157

⎞

⎛

d(r)3 ⎠ + O ⎝

n≤X

d(r)3 q ∗ (log+

n≤X

∗

⎞

q 2⎠ ) . Y

Since d(r) r and φ(r) ≥ r/(log log r), the quantity d(r)3 r/φ(r)2 is bounded. Thus, by Lemma 4.1, the ﬁrst term above is hX 2 . The second O term above is clearly X(log X)7 . Finally, as d(r)3 q ∗ ≤ q, the ﬁnal O term is hX 2 (log+ Xh/Y )2 ). Lemma 4.3. With notation as above and σ ≥ 0, we have 2 ⎛ ⎞ 1 Y D ⎠ s + Xh 2 ∗ 2 2σ 2 ⎝ χ(D) t dt Y h(1 + |s| )Y X (log Y ) hn 1 n≤X

0
Proof. This follows from Lemma 4.2 by integration by parts and an application of the Cauchy-Schwarz inequality. Lemma 4.4 With notation as above and σ ≥ 0, we have ⎛ ⎞ 2 1 Y D Xh 2 ∗ s⎠ ⎝ χ(D) |D| dt (1 + |s(s − 1)|2 )hY 2σ X 2 (log+ ) Y hn Y 1 n≤X

0
Proof. This essentially follows by partial summation from Lemma 4.3. Lemma 4.5 Let a, h be integers with (ah, 2N ) = 1. With notation as above and σ ≥ 0, we have 2 1 Y D Xh 2 |D|s dt (1 + |s(s − 1)|2 )hY 2σ X 2 (log+ ) . Y hn Y 0 n≤X 0
Proof. The sum over D may be written as 1 φ(8N )

χ mod 8N

χ(a) ¯

χ(D)

0
D hn

|D|s .

Applying the Cauchy-Schwarz inequality, each term in the sum over n is 2 1 1 Y D s χ(D) |D| dt . φ(8N ) χ n≤X Y 0 hn hn2 =b2

0
158

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

D If hn2 = b2 , then hn is a nontrivial character of conductor prime to 8N, and 2 ' . ( so χ hn is a nontrivial character. Hence, the above summand is 2 Y 1 D ∗1 s χ(D) |D| dt φ(8N ) χ hn Y 0 n≤X

0
where the asterisk on the sum ' . has ( the same meaning as earlier (that is, we range over those n ≤ X so that χ hn = 1.) Applying Lemma 4.4, we deduce the result. Lemma 4.6 Let a, h be integers with (ah, 2N ) = 1 and β = ±1. With notation as above and σ ≥ 0, we have 1 Y Y 0 n≤X 2

hn2 =b

0<βD≤t D≡a(mod 8N )

D hn

2 Xh 2 s |D| dt (1 + |s(s − 1)|2 )hY 2σ X 2 (log+ ) . Y

Proof. This follows immediately by noting that the previous result holds if ( ' even we sum over 0 < −D ≤ t. (This is clear since we can factor out χ(−1) −1 from hn the sum over D.)

§5 Estimates for some weighted sums We need to record some lemmas that are minor variants of estimates that appear in [MM]. We collect them here. Lemma 5.1 Let β = ±1. For Re s0 = σ0 and Re s1 = σ1 , 0, a ≡ 1(mod 4), and (ah, 2N ) = 1, we have n≤U,n2 h=b2

0 ≤ σ1 < 1/2,

a(n) (j) χ (nh)˜ gYβ (nh, 2s0 ; a) n1+s1 0 (|s0 | + 1)h1/2 U 1/2−σ1 Y 1/2+2σ0 log U h.

If σ1 > 12 , we have n>U,n2 h=b2

a(n) (j) χ (nh)˜ gYβ (nh, 2s0 ; a) n1+s1 0 (|s0 | + 1)h1/2 U 1/2−σ1 Y 1/2+2σ0 log U h.

σ0 >

§5 Estimates for some weighted sums

159

Proof. This is essentially Lemma 4 and Lemma 8 of [MM]. For example, to prove the ﬁrst estimate, one checks that n≤U n2 h =b2

a(n) (j) χ (nh) n1+s1 0

m

0<βm≤t m≡a(mod 8N )

nh

m2s0

is (|s0 | + 1)h 2 U 2 −σ1 t 2 +2σ0 (log U h). 1

1

1

Integrating over t then gives the stated result. Lemma 5.2 Let β = ±1. If Re s0 = σ0 , and Re s1 = σ1 , 0, a ≡ 1(mod 4), (ah, 2N ) = 1, then n≤U,n2 h=b2

0 ≤ σ1 < 1/2,

σ0 >

a(n) β g (nh, 2s0 ; a) (|s0 | + 1)h1/2 U 1/2−σ1 Y 1/2+2σ0 (log Y )(log U h). n1+s1 Y

If σ1 > 12 , then n>U,n2 h=b2

a(n) β g (nh, 2s0 ; a) (|s0 | + 1)h1/2 U 1/2−σ1 Y 1/2+2σ0 (log Y )(log U h). n1+s1 Y

Proof. This is essentially Lemma 5 and Lemma 9 of [MM]. Lemma 5.3

We have

|a(n)| √ x(log x)−ρ n

n≤x

for some ρ > 0. Proof. This is due to Rankin [Ra] (see [MM, Lemma 17] or Theorem IV.9.1). Lemma 5.4.

1 Y

For 0 ≤ σ1 < 1, we have ⎛ a(n) ⎜ Y ⎝ n1+s1 1 2

n2 h=b

|D|2s0

|D|≤t D≡a(mod 8N ) 1

(|s0 | + 1)h 2 Y 2σ0 Proof. Use Lemma 2.2 and Lemma 5.3.

D nh

⎞ ⎟ dt⎠ exp(−n/X)

X 1−σ1 (log X)−ρ . 1 − σ1

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

160

§6 The statements A± (α) and C ± (α) Let β = ±1. In §3, we had deﬁned Mβ (s0 , s1 , j, h) and Nβ (s0 , s1 , j, h). Consider the following statement: 1 Y

β

A (α) :

1

Y

0<βD≤t D≡a(mod 8N )

D h

|D|2so LDj 2 (1 + s1 , f )dt

= Mβ (s0 , s1 , j, h) + Nβ (s0 , s1 , j, h) √ + O(Y 1−σ1 +2σ0 h(1 + |s0 | + |s0 − s1 |)2 × (log Y h)α log(h log Y ) + σ−3/4 (j)3

,

for a ≡ 1(mod 4), (ahj, 2N ) = 1, σ0 ≥ 0 and 0 ≤ σ1 < 12 . Lemma 6.1

Let β = ±1. Then Aβ (2) holds.

Proof. Let a ≡ 1(mod 4) and (a, 2N ) = 1. Suppose also that σ0 ≥ 0 and that 0 ≤ σ1 < 12 . Let X ≥ Y and consider the integral 1 Y

1

Y

0<βD≤t D ≡ a(mod 8N )

D h

|D|2s0

1 2πi

LDj 2 (1 + s1 + w, f )X w Γ(w)dw dt. (2)

On the one hand, it is (n,j)=1

a(n) β g˜ (nh, 2s0 ; a) exp(−n/X). n1+s1 Y

(Recall that we are writing g˜Yβ (n, 2s0 ; a) =

1 Y

1

Y

|D|2s0

0<βD≤t D≡a(mod 8N )

D n

dt

as usual.) Let us estimate the contribution of those n for which n2 h is not a square. Applying the Cauchy-Schwarz inequality, we see that the terms with n ≤ X are ⎛

|a(n)|2 ⎝ n≤X

n

5 4 +σ1

⎞ 12

⎛

⎜ exp(−n/X)⎠ × ⎝

1

n≤X n2 h =b2

n

3 4 +σ1

⎞ 12 2 β ⎟ g˜Y (nh,2s0 ;a) exp(−n/X)⎠ .

We apply Rankin’s estimate for the ﬁrst factor. We ﬁnd that it is X 2 ( 4 −σ1 ) . 1

3

§6 The statements A± (α) and C ± (α)

161

For the second factor, we apply Lemma 4.6, and ﬁnd that it is (1 + |s0 (2s0 − 1)|)h 2 Y 2σ0 X 2 ( 4 −σ1 ) log+ 1

1

5

Xh . Y

Putting these estimates together, we get an estimate of def

1

E1 = (1 + |s0 (2s0 − 1)|)h 2 Y 2σ0 X 1−σ1 log+

Xh Y

Similarly, the sum over terms n > X is majorized by

12

|a(n)| exp (−n/X) n2σ1 2

⎛ ⎜ ⎝ n>X n2 h =b2

n>X

⎞ 12 1 β ⎟ |˜ g (nh, 2s0 ; a)|2 exp (−n/X)⎠ n2 Y

and this is E1 also. If we choose X ≤ Y (log Y )γ , we see that E1 (1 + |s0 (2s0 − 1)|)h 2 Y 1+2σ0 −σ1 (log Y h)γ(1−σ1 ) log(h log Y ). 1

The sum corresponding to the squares is equal to ⎛ ⎞ ⎟ Y ⎜ a 1 a(n1 n2 ) ⎜ 2s0 ⎟ |D| ⎜ ⎟ dt exp(−n/X). Y n h=b2 (n1 n2 )1+s1 n1 ⎠ 1 ⎝ 0<βD≤t 2 (n,j)=1

D≡a(mod 8N ) (D,2N n2 h)=1

By Lemma 3.1, this is = Mβ (s0 , s1 , j, h) + O(σ−3/4 (j)3 X −σ1 − 8 Y 1+2σ0 ). 1

On the other hand, moving the line of integration to the line Re(w) = −η, we get a residue at w = 0 equal to 1 Y D |D|2s0 LDj 2 (1 + s1 , f )dt. Y 1 h 0<βD≤t D ≡ a(mod 8N )

plus an integral along the line Re(w) = −η. We proceed as in §3 and rewrite it as a μ a ˜(d, f ) 1 ¯(n) βω A−2s1 (a) δ 4s0 ¯(δ 2 ) 1+s 1−s1 −w 1 2πi N d n (−η) 2 δ2 ≤Y d|δ

(δ,2N h)=1 j|δ

Γ(1 − s1 − w) × gY /δ2 (ndh, 2(s0 − s1 − w); aδ¯2 ) (X/A2 d)w Γ(w)dw. Γ(1 + s1 + w)

162

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

To estimate this integral, we split the sum over n into those for which n2 dh is a square, and the remaining values. If 18 ≤ σ1 < 12 , then by Lemma 3.2, we see that the ﬁrst set contribute an amount which is Y 1+2σ0 X −σ1 . If 0 ≤ σ1 < 18 , then it is Nβ (s0 , s1 , j, h) + O(Y

5 8 +2σ0

X 16 −σ1 ). 3

Using Lemma 5.2, we shall estimate the contribution of the second set. Split the sum over n at an intermediate point U to be speciﬁed later. For the terms with n < U, we move the line of integration to a line Re(w) = −η1 , with 1 σ1 ≤ η1 < σ1 + , 2 and for those with n > U, we move the line of integration to a line Re(w) = −η2 , with 2 > η2 > 1. Let us set Z = Y /δ 2 . The terms with n < U then contribute an amount

δ 4σ0

δ 2 ≤Y

d 12 d(d) X −η1 |Γ(−η1 )|(1 + |s0 − s1 + η1 |) d1+σ1 A2 d 2 d|δ

×(dh) 2 U 2 −(η1 −σ1 ) Z 2 +2(σ0 −σ1 +η1 ) (log Z)(log U dh). 1

1

1

Now, if we choose U = Y 2 /X, then X −η1 U 2 −η1 +σ1 Y 1

1 2 +2σ0 −2σ1 +2η1

We choose η1 = σ1 +

= Y 3/2+2σ0 X − 2 −σ1 . 1

1 −ν 2

for a ν satisfying 1 1 3 < ν < min( , σ1 + ) 4 2 8 say. (This choice ensures that η1 > 1/8 and in particular is bounded away from zero.) Then the sum over δ above is δ 2 ≤Y

δ −1+4(ν− 2 ) 1

d|δ 2

d 2 −ν d(d) 1

§6 The statements A± (α) and C ± (α)

and this is

163

δ 4ν−3 (δ 2 ) 2 −ν d(δ 2 )2 1

δ 2 ≤Y

δ 2ν−2 d(δ 2 )2 1.

δ 2 ≤Y

Hence the above is h 2 Y 3/2+2σ0 X − 2 −σ1 (log Y )(log Y h)(1 + |s0 − s1 |). 1

1

The contribution of the terms with n > U is entirely similar to the above with η2 replacing η1 . In moving to the line −η2 , we also encounter a residue at w = −1. Using Lemma 5.2, this residue is easily shown to be √ 1 (1 + |s0 − s1 |) hX − 2 −σ1 Y 3/2+2σ0 (log Y )(log Y h). Now we choose (say) η2 = 3/2. Then we see the total contribution is (1 + |s0 − s1 |)h 2 Y 3/2+2σ0 X − 2 −σ1 (log Y )(log Y h) 1

1

and for X ≥ Y this is (1 + |s0 − s1 |)h 2 Y 1+2σ0 −σ1 (log Y )(log Y h). 1

This proves Aβ (2). To proceed further, we introduce the following smoothing operator. We deﬁne IU0 (f ) = f,

IU (f ) = IU1 (f ) =

1 U

2U

f (u)du. U

Moreover, for n ≥ 1, we set IUn (f ) = Let us also set A(u) =

1 U

n
and for σ1 > 12 , B(u) =

n>u n2 h =b2 (n2 ,j)=1

2U

Itn−1 (f )dt. U

a(n) β g˜ (nh, 2s0 ; a) n1+s1 Y

a(n) β g˜ (nh, 2s0 ; a). n1+s1 Y

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

164

Lemma 6.2 U ≥Y

Assume Aβ (α) and suppose σ0 ≥ 0. If 0 ≤ σ1 < 12 , then we have for

IU4 (A) Y 1−σ1 +2σ0

√

0 h(1+|s0 |+|s0 −s1 |)2 (logY h)α log(hlogY ) + σ−3/4 (j)3 .

and U ≥ Y , then we have √ 0 IU3 (B)Y 1+2σ0 U −σ1 h(1+|s0 |+|s0 −s1 |)2 (logY h)α log(hlogY )+σ−3/4 (j)3 .

Moreover, if 1 > σ1 >

1 2

Proof. Let us set

A∗ (u) =

n
a(n) β g˜ (nh, 2s0 ; a). n1+s1 Y

Then IU4 (A∗ (u)) =

1 2πi

(c)

⎧ ⎪ ⎨1 ⎪ ⎩Y

1

Y

0<βD≤t D≡a(mod 8N )

D h

|D|2so LDj 2 (1 + s1 + w, f )dt

⎫ ⎪ ⎬ ⎪ ⎭

IU4 (uw )

dw w

provided σ1 + c > 12 . We move the line of integration to Re(w) = c1 < 0 where c1 is chosen so that 0 < σ1 + c1 < 12 . Appealing to Aβ (α), this is equal to Mβ (s0 ,s1 ,j,h) + Nβ (s0 ,s1 ,j,h) 1 dw + (Mβ (s0 ,s1 + w,j,h) + Nβ (s0 ,s1 + w,j,h))IU4 (uw ) 2πi (c1 ) w √ 0 1−σ1 +2σ0 2 α +O Y h(1 + |s0 | + |s0 − s1 |) (logY h) log(hlogY ) + σ−3/4 (j)3 . The condition U ≥ Y is used in obtaining the O-term. Using the deﬁnition of Nβ and Lemma 3.3, it is easy to check that Nβ (s0 , s1 , j, h) Y

1 2 +2σ0

σ − 12

(log Y )(|s1 | + 1)1−2σ1 d(h0 )h0 1

for 0 ≤ σ1 < 12 and σ0 ≥ 0. (Here, h = h0 h21 with h0 squarefree.) The same estimate holds for the integral above involving Nβ . Next, we observe that 1 dw Mβ (s0 , s1 , j, h) + Mβ (s0 , s1 + w, j, h)IU4 (uw ) 2πi (c1 ) w gives the contribution of squares in IU4 (A∗ (u)) (that is, those n for which n2 h = b2 ). This proves the ﬁrst statement of the lemma. For the second part, consider the sum a(n) β S = IU3 ( g˜ (nh, 2s0 ; a)). n1+s1 Y n>u (n2 ,j)=1

§6 The statements A± (α) and C ± (α)

165

We shall estimate it using partial summation. Set

C(x) =

u
S = IU3 (

We have

∞

a(n) β g˜Y (nh, 2s0 ; a). 1 n 2 +s1

t− 2 dC(t)) = IU3 ( 1

u

1 2

∞

C(t)t−3/2 dt)

u

and this is equal to ⎛ ⎞ Y 1 1 D 1 2dw ⎜1 ⎟ |D|2so LDj 2 ( + s1 + w, f )dt ⎠ IU3 (uw− 2 ) ⎝ 2πi (c) Y 1 0<βD≤t h 2 1 − 2w D≡a(mod 8N )

provided σ1 +c > 1 and 0 < c < 12 . Now move the line of integration to Re(w) = c1 where 12 ≤ σ1 + c1 < 1. (This is possible because 12 < σ1 < 1.) Now apply Aβ (α) on the line c1 . The “main term” in Aβ (α) gives 1 1 1 1 2dw {Mβ (s0 , s1 + w − ) + Nβ (s0 , s1 + w − )}IU3 (uw− 2 ) . 2πi (c1 ) 2 2 1 − 2w The Mβ term gives the square terms in S. The Nβ term is zero if σ1 > 5/8. If σ1 ≤ 5/8 then it is (|s1 | + 1)1−2(σ1 +c1 − 2 ) U c1 − 2 Y 1

1

1 2 +2σ0

c +σ1 − 12

h01

d(h0 ).

The O term in Aβ (α) gives an amount which is 1 1 √ Y 1−(σ1 +c1 − 2 )+2σ0 U c1 − 2 { h(1 + |s0 | + |s0 − s1 |)2 × (log Y h)α log(h log Y ) + σ−3/4 (j)3 }. Choosing c1 =

1 2

− σ1 gives the result.

Next we consider the following statement which provides estimates for functions similar to A(u) and B(u) in which we restrict the sum to fundamental discriminants. 4 C β (α) : IX (

n≤u n2 h =b2 (n,j)=1

a(n) β g (nh, 2s0 ; a)) n1+s1 Y

Y 2 +2σ0 X 2 −σ1 √ 0 1 h(1 + |s0 | + |s0 − s1 |)2 (log Y h)α log(h log Y ) + σ−3/4 (j)3 2σ1 − 1 1

1

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

166

for a ≡ 1(mod 4), (ah, 2N ) = 1, (2N h, j) = 1 σ0 ≥ 0 and 0 < σ1 < 12 . Moreover, for X ≥ Y 3 IX (

n>u n2 h =b2 (n2 ,j)=1

Y

1 2 +2σ0

a(n) β g (nh, 2s0 ; a)) n1+s1 Y

√ 1 X 2 −σ1 { h(1 + |s0 | + |s0 − s1 |)2 (log Y h)α (log(h log Y )) + σ−3/4 (j)3 }

for a ≡ 1(mod 4), (ah, 2N ) = 1, σ0 ≥ 0 and 1 > σ1 > 12 . In the next result, we exhibit the relationship between Aβ (α) and C β (α). If Aβ (α) holds, then C β (α) holds.

Proposition 6.3

Proof. For the ﬁrst half of C β (α), we have ⎛ ⎞ ⎜ 4 ⎜ IX ⎜ ⎝ n≤u

⎟ a(n) β ⎟ g (nh, 2s ; a) ⎟ du 0 n1+s1 Y ⎠

n2 h =b2 (n,j)=1

⎛

⎞

⎜ 4 ⎜ = IX ⎜ ⎝ n≤u n2 h =b2 (n,j)=1

=

g 2 ≤Y (g,2N h)=1

a(n) 1 n1+s1 Y

Y 1

0<βD≤t D≡a(mod 8N )

D nh

|D|2s0

g 2 |D

⎛

⎜ 4 ⎜ μ(g)g 4s0 IX ⎜ ⎝

⎟ ⎟ μ(g)dt⎟ ⎠ ⎞

n≤u n2 h =b2 (n2 ,jg)=1

⎟ a(n) β 2 ⎟ g ˜ (nh, 2s ; a¯ g ) ⎟. 2 0 n1+s1 Y /g ⎠

To estimate this, we split the sum over g into two parts at ) Y V = . X Lemma 4.6, Rankin’s estimate and the Cauchy-Schwarz inequality imply that the ﬁrst part is 2σ0 √ Y g 4σ0 X 1−σ1 h(1 + |s0 (2s0 − 1)|) log+ (Xhg 2 /Y ) 2 g g≤V

and this is

) √ Y 2σ0 1−σ1 h(1 + |s0 (2s0 − 1)|)Y X (log h) X √ 1 1 h(1 + |s0 (2s0 − 1)|)Y 2 +2σ0 X 2 −σ1 (log h).

§6 The statements A± (α) and C ± (α)

167

For the second part, Lemma 6.2 implies that it is

√ Y >g>V

g

4σ0

Y g2

1−σ1 +2σ0

0 √ (1 + |s0 | + |s0 − s1 |)2 h(log Y h)α (log(h log Y )) + σ−3/4 (j)3 ) 2σ1 −1 Y 1−σ1 +2σ0 −1 Y (1 − 2σ1 ) X 0 √ (1 + |s0 | + |s0 − s1 |)2 h(log Y h)α log(h log Y ) + σ−3/4 (j)3 and this is Y

1 2 +2σ0

X 2 −σ1 (1 − 2σ1 )−1 1

√ (1 + |s0 | + |s0 − s1 |)2 h(log Y h)α log(h log Y ) + σ−3/4 (j)3 .

This proves the ﬁrst part of C β (α). For the second half, we can apply Lemma 6.2 directly to the entire range of the sum over g. Proposition 6.4

If C β (α) holds then Aβ ( 45 α) holds.

Proof. We proceed exactly as in the proof of Lemma 6.1, only in place of Lemma 5.2, we use C β (α) to estimate the contribution of the second set. For each ﬁxed value of δ, we will split the sum over n at an intermediate point uδ to be speciﬁed later. For the terms with n < uδ , we move the line of integration to a line Re(w) = −η1 , and bounded away from zero, with 1 σ1 < η1 < σ1 + . 2 For those with n > uδ , we move the line of integration to a line Re(w) = −η2 , with σ1 + 12 < η2 < 1 if σ1 < 1/8 1 < η2 < 1 + σ1 if 1/8 < σ1 < 12 . When η2 > 1, we also pick up a residue R from the pole at w = −1. (We remark that the point 1/8 was only used as a convenient breaking point. The essential diﬀerence between the two cases is whether σ1 is bounded away from zero or not.) Let us set Z = Y /δ 2 and U = Uδ = Z(log(Zh + 100))γ

168

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

for some 2 > γ > 0 to be speciﬁed later. By the ﬁrst part of C β (α) and the mean value theorem, there exists V = Vδ in the interval (Uδ , 2Uδ ) such that a ¯(n) IV3 ( g β (ndh, 2(s0 − s1 − w); aδ¯2 )) 1−s1 −w Z n n
−(η −σ )

Z 2 +2(σ0 −σ1 +η1 ) Uδ2 1 1 √ 1 × dh(1 + |s0 − s1 − w| + |s0 |)2 (2(η1 − σ1 ) − 1) ( × (log Y dh)α log(dh log Y ) + σ−3/4 (j)3 1

Moreover, as V ≥ Z a ¯(n) IV3 ( g β (ndh, 2(s0 − s1 − w); aδ¯2 )) 1−s1 −w Z n n>u n2 h =b2 (n2 ,j)=1 1

−(η −σ )

Z 2 +2(σ0 −σ1 +η2 ) Uδ2 2 1 √ × dh(1 + |s0 − s1 − w| + |s0 |)2 (log Y dh)α log(dh log Y ) + σ−3/4 (j)3 . 1

For each δ we apply the operator IV3δ with respect to the variable uδ to both sides of the basic equation. Then, the left hand side and the residue do not change (as they are independent of uδ ). And the n < uδ summed over all δ contribute an amount d 12 d(d) 1 1 4σ0 δ log d|Γ(−η1 )|(X/d)−η1 Z 2 +2(σ0 −σ1 +η1 ) U 2 −(−σ1 +η1 ) 1+σ1 d δ 2 ≤Y d|δ 2 √ 0 1 2 α 3 dh(|s | + |s − s | + 1) (log Zdh) log(h log Y ) + σ (j) 0 0 1 −3/4 (η1 − σ1 − 12 ) X −η1 (Y (log Y h)γ ) 2 +σ1 −η1 d 12 +η1 d(d) 1 −2+2σ1 −2η1 × δ d1+σ1 (η1 − σ1 − 12 ) δ2 ≤Y d|δ 2 √ (|s0 | + |s0 − s1 | + 1)2 dh(log Y h)α log(h log Y ) + σ−3/4 (j)3 Y

1 2 +2(σ0 −σ1 +η1 )

1

The above may be simpliﬁed to Y 1+2σ0 −σ1 +η1 X −η1 (log Y h)γ( 2 +σ1 −η1 ) 1

1 (η1 − σ1 − 12 )

√ (|s0 | + |s0 − s1 + η1 | + 1)2 h(log Y h)α log(h log Y ) + σ−3/4 (j)3 δ −2+2σ1 −2η1 d(d)dη1 −σ1 log d. δ 2 ≤Y

d|δ 2

§6 The statements A± (α) and C ± (α)

169

This is seen to be def

E2 = Y 1+2σ0 −σ1

Y X

η1

(logY h)γ( 2 +σ1 −η1 ) 1

√ 1 2 α 3 (|s | + |s − s + η | + 1) h(logY h) log(hlogY ) + σ (j) 0 0 1 1 −3/4 (η1 − σ1 − 12 ) Now, using the second part of C β (α), we see that the sum over δ of the terms with n > uδ contribute an amount η2 1 Y def 1+2σ −σ 0 1 E3 = Y (log Y h)γ( 2 +σ1 −η2 ) X √ 0 h(|s0 | + |s0 − s1 + η2 | + 1)2 (log Y h)α log(h log Y ) + σ−3/4 (j)3 . We now choose X so that Y 2σ0 X 1−σ1 = Y 1+2σ0 −σ1 This is ensured if γ =

1 2

Y X

η1

(log Y h)α+γ( 2 +σ1 −η1 ) . 1

α + 2η1 − 2σ1

and X = Y (log Y h)γ . With this choice, (and with E1 as in Lemma 6.1), α(1−σ1 )

E1 + E2 Y 1+2σ0 −σ1 (log Y h) 2 +2η1 −2σ1 0 √ 1 2 3 (|s | + |s − s + η | + 1) h log(h log Y ) + σ (j) . 0 0 1 1 −3/4 (η1 − σ1 − 12 ) 1

Simplifying, we see that if η1 = σ1 + 12 − ν, with ν = 1/8 (say) then the exponent of log Y h is α α (1 − σ1 )( 1 ) = (1 − σ1 ) 3 + 2η − 2σ − 2ν 1 1 2 2 and this is ≤

4 α. 5

Thus, E1 + E2 Y 1+2σ0 −σ1 (log Y h) 5 α 4

1

{(|s0 | + |s0 − s1 | + 1)2 h 2 (log(h log Y )) + σ−3/4 (j)3 }.

170

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

The error term E3 gives a similar quantity with the exponent of log Y h equal to 1 α α − (2η2 − σ1 − ) . 2 2η1 − 2σ1 + 12 We choose

η2 =

σ1 + 5/8 if σ1 < 1/8 17/16 if 1/8 < σ1 < 1/2.

Then, the above is ≤

4 α. 5

Hence, E3 Y 1+2σ0 −σ1 {(|s0 | + |s0 − s1 | + 1)2 h 2 (log Y h) 5 α log(h log Y ) + σ−3/4 (j)3 }. 1

4

The second part of C β (α) easily implies a similar estimate holds for R. We conclude that E1 + E2 + E3 + R Y 1+2σ0 −σ1 {(|s0 | + |s0 − s1 | + 1)2 h 2 1

4

(log Y h) 5 α log(h log Y ) + σ−3/4 (j)3 }. and so Aβ ( 45 α) holds.

§7 Proof of main result As a consequence of the results of the previous sections, we deduce the following crucial result. Theorem 7.1.

Let β = ±1, and λ > 0. Then C β (λ) holds.

Proof. From Lemma 6.1, we know that Aβ (2) is true. From Proposition 6.3, we deduce that C β (2) holds. Suppose we have established C β (α). By Proposition 6.4, β 4α we deduce that Aβ ( 4α 5 ) is true. By Proposition 6.3, we deduce that C ( 5 ) holds. Iterating this proves the result. Remark. We see also that given λ > 0, the statement Aβ (λ) holds. Proof of Theorem 1.2. Let a ≡ 1(mod 4), (a, 2N ) = 1. Consider the integral 1 Y

Y 1

|D|≤t D≡a(mod 8N )

1 2πi

LD (1 + w, f )X w Γ(w)dwdt. (2)

§7 Proof of main result

On the one hand, it is

⎛ 1 a(n) ⎜ Y ⎝ Y n 1

|D|≤t D≡a(mod 8N )

D n

171

⎞ ⎟ dt⎠ exp(−n/X).

Let us estimate the contribution of those n for which n2 is not a square. We apply Lemma 5.4 and ﬁnd that the sum is X(log X)−ρ . Thus, if we choose X ≤ Y (log Y )ν with 0 < ν < ρ, we see that this is Y (log Y )−κ for some κ > 0. Now, the contribution of those values of n for which n2 is a square is seen to be equal to ⎛ ⎞ Y ⎟ a(n) a ⎜ ⎜1 ⎟ 1 dt⎟ exp(−n/X) ⎜ n n ⎝ Y ⎠ 1 1 2 |D|≤t n2 =b

D≡a(mod 8N ) (D,n)=1

and from Lemma 3.5, we know that this is = C(0, 0, 1, 1)Y + O(Y X −1/8 ) where we recall that

1 a(n1 n22 ) a φ(n2 ) C(0, 0, 1, 1) = . 8N n1 n22 n1 n2

(Note that we are summing over both positive and negative values of D here.) On the other hand, moving the line of integration to the line Re(w) = −η, 0 < η < 1, we get a residue at w = 0 equal to 1 Y LD (1, f )dt Y 1 |D|≤t D≡a(mod 8N )

and an integral a μ a ˜(d, f ) 1 ¯(n) 2 ω (a) ¯(δ ) × 1−w N d 2πi n (−η) 2 δ2 ≤Y d|δ

⎛ ⎜ δ2 ×⎜ ⎝Y ×

(δ,2N )=1

1

Y /δ

2

|D0 |−2w (sgnD0 )

|D0 |≤t D0 δ2 ≡a(mod 8N )

Γ(1 − w) (X/A2 d)w Γ(w)dw Γ(1 + w)

D0 nd

⎞ ⎟ dt⎟ ⎠×

(∗)

172

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

which has been simpliﬁed in the (by now) familiar way. Let us write it as Σ1 + Σ2 where in Σ2 we only include terms with n2 d = b2 and Σ1 contains the remaining terms. First, consider the contribution of the terms with n2 d = b2 . We have |D0 |−2w (sgnD0 ) |D0 |≤t/δ2 D0 δ2 ≡a(mod 8N ) (D0 ,n2 d)=1

=

1 ¯ ¯2 (n d) ψ(aδ ) μ(m)χ0 2 (m2 )ψ(m2 )m−4w φ(8N ) √ ψ m≤ t/δ (n d) × χ0 2 (h)ψ(h)|h|−2w (sgn h) |h|≤t/δ 2 m2

and it is not hard to check that the above is 2η+ 12 t d(8N n2 d) 2 . δ Inserting this into the big expression above, we see that 2η+ 2 d 12 d(d) X −η Y |Γ(−η)| d(8N d). 2 d d δ 2 2 1

Σ2

δ ≤Y d|δ

Choosing η = 1/4 for example, we see that Σ2 Y X −1/4 . We now estimate Σ1 . By C β (λ) we know that for 0 < η1 < −η1 , there exists U ∈ (X, 2X) such that ⎛ ⎞ 2 2 Y /δ a ¯(n) β ⎜δ ⎟ IU3 ⎝ f (nd, −2w; aδ¯2 )dt⎠ 1−w t Y 1 n n≤u

1 2

n2 d =b2

(2η1 − 1)−1 d 2 (|w| + 1)2 (Y /δ 2 ) 2 +2η1 X 2 −η1 ' ( (log Y d/δ 2 )λ (log d log Y /δ 2 ) . 1

1

1

As d ≤ δ 2 ≤ Y we see that the above is d 2 (|w| + 1)2 (Y /δ 2 ) 2 +2η1 X 2 −η1 (log Y )λ log(d log Y ). 1

1

1

and Re(w) =

Exercises

Also, for

173

< η2 < 1, and Re(w) = −η2 , 2X ≥ U ≥ X ≥ Y ,

1 2

⎛ ⎜ δ2 IU3 ⎝ Y

⎞

Y /δ 2 1

n>u n2 d =b2

a ¯(n) β ⎟ f (nd, −2w; aδ¯2 )dt⎠ n1−w t

d 2 (|w| + 1)2 (Y /δ 2 ) 2 +2η2 X 2 −η2 (log Y )λ log(d log Y ). 1

1

1

We shall choose η1 and η2 bounded away from 0 and 1 respectively. Using the estimate 1 |˜ μ(d, f )| d(d)d 2 we deduce that Σ1

2

Y

1 2 +2ηi

X 2 −2ηi (log Y )λ (log log Y ) × 1

δ 2 ≤Y

i=1

d 12 d(d)

1 δ 1+4ηi

d

d|δ 2

1

dηi d 2 log d

and this is 1 2

1 2

X Y (log Y ) (log log Y ) λ

d(δ 2 )2 δ

δ

log δ

Y Xδ

2η1

+

Y Xδ

2η2 * .

Simplifying, we see that if we set X = Y (log Y )ν then Σ1 Y (log Y )ν/2+λ−2η1 ν (log log Y ). Now, if we choose λ = ν/10 for any 0 < ν < ρ, and η1 = 2/5(say) then Σ1 Y (log Y )−ν/5 (log log Y ). Together with our earlier estimate of Σ2 , this proves the main theorem.

Exercises 1.

Deduce Theorem 1.1 from Theorem 1.2 by showing that if there are only a ﬁnite number of fundamental discriminants D with LD (1, f ) = 0 then |D|≤Y D≡a(mod 8N )

LD (1, f )

√

Y (log Y ).

174

2.

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

For a prime p not dividing N , let us deﬁne

Bp (s) =

∞ a(p2j ) j=0

p2js

and Cp (s) =

∞ a(p2j+1 ) j=0

p(2j+1)s

.

Then, we have the identities Bp (s) =

αp2 1 − 2s p

−1

βp2 1 − 2s p

and

−1

(p)p 1 − 2s p

αp2 Cp (s) = p−s (1 + (p)p) 1 − 2s p

−1

−1

βp2 1 − 2s p

1−

1 p4s−2

−1

1

where a(p) = αp + βp and |αp | = |βp | = p 2 . 3.

For d not dividing N , deﬁne the function

Fd (s, f ) =

∞

a(dn2 )(dn2 )−s .

n=1

Then,

Fd0 (s, f ) =

∞ a(n2 ) n2s n=1

Bp (s)−1 Cp (s).

p|d0

Using the identity ∞ a(n2 ) = Bp (s) = L(2s, Sym2 )ζ(4s − 2)−1 , 2s n p n=1

deduce that Fd (s, f ) = L(2s, Sym2 )ζ(4s − 2)−1 −1 (p)p 1 1 + (p)p 1 − 2s 1 − 4s−2 . p p ps p|d

All of these formal manipulations are valid for Re(s) > 1.

Exercises

4.

175

For (d, 2N D) = 1, consider

F˜d (s, f, D) =

m2 d=b2 (m,D)=1

a(m) m1+s

a m1

φ(m2 d) . m2 d

Show that F˜d (s, f, D) is deﬁned for Re(s) ≥ 0 and has an analytic continuation for Re(s) > −1/4. Moreover, for Re(s) ≥ −3/16 (say), it satisﬁes the estimate of Lemma 3.3: in the notation of that lemma, 1 ν(d ) F˜d (s, f, D) c1 0 (d0 )−σ− 2 |L(2s + 2, Sym2 )ζ(2 + 4s)−1 |

(1 +

p|D

5.

1 3 ) . p1+2σ

Prove that if f has trivial character, ∞ a(n) −2πn/√N e . n n=1

L(1, f ) = 2 Deduce that

LD (1, f ) Y

|D|≤Y

where the sum ranges over D which are prime to N . State and prove a similar result for forms f with non-trivial character. 6.

Let

∞

A(X, χ) =

a(n)χ(n)n−1 e−2πn/X .

n=1

Using the large sieve inequality, prove the estimate ∗ |A(X, χ)|4 (X + Y )2+ . D≤Y χ mod D

Here, the sum over χ ranges over primitive characters. Deduce the estimate of Iwaniec [I] |LD (1, f )|4 Y 2+ . |D|≤Y

*7. Prove the asymptotic formulae for averages of higher derivatives: 1 Y

Y 1

|D|≤t

for j ≥ 0 and constants cj .

(j)

LD (1, f )dt ∼ cj Y (log Y )j

176

Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions

References [BC] P. T. Bateman and S. Chowla, Averages of character sums, Proc. Amer. Math. Soc., 1 (1950), 781–787. [FH] S. Friedberg and J. Hoﬀstein, Non-vanishing theorems for automorphic Lfunctions on GL(2), Annals of Math., 142 (1995), 385–423. [FS] A. S. Fainleib and O. Saparnijazov, Dispersion of real character sums and the moments of L(1, χ), (Russian) Izv. Akad. Nauk. USSR, Ser. Fiz.-Mat. Nauk, 19 (1975), 24–29. [I]

H. Iwaniec, On the order of vanishing of modular L-functions at the critical point, S´em. de Th´eorie des Nombres Bordeaux, 2 (1990), 365–376.

[J]

M. Jutila, On character sums and class numbers, J. Number Theory, 5 (1973), 203–214.

[MV] H.L. Montgomery and R.C. Vaughan, Mean values of character sums, Canadian J. Math., 31 (1979), 476–487. [M] V. Kumar Murty, A non-vanishing theorem for quadratic twists of modular L-functions, preprint, 1991. [MM] M. Ram Murty and V. Kumar Murty, Mean values of derivatives of modular L-series, Annals of Math., 133 (1991), 447–475. [MS] V. Kumar Murty and T. Stefanicki, Non-vanishing of quadratic twists of Lfunctions attached to automorphic representations of GL(2) over Q, preprint, 1994. [Ra] R. Rankin, Sums of powers of cusp form coeﬃcients II, Math. Ann., 272 (1985), 593–600. [Ro] D. Rohrlich, Non-vanishing of L-functions for GL2 , Invent. Math., 97 (1989), 381–403. [Sh] G. Shimura, On the periods of modular forms, Math. Ann., 229 (1977), 211– 221. [W1] J. Waldspurger, Sur les valeurs de certaines fonctions L automorphe en leur centre de sym´etrie, Comp. Math., 54 (1985), 173–242. [W2] J. Waldspurger, Correspondances de Shimura et quaternions, Forum Math., 3 (1991), 219–307.

Chapter 7 Selberg’s Conjectures

§1 Selberg’s class of Dirichlet series In a fundamental paper [S], Selberg deﬁned a general class of Dirichlet series and formulated basic conjectures concerning them. Selberg’s conjectures concern Dirichlet series, which admit analytic continuations, Euler products and functional equations. The Riemann zeta function is the simplest example of a function in the family S of functions F (s) of a complex variable s satisfying the following properties: (i) (Dirichlet series) For Re(s) > 1, F (s) =

∞ an , ns n=1

where a1 = 1 and we will write an (F ) = an for the coeﬃcients of the Dirichlet series; (ii) (Analytic continuation) F (s) extends to a meromorphic function so that for some integer m ≥ 0, (s − 1)m F (s) is an entire function of ﬁnite order; (iii) (Functional equation) There are numbers Q > 0, αi > 0, Re(ri ) ≥ 0, so that Φ(s) = Qs

d

Γ(αi s + ri )F (s)

i=1

satisﬁes Φ(s) = wΦ(1 − s¯) for some complex number w with |w| = 1; (iv) (Euler product) F (s) = Fp (s) p

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_8, © Springer Basel AG 1997

177

178

Chapter 7 Selberg’s Conjectures

where

∞ bpk Fp (s) = exp pks

k=1

kθ

where bpk = O(p ) for some θ < 1/2, where p runs over prime numbers. (v) (Ramanujan hypothesis) an = O(n ) for any ﬁxed > 0. Note that the family S is multiplicatively closed, and so is a multiplicative monoid. All known examples of elements in S are automorphic L-functions. In all of these cases, Fp (s) is an inverse of a polynomial in p−s of bounded degree. Selberg [S] introduced this family to study the value distribution of ﬁnite linear combinations of Dirichlet series with Euler products and functional equations. For this purpose, he introduced the important concept of a primitive function and made signiﬁcant conjectures about them. A function F ∈ S is called primitive if the equation F = F1 F2 with F1 , F2 ∈ S implies F = F1 or F = F2 . As we shall see below, one of the most serious consequences of the Selberg conjectures is that S has unique factorization into primitive elements. It is not diﬃcult to show that every element of S can be factored into primitive elements. This is a consequence of an old theorem of Bochner [B], though Selberg [S] and more recently Conrey and Ghosh [CG] seem to have found it independently. Selberg conjectures: Conjecture A: For all F ∈ S, there exists a positive integer nF such that |ap (F )|2 = nF log log x + O(1). p

p≤x

In Proposition 2.5, we shall describe nF more explicitly. Conjecture B: (i) for any primitive function F , nF = 1 so that |ap (F )|2 = log log x + O(1); p

p≤x

(ii) for two distinct primitive functions F and F , ap (F )ap (F ) = O(1). p

p≤x

Thus, in some sense, the primitive functions form an orthonormal system.

§1 Selberg’s class of Dirichlet series

179

In his paper [S], Selberg investigates the consequences of his conjectures to the value distribution of log F (σ + it) for σ = 1/2 or σ very near to 1/2. Selberg also conjectures the analogue of the Riemann hypothesis for the functions F ∈ S. It is not diﬃcult to see that Conjecture B implies Conjecture A. By Proposition 2.4 below, Conjecture B also implies that the factorization into primitives in S is unique. It seems central, therefore, to classify the primitive functions. To this end, it is natural to deﬁne the dimension of F as dim F = 2αF where αF =

d

αi .

i=1

By Proposition 2.2 below, this concept is well-deﬁned. Selberg conjectures that the dimension of F is always a non-negative integer. This question was previously raised by Vign´eras [V]. Bochner’s work can be used to classify primitive functions of dimension one in the case α1 = 1/2. They are (after a suitable translation) the classical zeta function of Riemann and the classical L-functions of Dirichlet. If the αi are rational numbers, then this is also a complete list of (primitive) functions of dimension one (see [Mu]). We will show that Conjecture B implies Artin’s conjecture concerning the holomorphy of non-abelian L-series attached to irreducible Galois representations. More precisely, let k be an algebraic number ﬁeld and K/k a ﬁnite Galois extension with group G. Let ρ be an irreducible representation on the n-dimensional complex vector space V . As explained in Chapter 2, for each prime ideal p of k, let Vp be the subspace of V ﬁxed by the inertia group Ip of p. Set Lp (s, ρ) = det(1 − ρ(σp )Np−s Vp )−1 where σp is the Frobenius automorphism of the prime ideal p of k, N is the absolute norm from k to Q. Deﬁne L(s, ρ; K/k) =

Lp (s, ρ).

p

(Sometimes, we write L(s, ρ) if the ﬁeld extension is clear. Since L(s, ρ) depends only on the character χ of ρ, we will also sometimes write L(s, χ) or L(s, χ, K/k) for L(s, ρ, K/k).) Clearly, the Artin L-function L(s, ρ) is a product of L-functions attached to irreducible constituents of ρ. We have: Artin’s conjecture: If ρ is irreducible = 1, then L(s, ρ, K/k) extends to an entire function of s.

180

Chapter 7 Selberg’s Conjectures

§2 Basic consequences We record in this section the results of Bochner [B], Selberg [S] and Conrey-Ghosh [CG]. Proposition 2.1

(Bochner [B]) If F ∈ S, and αF > 0, then αF ≥ 1/2.

Remark. In their paper, Conrey and Ghosh [CG] give a simple proof of this and also treat the case αF = 0. They prove that the constraint bn = O(nθ ) for some θ < 1/2 implies there is no element in S with αF = 0 except the constant function 1. Proposition 2.2 (Selberg [S]) Let NF (T ) be the number of zeroes ρ = β + iγ of F (s) satisfying 0 < γ ≤ T . Then, αF NF (T ) = T (log T + c) + SF (T ) + O(1), π where c is a constant and SF (T ) = O(log T ). If F = F1 F2 , then clearly NF (T ) = NF1 (T )+NF2 (T ) so that αF = αF1 +αF2 . Thus, if F is such that αF < 1, then F is necessarily primitive. The following is now immediate. Proposition 2.3 (Conrey-Ghosh [CG]) Every F ∈ S has a factorization into primitive functions. Proof. If F is not primitive, then F = F1 F2 and by the above, αF = αF1 + αF2 . By Proposition 2.1, each of αF1 , αF2 is strictly less than αF . Continuing this process, we ﬁnd that the process terminates because Proposition 2.1 implies the number of factors is ≤ 2αF . This completes the proof. We see immediately that the Riemann zeta function and the classical Dirichlet functions L(s, χ) with χ a primitive character are primitive in the sense of Selberg. Indeed, the Γ-factor appearing in the functional equation is Γ(s/2) or Γ((s + 1)/2) and the result is now clear from Proposition 2.1. Conjecture B forces the factorization in Proposition 2.3 to be unique. Indeed, suppose that F had two factorizations into primitive functions: F = F1 . . . Fr = G1 . . . Gt where F1 , . . . , Fr , G1 , . . . , Gt are primitive functions. Without loss of generality, we may suppose that no Gi is an F1 . But then, ap (F1 ) + · · · + ap (Fr ) = ap (G1 ) + · · · + ap (Gt ) so that ap (F1 )(ap (F1 ) + · · · + ap (Fr )) ap (F1 )(ap (G1 ) + · · · + ap (Gt )) = . p p p≤x

p≤x

As x → ∞, the left hand side tends to inﬁnity, whereas the right hand side is bounded since no Gi is an F1 . This contradiction proves:

§3 Artin’s conjecture and Selberg’s conjectures

181

Proposition 2.4 (Conrey-Ghosh [CG]) Conjecture B implies that every F ∈ S has a unique factorization into primitive functions. In the next proposition, we describe nF . Proposition 2.5 (a) If F ∈ S and F = F1e1 · · · Frer is a factorization into primitive functions, then Conjecture B implies nF = e21 + · · · + e2r . (b) Conjecture B implies that F is primitive if and only if nF = 1. Proof. We have ap (F ) =

r

ei ap (Fi )

i=1

and so computing the asymptotic behaviour of |ap (F )|2 p

p≤x

and using Conjecture B yields the result.

§3 Artin’s conjecture and Selberg’s conjectures We now discuss Artin’s conjecture in the context of Selberg’s conjectures. We begin by showing that Selberg’s conjectures imply the holomorphy of non-abelian Lfunctions. Let χ be the character of the representation ρ. We will write L(s, χ, K/k) for L(s, ρ, K/k). Theorem 3.1

Conjecture B implies Artin’s conjecture.

˜ be the normal closure Proof. We adhere to the notation introduced above. Let K ˜ ˜ of K over Q. Then, K/k is Galois, as well as K/Q, and χ can be thought of as a ˜ character χ ˜ of Gal(K/k). By the property of Artin L-functions (see [A]), ˜ L(s, χ, ˜ K/k) = L(s, χ, K/k). ˜ ˜ Moreover, if Ind χ ˜ denotes the induction of χ ˜ from Gal(K/k) to Gal(K/Q), then ˜ ˜ L(s, χ, ˜ K/k) = L(s, Ind χ, ˜ K/Q), by the invariance of Artin L-functions under induction. Hence, we can write L(s, χ, K/k) =

φ

m(φ) ˜ L(s, φ, K/Q)

182

Chapter 7 Selberg’s Conjectures

˜ where the product is over irreducible characters φ of Gal(K/Q) and m(φ) are non˜ negative integers. To prove Artin’s conjecture, it suﬃces to show that L(s, φ, K/Q) ˜ is entire for each irreducible character φ of Gal(K/Q). By Brauer’s induction theorem and the Artin reciprocity law, we can write L(s, χ1 ) ˜ L(s, φ, K/Q) = L(s, χ2 ) ˜ where χ1 and χ2 are characters of Gal(K/Q) and L(s, χ1 ), L(s, χ2 ) are entire functions, being products of Hecke L-functions. Thus, they belong to S and hence, by Proposition 2.4, have a unique factorization into primitive functions. We can therefore write m L(s, φ) = Fi (s)ei , ei ∈ Z. i=1

By comparing the p-th Dirichlet coeﬃcient of both sides, we get φ(p) =

m

ei ap (Fi )

i=1

from which we obtain m 2 |φ(p)|2 1 = ei ap (Fi ) . p p i=1

p≤x

p≤x

Conjecture B gives the asymptotic behaviour of the right hand side: m |φ(p)|2 = e2i log log x + O(1). p i=1 p≤x

Decompose the sum on the left hand side according to the conjugacy class C of ˜ Gal(K/Q) to which the Frobenius automorphism σp belongs: |φ(p)|2 1 = |φ(gC )|2 , p p p≤x

p≤x

C

σp ∈C

where gC is any element of C. By the Chebotarev density theorem 1 |C| = log log x + O(1). p |G| p≤x σp ∈C

Hence,

|φ(p)|2 |C| = |φ(gC )|2 log log x + O(1). p |G|

p≤x

C

§3 Artin’s conjecture and Selberg’s conjectures

183

But φ is irreducible and so, |C| C

|G|

|φ(gC )|2 = (φ, φ) = 1.

Therefore, the left hand side is log log x + O(1) as x→∞. We deduce that

m

e2i = 1

i=1

from which follows m = 1 and e1 = ±1. Thus, L(s, φ) = F (s) or 1/F (s), where F (s) is primitive and analytic everywhere except possibly at s = 1. However, L(s, φ) has trivial zeroes and so the latter possibility cannot arise. We conclude that L(s, φ) = F (s) is primitive and entire. Corollary 3.2 Let K/Q be Galois and let χ be an irreducible character of Gal(K/Q). Conjecture B implies that L(s, χ) is primitive. Proof. This is evident from the last line in the proof of the previous lemma. Or we can derive it as follows. By the previous theorem, F = L(s, χ) ∈ S and by the Chebotarev density theorem, nF = 1. The result now follows from Proposition 2.5 (b). Of course, Dedekind’s conjecture that the zeta function of a number ﬁeld is always divisible by the Riemann zeta function follows from Artin’s conjecture. However, it is rather interesting to note that the unique factorisation conjecture is ˜ its Galois closure, and suﬃcient to deduce this. Indeed, if K is a number ﬁeld, K ζK (s) is the Dedekind zeta function of K, then ζK˜ (s)/ζK (s) = F (s) is entire by the Aramata-Brauer theorem. By the same theorem, ζK˜ (s)/ζ(s) = G(s) is also entire. Since ζ(s) is primitive, it appears as a primitive factor in ζK˜ (s) = ζ(s)G(s). Since ζK˜ (s) = ζK (s)F (s) and F is entire, ζ(s) must appear in the unique factorization of ζK (s). This is Dedekind’s conjecture. The Selberg conjectures refer to the analytic behaviour of Dirichlet series at the edge of the critical strip. (There are other conjectures relating special values of Dirichlet series inside the critical strip, namely the Deligne conjectures and the Birch-Swinnerton-Dyer conjectures to cite speciﬁc instances.) A consequence of

184

Chapter 7 Selberg’s Conjectures

Conjecture B is that if F is any primitive function which is not the Riemann zeta function, then ap (F ) = O(1). p1+it p≤x

In particular, no primitive function should vanish on σ = 1. Thus, Selberg’s conjectures imply that no element of S vanishes on Re(s) = 1. Most likely, the Selberg class consists only of automorphic L-functions in which case there is a general non-vanishing result of Jacquet and Shalika [JS]. Many of our interesting consequences, notably the Artin conjecture, utilised the unique factorization conjecture. Perhaps this can be attacked by other means. Indeed, given r distinct primitive functions F1 , . . . , Fr , one would expect the existence of complex numbers s1 , . . . , sr such that Fi (sj ) = 0 if and only if i = j. If this were the case, then clearly, the unique factorization conjecture is true. The classiﬁcation of primitive functions is a fundamental problem. From the work of Bochner and Vign´eras, it follows that if F has dimension 1 and all the αi are rational numbers, then d = 1 and α1 = 1/2. It then follows, essentially from the same works, that F must either be the Riemann zeta function or a purely imaginary translate of a classical Dirichlet L-function attached to a non-trivial primitive character. It is shown in [Mu] that if π is an irreducible cuspidal automorphic representation of GL2 (AQ )), then L(s, π) is primitive if the Ramanujan conjecture is true. In particular, the L-function attached to a normalised holomorphic cuspidal Hecke eigenform is a primitive function which is in Selberg’s class (by Deligne’s theorem).

Exercises 1.

If F and G ∈ S, deﬁne F ×G=

p

Hp (s)

where Hp (s) = exp

∞

−ks

kbpk (F )bpk (G)p

.

k=1

Deﬁne F ∈ S to be simple if F × F extends to an analytic function for Re(s) ≥ 1/2 except for a simple pole at s = 1. Show that if F ∈ S is simple and entire, then F (1 + it) = 0 for all t ∈ R. (Hint: a simple function has at most a simple pole at s = 1.) 2.

If F ∈ S is simple and F × F has analytic continuation to Re(s) = 1, then show that F (1 + it) = 0 for all t ∈ R.

3.

Let F ∈ S and assume that an (F ) ≥ 0. If F = 1, show that Selberg’s conjectures imply that ζ|F .

References

185

4.

Show that the Dedekind zeta function of Q(21/3 ) factors into a product of two distinct primitive functions, one of which is the Riemann zeta function and the other of dimension 2.

5.

If F, G ∈ S are such that ap (F ) = ap (G) and ap2 (F ) = ap2 (G) for all but ﬁnitely many primes p, show that F = G.

6.

For F ∈ S, denote by ZF (T ) the multiset of zeros of F (s) in the region Re(s) ≥ 1/2 and | Im(s)| ≤ T. For F, G ∈ S, suppose the symmetric diﬀerence satisﬁes |ZF (T )ΔZG (T )| = o(T ) as T →∞. Show that F = G.

Exercises 5 and 6 are from [MM].

References [A] E. Artin, Collected papers, Springer-Verlag, New York-Berlin, 1982. [B]

S. Bochner, On Riemann’s functional equation with multiple gamma factors, Annals of Mathematics, 67 (1958) 29–41.

[CG] B. Conrey and A. Ghosh, On the Selberg class of Dirichlet series, Duke Math. Journal, 72 No. 3, (1993) 673–693. [JS] H. Jacquet and J.A. Shalika, A non-vanishing theorem for zeta functions of GLn , Inventiones Math., 38 (1976) p. 1–16. [M] M. Ram Murty, A motivated introduction to the Langlands program, in Advances in Number Theory (eds. F. Gouvea and N. Yui), pp. 37–66, Clarendon Press, Oxford, 1993. [M1] M. Ram Murty, Selberg’s conjectures and Artin L-functions, Bulletin of the Amer. Math. Soc., 31 (1) (1994) p. 1–14. [MM] M. Ram Murty and V. Kumar Murty, Strong multiplicity one for Selberg’s class, C.R. Acad. Sci. Paris, 319 (Series I) (1994) p. 315–320. [Mu] M. Ram Murty, Selberg conjectures and Artin L-functions, II, in Current Trends in Mathematics and Physics, A tribute to Harish-Chandra, (edited by S. D. Adhikari), Narosa Publishing House, 1995. [S]

A. Selberg, Old and new conjectures and results about a class of Dirichlet series, Collected Papers, Volume II, pp. 47–63, Springer-Verlag.

[V] M.F. Vign´eras, Facteurs gamma et ´equations fonctionelles, Lecture notes in mathematics, 627 Springer-Verlag, Berlin-New York, 1976.

Chapter 8 Suggestions for Further Reading

In [Iw], Iwaniec considers a weighted sum

μ2 (D)LD (1, f )F (D/Y )

D

where F is a smooth function, compactly supported in R+ with positive mean value. He establishes an asymptotic formula for it of the form αY log Y + βY + O(Y

13 14 +

)

with some constants α = 0 and β which depend on f and the test function F . From this, he is able to deduce that given any > 0, and Y > C( ), the number 2 of fundamental discriminants of D ≤ Y such that LD (1, f ) = 0 is at least Y 3 − . He is able to do this by establishing the upper bound

|LD (1, f )|4 Y 2+

d≤Y

and then using the Cauchy-Schwarz inequality: |

μ2 (D)LD (1, f )F (D/Y )| ≤ #{D ≤ Y : LD (1, f ) = 0}3/4 { |LD (1, f )|4 }1/4 .

D

D

The method is capable of generalization and extension. For instance, see Murty and Stefanicki [MS] and Stefanicki [St], as well as [PP]. In [GV], Goldfeld and Viola formulate the following conjecture. Let us suppose that we have a Dirichlet series ∞ an L1 (s) = ns n=1

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_9, © Springer Basel AG 1997

187

188

Chapter 8 Suggestions for Further Reading

which converges absolutely in some half-plane. Deﬁne L2 (s) =

∞ an2 . ns n=1

Let N are R be ﬁxed integers. Let (D, N R) = 1, with D a fundamental discriminant. For any real character χ mod |D|, we assume that L1 (s, χ) given by L1 (s, χ) =

∞ an χ(n) ns n=1

extends to an entire function and satisﬁes a functional equation of the following type: Asχ Tχ (s)L1 (s, χ) = wχ Ak−s ¯ χ Tχ (k − s)L1 (k − s, χ) where Aχ > 0,

k > 0,

wχ = w (D)χ(R)

with |w| = 1, and a primitive Dirichlet character mod N , and where Asχ Tχ (s)L1 (s, χ) is an entire function of s. Here Tχ (s) denotes a product of gamma factors . Tχ (s) =

J+ Γ(s .i=1 J− i=1 Γ(s

+ αi+ )

if D > 0

αi− )

if D < 0

+

for positive integers J + , J − , and real numbers αi+ , αi− > −k/2 depending only on L1 (s). For convenience, we will write Tχ (s) =

J

Γ(s + αi )

i=1

it being clear that J and the αi depend on the sign of D. We also assume that Aχ = f (|D|) where f (x) is a non-decreasing C 1 function of x ≥ 1. We also suppose that L2 (s) has an Euler product: L2 (s) =

2λ ' (−δ 1 − γp,i p−s i p i=1

Chapter 8 Suggestions for Further Reading

189

and a pole of order ρ ≥ 0 at s = k. Under these conditions, Goldfeld and Viola conjecture that as D→∞, D L1 k/2, · |D|≤X

= (1 + o(1))

2λ 1 ρ [z L2 (k + 2z)]z=0 (1 + wχ ) logρ f (|D|) (1 − γp,i p−k )δi , ρ! i=1 |D|≤X

p|D

where the dash on the sum means that we sum over fundamental discriminants. In Chapters 5 and 6, we have established special cases of this conjecture. It is natural to consider this conjecture for the quadratic twists of a ﬁxed automorphic L-function on GLr . (See [Mu] for an introduction to terminology and notation.) It is then possible to prove upper bounds of general r. This has recently been done by Y. Zhang [Z, pp. 54–60]. She obtains that if π is an irreducible cuspidal automorphic representation of GLr (AQ ), where AQ denotes the adele ring of the rational number ﬁeld, and π has trivial central character, then D L(1/2, π ⊗ ) X (r+1)/2 log2 X · |D|≤X

for r > 1. In case r = 1 or r = 2, one can establish the Goldfeld-Viola conjecture. Indeed, in case r = 1, the method of Chapter 5 can be extended to grossencharacters and we leave it as an exercise for the reader. The case of r = 2 has been elucidated in Chapter 6 for holomorphic cuspidal automorphic representations. In the non-holomorphic case, this was dealt with in [MS]. The non-vanishing of L-functions at the center of the critical strip often seems to have arithmetic meaning as is seen by the Birch and Swinnerton-Dyer conjectures or more generally the conjectures of Deligne and Beillinson. To cite another instance, the famous theorem of Waldspurger shows that given a cuspidal automorphic representation π of P GL2 (AF ), the corresponding representation under the Howe correspondence is an automorphic representation of the metaplectic cover of SL2 (AF ) if and only if there is a quadratic character χ such that L(1/2, π⊗χ) = 0. (See [PS].) In some cases, the non-vanishing result of the L-function twisted by a Hecke character, not necessarily quadratic, is already enough for certain arithmetical applications as in the work of Ash and Ginzburg [AG]. The methods are equally adaptable for average values of quadratic twists of L-functions evaluated not at the center of the critical strip but at other points of the complex plane. One may also consider other twists and get analogous nonvanishing theorems. For instance, Barthel and Ramakrishnan [BR] have proved that given any irreducible, unitary, cuspidal automorphic representation π of GLr over a ﬁeld F , and any complex number s0 with Re(s0 ) ∈ / (1/(2r − 2), 1 − (1/(2r −

190

Chapter 8 Suggestions for Further Reading

2))), there are inﬁnitely many ray class characters χ of F such that L(s0 , π⊗χ) = 0. Non-vanishing near Re(s) = 1 is the subject of investigation in [HR]. Non-vanishing at s = 1/2 would have consequences for the construction of p-adic L-functions associated to cuspidal automorphic representation of GL2r as in the work of Ash and Ginzburg [AG]. Rohrlich [R1], proved the following non-vanishing theorem on GL2 . Let π be an irreducible cuspidal automorphic representation of GL2 over any number ﬁeld F and let s0 be a complex number. Then, there are inﬁnitely many ray class characters of F (of ﬁnite order) such that L(s0 , π ⊗ χ) = 0. Some applications are given in [R2]. Friedberg and Hoﬀstein [FH] give necessary and suﬃcient conditions for the existence of a quadratic ray class character with this property. There are other related results such as the recent work of Luo, Rudnick and Sarnak [LRS] on the Selberg eigenvalue conjecture. This again is an extension of the methods outlined in the previous chapters. Recent work of B¨ocherer, Furusawa and Schulze-Pillot [BFS] raises the question of the simultaneous non-vanishing of quadratic twists of two Hecke eigenforms. There is the problem of Merel [Me] which asks for non-vanishing at s = 1/2 of the L-functions of the twists of a given eigenform by even Dirichlet characters. Such a result will have applications in determining good upper bounds for torsion of elliptic curves over cyclotomic ﬁelds.

References [AG] A. Ash and D. Ginzburg, p-adic L-functions for GL(2n), Invent. Math., 116(1994), 27–73. [BFS] S. B¨ ocherer, M. Furusawa and R. Schulze-Pillot, On Whittaker coeﬃcients of some metaplectic forms, Duke Math. J., 76(1994), 761–772. [BR] L. Barthel and D. Ramakrishnan, A nonvanishing result for twists of Lfunctions of GL(n), Duke Math. J., 74(1994), 681–700. [FH] S. Friedberg and J. Hoﬀstein, Non-vanishing theorems for automorphic Lfunctions on GL(2), Annals of Math., 142 (1995) pp. 385–423. [GV] D. Goldfeld and C. Viola, Mean values of L-functions associated to elliptic, Fermat, and other curves at the center of the critical strip, Journal of Number Theory, 11 (1979) pp. 305–320. [HR] J. Hoﬀstein and D. Ramakrishnan, Siegel zeros and cusp forms, Int. Math. Res. Not., 1995, pp. 279–308. [Iw] H. Iwaniec, On the order of vanishing of modular L-functions at the critical point, S´eminaire de Th´eorie des Nombres, Bordeaux, 2 (1990) pp. 365–376. [LRS] W. Luo, Z. Rudnick and P. Sarnak, On Selberg’s eigenvalue conjecture, Geom. and Func. Anal., 5(1995), 387–401.

References

191

[Me] L. Merel, private communication, 1995. [Mu] R. Murty, A motivated introduction to the Langlands program, in Advances in Number Theory, (eds. F. Gouvea and N. Yui), Oxford University Press, 1994. [MS] V. Kumar Murty and T. Stefanicki, Non-vanishing of quadratic twists of Lfunctions attached to automorphic representations of GL(2) over Q, preprint, 1994. [PP] A. Perelli and J. Pomykala, Averages of twisted L-functions, to appear in Acta Arithmetica. [PS] I. Piatetski-Shapiro, The work of Waldspurger, in Springer Lecture Notes, 1041 pp. 280–302. [R1] D. Rohrlich, Non-vanishing of L-functions for GL(2), Inventiones Math., 97 (1989) pp. 381–403. [R2] D. Rohrlich, Non-vanishing of L-functions and the structure of Mordell-Weil groups, J. reine angew. Math., 417 (1991) pp. 1–26. [St] T. Stefanicki, Non-vanishing of L-functions attached to automorphic representations of GL(2), Ph.D. Thesis, McGill University, 1992. [Z]

Y. Zhang, Some analytic properties of automorphic L-functions, Ph.D. Thesis, McGill University, 1994.

Name Index

Artin, E., 181 Ash, A., 190 Balasubramanian, R., 94, 95, 125, 127 Barban, M.B., 112 Barthel, L., 189 Bateman, P.T., 141, 154 B¨ocherer, S., 190 Bochner, S., 178, 180 Br¨ocker, T., 67 Cassels, J., 28 Chowla, S., 141, 154 Cohen, H., 80 Conrey, B., 178, 180 Davenport, H., 42, 117 Dieck, T., 67 Ellison, W., 15 Fainleib, A.S., 138 F´ejer, L., 23 Foote, R., 35, 39, 41 Frey G., 80 Friedberg, S., 140, 190 Fr¨ ohlich, A., 28, 29 Furusawa, M., 190 Ghosh, A., 178, 180 Ginzburg, D., 190 Goldfeld, D., 187 Graham, S., 112, 113

Heath-Brown, R., 95 Hildebrand, A., 96 Hoﬀman, J., 190 Hoﬀstein, J., 140, 190 Iwaniec, H., 176, 187 Jacquet, H., 184 Jutila, M., 97, 104, 131, 153 Kahane, Jean-Pierre, 9 Katz, N., 53 Knapp A., 80 Lagarias, J., 42, 44, 48, 54, 61 Lang, S., 92 Luo, W., 190 Merel, L., 190 Montgomery, H., 44, 54, 72 Montgomery, H.L., 97, 154 Murty, M. Ram, 35, 46, 85, 95, 128, 134, 136, 137, 140, 147, 153, 159, 179, 185, 189 Murty, V. Kumar, 9, 35, 39, 46, 48, 77, 84, 95, 125, 127, 134, 136, 137, 140, 147, 153, 159, 185, 189 Odlyzko, A. M., 42, 44, 48, 54, 61 Oesterl´e, J., 80 Ogg, A., 82, 84

M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1, © Springer Basel AG 1997

192

Name Index

Perelli, A., 187 Piatetski-Shapiro, I., 189 Polya-Vinogradov, 95 Pomykala, J., 187 Ramakrishnan, D., 189, 190 Rankin, R., 90, 137, 138 Rhoades, S., 35, 61 Rohrlich, D., 134, 190 Rudin, W., 11 Rudnick, Z., 190 Saparnijazov, O., 138 Saradha, N., 46 Sarnak, P., 190 Scherk, J., 46 Schulze-Pillot, R., 190 Selberg, A., 177, 178, 180 Serre, J.-P., 26, 42, 61, 65, 68, 83 Shahidi, F., 90

Shalika, J.A., 184 Shimura, G., 80, 134 Siegel, C. L., 95 Stark, H. M., 35, 37, 38 Stefanicki, T., 140, 187, 189 Uchida, K., 32 van der Waall, R. W., 32 Vaughan, R.C., 97, 154 Vehov, P.P., 112 Vign´eras, M.F., 179 Vinogradov ,I.M., 98, 100 Viola, C., 187 Waldspurger, J., 134 Wales, D., 41 Zhang, Y., 189

193

Subject Index

L-function formalism, 35 algebraic number ﬁeld, 19 approximate functional equation, 93 Aramata-Brauer theorem, 30, 35, 40, 183 Archimedean Euler factors, 28 Artin conductor, 28, 44 Artin’s conjecture, 29, 46, 50, 52, 179, 181, 183 Artin’s reciprocity theorem, 29 Artin’s theorem, 36 Atkin-Lehner involution, 81 averages of higher derivatives, 175 Barban-Vehov weights, 110, 112 Birch and Swinnerton-Dyer conjectures, 189 Borel subgroup, 59 Brauer induction theorem, 29 Brauer’s induction theorem, 182 Cartan subgroup, 59, 60 character sums, 132 Chebotarev density theorem, 2, 42, 52, 68, 182 Chebycheﬀ polynomials, 88 class function, 25 class number formula, 16 classiﬁcation of primitive functions, 184 Cliﬀord’s theorem, 40

CM-type, 83 combinatorial identities, 88 compact groups, 65, 67–69 compact Riemann surface, 77, 81 congruence subgroup, 76 conjugacy class, 27 cusp forms, 77, 78, 133 cuspidal automorphic representations, 189, 190 decomposition group, 27 Dedekind’s conjecture, 30, 32, 183 Dedekind’s zeta function, 19, 21, 37, 39, 42, 52, 63, 185 Deligne’s Prime Number Theorem, 68 dimension, 179 Dirichlet polynomial, 2, 116 Dirichlet series with positive coeﬃcients, 87 discriminant, 44 eigenform, 82 Eisenstein series, 78 elliptic curve, 1, 53, 58 elliptic curves over cyclotomic ﬁelds, 190 equicontinuity, 65 equidistribution, 1, 65, 66 Erd˝ os-Tur´ an inequality, 71 estimate of Rankin-Shahidi, 138

194

Subject Index

Euler product, 16, 18, 19, 21, 177 explicit formula method, 128 factorization into primitives, 179 F´ejer Kernel, 12 Fourier inversion, 11 Frobenius element, 27 Frobenius reciprocity, 25, 30, 31, 36, 43 functional equation, 6, 177 fundamental discriminant, 100, 104, 105, 130, 134, 165, 173, 187 Galois module structure, 29 generalized ideal, 22 Goldfeld-Viola conjecture, 189 Haar measures, 65, 67, 72, 84 Hadamard factorization, 38 Hadamard’s proof, 9 Hecke operators, 82 Hecke subgroup, 76 Hecke’s L-functions, 21 Hecke’s theorem, 81 Hensel’s estimate, 44, 46, 49 Hensel’s inequality, 60 higher ramiﬁcation groups, 28 Hilbert class ﬁeld, 55 Howe correspondence, 189 hyperbolic, 76

195

Langlands program, 84 large sieve inequality, 117, 175 least prime in a conjugacy class, 52 line integral, 6 Mackey’s theorems, 26 mean-value estimate of Jutila, 131 metaplectic Eisenstein series, 140 method of averages, 2 minimal normal subgroup, 34 modular L-functions, 2 modular curve, 77 modular elliptic curve, 3 modular forms, 1, 77 molliﬁer polynomial, 116 monomial characters, 33, 61 newforms, 83 non-abelian L-functions, 181 normalized eigenfunctions, 82 oldforms, 83 omega theorem, 88 oscillation of Fourier coeﬃcients, 2, 75, 84

Jutila’s character sum estimate, 97

parabolic, 76 Parseval’s formula, 12 Peter-Weyl Theorem, 66 Petersson inner product, 83 Polya-Vinogradov estimate, 95, 97, 129, 130, 139 Polya-Vinogradov inequality, 97, 137 positive proportion, 127 prime ideal theorem, 20 prime number theorem, 2, 5, 6 primes in arithmetic progression, 15 primitive function, 178 principal congruence subgroup, 75

kernel function, 53

quadratic twists, 133

ideal class group, 21 ideal classes, 21 inductive property of L-functions, 43 inertia group, 27 inner product, 78 integrated Polya-Vinogradov estimate, 141

196

Subject Index

Ramanujan conjecture, 75, 83, 184 Ramanujan’s cusp form, 79 Ramanujan-Petersson conjecture, 84 Rankin’s estimate, 166 Rankin’s theorem, 90 Rankin-Selberg convolution, 140 ray class characters, 190 ray class group, 21, 22 real character sums, 152 regular representation, 26 relative discriminant, 44 Riemann hypothesis, 11, 95, 97, 128, 129 Riemann zeta function, 177, 183 Riemann-Lebesgue lemma, 12, 13 Sato-Tate conjecture, 2, 83, 84, 91 Selberg eigenvalue conjecture, 183, 190 Selberg’s class, 177 Selberg’s conjectures, 3, 177 semidirect product, 34 smooth approximation, 98 smoothing operator, 163

Stirling’s formula, 118 supersolvable group, 61 Tate’s thesis, 22 Tauberian theorem, 11, 19, 88 Tauberian theory, 2 theorem of Brauer, 29 trigonometric identities, 86 trigonometric inequality, 2 trigonometric lemma, 69 uniform distribution, 2 unique factorization of L-functions, 182 upper bounds for torsion, 190 upper half-plane, 76 Vinogradov’s lemma, 98, 100 weighted sums, 137, 158, 187 Weil’s conjectures, 75 Weil’s criterion, 67 Wiener-Ikehara Tauberian theorem, 7, 8, 68