This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
2, p,re, and we let nb nz, n3, ... be the quadratic non-residues. Denote by Vb Vz, ... the solutions to
n
+ py == nz,...
(mode).
If y == Vi (mod e), then py + n is a quadratic non-residue mod e, and is therefore not a square. We may therefore discard those y == Vi (mode). We may further discard more values of y by choosing different values of e until the number of trials is small enough not to be troublesome. X Z == 73 (mod 127). We try to solve x 2 = 127y + 73 where 1 ::;;; y ::;;; 31. We take e = 3, ni = 2. From 73 + 127y == 2 (mod 3), that is y == 1 (mod 3), we see that the remaining values for y are:
Example. Solve
2,3,5,6,8,9,11,12,14,15, 17, 18,20,21,23,24,26,27,29,30. We next take e = 5, ni = 2, n2 = 3. From 127y + 73 == 2,3 (mod5), we have VI == 2, V2 == 0 (mod 5) and so the remaining values for yare now 3,6,8,9,11,14,18,21,23,24,26,29. We next take e = 7, ni = 3, n2 = 5, n3 = 6. From the congruences 127y + 73 == 3,5,6 (mod 7), or y + 3 == 3, 5, 6 (mod 7) we have y == 0,2, 3 (mod 7), so that we are left with only the six values 6,8,11,18,26,29 for the trials. In fact 73 the solutions.
+8 x
127 = 1089 = 33 2 so that x == ± 33 (mod 127) are
Note. In this method, having taken e and e', there is no need to take ee'. Again, having taken an odd e, there is no need to take 2e.
All we discuss here is related to the work of Gauss. We see therefore that this "Prince of mathematics" is not only a theoretician, but also an expert problem solver.
44
3. Quadratic Residues
3.5 The Number of Roots of a Quadratic Congruence Theorem 5.1. Let I> 0, p,tn. If p > 2, then the congruence X2 = n (mod pI) has 1 + (~) solutions. If p = 2, then we have the following three cases. 1) 1= 1. There is one root. 2) I = 2. There are two or no roots depending on whether n = 1 or 3 (mod 4). 3) I > 2. There are four or no roots depending on whether n = 1 or n =1= 1 (mod 8). Proof We first discuss the three cases associated with p = 2. 1) This is trivial. 2) The congruence X2 = 1 (mod 4) has the solutions ± 1 (mod 4) and the congruence X2 = 3 (mod 4) has no solution. 3) If X2 = n (mod 2') is soluble, then x must be odd, say 2k + 1. Since . k(k + 1) (2k+l)2=4k(k+l)+1=8' 2 +1=1
(mod 8),
it follows that the congruence is not soluble if n =1= 1 (mod 8). Suppose now that n = 1 (mod 8). When I = 3, there are clearly the four roots 1,3,5,7. We now proceed by induction on I. Let a satisfy a 2 = n (mod 2' - 1 ). Then
We take b = (n - a 2)j2'-l. Then a + 2' - 2b is a solution with respect to mod2'. Therefore a solution to X2 = n (mod 2') certainly exists. Let Xl be a solution, and let X2 be any solution. Then x~ = (Xl - X2)(Xl + X2) = 0 (mod2'), and since Xl - X2, Xl + X2 are both even it follows that t(Xl - X2) . t(Xl + X2) = 0 (mod 2' - 2 ). But t(Xl - X2) and t(Xl + X2) must be of opposite parity, since otherwise their sum Xl cannot be odd. Therefore we have either Xl = X2 (mod2 ' - l ) or Xl = - X2 (mod2 ' - l ), and this means that X2 = ± Xl + k2 ' - l (k = 0 or 1). Hence there are at most four solutions to X2 = n (mod 2'). Since ± Xl> ± Xl + 2' - 1 are actually incongruent solutions we see that the congruence has exactly four solutions. When p > 2 and I = I, the result is trivial, and the remaining part of the theorem follows from Theorem 2.9.3. D
xi -
From the results of Chapter 2 we can determine the number of solutions to a quadratic congruence to any integer modulus m.
3.6 Jacobi's Symbol Throughout this section m denotes a positive odd integer. Definition. Let the standard factorization of m be PI ... Pt, where the Pr may be repeated. If (n,m) = 1 then we define the Jacobi's symbol by
45
3.6 Jacobi's Symbol
(-mn) =0 G) t
r=1
Examples.
r
(~) = 1. If (a,m) = 1, then (~) = 1.
Note: If (:) = 1, it does not follow that x 2
=n (modm) is soluble.
Theorem 6.1. Let m and m' be positive odd integers. (i) If n
= n'
(modm) and
(n, m) = 1, then (:) = (:). (ii) If(n, m) = (n, m') = 1, then (:) (;,) = (m:'). (iii) If(n,m) = (n',m) = 1, then (:)(:) = (:'). Theorem 6.2. (
-:n 1)
D
= ( - l)t(m-l).
Proof. It suffices to prove that t
t
L
Pi - 1 2
i=1
=
OPi- 1 (mod 2),
i=1
2
which certainly holds when t = I. Given any two odd integers u, v we always have
u - 1 v-I -2- + -2-
=-uv2- -1
(mod 2)
(or (u - 1)(v - I)
=0 (mod4)).
It follows by induction that t
Pi -
1
t-l Pi -
1
Pt -
1
L=i= L - +2i= 1 2 1 2 t-l
o
_
Theorem 6.3.
i=1
Pi -
1
1
0 Pi -
= 222 +~
1
i=1
(mod 2).
(~) = (_ 1)~mL1).
Proof. This is similar to the above, except that we replace (I) by U2 V 2 -
8
1
u2 - 1 v2 - 1 = - 8 - + - 8 - (mod 2).
D
Theorem 6.4. Let m, n be coprime positive odd integers. Then ( -m)(n) n m
~.'!!..::..! 2.
= ( - 1) 2
D
(1)
46
3. Quadratic Residues
Proof Let m = TIp, n =
=
TI q. Then n-lm-l
p-lq-l
TITI (- 1)-2--2- = (- 1)-2--2p
where we have used (1).
q
0
In using the Legendre's symbol we must always ensure that the denominator is a prime. In using Jacobi's symbol however, we can avoid the factorization process. For example:
383) (443) ( 60 ) ( 22 ) ( 15 ) ( 15 ) ( 443 = - 383 = - 383 = - 383 383 = - 383 8 8 = C1 :) = ( 5) = (25) = 1. If we delete the condition that m, m' are positive in Theorem 6.4, then we have:
Theorem 6.5. Let m, n be coprime odd integers.
(Imln)(m) jnf = - ( -
Otherwise, the required value is ( -
If m, n are both negative, then m-ln-l
1)-2--2-.
l)t<m-l).!
Example. Determine the solubility of x 2
=-
0
286 (mod 4272943).
Here p = 4272943 is a prime, and we have 10 evaluate (-:86). Since
(~1) = _ 1, (~) = 1, we have (-:86) = We now determine
C;3)
(~1 )(~)C;3) = _ C;3).
as follows: We have
4272943 = 29880 x 143 + 103*, 143 = 2 x 103 - 63, 103 = 2 x 63 - 23, 63 = 2 x 23 + 17*, 23 = 2 x 17 17 = 2 x
11,
11 - 5*,
11=2x5+1
47
3.7 Two Terms Congruences
where each step with a * denotes a change of sign. Therefore
p
( 143) = (-I? =-1. Thus ( as
;86) = I,
and the congruence is soluble. Gauss determined the solutions
± 1493445.
3.7 Two Terms Congruences Let p be prime. We now discuss the congruence
Xk
== n (modp).
Theorem 7.1. The congruence Xk
== I (modp)
(I)
has (k,p - I) roots. Proof I) Let d = (k,p - I) and let s, t be integers such that sk + t(p - I) then have :x;
=
== I (modp),
d. We
(2)
and conversely. 2) It suffices to prove that (2) has d roots. From Theorem 2.9.1 the number of roots for (2) certainly cannot exceed d. Also, there are p - I roots to xP- 1 == I (modp). Again, by Theorem 2.9.1 the number of roots for
xp -
1 -
I
p-l
- . , - - - = (Xd)-d-- 1
:x;
+ ... + x d + I == 0
(modp)
does not exceed p - I - d, so that the number of roots for (2) must be at least d. The theorem is proved. 0 Theorem 7.2. Either the congruence (k,p - I) solutions.
Xk
== n (modp), p,rn has no solution or it has
Proof If Xo is a solution, then (X;;-l X)k == follows from Theorem 7.1. 0
XkX;;-k
== I (modp). The required result
Theorem 7.3. If x runs over a reduced set of residues mod p, then (p - I)/(k,p - I) different values.
Xk
take
Proof From Theorem 7.2 we see that there are (k,p - I) distinct residues whose k-th power have the same residue modp. The p - I residues are now partitioned into (p - I)/(k,p - I) classes, and there is a one-to-one correspondence. 0
48
3. Quadratic Residues
Definition. Let h be an integer, and (h, n) = 1. The least positive integer I such that h' == I (modn) is called the order of h (modn). Theorem 7.4.
If hm == I (modn), then 11m.
Proof Suppose the contrary. Then there are integers q, r such that m = ql + r, 0< r < I. Now hr == hm(h,)-q == I (modn) contradicts with the definition of I. 0 Theorem 7.5. Let lip - I, and denote by ({)(/) the number ofincongruent integers with order I. Then ({)(I) is the Euler's function.
Proof We first establish certain properties of (()(I). I) If (110 12) = I, then ({)(l1/2) = ({)(lr)({)(l2). Let hI and h2 be integers with orders 11 and 12 respectively, and let Ibe the order of h 1h 2. From I == (h1h2)"2 == h';2 (modp), and Theorem 7.4 we see that / 11112. Since (110 12) = I, we have 111/, and similarly /211. Therefore I = 1112, that is· the order of h1h2 is 11/2. Thus, given any hI, h2 with orders 11,/2, we can construct h1h2 whose order is 11/2. We now prove that if we do not have hI == h~,h2 == h~ (modp), then h1h2 i= h~h~ (modp). For if h1h2 == h'lh~ (modp), then h 1h'1-1 == h~h21 (modp). But the order of h1h~-1 divides 11 and the order of h~h21 divides 12, so that h1h~-1 ==h~h21 == I (modp) which contradicts our assumption. Conversely, if h is an integer with order 11/2 where (/r,/2) = I, then hI = h'2, h2 = h" are integers with orders 110 12. Therefore ({)(lr)({)(/2) = (()(l1/2). 2) If q is prime, then ({)(qt) = qt - qt-1. The number of roots of xqt - 1 == 0 (mod p) is qt. If x satisfies this congruence and its order is not qt, then it must satisfy ~t-' _ I == 0 (modp). But the number of roots of this congruence is qt-1. Therefore ({)(qt) = qt _ qt-1. That (()(I) is Euler's function follows from the two properties in I) and 2). 0
3.8 Primitive Roots and Indices From Theorem 7.5 we see that there are ({)(p - I) incongruent numbers with order p - I (modp). Definition 1. A positive integer whose order is p - I is called a primitive root of p. Let g be a primitive root of p. Then gO, gl, . .. , gP- 2 are incongruent (modp). Definition 2. Corresponding to each integer n not divisible by p, there exists a such that
n == ga
(modp),
O~a
We call athe index ofn (modp) and we denote it by indg n or simply ind n. If b is such that n == gb (modp), then b == indn (modp - I).
49
3.9 The Structure of a Reduced Residue System
The function ind is similar to the logarithm function in that there are following properties: I) indnm == indm + indn (modp - l),p,rmn; 2) indn' == lindn (modp - 1),p,rn. Note: We do not define indn when pin; this is similar to not defining log O. Definition 3. Let p,rn. If the congruence
Y!' == n (modp)
(1)
is soluble, then we call n a k-th power residue mod p; otherwise we call n a k-th power non-residue. Theorem 8.1. A necessary and sufficient condition for n to be a k-th power residue modp is that (k,p - I) divides indn.
Proof Let a = indn and y = indx. Then (I) is equivalent to ky == a (modp - 1), and a necessary and sufficient condition for this to be soluble is that (k,p - 1) divides a. D "Base interchange formula". It is clear that the index depends on the primitive root chosen. Let gi be another primitive root and gi == gb (modp). Then n == g~ == (gb)a (modp) or
This is similar to the base interchange formula for the logarithm function. We list the least primitive roots for all the primes up to 5000 at the end of this chapter.
3.9 The Structure of a Reduced Residue System Let m be a natural number. We ask whether there exists
g
such that
gO, gi, g2, ... , g,,(m)-i (modm) form a reduced residue system. If g exists, then we
call it a primitive root of m. Theorem 9.1. A necessary and sufficient condition for m to have a primitive root is that m = 2,4,p' or 2p', where p is an odd prime.
Proof 1) Let the standard factorization of m be
Pi
From Euler's theorem, any integer a not divisible by Pi must satisfy
50
3. Quadratic Residues
Let I be the least common mUltiple of cp(it'), ... , cp(p!s) so that a l = I (modm). Therefore there can be no primitive root if 1< cp(m). If p > 2, then cp(pl) is even, so that m cannot have two distinct odd prime divisors. If m has a primitive root, then m must be of the form 21, i or 2cpl. If c ~ 2, then cp(2C) = 2C- 1 is also even, and so 2ci cannot have primitive roots. Therefore m must be of the form 21,pl or 2pl. 2) m = 21. If I = I, then I is a primitive root. If 1= 2, then 3 is a primitive root. Let I ~ 3. We prove by induction that for all odd a, we have a 2' - 2
=
I
(mod 21).
This is easy, since if then
Therefore there is no primitive root for m = 21 (/ > 2). 3) m = i. The case I = I has already been settled in §8. Let g be a primitive root of p. If gP-l - I =1= 0 (modp2), then we take r = g; if gP-l - I = 0 (modp2), then we take r = g + p. We then have
Therefore such an r is a primitive root of p2. Let rP -
1 -
I
=
kp, p,tk.
Since s~O,
we can prove as before that
Hence rpl - 2 (p-l)
= I + kp l-l
(mod pi) ,
I
~
2.
(1)
If the order of r is e, then el(p - I)pl-l = cp(i). Since r is a primitive root of p, we see that(p - 1)le. We deduce from (I) that e = cp(Pl); that is r is a primitive root ofi· 4) m = 2pl. We take g to be a primitive roqt of pl. If g is odd, then g is also a primitive root of 2pl; if g is even, then g + pi is a primitive root of 2pl. D Theorem 9.2. Let I > 2. Then the order of 5 with respect to the modulus 21 is 21- 2.
Proof We first prove that, for a
52a - 3
~
3,
= I + 2a -
1
(mod2 a ).
51
3.9 The Structure of a Reduced Residue System
This clearly holds when a 5 2a - 2 = (5 2a - 3 f
Therefore 521-3 (mod 21). D
=1=
= 3,
and we now proceed by induction. We have
== (1 + 2a- 1 + k2a)2 == 1 + 2a (mod2 a+ 1 ).
1 (mod 21) and 52' - 2 == 1 (mod 21). That is, the order of 5 is 21- 2
Theorem 9.3. Let I > 2. Then, given any odd a, there exists b such that a-I
a == ( - 1)-2-5 b
(mod 21),
b
~
0.
Proof If a == 1 (mod 4), then by Theorem 9.2, 5b (0::;;; b < 21- 2) gives 21- 2 distinct numbers mod 21; moreover they are all congruent 1 (mod 4). Therefore there must be an integer b such that a == 5b (mod 21). If a == 3 (mod 4), then - a == 1 (mod 4), and the required result follows from the above. D
Theorem 9.4. Let m = 21 . pili . .. p~s (standard factorization) with I ~ 0, 11 > 0, ... , Is > 0. We define (j to be or 1 or 2 according to whether 1= 0, 1 or 1= 2 or I > 2
°
respectively. Then the reduced residue system ofm can be represented by the products of s + (j numbers. Proof 1) Suppose that m = m'm", (m', m") = 1. Let ar, .. . , aq>(m') be a reduced residue system mod m', and that ai == 1 (modm") (this is always possible). Let br, ... , bq>(m") be a reduced residue system mod m" and that bj == 1 (modm'). Then aibj represen t a reduced residue system mod mm', and its num ber is q>( m'm"). Also, if aibj == asb t (modm'm"), then ai == as (modm'), bj == b t (modm"). 2) From Theorems 9.1 and 9.3 we know that the reduced residue system modm, where m = pi (p > 2), is the product of a single number. If m = 21 where I > 1, then the reduced residue system is the product of (j numbers. Combining this with 1), the theorem is proved. D
This theorem points out an important principle. In group theory this result is known as the Fundamental Theorem of Abelian groups. Exercise. Prove that if k < p, n, = kp2
+ 1 and
2n -
1
== 1 (modn),
then n is a prime number. Hints: (i) First prove.that n has a prime divisor congruent 1 (modp). Let dbe the least positive integer such that 2d == 1 (mod n). Deduce that d,tk, din - 1 and pld. Then obtain the conclusion from pldlq>(n). (ii) Deduce from n = kp2 + 1 = (up + 1)(vp + 1) that n cannot be composite. Note: Taking p = 2127 - 1, k = 180, Miller and Wheeler proved, with the aid of a computer, that 180(2127 - 1)2 + 1 is prime. (Nature 168 (1951),838).
52
3. Quadratic Residues
The least primitive roots for primes less than 5000. An asterisk indicates that lOis a primitive root. p
p-1
g
p
p-1
g
p
p-1
g
3 5 7* 11 13 17* 19* 23* 29* 31 37 41 43 47* 53 59* 61* 67 71 73 79 83 89 97* 101 103 107 109* 113* 127 131* 137 139 149* 151 157 163 167* 173 179* 181* 191 193* 197 199 211 223* 227 229* 233* 239
2 22 2·3 2·5 22.3 24 2.3 2 2·11 22.7 2·3·5 22.3 2 23 .5 2·3·7 2·23 22.13 2·29 22.3.5 2·3·11 2·5·7 23 .3 2 2·3·13 2·41 23 .11 25 .3 22.5 2 2·3·17 2·53 22.3 3 24 .7 2.3 2.7 2·5·13 23 ·17 2·3·23 22.37 2.3.5 2 22.3.13 2.3 4 2·83 22.43 2·89 22.3 2.5 2·5·19 26 .3 22.7 2 2.3 2·11 2·3·5·7 2·3·37 2·113 22.3.19 23 ·29 2·7·17
2 2 3 2 2 3 2 5 2 3 2 6 3 5 2 2 2 2 7 5 3 2 3 5 2 5 2 6 3 3 2 3 2 2 6 5 2 5 2 2 2 19 5 2 3 2 3 2 6 3 7
241 251 257* 263* 269* 271 277 281 283 293 307 311 313* 317 331 337* 347 349 353 359 367* 373 379* 383* 389* 397 401 409 419* 421 431 433* 439 443 449 457 461* 463 467 479 487* 491* 499* 503* 509* 521 523 541* 547 557 563
24 .3.5 2.5 3 23 2·131 22 ·67 2.3 3 .5 22.3.23 23 .5.7 2·3·47 22.73 "2.3 2.17 2·5·31 23 .3.13 22.79 2·3·5·11 24.3.7 2·173 22.3.29 25 ·11 2·179 2·3·61 22.3.31 2.3 3 .7 2·191 22.97 22.3 2·11 24.5 2 23 .3.17 2·11·19 22.3.5.7 2·5·43 24.3 3 2·3·73 2·13·17 26 .7 23 .3.19 22.5.23 2·3·7·11 2·233 2·239 2.3 5 2.5.7 2 2·3·83 2·251 22 ·127 22.5.13 2.3 2.29 22.3 3 .5 2·3·7·13 22 ·139 2·281
7 6 3 5 2 6 5 3 3 2 5 17 10 2 3 10 2 2 3 7 6 2 2 5 2 5 3 21 2 2 7 5 15 2 3 13 2 3 2 13 3 2 7 5 2 3 2 2 2 2 2
569 571* 577* 587 593* 599 601 607 613 617 619* 631 641 643 647* 653 659* 661 673 677 683 691 701* 709* 719 727* 733 739 743* 751 757 761 769 773 787 797 809 811* 821* 823* 827 829 839 853 857* 859 863* 877 881 883 887*
23 .71 2·3·5·19 26 .3 2 2·293 24 .37 2·13 ·23 23 .3.5 2 2·3·101 22.3 2·17 23 .7.11 2·3·103 2.3 2.5.7 27 .5 2·3·107 2·17·19 22 ·163 2·7·47 22.3.5.11 25 .3.7 22.13 2 2·11·31 2·3·5·23 22.5 2·7 22.3.59 2·359 2.3.11 2 22.3.61 2.3 2.41 2·7·53 2.3.5 3 22.3 3 .7 22.5.19 28 .3 22 ·193 2·3·131 22 ·199 23 ·101 2.3 4.5 22.5.41 2·3·137 2·7·59 22.3 2.23 2·419 22.3.71 23 ·107 2·3·11·13 2·431 22.3.73 24.5.11 2.3 2.72 2·443
3 3 5 2 3 7 7 3 2 3 2 -3
:,
11 5 2 2 2 5 2 5 3 2 2 11 5 6 3 5 3 2 6 11 2 2 2 3 3 2 3 2 2 11 2 3 2 5 2 3 2 5
53
3.9 The Structure of a Reduced Residue System
p
p-1
g
p
p-1
g
p
p-1
g
907 911 919 929 937* 941* 947 953* 967 971* 977* 983* 991 997 1009 1013 1019* 1021* 1031 1033* 1039 1049 1051* 1061 1063* 1069* 1087* 1091* 1093 1097* 1103* 1109* 1117 1123 1129 1151 1153* 1163 1171* 1181* 1187 1193* 1201 1213* 1217* 1223* 1229* 1231 1237 1249 1259* 1277 1279
2·3·151 2'5'7·13 2'3 3 '17 25 ·29 23 '3 2'13 22'5.47 2'11·43 23 '7'17 2·3·7·23 2·5'97 24 ,61 2·491 2.3 2-5-11 22'3'83 24 '3 2'7 22 ·11·23 2'509 22. 3· 5 ·17 2'5·103 23 .3.43 2·3·173 23 ·131 2'3'5 2'7 22'5'53 2'3 2'59 22'3'89 2'3·181 2'5·109 22'3'7'13 23 ·137 2'19·29 22 ·277 22'3 2'31 2·3·11·17 23 '3'47 2'5 2'23 27 .3 2 2'7·83 2'3 2'5'13 22'5'59 2·593 22 ·149 24 '3'5 2 22.3'101 26 ·19 2·13·47 22,307 2·3·5·41 22'3'103 25 .3.13 2·17'37 22'11.29 2.3 2'71
2 17 7 3 5 2 2 3 5 6 3 5 6 7 11 3 2 10 14 5 3 3 7 2 3 6 3 2 5 3 5 2 2 2 11 17 5 5 2 7 2 3 11 2 3 5 2 3 2 7 2 2 3
1283 1289 1291* 1297* 1301* 1303* 1307 1319 1321 1327* 1361 1367* 1373 1381* 1399 1409 1423 1427 1429* 1433* 1439 1447* 1451 1453 1459 1471 1481 1483 1487* 1489 1493 1499 1511 1523 1531* 1543* 1549* 1553* 1559 1567* 1571* 1579* 1583* 1597 1601 1607* 1609 1613 1619* 1621* 1627 1637 1657
2·641 23 '7.23 2'3'5'43 24 '3 4 22. 52 ·13 2·3·7·31 2·653 2·659 23 .3.5.11 2'3·13·17 24 '5'17 2·683 22'7 3 22'3'5'23 2·3·233 27'11 2'3 2'79 2·23'31 22. 3· 7 ·17 23 ·179 2'719 2'3'241 2'5 2.29 22. 3.11 2 2'3 6 2.3.5'7 2 23 '5'37 2·3·13·19 2'743 24 .3'31 22'373 2'7·107 2·5·151 2'761 2.3 2. 5 ·17 2·3·257 22'3 2'43 24 '97 2·19·41 2'3 3 '29 2·5·157 2'3'263 2·7·113 22. 3· 7 ·19 26 ,5 2 2 ·11· 73 23 '3'67 22'13'31 2·809 22'3 4 '5 2·3·271 22,409 23 ,3 2 ,23
2 6 2 10 2 6 2 13 13 3 3 5 2 2 13 3 3 2 6 3 7 3 2 2 5 6 3 2 5 14 2 2 11 2 2 5 2 3 19 3 2 3 5 11 3 5 7 3 2 2 3 2 11
1663* 1667 1669 1693 1697* 1699 1709* 1721 1723 1733 1741* 1747 1753 1759 1777* 1783* 1787 1789* 1801 1811* 1823* 1831 1847* 1861* 1867 1871 1873* 1877 1879 1889 1901 1907 1913* 1931 1933 1949* 1951 1973 1979* 1987 1993* 1997 1999 2003 2011 2017* 2027 2029* 2039 2053 2063* 2069* 2081
2·3·277 2'7 2'17 22'3'139 22'3 2'47 25 '53 2·3·283 22'7'61 23 '5'43 2·3'7·41 22'433 22'3'5'29 2· 32·97 22'3'73 2'3'293 24 '3'37 2.3 4 '11 2·19'47 22'3'149 23 '3 2.5 2 2·5·181 2·911 2'3·5·61 2·13'71 22'3'5'31 2'3'311 2'5·11'17 24 '3 2'13 22'7.67 2·3'313 25 '59 22. 32'19 2'953 23 ·239 2·5·193 22'3'7'23 22 ·487 2'3'5 2'13 22'17'29 2·23·43 2·3·331 22'3'83 22'499 2.3 3 .37 2·7·11·13 2·3·5·67 25 .3 2.7 2 ·1013 22'3'13 2 2·1019 22.3 3 '19 2·1031 22 ·11·47 25 .5'13
3 2 2 2 3 3 3 3 3 2 2 2 7 6 5 10 2 6 11 6 5 3 5 2 2 14 10 2 6 3 2 2 3 2 5 2 3 2 2 2 5 2 3 5 3 5 2 2 7 2 5 2 3
54
3. Quadratic Residues
p
p-I
g
p
p-I
g
p
p-I
g
2083 2087 2089 2099* 211l 21l3* 2129 2131 2137* 2141* 2143* 2153* 2161 2179* 2203 2207* 2213 2221* 2237 2239 2243 2251* 2267 2269* 2273* 2281 2287 2293 2297* 2309* 2311 2333 2339* 2341* 2347 2351 2357 2371* 2377 2381 2383* 2389* 2393 2399 2411* 2417* 2423* 2437* 2441 2447* 2459* 2467 2473*
2·3·347 2·7·149 23 .3 2.29 2·1049 2·5·21l 26 .3 ·Il 24.7.19 2·3·5·71 23 .3.89 22.5.107 2.3 2.7.17 23 ·269 24.3 3 .5 2.3 2.1l 2 2·3·367 2·1l03 22.7.79 22.3.5.37 22.13.43 2·3·373 2·19· 59 2.3 2.5 3 2·11·103 22.34.7 25 ·71 23 .3.5.19 2.3 2.127 22.3.191 23 .7.41 22.577 2·3·5·7·1l 22. II· 53 2·7·167 22.3 2.5.13 2·3·17·23 2.5 2 .47 22 .19.31 2·3·5·79 23 .3 3 ·Il 22.5.7.17 2·3·397 22.3.199 23 .13.23 2· 11·109 2·5·241 24 ·151 2·7·173 22 .3.7.29 23 .5.61 2 ·1223 2 ·1229 2.3 2.137 23 .3.103
2 5 7 2 7 5 3 2 10 2 3 3 23 7 5 5 2 2 2 3 2 7 2 2 3 7 19 2 5 2 3 2 2 7 3 13 2 2 5 3 5 2 3 II 6 3 5 2 6 5 2 2 5
2477 2503 2521 2531 2539* 2543* 2549* 2551 2557 2579* 2591 2593* 2609 2617* 2621* 2633* 2647 2657* 2659 2663* 2671 2677 2683 2687* 2689 2693 2699* 2707 271l 2713* 2719 2729* 2731 2741* 2749 2753* 2767* 2777* 2789* 2791 2797 2801 2803 2819* 2833* 2837 2843 2851* 2857 2861* 2879 2887 2897*
22.619 2.3 2.139 23 .3 2.5.7 2·5·1l·23 2.3 3 .47 2·31·41 4.7 2.13 2.3.5 3 .17 22.3 2.71 2·1289 2·5·7·37 25 .3 4 24.163 23 .3.109 22.5.131 23 .7.47 2.3 3 .7 2 25 .83 2·3·443 2·1l 3 2·3·5·89 22.3.223 2.3 2.149 2·17·79 27 .3.7 22.673 2·19·71 2·3·1l·41 2·5·271 23 .3.113 2.3 2.151 23 .11.31 2·3·5·7·13 22.5.137 22.3.229 26 .43 2·3·461 23 .347 22.17.41 2.3 2.5.31 22.3.233 24.5 2.7 2·3·467 2 ·1409 24.3.59 22 ·709 2.7 2.29 2.3.5 2.19 23 .3.7.17 22.5. II· 13 2 ·1439 2·3·13·37 24.181
2 3 17 2 2 5 2 6 2 2 7 7 3 5 2 3 3 3 2 5 7 2 2 5 19 2 2 2 7 5 .3 3 3 2 6 3 3 3 2 6 2 3 2 2 5 2 2 2 II 2 7 5 3
2903* 2909* 2917 2927* 2939* 2953 2957 2963 2969 2971* 2999 3001 301l* 3019* 3023* 3037 3041 3049 3061 3067 3079 3083 3089 3109 3119 3121 3137* 3163 3167* 3169 3181 3187 3191 3203 3209 3217 3221* 3229 3251* 3253 3257* 3259* 3271 3299*' 3301* 3307 3313* 3319 3323 3329 3331* 3343* 3347
2·1451 22.727 22.3 6 2·7·1l·19 2·13·1l3 23 .3 3 .41 22.739 2 ·1481 23 .7.53 2.3 3 .5 ·Il 2 ·1499 23 .3.5 3 2·5·7·43 2·3·503 2 ·151l 22·3·1l·23 25 .5.19 23 .3.127 22.3 2.5.17 2·3·7·73 2.3 4.19 2·23·67 24 ·193 22.3.7.37 2·1559 24.3.5.13 26 .7 2 2·3·17·31 2·1583 22·3 2.1l 22.3.5.53 2.3 3 .59 2·5·1l·29 2·1601 23 .401 24.3.67 22.5.7.23 22.3.269 2.5 3 .13 22.3.271 23 .11.37 2·3·181 2·3·5·109 2·17·97 22.3. 52. II 2·3·19·29 24.3 2.23 2·3·7·79 2·1l·151 28 .13 2.3 2 .5.37 2·3·557 2·7·239
5 2 5 5 2 13 2 2 3 10 17 14 2 2 5 2 3 II 6 2 6 2 3 6 7 7 3 3 5 7 7 2 II 2 3 5 10 6 6 2 3 3 3 2 6 2 10 6 2 3 3 5 2
55
3.9 The Structure of a Reduced Residue System
p
p-1
g
P
p-1
g
3359 3361 3371* 3373 3389* 3391 3407* 3413 3433* 3449 3457 3461* 3463* 3467 3469* 3491 3499 3511 3517 3527* 3529 3533 3539* 3541 3547 3557 3559 3571* 3581* 3583 3593* 3607* 3613 3617* 3623* 3631 3637 3643 3659* 3671 3673* 3677 3691 3697 3701* 3709* 3719 3727* 3733 3739 3761 3767 3769
2·23·73 25 '3.5.7 2·5·337 22.3.281 22. 7 .11 2 2·3·5·113 2·13·131 22'853 23 • 3 ·11·13 23 ·431 27 ,3 3 22'5'173 2·3·577 2·1733 22. 3 .17 2 2'5'349 2·3·11·53 2'3 3 '5'13 22'3'293 2·41·43 23 ,3 2,7 2 22 ·883 2·29·61 22'3.5'59 2'3 2.197 22.7.127 2·3'593 2·3·5'7·17 22.5.179 2.3 2.199 23 '449 2·3·601 22'3'7'43 25 '113 2'1811 2.3'5.11 2 22. 32. 101 2·3·607 2·31'59 2·5'367 23 '3 3 '17 22'919 2'3 2'5'41 24 .3'7.11 22'5 2'37 22 • 32·103 2.11.13 2 2· 34 ·23 22'3'311 2·3·7·89 24 '5'47 2'7·269 23 '3'157
11 22 2 5 3 3 5 2 5 3 7 2 3 2 2 2 2 7 "2 5 17 2 2 7 2 2 3 2 2 3 3 5 2 3 5 21 2 2 2 13 5 2 2 5 2 2 7 3 2 7 3 5 7
3779* 3793 3797 3803 3821* 3823 3833* 3847* 3851* 3853 3863* 3877 3881 3889 3907 3911 3917 3919 3923 3929 3931 3943* 3947 3967* 3989* 4001 4003 4007* 4013 4019* 4021 4027 4049 4051* 4057* 4073* 4079 4091* 4093 4099 4111 4127 4129 4133 4139* 4153* 4157 4159 4177* 4201 4211* 4217* 4219*
2 '1889 24 '3'79 22 ·13· 73 2 ·1901 22'5'191 2'3'7 2'13 23 '479 2·3·641 2· 52. 7·11 22. 32·107 2·1931 22'3'17'19 23 '5'97 24 '3 5 2.3 2'7'31 2·5·17·23 22 ·11· 89 2·3·653 2·37'53 23 ·491 2·3·5·131 2'3 3 '73 2 ·1973 2·3·661 22 ·997 25 .5 3 2·3·23·29 2·2003 22'17'59 2'7 2'41 22'3'5'67 2·3·11·61 24 .11.23 2.3 4 '5 2 23 • 3 '13 2 23 '509 2·2039 2'5·409 22.3.11'31 2·3·683 2·3·5·137 2·2063 25 • 3 ·43 22 ·1033 2·2069 23 .3.173 22 ·1039 2.3 3 . 7'11 24 .3 2'29 23 '3'5 2'7 2'5'421 23 '17'31 2·3·19'37
2 5 2 2 3 3 3 5 2 2 5 2 13 11 2 13 2 3 2 3 2 3 2 6 2 3 2 5 2 2 2 3 3 10 5 3 11 2 2 2 17 5 13 2 2 5 2 3 5 11 6 3 2
p
4229* 4231 4241 4243 4253 4259* 4261* 4271 4273 4283 4289 4297 4327* 4337* 4339* 4349* 4357 4363 4373 4391 4397 4409 4421* 4423* 4441 4447* 4451* 4457* 4463* 4481 4483 4493 4507 4513 4517 4519 4523 4547 4549 4561 4567* 4583* 4591 " 4597 4603 4621 4637 4639 4643 4649 4651* 4657 4663
p-1
g
22'7'151 2'3 2'5'47 24 '5'53 2·3· 7 ·101 22 ·1063 2'2129 22'3.5'71 2'5'7·61 24 .3.89 2·2141 26 '67 23 '3'179 2·3'7·103 24 .271 2'3 2.241 22 '1087 22. 32.11 2 2·3·727 22 ·1093 2·5·439 22'7'157 23 ·19·29 22. 5·13 ·17 2· 3 ·11· 67 23 • 3· 5· 37, 2'3 2'13'19 2'5 2'89 23 '557 2·23·97 27 '5'7 2'3 3 '83 22 ·1123 2·3'751 25 .3'47 22 ·1129 2.3 2.251 2·7'17·19 2'2273 22'3'379 24 .3.5'19 2'3'761 2·29'79 2· 33 • 5 ·17 22'3'383 2·3 ·13'59 22 .3.5'7.11 22'19'61 2'3'773 2·11·211 23 '7'83 2'3'5 2'31 24 .3'97 2.3 2'7.37
2 3 3 2 2 2 2 7 5 2 3 5 3 3 10 2 2 2 2 14 2 3 3 3 21 3 2 3 5 3 2 2 2 7 2 3 5 2 6 11 3 5 11
5 2 2 2 3 5 3 3 15 3
56
3. Quadratic Residues
p
p-I
g
p
p -I
g
p
p-I
g
4673* 4679 4691* 4703* 4721 4723 4729 4733 4751 4759 4783* 4787 4789
26 .73 2·2339 2·5·7·67 2·2351 24 .5.59 2·3·787 23 .3.197 22 .7.13 2 2.5 3 .19 2·3·13·61 2·3·797 2·2393 22 .3 2 .7.19
3 II 2 5 6 2 17 5 19 3 6 2 2
4793* 4799 4801 4813 4817* 4831 4861 4871 4877 4889 4903 4909 4919
23 .599 2·2399 26 .3.5 2 22 .3.401 24 .7.43 2·3·5·7·23 22 .3 5 .5 2·5·487 22 .23.53 23 .13.47 2·3·19·43 22 .3.409 2·2459
3 7 7 2 3 3 II II 2 3 3 6 13
4931* 4933 4937* 4943* 4951 4957 4967* 4969 4973 4987 4993 4999
2·5·17·29 22 .3 2 ·137 23 .617 2·7·358 2·3 2 ·5 2 ·II 22 .3.7.59 2·13·191 23 .3 3 .23 22 ·II·II3 2.3 2 .277 27 .3.13 2.3.7 2 .17
6 2 3 7 6 2 5 II 2 2 5 3
Chapter 4. Properties of Polynomials
4.1 The Division of Polynomials We consider polynomialsf(x) with rational coefficients and we denote by 13°f the degree of the polynomial.
Definition 1.1. Let./{x) and g(x) be two polynomials with g(x) not identically zero. If there is a polynomial h(x) such that./{x) = g(x)h(x), then we say that g(x) divides j{x), and we write g(x)I'/{x) or glf If g(x) does not divide ./{x), then we write g,tf Clearly we have the following: (i)flf; (ii) ifflg and gil, thenfand g differ only by a constant divisor, and we call them associated polynomials; (iii) if fig and glh, then Jlh; (iv) if fig, then 13°f ~ aOg. Ifflg and g,tI, then we callfa proper divisor of g and it is easy to see that, in this case, 13°f < 13° g. Theorem 1.1. Let./{x) and g(x) be any two polynomials with g(x) not identically zero. Then there are two polynomials q(x) and r(x) such that f = q . g + r, where either r = 0 or aOr < aOg. Proof We prove this by induction on the degree off If 13°f < aOg, then we can take q = 0, r =f If aOf~ aOg, we let f=
IXnXn
+ ... ,
g = Pmxm
+ ... ,
aOf= n, 13° g = m,
so that
From the induction hypothesis, there are two polynomials h(x) and r(x) such that
where either r
so that f
=
0 or aOr < aOg. We now put
= qg + r as required. D
58
4. Properties of Polynomials
Definition 1.2. By an ideal we mean a set I of polynomials satisfying the following conditions: (i) If f, gEl, then f + gEl; (ii) IffE I and h is any polynomial, then fh E I. Example. The multiples of a fixed polynomial fix) forms an ideal.
Theorem 1.2. Given any ideal I, there exists a polynomial f E I such that any polynomial in I is a multiple off; that is I is the ideal of the set of multiples off Proof Let f be a polynomial in I with the least degree. If g is a polynomial in I which is not a multiple off, then, according to Theorem 1.1, there are polynomials q(x) and r(x) (1' 0) such that g
= qf + r,
Since f E I, it follows from (ii) that qfE I, and hence from (i) that g - qfE I, that is rEI. But this contradicts the minimal degree property of f The theorem is proved. D Definition 1.3. Let f and g be two polynomials. Consider the set of polynomials of the form mf + ng where m, n are polynomials. From Theorem 1.2 we see that this set is identical with the set of polynomial which are multiple of a polynomial d. We call this polynomial dthe greatest common divisor offand g, and we write (f, g) = d. For the sake of uniqueness we shall take the leading coefficient of (f, g) to be I, that is a monic polynomial. Theorem 1.3. The greatest common divisor (f, g) has the following properties: (i) There are two polynomials m, n such that (f, g) = mf + ng; (ii) For every pair of polynomials m, n we have if, g)lmf + ng; (iii) If Ilf and Ilg, then 11(f, g). D Definition 1.4. If(f, g) = I, then we say thatfand g are coprime. Theorem 1.4. Let p be an irreducible polynomial. If plfg, then either plf or pig. Proof If p,tf, then (f, p) = I. Thus, from Theorem 1.3 there are polynomials m, n such that mf + np = 1 so that mfg + ngp = g. Since plfg, it follows that pig. D
4.2 The Unique Factorization Theorem Theorem 2.1. Any polynomial can be factorized into a product of irreducible polynomials. If associated polynomials are treated as identical, then, apart from the ordering of the factors, this factorization is unique. D
59
4.2 The Unique Factorization Theorem
The theorem can be proved by mathematical induction on the degree of the polynomial. Theorem 2.2. Letj(x) and g(x) be two polynomials with rational coefficients, and that j(x) be irreducible. Suppose that f(x) = 0 and g(x) = 0 have a common root. Then j(x)lg(x). Proof Sincefand g have a common zero, it follows that (f, g) # l. Let d(x) be the greatest common factor of j(x) and g(x). Then d(x) and j(x) are associated polynomials, because j(x) is irreducible. Therefore j(x)lg(x). 0
From this theorem we deduce the following: Ifj(x) is an irreducible polynomial of degree n, then the zeros
are distinct. Moreover, if 9(i) is a zero of another polynomial g(x) with rational coefficients, then the other n - I numbers are also the zeros of g(x). Theorem 2.3. Let f and g be monic polynomials:
where Pv are distinct irreducible monic polynomials. Then
where
Cv
= min (a v , bv )' 0
Definition 2.1. Letfand g be two polynomials. Polynomials which are divisible by bothfand g are called common multiples offand g. Those common multiples which have the least degree are called the least common multiples, and we denote by [f, g] the monic least common multiple. Theorem 2.4. Under the same hypothe~is as Theorem 2.3 we have
where dv
= max (a v , bv ). 0
From this we deduce: Theorem 2.S. A least common multiple divides every common multiple. Theorem 2.6. Let f, g be monic polynomials. Then fg
=
[f, g](f, g).
0
0
60
4. Properties of Polynomials
4.3 Congruences Let m(x) be a polynomial. If m(x)lfix) - g(x), then we say that fix) is congruent to g(x) modulo m(x) and we write
fix)
= g(x)
(modm(x)).
With respect to any modulus m(x) we have: (i)f=f(modm); (ii) iff= g (modm), then g =f(modm); (iii) iff= g, g = h (modm), thenf= h (modm); (iv) iff= g, fl gl (modm), thenf ±fl g ± gl,ffl ggl (modm). Being congruent is an equivalence relation which partitions the set polynomials into equivalence classes. From (iv) we see that addition and multiplication can be defined on these classes. We denote by 0 the class whose members are divisible by m(x). If m(x) is irreducible we can even define division on the set of equivalence classes (except by 0, of course). Specifically, if fix) is not a mUltiple of m(x), then there are polynomials a(x), b(x) such that a(x}f{x) + b(x)m(x) = 1 which means that there is a polynomial a(x) such that a(x)f(x) = 1 (modm(x)). We state this as a theorem.
=
=
=
Theorem 3.1. Let m(x) be irreducible. Then any non-zero equivalence class has a reciprocal. That is, if A is a non-zero equivalence class, then there exists a class B such that for any polynomials fix) and g(x) in A and B respectively we have fix)g(x) = 1 (mod m(x)). D We now give an example to illustrate the ideas in this section. Let m(x) = x 2 + 1, an irreducible polynomial. Each equivalence class contains a unique polynomial ax + b which we may take as the representative. The addition and subtraction of classes is given by ax + b ± (alx + b l ) = (a ± al)x + (b ± bl)' Multiplication is given by (ax + b)(alx + b l ) = aalx 2 + (ab l + alb)x + bb l = (ab l + alb)x + bb l - aal (modx 2 + 1). Using the ordered pair (a, b) to denote the class containing ax + b we then have
(a,b)
± (abb l ) =
(a, b)(ah b l )
(a
± abb ± bl),
= (ab l + bal, bb l - aal)'
From
(ax
+ b)( -
ax
. . ( we see thatthe Inverse of (a, b) IS
+ b) = a2 + b2
(modx 2
+ 1),
b)
a 2' 2 2 2 ' In other words we have the a +b a +b arithmetic of the complex number ai + b. Extending the idea here, if m(x) is a monic polynomial of degree n, then each equivalence class possesses a unique polynomial with degree less than n, say -
and the arithmetic of the congruence modulo m(x) becomes the arithmetic of these
61
4.4 Integer Coefficients Polynomials
polynomials. The sum of two such polynomials is obtained by adding the corresponding coefficients, and the product is the ordinary product polynomial reduced modulo m(x). Exercise 1. Let OCl, OC2, OC3 be distinct. Determine a quadratic polynomial j(x) satisfying j(OC1) = /31 '/(OC2) = /32, j(OC3) = /33'
Answer: The Lagrange interpolation formula ft..x) = /31
(x - O(2)(X - O(3) (OCI - O(2)(OCl - O(3)
+ /32
(x - O(3)(X - OCl) (OC2 - O(3)(OC2 - OCl)
(x - OCl)(X - O(2)
+ /33...,..-----,---(OC8 - OCl)(OC3 - O(2)
Exercise 2. Let ml(x) and m2(x) be two non-associated irreducible polynomials. Let fl(X) andf2(x) be two given polynomials. Prove that there exists a polynomialj(x) such thatj(x) =/;(x) (modmi(x)), i = 1,2.
4.4 Integer Coefficients Polynomials It is clear that the set of integer coefficients polynomials is closed with respect to addition, subtraction and multiplication. A set of integer coefficients polynomials is called an ideal if (i) f + g belongs to the set whenever f and g belong to the set, (ii) fg belongs to the set whenever f belongs to the set, and g is any integer coefficients polynomial. Theorem 4.1. (Hilbert) Every ideal A possesses a finite number of polynomials fl' ... ,J,. with the following property: Every polynomial f E A is representable as f = glfl + ... + gnfn where gb' .. , gn are integer coefficients polynomials.
Proof 1) Denote by B the set ofleading coefficients of members of A. We claim that B forms an integral modulus. To see this, we observe that if a, bEB, where ft..x) = axn + .. " g(x) = bxm + .. " then by (ii) we know thatj{x)xm, g(x)x" E A so that
j(x)xm ± g(x)xn
=
(a
± b)xm+n + ...
are in A. Therefore a ± bEB which proves our claim. From Theorem 1.4.3 members of B are multiples of an integer d. Let the corresponding polynomial with leading coefficient d be
2) Let fEA. Then there are two polynomials q(x) and r(x) such that ft..x) = q(X)fl (x) + r(x) where oOr < OOfl or r = O. This is certainly so if the degree of fis less than that offl' Ifj(x) = axn + ... + an (n ~ I), then by 1) we see that dla, and
62
4. Properties of Polynomials
is a polynomial with degree at most n - I. If the degree here is greater than or equal to I, then its leading coefficient is again divisible by d. Continuing the argument we see that our claim is valid. 3) If every member of A has degree at least I, then the theorem is proved. Otherwise we let d' be the greatest common divisor of the leading coefficients of . members of A whose degree are less than I, and we let f2
= d'xl' + d'lX"-l + ...
(did')
be the corresponding polynomial in A. From the above, we see that members of A whose degree lies between l' and I can be written asfix) = Q(X)f2(X) + r(x) where aOr < a 2f2 or r = O. Continuing this argument the theorem is proved. 0
4.5 Polynomial Congruences with a Prime Modulus In this section all the polynomials have integer coefficients and p is a fixed prime number. Definition 5.1. If the corresponding coefficients of two polynomials fix) and g(x) differ by multiples of p, then we say thatf(x) and g(x) are congruent modulo p, and we writefix)~g(x) (modp). By the degree aOfofj(x) modulo p we mean the highest degree of f(x) whose coefficient is not a multiple of p. For example 7x 2 + 16x + 9~2x + 2 (mod 7), and a°(7x 2 + 16x + 9) = I (mod 7). But with respect to the modulus 3, a 2(7 x 2 + 16x + 9) = 2. Clearly we have (i) j(x)~j(x) (modp); (ii) if f~g (modp), then g~f (modp); (iii) if f~g, g~h (modp), thenf~h (modp); (iv) iff~g,Jl ~gl (modp), thenf ±fl ~g ± gl and ffl ~ggl (modp). We note particularly that (f(xW
~j(xP)
(modp).
Definition 5.2. Letf(x) and g(x) be polynomials with g(x) not identically zero mod p. If there is a polynomial h(x) such thatj(x) ~h(x)g(x) (modp), then we say that g(x) dividesf(x) modulo p. We call g(x) a divisor ofj(x) modulo p, and we write g(x)lj(x) (modp). Example. From XS + 3x4 - 4x 3 + 2 ~ (2X2 - 3)(3x3 - x 2 + I) (mod 5) we see that 2X2 - 31x s + 3x4 - 4x 2 + 2 (mod 5). We have the following: (i) f(x)lj(x) (modp); (ii) if j(x)lg(x) and g(x)lf(x) (modp), thenj(x) and g(x) differs only by a constant factor; that is, there exists an integer a such thatj(x)~ag(x) (modp). In this case we say thatj(x) and g(x) are associated modulo p. It is easy to see that every polynomial has p - I associates
63
4.6 On Several Theorems Concerning Factorizations
modulo p. Moreover, there is a unique monic associated polynomial. (iii) Ifflg, glh (modp), thenflh (modp). (iv) Letfix) and g(x) be two polynomials with g(x) not identically zero modulo p. Then there are two polynomials q(x) and r(x) such that fi.x)~q(x)g(x) + r(x) (modp), where either aOr < aOg, or r(x)~O (modp). Definition 5.3. If a polynomial fix) cannot be factorized into a product of two polynomials with smaller degrees modp, then we say that f(x) is an irreducible polynomial modp, or thatf(x) is prime modp. Example. We take p = 3. There are three non-associated linear polynomials, namely x, x + 1, x + 2, which are irreducible. There are nine non-associated quadratic polynomials, namely x 2 , x 2 + x, x 2 + 2x, x 2 + 1, x 2 + X + 1, x 2 + 2x + 1, x 2 + 2, x 2 + X + 2, x 2 + 2x + 2. Of these there are 6 (= (x + a)(x + b)) which are reducible, and the three irreducible ones are x 2 + 1, x 2 + X + 2, x 2 + 2x + 2.
We note that if a polynomial is irreducible mod p, then it is irreducible and from this we deduce that x 2 + 2x + 2 has no rational zeros. The determination of the number of irreducible polynomials modp of degree n is an interesting problem which we shall solve in §9. Theorem 5.1. Any polynomial can be written as aproduct of irreducible polynomials modp, and this product representation is unique apartfrom associates and ordering of the factors. 0 We can define, similarly to §1, the greatest common divisor and the least common multiple. If we denote by (f, g) the monic greatest common divisor, then we have Theorem 5.2. Given polynomials j(x) and g(x), there are polynomials m(x) and n(x) such that m(x)f(x) + n(x)g(x)~(f(x), g(x)) (modp). 0
4.6 On, Several Theorems Concerning Factorizations Definition 6.1. Letj(x) = anxn + an_1x"-1 + ... be a polynomial. The polynomial + (n - 1)an_lxn- 2 + ... is called the derivative ofj(x) and is denoted by
nanx"-l f'(x).
Clearly we have (f(x) + g(x))' = f'(x) that (f(x)g(x)), = f'(x)g(x) + g'(x)j(x).
+ g'(x),
and it is not difficult to prove
Definition 6.2. If a polynomial j(x) is divisible by the square of a non-constant polynomial modp, then we say thatfix) has repeated/actors modp. For example, x 5 + X4 - x 3 - x 2 + X + 1 has the repeated factors (x 2 + 1)2 modulo 3.
64
4. Properties of Polynomials
Theorem 6.1. A necessary and sufficient condition for j(x) to have repeatedfactors is that the degree of (j(x),f'(x» is at least 1. D Theorem 6.2. Ijp,(n, then X' - 1 has no repeatedfactors modp. Theorem 6.3. Let (m,n)
=
d. Then (x'" - 1, xn - 1) =;xd - 1.
D
D
Theorem 6.4. Let (m, n) = d. Then
4.7 Double Moduli Congruences Definition 7.1. Let p be a prime number and q>(x) be a polynomial. Iff1 (x) - fix) is a multiple of q>(x) mod p, then we say that f1 and f2 are congruent to the double moduli p, q>(x) and we write
f1(X) §. f2 (x)
(moddp, q>(x».
For example, x 5 + 3x4 + x 2 + 4x + 3 §. 0 (modd 5, 2X2 - 3). Double moduli congruences have the following properties: 1) j(x)§.j(x) (moddp, q>(x»; 2) If f§.g (moddp, q», then g§.f(moddp, q»; 3) If f§.g and g§.h (moddp, q», thenf§.h (moddp, q»; 4) If f§.g and f1 §.gl (modd p, q», then f ±f1 §.g ± gl and ff1 §.ggl (moddp, q»; 5) Suppose that the degree of q>(x) (modp) is n. Then every polynomial is congruent to one of the following polynomials
0::;;; ai::;;;p - 1.
(1)
It is clear that there are pn polynomials in (1), no two of them are congruent (moddp, q>(x», and any polynomial must be congruent to one of them (moddp, q>(x». Definition 7.2. We call the pn polynomials in (1) a complete residue system (moddp, q>(x». By discarding those polynomials which are not coprime with q>(x) we have a reduced residue system (moddp, q>(x».
Theorem 7.1. Let (g(x), q>(x» = 1. Then, asj(x) runs through a complete (or reduced) residue system (moddp, q>(x», so does f(x)g(x). Proof If g(X)f1 (x) §. g(X)f2(X) (moddp, q>(x», then from (g(x), q>(x» = I we deduce that f1 (x) §. f2 (x) (moddp, q>(x». The required result follows easily from this. D
65
4.8 Generalization of Fermat's Theorem
4.8 Generalization of Fermat's Theorem Let p be a prime number, and
(x). Then ffl' ... ,jJp" _ I is also a reduced residue system. Therefore p"-l
p"-l
n /;(x) n (f(x)f;(x)) ~
i= I
(moddp,
L. This shows that, when I > L, the first t + k (k ;;:: 0) terms of the power series in p representing a" a, + 1, a, + 2, ••• must be equal. Since t can be arbitrarily large the required result follows. We have proved that all the power series in p of the form (3), finite or infinite, together give the whole set of p-adic numbers.
15.9 Application Although the notion ofap-adic number is introduced as such only in this chapter, it has appeared several times already in this book. An example of this was pointed out in the beginning of this chapter. Th,e generalization of this example is known as Hensel's Lemma. Theorem 9.1 (Hensel). Let f(x) be a polynomial with integer coefficients, and f(x) == go(x)ho(x) (modp), where go(x) and ho(x) are coprime polynomials. Then, among the p-adic numbers, there are two polynomials g(x), h(x) such that g(x) == go(x), h(x) == ho(x) (modp), andf(x) = g(x)h(x). Proof Let g,(x), h,(x) be two polynomials satisfying
and
Clearly g, and h, are coprime (modp). Let
and
422
15. p-adic Numbers
so that we have
Let
j(x) - g,(x)h,(x)
------;-,- - == t(x) (modp). p
Since g,(x) and h,(x) are coprime (modp), there are two polynomials cp(x) and t/J(x) such that t(x) == cp(x)hl(x) + t/J(x)Mx) (modp). Therefore j(x) - gl+ 1 (x)h l + 1 (x) ==j(x) - g,(x)hl(x) - pl(cp(x)hl(X)
+ t/J(x)g,(x»
== p'(t(x) - cp(x)h,(x) - t/J(x)g,(x» == 0 (modp'+ 1). Since the degree of t(x) does not exceed the degree of g,(x)h,(x) we may assume that the degrees of cp(x) and t/J(x) do not exceed the degrees of g,(x) and h,(x) respectively. The coefficients of g,(x) and hl(x) are cp-convergent, and so they converge to g(x) and hex) respectively. The theorem is proved. 0
Note: Lemma 7.10.1 can be given an interpretation in p-adic numbers.
Chapter 16. Introduction to Algebraic Number Theory
16.1 Algebraic Numbers Definition 1.1. By an algebraic number we mean a number 8 which is a root of the algebraic equation (1)
where the coefficients a r are rational numbers.
J=l
Examples of algebraic numbers are j2, i = and the rational numbers themselves. By clearing the denominators of all the fractions a r in equation (1) we obtain an algebraic equation with integer coefficients. From now on we shall call an ordinary integer a rational integer to distinguish it from an algebraic integer, which we shall define later. We see therefore that algebraic numbers may also be defined as the roots of algebraic equations with rational integer coefficients. If the equation (1) is irreducible and an i= 0, then we call n = aOfthe degree of the algebraic number 8. For example, rational numbers have degree 1, and the number i has degree 2. Let the equation (1) be irreducible and let 8(1), 8(2), ••• , 8(n) be all its roots. From Theorem 4.2.2 we know that 8(j) 'are distinct, and if 8(j) satisfies a rational coefficient equation g(x) = 0, then so do the remaining n - 1 numbers. We see therefore that the degree of an algebraic number is uniquely determined. Theorem 1.1. The sum, the diflerence, the product and the quotient (not dividing by zero) of two algebraic numbers are algebraic. 0 With the aid of the symmetric polynomial theorem, the proof of this theorem is only a simple exercise, and we shall omit many of the proofs in this chapter. Definition 1.2. If the irreducible algebraic equation defining 8 has rational integer . coefficients and leading coefficient 1, then we call 8 an algebraic integer. Examples of algebraic integers are themselves.
j2, i, (1 + fi)/2, and the rational integers
Theorem 1.2. Any algebraic integer which is rational must be a rational integer.
0
424
16. Introduction to Algebraic Number Theory
Theorem 1.3. The sum, the difference and the product of two algebraic integers are algebraic integers. D Theorem 1.4. Let 8 be an algebraic number. Then there exists a natural number q such that q8 is an algebraic integer. D Definition 1.3. If 8 and 8 - 1 are both algebraic integers, then we call 8 a unit. Examples of units are i and 3 - 2)2. Theorem 1.5. A necessary and sufficient condition for 8 to be a unit is that 8 satisfies an algebraic equation with rational integer coefficients, and with leading coefficient 1 and last coefficient ± 1. D
16.2 Algebraic Number Fields Definition 2.1. Let Fbe a set of complex numbers with at least two distinct members. Suppose that, given any two members in F, their sum, difference, product and quotient (not dividing by 0) are also members of F. Then we call Fa number field, or simply a field. An example of a field is the set of rational numbers which we shall, from now on, denote by R. It is clear that every number field must contain the rational field R. Theorem 2.1. Let 8 be an algebraic number ofdegree n. Then the set ofnumbers of the form (1)
where ak are rational numbers, forms a field. Moreover numbers represented by (1) are all distinct.
D
Dermition 2.2. The field in Theorem 2.1 is called the single extension of R by 8, and we shall denote it by R(8). For example, R(i) is the field of numbers of the form a rational numbers.
+ ib where a and bare
Theorem 2.2. /f8 "# 0, then R(8) is largest set ofnumbers obtainedfrom 8 by means of addition, subtraction, multiplication and division (except by 0). D Definition 2.3. Let 81> ... ,8, be algebraic numbers. The field obtained from addition, subtraction, multiplication and division (except by 0) of these numbers is called a finite extension of R and is denoted by R(8 1 , .•• , 8,).
425
16.3 Basis
Theorem 2.3. Every finite extension of R is a single extension. That is, given any finite extension R(8l>"" 8,), there exists an algebraic number 8 such that R(8) = R(8l> ... ,8,). 0 From this theorem we need only consider single extensions R(8), which we now call algebraic number fields. We also call the degree of 8 the degree of the field R(8). For example, R(i) is a quadratic field, and R is the only field with degree I. Theorem 2.4. Let D run over all the rational integers not equal to 1 with no square divisors. Then R(JD) runs over all the quadratic fields. 0
16.3 Basis In this section R(8) denotes an algebraic number field of degree n. We set 8 = 8(1), and let 8(2), ••• ,8(n) denote the remaining n - 1 roots of the irreducible equation defining 8. From the previous section we see that each number aE R(8) is representable as
where
aj
are rational numbers.
Dermition3.1. Leta(l) = a. Weputa(k) = a(8(k»,k conjugates of a. We also call the numbers Sea) N(a)
= 2,3, ... ,nand we call them the
= a(1) + ... + a(n) = a(8(1» + ... + a(8(n», = a(l) ... a(n) = a(8(1» ... a(8(n»,
the trace and the norm of a respectively.
It is easy to see that S(a + f3) = Sea) + S(f3) and N(af3) = N(a)N(f3). Also, from the symmetric polynomial theorem, we see that Sea) and N(a) are rational numbers, and if a is rational then Sea) = na and N(a) = an. Next, if a is an algebraic integer, then so are a(i), and hence so are Sea) and N(a); but Sea) and N(a) are known to be rational so that they must be rational integers. If a is a unit, then from N(a)N(a- 1) = N(aa- 1) = N(I) = 1 and the fact that N(a), N(IX- 1) are rational integers we deduce that N(IX) = ± 1. Conversely, if ais an algebraic integer and N(IX) = ± I, then 1X- 1 = ± a(2) ... a(n) is also an algebraic integer and so IX must be a unit. Therefore a necessary and sufficient condition for an algebraic integer IX to be a unit is that N(IX) = ± 1. Theorem 3.1. Let aE R(8) and let the irreducible equation satisfied by IX be hex) = 0, aOh = I. Also, let g(x) = rr~= 1 (x - a(v». Then g(x) is a polynomial with rational coefficients, and g(x) = c(h(x»"/', where lin and c is a rational number.
426
16. Introduction to Algebraic Number Theory
Proof That g(x) is a polynomial with rational coefficients follows at once from the symmetric polynomial theorem. Let oc = a(8). Then from h(oc) = 0 we have h(oc(v) = h(a(8(v» = 0, so that every root of g(x) = 0 is also a root of hex) = O. Since hex) is an irreducible polynomial we must have h(x)lg(x). Let g(x) = h(x)gl(x). If gl(X) is a constant, then the required result is proved; otherwise gl(X) has zeros and these must also be zeros of hex), so that h(x)lgl(x). Let gl(X) = h(x)g2(x). We can repeat the argument, and since the degree of g(x) is finite we finally obtain g(x) ::= c(h(x»n/l. 0
From this theorem we see that if 0( is an algebraic number of degree I, then there are I distinct numbers among OC(l), ••• , O(n) and each of them occurs nil times. Definition 3.2. Suppose that there exists a set of numbers OC1, ••• , O(m in R( 8) such that any number in R(8) is uniquely representable as alO(l + ... + amOCm where aj (1 ~j ~ m) are rational numbers. Then we call 0(1)'' • ,O(m a basis for R(8). It is easy to see that no one of 0(1, ••• , O(m is expressible as a linear combination of the other m - 1 numbers with rational coefficients. From Theorem 2.1 we know that 1,8, ... , 8 n- 1 forms a basis for R(8), so that basis certainly exists. Following the proof of Theorem 14.9.2 the reader can easily prove
Theorem 3.2. Every basis for R(8) has precisely n elements.
0
Let 0(1" •• , O(n and PI" .. , Pn be two bases for R(8). Then, from the definition ofa basis, there are rational numbers ajk (1 ~j, k ~ n) such that all
n
O(j
=
L ajkPk
~j~n),
(1
lajkl
a1 n
= ............ # O.
k=l
Definition 3.3. Let number
OCl, ••• , OCn E
R( 8). By the discriminant of 0(1, •••
L1(CX1, ... ,ctn) =
,O(n
we mean the
.......... .
Theorem 3.3. The discriminant
J(O(1>' •• , OCn) possesses the following properties: is a rational number; and if 0(1) ••• , OCn are algebraic integers, then J(O(1>' •• ,O(n) is a rational integer. 2) Let 0 ( 1 ) ' ' ' ' O(n and PI,"" Pn be two bases for R(8), and aj = L~= 1 ajkPk (1 ~j ~ n). Then
1)
J(O(1>""
O(n)
3) A necessary and sufficient conditionfor 0(1)' J(O(l> ... ,O(n)
# O.
•• ,
OCn
to be a basis for R(8) is that
0
Theorem 3.4. Suppose that, among the numbers 8(1), ... , 8(n), rl of them are real, and r2pairs of them are complex conjugates (r1
+ 2r2 = n). Then,for any basis
0(1)' •• ,
O(n
427
16.4 Integral Basis
of R(8), we have
Proof From Theorem 3.3 we need only examine the case = 8n - 1 • Now
0(1
= 1,
0(2 =
8, ... ,
O(n
Let us denote by !J the complex conjugate of 8. When 8(k) i= !JU) we have «8(j) - 8(k»(!J(j) - !J(k»)2 > 0, and (8(j) - !JW)2 < O. Therefore (- 1),2.1(1,8, ... , 8 n -
1)
> O.
16.4 Integral Basis In the remaining part of this chapter we shall use the word integer to mean an algebraic integer. Definition 4.1. Let Oh, . .. , Wm be m integers in R(8). If every integer in R(8) can be expressed uniquely as a1 W1 + ... + amw m, where a1' ... ,am are rational integers, then we call W1>'" ,Wm an integral basis for R(8). Theorem 4.1. Integral basis exists. More specifically let W1, ... ,Wnbe a basis where Wj (l ~j ~ n) are integers such that ILI(w1>"" wn)1 is least. Then W1>"" Wn is an integral basis. Proof We can choose a natural number q so that q8 is an integer, and now 1, q8, (q8)2, ... , (q8)"-1 are integers which form a basis for R(8). Therefore a basis 0(1) .•• ,lXn consisting of integers certainly exists. We shall now prove the set W1,' .. ,Wn of integers forming a basis which makes 1.1(0(1>' •• ,00n)lleast is an integral basis. Suppose the contrary. Then there exists an integer W = a1W1 + ... + anWn> where some ai is not a rational integer. We may assume without loss that a1 is not a rational integer, say a1 = g + t where g is a rational integer and 0 < t < 1. Then W'l = W - gW1 = tW1 + a2W2 + ... + anWnis also an integer, and w~, W2," . ,Wn still forms a basis for R(8). But
contradicting the minimal property of ILI(w1>' .. ,wn)l. The theorem is proved.
0
From this theorem we see that an integral basis is a basis, so that each integral basis consists of n elements. Theorem 4.2. All integral basis have the same discriminant. That is, if W1> ... , Wn and W'l' ... ,w~ are two integral bases, then L1(W1> ... ,wn) = L1(w'1" .. ,w~). 0
428
16. Introduction to Algebraic Number Theory
Definition 4.2. By the discriminant of the field R(.9) we mean the discriminant of its integral basis. We shall denote the discriminant of R(.9) by LI(R(.9» or simply LI. Theorem 4.3 (Stickelberger). The discriminant ofafield satisfies LI == 0 or 1 (mod 4). Proof Let il>' .. , in be a permutation of 1, 2, ... , n, and let (jil ....• i be 1 or - 1 depending on whether il>' .. , in is an even or odd permutation. Then, from the expansion of a determinant, we have n
"
(j.
~
. w(ill .•• w(in)
It""f1n
1
n
(i1to .. ,i n )
L
W~ll ... W~in)
+ 21] =
a
+ 21],
(it, ... ,i n )
where I] is an algebraic integer, and a = L(il ..... i n ) W~l) ••• w~in) is a symmetric function of .9(1), ••. , .9(n), so that a is rational and hence a rational integer. Therefore
Since the integer 1](1] + a) = (LI - a 2 )/4 is rational, it is a rational integer. Therefore LI == a 2 == 0 or I (mod 4). D We shall now examine the quadratic field R(jD) where D is a square-free rational integer. Each number in R(jD) is representable as 0( = (a + bjD)/2 where a, b are rational numbers. The trace and the norm of 0( are given by S(O() = a,
a2 _ b 2 D N(O()=--'4
Theorem 4.4. In the quadratic field R(jD), a necessary and sufficient conditionfor 0( to be an integer is that a, b are both rational integers satisfying a
== b (mod 2),
a == b == 0 (mod 2),
when
D == 1 (mod4);
when
D == 2,3
(mod 4).
(1)
Proof Since, in a quadratic field, 0( is an integer if and only if S(O(), N(O() are rational integers, the sufficiency of the condition (1) follows at once. Conversely, if 0( is an integer, then a and (a 2 - b 2 D)/4 are rational integers, so that
is also a rational integer. Since D is square-free, the number b must be rational. The necessity of the condition (1) now follows from a 2 - b 2 D == O. D
429
16.4 Integral Basis
When D == 1 (mod4), (l
+ jD)/2 is an integer in R(jD).
From
a + bjD a - b 1 + jD ----,,--'--- = - - + b ----'--
2
2
2
and 1
1jD
1 12 _jD =4D,
1
1
l+jD
I-jD
2
2
2
=D,
we have the following:
Theorem 4.5. Let D be a square-free rational integer, and let
D
Ll- { - 4D'
when
D== 1
(mod 4),
when
D == 2,3
(mod 4).
Then Ll is the discriminant of R(jD) , and 1, w is an integral basis. The numbers 1, (Ll + Jii)/2 also form an integral basis. D
From this theorem we see that, in a quadratic field, we may choose an integer w such that 1, w form an integral basis. This is not true in general; that is, if R(8) is a field of degree n ~ 3, we may not always find an integer w such that 1, w, ... ,wn - 1 is an integral basis for R(8). Example. Let oc be a zero offix) = x 3 - x 2 - 2x - 8. We shall prove that no integer w, with the property that 1, w, w 2 is an integral basis for R(oc), exists. Since ± 1, ± 2, ± 4, ± 8 are not zeros offix) , we know thatfix) is irreducible so that R(oc) is definitely a cubic field. It is easy to show that Ll(I, oc, o( 2 ) = - 4 x 503. Since p = 4/oc is a zero of g(y) = y3 + y2 + 2y - 8, it follows that pis an integer in R(oc). Let us denote by oc' and oc" the two remaining zeros of fix). Then
Ll(I,oc,P)
=
oc oc' oc"
4/oc 4/oc' 4/oc"
42
2
=
2
(N(oc»
1 oc 1 oc' 1 oc"
oc 2
2
OC,2 OC"2
Since Ll(l, oc, P) i= 0, the numbers 1, oc, Pform a basis. Indeed 1, oc, p must be an integral basis for R(oc), since otherwise the discriminant Ll of the field must satisfy ILlI < 503, and from Theorem 3.3 there exists a natural number a i= 1 such that - 503 = a 2 Ll, which is impossible because 503 is a prime number. Now let w be any integer in R(oc). Then there are rational integers a, b, c such that w = a + boc + cpo Now
430
16. Introduction to Algebraic Number Theory
8
1X2
= IX + 2 + - = 2 + IX + 2{J,
{J2
= -
IX
{J - 2
8
+ Ii = -
2
+ 21X -
(J,
so that w 2 = a 2 + b 2(2 =
+ IX + 2{J) + e2( - 2 + 21X - (J) + 2abIX + 8be + 2ae{J (a 2 + 2b 2 - 2e 2 + 8be) + (b 2 + 2e 2 + 2ab)1X + (2b 2 - e 2 + 2ae){J,
and hence .d(I, W, w 2)
=
1 a 0 b o e
a 2 + 2b 2 - 2e 2 + 8be 2 b2 + 2e 2 + 2ab . .d(1, IX, (J) 2b 2 - e2 + 2ae
== 0 (mod 4 ·503).
Therefore 1, w, w 2 cannot be an integral basis for R(IX).
16.5 Divisibility Definition 5.1. Let IX and {J be two integers. Suppose that there exists an integer y such that IX = {Jy. Then we say that {J divides IX and we write {JIIX. We also say {J is a divisor of IX, or that IX is a multiple of (J. Theorem 5.1. Let g(x) = IX,X'
+ ... + lXo,
1X, "# 0,
where the numbers IX, (J are integers, and let g(x)h(x)
= Y,+mx,+m + ... + Yo.
If there exists an integer (j satisfying (jlyu
o ~ w ~ m).
(0
~
u ~ 1+ m), then (jllXv{Jw (0 ~ v ~ I,
0
The consideration of divisibility leads naturally to the problem of factorization of algebraic integers and the uniqueness of factorization. However, the factorization of integers in the field of all algebraic numbers has little meaning since an integer may be a product of infinitely many integers. For example 2 = 2! x 2! x 2t .... From this we see that we must somehow restrict the domain of the divisors, and therefore we only discuss the factorization problem within a certain algebraic field R(9). Next, there may be infinitely many units in an algebraic field. If e is a unit, then every integer may be written as IX = e . e-llX, and therefore IX has infinitely many
431
16.6 Ideals
factorizations whenever R(B) has infinitely many units. For example, the numbers (1 + J2)n (n = ± 1, ± 2, ... ) are all units in R(J2) so that integers in R(J2) have infinitely many factorizations. In order to avoid this difficulty we introduce the notion of association. Definition 5.2. Two integers associates of each other.
0(,
Pwhich differ only from a unit divisor are called
Being associates is an equivalence relation. Definition 5.3. Let 0( be an integer in R(B). If there exist non-unit integers p, y such that 0( = Py, then we say that 0( is non-prime; otherwise we call 0( a prime in R(B). Theorem 5.2. Every algebraic integer in R(B) can be factorized into a product of primes in R(B). Proof If 0( is a prime, then there is nothing to prove. If 0( = py wher p, yare not units, then IN(O() I = IN(P)I . IN(y)l. Since p, yare not units the natural numbers.IN(p)l, IN(y) I are proper divisors of IN(O()I, so that IN(O()I > IN(P) I > 1 and IN(O()I > IN(y)1 > 1. The proof can now be completed by induction on IN(O()I. D It remains to consider the uniqueness of the factorization, and this is an important problem in algebraic number theory. We shall now examine the quadratic field R(J=5) and show that there is no unique factorization. Since - 5 == 3 (mod 4), every integer in the field takes the form 0( = a + bJ=5 where a, b are rational integers. We shall show that 2,3, 1 ± J=5 are primes in the field, and that 2, 3 are not associates of 1 ± J=5, so that from 6 = 2 . 3 = (1 + J=5)(1 - J=5) we see that there is no unique factorization in R(J=5). First 2, 3 cannot be associates of 1 ± J=5 because IN(2)1 = 4, IN(3)1 = 9 and IN(1 ± J=5)1 = 6. Next, if 2 is non-prime in R(J=5), we let
2 = O(P,
IN(O() I >
1,
IN(P) I > 1.
Write 0( = a + bJ=5. Then, from IN(2) I = 4, we have IN(O() I = a 2 + 5b 2 = 2 and this is impossible. Therefore 2 is a prime in R(J=5). Similarly 3, 1 ± J=5 are also primes in R(J=5). In order to overcome this problem Kummer invented the notion of ideals.
16.6 Ideals We shall now consider a fixed algebraic number field R(B) of degree n. Definition 6.1. Let 0(1) ••• , O(q be any q integers in R(B). The set of integers of the form
432
16. Introduction to Algebraic Number Theory
where I'/l> ••• ,I'/q are integers in R(8) is called an ideal generated by OCl> ••• ,ocq, and is denoted by [OCl> ••• ,OCq]. We shall use the capital Gothic letters
~, ~, (£:,
!l, ... to denote ideals.
Definition 6.2. An ideal [ocJ generated by a single integer oc is called a principle ideal. The set [OJ containing only the integer 0 is an ideal, but we shall assume that our ideals are distinct from [0]. The ideal [IJ contains all the integers in R(8), and is called the unit ideal which we shall denote by .0. Theorem 6.1. Ideals possess the following properties: 1) If oc, 13 are in the ideal, then so are oc ± 13; 2) If oc is in the ideal and 1'/ is an integer in R(8), then We see from this theorem that if 1 E~, then
~
I'/OC
is in the ideal.
0
= [1].
Definition 6.3. Let ~ = [OCl> ••• ,ocqJ and ~ = [f3l> . •. ,f3rJ be two ideals. If ~ and ~ contain exactly the same integers in R(8), then we say that they are equal and we write ~ =~. Theorem 6.2. A necessary and sufficient condition for two ideals [OCl> ••• , ocqJ and [131, ... ,f3rJ to be equalis thatthere are integers ~ij, I'/ji (1 ~ i ~ q, 1 ~ j ~ r) such that OCi
=
L ~ijf3j, j= 1
In particular, if [ocJ
q
f3j =
L I'/jiOCi· i= 1
= [f3J, then oc and 13 are associates. 0
Let OCl> ••• ,ocq be any q rational integers with greatest common factor d. Then there are rational integers Xl> . .• , Xq such that d = X10Cl + ... + XqOCq, and hence, in the rational number field, [OCl> ••• , ocqJ = Cd]. In other words there are only principal ideals of the rational number field. On the other hand we know from our discussion in the last section that, in R(.j=5), the ideal [2, 1 + .j=5J cannot be reduced to a principal ideal, so that non-principal ideals exist. Definition 6.4. Let ~ = [OC1, ••• , ocqJ and ~ = [131, ..• , f3r J be two ideals. We call the ideal [OClf3l> ... , OClf3" OC2f3l> . •• ,OC2f3" •.• ,ocqf3rJ the product of ~ and ~; we shall denote it by ~ . ~. Theorem 6.3. The product of~ and ~ is independent of the choices OCi, f3i. That is, if
then
=
, 13'1'···' OC2, 13't'···' OCs'f3'J [ OC'13' t • 1 1'···' OC ,1 13't' OC2
o
433
16.7 Unique Factorization Theorem for Ideals
This can easily be proved from the definition of equality for ideals. Also we have ~ for any ideal ~, and that multiplication of ideals is commutative and associative. We can then use induction to define ~1 ••• ~m and ~m, where m is a natural number, and show that the usual rules of indices hold.
D .~ =
Definition 6.5. Let ~, mbe two ideals. Suppose that there exists an ideal <£: such that ~ = m<£:. Then we say that mdivides ~ and we write ml~. We call m, <£: the divisors of~.
Clearly we have: 1) if <£:Im, ml~, then <£:I~; 2) if ml~ and 3) for any ideal ~ we always have DI~, ~I~.
1) is any ideal, then
m1)I~1);
Theorem 6.4.
If ml~ then
Proof Let ~ = the form
~
is a subset of m.
m<£: where m =
[Pl>' .. ,Pr], <£:
= [Yl>' .. ,Yr]. Then each odn ~ is of
IX = j~lk~l1]jkPjYk = Jl (Jl1] jkYk)Pj where 1]jk are integers in the field, and hence
IX lies in m.
D
We shall see in the next section that the converse of this theorem also holds; that is, if every integer in ~ lies in m, then mmust divide ~. From Theorem 6.4 we see that if ~ID then ~ = D.
16.7 Unique Factorization Theorem for Ideals Theorem 7.1. Given any idea/~ there exists an idealm such that the product ~ . misa principal ideal [a] generated by a natural number a.
Proof Suppose first that
~
is a principal ideal, say [IX]. Then we take
m= [1X(2), • •• ,1X(n)] where 1X(2), ••• , lX(n) are the conjugates of IX, and with a = IN(IX)I we see at once that ~ . m= [1X1X(2) ••• lX(n)] = [a]. IX,X'
Suppose now that ~ + ... + 1X0, and set
=
[IX" . .. , 1X0] is not a principal ideal. Let fix) =
g(x) = Pm xm
+ ... + Po
(m = (n - 1)/)
so that
n (IX?)X' + ... + n
fix)g(x) =
IX~)
= c,+mx'+m + ... + co,
j= 1
where Ci are rational integers, so that Pj are integers in R(B). Now put m= [Pm,"', Po] and a = (C'+m,"" co), and we shall prove that ~ . m= [a].
434
16. Introduction to Algebraic Number Theory
Since alck (0 ~ k ~ 1+ m), it follows from Theorem 5.1 that alexl'P. (0 ~ Jl ~ I, and hence exl'P. are numbers in [aJ. Conversely, from the greatest common factor definition for a, there are rational integers dl +m, . .. , do such that a = cl+mdl +m + ... + codo. Also, from
o ~ v ~ m),
Ck
=
L
exl'P.
(0
~
k
~
1+ m),
I'+.=k O~JL~1 O~v~m
we have
1'=0.=0
where 111'. are integers in R(8), and so a lies in m: . ~. Therefore m: . ~ Theorem 7.2.
If m: . (£; = m: . 1),
=
[aJ.
0
then (£; = 1).
Proof We choose ~ and a natural number a so that m:. ~ = [a]. This then gives [a] . (£; = [a] . 1), and this equation means that the set of integers in (£;, when
multiplied by a, coincides with the set of integers in 1) when multiplied by a. Therefore (£; = 1). 0 Theorem 7.3. If (£; is a subset of m:, then m:1(£;. Proof We take ~ and a so that m: . ~ = [aJ. Then the ideal ~ . (£; is a subset of m: . ~ = [a] so that we may write ~ . (£; = [aYl> . .. , aYq] = [a] . [Yl> . .. ,Yq] = ~ . m: . [Yl> ... ,Yq], which gives (£; = m: . [Yl>· .. ,yqJ. 0
From this theorem and Theorem 6.4 we see that a necessary and sufficient condition for ~Im: is that m: is a subset of ~. We shall now consider the factorization of ideals and the uniqueness problem associated with it. Definition 7.1. By a prime ideal we mean an ideal with only two divisors, namely D and the ideal itself. We shall denote a prime ideal by '-P. It is easy to see that in the rational field the prime ideals are [P], where p is an ordinary rational prime.
Theorem 7.4. Given any two ideals m: and ~, there exists a unique ideal 1) with the properties: 1) 1)1m:, 1)1~; 2) lf1)llm:, 1)11~, then 1)111). Furthermore each integer in 1) is expressible as ex + Pwhere ex E m:, PE ~. 0
Definition 7.2. We call the ideal 1) the greatest common divisor of m: and ~, and we denote it by (m:, ~). We also define (m:l> ... , m:m-l> m:m) = «m:l> ... , m:m-l), m:m). If (m:, ~) = D then we say that m:, ~ are coprime. It is easy to see that if (m:, ~)
= 1), then (m:(£;,
~(£;)
= 1)(£; for any ideal (£;.
435
\6.7 Unique Factorization Theorem for Ideals
Theorem 7.5. Let
'l' be a prime
ideal. Suppose that
Proof Since 'l',r~ we have ('l', ~)
have
'l'1~.
'l'1~~
= .0 and so ('l'~, ~~) =
and ~.
'l',r~.
Then
'l'1~.
Since 'l'1~~ we now
D
Theorem 7.6. Every ideal has finitely many distinct divisors. Proof Given the ideal ~ we choose ~ and a natural number a such that ~ . ~ = [al Therefore ~ contains a, and any divisor of~ also contains a. Thus it
suffices to show that there is at most a finite number of ideals containing a fixed natural number. Let 9Jl = [0(10 ••• ,O(m] be an ideal which contains a, and let Wl, ... , Wn be an integral basis for R(8) so that each O(j can be written as O(j = gjlWl + ... + gjnWn (1 ~j ~ m), where gjk are rational integers. Now set (0
~
Yj
=
n
pj =
L:
k=l
so that
rjk < a), n
qjkWk,
L rjkWk,
k= 1
O(j = apj + Yjo Since a lies in 9Jl, we have 9Jl
=
[aPl
+ Yl,· .. , apm + Ym, a] = [Yb . .. , Ym, a].
Since there is at most a finite number of sets Yb ... , Ym the required result follows. D Theorem 7.7 (Fundamental theorem for ideals). Any ideal ~ distinct from .0 can be factorized into a product ofprime ideals. Furthermore, apart from the ordering of the factors, this factorization is unique. Proof Since each ideal has at most a finite number of divisors we can use induction on the number of divisors of ~. We first establish the existence of a factorization. If ~ is a prime ideal, then there is nothing more to prove; otherwise we let ~ = ~(£; (~ "# .0, (£; "# .0). Since the numbers of divisors of~ and of(£; are less than that of~, the required result follows by induction. We now prove the uniqueness of the factorization. Suppose that
m ;::, 1,
I;::, 1.
If ~ is a prime ideal, then I = m = 1 and there is nothing to prove. If ~ is not a prime ideal, then I > 1, m > 1. Since 'l'd'l"l ... 'l'~, there must be a 'l'j (l ~j ~ m) such that 'l'l = 'l'j. We may assume without loss that j = 1 so that 'l'2 ... 'l', = 'l'~ ... 'l'~, and the required result follows from the induction hypothesis. D We have proved that every ideal distinct from .0 can be written as ... 'l'~r where 'l'j are distinct prime ideals, and aj are natural numbers. The representation is unique apart from the ordering of 'l'j. 'l'~''l'~2
436
16. Introduction to Algebraic Number Theory
16.8 Basis for Ideals Let Wi, . .. , Wn be an integral basis for R(8), and let ~ be any ideal of R(8). Since each member of ~ is representable as a linear combination of Wb ..• ,Wn with rational integer coefficients we see, from Theorem 6.1, that ~ can be viewed as a linear module. Also, corresponding to the ideal ~, there is an ideal ~ and a natural number a such that ~~ = [a], so that aWb"" aWn all lie in ~; and since these n numbers are linearly independent we see that ~ is actually a linear module of dimension n. From our discussion in Chapter 14, section 9, this module ~ must have a basis, and every basis must have exactly n integers. In particular, we have: Theorem S.l. Let such that
~
be an ideal of R(8). Then we can find n integers DCb' •• ,DCn in
where aij are rational integers, aij > 0 (l ~ i ~ n), 0 ~ aji < aii (l DCi, ... , DCnform a standard basis for~. D Let
DCb' •• , IXn
and flb' .. , fln be two basis for
~
~
~
i <j ~ n), and
and let
n
lXi
=
L uijflj
(i=I, ... ,n).
j= 1
Then the coefficient matrix (Uij) must be a modular matrix so that A(DCb" . ,DCn) = A(fl 1, ... ,fln)' Thus the discriminant of a basis of an ideal is independent of the choice of the basis so that we may write this as A(~). We shall now examine the standard basis for ideals of the quadratic fields R(.ji5). Let I, W be an integral basis for R(.ji5); the definition of W is given in Theorem 4.5. From Theorem 8.1 we can find two integers a, b + ew to form a standard basis. Here a, b, e are rational integers and we may suppose that a > 0, e > 0, 0 ~ b < a. However we should note that not all pairs of integers of the above form always form a basis for the ideal; there are other conditions on a, b, e. It is easy to see that a, b + ew form a standard basis for a certain ideal only when aw, web + ew) are representable as xa + y(b + ew), where x, yare rational integers. From aw = xa + y(b + ew) we have a = ye, ax + by = 0, so that cia, elb. Let a = em, b = en. Then from e(n
+ w)w = e(n + w)(n + w + w') - e(n + w)(n + w') = - eN(n + w) + e(n + w)(n + Sew»~,
where Sew) and N(n + w) represent the trace and the norm of wand n + w respectively, we see that a necessary and sufficient condition for em, e(n + w) to be a
437
16.9 Congruent Relations
standard basis for a certain ideal is that N(n
+ w) == 0
(1)
(modm).
From Theorem 4.5 we see that (1) is equivalent to LI
== {(2n + 1)2 (2n)2
(mod4m), (mod4m),
if
D
if
D
== 1 == 2,3
(mod 4); (mod 4).
(2)
Therefore we have: Theorem 8.2. A necessary and sufficient condition for a pair of integers cm, c(n + w) (c > 0, m > 0, 0 ~ n < m) to be a standard basis for a certain ideal of R(JD) is that either (1) or (2) holds. 0
16.9 Congruent Relations Definition 9.1. If~I[IX], then we say that see that ~IIX means that IX is in ~.
~
divides IX, and we write ~IIX. It is easy to
We can follow the discussion in Chapter 14, section 9, and define a congruent relation on the integers of the field R(8) with respect to an ideal. Definition 9.2. If~11X - f3, where IX, f3 are integers in R(8), then we say that IX and f3 are congruent modulo ~, and we write IX == f3 (mod ~). The integers of the field R( 8) are now partitioned into equivalence classes, called the residue classes modulo ~. We shall denote by N(~) the number of these residue classes, and we call N(~) the norm of~. From Theorem 14.9.3 we have: Theorem 9.1. Let W1, ... , Wn be an integral basis for R(8), and let 1Xl> ... , IXn be any basis for the ideal~. If lXi = = 1 aijWj, then N(~) is equal to the absolute value of the determinant of the coefficients, that is N(~) = lIaijll· 0
I:;
From this theorem we deduce at once: Theorem 9.2. Let LI be the discriminant of R(8), and LI(~) be the discriminant of the basis for ~. Then we have LI(~) = (N(~»2L1. 0 Theorem 9.3. The norm of a principal ideal [IX] satisfies N([IX]) Theorem 9.4.
N(~m)
=
= IN(IX)I. 0
N(~)N(m).
Proof Since ~ contains ~m, from Theorem 14.9.4, the members of ~ are partitioned into residue classes modulo ~m, and the number of classes is equal to N(~m)/ N(~). It remains to prove that the number of classes is also equal to N(m).
438
16. Introduction to Algebraic Number Theory
Let flI>' .• , flN('B) denote the residue classes mod~. There exists an integer OCE 21 such that ([oc] , 21~) = 21. Now ocflI> .•. ,OCflN('B) all lie in 21, and if} "# k (l ~), k ~ n), then ocflj 1= OCflk (mod 21~). From ([oc], 21~) = 21, we know that corresponding to any y in 21, there are integers 1], (j such that y = I]OC + (j, (j E 21~. Also, corresponding to the integer 1], there is an integer fl and a natural number) (1 ~) ~ N(~» such that I] = flj + fl so that y = ocflj + ocfl + (j == ocflj (mod 21~). This shows that every member of 21 must be congruent to exactly one of OCfll' ... ,OCflN('B) modulo 21~, and therefore the number of classes concerned must be equal to N(~) as required. 0 Theorem 9.5. Let 'l' be a prime ideal, and let oc be any integer not divisible by 'l'. Then ocN(\jJ) - 1 == 1 (mod'l'). Proof Let 0, 1tl> 1t2,' .. , 1tN(\jJ)-l denote the residue classes mod 'l'. Since 'l',j'oc, the numbers 0, OC1tl, OC1t2, ... , OC1tN(\jJ) - 1 also represent the residue classes mod'l'. Therefore
and the theorem follows.
0
16.10 Prime Ideals Theorem 10.1. Every prime ideal 'l' must divide a rational prime p. Moreover,p is the least positive rational integer in 'l' so that it is unique. Proof From Theorem 7.1 there must exist a rational integer a such that 'l'1[a]. Let a = IIp be its factorization, so that there must be a prime p such that 'l'1[p], or ~ jp. Suppose, if possible, there exists a positive rational integer b such that b < p and 'l'lb. Then bE 'l' so that(p, b) = 1 also lie in 'l' giving 'l' = [1], which is impossible. Therefore p is the least positive rational integer in 'l'. 0
Let the prime ideal factorization for [p] be 'l'1'l'2 ... 'l't. Then, on taking the norm, we havepn = N([p]) = N('l'1)N('l'2) ... N('l't). It follows that the norm ofa prime ideal must be a prime power. If N('l') = pI, then we calIf the degree of 'l'. Concerning the factorization of [p] there is the following important theorem which we shall not prove. Theorem 10.2 (Dedekind's discriminant theorem). A necessary and sufficient condition for 'l'21p is that piA. 0 Let us examine the factorization of [p] in the quadratic field R(fo). Clearly there can only be the following three possibilities. 1) [p] = 'l'; 2) [p] = 'l'.Q, 'l' "# .0, N('l') = N(.Q) = p; 3) [p] = 'l'2, N('l') = p. Concerning the factorization of [p] in a quadratic field, we have:
439
16.10 Prime Ideals
Theorem 10.3. Let A be the discriminant of R(jD). Then 1),2) or 3) in the above holds according to (~) = - 1, + 1, or O. Here (~) is the Kronecker's symbol.
Proof If ~ is a prime divisor of [pJ, and N(~) = p, then either [pJ = ~.Q or [pJ = ~2. Let cm, c(n + w) be a standard basis for the ideal. Then N(~) = c2 m = p, so that c = 1, m = p. From (2) in section 8, we now see that (~) is either + 1 or O. Let us suppose, conversely, that (~) = + 1 or O. We first consider the case p =f. 2. I) If (~) = 1, then there exists a such that p j a and A == a 2 (mod p). Since p =f. 2, we have (p, 2a) = 1 so that [p,a
+ flJ[p, a - flJ
[
~ - fl,-----;-
=
[pJ p,a
+ fl,a
=
[pJ [p,a
+ fl,2a, a 2 ;
AJ
A ,1
J=
[p].
Also [p, a + flJ =f. [p, a - flJ, since otherwise we have [p, a + flJ = [p,a - flJ = [p, a + fl, 2aJ = [1J and this is impossible; [p, a + flJ and [p, a - f l J are not .0. Therefore, when p =f. 2 and (~) = 1, [p J is the product of two distinct prime ideals. 2) If (~) = 0, then piA, so that
[p,flJ 2 = [p,flJ[p,flJ
=
[PJ[p,fl,~J.
But A = D or 4D, p =f. 2 and 1) is square-free, so that (p, :) =-1 and hence [pJ = [p, flJ2. That is, if p =f. 2 and (:) = 0, then [pJ is the square of a prime ideal. Let us now consider the case p = 2. Since (i) =f. - 1 we must have D == 2, 3 (mod 4) or D == 1 (mod 8). As before we can prove: 3) When D == 2 (mod4), we have (1) = 0 and [2J = [2, jDJ2; 4) When D == 3 (mod4), we have (1) = 0 and [2J = [2, 1 + jDJ2; 5) When D == 1 (mod 8), we have (i) = 1 and
[2J = [2, 1 +
fDJ·
Since the two factors here are distinct, ideals. D
[2, 1 -
2jDJ.
[2J is now the product of two distinct prime
Theorem 10.3 establishes Dedekind's discriminant theorem for quadratic fields. We shall now examine a specific example for a cubic field. Let oc be a zero ofj(x) = x 3 - x 2 - 2x - 8. We saw in §4 that R(oc) is a cubic field with discriminant 503, that 1, oc, f3 = 4/oc form an integral basis, and that f3 is a zero of g(y) = y3 + y2 + 2y - 8. We now consider the factorization of [503J in R(oc). Let ~, .0, ~ denote prime ideals of R(oc). Then the factorization of [503J must take one of the following five situations:
440
16. Introduction to Algebraic Number Theory
1) 2) 3) 4) 5)
[503] = ~.Q9t; ~, .0, 9t distinct and N(~) = N(.Q) [503] = ~2.Q; ~ =F.Q and N(~) = N(.Q) = 503; [503] = ~3; N(~) = 503; [503] = ~.Q; N(~) = 503, N(.Q) = 503 2 ; [503] = ~; N(~) = 503 3 •
= N(9t) = 503;
In each of the first four situations, [503] has a prime divisor ~ with norm 503. Let us first examine these four situations. Let ao, bo + bllX, Co + CllX + C2fJ be a standard basis for ~ so that bo < ao', Co < ao, Cl < b l . Also, since aolX, aofJ lie in ~ we have, in addition, that b l ~ ao, C2 ~ ao, and from N(~) = aOb l C2 = 503, we obtain ao = 503, b l = 1, C2 = 1, Cl = O. Therefore ~ must take the form [503, a + IX, b + fJ], and 503, a + IX, b + fJ form a standard basis for ~. Since a + IX, b + fJ E ~ and N(~) = 503, we have N(a + IX) == N(b + fJ) == 0 (mod 503). But a + IX and b + fJ are the roots of fix - a) = 0 and g(y - b) = 0 respectively so that N(a + IX) = Ifl- a)1 and N(b + fJ) = Ig( - b)l. Therefore a and b satisfy the cubic congruences a 3 + a2 - 2a + 8 == 0 (mod 503) and b3 - b2+ 2b + 8 == 0 (mod 503), which give the solutions a == 149, 149,204 and b == 395, 395, 217 (mod 503). Therefore ~ must be one of the following four ideals: [503,149
+ 1X,395 + fJ],
[503,204
+ IX, 217 + fJ],
[503,149
+ 1X,217 + fJ],
[503,204
+ 1X,395 + fJ].
The third ideal is not 1X(217
+ fJ) -
~,
since otherwise
217(149
+ IX) + 65(503) = 4 -
217·149
+ 65·503 = 366
would be in ~, and from (366, 503) = 1 we would have ~ = .0. Similarly the fourth ideal is not ~. Next, from (149
+ IX)IX = -
46(503)
+ 150(149 + IX) + 2(395 + fJ),
(149
+ lX)fJ = -
117(503)
+ 149(395 + fJ),
(395
+ fJ)1X = -
117(503)
+ 395(149 + IX),
(395
+ fJ)fJ = -
310(503)
+ 2(149 + IX) + 394(395 + fJ),
we see that 503, 149 + IX, 395 + fJ do form a standard basis for the prime ideal [503, 149 + 1X,395 + fJ]. Similarly 503, 204 + IX, 217 + fJ do form a standard basis for the prime ideal [503, 204 + IX, 217 + fJ]. Finally the two ideals [503,149 + IX, 395 + fJ] and [503, 204 + IX, 217 + fJ] are distinct divisors of the ideal [503] and we therefore conclude that our situation 2) is the only possibility, and computation shows that actually [503] = [503,149
+ 1X,395 + fJ]2
. [503,204
+ IX, 217 + fJ].
441
16.12 Ideal Classes
16.11 Units We have the following result on units: Among all the units in R(8) we can choose = r1 + r2 - 1 of them, say e1> . .. , e" such that every unit is representable as pe~l ... e!r (I = 0, ± 1, ± 2, ... ); here p is a certain root of unity in R(8). Here we shall only concern ourselves with quadratic fields R(jD). Let a unit be x + yw so that N(x + yw) = ± 1. We need therefore to solve these equations in rational integers for the units in R(jD). Now r
N(x
+ yw) = (x + yw)(x + yw') if D == 1 (mod 4), if D == 2,3
(mod 4).
When D < 0, the equations (2x + y)2 - y2 D = 4 and x 2 - y2 D = 1 have only finitely many solutions, so that R(jD) can have only finitely many units. In fact if we denote by w the number of units in R(jD), it is not difficult to show that w = 6, 4 or 2 according to whether LI = - 3, - 4 or LI ~ - 7. Consider next D > O. Now the equations (2x + y)2 - y2 D = ± 4 and x 2 - y2 D = ± 1 are the Pell equations we considered in Chapter 10. Therefore there exists a unit 1'/ in R(jD) such that any unit in R(jD) is representable as ± 1'/n, n = 0, ± 1, ± 2, .... This number 1'/ is called the fundamental unit of R(jD).
16.12 Ideal Classes Definition 12.1. Let m: and mbe two ideals. Suppose that there exist two principal ideals [ae] and [p] such that [ae]m: = [p]m. Then we say that the two ideals m: and m belong to the same ideal class, and we write m: '" m. It is easy to see that being in the same ideal class is an equivalence relation, and moreover we have 1) m: '" .0 if and only if m: is a principal ideal; 2) if m: '" mand (£; '" :n, then m:(£; '" m:n; 3) if m:(£; '" m(£; then m: '" m. The ideals of R(8) are now partitioned into classes called ideal classes. Theorem 12.1. The number of ideal classes of R(8) is finite. Proof It suffices to show that there exists a positive number M, depending only on R(8), such that every class contains an ideal satisfying N(m) ~ M. This is because
m
there can only be finitely many ideals having a given norm. Let (£; be any ideal of R(8). We already know that there exists an ideal m: such that m:(£; '" .0, and if we can choose an ideal msuch that m:m '" .0 and N(m) ~ M, then our theorem is proved. This is because m:m '" m:(£; so that m'" (£;.
442
16. Introduction to Algebraic Number Theory
Let Oh, ... ,Wn be an integral basis for R( 8) and let n
s= 1
We define the natural number k by k n ~ N(21) < (k + l)n. Among the (k + l)n integers Xl W1 + ... + XnWn (Xm = 0,1, ... ,k) there are at least two which are congruent modulo 21, say
here 0 ~ Ym
~
Zm
in 21. Since IYm - zml
~
IN((X)I
Since
(X
~
=
k, 0
~
k, and we now have the non-zero integer
k it follows that
IS~l mt1 (Ym - zm)w~) I ~ S~l mt1 klw~)1 = knM ~ M· N(21).
is in 21 we see that 211 [(X], and we may write [(X] = = IN((X)I ~ M· N(21) or N(~) ~ M as required. D
21~
which gives
N(21)N(~)
Theorem 12.2. Let h be the number of ideal classes of R(8). Then,for any ideal 21, we have 21h - .0. Proof Let 21b ... , 21h be ideals that belong to different classes. Then so are 2121b ... ,2121h and hence 211 ... 21h - (2121 1) ... (2121h), or 21h - .0. D
16.13 Quadratic Fields and Quadratic Forms Let Ll be the discriminant of the quadratic field R(jD). We shall now establish the relationship between the ideal classes of R(jD) and the classes of quadratic forms having discriminant Ll. Let 21 be an ideal of R(jD) and let (Xl> (X2 be a basis for 21 satisfying (1)
where (X'1' (X~ are the conjugates of (Xl> (X2. Corresponding to 21 we construct the quadratic form F(x,y) =
N((X1 X + (X2Y) ((X1 X + (X2Y)((X~ X + (X~) 2 = = ax N(21) N(21)
+
b xy
+ cy
2
.
Since a = N((X1)/N(21), b = (N((X1 + (X2) - N((X1) - N((X2»/N(21), c = N((X2)/N(21), and (Xl> (X2, (Xl + (X2 are in 21 we see that a, b, c are rational integers. Also, the
443
16.13 Quadratic Fields and Quadratic Forms
discriminant of F(x, y) is b2 - 4ac = (OC1OC~ - OC'lO(2)2/N(91)2 = A. We say that F(x,y) is a quadratic form belonging to 91. When A < 0 the quadratic field R(.ji5) is imaginary so that a > 0 and F(x, y) is positive definite. Also, it is not difficult to see that as OCl, OC2 run through the basis for 91 satisfying (I) we obtain all the quadratic forms equivalent to F. Theorem 13.1. Every indefinite or positive definite quadratic form F(x,y) = ax 2 + bxy + cy2 with rational integer coefficients and discriminant A belongs to an ideal 91 with basis OC1> OC2'
Proof We first show that a, (b - fl)/2 form a basis for the ideal IDl = [a, (b - fl)/2]. Observe that (b - fl)/2 satisfies the equation x(b - x) = ac so that it is an integer. Also we have w = (s(w) + fl)/2, where sew) = 0 or I, and sew) aw=
+b-
(b - fl)
sew) a=
2
+b
2
b - fl a-a
b - fl b - f l sew) - b + b + f l b2 - A 2 w = 2' 2 = ~a
+
2
'
sew) - b b - f l 2
.
2
'
where (s(w) ± b)/2 and (b 2 - A)/4a are rational integers, so that a, (b - fl)/2 do indeed form a basis for IDl. If a > 0 we take 91 = IDl, OCl = a, OC2 = (b - fl)/2, and from N(IDl) = a we have the quadratic form
(ax
+ t(b -
fl)y)(ax a
+ tcb + fl)y) = ax 2 + bxy + cy 2,
---=------'------=-----'---
so that IDl is the required ideal. If a < 0, then, since the quadratic form is not negative definite, A > 0 and we now take 91 = flIDl, OCl = afl and a2 = (b - fl)fl/2. It is easy to see that OC1> OC2 form a basis for 91 satisfying (I). Also N(91) = - aA and we can now construct the quadratic form
- A(ax
+ tcb-
fl)y)(ax - aA
+ tcb + fl)y) = ax 2 + bxy + cy 2.
------'=------'--------==------'---
The theorem is proved.
0
From the above we see that if F belongs to 91, then every quadratic form equivalent to Falso belongs to 91. However, given a quadratic form F, there may be two different ideals 91 and mto which Fbelongs. This then establishes a relationship between 91 and m. Definition 13.1. Let 91 and mbe two ideals. Suppose that there are integers oc and fJ such that [oc]91 = [fJ]m and N(ocfJ) > O. Then we say that 91 and mare equivalent in the narrower sense, and we write 91 ~ m.
444
16. Introduction to Algebraic Number Theory
It is clear that being equivalent in the narrower sense is a special case of being equivalent.
Theorem 13.2. Equivalent quadratic forms belong to ideals which are equivalent in the narrower sense. Conversely, quadratic forms belonging to ideals which are equivalent in the narrower sense are equivalent forms. 0 Let ho denote the number of ideal classes (not in the narrower sense), and let h denote the number of classes under the narrower sense of equivalence. Assume that the discriminant of the field concerned is Ll. Then h is the class number of quadratic forms with discriminant Ll. If ~ '" m then either ~ ~ m or ~ ~ [flJm, and we deduce that h ~ 2h o. In fact, if ~ '" m, then there are integers oc, f3 such that [ocJ~ = [f3Jm. (i) If Ll < 0, then N(ocf3) > 0 so that ~ ~ m, and whence ho = h. (ii) If Ll > 0 and the fundamental unit 1] satisfies N(1]) = - 1, then [ocJ~ = [f3Jm = [1]f3Jm and one of N(ocf3), N(ocf31]) must be positive, so that we still and ho = h. have ~ ~ (iii) If Ll > 0 and the fundamental unit 1] satisfies N(1]) = I then ~ cannot be equivalent in the narrower sense to both m and m[flJ, so that ho = h/2. Therefore we have
m
h' { ho = ~ 2'
if Ll < 0
or
Ll > 0,
if Ll > 0,
N(1])
N(1])
=
{1]2, 1],
1;
= + 1.
Also if we replace d by D in Theorem 11.4.4 and define B
=-
if Ll > 0, if Ll > 0,
B
accordingly, then
N(1]) = - I; N(1]) = + I.
Again, from our results on the class number in Chapter 12 we have: Theorem 13.3. Let ho denote the number of ideal classes. Then
W
h,
[tILlIl(Ll)
~ 2(2 _(~)) ,~,
1] h o =
if Ll < 0,
-; ,
£t
if Ll > O.
s,
Ll
s= 1
Example 1. In R(i) we have Ll = - 4, W = 4 so that ho
=
Example 2. In R(J="3) we have Ll ho
=
±(~) =
4 2(2 - 0)S=1
s
I.
= - 3, W = 6 so that
6 2(2 - (- 1»
±(-=2) =
s= 1
S
I.
0
445
16.14 Genus
Example 3. In R(J=5) we have LI
ho =
I (-
2 2(2 - 0).=1
Example 4. In R(.J"=T9) we have LI
ho =
20, W
= -
=-
20) s
19, W
f. (-
2 2(2-(-I))s=1
2 so that
=
= 2.
= 2 so that
19) = 1. s
Example 5. In R(J2) we have LI = 8, e = 3 + 2J2. Since - 1 and 1]2 = e, 1] is a fundamental unit. Also (1
+ J2)h = O
n
(sin ns)_ (~) = sin 3n
8
s= 1
so that ho
8
I
1]
= 1 + J2 has norm
sin ~ = (l
8
+ J2),
= 1.
16.14 Genus Let R(.ji5) be a fixed quadratic field with discriminant LI, and we shall assume in this section that the ideal classes are derived from the equivalence relation on ideals being equivalent in the narrower sense. Definition 14.1. If a quadratic form F(x, y) belongs to an ideal m: then we call the character system for F(x, y) (see Definition 12.6.1) the character system for m:. That is, if Pi>' .. ,Ps are the odd prime divisors of LI, we take an integer IX in m: so that (N(IX)jN(m:), 2L1) = 1 and we call
(N(IX)~~(m:))
(i= 1, ... ,s)
and 1[N(a)
]
"(IX)
= (-
1)2 N(~)-1
e(lX)
= (-
1)8 N(~) -1,
"(IX) e(IX),
1[(N(a»),
if
,
]
LI D=-=.3 4
(mod 4);
LI 4
if
-=.2 (mod 8);
if
-=.6
LI 4
(mod 8)
the character system for m:. Since ideals belonging to the same class have the same character system we may speak of the character system for an ideal class. Definition 14.2. Two ideal classes with the same character system are said to belong to the same genus. There is now a one-to-one correspondence between ideal classes in the quadratic field R(.ji5) and classes of primitive forms having discriminant LI.
446
16. Introduction to Algebraic Number Theory
Theorem 14.1. The values of the character systemfor ~m correspond to the products of the values of the character systems for ~, m.
Proof If a, 13 belong to
~,
m respectively, then af3 belongs to N(a)
N(f3)
--
--
N(~)
N(m)
~m.
Also
and
and if ( N(a)
N(~)'
2.1) = 1,
The theorem is proved.
N(f3) ( N(m) ,
2.1) = 1,
then
( N(af3)
N(~m)
,
2.1)
=
1.
D
From this theorem we deduce at once: 1) The character system for the product of two classes is the product of the two character systems. 2) If {~} and {m} belong to a genus, and {~d{md belong to a genus, then {~~d and {mmd also belong to a genus. Definition 14.3. We call the class to which the unit ideal .0 belongs the principal class, and the genus to which the principal class belongs the principal genus. Also, if ~m = [a] where a is a natural number, then we call {m} the inverse of the class {~}.
From Theorem 7.1 we see that the inverse of any ideal class always exists. Also = {~}. Since the values of the character system for the principal class, as well as for all the classes in the principal genus, are all 1, it follows that the product of any two classes in the principal genus, and the inverse of any class in the principal genus, are classes in the principal genus. (The family of all ideal classes forms a group with respect to class multiplication, and the sub-family of ideal classes in the principal genus forms.a sub-group.) {.o}{~}
Theorem 14.2. Every genus has the same number of classes.
Proof We let ~ be the principal genus, and we let ~{~} denote the family of classes obtained from the product of classes in ~ with {~}. We put all the ideal classes into various families (1)
where {~i} is any class not belonging to~, ~{~2}' ... '~{~i- d. It is easy to see that there is no ideal class which belongs to two of the families in (1).
447
16.15 Euclidean Fields and Simple Fields
From Theorem 14.1 we know that in each family in (1) all the classes belong to the same genus, and distinct families belong to different genera, so that each family in (1) forms a genus. Since any two classes in 3{~i} are distinct the theorem is proved. 0
16.15 Euclidean Fields and Simple Fields Definition 15.1. If ho = 1, then we call the field a simple field. It is clear that, in a simple field, every ideal is a principal ideal. Therefore we have:
Theorem 15.1. The unique factorization theorem holds for integers in a simple field. 0
There is a type of simple fields, called Euclidean fields, having properties which are very similar to those of the rational field. Definition 15.2. If, corresponding to any two integers ~, 1] (1] -# 0) in R(jD), there exist two integers K, A. such that IN(A.) I < IN(1])I,
(1)
then we call R(jD) an Euclideanfield. An alternative definition is: Defmition 15.3. If, corresponding to any b in R(jD), there exists an integer K such that IN(b - K)I < 1,
(2)
then we call R(jD) an Euclidean field. Theorem 15.2. Every Euclidean field is a simple field.
Proof Let R(jD) be Euclidean. In order to prove that R(jD) is simple it suffices to show that every ideal is a principal ideal. Let ~ be any ideal in R( jD) and let 1X1' 1X2 be a basis for ~, and we may assume without loss that 0 < IN(1X1)1 ~ IN(1X2)1. Since R(jD) is Euclidean there are integers IX~ and /32 such that 1X2 = 1X~1X1 + /32, IN(/32)1 < IN(1X1)1· If /32 -# 0, then there are lX'l and /31 such that 1X1 = 1X'1/32 + /31> IN(/31)1 < IN(/32)1. Continuing with the argument, which must terminate after a finite number of steps because IN(1X1)1 is a natural number, we arrive at an integer IX such that ~ = [1X1> 1X2] = [IX]. The theorem is proved. 0
448
16. Introduction to Algebraic Number Theory
Theorem 15.3. There are only five quadratic imaginary Euclidean fields, namely
R(~), R(j=2), R(j=3), R(~) and R(F-U). Proof 1) Let D == 2, 3 (mod 4). Put" = r + sJD, K = x + yJD. Then the condition (2) becomes: corresponding to any pair of rational numbers r, s there are rational integers x, y such that (3) Settingr = s = tthecondition (3) gives± + IDI± < 1, or IDI < 3. Therefore R(JD) cannot be Euclidean if D ~ - 3. On the other hand, if r, s are given rational numbers we can always find rational integers x, y such that Ir - xl ~ t, Is - yl ~ t so that corresponding to D = - 1, - 2, the inequalities I(r - X)2 - D(s - y)21 ~ ± + IDI± < I hold so that R(~) and R(j=2) are Euclidean. 2) Let D == 1 (mod4). Put" = r + sJD, K = x + y(1 + JD)/2 so that
Setting r = s = ± we have /6 + /61DI < 1 or IDI < 15. Therefore there can only be the three Euclidean fields R(j=3), R(~) and R(F-U), and these fields are indeed Euclidean because, given rational numbers r, s we may choose rational integers x, y such that 12s - yl ~ t, Ir - x - (y/2) I ~ t, and therefore when D = - 3, - 7, - 11,
I(r -
x -
~)2 2
_
D (s _
~)21 ~ ~ + IDI ~ ~ < 2
""" 4
16""" 16
l.
D
In §13 we calculated the class number for R(F-l9) to be l. We see therefore that there are simple fields which are not Euclidean. From Theorem 12.15.4 we know that there are only finitely many imaginary fields which are simple. The question then is exactly how many? It is not difficult to prove that R(JD) is simple when D = - 1, - 2, - 3, - 7, - 11, - 19, - 43, - 67, - 163.
It has also been proved thatthere is at most one more value of D, and that ifit exists, then D < - 5 . 109 • (In fact no extra D exists; see Notes.) Concerning real Euclidean fields we have:
Theorem 15.4. The field R(JD) is a real Euclidean field only when D = 2,3,5,6,7,11,13,17,19,21,29,33,37,41,57,73.
D
Various Chinese mathematicians, including the author, made contributions to this problem, which in principle was eventually settled by Davenport. The proof of the theorem is beyond the scope of this book.
449
16.16 Lucas's Criterion for the Determination of Mersenne Primes
16.16 Lucas's Criterion for the Determination of Mersenne Primes We first sharpen Theorem 9.S for the quadratic field R(JiJ), D > O. From Theorem 10.3 we know that all the prime ideals can be separated into three classes according to whether (~) = 0, + I or - l. We shall write q for a prime number satisfying (~) = I so that q = .0,0; we write r for a prime number satisfying (1) = - 1 so that r itself is a prime ideal in R(JD). From Theorem 9.S we have, if Q'(O(, then
=1
(mod .0),
(1)
O(r'-l=1
(modr).
(2)
O(q-1
and if r,(O( then
Theorem 16.1. Suppose that q, r are not 2. O(q-1
=1
If q,(O(,
then
(modq),
(3)
and if r,(0(, then O(r+ 1
=
(mod r).
N(O()
(4)
Observe that (1) and (3) are equivalent, and that (2) follows (4).
Proof Let 0( = a + b(.1 + fl)/2 where a, b are rational integers. Let p be an odd prime so that, from Fermat's theorem, O(P
=a
P
+ bP
.1P
+ (fl)p 2P
b p-1 =a + -(.1 + .1 2 fl) 2
=a + ~(.1 + (;)fl) Therefore if p = q, then O(q = (modr) which gives (4). 0
0(
(modp).
(modq) which gives (3), and if p = r, then O(r = IX
Now let p be an odd prime and we shall examine the nature of the Mersenne number M = Mp = 2P - l. If there exists .1 > 0 such that
(~) = and there exists a unit
1
e in R(fl) satisfying N(e) =
where e' is the conjugate of e.
-
1, then we let
450
16. Introduction to Algebraic Number Theory
Theorem 16.2. A necessary and sufficient condition for M to be prime is that rp - l
== 0 (modM).
(6)
Proof 1) Assume that M is a prime. From (5) we know that M is of the type r, and so from Theorem 16.1 we have eM + 1 == - 1 (mod M) and therefore
2) Assume that M is composite, say M = ql ... q s r 1 ... r t • From (5) we know that at least one of the prime divisors of M is of type r. If (6) holds, then Mlr p - l or
and hence e2P
== - 1 (mod M),
(7)
== 1 (modM).
(8)
and on squaring e 2P + 1
Let 'P be a prime ideal divisor of M and let I be the least positive integer satisfying el == 1 (mod 'P). Then, by (8), 1I2P + 1, and so by (7), 1= 2P + 1. If 'P is a divisor of a certain q, then eq - 1 == 1 (mod 'P) by Theorem 16.1, and hence 2P + 11 q - 1, which is impossible because q cannot exceed M. If 'P is a certain r, then er + 1 == - 1 (mod r) by Theorem 16.1. This then gives 2P + 112(r + 1) andsor = 2Pm - 1. Butr ~ Mso thatm = 1, r = M. That is Mmust be prime after all. 0 Example. Take L1
= 5, e = (1 + .j5)/2 so that
Ifwe take p = 7, Mp = 127, then the residues mod 127 for r m (m = 1,2,3,4,5,6) are 3,7,47,48,16, O. Therefore 127 is a prime. Of course the full power of the theorem is not revealed in this specific example. However, with the aid of electronic computers, the same method can be used to show, for example, that the 687 digit number M 2281 = 2 2281 - 1 is prime. Indeed all the large known Mersenne primes are found by essentially the same type of method.
16.17 Indeterminate Equations The invention of the theory of ideals to tackle Fermat's problem is an important development in algebraic number theory. From the standpoint of mathematics this theory is far more important than that of settling a difficult problem. Let p be an
451
16.17 Indeterminate Equations
odd prime and p
= e21ti/ p •
If we can prove that
has no integer solutions in the field R(p), then obviously Fermat's Last Theorem is established. The expression ep + 1]P can be factorized into linear terms in R(p) so that the problem is easier to start with. Indeed this is Kummer's starting point in his research on Fermat's problem, but the principal difficulty lies with the absence of a unique factorization theorem. It is for this reason that Kummer invented his theory of ideals which has now become an indispensable part of mathematics. It is not easy to understand Kummer's method. That is, even if we assume that there is unique factorization in R(p), we still need a deep theorem of Kummer's before we can settle Fermat's pr.oblem. The theorem concerned is as follows: A necessary and sufficient condition for a unit B in R(p) to be a p-th power of another unit is that B is congruent to a rational number mod (1 - PY. We can only consider two simple examples in this book. Theorem 17.1. The equation (1)
has no solution in integers in R(J=1). Proof The unique factorization theorem holds in the field R(J=1), that is every ideal is a principal ideal. We may therefore assume without loss that (e,1]) = 1. 1) Let A = 1 - i. Then A is irreducible, and A2 = - 2i and 2 = i(l - i)2 are associates. Also N(2) = 4 so that every integer in R(J=1) must be congruent to one of the four numbers 0, 1, i, 1 - i (mod 2). Since 0, 1 - i are divisible by A, any integer DC not divisible by Amust satisfy DC == 1 or i (modA2) so that DC = 1 + PA 2 or DC = i + PA2, and hence (2)
Now let e, 1],. satisfy (I). Suppose, if possible, that e, 1] are not divisible by A. From (2) and (1) we have 2 == .2 (modA 6 ). Since 2 = A2 i we see that AI •. Write • = AY so that A,j'y, and iA 2 == A2/ (mod A6) or / == i (mod A4 ). On squaring this we deduce from (2) that 1 == y4 == - 1 (mod A4), which is impossible. Therefore one of e, 1] is divisible by A. By symmetry we may assume that Ale, and we now write e = Anb, n ;;:: 1, A,j'b, so that we have
2) We now prove a more general result, namely that there are no integers b, ., 1]
in R(
J=1) such that
B
unit,
A,j'b1],
(b,1]) = I,
n;;:: 1.
(3)
452
16. Introduction to Algebraic Number Theory
The proof is divided into two steps. In the first step we show that if (3) is soluble then
n must be at least 2; in the second step we show that if (3) is soluble for a certain n, then it is soluble for n - 1 also. The theorem therefore follows from this contradiction. If(3)holdsforintegersD,r,I]thenAt!"r.SinceN(A) Let r = 1 + JJ.A so that on squaring we have'
= 2weseethatr == 1 (mod A).
Also, by (2), (4) so that, by (3),
Thus AIJJ. and we may write r
= 1 + VA 2 ,
r2 = 1 + 2VA2
+ V2A4 =
1 + A4V(i
+ v).
(5)
Since v, i + v form a complete residue system mod A we have v(i + v) == 0 (mod A) giving r2 == 1 (modA 5 ). From (3) and (4) we deduce that GA 4n D4 == r2 - 1]4 == 0 (mod A5), and we conclude that n ~ 2. Now assume that D, r, I] satisfy (3) with n ~ 2. Then GA 4n D4 = (r - I]2)(r + 1]2). From (5) we have r == 1 (mod A2 ), and on the other hand, since At!"I] we have (6)
it follows from (7) that A4 (n-l) must divide one of these two divisors. We may assume that A4 (n-1) actually divides the latter divisor, since otherwise we may replace I] by il]. From (7) we have r
+ .,,.2
_
14(n-1)
~-G211.
rp
4
(At!"rpO", (0", rp)
where G1, G2 are two units. Thus
21]2
il]2
= Y = G2 A4(n-l)rp4 - G10"\
= 1),
453
16.17 Indeterminate Equations
or
where 83 = - 8di, 84 = 82/i are also units. Since n ~ 2, A-/,a we see from (2) that 1]2 == 83 (mod A4) and hence, by (6), 1 == (mod A2). Therefore 83 is either + 1 or - 1 and not ± i, that is
A-/,q>a,
83
(q>, a) = 1.
Ifwe take the negative sign here then our second step follows at once, and if we take the positive sign then the same result is obtained by replacing 1] by i1]. 0 Theorem 17.2. The equation (8)
has no solution in integers in R(p), p = (- 1 + ~)/2. Proof Since R(p) is a simple field we may assume that (~, 1]) = 1. ·1) Let A = 1 - p, so that 1 - p2 = - p2(l - p) = - p2A and N(A) = _ p2A2 = 3. Therefore Ais irreducible and all the integers are partitioned into three classes represented by 0, 1, - 1. Therefore, if A-/,~, then ~ == ± 1 (mod A). We shall now show that (9)
Let
We need only consider the ~ = 1 + f3A so that ~3
_
+ sign case, since otherwise we may replace ~ by -
~.
= f3A(f3A + 1 - p)(f3A + 1 _ p2) = f3A(f3A + A)(f3A - p2 A) = A3 f3(f3 + 1)(f3 _ p2).
1 = (~
-
1)(~
- p)(~ - p2)
Since f3, f3 + 1, f3 - p2 are incongruent mod A, and N(A) = 3 there must be one of them which is divisible by A. We deduce that if A-/'1], then (10)
Now if A-/, ~1]', then 0 == ~3 + 1]3 + ,3 == ± 1 ± 1 ± 1 (mod A3). The possible choices are ± 1, ± 3 and none of them is divisible by A3, so that one of ~,1]" must be divisible by .Ie. Let it be , = Any, n ~ 1, A-/,y so that (~,1])=
1,
A-/,y,
n~
1.
2) We shall now prove a more general result, namely that (~, 1]) =
1,
A-/,y, n
~
1,
(11)
454
16. Introduction to Algebraic Number Theory
where 8 is a unit, has no integer solutions in R(p). As in the proof of Theorem 17.1 we separate into two steps where, in the first step, we show thatif(ll) has a solution, then n ~ 2, and in the second step we show that if (11) has a solution, then n may be replaced by n - 1 and there is still a solution. The theorem then follows by this contradiction. If (11) has a solution, then by (10)
Since + 1 + 1 and - 1 - 1 are not divisible by A we see that - 8A 3n y 3 == 0 (mod A4 ), and so n ~ 2. Suppose that e, 1], yare solutions to (11). From 1 == p == p2 (mod A) we deduce that e + 1] == e + P1] == e + p21] (mod A) and hence - 8A 3ny 3 = e 3 + 1]3 = (e + 1])(e + p1])(e + p21]) where the three divisors are all multiples of A. It is not difficult to show that (e + 1])/A, (e + P1])/A, (e + p21])/A are pairwise coprime. In fact, for example, from (e + 1]) - (e + P1]) = A1] and p(e + 1]) - (e + P1]) = - Aeweseethat(e + 1])/Aand(e + P1])/A are coprime. Thus one of the three divisors. in the factorization
_ 8A 3 (n-l)y3
=
e
+ 1] e + P1] e + p21] A
A
A
must be a multiple of A3 (n-l), and we may assume that it is (e we can replace 1] by P1] or p21]. Hence
+ 1])/A since otherwise (12)
where 81> 82, 83 are units and jl, v, u are pairwise coprime integers not divisible by A. From (12) we have
giving
(V,u)
=
1,
A,(jl
(13)
where 84, 85 are also units. From (13) we have v3 + 84U3 == 0 (mod A2) and here, by (10), ± 1 ± 84 == 0 (modA 2). Among the units ± 1, ± p, ± p2 only 84 = ± 1 can satisfy this congruence. Hence 84 = ± 1 and we see that (13) is the same as (11) with n replaced by n - 1. The theorem is proved. 0
16.18 Tables We conclude this chapter with two tables displaying all the quadratic fields R(.ji5) with - 100 < D ~ 100. We list their integral basis, discriminants, ideal classes and the quadratic forms associated with the ideal classes together with their
16.18 Tables
455
character systems. We also display the continued fraction representations for OJ and the fundamental units in the second table. More precisely: In Table I, the first column is the value for D. The second column is OJ (see the definition in Theorem 4.5). The third column is the discriminant d. The fourth column displays the ideal classes of R(~ji). The fifth column indicates the relationship between the ideal classes. The sixth column displays the quadratic forms representing the classes of forms corresponding to the ideal classes. The seventh column is the character systems associated with these classes of forms. In Table II, the first two columns are as before. The third column displays the continued fractions expansion representing OJ when D is square-free and representing JD when D is not square-free. The fourth column is the discriminant d. The fifth column displays x + yJD when D is square-free and it is the fundamental unit 1] of R(JD); when D is not square-free it displays the least positive integer solutions to x 2 - y2 D = ± I (if x 2 - y2 D = - I is soluble, then x + yJD satisfies x 2 - y2 D = - I, otherwise x, y satisfy x 2 - y2 D = + I). The sixth column is N(x + yJD). The last four columns are the same as the last four columns in Table I.
456
16. Introduction to Algebraic Number Theory
Table I D
00
LI
Ideal classes
-I
)=l
_22
-2
)-2
-3 -5
-6
Quadratic forms
Character systems
(I)
X2 +y2
+1
_23
(I)
2X2+y2
+1
-3
(I)
X2+xy+y2
+1
)-5
-2 2 ·S
(I)
A2
SX2+y2
+1, +1
(2, I +) -S)
A
3X2 +2xy+2y2
-I, -I
)-6
-2 3 .3
(I)
A2
6X2+y2
+ I, +1
A
3x2 +2y2
-I, -I
2X2+xy+y2
+1
1+)-3 2
(2,) -6) -7 -10
-\I
-13
-14
1+)-7 2 )-10
1+)-11
-7
(I)
-5.2 3
(I)
A2
IOx2 + y2
+1, +1
(2,) -10)
A
5x 2 +2y2
-I, -I.
3x2 +xy+y2
+1
A2
13x2 +y2
+1, +1
(2, I +) -13)
A
7X2+2xy+2y2
-I, -I
(I)
[4
14x2 + y2
+1, +1
(3,2+) -14)
[3
6x 2 + 4xy + 3y2
-I, -I
(2,) -14)
[2
7x 2+2y2
+ I, + I
Sx2+2xy+3y2
-I, -I
A2
4X2+xy+y2
+1, +1
(2, I +(0)
A
(I)
3X2+3xy+2y2 17x2+y2
-I, -I
[4
+ I, +1
(3,2+) -17)
[3
7X2+4xy+3y2
-I, -I
(2,1 +) -17)
[2
9x2 + 2xy + 2y2
+ I, + I
(3, I +) -17)
6x 2 +2xy+3y 2
-I, -I
-19
(I)
Sx 2 +xy+y2
+1
-3.2 2 .7
(I)
A2A~
+2Ix2+y2
+1,+1,+1
AAI
6x 2 +6xy+Sy2
-1,-1,+1
AI
7x 2 +3y2
+1,-1,-1
(2, 1+) -21)
A
IIx2+2xy+2y2
-1,+1,-1
(I)
A2
22x2+y2
+1, +1
(2,) -22)
A
IIx 2 +2y2
-I, -I
-II
(I)
)-13
-2 2 .13
(I)
)-14
_7.2 3
2
Relations
(3,1+)-14) -IS
-17
-19 -21
1+)-15 2 )-17
1+)-19 2 )-21
-3·S
-2 2 .17
(I)
(5,3+) -21) (3,)-21)
-22
)-22
-23 .11
457
16.18 Tables Table I (continued) D
-23
w 1 +) -23 2
-29
)-29
Quadratic forms
Character systems
(1)
[3
6x 2+xy+y2
+1
[2
4x 2 + 3xy + 2y2
+1
3x2+2xy+2y2
+1
2,1+ (2,
)-26
Relations
-23 (
-26
Ideal classes
-2 3 .13
-2 2 .29
1+)-23) 2
1+~-23) (1)
[6
26x2+y2
+1, +1
(5,3+)-26)
[5
7x 2+6xy+5y 2
-1, -1
(3,1 +) -26)
[4
9X2+2xy+3y2
+1, +1
(2,) -26)
[3
13x2+2y2
-1, -1
(3,2+)-26)
[2
10x2+4xy+3y 2
+1, +1
(5,2+)-26) (1)
[6
6x 2+4xy+5y 2 29x 2+y2
-1, -1 +1, +1
(3,2+) -29)
[5
l1x2 +4xy+3y 2
-1, -1
(5,4+) -29)
[4
9x 2+8xy+5y 2
+ 1, + 1
(2,1 +) -29)
[3
15x2+2xy+2y 2
-1, -1
(5,1 +) -29)
[2
6x 2+2xy+5y2
+ 1, +1
(1)
A2A:
10x2+2xy+3y 2 30X2+y2
+1,+1,+1
(2,) -30)
AAI
15x2+2y2
-1, -1, +1
(3,) -30)
Al
10x2+3y2
+1,-1,-1
(5,) -30)
A
-1, +1,-1
(1)
[3
6x 2+5y2 8x 2+xy+y2
(2,w)
[2
4X2+xy+2y2 5x 2+3xy+2y 2
+1
(1)
A2A:
33x2+y2
+1,+1,+1
(2,1 +) -33)
AAI
17x2+2xy+2y2
-1,-1,+1
Al
11x2+3y2
-1,+1,-1
(6,3+)-33)
A [4
7x 2+6xy+6y 2 34x2+y2
+1,-1,-1
(I)
(5,4+) -34)
[3
IOx 2+8xy+5y 2
-I, -I
(2,)-34)
[2
17x2+2y2
+1, +1
(3,1 +) -29) -30
-31
)-30
W+)-31)
-2 3 .3.5
-31
(2,1 +w) -33
)-33
_22 ·3·11
(3,) -33) -34
)-34
-2 3 .17
(5,1 +) -34) -35
W+)-35)
-5·7
-1, -1
+1 +1
+1, +1
7x 2+2xy+5y 2
-I, -I
(I)
A2
9X2+xy+y2
+1, +1
(5, 5+~ -35)
A
3x 2+ 5xy + 5y2
-I, -I
458
16. Introduction to Algebraic Number Theory
Table I (continued) D
w
Ll
Ideal classes
Relations
Quadratic forms
-37
j-37
-2 2 '37
(1)
A2
37x 2+y2
+1, +1
(2,1 +j -37)
A
-1, -1
(1)
[6
19x2+2xy+2y 2 38x 2+y2
(3,2+j -38)
[5
14x2 + 4xy+ 3y2
-1, -1
(7,2+j-38)
[4
6x 2+4xy+7y2
+1, +1
(2,j -38)
[3
19x2+2y2
-1, -1
(7,5+j-38)
[2
9x 2+ 10xy + 7y2
+1, +1
[4
13x2+2xy+ 3y2 lOx 2+xy+y2
-1, -1
(1) (2,1 +w)
[3
(3,1 +w)
[2
-38
j-38
-2 3 .19
(3,1 +j -38) -39
to+j-39)
-3,13
j-41
-2 2 '41
j-42
-43
!O+j-43)
-46
j-46
-3.2 3 .7
-43 -2 3 .23
to+j-47)
-47
41x2 + y2
+1, +1
(3,2+j-41)
[1
15x2+4xy+3y 2
-I, -1
(5,3+j-41)
[6
lOx2 +6xy+5y 2
+1, +1
(7,6-+'j-41)
[5
Ilx2
-1, -1
(2,1 +j -41)
[4
21x2+2xy+2y2
+1, +1
(7,1 +j -41)
[3
6x 2+2xy+7y 2
-1, -1
(5,2+j -41)
[2
9x 2+4xy+5y 2
+1, +1
(1 )
A2Af
14x2+2xy+3y 2 42x2+y2
+1,+1,+1
(7,j -42)
AAI
6x 2+7y2
+1, -1,-1
(3,j -42)
Al
14x2+3y2
-1,-1,+1
(2,j-42)
A
21x2+2y2
-1, +1,-1
(1)
I
(1)
[4
Ilx2+xy+y2 46x 2+y2
+1, +1
(5,3+j-46)
[3
Ilx 2 + 6xy + 5y2
-1, -1
(2,) -46)
[2
23x2+2y2
+ I, +1
lOx 2+4xy+5y 2
-1, -1
!O+j-51)
-3·17
+ 12xy+7y 2
-1, -1
+1
(1)
[5
12x2+xy+y2
+1
(2,w)
[4
+1
(3,2+w)
[3
6X2+xy+2y2 6x 2 + 5xy+3y2
(3,w)
[2
4X2+xy+3y2 7x 2 + 3xy + 2y2
+1
(1)
A2
(3,1 +w)
A
13x2+xy+y2 5x 2+3xy+3y2
+ I, +1 -1, -1
(2,1 +w) -51
+1, +1
[8
( 1)
(5,2+j-46) -47
+1, +1 -1, -1 -1, -1
(3,1 +j -41) -42
+1, +1
5X2+xy+2y2
(2,w) -41
6x 2+3xy+2y 2 4x 2 + 3xy+3y 2
Character systems
+1 +1
459
16.18 Tables Table I (continued) D
w
Ll
Ideal classes
Rela,tions
Quadratic forms
-53
)-53
-2 2 .53
(I)
[6
53x 2+y2
+ I, + I
(3,2+)-53)
[5
19x2+4xy+3y 2
-I, -I
(9,8+)-53)
[4
13x2 + 16xy+9y 2
+1, +1
(2, 1+) -53)
[3
27x2+2xy+2y2
-I, -I
(9, 1+) -53)
[2
6x 2+ 2xy + 9y2
+1, +1
18x2+2xy+3y 2
-I, -I +1, +1 -I, -I
(3, 1+) -53) -55
W+)-55)
-5·11
(I)
[4
14x2+xy+y2
(2, I +w)
[3
(5,2+w)
[2
8x 2+3xy+2y 2 4x 2 + 5xy+ 5y2
)-57
+1,+1,+1
(I)
A2Ai
(2, 1+) -57)
AA.
29x2+2xy+2y2
-1,-1,+1
(3,) -57)
A.
19x2+3y2
+1,-1,-1
A A2
IIx 2 + 6xy+6y 2
-I, +1,-1
58x2 +y2
+ I, + I
(2,) -58)
A
29x 2+2y2
-I, -I
(I)
[3
15x2+xy+y2
+1
(3, 5+~ -59)
[2
7x 2 +5xy+3y 2
+1
5X2+xy+3y2
+1
-3.2 2 .19
(6,3+) -57) -58 -59
)-58 W+)-59)
-2 3 .29
(I)
-59
(3, -61
)-61
-2 2 .61
1+~-59)
)-62
-2 3 .31
-I, -I
(I)
[3
61x2 + y2
+ I, +1
(5,3+)-61)
[2
14x2 + 6xy+ 5y2
+ I, + I
(5,2+) -61)
-62
+1, +1
7X2+xy+2y2 57x 2 +y2
(2,w)
-57
Character systems
13x2 +4xy+5y 2
+1, +1
(7,4+) -61)
A[2
IIx 2 + 8xy+ 7y2
-I, -I
(7,3+) -61)
A[
IOx 2 +6xy+ 7y2
-I, -I
(2, 1+) -61)
A
31x 2 + 2xy + 2y2
-I, -I
(I)
[8
62x2+y2
+ I, +1
(3,2+)-62)
[7
22x2+4xy+3y2
-I, -I
(7, 1+) -62)
[6
9x 2+2xy+7y 2
+ I, + I
(11,2+) -62)
[5
6x 2 +4xy+ lIy2
-I, -I
(2,) -62)
[4
31x 2 +2y2
+1, + I
(11,9+) -62)
[3
13x2+ 18xy+ lIy2
-I, -I
(7,6+) -62)
[2
14x2 + 12xy+ 7y2
+1, +1
2Ix2+2xy+3y2
-I, -I
(3, I +) -62)
460
16. Introduction to Algebraic Number Theory
Table I (continued) D
w
Ll
Ideal classes
Relations
Quadratic forms
Character systems
-6S
j-6S
-2 2 'S'13
(I)
14
6Sx 2+y2
+1,+1,+1
(3,2+j-6S)
13
23x 2+4xy+3y 2
-1,+1,-1
(9,4+j -6S)
12
9x 2 + 8xy + 9y2
+1,+1,+1
(3, I +j -6S)
22x2+2xy+3y2
-1,+1,-1
(II, 10+j -6S)
AI3
ISx 2+ 20xy + IIy2
+1,-1,-1
(2, I +j -6S)
AI2
33x2+2xy+2y2
-1,-1,+1
(II, I +j -6S)
Al
6x2+2xy+lly2
+1,-1,-1
(S, j -6S)
A 14
13x2+Sy2 66x 2+y2
-I, -I, +1
(I) (S,3+j-66)
[3
ISx 2+6xy+Sy2
-1,+1,-1
(3,j -66)
12
22x 2 + 3y2
+1,+1,+1
14x2+4xy+Sy2
-1,+1,-1
AI2
IOx 2 + 4xy + 7y2
+1, -1,-1
AI2
6x 2+ IIy2
-I, -I, +1
(7,S+j -66)
Al
13x2+ 10xy+ 7y2
+1, -1,-1
(2,j -66)
A
33x 2+2y2
-1,-1,+1
(I)
14
17x2+xy+y2 69x 2+y2
+1,+1,+1
(7,6+j -69)
13
ISx 2+ 12xy+ 7y2
+1,-1,-1
(6, 3+j -69)
12
13x2+6xy+6y 2
+1, +1, +1
(7, I +j -69)
IOx 2+2xy+7y 2
+1,-1,-1
(S, I +j -69)
A[3
14x2+2xy+Sy2
-1,-1,+1
(3,j -69)
A[2
23x2+3y2
-1,+1,-1
(S,4+j-69)
Al
17x2+8xy+Sy2
-1,-1,+1
(2, I +j -69)
A
(I)
A2A~
3Sx2+2xy+2y 2 70X 2+y2
+1,+1,+1
(7,j -70)
AAI
IOx 2 + 7y2
-I, -I, +1
(S,) -70)
AI
14x2+Sy2
+1, -1,-1
(2,j-70)
3Sx 2+2y2
-1,+1,-1
(I)
A 17
71x2+y2
+1
(2, 3+~ -71)
16
IOx 2+3xy+2y 2
+1
(s, 7+~
IS
6x 2+7xy+Sy2
+1
-66
j-66
-2 3 '3'11
(S,2+j-66) (7,2+j-66) (11,j -66)
-67 -69
-70
-71
1
j-70
1
-67 -2 2
'3.23
-2 3 'S'7
-71
(I)
-71)
+1,+1,+1
+1
-1,+1,-1
461
16.18 Tables Table I (continued) D
Ideal classes
Relations
Quadratic forms
Character systems
(3, 5+~ -71)
[4
8x 2 + 5xy+3y 2
+1
1+~-71)
[3
6X2+xy+3y2
+1
[2
4x 2+3xy+5y 2
+1
9X2+xy+2y2
+1
w
-71
(3,
(5, 3+~ -71) (2, I +~ -71) -73
)-73
-2 2 .73
(I)
[4
73x 2 +y2
+1, +1
(7,5+) -73)
[3
14x2 +lOxy+7y 2
-I, -I
(2, I +) -73)
[2
37x 2 +2xy+2y 2
+1, +1
IIx 2+4xy+7y 2
-I, -I
(I)
[5
74x 2 +y2
+ I, +1
(11,6+)-74)
[4
}OX2+ 12xy+ IIy2
+1, +1
(3,1+)-74)
[3
25x 2+2xy+3y 2
+1, +1
(3,2+) -74)
[2
26x 2 +4xy+3y 2
+1, +1
(7,2+)-73) -74
)-74
-2 .37 3
9x2 + IOxy + lIy2
+ I, + I
(5,4+)-74)
A[4
18x2 +8xy+5y 2
-I, -I
(6,4+) -74)
A[3
15x2+8xy+6y 2
-I, -I
(6,2+) -74)
A[2
13x2 +4xy+6y 2
-I, -I
A[
15x2 +2xy+5y 2
-I, -I
(2,)-74)
A
(I)
[4
37x2 +2y2 77x 2 +y2
+1,+1,+1
(3,2+) -77)
[3
27x2 +4xy+3y 2
-1,+1,-1
(14,7+)-77)
12
9x2 + 14xy+ 14y2
+1,+1,+1
(11,5+)-74)
(5,1+)-74) -77
)-77
- 22 .7 ·11
26x2 +2xy+3y 2
-1,+1,-1
(6,5+)-77)
AI 3
17x2+ IOxy+6y 2
-1,-1,+1
(7,) -77)
A[2
IIx 2 + 7y2
+1,-1,-1
(6, 1+) -77)
Al
13x2+2xy+6y 2
-1,-1,+1
(2, 1+) -77)
A A2A~
39x 2 + 2xy + 2y2 78x 2 +y2
+1,-1,-1
(I) (2,) -78)
AAI
39x 2 +2y2
-1,-1,+1
(13,)-78)
AI
6x 2 +13y2
+1,-1,-1
(3,) -78)
A
26x 2+3y2
-1,+1,-1
(3, 1+) -77)
-78
)-78
-2 3 .3.13
-I, -I
+1,+1,+1
462
16. Introduction to Algebraic Number Theory
Table I (continued) D
w
Ll
Ideal classes
Relations
Quadratic forms
Character systems
-79
t(l +) -79)
-79
(I)
[s
20x 2+xy+y2
+1
1+~-79)
[4
IOx 2+xy+2y2
+1
(5,9+~-79)
[3
8x 2 +9xy+5y 2
+1
1+~-79)
[2
4x 2+xy+5y 2
+1
llx 2+ 3xy + 2y2
+1
(2,
(5,
(2, 3+~ -79) -82
)-82
(I)
[4
82x 2+y2
+1, +1
(7,4+)-82)
[2
14x2 +8xy+ 7y2
-I, -I
(2,) -82)
[2
41x2+2y2
+1, +1
13x2 +6xy+ 7y2
-I, -I
-2 3 .41
(7,3+) -82) -83
W+)-83)
(I)
[3
2Ix2+xy+y2
+1
(3,5+~-83)
[2
9x 2+5xy+3y 2
+1
7X2+xy+3y2
+1
A2A~
85x 2 +y2
+1, +1, +1 -1,-1,+1
-83
(3, -85
)-85
-2 2 .5.17
1+~-83) (I)
AAI
17x2 +5y2
(10, 5+) ~85)
AI
llx 2+ IOxy+ lOy2
+1,-1,-1
(2, I +) -85)
A
-1,+1,-1
(I)
[10
43x 2 + 2xy + 2y2 86x2 +y2
(3,2+) -86)
[9
30x 2 +4xy+3y 2
-I, -I
(9,2+)-86)
[8
IOx 2+4xy+9y 2
+1, +1
(5,2+) -86)
[1
18x2 +4xy+ 5y2
-I, -I
(17, 13+) -86)
[6
15x 2+ 26xy+ 17y2
+1, +1
(5,)-8S(
-86
)-86
-2 3 .43
(2,) -86)
IS
43x 2+2
-I, -I
(17,4+) -86)
[4
6x 2 +8xy+17y 2
+1, +1
(5,3+) -86)
[3
19x2 +6xy+5y 2
-I, -I
(9,7+) -86)
[2
15x2 + 14xy+9y2
+ I, + I
29x 2+2xy+ 3y2
-I, -I
[6
22x2+xy+y2
+ I, + I
15
12x2+3xy+2y 2
-I, -I
(3, 1+) -86) -87
W+)-87)
+1, +1
-3·29
(I) (2,
3+~-87)
463
16.18 Tables Table I (continued) D
(()
-87
Ideal classes
Relations
Quadratic forms
Character systems
(7, 5+~ -87)
[4
4x 2+5xy+7y2
+ I, + 1
(3, 3+~ -87)
[3
8x 2+ 3xy + 3y2
-I, -I
(7,9+~-87)
[2
6x 2+9xy+7y 2
+ I, + I
11x2+xy+2y2
-I, -I
(2, I +~ -87) -89
j-89
-2 2 .89
(I)
[12
89x 2+y2
+1, +1
(3,2+j -89)
[II
3Ix 2+4xy+3y 2
-I, -I
(17,9+j-89)
[10
IOx 2+ 18xy+ 17y2
+1, +1
(7, 3+j -89)
[9
14x2 +6xy+ 7y2
-I, -I
(5,4+j -89)
[8
21x 2 + 8xy+ 5y2
+1, +1
(6, I +j -89)
[1
15x2+2xy+6y 2
-I, -I
(2, I +j -89)
[6
45x 2+2xy+2y 2
+1, +1
(6,5+j-89)
[5
19x2+lOxy+6y 2
-I, -I
(5, I +j -89)
[4
18x2+2xy+5y 2
+ I, + I
(7,4+j -89)
[3
15x2 +8xy+ 7y2
-I, -I
(17,8+j-89)
[2
9x 2+ 16xy+ 17y2
+ I, + I
30x 2+2xy+3y 2
-I, -I
(3, I +j -89) -91
-93
-94
1+j -91 2
j-93
j-94
-7·13
-2 2 .3.31
-2 2 .47
(I)
A2
23x 2+xy+y2
+1, +1
(7,7+~-91)
A
5x 2 + 7xy+ 7y2
-I, -I
(I)
A2Af
93x 2+y2
+1, +1, +1
(6,3+j-93)
AAI
17x 2+6xy+6y 2
-I, -I, +1
(3,j -93)
Al
31x 2+3y2
+1,-1,-1
(2, I +j -93)
A [8
47x 2+2xy+2y 2 94x2+y2
-1,+1,-1
(I) (5,4+j -94)
[1
22x2 + 8xy + 5y2
-I, -I
(7, 5+j -94)
[6
17x2 + IOxy+ 7y2
+1, +1
(l1,4+j -94)
[5
IOx 2+8xy+ 11y2
-I, -I
(2,j -94)
[4
47x2+2y2
+ I, + 1
(11,7+j-94)
[3
13x2 + 14xy+ 11y2
-I, -I
+1, + I
464
16. Introduction to Algebraic Number Theory
Table I (continued) D
-94 -95
Ideal classes
Relations
Quadratic forms
Character systems
(7,2+)-94)
[2
14x2+4xy+7y 2
+ I, + I
19x2+2xy+5y 2
-I, -I
w
to +)-95)
-5·19
(5, 1+) -94) (I)
[8
24x2+xy+y2
+1, +1
(2, I +~ -95)
[1
12x2 +xy+2y2
-I, -I
1+~-95)
[6
6x 2+xy+4y 2
+1, +1
(3, 5+~-95)
[5
IOx 2 +5xy+3y 2
-I, -I
(5, 5+~ -95)
[4
6x 2+5xy+5y 2
+1, +1
(3, 1 +~ -95)
[3
8X2+xy+3y2
-I, -I
(4, 7+~ -95)
[2
9x2+7xy+4y 2
+1, +1
13x2+3xy+2y2
-I, -I
(4,
(2, 3+~-95) -97
)-97
-2 2 '97
(I)
[4
97x 2+y2
+1, +1
(7,6+)-97)
[3
19x2 + 12xy+ 7y2
-I, -I
(2, 1+) -97)
[2
49x2+2xy+2y2
+1, +1
14x2+2xy+ 7y2
-I, -I
(7, 1+) -97)
Table II
?' 00
D
w
Continued fractions
2
J2
[1,2]
3
)3
[1,1,2]
J
x+Y)D
N(x+y)D)
Ideal classes
23 3.2 2
I+J2
-I
(I)
2+)3
+1
(I)
Relations
Quadratic forms
Character systems
_2X2+y2 _3x2+y2
+1
_x 2+3y2 w
-I
(I)
5+2)6
+1
(I)
5
W+)S)
[I, I]
6
)6
[2,2,4]
5 3.2 3
7
Ji
[2,1,1,1,4]
22.7
8+3)7
+1
(I)
3+JS
(1)
[2,1,4]
8 10
)10
[3,6]
5.23
3+)10
+1 -I
11
)11
[3,~,6]
22.11
10+3Ji1
+1
12
[3,2,6]
13 W+Ji3) 14 )14 15
JI5
7+2)12
_x 2+6y2 _7x 2+y2 _x 2+ 7y2
+1, +1 -I, -I
A2
-IOx 2+y2
+1, +1
(2,)10)
A
_5x2+2y2
-I, -I
(I)
I
_l1x2+y2 _x 2+ IIy2
+1, +1 -I, -I
-3X2+xy+y2 _14x2+y2 _x2+14y2
+1, +1 -I, -I
-15x 2+y2 _x2+15y2
-1,+1,-1
I+w
+1 -I
(I)
[3, 1,2, 1,6]
15+4ji4
+1
(I)
[3,1,6]
3.2 2.5
4+)15
+1
(I) (2, I +ji5)
17 W+JI7)
[2, I, I,~]
17
3+2w
-I
(I)
A2
A
"'"
+1, +1 -I, -I +1 +1, +1 -I, -I
13 7.2 3
[2,~]
_X2+xy+y2 _6x 2+y2
>-l
'"
a'
+1
+1,+1,+1
-7x 2+2xy+2y 2 -2x2 -2xy+ 7y2
-I, -I, +1
_4X2+xy+y2
+1
+1,-1,-1 ~
0-
VI
oj:>.
Table II (continued)
0\ 0\
0)
Continued fractions
18
[4,4,8]
19
)19
[4,2, 1,3, 1,2,8]
D
.d
x+y)D
N(x+y)D)
17+4)18
+1
22 ·19
170+39)19
+1
9+2)20
+1
2+0)
+1
Ideal classes
Relations
Quadratic forms
Character systems
(I)
_19x 2+y2 _x 2+19y2
+1, +1 -I, -I
(I)
_5X2+xy+y2 _x2 _xy+5y2
+1, +1 +1, +1 -I, -I
20
[4,2,8]
21 !O+)21)
[2,U]
3·7
197+42)22
+1
(I)
_22x2+y2 _x2 +22y2
24+5)27
+1
(I)
_23x 2+y2 _x2 + 23y2
+1, + I -I, -I
5+)24
+1 -I
(I)
-26x2+ y2
+1, +1
22
jii
[4,1,2,4,2,1,8]
23·11
23
)23
[4, 1,3, 1,8]
22.23
[4,1,8]
24 26
)26
23 ·13
[5,1"0]
5+)26
(2,)26) [5,5,1"0]
27
[5,~, 2, 3,
28 29 !O+)29) 30
)30
1"0]
[3,5] [5,2,1"0]
29 3.5.2 3
32
)31
[5, I, 1,3,5,3, I, I, 1"0] [5, I, I, I, 1"0]
22 .31
+1
127 +24)28 2+0)
+1 -I
(I)
11+2)30
+1
(I)
1520 + 273)31
+1
17+3)32
+1
A
_13x 2+2y2
-I, -I
?'
5'
::; 0
0-
<=:
26+5)27
()
g. =s
(2,)30) 31
A2
-I, -I
(I)
A2
A
_7X2+xy+y2 _30X 2+y2
+1,+1,+1
_x2 +30y2
-1,+1,-1
-15x2+2y2 _2X2+ 15y2
-1,-1,+1 +1,-1,-1
-31x2 +y2 _x 2+3Iy2
+1, +1 -I, -I
+1
0 ;I> dQ
...'"
a'
~.
Z 3
'"
a'
... ..., ::r '0..."
'<
Table II (continued)
-
?' 00
Continued fractions
..1
x+yJD
N(x+yJD)
Ideal classes
33 W+J33)
[3,2,1,2,S]
3·11
19+80)
+1
(I)
34
[5,1,4, I, 1"0]
23 '17
35+6J34
+1
(I)
D
0)
Relations
Quadratic forms
Character systems
-8X2+xy+y2 _x2_xy+8y2
+1,+1 -I, -I
-34x2 +y2 _x 2+34y2
+1,+1
...;
II>
~ '"
35
J34
j35
[5, I, 1"0]
37 W+J37) 38
[3, I, I, S]
J38
[6,6,1"2]
39
J39
[6,4,1"2]
40 41 W+J41) 42 J42
[6,~, 1"2] [3,1,2,2,I,S] [6,2,1"2]
22 .5.7
37 ,19
23
3.2 2 .13
41 3.23 '7
6+J35
+1
A2
(3,I+J34)
A
(I)
A2
(2,1 +j35)
A
+1, +1
-llx2 +2xy+3y 2 -I, -I -3x2_2xy+ 11y2 -1,-1 _35x2+y2 +1,+1,+1 _x2+35y2 +1, -1,-1 -17x2+2xy+2y2 _2X2 -2xy+ 17y2
-1,+1,-1 -1,-1,+1
-9X2+xy+y2 -38r+y2 _x2+38y2
+1 +1,+1 -I, -I
5+20)
-I
37+6J38
+1
(I) (I)
25HJ39
+1
(I)
A2
-39r+y2 _x2+39y2
+1, +1, +1 -1,+1,-1
(2,I+J39)
A
-19x2+2xy+2y2 _2X2 -2xy+ 19y2
-1,-1,+1 +1,-1,-1 +1 +1,+1,+1 -I, -I, +1 -I, +1,-1
19+3J40 27+100)
+1 -I
13+2J42
+1
(I) (I)
A2
-IOx2 +xy+y2 -42r+y2 _x2+42y2
(2,j42)
A
-2Ix2+2y2
.j:>..
0\ -.J
oj:>.
Table II (continued)
0\ 00
D
w
Continued fractions
LI
J43
[6, I, 1,3, 1,5, 1,3, I, I, 1"2]
22 .43
x+yJD
N(x+yJD)
Ideal classes
Relations
42 43
3482+531J43
+1
(I)
Quadratic forms
Character systems
-2x2+2Iy2 -43x2+y2
+1,-1,-1
-x2+43y2
-I, -I
+1, +1
+1, +1
44
[6, I, I, 1,2, I, I, I, 1"2]
199 + 30J44
+1
45
[6,1,2,2,2, I, 1"2]
161 + 24J45
+1
46
J46
[6,1,3, I, 1,2,6,2, I, 1,3, I, 1"2]
23 .23
24335 + 3588J46
+1
(I)
_46x 2+y2 -x 2+46y2
-I, -I
47
J47
[6,1,5,1,1"2]
22 .47
48+7J47
+1
(I)
_47x 2+y2 -x2+47y2
-I, -I
48
[6, I, 1"2]
7+J49
50
[7,1"4]
7+J50
+1 -I
50+7J51
+1
51
J51
[7,7,1"4]
3.22 .17
(I)
A2
_5Ix2+y2 _x 2+5Iy2
+1, +1
+1,+1,+1
?'
-1,+1,-1
== ::t
+1, -1,-1
1'5
0
(3,J51) 52
[7,4,1,2,1,4,1"4]
53 W+J53) 54
[7,2,1,6,1,2,1"4]
55
J55
[4,7] [7,2,2,2,1"4]
53 22 .5.11
649+90J52
+1
3+w
-I
485 + 66J54
+1
89+12J55
+1
A
-17x2+3y2 -3x2 + 17y2
-1,-1,+1
0-
g. ==
0 -13x2+xy+y2
(I)
+1
~
"cr"...
(I)
(2,1 +J55)
A2
A
~.
-55x2+y2 -x2+55y2
+1,+1,+1
-27x 2+2xy+2y2 -2x 2 -2xy+27y2
-1,-1,+1
...g'
-1,+1,-1
>-l ::r
+1, -1,-1
Z c:
8
"... 0
'<
Table II (continued)
?' 00
D
w
57 W+J57)
59
J58 J59
60
N(x+yJD)
3·19
15+2J56 131 +40w
+1
(I)
[7, I, I, I, I, I, I, 1"4]
23 .29
99 + I3J58
-I
(I)
[7,1,2,7,2, I, 1"4]
[4,2,2,7] [7,1,6, I, 1"4]
63
[7,1,1"4]
65 W+J65)
[4, I, 1,7]
J66
x+yjD
Ideal classes
[4,~, I, I, 1,3,7]
22 .59
[7,1,2, I, 1"4]
61 W+J61) 62 J62
66
J
[7,2,1"4]
56
58
Continued fractions
[8,8,1"6]
530+69J59 31 +4J60
61 2 .31 3
5·13
3.2 3 .11
+1
(2,J58) (I)
17+5w
(I)
63+8J62
+1
(I)
8+J64 7+2w
+1 -I
J67
[8,5,2,1,1,7,1,1,2,5,1~]
22 .67
48842 + 5967 J67
Character systems
-14x 2 +xy+ y2 _x2 -xy+ 14y2
+1, +1
A2
-58x2+y2
+1, +1
A
-29x 2+2y2 _59x2+y2
-I, -I
-x2+59y2
-I, -I
-15x 2+xy+ y2 _62x2+y2 _x2 + 62y2
+1, +1 -I, -I
[8,4,1"6]
33+4J68
'"~ en
+1
+1 +1
+1, +1
+1
A2
-16x2+xy+y2
+1, +1
( 5 2I+J65) +2- ,
A
-2x2+5xy+5y2
-I, -I
(I)
A2
_66x2+y2 -x 2+66y2
+1,+1,+1
A
_22x2+3y2 -3x2+22y2 _67x2+y2
-I, +1,-1
(I)
_x2 + 67y2 68
....,
-I, -I
(I)
(3,J66) 67
Quadratic forms
+1
+1 -I
65+8J66
Relations
-I, -I, +1 +1,-1,-1 +1, +1 -I, -I oj:>.
0\
1.0
...,.
Table II (continued)
-.J
0
D
(0
69 W+j69) 70
j70
Continued fractions
L1
x+yjD
N(x+yjD)
Ideal classes
[4, I, I, 1,7]
3·23
11+3(0
+1
(I)
[8, 2, I, 2, I, 2, 1"6]
5'7.23
251 +30j70
+1
(I)
(2,j70) 71
j71
22 '71
[8,2,1"6]
72
73 W+)73) 74
[8,2,2, 1,7,1,2,2,1"6]
j74
[4,1,3,2, I, 1,2,3, 1,7] [8, I, I, I, I, 1"6]
73 23 ,37
(I) (I)
A2
-I8x2+xy+y2 _74x 2+y2
+ I, + I
(2,j74)
A
-37x 2+2y2
-I, -I
+1
(I)
+1
(I)
[8, 1,7, I, 1"6]
22 '79
+1, +1 -I, -I
-1,-1,+1
-I
4+(0
)79
-7Ix2+y2 _x 2+7Iy2
-I
+1
79
-I, +1,-1
43+5j74
57799 + 6630j76
3 .2 3 '13
-35x 2+2y 2 _2X2+35y2
A
943+250(0
[8, 1,2, I, 1,5,4,5, I, 1,2, I, 1"6]
[8,1,4, I, 1"6]
+1,-1,-1
+1
76
fo
+1, +1, +1
_x 2+ 70y2
17+2)72
+1
78·
+1, +1 -I, -I
A2
-I7x2+xy+ y2 _x 2 -xy+ I7y2 _70X 2+y2
(I)
26+3)75 7 ·11
Character systems
+1
[8, I, I, I, 1"6] [4, 1,7]
Quadratic forms
3480+4I3j71
75 77 W+)77)
Relations
53+6j78
80+9)79
+1
+1 ?'
a.... 0
Q.
c:
g.
A2
(2,j78)
A
(I)
[2
0
:s
-I9x2+xy+y2 -x 2-xy+ I9y2
+ I, + I -I, -I
0
_78x2+y2 _x2+78y2
+1,+1,+1
dO 0....
-1,+1,-1
-39x 2+2y 2 _2X2+39y2 _79x 2+y2
-1,-1,+1
_x2 + 79y2
-I, -I
+1,-1,-1 +1, +1
>
"
~.
Z c: 3 if ....
,..,
::r
".... 0
'<
Table II (continued)
?' 00
D
Continued fractions
00
L1
x+yJD
N(x+yJD)
79
Ideal classes
Relations
Quadratic forms
Character systems
(3,2+J79)
[2
-25x 2+4xy+3y 2 - 3x2 - 4xy + 25y2
-I, -I
(3, I +J79) [8, I, 1"6]
80 82
[9,1"8]
J82
9+J80 23 '41
9+J82
+1 -I
83
[9,9,1"8]
J83
22 '83
[9,6,1"8]
84
[5,9]
85 W+J85)
86
J86
87
J87
[9,~,
I, I, 1,8, I, I, 1,3, 1"8] [9,~, 1"8]
82+9J83 55+6J84
5 ·17
4+00
+1 +1 -I
-I, -I
_82x 2+y2
+1, +1
(3, I +J82)
[2
-27x 2+2xy+3y2
-I, -I
(2,J82)
[2
-4Ix2+2y2
+1, +1
-26x2+4xy+3y2 _83x 2+y2
-I, -I
_x 2+83y2
-I, -I
+ I, + I
(I)
A2
-21x 2+xy+y2
+1, +1
( 52+ I +J85) -2- ,
A
-3x2+5xy+5y2
-I, -I
-86x2 +y2 _x2+86y2
+1, +1
-87x 2+y2 _x2+87y2
+1, +1, +1
23 ,43
10405 + 1122)86
+1
(I)
3.2 2 .29
28+3J87
+1
(I) (2, I +J87)
A2
A
'"
+1, +1
[4
(I)
'~"
+1, +1
- 26x 2 + 2xy + 3y2 - 3x 2- 2xy + 26y2
(I)
(3,2+J82)
...,
-I, -I -I, +1,-1
-43x 2+2xy+2y 2 -I, -I, +1 _2X2 -2xy+43y 2 +1,-1,-1 ~
-.J
~
Table II (continued)
-..I
N
D
w
Continued fractions
88
[9,2, I, I, 1,2, 1"8]
89 to+)89) 90
[5,4, I, I, I, 1,4,9]
91
)91
.d
89
[9,2,1"8] [9, I, 1,5, 1,5, I, I, 1"8]
22 '7'13
x+y)15
N(x+yjD)
197+21)88 447 + 106w
+1 -I
19+2}90
+1
1574+ 165J91
+1
---[9, I, 1,2,4, 2, I, I, 1"8]
92 93 to+)93) 94
)94
Ideal classes
Relations
(I)
)95
97 to +)97) 98
+1,+1,+1 -1,+1,-1
(2, I +J91)
A
-45x2+2xy+2y2 _2x2_2xy+45y2
+1,-1,-1
-23x 2+xy+ y2 -x2 -xy+23y2
+1, +1 -I, -I
_94x 2+y2
+1, +1 -I, -I
1151 +120)92
+1
3·31
13+3w
+1
(I)
[9,1,2,3, I, 1,5, 1,8, 1,5,
23 ,47
2143295 + 221064J94
+1
(I)
-x 2+94y2
22 '5'19
99
[9, 1,3,1,1'8]
39+4)95
+1
49+5)96 5035+ 1138w
+1 -I
[9, I, 8, I, 1"8]
99+10)98
+1
[9, I, 1"8]
10+)99
+1
[5,2,2,1,4,4,1,2,2,9]
+1
_9Ix2+y2 _x2+9Iy2
(I) (2, I +)95)
96
-22x2+xy+y2 A2
[5,~, 9]
[9,1,2,1,1'8]
Character systems
(I)
I, 1,3,2, I, 1"8] 95
Quadratic forms
97
(I)
A2
A
-I, -I, +1
?'
:s
::;
-95x2 +y2 -x2+95y2
+1,+1,+1
0 Po
+1,-1,-1
-47x2+2xy+2y2
-1,-1,+1
a.0
_2x2_2xy+47y2
-1,+1,-1
-24x2+xy+y2
+1
c:
:s
0 ;I> OQ ro a' ...,
~.
Z c:
3 ...,g' >-l ::r ro
0..., '<
Notes
473
Notes 16.1. The problem concerning the number of imaginary quadratic fields that are simple fields was solved by H. M. Stark [55J and A. Baker [3J independently; see also [4J.
Chapter 17. Algebraic Numbers and Transcendental Numbers
17.1 The Existence of Transcendental Numbers A real number can be represented as a point on a straight line, so that a collection of real numbers is sometimes called a point set. For example, {lIn: n = 1,2, ... } is a point set, the set of rational numbers in the interval (a, b) is a point set. Definition 1.1. Let A, B be two point sets. Suppose that there exists a one-to-one correspondence between A and B (that is, there exists a bijection from A to B). Then we say that A and B are equipotent, or A and B have the same cardinal number. Being equipotent is an equivalence relation. Definition 1.2. Any set which is equipotent to the set of natural numbers is called enumerable. A countable set is one which is either finite or enumerable.
The set of natural numbers itself is, of course, enumerable; so is the set {lln:n = 1,2, ... }. Any sequence of numbers is a countable set.
Theorem 1.1. The countable union of countable sets is countable. Proof Let the countable sets be Mb M 2 , ••• where displaying the union as the rectangular array !Y.11
!Y.12 .,(
1X21
1X13
Mi
= (lXil,""
!Y.ij, •• •).
On
!Y.14
.,(
1X22
!Y.23
.,(
!Y.31
1X32
we can form the sequence (1X11, !Y.12, !Y.21, !Y.13, !Y.22, !Y.3b !Y.14, ••• ) following the arrows. The theorem is proved. D Theorem 1.2. The set of rational numbers is countable. Proof From Theorem 1.1 it suffices to show that the rational numbers in the interval [O,lJ is countable. We first arrange the reduced fractions in [O,lJ
475
17.1 The Existence of Transcendental Numbers
according to the size of their denominators, and when two fractions have the same denominator, we then arrange them according to the size of their numerators. This then gives the sequence Q 1. 1. 1. 1. LJ. 1. 1. l. .4. 1'1' 2' 3' 3' 4' 4' S' S' S' S'· ..
and so the theorem is proved.
D
Theorem 1.3. The set of real numbers in the interval (0, 1) is not countable. Proof Suppose the contrary and let OCt> OC2, OC3, ••• be an enumeration of the real numbers in (0, 1). Each OCi has a decimal expansion
We define a real number bi
fJ = 0 . b 1b 2 a ii + 1, { =
aii -
1,
•••
bn ... by setting
if 0 ~ aii ~ 5, if 6 ~ aii ~ 9.
We note that fJ is a real number in the interval (0, 1) which is different from OCi, for every i, because they differ at the i-th decimal place. This gives the required contradiction. (Observe that a terminating decimal may have two decimal representations, for example 0.12 = 0.11999 .... However the decimal representation of fJ does not contain any 0 or 9.) D Exercise 1. Determine the position of the reduced fraction alb in the proof of Theorem 1.2. Exercise 2. Show that a subset of a countable set is countable. In the previous chapter we defined an algebraic.number equation
eto be a root of the
where an, an- b ... , ao are rational integers. If this equation is irreducible and an =F 0, then is called an algebraic number of degree n, and if an = 1 then is called
e
e
an algebraic integer of degree n. Theorem 1.4. The set of all algebraic numbers is countable. Proof LetN = n + lanl + la n-11 + ... + laol sothatN ~ 2. Corresponding to each fixed N there can only be a finite number of polynomial equations, and each equation has only a finite number of roots so that the number of algebraic numbers corresponding to N is also finite. We denote by EN this set of algebraic numbers and consider the sequence E 2 , E 3 , ••• , EN, . . . . Let E~ be the subset of EN whose members are not already members of E 2 , ••• , EN -1. We then form the sequence of finite sets E 2 , E~, E~, .... From Theorem 1.1 the union of these sets is countable and the required result is proved. D
476
17. Algebraic Numbers and Transcendental Numbers
Definition 1.3. A number which is not algebraic is called a transcendental number. Theorem 1.5. Transcendental numbers exist. Proof From Theorem 1.3 and Exercise 2 we know that the set of all real numbers is uncountable. Since the set of real algebraic numbers is countable the required result is proved. D
17.2 Liouville's Theorem and Examples of Transcendental Numbers Theorem 2.1 (Liouville). Any algebraic number of degree n is not approximable by rationals to an order greater than n. That is, if is an algebraic number of degree n, then to every (j > 0 and A > 0 the inequality
e
(1)
has only finitely many rational integer solutions in p and q. Proof Suppose that
esatisfies th~ equation
There exists M = M(e) such that IF(y)1 < M for all y in the interval 1 < y < + 1. If p/q (q > 0) is a rational number near we may assume that 1 < p/q < + 1 andf(p/q) =F 0, so that
ee-
e
e
e
also
where '1 lies between p/q and
e. Therefore we have
(j > 0 and A > 0, the inequality (1) can have at most a finite number of solutions in p and q. D
It follows that given
We can now construct two transcendental numbers using this theorem.
477
17.2 Liouville's Theorem and Examples of Transcendental Numbers
Theorem 2.2. The two numbers
and
1 - 1 - 1 ... e=10 + 10 2! + 10 ! + 3
are transcendental. Proof 1) Let q
= lOn !
so that P
0<
e- q=
1 1O(n+1)!
+ ...
2
<
1O(n+1)!
2
= qn+1'
e
where n can be arbitrarily large. Therefore, by Theorem 2.1, cannot be algebraic. 2) Let
111
e= 10 + 102! + 10
31
+ ... = [0, at> a2, a3,·· .J,
and let Pn/qn be its n-th convergent. Then
Now an+1
= 10(n+ 1)1, and (n
so that
qn < (a1
<
+ 1)(a2 + 1) ... (an + 1)
(1 + ~)(1 +~) ... (1 + _1 )a 1a 2" 'a n 10 10 Ion
Therefore
Pn I 1 n1+-
so that, as before,
n+1
n
emust be transcendental.
1
n
D
n
~
1),
478
17. Algebraic Numbers and Transcendental Numbers
Exercise. Construct an uncountable set of transcendental numbers. (Suggestion: Show that if (an) is an increasing sequence of natural numbers, then 1
1
1
+ -lOa2·2! - + -lOa3"3! - + ... lOa! is a transcendental number.)
17.3 Roth's Theorem on Rational Approximations to Algebraic Numbers Liouville's theorem can be made much sharper. Let us denote by K the least positive number with the following property: given any real algebraic number ~ of degree n (~ 2) and given v> K, the inequality
I ~-~I<~ q qV
has only finitely many rational integer solutions in p and q (q > 0). From Liouville's theorem we see at once that K ~ n. Liouville's theorem was successively improved by Thue, Siegel and Dyson who proved that K ~!n + 1, K ~ minl ~s~n-l . (s + n/(s + 1», and K ~ respectively. Finally in 1955, Roth proved that K ~ 2, and this is the best possible result since given any irrational number ~ there are always infinitely many integers p, q (q > 0) such that I~ - p/ql < l/q2.
fo
Theorem 3.1 (Roth). Let ~ be any irrational algebraic number and let 0 > O. Then the inequality
1 ~_~I<_1 q q2H has only finitely many rational integer solutions in p and q (q> 0).
D
The proof of this celebrated theorem can be found in An Introduction to Diophantine Approximation by J. W. S. Cassels, Cambridge University Press, 1957.
17.4 Application of Roth's Theorem Theorem 4.1. Let n
~
3 and let
be an irreducible homogeneous polynomial with rational integer coefficients. Suppose that g(x,y)
=
L r+s~n-
grsx')' 3
479
17.4 Application of Roth's Theorem
is a polynomial with degree at most n - 3 and with rational coefficients. Then the equation f(x,y) = g(x,y)
(1)
has only finitely many solutions in integers (x,y). Proof We need only consider the case Ixl ~ Iyl. If y = 0, then there is at most one solution. Suppose now that y > 0. Let 0 ( 1 ) ' ' ' ' O(n be the roots of the equation f(x, 1) = 0, and let G = max(lgrsl). From (1) we have
(2)
Therefore there exists v such that
here Cl> and in what follows, C2, C3, C4, Cs are positive constants. When JJ. =I- v, and y is greater than a sufficiently large C2, we have Ix - O("yl
= I(O(v -
+ (x -
O(,,)y
so that by (2) and (3), Ix - O(vyl <
CS/y2
I
o(v -
O(vY) I > C3Y -
l-~ C1Y
n>
C4Y,
(3)
or
~I<
;! .
By Roth's theorem this inequality has only finitely many integer solutions, and the situation when y ~ C2 is also clear. The theorem is proved. 0 Theorem 4.2 (Thue). Let n ;;:: 3 and let
be an irreducible homogeneous polynomial with rational integer coefficients. If a is a rational integer, then the equation g(x, y) = a has only finitely many integer solutions in x,y. Proof This is a special case of Theorem 4.1.
0
Theorem 4.3 (Thue). In the previous theorem if a =I- 0, then the hypothesis that g(x,y) be irreducible can be relaxed, provided that g(x,y) is not the n-th power of a linear form or the n/2-th power of a quadratic form. Proof If g(z) = g(z, 1) is irreducible then there is nothing more to discuss. If g(z) is a power of an irreducible polynomial h(z) with degree m ;;:: 3, so that g(z) = (h(z))n/m, then the problem becomes that of the solutions to the equation ymh(x/y) = am/no If
480
17. Algebraic Numbers and Transcendental Numbers
a mln is a rational integer, then the problem is reduced to that in Theorem 4.2, and if a mln is not a rational integer the equation concerned has no solutions. We now
suppose that g(z) = g1(Z)g2(Z) where g1(Z) and g2(Z) are integer coefficient polynomials with degrees rand s respectively, and with no common factors. The problem is now transformed into that of the solutions to
Given a, the number of pairs at> a2 is finite. If there is a solution y =F 0, y"g1(Z) = at> ySg2(Z) = a2 have common roots so that
But g1 (z) and g2(Z) have no common factor and g2(Z) =F ± a2, 9 1 (z) =F this equation cannot hold. The case y = 0, ± 1 is trivial. 0
± 1, then
± a1 so that
Note: The condition a =F 0 is necessary, since the equation x 3 - y~ = 0 has infinitely many solutions. Also (sx + ty)n = an, (x 2 - 2y2)1 = 1 both have infinitely many solutions so that the remaining conditions are also necessary.
17.5 Application of Thue's Theorem Theorem 5.1 (Landau-Ostrowski-Thue). Let n ~ 3, b 2 Then the indeterminate equation ay2
+ by + c = dx"
-
4ac =F 0, a =F 0, d =F O.
(1)
has only finitely many solutions. Proof We rewrite (1) as (2ay + b)2 - (b 2 - 4ac) = 4adx", and we see that if the equation yi - (b 2 - 4ac) = 4adx n has only finitely many solutions, then the same holds for the original equation. We can therefore assume that a = 1, b = 0, that is, it suffices to prove that if n ~ 3, k =F 0, I =F 0, then y2 _ k = Ixn
(2)
has only finitely many solutions. 1) Suppose that k = m 2 so that (2) becomes (y - m)(y
+ m) =
Ix".
If x = 0, then y = ± m. Suppose now that x =F 0 so that y =F ± m. If p ,r 2ml, then p cannot be a prime divisor of both y + m and y - m. Therefore y + m and y - m have the representations
481
17.5 Application of Thue's Theorem
+m =
±p~1
... p,/zn = qz",
O::;;;ri::;;;n-I,
Y- m=
± p~1
... pjiwn = twn,
0::;;; Si
y
::;;;
n -
I,
where Ph'" ,Pj are the prime divisors of 2m I, and q, t may take only finitely many non-zero values. Corresponding to a set q '# 0, t '# 0, the functionj(z) = qzn - t has no repeated zeros, so that the hypothesis to Theorem 4.3 is satisfied. It follows that the indeterminate equation qzn - twn = 2m has only finitely many sets of non-zero solutions, and so the theorem is proved. 2) Suppose that k is not a perfect square and let k > 0, k < O.
We need only consider the solutions to (2) with x > 0, that is the solutions to y2 _ k = Ix n ,
x> O.
(3)
Let x, y be any set of solutions to (3). By Theorem 6.10.5 there are integers rand q such that (4)
Let qr - rx = s, so that
Isl<Jx
(5)
s == qy (mod x).
(6)
and Also, let
since k is not a perfect square, t '# O. From (6) and (3) we have
so that t is an integer. Also, from
It I ::;;; ( S2 + q2lkl)n < (x + xlkl)n = (l + Ikl)n, x
x
we see that, corresponding to any fixed nand k, the integer t may take only finitely many values. We next define p = (s - q8)"(y + 8)/xn, = s + q8 so that
e
t(y
+ 8) = pen.
(7)
482
17. Algebraic Numbers and Transcendental Numbers
From (s - q8)n
= (q(y
- 8) - rx)"
=
(AI
+ A 28)(y -
8)
+ (-
l)"rnxn,
we have xnp
= (s = (AI
+ 8) = (AI + A 28)(y2 - k) + (- l)"rnxn(y + 8) + A 28)lxn + (A3 + A48)Xn = (As + A 68)xn, q8)n(y
or (8)
where A I, A 2, ... ,A6 are integers depending on x, y. From
and
lsi + ql81 ~ Jx + Jx JikI =
Jx (l + JikI),
we have lAs
± A 681 =
I(s =+= q8~:(Y ± 8) I
~ (l + JikI )"(Jlk I + III + JikI). Therefore from and
we see that, given fixed integers n, k, I the numbers As, A6 may only take a finite number of distinct values, that is the number of values that pmay take is finite. We already know that t may take only finitely many values so that, given fixed n, k, I there are only finitely many equations (7). From (7) we have t(y
+ 8) = (As + A 68)(s + q8)",
t(y - 8) = (As - A 68)(s - q8)".
Therefore
Given n, t (=F 0), As, A 6, if we can show that the right hand side of the above equation is a polynomial g(s, q) satisfying the hypothesis of Theorem 4.3, then (9)
483
17.6 The Transcendence of e
has only finitely many sets of solutions (s, q) so that from (7) we see that there are only finitely many different values for y, and hence the equation (2) also has only finitely many solutions. It remains therefore to show that g(s, q) satisfies the hypothesis of Theorem 4.3, and for this it suffices to show that J(z)
1
= "8 [(As + A68)(z + 8)" -
(As - A68)(z - 8)nJ
has no repeated zeros. But this is a simple matter. For ifJ(z) = O,J'(z) = 0 have a common solution z = zo, then Zo must satisfy As + A6 8 As - A68
= (~)n = (~)n-l Zo
+8
Zo
+8
Since (z - 8)/(z + 8) =f. 1, this is impossible. The theorem is proved.
D
Exercise 1. Let n be an odd integer greater than 1. Arrange the integers which are either a square or an n-th power into an increasing sequence (zr). Prove that Zr+1 - Zr-? 00 as r-? 00. Exercise 2. Let
<0 = min(e -
[eJ, [eJ + 1 -
e).
Prove that
17.6 The Transcendence of e The existence of transcendental numbers has been established, and indeed our proof shows that almost all real numbers are transcendental since the algebraic numbers form a countable set. We now turn to the problem of the transcendence of specific given numbers such as e, n or sin 1. This problem is much more difficult. In this and the next section we prove the transcendence of e and n, but up to the present no one has established the transcendence, or otherwise, of e + n. Similarly we do not know the transcendence of Euler's constant y= lim
(1 +~+
+~-IOgn),
... 2 n and in fact we do not even know whether it is rational or irrational. This is a part of the famous seventh problem of Hilbert and the other part is the main theme of sections 8 and 9. n-+oo
Theorem' 6.1. The number e is irrational.
Proof We shall prove that e- 1 is irrational. Let e- 1 = (Tn + Pn where 00 n ( _ l)k (Tn
=
L -k-'-' •.
k=O
k=n+l
k!
484
17. Algebraic Numbers and Transcendental Numbers
It is easy to see that
0< (- 1)n+ 1p = n
1 (n
+ I)!
1
1
+ ... < _--,--.,. (n + 2)! (n + I)! .
-
Therefore 0< n!Pn( - 1)"+1
1 n+l
< - - < 1,
so that
cannot be an integer.
D
Theorem 6.2. Let n
I(x) = L amxm, m=O m
F(x) = L j
F(O)e X
-
F(x) = Q(x).
Then n
IQ(x)1 ::;; e1xI L lamllxl m. m=O Proof We have the following identity: n n m! F(x) = L L am x m- k k=O m=k (m - k)! n m m! n m m! = L am L x m- k = L am L _Xk. m=O k=O (m - k)! m=O k=O k!
In particular we have
n
F(O) = L amm!. m=O
Therefore
=
Lam L m' _'xk I Im=O k=m+ k! n
00
1
n
00
::;; L laml L Ixlkj(k - m)! m=O k=m+ 1 n 00 lxi' n = L lamllxl m L - , ::;; e1xI L lamllxlm. m=O '=1 I. . m=O
D
485
17.6 The Transcendence of e
Theorem 6.3 (Hermite). The number e is transcendental. Proof Suppose that e is a zero of P(x) where m
L ghx1',
m>O, h=O and gh are rational integers. Let p be a prime number greater than max(m, Igol), and let P(x) =
m
xp -
1
I(x) =
IT (h -
x)P
h=l (p - I)!
k=O Since h is a zero of I(x) of order p we can write + A Px P + ... B p,h (x - h)P I(x) = (m!)PxP-l (p - I)!
+ B p+ 1 ,heX -
h)P+l
+ ...
(p - I)!
where A, B are rational integers. We can now construct F(x) and Q(x) according to Theorem 6.2, and we have m
m
m
h=O
h=O
h=O
We already know that m
n
m
n
h=O
k=O
h=l
k=O m
h= 1 is a rational integer. Observe that, on the right hand side of this equation, p-t'go(m!)p and the remaining terms are all multiples of p, so that
I hto ghF(h) I ~ 1. If we can prove that there exists a prime p > max(m, Igol) such that
IhtO ghQ(h) 1< 1, then the required contradiction will be obtained from equation (l). By Theorem 6.2 it suffices to prove that, for any fixed x, we have n
as k=O but this is easy because, as p
p -+ 00,
-+ 00, m
IT
IxIP- 1 (h + Ixl)P L lakllxl k ~ _ _"-h=-=l_ _ _ -+ O. (p - I)! k=O n
D
486
17. Algebraic Numbers and Transcendental Numbers
17.7 The Transcendence of 1C Theorem 7.1. The number n is irrational. Proof (Niven). Suppose the contrary, so that n = alb where a, b are positive integers. Let xn(a - bx)n f(x)=---
n!
and F(x)
= f(x) - j<2)(X) + j<4)(X) - ... + ( - 1)"f(2n)(x).
It is easy to see thatf(x) together with its derivatives are integers when x so that F(O) and F(n) are integers. Now d -(F'(x) sin x - F(x)cosx) dx
=
(F"(x)
+ F(x» sin x
= 0 and n,
=f(x) sin x,
so that
" ff(x)SinXdX
=
(1)
F(n) - F(O)
o
is an integer. But, for 0 < x < nand n sufficiently large, we have nna n 1 o
" 0< ff(x)SinXdX < I, o
giving a contradiction to (1) being an equation in integers.
0
Theorem 7.2 (Lindemann). The number n is transcendental. Proof We shall prove that in is not algebraic. Suppose the contrary and let in satisfy
the equation
so that ain satisfies the equation
am-lf(~) = xm + al~-l + ...
= O.
Our supposition that in is algebraic implies that ain is also algebraic. The required
487
17.7 The Transcendence of 11:
contradiction will therefore follow if we can prove that ain does not satisfy any equation of the form P(y)
= ym + k m_ 1ym-l + ... + ko = 0,
m>O.
Let m
P(y) =
n (y h=l
alXh)'
Since 1 + e i1t = 0, it suffices to show that m
n (eO -
R=
e"h) =F O.
h=l Now R can be written as
R = c + Le"
+ Led'" + ... = c + efJl + efJ2 + ... + efJ
r,
where c is the number of the 2m terms in which the power of the exponential is zero, and 131,132,'" , f3r are non-zero numbers. Let p be a prime greater than max(c, a, n~= 1 alf3hl) and define I(x) by (ax)p-l I(x) =
n (ax -
af3h)P
h=l
(p - I)!
Similarly to the proof of Theorem 6.3 we have YP,h(X - f3h)P
+ Yp+ l,h(X -
f3h)P+ 1
+ ...
(p - I)!
where A" being symmetric functions of af31, ... , af3r and hence symmetric functions 'of alXb" . , alXh, are rational integers and A p- 1 =1= 0 (modp). On the construction of the corresponding F(x) and Q(x) we have F(O)R = F(O)
(c + tl h
efJh)
tl
t
= cF(O) + h F(f3h) + h Q(f3h),
so that cF(O) = C(Ap-l
+ pAp + ... )
is a rational integer which is not a mUltiple of p. Also
L F(f3h) = L (PYP,h + pcp + 1)yp+ l,h + ... )
h=l
h=l
L
L
Yp,h + pcp + 1) Yp+ l,h h=l h=l = pCp + pcp + I)Cp+l + ... ,
=p
+ ...
488
17. Algebraic Numbers and Transcendental Numbers
where cP' cp + 1>" ., being symmetric functions of apt> . .. , apr> are integers. It follows that L~ = 1 F(Ph) is a mUltiple of p and whence
IcF(O) + htl F(Ph) I~ 1. It only remains to show that, for sufficiently large p,
But, as p
--+ 00,
n
L lakllxlk ~ k=l
L
(alxl)P-l (alxl + alPhl)P _ _ _:.:....h=-=l'--_ _ _ _ --+ 0 (p - I)!
so that the result follows from Theorem 6.2.
D
Remark. This theorem settles the problem of "squaring the circle" - it is impossible to construct a square equal in area to a given circle, using only straight edge and compass. Exercise 1. Prove that sinh
eis transcendental whenever eis rational.
Exercise 2. Prove that sin 1 is transcendental by proving that ei is transcendental.
17.8 Hilbert's Seventh Problem In the year 1900 Hilbert gave a list of 23 unsolved problems which he believed to be worthy of the attention of mathematicians in the twentieth century. We already mentioned the first part of his seventh problem, and the remaining part is the following: Let (J( and Pbe algebraic numbers with (J( =F 0, 1 and Pirrational. Does it follow that (J(P is transcendental? As specific examples he asked for the proofs of the transcendence of 2J2 and e" = ( - 1) - i. In 1929 the Russian mathematician A. O. Gelfond made an important contribution to the solution of this problem. He proved the transcendence of e" and pointed out that his method can be used to settle Hilbert's problem when plies in an imaginary quadratic field. In 1930 Kusmin used Gelfond's method to settle the case when P lies in a real quadratic field and proved in particular that 2J2 is transcendental. Then in 1934 the complete solutions to Hilbert's problem were given independently by Gelfond and Schneider. It may be of some interest to recall that, when discussing this problem, Hilbert was of the opinion that the solution would not be available before the solutions to the Riemann's hypothesis and Fermat's last theorem. It seems therefore that it is very difficult to judge the difficulty of an unsolved problem before a solution is available.
489
17.8 Hilbert's Seventh Problem
Let K be an algebraic number field of degree h, and let /31, ... , /3h be an integer basis, so that every integer in K has the unique representation a1/31 + ... + ah/3h where al> . .. , ah are rational integers. We shall denote by loci the maximum of the modulus of the conjugates oc(i) (1 ~ i ~ h) of oc, that is
loci = max loc(i)I. 1 ~i~h
In the following we let c, Cl> C2 be natural numbers depending on K and its basis /3l> ... , /3h. It is easy to show that if oc is an algebraic integer with oc = a1/31 + ... + ah/3h, then lad ~ clocl· Lemma 8.1. Let 0 < M < N, and ajk be rational integers satisfying lajkl ~ A (A ~ 1, 1 ~j ~ M, 1 ~ k ~ N). Then there exists a set ofrationalintegers Xl> ... , XN, not all zero, satisfying 1
~j~
M,
(1)
and M
IXkl
~
[(NA)N-M],
1 ~ k ~ N.
(2)
Proof Let 1 ~j~M,
so that this defines a mapping from rational integers (x 1, ••• , XN) to rational integers (Yl>· . . ,YM). We write N-M M H = [(NA)N-M] so that NA «H+ 1)~, and hence
NAH + 1 ~ NA(H
N
+ 1) < (H + I)M.
(3)
For any set of integers (Xl, . .. , XN) satisfying (4)
we have
where - B j and Cj represent respectively the sum of the negative and positive coefficients of Yj, so that the number of values assumed by Yj cannot exceed NAH + 1. The number of sets (Xl> ... , XN) satisfying (4) is (H + I)N and the corresponding number of sets (Yl> .. . ,YM) is at most (NAH + I)M. It follows from (3) that there must be two sets (x~, ... , x~) and (x~, ... , x~) which correspond to the same set (Yl> .. . ,YM). Let Xk = x~ - x~ (1 ~ k ~ N) so that (Xl> .. . , XN) is now the required set satisfying (1) and (2). D
490
17. Algebraic Numbers and Transcendental Numbers
Lemma 8.2. Let 0 < p < q, and let a.kl (1 ~ k ~ p, 1 ~ I ~ q) be integers in K satisfying Ia.kll ~ A. Then there exists a set ofalgebraic integers e1,' .. , eq in K, not all ' zero, satisfying· l~k~p
(5)
and 1~ I
Proof Let el=XllfJ1 integers. Let
+ ...
+XlhfJh (1
~/~q)
~
(6)
q.
where Xll, ... ,X,h are rational (7)
where aklrl> ... ,aklrh are also rational integers. For 1 ~ k q
q
~
p we have, from (5), that
h
o = L a.kle, = L a.kl L x'rfJr 1=1
1=1
r=l
= rt J1 X'r ut aklrufJu = ut Ct1 J1 aklrUXlr)fJU' Since fJl>' .. , fJh are linearly independent we have the hp number of equations h
q
L L ak'rux'r = 0,
1~ u
~
h,
(8)
r=l'=l
with hq number of unknowns. From (7) and our remark preceeding Lemma 8.1 we see that laklrul ~ cmax 1';;i';;h IfJd A ~ c2A. It now follows from Lemma 8.1 that the system (8) has a non-trivial set of solutions in rational integers satisfying 1~ I
~
q,
1~ r
~
h.
Therefore
le,l Taking
C1 =
~
IX llllfJ11 + ... + IX'hllfJhl
~
c2h(l
+ (hqC2A y/(q-P».
c2h the lemma is proved.
D
17.9 Gelfond's Proof Let a. and fJ be algebraic numbers with a. i= 0, 1 and fJ irrational, and we have to prove that a. Pis transcendental. Suppose the contrary, so that y = a. P = ePlog~ (where log a. may be any fixed value of the logarithm of a.) is also algebraic. We shall derive a contradiction.
491
17.9 Gelfond's Proof
Suppose that
0(,
f3, y lie in an algebraic field with degree h. Let m
where q2
= t
=
2h
q2 n=2m
+ 2,
is a square of a natural number and is a multiple of 2m. Also, let
P1, P2, ••. , PI represent the t numbers
(a
+ bf3) log 0(,
1 ~ a ~ q,
1 ~ b ~ q.
We introduce the integral function (1)
where the coefficients 1'/1>' . . ,1'/1 are determined by the following conditions. We solve the system of mn homogeneous linear equations
o~ k ~ n -
1~ I
1,
~
m,
(2)
in the t = 2mn unknowns 1'/1", .,1'/1' The coefficients of this system are numbers in K and
~
1
I
~
m,
1 ~ a,
b ~ q,
O~k~n-l.
Let C1> C2, ••• denote natural numbers which are independent of n. There exists C1 such that C10(, Clf3 and C1Y are all integers in K, so that on multiplying each of the coefficients of the system by c~-lc~qc~q = C~-l + 2mq (~ c~), the resulting coefficients become integers in K. Moreover the absolute value of the conjugates of the various coefficients is at most
It follows from Lemma 8.2 that there is a non-trivial set of integers solutions 1'/1>' . ·,1'/1 in K such that
1~ k
~
t.
Since the numbers Pl, ... , PI are distinct, the function R(x) is not identically zero. For suppose otherwise; then on expanding the right hand side of(1) we have 1'/lP~
which implies 1'/1 R(x)
+ 1'/2P~ + ... + 1'/IP~ =
= 1'/2 = ... = 1'/1 =
=
anix _l)n
0,
k = 0,1,2, ...
0, a contradiction. Thus we see from (2) that
+ an+l,/(X _
/)n+1
+ ... ,
1 ~ I ~ m,
(3)
where an,I> an + l,I> ••• are not all zero. Hence there must be a natural number r such
492
17. Algebraic Numbyrs and Transcendental Numbers
that R(k)(/) = 0,0 ~ k ~ r - I, I ~ I ~ m. But for I so that we see from (3) that r ~ n. Let us now examine the number
~
10
~
m we have R(r)(lo)"# 0
(4) c~ + 2mq p
This number lies in K, and
is an integer in K so that
IN(p) I > c 1h(r+2m q )
s
> c r.
(5)
On the other hand (6)
We now determine a suitable upper bound for Ipl. We apply Cauchy's integral formula to the function S(z)
R(z) m r TI (z-/o) k=l k *'0
= r!
(/0---k)r . z-k
'We then have
f
I S(z) p = (logoc)-rS(/o) = (logoc)-r-. --dz, (7) 2m z - 10 c where Cis the circle Izl = m(l + r/q), so that 10 (~ m) lies inside C. Asz varies on the circle we have .
IR(z)1 ~ t max l'1kle(Q+qIPIl}oglal.m(l +~) ~k~t
1 ~
...."
tcnnt(n+l)Cr+Q 4
~
9
""
Cr rt (r+3) 10
,
mr ( + qr) - m = q'
Iz - 101 ~ Izl - 1/01 ~ m I Iz - kl
~
mr
-, q
m
1
(z-/o)-rJI
I ~ k ~ m,
(I -
k)rl :-k
~C~1 (q)mr -; ,
k*lo
IS(x)1
~ r!c~ort(r+3)S1 (~rr ~ c;2rtr(3-m)+~.
From (7) we now have Ipi
~
I -I(logoc)-rl 2n .
II I
S(z) - Idzl z - 10
c
493
Notes
From (6) and (8) we have
Replacing m by 2h
+ 2 we now have
and from (5) we deduce that rt'-~h
< c'14 c'5 -- c'15'
Since r ~ n, this cannot hold for sufficiently large n, and the required contradiction is obtained. D
Notes 17.1. The proof of Roth's theorem has been omitted in this English edition (seeJ. W. S. Cassels [15]). W. M. Schmidt [51], [52] has given the following important generalization of this famous theorem: Let 0(10"', O(n be real algebraic numbers such that 1,0(10"', O(n are linearly independent over that rational field R. Then, given any B > 0, the inequality
has at most a finite number of sets of integer solutions
q10 ... ,qn'
17.2. A. Baker [2] has made the following important improvement on Thue's theorem: Let g(x,y) be a homogeneous irreducible polynomial of degree n (~3) with rational integer coefficients, and let m be a positive integer. Then all the integer solutions to the equation g(x,y) = m can be effectively determined. More specifically, if H exceeds the absolute values of all the coefficients of g(x, y), then all the integer solutions to g(x,y) = m must satisfy max(lxl, Iyl) < exp{(nH)<10n)5
+ (logm)2n+2}.
17.3. A. Baker [1] has made the following important generalizations of the Gelfond-Schneider theorem: (i) If 0(10"', O(m Po, P1o' .. ,Pn are non-zero algebraic numbers, then e/lOO(~' ..• O(~n is transcendental. ii) Let 0(10 ••• ,O(n be algebraic numbers not equal to 0 or 1, and let P10 ... ,Pn be algebraic numbers such that 1, Pl," . ,Pn are linearly independent over the rational field R. Then O(~' ... O(~n is transcendental.
Chapter 18. Waring's Problem and the Problem of Prouhet and Tarry
18.1 Introduction In the year 1770 Waring wrote the following in his Meditationes Algebraicae: Every positive integer is the sum offour squares, nine cubes, nineteen biquadrates, and so on. We may interpret "and so on" to mean that there exists an integer s(k) such that every positive integer is the sum of s(k) k-th powers. Well over a hundred years later Hilbert gave the first proof of the existence of s(k) for every positive integer k. We can restate the problem more precisely as follows: Let k be any fixed positive integer. We ask if there exists an integer s = s(k) such that, for any n > 0, the equation
n = ~ + ... + x!,
(1)
is always soluble in integers xv. We now denote by g(k) the least of all integers s with this property. Then Waring's statement becomes: "g(2)
= 4,
g(3) = 9,
g(4) = 19,
and so on."
We also denote by G(k) the least number swith the property that (1) is soluble for all sufficiently large n. Then clearly we have G(k)
~
g(k),
but in actual fact there is a great difference between the two numbers. In this chapter we only prove some very special results. The proof of the WaringHilbert theorem (that is g(k) < (0) is given in the next chapter. The proof, which Khintchin described as one of the three pearls in number theory, is due to Linnik and is much simpler than the original proof by Hilbert.
18.2 Lower Bounds for g(k) and G(k) Theorem 2.1. g(k) Proof Let q
~
2k
+ [@k]
- 2.
= [(W] and consider n = 2kq - 1 < 3k.
495
\8.2 Lower Bounds for g(k) and G(k)
This number ncan only be the sum of the powers lk and 2k, and in fact the least sfor the decomposition is given by n
= (q - 1)2k
+ (2k
- 1)1\
that is, n requires (q - 1) lots of 2k and 2k - 1 lots of 1\ giving g(k) ~ 2k
+ q - 2. D
From this theorem we see at once that g(2) ~ 4,
Theorem 2.2.
If k
~
~
9,
2, then G(k)
~
g(3)
g(4) ~ 19,
k
g(5)
~
37, ....
+ 1.
Proof Denote by A(N) the number of positive integers not exceeding N which are expressible in the form X~
We may suppose that
+ ... + x~,
Xl.' •. , Xk
are arranged so that
Hence A(N) cannot exceed the number of solutions to this set of inequalities, that is [N!lk]
A(N) ~
Xk
Xk-I
X2
L ... L
L L
1.
We claim that the sum on the right hand side is B(N) = -
1
k!
([N 1 / k]
+ 1)([N 1 /k] + 2)' .. ([N 1 /k] + k).
We can use induction to prove this. The claim clearly holds when k = 1, and so it remains to prove that
±(X +: -1) (Y +k), =
x=o
k
1
k
and this is easy to establish. When N -? 00, N
2
B(N)- k! <3 N ,
and so, for sufficiently large N, we have A(N) <~N.
This shows that the A(N) numbers concerned cannot include all the positive
496
18. Waring's Problem and the Problem of Prouhet and Tarry
integers less than N, and therefore G(k) ~ k
+ 1.
D
For certain values of k it is possible to raise the lower bound slightly by congruence considerations. For example: since
== 0 or 1 (mod 16)
X4
it follows that numbers of the form 16m sum, and therefore G(4) ~ 15. But if 16n = xi
+ 15 require at least 15 biquadrates as their + ... + xis
Moreover, 31 cannot be the sum offewer than 16 biquadrates so that 16·31 cannot be the sum of 15 biquadrates. We see therefore that . G(4) ~ 16.
In general we can use this argument to prove: Theorem 2.3.
If k =
2~ ~
4, then G(k) ~ 4k.
Proof We have already proved the case 8difficult to see that Xk
= 2. Suppose now that 8- > 2. It is not
== 0 or 1 (mod 4k).
Let n be any odd number. Suppose that 2.9-+ 2n is the sum of 2.9-+ 2 - 1 or fewer k-th powers. Then each of these k-th powers must be even, and hence is a multiple of2k. But 2k > 2.9-+ 2 and 2-/,n so that our supposition is impossible. Therefore the theorem is proved. D
18.3 Cauchy's Theorem Let q > 1. In this section we discuss the condition for the solubility of the congruence x~+···+x~==n
(modq).
From the Chinese remainder theorem we see that we can restrict our discussion to the congruence x~
+ ... + x~ == 0
(modp'),
(1)
497
18.3 Cauchy's Theorem
where p is a prime number. Since n = n - 1 + 1k we may also assume in what follows that p-fn. We first prove the following: Theorem 3.1 (Cauchy). Let Xl>'" ,Xm be m incongruent numbers (modq) and Yl>'" ,Yn be n incongruent numbers (modq). Suppose that there exists Yi such that (Yi - Yi,q) = 1 whenever j i= i. Then the number of incongruent numbers (modq) represented by Xu + Yv (1 ~ u ~ m, 1 ~ v ~ n) is at least min(m
+n -
1, q).
Proof The theorem is trivial if n = I. Suppose therefore that n ~ 2 and we may also assume that i = 1. We use an inductive argument. Let Zl>'" ,Zt be incongruent numbers (modq) of the form Xi + Yi- If t = q the required result is established. We suppose therefore that t < q and we denote by X, Y, Z the sets Xl>'" ,Xm ; Yl>'" ,Yn; Zl>'" ,Zt respectively. Consider Xl + Yl + A(Yn - Yl)' When A = 0, 1 all such numbers belong to Z. Since (q,Yn-Yl)=1 there must exist a Ao such that xl+Yl+(Ao-I)(Yn-Yt) E Z and Xl + Yl + AO(Yn - Yl) if; z. Let (j = Xl + Yl + AO(Yn - Yl) + Yl' Then (j - Yl if;Z and (j - YnEz. We can arrange Yl>'" ,Yn so that {
Clearly r
~ n -
~
(j -
Ys if;Z
(1
(j -
Ys'EZ
(r <
~
r),
S' ~
n).
s
I. Write
Z' = {z:z = Xu
+ Ys; u =
1,2, ... ,m; s = 1,2, .... ,r}.
Then (j - Ys,if;Z'; otherwise from (j - Ys' = Xu + Ys we have (j - Ys = Xu + Ys'EZ. If we denote by t' the number of incongruent numbers (mod q) in Z', then t ' ~ t - (n - r). On the other hand, from the induction hypothesis we have t' ~ m + r - 1. Therefore t ~ m + n - 1. 0 Definition. Suppose that ptllk. Then we define
+ I, "C + 2,
y = {"C
Theorem 3.2.
when p > 1; when p = 2.
If the congruence (2)
is soluble, then, the congruence Xk
== a (mod pI)
is also soluble whenever I> y. Proof Let Y be a solution to (2), and g be a primitive root of p' (if p = 2, we take g = 5). We fix b ~ 0 so that
498
18. Waring's Problem and the Problem of Prouhet and Tarry
(3) and hence gb == 1 (modpY). Therefore pt(p - 1)lb. Let b clearly replace the exponent b in (3) by
where h is any integer. Let k
= pt(p - l)b l . We can
= ptkb (kbP) = 1. We can then take h so that
We then have
The theorem is proved.
0
Theorem 3.3. Suppose that the congruence (1) has a solution when I = y. Then it has a solution when I> y. Proof By hypothesis there are Yb' .. ,Ys such that y~
+ ... + y~ == n
(modpY).
Since p,tn there must be a y, which we may take as Yb such that p,tYl, and so from
y~ == n' - y~ - ... - y~ and Theorem 3.2, there exists ~
Xl
(modpY)
such that
+ y~ + ... + Y: == n
(modpl).
0
Theorem 3.4. If k = 2" then (1) is always soluble with s ~ '4k; if k =I- 2" then (1) is always soluble with s ~ 3k + 1. Proof Clearly it suffices to consider the case I ~ y, and by Theorem 3.3 we need only ' consider the case I = y. 1) If k = 2 t , then pY = 2t+2 = 4k. The congruence x~
+ ... + x: == n
(mod 2Y)
is clearly soluble when s ~ 4k. 2 l )p = 2, k = 2tko, ko > 1, 2,tk o. Here k ~ i' 2Yso that, when s ~ 3k > 2Y, (1) is soluble. 2 2 ) p > 2,p - 11k. Here k ~ pt(p - 1) > pY/3 so that, when s ~ 3k > pY, (1) is soluble. 2 3 ) p > 2, (p - l),tk, p,tk. Here y = 1. From p - l,tk, Theorems 3.7.2 and 3.7.3, we see that as x runs over a set of reduced residues mod p, Xk gives
499
18.4 Elementary Methods
d=
p - l >1 (k,p - 1)
incongruent numbers (modp). From Theorem 3.1, gives
x~
+ ... + X:(P,tXb'"
,x.)
min(d + (d - 1)(s - 1),p) incongruent numbers (modp). When p-l
p-l
Id
d-l
s~ 2k>-1-~--'
we have min(d + (d - 1)(s - 1),p) = p so that the theorem follows. 24) p > 2, (p - 1),tk, k = ptko, p,tko. From
and (p - 1),tko, we see that Xk runs over at least (p - 1)/(p - l,ko) (> 1) incongruent numbers (modp). Therefore X~
+ ... + x~,
gives min (
p-l (p - 1, k o)
+ (P-l
(p - 1, k o)
-
1) (s - 1) pY) ,
incongruent numbers modpY. From s - 1 ~ 3k
~
2pk -p-l
pY
~ -----~
1 p-l 2 (ko,p - 1)
pY - 1 ----_p_-_l__ l (ko,p - 1)
we see that x~ + . . . + x~(p,t Xl, ... ,X.) gives pY incongruent numbers. The proof of the theorem is complete. 0
18.4 Elementary Methods In the study of Waring's problem an elementary method usually gives rather poor results. We now introduce several examples which prove the existence of upper bounds for G(k) and g(k) for some special k. Sometimes we can even determine explicitly such an upper bound, but such a result will not be sharp. From Theorem 8.7.8 we already have that g(2) = 4.
500
18. Waring's Problem and the Problem of Prouhet and Tarry
Theorem 4.1. g(4)
~
50.
Proof We start with the identity 6(a 2
+ b 2 + c2 + d 2 )2 = (a + b)4 + (a - b)4 + (c + d)4 + (c - d)4 + (a + C)4 + (a - C)4 + (b + d)4 + (b - d)4 + (a + d)4 + (a - d)4 + (b + ct + (b - C)4.
Since a 2 + b 2 + c2 + d 2 can represent any positive integer, it follows that the left hand side of the identity represents 6x 2 where x is any integer. Now any integer n can be written as n
= 6N + r,
r
= 0, 1,2,3,4,5
so that n = 6(xi
+ x~ + x~ + x~) + r.
By the identity 6xi is representable as a sum of 12 biquadrates. Therefore n is the sum of at most 4 x 12 + 5 = 53 biquadrates. We take one further step. Any n ~ 81 is expressible as n
= 6N + t
where N~O, and t=0,1,2,8i,16 and 17 corresponding to n=:0,1,2,3,4,5 (mod 6). But 17 = 24
+ 1.
Therefore, following the method above, if n ~ 81, then it is the sum of 4 x 12 + 2 = 50 biquadrates. We can deal with n ~ 80 easily: If n ~ 50, then trivially n = n ·14. If 50 < n ~ 80, then n = 3 . 24 + (n - 48) . 14 and this is the sum of 3 + n - 48 < 50 biquadrates. D The same method together with the identity 5040(a 2
+ b 2 + c2 + d 2 )4 =
6L(2a)8
+ 60L(a±b)8 + L(2a ± b ± C)8
+6L(a±b±c±d)8,
(2)
can be used to prove that g(8) < 00. In this identity there are 840 8th powers on the right hand side, and since every n ~ 5039 is expressible as a sum of at most 273 numbers 18 and 28 , we see that g(8)
Theorem 4.2. G(3)
~
13.
~
840g(4)
+ 273 ~ 42273.
501
18.4 Elementary Methods
Proof We start with the identity 4
L «Z3 + Xi)3 + (Z3 -
X;)3)
= 8z 9 + 6Z 3(Xi + X~ + X~ + x~).
(1)
i= 1
If a number is expressible as (2)
then from (1) this number must be a sum of 8 cubes; this is because m is expressible as xi + x~ + x~ + x~, and Xi ~ Z3. Let z be a positive integer congruent to 1 (mod 6). We denote by /z the interval (3)
Clearly, for sufficiently large z, we have q>(z
+ 6) <
t/I(z),
(4)
that is the intervals /z overlap. Hence, for sufficiently hirge n, there must exist z such that (3) holds. We define r, sand N as follows: n
== 6r (mod Z3),
1 ~ r ~ z3,
n
== s + 4 (mod 6),
o ~ s ~ 5,
Then
so that (5)
We now prove that n - N is expressible as (2). Now n- N
== 6r - (r + 1)3 - (r - 1)3 + 2r 3 == 0 == 8z 9 (mod Z3),
and n - N == s
+4 -
(r
+ 1) -
(r - 1) - 2(Z3 - r) - sz
== s + 4 - z(s + 2) == 2 == 8 == 8z9 (mod 6), so that n - N - 8z 9 is a multiple of 6z 3 , that is
n=N
+ 8z9 + 6mz 3 •
The theorem then follows from 0 ~ m ~
Z6,
which is a consequence of (5).
D
502
18. Waring's Problem and the Problem of Prouhet and Tarry
Theorem 4.3. g(3)
~
13.
Proof 1) First we prove that qJ(z + 6) that
~
",(z) for Z
~
373, or when (
~
379 we have
or 6)9 14 ( 1-t
Now (1 - l5)m
~
~
3 128 1 12+-+-+-. t3
t6
(6)
t9
1 - ml5 whenever 0 < 15 < 1, so that ( 1_
~)9,~ t
.
1 _ 54. (
We see therefore that (6) will follow if we' have 54) 14 ( 1 - t
~
3 128 1 12+ - + - + -9 , 3 t
t
(6
and hence if we have 3 128 2(t - 7 x 54) ~ 2" + - 5 t
t
1
+ 8"' t
But t ~ 379 = 7 x 54 + 1 so that (6) is proved. So when z ~ 373 the various intervals /z overlap. This means that when
n must lie in one of the intervals /z. Also 1025 > 14 (373)9 so that any integer exceeding 1025 is expressible as a sum of 13 cubes. 2) We next prove that numbers not exceeding 10 25 are also sums of 13 cubes. From tables it is known that all numbers up to 40000 are sums of 8 cubes with the two exceptions of23 and 239 which require 9 cubes. Thus, if240 ~ n ~ 40000 then n is the sum of 8 cubes. Now if N ~ 1 and m = [N l / 3 ], then
N - m 3 = (N l /3)3 _ m 3 ~ 3N2/ 3(Nl /3 - m) <
3~/3.
Suppose that and let
n = 240 + N, Then N = m 3 + Nt.
m = [N l /3],
0< Nl < 3N 2/ 3,
+ N2,
ml = [N~/3],
0< N2 < 3Ni /3 ,
Nl = mi
503
18.5 The Easier Problem of Positive and Negative Signs
Therefore
n = 240
+ N = 240 + Ns + m 3 + mi + m~ + m~ + m!.
From
we have 240 and so 240
+ Ns
~
240
+ N5 <
35240 < 40000,
is the sum of 8 cubes. The theorem therefore follows.
D
From the identity
we can prove that g(6)
~
184g(3)
+ 59 ~ 2451.
18.5 The Easier Problem of Positive and Negative Signs Denote by v(k) the least natural number s such that, for any integer n, the equation
is soluble. Here
Xi
are integers and we can attach any v(k)
~
± sign for them. Clearly
g(k).
Indeed the problem of the existence of v(k) is quite easy. Theorem 5.1. v(k) ~ 2k -
1
+ tk!.
We shall require the following: Theorem 5.2. Let
AI(x) = I(x Then
where d is an integer.
+ 1) -
I(x) ,
504
18. Waring's Problem and the Problem of Prouhet and Tarry
If f(x) is a polynomial of degree k with leading coefficient a, then Llf(x) is a polynomial of degree k - 1 with leading coefficient ka. The theorem follows from repeated applications of this fact. 0 ProofofTheorem 5.1. We can consider Llk-1Xk as a sum of2 k - 1 terms of ± Xk. Now any integer n is expressible as
n - d= k!x
+ I,
that is
From
the theorem follows. Theorem 5.3. v(k)
~
0 G(k)
+ 1.
Proof We take y sufficiently large so that n + yk exceeds a certain sufficiently large number. From the definition of G(k) we have n+yk-xk 1
and hence the result.
+ ...
+~G(k)'
0
Theorem 5.4. v(2) = 3. Proof We have, by Theorem 5.1, v(2) ~ 3. But 6 cannot be a sum of two squares and moreover the difference between two squares is either odd or a multiple of 4. Therefore v(2) > 2. 0
Theorem 5.5. v(3) is either 4 or 5. Proof Since n 3
so that v(3) Next
~
-
n is divisible by 6 we can let n 3
-
n
= 6x. Now
5. y3 == 0, 1 or - 1 (mod 9)
so that, if n v(3)
~
4.
=
9m
± 4,
then n cannot be the sum of three cubes. Therefore
0
It is an open problem whether v(3) is 4 or 5, and it is conjectured that v(3) = 4. It has been verified by Chao Ko that every integer with absolute value ~ 100 is the sum of four cubes.
505
18.(; Equal Power Sums Problem
Theorem 5.6. v(4) is either 9 or lO. Proof Consider the identities:
+ 4 = 2(2x + 3)4 + (2x + 6)4 + 2(2x2 + 8x + 11)4 - (2X2 + 8x + lO)4 - (2X2 + 8x + 12)4; 48x - 14 = 2(2x + 5)4 + (2x + 8)4 + (x 2 + 6x + 9)4 + (x 2 + 6x + 12)4 - (x 2 + 6x + 8)4 - (x 2 + 6x + 13)4; 24x = (4y + 11)4 + (2y - 87)4 + (y - 9)4 + (y - 41)4 + (y - 83)4 + (y + 125)4 + (y2 + 603)4 + (y2 + 625)4 _ (y2 + 602)4 _ (y2 + 626)4, 48x
where y = x - 10319691; 24x - 8
= (4y + 11)4 + (2y
+ (y + 883)4 + (y - 933)4 + (y - 975)4 + 39851)4 + (y2 + 39873)4 _ (y2 + 39850t
- 87)4
+ (y + 1017)4 + (y2 _ (y2 + 39874)4,
where y = x - 120858614086. We can replace x by - x and obtain similar identities for 48x - 4, 48x + 14, 24x + 8. Now it is easy to see that if n is a multiple of 8, then n is the sum of 10 biquadrates. If n is not a mUltiple of 8, then we can write n as n
= 48z + y,
- 24 < y < 24,
and we can easily prove that there are integers Xl> X2, X3 such that y ± xi
± xi ± xj == ± 4, ±
14 (mod 48).
This proves that v(4) ~ lO. From ± y4 == 0, ± 1 (mod 16) we see that numbers of the form 16x + 8 require at least 8 biquadrates to represent them, and that they must all take the same sign. But numbers such as 24, lO4 cannot be expressed this way. Therefore v(4) ~ 9, and the proof of the theorem is complete. 0
18.6 Equal Power Sums Problem Let N(k) be the least integer s such that there exist Xl, .•. , xs; Yl>'" ,y., but Yl>'" ,Ys not merely a permutation of Xl>' .. , x., with the property that Xl
+ ... + Xs = Yl + ... + y., (I)
506
18. Waring's Problem and the Problem of Prouhet and Tarry
We also let M(k) denote the least s so that (1) holds, and furthermore, (2) Theorem 6.1. M(k) ;;:: N(k) ;;:: k
+ 1.
Proof From
X~
+ . . . + x~ = y~ + . . . +
Y:,
we have
so that Yl>' .. ,Yk is only a permutation of Xl>' .. , Xk'
D
Theorem 6.2. N(k) ~ M(k) ~ 2k.
Proof Let Xl>"" Xs;Yl>'" ,Ys be solutions to (1) and (2). Then (3) i= 1
i= 1 s
L ((Xi + d)k+2 + y~+2)"# L (X~+2 + (Yi + d)k+2).
(4)
i= 1
i= 1
The proofs of these two formulae follow from the expansions of (3), (4) and applying (I), (2). Thus, if M(k) exists, then taking s = M(k) we have M(k + 1) ~ 2M(k). But M(1) = N(I) = 2, so that the theorem follows by mathematical induction. D Theorem 6.3. N(k) ~ tk(k
+ 1) + 1.
Proof Suppose that n > s! Sk. Let ai (i = 1,2, ... ,s) run over 1,2, ... ,n. Then there are nSsets al> a2, ... ,as. Each fixed set al> a2, ... ,as has s! permutations. It follows that among the nSsets al> a2,' .. , as there are at least nS/s! sets in which every set is a permutation of a certain other set. Let
h = 1,2, ... ,k.
Then Therefore there are at most k
IT (sn h h= 1
s
+ I) < skntk(k+ 1)
507
IS.7 The Problem of Prouhet and Tarry
sets of different Sl (a),
(5)
s2(a), ... , sk(a).
Take s = tk(k + 1) + 1. Then, from n > s!s\ we have
Therefore there are at least two different sets a1> a2, ... , as such that (5) takes the same values. Since these two sets are not permutations of each other, it follows that N(k) ~ s, and the theorem is proved. 0 We now write to represent (l) and (2). From Theorem 6.1 and the following examples, we have: Theorem 6.4. If k
~
9, then M(k)
=
N(k)
=
k
+ 1.
[0,3]1 = [1,2]1>
[1,2,6]2 = [0,4,5]2' [0,4,7,11]3 = [1,2,9,10]3, [1,2,10,14,18]4 = [0,4,8,16,17]4, [0,4,9,17,22,26]5 = [1,2,12,14,24,25]5, [0,18,27,58,64,89,101]6 = [1,13,38,44,75,84,102]6, [0,4,9,23,27,41,46,50]7 = [1,2,11,20,30,39,48,49]7, [0,24,30,83,86,133,157,181,197]8 = [1,17,41,65,112,115,168,174,198]8, [0,3083,3301,11893,23314,24186,35607,44199,44417,47500]9
= [12,2865,3519,11869,23738,23762,35631,43981,44635,47488]9' []
IS.7 The Problem of Prouhet and Tarry In this and the next sections we shall prove that
M(k)<;(k+l)
IOg~(k + 2)J ) ( [10g(I+D +1
-k'logk.
Actually our eventual result gives even more than this. We first prove several
18. Waring's Problem and the Problem of Prouhet and Tarry
508
lemmas. In this and the next sections the constants ct. C2, ••. as well as the constant implied by the O-symbol depend only on k. Moreover, Cl, C2," • are positive. Theorem 7.1 (Bunyakovsky-Schwarz). Let ai, bi (i = 1,2, ... , n) be real numbers. Then
with equality sign only when - = - = ...
bl
b2
=
bn
Proof The required result follows at once from
i<j
Theorem 7.2. Let H be any given number. Then there exists a set ofpositive integers ai, a2, ... , ak depending only on k and H such that the product of the terms of the main diagonal of the determinant
a~-l
... a:- l
is greater than H times the sum of the absolute values of the remaining terms of the expansion of D k. Proof We use mathematical induction to prove the theorem. Suppose that j ~ k and let C{Jj(at. . .. , aj) denote the product of the terms of the main diagonal of D j minus the sum of the absolute values of the remaining terms of the expansion of D j • Then clearly we have
where", is a polynomial in aj of degreej - 2. From the induction hypothesis we can take at. ... ,aj-l so that C{Jj-l(at. ... ,aj-l) > O. For this set at. ... ,aj-l we can clearly set aj so that C{Jj> O. But C{Jl(al) = 1, so that the theorem is proved. D Theorem 7.3. Let at. ... , ak be a set ofpositive integers satisfying Theorem 7.2. Let Q ~ 1 and Xl," ., X k be positive integers belonging io the intervals (i= 1,2, ... ,k).
Denote by N the number of sets (Xt. ... , X k) such that X k1
+ ... + X k'k
X k1 - l
+ ... + X kk- l , ... , X 1 + ... + X k
509
18.7 The Problem of Prouhet and Tarry
lie in intervals with lengths O(Qk - 1), O(Qk - 2), ... , O(Q), O( 1)
respectively. Then N
= 0(1).
Proof Let (Xl>' .. , X k) and (X'l' ... , X~) be two sets which satisfy the conditions of the theorem. Then
Let Y i = Xi - X;. Then AllYl
+ ...
+AlkYk = O(Qk-l),
so that A .. IJ
=
X~-i J
+ X~-i-1X'. + ... + X,.k-i J J J
(1 ~ i,j ~ k).
Thus
The ratio of the product of the terms of the main diagonal of the determinant IA k- i + d to that of Dk in the previous theorem is clearly greater than klQk-l+k-2+ ... +2+l
= klQtk(k-l).
Also the ratio of the absolute value of each remaining term in the expansion of IA k- i+ 1,A to the corresponding absolute value term for Dk is smaller than 2tk(k-l)k lQtk(k-l).
We now take H
= 2tk(k-l) in Theorem 7.2, so that we have
It is then easy to see that O(Qk-l)
A 12 "'A lk
. . . . . . . . . . . . . . . . . .. = O(Qtk(k-l». 0(1)
Ak2 ... Akk
Therefore Y l = 0(1).
510
18. Waring's Problem and the Problem of Prouhet and Tarry
Similarly we have Y2 = 0(1),
The theorem is proved.
... , Yk = 0(1).
D
Theorem 7.4. Suppose that the conditions in Theorem 7.3 are satisfied. Let A1 ;;:: 0, A2 ;;:: 0, ... ,Ak ;;:: 0. Then the number of sets (Xl> ... , X k) such that X~
+ ...
+X~, X~-1
+ ...
+X~-1,,,,,X1
+ ...
+Xk
lie in intervals with lengths
respectively is
Proof Since an interval with length 0(Qk-i+A k- i+1 ) can be divided into O(Q"k-i+l) intervals with lengths O(Qk-i), the required result follows at once from Theorem 7.3. D Now let fJ = kj(k + 1) and a1, ... , ak + 1 be a set of positive integers satisfying the conditions of Theorem 7.2 (where we have replaced k by k + 1). We suppose that (1 ~ u ~ k
+ i,
1 ~ v ~ I).
Denote by r(n1' ... ,nk) the number of solutions to the system k+ 1 I (l ~ h ~ k). y~v = nh
L L
u= 1 v= 1
We now prove the following theorems: Theorem 7.5. There exists a set of integers N 1, ... ,Nk such that
Proof The numbers of different sets (Yuv) must be 1 k+ 1 I au QPV-l;;:: C2Q(k+1)(1+P+·oo+pl-l) 2 u=1 v=1 = C2 Q(k + 1)2(1- PI).
;: - n n
Since Inhl ~ C3Q\ the number of different sets (nh) is
511
18.8 Continuation
Therefore there must be a set of integers r(N1>"', N k ) >c:;..-
C2
N1>' .• , Nk
such that
Q(k+ 1)2(l_pl)_tk(k+ 1).
D
C4
Theorem 7.6. The number of solutions to the system k+ 1
I
L L y~v = Nh
(l ,,;;, h ,,;;, k
+ I)
u= 1 v= 1
is at most
Proof From k+1
k+1
I
L y~l = Nh - L L y~v
(l,,;;,h,,;;,k+l)
and (l ,,;;, u";;' k
+ I,
1 ,,;;, v ,,;;, I),
we see that 1 + ... + yk+ 1 Yk+ 11 k+1,l' yk11 + ... + ykk+1,l''''' Y 11 + ... + y k+1,l
lie in intervals with lengths
respectively. We take A. u = ufJ - (u - I)
~
0 in Theorem 7.4. Then, from
k+1
L:
{ufJ - (u - I)} = tfJ(k + I)(k + 2) - tk(k + I) = t k,
we see that the number of sets (Y11,'" ,Yk+ l,d is O(QkI2). Corresponding to each fixed set (Y11,' .. ,Yk+ 1,1) the sums k+ 1 y 12 + " ' + yk+1,2 Yk12+ 1 + " ' + yk+1,2'"''
clearly lie in intervals with lengths O(Q(k+ 1)P2),
O(Qk P2 ),
... ,
O(QP 2)
respectively. Replacing Q by QP in Theorem 7.4, we see that the number of different sets Y12"" ,Yk+1,2 is O(QkPI 2). Continuing this way the theorem is proved. D
18.8 Continuation Theorem 8.1. Denote by W(k,j) the least integer s such that the system (l ,,;;, h ,,;;, k),
512
18. Waring's Problem and the Problem of Prouhet and Tarry s
L ~/1"# L X~q+1, i= 1
(p "# q, 1 ~ p, q ~j)
i= 1
is soluble in integers. Then
([ IOg~(k++D + 2)J
+ I)
W(k,j)'; (k
IOg(1
) I .
Proof This theorem is an immediate consequence of the following theorem.
Theorem 8.2. Let
([ IOg~(k++D + 2)J
, ;;. (k
+ I)
)
log (I
I
.
Then, given any j, there are integers
such that the system
(l
~
h
~
k),
is soluble. Proof Let r(Nb' .. , N h ) be as defined in the previous section. By Theorem 7.5 there are N 1 , ••• , Nh such that
Corresponding to a set of solutions (Yuv) to the system k+ 1
I
L L Y:v =
Nh
u= 1 v= 1
there is clearly a number M such that k+ 1
I
L L y~:l =
M.
u=l v=l
If such an M has only e ( ~ j - 1) different values, say M b M 2, .•. ,Me, then, by Theorem 7.6, the number of solutions to the e-system
513
Notes
is at most cseQtk(k+ l)(l-P'). From the definition of Mi the number of solutions to this e-system is at least r(Nb' .. , N k ). On the other hand, if we take
~
I > {lOg (k
+ 2) flOg (1 + ~)} ,
then, for large Q, we have
giving a contradiction. Our theorem is proved.
D
Notes 18.1. Concerning the value of g(k) in Waring's problem there is the following result: When k > 6 and
we have
(See Hua [30].) Moreover K. Mahler [41J proved that there exists a constant ko such that the above inequality holds whenever k > k o. Unfortunately the method which is based on Roth's theorem is ineffective in the sense that it does not allow us to make a computation for the value of k o. J. R. Chen [18J proved that g(5) = 37. R. Balasubramanian proved that 19 ~ g(4) ~ 21 (see [5J). 18.2. I. M. Vinogradov [61J has improved on his own result on G(k) in Waring's problem: For sufficiently large k we have G(k) < k(210gk
+ 410glogk + 2 log log logk + 13).
Chapter 19. Schnirelmann Density
19.1 The Definition of Density and its History The purpose of this chapter is to prove the following two important results: "There exists a positive integer c such that every positive integer is the sum of at most c primes." "Let k be any positive integer. Then there exists a positive integer Ck (depending only on k) such that every positive integer is the sum of at most Ck k-th powers." These two rf':::;Jlts are obviously related to the Goldbach problem and the Waring problem. Indeed we can even say that these two results are the most fundamental first steps towards these two famous problems. We shall call them the Goldbach-Schnirelmann theorem and the Waring-Hilbert theorem respectively. In this chapter we introduce the notion of density created by Schnirelmann. This notion is extremely elementary, and yet it allows us to establish the two historic results. Our proof of the Goldbach-Schnirelmann theorem differs slightly from Schnirelmann's original proof in that we replace the application of Brun's sieve method by Selberg's sieve method. Again our proof of the Waring-Hilbert theorem is not the original proof due to Hilbert, nor that due to Schnirelmann. We shall give instead the proof by Linnik, given in 1943, with some simplifications and modifications. In both these proofs the notion of Schnirelmann density occupies an important place. The definition of density is as follows: Definition 1. Let ~ denote a set of (distinct) non-negative integers a. Denote by A(n) the number of positive integers in ~ which do not exceed n; that is A(n)
L
= 1
1.
~a~n
Suppose that there exists a positive number IX such that A(n) ;;:: IXn for every positive integer n. Then we say that the set ~ has positive'density, or that ~ is a positive density set. The greatest IX with this property is then called the density of ~. Obviously we have the following simple properties: (i) Since A(n) ~ n, it follows that IX ~ 1. (ii) If IX = 1, then A(n) = n for all n and so ~ must include all the positive integers. Exercise. Let't ;;:: 1. Determine the density of the set 1 + ['ten - I)J, n = 1,2, ....
515
19.2 The Sum of Sets and its Density
19.2 The Sum of Sets and its Density We now introduce the symbols m, b, B(n), {3 and (t, c, C(n), y. The definitions for them are analogous to those for~, a, A(n), oc: thatisbEm, B(n) = L1':;b':;n 1, and{3 is the density of the positive density set m. Definition. The set of integers of the form a + b (aE~, b Em) is called the sum of the sets ~, m, and is denoted by (t. We also write ~ + m = (t. Theorem 2.1. Let 0 E ~ and (t
= ~
+ m.
Then y ;;:: oc
+ {3 -
oc{3.
Proof Since {3 > 0 we see that 1 Em. The following three types of numbers are positive integers in (t; they are all different and are at most n. (i) In m we write b 1 = 1, bz , ... ,bB(n), the numbers being arranged in increasing order. Since 0 E ~ we see that b1, bz, ... ,bB(n) are members of (t, and that there are B(n) such members. (ii) Corresponding to any v where 1 ~ v ~ B(n) - 1, the various numbers a + bv , with a E ~ and 1 ~ a ~ bv + 1 - bv - 1, are distinct positive integers not exceeding n in the set (t. This is because
and
so that
It is clear that the two types of numbers in (i) and (ii) are mutually distinct. For . each fixed v (l ~ v ~ B(n) - 1), there are A(b v + 1 - bv - 1) such numbers a + bv in (t. (iii) For a E~, 1 ~ a ~ n - bB(n), the numbers a + bB(n) are distinct positive integers not exceeding n in the set (t. Since a + bB(n) ;;:: 1 + bB(n) we see that these numbers of type (iii) are different from those in types (i) and (ii), and there are A(n - bB(n» such numbers a + bB(n). From the results of (i), (ii) and (iii) we have B(n)-l C(n) ;;:: B(n) + L A(b v + 1 - bv - 1) + A(n - bB(n» v=l
B(n)-l ;;:: B(n) + L oc(b v + 1 - bv
-
1)
+ oc(n -
bB(n»
v= 1
= B(n) + oc{bB(n) - b1 - (B(n) - 1) + n - bB(n)}
= B(n) + oc{n
- B(n)} ;;:: (l - oc){3n
= n(oc + (3 - OC(3),
+ ocn
516
19. Schnire1mann Density
and hence C(n) --
n
~
0(
+ 13 - 0(13,
y
~
0(
+ 13 - 0(13· D
Note: This theorem is not the best concerning the density of the sum of sets. The sharpest result should be y ~ min (1, 0( + 13), a theorem proved by Mann in 1942. The proof of Mann's theorem is more complicated, and since there is no fundamental improvement concerned with the applications to the principal results in this chapter, we do not include it in this book. Let us now take ~ and ~ both to be sets of positive integers congruent to 1 mod q, and assume also that 0 E~. Then ~ + ~ include all the positive integers congruent to 1, 2 mod q. Obviously the densities of ~ and ~ are l/q while the density of ~ + ~ is 2/q. Therefore the result of Mann cannot be improved.
Theorem 2.2. Let 0 E ~ and 0( + 13 contains all the positive integers.
~
1. Then y = 1; that is the set <£: =
~
+~
Proof Suppose that y < 1, so that there is a least positive integer n ¢ <£:. From 13 > 0 We deduce that 1 E~, and from 0 E ~ we further deduce that 1 E <£:, so that n ~ 2. Again, from 0 E ~ we have n ¢ ~ also. Consider the following natural numbers a and n - b not exceeding n - 1:
a,
1
~a~n-l,
n - b,
1
~b~n-l,
aE~,
Each a is different from n - b, because a = n - b would give n = a + bE<£:, a contradiction. Also since both a and n - b do not exceed n - 1, the number of such numbers is at most n - 1. On the other hand the number of such numbers a and n - b is A(n - 1) + B(n - 1). From A(n - 1)
~
O(n - 1)
B(n - 1)
= B(n) ~
and f3n > f3(n - 1),
we arrive at A(n - 1)
+ B(n -
1) > O(n - 1)
+ f3(n
- 1)
= (0( + f3)(n
- 1)
~
n - 1,
contradicting our earlier claim that the number of numbers a and n - b does not exceed n - 1. The theorem is proved. D Theorem 2.3. Let
~
have density So
0(
= 2[
< 1 and let log2 ] -log(1 - O()
+ 2.
517
19.2 The Sum of Sets and its Density
If 0 E~, then every positive integer is the sum of So numbers in ~. ljO ¢~, then every positive integer is the sum of at most So numbers in ~. Proof The second part of the theorem follows trivially from the first part by artificially inserting 0 into the set ~. We now prove the first part of the theorem - that is we shall assume that 0 E ~. Let ~h = ~ + . . . + ~, the h-fold sum of~. We denote by rxh the density of ~h and we proceed to prove by induction that rxh ;;:: 1 - (1 - rx)h. When h = 1 we have rxl = rx. From ~h = ~ + ~h - 1, Theorem 2.1 and the induction hypothesis, namely rxh-l ;;:: 1 - (1 - rx)h-l, we have
rxh;;::
rx
+ rxh-l
- rxrxh-l
= rx + (1
- rx)rxh-l
;;::rx+(l-rx){I-(l-rx)h-l}
= 1 - (1 - rx)h. Therefore rxh ;;:: 1 - (1 - rx)h holds for h So
-=
[
2
log2 -log(I -
] rx)
= 1, 2, 3, .... Now
+1>
log2 -log(l -
, rx)
so that (l -
rx)'o
12 :::;:;
(1 -
log2 rx)-Iog(l-~)
=e
log2 log(l-~)
----·Iog(l-~)
1 = 2'
and hence
rxso /2 ;;:: 1 - (1 - rx)'o/2 ;;:: 1 -
t = t.
Since 0 E ~so/2 the set ~so = ~so/2 + ~so/2 must, QY Theorem 2.2, include all the positive integers and therefore every positive integer is expressible as the sum of So members of ~. 0 Theorem 2.4. Let ~* be a collection of non-negative integers, with multiplicity of membership being allowed. Let ~ be the largest set from ~* without multiplicity of membership. Let rea) denote the multiplicity of a in ~. Suppose that
1 n holds for all n ;;:: 1. Then
_C=-"'=~L"':":'"
~
n,--r.,....(a_))_'
r2(a)
~ a'
has a positive density
(> 0),
rx ;;::
rx/.
Proof From the Bunyakovsky-Schwarz inequality (Theorem 18.7.1) we have
C"'~"'n Y: ;: "'~"'n rea)
1
r2(a) 1
"'~"'n 12 = A(n) "'~"'n r2(a), 1
518
19. Schnirelmann Density
so that A(n) 1( -n- ~ ~
L
r(a)
)2/,
1 ~a~n
The theorem is proved.
L
r2(a) ~
(X'.
1 ~a~n
D
19.3 The Goldbach-Schnirelmann Theorem In §§3 - 5, the letters Ch §§3 - 5 is to prove
C2, ...
denote absolute positive constants. The purpose of
Theorem 3.1. There exists a positive integer c such that every integer greater than 1 is the sum of at most c prime numbers. We define m:* to be the collection of numbers 1 together with Pl + P2 where Ph P2 run through all the prime numbers. Note that members of m:* may have multiplicity. We also define m: to be the largest set from m:* without multiplicity of membership. In order to prove Theorem 3.1 it suffices to prove Theorem 3.2. m: has positive density
Cl.
By Theorem 2.3 any positive integer m is expressible as a sum of at most So members ofm: (that is, a sum of terms involving 1 and numbers of the formpl + P2). This implies that m is the sum of at most 2so numbers which are primes or 1. Therefore, for any n > 2, we have n = 2 + (n - 2) = 2 + b . 1 + where the number of primes P being summed is at most 2so - b. Since 2 + b is expressible as a sum of at most b + 1 primes, it follows that n is expressible as a sum of at most 2so + 1 primes. Therefore Theorem 3.1 follows from Theorem 3.2. We now let r(l) = 1 and r(a) be the multiplicity of a in the collection m:*, that is
LP,
I, r(a)
=
if a
L
{
1,
= 1,
if a ~ 2.
Pl +P2=a
Ll.,;;a
Following Theorem 2.4 our aims are to find a lower bound for <';" r(a) and an upper bound for <';a<';" r2(a). The former task is not at all difficult while the latter task is the main concern of the next section.
Ll
Theorem 3.3.
If n ~ 2,
then
L
r(a) ~
1 <';a<';"
Proof Let n
~
n2 C2-2 -·
log n
4. From Theorem 5.6.2 we have
L 1 ~a~n
r(a)
= 1+
L
L
4~a~n
Pl+P2=a
(1)
519
19.4 Selberg's Inequality
~ (C3~/IOg~)2 ~ c~~. 2
2
4 log n
If n = 2 or 3, then 2:r(a) = 1, so that the theorem follows by taking C2
.
(c~ log2 2 log2 3)
= min 4' -4-' - 9 - · 0
From Theorem 2.4 and r(l) = 1 we see that the crux of the matter is now to prove Theorem 3.4.
If n ~ 2,
then (2)
In other words if Theorem 3.4 is proved, then from
and Theorem 2.4, we see that Theorem 3.2 follows. It remains therefore to prove Theorem 3.4.
19.4 Selberg's Inequality Although we can do without the following theorem in this section, nevertheless the reader should know the result Theorem 4.1. Let ai > 0 (i = 1,2, ... , n)andb i (i = 1,2, ... ,n) befixedrealnumbers. The minimum value of2: 7= 1 aix ?, subject to the constraint 2: 7= 1 biXi = 1, is given by (2:7=1 bNai)-l. Moreover, the minimum value is attained when
bi ai
Xi=~'
i = 1,2, ... ,n.
2:~ i=l
ai
Proof From the Bunyakovsky-Schwarz inequality (Theorem 18.7.1) we have
520
19. Schnirelmann Density
Therefore n
L aiX ; ~ - n - - i=1
(1)
b~a:-1
"
L.-
I
I
i= 1
Again, by Theorem 18.7.1, we know that the condition for equality in (1) is that there exists a real number to such that r;;.x.I
YUi
1
= tob·-r:: I
(i= 1,2, ... ,n),
v ai
that is (i
= 1,2, ... , n).
Hence 1=
n
n
i= 1
i= 1
L bixi = L b;ai- 1t o,
or to=-n--"L.- b~a.-1 I I i= 1
Therefore (i= 1,2, ... ,n).
(2)
"L.- b~a:-1 I I i= 1
The theorem is proved.
0
Theorem 4.2 (A. Selberg). Let {b} be a set of M integers such that the number of integers b in the set which are divisible by a positive integer k is
Lb 1 = g(k)M + R(k),
(1)
klb
where R(k) is a certain remainder term, and g(k) is a positive valued multiplicative function satisfying g(p) < 1. Let N~ denote the number of integers b in {b} which are not divisible by any prime ~ Then
e.
where
521
19.4 Selberg's Inequality
f{k) = Ak =
~J1(d)lg(~)*,
J1(k) L J12(m) f{k)g(k) 1 ';;m';;l;/k f{m) (m,k)= 1
(4)
I
L J1 2(m). 1 ';;m';;l; f{m)
(5)
Proof Let 1 = ,1.1, ,1.2, ... ,,1.[1;) be real numbers. Since the least common multiple of k1' k2 is k1k2/(k1> k 2), we have, from (3), that
L Ak.Ak2 Lb 1 = L Ak.Ak2 1';;kl.k2';;1; kdb 1';;kl.k2';;1; k21b
1
L Ak.Ak2 ( g { k1k2 } M ';;k lo k2';;1; (k1' k 2)
+R{
Lb I
k1k2 }) , (k1> k 2)
e
where plb => p > means that the prime divisors of b ar:e greater than Theorem 6.2.4 we have NI;
~ MQ + 1';;kL.k,';;1; Ak.Ak2R {:1;}, 1,
1
Q=
e. From (6)
2
L A A g(k1)g(k2) 1';;kl.k,';;1; kl k2g{(kb k 2)}'
From (4) and Theorem 6.4.1, we have
1,;;~,;;/(d)t';;~';;1; Akg(k)f
(7)
dlk
From (5) and Theorem 6.2.1 we see that ,1.1 = 1 (the reader can use Theorem 4.1 to prove that Q is actually the minimum for this choice of ,1.1>" . , Am). Let S
=
L
J1 2 (m). 1';;";';;1; f{m)
(8)
From Theorem 6.2.2 we see thatf{n) is also multiplicative so that, by (5) we have
*
When k is square-free,j(k) = (ljg(k» Oplk(l - g(p» > O.
522
19. Schnirelmann Density
L
Jl(mk) Jl(m)-l~m~l;lk sfimk) (m,k)= 1
L
Akg(k) = Jl(k) Jl2(m) sfik) 1 ~m~l;lk fim) (m,k)=l Jl(mk) Jl(m)--, 1 ~m~l;lk sfimk)
L
and hence, by Theorem 6.3.2, Jl(m)
L
sfim)
L
Akmg(km) =
1 ~k~l;lm
A,g(r).
1 ~'~l;
m!,
Therefore, by (7) and (8) we have
Q= The
requi~....d
L
1 ~d~l;
fid) {Jl(d)}2 sf(d)
= 12 s
L
1 ~dq
result follows from (6) and (8).
= : = ~.
Jl2(d) fid)
s
s
D
Theorem 4.3. Let the conditions in Theorem 4.2 hold. multiplicative function, and gl(P) = g(p), then
If glen)
is a completely
M
Nl;~----
L
gl(k)
We first establish the following:
Theorem 4.4. Let fin) be a completely multiplicative function satisfying 0 If f3n ~ 0, then
L
f3nf(n)
IT {1
- fip)}
-1
L
~
L
fin)
13m,
min
~n~l;
1
~
pl;';;=O>P!k m
where pi;';; => plkm means that n/m has only the prime divisors of k m. Proof
L
f3nfin)
IT {I -
fip)}
-1
=
L
00
IT L:
f3nfin)
p!k n
=
L
00
f3nfin)
IT L fipm) = L p!k n
=
L l~n~~
L r=l
p!,=o>p!k n
00
f3J(n)
f(nr) =
L l~n~~
00
f3n
L
,=1 p!,=o>p!k n
m=O
00
f3n
{fip)}m
m=O
L s=l
n!s
pl~ =o>p!k n
fis)
fir)
f(p) < 1.
523
19.4 Selberg's Inequality 00
Lfts)
f3n ?:-
L
s= 1
1
~n~,:
L 1
fts)
~s:::;~
nls
L
fts)
l"'s"'~
f3n
nls
pi; =>Plkn =
L 1 ~n~~
pi; =>Plkn
L
0
f3n·
nls
pi; =>Plkn
Proof of Theorem 4.3. We have, by (4),
+ J-l(p) = _1__ 1 =
f(p) = J-l(1) g(p)
g(1)
1 - g(p) .
g(p)
g(p)
If k is square-free, then, by Theorem 6.2.2,
2(k) ( ) ng1(p) _J-l_ = J-l2(k) gl p = J-l2(k)_:..:..plk~_ _ ftk) plk 1 - gl(P) {I - gl(P)} plk =J-l 2(k)gl(k)n{1-g1(p)}-1. plk
n
The above still holds when k Theorem 4.4,
L
n
(9)
= 1 and when k has square divisors. Therefore, by
J-l2(k) Irk) =
1 "'k"'~ J\
L 1 "'k"'~
?:-
L l"'k"'~
J-l 2(k)gl(k)n{1-g1(p)}-1 plk gl(k)
L J-l2(m). mlk pl~ =>plm
Let dk be the greatest square-free divisor of k, so that dklk. If p\!!-.-, thenplk and
dk
so pldk. Therefore dk is a number satisfying the condition on m, so that
J-l2(k) -k-?:- L gl(k). l"'k"'~ ft) 1H"'~ L
(10)
From (9) we see that J-l 2(k)lf(k) ?:- 0 and so, by (5) and (9) we have that
~ J-l2(k) _ J-l2(k) ~ n{l ( )}-1 . k ....". ftk)g(k) - ftk)gl (k) "" plk - gl P
1,1.1
When k = 1 or k is square-free, g(k) = gl(k), and if k is not square-free, J-l(k) = 0; therefore the above holds for all k. The theorem now follows from (10) and Theorem 4.2. 0 Theorem 4.5. Let A ?:- 0, M ?:- 3 and denote by n(A ; M) the number ofprimes between A and A + M. Then
524
19. Schnire1mann Density
n(A;M)
~
2M {1'+
10gM
o (IOgIOgM)}. 10gM
The implied constant here is independent of A and M. Proof Let
L
=
S(A;M)
1,
A+JM
so that n(A;M)
L
=
1 + S(A;M) ~
JM + S(A;M).
(11)
A
We now take {b} to be the set of all integers n satisfying A < n the notation of Theorem 4.3 we have
~
A
+ M. With
S(A;M) ~ N~,
uniformly in A. We now estimate
~
N~.
1 = [ A : MJ -
(12)
From
[~J = ~ + R(k),
IR(k)1 ~ 1,
A
we have gl(k) = 11k, giving
From Theorem 5.9.3 we have
( 1)-1
IT(1-g1(P»-1=IT 1-plk plk P
IT (1)-1 1-=O(logk).
~
p';;k
P
Hence
L 1R ( kk1k2k )1 IT(1-g1(P»-lIT(1-g1(P»-1 (1) 2)
1 ';;k!.k2';;~
=o(
plkl
plk2
L IOgk110gk2)=0(~210g2~). 1 ~klfk2~;
Therefore N~ ~
M log ~
+ 0(1)
+ O(~ 2 log2 ~).
Taking ~
= M! jlog2 M,
we have NMt/log2M
2M(
~ 10gM 1 + 0
M)) .
(lOg log 10gM
Substituting this into (11) and (12), the required result follows.
D
525
19.5 The Proof of the Goldbach-Schnirelmann Theorem
19.5 The Proof of the Goldbach-Schnirelmann Theorem Theorem 5.1.
If a ~ 2,
then r(a):::;:; c S -I -
a 2
Jl2(k)
-L-k - '
og a kla
Proof When a = 2 or 3 we have r(a) = 0, and if a is odd and a = Pi + P2 then either Pi or P2 is 2 so that r(a) :::;:; 2. Since the theorem is trivial for these values of a we shall assume that a ~ 4 and a is even in the following. We have r(a)
=
L
1:::;:;
L
L
1+
L
1+
1:::;:; S(a)
+ 2Ja,
(1)
PI+P2=a PI.P2> J~
where
L
S(a) =
1.
Pl+P2=a PI.P2> J~
We now define a set of integers bc
= c(a
- c) (c
= 1,2, ... ,a). If Pi + P2 = a
> Ja, thenpi(a - Pi) = P2(a - P2) = PiP2 is not divisible by any prime : :;:; Ja. Using the notation of §4, we have
andp1>P2
S(a):::;:; N~,
(2)
Denote by M(k) the number of solutions to the congruence x(a - x)
== 0
(modk),
O:::;:;x
Then
ct
1=
[~JM(k) + T(k) ,
cIa-c) =O(modk)
where 0 :::;:; T(k) :::;:; M(k). Therefore M(k) Lb1 :::;:;--a klb k
+ M(k)
and M(k) - - 1 M(k) = - a - M(k). [a] M(k) > (a) k k
Lb 1 ~ klb k
Let M(k) g(k) =-k-'
so that
(3)
526
19. Schnire1mann Density
Lb 1 = g(k)a
+ R(k),
(4)
klb
where IR(k)1
~
M(k)
~
k.
(5)
From Theorem 2.8.1 we see that M(k) is a multiplicative function of k and hence so is g(k). Also I, M(p) = { 2,
pia p,(a.
(6)
Therefore, by (3), pia, (7)
p,(a.
Since 21a we have g(2) = t and therefore 0 < g(p) < 1 so that Theorem 4.3 is applicable. If k = p~1 ... p~r, then from (3) and (6) we have
1 r
h(k)
~ksl)l (1 +as)=T' vs.(a
where r
h(p~' ... p~r) =
TI (1 + as),
Pl, ... ,Pr distinct primes.
(8)
s= 1
Vs.(a
From Theorem 4.4 we have
TI (1--1)-1 via
P
1 ( 1)-1
L gl(k)~ L h(k)-TI I - 1 ';;k';;~ 1 ';;k';;~ k via P 1 ~
L - L
l,;; kq
k
h(m).
mlk
vl;""vl a
If we write k = p~1 ... p~'q~1 ... q~u, where PT and qu are all distinct primes and PTla, qu,(a, then m can take all the numbers of the form m= where 0
~ Cl ~
p~1
al, . .. , 0
k ... p~'
~ Ct ~
= pal-C! ... pa,-c'qbl ... qbu 1
t
1
u ,
at. For such m we have, by (8), that
527
19.5 The Proof of the Go1dbach-Schnire1mann Theorem
Therefore, by Exercise 6.5.1, we have
,=
L k1 (1 + a1) ... (1 + a )(1 + b1) ... (1 + b
u)
t
l';;k';;~
Hence
~ c71og2~ {
p.2(k)}-1 L-. kla
Secondly, if k
= TIplkP c, then
TI {1 -
~
gl(P)} -1
(9)
k
from
{1 - gl(2)} -1{1 - gl(3)} -1
plk ~ 2.3
TI
5 ';;pl~
( 2)-1 < 1- 5
TI
{1 - gl(P)}-l
5';;plk 6
TI (1 + c) =
6d(k) ~ 6k,
plk
together with Theorem 4.3, (5) and (9), we have that
1 a p.2(k) Sea) ~ N~ ~ - . - 2 C7 log ~ kla k
L -- +
k1k2
L
1';;kl.k2,;;~(kl>k2)
6k 1 . 6k 2
1 a 112(k) ~-'-2-1'""-+ 36~6. C7 log ~ kla k
L
We take ~
= a1o,
D
and the theorem follows from (1).
Proof of Theorem 3.4. When n
L 1 ';;a';;n
r2(a) ~
1+
~
2, we have
L c; 4';;a';;n
a2 -4-
112(k ) 112(k ) L _1'""_1_ L _1'""_2_
log a kda
k1
k21a
k2
528
19. Schnirelmann Density
l:::;a~n
k,k2 (k •• k2)
Ia
n
k1k2 (kl> k 2) Since (kl> k 2) ~ min{kl> k 2} ~ Jk 1k 2, it follows that ,,2
-
n
2
2
"
n
r (a) ~Y"+ Cs -1- 4 L. k k 3/2 1 "'a"'n og n 1 "'k •• k2 "'n ( 1 2) L.
1
~ + C~~( ~ log4 n
_1_)2
'::1 k
k
3/2
n3
~ C4-4 -- '
log n
The theorem is proved.
D
Exercise 1. Let x, k, 1 be positive integers, and (I, k) = 1. Denote by n(x; k, I) the number of primes in the arithmetic progression kn + 1(n = 1, 2, ... ) not exceeding x, and let 0 < (j < 1. Prove that, for k < xo, n(x;k,/)~
2x
(
x q>(k) logk
1+0
((lOg log logx
where the implied constant depends at most on
X)2)) ,
(j.
Exercise 2. When p, p + 2 are both primes, we call them a pair of "prime twins". Denote by Z2(N) the number of pairs of "prime twins" not exceeding N. Prove that N Z2(N) ~ Cs - 2 - ' log N
and that the series
1
L-;, p'
p
where the summation is over all "prime twins" p*, is convergent.
19.6 The Waring-Hilbert Theorem In §§6 -7 theletters c, Cl> C2, ••• denote positive constants depending only on k. The constants implied by the O-symbol also depend at most on k. The purpose of §§6 - 7 is to prove
529
19.6 The Waring-Hilbert Theorem
Theorem 6.1 (Hilbert). Corresponding to each positive integer k there exists a positive integer c such that every positive integer is the sum of at most c k-th powers. We now define 21;" to be the collection of integers x~ + ... +.x;- where each Xm runs over all the non-negative integers. We define 21t to be the largest set of distinct elements from 21;". Let
The proof of Theorem 6.1 is divided into sections of a chain: Theorem 6.2.
If k
~
2, then 21q has positive density.
We see that Theorem 6.1 can be deduced at once from Theorem 2.3 and Theorem 6.2. We define rea) to be the number of solutions to
We first prove: Theorem 6.3.
If n ~
1, then
L
rea) ~ c2(k)nq /k •
1 ~a~n
Proof Clearly we can assume that n >
L
rea)
= -
Cl.
L
1+
O~a~n
1 ~a~n
Then
L
k =a xk+.··+x ,
~ -1
+
L
o ~x, ~(n/q)'/k
L
o ~X
D From Theorem 6.3 and Theorem 2.4 we see that the vital link in the chain is to prove: Theorem 6.4. If k
~
2 and n
~
1, then
L
r2(a) ~ c4(k)n 2q /k -
1•
l:S:a:S;n
If this theorem is proved, then Theorem 6.2 follows at once from Theorem 2.4 and Theorem 6.3. We now transform Theorem 6.4 to the following:
530
Theorem 6.5.
19. Schnirelmann Density
If k
;;:: 2 and P ;;:: 1, then
II 1
xto e21tixka
r C
!
~ cs(k)P2q
drx
-k.
o
We take P integer q,
=
[nl/k] so that, when n is large, C1Pk > n. We also note that, for any if q
=
0,
if q # O. o
From Theorem 6.5 we have
1,,;~,,;nr2(a) ~ O,,;a~qpkC~+ ...~x~!=a ly O'::::;Xi~P 1 ~i~Cl
-II f. ... f. e21ti(X~+ +x~!)aI2 1
...
-
X!=o
o
drx
XC! =0
1
I
= Ixto e21tiXk"12C1 drx
~ cs(k)p 2q-k
o ~
c4(k)n 2C !lk-l
giving Theorem 6.4. Our aim therefore is to prove Theorem 6.5. Exercise. Deduce Theorem 6.5 from Theorem 6.4.
19.7 The Proof of the Waring-Hilbert Theorem Theorem 7.1. Let X, Y;;:: 1, n be an integer, and q(n) denote the number of integer solutions to
(Ixml
~
X,
IYml
~
Y, m = 1,2).
(1)
Then q(n)
~
{
27 X 3 / 2 y 3 / 2 , 1
60XYL din
d'
if n = 0; if n #
o.
(2)
531
19.7 The Proof of the Waring-Hilbert Theorem
Proof 1) n = O. Here the values taken by Xl> X2 and Yl cannot exceed 2X + 1, 2X + 1 and 2Y + 1. When Xl> X2, Y1 are specified, Y2 can only take one value. Therefore q(O) ~ (2X + 1)2(2 Y + 1) ~ (3X)2(3 Y) = 27 x 2Y,
and similarly q(O) ~ 27 XY 2 , and hence q(O) ~ min(27 x 2Y, 27 Xy2) ~ )27 x 2Y . 27 XY 2 = 27 X 3/2y3/2. 2) n i= O. We can assume without loss that X integer solutions to XlYl
+ X2Y2
=
n
~
Y. Let ql(n) be the number of
((Xl>X2) = 1, IX21 ~ Ixd ~ X, IYml ~ Y, m = 1,2).
(3)
Clearly Xl i= 0, since otherwise X2 = 0 giving n = 0, contradicting our present hypothesis. Next, for a fixed set Xl, X2 with (Xl' X2) = 1, IX21 ~ IXll ~ Xwe denote by q2(n; Xl> X2) the number of integer solutions in Yl> Y2 for (3). From Theorem 1.8.2 we see that (3) is soluble, and ifi l , Y~ is a set of solutions, then all the solutions are given by t
integer.
Therefore Itl=
Y+ Y 2Y IY~ Xl- Y21 ~--=-, IXll Ixd
and hence the number of values taken by t does not exceed 2Y
4Y + X
IXll
IXll
2'-+I~
5Y
~-,
IXll
that is
Therefore ql(n)~ ~
5Y 21xd + 1 -~5Y l""lx,I""X IX21""lxd Ixd l""lxd""X IXll 5 y. 3 . 2X = 30XY.
L
L
L
It follows that, with the condition (Xl, X2) = 1, the number of solutions to (1) does not exceed 2 . 30XY = 60XY. Next, if(xl' X2) = d i= 1, din, then we let x~ = xt/d, x~ = x2/d, so that we now seek the number of integer solutions to .
and we see from the above that this number does not exceed 6~ . Y.
532
19. Schnirelmann Density
Therefore, when n =F 0, q(n) ~ 60XY
L -.1 din
The proof of the theorem is complete.
d
0
Theorem 6.5 is obviously a consequence of the following Theorem 7.2. Let k
~
2, andf(x) be a polynomial with degree k having integer valued
coefficients:
Then
(4) o
Proof When k
= 2, the left hand side of (4) is the number of integer solutions to
a2
=
0(1),
al
=
O(P),
1~m
~
4.
(5)
Let Xi - Yi = Z;, a2(xi + Yi) + al = Wi (1 ~ i ~ 4). We see that the number of solutions to (5) does not exceed the number of integer solutions to (Zi
= O(P), Wi = O(P), 1 ~ i
~
4).
(6)
If we denote by q(n) the number of integer solutions to
= O(P), Wi = O(P), m = 1, 2 where the constants implied by the O-symbol are the same as those of (6», then the number of solutions to (6) is Llnl ~C6PZ q2(n). From Theorem 7.1, we have
(Zi
=
0(P
6)+ 0 (p4
1
~dl'~~C6PZ d d dl~ I 1 2
(dl.dz) n 1 ~n~c6P2
1)
533
19.7 The Proof of the Waring-Hilbert Theorem
and the required result (4) follows. Suppose now that k ~ 3. We proceed by mathematical induction, and assume as induction hypothesis that the theorem holds when k is replaced by k - 1. From
£
£
Ix=o e 21tij(X)a.12 = x=o e- 21tij(x)a.
I
e21tij(x + h)a.
-x~h~P-x
P
I' I'
e 21tih
+ P,
(7)
O
I'
where means that the summation is over those integers in the relevant part of the set in the interval, and cp(x, h) = t(fix + h) - fix» (h "# 0). Viewing cp(x, h) as a polynomial in the variable x, we see that cp(x, h) is a polynomial of degree k - 1 satisfying the conditions of the theorem. Writing P
ah
=
I'
e 21tih
x=o
we have
If
I' ahl :::;;P, IO
.
then clearly the required result follows. Otherwise we repeatedly use the Bunyakovsky-Schwarz inequality to give
:::;; (3P)8 k -
L
I'
l
lahl
8k - 2
O
=
0 (
p8 k - 2 -
1
I'
I' P
I
O
8k 2 e 21tih
(8)
534
19. Schnire1mann Density
Let 1
From 0
~
x
~
x~~ e21tihtp(x,hllX
= ~A(n)e21tih"".
Z
8k
1
_
(9)
P, we deduce that
=
n
o(
From (9) and the induction
ICP(X,h)l) =
max
O(Pk -
1 ).
o~x~p
hypothesi~,
we have
1
IA(n)1
=
Iflx~~ e21titp(X,hlPI8k-Z e-21tinPdPI o 1
~f
Z
8k
1
xto e 21titp(x,hlP 1
-
dP
= O(P 8k - Z-(k-l».
o
We raise (8) to the 4th power, and then integrate with respect to oc over 0 giving
~
oc
~
1
o 1
= O(p4.8k-Z-4f( o =
O(p4.8k_Z-4
If O
£f e21tihtp(X'hlIX18k_Z)4 dOC)
1
x=o
»)
I
A(n 1 )A(n 2)A(n 3)A(n 4
n1h 1 + n2h2 = n3h3 + n4h4 o
n;=O(pk-l) i= 1,2,3,4
= O(P4.8k->-4p3kp4.8k->-4(k-l» The proof of the theorem is complete.
=
O(p8 k- 1 -k).
D
Notes 19.1. Bombieri's theorem on n(x; k, /) (see Note 5.4) can also be used to prove the Goldbach-Schnirelmann theorem. There is a long list of references on this problem in a paper by R. C. Vaughan [58] where he used a mean value theorem on n(x; k, /) by Davenport and Halberstam to prove that every sufficiently large odd number is a sum of 5 odd primes and thus every sufficiently large even number is a sum of at most 6 primes. Vaughan also proved that every even number is a sum of at most 27 primes.
Chapter 20. The Geometry of Numbers
20.1 The Two Dimensional Situation In this section we restrict ourselves to the two dimensional space to give a brief description of the fundamental results in the chapter. Definition 1. Let c be a simple closed curve in the plane and denote by R the region within c. Suppose that, given any two points of R, the midpoint* also lies in R. Then we call R a convex region. For example: Circles, ellipses, parallelograms and regular n-sided polygons are all convex regions. The area of a convex region always exists. (We can define it as the supremum of the sums of the areas of networks of small squares whose vertices lie in the region.) Throughout the chapter we make use of various properties concerning convexity, and strictly speaking we need some knowledge of measure theory and topology to give the proper justifications. The reader, however, can understand without difficulty the fundamental content of the chapter without this knowledge, and in any case when it comes to our specific applications and examples the results can be obtained without this knowledge. Theorem 1.1 (Minkowski's Fundamental Theorem). Let R be a convex region symmetrical about the origin with area greater than 4. Then R must contain a lattice point differentfrom the origin. (A lattice point is a point with integral coordinates.)
Proof (Haj6s). Denote by S2r.2s the square with side length 2 centre at the even coordinate point (2r,2s). If S2r.2s intersects with R, then we translate this intersection to the square So.o with the transformation x - 2r = x', y - 2s = y'. This process moves the whole of R into So.o and since the region R has an area greater than 4 there must be at least two points of R which have gone into the same point in So.o. Let these two points come from the two different squares S2r.2s and S2r',2s' so that the original coordinates of the two points are (xo
+ 2r,yo + 2s)
and
(xo
+ 2r', Yo + 2s').
Since R is symmetrical about the origin, the point ( - (xo
*
+ 2r'),
(Yo
+ 2s'»
It is not difficult to prove that, in this case, the whole line joining the two points lies in R.
also
536
20. The Geometry of Numbers
lies in R. Since R is also convex, the midpoint of the two points (xo
+ 2r,yo + 2s)
and
(- Xo - 2r', - Yo - 2s'),
that is, the point (
xo
+ 2r -
(xo 2
+ 2r')
Yo
+ 2s -
'
must lie in R. The theorem is proved.
(Yo 2
+ 2S'») =(r-r's-s') '
0
It is not difficult to deduce the following:
Theorem 1.2. Ifwe change the hypothesis in Theorem 1.1 by replacing "greater than 4" with "at least 4", then the conclusion becomes "Then there must be a lattice point different from the origin which lies in R or on its boundary". Application 1. Take R to be the parallelogram I~I ~b,
1111
~ c,
(1)
where ~
= ocx + {3y,
11 =
yx
+ by,
ocb - {3y = A(:;t: 0),
and oc, {3, y, b are real numbers. The inequalities (1) define a parallelogram symmetrical about the origin, and is therefore a convex region. The area of this region is equal to
I~I ~b
Iql ~c
I~I ~b Iql ~c
I~I ~b
Iql ~c
Therefore, if (4bc/lAI) ;;:: 4, then there is a lattice point different from (0,0) which satisfies (I). That is: Theorem 1.3. If b > 0, c > 0, bc ;;:: 1.11, then there must be a lattice point (x,y) (:;t: (0,0» satisfying (1). 0 Taking in particular oc = b = 1, y = O. Then there is a lattice point (x,y) (:;t: (0,0» such that
Ix + {3yl that is
This is Theorem 6.10.6.
~ b,
537
20.1 The Two Dimensional Situation
Application 2. Take R to be the ellipse (2)
This clearly satisfies the hypothesis of Theorem 1.1. The area of the region in (2) is given by
II
dxdy =
II
IO(X,Y) o(e,I])
Ided
I] -
~ IAI
If nr2 ~ 41AI, then there is a non-zero lattice point (x,y) satisfying (2). Now any ellipse centre at the origin has the equation (3)
Let b
e= Jax + 2V-r::a Y'
I]
=
Jc _
2
b y. 4a
Then (3) can be written as (2) with A=
Jac - (~y.
Therefore: Theorem 1.4. If a > 0, ac - (t)2 > 0, A = point (x,y) i= (0,0) such that
ax 2 + bxy
J ac 4
+ cy2 ~ -A. n
(t)2, then there exists a lattice
D
This is not the best possible result; in fact we can replace 4/n by 2/./3.
Application 3. Take R to be the region given by the hyperbola (4)
This region is not convex so that we cannot apply Theorem 1.1 directly. Our method is to make a convex region with area at least 4 from this region. We have (5)
and (6)
538
20. The Geometry of Numbers
is a convex region whose area is given by
I~I
II +
Iql '" 2,
dxdy= I~I
d -~ IO(X'Y)ld o(~, Yf) ~ Yf - ILII
II +
Iql ",2,
II
I~I + Iql '" 2,
Therefore: Theorem 1.5. There must be a non-zero lattice point satisfying
From (5) we deduce at once: Theorem 1.6. There must be a non-zero lattice point satisfying
Again this is not the best possible result; it has been proved that replaced by 1/)5.
t can
be
20.2 The Fundamental Theorem of Minkowski Let R be a bounded region in n-dimensional space. Suppose that the mid-point of any two points belonging to R also belongs to R. Then we say that R is a 'convex region, or a convex body. Theorem 2.1. Let R be a convex body in n-dimensional space which is symmetrical about the origin and has a volume> 2n. Then R must contain a non-zero lattice point. It is not difficult to generalize the proof of Theorem 1.1 to n-dimensional space. We now give another method to prove Theorem 2.1 in this section. Proof(Mordell). Let t be a fixed positive integer, and let q, run over all the integers. The various planes 2q,
r
X,=-, t
= 1,2, ... ,n
divide the space into cubes, with each cube having volume (2/t)n, and (2qi/t, . .. ,2qn/t) is a corner point. Denote by N(t) the number of corners in the region R, and by A the volume of R. Then, from measure theory, we have lim 1-00
(~)n N(t) =
A.
t
If A > 2n, then N(t) > t n for sufficiently large t.
539
20.2 The Fundamental Theorem of Minkowski
On the other hand there are at most t n sets (ql> .. . , qn) whose corresponding coordinates are incongruent numbers mod t, so that there must be two points
in R satisfying qi - q; == 0 (mod t). Since R is symmetrical about the origin, it contains the point ( _ 2q'l , ... , _ 2q~) . t
t
Since R is also convex, it contains the mid-point of the two points and
( _ 2q'l , ... , 2q~) , t t
that is, the point
which is a lattice point. The theorem is proved.
D
As before we also have: Theorem 2.2. In Theorem 2.1 we may replace" > 2 n " by " ;;:: 2n " provided that we also replace" R must contain a non-zero lattice point" by "there must be a non-zero lattice which lies in R or on its boundary". D
We can make the result sharper in the following sense. Theorem 2.3. Denote by Q the mid-point of the line joining the origin 0 to the point P on the convex body R. As P runs over the points of R, the point Q describes a convex body which we denote by R t . Under the hypothesis of Theorem 2.2 we may strengthen the conclusion by assuming that the lattice point concerned lies outside R t . Proof Denote by (j the greatest distance between 0 and a boundary point of R. Take the integer N satisfying 2N - 1 ~ (j < 2N, so that the distance between 0 and any boundary point of R2 - N is less than 1. Since R2 - N has no non-zero lattice point, the lattice point in Theorem 2.2 must lie outside R2 -N. Therefore there exists an integer m with the property that inside or on the boundary of R2 -m, but outside R 2 -m-l, there is a lattice point (Xl> ... ,xm). Now the lattice point
lies inside or on the boundary of R but outside Rt .
D
540
20. The Geometry of Numbers
20.3 Linear Forms Let
a,rs
be real numbers, with the determinant :;t:0
LI= and let
r =
1,2, ... ,no
(1)
Take R to be the region
This is a convex body symmetrical about the origin, and its volume is given by
f. f f··· f I f··f
dXl . dX2 ... dX n
l~d"'A!.···,I~nl"'An
O(Xl,X2,""Xn) o(el> e2,"" en)
I~d
Idel ' de2'"
den
'" Al,···.I~nl '" An
1
ILII
Therefore if A1A2 ... An > ILlI, then R contains a non-zero lattice point, and if A1A2 ... An ~ ILlI, then there is a non-zero lattice point in R or on its boundary. Therefore: Theorem 3.1. Let el>' .. , en be n real linear forms in Xl, ... , Xn with determinant LI. Let Al>"" An be positive numbers satisfying A1A2 ... An ~ ILII. Then there exist integers Xl> X2,' .. , x"' not all zero, such that
Theorem 3.2. The conclusion of Theorem 3.1 can be strengthened to the following: there exist integers xl> X2,' .. , x"' not all zero, such that
Proof Let:. '> O. By Theorem 3.1 there are integers Xl, ... , X"' not all zero such that
lell
~ (l
1
+ e)n- Al> le21
~
A2 l+e
- - < A2, ... , lenl
~
An l+e
- - < An'
Now let e -+ O. From the discrete nature of integral points the theorem is proved. 0
541
20.3 Linear Forms
+ 1, and take
If we replace n by n
e. =
x.
(1 ~ v ~ n),
A. =
t 1/n
(1 ~ v ~ n),
1
= t'
An+ 1
then, from Theorem 3.2, we have: Theorem 3.3. There are always integers Xb . .. ,Xn and y, not all 0, such that
and Ix.1 ~ t 1/", where t is any positive number.
D
Again if we take (1
1
v ~ n),
(1 ~ v ~ n),
A+1=•
~
t
then we have: Theorem 3.4. Let 1X1' ••• , IXn be real numbers and t lattice point (X,YbY2, . .. ,Yn) such that 1 IIX.x - Y.I <-,
1
~
~
1. Then there exists a non-zero
x
~
tn.
t
In other words there are n rational numbers ydx, .... , Yn/x, with common denominator x, such that
I
IX. -
Let then
Cn
Y.
~
I<
1
X 1 + 1/n '
1 ~ v ~ n.
D
be the greatest real number with the following property: If 0 < c <
I
IX. -
I
Y. < X
Cn>
1
cx
1 + lin'
=-J"5.
has infinitely many sets of solutions. From Theorem 10.4.4 we know that C1 The determination of Cn for n ~ 2 is an unsolved problem. In Theorem 3.2 we cannot replace le11 ~ A1 by le11 < A1 as well. For example:
.. "' (2)
542
20. The Geometry of Numbers
Then from led < 1 we have Xl = 0, and from le21 < 1 we have X2 giving only the origin satisfying lell < 1, le21 < 1, ... ,lenl < 1. Finally we let
= 0, and so on,
n
(1 ~ v ~ n)
be a unimodular transformation. Substituting this into (2) will give a set of equations with properties similar to those in (2). Question: Apart from the situation in the given example, can we have lei I < AI? This is the famous Minkowski's problem, and for decades the situation was settled only for n ~ 7, until in 1942 the Hungarian mathematician Hajos settled it generally.
20.4 Positive Definite Quadratic Forms Consider the ellipsoid R: (1)
In order to prove that this is a convex body it suffices to prove that
Since i
= 1,2, ... ,n,
the inequality (2) clearly follows. Since the volume of the n-dimensional sphere with radius r is nn/2rn / r(~ follows that the volume of R is given by
1
ILlI
f...f de· .. de 1
1
= - rn
n ILlI
Therefore we have: Theorem 4.1. There exist integers
Xl> ... , X n ,
not all 0, such that
ei + ... + e; ~ 4 ( ILlI)2/n I n
'
ntn r(tn
+ 1)
+ 1), it
.
543
20.5 Products of Linear Forms
where
J.
~
(" )
. r ~+1
We can rewrite Theorem 4.1 differently. A positive definite quadratic form n
n
L L arsxrx.,
Q(X1>' .. ,Xn) =
ars
= asr
r= 1 s= 1
can be represented by
e
The determinant LI of 1> ••• ,en is equal to the square root of D = larsl. This is because A = (a rs ) is a positive definite matrix so that there exists a matrix B such that A = BB', LI = IBI = Dt. Therefore Theorem 4.1 can be stated as follows: Theorem 4.2. Let Q(Xl,' .. , xn) be a positive definite quadratic/orm with determinant D. Then there exists a non-zero point (Xl, . .. ,xn) such that (3)
Let Yn be the least constant with the following property: There exists a non-zero lattice point such that
In §1 we already remarked that Y2 = 2/.j3. Up to the present mathematicians have only determined the values of Yn for 2 ~ n ~ lO: Y4
= Ji, Ys
=
2,
Ys = Y9
18,
= 2,
In general, we know that Yn
<;;2 (r(2 + 2n))2/n
( -~ ne
as
20.5 Products of Linear Forms We first consider the region R: (1)
544
20. The Geometry of Numbers
This region is clearly symmetrical about the origin, and from
~ + ~' I ~ I~I + I~'I
I
2
""
2
we see that R is a convex body whose volume is given by
f·-f 1
ILl I
Therefore: Theorem 5.1. There exists a non-zero lattice point
(Xl, •.. ,
xn) such that
(2)
~2
When n = 2, this is the best possible result. For if we take X - y, then ILII = 2 and (2) becomes I~ll + 1~21 ~ 2. But
~l
=
X
+ y,
=
so that if this is less than 2, then x = y = 0. When n = 3, Minkowski proved that there is a non-zero lattice point (Xl, X2, X3) such that
and that 108/19 is best possible. The problem for n > 3 is unsolved. We now discuss the product of linear forms. We shall use the following result, known as the arithmetic-geometric means inequality. Theorem 5.2.
If al
~
0, ... ,an
~
0, then
Proof 1) n = 2k. We use induction on k. Since
545
20.5 Products of Linear Forms
we see that the result holds when k = I. Assume now that the result holds when n = 2k- 1 . Then when n = 2k we have 1
1
1
(al'" a2k)2k = {(al'" a2k-I)2k-l(a2k-l+l'" a2k)2k-l}t
~ {(a 1 + ... + a 2k-I)(a2k-l+l + ... + a2k)}t 2k -
~
al
2k -
1
1
+ ... + a2k
""
2k
2) (Backward induction.) We now show that if the result holds for n holds for n. Take
+ 1, then it
Then, from our induction hypothesis, we have
);:tT = (al ... an+l)n+l + ... + an) 1
1 ( -na1 ... an(al
_1_
=
_1_ {a n+1 1
~
al
+ ... + an+1 n+ 1
+ ... + an + ~n (a 1 + ... + an)}
which gives
The theorem is proved.
0
From Theorem 5.1 and Theorem 5.2 we have at once:
Theorem 5.3. There exists a non-zero lattice point such that
lei'" enl
n!
~ -ILlI· nn
0
Note. We can also deduce from Theorem 3.1 that there is a non-zero lattice point such that
Since n! < nn whenever n > 1, our Theorem 5.3 here gives a better result. Denote by I'n the least positive constant such that, whenever y ~ Ym there is a non-zero lattice point satisfying
Up to the present we only know that 1'2
= 1/)5 and Y3 = t (Davenport).
546
20. The Geometry of Numbers
20.6 Method of Simultaneous Approximations Theorem 6.1. Let OCt> ••• , OC n be real numbers. Then there exist a non-zero lattice point ~
(Xl> ... , xn) and an integer Y
I such that
i = 1,2, ... ,n. Proof We first consider
IXi - ociyl
+ I~I,,;; r,
I ,,;; i,,;; n,
This is a convex body symmetrical about the origin, and its volume is given by
f. f I~d
(
dXl··· dXndy
here ~i = Xi - OCiY, I ,,;; i ,,;; ~n+l
n,)
= y/t
+ I~n+ d <S;,
i= 1, ... ,n
I~d
+ I~n+ d <S;, i= 1, ... ,n
f··f
=Itl I~d
d~l···d~nd~n+l
+ I~n+ d <S;,
i= 1, ... ,n
~i+~"+ 1 ~r
i= 1, ... ,n
~i~O,~n+ l~O
2 + lit I =_ _ rn+l. n
n
+I
Therefore there is a non-zero lattice point (Xl, ... , Xn, y) such that
lx, _ "",I +
I~I.; (n ~ 1)"~'
From Theorem 5.2 we have
, ; _n_ (n + 1);;+1, + It I 1
n
I
i = I, ... ,n.
547
20.7 Minkowski's Inequality
Hence
I
O(i -
Xi
I~
Y
n (n
1 '
i = 1,2, ... , n.
0
+ 1)/+-;;-
This theorem is a slight improvement on Theorem 3.4. The best results at the present are: (Minkowski),
n+ 1 { 1 + (n---l)n+3}1/n
cn~--
n+1
n
(Blichfeldt).
Exercise. Let 0(. = fl. + iy. (v ::: 1, ... , n) be complex numbers. Then there are complex integers Zl>'" ,Zm W such that
20.7 Minkowski's Inequality For ai
~
0 (i
= 1, ... ,n), r > 0 we define I
Mr(a) = { ;;(a~
+ ... + a~)
}l/r
(1)
.
When r < 0, and some ai = 0, then the equation (1) has no meaning. In this case, we define (a~ + ... + a~)l/r = O. Therefore, when ai ~ 0, ri:O we can always define Mr(a) =
H(a~ + ... + a~)r/r.
From now on wedenoteai ~ 0 (i = 1, ... ,n) by (a). We write (a) > 0 to mean ai > 0 (i = 1, ... , n), and (a) i: 0 to mean that not all the ai are zero. We also denote by max a and min a the largest and the smallest numbers in ai respectively. If there are non-zero real numbers A., Jl such that A.ai = Jlb i (i = 1, ... , n), then we say that (a) and (b) are proportional. Theorem 7.1. limr _
oo
Mr(a)
= maxa.
Proof We can suppose that r > 0, so that }l/r I { ;;(maxa),
or
~ Mr(a) ~
{
(maxa)'
}l/r
,
548
20. The Geometry of Numbers
(;;l)l/r max a ~ Mr(a) ~ maxa. Since
. (l)l/r = (1)0 - = 1,
hm r-
n
+00
we have limr_+ oo Mr(a) = maxa.
n
D
Theorem 7.2. lim r__ 00 Mr(a) = min a. Proof We can suppose that r < O. We first consider the case (a) > O. We have
so that by Theorem 7.1, 1
lim Mr(a) =
1
( ) = - - = mina.
• 1 1 hm M-r maxa a Finally when one ai = 0, and r < 0, we see that both Mr(a) and min a are zero. The theorem is proved. D
r- -
00
-';-+00
We write the geometric mean of ai' Theorem 7.3. limr_o Mr(a)
=
G(a).
Proof 1) r < 0, and some ai = O. Thi~/case is trivial. 2) r =F 0, (a) > O. From (1) we have
I Mr(a) = { ;;a~
+ ... + a~)
1 {1
r
r }
1 +···+a) = er-log -(a n "
We now let r
-+
r
•
0 and apply L'Hospital's rule, giving
1 {I
lim -log r-O
}l/r
-(a~
n
+ ... + a~)
}
=
1
n
n
i-1
- L a~loga· lim r-O
1
-(a~
n
-
I
I
+ ... + a~)
1~ logai'
= -
n
L.
i= 1
549
20.7 Minkowski's Inequality
Therefore · M r () I1m a
1 {l
= I'1m er-log -(a n
r-O
r l
+···+ar )} n
r-O
= 0. We can assume that al > 0, ... ,as> 0, as+ 1 = as+ 2 = ... = an = 0, s < n. Then we have 3) r > 0, and some ai
I Mr(a)= { ~(a~ =
1 + ... + a.) }l/r = {s~'~(a~ + ...
+a~)
}l/r
(~ylrg(a~ + ... + a~)r/r.
From our earlier result we have I
lim { -(a~
+ ... + a~) }l~ = (al
... as)l/s,
S
r-+O
and, since s < n, s)l/r lim ( n
= 0.
r-+O
Therefore lim Mr(a)
=
r-+O
Lemma 1. Let 0(
+ {3 =
lim {(-s)l/r{1-(a~ n s
+ ... + a~) }l/r}
r-+O
1,0( > 0,{3 > 0, Then/or s;;:: 0, t;;:: 0, we have ~t(J ~
with equality only when s
sO(
+ t{3
= t.
Proof The lemma is trivial if s = t or if one of s, tis 0. We assume therefore that s, t are distinct positive numbers. If s > t, then sit> l. Also, < 0( < I, 1 - 0( = {3, so that
°
(n~ From
fy~ sit
1 = 0(
f sit
- I dy
~ 0(
dy
= 0(
(f -
I).
550
20. The Geometry of Numbers
we have SXt fJ ~
Finally if SXt fJ =
SIX
+ t{3,
SIX
+ t{3.
then
fy~ sit
IX
f sit
- 1 dy =
IX
dy,
or
f (y~-l sit
which is impossible unless s = t.
- l)dy
= 0,
0
Lemma 2 (Holder's inequality). Let IX not proportional we have
+ {3 = 1, IX > 0, {3 > O.
When (a) and (b) are
n a~bf < (n.L ai)~( .Ln bi)fJ . .L ,=1 ,=1 ,=1 Proof Since (a) and (b) are not proportional, there exists i (I
~
i
~
n) such that
a· b· --'-=/:--'-. n n
L aj
j= 1
L bj
j= 1
Therefore, by Lemma 1, n
i= 1
(
.~ aj)~( .~ bj)fJ J= 1
J= 1
<
.~
,=1
{ ( - ; - ) IX L aj j= 1
+ (~){3} = IX + {3 =
L bj
1,
j= 1
or
Lemma 3 (Holder's inequality). Let k > O,k =/: 1, (Ilk) + (Ilk') = 1. Suppose that (a k ) and (b k ') are not proportional, and that (ab) =/: O. Then
n
(n
.L aibi < ,=1 .L a7 ,=1
)llk( .Ln br )llk' ,=1
(k > 1),
(2)
551
20.7 Minkowski's Inequality
(k < 1).
(3)
Proof 1) k> 1. Here k' = kl(k - 1) > 1, 0 < 11k < 1, 0 < 11k' < 1, 11k = 1. By Lemma 2 we have
n
i~l aibi
n
=
i~l (a~)l/k(bnl/k' <
(n
i~l a~
)l/k(
2) 0 < k < 1. Here k' = kl(k - 1) < O. If some bi the beginning of the section we have
n
i~l b~'
)l/k'
n
i= 1
Therefore
When (b) > 0, from 0 < k < 1, we have
From Lemma 2 we have
i= 1
i= 1
i= 1
From
we have (k < 1). Theorem 7.4. Let 0 < r < s, Then Mr(a) < M.(a),
.
= 0, then by the definition in
L b~' )l/k' = O.
(
+ 11k'
0
552
20. The Geometry of Numbers
= SIX, 0 <
Prol'1f Let r
Mr(a)
IX
{I
< 1. Then
{I
/
= ;; (a~ + ... + a~) }l/r =;; (a~" + ... + a~") }I S" =
Gttl
(aD" . 1
})I /S".
By Lemma 2 we have Mr(a)
(n1{.Ln
= -
(aD" .
{I
P -"}) 1 Is" < - (n .L af)"( .Ln 1)1-"}I/S" n
.=1
= { -I
n
(nL af)"n
1 -"
}I/
S
.=1
.=1
"
i=1
= (a~ + .~~ + a~)"y/S" =(a~ + .~. +a~y/s
= Ms(a). 0 Theorem 7.5. Let r > 0, r "# 1.
If (a)
and (b) are not proportional, then (r> 1)
and (r < 1). Proof 1) r > 1. Let r' 3, we have
= r/(r - 1). Then r' > (land l/r + l/r' = 1. By (2) in Lemma
n
n
n
i= 1
i= 1
i= 1
L (ai + bi)' = L ai(ai + bi)'-1 + L bi(ai + b i)'-1
n )l/r{ i~1n + (i~1 b~
(ai
}-r r-l
+ bi)'
553
20,7 Minkowski's Inequality
r-l
= {( ,Ln a~ )l/r + (n ,L b~ )l/r}{ ,Ln (ai + b;)r}r. ,=1
,=1
,=1
MUltiplying through by
{,=,±
r-l (ai
+ bi)'}--r,
1
we have
{i~ln (ai + bi)' }l/r < (ni~l a~ )l/r + (ni~l b~ )l/r, 2) 0 < r < 1. If ai + bi = 0 for i = 1,. , . , n, then we deduce, from ai ~ 0, bj ~ 0, that ai = bi = 0 for i = 1"., ,n. This means that (a) = (b) = 0 which implies that (a) and (b) are proportional. Therefore, under the hypothesis of the theorem, we can assume that there is some i, 1 ~ i ~ n, such that aj + bi > O. In fact it is clear that we can assume without loss that ai + bi > 0 for i = 1, ... ,n. Now let r' = r/(r - 1). Then, from (3) in Lemma 3, we have n
n
n
i= 1
j=l
i= 1
r-l
r-l
n )l/r{ n }-r (n )l/r{ n }-r = ( j~l a~ j~l (aj + bi)' + j~l b~ i~l (ai + bj)' n
= {( ,L a~ ,=1
)l/r
+ (n ,L b~ )l/r}{ ,Ln (ai + bj)'}~r. ,=1
,=1
Multiplying through by
{,±
r-l (ai
+ bi)'}--r,
,= 1
we have
This theorem is commonly called Minkowski's inequality.
554
20. The Geometry of Numbers
20.8 The Average Value of the Product of Linear Forms Theorem 8.1. Let n ~ 2, and ~ 1, ... , ~n be linear forms in Xl, ... , Xn with determinant Ll =F O. Suppose that there are s pairs offorms with complex conjugate coefficients and r forms with real coefficients, where r + 2s = n. If (J ~ 1, then there is a non-zero lattice point such that
Proof By Theorem 7.5, (1)
represents a convex body symmetrical about the origin. We first evaluate the integral
r··f
A=
Let ~r+j = '1r+j + i'1r+s+j, ~r+s+j = ~r+j(j = 1,2, ... ,s) be thes pairs oflinear forms with complex conjugate coefficients. Then
r··f
A=
l~tI"+"'+I~rl"+2
Io(~1>""o(X1>""xn) ~r> '1r+1>""
I '1r+2s)
'i,' (q;+q;+}"'2"nT"
J=r+ 1
2S ILlI
r··f
l~tI"+···+I~rl"+2
'i,' (q;+q;+ j),,'2 "nT"
j=r+l
We make the following substitutions:
1 ~j ~ s.
555
20.8 The Average Value of the Product of Linear Forms
This then gives
f··f
A = 2s • 2r @2sl u
ILII
f ... f o
r+s
V=IJ.1 Pv
) dp1'" dPr+s
pf + ... +pC:+ s~nTO' Pv~O
2"
x
(
dO "'r+ 1 ... dBr+s
0
f··f
ILII
(
r+s
n
) Pv dp1'" dpr+s'
v=r+l
pf + ... +pC:+ s~nTa Pv~O
Let
p~
= nTu-c v , v~ = 1,2, ... , r + s, so that j
tl+···+tr+s~l tv~O
When
A
~
2n ,
that is when
T>-( (~Jn-;r(l +~}LlI )l/n ?"
2s
(
1) ( 2) ,
2--;;rr 1 +~ rs 1 +~
there is a non-zero lattice point satisfying (1). The theorem is proved.
0
Theorem 8.2. Let the hypothesis in Theorem 8.1 be satisfied. Suppose that Ab ... ,An are positiv~ numbers, with Ar+t = Ar+s+t (t = 1, ... , s) and (A1 ... Ar+2s) ~ (~)SILlI. Then there is a non-zero lattice point such that
556
20. The Geometry of Numbers
The reader can supply the proof for this. Theorem 8.3. Let the hypothesis in Theorem 8.1 be satisfied, and let (l ~r+v
=
I'/r+v
Suppose that Ai ... An
~
+ il'/r+s+v,
~
er+s+ v
v ~ r),
= ~r+ v
(l
~
v ~ s).
IA1/2 s. Then there is a non-zero lattice point such that 1 ~ v ~ n.
Proof The absolute value of the determinant for I'/l> 1'/2, ... , I'/n is IA1/2s, and therefore the theorem follows at once from Theorem 3.1. 0
20.9 Tchebotaref's Theorem Let n
~i
=
L
(XijXj
(i= 1, ... ,n),
j= 1
where
(Xij
are real numbers, and the determinant for the coefficient is (X11 ••• (Xln
A = ........ #0. (Xnl ••• (Xnn
A famous conjecture of Minkowski is as follows: Corresponding to any set of real numbers Pi' ... , Pn there is a set of integers Xl, ... , Xn (possibly all zero) such that
Minkowski himself proved this for n = 2, and it has also been settled for n As for the general case we only have the following theorem. Theorem 9.1 (Tchebotaref). Let m be the lower boundfor when Xl> ••• , Xn take integer values. Then we have
I(~l
Proof We can assume, without loss of generality, that A given B > 0, there are integers xi, ... , x: such that
=
-
= 3,4,5.
pd ... (~n -
Pn)1
1 and m > O. Then,
o ~ 8 < B.
557
20.9 Tchebotaref's Theorem
Let (i= 1, ... ,n),
so that n
e; = L [3ij(Xj -
xj)
(i=I, ... ,n),
j= 1
and the absolute value of the determinant D for the forms is
IDI
n let - p;l)-1 =1-m- -8. n
=
(
j=l
Since nj= 1 lei - pd ~ m, it follows that
Ii Ie; + 11 = Ii Iee~ -=- Pi. I~ 1 - 8.
i=l
i=l
i
p,
Similarly n
n Ie; -
11
~ 1 - 8.
i= 1
Therefore n
i= 1
Define the convex region C': (i=I, ... ,n).
We now prove that C' has no non-zero lattice point. Suppose that C' has a non-zero lattice point. Then the corresponding e~, must satisfy
... ,e~
(i=I, ... ,n).
If e;2
-
1 > - (1 - 8)2 for some i, then le;2 - 11 < (1 - 8)2 for thati, and therefore n
n le;2 -
11 <
(1 - 8)2
i= 1
which is impossible. Hence (i= 1, ... ,n),
and so (i = 1, ... , n).
558
20. The Geometry of Numbers
Thus, when 8 is very small, this lattice point in C' must be very close to the origin, and this leads at once to a contradiction. For, according to Theorem 2.3, any nonzero lattice point must lie outside C t , and this clearly contradicts le;1 ~ (i = I, ... , There is therefore no lattice point in C' besides the origin. Now, by Theorem 2.1, we have
fo
n).
2"{ 1 + (1 - 8)2}"/2 - ' - - - - - - - ~ 2"
IDI
""
or
As e --+ 0, so that 8
--+
0, we have m ~ 2-"/ 2.
D
20.10 Applications to Algebraic Number Theory Let Wb' .. , W" be an integral basis for the algebraic number field R(8) of degree n. Suppose that there are r1 real numbers, and r2 pairs of complex conjugate numbers (r1 + 2r2 = n) in 8(1), ... ,8("). Then in the n linear forms (i
= 1,2, ... ,n),
there are r1 forms with real coefficients, and r2 pairs of forms having complex conjugate coefficients. It is also clear that the absolute value of the determinant of this set offorms is where L1 is discriminant of the field R(8). Let oc = OC(l), and take (J = 1 in Theorem 8.1, so that there is a set of rational integers Xl, • •. ,Xm not all zero, such that
M
L
IN(oc) I1/" ~ -1 " loc(i)1 ~ n i= 1
((4)'- 2 --;;M nI )1/"' n
n
and this means that there is a non-zero algebraic integral oc satisfying
IN(oc) I ~
(-n4)'2nl~M· n"
Since IN(oc) I is a natural number, and 2r2
~ n,
we deduce that
n" (n)"/2 n" M>- (-n)'2 ->-4 -. 4 n! n! 7
7
Let
" _(n)zn" - -
v "
4
(1)
n!'
(2)
559
20.10 Applications to Algebraic Number Theory
so that
In (
Vn +1 - = - - 1 + -l)n
2
Vn
n
~ n1-
> 1.
Thus {V n } is an increasing and unbounded sequence. Also, when n = 2,
JiAI ~
V2
=
I>
1.
We have therefore the following two theorems: Theorem 10.1. The only field with discriminant 1 is the rational number field.
D
Theorem 10.2. IfLl is a rational integer, then there is afinite number n(Ll) such that any algebraic number field with discriminant Ll has degree at most n(Ll). D
Actually we can take one step further and prove: Theorem 10.3. Corresponding to each fixed rational integer Ll, there are at most a finite number of algebraic number fields with discriminant Ll.
Proof By Theorem 10.2 it suffices to prove that, given any natural number n, the number of algebraic number fields with degree n and discriminant Ll is finite. Let R(8) be a field with discriminant Ll and of degree n, and let W1, ... , Wn be its integral basis. Let (i= 1, ... ,n),
and define r1> r2 as before. We can assume, without loss of generality, that OC(1) = oc, OC(2), ••• , O(r ll have real coefficients, and oc(r, + 1), ••• , O(n) have complex coefficients, and that
Let
so that, by Theorem 8.3, there are rational integers xi, ... , x:' not all zero, such that
... ,
(3)
Therefore there exists a constant c, depending only on n, such that (i= 1,2, ... ,n).
If we can prove that (i
= 1, ... , n - 1),
(4)
560
20. The Geometry of Numbers
then, from Theorem 16.3.1, we see that 0(* is an algebraic number of degree n, and it is easy to prove that R(8) = R(O(*). Let the irreducible equation satisfied by 0(* be (5)
so that each ak must satisfy (k
= 1, ... ,n).
(6)
Thus any field R(8) of degree n and with discriminant LI must be the same as a certain R(O(*), where 0(* is a root of a certain irreducible equation (5) which satisfies the condition (6). Since the number of such irreducible equations is finite, the theorem follows. Therefore it remains to prove that (i= 1, ... ,n-l).
If '2 = 0, then O(*(v) =
'1~
(7)
(v = 1, ... , n). From (3) we have
1 1 ~ IN(O(*)I ~ 2n - 1 100*(n)l. But (i = 1, ... , n - 1),
so that (7) is established. If '2 > 0, then, for 1 ~ v ~
'2 -
1,
Thus
But 10(*(i)1
~ ~ < 2!
and O(*(r, +r2) =F O(*(n), since otherwise '1:
1 ~ IN(O(*)I
=
~
i =F n,
0, so that 100*(n)1
~
t, giving
1 F,
(y 2)n+2
,
which is impossible. Therefore (7) also holds when proved. D
'2
> 0. The theorem
IS
20.\1 The Least Value for
561
IAI
Exercise 1. Prove that we can always select an integer 0( from the ideal IN(O() I ~
M
0
such that
N(o).
Exercise 2. Prove that, given any ideal class, there is an ideal
0
satisfying
N(o)~M·
IAI
20.11 The Least Value for
We sawin the previous section that the discriminant A of an algebraic number field of degree n satisfies
1.11
~
(-n)2r2(nn)2 -. n!
"'"' 4
(mod 4), and (- 1)'2.1 > 0, we can construct the
Moreover, from A == 0 or following table:
'2 = I
'2 = 0 n=2 n=3 n=4 n=5
A A A A
~ ~ ~ ~
But actually the least value for
4 21 \16 680
A A
A A A
~
- 3 - 15
A~-71
A
~
- 419
(I) A A
~ ~
44 260
1.11 can be calculated to give
'2 = I
'2 = 0 n=2 n=3 n=4
~
'2 = 2
=5 = 49 = 725
A A A
===-
3 23 275
'2 = 2 (II) A
=
\17
The case n = 2 in Table (II) follows at once from considering the quadratic fields R(fi) and R(~). . When n = 3, if 8- satisfies x 3 + x 2 - 2x - I = 0, then the discriminant of R(8-) is 49, and if 8- satisfies x 3 - x - I = 0, then the discriminant of R(8-) is - 23. When n = 4, we let 8- be a root of
The following can then be proved: I) When a = 7,p = 29, we have r2 = 0, A = 725; 2) When a = 3,p = 11, we have r2 = I, A = - 275; 3) When a = - I,p = 13, we have r2 = 2, A = 117.
562
20. The Geometry of Numbers
The actual construction of Table (II) presents a problem. The case n = 2 in the table is very easily settled. When n ;;:: 3, the proof of Theorem 10.3 gives us a method whereby after a "finite number" of calculations we can arrive at the results given in Table (II). However, in actual practice, this method requires the calculations of the roots of about one thousand polynomial equations and the determination of the discriminants of the corresponding algebraic number fields. In order to solve this concrete problem we need a practical method. We now examine the situation when n = 3. Suppose that the cubic field R(8) in our discussion has discriminant Ll which satisfies 0 < it ~ 49 (r2 = 0), or - 23 ~ Ll < 0 (r2 = 1). From §10 we see that there is a non-zero integer oc in this field such that (1)
and
The degree of oc is either 3 or 1. Suppose that the degree of oc is definitely 3 so that oc cannot be a rational integer and hence R(8) = R(oc). From the inequality (1) we can determine a bound for the coefficients for the equations satisfied by oc, and the eventual result can be obtained after a finite number of calculations. Unfortunately we have no way of ensuring that oc is not a rational integer. On the contrary, from r > 3, we see that oc = ± 1 do satisfy (1) and ± 1 belong to R(8); therefore this method is not applicable. Let p > 3 and consider the convex body B:
jell + le21 + le31
~ p,
lel + e2 + e31 < 3 «
p),
where
and Wb W2, W3 is an integral basis for R(8). It is easy to see that B is a convex body symmetrical about the origin. Denote by F(t) the area of the intersection between the convex body A:
and the plane el + e2 decreasing. Therefore
+ e3 = t.
Then F(t) = F( - t), and when t;;:: 0, F(t) .is
563
20.11 The Least Value for ILlI 3
Volume of B
=2
p
f F(t)dt = 2~ f FGU )dU o
o p
~ 2~ fF(U)dU = ~ x Volume of A. P
p
o
But
Volume of A
=
{
233!~' 2
3
(
)
r2
when
r2 = 1.
=
0;
3
1 1t P ---
4
when
3!)23'
Therefore, by Minkowski's theorem, there is a non-zero integer a in R(f) satisfying when
(2)
when and (3)
Now we see from (3) that a certainly cannot be a rational integer. Therefore a has degree 3 and R(f) = R(a). Let the irreducible equation satisfied by a be (4)
Then g3 i= 0, and we can assume that g3 > O. For, if otherwise, from - a satisfying the equation
and R(f) = R(a) = R( - a), and - a also satisfying (2) and (3), we can replace g3 by - g3' From the relationship between the roots and the coefficients we have
so that Ig11
~
2 and g3
= 1. Finally we find a bound for g2 by
564
20. The Geometry of Numbers Ig21
+ OC(l)OC(3) + OC(2)OC(3)1 ~ IOC(1)OC(2)1 + IOC(l)OC(3)1 + IOC(2)OC(3)1 (loc(l)l + 11X(2)1 + IOC(3)1)2 't'2
=
IOC(1)OC(2)
~
~-<5,
3
3
so that Ig21 ~ 4. But when r2 = 0 we actually have Ig21 ~ 3. For we now have 1X(i) (i = 1,2,3) being all real, so that either they all have the same sign or there are exactly two of them having the same sign. In the former situation we have
+ 11X(1)1X(3)1 + 11X(2)OC(3)1 + 11X(2)1 + IOC(3)1)2 (1X(1) + 1X(2) + OC(3»2
Ig21 ~ 11X(l)OC(2)1 (11X(1)1
~
3
=
3
< 3,
and in the latter situation, we can assume that 1X(l)OC(2) > 0 and 1X(1)OC(3) < 0, so that
that is Ig21 ~ 3. Summarizing the above, in any cubic field R(8) with discriminant L1 satisfying 0< L1 ~ 49 (r2 = 0) or - 23 ~ L1 < 0 (r2 = 1) there is an integer oc such that R(8) = R(oc), and IX satisfies an irreducible equation
with Igti ~ 2, Ig21 ~ 4 (when r2 = 0, Ig21 ~ 3). Therefore, in order to determine cubic fields R(8) with discriminant L1 satisfying 0 < L1 ~ 49 (r2 = 0) or - 23 ~ L1 < 0 (r2 = 1), we need only examine these irreducible equations. But the number of such equations is at most 45 (at most 35 when r2 = 0). Moreover, when gl = g2 the equation has the root 1, and when gl + g2 + 2 = othe equation has the root - 1, so that we have no need to examine these reducible equations. Finally since the roots of x 3 - g2x2 + glX - 1 = 0 are the reciprocals of the roots of x 3 - glx2 + g2X - 1 = 0, and R(8) = R(I/8), the reciprocal equation to (4) need not be examined either. We are then left with 27 (18 when r2 = 0) equations to be considered. We then calculate the roots 8 of these 27 (or 18) equations and then determine the discriminants for R(8) to arrive at the results for n = 3 in Table (II).
Bibliography
I. Baker, A.: Linear forms in the logarithm of algebraic numbers. Mathematika 13 (1966) 204 - 216. (II) Mathematika 14 (1967) 102-107. (III) Mathematika 14 (1967) 220-228. (IV) Mathematika 15 (1968) 204-216 2. Baker, A.: Contribution to the theory of Diophantine equations I: On the representation of integers by binary forms. Phil. Tran. Roy. Soc. London, A 263 (1967) 273 -291 3. Baker, A.: On the class number of quadratic fields. Bull. London Math. Soc. 1(1969) 98-102 4. Baker, A.: Transcendental number theory. Cambridge University Press (1975) 5. Balasubramanian, R.: On Waring's problem: g(4) ..;; 21. Hardy-Ramanujan Journal 2 (1979) 1- 32 6. Barban, M. B.: Arithmetic functions on thin sets. [Russian]. Dokl. UzSSR 8 (1961) 9-11 7. Barban, M. B.: The density of the zeros of Dirichlet L-series and the problem ofthe sum of primes and almost primes. [Russian]. Mat. Sbornik (N. S.) 61 (103) (1963) 418-425 8. Bombieri, E.: Sulle formula di A. Selberg generalizzate per classi di funzioni aritmetiche e Ie applicazioni al problema del resto nel "Primzahlsatz". Riv. Mat. Univ. Parma 2; 3 (1962) 393-440 9. Bombieri, E.: On the large sieve. Mathematika 12 (1965) 201-225 10. Bombieri, E.: Le grand crible dans la theorie analytique des nombres. Societe Mathematique de France 18 (1974) II. Bombieri, E., and Davenport, H.: Small differences between prime numbers. Proc. Roy. Soc. Ser. A, 293 (1966) 1-18 12. Burgess, D. A.: The distribution of quadratic residues and non-residues. Mathematika 4 (1957) 106-112 13. Burgess, D. A.: On character sums and primitive roots. Proc. London Math. Soc. 12 (1962) 179-192 14. Buchstab, A. A.: New results in the investigation of the Goldbach-Euler problem and the problem of prime pairs. [Russian]. Dokl. Akad. NaukSSSR 162(1965) 735 -738 = Soviet Math. Dokl. 6 (1965) 729 -732 15. Cassels, J. W. S.: An introduction to Diophantine approximation. Cambridge Tracts in Mathematics 45 (1957) 16. Chao, K.: On the diophantine equation x 2 = y' + I, xy io O. Sci. Sin. 14, 3 (1965) 457 -460 17. Chen, J. R.: On the circle problem. [Chinese]. Acta Math. Sinica 13 (1963) 299-313 18. Chen, J. R.: On Waring's problem: g(5) = 37. [Chinese]. Acta Math. Sinica 14 (1964) 715 -734 19. Chen, J. R.: On the representation ofa large even integer as the sum ofa prime and the product of at most two primes. [Chinese]. Kexue Tongbao 17 (1966) 385-386 20. Chen, J. R.: On the representation of a large even integer as the sum a prime and the product of at most two primes. Sci. Sinica 16 (1973) 157 -176 21. Diamond, H., and Steinig, G. J.: An elementary proof of the prime number theorem with a remainder term. Inventiones Math. II (1970) 199 -258 22. Dickson, L. E. : History of the theory of numbers. (Three volumes). Carnegie Institute, Washington (1919, 1920, 1923) 23. Elliot, P. D. T. A., and Halberstam, H.: Some applications of Bombieri's theorem. Mathematika 13 (1966) 196-203 24. Estermann, T. : Introduction to modern prime num ber theory. Cambridge Tracts in Mathematics 41 (1952) 25. Gauss, C. F.: Disquisitiones arithmeticae. Leipzig, Fleisher, (1801). English translation: A. A. Clarke, Yale University Press (1966)
566
Bibliography
26. Hagis, Jr., P.: A lower bound for the set of odd perfect numbers. Math. Compo 27; 12l (1973) 951-953 27. Hagis, Jr., P., and McDaniel, W. L.: On the largest prime divisor of an odd perfect number, II. Math. Comp., 29 (1975) 922-924 28. Halberstam, H., and Richert, H.-E.: Sieve methods. Academic Press, London (1974) 29. Hardy, G. H., and Wright, E. M.: An introduction to the theory of numbers. 4th ed. Oxford (1960) 30. Hua, L. K.: Die Abschiitzungen von Exponentialsummen und ihre Anwendung in der Zahlentheorie. Enzykl. Math. Wiss., J, 2, Heft 13. Teil I. Leipzig (1959) 31. Huxley, M. N.: On the difference between consecutive primes. Inventiones Math. 16 (1972) 191-201 32. Huxley, M. N.: Small differences between consecutive primes. Mathematika, 20; 2 (1973) 229-232 33. Ingham, A. E.: The distribution of prime numbers. Cambridge Tracts in Mathematics 30 (1932) 34. Kolesnik, G. A.: The refined error term of the divisor problem. [Russian]. "Mat. Zametki" 6 (1969) 545-554 35. Korobov, N. M.: On the estimation of trigonometric sums and its applications. [Russian]. Uspeki Math. Nauk SSSR 13 (1958) 185-192 36. Landau, E.: Handbuch der Lehre von der Verteilung der Primzahlen. (2 Biinde). Leipzig, Teubner (1909) 37. Landau, E.: Vorlesungen iiber Zahlentheorie. (3 Biinde). Leipzig, Hirzel (1927) . 38. Landau, E.: Uber einige neuere Fortschritte der additiven Zahlentheorie. Cambridge Tracts in Mathematics 35 (1937) 39. Lavrik, A. V., and Soberov, A. S.: On the error term of the elementary proof ofthe prime number theorem. [Russian]. Dokl. Adad. Nauk SSSR 211 (1973) 534-536 40. Linnik, Yu. V.: The dispersion method in binary additive problems. Leningrad, (1961). = Providence, R.1. (1963) 41. Mahler, K.: On the fractional parts of the powers of a rational number, II. Mathematika 4 (1957) 122-124 42. Minkowski, H.: Geometrie der Zahlen. Leipzig, Teubner (1910) 43. Minkowski, H.: Diophantine Approximation. Leipzig, Teubner (1927) 44. Montgomery, H. L.: Topics in Multiplicative Number Theory. Springer Lecture Notes 227 (1971) 45. Pan, C. T.: On the least prime in an arithmetic progression. [Chinese]. Sci Rec., New Ser. 1 (1957) 283-286 46. Pan, C. T.: On the representation of an even integer as a sum of a prime and an almost prime. [Chinese]. Acta Math. Sinica 12 (1962) 95 -106 = Chinese Math.-Acta 3 (1963) 101-112 47. Pan, C. T.: On the representation of even numbers as the sum of a prime and a product of at most 4 primes. [Chinese]. Acta Sci. Natur. Univ. Shangtung 2 (1962) 40-62 = Sci. Sinica 12 (1963) 455 -474. [Russian] 48. Pan, C. T., Ding, X. X., and Wang, Y.: On the representation of a large even integer as a sum of a prime and an almost prime. Kexu Tongbao 8 (1975) 358-360 49. Richert, H.-E.: Zur multiplikativen Zahlentheorie. J. reine angew. Math. 206 (1961) 31-38 50. Roth, K. F.: On the large sieves of Linnik and Renyi. Mathematika 12 (1965) 1-9 51. Schmidt, W. M.: Simultaneous approximations to algebraic numbers by rationals. Acta Math. 125 (1970) 189-201 52. Schmidt, W. M.: Diophantine Approximations. Springer Lecture Notes 785 (1980) 53. Sierpiilski, W.: Elementary theory of numbers. Warszawa (1964) 54. Slowinski, D.: Searching for the 27th Mersenne prime. J. Recreational MatJ;!.ematics 11 (1979) 258-261 55. Stark, H. M.: A complete determination of the complex quadratic fields of class number 1. Michigan Math. J. 14 (1967) 1-27 56. Stepanov, S. A.: On the estimation of Weyl's sums with prime denominators. [Russian]. Uzv. Akad. Nauk. SSSR, Ser. Mat. (1970) 1015 -1037 57. Titchmarsh, E. c.: The theory of the Riemann zeta-function. Oxford (1951) 58. Vaughan, R. c.: A note on Snirel'man's approach to Goldbach's problem. Bull. London Math. Soc. 8 (1976) 245-250 59. Vinogradov, A. I.: The density hypothesis for Dirichlet L-series. [Russian]. Izv. Akad. Nauk SSSR Ser. Mat. 29 (1965) 903 -934. Corrigendum: ibid. 30 (1966) 719-720
Bibliography
567
60. Vinogradov, I. M.: On a new estimation of a function W + it). [Russian]. Izv. Akad. Nauk SSSR, Ser. Mat. 22 (1958) 161-164 61. Vinogradov, I. M.: On the problem of the upper estimation for G(m). [Russian]. Izv. Akad. Nauk SSSR, Ser. Mat. 23 (1959) 637 - 642 62. Wang, Y.: On the least primitive root. [Chinese]. Acta Math. Sinica 9 (1959) 432-441 63. Wang, Y.: On the estimation of character sums and its applications. [Chinese]. Sci. Record (N. S.) 7 (1964) 78-83 64. Wirsing, E.: Elementare Beweise des Primzahlsatzes mit Restglied, II. J. Reine Angew. Math. 214/215 (1964) 1-18 65. Yin, W. L.: On Dirichlet's divisor problem. Sci. Rec., New Ser. 3 (1959) 131-134
Index
Abel's lemma 120 Aequartro identica ratis abstrura 208 Algebraic number fields 425 Argument 338 Artin, E. 39 Association 431 -, left 368, 382 -, right 368 - modulo p 62 Baker, A. 493, 565 Balasubramanian, R. 513, 565 Barban, M. B. 565 Basis 399,426 -, integral 427 -, standard 402 Base interchange formula 49 Bertrand's postulate 75, 82 Blichfeldt, H. F. 547 Bombieri, E. 100, 249, 565 Brun, V. 74, 514 Buchstab, A. A. 565 Burgess, D. A. 185, 337, 565
Cassels, J. W. S. 478, 493, 565 Chao Jung-Tze 255 Chao, K. 299, 504, 565 Character 152 - system 314, 445 -, improper 157 -, primitive 156 -, principal 152 -, standard factorization of 156 Chebyshev, P. L. 82 Chen, J. R. 99, 100, 147,513,565 Chowla, S. 100 Cofactor 372 -, algebraic 372 Commute 372 Congruent 22, 416 - modulo m 437 - modulo 9Jl 402 Conjugates 425
Continued fraction 250 -, complete quotient of 252 -, n-th convergent of 250 -, periodic 260 -, simple 251 Convex body 538 - region 535 Coprime 5, 434 Countable 474 Cross ratio 342
Davenport, H. 100, 448, 534, 545, 565 Degree 423, 425 - of III 438 Density, asymptotic 113 -,p-
210
-, real 210 -, Schnirelmann 514 Diamond, H. 249, 565 Dickson, L. E. 276, 565 Dimension 399 Ding, X. X. 566 Diophantine equations 276 Diophantus 276 Dirichlet series 143 Dirichlet's divisor problem 147 Discriminant 300, 426 - of R(8) 428 -, fundamental 322 Divisor 2, 57, 430 -, elementary 387 -, greatest common 5, 58, 394, 434 -, ideal 433 -, proper 2 -, right 389 Dyson, F. J. 478
Eisenstein, F. G. 39 Elliot, P. D. T. A. 101, 565 Elliptic 341 Enumerable 474 Equipotent 474 Equivalent 257, 350, 369
570
Index
- form 301 - form modq 309 - in the narrower sense 443 Erdos, P. 217 Euclidean algorithm 5 Euclidean distance 347 Euler, L. 76 - -Binet formula 290 - -'s constant 88, 112, 483 - -'s criterion 36 - -'s identity 191 Extended complex plane 339 Extension, algebraic 69 -, finite 424 -, single 424 - , ((i-
416
((i-convergent sequence 415 ((i-limit 415 Factor, invariant 387 -, repeated 63 Farey sequence 125 Fermat, P. de 288 - solution 25 - last theorem 151,451,488 Fibonacci sequence 252 , Field 68, 424 -, Euclidean 447 -, simple 447 Finite order 342 Fixed point 341 Form, binary quadratic 300 -, (in)definite 301 -, primitive 307 -, reduced 304 Franklin, F. 197 Function, arithmetic 102 -, Chebyshev 217 -, (completely) multiplicative 13, 102 -, divisor 103, 111 -, Euler 103 -, generating 143 -, Mobius 103 -, Riemann zeta 144,219 -, slowly decreasing 226 -, von Mangoldt 103 Fundamental circle 358 Fundamental region 351 Fundamental sequence 415 Furtwiingler 39
Gauss, C. F. 37, 39, 47, 329, 565 Gelfond, A. O. 488 Genus 314, 445 -, principal 446
Geodesic 345 Goldbach's problem 74,99, 151,514 Graph 195 -, (self-)conjugate 196 Group 340 -, abelian 68 -, adjoint 391 Hagis, Jr., P. 566 Hajos 535, 542 Halberstam, H., 100, 101,534,566 Hardy, G. H. 101, 566 Heath-Brown, D. R. 100 Hensel, K. 405 Hensel's lemma 421 Hilbert, D. 39, 483, 488, 494, 514 Heilbronn, H. 329 Hua, L. K. 513,566 Huxley, M. N. 100, 566 Hyperbolic 341 Ideal 58, 68, 432 - class 441 - divisor 433 -, prime 434 -, principle 432 -, product of 432 -, unit 432 Index 48 Inequality, arithmetic-geometric means -, Bunyakovsky-Schwarz 508 -, Cauchy 330 -, Holder 550 -, Minkowski 547,553 Ingham, A. E. 566 Integer 1 -, algebraic 423 -, rational 423 Inverse transformation 339 Involution 342 Iwaniec, H. 100 Jarnik, M. V. 123 Jacobi's symbol 44, 159
Khintchin, A. 494 Kolesnik, G. A. 147, 566 Korobov, N. M. 248, 566 Kronecker's symbol 185, 304 Kummer, E. 39, 431, 451 Kusmin 488 Lagrange interpolation formula
61
544
571
Index Lambert series 146 Landau, E. 566 Large sieve 100 Lattice point 40, 112 Lavrik, A. V. 566 Law of Quadratic Reciprocity 39 Lehmer, D. H. 26 Lehmer, D. N. 4 Legendre, A.-M. 39 Legendre's symbol 35, 152 Lobachevskian geometry 348, 354 Loxodromic 341 Linnik, Yu. 100, 101, 494, 514, 566 Littlewood, J. E. 73, 101
Mahler, K. 513,566 Mann, H. B. 516 Markoff, A. A. 288 Matrix, adjoint (modular) 373, 390 -, composite 389 -, irreducible 389 -, (positive) modular 365, 372 -, (standard) prime 390 Mediant 127 Mersenne number 38, 449 Mersenne prime 450 Miller, J. C. P. 51 Minkowski, H. 544,547,556,566 Mobius (inverse) transform 108 Modular transformation 257 Modulus 4 -, double 64 -, integral 68 Montgomery, H. L. 100,566 MordeII, L. J. 538 Multiple 2 -, (left) least common 394, 8, 59
Niven, I. 486 Norm 425 - ofIDl 402 Normal form of Hermite 369, 384 - - of Smith 370, 386 NuII sequence 415 Number, algebraic 423 -, cardinal 474 -, composite 3 -, Markoff 260, 288 -, Mersenne 38, 449 -, perfect 13 -, prime 3 -, square-free 113 -, triangular 191 -, transcendental 476
Order 48,68 Otto 255
Pan, C. T. 100,566 Parabolic 342 Partition 187 -, (self) conjugate 195, 196 Period 342 Point at infinity 339 P6lya, G. 185 Polynomial, associated 57 -, integral valued 17 -, (ir)reducible 20,63 Primary solution 282 Prime in R(8) 431 - modp
63
- twins 74 Primitive root 48, 49, 68 Principal class 446 Proper solutions 279
Quadratic algebraic numbers 349 Quadratic, (non)-residue 35
Reduced points 351 Reduced quadratic form 358 Renyi, A. 100 Residue class 22 moth - moddp, rp(x) 67 ....:., (non)-k-th power 49 -, quadratic (non) 35 Residue system, complete 22, 64 - -, reduced 24, 64 Richert, H. E. 100, 147, 566 Riemann hypothesis 185,488 Riemann sphere 339 Roth, K. F. 100,478,566
Schmidt, W. M. 493, 566 Schneider, T. 488 Schnirelmann, L. 514 Selberg, A. 217,514 Siegel, C. L. 329,478 Sierpinski, W. 566 Sieve of Eratosthenes 3 Slowinski, D. 566 Soon Go 276 Squaring the circle 488 Standard factorization 3 Stark, H. M. 566 Steinhaus, H. 123 Steinig, G. J. 249, 565 Stepanov, S. A. 566
572 Symbol, Jacobi 44, 159 -, Kronecker 185,304 -, Legendre 35, 152
Takagi, T. 39 Theorem, Bombieri 101 -, Cauchy 497 -, Chebyshev 73,79,89,266 -, Chinese remainder 22, 29 -, Dedekind's discriminant 438 -, Dirichlet 73,97,243 -, Eisenstein 20 -, Erdos-Fuchs 138 -, Euler 24, 36, 76 -, Fermat 18, 24 -, Fundamental - of arithmetic I, 3, 6 -, Fundamental - for ideals 435 -, Gauss 20, 37 -, Hardy-Ramanujan 95 -, Heilbronn-Siegel 329 -, Hermite 485 -, Hilbert 61,529 -, Hurwitz 256 -, Ikehara 228 -, Jacobi 208 -, Jacobsthal 176 -, Khintchin 266 -, Lagrange 208 -, Landau-Ostrowski-Thue 480 -, Legendre 261 -, Liouville 476 -, Lindemann 486 -, Mayer 288 -, Miller 96 -, Minkowski's Fundamental 535, 538 -, P61ya 172 -, prime number 73 -, Roth 478 -, Selberg 233, 520 -, Schur 329 -, Siegel 331,335 -, Sierpinski 134
Index -, Soon Go 286 -, Stickel berger 428 -, Tchebotaref 556 -, Thue 479 -, unique factorization 58 -, Voronoi 137 -, Weyl 270, 272 -, Wilson 33 -, Wolstenholme 33 Thue, A. 478 Titchmarsh, E. C. 566 Trace 425 Transformation 373 -, (uni)modular 348, 373 Triangle 347 Turan, P. 95
Unit 424 -, fundamental - circle 338
441
Valuation 408, 409 -, (non)Archimedian 411 -, equivalent 410 -, identical 409 - , p-adic 409 Vaughan, R. C. 534, 566 Vinogradov, A. I. 100, 566 Vinogradov, I. M. 74, 173,248,513,567
Wang, Y. 185, 337, 566, 567 Weil, A. 185 Wheeler, D. J. 51 Wiener, N. 217 Wirsing, E. 249, 567 Wright, E. M. 566
Yin, W. L.
137, 147, 567