This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
(148.11)
ORDERED STRUCTURES
570
Since P is a semimodule, the rule:
fl -a.y-P EP=; -xr P holds because (/3 - a) + (y
x. Because of (148.2) this means
that (148.10) is satisfied. Since, by (148.2), (148.11) may be represented in the form
/3-x=0, f - aEP, - (i3-x)EP, we see from (148.1) that the assertion concerning (148.11) is true. From (148.2) the validity of (148.3) follows at once. If S is a ring, then from the left-hand side of (148.4)
/3-x,1'EP follows from (148.2). As P is a semiring, so
(8-a);'EP, i.e..
Yi3 - ya, /By - x;' E P. Because of (148.2) this implies the validity of the right-hand side of (148.4). This proves the first part of the theorem.
We now suppose that S is ordered and. that (148.3), (148.4) hold. [If S is a module then only (148.3) holds.] We denote by P the set of elements a of S with the property a > 0. From the supposition a, /3 > 0, by (148.3), (148.4), we deduce that a + /B, aj9 > 0. Hence it follows that P is a semimodule or semiring, according as S is a module or a ring. Further, because of the supposition, for every element a of S exactly one of the possibilities
a=0, 0
is true. We can write for this
a=0, x>0, -a>O. Hence we see that for P the condition required in (148.1) is satisfied. Hence P is a positivity domain. From the equivalence of the three propositions
a
ORDERED STRUCTURES
571
For two elements a, P (a < f) from an ordered structure S, by the interval (a , f) we mean the set of those elements of S which satisfy the condition
A module or a ring is called orderable if it can be ordered in at least one way.
THEOREM 367 (LEVI'S theorem). A module M (0 0) is orderable if, and
only if, its characteristic is 0. If this is so and N is an arbitrary subsemimodule of M which does not contain the element 0, then M can be ordered in at least one way so that all the elements of N become positive. If M is ordered in some manner and a is an element different from 0 in M, then either c > 0 or a < 0. Hence, and from (148.9), not > 0 or
na < 0, respectively, for all n = 1, 2, .... Since then we have in both cases no, 0, M must, in fact, be of characteristic 0. Now suppose that M has characteristic 0. Then every element of M different
from 0 generates a subsemimodule which does not contain the element 0. i.e., which has the property required in the theorem. Therefore it suffices to prove only the last assertion of the theorem. With this end in view we consider the set of those subsemimodules of M
which contain N but do not contain 0. This set contains, according to Theorem 48, a maximal element P. Since this P is a subsemimodule of M which contains N, we have only to prove that P is a positivity domain. Since, further, P does not contain the element 0, it suffices to show that for every element a (# 0) of M either a or -a, but not both, belongs to P.
Since a + (-a) = 0 and P is a semimodule which does not contain 0, so of a, -a at most one can belong to P. We suppose that neither of these elements belongs to P; the contradiction arising from this will prove the theorem. On account of the supposition the semimodules {P, a}, {P, -a} are proper extensions of P, therefore both must contain the element 0. Accordingly, the equations
It+ma=0,
r-na=0
hold with appropriate 11, r (E P), m, n (E . 1 ) From these we have
nla+In r=0. But since the left-hand side is an element of P and the right-hand side not, we have a contradiction. Consequently Theorem 367 is proved. Because of this theorem we need only further examine a ring (skew field, field) R for orderability. Since, if R is ordered, the module R+ is also always
ordered (according to the same positivity domain), we may restrict ourselves, by Theorem 367, to the case where R is of characteristic 0.
ORDERED STRUCTURES
572
First, we observe the following: if R is ordered and a, j3 are arbitrary elements of R, other than 0, for which the refore one of the four case 1)
a>0, P>0.
2)
1a>0,
3)
(a<0,
4)
I J3>0.
1f3<0.
Jx<0, J3<0
must hold, then correspondingly we have
aJ3>0, af<0. a43<0, a#>0.
(148.12)
In case 1) the assertion follows from (148.4). In case 2) we have a > 0,
-# > 0, whence it follows first that -al4 > 0. so that a# < 0. Case 3) is the dual of 2). In case 4) we have -a > 0, -9 > 0, whence afl > 0 follows. Rule (148.12) is now proved. Since in all four cases a(3 # 0, this implies that every orderable ring is necessarily zero-divisor-free. Further, the next more general rule follows from (148.12): in an ordered ring R every product
al ... an (21, ..., an # 0)
(148.13)
is positive or negative according as the number of the negative factors is even or odd.
Now we introduce the following fundamental concept in an arbitrary ring R. We consider all the prod ucts of the form as /9i9 ... vv with factors from R different from 0 and the further products which are obtained from them by permutation of the factors (e.g., a2, afl2a, aflai9). The semiring P0 generated by these products is called the positive kernel of the ring R.
This terminology is justified by the circumstance that according to the rule formulated in (148.13) the generators mentioned, and so, generally, all the elements of P0, must be positive, no matter how R was ordered. We call a ring (skew field, field) formally real when its positive kernel does not contain the element 0. (The reason for the term "formally real" will be given in § 156.) THEOREM 368 (JOHNSON'S theorem). A ring R (# 0) is orderable if, and only if, it is formally real. According to what has been stated above the assertion "only if" is true.
In order to prove -if" we suppose that R is formally real, i.e., its positive kernel P0 does not contain the element 0. To every subsemiring H of R with the property H
Po
(148.14)
we construct a certain subsemiring H' of R such that H
(148.15)
ORDERED STRUCTURES
573
as follows: we consider all the products of factors (s 0) of R with the property that a number (> 0) of the factors belong to h' and the remaining factors may be collected in pairs of equal elements. The semiring generated
by these products is to be taken for H'. It is evident that this satisfies (148.15). We then denote by> the set of those subsemirings H of R with the property (148.14) for which the subsemiring H' does not contain the element 0. Since, in particular, for H = P, evidently H' = P,,, the set k) is not empty. For every chain
Hl cH., c....
which is formed of the elements of 5i, the semiring
H = H1UH.,U... exists, by Theorem 47. Evidently for the corresponding semirings H'. H H;, ... the similar relation
H' does not contain the element 0, therefore H E .. This implies that the semirings H1, Hz, ... have the upper bound H (in s)). From the Kux&TOwsKI-ZORN lemma (Theorem 16) it follows that 5')
contains at least one maximal element. We denote by P such an element and show that P is a positivity domain of R, which proves the theorem. (In addition, P' = P, but this may be disregarded.)
Since P is a subsemiring of R, we have only to prove that the condition (148.1) is satisfied. Because P E 9), 0 is not an element of P. Therefore
it will suffice to show that for an arbitrary a (E R, # 0) exactly one of a, -a belongs to P. We cannot have a, -a E P, for this implies that x + (-a) = 0 belongs to P, which is false.
We suppose that neither of a, -a belongs to P. The following contradiction will prove the theorem. Since a lies outside P, the subsemiring Pt = {P, a}
of R is larger than P, whence it follows that
0EPi.
(148.16)
We denote in general by rro a product of factors (o 0) of R for which a number (> 0) of factors belong to P and for which the remaining factors
ORDERED STRUCTURES
574
may be collected in pairs of equal elements. Further, we denote by nl every such element of R for which and is a no. It is evident that the semimodule generated by all the no, nt is equal to P;+. Because of (148.16) we therefore have an equation of the form
Yno +Ynl=0,
(148.17)
where the sums are finite and at least one is not empty. Moreover, since every no belongs to P' and 0 does not belong to P', it follows that, in (148.17),
the second sum is not empty.
But now, on account of the above supposition, the elements a., -a play the same role. Thus, there is also an equation
E n;, - Y nl = 0,
(148.18)
similar to (148.17), where again the second sum is not empty and the no, ni are defined exactly as the n and n,, respectively. From (148.17), (148.18) we get
(>no)(Eno)+(Y_ no (z a,') = 0. Since every product no no is a no and every product n, ni is likewise a pro,
it follows that the left-hand side belongs to P', thus 0 E P'. This contradiction proves Theorem 368.
NOTE. According to this theorem, "orderable" and `formally real" define the same property for a ring. Of these notations, both equally valid, we prefer the latter. It is evident that the subrings of a formally real ring are also formally real.
Theorem 368 is essentially simplified when a skew field or even a field is under consideration. Before discussing this we first note that the positive
kernel of a skew field F consists of all the non-empty sums of the form
(c,EF, 00; k>0).
a ...ak
(148.19)
This follows by complete induction from the definition of the positive kernel of rings and from m floc = Q2aV
(a, Y E F; Y o 0; e = aN ; U = 1' -').
... ak = (al ... ak)2, it also follows that the positive kernel of afield consists of all the sums of the form Further, since in the commutative case a` 72
+ ... + an
(ac, E F, 0 0; n > 0).
(148.20)
ORDERED STRUCTURES
575
THEOREM 369 (PICKERT-SZELE theorem). A skew field F is formally real
and only if, -1 may not be written as a sum of the form
1... aK
(ai E F).
(148.21)
Let P0 denote the positive kernel of F. Because of JOHNSON'S theorem (Theorem 368) it suffices to show that P0 contains the element 0 if, and only if, -1 is representable in the form (148.21). If this is so, then
+Ea...ak=0,
from which it follows that because 1 = 12, 0 E Po. Conversely, if 0 E P0, i.e., we have an equation of the form w1 .
.
. wr + E 4 ... Mk = 0
lwi, ai E F; wi # 0),
we obtain
1 = I (w; 1)2 ... (a) 1)2 a ... aR Consequently, Theorem 369 is proved. THEOREM 370 (ARTiN-SCHREIER theorem). A field F is formally real if,
and only if, -1 may not be written in the form a1 +
... + a
(ai E F).
(148.22)
This follows immediately from Theorem 369. Cf. ARTIN-SCHREIER (1926), SZELE (1952), JOHNSON (1952). Theorem 369 (for
the countable case) was first published without proof by PICKERT (1951) (p. 248, Exercise 13). Lately PODDERJUGIN (1954) has extended Theorem 368 to non-associative
rings (also with zero divisors), where one has to modify the ordering slightly.
THEOREM 371. If G I F is a field of (finite) odd degree and if F is formally
real, then G is also formally real.
For the proof we put n = [G : F] _- 1 (mod 2). Because of the supposition and LEVI'S theorem (Theorem 367), F is of characteristic 0, thus we may put G - F(ad). Let f(x) be the minimal polynomial of s over F. If n = 1 the theorem is true. If n >-- 3 we assume its truth for all odd integers less than n instead of n. If G is not formally real, then an equation of the form
-1=
k
i=I
(f(x))2 +.f(x)9(x)
(f1 x ......fk(x), 9(x) E F[x]), (148.23)
where all the fi(x) are different from 0 and of degree < n - 1, follows from the fact that G ki F[x]/(f(x)) and from Theorem 370. Let r denote the maximum of the degrees of f1(x)...., fk(x). Because of (148.23), the
ORDERED STRUCTURES
576
degree of f(x)g(x) is at most 2r. It must be exactly 2r, since otherwise from the comparison of the coefficients of x2r it would follow that in F the sum of the squares of certain elements other than 0 vanishes, which is impossible, since F is formally real. Consequently, the degree of g(x) is equal to 2r - n, and so an odd number which is smaller than n. Therefore g(x) has an irreducible factor gl(x) also of odd degree, smaller than n. We now take an extension field G, = F(t91) of F such that g1(O1) = 0. According to the induction hypothesis G1 is formally real. On the other hand, it follows from (148.23) that
This contradicts Theorem 370 thus proving Theorem 371. Of course we must modify the concept of isomorphism and substructure when applied to ordered structures. We call an isomorphism between two ordered structures S (148.24) S' (x -k x') an order preserving isomorphism of S with S', if the condition
x<#=* a'<X
(148.25)
is also satisfied. (Of course this also holds for "C+" instead of "r*".) If S = S' we then speak of an order preserving automorphism. It is evident that an isomorphism such as (148.24) is order preserving if, and only if, the image of the positivity domain of S is exactly the positivity domain
of S'. Two ordered structures are considered as being not essentially different when they are obtained from each other by an order preserving isomorphism.
Of two ordered structures S, T we say that T is an order preserving sub-
structure of S when T is a substructure of S and every relation a < (3, valid in T, retains its validity in S. We also say that the ordering of S is an xtension of the ordering of T.
If instead of a substructure T an ordered structure T' is embedded in an
ered structure S so that every relation a < fl, valid in S, retains its ty after the embedding, then we speak of an order preserving embedding.
tly this is possible if, and only if, T is order preserving isomorphic
a ollowing assertion is almost trivial: if S is a structure ordered sa to the positivity domain P, then every substructure S' of the as S is ordered according to the positivity domain
P,=S'nP ring of S is an extension of that of S'.
(148.26)
ORDERED STRUCTURES
577
It will suffice to prove the assertion, e.g., for the case where S, S' are rings. Then P, P' are semirings. Since, for an element x of S, of the three possibilities (148.1) exactly one is true, it follows from (148.26) that for an element x of S', of the three possibilities
a=0, aE_ P', - aE P' exactly one is true. According to this S' is ordered according to the positivity
domain P'. Finally, if for two elements a, (3 of S' the relation a < (3 (P') holds, then fi - a E P', and therefore (3 - a E P, i.e., x < (3 (P). THEOREM 372 (GRATZER - S CHMIDT theorem). If S is a zero-divisor-free
Everett extension ring of an ordered ring R (0 0), then the ordering of R may be extended in one, and only one, manner to an ordering of S.
Let P be that positivity domain of R according to which R is ordered. It is to be proved that P may be extended to a positivity domain of S in only one way. We denote by P' the set of additive inverses of P, i.e., that of negative elements of R and show first of all that for every a (E S, 0) either
Pa, aP S P or Pa, aP S P'.
(148.27)
Since R is an ideal of S and P c R, so Pa, aP S R. Moreover, since S is zero-divisor-free, Pa, aP S PUP'.
Thus in order to prove (148.27), it suffices to show that for any two
elements , i of P ya E P
ar E P.
(148.28)
part According to the principle of duality it will suffice to prove the of this. For this we suppose that the left-hand side of (148.28) holds. Then Earl E P, thus the right-hand side of (148.28) holds. Hence (148.28), therefore also (148.27), is proved. Now in order to prove the theorem we first suppose that Q is a positivity domain of S for which (148.29) Q ? P. We have to prove the uniqueness of Q. We even prove that Q consists of the a such that Pa, aP S P
(a E S)
(148.30)
holds.
For a E Q, because of (148.29), Pa, aP c Q. But Pa, aP S R. Therefore, since according to (148.26), P = R fl Q, (148.30) follows. Conversely, if (148.30) holds for a a, then a E Q follows because of (148.29).
ORDERED STRUCTURES
578
Hence we have proved that Q is the set of elements a having the property (148.30).
Conversely, let Q denote the set being considered. We shall show that Q is then a positivity domain of S and (148.29) holds, which will prove the theorem. From the definition of Q it follows that it is a semiring and that (148.29) holds. If we take (148.27) into consideration then it also follows that for every a (E S) exactly one of the three possibilities
a=0, aEQ, -aEQ holds. Therefore Q is a positivity domain of S. Consequently Theorem 372 is proved. Cf. GRATZER-SCHMIDT (1957).
THEOREM 373. The ordering of a ring R with at least one central element
(;4 0) may be extended in one, and only one, way to an ordering of the quotient ring R' of R. Let P denote the positivity domain of R belonging to the given ordering. Because of the supposition, R must be zero-divisor-free. Therefore [according to (47. 1)] the elements of R' are given by
x9-1
(a E R; fi E Z, 0 0),
(148.31)
where Z denotes the centre of R. (Cf. also Theorem 94.) First, we assume an extension of the ordering of R to an ordering of R'. Let P' be the positivity domain of R'. Then P = R n P'
(148.32)
holds. We shall show that the element (148.31) belongs to P' if, and only if, (148.33),
a.(3 E P,
whence the assertion of uniqueness of the theorem follows. If the element (148.31) belongs to P', then, because (32 > 0, so also does 2 x(3 = 00-1 F(148.32).
(148.34)
belong to P', thus (148.33) follows from If, conversely, (148.33) holds, then by (148.32) ocfl E P', consequently the element (148.31) belongs to P', because of (148.34) and f-2 E P'. The assertion is now proved. In order to prove the existence proposition in the theorem we denote by P' the set of those elements (148.31) for which (148.33) holds. We shall show that P' is a semiring. For that purpose we consider two elements. y(3
', ;)b-I
(x, y E R; /3, 6 E Z,
0)
ORDERED STRUCTURES
579
of P' for which therefore the conditions af3, yb E P
are satisfied. There
of -1 +
yb-'
= ((xb + yj9)
xii '
yb
xy(f b)-'.
(148.35)
Since the elements
xy 3b = a,B yb
(ab + fly) (f1b) = xf9b2 +
belong to P on account of the assumption, the left-hand sides of equation (148.35) belong to P'. Consequently P' is a semiring. Moreover, since of the three possibilities
xf1=0, af1EP. -x(3EP
(xER;fEZ,
O)
exactly one is true, for an arbitrary element (148.31) of R', of the three possibilities a. f3-' = 0,
x,B-' E
P',
- xft __' E P'
likewise exactly one is true. Accordingly P' is a positivity domain of R'. Since P' P, the ordering of R' according to P' is the extension of that of R according to P. Consequently Theorem 373 is proved. As GRATZER-SCHMIDT (1957) have observed, Theorem 373 includes the
commutative case of Theorem 372, since all the ideals (A 0) of a zerodivisor-free commutative ring have a common quotient field. In every ordered ring with unity element e, this is positive because e = s2. Consequently s, 2e, ..., must all be positive, too. Because of Theorem 373, the following then holds: THEOREM 374. The ring 7 and the field .,-.-0 hare each only one ordering, and their positivity domains consist of the numbers m and mn-'. respectively, (m, n E <.Y ).
NoTE. This simple theorem is of great importance as it furnishes the abstract algebraic definition of the positive integers and the positive rational
numbers. Namely, the positive rational numbers are essentially the elements of the positivity domain P of an ordered prime field Fo of characteristic 0. Further the positive integers are essentially the common elements of the module generated by the unity element of Fo and of P. Hence-
forth, if an ordered ring with unity element is being considered we tacitly identify the unity element with the number 1; this means the order preserving embedding of 7 or, for an ordered skew field or field, the order preserving embedding of.7o.
ORDERED STRUCTURES
580
EXAMPLE 1. The positive kernels of 3 and .moo consist of the positive integers and the positive rational numbers, respectively. EXAMPLE 2. Contrary to Theorem 374, the module .7+ has exactly two orderings
in which the positivity domain consists, in the one case, of 1, 2, . . ., and in the other, of -1, -2..... Yet, these two ordered modules are not essentially different since n -- -n is an order preserving isomorphism between them.
EXAMPLE 3. Let V be the 3-vector space with the basis a),, ..., co.. The elements of the form n) (CA > 0; ek, ..., C. E .7; 1 < k CkWA + . . . + of V constitute a subsemimodule which is obviously a positivity domain P of V. In V. ordered according to this P, a1W, + .
.
. + anw < b1w, + ... + bn("
(a1 b, E 9)
if. and only if, for some k (1 < k < it) ak< bk. EXAMPLE 4. If a ring R is formally real and P denotes its positive kernel, then, for arbitrary elements n, i4 (# 0) of R, efl2, nx1 E P0
(n == 1, 2, ....),
whence it follows that a2f2, nay ; 0, and then that 0, nac 96 0. This is the direct proof that a formally real ring must be zero-divisor-free and of characteristic 0 which, of course, follows from JOHNSON'S theorem (Theorem 368), too. EXAMPLE 5. The complex field .F-, (i) is of degree two and because i2 + 1 = 0
is not formally real. Accordingly, Theorem 371 loses its validity for fields G I F of even degree. EXAMPLE 6. The minimal extension ring with unity element of a formally real ring
is formally real as a consequence Of SZENDREI'S theorem (Theorem 201) and of Theorem 372. (Cf. the original proof of JOHNSON (1952) for his Theorem 368.) EXERCISE. From Theorem 97 it follows that: if a semi-skew-field is a positivity domain of its difference ring then this is a skew field.
§ 149. Archimedean and Non-Archimedean Orderings
An ordering of a structure S is called an Archimedean ordering, if for every
pair of elements x, t3 (> 0) of S there is a natural number n such that
not > .
(149.1)
Otherwise we speak of a non-Archimedean ordering. The above terms are derived from the axiom of Archimedes according to which.
for two intervals AB, CD, there is always a natural number n such that nAB > > CD. THEOREM 375. An Archimedean ordered ring is necessarily commutative.
and if it has a unity element then it has no non-identical order preserving automorphism.
ARCHIMEDEAN AND NON-ARCHIMEDEAN ORDERINGS
581
Let R denote an Archimedean-ordered ring. We consider two positive ) there is, according to (149.1), an elements a, /3 of R. To every m (E
n (E 4'') such that (n - 1)o c::!9 m(3 < na.
(149.2)
Hence
m(af - floc) = a m# - mil a < a na - (n - I )o:
of. = a2 .
Since this holds for all in (E -t'), it follows, again from (149.1), that
a/3-floc <0. By applying this to /3, a instead of a, /3 we obtain j9a - of <_ 0, so that a/B= floc. Accordingly, R is in fact commutative. Moreover we suppose that R contains the unity element e. Fore instead of a, (149.2) reads as follows:
(n - 1)85mfl
(n - 1)e_<m#'
-e<m(/l-/3')<8
for all in, whence We then have 0' = 0 so proving the theorem.. If an element a (> 0) of an ordered ring R with unity element has the
property that for all n = 1, 2.... either a > n or na < 1 holds, then we call a an infinitely large or infinitely small element, respectively. We show that the ordering of R is Archimedean if and only if, R has neither infinitely large nor infinitely small elements.
The assertion -only if" needs no proof. In order to prove "if" we con-
sider two positive elements a, /3 of R. If there are elements in, n (= 1, 2, ..
such that
a
then a < mnf follows. This proves the assertion. The propositions a > n, na-1 < 1 are identical for an arbitrary element
a (> 0) in a field. Thus if a is infinitely large, then x-1 is infinitely small and conversely. Hence it follows that the ordering of a field is Archimedean if it has no infinitely small elements.
The extension of a non-Archimedean ordering can, of course, be only non-Archimedean. On the other hand, the following holds: THEOREM 376. In the case of an algebraic field G I F the extension of an
Archimedean ordering of the field F to an ordering of G is necessarily Archimedean.
ORDERED STRUCTURES
582
COROLLARY. An absolute-algebraic field can only be Archimedean-ordered.
in order to prove the theorem it suffices to show that to every positive element a of G there is a k (= 1, 2, ...) with a < k. Let n-t 1=0
be the minimal polynomial of a.. By hypothesis there is a k (= 1, 2....) such that
k> 1 -a, if a >_ k, then a
(1 =0,....n- 1).
1-a;, i.e.
a,> 1 -a
(i=0,...,it - 1),
from which it follows that n-t f(a) ;> an + (I - a)
57,
a' = 1.
i=0
Since this contradicts f(a) = 0, we have a < k. Consequently, we have proved the theorem and, on account of Theorem 374, the corollary also follows. EXAMPLE 1. Let V be the.3-vector space with basis w1, ..., w,. The ordering of V given in § 148, Example 3, is evidently non-Archimedean. We obtain an Archimedean
ordering of V if we take a real transcendental number 0 and define the positivity domain as the semimodule of those elements C1w1 + ... + Cn(un
(Cl, ..., c,, E .7)
of V, for which we have c1+c211
(This example relies, of course, on the usual ordering of the field of real numbers with which we shall only deal later on.) EXAMPLE 2. Let R = .fo[x]. The semiring consisting of the polynomials f(x)(r R)
with positive leading coefficients is a positivity domain of R. The corresponding ordering of R is non-Archimedean (although it is the extension of the Archimedean ordering of .? ). On the other hand, we obtain an Archimedean ordering of R if we take a real transcendental number 0 and define as positivity domain the semiring of those f(x) for which f($) > 0. EXAMPLE 3. The last two orderings of R = .$-0[x] obviously furnish, by virtue of Theorem 373, a non-Archimedean and an Archimedean ordering of the quotient field .7,,(x). In the first case an element a0x" + ... + any
b of T (x) is positive if ao b > 0. Moreover, all the order preserving automorphisms of .7-.(x) are then given by
x + cx + d, where c, d (e > 0) are elements of .?-,,. (For the second case cf. Theorem 375.)
583
ARCHIMEDEAN AND NON-ARCHIMEDEAN ORDERINGS
EXAMPLE 4. We construct a non-Archimedean-ordered skew field as follows: let
(r,(x)) = (..., r0(x), r1(x), ...) be an infinite sequence in both directions with terms from P-,(x) which, however, vanish to the left from a certain place on. in the set of these sequences we define addition and multiplication by (r,(x)) + (s,(x)) = (r;(x) + s,(x)), (ri(x)) (si(x))
rk(x) s,-k(x°k)), k
where we have to sum over k = 0, ± 1, ..., and a denotes a non-identical automorphism of , 0(x). It is easy to show that is a non-commutative skew field. Now order ."o(x) in a non-Archimedean way as in Example 3, and take an order preserving automorphism of .F'0(x) for ,s. Let an element of be positive if its first nonvanishing term is positive. A positivity domain is now defined in i. As for the rest, we can take the elements of it in terms of a new indeterminate y as a formal power series (cf. § 167)
The construction (without ordering) also holds if we take a field of prime characteristic p instead of .7-0, and leads in this case to a non-commutative skew field of the same characteristic p. EXERCISE 1. A homomorphism S - S' (fin -- o') of ordered structures is said to be order preserving if from e < a (o, a E S) o' < a' always follows. Show that the kernel of such a homomorphism, especially in the case of rings S, S' (0 0), consists only of 0 and infinitely small elements. (POLLAK.)
EXERCISE 2. Every ordered ring without infinitely large elements admits of an order preserving homomorphism, where the image of the ring is different from 0 and Archimedean-ordered. (POLLAK.)
§ 150. Absolute Value in Ordered Structures If S is a module, ring, skew field or field, then we obtain a very simple
classification of S by taking any two elements a, - a (a # 0) to form a class and taking 0 alone as a class. If, moreover, S is ordered, then every class contains exactly one non-negative element by which the classes are then represented. This happens in the following important definition: by the absolute value I a I of an element a of an ordered structure S we understand the non-negative member of the pair a, - a. The absolute value has the following properties:
10I=0,IaI>0(a0 0), ja.+
-a,
(150.1) (150.2)
(150.3)
(In modules only (150.1), (150.2) are to be considered.)
584
ORDERED STRUCTURES
Of these only (150.2) needs proof. Since
it follows that
x+ji.-a-rg
This establishes (150.2). We shall prove that from (150.1) and (150.2)
>_ilal-Iii
follows.
(150.4)
We apply (150.2) to a - f7 instead of x: i.e.
Similarly
Ia:_!9a-h'"+
Ia-i
flIlaI-
These may, because of (150.13), be summarized in
(lal - IfD
la! -
From this and (150.14), (150.4) follows. As a ccnsequence of (150.Q and (150.3) should be noted :
-'=x'".
(150.5)
NOTE. If S = .70 the absolute value I a I (a E .. T) is, because of Theorem 374, uniquely determined. Accordingly, the absolute value in an arbitrary S is
to be regarded as a generalization of the absolute value defined in S. An extensive further generalization in fields will be the subject of the next chapter. EXAMPLE. An ordered ring S is homomorphically mapped by at -- I a I, with respect to multiplication, onto the semigroup of non-negative elements of S.
CHAPTER X
FIELDS WITH VALUATION Soon after STEINITZ created the general theory of fields, KURSCHAK (1913) guided by the p-adic fields of HENSEL (1908) made a far-reaching discovery that the concept of absolute value in fields is capable of generalization. From this he developed the theory of valuation for fields, one of
the most important chapters of algebra with the widest possibilities of application. In this theory, among many others, the pure algebraic foundation of the notion of the field of real numbers was developed, where it came to light that the p-adic fields are analogous concepts. OsTxowsiu (1918, 1935)
has contributed largely to the further development of the theory of valuation. Concerning the treatment of the theory of numbers on a valuationtheoretical foundation cf. HASSE (1949). § 151. Valuations Let a field _9" and an ordered field F be given. The ordering relation in F is denoted by "<", the absolute value in it by "I I". Unless otherwise indicated, small Roman and Greek letters (also marked with indices) denote elements of .7 and F, respectively. The unity element is denoted in both fields by 1 without risk of ambiguity. Since, according to LEVI'S theorem (Theorem 367), F must be of characteristic 0, we may identify the prime field of F .70. These conventions also apply to the following paragraphs. We call .5T a field with valuation when 'a mapping a -+ T(a) of .5' into F is
given with the properties
qu(o)-0, q(a)>0(av
o).
(151.1)
qq(a + b) 5 q,(a) + q-(b)
(151.2)
T(ab) = q(a)p(b)
(151.3)
.
Then, q7(a) is called the value of the element a and F the value field and T itself is called a valuation, or, more precisely, the 99-valuation. 585
586
FIELDS WITH VALUATION
We have q{a - b) >_ I p(a) - p(b) I
(151.4)
,
q,(-a) = p(a) P(1) = I q.(a ' I) = p(a) I
(151.5)
(151.6)
,
(a # 0)
.
(151.7)
For b = 1, 99(a) = 99(oc)99(1) follows from (151.3). This together with (151.1) proves (151.6). (151.7) follows from (151.6) and (151.3). (99(- 1))' = 1 follows from(151.7), therefore 99 (-1) 1. This, together with (151.1), implies that q' (-1) = 1. Consequently, (151.5) follows from (151.3). 99 (a - b) >_ T (a) - T (b) follows from (151.2), with a-b instead of a. Similarly, 9,(b - a) ? 9'(b) - 9%(a)
also holds. Because of (151.5) the left-hand sides are equal and one of the right-hand sides agrees with the right-hand side of (151.4), therefore (151.4) is also true. Now we have to consider an important special case where -7itself is an ordered field and F = _9'. Let T(a) = I a I
This is possible since then (151.1), (151.2), (151.3) are satisfied according to (150.1), (150.2), (150.3). Hence in ordered fields the absolute value is a valuation which we call the absolute valuation and often denote by co:
co(a) = j a i
.
(151.8)
We have now shown that valuation is a generalization of absolute value. Evidently
q(0) = 0 , qu(a) = I (a # 0) ,
(151.9)
is always a valuation and is called the trivial valuation. THEOREM 377. In every field with valuation all the roots of unity have the value 1.
COROLLARY. The finite fields and, more generally, the absolute-algebraic fields of prime characteristic can have only the trivial valuation.
If 99 (a) > 1, then 99 (a) = 1 + S for some b > 0. Hence, and from the binomial theorem, 9' (a") = (1 + b)" > I f o r all n = 1, 2, ..., therefore a is not now a root of unity. If, furthermore, 92 (a) < 1, then q, (a-') > 1, thus a just as a-1 is again not a root of unity. This proves the theorem.
VALUATIONS
587
Since, in a finite field, the elements other than 0 are roots of unity, the corollary follows from the theorem. NOTE. We shall see that, besides the fields mentioned in the corollary, all other fields have non-trivial valuations. It is this generality which gives
valuations such a wide range of applications. According to (151.3), a valuation is always a homomorphic mapping of . into F with respect to multiplication. More precisely, according to (151.1), it maps .' into the semifield consisting of the zero element and the positivity domain of F. According to (151.2), homomorphism with respect to addition is not required. That, of course, would be impossible, too, since it would involve an isomorphism of F which is incompatible with (151.1).
§ 152. Convergent Sequences and Limits
We consider a valuation (p of the field .7 with the value field F and retain the conventions made in § 151 with respect to notation which we extend, however, by using here and in the following paragraphs i, j, k, n, r, s, N, N' to denote elements of 41'. The ordering relation is denoted by < both in 41" and in F. A countably infinite sequence al, a2, . . . consisting of elements of Y is denoted by [a,]. In particular, [a] will denote the sequence a, a,. . . We denote positive elements of F by s, S, p, v. We call [ai] a (p-convergent sequence if for every s (E F, > 0) there is a natural number N such that
4p(ai - ak) < a
(1, k > N).
(152.1)
We say that a (E.9') is the cc-limit of a sequence [ai] if, for every s (E F, > 0),
there is a natural number N such that
(i > N).
,p(ai - a) < a
(152.2)
We denote this by 92-lim ai = a.
(152.3)
(If necessary, the more precise notation' lim instead of lim is used. If 92-lim a; = 0
,
(152.4)
then we call [ail a p-zero sequence. This means, in other words, that for every
s (E F, > 0) there is a natural number N such that
rp(ai)<s
(i > N).
(152.4)
Further (152.3) is equivalent to the fact that [ai - a] is a q,-zero sequence. For a fixed valuation, we omit the attribute "q2" in the notation now introduced (and also in those to be introduced later on) when there is no
588
FIELDS WITH VALUATION
possibility of ambiguity. If Yis an ordered field and unless otherwise indicat-
ed, then in place of q' we have to understand the absolute valuation co. This will always be related to the ordered field F (instead of -7). The above concepts are well known in analysis for the case when 'P- (= F) is the field of real numbers, or of complex numbers, and ' means the "absolute valuation" in those fields. We shall not take this into consideration, however, but we shall obtain these fields as special cases of the present investigations.
We can write every element of .7as a limit, for a = lim [a]. We shall show that the limit of a sequence, if it exists, is always uniquely determined. To prove this we suppose that the sequence [a;] has two limits, a and b. Then, not only (152.2) holds, but, with a suitable natural number N'. also
q(a,-b)N').
For i > N, N' we have
9,(b-a)=q#a;-a)-(a;-b))<_q,(a;-a)+92(a,-b)<2e. If a
b fore=-2
THEOREM 378. Every sequence with a limit is convergent. Assume the validity of (152.3); then for every e (E F, > 0) there is a natural
number N such that (p(ai -- a),
q(ak - a) < 2
(i, k > N).
Since this gives
T(a,-ak)= p((a,-a)-(ak-a))
THEOREM 379. A sequence [a;] converges if, and only if, for arbitrary nl, n2, ... (E [a; - a;+,,,] is a zero sequence. COROLLARY. In afield, the set of convergent sequences is uniquely determined
by that of the zero sequences.
The assertion "only if" is trivial. In order to prove "if" we suppose that [a;] does not converge. Then there is an s (E F, > 0) such that we can find
for every i (> 0) suitable r, s (> i) with gq(a,, - a,) > 2e. Since we have 2e < q(ar
- a) < q(af - a,) +
cp(ai
- a,)
589
CONVERGENT SEQUENCES AND LIMITS
the first or the second term of the right-hand side is at least equal to e; we put ni = r - i or ni = s - i, respectively. Since, accordingly, we always have 99(ai - ai+R,) z e, it follows that [ai - ai+,,,] is not a zero sequence. Consequently the theorem is true, from which the corollary follows immediately. THEOREM 380. To every convergent sequence [ai] from Jr there is a p (E F, > 0) such that 92(ai) < i
(i = 1, 2, ...);
(152.5)
moreover, if [ai] is not a zero sequence, then there is also a I' (E F, for which, with a suitable natural number N, we always have cp(a) > v
(i > N) ;
0)
(152.6)
further, in the special case." = F, q' = co, this N may be chosen so that
either ai > r (i > N) or ai < - v (i > N).
(152.7)
From (152.1), it follows for k = N + I that q2(a,) < e + p(aN+i)
(i > N).
Accordingly none of the T (al), 97(a9).... is greater than ,u = max ((9'(a>.). .... q.(aN), e + (p(aN+I)) ,
so that (152.5) is proved.
If the assertion in (152.6) were false, then to every pair r, N there would be a k (> N) such that 92(ak) < r .
According to (152.1). N may be so chosen that for all i (>N) we have Ip(ai - ak) < I'. Hence 9,(a,) < 92(a, - ak) + 97(ak) < 2v .
If we apply this with v = - , then we see that [ai] is a zero sequence. This contradiction proves the assertion in (152.6). For the special caseY= F, 99 = w, (152.6) implies that for every i (> N) either a, > r or -a, > I'. Consequently, if (152.7) is false, for every N there
is a pair i, k (> N) such that a, > v, -ak > r.
Since
I
ai - ak I > 2r
follows from this, [ai] is not convergent. This proves Theorem 380. THEOREM 381. If [ai] converges so does [q,(a)]. If [ai] has a limit, then [92(a)] has too, and then lim 92(ai) = cp(lim a).
(152.8)
590
FIELDS WITH VALUATION
Since, according to (151.4), we always have I 9v(u) - 90) I < 9w(u - v) , from the definitions (152.1) and (152.2) it follows that the theorem is true. We define the sum and the product of two sequences [ai], [bi] by
[ai] + [bi] = [ai + bi], [ai] [bi] = [aib,] In this way we obtain a commutative ring which is isomorphic with the complete direct sum of a countably infinite number of fields.7. From here on we consider the sequences [ai] as elements of this ring. It is obvious that here [0] and [1 ] are the zero element and the unity element, respectively. The additive inverse of [bi] is [-b,] so for subtraction we have
[ai] - [bi] = [a, - bi] The multiplicative inverse of [bi] is [bt 1] provided that all the bl, b.2, .. . are distinct from 0; so for division we have [a,] [bi]-1 = [a, b, 1] THEOREM 382. LetO denote each of the four fundamental compositions. If the
sequences [a,], [bi] converge, then [a,Obi] converges; if they have a lim-
it, then [a,Obi] also has a limit, and lim (a,(Dbi) = lim a,Olim b, ,
provided that for division b1, b2, . . . are different ,from 0 zero sequence.
(152.9)
and [bi] is not a
We shall prove the theorem first for addition and multiplication. Let [ai], [bi] be two arbitrary sequences. Because of the identities
(u+v)-(u'+v')=(u-u')+(v-v') and uv - u'v' = v(u - u') + u'(v - v')
we have
9q((ai + b) - (at + bk)) < 92(ai - ak) + 9p(bi - bk) ,
(152.10)
92(aibi - akbk) < 9p(br)9v(a, - ak) + V(ak)99(b, - bk).
(152.11)
If [ai], [bi] are convergent, then for every s (E F, > 0) there is an N (> 0) such that E
92(ai - ak), 49(bi - bk) <
2
(i, k > N) .
From this, and from (152.10), it follows that [ai + bi] is convergent. Further, by (152.5), there is a p (E F, > 0) with 99(bi), 99(ak) < p p
(i, k = 1, 2, ...)
CONVERGENT SEQUENCES AND LIMITS
591
and to every s (> 0) an N (>0) such that 4v(a, - ak) , 92(b, - bk) < 2µ
(i, k > N).
Then the right-hand side of (152.11) is less than e, thus [a, b,] is convergent.
We now suppose that lim ai = a, lim bi = b. The above reasoning may be repeated if ak is everywhere replaced by a and
bk by b. We obtain as a result lim (a, + b,) = a + b, lim a,b, = ab. This proves that part of the theorem which relates to addition and multiplication.
Since if [b,] converges then so also does [-b,], and if lim b, = b then lim (-b,) = -b, the assertions relating to subtraction are reduced to the case of addition. Finally, in order to prove the assertions referring to division we suppose that the conditions stipulated at the end of the theorem are satisfied. We shall show that if [b,] converges then so also does [b, 1] and that if Jim b, = b then lim b, 1= b - '. This reduces that part of the theorem which refers to division to the case of multiplication.
If [b,] converges, then according to (152.6) there are certain v and N such that always (i > N). p(b) > v For every e (E F, > 0) N may be chosen so large that 99(bk - b) < v2e
(i, k > N).
From these it follows that 4'(bi 1 - bk 1) = 99(b,)-19P(bk)-199(bk - b) < e ,
therefore [bi 1] converges.
If, in addition, lim b, = b (# 0), then we may similarly conclude that lim bj 1 = b-1. For this, we have only to write everywhere b instead of bk and to subject v to the further condition 92(b) > v. Consequently Theorem 382 is proved. THEOREM 383. The convergent sequences [a,] constitute a ring in which the set it of zero sequences constitutes a maximal ideal. This it is called the ideal of zero sequences. The first assertion and the ideal property of it are a consequence of Theo-
rems 380, 382. It remains to be proved that the ideal it is maximal. For this it is sufficient to show the following: if [a,], [b,] are convergent se_ 20 R.-A.
592
FIELDS WITH VALUATION
quences, but [ai] is not a zero sequence, then [bi] is an element of the ideal
(n, [ai]). This assertion is equivalent to the following one: There is a convergent sequence [xi] such that [ai] [xi] _ [b,] (mod n)
.
(152.12)
Now, because of (152.6), the sequence [ai] contains only a finite number of vanishing terms, consequently, after the addition of a suitable zero sequence, a sequence [ai ] is obtained from [ai] with only terms different from 0. Then [a'] = [ail (mod n) . Since furthermore, according to Theorem 382, [a; ] is convergent and is not
a zero sequence, it follows, again from Theorem 382, that the equation
[ail [xil = [bil determines a convergent sequence [xi]. Thus (152.12) holds and Theorem 383 is true. We understand by a perfect field a field with valuation in which every convergent sequence has a limit. Sometimes the valuation of a perfect field is called a perfect valuation.
It is evident that a field with the trivial valuation is always perfect. We shall become acquainted with other perfect fields later. In perfect fields we shall introduce the concepts of infinite series and infinite products as follows: let. rbe a perfect field with respect to the valuation 99. To every countably infinite sequence [ai] formed of the elements of.Twe can assign, temporarily but formally, i.e., without any more closely defined sense, an infinite sum (i.e., consisting of an infinite number of terms) 00
i=1
ai=a1+a2+...
(152.13)
and an infinite product (i.e., consisting of an infinite number of factors) 00
ai = a1a2...;
(152.14)
q=
these are in general called an infinite series and an infinite product, respectively. The finite sums and products
si=a1+... +ai, pi=a1...a,, formed from them, are called the partial sums of (152.13) and the partial products of (152.14). More precisely, si is called the i`s partial sum and pi the ith partial product. We now call (152.13) a WW-convergent infinite series,
CONVERGENT SEQUENCES AND LIMITS
593
if the sequence [s,] formed from the partial sums is 97-convergent, i.e., because of the perfectness ofSf it has a 99-limit; then we say that this infinite series "exists" and is equal to this limit:
at = a1 + a2 + ... = 92-lim sl = 92-lim (a, + ...a)
.
1=1
Otherwise we call the infinite series 97-divergent. For this, it is usual to say that the infinite series does not exist. Furthermore, we call (152.14) a 92-convergent infinite product, if [pi] has a 92-limit different from 0; for this we say that the infinite product "exists" and is equal to this limit: OD
fl of = a1 a2 ... _ 92-lim pt = 92-lim a1 ... a; .
t=1
Otherwise we say that the infinite product is 92-divergent or "does not exist". From the definition it immediately follows that every existing limit may
be turned by the formula 92-I'M c, = c1 + (c2 - c1) + (c3 - c2) + ...
(152.15)
into an infinite series. Similarly we have the formula 92-lim c, = c1(c2cl 1)(c3e2 1) ...,
(152.16)
provided that the left-hand side does not vanish and no ci is equal to 0. Obviously, for the convergence of a1 + a2 + ... and a1a2..., the conditions lim a, = 0 and lim a, = 1, respectively, are necessary but, in general, not sufficient conditions. If necessary (152.13) and (152.14) are written more precisely as and
92-s al
99-11 a; , i=1
i=1
respectively.
Following the corollary of Theorem 379 we define two valuations of a field with the same ideals of zero sequences as equivalent.
NOTE. Let F be an ordered extension field of F whose ordering is an extension of the ordering of F. If 99 is a valuation of a field Y with the value
field F, then by the statement
(a) = 99 (a) (a E Y) a valuation 0 of. F is
defined with the value field F. Although these valuations at first sight appear
to have the same status, it may very easily happen that they are not equivalent. For example, if [al] is a 92-zero sequence whose terms are different from 0, and if F has a positive element a such that there is no element
sequence. (Cf. the e with 0 < e < E, then [a,] is evidently not a Example at the end of § 153.) However, we shall deal with the equivalence of valuations later in § 157.
594
FIELDS WITH VALUATION
§ 153. Perfect Hull It is of great importance that every field .7 with valuation - as will be seen - may be extended to a perfect field Y in which all the sequences, convergent in3, have a limit. Moreover we can subject .Y to certain further
conditions by which we attain (in a certain sense) uniqueness, but Y will depend essentially on the valuation of.''. This last fact affords many possibilities for the construction of overfields of a given field. By way of introduction we give some definitions. We say that a valuation q/ of an extension field 3' of the field 3, with valuation q', is an extension of the valuation 9), if the value field of 3' contains that of Jr in an order preserving way and, for the elements a of 3, q' (a) = 99'(a). If a fixed extension of 99 is considered, we often likewise denote it by 99.
A perfect extension field .3' of a field with valuation _9' is called a perfect hull of 3, if the valuation, of ' is an extension of the valuation 9) of .7 and consists of the qi-limits of the 92-convergent sequences formed from
the elements of 3. By a (bicontinuous or) topological isomorphism of a field .7 with a field c.P,
both with valuation, we understand an isomorphism of .7 with c.P which transfers the zero sequences, formed from .5r, just into the zero sequences, formed from cP. If there is such an isomorphism we call the fields Jr, cO (continuous or) topologically isomorphic. If between two fields 3, cP an isomorphism a --> a° holds and 99 is a valuation of .i , then a valuation 99° (with unaltered value field) is evidently defined in c.P by 99°(a°) = 97(a), such that a is now a topological isomorphism between _" and 4t.
Two extension fields c0, 4g' of a field .7 (all with valuation are called topologically equivalent (over ..3"), if their valuations are extensions of those of 3 and there is a topological isomorphism c.P 1.7 -- dt' 13. From now on let a field.7 with the valuation 99 and the value field F be given. (The notations are governed by the conventions made at the beginning of § 151, 152.) We shall construct a perfect hull of -7 and show afterwards that it is, in the sense of topological equivalence, uniquely defined.
To do this, we denote the ring of 97-convergent sequences [a;] by R and the ideal of zero sequences of ll by n. Since according to Theorem 383, n is maximal,
3 = T/n
(153.1)
is, by Theorem 130, a field. We denote its elements by [a1 ]' = residue class [a1] (mod n).
(153.2)
The special elements [a]' evidently constitute a subfield .7" of .7 such that
.7
.7' (a -# [a]') .
(153.3)
PERFECT HULL
595
We suppose that in .}' the corresponding embedding of Y instead of .g' is carried out, while we retain this notation for Y after the embedding, so that Y is now an overfield of Y. We define a certain valuation - (with a suitable value field) in Y, which will be an extension of q.'
We first deal with the case J r = F, q' = a). For this case we write F instead of .}'. In conformity with (153.2) we can take an arbitrary ele-
ment of F in the form [a;]' = residue class [a;] (mod n) , where it now denotes the ideal of zero sequences of F and [a;] is an w-convergent sequence. We call this element positive and write it as [aj]' > 0,
if there is a S (E F, > 0) and an N (E J) such that (i > N). a; > b (153.4) We have to show that the positive elements of F are uniquely determined by this. For this purpose we consider an element [fi]' of F equal to [a;]'. It has to be proved that the condition corresponding to (153.4) is also satisfied for fl;. The assumption implies the congruence [a;] _ [fli] (mod n). Since accordingly [(3; - aj] is a zero sequence, we can choose N above to be so great that besides (153.4) we also have
< Hence
a;
- 2s
,
(i>N).
then, because of (153.4),
(i>N), by which we have shown that in F the positive elements are uniquely defined.
Evidently these constitute a subsemiring of F which we denote by P. We show that P is even a positivity domain of F. For this purpose we consider an element [a;]' other than 0, of F, for which the sequence [a;] is therefore convergent, but not a zero sequence. It has to be shown that of the elements [a;]', - [a;]' exactly one belongs to P. According to Theorem 380, (152.7), there are a v (E F, > 0) and N (E 4") such that we have
either aj > v (i> N) or aj < - v (i > N). For the latter we may write -a; > v, so that condition (153.4) is satisfied,
for exactly one of the a,, - aj, i.e. of the elements [a;]', - [a;]' (= [-a,]') exactly one is positive. Accordingly P is in fact a positivity domain of F.
FIELDS WITH VALUATION
596
We put
P=Pf1F.
(153.5)
Since F is embedded in F, P consists of those a (= [a]') for which a > 0 (in F). This implies that P is the positivity domain of F. Since, because of
(153.5), P e P, the ordering of F according to the positivity domain P is an extension of the ordering of F. According to this the corresponding absolute valuation of F, which we denote by 11 11 or by w, is likewise an
extension of the absolute valuation of F. This will be taken as the required valuation of F, which we shall retain in what follows. Before passing to the general case we wish to show, as a preparation for it, that every co-convergent sequence [at] in F has an 6-limit and for it w-lim at = loci], .
(153.6)
We consider an arbitrary positive element [et]' of F. According to the definition in (153.4) there are then a 6 (E F, > 0) and N (E such that we always have et > 6 (i> e"). On account of the assumption, we may choose N so that
ai - akI <
2
(i,k>J )
also holds. Accordingly 6
6
et+ai - ak> 2 , et- at +ak> T. Again by (153.4), we have from this [et + at]' > ak, [ei - at]' > - Mk i.e.,
Vi]' + [ail' > ak,
[et]' - [ai]' > - ak ,
so that finally 11 Mk - [at]' 11 < [et]' for all k (> N). This implies (153.6).
We call attention to the consequence of (153.6) that every w-zero sequence is an c-o-zero sequence, which we shall often apply without reference.
We now consider the general case. We wish to define in .3r a valuation (P which is an extension of qq. (The preceding special case is not excluded. The
valuation to be given for this special case - as will be seen - turns out to be 161.)
Let [at]' be an arbitrary element of Y. We shall show that by 0([ai]') = w-lim q'(ai) (=[m(a,)]')
(153.7)
PERFECT HULL
597
a valuation 0 of . 1 ' (with the value field F) is defined with the desired properties.
For this, we must first of all show that the right-hand side of (153.7) exists and is uniquely determined by [ai]'. Since [ai] is qr-convergent, [ip(ai)] is, according to Theorem 381, an ca-convergent sequence. Hence, and from
what was proved above, the existence of the right-hand side of (153.7) follows.
It still remains to be shown that the equation cw-lim 9' (a) = w-lim 9' (b)
(153.8)
follows from [ai]' = [bi]' . By hypothesis [ai - bi] is a 99-zero sequence, consequently [q9 (ai - bi)] is, according to Theorem 381, an co-zero sequence. Because 99 (a) - q' (b,) I < 9, (ai - bi), 1
[9, (a) - 99 (b;)] is then an co-zero sequence. Hence, and from Theorem 382, (152.9) follows the assertion (153.8). What we have proved so far may
also be expressed by saying that a mapping 99 of Yinto F is defined by (153.7).
Further we have to show that ? is a valuation of Y. If [a,]' = 0, i.e., [ai] is a 9,-zero sequence, then [q:(ai)] is, according to Theorem 381, an co-zero sequence, thus, according to (153.7), ([a,]') = 0. If, on the other hand [a,]' 0 0, i.e., the p-convergent sequence [ai] is not a 9,-zero sequence, then (153.7) and (152.6) show that 99([a,]') > 0. Accordingly, 0 has the property given in (151.1). If 0 had not the property given in (151.2),
then there would be two elements [a,]', [bi]' of .i ' such that
([a,]' + [bi]')
- ([a,]') - 9'([b,]') > 0.
Hence, by (153.7), the existence of an s (E F, > 0) and N (E d) such that
,(ai + b) - 9,(a) - 9p(bi) ? s
(i > N)
would have followed. Since this is absurd, according to (151.2), it follows from this contradiction that Q' has the property given in (151.2). From (153.7) and Theorem 382, it follows that, together with 99, also has the property
given in (151.3). Accordingly, qi is in fact a valuation of Y. It is also an extension of q', for since Yis embedded in Y, it follows from (153.7), that 9'(a) = m([a]) = co-lim q,(a) = qu(a).
FIELDS WITH VALUATION
598
It still remains to be shown that in the special case Y = F, q = w we have 0 = w. For this case the definition (153.7) becomes: ik[ar]') = w-lim w(a;) . The right-hand side is, according to (153.6), equal to [w(a1)]'. This, according to the definition of tv, is equal to Hence the assertion has been proved. For the following we fix for the valuation 9 defined by (153.7), which in the case Y = F, 99 = w becomes the absolute valuation w. We now prove the following theorem: THEOREM 384. Every field J r with valuation has, to within topologically equivalent extensions, one, and only one, perfect hull, and that is the field Y defined above by (153.1) and (153.7).
First of all we prove that 3' is a perfect hull of Jr. (We continue to use the previous notation.) We begin with the proof that for every 99-convergent sequence [at], (153.9) -lim a, = [a,]'. (In the special case." = F, 99 = w, (153.9) becomes (153.6), which has already
been proved, but the following proof is also valid for it.) Take an element
is (> 0) of.}'. From (153.4) there exists a 6 (E F, > 0) such that
b<E. For every term a of
it follows from (153.7) that
([a,]' -
w-lim 99(a; - a.).
But there is an N (E,4) for which we always have
4'(ai-
a 2
<
E
2
(i,n>N).
Because of the preceding work we may choose N so that also
From these we get
<2' ([a,]' -
E
(n > N),
by which (153.9) is proved. This means that Yconsists of the -limits of the q-convergent sequences [a;].
In order to complete the proof that.3is a perfect hull ofYwe still have to verify only that
' is perfect. For this purpose we consider a qi-convergent
PERFECT HULL
599
sequence [ai] (formed of elements ai of 7), in order to prove that it has a -limit, which will verify the assertion. (We deal with a generalization of one part of the former statement involved in (153.9).) Now, first of all, we consider the case where we may form an (5-zero sequence [St] of the elements of F consisting only of positive terms. (There
is no need that this should always happen;
cf. HAUSCHILD-POLLAK
(1965).) Since according to (153.9) every element of.}' is the -limit of a 99-convergent sequence, there is for every i an ai (E 5) such that
,&i - ai) < bi
(i = 1, 2, ...) .
Further there is for every s (E F, > 0) an N(E- ) such that E
0(a-i - ak) , bi <
3
(1,k>N).
Since it follows from this that
9'(ai - ak) = 0(ai - ak) = m(Cai - ak) - (a, - ai) + (ak - ak)) < E (i, k > N), [ai] is a 99-convergent sequence. We write
a = t-lim ai . Then, for every s (> 0) from F there is an.N' such that
T(ai-a)<2
>
We can choose N' so great that also 1
a;<2 Hence
9'(ai -a-) = 0((ai - a'', + (ai - a.)) < e ,
so that O-lim ai = a. This implies that the assertion is true for the case considered.
In the other case there is no w-zero sequence in Y consisting only of positive terms. As [g1(ai - ai+i)] is an w-zero sequence, it has only finitely
many terms different from 0. According to this, there is an N (E "40") such that a, = dN(i > N) whence ip-lim ai = dN. This proves that.' is a per-
fect hull of Y. The proposition of uniqueness of Theorem 384 still remains to be proved. For this, we need the following preparatory 20/a R. - A.
600
FIELDS WITH VALUATION
THEOREM 385. Let.r "be a non perfect field with valuation with the value field F ands" a perfect hull of Y with value field F. To every positive element of F there is then in F a smaller positive element. (In this theorem ,7, F need not mean the fields so denoted, prior to Theorem 384.) For the proof let 99 and q denote the valuations of .7 ands', respectively.
We also consider-7 as being valued by 9-9 and denote this valuation of .' by 991. (The two valuations 99, q' of .7 differ from each other only in that F and F are their respective value fields.) We denote by 91 and 91, the ring of the q,- and q'1-convergent sequences, respectively, formed from.7. Further we denote by n and n1 the corresponding ideals of zero sequences of R and 911, respectively. We shall show that then 91 = ll and n = n1. As .9" is a perfect hull of .7, every 9,-convergent sequence from .7 has a -limit, thus it is a fortiori -convergent, i.e., qq,-convergent. According to this we have 919; B11. On the other hand, every 9,1-convergent sequence from .7 is also 9-9-convergent, thus, because F 9 F, it is even 9,-con-
vergent. Since now 911 c 91, it follows that 91= M1. Furthermore, since every 9,1-zero sequence from,7 is a 9-9-zero sequence, it is, again because F c F, a 9,-zero sequence. Therefore n1 c U. Since nl is, by Theorem 383, a maximal ideal of (91, =) 91, we have 111 = it. Now lets be a positive element of F. We have to show that there is an a in F such that 0 < s < s. First we consider the case where we can form a qq-zero sequence [ai] from .7 with infinitely many terms different from 0. We have already
proved that [a;] is a --zero sequence, so there is a term a, such that 0 < 99(a,) < e. Then, q (a;) = 99 (a,) belongs to F, so the theorem is true in this case. In the other case every 9,-zero sequence from 3 contains only finitely many terms different from 0. In every 9-convergent sequence from.' the
terms are then equal except for finitely many. This implies that every q-limit belongs to-9', i.e., Y=,7. Since this is, however, a contradiction, Theorem 385 is proved.
Finally the proposition of uniqueness of Theorem 384 is contained in the following as the special case .7 = cPt, a = 1: THEoREM 386. A bicontinuous isomorphism
' -- co (a - aa)
(153.10)
between two fields .7, a with valuation may always be extended to a bicontinuous isomorphism between two arbitrary given perfect hulls of .7T' and cQt.
We denote perfect hulls of _57 and &t by .t and
the valuation and
value fields of .7, &t, -7, c by q, V, , and F, G, F, G, respectively. If f is perfect, there is for every 9,-convergent sequence [ac] from . an element a of,7 such that a = T-lim a,. Since then a = q-lim a; must also hold,
PERFECT HULL
601
3 = .7. Also from the supposition, it follows that c0 is perfect, therefore i4 = c4. According to this the theorem is true in this case, so that we may henceforth assume that neither .7 nor cet is perfect. We denote an arbitrary 92-convergent sequence from.' by [ai] and shall show that (153.11) -lim a, --> i-lim aa, is a bicontinuous isomorphism of ..'onto c0. Since (153.11) is a continuation of (153.10), this will prove the theorem. The left- and right-hand sides of (153.11) run through all the elements of .f and 4 respectively, therefore 3 is mapped by (153.11) onto c4. We shall
show that this mapping is one-to-one. The proof will rely on the direct consequence of Theorem 385 that a sequence in Y is a i-zero sequence if, and only if, it is a p-zero sequence. ('If"' follows from Theorem 385, "only if" is trivial.) For two 42-convergent sequences [ai], [bi] from .7
-lim a, = -lim bi if, and only if, [a, - b,] is a -zero sequence, i.e., a 99-zero sequence, i.e., if [oai - vbi] is a y,-zero sequence, i.e., a yi-zero sequence, i.e., if -lim aai = +p-lim orb,.
This proves that the mapping (153.11) is one-to-one. Its homomorphy property follows from Theorem 382. If we show that every i-zero sequence becomes a yrzero sequence under (153.11), then for the same reason the remaining requirement will hold, thus completing the theorem. We take a sequence [a,] from Y. Every term ai of this sequence may be considered as a limit a, = 99-lim aik k
(i = 1, 2, ...) ,
where the aik are suitable elements of.'. We prove that [a-i] is a i-zero sequence if, and only if, to arbitrary natural numbers N1, N2, ... may be assigned further natural numbers n1(> N1), n2 (> NO ...
(153.12)
92-lim a,,, = 0 .
(153.13)
with the property r
First, we suppose that ri] is a -zero sequence and N1, N2.... are any natural numbers. To every positive element a of F there is then an N (E.4*) such that always
j(ai)<2
(i>N).
FIELDS WITH VALUATION
602
Also we may chose n1,, n2, ... satisfying (153.12) and such that IG (a,,,, - ai) < Hence 99 (a,n)
(t = 1, 2, ...) .
2
(ai,,,) < e, and we have proved (153.13).
Conversely, we suppose that for arbitrary N1, N2, ... the n1, n2, ... with the property (153.12) may be chosen in such a way that (153.13) is valid. Let is be an arbitrary positive element from F. According to Theorem
385 there is a positive element E of F such that 2e < E. We choose the N1, N2, ... in such a way that we always have &en, - ai) < e (ni > Ni, i 1, 2, ...) . Because of the supposition the nI, n.2,
. .
. may then be chosen in such a way
that for a suitable N (E J") we always have faint) _
faint) < e
(i > N).
Hence 45 (a) < 2 e < 9 (i > N), i.e., 15-lim a, = 0. Consequently we have shown that the assertion concerning the f)-zero sequences formulated in (153.12) and (153.13), formed from ", is true.
For the same reason a similar statement holds also for the
TP-zero
sequences formed from 4. From both these, and (153.10), it immediately follows that by (153.11) every q-zero sequence is in fact mapped into a w-zero sequence. Thus Theorem 386 is proved. EXAMPLE. In Theorem 385 it is necessary to suppose that F" is not perfect. Suppose
that the field.. has the trivial valuation, so that it is perfect and consequently equal to its perfect hull S r, and let F = i and F = .moo (x) be the value fields of .7" and X. respectively, where F, as in § 149, Example 3, shall be ordered in a non-Archimedean way. (See Theorem 374.) Now x-1 is a positive element of F, and there is no element
eofFsuch that 0<e<x-'.
§ 154. The Field of Real Numbers
As one of the most important applications of valuation theory first of all arises the field of real numbers which we define as the perfect hull of the field 5 with the absolute valuation and denote by o). The elements and subfields of 9( 'o) are called real numbers and real number fields, respectively. By Theorems 384, 385, o; is an Archimedean-ordered field provided with the absolute valuation. We always denote the ordering, the absolute value and the limit by <, I I and lim, respectively. THEOREM 387. Every Archimedean-ordered field F, perfect with respect to
the absolute valuation, is order preserving isomorphic with .f(o). Further every Archimedean-ordered module M or ring (field) R is order preserving isomorphic with a submodule or subring (subfield) of ol.
THE FIELD OF REAL NUMBERS
603
It may be supposed that is a subfield of F. We denote by co or I I the absolute value in F. By Theorem 386, for the first assertion of Theorem 387, it will suffice to prove that every element a of F is the limit of a sequence from Y0. By the supposition, to every natural number n there is one and only one integer kn with kn 5 not < k,, + 1. We put an = n-'I,,. Then we have an 5 a < an + + n-1, so that
(n= 1,2,...).
Ia - ocI < n-I
On the other hand, for every element a (> 0) of F there is a natural num-
ber N (> s-1). Then for n > N
Ian-al
exists and depends only on a. Further a -* a is an order preserving isomorphism of M into 9'( ). The assertion about R may be reduced to the case where R is a field. For, according to Theorem 375, R is commutative and zero-divisor-free, so that, by Theorem 373, the ordering of R may be extended to an ordering of its quotient field F, which is obviously Archimedean. The ordering of F may be extended further to an ordering of its perfect hull F. Since, according to the first assertion, already proved, of the theorem, F is order preserving isomorphic with 7(o), the proof is now complete. A function f (x) defined in o) is called continuous at the point x = c
if, to every element a (> 0) of
0?,
there is a further element b (> 0)
from 30) such that the condition
I1(c+h)-f(c)I <e
(IhI <6)
is satisfied. Iff(x) is continuous at every point of the interval (a, b), then we say that f(x) is continuous on this interval. Finally, a function f(x) is said to be continuous if it is continuous at every point of its domain of definition. THEOREM 388 (BOLZANO'S theorem). If a function f(x) defined in
O) is
continuous on the interval (a, b) and f(a) < 0 and f(b) > 0, then it has at least one zero in this interval.
For the proof, we suppose that the theorem is false. We write a0 = a, b0 = b and determine the real numbers a,,, b,, (n = 1, 2, ...) so that an+I = an, bn+I = 2 (an + b,,) or an+I = 2 (an + bn), bn+l = bn ,
FIELDS WITH VALUATION
604
respectively, according as f12 (a
+
I
is positive or negative. Then
111
ao < al S ... and bo > bl. >_ ..., and
f(a,J<0, f(b,J>0, b"-a"=2-"(b-a)
(n=0,1,...).
It is evident that the sequences [a"], [b"] have a common limit. We put
c=lima"=limb". Then a S c < b and
a"
(n=0,1,...).
Furthermore, by hypothesis, J(c) 96 0.
Therefore, and because of the continuity of f(x), we can choose a number S (> 0) so that f(x) is either positive everywhere or negative everywhere on the interval (c - 8, c + b). But since, for sufficiently large n, a", b" lie in this interval, we reach a contradiction, so proving Theorem 388. so
If two functions f(x), g(x) are continuous at a point x = c, then also are f(x) + g(x), f(x) - g(x), f(x)g(x) and (for g(c) # 0)
f(x)g(x)-1, too. We need not prove this since the proof is similar to that of Theorem 382. It follows that every polynomial f(x) (E 9( 'O)[x]) is continuous. Tm;oREM 389. Every polynomial fl x) over .5o) of odd degree has at least one zero in o). We may suppose that f(x) is a principal polynomial:
fix) = x" + alx"-' + ... + a"
(2,f' n)
.
Take a real number c such that
c> l,c> Ia,I +... + Then f (c) > 0, f (- c) < 0. This, by Theorem 388, proves Theorem 389. Tm?OREM 390. Every equation
x"-a=0
(n>-1; aE.5o),>0)
(154.1)
has exactly one positive root in -7(o. Since the left-hand side of (154.1) is negative for x = 0 and positive for x = 1 + a, so by Theorem 388, (154.1) has at least one positive root. There
cannot be two positive roots since b" < c" follows from 0 < b < c. The theorem is now proved.
THE FIELD OF REAL NUMBERS
605
Because of this theorem we introduce for the positive root of the equation (154.1) the notation
a(154.2)
We define more generally the power a' for a positive real basis a and for a rational exponent ,bmy putting amn-1
= (a n-I
(a E _7(o), > 0; m E 7, n E -44'
.
(154.3)
It is now necessary to show that the right-hand side depends only on a and mn-1. In other words, we have to show that we always have (am)"-' = (ak)'-' (m, k E .7; n, I E -t'; mn-1 = kl-1). The left-hand side and the right-hand side are positive roots of the equations
x" = am and x = ak , respectively. Because of Theorem 390, we may replace these equations, without altering their positive roots, by xn' = ank.
xn1 = am' ,
Now, because ml = nk, these equations are identical, whereby the uniqueness of definition (154.3) is proved. We shall prove that
(ab)' = arbr
,
arts = eras,
(a')s = ars
(a, b E (o), > 0; r, s E Yo) .
(154.4)
First of all, it follows from Theorem 390 that (154.43) is true ifs is a natural
number. Let us therefore take two natural numbers m, n such that rm, sn are integers, then ((ab)r)m = (ab)rm = a'mb'm
= (ar)m
(b')m
= (arb')m,
(ar+s)mn = a(r+s)mn = armn asmn = (ar)mn (as)mn = (ar a3)mn ,
((a)s)mn = (ar)smn = arsmn = (ars)mn .
From these equations, and again from Theorem \\390, it follows that (154.4) is true. Further, we shall prove the two properties of monotonicity
0
(a,bE_9(o); rEYo,>0), (a E 3(o), > 1; r, s E .7o)
.
(154.5) (154.6)
606
FIELDS WITH VALUATION
For the proof of (154.5) let m denote a natural number such that rm is an
integer. Then a'' < b'm, (a'), < (b')', from which (154.5) follows. To prove (154.6) we take two natural numbers m, n such that rm, sn are integers. Because rmn < smn we have 1 < asm ,", and arm" < asm", whence (154.6) follows. We define the power a` for every positive real number a and for every real number c by
d'= lim a(n
(c=limes),
(154.7)
where [c"] is an arbitrary sequence from 70 with limit c. To justify this definition we must show that the right-hand side of (154.7) always exists and is independent of the special choice of [c,, ], i.e., it is uniquely determined by a, c. From this, and because c = lim c, it follows that (154.7) becomes a
valid equation for a rational c, thus giving a generalization of the concept of power.
We shall shorten the proof somewhat. We shall show, first, that to arbitrary elements a (> 1), e (> 0) of .5 o) there is an N (E 41') such that
an-' < 1 + e
(n > /V).
(154.8)
Since (154.8) is equivalent to a < ([ + E)" and this is valid for every n such that a < I + n e, the assertion is true. Now we shall show that the right-hand side of ([54.7) exists for every convergent sequence [c"] from .70. If a = 1 it is trivial. Furthermore, since (a-')`" = (a`")-', it suffices to consider the case a > 1. We take an integer r such that r > C1,C2,... Let a be an arbitrary positive real number. On account of (154.8), we take an n with
a"-'<1+a'e.
Then we take an N' (E a-) for which we always have
Ck - c,I
(k,1>N').
Then
a`k - ac' 15 a'(d`k`h' - 1) < a'(a"-I - 1) < g. Since accordingly [a`"] is convergent, the right-hand side of (154.7) exists.
In order to prove the uniqueness of the definition (154.7), we take a further sequence [d"] from .ro such that c = lim d,,. We denote the "mixed" sequence c1, d1, c2, d,., ... by [e"]. According to the preceding work, all three limits lim a'R , lim a" d , lIm a`"
THE FIELD OF REAL NUMBERS
607
exist. Since the first two limits separately must be equal to the third, they are equal to each other. Accordingly, a` is uniquely defined by (154.7). We leave to the reader the simple proof that (154.4), (154.5), (154.6) also remain valid for real numbers r, s and then a` is a continuous function of both variables a, c. The definition of an ordered module will henceforth also be related to the multiplicative notation. The meaning of an ordered Abelian group is now obvious. We have to interprete correspondingly order preserving automorphisms and isomorphisms, in connection with Abelian groups, and we may speak, in the corresponding sense, of the order preserving isomorphisms between two structures, one of which is an Abelian group, the other a module. We denote the positivity domain of .7jo) by P. This is the subsemifield of .7"(o) consisting of the positive elements. Correspondingly, the group of positive real numbers is denoted by P". This is ordered and has for its positivity domain the semigroup of those real numbers which are greater than 1. THEOREM 391. All the order preserving automorphisms of the group of positive real numbers are given by
a-->a`,
(154.9)
where c can be any positive real number. (This theorem may be regarded as a definition of the power a` for a, c > 0. Since a° = 1 , a = (a`)-1, we can now arrive at the complete definition.) For the proof of the theorem we denote the group of positive real numbers, as above, by P'<.
First of all we shall show that an order preserving automorphism of P"
is given by (154.9). If b is an arbitrary element of P", then (b`-')` = b. Hence we see that P" is mapped onto itself by (154.9). Furthermore since, as seen above, (154.41), (154.5) hold for c instead of r, (154.9) is in fact an order preserving automorphism of P'. Conversely, we consider an order preserving automorphism a -->f(a)
(154.10)
f(ab) = f(a)f(b) ,
(154.11)
a < b = f(a) < f(b) .
(154.12)
of P". Then for a, b E P"
We first show that
f(d) = (f(a))`
(a E P", I E J) .
(154.13)
608
FIELDS WITH VALUATION
For t = 0 it is trivial. Further, according to (154.11), f(1) = 1, f(a-t) = = (f(at))-1, while (f(a))-t = ((f(a))t)-1, so that it is sufficient to prove (154.13) for t > 0. We put t = kl-' for two natural numbers k, 1. Because of (154.11) we then have ((d ))h' = f(a) = f(ak) = (J (a))k = ((f(a))t )f .
Hence, and from Theorem 390, (154.13) follows. We shall also show that every equation
a"=b
(a,bE. (0,; a>0,b>0; a
1)
(154.14)
is solvable with a real number x. For, it is obvious that there are integers k, l such that ak < b < ar. Hence, and from the continuity of ax, according to BoLZANO's theorem (Theorem 388) the assertion follows. We now take a fixed element b (# 1) and an arbitrary element a of P" . According to what has been proved for (154.14),
f(b) = b` , a = bd
(154.15
for two suitable real numbers c, d. We take two sequences [r"], [s"] from .moo such that
r1, r2, ... < d < sI, s2, ... ,
lim r" = lim s" = d.
(154.16)
(154.17)
From (154.152), (154.16) we get
b'"
(n= 1,2,...).
Hence, and from (154.12),
.f(b'")
(f(b))'" < f(a) < (f (b))s"
,
so that, by (154.17) and (154.15), .f(a) = (f(b))d = (bc)d = (bd)` = a` .
Here, because of (154.12), c must be positive. Consequently Theorem 391 is proved. By this proof we have also shown that (154.14) always has a solution x in Yob. Of course there is only one solution. We call it the logarithm of b
THE FIELD OF REAL NUMBERS
609
(to the base a) and denote it by logo b. This definition is representable by the formula (154.18) (a, b E .3lo). > 0; a # 1) . b aIO$ab For log. b we write log b when the base a is fixed. Without any difficulty we can obtain the rules
log. be = log. b + log. c loga b` = c log. b"
(b, c > 0) , (b > 0) .
(154.19) (154.20)
THEOREM 392. All the order preserving isomorphisms of the group of positive real numbers onto the module of real numbers are given by a -+ log, a,
(154.21)
where c can be any real number > 1. (This theorem may also be regarded as a definition of the logarithm log, a for c > 1. From here, since log,_ I a = - log, a, we can obtain the complete definition.) The assertion of the theorem that (154.21) yields only the desired isomorphisms is easy to see on account of (154.19) and (154.6). We choose one of the isomorphisms (154.21) and denote it by at,. All the required isomorphisms are then given by
a=ace, where e runs through the order preserving automorphisms of the group of positive real numbers. According to Theorem 391 these a are given by
a--Boa=a', where r is a positive number. For a (E F'(o), > 0) we have, because of (154.20)
as = a. ea = aca' = loge a' = r loge a. But since
(Cr-I)rbca = clogca = a
we have as = logs a with d = c'-'. This proves Theorem 392. As regards Theorem 391, it should be noted that the group of positive real numbers also has automorphisms which are not order preserving. The same holds with respect to Theorem 392. Cf. HAMEL (1905). For the theory of real numbers see KALMAR (1952).
610
FIELDS WITH VALUATION
EXERCISE 1. A set S of real numbers bounded above has one and only one upper bound c with the property that every interval (c - h, c) (h > 0) contains at least one element of S. We call c the Weierstrass upper bound of S. EXERCISE 2. By a partial sequence of a sequence [a1] let us understand every sequence [a,,] with 1 < n, < n2 < ... Every sequence of real numbers in an interval has convergent partial sequences (BOLZANO-WEIERSTRASS theorem).
§ 155. The Field of Complex Numbers
For every real number a, a2 + 1 > 0, so that the polynomial x2 + 1 is irreducible over .7(O). Thus according to Theorem 295 the field .o)(i) with i2 + I = 0 exists and we call it the field of complex numbers. Its elements and subfields are called complex numbers and complex number fields, respectively. The real number fields also occur among these but are usually excluded.
Since -7to) (i) is algebraically of the second degree over 20), the complex
numbers may be taken uniquely in the form a + bi, where a, b denote real numbers. These are sometimes called the real part and the imaginary part of the complex number. THEOREM 393. When, in an ordered field F, every positive element is a square
and every polynomial from F[x] of odd degree has a zero in F, then the complex field F(i) is algebraically closed. COROLLARY (the so-called Fundamental Theorem of Algebra). The field of complex numbers is algebraically closed.
In order to prove the theorem, first of all let us refer to the fact that, according to the JOHNSON'S theorem (Theorem 368), F is formally real.
Since, according to this, the equation a2 + b2 = 0 has only the trivial solution a = b = 0 in F, by Theorem 177, the complex field F(i) does in fact exist. We consider an arbitrary non-constant principal polynomial f(x) over F(i). We have to prove that this has a zero in F(i). First of all we shall prove this assertion for the special case where f(x)
lies in F[x]. Let n = 2`no
(2` 11 n)
(155.1)
be the degree of f(x). If t = 0 then 2 ,j' n, therefore f(x) has a zero in F. If t > 0 we assume the assertion for t - 1 instead of t. We put
f(x)_(x-00...(x- 00, where al, ..., a are elements of a splitting field
G = F(i, al, ..., cc ) of f(x) over F(i). It will suffice to prove that at least one of the al, ..., an necessarily lies in F (i).
THE FIELD OF COMPLEX NUMBERS
611
For this purpose we put /I* _
I2 "I
= 2'-'no(n - 1)
(155.2)
and form the polynomials
Fk(x)_
fl
(x-a,as-k(a,+x,,))
(k=1,...,u*+1) (155.3)
of degree n*. Temporarily we consider the a.I, ...,x as indeterminates. Fk (x) is then a symmetric polynomial in al, ..., x,,. Hence by the main theorem for symmetric polynomials (Theorem 268) it follows that Fk(x), together with f(x), lies in F[x]. Further, because2,f'no(n - 1) (since t > 0 and (155.1), holds), and for the degree n* of Fk(x) the factor decomposition
(155.2) holds, it follows from the induction hypothesis that every Fk (x) has at least one zero in F(i). Now the number of pairs r, s occurring in (155.3) is equal to n*, whereas the number of polynomials Fk(x) equals n* + 1. Consequently there is a pair r, s (1 < r < s <- n) and two distinct k, 1(= 1, .... n* + 1) such that
a,a, + k(a, + a,) , x,as + 1(a, + x.,) both he in the field F(i). This is of characteristic 0, so that aS E F(i).
(155.4)
Hence
(ar - a,)2 E F(i) .
(155.5)
But the formula
(a + bi) (2a + 2 , /a2 + b''=) _ (a + Jag + b2 + bi)'2
(a, b E F) (155.6)
holds in F(i) where the radical Ja` + b2, according to the supposition, is an element of F. We may suppose that this is non-negative. The second factor of the left-hand side of (155.6) is then a non-negative element of F and
vanishes only for a = b = 0. Again according to the supposition, it also follows from (155.6) that every element of F(i) is a square in it. Therefore (155.5) may be replaced by or, - a,, E F(i). Hence, and from (155.42), we obtain a a, E F(i) by which the assertion for the special case f(x) E F[x] is proved.
From here the step leading to the general case f(x) E F(i)[x] is not difficult. We write f(x) = g(x) + ih(x)
FIELDS WITH VALUATION
612
with two polynomials g(x), h(x) from F[x]. The product
(g(x) + ih(x)) (g(x) - ih(x)) = (g (x))2 + (h (x))2 lies in F[x], and so has a zero a + bi in F(i) (a, b E F). If it is a zero of the first factor, then the proof is complete. Otherwise
g(a + bi) - ih(a + bi) = 0. Applying the isomorphism i --> -i of F(i) I F we get
g(a - bi) + ih(a - bi) = 0, i.e., f(a - bi) = 0. Consequently Theorem 393 is proved. As the suppositions of this theorem, according to Theorems 389, 390, are valid for F = Y(o), the corollary is likewise true. The above proof goes back to LAGRANGE whose explanations were, however, incomplete in one point. The first complete proof of the Fundamental Theorem of Algebra was due to GAUSS. According to STEINITZ'S theorem (Theorem 300) every field has an algebraically
closed extension field but only the existence of such a field is proved so that the importance of the Fundamental Theorem of Algebra remains undiminished. Although
it was from the very first called the Fundamental Theorem of Algebra, it achieved the character of a pure algebraic theorem only since the researches of ARTIN - SCHREIER
(1926). Furthermore it should be pointed out that the Fundamental Theorem of Algebra, in spite of its importance, in the present stage of development of algebra is not correctly named, but this name is retained for historical reasons.
By Theorem 328, . 7(0) (i) I.9r o) has only one non-identical automorphism
i ->- - i. Since here a + bi becomes a - bi (a, b E Y(o)), we call these numbers conjugate complex numbers (more precisely "conjugate complex numbers over .Fto,"). Here two conjugate complex numbers are denoted by a, &, and we say that a is the (complex) conjugate of a.
The field Y(O)(i) is not orderable since i2 = - 1, so that we can not define any absolute value in it in the usual sense of the word. But we call the non-negative real number I
a I = (acac)1 = (a2 + b2)1 ,
(155.7)
by long-established custom but very incorrectly, however, the absolute value of a for a complex number a = a + bi (a, b E to)) We shall show that this defines a valuation which we (again incorrectly) call the absolute valuation of Jr(o)(i). [Since we are dealing with an extension of the absolute valuation of Y(o), the absolute value in _7 (o) and Y(o)(i) can be similarly denoted.]
It is evident that I a = 0 only for oc = 0. For two arbitrary complex numbers a, i we have aP6i4 = oca(3B, whence, because of (155.7), the homo-
THE FIELD OF COMPLEX NUMBERS
613
morphy property I ap I = I a I I P I follows. It still remains to be shown that
I«+flI
This inequality is, because of (155.7), equivalent to any of the following:
«+# I2
(Ial+I
I)2,
I, («P+«p)2<4«aRP,
(aPizfl)250.
The last inequality is trivially true, since the real part of MA - a# vanishes. This proves the assertion. The absolute algebraic complex numbers are called algebraic numbers. These constitute a subfield of .7to) (i) which we denote by vl?o and call the field of algebraic numbers. According to Theorem 297, ,,eo is algebraically closed and at the same time the algebraic hull of .70 in.9r(o)(i). It follows also that o is the algebraic closure of Yo. (This explains the notation we have introduced.) The algebraic number fields shall simply mean the subfields of -eo. The real elements of,,-go are called the real algebraic numbers. These constitute the field of real algebraic numbers which is obviously equal to
o n .(o). THEOREM 394. Every ordered absolute algebraic field is order preserving isomorphic with a real number field. This follows from the corollary of Theorem 376 and from Theorem 387. EXAMPLE 1. For the square root of complex numbers we have the formula
Ja + bi =
ja+Vas+b$ + 2
i
-a+ Jas+b2 2
(a, b E Jr(,))
(155.8)
where we have to take for the radical aE -+b2 its non-negative value and for the remaining two radicals on the right-hand side such values that their product results in b. [This formula can also be easily obtained from (155.6).] EXAMPLE 2. Although K,0)(t) is not orderable, there are orderable (non-real) complex
number fields. In particular it follows from Theorem 394 that a complex number field of finite degree is orderable if, and only if, it is isomorphic with a real number field. For instance, let us take the three roots al, aE, as of x3 - 2 = 0 from X (0)(t). Among the three conjugate fields Kto) (ak) (k = 1, 2, 3) there is one real and two complex ones. Thus all three are orderable. EXAMPLE 3. From Theorem 394 we obtain the following more general consider-
ation: Let F be an absolute algebraic field of characteristic 0. In order to determine all the orderings of F, take all its isomorphic mappings into ddo fl .(o). To each such isomorphism there belongs an ordering of F, namely that for which the considered isomorphism is order preserving. All the orderings of F arise in this way.
614
FIELDS WITH VALUATION
EXAMPLE 4. In particular it follows from Example 3 that a number field of finite degree has as many orderings as isomorphisms in c74 n .F_(0). For instance, ,F'0( r2-) has two orderings. Which? EXAMPLE 5. We wish to denote, analogously to dlo, the algebraic closure of the prime field ., (p > 0) by c 1l, This field may also be obtained as the union of a suitable
chain of its finite subfields. For this purpose take a sequence n1, n2, ... of natural numbers with n, 1 n2 1 ... such that every natural number is a divisor of a suitable nk. Then take for every nk the uniquely determined finite field Fk of order pk contained in d4,. It is also evident that F1 c F2 s .. , and c4, = F, U F2 U ... EXAMPLE 6. The field (p > 0) has noteworthy properties. All its automorphisms are the mappings a -- a°k, where k may be any integer (cf. § 142, Example 2). They form an infinite cyclic group. They all map every subfield of d4, onto itself. For every natural number n, cl, has exactly one subfield of degree n. Also at,, has an infinite number of infinite subfields. For example, for every prime number q the union field of the subfields of degrees q, q2, . . . is an infinite subfield. The group of the roots of unity of order prime to p is isomorphic with c4;. EXAMPLE 7. If R is a countable ring, then, because of Theorem 13, there are only
countably many polynomials f(x) of a fixed degree n over it. Hence, and from Theorem
12, it follows that R[x] is countable. It is now obvious that all the fields at,, (p > 0) are countable. It follows more generally that the algebraic closure of a countable field is countable. We similarly arrive at the conclusion that, for a countable field F. every polynomial ring F[x,, ..., x.], furthermore the polynomial ring F[x1, x2, ...] of countably many indeterminates and the quotient field F(x1, x2, ...) are countable. Since.((,) is not countable (cf. the Example on Theorem 11), it follows from the above and STEINITZ'S second main theorem (Theorem 322) that the degree of transcendence of 9-(,,) is not countable. EXAMPLE 8. From the Fundamental Theorem of Algebra it follows that every irreducible polynomial over is linear or of degree two. EXAMPLE 9. In (93.1), (93.2), (93.3) we defined the character group G of a finite Abelian group Al. We now wish to specialize this definition so that we take for the
group, denoted there by 3, the group
(i)* of complex numbers other than 0.
This is possible, since this group, by the Fundamental Theorem of Algebra (corollary of Theorem 393) and by Theorem 314, contains exactly one cyclic subgroup of order m for every natural number m. For the specialization under consideration, the characters of the group 4 are called its complex characters. As it is usual to consider only these
characters (for finite Abelian groups), they are briefly called the characters of 4. The definition therefore runs as follows: let G be the set of homomorphic mappings of 4 into P',0, (i)*. Denote by as (E . 10, (i), # 0) the image of a (E 4) under an a (E G), for which (c(ab) = )
aab = as a b
(a, b E 4; a E G)
(155.9)
and define the product ai3 of two elements a, ig (E G) by ((ai3)a =) (a E 4; a, i3 E G). ai9a = as fla
(155.10)
G now becomes a group, namely the group of (complex) characters, or the (complex) character group of 4. All the assertions in § 93 hold for this case, as well as the fol low-
ing important character relations: _ In for a = e,
as
a C4 -
10 for a ll
e,
aE as
_
n for a = c 10 for a
e,
(155.11)
REALLY CLOSED FIELDS
615
where n = 0(4) (= O(G)), while e and e denote the unity elements of 4 and G, respectively. It is usual to call the sums in (155.11) character sums. Because of the principle of duality for finite Abelian groups it is sufficient to prove, e.g., (155.11,). Let S(a) denote the left-hand side of (155.111). For every b (E 4), it follows from b4 = 4 and (155.9) that S(a) = Y aab = Y, ((xa ab) = ab Y as = ab S(a), a
a
a
so that ((xb - 1)S(a) = 0.
If at 96 e, then there is a b such that ab 96 1, whence S(a) = 0. On the other hand ea = 1 for all a, thus S(e) = n. Consequently (155.11) is proved. For the applications of complex characters to number theory cf. HASSE (1950). EXAMPLE 10. The algebraic numbers with minimal polynomials in .R70[x]are called integral algebraic numbers. These constitute a ring (proof by Theorems 207, 268) with the quotient field c-4o; this ring is not a field and has no irreducible elements.
§ 156. Really Closed Fields
A formally real field without proper formally real algebraic extension fields is said to be really closed. (It would be more correct to say "really algebraically closed".) THEOREM 395. In a really closed field F every algebraic equation of odd degree is solvable, and F may only be ordered so that the positive elements are squares.
For the proof we take a polynomial f(x) of odd degree over F. We have to show that f(x) has at least one zero in F. Since f(x) has an irreducible factor of odd degree we may assume that f(x) itself is irreducible. Then since, according to Theorem 372, the extension field F(a) with f(a) = 0 is formally real, we must have F(a) = F, a E F. The first assertion of the theorem is now proved. In order to prove the second assertion, we denote the set of squares other than 0 from F by P, and suppose that F is ordered in some manner. It suffices to prove that P is necessarily the positivity domain of F. For that purpose consider an element y ( 0) of F - P. Then since x2 - y is irreducible, there exists an extension field G = F( .,/y). Because G D F, G is not formally real, so that there exists an equation
- 1 = k=1 , (k + Yk V r)2
lake Pk E F; k = 1, ..., n).
In this the coefficient of .Jy must be equal to 0, consequently
-1 =k (ak +Yky). QQ
(156.1)
616
FIELDS WITH VALUATION
Since F is formally real, it follows that y is not a sum of elements from P. Since such a sum cannot be 0 either, we see that P is closed with respect to addition. Since a similar statement holds for multiplication, P is a semiring. Since, moreover, the elements n
n
k=1
k=1
lie in P and (156.1) agrees with a + fly = 0, -y = q9(#-1)2 belongs to P. It follows that for every element S of F one of the propositions
6=0, 6EP, -bEP is valid. But since two of them evidently cannot hold, P must be the positivity domain of F. Consequently the theorem is proved. THEOREM 396. A formally really field F is really closed if, and only if, the complex field F(i) is algebraically closed. If F is really closed, then, according to Theorems 393, 395, F (i) is algebrai-
cally closed. Conversely, let us assume the latter. Since [F(i) : F] = 2 is a prime number, there is no field G such that F c G c F (1). Since, on the other hand, according to the assumption, F(i) is the algebraic closure of F, it follows that every proper algebraic extension field of F is isomorphic with F(i). Since this field is not formally real, because i2 = -1, F is really closed. This proves Theorem 396. THEOREM 397. The field ,eo (1 'tol of the real algebraic numbers is, apart from isomorphism, the only really closed absolute algebraic field, and the field - 'o of the algebraic numbers has degree 2 over it. We write
For every element a = a + bi of vgo (a, b E Y(o) the conjugate a = a - bi t?o. As i E -o, so
lies in
a=.(a+a), b=4(a-a) o and so also in .7. Hence a E 9'(i), and consequently ufo = .}'(i). This means, by Theorem 396, that Y is really closed, proving that the last assertion of Theorem 397 is true. Conversely, let us consider a really closed absolute algebraic field LP. By Theorem 394, we may assume that 4g is a subfield of .7. The subfield 4 (i) of o is algebraically closed, because of Theorem 396, so that 4 (1) = moo.
lie in
Since [vfo :.f] = [,,go : 4t], and 4
Y, it follows that dot = Jr. This
completes the proof of Theorem 397. For a proof of the existence and uniqueness of the really closed algebraic field which makes no use of d,, see VAN DER WAERDEN (1955).
ARCHIMEDEAN AND NON-ARCHIMEDEAN VALUATIONS
617
§ 157. Archimedean and Non-Archimedean Valuations
Those valuations in which the value field is Archimedean-ordered are the ones most frequently used. In this case, by Theorem 387, we can assume without loss of generality that the value field is equal to.7(o), and then we speak of a real valuation. Henceforth we shall consider only these valuations. First, the following theorem holds : THEOREM 398. Two real valuations 99, ip of a field . are equivalent if, and
only if, there exists a positive real number K such that V(a) = (gp(a))K
.
(157.1)
This is evidently true when cp or ' is the trivial valuation. In the rest of the proof we shall disregard this case. The assertion "if" is true, since in (157.1) the q'-zero sequences and p-zero sequences formed from -9ragree with one another. In order to prove the "only if" part we assume that q, tp are equivalent. First we shall show that
q7 (a) < p (b) a ip (a) < ip (b) .
(157.2)
From the left-hand side it follows that 99(ab-1) < 1, therefore [(ab-I)"] is a 99-zero sequence, consequently a p-zero sequence. This means that +o(ab-1) < 1, i.e., the right-hand side of (157.2) holds. By symmetry this proves (157.2). We now take a fixed element c such that q'(c) > 1. Such a c always exists,
since q' is not the trivial valuation and 9(c)9(c-1) = I for c 0. Because of (157.2), +p(c) > 1. We consider an arbitrary element a (0 0) of.9rand put 93(a) = (q'(c))" ,
V(a) _ (+Y (c))"'
for two suitable real numbers a, a'. For these we show that a = a'. We take two integers in, n (n 0 0) with
Then (92(c))mn -' < (92(c))"
= 92(a) ,
and so 99(c'")<99(an
From this and from (157.2), we get o(c'") <
and then (Y'(c))n n
tV(a"),
' < V(a) = (w(c))"
(157.3)
FIELDS WITH VALUATION
618
As this gives
m
n
<_ cc', so a <_ a'. Likewise a' < a, consequently a = a'.
Finally we put ip(c) =
(99(c))K
for some positive real number K. Hence, and from (157.3), the validity of (157.1) follows, since a is equal to a' for all a 0. Even a = 0 is not an exception, therefore the theorem is true. A real valuation 99 of a field Y is said to be an Archimedean or nonArchimedean valuation, according as there are elements n (= 2, 3, ...) in .s' such that 9'(n) > 1, or qq(n) < 1 for all n. Since the trivial valuation is non-Archimedean, it follows from the corollary of Theorem 377 that the fields of prime number characteristic have only non-Archimedean real valuations. The absolute valuations of Y (0) and of .7(0)(i) are Archimedean-. THEOREM 399. A real valuation T of a field.Y is non-Archimedean if, and only if, we always have p(a + b) < max (qu(a), qq(b)) . (157.4)
If (157.4) holds, it follows by induction that gq(al +
... +
a") < max (9q(a1), ..., 99(a")) .
Therefore 99(n) 99(1) = 1 for all n = 2, 3, ..., and so 99 is nonArchimedean. We now suppose that 99 is non-Archimedean. It follows that
((p(a + b))" = 99 ((a + b)") <
ko
On account of the supposition 99
((;)an_kbk)
11
kI
1
(n = 2, 3, ...) .
<_ 1. Therefore from the homo-
morphy property of 99, (q9 (a + b))" 5 (n + 1)µ" (,u = Max (qu(a), qq(b))). If now a and b are not both 0, then
(j&-'99(a + b))" < n + 1
.
(157.5)
On the other hand, for every positive 6
(l+by >1+(;)a2 and so it follows from (157.5) that p -'99(a + b) < 1, i.e., (157.4) holds. Since (157.4) holds also in the case a = b = 0, Theorem 399 is proved.
619
ARCHIMEDEAN AND NON-ARCHIMEDEAN VALUATIONS
NoTE. Since from (157.4) it even follows that qq(a + b) < 99(a) + p(b), we can also characterize the non-Archimedean valuations as follows: a non-
Archimedean valuation 99 of a field Y means a homomorphic mapping a -> qu(a) of .7+ into . 'tot such that (157.4) and 99(0) = 0, q(a) > 0 (a hold. Since, from (157.4), for every positive real number x (99(a + b))" < max (9,(a))K
0)
(9' (b))K) ,
,
this implies that both qu(a) and V(a) = (qu(a))" are non-Archimedean valuations. By Theorem 398, we see that an Archimedean and a non-Archimedean valuation can never be equivalent. EXAMPLE. In connection with Theorem 398 it should be noted that (157.1), for a given Archimedean valuation 99, does not yield for all x (> 0) a valuation w, since for this to be the case we must have
(P(a + b))" 5 (q,(a))" + (99(b))". The K such that 0 < it < 1 are always suitable, since then from 9'(a + b) < qu(a) + 9'(b) according to a well-known theorem we get (99(a + b))' S (qu(a) + 9p(b))" < (9p(a))" + (9v(b))'
In the special case F =
.
(.), qu(a) = ; a I no x > I is suitable.
§ 158. Exponent Valuations By contrast to what we have done up to this point, from now on we shall denote a field with valuation by F, while.{(,) will always represent a value field.
We consider a non-Archimedean valuation 99 of a field F. By the note at the end of § 157, this implies a mapping a --> qu(a) of F into .7(o) with the properties c(o) = 0, 99(m) > 0 (a
0) ,
(158.2)
97(aJ) = 9'(a)99(J9) ,
cp(a + (9) < max (qu(a), T(f))
(158.1)
.
(158.3)
Instead of this function q, it is convenient to introduce the function w(a) = -I og" 99 (a)
(a 0 0) ,
(158.4)
where we take an arbitrary fixed real number g (> 1) as base of the logarithm. (Here we have excluded a = 0 although this is immaterial.) We call
620
FIELDS WITH VALUATION
w an exponent valuation of F and shall prove that it has the following properties :
w(a) E 9o;
(cc E F, * 0) ,
(158.5)
w(aq) = w(a) + w((3) (aft # 0) ; (158.6) (158.7) w(a + P) > min (w(a), w(f)) (a, 6, a + f # 0) . Before giving the proof we note that Z min (w((xl), ..., w(Q) (al, . . ., a,,, al + ... + (Xn#0) w(al + ... + (158.8) follows more generally from (158.7) If, moreover, a is a root of unity in F, q9(a) = 1, and so w(a) = 0. Besides, from g (-a) = qu(a), q9(a-) = 99 (a)-1 the properties w(- cc) = w(a)
w(a 1) = - w(a)
(0100), (0,00)
(158.9) (158.10)
follow.
The equivalence of the two definitions of w above becomes obvious if we write (158.4) in the form 99(a) = 9-W(a)
(a 96 0) .
(158.11)
(The appearance of w (158.11) in the exponent is the reason for the term `exponent valuation".) NoTE. We also use the exponent form w, defined by (158.4), of a real valuation 99 if we can prove that this valuation is non-Archimedean. We see from (158.5), (158.6), (158.7) that an exponent valuation w of a field F means a homomorphic mapping a --> w(a) of the group F* into the module . '(,,,) which also satisfies condition (158.7).
If necessary we suppose that the definition of w (a) is extended to include
the case at = 0 by putting w(0) = oo, which means that w(0) is "greater" than all the other w(a). Here (158.7) remains true. (158.6) remains also true if we put oo + w(a) = Co. We emphasize once more that non-Archimedean valuations and exponent valuations differ from each other only immaterially, so that the previous definitions and statements, so far as they concern non-Archimedean valuations, may also be applied directly to exponent valuations. Similarly, future statements about one of these valuations will be taken as valid for the others. We say that the (non-Archimedean) valuation 99 and the expo-
nent valuation w correspond to each other, if (158.4) [or (158.11)] holds for a fixed g (> 1). For instance, a sequence [a;] from F is a w-zero sequence if, and only if, to every real number C there is an N (E -/") such that w(a;) > C
(i > N) .
EXPONENT VALUATIONS
621
On account of Theorem 398, the exponent valuations equivalent to an exponent valuation w(a) are given by ctii'(a) ,
where c can be any positive real number. THEOREM 400. Instead of (158.7) for w(a) # w(fl) we even have
w(a + p) = min (w(a) , w(p))
(158.12)
.
Let us assume, e.g., that w(a) > w(8). If the assertion is false, then, because
of (158.7), w(a + fl) > w(8). On the other hand, from (158.7) and (158.9), w(fl) =
a + a + p) > min (w(oc), w(a + 6))
.
From both results w(fl) Z w(a). This contradiction proves the theorem. Exponent valuations are, in several respects, simpler than Archimedean valuations, as we shall see later. First of all, in addition to Theorem 379, the following also holds : THEOREM 401. A sequence [a;] from afield F with the exponent valuation
w is convergent, if [a; - ai+i] is a zero sequence. For this, it is necessary that the terms of the sequence [w(a;)], except for finitely many, are equal, unless [a;] is a zero sequence.
If [a; - a,+,] is a zero sequence, then, for every real number C there is a natural number N such that
w(a;-(zi+I)>C
(i>N).
Because 1-1
ak - al = X (ai - a! +I) !=
(0
and because of (158.8), it follows that
W(ak - a;) > C
(k, 1 > N).
The first assertion of the theorem is now proved. In order to prove the second assertion, we suppose [a;] to be convergent and not a zero sequence. To every real number C there exists a natural number N such that we always have w(a; - (xk) > C (1, k > N). (158.13) If the assertion is false, then for every i (> N) there is also a k (> N) with w (a;) # w (ak) .
FIELDS WITH VALUATION
622
For such a pair a;, ak, because of (158.9) and Theorem 400, (158.13) becomes min (w(00, w((Xk)) > C C.
From this we obtain w(a;) > C so that [a;] is a zero sequence nevertheless. This contradiction proves Theorem 401. THEOREM 402. If [a;] is a convergent sequence, but not a zero sequence, from a field F, which is perfect with respect to an exponent valuation w, then
w(lim a) = w(aN) = w'(aN+1) =
...
(158.14)
for a suitable N.
In order to prove this, we put a = lim a,. By Theorem 400 we have: w(M) 0 w(a)
w(a, - a) = min (w(at), w(a)) .
But since [a; - a] is a zero sequence, Theorem 402 follows. We now consider an arbitrary field F with an exponent valuation w, in order to define the following important notions. By the value module of F we understand the submodule of Y o) consisting of all the values w(a). When it is denoted by %, then according to (158.6) we have the homomorphism
F* -
(a - w(a))
.
(158.15)
Further we take into consideration those elements a of F for which w((X) z 0.
(158.16)
[Thus, these elements are 0 and the elements of F which are mapped onto the non-negative elements of fin the homomorphism (158.15)]. We shall show that these elements a constitute a subring of F, which we denote by R = R,4, and call the valuation ring of F (with respect to the exponent valuation w). We notice at the same time that, because w(l) = 0, R contains the unity element of F. In order to verify the ring property of R, let us take two elements a, fi from R. Because w(a), w(f) >- 0, it follows from (158.6), (158.7), (158.9) that w(a - A), w(afl) >_ 0 ,
and then a - fl, afi E R. Accordingly, R is, in fact, a ring. We shall show that the elements a with w(a) = 0 are all the units of R.
(158.17)
EXPONENT VALUATIONS
623
For, the units of R are those elements a of F for which both w(a) Z 0 and w(a-') >_ 0 are satisfied. Because of (158.10), the assertion follows. We shall show that the elements a of F with the property w (a) > 0
(158.18)
i.e., the elements of R other than the units constitute a maximal ideal of R, which we denote by p = pw and call the valuation ideal with respect to the exponent valuation w. We shall also show that p is a prime ideal. The ideal property of p follows at once from (158.6) and (158.7). The maximal property of p follows from the fact that every ideal of R containing
a unit must be equal to R. Since, by (158.6), if w(a/4) > 0, then either w(a) > 0 or w(fl) > 0, so p is a prime ideal. (This also follows from Theorem 186 as well as from the above.) is a field, From Theorem 130 it follows that the factor ring R/.p = which we denote by 2a = tSW and call the valuation factor field with respect to the exponent valuation w. THEOREM 403. Two exponent valuations w, w' of afield F are equivalent if, and only if, the valuation rings R, R, are equal. It is clear that Rw = F if, and only if, w is the trivial valuation. Hence the theorem follows for the case where one of w, w' is the trivial valuation. Therefore we exclude this case in future. If now w, w' are equivalent, then according to Theorem 398, w'(a) = cw(a)
(a E F)
(158.19)
for a positive real constant c. Since the propositions w(a) z 0, w'(a) > 0 are then equivalent, R,, = R,,.. Conversely, let us assume the latter. From this it follows that ,pW = ipW therefore
w(a) > 0 . w'(a) > 0. We consider two elements at, # such that w(a), w(f) > 0, whence w'((x), w'(fl) > 0. For arbitrary integers m, n, according to (158.6), w(a'"(3") = mw(a) + + nw(fl). The same holds for w' instead of w, thus
mw(a) + nw(f) > 0 a mw'(a) + nw'(f) > 0. Hence it evidently follows that w(a)(w(fl))-' = w'(x)(w'(f))-'. This implies the existence of a positive real number c which has the property that (158.19) holds for all a such that w(a) > 0. Because of (158.10), the same is true for
w(a) < 0. Since, finally, the units a of Rw = Rw are characterized both by w(a) = 0 and by w'(a) = 0, (158.19) is true generally. Thus Theorem 403 is proved. 21 IL-A.
FIELDS WITH VALUATION
624
According to this theorem the determination of the possible (non-equivalent) exponent valuations of a field F is reduced to the examination of all its valuation rings. THEOREM 404. All the valuation rings of a field F are F itself and those maximal subrings of F with unity elements which are not fields. First of all we suppose that R (A F) is a valuation ring of F. Let w denote
the exponent valuation of F belonging to it. Because w(1) = 0, we have 1 E R. From the definition in (158.16) it follows that for every element a of F other than 0 at least one of (x, ac-' belongs to R. Since then F is the quotient field of R, R itself cannot be a field. We still have to show that for every element a of F - R the ring {R, a) is equal to F. By hypothesis we have w(a) < 0. Consequently for every element # of F -nw(a). Hence there is a natural number n such that w(/3a - ") ? 0, fa " E R, # E { R, a},
therefore {R, a} = F. We now consider, conversely, a maximal subring R of F containing the unity element which is not a field. We have to prove that R is a valuation ring of F. First of all we show that, for every element a of F other than 0, one of a, a-1 lies in R. With this aim we suppose that a is not an element of R, so that it follows that the ring {R, a) is equal to F, and hence a
I =f(a)
for a polynomial f(x) over R. Consequently there is a principal polynomial F(x) over R with
F(a-I) = 0.
(158.20)
Let n denote the degree of F(x). Since, by hypothesis, F is the quotient field of R, there is an element P (A 0) of R with e,
ea_I....
Pa-("-1) E R
.
(158.21)
Here we may postulate that e is not a unit in R. Hence it follows, because
PP-' = 1, that e-' does not lie in R. If e-' lay in the ring {R, a-1}, this would imply that 9 e- = g((X-' )
for a polynomial g(x) over R, whose degree may be taken as n - 1 at most, because of (158.20). But, because of (158.21), it follows from this that
P-I = PP 2 = Pg(a-')
EXPONENT VALUATIONS
625
belongs to R, which is impossible. Since, according to this, lie in the ring {R, a-1} then
e_2
does no
R c {R,a-1} c F. Now by hypothesis, " = " must hold on the left-hand side, so that it follows that a-1 E R, as was asserted. We now denote by U the group of units of R and by that module which arises from the factor group F*/U after passing over to the additive notation. Then, f consists of the cosets ocU (a E F, 0); further the rule aU -I- 19U = ajU
(158.22)
holds inffor the addition of elements. a relation < by We define in ocU < flu cs a-'# E R - U .
(158.23)
This is well defined, since if ocU = a1U, 19U = fl1U then the propositions " fl E R - U, al-'til E R - U are equivalent. Since, for a, fl, y (E F, 0) the rule
a
f-'y ER-U=a-')'ER-U
holds, this relation < is transitive. For any two elements ocU, flu of4', of the three propositions
aU = flu, aU < flu, flu < aU exactly one holds, for if a laFof does not belong to U, then according to what has been stated exactly one «_ Ill,
,q_la ( =
(a-1N)-1)
belongs to R - U. Hence, we have defined an ordering relation < in 44f by (158.23).
Addition is monotone since, if
aU
0) it follows from (158.23) that ocyU < flyU,
i.e., by (158.22),
aU -j- yU < flu -- yU
.
According to the second part of Theorem 366 the modulecAf is ordered; further its positivity domain is the semimodule consisting of the elements
626
FIELDS WITH VALUATION
aU (> U), where we have to take into consideration that U is the zero
element of.
We show that the ordering of
is Archimedean. For this purpose
let us consider two elements aU, 9U (> U) of Lam. Since a'1 does not belong to R, the ring {R, a-1} is equal to F, and so 14-1 E {R, a-1}. Accordingly
14-1 = h (a-') for a polynomial h(x) over R. Therefore it follows from a E R that fl- Ian E R for a suitable natural number n. This implies that 14U < naU, so that f is in fact Archimedean-ordered. For any two elements a, fl of F with a, fl, a + j rA 0 the relation (a + fl)U >_ min (oU, 14U)
(158.24)
also holds. Let aU <_ 14U. Then according to (158.23) a-116 E R, thus a -'(a + 14) E R, i.e., aU < (a + #)U. Hereby (158.24) is proved. By Theorem 387, /K is order preserving isomorphic with a submodule of tat. We carry through the corresponding embedding, but keep the previous notation for the elements of L f. By (158.22), (158.24), an exponent
valuation of F is then defined by w(a) = aU (cc E F, # 0). The valuation ring of F belonging to it consists of those elements a (E F) for which either
a = 0 or a U >_ U. Now we see from (158.23) that these a are exactly the elements of R. Consequently Theorem 404 is proved. A supplement of this theorem is the following THEOREM 405. The valuation ideal of a valuation ring R (# F) of a field F is the only proper prime ideal of R. Since, by Theorem 404, R is different from 0 and not a field, there is an element a in it which is neither 0 nor a unit. Since a-1 does not belong to R, the ring {R, a-' I is equal to F. Therefore for every element fi (& 0) of a proper
prime ideal p of R an equation #-' = f(a-) holds for some non-constant polynomial f(x) over R. If its degree is denoted by n, it follows that a"f(a -') E R,
a" = 9 . a"J (a_ 1) E ) .
Since p is a prime ideal, it follows that a E p. Accordingly, we have found that the elements other than the units of R all belong to ,p. Since, on the
other hand, p can contain no units, it is necessarily the valuation ideal. Since this is evidently a proper prime ideal of R, the proof is complete. We see that the definition of an exponent valuation of the field.F (,) relies only on its module .""( , and also remains meaningful if instead of this module we have recourse to an arbitrary ordered module. In this way, according to KRIJLL (1932), the so-called general valuation w of a field F is defined as a homomorphic map-
ping a - w((x) of the group F* onto an ordered module.,it with the property w(a + j9) > min (w(a), w(8))
(a, fi E F; a, /3, at + f 96 0).
DISCRETE VALUATIONS
627
Again of is called the value module, which occurs instead of the value field. If f is Archimedean-ordered, then we return to the exponent valuations Accordingly, general valuations essentially include all the exponent valuations, but not the Archimedean valuations. Cf. also PICKERT (1951) and FucfLS (1951).
ExAMPLE. In a field which is perfect with respect to an exponent valuation, an infinite series a, + a$ + . . . is convergent, by Theorem 401, if, and only if, lim a, = 0.
§ 159. Discrete Valuations A non-trivial exponent valuation w of a field F with cyclic value module
is called a discrete valuation. So we may now take the value module in the form {c}, where c is a positive real number. Because of the order preserving isomorphism .Y* ti { c} (n -+ nc) it is possible in the case of a discrete
valuation, by passing over to an equivalent valuation, to take,7+ as a value module. In this case we call the discrete valuation normed. THEOREM 406. A non-trivial exponent valuation is discrete if, and only if,
among the positive elements of the value module there is a minimal one. We have to prove only the assertion "if". We consider a non-trivial expo-
nent valuation w such that among all the positive values w(a) there is a minimal value w(b). To every w(a) with a # 0 we may assign an integer n such that nw(b) 5 w(a) < (n + 1)w(b) . Then
0 5 w(a) - nw(b) = w(ab-") < w(b). Now w(a) = nw(b) follows from this because of the assumption. Hence the theorem is proved. THEOREM 407. The valuation ring belonging to a discrete valuation is a
principal ideal ring with (prime decomposition and to within associates) only one prime element which is a generator of the valuation ideal. Conversely, if a principal ideal other than 0 belongs to an exponent valuation as valuation ideal, then the valuation is discrete. In order to prove this, let us consider an exponent valuation w of a field
F with the valuation ring R and the valuation ideal p. First of all we suppose w to be discrete. Let a be a proper ideal of R. Since the elements of a have non-negative values, a contains an element
a (# 0) such that for all the elements P (# 0) of a, w(f) > w(a) > 0. Hence w(fla-1) Z 0, i.e., floc-' E R. This means that P lies in the principal ideal (a), i.e., a = (a). Thus R is a principal ideal ring. In the following, for the sake of convenience, we suppose that w is normed. Let n denote an element of p such that w(n) = 1. For every element a (00) of
R, w(a) = k is a non-negative integer. From w(aaa-k) = 0 it follows that
FIELDS WITH VALUATION
628
an-k = e is a unit of R. Because a = nnk, R has the single prime element n. Asp consists of 0 and the a such that w(a) = k > 0, sop = (n). Conversely, let us suppose that the valuation ideal p is a principal ideal (n) other than 0. For every element a of p, an-1 E R, thus w(an-1) Z 0, w(a) Z w(n). According to this w(n) is the least positive element of the value module. This implies, by Theorem 406, the validity of Theorem 407.
THEOREM 408. Let F be afield, perfect with respect to the nonmed discrete
valuation w, with the valuation ring R and the valuation ideal p. Take fixed elements ..., n_1i no, nI,... from F with
(i = 0, ± 1, ...)
w(n;) = i
(159.1)
and a system of representatives 91 of R mod p, containing the element 0. Then the infinite series a = amnm + am+lnm+1 + ...
(am,
am+1,
.. E 91; am }o 0)
(159.2)
are all the different elements (# 0) of F, where m can be any integer, and w(a) = m .
(159.3)
(If, on account of Theorem 407, we choose an element n such that p = (n), we may then write n; = n'.) We already know (cf. the Example at the end of the preceding paragraph) that the infinite series (159.2) is convergent. In order to prove (159.3) we denote the i`h partial sum of the right-hand side of (159.2) by a;. Because of (159.1) and Theorem 400, w(i) = w(amnm) = m
(i = 1, 2, ...) .
Since, on the other hand, a = lim at,
lim (aj - a) = 0. Hence, and from Theorem 400, it follows that w(a) m is impossible and so (159.3) is true. Then we show that every element a (# 0) of F may be written in the form (159.2). For if we put m = w(a), then w(an;,f) = 0 follows, thus anml is a unit in R. Accordingly am (mod p)
for some am (
0) from 91. Since w(mn
- xm) >_ 1, so
w(a - amnm) >_ m + 1.
629
DISCRETE VALUATIONS
We suppose that for an s (> m) with any an, ..., as (E R) whatever, W (a - an,n,n - ... -- a ac) >- S -1-
1.
(159.4)
This is, according to the above, in fact the case particularly for s = m. If we
now temporarily denote the expression in brackets on the left-hand side s + 1, and so w (fins+1) >_ 0. Accordingly of (159.4) by fl, then w
s+l = a:+l (mod p) for an as+1 from RJR, thus we have
w(f -as+1n.,+1)Zs+2. This means that (159.4) is satisfied for s + 1 instead of s. By induction it follows that elements am (s' 0), am+l, ... of ll exist such that (159.4) holds for all s = m, m + 1, .... This implies the validity of (159.2). Lastly, in order to prove the uniqueness of the representation of a in the form (159.2), it should be noted that, by (159.3), the number m on the righthand side of (159.2) is uniquely determined by a. Consequently every further representation, similar to (159.2), of a is of the type : a = I9mnm + 1m+1nm+1 +
where Ym, m+t,
are elements of M. After subtraction we obtain
0=(01m-Ym) am +((Xm+1-Nm+1)nm+1+ If a,, - fl, were the first non-vanishing coefficient on the right-hand side, then the right-hand side would have the value s which is, however, false. This completes the proof of the theorem. § 160. ,p-adic Valuations
So far we have become acquainted with only one application, though a very important one, of valuation theory which consists in constructing the field (o) as the perfect hull of the field .5 with the absolute valuation and, proceeding from here, the field,) (i) as the algebraic closure of Y(o). Now we shall explicitly give certain further valuations which similarly
lead to important applications. Let a field F and a subring R of F with unity element and with prime decomposition be given. Suppose also that F is the quotient field of R.
FIELDS WITH VALUATION
630
Let n be a prime element of R. The elements a (96 0) of F may then be represented in the form
a=
aw(")
(K, A E R; n X K, A)
T,
(160.1)
where the exponent w (a) indicates an integer uniquely defined by a. (Notice
that if a E R, 0 then Tcw(") I I a.) We shall show that w is a discrete and normed exponent valuation of F, which we call the z-adic valuation of F and denote by w,,. Further, we shall show conversely, that every discrete valuation is equivalent to a FI-adic valuation. It should be noted that w,, (for fixed F and R) depends only on the prime ideal ,p = (n), therefore w,, is also denoted by w, and called a p-adic valuation. We repeat that the normed discrete valuations and the (n-adic or) p-adic valuations are identical.
For the proof we consider a further element fi (s 0) of F and write this as in (160.1) in the form
vER;aXµ,v).
_IFIw(") V
(160.2)
Since
Av
w(aj9) = w(a) + w(fl). If further, e.g., w(a) < w(p), then from (160.1) and (160.2) we obtain
a+#=
(rcv + Ap.,k),w(a)
(k = w(fl) - w(a) Z 0) .
vv
This gives w(a + i) w(a). Thus, w(a + fi) Z min (w(a), w(,B)) always holds. According to this, w is an exponent valuation. It is also discrete and normed, since the value module is evidently equal to 7+. Let us consider, conversely, a discrete valuation w of F. Let the valuation ring and the valuation ideal be denoted by R and ,, respectively. By Theorem 404, F is the quotient field of R. Furthermore, R, according to Theorem 407 is a ring with prime decomposition and ac is the unique prime element for which 4 = (ac). Thus the ,p-adic valuation w,, of F certainly exists. We even show that wand w , are equivalent. For this purpose let us consider an arbitrary element a (A 0) of F. This may be represented in the form
x= TK nwv
,
where K, A are units of R. Thus w(a) = w(9c) wp(a), so that W. wy are, in fact, equivalent.
P- ADIC VALUATIONS
631
Now, in general, we understand by a p-adic field any field which is perfect with respect to a p-adic valuation. (In the literature, it is usual to call only certain special cases of ?these "p-adic fields". Cf. HASSE (1949). Further we call the perfect hull of a p-adic valuated field the p-adic hull of this field. The p-adic hulls of algebraic number fields are called p-adic number fields. The elements of a p-adic number field are said to be p-adic numbers. We know from Theorem 408 that the elements of a p-adic field F (since this is just a field perfect with respect to a discrete valuation) may be uniquely given as infinite series
a = an,'G"' + am+1 arm+I +
(am, am+l, ...E R; am # 0)
,
(160.3)
where n is a generating element of the valuation ideal p and 91 denotes a system of representatives mod p of the valuation ring R of F, containing the element 0. We call (160.3) a normed p-adic series. (Hence the term p-adic field. Of course (160.3) is convergent and represents an element of the field,
are arbitrary elements of the valuation ring. Then when am, am+l, (160.3) is simply called a p-adic series.) _ In particular, we have to consider the p-adic hull F of a field F. Let w denote the p-adic valuation, R the valuation ring of F belonging to it and p itself the valuation ideal of R. We denote by w the valuation of F (in exponent form). Every element (-A 0) of r arises as a = w-lim ai , where [a;] is a w-convergent sequence from F. According to Theorem 402,
w(x;)=c for a suitable N (E 4") and an integer c for all i > N, whence w(a) = lim w(a,) = c .
Consider a further element P (# 0) of F : # = w-lim 9, , where [i4;] is a suitable w-convergent sequence from F. If, moreover,
a+#
0, then w(ac + 0) = lim w(a; + 9,) .
Hence, it immediately follows that the valuation w (together with iv) is non-Archimedean. We have also found that the value module of w like 21/a R.-A.
632
FIELDS WITH VALUATION
that of w is .7+, thus w is a discrete valuation of F. Let the valuation ring R and the valuation ideal belong to it. Then w itself is the -adic valuation of F and consequently is a P-adic field. Let n be a generating element of ,p, i.e., an element from F such that w(n) = 1. Since w is a continuation of w, w (7c) = w (n) = 1, thus 7L is also a generating element of . Furthermore, since the elements ( : A 0) of F are the w-limits of w-convergent sequences from F, this means, in other words, that the infinite series (160.3)
formed from F furnish all the elements a (0 0) of F. Since an, elements (
0, the
0) of R and P are given by m
0 and m > 0, respectively. We wish to apply this to the case F = . With this end in view we
proceed from the subring 7 of .9' ; this has . for quotient field and is a ring with prime decomposition. Consequently, a p-adic valuation of 3 belongs to every prime number p, which we call the p-adic valuation of 5 and denote by wp. wp (a) is, according to the general definition (160.1), that integer for which an equation of the form
a=
pw,(a)
(k, I E J"; p X k, 1)
(160.4)
holds, where a can indicate any element of .50 other than 0. The corresponding p-adic hull of 0 is denoted by p) and called the p-adic number field.
(Of course p1 has characteristic 0.) Its elements are the p-adic numbers. Since in J' the numbers 0, . . ., p - 1 constitute a system of representatives mod p, the p-adic numbers are given by the p-adic series a = am pm + am+l pm+l + .. .
(ann am+1, ... = 0, ..., p - 1) , (160.5)
where m can be any integer. The cases where am # 0 in (160.5) are the normedp-adic series, which furnish all the different p-adic numbers. Ap-adic number a is called a p-adic integer, if a may be represented in the form (160.5) with m Z 0. The p-adic integers constitute a ring, which is called the ring of p-adic integers. This is the valuation ring of pl with respect to wp. The units of this ring are called the p-adic units. These are the special cases of (160.5) given by m = 0, am 0. From (160.5) it follows that the elements (0 0) of 3 p) may be uniquely given in the form pop, where @ is a p-adic unit and m an integer. It is evident that the (infinitely many) p-adic number fields .21, Y(3), . . . are not pairwise isomorphic. For an easy application of p-adic number fields see HASSE (1935). EXAMPLE 1. The p-adic field contains non-rational algebraic elements, i.e., algebraic elements lying outside F.. For instance, let us take an equation such as
x"- a =0
(a E J; p,f'n, a),
(160.6)
633
P-ADIC VALUATIONS
which has no solution in .f`-,. This means that a is not an nth power in J. Furthermore, let us suppose that in 3 the congruence
x" - a = 0 (mod p)
(160.7)
has a solution at, (= 1, ..., p - 1). We shall show that (160.6) then has a solution a in .",,, for which moreover a = ao (mod p) . (160.8) Since ao - a = 0 (mod p), and because of the polynomial theorem, we may determine
a sequence of integers a,, a2, ... (= 0, . . ., p - 1) so that all the congruences
(a,+alp+...+a,p'?-a=0 (modp'+') (i=0,1,...)
...
hold. It is evident that the p-adic number a = ap + a1 p +
is then a solution
of (160.6), for which (160.8) also holds. EXAMPLE 2. The field .t',,, is not algebraically closed, since, e.g., the equation xE - p = 0 has no solution in it. This follows simply from the fact that p is a prime element in the valuation ring of . ,,,. We call the algebraic elements of . ",,, algebraic p-adic numbers. EXAMPLE 3. The field .3r",,, is not countable, as is obvious from (160.5). Hence, by the same inference as in § 155, Example 7, it follows that the degree of transcendence
of F(,) is not countable. EXAMPLE 4. The field .97,,, is not orderable. For the case p 96 2 we see this as follows: Since the congruence x2 + p - 1 = 0 (mod p) is solvable in .7, there exists, according to Example 1, a p-adic number a such that a2 + p - 1 = 0. Because -1 = = aQ + p - 2 the assertion follows from the ARTIN- SCHREIER theorem (Theorem 371). If p = 2 prove as in Example I that in ' (2) the equation as + 7 = 0 is solvable. Because - I = a2 + 6 the former inference again holds. EXAMPLE 5. The polynomial x°-1 = 0 splits into linear factors over the field
.-%,,) [i.e., -2-&) contains p - 1 (p - 1)`h roots of unity ]. This follows from Example 1,
since the congruence x" = 0 (mod p) has the solutions 1,
.
. .,p - 1.
§ 161. Ostrowski's First Theorem THEOREM 409 (OSTROWSKI's first theorem). All the real valuations of the fields F are (apart from equivalent valuations) the trivial valuation, the absolute valuation and the p-adic valuations. In the first place we know that the valuations listed above are not equi-
valent to one another. We now consider an arbitrary non-trivial real valuation q' of .5. At first we suppose 92 to be Archimedean. Since p(1) = 1, we have qq(n) 5 n (n = 0, 1,
. .
.). Because q7(- a) = qu(a)
,p(n)<=Ink
,
(n=0,±1,...).
(161.1)
We shall show that p(b) < (max (1, qu(a)))'°s"b
(a, b = 2, 3, ...) .
(161.2)
FIELDS WITH VALUATION
634
For every natural number n we may put
b' =co+ cia+... +Cmam (0
where co, ..., c., are integers. Since then a' _< b", it follows that
m<_nlog,b.
(161.4)
Further from (161.1) and (161.3)
(99(b))" < a (1 + q7(a) + ... + (9,(a))) . If, therefore, we put
It = max (1, 9' (a)),
(161.5)
then because of (161.4) (qq(b))" < a(m + 1),u"' < a(l + n log, b)µ" Iogab , thus, (µ-IoSab qq(b))"
(n = 1, 2, ...) .
< a (1 + n logo b)
(161.6)
Because of the inequality
(1+S)">1+in)
2
(6>0; n=2,3,...)
the left-hand side of (161.6) can therefore be at most 1, i.e., q,(b) < , Iosab
By (161.5) this proves (161.2). By hypothesis there is an integer b (> 1) such that qq(b) > 1. Hence; and from (161.2), we see the validity of q'(a) > I for all integers a (> 1). Thus, according to (161.2) p(b) 5 q,(a)loeab = bloga O(a)
i.e., 1ogb q ( ( b ) 5 log. p (a)
(a,b = 2, 3, ...) .
Since, a, b may be interchanged, "=" holds here. Consequently
log. p(a) = K
(a = 2,3,...)
OSTROWSKI'S FIRST THEOREM
635
for a fixed positive number K, because q,(a) > 1. Hence, because q,(-a) _ = q,(a) and 99(0) = 0, q7(1) = 1,
9,(a)=IaII` This then holds because of the homomorphy property of 99 for all a from .9. According to this, and Theorem 398, 99 is equivalent to the absolute valuation .70. Secondly we suppose q, to be non-Archimedean. Now (p (a) 5 1
(a E J) .
We shall show that especially the a such that p(a) < I constitute an ideal
of 7. This follows from the fact that for two elements a, b of ..7 in the case q, (a) < 1 we always have ,(ab) = q,(a)q,(b) < 1
and if p(a) < 1, p(b) < 1 q,(a - b) 5 max(q,(a), q,(-b)) < 1
.
This ideal is prime, since for q,(ab) < I at least one of p(a) < 1 and p(b) < 1 must hold. Let (p) be this ideal, where p is a prime number. Then
p(p) < 1 .
(161.7)
Furthermore q,(a) = 1 for all the integers a prime to p. For every element a it follows that ( 0) of . 99(a)
= (9, (p))wP(a) ,
where wp denotes, the p-adic valuation of .,. Thus if we pass from q, to an exponent valuation according to the rule 9'(a) = g-w(a)
where g denotes a fixed real number (> 1), then w(a) = Kw,(a) for a fixed real number K, which, because of (161.7), must be positive. Consequently, according to Theorem 398 the proof of Theorem 409 is completed. § 162. Hensel's Lemma
We prove the following theorem, important in itself, which at the same time is preparatory to the following paragraphs. THEOREM 410 (HENSEL'S lemma). Let F be afield, which is perfect with respect to a non-trivial exponent valuation w, with the valuation ring R and
FIELDS WITH VALUATION
636
the valuation ideal p. Then, if f (x), go(x), ho(x) are three polynomials over R such that the leading coefficients of go(x), ho(x) are prime to P and
f(x) * 0 (mod p),
(162.1)
fl x) - go(x)ho(x) (mod i,) ,
(162.2)
(4,, go(x), ho(x)) = 1
(162.3)
hold, then there exist two further polynomials g (x), h (x) over R such that g(x), go(x) are of the same degree,
f(x) = g(x)h(x) ,
(162.4)
and
g(x)
go(x) (mod ip)
,
h(x) _ ho(x) (mod ,p).
(162.5)
COROLLARY. For every irreducible polynomial
f(x)=aco+ar1Y+...+ax" over F
min (w(arn), ..., w(a)) = min (w(acJ, To prove this we denote the degrees of the polynomials f(x), go(x), ho(x)
by n, r, s, respectively, (n > r + s; r, s >_ 0). We may assume that r since otherwise the theorem is true. By (162.2),
f(x) - go(x) ho(x) - 0 (mod ,) .
(162.6)
On account of (162.3) there exist polynomials G(x), H(x) over R such that
go(x)G(x) + ho(x)H(x) - I - 0 (mod ,p) .
(162.7)
We take from p an element n (# 0) for which w(n) is not greater than the least value of the coefficients of the left-hand sides of (162.6) and (162.7). Then
J (x) - go(x)ho(x) (mod a) ,
go(x)G(x) + ho(x)H(x) - I (mod a)
(162.8) .
(162.9)
We prove that there are polynomials uj(x), vj(x) (i = 0, 1, ...) over R
of degrees 5 r - 1 and 5 n - r such that for the polynomials gj(x), hj(x) defined recursively by
gj+i(x) = gj(x) + ni+luj(x) ,
(162.10)
hj+1(x) = hj(x) + nj+IVj(x)
(162.11)
HENSEL'S LEMMA
637
the congruences
f(x) = g.(x)h,(x) (mod n'+1)
(i = 0, 1, ...)
(162.12)
hold.
According to (162.6), (162.12) holds for i = 0. We suppose that for an m (> 0) the ui(x), v;(x) (i = 0, . . ., m - 1) are determined as required so that (162.12) is satisfied for i = 0, . . ., m. Then we wish to determine a suitable pair of polynomials um(x), v,,,(x) such that (162.12) is also satisfied
for i = m + 1. This requirement is that f(x) = gm+t(x)hm+](x) (mod n"'+2)
for which we can write, because of (162.10), (162.11), f(x) - gm(x)hm(x) °- am+1(gm(x)vm(x) + hm(x)um(x)) (mod n'"+2). (162.13)
The left-hand side is, according to the case i = m of (162.12), equal to nm+'Fm(x), where the second factor is a polynomial of degree 5 n over R. Thus because of (162.10) and (162.11), (162.13) is identical with go(x)vm(x) + ho(x)um(x) - F,(x) (mod n) .
(162.14)
On the other hand, it follows from (162.9), on multiplication by Fm(x), that go(x) G(x) Fm(x) + ho(x) H(x) Fm(x) _- F,(x) (mod n).
In order to obtain from this a suitable solution of (162.14), we determine two polynomials qm(x), um(x) by Euclidean division so that H(x)Fm(x) = go(x)gm(x) + um(x)
and um(x) is of degree <- r - 1. Moreover we put vm(x) = G(x)Fm(x) + ho(x)gm(x)
.
Then (162.14) holds, because of (162.15), but the degree of vm(x) will, in general, not be <- n - r. However, this inadequacy is avoidable, if we omit from vm(x) the terms divisible by n. If we continue to denote the remaining polynomial by vm(x), then (162.14) remains unaltered, and the new vm(x) must be of degree < n - r. Otherwise, go(x)vm(x) would be a polynomial of degree > n with a leading coefficient not divisible by n, but this is an obvious contradiction of (162.14), since ho(x)um(x), Fm(x) are of degree 5 n. From this contradiction it follows by induction that the above assertion concerning (162.12) is true. Let us notice that by (162.10), (162.11) the degrees of the polynomials
g;(x), hi(x) are equal to r and, at most, n - r respectively (i = 0, 1, ...). If we now denote, for an arbitrary k (Z 0), the coefficients of xk in the
FIELDS WITH VALUATION
638
polynomials gi(x), h,(x) by ai and fli, respectively, (i = 0, 1,.. .), then by (162.10), (162.11)
n'+1 I ai+I - 'Xi, A+I - lgi , whence the existence of the limits a(k) = w-lim ai, 1(k) = w-lim Ni
follows. Thus if we write g(x) = a(,)x' + ... + a(o),
h(x) = N(n-,)xn' + ... 1g(o)
then it follows from (162.12) that (162.4) is satisfied. Furthermore because of (162.10) and (162.11), (162.5) is also satisfied. Hence the theorem is proved.
In order to prove the corollary, let us multiplyf(x) by a suitable element (
0) of F, by which we obtain min (w(orn),
..., w(a )) = 0 ,
so that now f(x) E R[x]. If the corollary were false, then we would have 0, so that w(an), f(x) __ (an +
... + a,x')
l (mod p)
(w(a,) = 0)
would hold for an r such that 0 < r < n. Hence according to the theorem the reducibility of f(x) follows. This contradiction proves the corollary. EXAMPLE 1. Let f(x) be a primitive polynomial over .I, and p be a prime number. Let us assume that f(x) = gjx)h::(x) (mod p) (go(x): h)(x) E J[xl), (p, go(x), h,(x)) = 1
where the leading coefficients of go(x), ho(x) are prime to p. Then a factor decomposition
f(x) = g(x)h(x),
holds in .F-(,)[xl where the coefficients of the factors are p-adic integers, the congruences
g(x) = gu(x), h(x) = h,(x) (mod p) hold and g(x), go(x) have the same degree. This follows from HENSEL'S lemma applied
to F = 5 7(,). Of course, the fact that h(x), ho(x) are of the same degree may be substituted for the condition about the equality of the degrees of g(x), go(x). A similar fact is valid for HENSEL'S lemma in the general case.
EXAMPLE 2. If f(x) is a primitive polynomial over 9, and p a prime number and c an integer such that
f(c) - 0 (mod p), f(c) * 0 (mod p),
HENSEL'S LEMMA
639
then there is an element a of .K&1 such that
f(a) = 0, a = c (mod p). This is the special case go(x) = x - c of Example 1. By further specialization to the case f(x) = x" - a (p,}'n, a) we again arrive at Example 1 of § 160.
§ 163. Extensions of Real Perfect Valuations for Field Extensions of Finite Degree THEOREM 411. If F is afield, which is perfect with respect to a real valuation
99 and G I F is a field of degree n (n >- 2), then G admits one, and only one, real valuation V, which is an extension of 97, and that is given by (163.1)
tV(a) = 9'(NGIF(a))e .
Furthermore G is perfect with respect to V. (If 99 is Archimedean, then the assumptions are fulfilled, according to the later Theorem 412, only for n = 2.) First of all we shall show that a valuation +p of G is defined by (163.1). It is evident that V(0) = 0 and V(a) > 0 (a 0 0). Since, according to Theorem
176, a - NGIF(a) is a homomorphic mapping of G" into F', the property yt(afl) = yr(a)tp(j) follows. We still have to show that' has the additive property of valuations. First, let us consider a non-Archimedean valuation 99 and show at the same time that tp is also non-Archimedean. Let a, j9 be two elements of G
with a, fi, a + # # 0. Without loss of generality we may assume that +p(a) 5 1p(f). We have to show that then
tp(a + 9) < t'O Since the left-hand side is equal to 4p(af -1 + 1) p(9) and moreover ep(a9-1) 5 1, it suffices to show the following: for every element y of G such that tp(y) 5 1. (163.2) V (Y + 1) 5 1. Let
f(x)=xm+alx'n-1 +... +a,,,
(163.3)
be the minimal polynomial of y over F. According to Theorem 360, (-1)nram.
NF(Y)IF(Y)=
Hence, and from Theorem 359, (Y) _
(-I)na.,:-1
.
FIELDS WITH VALUATION
640
Thus, from (163.1), V(y) = (9' (am))"' .
(163.4)
But f (x - 1) is the minimal polynomial of y + 1. Its constant term is equal to f (- 1), and so from (163.3) and (163.4),
tV(y+ 1)=(9(1 -al+... ±a,,,)),.
(163.5)
Let w denote the exponent valuation corresponding to the valuation P. Then w(1) = 0. Further, from ip(y) 5 1 and (163.4), first p(am) 5 1, then w(am) > 0. Because of the irreducibility of f(x), the validity of w(aI) >_ 0 (i = 1, ..., m) follows from the corollary to HENSEL'S lemma (Theorem 410). Accordingly,
w(1 -al+...±am)z0. This means that the right-hand side of (163.5) is at most 1, by which (163.2) is proved. Secondly, let us consider the case of an Archimedean valuation q'. According to Theorem 412 of the following section we may restrict our-
selves - as has been already noted - to the case n = 2. We again denote by y an element of G for which qq(y) 5 1. Similarly, as above, it is sufficient to prove that q,(y + 1) < y,(y) + 1.
(163.6)
We may suppose that y lies outside F. Because n = 2, the minimal polynomial of y over F may then be put in the form
f(x)=x2+ax+b. As before V(V) = (cv(b))'
+V(y + 1) = (,p(1 - a + b))*.
,
Accordingly, assertion (163.6) means that
T(1 - a + b) < ((p(b))* + 1)2.
Since qq(1 - a + b) <- 1 + p(a) + ga(b), it suffices to show that ,P(a) < 2(99(b))t
,
i.e., (99(a))2 < 499(b)
We assume the reverse: (99(a))2 > 4q((b).
(163.7)
REAL PERFECT VALUATIONS
641
Since f(x) is irreducible, VA 0. Hence, and from (163.7), m :A 0. We form
a sequence [c,] from the elements of F such that a
c1=2, c,+1=-a-
b c
(i=1,2,...).
(163.8)
This is possible, since the c1, c2i ... turn out to be different from 0, indeed, we shall show that
(i = 1, 2, ...).
99c. ? 2 99a)
(163.9)
Because (p(2) 5 2,
If (163.9) is then assumed for some i, then, because of (163.7) and (163.8), we obtain 99(c1+L) = 4'(a +
Al
> 9?(a)-
9%(b) > 4'(a) - 2
Vi(a) >
2 9'(a)
Hence (163.9) follows by induction. We now show that our sequence [c,] is convergent. We have
c,+s - c1+1= -
b c1+1
+
b c,
= b c1+1 - c, c, c,+I
Therefore from (163.9) it follows that m(c1+2 - c,+1) < (99(a))2
9'(c,+1- c)
If the first factor of the right-hand side is denoted by q, then according to (163.7) 0 < q < 1, and we obtain 99(c,+1 - c,) 5 q'-1 9'(c2 - c1)
(i = 1, 2,. ..).
This gives
,-1 99(c, - Ck) <
i=k
92(c,+1 - c) 5 92(c2- c1) Y. q1
qk-1
1
< 1-
whence we see that for 1 < k < 1, [c,] is ,p-convergent.
q
9' (c2 - c1)
FIELDS WITH VALUATION
642
We put c = 9'-lim c,. By (163.8),
c= -a - b, c2+ac+b=0. c But, because c E F, this contradicts the irreducibility of f(x). Consequently we have shown that (163.1) defines in both cases a valuation of G. From now on we deal with both cases together. For a E F, NGIF (a) = a", and then, from (163.1), V(ac) = 99(cc). Accordingly, y, is an extension of T. To continue the proof we need the following PROPOSITION. Let wl, . ., co,,
be an F-basis of afield G F of degree it.
Further let F be perfect with respect to the real valuation 99, and op be a real valuation of G, which is an extension of 99. A sequence [ai] from G with the basis representation
at = an) col + ... + a(,") co"
(a;i), ..., a(in) E
F; i = 1, 2,...)
of its terms is yp-convergent if and only if the sequences
[a;II ],
(163.10)
..., [a,">]
are (p-convergent, and then
o-lim a; = a(i'col + ... + a(")COn
(163.11)
where
a(k) = 97-lim a(k)
(k = 1, ..., n).
(163.12)
Since from (163.10) and (163.12) for every i (= 1, 2, ...) y!(act -a(' (j), -
... -
n
a(' W") :!9
k=]
p(a ") - a(k)) 'P(Wk)
both the "if" assertions of the proposition are true. In order to prove the "only if" assertion, we call the sequence [at] an m-sequence, if a;"'
a(") = 0
(i = 1, 2, ...).
For the 1-sequences the assertion is true, since for these
We denote by m one of the numbers 2, . . ., n, and assume that the assertion for the (m - 1)-sequence is true. We consider a p-convergent m-sequence [act]. It suffices to prove the assertion for this sequence, since from this the assertion will follow by induction for the case m = n, i.e., for the general case.
REAL PERFECT VALUATIONS
643
if the sequence [aJ')] is 99-convergent, then the sequence loci - aim) CJm] _ [all)wl + ... + a(m-1)wm_1J 77
is p-convergent and at the same time an (m - 1)-sequence. For this case it follows from the induction hypothesis that the assertion is true. If, on the other hand, the sequence [a(im)] is not 99-convergent, then we infer from this a contradiction and so prove the proposition.
By hypothesis there exists a positive number s and natural numbers n1, n2, ... such that g9(aim) - a;+Rr) > s
(i = 1, 2, ...).
(163.13)
We put (a; - ai+", )
f'i = (asm) -
(I= 1, 2,. ..).
(163.14)
Since now, according to Theorem 379, [a; - a;+";] is a 'N-zero sequence, so also, because of (163.13), (163.14), [,;] is a si-zero sequence. A glance
at (163.10) and (163.14) shows further that [f; - wm] is an (m - 1)sequence. This is, according to the above, p-convergent, and W-lim WI - wm) = -wm . i
By the induction hypothesis we obtain from the formula (163.11) an equation of the form -co. = b1w1 + ... + bra-lwm-1 with elements b1, . . ., bm_1 from F. Since, because of the basis property of w1, ..., w", this is impossible, the proposition is proved. Hence the assertion of Theorem 411 that G is perfect with respect to V follows immediately.
We still have to prove the uniqueness assertion of the theorem. For that purpose we anticipate a special inference from the proposition. If p-lim a; = 0, then, from (163.11), i
a(k) = q'-lim aik) = 0 i
(k = 1, ..., n).
On the other hand, by (163.10) and Theorem 360, NGIF(a,) is a homogeneous polynomial in the ail), ..., o(,) of degree n with coefficients from F, which
depend only on the wl, ..., w" and their conjugates. Hence there follows for the sequences [a;] considered in the proposition, the rule: y,-lim a; = 0
qi-lim Nc i F((X;) = 0.
(163.15)
644
FIELDS WITH VALUATION
We now suppose that vp is any real valuation of G and an extension of (p. We have to prove that (163.1) necessarily holds for this +p. We suppose
that for an element a of G (163.1) is false. Then we have either 1P((X") < < 99(N(a)) or V(an) > q)(N(a)), where we have put N = NGIF. Corresponding to these cases we write N(a)
Mn
which implies that +'(Q) < 1.
(163.16)
N(e) = 1.
(163.17)
Further evidently
From (163.16) +p-lim e'= 0. Then, according to (163.15), 4p-lim N(e) = 0, i.e., 99-lim (N(CO))' = 0, contradicting (163.17). Consequently we have
proved Theorem 411. EXAMPLE. On account of Theorem 359, definition (163.1) may be replaced by (163.18)
VW = T (NF(-) F(a))IF(a):F1-1.
Hence we see that rp((x) depends only on F, p, a. Thus Theorem 411, apart from the statement '°G is perfect with respect to +p", holds for all algebraic field extensions G I F of a field F which is perfect with respect to gyp.
§ 164. Ostrowski's Second Theorem THEOREM 412 (OSTROWSKI's second theorem). The Archimedean-valued perfect fields are topologically equivalent to one of the fields .5o), 5o)(i) valued by the absolute valuation.
Let F denote a field which is perfect with respect to an Archimedean valuation 92. Since the characteristic of F must be equal to 0, it may be supposed that .9 is a subfield of F. From OsTROwsKI'sfirst theorem (Theorem
409) it follows that the valuation of ..5 must be equivalent to the absolute valuation. Thus we may assume that F contains the perfect hull 5o) of Y0 and that (p is an extension of the absolute valuation of J 57(o). First, we distinguish two cases which we shall, however, soon consider together. If x2 + I is reducible over F, then it may be supposed that 5o)(i), too, is a subfield of F. Since furthermore [J9( '0,(!) : 5r(o)] = 2, it follows from Theorem 411 that for the elements a + bi of o)(1)
99(a + bi) _ 1(a + bi)(a - b 1)11 = (a2 + b2)'
a + bi
OSTROWSKI'S SECOND THEOREM
645
i.e., ois also valued by the absolute valuation. For this case we shall show that F = ob(i). If, on the other hand, x2 + I is irreducible over F, then the complex field
G =F(i) over F exists and we may suppose that .5o)(i) is a subfield of G. Because [G: F] = 2 it follows from Theorem 411 that G has one, and only one, real valuation, which is an extension of 99, and that it again agrees with the absolute valuation for the elements of.$o)(i). Here G is, according to Theo-
rem 411, perfect. It means that for G the former case holds, whence, anticipating the result, it follows for that case that
Jp(i) = G D F
.off,
[7o)(i) :. 0)] _ [G : F] = 2.
Since F = .Yjo) follows from this, it suffices to prove for both cases the following :
If F is an extension field of o}(1) and (p a real valuation of F, which is an extension of the absolute valuation of 90;(i), then F = .jo)(i).
For the purpose of the proof we suppose that F D .o)(i). Let a be an element of F - .7(0,(i). We denote by a the Weierstrass lower bound (cf. § 154, Exercise 1) of the set of all values (p(a - z) for z E .7()(i). (This means that for these z we always have q^(a - z) >_ a and that there is for every h (> 0) a z such that q(ac - z) < a + h.) In any case, a Z 0. First of all we show that there exists a zo (E Jo)(i)) such that q:(a - zo) = a.
(164.1)
According to the definition of a we may form from .9o)(i) a sequence such that lim 97(u -
a.
(164.2)
Since, on the other hand, we have
5 97(a - zj + qu(a), there is a real number C (> 0) such that I Z. l = 4i(zn) < C. According to the BOLZANO - WEIERSTRASS theorem (§ 154, Exercise 2) we may
therefore form a convergent partial sequence from [z.]. If we retain the for this, then (164.2) remains valid, and notation
limz=zo,
646
FIELDS WITH VALUATION
for a zo from -o) (i). It then follows that
a=lim p(oc by which we have proved (164.1).
Together with the preceding a, a - zo also lies in F(o)(i). Therefore, if we write a instead of a - zo then (164.1) goes over into qu(a) = a. Since a # 0 here, it follows that a > 0. Thereby we have the result that there is an element a of F and a positive real number a, for which the following always holds :
T(a - z) = qu(a)
(z E F(o)(1), I z I < a).
(164.3)
We shall show that for every a with the property (164.3) 'AM - z) = qq(a)
(z E 5o)(11, I z I < a).
(164.4)
For this purpose we take a natural number n and a primitive nth root of unity a from Y0)(i). Then, according to (164.3), Pn_IZ)
9p(a - z)9:(a - aaZ) ... 92(a -
=
p(an
- Zn) S
s q)(an) + p(z") = a"+ I z I".
If we apply (164.3) to the factors of the left-hand side from the second on, then we get 4'(a - z)an-1 < an + I Z I., i.e., n
rr
9'(a-z)Sa1+ [
(n=1,2, ...).
a 1
J
For I z I < a this gives (p(a - z) < a. Hence, and from (164.3), follows (164.4).
We generalize (164.4) to 99(a - nz) = qu((X)
(z E
z I < a; n = 1, 2, ...).
(164.5)
For n = 1, (164.5) is the same as (164.4). Let (164.5) be assumed for some n. If we then apply (164.3) to nz + z instead of z, (164.3) follows
from the assumption, thus also (164.4) for a - nz instead of a. Thus, again from the assumption,
tq(a - nz - z) = tp(a - nz) = p(a) (z E .(o)(i); I z I < a). Hence (164.5) is proved by induction. Now, from (164.5)
tP(a - z) = T(a) = a
(z
E'o)(')).
OSTROWSKI'S SECOND THEOREM
647
So, for any two complex numbers z1, z2,
Iz1-z2I=9q(z1-Z2)<4'(a-z1)+q'(a-z2)=2a. By this contradiction we have proved Theorem 412. EXAMPLE. Every Archimedean-valued field F is topologically equivalent to a sub-
field of the field . (,)(i) valued by absolute valuation. This follows immediately, when we apply Theorem 142 to the perfect hull of F.
§ 165. Extensions of Real Valuations for Algebraic Field Extensions The proposition of uniqueness in Theorem 411 (see also § 163, Examples does not remain valid, if continued valuations of algebraic field extensionof non-perfect fields (with valuation) are included. Concerning this the fol lowing theorem is valid: THEOREM 413. Let an algebraic field G I F and a real valuation 92 of F be given. Let r denote the perfect hull of F, and N an algebraic extension
field of F which continues a subfield, isomorphic with the normal field of G I F. Let the real valuation of N, which continues p, be denoted again by 97. Then all the real valuations w of G, which continue the valuation of F, are given by +V(a) = q7(a'),
(165.1)
where a -+ a' denotes every isomorphic mapping of G I F into N. Let a, fi be arbitrary elements of G. According to (165.1) tV(afl) = 97(((c)') =
= 92(a')990') = w(000'
'(a + fi) _ q((a + a)') = 92(a + fl) < 99(a') + q0') = o(a) + o(fl), thus , is a real valuation of G. When a E F, we have a' = a, tN (a) _
(a),
thus tN is an extension of q7.
Conversely, let p denote a valuation of G of the required type. Furthermore let G denote the perfect huff of G, where we may assume that F is a subfield of G. The valuation of G may likewise be denoted by v. The subfield F(G) of G has an extension field No isomorphic with N over Let a -b a' be a corresponding isomorphism:
No IG,& NIF(a-bal.
648
FIELDS WITH VALUATION
Similarly, as above, we see that a valuation 92t of N is defined by p(a) = g21(a') which is an extension of the valuation 99 of r, and thus identical with the valuation 92 of N. Because G c No, (165.1) is fulfilled as required. Of course formula (165.1) does not, in general, yield only different valuations V. Concerning this cf. PICKERT (1951).
EXAMPLE. We consider the special case of Theorem 413, for a simple algebraic extension G = F(8). Let f(x) be the minimal polynomial of ?? over F and
f(x) = fi(x) ...f.(x) its irreducible factor decomposition over F. Then it will suffice to admit only the isomorphisms 0 -- 0k (k = 1, ..., r), where each @l, . . ., @, (E N) is a zero of fl(x), .. , f,(x), respectively. We see from the trivial consequence of Theorem 411 that in the continued valuation +p, given there, two conjugate elements over F have the same +p-value.
§ 166. Real Valuations of Number Fields of Finite Degree
We apply Theorem 411 in order to determine all the real valuations of the fields of finite degree. For prime characteristic we have to deal with finite fields, which, however, can have only the trivial valuation, so that
we may restrict ourselves to the algebraic number fields of the form G = .7(o)(O). According to OSTROWSKI's first theorem (Theorem 409) there has are two cases to be distinguished according as the prime field
the absolute valuation or a p-adic valuation. So we are considering the Archimedean or non-Archimedean valuation of G, respectively. In both cases we obtain the required valuations on account of Theorems 411, 413
(where in both cases we have to take into consideration the Examples concerned). We denote by f(x) the minimal polynomial of 9 (over .5J, and by n its degree. We first determine the Archimedean valuations of G. Since Y0 has, with respect to the absolute valuation, the field . io> for perfect hull, so we must determine the irreducible factors of f(x) over 5 ol. We proceed most easily from the factor decomposition
f(X) = (X - 01) ... (X -
(&'1 = ) Let r1 and 2r2 be the number of real or non-real conjugates of 0, respectively (r1 + 2r2 = n). Then f(x) splits into the product of r1 linear and r2 quadratic irreducible factors over ,. If 19k is real, then a valuation of G is given by over
(166.1)
V)(9(6)) = 19('0k) I
where g(x) denotes the polynomials over F of degree < n. If '9k is not real, then we have to put _ V(9(0)) = 19(O,) 9(9k) 1
,
REAL VALUATIONS OF NUMBER FIELDS
649
where Ok denotes the complex conjugate of 11k. The right-hand side is again equal to I g(0k) 1, therefore all the Archimedean valuations of G are given by formula (166.1), where ?9k has to run through the conjugates 01, ..., of 0, however, so that of each pair of conjugate-complex ele-
ments only one is admitted. The r1 + r2 Archimedean valuations so obtained are evidently distinct. Secondly, we define the non-Archimedean valuations of G. We give these more easily as exponent valuations. Let cop denote the p-adic valuation
of.$, where p may be an arbitrary prime number. We also denote the extension of w, to ible factors:
0)
by wp. We decompose f(x) over 5p) into irreduc-
f(x) = f1(x) ... f,(x) All the exponent valuations w of G, which continue the valuation wp of .7o, are then given by
w(g(o)) =
WVp(N(9(0k)))
(166.2)
k
where Ok denotes a zero of fk(x) in a suitable extension field ofpl, nk the degree of f(k)(x), and N the norm with respect to f(p)(0k) 1 -7;pl, while g(x) means the same as in (166.1). In connection with the above see § 170, Example.
§ 167. Real Valuations of Simple Transcendental Field Extensions Let a simple transcendental extension field F(x) be given, i.e., the rational function field of an indeterminate x over a fundamental field F. We wish to determine those real valuations of F(x) which are extensions of the trivial valuation of F. Since the required valuations of F(x) are non-Archimedean, we may take them in the form of exponent valuations. Therefore, let w denote an exponent valuation of F(x) such that w(a) = 0 for a E F, 0 0. We distinguish two cases, one when we have always
w(f(x)) > 0
(167.1)
for the non-constant polynomials f(x) (E F[x]), the other when
w(f(x)) < 0
(167.2)
holds, too, but we note in advance that these two cases will not be essentially distinct. When wv(f(x)) = 0 always holds, all the polynomial quotients have
FIELDS WITH VALUATION
650
w-value 0, thus F(x) is then trivially valued as the quotient field of F[x]. As we wish to disregard this, we suppose in the first case that ">" occurs in (167.1).
In the first case letp(x) denote a polynomial over F such that w(p(x)) > 0. Among the irreducible factors of p(x) at least one must occur with positive value, therefore we may suppose that p(x) itself is an irreducible principal polynomial. Now, if we replace w, if necessary, by an equivalent valuation, we may also assume that w(p(x)) = 1. (167.3)
If f(x) is now a polynomial non-divisible by p(x) over F, then we shall show that w(f(x)) = 0. Let p1(x), f1(x) be such that 1 = P(x)Pj(x) + Ax) ft(x)
(P,(x),.fi(x) E F [x])
Hence, because of (167.1),
o = w(1) z min (w(p(x)) + w(pl(x)), w(f(x)) + w(fl(x))) Z min (w(p(x)), w(f(x))).
From this w(f(x)) < 0, and so w(f(x)) = 0 follows from (167.1).
We now consider an arbitrary polynomial g(x) over F. This can be written uniquely as
g(x) = (P(x))k4(x)
(9(x) E F[x],P(x) f' 9(x))
From what has been proved so far and from (167.3) it follows that
w(g(x)) = k . Because of the rule w
g(x) f h(x)
w(g(x)) - w(h(x))
(167.4)
the values of all the elements of F(x) are now known. Further, we see [cf. (160.1)] that w is that .p-adic valuation of F(x) which belongs to the prime polynomial p(x). Since, conversely, all the irreducible principal polynomials
p(x) (E F[x]) are admissible and yield non-equivalent .p-adic valuations, the present case has been entirely explored. In the second case we take a principal polynomial p(x) over F of minimal
degree with the property w(p(x)) < 0 .
(167.5)
We shall show that p(x) must be linear. For let us suppose that
p(x) = x° + PA(x)
(n>_2),
(167.6)
SIMPLE TRANSCENDENTAL FIELD EXTENSIONS
651
where p1(x) aenotes a polynomial of degree < n, then by hypothesis we have
0,
w(pt(x)) >_ 0, w(x)
w(x")
0.
w(p1(x)))
0.
Hence, from (167.6), w(p(x)) > min (w(x")
,
Since this contradicts (167.5), p(x) is in fact linear. Then
p(x)=x+a for some element a from F. But, because of w(f) = 0 and Theorem 400, it follows from (167.5) for every element 9 (# 0) of F that w(p(x) + j9) = w(p(x)).
This together with (167.5) means that all the linear polynomials of K [x] have a common negative value. When we thus replace w, if necessary, by an equivalent valuation, then we get
w(x) = -1. For every polynomial
9(x)=aoxm+... +am
(am00)
over F we have w(g(x)) _ -m as a result of Theorem 400. If we again apply rule (167.4) we obtain for the value of an arbitrary element of F(x) other than 0, the formula wl
x
h(x) j= n - m,
(167.7)
where m and n denote the degree of the numerator and denominator, respectively. (Moreover (167.7) also holds for g(x) = 0, since then, according
to our previous convention m = - oo, while on the other side w(0) = Co.) Conversely, we shall show that a (discrete) exponent valuation of F(x) is defined by (167.7), which we call the degree valuation of F(x).
If gf(x) is a polynomial (5& 0) over F of degree m, (i = 1, . . ., 4), then 91(x) w(---
93(x) I 92(x) 94(x) 1
_ m2+m4-ml-m3=w
91(x) 92(x)
+w I
and W
93(x)
91.(x) l
92(x)1 + 194(x) I =
91(x)94(x) + 92(x)93(x)
w 1
92(x)94(x)
93(x) 9a(x)
,
FIELDS WITH VALUATION
652
On the right-hand side of the last equation the degree of the numerator is at most max (m1 + m4, m2 + m3), but that of the denominator equals m2 + m4, thus the right-hand side of this equation is at least
m2+m4-max(ml+ m4, m2+m3)= = min (m2 - m1, m4 - m3) = min w 91(x)
,
x 9a(x)
92(x)1
19.I(x)1
According to this, (167.7) is in fact an exponent valuation. To sum up we have the following THEOREM 414. All the real valuations of the rational function field F(x), which extend the trivial valuation of a field F, are (apart from equivalent valuations) the ,p-adic valuations belonging to the irreducible principal polynomials p(x) (E F [x]) and the degree valuation of F(x). (Clearly the above valuations are not equivalent to one another.) It is important, as we have already suggested, that the exceptional position of the degree valuation of F(x) as opposed to the .p-adic valuations of F(x) (in a sense to be stated precisely below) is only apparent. It is well known, but is worth stressing, that the indeterminate x does not
have an invariant meaning, for the field F(x); it may in fact be replaced by any element Y
_ ax+ ;tx +
(z, "'
y, b r-- F; ab -
0) .
(167.8)
Then, according to Theorem 323, all the primitive elements y of F(x) for which F(y) = F(x) are given by (167.8). We choose, in particular,
y_x 1
(167.9)
and denote by wj, the .p-adic valuation of F(y) (= F(x)) corresponding to the prime polynomial p(y) = y. Let g(x) h(x)
be an arbitrary element (0 0) of F(x). In order to determine its wy value, we denote by m and n the degree of the numerator and denominator, respectively. We have g(x)
9(y-1)
y
y"h(y-1)
653
SIMPLE TRANSCENDENTAL FIELD EXTENSIONS
Both numerator and denominator of the second factor of the right-hand side are polynomials of y with a constant term other than 0, therefore they are prime to y. Consequently x
tip,,
h(x) = n - m.
We now see the correspondence with (167.7), so that the p-adic valuation xY of F(y) (= F(x)) is identical with the degree valuation of F(x). Thus we have seen that the degree valuation of F(x), after we have introduced the primitive element y =
Ix
,
is converted into the p-adic valuation corrrespond-
ing to the prime polynomial y of F [y].
The p-adic valuation of F(x) corresponding to the polynomial x - a is also called the valuation (of F(x)) belonging to the place a. Correspondingly,
the degree valuation is also called the valuation belonging to the place 00
(since, after we have introduced the primitive element
I,x this becomes
the valuation belonging to the place 0). The above valuations are called the point valuations of F(x). If, in particular, F is algebraically closed, i.e., only linear polynomials are irreducible over it, then point valuations are all the valuations of F(x). Let w,, denote the point valuation of F(x) belonging to the place a (a E F or a = oo). If, for an element r(x) of F(x), w.(r(x)) = n or
-n
(n = 1, 2, ...)
,
then we say, using the terminology of function theory, that x = a is an n -fold zero or an n -fold pole of r(x), respectively. As to every valuation in general, so also to every point valuation wa there belongs a perfect hull of F(x). Its elements, according to (160.5) may be uniquely given by the (p-adic) infinite series f(x) _= am(x - oc)m + an:+1(x - 00"' + ...
am+I, ... E F) ,
(167.10)
where m can indicate any integer. These are similar in form to the power series in the theory of functions, therefore they are called formal power series. For the degree valuation (x = co ), we have to replace x - a in (167.10) by
-.xI
In function theory one is generally interested only in the substitution values f(E) for 5 E F in connection with a power series f(x), i.e., we understand f(x) as a function of the variable x. Therefore the notion of power series in algebra and that in function
654
FIELDS WITH VALUATION
theory are related in the same way as the polynomial notions of both theories. (Cf. Exercise 2.) EXERCISE 1. Quite generally, if 99 is a valuation and a -- a' an automorphism of a field F, then tp(a) = p(al) defines a valuation ip of F. Prove that all the place valuations of the above-considered field F(x) can be obtained from one of them in this way by means of F-automorphisms. EXERCISE 2. The above notions introduced with respect to F(x) are meaningful even when F is arbitrarily valued (but now Theorem 414 is no longer valid). Let e.g.,
F = F-(,) be the p-adic number field. We take into consideration a formal power series (167.10) where now a, am, am+1, ... are elements of JF,," . With an element of F-(," we carry out the substitution x = :
!l') = am( - ex), + am+l(E - (z) '+1 + .. . However, this equation has a meaning only if the right-hand side is convergent, and then represents an element of .F'(,). The , for which this is the case, constitute the domain of convergence of the formal power series (167.10). Show that if p 96 2 the domain of convergence of ap
f(x) =A E
1l
(E) x"
n
which are not units, and that then
consists of those p-adic integers
(f($))2 = 1 + $ . For further investigations of this kind cf. HASSE (1963). EXERCISE 3. Take for basis the rational function field F = .70(x1, ..., x"), form the rational function field G = F(x) and denote by G the perfect hull of G belonging to the degree valuation. Put
AX) = (x - x1) ... (x - xn) = x" + g(x)
In G we have I
X - x,
_
I
a0
x
X= "
xi
1
x'r , wx)
_
1
x" i
gx) ''
00
(
X" )
If these are inserted into
"
1
f' (x)
x - x,
f(x)
we obtain a short proof of WARING'S formula (Theorem 279).
CHAPTER XI
GALOIS THEORY The main object of the theory of GALOIS is to obtain a detailed survey of the subfields of a separable normal field of finite degree. The applications
of this theory far surpass its objective and are deeply involved in many questions of algebra, number theory and geometry. It was one of the earlier
established branches of knowledge of modern algebra and is admirably neat. The twenty years old GALOIS, killed in a duel to the great sorrow of posterity, far exceeded with his life-work the stage of development of the mathematical sciences of that time. As to generalization cf. CARTAN (1947) and NAKAYAMA (1949-50).
§ 168. Fundamental Theorem of Galois Theory A separable normal field N I F of finite degree is said to be a Galois field and the group of its F-automorphisms is called the Galois group of N I F. If F is a prime field, then N is called an absolute Galois field. In this case the Galois group is the full automorphism group of N. We can express the definition of Galois fields in two other forms: Galois fields over an arbitrary field F agree with those separable algebraic fields F(ad) for which the minimal polynomial of 0 splits into a product of linear factors over F(O). The validity of this transformation of the above definition follows from Theorems 299, 303, 356. Furthermore, it follows from Theorems 331, 336 that a field N I F is a Galois field if, and only if, its degree [N : F]
is finite and equal to the number of automorphisms of N I F. Galois fields constitute the object of the theory of GALOis, or briefly Galois theory, and in this the Galois groups as equally important topics will also be considered. More precisely, we shall mostly deal with a fixed Galois field and its Galois group only. We shall show that two isomorphic Galois fields N I F, N' I F have isomorphic Galois groups. We shall even show that from every isomorphism
NIF;zt N' I
(a->. soc)
follows the isomorphism
d ^ 4'
(a - sas-1)
between the corresponding Galois groups c0, cdi'. 22 R. -A. 655
GALOIS THEORY
656
It is evident that a->sas-1 is an isomorphic mapping of N' onto itself for every a E 44, i.e., an automorphism of N'. The elements of the fundamental field F remain fixed here, thus sas-1 belongs to 4'. Accordingly s4ts-1 e_ C cp'. Likewise s-1c4 s S 4, i.e., 4' c s4s-1. Therefore sc4s-1 = 4t'.
This means that a -* sas' is a mapping of cP onto 4'. This mapping is obviously one-to-one and homomorphic, so proving the assertion.
Galois theory is concerned with certain other fundamental notions in connection with a Galois field N I F and its Galois group 4. In order to introduce these we next consider a quite arbitrary field N and its full automorphism group cP. If G is a subfield of N, we denote by G the set of those elements a of 44 for which the equation as = a
is satisfied by all a E G. If furthermore V is a subgroup of c0, then we denote similarly by - the set of those elements a of N for which the given equation
is satisfied for all a E A. Evidently G is a subgroup of 4, furthermore is a subfield of N. We call G and X the invariance group of G and the invariance field of, respectively. (In other words : G is the group of G-automorphisms of N, furthermore A' is the field of those elements of N which
are fixed under the automorphisms contained in X.) According to the above we have defined a unique correspondence
X->X where X runs through the subfields of N and the subgroups of 4. We call this the correspondence in the sense of Galois theory. Substantially it means a certain mapping by which the set of subfields of N is mapped into the set of subgroups of co and this into the former set. We say that G is the subgroup of c belonging to the subfield G (c N) and the subfield of N belonging to the subgroup ° (9 c.4). From the definitions we obtain the following rules immediately: G1 c G2
G,
G2
1719- Ir2 Ir1X2
(G1, G2 S N) .
G 1,'
2C(4)
(168.1)
(168.2)
Further we shall prove that GDG
(G S N),
(168.3)
Since, for every element a of G, the condition as = a is satisfied by all the elements a of G, we have a E G, whence (168.3) follows. (168.4) is proved similarly.
FUNDAMENTAL THEOREM OF GALOIS THEORY
657
We prove further that
(G c N),
G=G (
c c4)
(168.5) (168.6)
.
From (168.1) and (168.3) it follows that the left-hand side of (168.5) is contained in the right-hand side. Also the converse of this follows from (168.4) by application to = G. Hence, (168.5) is proved and (168.6) may be proved similarly.
TinoREM 415. For a group Re of automorphisms of a field N, the relative field Nla' is of finite degree if, and only if, W is finite, and then (168.7)
[N : Jr] = O(M).
(168.8)
is Thus, if N The automorphism group of N I is of finite degree, then l° is finite, consequently according to (168.4) X, too, is
finite. Conversely, we suppose that M° is finite. We next show that N I A' is then of finite degree and
[N:?]s0(
(168.9)
.
For this purpose we write n
(168.10)
It is sufficient to show that any n + I elements w1, ..., can+1 of N over Jr are necessarily linearly dependent. We consider the system
(awl) x1 + ... + (awn+l)xn+1 = 0
(a E -7)
of n homogeneous linear equations with n + 1 unknowns x1,. .., xn+1. This has, according to Theorem 248, a non-trivial solution. There is thus a least natural number m (_< n + 1), for which the equation system
(awm),n = 0
(a E M°)
(168.11)
is satisfied by any elements 1, ..., $n, of N, where m does not vanish. Of course, we may assume that (168.12) $m E X. We take an arbitrary element b of X, replace a in (168.11) by b-1a, and then
carry out the automorphism b: (acal) (b$1) + ... + (awm)
0
(a, b E a°)
CALOIS THEORY
658
Hence, and from (168.11), (168.12), we obtain by subtraction,
(awl) ($t - b$t) -I- ... + (awm-i) (em-i -
0.
Because of the minimal property of m
bpi = i
(i = 1, ..., m - 1; b E)
must then hold. Therefore fit,
- , m-i E .
. This and (168.12) imply,
for the special case a = 1 of (168.11), that oli.... , alm, and a fortiori are linearly dependent over X. (168.9) is now proved. oh, ..., Since, accordingly, the field NIA' is of finite order, so, from Theorem 335, for the order of its automorphism group the inequality
O(,YC) 5 [N : C]
(168.13)
follows. Since, moreover, from (168.4) 2C ', equality must hold here as well as in (168.9) and (168.13). This implies the validity of (168.7) and (168.8), and consequently Theorem 415. THEOREM 416. A field N I G is a Galois field if, and only if, there exists a finite group c° of automorphisms of N such that
G = X.
(168.14)
For the proof we suppose, first, that for afinitegroup l° of automorphisms of N the condition (168.14) is fulfilled. Hence, and from (168.8), [N : G] _
= O(. Moreover, it follows from (168.7) that G = T. Then [N : G] = O(G).
(168.15)
Since G is the automorphism group of N I G, so, according to the above, (168.15) means that N I G is a Galois field. Conversely, we suppose N I G to be a Galois field, i.e., that (168.15) is satisfied. We write A' = G. Since ° is finite, [N : G ] _ [N : G] follows G, i.e., = G. from (168.8) and (168.15). This and (168.3) give Consequently Theorem 416 is proved. THEOREM 417 (fundamental theorem of GALOIs theory). Let N I F be a Galois field and 4 its Galois group, and let X --> X be the correspondence in the sense of Galois theory. Then the set of subfields of N I F is mapped oneto-one onto the set of subgroups of c4 by
G->G
(F9G9N)
(168.16)
and
[N : G] = O(G)
(168.17)
FUNDAMENTAL THEOREM OF GALOIS THEORY
659
for elements assigned to each other. The inverse mapping of (168.16) is
r-. jr
( ° c Co.
(168.18)
SUPPLEMENT 1. For arbitrary subfields G1, G2 of N I F and for arbitrary
subgroups -71, -72 of cp
621
X1 c 4°2 a dr1 {Gl, (:;2} = Gl (1 G2, {DL
i'
2}_
1fl
2,
`2
{G1 n G2 = {Gl, G2},
1n
(168.19) (168.20) (168.21)
2= ['?'I, I2). (168.22)
SUPPLEMENT 2. For the conjugates aG (a E cot) of a subfield G of N I F
aG =aGa 1.
(168.23)
SUPPLEMENT 3. A subfield G of N I F is (normal i.e.) a Galois field over F if, and only if, G is normal in c.P, and then the Galois group of G I F is isomorphic
with thefactor group c4/G (= F/G). If the elements of a system of representatives mod G of cog act on the elements of G, all the different automorphisms of the field G I F are obtained, i. e. the elements of its Galois group. NOTE. Apart from (168.17) the contents of the fundamental theorem and of Supplement 1 may be so stated that the first or second lattice of the subfields of N I F is isomorphically mapped by (168.16), onto the second or first lattice of the subgroups of cot, respectively. In Galois theory, particularly
in Supplement 3, lies the justification for the similarity of the terms "normal field" and "normal divisor". The fundamental theorem may be applied to an arbitrary separable field G I F of finite degree as one has recourse to its normal field N I F which is evidently a Galois field and is called the Galois field of the field G I F; then the subfields of G I F are assigned, because of (168.21), to those subgroups of the Galois group of N I F which contain the invariance group G of G. In order to prove Theorem 417 we denote the full automorphism group of N by Since cP is the Galois group of N I F, F = d, .
(168.24)
On the other hand, there is, according to Theorem 416, a subgroup coo of vL' such that F=coo.
GALOIS THEORY
660
By Theorem 415, c o =o. and so F = 4io. Hence, and from (168.24), it follows that 40 = 4i, consequently
F=4.
(168.25)
Further, from Theorem 415, [N : cQ] = 0(c4). Hence, and from (168.24), (168.25),
[N : F] = O(F) .
(168.26)
We now consider an arbitrary subfield G of N I F. Since both N I F and N I G are Galois fields, we may apply (168.24), (168.25), (168.26) instead of F. Consequently (168.27) G=G and (168.17) holds. Since G ? F, according to (168.1), G c F, thus, according to (168.24), G c co. Accordingly, by (168.16), the set of subfields of N I F is mapped into the set of subgroups of cQ.
In order to show that this mapping is onto the latter set, we consider 9 4 it then an arbitrary subgroup 2l° of c.P. We put G = R. Because follows from (168.2) and (168.25) that G ? F, therefore G is a subfield
of N I F. Since V is finite, it follows from Theorem 415 that ' = 'V. Since then
AT, the mapping (168.16) is in fact onto the set of subgroups
of c.4.
In order to verify that this mapping is one-to-one, we suppose that for two subfields G, H of N I F the assigned groups are equal: G = R. Since, besides (168.27), VI = H also holds, it follows that G = H, i.e., that the mapping (168.16) is one-to-one. The mapping G
G, inverse to this, may be given, according to (168.16),
as G -* G. Since here G runs through all the subgroups A' of 4, this mapping agrees with (168.18). Consequently Theorem 417 is proved.
From the fact that (168.16) and (168.18) are one-to-one, and from (168.1), (168.2) we obtain the rules (168.19), (168.20).
In order to prove (168.21) we bear in mind that {G1, G2} Q G1, G2. According to this and (168.1), {G1, G2} c Gl, G2 ,
thus
{G2} S G1 fl G2 . On the other hand, the right-hand side is evidently a part of the left-hand side, whence (168.211) follows.
Similarly (168.221) is obtained from (168.2). (i = 1, 2). In order to prove (168.212) we apply (168.221) to By the subsequent application of the mapping X -k X, because X = X, we obtain (168.212).
FUNDAMENTAL THEOREM OF GALOIS THEORY
661
Similarly we obtain (168.222) from (168.211) by application to G. = Xt (i = 1, 2). Consequently Supplement I is proved.
In order to prove Supplement 2 we denote the elements of G by a. Then aG consists of the elements am. Consequently, the group aG consists of those elements s of 4 for which the condition
saa = as
(168.28)
is satisfied for all or E G. For (168.28) we may write a-1saa = or. Thus the required s are those elements of d for which a'sa E G, i.e., s E aGa-1. Hence Supplement 2 is proved. Lastly, in order to prove Supplement 3, we must notice that, because of Theorem 331, G I F is normal if, and only if, G has no conjugates in N over F except itself. This implies that
aG = G
(168.29)
for all a E c.0. Since (168.29) means the same as aG = G, for which we can
write, according to (168.23), aGa-1= G, the above-mentioned condition consists in the fact that G is normal in 4. Hence the first assertion of Supplement 3 is proved.
For the following statements we assume that G I F is normal, i.e. a Galois field. For every a (E c4) let a1 denote the mapping
a-* ax of G. This is, according to the above, an automorphism of G I F. It is again evident that all the different al form a group c01. For this the homomorphism 4 '" X11 (a-* a)
holds trivially. Its kernel consists of the elements a of cP for which the equation as = a is satisfied for all a E G. Since, accordingly, this kernel is equal to G, the isomorphism
4 /G
c.01 (a G -* a)
(168.30)
follows. This results in
[N : F] = 0(4) = O(G) O(4l) whence, according to (168.17) and Theorem 292,
O(Q = [N : F] [N : G]-1= [G : F].
(168.31)
Now cPl is a subgroup of the Galois group of G I F. Both are of the same order because of (168.31), and so equal. This, with (168.30), completes the proof of Supplement 3.
GALOIS THEORY
662
THEOREM 418. Let N I F1, N F2 be two Galois fields and (41, CP2 their Galois groups. With respect to F = FI fl F2 as fundamental field, N is a Galois field if, and only if, the group c44 _ {c4l, 4t2} is finite, and then this is exactly the Galois group of N I F.
If N I F is a Galois field, then F is its Galois group. On the other hand, according to (168.212),
F = F1 fl F2={FI,F2}=(4I,c02)=4. Hence it follows that co is finite and is the Galois group of N I F.
If, conversely, c4 is finite, then, according to Theorem 416, the field N 14 is a Galois field. Since, moreover, according to (168.221), at = {41, (^ti2) = 41
n42= F1 fl F2 = F ,
Theorem 418 is proved. THEOREM 419. Let N I F be a Galois field and P I F a further field, both with a common overfield. Then the field {N, P} I P is a Galois field and every
automorphism of it is the extension of one, and only one, automorphism of N I F. Furthermore the Galois group of {N, P} I P is isomorphic with a subgroup of the Galois group of N I F.
For the proof we put N = F(O)
.
Then {N, P} = P((9).
Letf(x) and g(x) denote the minimal polynomials of $ over F and P, respectively. Then g(x) I f(x).
(168.32)
Since f(x) is separable, the separability of g(x) follows from (168.32). Since furthermore f(x) splits into linear factors over N, it follows from (168.32) thatg(x) splits into linear factors over {N, P}. Accordingly {N, P} I P is, in fact, a Galois field.
If 0 -* 0' is an automorphism of the last mentioned field, so that g(X) = 0, then, because of (168.32), f(O') = 0; we are therefore concerned with the extension of an automorphism of N I F. Since both automorphisms are uniquely determined by 19', one is uniquely determined by the
other. If we assign the first to the second, then evidently we obtain an isomorphism of the Galois group of {N, P} I P with a subgroup of the Galois group of N I F. Theorem 419 is thus proved.
FUNDAMENTAL THEOREM OF GALOIS THEORY
663
TmOREM 420. Let N I F be a Galois field with the Galois group 4 and
(F =) Go c G1 c
... c Gr (= N)
(168.33)
a chain of subfzelds of it, and let
(C4=)
'oD'r1D...Z) -'r(=1)
(168.34)
be the chain of the related invariance groups. All the fields
Gi I Gi_1 (i = 1,
..., r)
(168.35)
are Galois fields if, and only if, (168.34) is a normal series of 4. Furthermore, the Galois groups of the fields (168.35) are then isomorphic with the factor groups
,:77i-1/ri
(i= 1,...,r).
(168.36)
COROLLARY. The Galois group c.P of N I F is solvable if, and only if, there is a
chain of fields (168.33) in which all the fields (168.35) are Galois fields of prime degree. The theorem follows immediately from the Supplement 3 of Theorem 417. From Theorem 138 it then follows that the corollary is also true.
In certain cases we name the Galois fields according to the properties of their Galois group. Thus a field N I F is called cyclic, Abelian or solvable, respectively, if it is a Galois field and its Galois group is cyclic, Abelian or
solvable, respectively. Of course a Galois field of prime degree is always cyclic.
§ 169. Stickelberger's Theorem on Finite Fields
According to Theorem 306, the number of automorphisms of a finite field is equal to its degree, thus finite fields are absolute Galois fields and, according to the same theorem, cyclic. Although the fundamental theorem of the Galois theory is almost trivial for them, nevertheless its application yields valuable results. An example of this is the following important theorem). If f(x) is a polynomial of degree THEOREM 421 n without multiple factors over a finite field F of characteristic p A 2 and m the number of irreducible factors of f(x), then its discriminant is a square if, and only if 2 1 n - m. For another proof cf. the Example at the end of § 174. For the proof we may suppose that f(x) is a principal polynomial. We first consider the case wheref(x) is irreducible, i.e., m= 1. Then we have recourse to the finite field G = F(a) where f (a) = 0. Since G is (absolute) cyclic, 22/a R.-A.
GALOIS THEORY
664
so G I F is cyclic of degree n. Let s be a primitive element of its Galois group. Then
f(x) = (x - a) (x - sa) ... (x - sn-1a) We write S=
11 (sta - ska) 051
.
.
(169.1)
Then, according to Theorem 277,
D=b2 is the discriminant of f(x). On the other hand, it clearly follows from (169.1) that sb=(-1)n-'6.
Since, according to this, (because p 0 2) b lies in F if, and only if, 2 n - 1, the theorem is proved for the case m = 1. For the general case an irreducible factor decomposition f(x) = f1(x) ... fm(x) holds in F[x]. Hence, and from the multiplication theorem for discriminants (§ 113, Example 2), it follows that the discriminant of f(x) is of the form D = DI ... DmL02 ,
(169.2)
where Di denotes the discriminant of f;(x) (i = 1, ... , m) and N an element of F. Now the group F*, according to Theorem 306, is cyclic, furthermore O(F*) is an even number because p # 2. Consequently it follows from (169.2) that D is a square in F if, and only if, the number of those Dl which are not
squares is even. If ni denotes the degree of f(x) (i = 1, . . ., m), the above condition, according to the case already dealt with, implies that
(n1- 1)+...+(nm- 1)=n-m is even. The theorem is now proved.
§ 170. The Quadratic Reciprocity Theorem Let a, b (b # 0) be relatively prime integers. If the congruence x2 =- a (mod b)
(x E -7)
is solvable, then we say that a is a quadratic residue mod b.
665
THE QUADRATIC RECIPROCITY THEOREM
For every odd prime number p and for every integer a prime top we define
a
the Legendre symbol pj so that this is equal to 1 or -1, according as a is a quadratic residue or not a quadratic residue mod p.
If a' denotes the residue class a (mod p), then j_!_j = 1 means that a'
P is a square in the group ..5 of the prime residue classes mod p. This group is cyclic of even order, according to Theorem 225. Hence the rule abo
p 1=
a (Aj t
(P X a, b)
(170.1)
I
immediately follows. THEOREM 422 (quadratic reciprocity theorem). For arbitrary different odd prime numbers p, q 1)p21 °21
P)fqJ
(170.2)
IqP = (-
THEOREM 422' (first complementary theorem to the quadratic reciprocity theorem). For every odd prime number p p-i
=c-1) 2
(170.2')
THEOREM 422" (second complementary theorem to the quadratic reciprocity theorem). For every odd prime number p p'1I2
tP _ (-
1)
8
(170.2")
We shall carry out the proof applying STICKELBERGER's theorem (Theorem
421) to the special case
F=Srp, f(x)=x"- 1 (Only the cases n = q, is = p 2
((n,2p)=1).
l will be taken into consideration.)
Since f'(x) = nx"-1, the Sylvester determinant of J (x), f'(x) is equal to n". Thus the discriminant of J (x) is D
=
(-
1)(Dn"
GALOIS THEORY
666
Because p .4' n, this is an element of J r, other than 0, thus f (x) has no multiple factors over .3p. Because 2 f' n we have
(it)
n 2
1
(mod 2).
Further the factor n"-1 of D is a square in .3,. Consequently we have, according to the theorem mentioned, P"-1
(- 1) z it )
( l
(170.3)
=
where m is the number of irreducible factors of f(x) over .3D. Next we put it = q. Then J (x) = (x - 1) Fq(x) ,
where FF(x) indicates the qa` cyclotomic polynomial over. Thus from 92(q) = q - 1 and Theorem 317 it follows that
m=1+ q-1
(e =.o (p(mod q))).
e
Accordingly, we now have
it - m =
q
e
1
Because of the meaning of e, p is the q e
(mod 2).
1
th power of a primitive element
of the group rq . Accordingly, p is a square in -7'* if, and only if, q e
1
is
even. Consequently, (170.3) now becomes q-1
( 1(-1) 2 q
Pj
P
q
q-1 (
1(
1
l
`q1 1P1
((
l P
2
(170.4)
THE QUADRATIC RECIPROCITY THEOREM
667
Since the left-hand side remains unaltered after interchanging p and q it also follows that p 1
(_1)2
((- l)z
P
q
1
(170.5)
Moreover
P11(-1)z P-1
3
,
(170.6)
3, it follows from (170.5) for q = 3.
since, if p = 3, this is trivial, and if p Because
()=_i.
1, (170.6) agrees with (170.2'), by which Theorem 422' is proved. 31
Because of (170.2') the equation (170.4) goes over into (170.2). This proves Theorem 422'. z We now write n = P
2+
1
.
[The condition (n, 2p) = 1 is fulfilled ! ]
Then we have n I p4 - 1, thus
f(x)Ix"-x. Hence it follows, according to Theorem 308, that the irreducible factors of f(x) over Yp can be only of degrees 1, 2, 4. According to the same theorem the product of those irreducible factors of f(x), whose degree is 1 or 2, is equal to the greatest common divisor
(AX), x1' - x) Because
x" ' - 1 - x(x" - x) = x2 - 1
(y
J lx) I x1'+1 - 1,
this greatest common divisor is equal to
(f(x), x1' - x, x2 - 1) , thus finally equal to x - 1, where we have taken into consideration the fact that because 2 I' n, x + I is not a divisor of f(x). This shows that the irreducible factors of f(x) over.$ , except the factor x - 1, are all of degree 4. Thus m=l+pz-1 8
Accordingly, we now have 2
n
-m-p
_ 8
1
(mod 2).
OALOIS THEORY
668
Since further, n
2
1
=
p2 - 1
is even, (170.3) becomes
4 p2 + 1
a
2
p Because of (170. 1) the left-hand side is equal to
12 . Consequently Theorem I
P1
422" is proved. The proof as carried out is essentially due to MIRIMANOFF-HENSEL (1905) and is one of the most elegant of almost a hundred proofs for the quadratic reciprocity theorem and the supplementary theorems. The quadratic reciprocity theorem may be formulated in field theoretical terminology
independently of the Legendre symbol as follows: Let p, denote the proposition that p is a square in .X",. If among the different odd prime numbers p, q at least one
is = 1 (mod 4), then the propositions p q, are equivalent. If on the contrary both are =- -1 (mod 4), then exactly one of the propositions p qp holds. (A really admirable correlation between .g', and X.!) ExAMPLE. As an application let us determine by the process derived in § 166 all the real valuations of the number field
G =.P-.(V-11 ) of degree two. The minimal polynomial of
11 is
f(x) = x2 - 11. We take an arbitrary element (96 0) of G in the form
a+b
11
(a, b E .X"o).
Since, over P'(0), the decomposition
f(x) = (x - J11) (x + J11), holds, we see, according to (166.1), that G has exactly the following two Archimedean valuations:
4gi(a+b 11)=Ia+b
11I,
.p2(a+b 11)=Ia-b
11`.
In order to determine the non-Archimedean valuations, we denote by w, that valuation of 9'(P) which extends the p-adic valuation of .9'o, where p can be any prime number. If f(x) is irreducible over X(,), then (166.2) yields the one and only one valuation w(a + b
11) = 2 w,,(a2 - l 1 b2)
of G ; if on the other hand f(x) is reducible over . ,), i.e., there is an element y of X(,) such that y2 = 11, then we likewise obtain the two valuations vi(a + b Vi -I) = w,(a + by),
v2(a + b
l 1) = w,(a - by)
THE QUADRATIC RECIPROCITY THEOREM
669
of G. The first case always occurs if x2 = 11 (mod p)
(170.7)
11
is insolvable, i.e., - _ -1. This condition, according to Theorem 422, may be p-i expressed in the form (-1) 2 111) = -1. The left-hand side depends only upon
the residue class p (mod 44), where we are concerned with the prime numbers p = 3, 13, 15, 17, 21, 23, 27, 29, 31, 41 (mod 44).
To this first case belong p = 2 and p = 11, too, since then xt - 11 (mod p2)
is insolvable. The rest of the p are given by 111 I = I and they may be characterized by p = 1, 5, 7, 9, 19, 25, 35, 37, 39, 43 (mod 44).
Since for these p congruences (170.7) has a solution co (0 < co < p), for which
f(x) _ (x - co) (x + co) (mod 11), the existence of a required element y follows for them from HENSEL'S lemma (Theorem
410), so that these p all belong to the second case. In order to determine in this case the n,-values (1 = 1, 2) of the elements of G it is still necessary, of course, to compute y explicitly. For that purpose we put
y = co + ct p + cz p2 + ...
(0 S cl, cz.... < p)
and compute recursively c1, c2, ... from the congruences (co + cl p + ... + ck pk)z m 11 (mod pk+')
(k = 1, 2, ...) .
Thus we obtain, e.g., if p = 5, co=1,
c1 = 1 ,
c2 = 2,
c3 = 0,
cs = 0,
c5=2,...,
thus y = 1 + 5 + 2.52 + 2.55 + ... Naturally for given p, a, b we need to calculate only a finite partial sequence c0, ... ck of co, c1, ..., for which already
a ± b (co + ... + ck p) r 0 (mod pk+1) holds, for it suffices to determine the above v,-values (1 = 1, 2).
§ 171. Cyclotomic Fields
Among finite fields the cyclotomic fields (§ 135) belong to the simplest kind of absolute Galois fields; of course, these are also very important. We shall now consider them more explicitly.
670
GALOIS THEORY
THEoREM 423. Let N denote the nth cyclotomic field of characteristic 0.
It is abelian of degree q(n), and its Galois group is isomorphic with the group ,71(n)* of prime residue classes mod n. If, in particular, n = q is a prime number, then the field N is cyclic of degree q - 1. N is the splitting field of the polynomial x" - 1 over Y0, therefore we may always suppose that N is a subfield of o)(i). As stated above N is of finite degree, separable and normal, i.e., a Galois field. Since all the nth roots of unity from 9 O)(i) are powers of a primitive nth root of unity,
N = Y(e) , where a denotes a zero of the nth cyclotomic polynomial F"(x), i.e., a primitive nth root of unity. Since F"(x), according to Theorem 315, is of degree qq(n) and, by Theorem 316, is irreducible, it follows that N has degree T(n).
Further F"(x) = fl (x - `) a
where a has to run through those numbers 1, ..., n which are prime to n. These give, in the form e-> ea
all the different automorphisms, - i.e., elements of the Galois group -
of N. The product of two such automorphisms a -> ea, a -* eb is, because (e")b = cab, equal toe - a"b. Since, further, in ° the exponent a is only taken mod n, because o(e) = n, we see that the Galois group of N has the property required in the theorem. Since this group is Abelian, N is also Abelian. Because of Theorem 225, it also follows that the last assertion of Theorem 423 is true. THEOREM 424. For a prime number q the splitting field G of the polynomial x9 - 1 over an arbitrary field F is cyclic, and its degree [G : F] is a divisor
ofq-1.
Let p be the characteristic and F,, (p >_ 0) the prime field of F. If q = p then G = F, so that we need only consider the case q 0 p. It follows, from the supposition, that G contains a splitting field Z of.x4 - 1 over Fp. Furthermore
G={F,Z}.
Now Z I F. is a cyclic field such that
[Z:F,,]Iq-I since, for p = 0, this follows from Theorem 423 and for p > 0 from Theorem 317. On account of Theorem 419,
{Z,F}IF=GIF
671
CYCLOTOMIC FIELDS
is likewise cyclic, moreover, the divisibility relations
[G:F]I[Z:Fp]Iq-] hold. Consequently Theorem 424 is proved. EXAMPLE 1. Let N denote, for a prime number q, the q1 cyclotomic field of characteristic 0. We determine all the subfields of N. Since N, according to the end of Theorem
423, is cyclic of degree q - 1, exactly one subfield of N of degree e belongs to every divisor e of q - 1, and no further subfields of it exist. Let
of=q-1.
(171.1)
Let a (E N) denote a primitive qt° complex root of unity and g a primitive number mod q. Then N= Jro(e). According to GAUSS the sums
n, = e`` + e``+` + ... + e°`+`('
,)
(i = 0, ..., e - 1)
(171.2)
are called f-termed periods of N. Since 1, . . ., e°-$ is a basis of N, the same holds
with respect to e, ..., e°-1. (The second is a normal basis.) After rearrangement of the elements, this basis may be written as follows: (171.3)
Consequently it follows from (171.2) that the periods no, ... ne_1 are different. Now let s denote the primitive automorphism e-
(171.4)
ea
of N, which is a primitive element of the Galois group of N. Then, according to (171.2),
snl = nt+1
(i = 0, ..., e - 1; ne = no).
(171.5)
Hence
skn,= r,
eIk
for every i = 0, ..., e - 1. This implies that all the subfields
G, = .
0(x,)
(i = 0, ..., e - 1)
of N have the same invariance group G, = {se} ,
and so by the fundamental theorem of GALOIS theory (Theorem 417), are equal. Moreover the relation [N : Gt] = O({se}) =
q-1 e
follows. Consequently we have obtained the result that Go (= G1 = ... = G,-,) is just the subfield of N of degree e. EXAMPLE 2. We see from (171.2) that the symmetric polynomials of no, ..., ne_1 are also symmetric polynomials of the basis elements (171.3). Consequently
h(x) _ (x - no) ... (x - ne_1)
(171.6)
GALOIS THEORY
672
is a polynomial over `.o. Since it is of degree e, because of Example I it is irreducible,
thus it is the (common) minimal polynomial of the f-termed periods no, ..., ne_1.
These are thus conjugate as we see also from (171.5). Hence it follows that the periods no, ..., napart from the order, are independent of the particular
choice of e and g. It also follows that h(x) = 0 is the defining equation of the subfield of degree e of the qth cyclotomic field and the periods no, ..., ne._l constitute a normal basis of it. EXAMPLE 3. We compute the polynomial (171.6) for the case q = 7, e = 3 (f = 2). As primitive number mod 7 we take g = 3. Because 3'= 1, 3, 2, 6, 4, 5 (mod 7)
(i =0,...,5),
according to (171.2),
no=e+eo, n1=28+e', na=22+eb. We compute
no=2+na, ni =2+no, ns=2+n,, noel = nl + n8 , n1n2 = no + X2 , Wont = no + al
If ak denotes the kth elementary symmetric polynomial of no, all nq, then it follows that
al = -1, QE=2al=-2,
as = none - na = (al. + na)nz = 2 + al = 1.
Consequently (171.6) now reads
h(x)=x3+x'-2x-1. § 172. Cyclic Fields Of the Galois fields, the simplest are the cyclic ones. We have dealt with examples of cyclic fields, and shown that all the finite fields and certain cyclotomic fields are absolutely cyclic. Here we wish to consider cyclic fields in general. In particular, we shall study those which are of prime degree, as these will be sufficient for our later purposes. We need the following preparation. If a is an element of a field F and n (>--_ 2) a natural number, then the (ntb) radical ./a means according to our
former definition a (in general not uniquely determined) root of the socalled binomial equation
x"-a=0.
(172.1)
We now extend this notion by admitting roots of(172.1) as values of . Ja which lie in any overfield of F. In this case we call /au more precisely a radical over the of F, which field F. Every such radical furnishes an extension field
CYCLIC FIELDS
673
we say arises from F by the adjunction of a radical. Of course, F(f) is, in general, not uniquely determined by F, cc and n. There is an extremely important special case, when the equation (172.1) corresponding to the radical '.Ja is irreducible. Then we call " a an irreducible radical. For this case, according to Theorem 295, the field F(Ja) is uniquely determined up to equivalent extensions (over F). We call " a a reducible radical, when (172.1) is reducible. (Cf. Examples
1, 2, 3.) When we say that "a field F contains the nth roots of unity", then it means that the polynomial x" - 1 over F splits into linear factors, i.e., all the radicals \/i over F already lie in F. THEOREM 425. A field F(Ja) I F is cyclic if the fundamental field F contains the nth roots of unity and its characteristic is not a divisor of n.
It will suffice to consider the case a 0. For the sake of brevity let us put 19 = Ja. Then we deal with the field G = F(ZJac) = F(8)
,
where 0 is a root of the equation (172.1). By hypothesis, F contains a primitive nth root of unity The factor decomposition
x" - a=x" - 0"= (x -0)(x - e8)... (x - e"-I8) holds over G. Since 0, . . ., a"-t0 are distinct, it follows that G I F is separable
and normal, therefore a Galois field, and that its automorphisms may be given in the form
0->eh9. The product of this with a further automorphism8-*et8 is 8-*e'ei8. Thus the Galois group of G I F is a homomorphic image of the cyclic group {e}, consequently it is itself cyclic. This proves Theorem 425.
One part of this theorem admits the following important converse. THEOREM 426. If G I F is a cyclic field of degree q of characteristic p (>_ 0),
where q is a prime number other than p and the fundamental field F contains the qth roots of unity, then there exists an element a of F such that G = F(.,/a). Let a denote a qth root of unity (in F), 0 a primitive element of G, for which G = F(d)
therefore holds. Also let s be a primitive automorphism of G I F. Then we call every expression (e, 8') = 0 + e(s8) + e2(s'o) + ... + eq-I(3q-I8,)
(172.2)
GALOIS THEORY
674
a Lagrange resolvent. It will be shown that among these Lagrange resolvents occurs a suitable element a of G. Because se = e we have
S(0, ,O) = s8 + e(s'0) + ... + eq-1(sgo) .
Since further sq = 1, comparison with (172.2) results in the equation
s(e, t) = e-1(e, ) . Then (172.3)
s ((e, o')q) _ (e, 0)q
On the other hand, it follows from (172.2) after summation for all the that
E (e, ) = q8 . e
The right-hand side is not 0, because p r q, therefore it lies outside F. Consequently, for at least one e, (e, t9) lies outside F and therefore is a primi-
tive element of G. We take such a e and put co = (e, 0) .
According to what has been said, G = F(co). Since s denotes a primitive element of the Galois group of G I F, it follows from (172.3) that coq belongs to F. Hence, Theorem 426 is proved. Theorem 426 even holds, according to FRIED (1956), for all natural numbers q such that p }' q. As regards the generalization of Theorem 426 for an arbitrary n (= 1, 2, ...) instead of q, see BoORBAKI (1939) and LUGOWSKI-WEINERT (1960).
THEOREM 427 (ABEL'S theorem). A polynomial (172.4)
of prime degree q over afield F is reducible if, and only if, a is a q`b power in F.
In this case (172.4) splits into linear factors over F if, and only if, F contains the q`s roots of unity. For the proof we suppose that (172.4) is reducible over F, i.e., there exists
a principal polynomial f(x) (E F[x]) of degree n (1 < n < q - 1) such that
f(x)Ixq-a.
(172.5)
Over a suitable extension field of F,
xq - a = (x - 13) (x - elI) ... (x -
eq-
)
(172.6)
675
CYCLIC FIELDS
with 994 = a,
e4 = 1
(172.7)
.
We take the constant term of f(x) in the form
(8 E F). From
(172.5) and (172.6),
P=
(172.8)
'0k0"
for some integer k, which we do not determine more precisely. According to (172.7) and (172.8),
N4=a"
Now the equation
qy + nz = I has a solution y, z (E J) because (q, n) = 1R.zIt follows that
a = qy+nz = (ayY)"
thus a is a qts power in F. From now on we suppose that a = 994 for some 99 (E F). Then
x4- (X =x4-994 has the factor x - 99. What has been stated so far proves the first part of the theorem. Since x4 - a = 994((99-Ix)4 - 1) ,
we see that x4 - a splits over F into linear factors if, and only if, the same holds for x4 - 1. This proves the second part of Theorem 427. The generalization of the first part of this theorem is the following THEOREM 428 (VAHLEN-CAPELLI theorem). Over afield F a polynomial
x" - a
(n ? 2; a E F, 0 0)
(172.9)
is reducible if, and only if, a = YRd
(d l n, > 1; j9 E F)
(172.10)
or
4 In,
a= -4y4
(y E F).
(172.11)
The proof is founded upon the following PROPOSITION. If f(x) is a separable principal polynomial and g(x) a nonconstant polynomial over a field F, and the irreducible factor decomposition
g(x) - 99 = fl gi(x) 1=1
(172.12)
676
GALOIS THEORY
holds over the extension field G of F, defined by
G = F(s)
(f(z9) = 0) ,
(172.13)
then the polynomial
h(x) = f(g(x))
(172.14)
has the irreducible factor decomposition over F:
h(x) _ [J N(g;(x))
(N denotes NG(X),F(X))
.
(172.15)
Thus the degrees of the irreducible. factors of h(x) are multiples of the degree
of f(x) and, in particular, h(x) is irreducible over F if, and only if, g(x) - 0 irreducible over G.
First of all we prove the Proposition. Because of (128.4), f(x) _ = N(x - 0). If x is substituted here for g(x), equation (172.15) follows from (172.12), (172.14). Since in this the factors of the right-hand side belong to F[x], we have only to show that they are irreducible over F. With this end in view we denote an irreducible divisor of h(x) over F by k(x). From (172.12), (172.15) it follows that for some i (= 1, ...,.r) gi(x) I k(x).
(172.16)
We can write
1(x) _ (x - iI) ... (x - 0.) so that N = F(t91,...,
(0 = 'b),
is the normal field of G I F. Because of the
separability of f(x), X91, ..., 0," are distinct. Now N(gi(x)) is the product of m factors, which result from g,(x) by applying the isomorphisms # --> 0, (s = 1, ..., m) of G I F (in N), and which, according to (172.12), are, in order,
divisors of g(x) - 01, ..., g(x) - 1,,, and consequently pairwise relatively prime. Since because of (172.16) they are all divisors of k(x), which follows
by applying the same isomorphisms, we find that N(gr(x)) I k(x).
Since the right-hand side is irreducible over F, it follows that every irreducible factor of h(x) over F is associated with one of the r factors of the righthand side of (172.15). This proves the Proposition. We shall now prove Theorem 428. For the assertion "if" this is simple, for if (172.10) holds, then
x"-a=x"-fld is divisible by x"" - f, but if (172.11) holds, then we have R
R
R
x" - a = x" + 4y4 = (x 2 - 2yx 4 + 2y2)( x 2 + 2yx 4 + 2y2).
CYCLIC FIELDS
677
Conversely, we suppose that (172.9) is reducible over F. We have to show that then (172.10) or (172.11) is true. If n is a prime number, then, according to the first part of Theorem 427, (172.10) is true. Now let n be composite. We suppose that the assertion is false for n but true for smaller n. Hence it follows that all the polynomials
xd - x
(d I n, < n)
(172.17)
are irreducible over F.
We denote by p (Z 0) the characteristic of F and first consider the case p,f'n. We begin with the case
n = q`
(q prime number, e >_ 2).
Since x4 - a, by (172.17), is irreducible over F and, because q # p, is even separable, a separable field G such that [G : F] = q is defined over F by G = F(O)
(VQ = 0C).
(172.18)
Since, moreover, we have
TV - a = (xV-')a - a, it follows from the reducibility of (172.9) and from the Proposition that
x°`-' is reducible over G.
-
19
(172.19)
If q # 2, it follows hence and from the induction hypothesis that there is an equation of the form
P_A4 On the other hand (since p
(AEG).
2), a = NGIF (0')
follows from (172.18). Since we now have a = (NGIF (A))4,
we have obtained the result that (172.10) is true for d = q.
If, however, q = 2, then it follows from the reducibility of (172.19) and from the induction hypothesis that
either 0 = (a# + v)2 or t = -4 (0 + v)4,
GALOIS THEORY
678
where µ, v denote suitable elements from F. In the first case 0 = µ2a + v2 + (21Av - 1)19,
thus µ2a + V2 = 0, 2µv - 1 = 0, and hence (because p
2)
a= -4v4. In the second case
0=0+ 4(µ2a+ v2+2µv+9)2= = 4(p2a + v2) + 16µ2 v2oc + (16p v(µ2a + v) +1)0,
thus (µ2a + v2)2 + 4µ2v2a = 0,
16µv(µ2a + v2) + 1 = 0,
and hence
j4.
1 8µv
So we see (in both cases) that (172.11) is t rue.
We now consider the case where n is not a prime power, i.e., it has a decomposition
n=uv
(u,vE7; (u,v)=1; u,vZ2).
Since, according to (172.17),
x" - a, x° - a are irreducible over F, and also
x"-a=(x°)"-a=(x')°-a, it follows from the Proposition that every irreducible factor of x" - a over F has a degree divisible by u and v, i.e., by n. This results in the contra-
diction that x" - a is irreducible over F. Lastly we consider the case p I n. Since, by hypothesis, a is not a pt power in F, the field G defined by
(P = a)
G = F ($)
(172.20)
is, according to Theorem 341, pure inseparable over F with [G : F] = p. Then
x"-a=(xp-," -#)°
(172.21)
An irreducible factor over F of this polynomial cannot be a power of .Cp-1n
- 0,
(172.22)
679
CYCLIC FIELDS
since 6, . . ., #°-' lie outside F. Consequently (172.22) is reducible over G. Therefore, because of the assumption, 4 1 n,
O= -4y4,
where fi, y are elements of G. (In the second case p must be odd, but we do not need this.) Since a = #P, it follows that a = (F'P)d
or
at
= -4 (yP)',
respectively. Since now G I F is pure inseparable and of degree p, it follows
from Theorem 343 that PP and yP lie in F. This contradiction completes the proof of Theorem 428. Cf. CAPELLI (1901), who considered merely a number field of finite degree.
Evidently the validity of Theorem 426 disappears if q = p. In this case, however, the following holds : TttEoi t 429. If G I F is a cyclic field of degree p and of characteristic p (p prime), then there is a A such that G = F(A) and AP - A E F.
Let s denote a primitive automorphism of G F and Y(s) the natural group ring of {s}. For an element of this and for an element of G we define the operator product n
w= E a,(s'w)
ais` 1
(ai E 7; w E G).
t=o
It is evident that G+ is transformed by this into an Y(s)-module (with the customary properties of an operator), furthermore F+ is an admissible submodule. In particular, (because sP = 1 and pw = 0)
(s- 1)P0.) =(SP- 1)w=0.
Now we take an co from G - F. For this (s - 1) w = sco - w 0 0, thus there is a k (1 <- k <- p - 1) such that
(s - I)kw
0,
(S - 1)k+1 CO = 0.
We put
e = (s - 1)k-1 Co. Then
(s- 1)e's0, (s- 1)Ze=0.
GALOIS THEORY
680
For the element e
7.=
(s - 1)e
so
sA
se
SQ
s(s-1)e = (s- 1)ae+(S- 1)e = (s- l)e =a + i,
then holds, as also does
s(AA-A)=(sA)"-sA=(A+ 1)°- (A + 1)=Ap-A. Since, according to this, A E G - F, AP - A E F, so A satisfies the required conditions, which proves Theorem 429. This simple proof is due to FRIED (1956). EXAMPLE 1. The field .g", (V2-5) is not uniquely defined as a subfield of .W(,) (10,
since .J25 has the four values 511"
(n=0,...,3).
Thus .F'0(.J25) can be either 3s"0(
) or .g-O( ,r--5 s ). These are not isomorphic. EXAMPLE 2. The n' roots of unity (n > 2) over a field may be written as Vi. This radical is always reducible.
EXAMPLE 3. If F is a field of prime characteristic p, then the field
F(°
m)
(a E F; n > 1) is, apart from equivalent extensions, always uniquely determined. It suffices
to prove this for the case where a is not a pth power in F. Then the assertion follows from the fact that, according to Theorem 428, is irreducible. (The same results from Theorem 427.) Since, moreover, V is uniquely determined within this radical is usually denoted by a°-". (Cf. § 142, Example 2.) EXERCISE. The following generalization of Theorem 426 holds. If G I F is a cyclic field of degree n and of characteristic p (>_ 0) where p,?n and F contains the nth roots of unity, then we can obtain G from F by the adjunction of an nth radical.
§ 173. Solvable Equations
Let us take a field F. We take an integer t (>_ 0) and give, over F, polynomials of the form
9; = gi(x1, ..., x,_1)
g = 9(x1, .. , xr)
(1
(173.1)
and integers q1, . . ., qt (>_ 2). (In particular, g1 is an element of F.) If t91, ..., 0t are elements of an extension field of F, where
..., $;_1)
(1= 1, ..., t),
(173.2)
then we call
0 =g(131,...,$e)
(173.3)
SOLVABLE EQUATIONS
R
681
a radical expression over F belonging to the polynomials g1, ..., g1, g and the root exponents q1, . . ., q, The 01, . . ., 0, are themselves called generating radicals. Allowing for the obvious ambiguity, if necessary we say more precisely that 0 is a value of this radical expression. We can (173.3) an irreducible radical expression, if the generating radicals occurring in it are irreducible (i.e., every 13; is an irreducible radical over F (01, . . ., 0I-1)).
Radical expressions can be easily characterized. In order to do this, we put Gi = F(01,
. .
(i = 0, ..., t).
., 0I)
(173.4)
Then G; = Gr-1(01),
MP E G,_I
(i = 1, ..., t),
(173.5)
(173.6)
0 E Lr L.
Hence we see that 0 is a radical expression over F if, and only if, there exists a chain of fields
F=Go9G1c...9G,
(173.7)
such that every term arises from the preceding by the adjunction of a radical and 0 is an element of Gt. Incidentally we sometimes call (173.7) a chain of fields belonging to the radical expression 0.
As an example of a radical expression (with t = 2) we have:
x+ A V a+ (µ + v (.,/ a)3), J e+ or
+
(Ja)z
(a, K, ..., r E F).
We call the irreducible equation
f(x) = 0
(f(x) E F[x])
(173.8)
solvable (by radicals) if it has a root which is a radical expression over F. By the above this means that there is a chain of fields (173.7) whose last term contains a root of (173.8) and that moreover (173.5) holds. On the other hand, suppose that f(x) is the product of separable polynomials over F. Let N I F be a splitting field of f (x), which is thus a Galois field. We call it and its Galois group the Galois field and the Galois group of the equation f(x) = 0, respectively (but cf. § 174). Since N I F is, apart from equivalent extensions, uniquely determined, the Galois group of an equation is uniquely determined to within isomorphism. The solution of equations by radicals was the main problem in early
algebra which had no other (algebraic) methods for the solution of an algebraic equation. Although Theorem 299 entirely solves this problem by the help of the splitting field, nevertheless solvable equations still form
an important branch of algebra, this being explained by the fact that
GALOIS THEORY
682
at least one solution of them is easily determined, i.e. by reducing them to
the solutions of binomial equations. One of the best results of Galois theory will turn out to be the close connection between the solvability of an equation and its Galois group. Before beginning our considerations we take a simple example: every cyclotomic equation F (x) = 0 is solvable (over . o), since every root of it may be given as the
radical expression V1. Still this is not a satisfactory solution of the problem of determining the solutions of this equation, since not all the values of this radical are solutions, but only the primitive nth roots of unity among them. This remark is reasonable,
in particular, already for the case n = 3, where the equation x2 + x + 1 = 0 is involved;
now, besides the previous expression, the radical expression 2 (-1 + V=3) is also a solution of the equation, and that, "fortunately", for all its possible values, therefore it is far preferable to the other.
For a solvable equation (173.8) an irreducible radical expression is called
a solution formula for this equation, if all its values are solutions of the equation and, conversely, all the solutions are furnished by its values. Next we prove the following THEOREM 430. Every irreducible radical expression over a field F, which furnishes a solution of an irreducible (solvable) equation (173.8), is a solution formula for this equation.
For this we suppose that the radical expression 0 given by (173.3) is irreducible and a solution of (173.8). Here 0, is a solution of the equation xq` - g,(01, ..., oi_1) = 0
U= 1, ..., t)
(173.9)
whose left-hand side is irreducible over F(81, . . ., O'i-D We consider a further value 0' of the same radical expression. For this
0' = 01, ..., 07,
(173.10)
where 0l, ..., 6' are elements of an extension field of F such that 0 is a solution of xqt
- g.(1 , ..., O'_-D = 0.
(173.11)
We have to prove that 0' also satisfies the equation (173.8).
To do this, we show that the isomorphisms F(01,
.,,&j) I F x F(a9l, ... 0r) I F (01 -* #1, ..., 0, -+ 0j'),
(i = 1, . . ., t) (173.12)
hold. For i = 1 it is true since, according to (173.9) and (173.11), 01, O' are solutions of the equation xq' - g1 = 0, irreducible over F. We then consider an i (= 2, . . ., t) and make the induction hypothesis that the
683
SOLVABLE EQUATIONS
isomorphism (173.12) holds for i - 1 instead of i. By this the left-hand side of (173.9) is converted into that of (173.11). Since, of these two polynomials, the first is irreducible over F(01, . . ., #t_1), and since t; and 0; are roots of (173.9) and (173.11), respectively, it follows from Theorem 298 that (173.12) is true for the i concerned and therefore in general.
We now take into consideration the isomorphism (173.12) for i = t. Because of (173.3) and (173.10) this maps 0 onto 0'. Hence it follows that, together with 0, 0' is also a root of (173.8) as asserted. We have still to prove that conversely every root of (173.8) is a value
of the radical expression (173.3). With this end in view we consider a normal field N of F(01, ..., i9) 1 F. From f($) = 0 and (173.3) it follows that N contains a splitting field of f (x). Therefore it is sufficient to prove that all the zeros of f(x) lying in N occur among the values of the radical expression (173.3).
Let 0' be such a zero :
f(0') = 0, 0' E N. By Theorem 329 there exists an isomorphism e -> sLo of F(01, ..., 0) 1 F such that 0' -1- s O. Now from (173.3)
so = 9(sh'1, ..., so), and it follows from (173.9) that sli, is a root of x41 = 9,(s#1, ..., s$,_1)
(i = 1, ..., t).
Accordingly, 0' = sO is, in fact, a value of the radical expression (173.3). Consequently Theorem 430 is proved. As a further preparation we prove the following THEOREM 431. Every normal subfield G I F of a solvable field N I F is solvable.
Since the assertion is trivial for G = F and N, we may assume that F c c G c N. Let co denote the Galcis group of N I F, which is, by hypothesis, solvable. Further let cY, denote the invariance group of G. According to Supplement 3 of the fundamental theorem of GALOis theory (Theorem 417),
Y is a normal subgroup of cPi, furthermore the Galois group of G I F is . isomorphic with the factor group d I Now, according to the JORDAN-HOLDER theorem (Theorem 136), the
normal series c0 X D 1 may be refined to a composition series
(`j _)Xp aYl
...
Xr (_ t) D ... D 1.
According to Theorem 138 in this all the factors Xi/
r+l are of prime
GALOIS THEORY
684
order. According to the second isomorphy theorem (Theorem 126')
`4/X D t1/
D ... D Xr1J7(= 1)
is likewise a composition series whose factors are again of prime order. Accordingly 4g/P, on account of Theorem 138, is solvable, so that the proof of Theorem 431 is complete. THEOREM 432 (main theorem for solvable equations). Let
f (X) = 0 (f(x) E F[x])
(173.13)
be a separable equation over a field F of characteristic p (Z 0), further let cP be its Galois group. If (173.13) is solvable, then C4 is likewise solvable. If, conversely, 4 is solvable and in the case p > 0 all the prime factors of 0 (4) are smaller than p, then (173.13) is likewise solvable and has a solution formula in which all the root exponents are prime numbers, which do not exceed the greatest prime factor of 0 (4). In order to prove the first part of the theorem we suppose (173.13) to be solvable. Then there is a radical expression t over F with f(t9) = 0.
(173.14)
On account of the rule
we may suppose that in 0 all the root exponents are prime numbers. These, and the corresponding radicals, are denoted by q1, . . ., q, and' 191, ..., so that i97' E F(191, ..., X9,_1)
(i = 1, ..., t),
0EF(61,.. hold. We show that q1i . . ., qt (if p > 0) may be supposed to be different from p.
First let us consider the case, where for some i (>_ 0)
g1, ,qr#p; q,+1=...=q,=p By Theorem 341, of the fields F(61,
..., 0,) 1 F,
F(191, ..., 't) I F(01, ..., 0t)
the first is separable, the second pure inseparable. Since 19 is separable, it follows that 19 E F(01, ..., 19,). Accordingly, the (superfluous) radicals a91+1, , 0t may be omitted, so that the assertion is true for this case.
SOLVABLE EQUATIONS
685
Secondly consider the case, where there exists an i with
qi = p, q;+1 9 p It suffices to prove that 0 may be transformed into a radical expression, where the root exponents
q1, . , q;- l, q;+1,p,p,q,+2,-- ,qi occur in order of succession, since hence, and from what has been said above, the assertion will follow by induction. For our purpose we insert in the sequence of radicals 791, . . ., 09, between
19; the term
u*_0+1' from which we obtain the sequence $1,
..., $;-1, (97 (_ X91+1), 991, O'i+1, ..., 19'.
Since
0*Q'+' = ( +i')° E F(19i , . . ., 9f) c F(191, . . ., 0r-1), = 07' E Fo91, . . ., 0,-,) c F(61,
..., 49;-v $*),
091P+1 = 6* E F(01, ..., 09;-1, #
,
01)
hold, the truth of the assertion follows. We now put G = F(01, ..., 0,). The splitting field of the polynomials
x41 - 1,...,x4'over G is denoted by G'. Then G' contains a splitting field F of the san. polynomials over F, furthermore evidently
G'={F',G}. By Theorem 424, F may be obtained from F by successive cyclic field extensions. Since finite cyclic groups are solvable, it follows from Theorem 420 that here extensions of prime degree suffice.
Further let us denote the normal field of G' F by N. Together with this, N I F is separable and thus a Galois field. Lets denote an automorphism of it.
Because of the supposition we can put (Oct-, E
Hence sO,
=
°j sx; --I.
GALOIS THEORY
686
Evidently we obtain N from F if we first successively adjoin all the conjugates sh1 of #,, by which we obtain a normal field over F, then we successively adjoin to this all the conjugates she of 02, by which we again obtain a normal field over F, and continue in the same way. But since F contains all the q;th roots of unity (i = 1, . . ., t), it follows from Theorem 427 that the adjunction of each s1, effects a cyclic extension whose degree is q, or 1. Together with the preceding statement this results in the existence of a chain of fields
F=No cN1 c... cN,=N, in which all the N,+1 1 N, are Galois fields of prime degree. This means, on account of Theorem 420, that the Galois group of N I F, i.e. N I F itself, is solvable.
Since, on the other hand, because of (173.14), N I F contains a splitting
field of f(x) over F, i.e., the Galois field of equation (173.13) as a normal subfield, it follows from Theorem 431 that this field, and thus also the Galois group c0 of (173.13), is solvable.
In order to prove the second part of the theorem, let us suppose that it is true for smaller O(o2). We denote by G I F the Galois field of the equation (173.13) and consider a composition series of its Galois group: (`xd =) -7p D'71 D ... Z) -7r (= 1). To this belongs, according to Theorem 420, a chain of fields
(F=)Go cGxc
... cG,(=G),
in which G; I G,_1 is cyclic of a prime degree q, (i = 1, ..., r). Hence
O(d)=q1...q,. A splitting field of the polynomials
xqI - 1,...,xgr- 1 over G is denoted by N. This contains a splitting field No of the same polynomials over F, which is obtained from F by the adjunction of all the q,th roots of unity (i = 1, . . ., r). Because of the induction hypothesis it follows from Theorem 424 that we can obtain No from F by successive adjunctions of irreducible radicals, where only prime numbers occur as root exponents which are smaller than the greatest prime factor of 0(c4). Finally, we consider the fields
N,={No,G,}(9 N)
(i= 1,...,r; N,=N).
Since we have
N, = {Ni-,, Gi},
687
SOLVABLE EQUATIONS
Ni I N;_1 is, according to Theorem 419, a Galois field, and [Ni Ni-11191 Since, furthermore, N,_ 1 contains all the qih roots of unity, either we have
Ni = Ni_1 or we obtain Ni from Ni_1 according to Theorem 426 by the adjunction of an irreducible qih radical (i = 1, . . ., r). If these radicals are adjoined to No, we obtain N, = N. By consideration of the above, because G c N. the second part of Theorem 432 follows. EXAMPLE 1. In the second part of Theorem 432 the condition "in the case p > 0
all the prime factors of 0(4) are smaller than p" may not generally be omitted. In order to give examples for this, we remark in advance that in Theorem 432 for a finite field F (since then by (173.13) a finite field is again defined) there are very simple conditions, as in this case equation (173.13) is always solvable and its Galois group c1/j is cyclic. For, the solvability of (173.13) follows from the fact that every element of a
finite field is a radical, and is either 0 or a root of unity. Nevertheless it can happen
that (173.13) does not have a solution formula. This is trivial in the special case F = .`, ffor non-linear equations (173.13)] since E has only the two elements 0, 1, therefore over it there exist no irreducible radicals at all. (There is no contradiction to Theorem 432, since now the above condition is not fulfilled.) EXAMPLE 2. Let f(x) be an irreducible polynomial of degree p over .2",. According to the above observation the equation f(x) = 0 is solvable, yet it has no solution formula. Otherwise it must also have (cf. the beginning of the proof of Theorem
432) a solution formula wherein only prime numbers other than p occur as root exponents, but this is impossible, since, by the equation f(x) = 0, a field of degree p is defined. EXAMPLE 3. Over the rational function field F = .z-$ (z) let us consider the separable equation
x2+x+z=0.
Its Galois group is of second order, thus solvable, although the equation itself is not solvable. (Of course, the condition quoted in Example I is not fulfilled.) Every root of unity over F is algebraic over 9 g. Thus if we adjoin to F arbitrary roots of unity, then the equation (because of the transcendence of z) remains irreducible. After adjoining suitable roots of unity we may always restrict ourselves to the adjunction
of irreducible radicals. When, for such an adjunction, the root exponent is odd, then the equation again remains irreducible. Finally, the adjunction of square roots leads to a pure inseparable extension, consequently it does not furnish a solution of the equation.
§ 174. The General Algebraic Equation First of all let us consider over a field F an equation (174.1) f(x) = 0, whose left-hand side is the product of separable principal polynomials
which we suppose without loss of generality to be different. Let ce, denote the Galois group of this equation. We put f (x) = (x - oc1) ... (x - xn) 23 R.- A.
(%I,. .., an
G),
(174.2)
GALOIS THEORY
688
where G = F(a1, . . ., an) denotes the Galois field of (174.1) over F, so that 4 is simultaneously the Galois group of G I F. If now t' -* n' is an arbitrary automorphism of G I F, then, because of (174.2),
J (X) = (x - aI) ... (x - an). Hence it follows that al ... and
a1...an
(174.3)
is a permutation of the roots of (174.1). Since furthermore al, ..., an are generators of G (over F), so the above-considered automorphism N -> t)' is already determined by the permutation (174.3); consequently the permutations
(174.3) must constitute a group isomorphic with c4. Generally we tacitly identify the Galois group of the equation (174.1) with this permutation group of the roots al, . . ., an. The index of this group, in the full permutation
group of the set
that we arrive in this way at different permutation groups, which moreover are of different degree, but necessarily are always isomorphic with one another.
According to the above preliminary observation we define the general algebraic equation of degree n as an equation
xn - xlxn-1 + ... + (-1)'-x, = 0
(174.4)
over the rational function field F(x1, ..., xn), where F can be an arbitrarily given field. This notation is justified by the circumstance that every algebraic equation of degree n over F arises from (174.4), when the indeterminates x1, ..., xn are replaced by elements of F. THEOREM 433. The general algebraic equation of degree n is separable and affectless. COROLLARY (RuFFirn-ABEL theorem). The general equations of degree
at least five are insolvable. Let a1, . . ., a,, be the roots of the equation (174.4) in an extension field of F(x1, ..., x,,). Then x1, . . ., x,, are the elementary symmetric polynomials of a1, . . ., an:
x1=act +...+an,
xn=a1...an.
THE GENERAL ALGEBRAIC EQUATION
689
Hence we see that F(al, . . ., (zn) is the splitting field of the equation (174.4) over the given field F(xi, . . ., x,,). We have recourse to the equation
(x-ti)...(x-tn)=x"-six"-1+...+(-1)"s,,=0,
(174.5)
whose roots t1i . . ., t" are new indeterminates over F. The s1, ..., s" are the elementary symmetric polynomials of t1,. .., t":
S1ti+...+to Sn = ti .. to .
Let us consider (174.5) as an equation over the field F(s1, ..., sn). Since its roots t1i ..., t" are different and F(t1, ..., tn) is its splitting field, this is, at the same time, its Galois field over F(s1i ..., sn). Its Galois group consists of all the permutations of
.
. ., XnJ ^' F [51, . . ., Sn] 77
f(X1, . . ., xn) -*J(Sl,
holds. In every case (174.6) holds with "-." instead of
.
. ., Sn))
(174.6)
", i.e., as a homo-
morphism. The kernel of this homomorphism is formed by the polynomials f(x1, ..., xn) for which f(si, ..., s,) = 0. But this equation is valid because of the proposition of uniqueness of the main theorem for symmetric polynomials (Theorem 268) only if f(t1i . . ., tn) = 0, i.e., f(x1, ..., x,,) = 0. Since, according to this, the kernel of the abovementioned homomorphism is 0, we have proved (174.6). The isomorphism (174.6) may be uniquely continued to an isomorphism F(x1,
.
. ., x,,) :.,
F(si.... , sn)
of the quotient field. Since in this the polynomials occurring in (174.4) and
(174.5) correspond to each other, it may be further extended to an isomorphism F(ai,
..., an)
F(ti, ..., tn)
of the splitting fields. This isomorphism maps the roots ai, ..., an of (174.4) in any order into the roots ti, . . ., to of (174.5). Hence it follows
GALOIS THEORY
690
that al, . . ., a are different, i.e., equation (174.4) is separable. Simultaneously it also follows that the Galois groups of the fields F(al, ..., an) I F(x1, . . ., xn),
F(t1,
.
. ., tn) I F(s1,
. .
., sn)
are isomorphic. Hence, according to what has been stated above, Theorem 433 is proved. Since the full permutation group of degree n is not solvable
if n > 5 according to Theorems 84 and 85, it follows that the corollary is also true. in the above proof the main theorem for symmetric polynomials might easily be dispensed with. In this way we could obtain a second proof of this theorem. Cf. VAN DER WAERDEN (1955).
According to the above corollary the splitting field of an equation of degree at least 5 generally cannot be obtained by adjunction of radicals from
the fundamental field. The most we can establish in this respect is the following
THEomm 434. Let f(x) denote a separable principal polynomial of degree n
with discriminant D over a field F of characteristic p 0 2, further let N I F be the Galois field off (x) and c0 its Galois group. D is a square in F if, and only if, 4t is a subgroup of the alternating group vgn. If this is not the case, then N contains a subfield F(.JD) of second degree over F, and the Galois group of N I F(J) is equal to the intersection c.P For the proof we put f ( x ) = (x
- a) ... (x - a,.)
(a1, ..., a. E N).
Then 4 and vgn may each be interpreted as a subgroup of the full permutation group of the set
Let s denote an element of co. sS = S if, and only if, s belongs to Accordingly, 6 lies in F if, and only if, 4 is contained in vgn. Hence, the first assertion of the theorem is proved, the remaining assertions follow trivially. EXAMPLE. We can obtain from Theorem 434 a second proof for STICKELEERGER's
theorem (Theorem 421). With this end in view, we apply Theorem 434 to a finite field F of characteristic p (> 3). Now, by Theorem 306, 4 is cyclic of order n, thus 4 is a subgroup of or,, if, and only if, n is odd. Hence we see that from the first assertion of Theorem 434 it follows that Theorem 421 is true for the special case m = 1. We then
obtain the general case of this theorem as in the second part of the first proof.
TSCHIRNHAUS TRANSFORMATION OF POLYNOMIALS
691
§ 175. Tschirnhaus Transformation of Polynomials
Let a non-constant principal polynomial f(x) over a field F be given. Take a further polynomial 99(x) over F. Let the factor decomposition
f(x) =x"+alx"-1+...+a,;=(x-aI)...(x-a") (175.1) hold over a splitting field of f(x) where the al, ..., a" are zeros of f(x). We form the principal polynomial
g(x) _ (x - Oai)) ... (x - 99(a,.))
(175.2)
with the zeros ..., 4p(a"). From the principal theorem for symmetric polynomials (Theorem 268) it follows that g(x) lies in F[x]. We introduce the notation g(x) = f (x) "
(175.3)
and say that this polynomial arises from the polynomial f(x) by the Tschirnhaus transformation with the polynomial 99(x).
THEOREM 435. If the result g(x) = f(x)) of the above Tschirnhaus transformation has only different zeros P1,
..., P", then we compute the (necessarily
distinct) zeros al, . . ., a", off(x) according to the rule
x - a; = (f(x), (p(x) - #,)
(i = 1; . . ., n).
(175.4)
By (175.4) we also select an order of succession of the al, . . . , a", moreover F(a,) = F((3,), as the proof shows, whence F(al, . . ., a") = F(f1, ..., #") for the splitting fields of f(x) and g(x).
For the proof of the theorem we choose the order of the al, ..., a" according to (175.2) by
9'W _ f,
(i = 1, ..., n).
Since then cv(ak) - f, = 0 only if k = i, x - a, must be the greatest
common divisor of f(x) and 99(x) - fl,. Hence the theorem is proved. Accordingly, with regard to the determination of all the solutions of an
equation f(x) = 0 (with distinct roots) the transformed polynomials g(x) =Ax)", quoted in the theorem, are equivalent to the given polynomial f(x). In the application we often proceed so that we seek a q((x) for which as many coefficients as possible of
g(x)=x"+blx"-I+...+b"
GALOIS THEORY
692
vanish. As the most simple example we take a linear polynomial
gp(x)=x+c
(cE F).
Because of (175.1) and (175.2)
g(x)=f(x-c)=(x-c)"+al(x-c)"-1+...+a" now holds. Here the coefficient of x"-1 is equal to -nc + a1. This vanishes for
provided that n is not divisible by the characteristic of F. So, by this we have settled that the problem of solving all equations of degree n over a field of characteristic p (p,r n) may be restricted to equations of the form
x"+b2x"-2+ ...+b"=0.
(175.5)
(In this special application it is not necessary to presuppose the distinctness
of the roots.) Thus if
instead of the general equations of degree n we need to consider only equations (175.5) with indeterminates b2,. . ., b". Of course, we could also have obtained the result (175.5) without the general notion of Tschirnhaus transformations. For further applications cf. BIEBERBACHBAUER (1933) and GARETT (1956).
EXAMPLE. If we restrict ourselves, as in Theorem 435, to the case in (175.3), where
f(x) and g(x) have no multiple zeros, then the similar equation
(g(x)) _ (1(x))' ,
(175.6)
follows from (175.3), where on the right-hand side we have to understand the Tschirnhaus transformation [with op(x)] of the ideal (Ax)) (cf. §122). Namely, the elements of
the ideal on the right-hand side of (175.6) are those polynomials h(x) over F, for which the condition h(p(x)) E (1(x))
is satisfied. This condition is identical to f(x) I h(q0)), i.e., according to (175.1), it is equivalent to the validity of all the equations h(cv(a,)) = 0
(t = 1, ..., n).
Since, according to (175.2), this amounts to the same as g(x) I h(x), it follows that (175.6) does, in fact, exist.
§ 176. Equations of Second, Third and Fourth Degree We consider for n = 2, 3 or 4 an equation of degree n
x"+aix"-I+...+a"=0
(176.1)
over a field F of characteristic p (>_ 0). Here we always suppose that p # 2
EQUATIONS OF SECOND, THIRD AND FOURTH DEGREE
693
and when n = 3, 4 we also assume that p 3. In both the last cases we also assume for the sake of convenience that al = 0, by which, according to the inference of the preceding paragraph, we lose no generality. Since the full permutation group of at most, fourth degree is solvable, by Theorem 432, equation (176.1) is solvable under the convention adopted,
and it also has a solution formula. We now intend to deduce this formula.
The procedure consists in assuming from the outset a splitting field ..., x,J, where xI, . . ., x denote all the roots of (176.1), then to
F(x1,
construct this splitting field by the help of adjunctions of irreducible radicals
and to express the roots xI, . . ., xn by the same radicals. In this way we have not only set up a solution formula, but also simultaneously effected the actual decomposition of the left-hand side of (176.1) into the form
x"+alx"-I+...+a,, =(x-xl)...(x-xn), where the x1, . . ., x,, are given by radical expressions over F. In connection with this we shall show that this decomposition also remains valid in the
reducible case, by which we obtain the complete solution of (176.1). We denote by D the discriminant of (176.1). We may put
JD = II
(xi - xk),
(176.2)
since, according to Theorem 277, the square of the right-hand side of (176.2) is equal to D. It should be noted that, by Theorem 434, the Galois
group of (176.1) over F(JD) is equal to a subgroup of the alternating group vf,, . CASE n = 2: (176.1) now reads
x2+alx+a_-=0.
(176.3)
On account of (113.4) we have
D = ai - 4a2,
(176.4)
JD = xl - x2 .
(176.5)
further because of (176.2)
2 = 1, so, according to the above statements, F(xl, x2) = F(-,/Y)). Accordingly, x1, x2 may be expressed as elements of F(.,/5). This can be done using equation (176.5) and the equation As t
xl+x2=--a1
GALOIS THEORY
694
arising from (176.3). Then we obtain
x1=
2
(-al+ / ), x2= 2 (-a1-.J),
(176.6)
where we have taken into consideration that p # 2. Since, conversely,
x2+a1x+a2=(x-x) (x-x2) always follows from (176.4) and (176.6), the formula (176.6) gives all the solutions in the present case. CASE n = 3: We now write (176.1) (with a1 = 0) in a simpler notation in the form
x-3+ax+b=0.
(176.7)
In order to avoid the necessity to distinguish between different cases, we assume that the coefficients a, b are indeterminates over F. This means that (176.7) is regarded as being over the rational function field F (a, b) as a fundamental field, which we still denote, for the sake of simplicity, by F. According to the final observation of the preceding paragraph, (176.7) is then like the general equation of third degree, by which we mean that Theorem 433 can be applied to equation (176.7), i.e., this is affectless. The result for this case can be applied to every other case, if we replace the indeterminates a, b by elements of the fundamental field originally given. Since equation (176.7) is affectless, it follows, as above, that its Galois group over F(,FD) is equal to f3. On account of (113.5) we now have
D = - 4a3 - 27b2.
(176.8)
Further, according to (176.2),
VD = (x] - x2) (x1 - x3) (x2 - x3) =
xix., - Y- x32Lx3 , 3
(176.9)
3
where the symbol > is defined for arbitrary functions f(xl, x2, x3) by 3
E J (XI, x2, x3) = J
x2, x3) + fl X21 x3, x1) + f(x3, x1, x3]
a
As we have O(vE3) = 3, the Galois field F(x1, x2, x3) I F(N46) is cyclic of degree three. We adjoin both third roots of unity
_=
2 (- 1 + J- 3), 02= 2
(176.10)
EQUATIONS OF SECOND, THIRD AND FOURTH DEGREE
which is equivalent to the adjunction of the radical ,
695
. Of course,
also the field F(x1, x2, x3, J-3) 1 F(JD ,) is cyclic of degree three. We take the Lagrange resolvent [(172.2)]: y = x1 + Cx3 + e2x3 .
(176.11)
(The following will be a similar construction to that in the proof of Theorem 426.) From (176.11) and p3 = 1 we have
y3 = E xi + 3e E xix2 + 3e2 E xix3 + 6x1x2x3 . 3
3
3
After the substitution of (176.10) and then (176.9) we obtain y3 =
xi 3
2
6
xix2 + 6x1x2x3 + 2 V - 3 JD ,
(176.12)
where E denotes the summation over all the permutations of <x1, x2, x3>. 6
From the formula p3 = - ai + 3a1a2 - 3a3 (§ 115, Example 1) we get Z Al = - 3b. 3
Further,
Y, xl 3 After substituting
E x1x2 = E xix2 + 3x1x2 x3. 3
6
Y_ x1 = x1 + x2 + x3 = 0,
x1x2x3 = - b
3
we obtain
E xix2 = 3b. 6
Thus, from (176.12),
y= 3I- 2 b+ 2 y-3D , where
- 3D =
(176.13)
JD (therefore the occurrence of J - 3D does
not imply that any new radical has been adjoined). Since y is obviously an irreducible radical of degree three over the field F(%D , /----3), it follows that
F(x1, x2, x3, J- 3) = F(/D, J- 3 , y).
(176.14)
Hence, it is possible to compute x1, x2, x3 as elements of the right-hand side of (176.14). 23/a R.-A.
696
GALOIS THEORY
For this purpose we have at our disposal the equations
x1+x2+x3=0
(176.15)
and (176.11). Similarly to (176.11) we put (176.16) y' = x1 + e2x2 + ex3. For this a formula similar to (176.13) holds (see below), but it can also be computed as follows: because e3 = I and (176.10),
yy'
_ 3
+ (e + e2) E xlx2 =
3
3
xi -
3
x1x2 = (Z x1)2 3
- 31 x1x2 3
follows from (176.11) and (176.16). According to (176.15) and
E x1x2 = x1x2 + xlx3 + x2x3 = a 0
we have
(176.17)
YY' = -3a, therefore y' is determined in terms of y.
We now compute the roots x1, x2, x3 from (176.15), (176.11), (176.16),
taking into consideration e2 + e + 1 = 0 and p # 3,
(Y+y'),
x1=
x2 = 3 (e2Y + ey') ,
(176.18)
1
X3 = 3 W + e2Y')
The equations (176.8), (176.10), (176.13), (176.17), (176.18) together furnish the required result. Since with the radical expressions x1, x2, x3 so computed, the decomposition
x3+ax+b=(x-x1)(x-x2)(x-x3) holds, this also remains true in the case where the indeterminates a, b are replaced by arbitrary elements of the original fundamental field, so that our result, apart from the case when a = 0, is generally valid.
We can express the result in another form too. If /- 3 is replaced by
- J- 3, e becomes e2, thus (176.11) becomes (176.16). Accordingly, from (176.13) we obtain
y' = 3 -
22
b-3
- 3D
(176.19)
697
EQUATIONS OF SECOND, THIRD AND FOURTH DEGREE
with respect to which it should be noted, however, that the radicals (176.13), (176.19) are not arbitrary inasmuch as (176.17) must bold for them. After substituting (176.8), (176.10), (176.13), (176.19) in (176.18) we now obtain all three roots of the equation (176.7) in the form
x=
a
-
b
+ JI 212 + 13 I3
+ s - 2-
X2)2
+ Tr (176.20)
where the radicals occurring here are always to be chosen so that the
product of the two terms of the right-hand side should be -
. This
formula (176.20) is called the Cardan formula (for the equation of3 degree of course, the case a = 0 is no longer an exception. Finally, denote the two radicals in (176.20) by u and v, so that (176.20)
then assumes the form x = u+ v uv = six pairs of values are possible: u, v; 0u, 02v;
02U,
I. Then for u, v the following 3
Ova v, u; 0v,
e2U;
e2v, eu ,
of which the first three already furnish the three solutions of (176.7).
CASE n = 4: We can now take the given equation (176.1) (because a1= 0) in the form
x4+ax2+bx+c=0,
(176.21)
Again it will suffice to consider the case where a, b, c are indeterminates, i.e., that a rational function field F (a, b, c) is the fundamental field, which we denote, for the sake of simplicity, by F. The solution of (176.21) will be obtained by its reduction to the previous case n = 3. Because of the supposition, the Galois group of (176.21) is the full permutation group '94. All its composition series are,, according to Theorems 84, 85, the
.94D f4JG4z) HD 1,
(176.22)
where G4 denotes the four-group consisting of the permutations 1, (1 2) (3 4), (1 3) (2 4), (1 4) (2 3) ,
(176.23)
and H is an arbitrary subgroup of order two of G4. The invariance fields belonging to the subgroups (176.22) form a chain
F c F(,JD) c F. c F12 c F(x1, . . ., x4)
(176.24)
OAI.OIS THEORY
698
with [Fe : F] = 6, [F12 : F] = 12. Our next intention is to construct the field F6.
With this end in view we consider the elements
Y1=(x1+x2)(x3+x4), (176.25)
Y2 = (X1 + x3) (x2 + x4) ,
Y3 = (x1 + x4) (x2 + x3)
and assert that
F6
(176.26)
= F(Y1, y2, y3).
On the one hand (176.25) furnishes fixed elements of the permutations (176.23), whence F(Y1, Y2, Y3) c Fa
(176.27)
.
On the other hand, we determine the equation of degree three with the roots yl, y2, y3. For this we must determine the elementary symmetric polynomials of yl, y2, y3. These are in shortened notation [Yll = 2[x1x2] = 2a [YIY21 = [xix221 + 3 [xix2x3] + 6 x1x2x3x4
YLY2y3 = [x3jx2x3] + 2 WX2x3x4] + 2 [x2jx2x3] + 4 [xlx2xsx4l
where, e.g., [x2jx2] denotes that symmteric polynomial whose terms all have the coefficient 1 and result from xix22 by permutations of <x1, ..., x4). Since [xix2]2 _ [x2jx22] + 2 [xlx2x3] + 6 x1x2x3X4 = a2,
[x1] [XIX2x31 =
[x2jx.2x3] + 4 xlx2x3x4 = 0
,
X1x.2x3x4 = C
[xlx2x3] = - 4c,
[xix2]=a2+2c, [YLY2l =a2-4c. Furthermore [x1] [x1x21 [xlx2x3l = [xix2x3] + 3 [xlx2x3x4] + 3 [x1 2xg] + 8 [x2jx2x3x4] = 0 ,
[JGIX2X3X4] + 2 [ 'IX2x3X4] = 0 , [X1X2x3]2 =
[X1X21X1X2x3X1 =
[x2jx2x321 + 2 [x2Ix2X3x41 = b2,
[xix2x3x4] = ac
EQUATIONS OF SECOND, THIRD AND FOURTH DEGREE
699
hold, so that we have
[xixaxs] _ - 2ac + bz , 2ac, [-41x2x3x4] [xix22x3] = 4ac - 3b2 ,
Y1Y2Ys = - Y.
Consequently yl, y2, y3 are the three roots of the third degree equation
y3-2ay2+(a2-4c)y+b2=0,
(176.28)
which (or, at least its left-hand side) is called the cubic resolvent of equation
(176.21). [Often y3 + 2ay2 + (a2 - 4c) y - b2 = 0 is also called so.] We shall show that the discriminant of (176.28) agrees with that of (176.21). We obtain from (176.25)
Y1 - Y2 = (xl - x4) (x3 - x2) ,
Y1 -/Y3 = (x1 - x3) (x4 - x2) ,
Y2 - Y3 = (X1 - x2) (x4 - x3)
whence the assertion follows. Since D is not now a square in F, it follows from what has been proved,
and from Theorem 434, that the equation (176.28) is affectiess, i.e., [F (3'1, )'21 Y3) : F] = 6. Hence, and from (176.27), (176.26) follows. From (176.28), y1, y2, y3 may be computed by the Cardan formula. Accordingly, it still remains to construct the extension field F(xl, ..., x4) of F(y1, y2, y3) by the adjunction of radicals and to express XI, ..., x4 in terms of these radicals. This is easy since this field, according to (176.22), (176.24), has the four-group G 4 for Galois group, thus the adjunction of two suitable square roots will be sufficient. In order to carry through this construction, we take into consideration XI + x2 + x3 + X4 = O ,
(176.29)
therefore (176.25) may be written as y1 = - (X1 + x0)2,
y2 = - (XI + x3)2,
y3 = - (X1 + x4)2.
Accordingly,
x1 + x2 =I1 X1 + X3 =
-Y21
X1 -+4
Y3,
(176.30)
whence evidently F(xI,
. .
., x4) =
(176.31)
GALOIS THEORY
700
According to what was said in (176.31) a square root is dispensable. We also prove this directly by showing that Yi -,I-Y2 %-Y3
= -b .
(176.32)
Since, for the product of the left-hand sides of (176.30),
(x1 + x2) (xi + x3) (xl + x4) = xi [x11 + [xlx2x3l = -b, so
(176.32) is true. Lastly we obtain from (176.29), (176.30) 1
x1= 2 (,/-Y1+
2+
3),
/
G,
X2
(176.33)
1
x4 = 2 (- N
/
Yl -
/
-Y2 + V -Y3)
These formulae thus furnish all the roots of the biquadratic equation (176.21), where yl, y2, y3 denote the three roots of the cubic resolvent (176.28) and the three square roots in (176.33) satisfy condition (176.32). The solution formula (176.20) is wrongly attributed to CARDAN as he obtained it from TARTAGLIA, but it is not known whether the latter was the discoverer of this formula. Formulae (176.33) are essentially due to FERRARI. EXAMPLE 1. We obtain the Cardano formula by a simple but clever method as follows: put x = u + v, with two new unknowns u, v, in (176.7). We then have
(u + v)3 + a(u + v) + b = 0, i.e.,
u3+v3+(3uv+a)(u+v)+b=0.
if we subject u, v subsequently to the condition
3uv+a=0, then the former equation reads
u3 + v3 + b = 0. The solution of the system consisting of the last two equations entails no difficulties
and leads at once to (176.20). Of course, this elementary process gives no real insight into the cubic equation. (See also the following paragraph.)
EQUATIONS OF SECOND, THIRD AND FOURTH DEGREE
701
EXAMPLE 2. Also the biquadratic equation (176.21) admits an elementary solution, if we seek a factor decomposition
x°+axs+bx+c=(x2+rx+s)(x2-rx+t) of its left-hand side. For this purpose one must determine a solution r, s, t of the system of equations
s+t-r$=a, r(-s+t)=b, st=c,
which can be easily done. EXAMPLE 3. Over a field with characteristic other than 2 the polynomial
y2 + bx +
JOY -
2 ay - 4 (a$ - 4c)
4
n the two indeterminates x, y has the property that its discriminants, with respect to y and x, are the biquadratic polynomial x4 + ax2 + bx - c and its cubic resolvent y3 - 2ay2 + (a2 - 4c)y + b2, respectively. [Cf. R DEI (1959c)].
§ 177. The Irreducible Case
First of all let us consider an arbitrary polynomial over Col without multiple factors which we decompose into the product of irreducible factors : ,f (x) = 11(x) ...1,(x) q1(x) ... qs(x)
The lk shall be assumed to be linear and the qk quadratic. Since the discriminant of an 1k is equal to 1 and that of a qk is negative, it follows from the multiplication theorem for discriminants (§ 113, Example 2) that the discriminant of J (x) is positive if, and only if, s is even, i.e., the number of
the non-real zeros of f(x) [in 501(i)] is divisible by 4. We now examine a cubic equation over 9'tO) without multiple roots, which we assume without restriction of generality to be in the form
x3+ax+b=0,
(177.1)
further we denote the discriminant of (177.1) by D. According to the above, the number of the real roots of (177.1) is equal to 3 or 1 according as D is positive or negative.
On the other hand, we know that for computing the roots of (177.1) according to the Cardan formula it suffices to adjoin to the fundamental field for a fixed value of _-3D the three values of 22
b+2
(177.2)
702
GALOIS THEORY
[cf. (176.13)]. If there is only one real root, i.e., if D < 0, then among the values of the radical (177.2) a real one always occurs, so that this single real
root of (177.1), on account of the Cardan formula, may be computed within .7(o. It is different if all three roots are real, i.e., if D > 0, since then the radical (177.2) has no real value at all. This case of the cubic equation, in which it is has three real roots, is called the irreducible case. The terminology refers to
the fact that in this case equation (177.1), applying the Cardan formula, is reducible only after the adjunction of non-real radicals. [From (176.18) it
is clearly seen how the three real roots then arise as a sum of pairwise conjugate complex numbers.] One might think that in the irreducible case the difficulty could be avoided; namely, the Cardan formula might be replaced by another, which
in the case of three real roots would only require the adjunction of real radicals. The centuries old efforts regarding this problem were brought to an end with the Galois theory in the following theorem. THEOREM 436. If f(x) is an irreducible polynomial of third degree with three real zeros over a subfield F of then none of these is expressible by real radicals (over F). For, let D (> 0) be the discriminant of f(x). After the adjunction of .JD
we obtain a field F(.JD) of at most second degree over F, so that f(x) remains irreducible. Hence and from Theorem 434 it follows that the Galois
is the alternating group 'e, group of f(x) over We suppose that f(x), after the adjunction of certain further real radicals 01, ...,14',,, will be reducible. We may assume that all the root exponents therein are prime numbers. In the considered sequence of radicals there is a term #k= /w (q a prime number) such that, of the fields G = F(.[, t91, ..., Ok-1),
G1
= G (;/w)
f(x) is irreducible over the first and reducible over the second. If the radical Jco is reducible, then co is, according to ABEL'S theorem (Theorem 427), a qt' power in G, whence, because .Jw E 9o), we obviously have .,/w E G, G1= G. But this is absurd, thus ,A.) must be irreducible. Hence [Gl : G ] = q. Since, on the other hand, G I contains a zero of f(x), we have 3 1 q, i.e., q = 3. According to what has been said above, the Galois field of J (x) over G is of degree 3. Since G1 contains a zero of f(x) and is likewise of degree 3 over G, so G1 I G itself must be a Galois field. Thus the irreducible polynomial
f(x) over G decomposes into linear factors over G.I. Likewise, according to Theorem 427, G1 must contain all third roots of unity which is impossible,
because G1 9
o). Consequently the above theorem has been proved.
EQUATIONS OF THIRD AND FOURTH DEGREE
703
§ 178. Equations of Third and Fourth Degree over Finite Fields
If f(x) = 0 is an equation over a field F, then the question arises as to how many roots lie in the fundamental field F itself. If, e.g., F = .off, and f(x) is a polynomial of third degree (over Y(0)) without multiple zeros, then the number of these roots of f(x) = 0 is 1 or 3. On the other hand, for other fundamental fields this number can have the three values 0, 1, 3, e.g., if F is a finite field. Here, and from now on, we shall deal only with this case and we shall examine the question raised with regard to equations of third and fourth degree. The KONIG-RADOS theorem (Theorem 313) answers this question generally for finite fields, for equations of arbitrary degree, but much simpler results of a quite different kind hold in the special cases mentioned here. THEOREM 437. Let F denote the finite field of characteristic p (> 3) such that O(F) = q, further let n =
q-1
q+1 or
6
(178.1)
6
denote the integer nearest to 6 . Let the cubic equation
x3+ax+b,0
(178.2)
over F be given with the discriminant
D = -4a3+27b2#0.
(178.3)
If D is not a square in F, then (178.2) has exactly one root in F. If D is a square in F, and
t, (-27b2D'1) = 0 .
(178.4)
where
V(X)=l
2n
Jx2...+2n
(178.5)
then (178.2) has three roots in F. In the remaining cases (178.2) has no roots in F. In order to prove the theorem, let us denote by nz (= 0, 1, 3) the number
of roots of equation (178.2) lying in F. Then we have m = 1 if, and only if, the left-hand side of (178.2) splits into the product of two irreducible factors over F. This is the case, according to STICKELBERGER'S theorem
(Theorem 421), if, and only if, D is not a square in F. Hence we have shown that m = I if, and only if, D is not a square in F.
704
GALOIS THEORY
Henceforth we suppose that D is a square in F, whence m = 3 or 0. It suffices to prove that (178.4) is true if, and only if, m = 3.
Let Fe denote the field of 6" degree over F, for which we then have O(F6) = q6. We denote by F2 and F3 the subfields of F of degrees 2 and 3, respectively, over F. Let x1, x2, x3 be the roots of (178.2) in F6. But since
m = 0 or 3, so (178.6)
xI, x2, x3 E F3.
Since a -+ aq is a primitive automorphism of F6 I F, if m = 0 we can choose
the notation so that (178.7)
X1= x2, X2= X3, x3= x1. Because of the definition of n,
q=6n+e
(e=±1).
(178.8)
Certainly F2 (possibly also F) contains the radical ,/-3, therefore, also, the
third roots of unity
o2(-1+
,
022(-1-%J-3).
(178.9)
Furthermore, because of the equivalence of the propositions
OEF, x2+x+lIx°-1-1, x3-1Ixq-1-1, 31 q-1, the rule
EFae=1
holds.
(178.10)
We now consider the Lagrange resolvents y = x1 + 0X2 + 02x3 , y' = x1 + 0`'x2 + 0x3.
(178.11)
These lie in Fe. Just as in (176.13), (176.19) now
y3=2 (- 27b +
-27D), y'3=2 (- 27b -
27D). (178.12)
Here also y, y' are different from 0, for otherwise we have (27 b)2 = -27 D, a = 0, although we have restricted ourselves to the case a 0 0. We compute yq, where we distinguish, according to the different
values of m (= 0, 3) and e (= 1, - 1), four cases. We prove that
yq=o2y (m = 0, e = 1); yq = oy' (m = 0, e = - 1) ; yq=
(178.13)
y(m=3,e= 1) ; yq= y'(m=3,e= -1). (178.14)
EQUATIONS OF THIRD AND FOURTH DEGREE
705
If m = 0, according to (178.7) and (178.111), we have yq = x2 + ogxs + o2gxl = o2q(x1 + ogx2 + o2gx3)
But, according to (178.10), we have oq = o or oq = 02, according as e = 1 or e = -1. Hence, and from (178.11), the assertion (178.13) follows. If m = 3, x1i x2, x3 E F, then yq = x1 + ogx2 + 02gx3 .
If the cases are similarly distinguished as formerly, then (178.14) follows. From (178.13), (178.14) we obtain proper equations if we exchange o for o2 and y for y'. By division of the corresponding equations we obtain (yy'_1)q-I=o(m= 0,e= 1)
;
(yy'_I)q_1= 1(m=3,e= 1);
(yy'-I)q+1= o-1(m=0,e= -1); (yy'-1)q+1= 1(m=3,e= -1).
Accordingly, m = 3 if, and only if,
(yy'-1)q_e
= 1. This condition, by
(178.8) and (178.12), may be represented in the form
27b +
- 27D 2n=
27b-
-27D)
1
and then in the form (1 +
227b2 D- I)2n - (1 -
- 27b2D_ 1)2" = 0
.
Hence, after applying the binomial theorem and dividing by 2,/ - 27 b 2 D-1 we obtain equation (178.4). Consequently Theorem 437 is proved. THEOREM 438. Let a biquadratic equation
x4 + axe + bx + c = 0
(b & 0)
(178.15)
without multiple roots be given over a finite fielal F of characteristic p (> 3).
Let m and m' (< m) denote the number of the roots of the cubic resolvent
ys + 2ay2 + (a2 - 4c)y - b2 = 0
(178.16)
lying in F, and the number of those roots of (178.16) which are also squares in F, respectively. If m > m', then (178.15) has no roots in F; if, on the other hand, m = m', then (178.15) has exactly m + 1 roots in F. (It should be noted that for the pair m, m' only the following five possibilities may be taken into consideration: 0, 0; 1, 0; 1, 1; 3, 1; 3, 3.)
GALOIS THEORY
706
For the purpose of the proof, let n denote the number of roots of (178.15) lying in F. The theorem can be divided into the following part-assertions:
Assertion 1. n = 1 if, and only if, m = 0.
Assertion 2. n = 2 if, and only if, m = m' = 1. Assertion 3. n = 4 if, and only if, m' = 3. We denote the roots of (178.15) by x1, . . ., x4 in a suitable overfield of F. Because x1 + ... + x4 = 0, the three elements [cf. (176.25), (176.28)] Y1 = - (x1 + x2) (x3 + x4) = (x1 + x2)2, (178.17)
Y2 = - (x1 + x3) (x2 + x4) = (xj + x3)2, Y3 = - (x1 + X4) (x2 + X3) = (x1 + x4)2
are then the roots of (178.16). Hence it follows [cf. (176.31)] that F(x1i
.
. ., x4) = F(Jyl,
Y2,
Thus x1, ..., x4 E F if, and only if, we have Jy1,
.,/Y3)
Y2,
(178.18)
.
Ys E F. Thus Asser-
tion 3 is proved. We denote the left-hand side of (178.18) by G. Because of (178.18), the degree of G I F(Y1, Y21 Y3) is a power of 2. Thus the degree of G I F is divisible by 3 if, and only if, the same holds for the degree of F(y1, Y2, y3)1 F. Thus we have established Assertion 1. In order to prove Assertion 2, we first suppose that n = 2. We may even suppose that x1, x2 lie in F. Then 1
x3=- 2(x1+x2)+Jw, x4=- 2(x1+x2)-Jw 1
where w is an element of F, which is not a square in F. Consequently, accord-
ing to (178.17), we have z
Y1 = (xl + xz)2, Y2 = ( 2
(xl - x2) + ',/;) , Y3 = l2 (xl -
7e2)
-
2
Therefore m = m' = 1. Conversely, we suppose that m = m' = 1. Then among the yl, y2, Ys exactly one is a square in F. We may suppose that it is yl. According to (178.17) we then have x1 + x2 = r,
y1 = r2
(178.19)
707
EQUATIONS OF THIRD AND FOURTH DEGREE
where r is an element of F. After substitution in (178.16) we get
r6 + 2ar4 + (a2 - 4c)r2 - b2 = 0
(r # 0).
(178.20)
Also 4
i
(x - xI) = x2 - rx +
r3+ar+b IIx2 + rx + r3+ar- b ) 2r
2r
(178.21)
holds, since the right-hand side, because of (178.20), agrees with the lefthand side of (178.15). Because b 0 0 it follows immediately from (178.15) that the six sums x1 + xk (1 <- i < k 5 4) are distinct. Hence, and from (178.191), it follows that the two factors of the right-hand side of (178.21) must be equal to the products
f(x) = (x - x)(x - x2) , g(x) = (x - x3) (x - x,1) .
(178.22)
Since, accordingly, the polynomials (178.22) lie in F[x], their discriminants (x1 - X2)2,
(178.23)
(x3 - x4)2
are elements of F. Since, on the other hand, y2, y3 are the zeros of an irreducible polynomial of second degree over F, (Y2 - Y3)
is an element of F, but not a square in F. Further according to (178.17) and
xl + ... + x4 = 0 we have Y2 - y3 = (XI + x3)2 - (XI + x4)2 = (x1 - X2) (;3 - x4) , thus
(Y2 - Y3)2 = (XI - x0)2 (x3 - X4)2.
Since the group F* is cyclic and of even order, it follows that of the two factors of the right-hand side exactly one is a square in F. These factors are the discriminants (178.23) of the polynomials (178.22), so that of these two polynomials f(x), g(x) E F[x]) 'bxactly one is reducible. That means that n = 2, by which Assertion 2 and Theorem 438, too, are proved. EXAMPLE. We see that Theorem 437 determines the number of roots of (178.2) lying in F. These roots themselves, of course, may be computed according to the Car-
dan formula, where possibly an extension of the fundamental field is needed; but in the majority of cases it is possible in another way, too. For instance, if (178.2) has exactly one root in F (i.e., if D is not a square in F), then this root corresponding to the cases 3 1 q - 1, 3 1 q + 1 is equal to q+2
- 3a-1 (w 3 + w'
q}2 3
) or w
20-1
2q-1 3
+ w'
3
(178.24)
GALOIS THEORY
708 respectively, where
ru
b
3
lE
+
Y
}
`3 (178.25)
bi_
_
(A 2)+
3
E
.
((
=3)
The formulae (178.24) are radical free, thus applicable within F, since after replacement by (178.25) the odd powers of the radical disappear owing to the forming of sums. Cf. also REDEI (1946-48b). EXERCISE. Prove the assertion concerning (178.24).
§ 179. Geometrical Constructibility
We shall see that the problem of geometrical constructibility (with compass and ruler) is solved by GALoIs theory. It will be shown that this problem may be reduced to the theory of solvability of equations. As a preliminary we need the following theorem, very important also in itself. THEOREM 439. Every finite p-group has a centre and is solvable.
Let c0 denote a finite p-group. Since 0 (4) is a power (>->- p) of p, so in
the class equation (42.32) for cPt the number of the terms equal to I is divisible by p and is consequently at least p. Hereby the first assertion of the theorem is proved. As the centre is a normal divisor, the second assertion follows by induction.
Let a Euclidean plane be given where we admit imaginary points, but exclude infinite points. When we speak of points or sets of points, these will always belong to the Euclidean plane. We consider a fixed set C of points,
which consists of at least two points. Then we consider the straight lines which pass through at least two points of G5, and the circles which have a point of G5 for centre and pass through at least one other point of G. The set of points which belong to at least two of the above straight lines or circles is denoted by `G' 1. Here, G51 ? G5 holds trivially. We repeat the above process with 61 instead of 6, so obtaining a set G52 of points, etc. This leads to a chain of sets G5 9- S19- G29- ...
whose union
S=G UG51U...
(179.1)
is called the set of (geometrically) constructible points of the points of G.
Note that every point of 9 belongs to a s,,, so that its geometrical construction may be carried out in finitely many steps. The straight lines and
GEOMETRICAL CONSTRUCTIBILITY
709
circles occurring when determining C5, further the lines, arcs, angles and plane
figures (triangles, etc.) determined by the former, are also called constructible. Different authors define geometrical constructibility differently. We can assume as given elements straight lines, circles, lines and angles in addition to points, and further admit, while constructing, the help of arbitrarily chosen points. Within this general framework we can, e.g., undertake the task of constructing for a given straight
line g the parallel straight line through a given point P. This task may be brought within our definition by a slight modification so that we give the straight line g by two
points A, B of it, then try to define in the set S = a fourth point Q such that the lines AB, PQ are parallel. We have purposely chosen as far as possible a simple formulation for the definition of constructibility, but it is sufficient, as the above example shows, for all the usual construction problems. For the sake of brevity we shall not carry out completely the simple elementary geometrical reasonings to be applied. For this, we refer the reader to BIEBERBACH (1952).
Our first aim is the algebraic characterization of the points of S. With this end in view we introduce rectangular point coordinates with the same metric on both axes x, yin our plane. As the origin of the (say right-oriented)
coordinate system and as unit point of the x-axis we take points of S. Then we adjoin to .70 both coordinates of all the points (x, y) of e, by which we obtain a subfield F of 9o)(i). We call F the field belonging to the set (5 of points. This is permissible, since F is uniquely determined by C. If we
choose, as above, another pair of points of e for fundamental points of the coordinate system, this implies a coordinate transformation in the determining equations of which only elements of F occur as coefficients, therefore the field F is again generated by the "new" coordinates of the points of e. We adjoin to F the square roots (E .9o)(i)) of all the elements a of F, by which we obtain a field FI. If we repeat this process for F1 instead of F, etc. then we obtain a chain of fields
F(= Fo) S F1 S F2 c
... (c
According to Theorem 47, the union of the i lds
F=FUF1U...,
(179.2)
exists and is called the 2-closure of F. It is evident that r is that minimal field
between F ando,(i), which contains the square roots of its elements. (This means, in other words, that all the quadratic polynomials f(x) over F are reducible.) In particular, F (and even F1) contains the element i = J -1 . The algebraic characterization of S then reads as follows:
710
GALOIS THEORY
The set e of the points, constructible from the points of G5, consists of those points both of whose coordinates belong to the field F.
From the definition of G and F it follows by elementary geometrical
reasoning that the coordinates of an arbitrary point of 6 are radical expressions over F, where only square roots occur as radicals, thus they belong to F. We have still to show that, conversely, every point with coordinates in F belongs to S. The proof depends upon the following propositions I, II, where a, j9 denote complex numbers: 1. A point (a, /1) lies in S if, and only if, (a, 0), (fl, 0) lie in G. II. When (a, 0), (fl, 0) (j 0) lie in ig, then
(a - fl, 0), (4-1, 0), (N/«, 0)
,
also lie in G. We shall show, first, that from I and II follows the previous statement,
viz., that every point (a, fE) with a, 9 E F lies in 8. Let P, a.... be the coordinates of the points of C. It follows from I that all the (e, 0), (a, 0), ... lie in G. Since, on the other hand, we have F = Y0(e, a, ...), it follows Further it from one part of II that all the (x, 0) with 1c E F(= FO) lie in follows from II that if for some n (= 0, 1, ...) all the (IC, 0) with K E F. lie in 8, then all the (ic, 0) with x E F,,+1 also lie in G. It follows by induction
that all the points (K, 0) lie in 9, in which K belongs to one of the fields F, F1, ... This implies, according to (179.2), that all the (x, 0) with K E F lie in ig, whence the assertion follows, according to I. I and II themselves have to be proved only for the case where C, apart from the points (0, 0), (1, 0), contains only the point (a, j9) or the points (a, 0), (f, 0), respectively. The proof is known from elementary geometry. We now prove the following THEOREM 440. Let a set G of points, consisting of at least two points, be given in a Euclidean plane extended by the imaginary points. Take two arbitrary points of G as zero point and positive unit point, respectively, on the
abscissa-axis of a rectangular coordinate system and extend .5 by the adjunction of the coordinates of the points of G to afield F. In order that a point (a, j9) of the plane should then be constructible from the points of it is necessary and sufficient that a, fl be algebraic over F and that the degree of the Galois field of F(a, fi) I F be a power of 2.
According to I it is sufficient to prove the theorem for one point of the special form (a, 0). First, we suppose that ((x, 0) is constructible. Then
a E F, where F denotes the 2-closure of F. This implies that a is a radical expression over F, wherein only square roots occur as radicals. Accordingly a is algebraic. Let N be the Galois field of F(a) over F. N is obtained from F by adjoining all the conjugates of square roots in N which
GEOMETRICAL CONSTRUCTIBILITY
711
occur in a. These adjunctions may be ordered so that the degree of the field
is doubled at every step. Hence it follows that [N : F] is a power of 2. Now we suppose, conversely, that a is algebraic and the Galois field of F(a) I F has a power of 2 as its degree. Its Galois group is a 2-group and consequently, according to Theorem 439, is solvable. Hence, it follows, according to the second part of the principal theorem for solvable equations (Theorem 432), that a. is a radical expression over F, in which only square roots occur as radicals. According to what has been stated above, the point (a, 0) is therefore constructible, thus Theorem 440 is proved. There is a particularly important special case of this theorem when the given set G5 of points consists of only real points and the constructibility of
another real point is required. In this "real case" of the theorem we can restrict ourselves to a real Euclidean plane. This may be understood as a Gauss number plane, so that we can identify the point (x, y) with the complex number x + yi. The special choice of the coordinate system described in the theorem is to ensure that the numbers 0, 1 (i.e., the corresponding points) should belong to C. In addition, we have to make the slight modification that we take F(i) for fundamental field instead of the above F. (This is quite permissible, since the point i, i.e. the positive unit point of the y-axis, is constructible.) When we again write F instead of F(i) then F may also be obtained from . by adjoining the complex numbers belonging to the points of G, their conjugates and also i. If now a + bi is a point of the Gauss plane (a, b E then, according to Theorem 440 it is constructible if, and only if, the field F(a, b) I F is algebraic and its Galois field has a power of 2 for its degree. However, these conditions may be replaced by the similar conditions relating to the field
F(a + bi) I F. The simplest way to recognize this is that a, b are radical expressions composed of square roots over F if, and only if, the same holds
for a + bi, and so, because i E F, also for a - bi. The result is expressed in the following THEOREM 440'. Let a set S of points, which among others shall contain the points corresponding to the numbers 0, 1, be given in a Euclidean plane considered as a Gauss number plane. Let F denote ;he field, which arises from .2 after the adjunction of i, of the complex numbers corresponding to the points of G5 and their conjugates. Then in order that an arbitrary point of the plane should be constructible from the points of S, it is necessary and sufficient that for the corresponding complex number a the field F(a) I F should be algebraic and its Galois field should have a power of 2 for its degree. EXAMPLE 1. We examine for which numbers is (> 3) a regular n-gon can be constructed. This construction problem is known as the cyclotomy problem. The problem is defined as follows: let two points 0, A be given. It is required to discover whether a regular n-gon can be constructed geometrically, one vertex of which is A and whose remaining vertices all lie on a circle with centre O. We proceed according to Theorem
712
GALOIS THEORY
440'. Since we take 0 and A for the points 0 and I of a Gauss plane, F = .57'o(i) is the fundamental field, and the angles of the regular n-gon are given by the complex roots of the equation
X"- I = 0.
These are of the form 1, a, ..., a"-', where a is a primitive nt root of unity, i.e., a root of the n`h cyclotomic equation F"(x) = 0. Of course, it is sufficient to consider the case in which a is constructible. Now .91-h(a) I F
is the n" cyclotomic field and this, according to Theorem 423, is a Galois field of degree 99(n). Because F = F"0(i) it follows from this, and from Theorem 419, that F(a) I F
is a Galois field of degree q9(n) or p(n). It now follows according to Theorem 440' that the regular n-gon is constructible if, and only if, p(n) = 2k
(179.3
for some integer k. This condition, on account of (84.22), implies that n has no multiple
odd prime factors and its odd prime factors are all of the form (s = 1, 2, ...).
2' + 1
There s must be a power 2' of 2, for ifs had an odd divisor d (> 1), then 2°-'' + 1 would be a proper divisor of 2' + 1. The prime numbers of the form 22' + 1
(179.4)
are called Fermat prime numbers. The final result now reads as follows: only those regular n-gons (n > 3) are constructible whose number of sides is of the form
n = 2'p, ... pk
(e > 0, k > 0),
(179.5)
where Pt, ..., Pk are different Fermat prime numbers. This is the famous theorem which Gauss proved in his youth. It should be noted that at present only those Fermat prime numbers are known which are given by t = 0, 1, 2, 3, 4; these are: 3, 5, 17, 257, 65537.
(179.6)
The case t = 5 of (179.4) is a number divisible by 641, thus not a prime number. According to (179.5) and (179.6), those n (3 < n < 20) for which the regular n-gon is constructible are the following: n = 3, 4, 5,-6, 8, 10, 12, 15, 16, 17, 20. EXAMPLE 2. By trisection we understand the problem of dividing an angle by geometric construction into three equal parts. This is equivalent to the following: let a complex number w (96 1) be given, where I co I = 1, furthermore let a = /w, where it is immaterial which of the three values of this radical are taken. We want to know whether the point a can be constructed from the points 0, 1, co. (According to the well-known de Moivre formula the construction of the third part of a given angle
is involved.) In order to clarify this problem by applying Theorem 440', We take F = Y0(i, J- 3 , co) as fundamental field. This is permissible, since we can construct
the point J- 3 . Then the polynomial X3 - W
(179.7)
over F is either irreducible or splits into linear factors. Correspondingly, the degree of the Galois field of F(a) I F is divisible by 3 or equals 1. Thus the trisection for a
GEOMETRICAL CONSRUCTIBILITY
713
given co is possible if, and only if, (179.7) is reducible over F. Since this is certainly not the case for a transcendental co, it follows that trisection is, in general, insolvable, but, at the same time, infinitely many particular cases are solvable, e.g., for all the co
a
_ I
i
s
a-1
(a E .7o).
EXAMPLE 3. The duplication of the cube or the Delian problem requires the construction of a cube with double the volume of a given cube, in other words, the construction of a segment of a straight line of length .J2 of a given unit segment is desired. This is like the preceding example, where now the polynomial
x3 - 2
(179.8)
occurs instead of (179.7). Since (179.8) is irreducible over .F-0 (1), a cube cannot be duplicated by a geometrical construction. EXAMPLE 4. The quadrature of a circle (squaring the circle) requires the construction of a square which has the same area as a given circle. As in the preceding example
this involves the construction of a straight line of length .n of a given unit straight line, where n (= 3.14. . .) i; the Ludolph number. Since ,z, according to the wellknown theorem of LINDEMANN, is transcendental, the quadrature of a circle, because of Theorem 440', is insolvable.
§ 180*. Remarkable Points of the Triangle We take a Euclidean plane without infinite points, containing all the geometric figures with which we deal. The elements constructible from the set of vertices of a triangle (points, straight lines and circles) are called constructible elements of this triangle. As triangles we only admit real triangles, but where necessary, we consider imaginary points, too. Certain constructible points of the triangle, in the construction of which no vertex is given preference, one usually calls "remarkable points of the triangle". Early examples are the centroid, orthocentre and circumcentre of the triangle; indeed we see that, e.g., the three medians, alti-
tudes and perpendicular bisectors of the sides of the triangle cut each other in these points. As a further, instructive example we consider the centres of the four tangent circles of the triangle, which we may obtain as points of intersection of the (internal and external) bisectors of the angles yet we now observe that of these four points, according to our subsequent general definition, one, namely the centre bf the inner tangent circle is called a remarkable point, and all four are called its "conjugates". From the beginning of the last century about twenty other remarkable points were discovered and their properties thoroughly investigated, but no doubt on account of some surprising difficulties, a systematic theory of remarkable points of the triangle has not been established, until now, nor has a general definition been given for them. This gap will now be filled up as briefly as possible. We shall make use of some well-known geometrical concepts and the theory of point sets without defining them.
OALOIS THEORY
714
Some preliminary considerations of geometrical constructions are also necessary.
We shall introduce for the natural numbers a, b, c, d, the following five symbols : 1 abed
(a, b, c, d different),
1(ab)(cd)
(aob; cod; aoc),
lab(cd)
(a96 b; c96 d),
2(ab)(cd)
(aob,c; d0b,c),
2ab(cd)
(a, b, c, d different),
(180.1)
which we call construction steps. The first three and the last two are called, more precisely, single-valued and two-valued construction steps, respectively. Thus an n-valued construction step (n = 1, 2) always occurs under the symbols nabcd , n(ab) (cd) , nab(cd) ,
(180.2)
of which the first is to be considered only for n = 1. In any case we call max (a, b, c, d) the order number of the construction step. We shall also consider an ordered set S, of v points (v = 2, 3, ...) in our plane which may include also imaginary finite points. The kte element of S,, is denoted by Ak. For two different points P, Q we denote by PQ the straight line through P and Q, and by (PQ) the circle through Q with centre P. Let q(n) be an n-valued construction step with an order number < v. According as q(n) is of the form (180.21), (180.22) or (180.23), we define the set S, ' ) as the difference set (Aa Ab fl A,. Ad) - S,,, ((Aa Ab) fl (Ae Ad)) - S,., (Aa Ab) n (Ae Ad)) - S
respectively. If this set S, (") is neither empty nor infinite, it consists of at most two points, which lie outside S, and are constructible from S,. If SF") consists of exactly n points, then we say that the ordered set S, is admissible for the construction step (E(n). Further on we shall conconsisting of n sider an S, of such a kind; then we consider the set points as a (somehow) ordered set, unless otherwise indicated. The latter condition for n = I is superfluous, since this set then consists of one point, which may be regarded (uniquely) as an ordered set. However, if n = 2 the definition of our ordered set is two-valued. Thus, in both cases: if S, is admissible for (E(n), then the (ordered) set S?(") has exactly n different meanings.
Let A denote a triangle which is non-degenerate, i.e., its vertices do
not lie on one straight line. By an ordered triangle we understand
REMARKABLE POINTS OF THE TRIANGLE
715
an ordered set of three points not lying in one straight line. A triangle thus leads to six ordered triangles. We often assume triangles as ordered without always expressing this in the notation (thus we also apply A as the notation of ordered triangles) and we even omit the adjective "ordered" where there can be no ambiguity. For any elementary ordered sets S', S" we denote the ordered set obtained
from them by juxtaposition by (S', S"). By a construction process $ for an ordered triangle A we understand a finite sequence C, ..., Sir (r >-- 1) of construction steps such that the ordered sets of points A1, ..., Ar, defined recursively by
Al = A, Ak+1 = (Ak, Akk)
(k = 1, ..., r - 1),
(180.3)
exist, i.e., Ak is admissible for +k (k = 1, . . ., r - 1), and also Ar is admissible for 19r, taking the many-valuedness of the A2, ..., Ar into consideration in all cases. It should be noted that if the construction step lZk is nk-valued, then Ak has exactly nI ... nk_1
(180.4)
distinct meanings and consists of 3 + n1 + ... + n1,_1 points. Therefore this
number must necessarily be an upper bound for the order number of
%k (k= 1,...,r- 1).
In (180.4) we particularly emphasize that the number of all the Ar is equal to
n1 ... nr_I .
(180.5)
In any case the set Air, now considered as unordered, consists of nr points. The whole set of
nl ... nr
(180.6)
points (which are not necessarily different) are constructible points of A.
We say that they are obtained from A by the construction process Their set is denoted by A% and called the set of conjugate points arising from A by $. We also say that these points are the conjugates of any one of them. It should be emphasized that, because of the (necessary) generality
of the concept of construction steps, every constructible point of A is contained in a suitable A$. To illustrate this, let us consider the construction process $ (for p) consisting, in order, of the construction steps 212(13), 212(23), 2(34)(43), 2(36)(63), 1881011
(so that now r = 5, n1 = n2 = ns = n4 = 2, n., = 1). The reader must realize that A consists of the centres of the four tangent circles of L and from p each of these four points is obtained exactly four times by $, but that all this holds only for those p which do not have two equal sides meeting at either the first or second vertex.
716
GALOIS THEORY
C5
will always denote an everywhere dense and similarity-invariant
set of ordered triangles; by the first of these two properties we understand that for three arbitrary circles 01, 02, 03 there exists at least one triangle A (E Cam) whose kte vertex lies in Ok (k = 1, 2, 3). If $ = 3c is simul-
taneously a construction process for all the Q (E G), then we call it a general construction process for triangles. This is said to be regular, if the number of elements of A (i.e., that of the conjugates) is the same for
all the A (E t) and if these elements occur each time in equally many Q(gr A '. We call them totally real, if for the A (E C5) all the
consist of only real points [the 91, ..., (9r and A 1, ..., A. mean the same as in (180.3)]. Further we call our general construction process $ continuous, if the correspondence A --> A (A E Cam) is continuous and we then extend the notion of the set A of the conjugates to all the ordered triangles A with the definition
Q = lim A) k- oo
(180.7)
... is a sequence from C5 converging to A. If the sets A consist of an equal number of points for all the ordered triangles A
where A(l), A(2),
then we call 3 non-degenerate. Lastly we call 3 irreducible, if no further general construction process 3' exists such that A W c A$ for all the ordered triangles A. We now make the following definition: for a correspondence A --->. PA of points P. to the real triangles A of a Euclidean plane without infinite points, Pa is said to be a remarkable point of the triangle A, if the following axioms are fulfilled :
1. Exactly one PA belongs to every Q.
II. P. is real. III. The correspondence A - . P. is continuous. IV. The correspondence A -p. P. is similarity-invariant. V. P. is a constructible point of A. VI. For the elements A of an everywhere dense and similarity-invariant
set S of ordered triangles, PA is obtained from A by a general construction process is regular. is totally real. is continuous. is non-degenerate. XI. is irreducible.
VII. VIII. IX. X.
XII. The correspondence Q - A is similarity-invariant. Of these axioms I-V concern only the remarkable point Pp itself, VI-XII refer also to the conjugates of Pp. Our axioms are not independent, for II is contained
REMARKABLE POINTS OF THE TRIANGLE
717
in VIII, and XII follows from the other axioms. But III is not contained in IX nor is IV in XII. As we shall see, III is designed to separate Pp from its conjugates. For a generalization of the remarkable points see Examples 3, 4.
Our further aim is to determine the remarkable points on the basis of their above definition. By this we mean an analytical determination, where, at first, we shall make use of rectangular coordinates, but later for the sake
of elegant results we shall pass over to invariant, barycentric triangular coordinates.
We take a rectangular coordinate system in our plane. Let (x, Y E .o); Y # 0)
A (x, Y)
(180.8)
denote the ordered triangle with vertices at (0, 0), (1, 0), (x, y). We denote the 2-closure of an arbitrary field F by F and consider the fields, rings, field or ring elements occurring as substructures or elements, respectively, of (oi (I, X, tj, Zit ..., Z Z, t, ql, q2, q3)
(S
0),
where g, ..., q3 denote indeterminates. (We shall decide on the number s of the zI, . . ., z, later.) We put F=.o
(180.9)
We make use of the 2s sequences
(e) = (el, . . ., e,)
(el, . . ., e, _ ± 1),
(180.10)
from a sequence [dependent on (e)]
el o-cl, ..., e,
(180.11)
of square roots (supplied with the factors el, ..., e,
MI = a, a, = li + cel aI, a3 = b + eel
+ (f + gel 1 J e2
1) defined by:
al +
(a, li, ... E F)
with (independent of el, ..., e,) fixed a,.... b, (The reader will understand
the general law of formation of the al, ..., a, even without further explanations.) It should be noted that depends only on el, . . ., ek-1. We then take two elements, also dependent on (e):
_(e) _
.f ((, t7, el
(e> _ 9 (X, t), eI
al, ..., e,
/al, ..., e. a,),
718
GALOIS THEORY
where f (g, t), z1, ..., z,), g (g, tj, zr, ..., z5) are fixed polynomials, linear with respect to each Zk, from F[z1, . . ., z,], thus their total degree with respect to all the z1, . . ., z, is equal to s. Call the pair of elements
(3e, D) = (3ecer J(e))
(180.12)
a (2s-valued) general (constructible) point. Of the two values of /u- for a complex number u (96 0) we call that with
. The main value of /0 must of course be 0. Thus, in particular, the main value of the square root of a positive number is positive. If we carry out, in (180.12), the substitution the argument x (0:5 rc < n) the main value of
g -> x, t) -> y
(x, Y E
o); Y 96 0),
(180.13)
while giving their main value to all the square roots which occur, we denote the 2s, not necessarily different, points so obtained by
((x, Y), ¶(x, Y)) = ('(e) (x, Y), `ce) (X, M.
(180.14)
According to the previous paragraph these are constructible points of A (x, y), which we consider as assigned to this triangle. [Of course, (180.14)
may become meaningless owing to the vanishing of denominators, but we shall deal only with general points (180.12), where such exceptional cases do not occur.] After these preliminaries we consider a remarkable point P. We take A itself as in (180.8) in the form A = A (x, y), so that for P,, we may also write PD(x,Y). Further we consider a general construction process $ = $.y determined according to VI. This may consist, with the same notation as above, of the construction steps (Y1, ...,r so that (Yk is nk-valued
(k = 1, .., r). From our triangles we temporarily consider only the A = A(x, Y) E 3.
(180.15)
For Qk [cf. (180.3)] we write, correspondingly, Ak(x, y). In order to determine the set A(x, y)$ of the conjugates of P,(.,, Y) we have to take care how the coordinates of each of the nk points of the nl ... nk_1 sets [cf. (180.4)] Ak(x, Y)°k
(k= I,.. ., r)
are to be computed one after another. For this we suppose that
nk,=... =nk,=2
(1
(180.16)
REMARKABLE POINTS OF THE TRIANGLE
719
and all the remaining nk are equal to 1, whence
nl...n,=2S follows. Evidently (180.16) then consists of one point with coordinates of the field 7o (x, y) for each k = 1, . . ., k1 - 1. (Here and in the following
we shall take (180.3) into consideration.) First of all in k = k1 a twovaluedness occurs in the calculation by the occurrence of a square root from an element of the former field, which must be adjoined. It continues
in this way, where for (and only for) k = k2, ..., k, a new square root is adjoined. Take these square roots, taking into consideration their twovaluedness, in the form
el / , ..., e, J
(el,
..., e, = ± 1),
where every term depends upon the preceding one. The , / always indicates
the main value. This short discussion already shows that we can give a general point (Y., J) = (X(e) SD(,,)) defined in (180.12) such that if (180.15)
holds then the 2S points (180.14) are exactly the (n1 ... n, _) 2S points [calculated in (180.6)] of all the n1 ... n,_1 unordered sets A,(x, y)ae. Hence, A(x, y)% is defined as the union of these sets. From VII it also follows that, as in case (180.15), the number of conjugates of PA(X, y), i.e., that of the different points occurring in (180.14) is a fixed divisor 21
(0 < l < s)
(180.17)
of 2S and these points, i.e., the elements of the set A (x, y)'P, all have the same multiplicity 2S-'
(180.18)
among the points (180.14).
We want to deduce from this result a formula which yields all the points of A(x, y)*, and only these, and each of them once only. For this purpose, using the indeterminates z, t we form, first, the 2S linear expressions (180.19)
(e) + J(e) s,
then the principal polynomial, which has these as all its zeros,
F(t) _ fl (t - Y.(e) - lD(e) z), (e)
(180.20)
where we have to multiply over all the 2' sequences (e). For fl we may (e)
write fl ... fl, where every e,, assumes both values ± 1. If we then carry e,
24 R.-A.
e,,
GALO[S THEORY
720
out successively the multiplication for e ..., e1, we see at once that this product lies in 70 (K, l) [z, t], so that [cf. (180.9)] we can write F(t) = FQ, l , z, t) E F [z, t].
(180.21)
According to our results, in the substitutions (180.13) [cf. (180.9)] satisfying (180.15), F(t) turns into the 2s-t-th power of an element from .52(O)(i)[z, t]. (Because of VIII, i might be omitted here, although this is unimportant at this stage.) Since the points satisfying condition (180.15) are everywhere dense on the plane, this implies that F(t) = G(t)21-'
(180.22)
G(t) = G(X, t , z, t) E F [z, t]
(180.23)
where
is a principal polynomial of degree 21 with respect to t. We shall show that G(t) is irreducible over F(z). Otherwise the elements (180.19) would have a degree < 2' over F (z). From this, it would follow that a general construction process V = $,, exists such that A(x, y)$' c c A(x, y) for all the A(x, y) satisfying (180.15). Then, because of IX and XII, AV e A$ would hold for all triangles A. Since this contradicts XI, the irreducibility of G(t) is proved. It now follows from (180.20), (180.22), (180.23) and the irreducibility of G(t) that there are exactly 2' distinct elements (180.19), and these are all the zeros of G(t) and conjugate over F(z). Therefore our general point (3c, P) = (ke), Dte) (especially, e.g., for el = ... =e,=I) may be expressed by a sequence such as
Jw1,
w, (E F)
a, . .
of square roots (instead of the previous
Cok E F (Jwl, ... 11IN-1)
(180.24) .,
(k = 1,
a,.), for which ., 1)
(180.25)
..., G).
(180.26)
. .
and
FcF
cF
/w2) c
... c
F (Jw1,
Because of (180.25) and (180.26), we say that (180.24) is an irreducible square root sequence over F. After this transformation we retain for our general point the previous notation (3r, J), where now 3r, SJ are certain elements of F. [In the former meaning we no longer use (., J). ] The 2' points arising from (X, J) by the substitution (180.13), taking into considera-
REMARKABLE POINTS OF THE TRIANGLE
721
tion all the possible many-valuednesses of the substitution values of (180.24),
are denoted (without expressing the many-valuedness) by (3(x, Y), ¶'1(x, A.
(180.27)
We shall show, for these 2' points (180.27), that they are real and distinct for every A (x, y) and just constitute the set A (x, y)V of conjugates of Po(x. vY
This is clear for the case (180.15) according to the above result, because
of VIII, (180.22) and the definition (180.20) of F(t). For the remaining A (x, y), the assertion follows at once from IX, X and definition (180.7). [It should be noted that in the second case even the existence of the points (180.27) has only now been proved.] In order to prove a further property of our (X, SD) let us now consider from XII that the correspondence A -> A is similarity-invariant. Since A(x, y) becomes A(x, - y) by reflection in the x-axis, it follows that
A(x, y)`t' and Q(x, -y)$ are obtained from each other by the same reflection. This means the equality of the two sets consisting of the points (180.27) or of the points
(T(x, -Y), - J(x, -Y)), respectively. We now assign to the points (180.27) the 2' different linear expressions formed by the indeterminate z: (x, Y) +
)(x, Y)
Z.
Y
The set of these expressions is, according to the above, invariant with respect to the substitution y -y. Therefore, if using the further indeterminate 1, we form the relative norm
NO = H(X, t), z, t) = NcIF(z)T t
-
-
z
(180.28)
which with respect to i is a principal polynomial of degree 2', then the validity of H(x, y, zz, t) = H(x, - y, z, t)
follows for all the real x, y (y 36 0). Consequently
H(X, , z, t) = HQ, -t), z, t), which we express briefly, but clearly, by saying that H(XC, t), z, t) involves (besides 2), z, t) only p`
GALOIS THEORY
722
On the other hand, because of (180.28), H(t) has the zero
whence its irreducibility over F(z) also follows. [We understand the latter property most easily by the fact that it holds for the polynomial (180.23) and we have H(g, t, t)z, t) = G(g, t, z, t).] From both these results follows the possibility of such a choice of the square root sequence (180.24) that [(180.25), (180.26) remain valid and] in
wl, ..., Jwr, E,
(180.29)
only 112 is involved (besides ). The importance of this apparently simple result will become evident in connection with barycentric coordinates
to which we now turn. The vertices and sides of the ordered triangle . are denoted in order by A1, A2, A3 and 91, 92i 93i so that Ak, gk are opposite to each other. By the barycentric coordinates x1, x2, x3 with respect to , of a point P we mean the quotients
xk =
P9k
(k=1,2,3),
Ak9k
where numerator and denominator denote the distance, together with sign, of the point P or Ak from the side gk, such that xk is positive if, and only if, these points he on the same side of gk. We Put P = (x1, x2, x3). As is well known,
x1 + x2 + x3 = 1 .
(180.30)
We denote the squares of the lengths of the sides opposite to the angles AI, A2, A3 of Q in order by q1, q2, q3 and for , write more precisely (180.31)
A = x(q1, q2, q3) .
Accordingly, q1, q2, q3 always denote positive numbers subject to the triangle-inequalities q1 I +
q2 I + I
q3 I > 2 I ,I qk I
(k = 1, 2, 3) .
(180.32)
Such numbers are, briefly, called admissible. Our aim is to give PA and its conjugates for the triangle (180.31) with barycentric coordinates as functions of the side-length squares qI, q2, q3.
REMARKABLE POINTS OF THE TRIANGLE
723
It is a paradox of remarkable points that - as we shall see - in their analytic description the side-length squares q1, q2, q3 are suitable parameters but the side lengths themselves are unsuitable. We obtain the transformation formulae between the rectangular coordinates il and the barycentric coordinates x1, x2, x3 of the triangle A (x, Y)
x5, x3= Y
x2=
Y
Y
Hence, and from (180.27), it follows that the 21 conjugates of Pots. yt, expressed by such barycentric coordinates, are given by
(x, Y) + (x - 1) I(y, Y) (x, Y) - g
(y, Y) ,
+1
(y, Y) l
,
(180.33)
.
Since these points arise in the substitution (180.13) from the triple
+(N-
+
(180.34)
(180.34) may be regarded as another form of our general point (3r, J), namely that adapted to the barycentric coordinates related to Q(x, y). Since every triangle is similar to a Q(x, y), because of XII, we can easily generalize the result (180.33) to all triangles instead of Q(x, y). For this purpose we take the arbitrarily ordered triangle Q as in (180.31) as Q(q1, q2, Q. The A(x, y), similar to this, are (taking into consideration the arrangement of the vertices) defined by the conditions
(x - 1) 2+ Y2
= qI
q3 '
ti+
x2
q2
2
Y-
=
q3
'
which we can also write as X
q2 + q3 - qI 2 q3
,
y2 =
2(g1g2 + g1g3 + g2g3) - g1 - qz - qs 4qi (180.35)
The numerator in (180.352) is sixteen times the square of the area of A (q1, q2, q3), and so is positive. Therefore we always obtain exactly two solu-
tions x, y as was obvious from the outset. The solutions differ only in the sign of y, but this difference does not affect the following arguments.
GALOIS THEORY
724
Keeping the relation (180.35) in view, in the general point (180.34) we make the substitution q2 + q3 - q1
2q3 2
2(g1g2 + g1g3 + g2g3) - ql - q2 -
q32
(180.36)
4q3
%%
where q1, q2, q3 are three new indeterminates; for the "homogeneous" rational field over 70, generated by them, we introduce the notation
Q0 = -ro
ql
q2
gl
qg3 _
g3 17-0
q3
q3
1 q2
q2
,
q1
ql
'YO
01 : q2 (180.37)
Since because of (180.29) only l 2 is involved (besides Xr) in (180.34), no "new" square roots occur after the substitution (180.36) in (180.34); the only change is that new indeterminates q1, q2, q3 will appear in (180.34) after
this substitution. More precisely we have the following: if, after the substitution (180.36), we introduce for our general point (180.34) the notation (21, £2,
2(q1, q2, q3), X3(gl, q2, q3)) (180.38)
3) = (31(g1, q2, q3),
and retain the notation,Jcol, ...,
Y coo
introduced in (180.24), for the
square roots occurring in (180.38) we now have X1, X2, X3 E Qo (/w1,
., Jam,) ,
(180.39)
then [instead of (180.24), (180.25), (180.26)]
Jwl, .. , Jw! E Qo .Jwk E Qo (Jw1,
..., ,Jcok-
(180.40)
(k = 1, ...,1),
(180.41)
Qo C Qo (Jw1) C Qo (Jwl, Jw2) c ... c Qo (Jwi, ..., JWt) (180.42) now hold. From the above it is now clear that by the substitution 3
q1 -- q1, q2 --,, q2, q3 -->- q3 (qk > 0; m'1
4m I > 2 1 Jgk I ; k = 1, 2, 3) (180.43)
we obtain from (180.38) exactly 2' different real points, which we denote by
REMARKABLE POINTS OF THE TRIANGLE
725
barycentric coordinates related to A(g1,g2,q3) (without expressing the many-valuedness) such as (180.44)
(E1(g1, q2, q3), X2(g1, q2, q3), .3(q1, q2, q3)),
and that these points constitute exactly the set A(ql, q2, q3)` of the conjugates of PD(q, q, q,J. [It should be noted that this result refers to all
triangles, for all may be taken in the form A(q1, q2, q3); the fact that A(q1, q2, q3) is only defined up to position by q1, q2, q3, has, because of XII,
no disturbing effect, since the barycentric coordinates are invariant with respect to similar mappings. I To distinguish it from the previous (X, J) we call (180.38) a barycentric general (constructible) point. (Of course this concept is capable of wider generalization but this is unnecessary for our purposes.) It is now necessary to examine how the Po(q q,, q,) itself can be separated from the currently existing 2' conjugates (180.44) of PA(q,, q,, Q. The answer to this important question is unexpectedly simple. According to what has been proved concerning (180.27), the square roots (180.40) occurring in
(180.38) assume only real values other than 0 in all the substitutions (180.43) (considering all the possible many-valuednesses). The substitution values, to which the remarkable point Po(q , q,, Q in (180.44) always belongs,
must then keep their sign because of III. Then we find that these substitution values, belonging to Po(q q,, Q, are always positive, for this only requires that the square roots (180.40) occurring in (180.38) have suitable signs, which is permissible because of the irreducibility of the square root sequence (180.40). From now on this will be assumed. Then, according to (180.44), Po(gl, q,, q.) = (X I (q1, qs, q3)+,
Y-2(gl, q2, q3) +,
X3(g1, q2, q3)+)
,
(180.45)
where "+" indicates that only positive substitution values are to be taken for the square roots which occur. (This notation will be applied later in similar cases.) Finally, we consider that, according to I, exactly one Po belongs to every A, but, by (180.45), a Po(q,, q, q,) was ascribed to every ordered triangle A(q1, q2, Q. Therefore the Po(q,', q.', q,) belonging to the six permutations
2 3l 1'2'3' 1
(180.46)
must be equal to one another. This means the validity of 3Ek'(gl, q2, q3)+ _ 3k(gl, q2, q3)r
(k = 1, 2, 3)
GALOIS THEORY
726
for all permutations (180.46) and for all admissible q1, q2, q3. Hence, it follows that X2(gl, q2, q3)+ _ Y-1(g2, q3, q1)+, 2e3(g1, q2, q3)+ = 3e1(g3, q1, q2)+,
X1(g1, q2, q3)+ = 31(gl, q3, q2)+
Obviously even the rules 2E2(g1, q2, Q3) = 3E1(g2, q3, q1),
(180.47)
-T3(g1, q2, Q3) = 'f1(g3, q1, Q2) ,
Y1(Q1, q2, Q3) = X1ig1, q3, Q2)
are then valid.
In order to formulate this result conveniently we shall use homogeneous barycentric coordinates Yl, Y2, Ys (Y1 + Y2 + y3 # 0), and stipulate that the point determined by them shall be denoted by (yl, y2, y3) and defined
as usual as 1
e=
(Y1, Y2, Y3) _ W11 QY2, US)
l
,
Y1 + Y2 + Y31
where "ordinary" barycentric coordinates are intended on the right-hand side.
As further preparation we introduce the field ( D QO)
Q = -70'(gl, q2, Q3)
and consider an arbitrary element 97 = 9'(q1, g2,g3) of Q together with its minimal polynomial 0(t) = 0(q1, q2, q3, t )E Q[t]
over Q. If 0(t) lies in [q1, q2, Q3,t], then we call the element integral. If, in addition, there are two integers j (> 0), k (>_ 0) such that the polynomial O(ql, q2, g8, tk) is homogeneous, then we call the element 99 homogeneous. If further P1, 9'2, P3 are elements of the field Q with non-vanishing sum Cr,
then we call the relative degree [Y0(g1, q2, q3, a-1 991, a-I 922,
a-1
q'3)
:
01, q2, q3)]
the degree of the triple ON, 9221 993). This is a power of 2 and evidently has the property that in the substitution (180.43) at most as many different points are obtained from (991, 9'2, 993) as the degree of this triple.
REMARKABLE POINTS OF THE TRIANGLE
727
THEOREM 441. In order to give the remarkable points of a triangle, take a homogeneous integer element q'(Q1, q2, q3) of the 2-closure of the rational function field Q = Y0(Q1, q2, Q3) such that (180.48)
9901, q2, Q3) = 901, Q3, q2) and
9'(q1, q2, q3) + 9'(Q2, q3, q1) + 9'(Q3, qI, q2) , 0;
(180.49)
further, suppose that for (9,(ql, q2, q3) , 9;02, q3, q1)
9'(Q3, ql, q2))
(180.50)
9'(q3, q1, q2)) ,
(180.51)
,
the number of different real points (9' (ql, q2, q3),
9, (q2, q3, q) ,
(where homogeneous barycentric coordinates are being used) arising by any substitution (180.43), is equal to the degree of (180.50). Then the point PA = (9%(ql, q2, q3)+, 9'(q2, q3, q1)+ ,
9p(g3, q1, q2)+)
(180.52)
assigned to the triangle A = A(q1, q2, q3) with the side-length squares q1, q2, q3
is a remarkable point of A, and all the remarkable points of the triangle arise
in this way. The points (180.51) are there exactly the conjugates of PA. [For the signs "+" in (180.52) see the remark after (180.45).] The assertion of the theorem that every remarkable point PA and its conjugates may be given by (180.52) or (180.51), respectively, is deduced from the above results. With this end in view we have to consider that, according to (180.39), 2E1 = 3E101, q2, q3) is an element of the 2-closure Q0 of the field Qo defined in (180.37). Therefore we put X1(81, q2, q3) _
9'(q1, q2, Q3)
(Q1, , qsQ
,
(180.53)
where p and r are homogeneous integer elements of Q. It may be assumed that Z lies in the fundamental field Q, and therefore in the polynomial ring o[ql, q2, q3], where we can restrict ourselves to a symmetric homogeneous polynomial T. [Since in the substitutions (180.43) no denominator can vanish,
in 31 we may suppose moreover that neither do the substitution values r(q1i q2, q3).] Afterwards the denominator r may be cancelled, in so far as we now employ homogeneous (barycentric) coordinates. This proves, according to (180.38), (180.44), (180.45), (180.47) that Po and its conjugates
arise as described in the theorem. The easy proof of the other assertion of the theorem that a remarkable point and its conjugates are always furnished by (180.52), (180.51) is left for the reader. 24/a R.-A.
GALOIS THEORY
728
If the degree of (180.50) is equal to 1, then we call the remarkable point Pp rational.
These remarkable points are thus characterized by the circumstance that they have no conjugates other than themselves. [Consequently, we may assume (180.51) for them instead of (180.52).] They are easily given, and are furnished by those homogeneous polynomials 9901, q2, q3) over .F70, which are subject only to (180.48) and the con-
dition that the sum of the coordinates in (180.51) vanishes for no A. In the general case it would be desirable to complete Theorem 441 by examining the condition with regard to the diversity of the conjugates (180.51), but this appears to be no easy task. No matter which remarkable point PA is concerned, it is clear that, for the particular case of an isosceles or equilateral triangle A, this lies on the axis of symmetry or at the centroid of A, respectively. In the following few examples we restrict ourselves almost exclusively to cases already known (cf. ALTSHILLER- COURT (1952).)
EXAMPLE 1. For 92 = 1, qi - (q2 - q3)2, ql (q1 - q2 - Q9 q, the centroid, vertex, circumcentre and Lemoine point of the triangle are obtained from (180.52). EXAMPLE 2. For T = ti/ 4i, y4i - \/q2 - vg3, q1 - (v/q2 - N /Q2 the cen-
tres of the four tangent circles, the four Nagel points and the four Gergonne points, are obtained from (180.51) in order. From among each of these, sets of four conjugate points only one is a remarkable point, namely that given by (180.52). EXAMPLE 3. An easy generalization of the remarkable points arises if we admit exceptions in (180.51). To these belong 4' _ (q1 - q2) (q1- q3), (q1 + qz - q3 (q1 + q3)) (q2, + q3 - q2 (q1 + q3)), VIQ
01 - q2 - q3) v
where El indicates the numerator of (180.362). Here the Steiner point, Tarry point and the two isodynamic points of the triangle are the points under consideration. In each case only equilateral triangles are exceptions. EXAMPLE 4. A further generalization will arise if we admit reducible radical expres-
sions for T in Theorem 441. This includes the case 99 = q1(q1 - q2) 01 - q3) (q2 - q3)2 ((q2 + q3) (q1 - q2) (q1- q3) +
+ N/01 - q2)2 (q1 - q3)2 (q2 - Q2), which furnishes the two Brocard points. (Equilateral triangles again consitute an exception, since then the two Brocard points coincide. On the other hand isosceles triangles obviously do not form any "essential" exceptional cases.) EXAMPLE 5. If 99, y, are two elements of Q, satisfying the conditions of Theorem 441,
to which rational remarkable points belong, then the same applies to
wtV*p+(1-w)p*tp
(WE,9-p)
where tV* _ Y_
q2., q3.),
V* = E
q0,
where we have to sum over the three even permutations of the form (180.46). Hence it follows that on any straight line through two rational remarkable points of a triangle there exist infinitely many rational remarkable points of the same triangle and these are equally dense everywhere on the straight line. If we proceed to three remarkable points not lying on one straight line, then it follows by repeated application that all the remarkable points of a triangle without equal sides are everywhere dense on the plane - a surprising and disillusioning fact.
REMARKABLE POINTS OF THE TRIANGLE
729
ExERCISE. Define by suitable axioms the remarkable straight lines and circles of a triangle and construct for them the analogue of Theorem 441. PROBLEM. How can we define analytically those remarkable points which are definable by a construction process consisting only of single-valued construction
steps? (Of the centroid, orthocentre and circumcentre of the triangle only the second is included here.)
§ 181. Determination of the Galois Group of an Equation Over a field F we consider a not necessarily irreducible equation such as
f(x)=0
(181.1)
without multiple roots and want to elaborate a process for the determination of its Galois group c4. Of course, this is also suitable for the determination of the Galois group of an arbitrarily given Galois field, since this may always be given as a Galois field of a suitable equation. For our purpose we put
f(x) = (x - a) ... (x -
(G = F(ai, ...,
(181.2)
where G I F is the Galois field of (181.1). Furthermore we have recourse to the polynomial ring G [u1, . . .,
(181.3)
consider in it the element
I
(181.4)
and form with the help of a new indeterminate z the polynomial
F(z) _ fi (z - s ,O) ,
(181.5)
su
where s has to run through all the permutations of the set
C11,.. ., «n. Consequently, it follows from the main theorem for symmetric polynomials (Theorem 268) that F(z) lies in the polynomial ring
F[z],,= F[z,u1,...,u,,].
(181.6)
In this, let the irreducible factor decomposition
F(z) = Fi(z) ... Fk(z)
(181.7)
GALOIS THEORY
730
hold, where we may assume that the factors are principal polynomials and
z - 0 FF(z).
(181.8)
For every s,, let s,, denote the corresponding permutation of the set
a.>. (More precisely sa and s,, applied to
We consider those permutations s with respect to which the F1(z) is invariant. The group consisting of them is denoted by co. and we prove that the permutations sx, corresponding to the elements of C4,,, constitute exactly the Galois group 44 of the equation (181.1), hence 4 c.Pt . With this end in view we first show that the product of all the (181.9)
is equal to F1(z). For we have z - su# I F1(z)
if, and only if,
z - 01su1F1(z).
(181.10)
The right-hand side is, because of (181.5) and (181.7), equal to an Fk(z) for every s,,. Hence, and from (181.8), it follows that (181.10) has the same meaning as the condition s,-,1F1(z) = F1(z), which can also be expressed
s with this property constitute exactly the
as
group c4,,, F1(z) is in fact the product of the polynomials (181.9).
Now we show that the product of the polynomials
z - so
(sE4)
(181.11)
is similarly F1(z). We denote this product by P(z). If we apply an arbitrary
element of 4, the polynomials (181.11) undergo a permutation, so that the coefficients of P(z) He in F. More precisely, P(z) E F[z],,.
Furthermore, according to (181.11), z - 0 is, in particular, a divisor ofP(z), whence, because of (181.8), and the irreducibility of F1(z), follows the divisibility Fi(z) I P(z) .
(181.12)
Again, because of (181.8), all the divisibilities z - sO I sF1(z)
(181.13)"
GALOIS GROUP OF AN EQUATION
731
also hold. But since the coefficients of F1(z) lie in the fundamental field F, s may be cancelled on the right-hand side of (181.13). Consequently, (181.13) means, on account of the definition of P(z), that P(z)!F1(z). This and (181.12) prove that F1(z) = P(z). It now follows immediately from (181.4) that, for an arbitrary s and the corresponding sa, sx(su$) = 0 always holds, i.e., s ,,O = s, 1t'. Hence it follows that the polynomials (181.9), apart from the order of succession, coincide with the polynomials z - sa 1'+9'
(s, E (4.) ,
where co. denotes the group of permutations corresponding to the s, (E This result may also be expressed so that the polynomials (181.9), apart from the order of succession, coincide with the polynomials
z - sat
(S« E Cjx)
According to the above, these coincide, apart from the order of succession, with the polynomials (181.11). This means that co = 4., whereby the above assertion is proved. The direct application of the process so obtained for the determination of the Galois group of an equation involves almost insurmountable difficulties of calculation, in return for which we easily prove by its help the following very useful THEOREM 442. Let R denote a ring with prime decomposition and n a prime element of R, for which the factor ring R = R/(n) is zero-divisor-free and the homomorphism
R -.R(a `a)
(181.14)
holds, where a denotes the residue class a (mod n). The quotient fields of R and R are denoted by F and F. Let f(x) (E F[x]) be a principal polynomial without multiple factors and with coefficients from R, furthermore let f(x) (E F [x]) be its image under the homomorphism (181.14). If this has no multiple factors, then the Galois group 4 of f(x) = 0 (regardless of the meaning of the permuted elements) is a subgroup of the Galois group d off(x) = 0. In the proof we use some of the above notations. According to GAUSS'S theorem (Theorem 207) the factors of the right-hand side of (181.7) lie in the
polynomial ring
R[z]n= R[z,u1,...,un] Furthermore, after the application of the homomorphism (181.14) a not necessarily irreducible factor decomposition F ( z ) = Ft (z)
... Fk (z)
(181.15)
GALOIS THEORY
732
is obtained from (181.7) in the polynomial ring
R[ZIU = R[z, u1,..., Since the permutations s (E dL) leave the polynomial Fj(z) fixed and the remaining permutations of the set
This theorem, for R = 7, n = p, turns out to be particularly simple, since then F = R = gyp, thus the Galois group c, is, according to Theorem 306, cyclic. As an application, we shall show that over Jr, there are affectless equations of degree n for every natural number n. (In this respect Theorem 433
only says in somewhat weaker form that over every rational function field of at least n variables there are affectless equations of degree n. Against which, over finite fields, for the same reason, there are no affectless equations
whatever of degree >_ 3.) To prove this assertion some preparation is needed.
A permutation group 9 of a set S is called transitive, if for every pair of elements a, b of S there is a permutation in 9 which carries a into b. From Theorem 329 it follows that the Galois group of an irreducible (separable) equation is transitive. We show the following: if a transitive permutation group 99 of the set
<1, ..., n> contains a transposition and a cycle of order n - 1, then .53 is the full permutation group of degree n.
We may assume that (1 2
... n - 1) E -0.
(181.16)
Since 93 contains a transposition and is transitive, it follows from Theorem 82 that it also contains a transposition of the form (in). According to the same theorem, and on account of (181.16), 9s contains all the transpositions (1 n), .., (n - 1 n). Therefore, because
(r s) = (r n) (s n) (r n)
(r, s = 1, . . ., n - 1),
it contains all the transpositions. Hence the theorem is proved.
We can now construct for n > 2 an affectless equation f(x) = 0 of with coefficients in 7 as follows : let us take three distinct degree n over prime numbers P1, P21 P3 (say Pl = 2, P2 = 3, p3 = 5) and three polynomials f1(x), f2(x), f3(x), of degree n over,7 such that the first is irreducible mod pl,
the second has an irreducible factor mod P2 of degree (n - 1), and lastly among the mod p3 irreducible factors of the third, one factor is of degree 2 and the remaining factors are of odd degree. All this is possible according
GALOIS GROUP OF AN EQUATION
733
to Theorem 306 (cf. also § 133, Example 1). Then take a principal polynomial
f(x) of degree n over Y with
f(x) - f (x) (mod pi)
(i = 1, 2, 3) ,
which is likewise possible according to the Chinese remainder theorem (corollary of Theorem 198). Let &, denote the Galois group of the equation
f(x) = 0 over . and cdc the Galois group of the equation f(x) = 0 over Jr, (i = 2, 3). Since fi(x) is irreducible mod pl, so f(x) is all the more irreducible over 5, thus 4 is transitive. Since, furthermore, 42i 4t3 are cyclic, it follows by hypothesis that they contain a cycle of order (n - 1) and a transposition, therefore, according to Theorem 442, the same holds for c.P. Consequently, co is, according to the above, the full permutation
group of degree n which is the same as saying that the equation f(x) = 0 is affectless. VAN DER WAERDEN (1933) has shown that among the equations with integers those without affect prevail (in a certain sense). It is a so far unsolved problem, whether equations with integer coefficients exist whose Galois group is prescribed. With regard to this cf. NOETHER (1918). As regards equations without affect see also BAUER (1932).
§ 182. Normal Bases In § 147 we have already defined the normal bases of separable norma I fields of finite degree, i.e., of the Galois fields and, in Theorem 365, proved their existence for the case of finite fields. THEOREM 443. Every Galois field N I F has a normal basis.
According to what has been stated above, the case where N is infinite still remains to be considered. In order to prove this case we need the following PaoposITTON. If N I F is an infinite Galois field of degree n and Al, ...,
A are all the elements of its Galois group, then for a polynomial f(xl, ..., xn) over N f (Ala, . . .,
0
(x E N)
(182.1)
always holds if only f(xt, ..., xn) = 0 . In order to prove the proposition, we take a basis coi, ..., w,, of N I The elements of N may then be taken in the form
a = aiool + ... + anwn
(a,, ..., an E
It follows that A,a = > ajA,wi /=I
(i = 1, ..., n).
F) .
F.
734
GALOIS THEORY
We introduce the polynomial
g(Yi, ...,
(E N [Yi, ...,
=.f(E [y1A¢Ai , ..., E yiAnwi) i i
(182.2)
where we have to sum over the j = 1, ..., n. (The coefficients A,ai are deliberately placed on the right side.) Because of (182.1),
(a,,...,a,, E F).
g(al,...,a.) = 0 From this, because of Theorem 206,
0
g(Y1, ...,
arises from f(xl, . . ., follows. But g(yl, . . ., the linear transformation
xi -
(182.3)
because of (182.2), by
(i= 1, ..., n)
yyA,pii
(182.4)
whose matrix is (A,i). On account of (145.10) the square of the determinant I A,wi I is equal to the discriminant of the basis elements w1, ..., w,,. This discriminant is, according to Theorem 362, different from 0, therefore the same also holds for A,wi I. This means that the matrix (Ap) is invertible. also arises by a linear transformation Since, according to this, f(xi, ..., from g(yl, . . ., it follows from (182.3) that the proposition is true. We can now prove the theorem. Keeping the former notations we write A;A; = A(i.i)
(i, j = 1, . . ., n) ,
(182.5)
where (1, j) denotes the function of i, j with the range <1, . . ., n> just defined by (182.5). Using this function we form the determinant
f(xl, ...,
I xy, i) I
(E F [x1, . . ., x )) .
(182.6)
For every a (E N), according to (182.5) and (182.6),
.f(Ala, ..., AA) = I A(i, )a I = I AiA,a 1.
(182.7)
But it follows from (182.5) that the condition (i, j) = 1 between i (= 1, . , n) and j (= 1, ..., n) produces a one-to-one relation. This means, because of (182.6), that f(l, 0, ..., 0) is a determinant which contains in each row and column exactly one element other than 0 and this element is equal to 1. Such a determinant is different from 0 (and equal to 1 or -1). 0 0, it follows from the Since accordingly we have, a fortiori, f(xl, . . ., proposition, and from (182.7), that an a (E N) exists such that I AiA,,x 1
0.
(182.8)
NORMAL BASES
735
The square of the left-hand side is, according to (145.10) the discriminant of the elements Ala, ..., Ana. (182.9) Since this discriminant, because of (182.8), is different from 0, the elements (182.9) constitute, because of Theorem 362, a basis, and this is evidently a normal basis of N I F. For the above proof cf. BOURBAICI (1939). For another proof, which also holds for finite fields, cf. PICKERT (1951).
EXAMPLE 1. Let N I F be a Galois field of degree 2 and of characteristic p (Z 0).
If p 96 2, then N = F (,) for a suitable element d from F. Now .1
+ ,Fd ,
1 - Jd
is a normal basis of N I F. If, on the other hand, we have p = 2, then, according to Theorem 429, N = F(8), 02 + 0 + a = 0 for a suitable element a of F. Now
0,1+0 are conjugate; consequently they constitute a normal basis of N I F. EXAMPLE 2. In the n`h cyclotomic field F (of characteristic 0) the primitive nth roots of unity 91, ..., e9,(")
(182.10)
constitute a basis, thus a normal basis if, and only if, n has no multiple prime factors.
We first prove the "if" part. This is true if n is either 1 or a prime number. If n = = pl . . . pk (k > 2) is the product of different prime numbers pl, . . ., pk, then it follows from the preceding statements and from Theorem 292 that the products
al...ak
(182.11)
constitute a basis of F, if a; runs through the primitive PC roots of unity (i = 1, . . ., k). But these products (182.11) now coincide with the elements (182.10), so that the assertion is also true in this case. In order to prove the "only if" part, we denote the n1h cyclotomic polynomial by F"(x). If d is the product of the different prime factors
of n, then F"(x) = Fd(x° '").
If n > d it follows that the sum of the elements (182.10) vanishes, therefore these elements now constitute a basis. This completes the proof. For an application cf. REDEI (1959, 1960).
EXAMPLE 3. The solution of the Exercise in § 172 is obtained as follows. Let us take an element 0 of G whose conjugates constitute a normal basis of G I F, together with a primitive automorphism s of G I F and a primitive n`h root of unity a (E F). The Lagrange resolvent [cf. (172.2)]
A = 0 + e(s6) + ... + e"-'(s"-'0) is then different from 0, moreover (as quoted above) sA = e''7. and A" E F. Accordingly
G=F(.jx)for rc=A" (E F).
CHAPTER XII
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES A non-commutative structure is called one-step non-commutative when its proper substructures (of the same kind) are all commutative. The determination of these structures is a difficult task, and has only been solved for groups, rings and semigroups; furthermore, we have to note WEDDER-
theorem (Theorem 318) which implies that no finite one-step non-commutative skew fields exist. Finite one-step non-commutative structures are of great importance because every finite non-commutative BURN'S
structure contains at least one one-step non-commutative substructure. § 183.* Finite One-step Non-commutative Groups In compliance with the above general definition a non-commutative group
with only commutative proper subgroups is called a one-step
non-
commutative group. All the finite groups of this kind are fully determined by the following two theorems. THEOREM 444. The finite one-step non-commutative p-groups are: the quaternion group (of order 8), every group (of order pn,+n+l) for given m, n (E .f') defined by the equations Apm = 1, BO = 1, C° = 1, AC = CA, BC = CB, BAB-' = AC (m>_n>_ 1)
(183.1)
and every group (of order p,n+n) defined by the equations
Ae = 1, B°" = 1, BAB-' =
Al+vm-'
(m >_ 2, n > 1; pm+n > 8). (183.2)
These groups are not isomorphic. THEOREM 445. The remaining finite one-step non-commutative groups are obtained as follows: Given two different prime numbers p, q and a natural number n, put (183.3) m = o(p (mod q)) , 736
GROUPS
737
take the finite field F such that
O(F) = p"
(183.4)
and from the group F* an arbitrary fixed element co such that
o(w) = q.
(183.5)
Then a group (of order pmq") exists such that by the help of certain special elements
Px(a E F), Q (Pi, the unity element; o(Pj = p for at
0; o(Q) = q") (183.6)
all its distinct elements may be written uniquely in the form P,.Qi
(aE F; i=0,...,q"- 1)
(183.7)
and the rule for multiplication of elements is
(a, P E F; i, k = 0, ..., q"- 1).
P«Qi PpQk = PQ+.r# Qi+k
(183.8)
Taking for each of the above triples p, q, n such a group, we just obtain the mutually non-isomorphic finite one-step non-commutative groups which are distinct from the p-groups.
It has to be proved that every finite one-step non-commutative group is isomorphic with one of the groups given in Theorems 444, 445 and that, conversely, these groups are one-step non-commutative and not isomorphic with each other. We begin the proof with the first of these assertions. We denote by 4 a finite one-step non-commutative group and show first that it is not simple. For the proof we assume c.P to be simple. We first remark that for two proper subgroups a°, . Y of c4, for which 2c° is maximal and .2' not contained in :72, "'
1 (15Y= 1. For, because of the supposition,
'Xj
C4
Since moreover
, W' are Abelian, ' (1 r is normal in JT and 5Y
and therefore also in 4. Because of the simplicity of 4 the truth of the remark follows. Since c.P is not commutative, 0(4) is a composite number. Let a maximal subgroup of 4 be denoted by A' and its index by h = O(C
:
Let Al, .., Ah
(183.9)
738
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
be a left representative system of 4 mod .Y. We show that all the conjugates
1i = A;,7,4; I of fT.
(i= 1, ..., h)
(183.10)
are pairwise distinct. We assume that Xi = X, for two different
i, j. Itsfollows that Aj 1A,YoAr 1 Aj
so that
=
°,
is normal in the subgroup
_{
,A.'A,}
of 4. But since X is maximal, we have 4.l = 4. Since, according to this, Y is normal in 4 and 4 is simple, it follows that ' = 1. This contradicts the maximal property of Y, so that we have proved that the conjugates (183.10) are pairwise distinct.
Hence, and from the above remark, it even follows that
tf210;= 1
(1 <-i<-j_
Thus 2C1i ...,h, because of (183.9) and (183.10), contain altogether exactly
h(O(Jf) - 1) = 0(4) - h
(183.11)
different elements (# 1) of 4. We take a further element (0 1) of 4. This is contained in a maximal
subgroup .7 of 4. We put
k = 0(4 :
(183.12)
Here 7 is, according to its definition, different from all the X1, . . ., Xh If what was said above is applied to 7 instead of AT, then it follows that the k conjugates 1, . . ., Ak of contain altogether exactly
0 () - k
(183.13)
different elements (0 1) of 4. Since furthermore, as has been said,'' is different from the conjugate subgroups °l, ..., °h, so the same holds for its conjugates X1, . . ., Xk. It follows from the above remark that
min a,,= 1
(i= 1,...,h; j= 1,...,k).
According to this and (183.11), (183.13) all the X1, Xj contain altogether
20(4) - h - k elements of 4 different from 1, so
20(4) - h - k < 0(4)
,
GROUPS
739
and so
h + k > 0(c4). But no natural number is smaller than the sum of two proper divisors, therefore we have proved by contradiction the non-simplicity of 4. We show that cP is even solvable. Let l° denote a maximal proper normal
subgroup of 4. The factor group c4/a° is then simple. Thus, according to what has been proved above, it is certainly not one-step non-commutative. On the other hand, since its proper subgroups are commutative, it is itself commutative. Since it is simple, it is, consequently, of prime order. Since
furthermore r is Abelian, 4 is in fact solvable. We show that in c0 there are two elements A, B with
AB # BA.
(183.14)
such that for the commutator
C = A'1BAB-1
(183.15)
AC= CA
(183.16)
the equation holds. We take a normal divisor cY,
of 4 of prime index and an element
B from the difference set 4 - 7°. Since :c° is Abelian and 4 =
B},
2C has an element A which does not commute with B. Since, together with A and BAB-1, C also lies in (183.16) follows. This proves the assertion. Because of Theorem 195 we may also suppose that o(A), o(B) are prime powers :
o(A) = p', o(B) = q' !
(183.17)
where the prime numbers p, q need not be different. Because of (183.14),
d4 = {A, B).
(183.18)
We denote by 4' and Z the commutator group and the centre of 4. Henceforth we distinguish two cases. First let us consider the case
C4, g Z.
(183.19)
Because of (183.15) we have C E c4', then C E Z. Accordingly, we now have
BC = CB.
(183.20)
We now need the following useful theorem whose proof can easily be carried out by induction by the reader.
740
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
THEOREM 446. If A, B, C are three elements of a group with the properties (183.15), (183.16), (183.20) then the formulae
BkA,B-k = A'C`k , (AiBkC)r = AirBkrClrtttkr(r-1)
(183.21) (183.22)
hold for all integers i, k, 1, r. We may suppose for the above A, B that APB = BAP, ABq = BqA,
(183.23)
for whenever; if one of these equations does not hold, then we may replace A, B by AP, B or A, Bq, respectively, where (183.14) remains satisfied and
in (183.17) one of the exponents m, n is reduced by 1. By sufficiently many substitutions of this kind we achieve our result. From (183.21), (183.23) we have C° = 1, Cq = 1. Because C 0 1 this gives
o(C) = p
(183.24)
and p = q, so that (183.17) becomes o(A) = pm, o(B) = p".
(183.25)
From (183.15), (183.16), (183.20), (183.24), (183.25) it follows that all elements of c0 may be written in the form A`BkCi
((i i=0 = O,,..
1;
k = 0, ..., p" - 1;
1 = 0, ...
,p-
1).
(183.26)
Of course, these elements need not be different. Every element (183.26) has, according to (183.22), a power of pas order, so that cfd is a p-group. In the sequel the case where 4 is the quaternion group will be disregarded. We choose the A, B above so that
m+n
(183.27)
is minimal. We then show that with a suitable choice of A, B
{A} n {B} = 1.
(183.28)
For, if it were false, then there would be a power other than 1 of A which is equal to a power of B. When we replace A by a suitable primitive element of {A}, which is evidently permissible, we can assume an equation such as
Ai''m' =BP""(o 1) (0<m'<m; 0
(183.29)
GROUPS
741
Since A and B may be interchanged with one another, we can assume that
(0<)m'5n'. For the element
A' = AB-
y,.-m.
(183.30)
(183.31)
we have
A'p'"' = C-
(183.32)
according to (183.22), (183.29).
If the right-hand side is equal to 1, then o(A') I pm'. Since, further, A'B & BA', because of (183.15) and (183.31), A may be replaced by A'. But since we have m' + n < m + n, we are contradicting (183.27). The case still remains, where the right-hand side of (183.32) is different from 1, i.e., the exponent of C is not divisible by p. This means, because of (183.30), that
p=2, m'=n'= 1.
Hence and from (183.24), (183.32),
o(C) = 2, A'2 = C, o(A) = 4. Because of (183.31) we have A'B 0 BA', thus because of (183.27) we have o(A) 14. But as m > m', we have m > 2 and so, by (183.251) ,
o(A) = 4. Because of (183.31) A' = AB-1. Since we now have (AB-1)2 = C, it follows from (183.15) that
A2 = B2, o(A) = o(B) = 4. Therefore if our assertion is false, then any two elements of order 4 of cP which do not commute have the same square. Since A', B are such elements, because A'2 = C we also have A2 = B2 = C. Hence and from (183.15) it follows that BAB'1 = A3. This means that 4 is the quaternion group. As we have excluded this case, the assertion (183.28) is true. Accordingly we have in any case 0(44) > p'+", thus, since C4 is a p-group,
p'+" 10(4). Consequently 0(c4) = pm+"+1 or 0(4) = pm+",
(183.33)
according to whether the elements (183.26) are all different or not. For the case (183.331) we may suppose that m > n, since otherwise we could interchange A, B. We see from (183.15), (183.16), (183.20), (183.24), (183.25) that the equations (183.1) are fulfilled. Since, on the other hand,
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
742
the group defined by (183.1) is evidently of order at most p"'+'+', (183.331) means that cP is this group. For the case (183.332) the elements (183.26) are not all different. Hence and from (183.28) it follows that C is equal to an A'Bk. Because C E Z, p1 l, k. From (183.22), (183.24) it also follows that p'"-I 11 and p"-1 I k. Accordingly an equation such as C = APO-" B""-' y (183.34)
holds, where both exponents must be divisible by p: pI
p"'_ix, pn_1y.
(183.35)
We prove that after we have suitably chosen A and B one of x, y will be divisible by p. Namely if we have p I' x, y, then it follows from (183.35) that m, n >_ 2. Symmetry allows us to assume that
mznZ2. We determine a solution z of the congruence yz
x (mod p)
(183.36)
and put B' = AJ "n-"zB .
As AB'
(183.37)
B'A, (183.27) implies the necessity of o(B') z o(B). Since,
on the other hand, because of (183.22), (183.25) and n >_ 2 ,
B'P"= 1
,
it follows from (183.252) that o(B') = o(B). Accordingly we may interchange B and B'. Because of (183.15), (183.16), (183.37),C remains unaltered.
Moreover, according to (183.22), (183.34), (183.36), (183.37), -IPM -'y
B'P"-'y = CC2
-Iy-1)
If the second factor on the right is 1, i.e., C = B"'"-'y, then the assertion is true. If the above-mentioned factor is not equal to 1, then the exponent in it is not divisible by p. This means that
p = 2, m=n=2. Then we have
o(A) = 4, o(B) = 4.
(183.38)
GROUPS
743
Further, because of (183.34), we have C = A2B2 .
Hence and from (183.15) A2B2 = A-1BAB-1. Thus, because of (183.38), (AB)2 = 1. We have found an element of order two, non-interchangeable with A. But this contradicts (183.27) because of (183.382). This contradiction proves that it may be assumed in (183.34) that p I x or p I y. Then C belongs to {B} or to {A}. If A, B are interchanged, then according to (183.15), (183.16), (183.20) C changes into C-1, so that we can put C E JA}.
If a suitable primitive element of {B} is substituted for B, we can show, according to (183.21), (183.24), that even
C = A""'-'.
(183.39)
Moreover, because of (183.14) and (183.20) we must have m Z 2. We see from (183.15), (183.25), (183.39) that the equations (183.2) are satisfied. But as the group defined by (183.2) is evidently of order at most pm+", so (183.332) implies that & is exactly this group. Consequently we have
proved our assertion for the case (183.19), and have shown that in this case c4 occurs among the groups enumerated in Theorem 444. Secondly we consider the case where c4' is not contained in Z. We denote, as above, a normal divisor of cP with prime index by A. In the above proof of the assertions concerning (183.14), (183.15), (183.16) we have seen that we may also require that
A E r° a (In the previous case keeping to this additional requirement would have been superfluous and even prejudicial.) Furthermore (183.23) may also be assumed, since the proof was independent of assumption (183.19). We show that in addition to (183.231) we may also assume that A° = 1
.
For since according to (183.15) we have BAB-1 = AC, it follows from (183.16) that
BA°B-1 = ARC". This results, because of (183.23,), in
C°= 1 On the other hand, we have
.
BC0CB,
since from BC = CB and (183.15), (183.16) it would follow that cj' _
744
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
_ {C} c ;x which contradicts our assertion. Since, accordingly, we have
4={C,B} and also C E ', we only need to take C instead of A in order to ensure that the requirements for A shall be satisfied. If we write Q for B we can express what has been said so far as follows : In c0 there are two elements A, Q such that AE
(183.40)
o(A) = p .
(183.41)
O(Q) = q",
(183.42)
f
AQ 96 QA ,
cP={A,Q} AQQ = QQA .
(183.43) (183.44)
(183.45)
[These are not independent, (183.43) and (183.44) being equivalent, while (183.45) follows from (183.40) and (183.42), since T is an Abelian normal divisor of index q in c &..l
We put
(i=0,1,...; A0=A).
Ai=Q`AQ-I
(183.46)
According to this we have (i, k = 0, 1, ...) .
QkAiQ-k = Al+k
(183.47)
From (183.45), (183.47) it follows that
A,=AI
(i=j(mod q)).
(183.48)
From (183.41), (183.46),
o(A) = p
(i = 0, 1, ...) .
(183.49)
By (183.40) and (183.46), A. E
'
(I= 0, 1, ...) .
(183.50)
Because of (183.50) the Al generate a subgroup Iwo
= {A0, AI, ... }
(183.51)
GROUPS
745
of which is thus likewise Abelian, moreover because of (183.49) it is an elementary p-group. From (183.44), (183.46) it follows that o is a normal
divisor of 4. In the sequence A0, Al, ... there is a first term Am, for which we have Am E {Ao, ..., Am_i} , and, because of (183.48), certainly
1 <_ m S q .
(183.52)
We choose the starting element A (without altering OL , Q) so that
m is minimal.
(183.53)
From the assumption it follows, because of (183.47), that Am+' E {Ai,
..., Am-1+1} f
then by complete induction
Am+' E {A0,.. ., A.-,I
for all i = 0, 1,
.... Accordingly Xo = {Ao, ..., Am_i} .
(183.54)
Further, as A0, ..., Am_1 are independent, they constitute a basis of whence
O(fo) = p'a .
(183.55)
In the following we use polynomials f(x), g(x), F(x) ... in an indeterminate x with integral coefficients, which are considered mod p, so that they are also to be understood as polynomials over the prime field F,,. To every polynomial such as
f(x)=ax'+...+ao we assign the element AX) = Aar
.. Aa,o
of moo. This correspondence is unique, because of (183.49). Moreover it is evident that o consists of all the different f(x). Evidently f(x) + g(x) = MAX)
(183.56)
0 = 1, - f(x) = (Ax))-'
(183.57)
c f (x) _ (f(x))`
(183.58)
and more generally (c E Y) .
746
FINITE ONE-STEP NON-COMMUTATIVF STRUCTURES
From (183.47) we have
(k = 0, 1, ...) .
xkf(x) = Qkj(x) Q-k
(183.59)
Thus the formula
(x - 1) f(x)
(Q, Ax)) = Q AX-) Q-1(f(x))
(183.60)
holds for the commutator. Since, further, according to (183.48), AA, -l
=
= 1, it follows that
x9-1=1.
(183.61)
Because of (183.50), (183.54), (183.55) we have
A. Am_i ... Ao = 1
(183.62)
with exponents uniquely defined mod p. Correspondingly we introduce the polynomial (183.63) F(x) = x' + cm._.1xin-1 + ... Co. Then
F(x) = 1
(183.64)
.
We prove that
(x - 1)11
(r=0, 1,...).
(183.65)
For this we denote the left-hand side by B,. Here, B0
B1=A1Aa1
1.
Then let r>- 2 and assume the assertion for smaller r. According to (183.60), QB1-1 Q-1Bi l 1 = BI
(i = 1 , 2, ...) .
Since, because of the supposition, B,_1 1, we have B,_2 Q that d4 _{B,_2,Q).
QB,_2, SO
Since the commutator B,_1 of these generators Q, Br_2 is interchangeable furthermore since dj' does not lie with B,_2 (because B,_1, Br_2 lie in in Z, it follows that Br_i, Q cannot be interchangeable. This implies that Br 0 1, so proving (183.65). Since, in FP[x], xP - 1 = (x - 1)P, it follows from (183.61), (183.65) that
p# q.
(183.66)
GROUPS
747
We prove that F(x) is irreducible. For this purpose we consider a factor decomposition (f(x), g(x) E FP[x]) F(x) = f(x)g(x) where x*_1
f(x) = xs + ds_l
+ ... + do
(s > 0) .
Because of (183.64), (183.65) we may assume that
f(x)#x- 1. It is sufficient to prove that necessarily s = m, since from this the irreducibility of F(x) follows because of (183.63). We therefore write
B = g(x),
Bi = Q'BQ-`
(i=0,1,... ; Bo = B) .
Because of the assumption we have
F(x) 0 (x - 1) g(x) .
From this, because of (183.60) and the definition of F(x) it follows that
QBQ-'B-' + 1, and so
di ={B,Q}. Since, furthermore, B E A o and so o(B) = p, B satisfies the requirements for A in (183.40) to (183.45). But we also have s
F(x) =
=0dkX'AX)
whence it follows, because of (183.56), (183.58), (183.59), (183.64), that
Bs Bds'l s-1
Bd. 0 =1
therefore, because of (183.53), s>-m, i.e., s = m. This proves the.irreducibility of F(x). Now, because of (183.56), we have a homomorphism Fp[xl+ ,.,
(f(x) -)-f (x))
,
(183.67)
the kernel of which we denote by a. Then
f(x) E a a f(x) = 1
.
(183.68)
We show that a is an ideal of FP[x]. By definition a is a submodule and so because of (183.59), (183.68) an 'ideal of Fp[x].
748
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
0 1, a o Fp[x] follows from (183.67). FurtherSince we have more F(x) is, according to (183.64), (183.68), an element of a. From both, because of the irreducibility of F(x): a = (F(x))
(183.69)
follows.
Because of (183.61), (183.65) and (183.68),
x4-IEa, x-11a. Hence and from (183.69) owing to the irreducibility of F(x), it follows that x9 - 1 F(x)
X- 1
'
(183.70)
so that for the degree m of F(x), by § 133, Example 3,
m = o(p(mod q)).
(183.71)
We make use of the finite field F over FP such that
O(F) = p'
.
(183.72)
Since F(x) is irreducible and of degree m, we may put F = FP(w)
(183.73)
F(w) = 0 .
(183.74)
where
Because of (183.70), wQ = 1, co 0 1 also holds, thus o(w) = q .
(183.75)
is normal in d d, the elements of ci, because of (183.44), may be Since written in the form .f(x) Q' . From (183.68), (183.69),
f(x) = g(x) «f(x) = g(x) (mod F,(x)) . But, according to (183.73), (183.74),
(x) = g(x) (mod F(x)) a f(w) = g(w) .
GROUPS
749
Since, accordingly, the arbitrary element (,-r) of 2l° is completely deter. mined by the element f(co) of F, we may write the elements of X0 in the form f(x) = Pn.)-
(183.76)
Then the PP
(183.77)
(a E F)
are all the different elements of Ta. Because of (183.42), according to the above, the (a E F; i =0,...,q" - 1) (183.78) P. Q' are all the different elements of 4. Since from (183.59), (183.56) -f(X)Q' . g(x)Q" = f(x) + x'g(x)Q' +k
,
after writing the elements in the form (183.76) we get P,Q i
k - Pa+w'fQi+k PeQ-
(183.79)
Thus the elements of 44 have to be multiplied according to this rule. With regard to (183.71), (183.78), (183.79) we see that (4 is completely characterized by the determining elements p, q, n, co. The elemdht co of F according to (183.75), (183.74) is there subjected to the conditions o(co) = q, F(co) = 0. The second of these conditions may be neglected, i.e.,
we may take for co in (183.79) a quite arbitrary element of F such that o(w) = q, because from this, as will be shown immediately, we obtain groups isomorphic with 4. All the co such that o(w) = q may be given by any arbitrary one of them in the form wr (r q - 1) Now according to (183.79), P«Qri
PpQri =
Pa+wrrJ5Q('+k)
.
The comparison with (183.79) shows that the substitution of Qr for Q has the consequence that in the multiplication rule (183.79) the element to is exchanged for co". This establishes the required isomorphy. From this last proof and from (183.66), (183.71), (183.72), (183.78), (183.79) we see that c4 is in fact one of the groups given in Theorem 445. It still remains to be proved that, conversely, the groups given in Theorems
444, 445 are one step non-commutative and are not isomorphic with each other.
First of all it is evident that the quaternion group is one step noncommutative. Since, furthermore, condition (183.28) is not satisfied here, it is not isomorphic with any of the remaining groups given in Theorem 444.
750
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
The groups defined by (183.1) and (183.2) are denoted by co, and 4 2 respectively. First of all we consider coil. It is evident that its elements may
be written as
(i==0,...,p' - 1;k=0,...,p"- 1;1=0,...,p- 1),
A'BkCI
(183.80)
If we show that equality holds here, it follows so that O(d4j) that all the different elements of coil are given by (183.80). For this purpose we consider the set G of triples i, k, I
of integers i mod p', k mod p", 1 mod p. From the full permutation group .9(S) we take the following three permutations
_
i
kl
B'
_ ik / C'-lik1+1)' = ik+1i+1)' r
k
l
(183.81)
which are evidently of orders p', p" and p. According to (1 83.81)
B'A'B" _
i
k+1 i+l
_
i
k1
i+1k+11+l+1,i+1 k1+1,
A'C'
'
and also A'C' = C'A', B'C' = C'B'. Accordingly, the equations (183.1) are satisfied by (183.81), from which by Theorem 101 the homomorphy
dil x {A', B', C) follows. Therefore it is sufficient to show that the group on the right-hand side is of order at least p"'+"''. It is not commutative and because
B'P =
I
k l)
ik+pl,
it contains the direct product {A'} ® (B'P} ® {C'}. This is of order p'+" and its index in (A', B', C) is at least p. Hence the assertion is proved. Since, according to this, the elements (183.80) are distinct, it follows also that cPil is not commutative. In order to show that c4JI is one-step noncommutative we have to demonstrate that any two non-interchangeable elements X = A`BkCI , Y = A"Bk'('c'
GROUPS
751
of 4, are already generators of this group. Here Ymay be replaced by any X'Y which, because of (183.1), allows us to restrict ourselves to the case where i
or i' is equal to 0. We can put i' = 0. The assumption and Theorem 446 implies that p,f' V. If we then replace X by a suitable XY', we also obtain k = 0. Since two elements of the form X = A'C' ,
Y = B"C" ,
where p,j' i, k', are evidently generators of d41, so the assertion has been proved.
The corresponding results may be proved for dg, as follows: What we have proved concerning co, did not necessitate the assumption m >_ n, and so remains true for all m >_ 2, n >_ 1. As APm-', C are central elements of dul, so {APm-' C-1} is a normal divisor of order p of 4g,. A glance at (183.1) and (183.2) shows that `42
zt; 41 / {APm-` C-1} ,
implying that co, is in fact of order p'"+" and one-step non-commutative. (This proof holds even for the case p°'+" = 8, although we have not yet made use of this fact.) We see from Theorem 446 that in c& the commutator group {C} is prop-
erly contained in a cyclic subgroup only when p = 2 and m = n = 1, i.e., O(c.Pg,) = 8. Since for 42, on the other hand, O(c42) > 8 and its commutator group {APm-'} is properly contained in {A}, it follows that no d%g, can be isomorphic with any c42. Both in 4,, if 0(ddl) > 8, and in 42 because of Theorem 446 the maximum of the orders of the elements is equal to p,a,(m,") Hence it is obvious that neither the d41 nor the 442 are isomorphic with each other. Finally, we consider the group defined in Theorem 445 which we denote
by cog. As opposed to the previous cases we have here to prove first the existence of cog, for cog is not defined by equations like cogs and 42, but by the
multiplication rule (183.8). We make the observation that all integers i, k may be admitted in (183.8) because o(w) = q and o(Q) = q". For the product of any three elements of c4 we have, according to (183.8), (PPQ1'PPQk) PyQI =
p.'d'3Q'-1k pYQl
-
Pn+w"/3+uat+k'Q1+k+1
P«Q'PP"u,'Qk+1 _ P.+w,(P+ruky)Qi+k+l
Hence the multiplication (183.8) is associative. Evidently P0 Q° is the unity element. The left inverse of PgQk is P_,0_k8Q-k. Thus we have proved the existence of the group jog. 25 R. - A.
752
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
It is obvious that cQ,l is not commutative. We still have to show that c4 is generated by any two non-permutable elements
X=P«Q', Y=p'Qk. Since we may replace X and Y by XY' and X'Y, respectively, we may assume, e.g., that k = 0. Equation (183.5) and the assumption imply that q,j'i. Since X may be replaced by every primitive element of {X}, we may assume that i = 1. Since now
X=P,Q, Y=P,j, 0, we have fro ni (183.8)
where
(i=0,...,m - 1).
X'YX-l = Since
cof,
...,
O)m-I Q constitute th(e'
a basis of F, it follows that the group
P. (o E F). It then also contains P_,X = Q, con{X, Y} contains all sequently it is equal to L. Accordingly, dj is one-step non-commutative. Because of the diversity of the orders, df cannot be isomorphic to either ddI or c1762.
Since, according to (183.8),
Q'PIQ-` Pl I = P.,-1
(i=0, ..,m- 1),
the P. (a E F) make up the commutator group of c0,1, which is then of order
p'. From this it evidently follows that no two of the d3 are isomorphic. Consequently we have proved Theorems 444, 445. For finite one-step non-commutative groups cf. MILLER- MORENO (1903), SCHMIDT (1924), REDEI (1950a, 1958b). Further literature and applications see in GOLFAND (1948), IT6 (1955), REDEI (1951a, 1955-56), SUZUKI (1957).
EXAMPLE 1. From Theorem 445 it follows that there is at least one one-step non-commutative group of order pq" (p, q different prime numbers, m, n > 1) if, and only if, one of the conditions in = o(p (mod q)), n = o(q (mod p)) is satisfied. There are two such groups if, and only if, both these conditions are satisfied.
There are never more than two such groups. EXAMPLE 2. The centre of the group given in Theorem 445 is the cyclic subgroup (Q°} of order q"-1. The only centre free case is that with n = 1. In this case the commu-
tator group is the one and only one proper normal divisor. EXERCISE 1. A one-step non-commutative group (of order 8) is defined in the (excluded) case p'+" = 8 by the equations (183.2), which is, however, isomorphic with one of the groups given by (183.1). EXERCISE 2. Every finite one-step non-commutative p-group is the homomorphic image of a group defined by (183.1).
GROUPS
753
EXERCISE 3. Every finite one-step non-commutative group is a homomorphic image of the group defined by the equations ABTA-1 = Bt+1,
B;B, = Bi$,
(i,.% E J).
PROBLEM. Are there infinite one-step non-commutative groups?
§ 184.* Finite One-step Non-commutative Rings
In virtue of our general definition a one-step non-commutative ring means a non-commutative ring with only commutative proper subrings. The finite rings of such a kind are determined by the following two theorems.
As a preliminary, let us consider elements e, a from a ring R and a polynomial from 7[x]:
f(x) = c + g(x)
(c E J ; 9(x) E Y [x] = xxY [x]).
Though f(e) is then in general meaningless, the "product" f(e)a = Ca + A00, may always be defined. (The reader will easily see that this is an operator product and that this is essentially a special case of the construction in § 62, Exercise 5.) Then f(e)e is the substitution value of f(x)x for x = e. THEOREM 447. In order to determine the non-nilpotent finite one-step non-commutative rings, we define for every prime number p and every natural number m a ring
RI = {e, a}
(184.1)
p"', = 0, pa = 0, e2 = e, ea = or, ae = 0, a2 = 0;
(184.2)
by the equations
further, for any (not necessarily distinct) prime numbers p, q and natural numbers m, e, n (n < q) we define the ring
R11={e,a}
(184.3)
by the equations
e =0, pa=0, F(e)e=0, F(e)a=0, are = epa,
a2 = 0
(P = p" °e-,),
(184.4)
where F(x) (E J [x]) denotes a fixed arbitrarily chosen irreducible principal polynomial mod p of degree qe the particular choice of which is of no importance since the R11 belonging to fixed p, q, m, e, n and different F(x) are
754
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
isomorphic. The rings RI (belonging to all the p, m), the rings opposed to these and the rings RII (belonging to all the p, q, m, e, n) then constitute a complete system of non-isomorphic non-nilpotent finite one-step noncommutative rings. There, RI is of order O(RI) = pm+l,
(184.5)
it has the basis elements e, a, where
o+(e) = p', o+(a) = p,
(184.6)
and
(ae + ba) (ce + do) = ace + ado,
(a, b, c, d E -.7),
(184.7)
for the product of two elements. RII is of order O(R1I) = p(m+1)WW ;
(184.8)
its elements may be uniquely given as
a(e) + b(e)a
(a(x) E 7[xlo, b(x) E 7[xl),
(184.9)
where a(x), b(x) run through a representative system of J [xb mod (pmx, F(x)x) and of J' [x] mod (p, F(x)), respectively. The rule
(a(e) + b(e)a) (c(e) + d(e)a) = a(e)c(e) + (a(e)d(e) + b(e)c(e)0a (a(x), c(x) E J' [x b, b(x), d(x) E J [x]) (184.10)
holds for the product of two elements. THEOREM 448. The nilpotent finite one-step non-commutative rings are the rings (184.11)
Rn1
defined by the equations
pme=0, p"o=0, er=0, a,=0, o2a = eoe = ae2,
(184.12)
eat = aoa = ate, pea = pae
and their non-commutative homomorphic images; here p is a prime numbe
and m, n, r, s are natural numbers such that m S n and r, s
2. R111 is of
order O(R1I1) =
p1+m (r- 1) s+ n (s- 1)
(184.13)
755
RINGS
with basis elementsoa - o, e'a'(01) for which
o+(Pa-aO)=p; o+(ea')=p' (1 <-i
o+(aj)=p"(1 <j<_s- 1), and here the rule
(a(er - aee) + t,j E au 'a) (b(Pa - aN) + Yk,lbkl?ka) _ (184.15)
_ -aoibio(Na - erg) + E aljbkl°'+kaj+l i,j.k.l
holds for the product of two elements where a, a,j, b, bkl denote integers and we have to understand finite sums. NOTE. The equations (184.12) are invariant with respect to substituting e, m, r for a, n, s. Hence it follows that we obtain all the different RIII, when we
prescribe that if m = n then r < s. It would be easy to show that then the RIII are not isomorphic with each other. However, if we wished to state Theorem 448 in a similar perfect form, like that of Theorem 447, then it would be necessary to solve the homomorphy problem for the rings Rill, but it seems to be a difficult problem. PROOF OF THEOREM 447. We first make these trivial observations concern-
ing one-step non-commutative rings: These rings are characterized by the fact that we can generate them by any two non-interchangeable elements.
The homomorphic images of these rings are commutative or one-step non-commutative. Every such ring is directly indecomposable, thus in the case with which we are concerned, i.e., in the finite case, it is, because of Theorem 197, necessarily a p-ring. The commutator a4B - floc of two ring elements a, j3 is always denoted by [a, 9]. We denote by R a non-nilpotent finite one-step non-commutative ring, which therefore is a p-ring. We want to prove that R is either a ring RI or a ring opposed to an RI ring or a ring R,I. We denote the radical of R by n. Because of the second part of Theorem
259, n is nilpotent. Hence n s R. According to Theorem 260 the factor ring R/n is radical free, thus, as a finite ring, semisimple. Consequently from the first structure theorem of WEDDERBURN - ARTIN (Theorem 256) the
direct decomposition R/n = Fl (D
... q)
Fk
(184.16)
follows, where the F1 are full matrix rings over (finite) fields of characteristic p.
Since the F1 are homomorphic images of R, they are partly commutative,
756
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
partly one-step non-commutative. But as a full matrix ring of rank > 1 over a field always contains non-commutative proper subrings, so F1, . . ., Fk must be fields.
We show that k = 1. For this we suppose that k >_ 2 and denote by RI,
.
. ., Rk those subrings of R, for which
R,/n = F;
(i = 1, . . ., k).
(184.17)
For every i we take an element er from Rf such that the residue class
Ic,=ei+tt
(i= ],...,k)
(184.18)
is a generator of the field F,. Let.fi(x) be the minimal polynomial of x,.
(We suppose here that the coefficients of f(x) are integers considered mod p but fixed somehow.) The constant term of f, (x) is denoted by c,. 0, so p x c,. We put As Ici g,(x) = f ,(x) x = c,x + hi(x)
(i = 1, ..., k)-'
(184.19)
where the h,(x) are polynomials divisible by x2. Because of (184.18) and g,(rc,) = f (rq)IC; = 0 we have
gi(e) E it
(i = 1, ..., k).
(184.20)
Since the RI, ..., Rk, because k >_ 2, are proper subrings of R, they are commutative. Since, further, by (184.17), they contain it, it follows from (184.20) that
[g,(e,), e;] = 0
(i, j = 1, . . ., k).
(184.21)
On the other hand, it follows from (184.16) that icircj = 0 (i 0 j), i.e., because of (184.18), ere; E it (i 0 j). Since it is an ideal of R, it follows that ej E
Q
i,j= 1,...,k; a,b>_ 1).
Therefore, since it c R, and by the commutativity of R,: c-1 c-1 ere, = erC-1 ere; = ereier = eer er = e,er (c This, (184.19), (184.21) and x2 I hi(x) result in
0 = [crer + hi(ei), e,] = [tier, e,] = cr[er, e,]
2; i, j = ], ..., k). (i, j = 1, .
.
., k).
Since p,f'c and R is a p-ring, it follows that [ei, e;] = 0 (i, j = 1, . . ., k). Since, on the other hand, according to (184.16), (184.17). (184.18)
R={el,...,ek,tt}, R,={er,n}
(i= 1,...,k)
RINGS
757
and the R, are commutative, R is also commutative. This contradiction proves the assertion that k = 1. From this and (184.16), R/n is a finite field. We write R/n = F.
(184.22)
We take an element o of R, for which the residue class e + it is a generator of the cyclic group F*, and so, a fortiori, of the field F:
F = {e + n},
o(o (mod n)) = O(F) - 1.
(184.23)
From (184.22), (184.231) it follows that R = {o, n}.
We show that the degree of F is a prime power (>_ 1), i.e. (Q = q`; q prime, e >_ 0).
O(F) = pQ
(184.24)
Otherwise F would contain two proper subfields F1, F2 such that F = {F1, F2}.
We denote by R1, R2 the proper subrings of R determined by R1/n = F1,
R2/n = F.2 ,
whence R = {R1, R2}.
(These F,, F2, R1, R2 have nothing to do with those in (184.16), (184.17).)
Any two elements 01, Q2 of RI and R2, respectively, may be taken in the form 01 = fl(o) + P1 ,
e2 = f2(o) + v2 ,
where f1(x), f2(x) are polynomials from 7 [x]o and I. 1'2 elements from it. Since R1, R2 are commutative and contain it, so [f1(e), v2] = 0, Since, further,
(f2(o)3 v11 = 0,
[v1, v2] = 0.
[f1(e),f2(o)] = 0 trivially, it follows that [o1, 021 = 0.
We have shown that R is commutative, a contradiction by which we have proved that the degree of F is a prime power. From (184.232) and (184.24) it follows that PQ 10
= o (mod n).
(1 84.25
758
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
Because R = {o, it) there is an element a of it such that [o, a]
0
(184.26)
whence (184.27)
R
We define recursively
(f = 1, 2, ...).
al = [e, al, ai+I = [e, (1,]
(184.28)
As it is an ideal of R, so, together with a E it, all the statements
a, al, a2i ...E IT
(184.29)
a, al, a2, ... # 0.
(184.30)
hold.
We prove that
0, therefore, a fortiori, a
According to (184.26), (184.281) al
0.
In order to prove next that a2 0 0 we assume that a2 = 0. This means according to (184.28) that
o(oa - an) - (oa - ao)o = 0, i.e.,
ao2
= 2nao - o2a.
Hence, generally,
an' = io1-lao - (i - 1)o'o
(i = 2, 3, ...),
(184.31)
since
ao2 - (i - 1)olao =
ao'+i = ao`o =
= 2iotao - io'+1a - (i - 1)o'ao = (i + 1)0'ao - io'+Ia. On the other hand from (184.25) it follows that npkQ
- o (mod n)
(k = 1, 2, ...).
As it is commutative and contains a, from this and from (184.26) it follows that [QpkQ,a]
0
(k= 1,2,...).
However, if we apply (184.31) to some i = pkQ with pkQ i = 0, then we obtain aopkQ = opkQa,
which is a contradiction. This proves that a2 write [o, [o, a]] 0.
0, for which we may also
RINGS
759
In order to complete the proof of (184.30) we suppose that a, e 0 has been proved for some i (>_ 2). Because [o, a,_1] = a, 0 0 (and (184.29)) we may apply the result now obtained to a,_1 instead of a. Then we obtain [o, [e, a,-I]] # 0, i.e., [because of (184.28)] a,+1 generally.
0. Consequently (184.30) is proved
We show that among the a defined in (184.26) there is one with
a2 = 0, pa = 0.
(184.32)
Since it is commutative and contains the elements or, a1, Oa1, pe, we have
a1 = eaa1 - aQOa1 = Oaa1 - gala = Qaal - eaa1 = 0,
pal=P[A,a]= [peg a]=0. These mean that (184.32) is satisfied for a1 instead of a. Since according to (184.28), (184.30) we also have [O, all = a2 96 0, al is an element equivalent to a so proving the assertion concerning (184.32). Later we also assume (184.32) and show that for the principal ideal
(a) (C n)ofR (a)2 = 0 and p(a) = 0.
(184.33)
From the commutativity of (a) it follows that (a)2 = (a2). According to this and (183.321), (184.331) is true. From (184.322) (183.332) follows trivially.
Here we distinguish two cases and deal first with the easier case, where among all the possible pairs e, a there is at least one such that aO = 0 or
ea = 0. We show that R is then an RI or anti-isomorphic with an RI which is just a part of the above assertion. It is sufficient to show that from the supposition ae = 0
(184.34)
it follows that R is equal to R1. Because of (184.25) and the nilpotence of it, (e - el Q)k = 0 for some k >- 1. This equation, because pQ >- 2, may be written in the form ek
25/a R. -A.
= ekf(e)
760
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
where f(x) is a polynomial from 7 [x]0. Hence 12k = Qk(f(e)),
(i = 1, 2, ...).
If we apply this with 1 = k + 1, we obtain Qk = Q2kg(Q)
(184.35)
for some polynomial g(x) from 7 [.Y]o. Since, according to this, we have QkQI = Q2kg(o)Qi
(i = 1, 2, ...),
it follows immediately that Qkg(Q) = Q2k(g (9))2.
This and (184.34) imply that for the element o = Qkg(Q)
of R,
o2 =Q ',
(184.36)
aQ'=0.
(184.37)
We show that
Q'a 0 0.
(184.38)
If Q'a = 0, then from (184.35), (184.36) we have Qka = 0. We denote by d the least natural number with the property Qda = 0.
(184.39)
From (184.26), (184.34) it follows that Qa 0 0, therefore d other hand, from (184.34)
2. On the
a(o - QPQ) = 0.
Because of (184.25) both factors lie in n, therefore, since It is commutative, (Q - QPQ )a = 0.
Then, a fortiori,
(9d-1
-
Qd+PQ-2 )a
= 0.
Because of (184.39) and since pQ >- 2, we have Qd-1a = 0. Since this contradicts the minimal property of d, we have proved (184.38). We put a,
= e'a.
Because of (184.33), (184.37) and (184.38) we then have
a'2=0,pa'=0, Q'a'=a'00, a'go'=0.
(184.40)
RINGS
From this [o', a]
761
0, and so
R={o,
(184.41)
We now put o+(e') = pm
(184.42)
for some natural number m. From (184.37), (184.40), (184.41), (184.42) it follows that R is a homomorphic image of the ring Rl defined by (184.1) and (184.2): RI - R.
Since, further, R is not commutative, so from (184.42) it follows that pm+l 10(R).
But, on the other hand, (184.1) and (184.2) imply that every element of RI
may be written in the form ao +ba (a=0,...,pm-1;b=0,...,p- 1), thus we have O(RI) I pm+'.
From these three relations the required isomorphism of R with RI follows. For the other far more difficult case, where for all the possible pairs o, a oa
0 and ae
0,
(184.43)
we shall prove that R is then a ring R11 First, we show that o'aoj
0
(i, j = 0, 1, ...).
(184.44)
For this we suppose that (184.44) is false for some pair i, j, i.e. o'aoj = 0,
where, according to (184.43), i + j z 2. If i Z I then
o'-lae1+1= 0 since otherwise a' = o'- aej would be an element of it with
oa'=0, a'o56 0. [o,a']00, which is impossible because of the supposition in (184.43). One similarly
proves that if j z 1 o'+t aoi-1= 0.
762
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
By induction it follows that all the equations
ease`+j_p=0
(a=0,...,i +j)
hold. On account of the definition (184.28) this gives ai+j = 0. Since this is impossible according to (184.30), we have proved (184.44). Since al is, as mentioned already, an element of it equivalent to a, it follows from (184.44) that e'aiej 0, i.e., (184.45) (1,.1 = 0, 1, ...). [e, eaoj] # 0
We denote by F(x) the minimal polynomial of the generating element + it of F [cf. (184.231)], for which therefore F(e + n) = 0;
(184.46)
with regard to (184.24), F(x) is then a principal polynomial of degree Q, which we may suppose to be an irreducible element of.7 [x] modp. Of course,
the coefficients of F(x) (apart from the leading coefficient 1) are determined mod p, but we shall later choose them suitably fixed. For the sake of brevity we write (184.47) G(x) = F(x)x. Then from (184.46) (184.48) G(e) E n.
We prove that by suitably choosing a the equation
G(e)a = 0
(184.49)
holds.
Because of the nilpotence of It the existence of a natural number k with (G(e))ka = 0
follows from (184.48). If k = 1, (184.49) follows. If k z 2 we assume the assertion for smaller k. If 0, [e, (G(e))k-'al
then a' _ (G(e))k-la is an element of it equivalent to a; since we have G(e)a' = 0, the assertion concerning (184.49) is now true. In the other case we have
0 = [e, (G(e))k-la] _ (G(e))k-'[e, a] = (G(e))k_la1.
Thus if we replace a by the (equivalent) element a1, then instead of k we get a smaller number. Because of the assumption this proves the assertion concerning (184.49).
RINGS
763
From (184.48), (184.49) and the commutativity of n it follows that aG(e) = 0.
(184.50)
Because of (184.47), (184.49), (184.50),
F(e)eae = 0,
eaeF(e) = 0.
Thus if we replace a by the element eae which, according to the case i = j = I of (184.45), is its equivalent, we even have [over and above (184.49), (184.50)]
F(e)a=0 and aF(e)=0,
(184.51)
which we now also assume. We show that e>1
(184.52)
[which, according to (184.24), means that the common degree Q of F and F(x) is at least q].
If e = 0, then O(F) = p according to (184.24), thus F(x) = x + c for an integer c (which is prime to p, though this will not be considered). Hence and from (184.51) it follows that
(e+c)a=0, a(e+c)=0, and then ea = ae. This contradiction proves (184.52). As a refinement of (184.51) we prove for elements a' (# 0) of the ideal (a) and polynomials f(x) from ..7 [x] the rule :
J(e)a' = 0 a a'f(e) = 0 a f(x) - 0 (mod p, F(x)).
(184.53)
For this we interpret the ideal (a) of R as an Y [x]-double-module such that we define for any r (E (a)) and J(x) (E -' [x]) the left- or rightoperation, respectively, by
f(x)* = f(e)r, rlx) = 'Cf(e). We then denote by I and r the annihilating left and right ideals, respectively,
of the element a', which are thus ideals of 7[x]. Both contain, because of (184.332) and (184.53), the maximal ideal (p, F(x)) of 7[x]. On the other hand, they are different from 7[x], since a' 0 0, thus both are equal to (p, F(x)). This proves (184.53).
764
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
For the time being we write F(x) explicitly :
F(x) = xQ + a1xQ-' + ... + aQ (a,, ..., aQ E .Y).
(184.54)
We have recourse to the so-called Horner polynomials
Fi(x) = xJ + alxj-1 +
... + a,
(j = 0, ..., Q; Fo (x) = 1, FQ (x) = F(x))
(184.55)
and put at = aeQ-1 + F1(e1aeQ-2 + ... + F0_1(e")a
(i = 1, ..., Q - 1).
(184.56)
Al] these r, are elements of the ideal (a). Since according to (184.55) we have
(1= 0, ..., Q - 1).
xF,(x) = Fj+1(x) - aj+l it follows from (184.56) that
o"r, = (Fi(ed) -a
j)aeQ-1 +
i.e.,
e,"rt = rte - aF(e) + F(el)o
... + (FQ(e") -aQ)a, (i = 1, ...,
Q-
1).
The last two terms vanish because of (184.53), therefore Tie = e°fTi
(i = 1, ..., Q - 1).
(184.57)
We prove that not all the rl, ..., TQ_1 vanish. For let us assume that r1 = ... = TQ_1 = 0 and put
Q - 1),
ei = e°,
j=aeQ-j_1
(j=0,...,Q- 1).
(184.58)
(184.59)
Because of (184.56) the assumption then reads as follows :
o + F1(e)51 + ... +
0
(i = 1, ..., Q - 1).
(184.60).,
The matrix of the coefficients of this homogeneous system of linear equations is
1 FI(e)
... FQ_1(e)
1 F1(eQ_1) ... FQ_1(QQ-1)
RINGS
765
After cancelling theft column a square matrix is obtained whose determinant is denoted by Dj(I = 1, . . ., Q). From (184.60), according to CRAMER'S rule (Theorem 172), it follows that (184.61)
0.
According to (184.55) suitable multiples of the columns may be subtracted from subsequent columns in DQ so that the Vandermonde determinant, consisting of the rows
1, el'...,
eQ_2
(i = 1, ..., Q
is obtained. After a similar process DQ_1 becomes the determinant consisting of the rows
1, or, ..., PQ-$, PQ-1 + atoQ-2
(I= 1, ..., Q - 1).
Hence (§ 115, Exercise 1),
DQ_1=(ei+... + oQ_1+a1)DQ thus according to (184.59), (184.61), DQ(aw + (got + ... + QQ_1 + at)a) = 0.
(184.62)
According to the above, because of (184.58), we have:
Do = d(o) with 4(x) _
fl
(xP' - xP).
15I<j5Q-1
But as F(x) is irreducible mod p and is of degree Q, on account of Theorem 308, 4(x) # 0 (mod p, F(x)). Hence and from (184.62) it follows, according to rule (184.53), that
ao +
(e1 + . .
. + oQ_ 1 + a1)a = 0.
(
4.63)
Because of (184.54) and Theorem 311:
x + x° + ... + xP°-' + a1 = 0 (mod p, F(x)). On account of rule (184.53) and according to (184.58),
(o+Ql+ ...+oQ-1+a1)a=0, thus (184.63) becomes ae - ea = 0. This contradiction proves the assertion that not all the rl, . . . 7.Q_ I vanish. For these elements we further prove that if aI # 0 then [o, c;l
0.
(184.64)
766
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
The validity of (184.64) is, according to (184.57), equivalent to (eP, - Q)TI # 0.
Furthermore, because r, ,-E 0 and by rule (184.53),-this is identical with
0 (mod p, F(x)).
XP, - x
Since the latter follows from (184.231), (184.46) and from Theorem 308, the assertion in (184.64) is proved. Of the TI, ..., TQ-1 we now take a T, other than 0 (0 < t < Q). Because of (184.64) this is not interchangeable with n and because of (184.56) it lies in (a), therefore, a fortiori, in n, consequently it is an element equivalent to or. When we replace the previous or by this Ti retaining for it the notation a, we may now assume that ao = e"`a.
(184.65)
[The reader will see that (184.26), (184.27), (184.29), (184.30), (184.32), (184.33), (184.44), (184.45), (184.51), (184.53) remain valid.] We prove that
q`-1 I t.
(184.66)
As a preliminary we show that R = (ok, n}
R
(184.67)
for every natural number k. From the right-hand side the left-hand side follows trivially. Now assume that the right-hand side is false, so that ok and a are interchangeable with each other. Hence it follows, because of (184.27), that ok lies in the centre of R. Furthermore, since it is commutative, IQ', u} is also commutative, i.e., properly contained in R. Hence (184.67) is true. The proof of (184.66) will be obtained by the following transformation of
(184.67). Its left-hand side, because of (184.22), means that {ek, n}/n = F. It also means that the residue class Qk + it = (o + n)k is a generator of F. This, because of (184.23.), (184.24) and Theorem 307, means that
pq` - I I' k(p9`-' - 1). On the other hand, according to (184.23), (184.24), (184.46), (184.53), (184.65) and according to § 133, Example 9, we have R
(okpr
- ek) a 0 O q xkPl - xk - 0 (mod p, F (x)) ok,a} - [nk,a] 0 q to xk(Pl-1) - 1 (mod p, F(x)) pq` - 1.}' k(p` - 1).
767
RINGS
According to this the rule (184.67) means that pq`-1
-1=
(pqe
- 1,p` - 1),
whence (184.66) follows.
Since, according to (184.24) we have Q = qe, we may write t, because
of (184.66) and 0 < t < Q, in the form t = n qe-1, where n is a natural number and n < q (exactly as in Theorem 447). Then (184.65) may be written as
Qa=QPa,
(184.68)
where P has the same meaning as in (184.4). We prove the generalization of (184.68): af(e) = (.f(Q))Pa
(Ax) E 5' [x]).
(184.69)
From (184.68) we know that af(O) = f(o' )a and hence, because of rule (184.53), (184.69) also follows. From (184.27), (184.321), (184.65) it follows that all the elements of R are expressible in the form
a(e) + b(Q)a
(a(x) E 7 [x]o, b(x) E 7[x]).
(184.70)
For this we prove the rule a(e) + b(O)a = 0 - a(e) = 0,
b(O)a = 0.
(184.71)
To do so we assume the left-hand side. It is sufficient to prove that b(O)a = 0. 0 (mod p, F(x)), whence
If this is false, then according to (184.53), b(x)
we can infer the existence of a polynomial f(x) from Y [x] such that f(x)b(x)m 1 (mod p, F(x)). Again according to (184.53), f(Q) b(O)a = a. Hence
and from the assumption we have f(e) a(e) + a = 0. Because [Q, a] 0 0 we have got a contradiction, therefore (184.71) is now proved. All the a(x) from . Y [x]o with the property a(Q) = 0 constitute an ideal of . Y [x]o, which we denote by ao. This definition is expressed in the rule a(e) = 0 c* a(x) = 0 (mod Q.
(184.72)
Hence and from (184.53), (184.71) we obtain the rule
a(e) ± b(O)a = 0 .* a(x) = 0 (mod ao), b(x) = 0 (mod p, F(x)).
(184.73)
Before determining ao we prove that from the assumption (f(x'))P * f(x) (mod p, F(x))
(f(x) E .7[x]o)
(184.74).
768
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
follows the existence of a polynomial g(x) such that
g(f(x)) - x (mod ao)
(g(x)
According to (184.53), (184.74), (f(o))"a (184.69), it follows that [f(o), a]
0,
(184.75)
f(n)a. Hence, according to
R = {f(o), a},
E {f(o), a}.
Because of these relations, by similar reasoning to that by which we obtain ed (184.70), we obtain the equation o = g(f f(°)) + h(e)a
(g(x) E 7 [x]o, h(x) E 7 [x]).
In this equation the last term must vanish according to the rule (184.71); then from this and from rule (184.72) the validity of (184.75) follows. We now want to determine the ideal ao; this work will be somewhat bothersome. According to the third paragraph of § 123 we have ao = xa
(184.76)
for some ideal a of 'Y [x]. Since R is a p-ring, 0+ 0) = Pm
(184.77)
for some natural number m. Then p'"o = 0, thus from (184.53) and (184.76) it follows first that pmx E ao and then that
p E a.
(184.78)
From (184.47), (184.48) and the nilpotence of it it follows that (F(o)g)" = 0 for a natural number u. From (184.72) (F(x)x)" E ao, therefore according to (184.76) afortiori (184.79) (F(x)x)" E a.
On the other hand, we can show that
a c (p, F(x)).
(184.80)
If a(x) E a, then according to (184.76), a(x)xE ao therefore, according to (184.72), a(p)e = 0, consequently a(o)oa = 0. Hence it follows, according to (184.53), that a(x) x E (p, F(x)), a(x) E (P, F(x)). This proves (184.80).
RINGS
769
Because of (184.79) a is a primitive ideal of 7 [x], which according to (184.78) contains a power (> 1) of p. A simple application of the KRONECKER - HENSEL theorem (Theorem 286) shows that
a = (p`',pel GI(x), ...,pek-IGk-I(x), Gk(x))
(k ? 1),
(184.81)
where G.(x) is a principal polynomial of degree nj from _'Y [x] and
eo>...>ek_1>O; 0
(i = 2, ..., k).
(184.82)
Hence and from (184.79) we get at once (F(x)x)" __ 0 (mod p, GK(x)). Since therein F(x) and x are irreducible principal polynomials mod p of different degree, it follows that Gk(x) _- (F(x))°x'(mod p)
(184.83)
for non-negative integers v, w. According to (184.81), (184.82), (184.83) we have
a c (p,(F(x))°x').
(184.84)
Hence and from (184.80), because a # 7[x] we have V
We prove that
v=1 and w=0.
(184.85) (184.86)
If (184.861) is false, then, according to (184.85), v > 2, whence from (184.84) a c (p, (F(x))2), so that according to (184.76) ao c (px, (F(x))zx) .
(184.87)
We take a polynomial a(x) with
(a(x))' * a(x) (mod p, F(x)), a(x) E.' [xb
(184.88)
[because of (184.232) we could take, e.g., a(x) = x], further a polynomial b(x) with
a'(x) + F'(x)b(x) = 0 (mod p, F(x)), b (x) E 7 No
(184.89)
and put
f(x) = a(x) + F(x) b(x).
(184.90)
770
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
This, because of (184.88), satisfies condition (184.74). Thus there exists a
polynomial g(x) satisfying (184.75), for which, because of (184.87), a fortiori g(f(x)) = x (mod px, (F(x))-x). After differentiation we get
f'(x) g'(f(x)) = I (mod p, F(x)). But, on the other hand, according to (184.89), (184.90) we have
f'(x) = a'(x) + F'(-x) b(x) = 0 (mod p, F(x)). These two congruences are contradictory, so proving (184.861). In order to prove (184.862) notice that because of (184.232) (cf. § 133, Example 9)
x2tP-1t *I (mod p, F(x)), i.e., (184.74) is satisfied by f(x) = x2. According to (184.75) there is thus a polynomial g(x) with g(x2) = x (mod ao),
g(x) E -7 [x]o .
Hence by (184.76), (184.84), g(x2) = x (mod px, x`°+1), so that (184.862) follows.
According to (184.83), (184.86) we have Gk(x) = F(x) (mod p). But since
F(x) is only determined mod p, so after a suitable choice we have even Gk(x) = F(x).
(184.91)
We show that necessarily k = 1. For this purpose we assume that k ? 2. Then according to (184.82) and (184.91) Gk_I(x) is a non-constant principal polynomial such that F(x) = 0 (mod p, Gk - 1(x)). But since F(x) is irreducible
mod p, we have a contradiction and so we have proved that k = 1. According to this and (184.81), (184.91), a = (pep, F(x)), therefore from (184.76) ao = (pe°x, F(x)x).
On the other hand, from (184.72), (184.77) it follows at once that m is the least natural number such that pmx E no, thus necessarily eo = in. This means that ao = (p'"x, F(x)x);
this is the desired determination of ao.
(184.92)
RINGS
771
Because of (184.72) and (184.92), pmQ = 0 and F(o)o = 0. Hence and from (184.27), (184.32), (184.511) and (184.68) it follows that R is a homomorphic image of the ring R11 [defined by (184.3) and (184.4)]: R11 - R.
(184.93)
From (184.70), (184.73) and (184.92) we have (184.94)
O(R) = p(m+1)¢
where it is taken into consideration that F(x) is of degree Q = qe. On the other hand, it follows from (184.3) and (184.45,6) that the elements of R11 may be written in the form (184.9), moreover because of (184.41,2.3.4) it is clear that every element of R11 arises (at least once), when a(x), b(x) are restricted to the representative systems listed according to (184.9), so that 0(RI1) <
p(m+l)qc.
(184.95)
From (184.94) and (184.95) it follows that the homomorphy (184.93) is an isomorphy, as was asserted for this case.
Summarizing, we have so far proved the assertion of Theorem 447 that every non-nilpotent finite ring is either a ring R1 or a ring opposed to a ring R1 or a ring R11.
In order to prove the remaining assertions of Theorem 447, let us consider first the ring R1 [defined by (184.1) and (184.2)]. We have recourse to the (free) ring SJ'I = {x, y}
(184.96)
with the free generators x, y. (We assume the subring {x} of 91 to be identified
with 7 [x]0.) Because of (184.1) and (184.2), the isomorphism R1 czz; 91/r (O -- x + r, a -- y + r)
(184.97)
holds for the ideal r of R1, where
r = (p'"x, py, x2 - x, xy - y, yx, y) It is obvious that the elements
ax + by (a=0,...,pm- 1; b=0,...,p- 1)
(184.98)
represent all the residue classes ofd{ mod r. We show that here each class will be represented only once. It is sufficient to prove that in particular
only for a=b=0.
ax + by E
(184.99)
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
772
By the substitution x = 1, y = 0 the relation a E (p') is obtaineu from (184.99), where on the right is meant an ideal of 7, so that a = 0. According to this, (184.99) is reduced to by E r. If we carry through the substitution
x=E11=
f0
y=E12=1o
01'
off,
where we have matrix units over the prime field of characteristic p, then we obtain b E12 = 0, thus b = 0. Consequently we have proved that every residue class mod t of R is represented exactly once by (184.98). Since the number of these representatives is prntl, the assertion (184.5) follows from (184.97). As a result of (184.97) the representatives (184.98) themselves give by
means of the substitution x = e, y = a all the different elements of R1. This means that e, a are basis elements of RI and (184.6) holds. The validity of (184.7) follows trivially from (184.23.4.5,6) Because of (184.62), a is diffej ent from 0, thus RI according to (184.24,5)
is not commutative. In order to prove that RI is one-step non-commutative, it is thus sufficient to show that RI is generated by any two noninterchangeable elements
a=aee+ba, j9=cO+da
(a,b,c,dEJ).
By hypothesis, [a, fi] = ap - fSa 0 0. This, according to (184.7), means
the same as (ad - bc)a # 0, whence according to (184.62) it follows that p f' ad - bc. This means, by (184.6), that a, fi are basis elements, indeed, generators of RI. Therefore RI is one-step non-commutative. According to (184.23) and (184.61), 0 is an idempotent element other than 0, so that RI is not nilpotent. Because of (184.5), p and m are invariants of RI. Hence it follows that none of the RI belonging to the pairs p, m are isomorphic.
RI has, according to (184.7), the p left annihilators ba (b = 0, ..., p - 1), but only the right annihilator 0. This, together with what has
been said above, means that among all the R, and the rings opposed to them, none is isomorphic. Consequently we have proved the assertions of Theorem 447 referring to the rings R,. We still have to consider the rings RII [defined by (184.3) and (184.4)].
In order to prove (184.8) to (184.10) we will construct a skew product of the factor rings A
=,Y[x]o/(pn,X'F(x)x),
F = Y[x]/(p, F(x)).
(184.100)
Here,
0(A) =
p,nq'
,
O(F) = pqe
,
(184.101)
RINGS
773
furthermore F is a field, since (p, F(x)) is a maximal ideal of 7 [x]. We denote by R' the set of all the (ordered) pairs (a, fl)
(184.102)
((x E A, fi E F)
and adopt the convention that (a(x), b(x))'
(a(x) E,7 [x]o, b(x) E 7 [x])
(184.103)
shall mean the element (184.102) of R' formed from the residue classes
a = a(x) + (pmx, F(x)x), f = b(x) + (p, F(x)).
(184.104)
Then we define addition and multiplication in R' by (a(x), b(x))' + (c(x), d (x))' =
= (a(x) + c(x), b(x) + d(x))',
(184.105)
(a(x), b(x))' (c(x), d(x))' =
= (a(x) c(x), a(x) d(x), + b(x) (c(x))P)',
(184.106)
where a(x), c(x) and b(x), d(x) are polynomials of Y [x]o and 3' [x], respectively, and P denotes the same power of p as in (184.4). Because of (184.100) it is evident that both compositions (184.105), (184.106) are unique. Moreover, because of (184.102), O(R') = O(A) O(F).
(184.107)
Next we show that R' is a ring. Because of (184.105) R' evidently constitutes a module. In order to verify
the associativity of the multiplication, we denote by A and B the first and second factor, respectively, on the left-hand side of (184.106), further we take a third element C = (f(x), g(x))' of R' where f(x) E,7 [x]o, g (x) E E 3[x]. Then, according to (184.106), (with abbreviated notations)
AB = (ac, ad + bc1')',
BC = (cf, eg + df P)',
(AB)C = (acf, acg + adfP+ bc"f P)' = A(BC). This proves the associativity of multiplication. The distributivity is seen at once from (184.105) and (184.106) since (u(x))P + (v(x))"
(u(x) + v(x))P (mod p)
Accordingly R' is a ring.
(u(x), v(x) E
774
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
We show that R' is generated by the two special elements (x, 0)', a, _ (0, 1),
(184.108)
i.e. that R'
o' a'),
(184.109)
and show that the arbitrary element (184.103) of R' may be expressed in the form (a(x), b(x))' = a(o) + b(e)a. (184.110)
Namely, if b(x) lies even in 7[x]o, then, according to (184.105) and (184.106), the left-hand side of (184.110) is equal to (a(x), 0)' + (b(x), 0)'(0,1)'. Again, from (184.105), (184.106) and (184.1081),
(u(x), 0)' = u((x, 0)') = u(Q)
(u(x) E .7[x]o).
According to this and (184.1082) the previous expression becomes the right-
hand side of equation (184.110), which now proves this. In the general case we may put b(x) = bo(x) + k
(bo(x) E 7[xlo, k E 7).
According to the previous special case of (184.110) and (184.1082) (a(x), b(x))' _ (a(x), bo(x))' + k(0, 1)' =
= a(o) + bo(o) a + ka = a(o) + b(o)a. This completes the proof of (184.110). For the elements (184.108), from the definition of (184.103) and from (184.105), (184.106), (184.110) we have the following:
pm = (Pmx, 0)' = 0, P a' = (0, P)' = 0,
F(o') o' _ (F(x)x, 0)' = 0, F(o') a' _ (0, F(x))' = 0,
UV _ (0, 1)' (x, 0)' _ (0, x")' _ (x", 0)' (0, 1)' = o' Pa
a'2=(0,1)'2=(0,0)'=0. Since these equations agree formally with (184.4), the homomorphy relation as in (184.93) with R' instead of R follows from (184.3) and (184.109). The order relation similar to (184.94) likewise holds with R'
instead of R because of (184.101) and (184.107). From these two relations R' which is no longer
and from (184.95) we obtain (the isomorphy R
RINGS
775
of interest, and at the same time) the validity of (184.95) with "_" instead i.e., that of (184.8) and, furthermore the validity of the part of of the assertion about (184.9) which remained unproved before (184.96), (namely that we have uniqueness as asserted in (184.9)). The rule (184.10) follows at once from (184.42,5,6).
For the proof of the remaining assertions about R11 we need some preparation.
The following rules hold for the elements of RII with the notations used in (184.9):
a(o) + b(o) a = 0 = a(o) = 0, b(o)a = 0, a(o) = 0
(184.111)
0 (mod p'" x, F(x) x),
a(x)
(184.112)
b(o)a = 0 -tz> b(x) = 0 (mod p, F(x)).
(184.113)
These follow from the uniqueness proved for (184.9). Since (p, F(x)) is a prime ideal of ..9'[x], we have
f(o)a 0 0, g(o) a 0 0 =f(o)g(o) a
(184.114)
0,
for polynomials f(x), g(x) from.Y[x] because of (184.113). For the commutator of two arbitrary elements a = a(o) + b(e) a, fl = c(o) + d(o)a (a(x), c(x) E _7 [x]o; b(x), d(x) e 7 [x])
(184.115)
of R11 we have from (184.10) that
[a, R] == ((a(e) - a(o)' (d(e) + b(o)(c(o)' - c(o)))a,
(184.116)
whence according to (184.113)
[a, j9] = 0 p (a(x)' - a(x))d(x) = b(x)(c(x)1 - c(x))
(mod p, F(x)). (184.117)
We now want to prove that R11 is one-step non-commutative. According
to (184.45) we have [o, a] = (o - o")a. But according to Theorem 308
x° - x
0 (mod p, F(x)).
From these and from (184.113) [o, a] 0 0, so that RII is not commutative. We consider two arbitrary non-interchangeable elements a, 9 of RI1 in the form (184.115), in order to show that then necessarily R1, = {a, #1,
(184.118)
776
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
whereby we shall have proved that R11 is one-step non-commutative. We carry out the proof so that we first reduce the assertion by several steps to a simpler case. By interchanging x and j9 only the two sides of the congruence in (184.117) are interchanged. As the order of succession of these elements is immaterial and this congruence itself, because of the assumption [x, fl] 0 0, is not satisfied [thus at least one side of it is # 0 (mod p, F(x))], so we may assume that
a(x)' * a(x) (mod p, F(x)).
(184.119)
Because of (184.119) a polynomial f(x) exists such that f(a(x)) = x (mod p, F(x)),
f(x) E 7 [x]o
.
(184.120)
For, on account of Theorem 310 the assertion is in the first place true with 7[x] instead of 7[x]o, but then obviously (184.1202), too, may be required. As a first reduction step we show that in the proof of (184.118) we may
restrict ourselves to the case c(x) = 0 (mod p, F(x)).
(184.121)
For, we deduce from (184.1201) the existence of a polynomial g(x) _
= c(f(x)) such that g(a(x)) = c(x) (modp, F(x)).
(184.122)
With this g (x) we form NI = fB - g(x).
(184.123)
We write this element of RI, [as in (184.1152)] in the form
f1 = cllo) + di(ct)a
(c1 (x) E ,Y[x]o ,
d1(x) E 7'[x]).
(184.124)
Since according to (184.123) {x, #1} = {x, 13}, the assertion (184.118) means the same as R11 = {x, #1;. Since, moreover, from (184.123) and 0 evidently follows, it is sufficient from [x, j3] 0 the inequality [r, NI] to show that (184.121) is satisfied for c1(x) instead of c(x). To do this we bear in mind that it follows from (184.1151) that g(x) = g(a(O)) (mod o) (in R1i). Similarly the congruence g(a(o)) = c(s) (mod pp, a)
RINGS
777
follows from (184.43) and (184.122) [because g(a(x)), c(x) E J [x]6]. Because of (184.1152) and (184.123), Nl =_
0 (mod po,a).
According to this and (184.42,5,2), Pr = 0. Because of (184.46) and (184.124)
we can replace this by c1(o)a = 0. Since, by (184.113), this means that (184.121) is satisfied by cl(x) instead of c(x), we may in fact restrict ourselves to the case (184.121). Because [a,#] 0 0 it follows from (184.117) and (184.121) that
d(x) * 0 (mod p, F(x)).
(184.125)
As a second reduction we show that we may restrict ourselves [in the proof of (184.118)] to the case d(x) = 1.
(184.126)
a' = d(o)a.
(184.127)
For this purpose we put Because of (184.125) there is a polynomial u(x) (E,7 [x]) with u(x)d(x) __ 1 (mod p, F(x)). According to (184.42,4), u(o)d(o)a = a. Hence and from (184.127) it follows that
a = u(o)a'.
(184.128)
Because of (184.3), (184.127), (184.128) we have R11 = {o, a} = {o, a'}. Again, it is evident that the equations (184.42.4,5,() are satisfied for a' instead
of a. Both together mean that o, a' and o, a are equivalent pairs of generators of R11. Since (184.115), according to (184.127) and (184.128), may be represented in the form a = a(o) + b(o)u(o)a', /1 = c(o) + a',
this means [after comparison with (184.115)] that we may in fact restrict ourselves to the case (184.126). Accordingly, (184.115) transforms [with unaltered a(x), c(x), but with new b(x)] into (184.129) a = a(e) + b(o)a, fl = c(o) + a.
As a final reduction step [for the proof of (184.118)] we show that we may restrict ourselves to the case b(x) = 0.
(184.130)
778
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
With this end in view we put
al =
(184.131)
with some polynomial h(x) from J [x]o, to be soon defined more explicitly below. Since from (184.131) [for any h(x)] the relation {al, #19 {a, #1 follows, it will be sufficient to prove our assertion (184.118) for 0c1
instead of a. In order to choose h(x) suitably we first transform (184.13 1). According to (184.1291) we have h(a) = h(a(Q) -}- b(Q) a) .
Hence, according to (184.45,6), h(a) = h(a(e)) + v(o)a
(184.132)
for some v(x) from r [x]. Because of (184.42, 4) and (184.121), c(o)o = 0. Thus, according to (184.10), (184.1292), (184.132) h(a)/3 = h(a(o))c(p) + h(a(o))o.
From this and from (184.1291), (184.131) we obtain
a.1= a(o) - h(a(o))c(o) + (b(o) - h(a(e))a.
(184.133)
We now choose h(x) so that the part multiplied by a on the right-hand side vanishes. Since this requirement, according to (184.113), means that b(x) - h(a(x)) = 0 (mod p, F(x)), it is, according to (184.120), sufficient if we take h(x) = b(f(x)). With this choice of h(x), (184.133) becomes (184.134) al = a1(o) for some polynomial a1(x) from -X[x]o, for which, according to (184.121),:
al(x) = a(x) (mod p, F(x)).
(184.135)
Comparison of (184.134) with (184.129) shows that we may in fact restrict ourselves to the case (184.130). When we also take (184.135) into consideration, the result obtained means that in the proof of (184.118) for [a, 14] 0 0 only the case where a = a(o), 9 = c(o) + a
(184.136)
RINGS
779
remains, where (184.119) retains its validity for the "new" a(x); since # has not been altered in the last reduction step, (184.121) also holds unaltered. (Because of (184.118), (184.119) and (184.121), [x, #] 96 0 is also valid
for (184.136), but this is not required in the part of the proof to follow.) Now it is easy to prove directly (viz., without further reduction) the assertion (184.118) for (184.136). It is sufficient to show that e E {a}, for, from this and from (184.1362), it follows that a E {a #}, thus {e, a} C C {a, #} which, because of (184.3), is equal to (184.118). The remaining assertion that p E {a} requires the existence of a polynomial k(x) from 5' [x]o such that e = k(a). Because of (184.1361) and the rule (184.112) we have thus only to verify the possibility of fulfilling the conditions x =_ k(a(x)) (mod p'", F(x)),
k(x) E
[x]o .
(184.137)
(This is sufficient since it already follows, from (184.1362) and the fact that a(x) E 7 [x]0, that x I k(a(x)).) Because of (184.120) the condition (184.137) without the exponent m is to be satisfied. Suppose that (184.137) holds for an exponent I (= 1, .. m - 1) instead of m. Then we have
x =_ k(a(x)) + p'w(x) (mod F(x)) for some polynomial w(x) from.7 [x]o, thus according to (184.120)
x = k(a(x)) + p'(w(f(a(x)))) (mod p1+1, F(x)).
This means that (184.137) can be satisfied for I + 1 instead of m, thus by induction also (184.137) itself. This completes the proof that R11 is one-step non-commutative.
From (184.112) it follows that all the Q' (i = 1, 2, ...) are different from 0, so that R11 is not nilpotent. We now turn to the assertion of Theorem 447 that R11 does not essentially
depend on the special choice of F(x), i.e. we obtain a ring isomorphic with it when F(x) is replaced by another, irreducible mod p principal poly-
nomial F1(x) (E 7[x]) of the same degree q0. It is sufficient to verify the existence of an element jol in R11 with R11 = {pig a},
(184.138)
for which the equations (184.41,2..,5) are satisfied with of and F1 instead of a and F. As a preliminary we show that there is a polynomial a(x) with
F(a(x)) = 0 (mod p', F(x)),
a(x) E,7[x]0
(184.139)
780
FINITE ONE-STEP NON-COMMUTATIVE. STRUCTURES
and note that for this a(x) the relations
F;(a(x)) * 0 (mod p, F(x)),
(184.140)
a(x)" * a(x) (mod p, F(x)) (184.141) necessarily hold. Without the exponent m (184.139,) can be satisfied, according to Theorem 312, with an a(x) from J' [x]. Evidently (184.1392) may also be prescribed. Further from (184.139), according to the same theorem, (184.140) and (184.141) also follow. Suppose that (184.139) is satisfied for an exponent !(= 1, . . ., m - 1) instead of m. We show that then
F,(a(x) + plb(x)) = 0 (modp'+', F(x))
has a solution b(x) (E7[x]o), so proving our statement by induction. The last congruence is identical with
Fl(a(x)) + p'F,(a(x))b(x) = 0 (modp'+I, F(x)). Because of the supposition and because of (184.140) this has a solution of the required type. Consequently the assertion is proved. We now form with the help of a polynomial a(x) from (184.139) the element
ei = a(e) of R,I and show that for this element (184.138) and the properties postulated there hold. Since, according to (184.139), we have
Fi(a(x)) a(x) = 0 (mod p'"x, F(x)x) .
it follows from the rule (184.112) that
FI(a(e)) a(g) = 0 . From (184.139) itself
FI(a(e)) or = 0 according to the rule (184.113). According to (184.10)
aa(e) = a(e)" a. From (184.41) pa(o) = 0. These four equations mean that the equations (184-41,3,4,5) are satisfied by e, and F, instead of o and F. Furthermore it follows from (184.113) and from (184.141) that a(g)pcr 0 a(g)o r.
781
RINGS
Thus according to what has been said above a(e)a 0 aa(e), i.e., [a(e), a] 0. Hence (184.138) follows. This proves the assertion of Theorem 447 concerning the neutrality of the choice of F(x). We now prove that p, q, m, e, n are invariants of R11; hence it will follow that of the RII given in Theorem 447 none are isomorphic. According to what has been proved concerning (184.9), RII has the basis elements e`
(i= 1,. ..,qe),
e'a(i=0,...,qe- 1),
and the first or last of these have the additive order p' or p, respectively. Hence it follows that p, m, q, e are invariants of RII. The invariance of n has still to be demonstrated. On account of (184.3) and (184.45.6) it is sufficient to prove the following: If e', a', t are two elements of R and an integer, respectively, with
R _ f e', a' }
(184.142)
,
a'2 = 0,
aq' = e' 'a'
(184.143)
(184.144)
,
0
(184.145)
then t = nq` -1 .
(184.146)
For the proof we write e' = a(e) + b(Lo)g, a' = c(e) + d(e)a (a(x), c(x) E -'Y [xb ; b(r), d(x) E ..-' [x]) .
(184.147)
From (184.46) and (184.1472) it follows that a'a = c(e)a. According to this, (184.10) and again (184.1472), a12 or = c(o)2a. Because of (184.143) this gives c(e)2 a = 0, whence, from the rule (184.114) even
c(e)a = 0.
(184.148)
By (184.142), [o; a'] + 0. From formula (184.116) and (184.147), (184.148) we then have (184.149) (a(e) - a(e)") d(e)a 0 . But, from (184.10), (184.147), and (184.148) we obviously have
a'o' = a(e)c(e) + a(e)"d(e)a, o °`a' = a(e)"c(e) + a(e), d(o)a
.
782
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
According to (184.144) and (184.111) we therefore have
(a(o)' - a(o)") d(e)o' = 0 . This and (184.149) give, according to rule (184.114):
(a(Q) - a(P)P) r
0, (a(P)" - a(o)°`) cr = 0.
These imply, by rule (184.113), the same as
a(x)" * a(x) (mod p, F(x))
,
a(x)°` - a(x)P (mod p, F(x))
.
Consequently, if on account of (184.1002) we take a generator x of F, where F(K) = 0, then it follows that (a(K)) P
a (K) ,
(a(K))" = (a(K))P .
Because of (184.1012) it follows from the first that a(K) is also a generator
of F, and from the latter that t = n qn-I (mod qe). Because of (184.145), it then follows that the equality t = n qe-I, i.e., (184.146) holds. We have proved that p, q, m, e, n are invariants of RI,, thus in the RII given in Theorem
447 there are no isomorphic rings. When we have proved that no ring RII is isomorphic with a ring RI or with one of the rings opposed to the R1, we shall have proved Theorem 447. The truth of this assertion follows from the fact that RI has left annihilators
other than 0, as noted above, while obviously there are no left or right annihilators different from 0 in RII. Consequently the proof of Theorem 447 is now completed. PROOF OF THEOREM 448. This needs some preparatory work. LEMMA 1. If for a ring R and a subring S of R
R = R2 + S
,
(184.150)
then
R = Rk + S
(k = 2, 3, ...) .
(184.151)
Therefore, for a nilpotent ring R, (184.150) can be satisfied only by S = R. Let us assume (184.151) for some k (>_ 2). Hence and from (184.150)
R=(R2+S)k+S c Rk+I+Sk+S= Rk+I+S. This implies that (184.151) holds for k + t instead of k, and so generally. LEMMA 2. In a minimal system of generators 01, w2, ... of a nilpotent ring R, no co; may be replaced by an w; coy or coj w; (j arbitrary).
Let S denote the (proper) subring {co, ..., co;_,, w;+I.... } of R. If the assertion is false, we have R = {S, w; ww} or R = {S, w;coif'
RINGS
783
for some i, j. Then the subring R2 + S of R contains a system of generators of R, whence (184.150) follows. Therefore, according to Lemma 1, S = R. This contradiction proves Lemma 2. In order to prove Theorem 448, let us first consider a nilpotent finite
one-step non-commutative ring R. We have to prove that this is a
homomorphic image of a ring Rill. Because of the assumption, R is necessarily a p-ring, therefore there are two elements e, a in it with i.e., {e, a} 56 0.
(184.152)
We put
o+(e) = P', o+(q)=p", o(e) = r , o(a) = s ,
(184.153)
where m, n z 1 and r, s z 2 necessarily hold. Because of the nilpotence of R, r and s denote the least natural numbers for which e` = 0 and a' = 0, respec-
tively. Of all the possible pairs e, a we choose one for which m + n is minimal. ThenPe, a-is not a system of generators of R, and so we have p [e, a] = [Pe, a] = 0, i.e., pea = Palo .
(184.154)
In all our assumptions the order of succession of e, a is immaterial, therefore
we can also assume that m 5 n. Because of (184.1521), Lemma 2 implies that all the four products o2, ea, ae, a2 are interchangeable with both a and a. Therefore e2a = eae = ao2
,
eat = aea =ate .
These equations and (184.1521), (184.153), (184.154) imply that R is a homomorphic image of the ring R1; [defined by (184.11) and (184.12)], as stated. Conversely, let us consider the ring RIII. In order to prove assertions (184.13) to (184.15), we have again recourse to the free ring 91 _ {x, y}, introduced in (184.96). For its ideal r = (Pmx, P"y, x', y', x2y - xyx, xyx - Yx9, xy2 - YxY, YxY - y2x,
p(xy - yx))
(184.155)
we have according to (184.11) and (184.12) the isomorphism
Rlll : SR/r(e-x+r,a-.y+r). 26 R.-A.
(184.156)
784
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
Because of (184.155) and since m < n, the elements
a =0,...,p -1, au=0,...,p'-1;i>0
ayx + Y_ a;jx'y'
.i
(184.157)
a01=0,...,p^-1
represent all the residue classes mod r of !1, where we have to sum over the i, j for which
0
r-1; 0<_j<_s-1; i+j>1.
We show that each class is represented here only once. It is sufficient to prove that (184.157) lies in r only if a and all the a,1 vanish. In the following proof we shall use the concepts "degree", "homogeneous"
and "homogeneous component" in the free ring 91 with similar meaning as in polynomial rings. We assume that (184.157) lies in r. Since r, s Z 2 and because of (184.155) r S (px, py, x2, xy, y2). Consequently it follows by hypothesis that ayx + a10x + aoly E (px, py, x2, xy, y2) .
Since the ideal on the right-hand side has homogeneous generators, it must contain (cf. § 67, Examples 6, 7) the homogeneous components of the left-
hand side, and, in particular, ayx E (px, py, x2, xy, y2)
From this it follows that ayx E (px, py), thus p I a. But since a only takes the values 0, ..., p - 1, we have
a=0.
(184.158)
We still have to show that all the a1 are equal to 0. We suppose that this is false and denote by u (0 < u <_ s - 1) the largest integer with the property
that all the au such that j < u are equal to 0. If (184.157) is multiplied from the right by ys-1-", then the product so obtained lies in the ideal r, thus, because of (184.158) Y_
ar,.x'ys-1-"+j E r
.
e..i
Here, according to the definition of u, the terms for which j < u disappear. Since, further, according to (184.155) we have y' E r, the terms for which s - 1 - u + j > s also disappear, i.e., the terms for which j > u may be omitted. After this only the terms for which j = u remain : a, x'ys-1 E r .
RINGS
785
(According as u = 0 or u > 0, one has to sum here over i = 1, ...,p - 1 or i = 0, . . ., p - 1, respectively.) The terms of the left-hand side are of different degree. But t has, according to (184.155), only homogeneous generators, so that the mentioned terms must each lie in t. Among these terms
there is, according to the definition of u, one which is different from 0. Suppose that ar" 0. According to what has been said we have armxrys-I E t .
Because of (184.155), army 'YS -1 E (pmx, p"y, xr, .3's, xy - Yx) .
since (after adding xy - yx) the cancelled generators of t lie in the principal
ideal (xy - yx) . Because i < r and s - i < s it follows that a;" xry"-1 E (pmx, p"Y, xy - yx)
(184.159)
.
The substitution y = x results in a,,,x`+s'' E (P-x, P"x) , where we now have an ideal of .7 [xb on the right. Because m _:!9 n it follows that Pm
a:"
But since for i > 0 a;,, only takes the values 1, ..., p-1, because of (184.157) and since a1 # 0, i = 0 (and u > 0). In the remaining case i = 0, (184.159) reads as follows: aouys-1 E (pmx, p"y, xy - yx)
After the substitution x = 0, y = x we obtain
.
E (p"x), thus
p' I a... But since, because of (184.157) and because ao, 0 0, aou only takes the values
1, ...,p"-1, we. have proved by this contradiction that the elements (184.157) represent all the different residue classes of . mod r. The number of these representatives (184.157) is 1
,
whence, because of (184.156) the validity of (184.13) follows. From (184.11), (184.123, 4, s.6) it follows at once that the elements 'Oo' -
eY, mentioned before (184.14), are generators of the module Ri1I+. On the other hand, from (184.1 21, 2, 7) and m < n it follows that for these elements equations (184.14) hold with "<-" instead of °°_". But since the
786
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
product of the right-hand sides of equations (184.14) is exactly the right-hand side of equation (184.13) as already proved, we see at once that the generating elements mentioned constitute a basis of R111, i.e., of and all the equations (184. 14) are true.
If we multiply oa - aO from the left or right by a or a, then according to (184.125,6) we always obtain 0. Hence (184.15) follows.
Because of (184.125,6) the module of R1, (k 3) has the generators o'a"(i + j > k) . Hence and from (184.123,4) we have Rii1S-1 = 0 and so RI,I is nilpotent. Lastly we wish to show that the ring RIII is one-step non-commutative. Since the element ga - op, according to (184.141), is different from 0, RIII is not commutative. Therefore it is sufficient to prove that R111 is generated by any two non-interchangeable elements a, fl. For the proof we take these two elements a, j3 as the first or second factor, respectively, on the left-hand side of (184.15). It then follows from (184.15) that [a,
P]=ap-fa=la0a0(0a-ago). bl,
1
Because of (184.141) and the assumption that [a,#] 0 0, the determinant on the right-hand side is not divisible by p. Therefore a and a may be written as a sum of two elements, of which one belongs to the submodule of R11, generated by a and 8, the other to the ring RII1. Since; according to (184.11), p, a are generators of R111, we obtain R111=S+ RI11,
where S denotes the subring {a, f} of R111. Because of the nilpotence of Rm, if we apply Lemma 1, we obtain S = 8111. This means that R111 is generated
by a and f, i.e., it is one-step non-commutative. Consequently Theorem 448 is proved. Cf. R>DEI (1957).
EXAMPLE. The ordinary quaternion field is a one-step non-commutative skew field.
§ 185.* Finite One-step Non-commutative Semigroups
According to the general definition a one-step non-commutative semigroup means a non-commutative semigroup whose proper subsemigroups are all commutative. THEOREM 449. In order to give the finite one-step non-commutative semigroups, we define first for natural numbers r, a (r > a > 1) the semigroup
S1 = (, a}
(185.1)
SEMIOROUPS
787
by the -equations er = ea , a2 = ,7' Ua = (?,
2
2
,,
(185.2)
secondly for the natural numbers r, a, e (r > a > 1) and for a prime number p we define the semigroup (185.3)
S11 = {Lo, a}
by the equations
or = e, ape+I _ a, ape = eap, LOaiI = e, L02ak = eakQ = ake2
(k = 1, 2, ...) ,
(185.4)
thirdly for the natural numbers r, a, s, b (r > a > 1, s > b > 1) we define the semigroup Sin = {Lo, a}
(185.5)
by the equations Qr = 9a, or'
oure = aLo2 ,
Dr
(185.6)
a2LO = aqa = LOa2.
In addition, we denote by S1V a semigroup consisting of two right units. These semigroups S1, . . ., S1V, and their non-commutative homomorphic and
anti-homomorphic images and, finally, the finite one-step non-commutative
groups defined in Theorems 444, 445 are, apart from isomorphy, all the finite one-step non-commutative semigroups. S1 has order
O(S1) = r + 1
(185.7)
ee'(i= 1,...,r - 1),a,ago,
(185.8)
O(Sn) = pe(r + p - 1)
(185.9)
and consists of the elements
S11
has order
and consists of the elements ahoak, oiak, a1+k
(i=2,...,r- 1; h=0,...,p- 1; k=0,...,pe- 1),
(185.10)
finally, 5111 has order 0(S111) = rs
(185.11)
and consists of the elements
Loiak(i+k>-,1; i=0,...,r- 1; k=0,...,s- 1),aLo. (185.12) 26/a R.- A.
788
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
NoTE. In order to give a complete system of non-isomorphic finite one-
step non-commutative semigroups, we still have to solve the homomorphy problem for the semigroups S, S11, 5111. This is simple for S1 and Sn, but for S111 it is an extremely fatiguing task. In order to prove the theorem, first of all we have to take into considera. tion the fact that, according to Theorem 33, a finite group has only groups
as subsemigroups. It follows that the only groups occurring among the finite one-step non-commutative semigroups are the finite one-step noncommutative groups. Consequently our proof should consider only those semigroups which are not groups. We now wish to prove the assertions listed in (185.7) to (185.12). From the defining equations (185.2), (185.4), (185.6) for S1i 511, S111 it follows that all the elements of these semigroups occur among the elements quoted in (185.8) (185.10) and (185.12) respectively. Therefore it is sufficient to
prove that these elements are all distinct in the three cases, for (185.7), (185.9) and (185.11) then follow by counting the elements. The fact that the mentioned elements are different, will be proved by Theorem 101, when we consider three semigroups belonging to the three equations (185.2), (185.4) and (185.6), respectively, for which we retain the notations (185.1), (185.3), (185.5), and show that the relations (185.7), (185.9), (185.11) hold for them
with Z instead of =. In all three cases we shall define the generators 'o, a as certain mappings of a set S1, S11 or S111, respectively, into itself.
For non-negative integers t, in, n (0 < m < n) we denote by (t);, that integer, for which
(t)M' = t (t < n) or
(t)om,-I (mod n - m), m 5(t)7
We now accomplish the definition of the semigroup S1 = {p, a}, starting from the set
S1=<0',1',0,...,r- l>
(where 0, 1' are to be regarded as pure symbols) and defining the mappings e, a of this set into itself by
QO'=1, 0l'=2, gx=(x+1)Q (x=0,...,r- 1), and
a0'=0', al'= 1 ' ,
F
ax =x (x=2,...,r- 1).
From this definition we have P'0' = (i)Q, O'1' = (i + 1)Q, c'x = (x + i)Q
(x=0,...,r - 1; 1 = 1,2,...);
SEMIGROUPS
789
and in particular
e'= e Further it is easy to show that
a2=a, ea= e,
ae2= e'
-
This means that SI belongs to the equations (185.2). Since, moreover, the images
eb=i(i=l,...,r- 1), aO=O', aeO=1' of the element 0 of SI are different, it follows that 0 (S1) z r + 1. For the definition of S11 = { e, a) we denote by S11 the set of elements x, (Y, X), [u, z]
(x=0,...,pe; y=0,...,p- 1; z=0,...,pe- 1; u=2,...,r- 1), which are to be regarded as pure symbols without any particular meaning, and define the mappings e, a of this set into itself by
ex = (0 , (x),) ,
a(y, z) _ [2, (y + z),e] ,
e [u, z] _ [(u + 1)a, z]
and
ax = I + (x)p<, a(y, z) _ ((y + 1)p, (y + z + 1 - (y + a[u, z] = [u, (z + 1)pe] ,
where (t)e denotes the least non-negative residue oft mod c (t = 0, 1, ...; c = 1, 2, . . .). Then the following relations hold: e`X = [(i)a, (x),,e], e'(Y, z) = [(i + 1)4 , (y + z),O], e,[u, z] _ [(u + i)a, z]
(i=2,3,...), akx = 1 + (x + k - 1)p"
ak(y,z)=((y+k)p,(y+z+k)-(y+k)p)_), ak [u, z] = [u, (z + k)p,]
(k= 1,2,...). Hence we see that for i > a, j z a, i m j (mod r - a) we always have ej = e', and in particular,
e'= a.
790
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
Moreover ape+i
Further, it follows that apx = 1 + (x + p - 1),e ,
ap(y, z) _ (y, (z + P),) ,
ap[u,z]=[u,(z+P)p], and then apox = (0, (x + P),,) = eapx,
ape(y, z) = [2, (y + z + p)] = ea°(Y, z) , ape [u, z] = [(u + 1)a, (z + P),] = eae [u, z] ,
so that
ape= eap.
We also obtain ap°X = 1 + (x + Pe
ap`(y, z) _ (y, Z),
ap`[u, z] = [u, z] .
Hence
oa,'x = (0, (x),e) = ex, ea°`(y, z) = e(y, z)
,
eap1[U, z] = e[u, Z],
so that
oap`=o.
Finally we obtain o2x = [2,(x),]
,
e2 (y, z) = [(3)a, (y + z),] , o2[u, z] = [(u + 2)a, z] ,
oakx = (0, (x + k)p.) , Qak(y, z) _ [2, (y + z +
k),9]
,
eak[u, z] = [(u + 1)a, (z + k),,] (k = 0, 1, ...) , and then e2akx
= [2, (x +
eakex
= ake2X ,
e2ak(Y, z) = [(3)a, (Y + z + k)] = oake(y, z) = ake2(y, z)7,
02ak[u, Z] = [(u + 2)a, (z + k),,] = eake[u, z] = ake2[u, z] ,
so that e2ak = eake = ako2 (k = 1, 2, ...)
791
SE IIGROUPS
Consequently S11 belongs to the equations (185.4). Since the images
akeakO=o'(0,k)=(h,k) (h=0,..1; k = 0,...,pe- 1), e'akO= e'--'(O, k)= [i, k] (i=2,...,r- 1; k=0,...,pe- 1) .
ai+k 0 = 1 + k
(k = 0, ..., pe - 1)
of the element 0 of S15 are different, O(S51) >- pe (r + p - 1), as was required. For the definition of SIIl we denote by S1II the set of elements
(x,y)
(x=0,...,r- 1; s=0,...,s- 1), 9,
which are again to be regarded as pure symbols, and denote the mappings
a of this set into itself by 60(x, y) = ((x + 1)p, y),
eQ = (2, 1)
and
a(x, y) = (x, (y + 1)b) (except for x = 1, y = 0)
,
a(l,0)=d2, crQ=(1,2). Then
(i = 2, 3, ...) .
60'(x, Y ) = ((x + i)Q, y), o'Q = ((1 + i)a, 1)
and
(k=2, 3, );
ak(x,Y)=(x,(Y+k)b). akSa=(1,(1 +k)%) so that 60' = 60° , as = b Moreover we have the following relations 60 2(x, y) _ ((x + 2)a, y) ,
602Q
((3)a, 1)
,
a2(x, y) _ (x, (y + 2)b) , a2Q _ (1, (3)b) ,
ea(x, y) = ((x + 1)a , (y + 1)b) , eaQ = (2, 2) ,
so that we obtain easily e2a(x, y) _ ((x + 2)0 , (y + 1),) = eae(x, y) = ae2(x, y) , e2a.Q = ((3)a ,
2) = eaeQ =
ae2.Q
and
a2e(x, y) = ((x + l )y , (y + 2)b) = aea(x, y) = ea2(x, y) , a2eQ = (2,(3)b) = aeaQ = ea2Q .
792
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
Hence
e2d'=eae-cie2, atLo =uea=Lo a2. Consequently SI1I belongs to the equations (195.6). Since the images
e'ak(0, 0) = e'(0, k) = (1, k)
(i+kZ 1; i=0,...,r- 1; k=0,...,s- 1), and
ae(0,0)=cr(1,0)=Q
of the element (0, 0) of SIII are different, 0 (SI11) >_ rs. Consequently the assertions in (185.7) to (185.12) follow. We denote the semigroups defined in (185.1) to (185.6) again by SI, SII, SIR, and now prove that they are one-step non-commutative. (The same holds for SIv.) It is sufficient to prove that the semigroups mentioned have non-interchangeable elements and are generated by any two such elements. Of all the elements (185.8) of SI only a and a, according to (185.2), consitute an (unordered) pair of non-interchangeable elements. From (185.1) it follows that they generate SI. Of all the elements (185.10) of S11, according to (185.4), only the
K=a"eo°, a=aw
(u=0,...,p- 1; v=0,...,pe- 1; w= 1,. ,pe;
P,f'w)
constitute non-interchangeable pairs of elements. If we determine a natural number t by wt = 1 (mod pe), then according to (185.4)
a=0.W1 =R'
0=
ap,Loap'=O,Pe_"KO'P'
=
= )(P'_u)1K;i.(Pe_v)t.
Hence, and from (185.3) it follows that K, ? are generators of SII. Among the elements (185.12) of SIII, according to (185.6), only a and Cr
constitute a non-interchangeable pair of elements and, by (185.5), they generate S111. Since, by the above, Si, . . ., S,v are finite and onestep non-commutative, the first part of the Theorem is proved. In order to prove the second part, we consider a finite one-step non-commutative semigroup S which is not a group. We may assume this to be of the form S
(185.13)
SEMIGROUPS
793
We have to show that S is a homomorphic or an anti-homomorphic image of one of the semigroups S1,. .., SIV. First of all we prove some propositions about S. PROPOSITION 1. If pS c S and
a, b, c, d, e,...
(185.14)
is a finite non-empty sequence of natural numbers with then
a+c+e+...?4,
(185.15)
eaabQcad ... _ pa+c+... ob+d{...
(185.16)
Before giving the proof we make the observation, important also for what follows, that oS is a subsemigroup of S and so by virtue of the assumption, commutative. We now denote by k and K the number of terms in the sequence (185.14) and the left-hand side of (185.16), respectively.
For k = 1, 2, the assertion (185.16) is trivial. For k = 3 we distinguish two cases : if c 2, then paab and p` lie in pS, thus we have K=
paab
pe = p'paab ,
so that (185.16) holds. In the other case, when c = 1, according to (185.15), we have a >_ 3. Hence, by an analogous reasoning, we obtain K = p2 . pa
tab p1
= pa
tab pe+Q,
so that we have reduced this case to the preceding one. The case k = 4, since
K= may be reduced to the case k
paob pc . Off,
3. Finally the case k >_ 5, since
paab . p°Qd oe = eeadpe+aab
may be reduced to a case with smaller k. Hence Proposition 1 is proved. PRoPosmoN 2. If eS e S, then OiOkp!_e'+Jok
(185.17)
For jk = 0 this is trivial. We suppose that (185.17) does not hold for some triple i, j, k of natural numbers. This means that the elements peak, pi are not interchangeable, i.e., S = (p'ak, ell .
794
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
In particular, a can then be expressed as a finite product or = e"a°eW ...
(u, v, w.... Z 1).
(185.18)
Since e, a are not permutable, the case a = e" is excluded in (185.18), i.e., at least v must exist in addition to u. Let us substitute for a factor a of the right-hand side of (185.18) this right-hand side itself. Then we again obtain
an equation of the form (185.18), where u + w + ... >_ 2. After a further such substitution we even have u + w + ... >_ 3. Then Proposition 1 may be applied to both the products (ea =) a"+Ia°ew ..., (ae =)
a"a"e"
... e,
by which their equality is established, i.e., ea = ae. This contradiction proves Proposition 2. PROPOSITION 3. If Se = S, then e°(e) is a right identity for S.
From the assumption it follows that S = So = See = ... Since S is finite, so the mapping at -* aek ((x E S) is a. permutation of S for every natural number k. Consequently there are two natural numbers k, 1(k < 1) such that ao' = aek for all at. If we write this equation in the form aek er-k = = aek and consider that Sek = S, then we see that eI-k is a right unit for S. Hence, in particular, e'-k+I = Q. Let n denote the least natural number
(therefore existing) with a"+' = e. Then e" is a right unit for S since Se = S but clearly n = o(e). Consequently Proposition 3 holds. PROPOSITION 4. If eS c S, Se = S, then all the powers e, e2, ... lie outside the centre of S.
This is obvious from eS, e2S,... c S and Se = See = ... = S. PROPOSITION 5. If there exists a natural number n with a"+' = or, then
ea`'s aye
((k,n)= 1; k= 1,2,...).
(185.19)
For the proof take some 1(>_ 1) with kl =_ 1 (mod n). From ea ae and a = (ak)I the assertion follows. PROPOSITION 6. If, for a fixed natural number n and for all the natural numbers k,
eak = ake a (k, n) > 1
(185.20)
holds, then n is a prime power (>_ 1).
If the assertion were false, we could write n = kl, where k z 2, 1 > 2, (k, 1) = 1. From (185.20) e would then be interchangeable with ak and a1, and so also with ak+I. But according to (185.20) and (k + 1, n) = 1, this is impossible, so Proposition 6 is true.
SEMIGROUPS
795
We now want to prove the assertion stated after (185.13). Not all the
eS, Se, cS, Sa can be equal to S, for then from (185.13) S = aS = Sa would follow for all elements a of S, which is contrary to the assumption that S is not a group. Since the order of succession of e, a, and an anti-automorphism of S are immaterial we may assume that (185.21)
OS C S.
In the following proof we distinguish four cases.
1. Let Se = Sa = S. According to Proposition 3 (applied also with a instead of e) S contains two right units of the form (m, n ? 1),
9m, an
from which we obtain em+1
= e
a+1 = a.
If we have eman = an
em.
i.e. em = a", then [partly from (185.21) and Proposition 21 we have
ae = an+ie = em . ae = emae = em+l a = U. But since this is false, we must have ?n
a tae. n
IT m
Hence S contains two non-permutable right units, which are therefore necessarily generators. Thus S contains no further elements whatever and is, consequently, isomorphic with Siv.
2. Let Se = S, Sa c S. According to the Propositions 3, 4, S contains a right identity em which is not a central element. Hence from (185.13) ema # aem.
Because eS c S, and Se = S, we also have eS c S; Sem = S, so that we may take em instead of e. In other words this means that a may be assumed to be a right unit. Then every element of S is of the form ak
(k = 1, 2, ...) or ea! (1= 0, 1, ...).
Next we suppose that there is a k (> 2) for which eak
A ake.
From this it follows that S = {e, ak} so that or E {e, ak}, and a is of the form
a=ak' (x
1)
or a =eak-v (p
0).
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
796
But the second case is impossible because p is a right unit for which ea 0 a, consequently the preceding case remains the only possibility. Because kx Z 2 there are thus natural numbers n for which
a"+1=a. We assume n minimal. Then kx 1 (mod n), thus (k, n) = 1. This result, together with Proposition 5, implies that the rule (185.20) holds. According to Proposition 6 we thus have n = pe (e z 0) for some prime number p.
If e z 1 then (p, n) > 1. Since (185.20) now holds and a is a right unit, we have
eaP+I =
ooP
, a = aPe , or = aP , a = aP+1 = aP+ie.
But this contradicts (185.20), so only the case e = 0, n = 1, a2 = a remains. By what has been said above all the elements of S occur among
a, e, ea. Then
aea
e
e = ea,
whence it follows that Sa = S. But as this contradicts the supposition, all the equations
(k = 2,3,...),
eak=ake must hold, in particular,
eat=ate=a2. From the above we must also have e2 = e,
ae = a.
The finiteness of S implies that
a' = a"
(for some r > a ? 1).
Here necessarily a > 1, since otherwise ea = ea' = a'e = ae would follow, which is false. We have found that all the equations hold which result from (185.2) after interchanging a and a. Consequently S is now a homomorphic image of S1.
3. Let Se c S, Sa = S. If aS c S, then after interchanging a and a [with regard to (185.21)] the assumptions of the preceding case are satis-
797
SEMIGROUPS
fled; therefore it is sufficient to consider the case aS = S. For the sake of convenience we summarize the suppositions:
eS,SocS; aS=Sa=S. These are self-dual, a fact which we have to consider repeatedly in the sequel. According to Proposition 2 we have
(i,j= 1, 2, .. .; k=0, 1,...),
of+Jak = Qiakef = ak`O1+J
and, in particular, 02ak = oako =
ako2
(k = 1, 2, ...).
From Proposition 3 (applied with a instead of o) ao(, is the unity element of S, thus a"+1 = a,
a"o = oa" = o
(n = o(a)).
We consider a k (>_ 2) for which oak # ako. Then it follows that S = {L0, ak}
and a E {o, ak}, so that we have an equation of the form a = ak-o''akZ, where x + y + z > 0. Since oa ao, necessarily y = 0. Consequently we can write a = a' (t > 0). Since here we have kt Z 2, it evidently follows that kt = I (mod n), thus (k, n) = 1. This result and Proposition 5 imply that rule (185.20) holds and so, according to Proposition 6, we have n = p° (e _>_ O, p prime). But because oa ao, n = 1 is not possible, so it necessarily follows that e >_ 1. Further, because of (185.20), we have gap = apLo-
On account of the finiteness of S an equation of the form Q
r = Qa
holds for r > a >_ 1. But here we must have a > 1, since the assumption a = 1, because r Z 2, gives the contradiction oa= o'a = ao' = ao The result is that all the equations (185.4) are satisfied, so that S is a homo-
morphic image of S. 4. Let So, Sa c S. If we have a S = S, then [with regard to (185.21)] after dualization and interchanging of o and a we obtain the case before last. Accordingly, we have only to consider the case aS c S. We now have the (self-dual) suppositions eS, So, aS, Sa c S.
798
FINITE ONE-STEP NON-COMMUTATIVE STRUCTURES
According-to Proposition 2 it follows that 92a
= o e = ae2 and
°.2e = QLoa = Nat.
From the finiteness of S we must have equations of the form Lo"=N
o'=o'b,
where r > a z I and s > b ? 1. But here a > 1, b > 1 must hold, since e.g., if a = 1 we obtain the contradiction Loa = 'a = aLor = a Lo. Since according to this all the equations (185.6) are satisfied, S is a homomorphic image of Sn1. We have now confirmed the assertion in all four cases. Consequently Theorem 449 is proved. EXAMPLE. The (infinite) semigroup defined by the equations
eEa = eae = aez, ate = aea = ea: consisting of the (distinct) elements
e'ak(i+k>0; i,k=0, 1,...), ae, has e, a as its one and only one non-permutable pair of elements. Consequently it is one-step non-commutative. EXERCISE. Solve the homomorphy problem for the semigroups SI, SE, Sm of Theorem 449.
BIBLIOGRAPHY ALBERT, A. A. (1937) Modern Higher Algebra. University of Chicago Press, Chicago. ALBERT, A. A. (1939) Structure of Algebras. Amer. Math. Soc., New York. ALBERT, A. A. (1956) Fundamental Concepts of Higher Algebra. University of Chicago Press, Chicago. ALTSCHILLER-COURT, N. (1952) College Geometry. 2nd ed. Barnes-Noble, New York. ARTIN, E. (1944) Galois Theory. 2nd ed. University of Notre Dame, Paris.
ARTIN, E. (1947) The free product of groups, Amer. J. Math. 69, 1-4. ARTIN, E., NESBITT, C. J. and THRALL, R. M. (1946) Rings with Minimum Condition.
University of Michigan Press, Ann Arbor. ARTIN, E. and SCHREIER, O. (1926) Algebraische Konstruktion reeller KSrper, Abh. Math. Sem. Univ. Hamburg 5, 85-89. ASANO, K. (1949) Ober die Quotientenbildung von Schiefringen, J. Math. Soc. Japan 1, 73-78. BAER, R. (1942) Inverses and zero-divisors, Bull. Amer. Math. Soc. 48, 630-638. BAER, R. (1946) Absolute retracts in group theory, Bull. Amer. Math. Soc. 52, 501-506.
BAUER, M. (1923) Ganzzahlige Gleichungen ohne Affekt, Math. Z. 16, 318-319. BAUER, M. (1932-34) Uber die alternierende Gruppe, Acta Sci. Math. Szeged 6, 222-223. BEAUMONT, R. A. (1948) Rings with additive group which is the direct sum of cyclic
groups, Duke Math. J. 15, 367-369. BENADO, M. (1954) Les ensembles partiellement ordonnes et le theoreme de raffinement de Schreier, 1, Czechoslovak. Math. J. 4, (79) 105-129. BENADO, M. (1955) Les ensembles partiellement ordonnes et le theoreme de raffinement de Schreier, I I, Czechoslovak. Math. J. 5, (80) 308-344. BIEBERBACH, L. (1952) Theorie der geometrischen Konstruktionen. Birkhauser Verlag, Basel.
BIEBERBACH, L. and BAUER, G. (1933) Vorlesungen fiber Algebra. 5. Aufl. Teubner Verlag, Leipzig-Berlin. BIRKHOFF, G. (1948) Lattice Theory. Amer. Math. Soc., New York. BOERNER, H. (1955) Darstellungen von Gruppen. Springer Verlag, Berlin. BOURBAKI, N. (1939) Elements de mathematique. Premibre partie: Les structures fondamentales de l'analyse. Livre I: Theorie des ensembles; Livre II: Algebre; Livre III:
Topologie generale. Hermann, Paris. BROWN, B. and McCoy, N. H. (1946) Rings with unit element which contain a given
ring, Duke Math. J. 13, 9-20. BRUIJN, N. G. (1953a) On the factorization of finite Abelian groups, Nederl. Akad. Wet. Proc. Ser. A 56, 258-264. BRUIJN, N. G. (1953b) On the factorization of cyclic groups, Nederl. Akad. Wet. Proc. Ser. A 56, 370-377. BURNSIDE, W. (1897) Theory of Groups of Finite Order. 2nd ed. University Press, Cambridge. CAPELLI, A. (1901) Sulla riduttibilita della funcione x"-A in un campo qualunque, Math. Ann. 54, 602-603. CARTAN, H. (1947) Theorie de Galois pour les corps non commutatifs, Ann. Sci. 1'Ecole Normale Sup. 64, 59-77. 799
800
BIBLIOGRAPHY
CHATELET, A. (1925) Les groups abeliens finis et les modules des points entiers. ParisLille. CHEBOTAREV, N. G. (1950) Grundzilge der Galois'schen Theorie. Noordhoff, Groningen-
Djakarta. CHEVALLEY, C. (1951) Introduction to the Theory of Algebraic Functions of One Variable.
Amer. Math. Soc., New York. CLIFFORD, A. H. and PRESTON, G. B. (1961) The Algebraic Theory of Semigroups.
Amer. Math. Soc., New York. vol. I. DELSARTE, S. (1948) Fonctions de Mobius sur les groupes abeliens fins, Ann. Math. 49, 600-609. DEURING, M. (1935) Algebren (Ergebn. d. Math. IVI). Springer Verlag, Berlin. DICKSON, L. E. (1901) Linear Groups with an Exposition of the Galois Field Theory, Teubner Verlag, Leipzig. DICKSON, L. E. (1927) Algebren and ihre Zahlentheorie. Orell-FUssli Verlag, ZUrichLeipzig.
DIEUDONNE, J. (1943) Les determinants sur un corps non commutatif, Bull. Soc. Math. France 71, 27-45. DIEUDONNE, J. (1955) La geometrie des groupes classiques (Ergebn. d. Math., neue Folge, Heft 5) Berlin. DIEUDONNE, J. (1963) La geometrie des groupes classiques. 2nd ed. Springer Verlag, Berlin.
DORROH, I. L. (1932) Concerning adjunctions to algebra, Bull. Amer. Math. Soc.
38, 85-88. DUBREIL-JACOTIN, M. L., LESIEUR, L. and CROISOT, R. (1953) Legons sur la teorie des treillis des structures algebriques ordonnees et des treillis geometriques. GauthierVillars, Paris. EGERVARY, E. (1955) Ober die Faktorisation von Matrizen and ihre Anwendung auf
die Losung von linearen Gleichungssystemen, Zeitschr. angew. Math. u. Mech. 35, 111-118. EICHLER, M. (1952) Quadratische Formen and orthogonale Gruppen. Springer Verlag, Berlin. EILHAUER, R. and WEINER, H. J. (1963) Zur Holomorphentheorie der Ringe, Acta Sci. Math. Szeged 24, 28-33. ELLIS, D. (1949) An algebraic characterization of lattices among semilattices, Port.
Math. 8, 103-106. EVERETT, C. J. (1942a) An extension theory for rings, Amer. J. Math. 64, 363-370. EVERETT, C. J. (1942b) Vector spaces over rings, Bull. Amer. Math. Soc. 48, 312-316. FARY, 1. (1949) Die Aquivalente des Minowski-HajSschen Satzes in der Theorie der topologischen Gruppen, Comment. Math. Aely. 23, 283-287. FErr, W. and THOMPSON, J. G. (1963) Solvability of groups of odd order, Pacific J. Math. 13, 775-1029. FRAENKEL, A. (1946) Einleitung in die Mengenlehre. 3. Aufl. Dover, New York. FRIED, E. (1953-54) Ober als echte Quotientenkorper darstellbare Korper, Acta Sci. Math. Szeged 15, 143-144. FRIED, E. (1956) Algebraically closed fields as finite extensions, Mat. Lapok 7, 47-60. FROBENIUS, G. (1878) Theorie der linearen Formen mit ganzen Coeffizienten, J. reine angew. Math. 86, 146-208. FROBENIUS, G. and STICKELBERGER, L. (1879) (Jber Gruppen von vertauschbaren Ele-
menten, J. refine angew. Math. 86, 217-262.
Fucxs, L. (1950) A note on the idealizer of a subring, Pub!. Math. Debrecen 1, 160-161.
Fucxs, L. (1951) The generalization of the valuation theory, Duke Math. J. 18, 19-26.
BIBLIOGRAPHY
801
FUCHS, L. (1952a) A simple proof of the basis theorem for finite Abelian groups, Norske Vid. Selsk. Forh. Trondheim 25, 117-118. FUCHS, L. (1952b) On subdirect unions, 1, Acta Math. Acad. Sci. Hung. 3, 103-119. FUCHS, L. (1958a) On the possibility of extending Haj6s's theorem to infinite Abelian
groups, Publ. Math. Debrecen 5, 338-347. FUCHS, L. (1958b) Abelian Groups. Akademiai Kiad6, Budapest. FUCHS, L. (1959) Abelian Groups. Pergamon Press, London. FUCHS, L. (1963) Partially Ordered Algebraic Systems. Pergamon Press, Oxford. GACSALYI, S. (1951-52) On algebraically closed Abelian groups, Publ. Math. Debrecen 2, 292-296. GARETT, J. R. (1956) Reduction of equations to normal form in fields of characteristic p,
Duke Math. J. 23, 241-251. GODEL, K. (1953) The Consistency of the Axiom of Choice and of the Generalized Continuum-hypothesis with the Axioms of Set Theory. University Press, Princeton. (Golfand) Form4IaHA,10. A. (1948) 0 gruppakh, vse podgruppy kotorykh spetsialnye, Doklady Akad. Nauk SSSR 60, 1313-1315. GRATZER, G. and SCHMIDT, E. T. (1957) tlber die Anordnung von Ringen, Acta Math. Acad. Sci. Hung. 8, 259-260. HAJbs, Gy. (1942) Ober einfache and mehrfache Bedeckung des n-dimensionalen Raumes mit einem Wurfelgitter, Math. z. 47, 427-467. HAJbs, Gy. (1950a) Sur la factorisation des groupes abeliens. Casopis 74, 157-162. HAJbs, Gy. (I 950b) Sur Ie problkme de factorisation des groupes cycliques, Acta Math. Acad. Sci. Hung. 1, 189-195. HALL, M. (1959) The Theory of Groups. Macmillan, London HAMEL, G. (1905) Eine Basis aller Zahlen and die unstetigen Losungen der Funktional-
gleichung f(x + y) = f(x) = f(y), Math. Ann. 60, 459-462. HASSE, H. (1935) Elementarer Beweis des Hauptsatzes uber ternare quadratische Formen mit rationalen Koeffizienten, J. reine angew. Math. 172, 129-132. HASSE, H. (1963) Zahlentheorie. 2. Aufl. Akademie Verlag, Berlin. HASSE, H. (1950) Vorlesungen fiber Zahlentheorie. Springer Verlag, Berlin. HASSE, H. (1951) Hihere Algebra. Walter de Gruyter, Berlin, Bd. I-II. HASSE, H. and KLOBE, W. (1952) Aufgabensammlung zur hi heren Algebra. 2. Aufl. Walter de Gruyter, Berlin. HAUPT, O. (1952, 1954) Einfiihrung in die Algebra. 2. Aufl. Geest and Portig, Leipzig, Bd. 1-II. HAUSDORFF, F. (1949) Grundziige der Mengenlehre. Chelsea, New York. HENSEL, K. (1905) (Jber die zu einem algebraischen Korper gehorigen Invarianten, J. refine angew. Math. 129, 68-85. HENSEL, K. (1908) Theorie der algebraischen Zahlen. Teubner Verlag, Leipzig. HERMES, H. (1955) Einfiihrung in die Verbandstheorie. Springer Verlag, Berlin. HERSTEIN, I. N. (1954) An elementary proof of a theorem of Jacobson, Duke Math. J. 21, 45-48.
HIGMAN, G. (1951) A finitely related group with an isomorphic proper factor group, Journ. London Math. Soc. 26, 59-64. HOCHSCHILD, G. (1947) Co-homology and representations of associative algebras, Duke Math. J. 14, 921-948. HOLDER, O. (1895) Bildung zusammensetzter Gruppen, Math. Ann. 46, 321-422. HURWITZ, A. (1919) Vorlesungen iiber die Zahlentheorie der Quaternionen. Springer Verlag, Berlin. ISBELL, J. R. (1959) On the multiplicative semigroup of a commutative ring, Proc. Amer. Math. Soc. 10, 908-909. IT6, N. (1955) On the number of isomorphic classes of non-normal subgroups in a finite
group, Acta Sci. Math. Szeged 16, 9-11.
802
BIBLIOGRAPHY
JACOBSON, N. (1943) The Theory of Rings. Amer. Math. Soc., New York. JACOBSON, N. (1951, 1953, 1964) Lectures in Abstract Algebra. D. Van Nostrand,
New York, vols. I, Ila, III. JACOBSON, N. (1956) Structure of Rings. Amer. Math. Soc., Providence.
JOHNSON, R. E. (1952) On ordered domains of integrity, Proc. Amer. Math. Soc. 3, 414-416. JOHNSON, R. E. (1958) Rings with unique addition, Proc. Amer. Math. Soc.9, 57-61. KALMAR, L. (1938-40) On the possibility of definition by recursion, Acta Sci. Math. Szeged 9, 227-232. KALMAR, L. (1949-50) Ober die Cantorsche Theorie der reellen Zahlen, Publ. Math. Debrecen 1, 150-159. KALMAR, L. (1952) Another proof of the Markov-Post theorem, Acta Math. Acad. Sci. Hung. 3, 1-27. KALOUJNINE, L. (1939) Une methode de construction des sous-groupes infra-invariants,
C. R. Acad. Sci. Paris 208, 1869-1871. KAPLANSKY, 1. (1949) Groups with representation of bounded degree, Canad. J. Math. 1, 105-112. KAPLANSKY, I. (1954) Infinite Abelian Groups. Ann Arbor.
KERTESZ, A. (1951-52) On groups every subgroup of which is a direct summand, Publ. Math. Debrecen 2, 74-75. KERTESZ, A. (1952) On fully decomposable Abelian torsion groups, Acta Math. Acad.
Sci. Hung. 3, 225-232. KERTESZ, A. (1955-56) The general theory of linear equation systems over semi-simple
rings, Pub!. Math. Debrecen 4, 79-86. KERTESZ, A. (1959a) A remark on the general theory of modules, Publ. Math. Debrecen
6, 86-89. KERTESZ, A. (1959b) Simple proof of a fundamental theorem of field theory, Amer. Math. Monthly 66, 804. KERTESZ, A. (1960) On independent sets of elements in algebra, Acta Sci. Math. Szeged 21, 260-269. KERTESZ, A. (1962) Vorlesungen uber Artinsche Ringe. Akademiai Kiado with Teubner Verlag, Budapest-Leipzig. KOCHENDORFFER, R. (1955) Einfdhrung in die Algebra. Deutscher Verlag der Wissenschaften, Berlin. KORINEK, V. (1942) Der Schreiersche Satz and das Zassenhaussche Verfahren in Ver-
banden, Vestnik Krkl. Ceske Spol. Nauk 14, 1-28. KRASNER, M. (1939) Une generalisation de la notion des sous-groupes invariants, C. R. Acad. Sci. Paris 208, 1867-1869. KRBEK, F., von (1954) Eingefangenes Unendlich. 2. Aufl. Teubner Verlag, Leipzig. KRONECKER, L. and HENSEL, K. (1901) Vorlesungen fiber Zahlentheorie. Teubner Verlag, Leipzig.
KRULL, W. (1932) Allgemeine Bewertungstheorie, J. reine angew. Math. 167, 160196.
KRULL, W. (1935) Idealtheorie (Ergebn. d. Math. IV3). Springer Verlag, Berlin. KUROSH, A. G. (1953a) Gruppentheorie. Akademie Verlag, Berlin. (KuROSB) Kypoln, A. r. (1953b) Teoriya grupp. 2nd. ed. lzd. Tekhniko-teor. Liter., Moscow.
KuRosH, A. G. (1955-56) The Theory of Groups. Chelsea, New York, vols. la, II. KURSCHAK, J. (1913) Ober Limesbildung and allgemeine KSrpertheorie, J. reine angew.
Math. 142, 211-253. LANDAU, E. (1930) Grundlagen der Analysis. Akademische Verlagsgesellschaft, Leipzig. LEEUVEN, L. C. A., van (1958) On the holomorphs of a ring, Nederl. Akad. Wet. Proc.
Ser. A 61, 162-169.
803
BIBLIOGRAPHY
LEEUVEN, L. C. A., van (1965) Holomorphe von endlichen Ringen, Nederl. Akad. Wet.
Proc. Ser. A. 27, 632-645. LORENZEN, P. (1939) Die Definition durch vollstandige Induction, Monatsch. Math. Phys. 47, 356-358. LuGOWSKI, H. and WEINERT, H. J. (1960) Elementarer Beweis eines Satzes uber zyklo-
sche Korpererweiterungen, Wissenschaftliche Zeitschrift der Padagogischen Hochschule Potsdam 5, 175. (LYAPIN) JIanHH, E. C. (1960) Polugruppy. Gosudarstvennoie lzdatelstvo Fizikomatematicheskoi Literatury, Moscow. MACDUFFE, C. M. (1933) The Theory of Matrices (Ergebn. d. Math. II.). Springer Verlag, Berlin.
McCoy, N. H. (1948) Rings and Ideals. Mathematical Association of America, Baltimore.
MACLANE, S. (1938) A lattice formulation for transcendence degrees and p-bases, Duke Math. J. 4, 455-468. MACLANE, S. (1958) Extensions and obstructions for rings, Illinois J. Math. 2, 316-345. MALTSEV, A. (1937) On the immersion of an algebraic ring into a field, Math. Ann. 113, 686-691. (MALTSEV) ManueB, A. (1939-1940) 0 bklyuchenii assotsiativnykh sistem v gruppy,
I, II, Mat. sb. nov. ser. 6, 331-336; 8, 251-264. (MARKOV) MapxoB, A. A. (1947) Hevozhmozhnosty nekotorykh algorifmov v teorii assotsiativnykh sistem, I, 11, Dokl. Akad. Nauk SSSR 55, 583-586; 58, 353-356. MILLER, G. A. and MORENO, H. C. (1903) Non-Abelian groups in which every subgroup is Abelian, Transact. Amer. Math. Soc. 4, 398-404. MIRIMANOFF, D. and HENSEL, K. (1905) Sur la relation
procit6, J. reine angew. Math. 129, 86-87.
D-_ p
(-1)` at ]a loi de reci-
MOTZKIN, A. (1949) The Euclidean algorithm, Bull. Amer. Math. Soc. 55, 1142-1146. NAKAYAMA, T. (1949-50) Galois theory for general rings with minimum condition,
J. Math. Soc. Japan 1, 203-16. NOETHER, E. (1918) Gleichungen mit vorgeschriebener Gruppe, Math. Ann. 78, 221-229. NOETHER, E. (1934) Zerfallende verschrankte Produkte and ihre Maximalordnungen. Hermann, Paris. (NovlKOv) HOBHKOB, II. C. (1955) Ob algoritmicheskoi nerazreshimosty problemy tozhdestva slov v teorii grupp, Trudy mat. inst. im. V. A. Steklova XLIV, Moscow. NOBAUER, W. (1962) Einige Bemerkungen fiber die Tschirnhaussche Transformation
von Polynomidealen, Acta Sci. Math. Szeged 23, 242-246. ORE, O. (1933) On a special class of polynomials, Transact. Amer. Math. Soc. 35, 559-584.
ORE, O. (1934) Errata in my paper "On a special class of polynomials", Transact. Amer. Math. Soc. 36, 275. OSTROWSKI, A. (1918) Uber einige Losungen der Funktionalgleichung T(x) q,(y),
= p(xy), Acta Math. 41, 271-284. OSTROWSKI, A. (1935) Untersuchungen zur arithmetischen Theorie der Korper, Math. Z. 39, 269-404. PICKERT, G. (1947-49) Bemerkungen zum Algebrenbegriff, Math. Ann. 120, 158-164. PICKERT, G. (1951) Einfuhrung in die hohere Algebra. Vandenchoeck - R uprecht, Gottingen. PICKERT, F. and GORKE, L. (1958) Aufbau des Systems der reellen Zahlen, Grandzifge
der Mathematik. Vanderhoeck-Ruprecht, Gottingen, 1. (PODDERYUGIN) Iloupeploruu, B. A. (1954) Usloviye uporyadochibayemosti proisvolnogo kolna, Usp. mat. nauk. 9, 211-216.
804
BIBLIOGRAPHY
(POLLAK) rlonnaK, r. (1955) Hovoye dokazatelstvo prostoty znakoperemennoi gruppy,
Acta Sci. Math. 16, 63-64. POLLAK, Gy. (1964) Bemerkungen zur Holomorphentheorie der Ringe, Acta Scl. Math. Szeged 25, 181-185. POLLAK, Gy. and HAUSCHILD, K. (1965) Uber ein Problem von L. R6dei, Acta Sci. Math. Szeged 26, 231-232. PosT, E. L. (1947) Recursive unsolvability of a problem of Thue, J. Symb. Log. 12, 1-11. RADOS, G. (1836) Zur Theorie der Congruenzen hSheren Grades, J. refine angew. Math. 99, 258-260. REDEI, L. (1946-48a) Uber einige merkwurdige Polynome in endlichen KSrpern mit zahlentheoretischen Beziehungen, Acta Sci. Math. Szeged 11, 39-54. REDEI, L. (1946-48b) Bemerkung zu meiner Arbeit »Uber die Gleichungen dritten
and vierten Grades in endlichen KSrpern<, Acta Sci. Math. Szeged 11, 184-190. REDEI, L. (1947) Zwei LOckensatze Uber Polynome in endlichen Primkorpern mit Anwendung auf die endlichen Abelschen Gruppen and die Gaussischen Summen, Acta Math. 79, 273-290. REDEI, L. (1949) Kurzer Beweis des gruppentheoretischen Satzes von Haj6s, Comment.
Math. Hely. 23, 272-282. REDEI, L. (1949-50) Vereinfachter Beweis des Satzes von Minkowski-Haj6s, Acta Sci. Math. Szeged 13, 21-35.
REDEI, L. (1950a) Die Anwendung des schiefen Produktes in der Gruppentheorie, J. reine angew. Math. 188, 201-227. REDEI, L. (1950b) Ein Beitrag zum Problem der Faktorisation von endlichen Abelschen
Gruppen, Acta Math. Acad. Sci. Hung. 1, 197-207. REDEI, L. (1951a) Ein Satz Uber die endlichen einfachen Gruppen, Acta Math. 84, 129-153.
REDEI, L. (1951b) Die Einfachheit der alternierenden Gruppe, Monatsh. Math. 55, 328-329.
REDEI, L. (1952a) Die Verallgemeinerung der Schreierschen Erweiterungstheorie, Acta Sci. Math. Szeged 13, 252-273. REDEI, L. (1952b) Uber die Determinantenteiler, Acta Math. Acad. Sci. Hung. 3, 143-150.
REDEI, L. (1952c) Kurzer Beweis der Waringschen Formel, Acta Math. Acad. Sci. Hung. 3, 151-153. REDEI, L. (1953-54) Uber die Ringe mit gegebenem Modul, Acta Sci. Math. Szeged 15, 251-254. REDEI, L. (1954a) Uber das Kreisteilungspolynom, Acta Math. Acad. Sci. Hung. 5, 27-28. REDEI, L. (1954b) Algebra I. Akad6miai Kiadb, Budapest. REDEI, L. (1954c) Die Holomorphentheorie fur Gruppen and Ringe, Acta Math. Acad. Sci Hung. 5, 169-195. REDEI, L. (1955a) Zetafunktionen in der Algebra, Acta Math. Acad. Sci. Hung. 6, 5-25. REDEI, L. (I955b) Neuer Beweis des Haj6sschen Satzes Uber die endlichen Abelschen Gruppen, Acta Math. Acad. Sc!. Hung. 6, 27-40. REDEI, L. (1955c) Die gruppentheoretischen Zetafunktionen and der Satz von Haj6s, Acta Math. Acad. Sci. Hung. 6, 271-279. REDEI, L. (1955-56) Die endlichen einstufig nichtnilpotenten Gruppen, Publ. Math. Debrecen 4, 303-324. REDEI, L. (1956) Aquivalenz der Satze von Kronecker-Hensel and von Szekeres fur
die Ideale des Polynomringes einer Unbestimmten Uber einem kommutativen Hauptidealring mit Primzerlegung, Acta Sci. Math. Szeged 17, 198-202.
BIBLIOGRAPHY
805
REDEI, L. (1957) Die einstufig nichtkommutativen endlichen Ringe, Acta Math. Acad. Sci. Hung. 8, 401-422. REDEI, L. (1958a) Uber die algebraisch-zahlentheoretische Verallgemeinerung eines
elementarzahlentheoretischen Satzes von Zsigmondy, Acta Sci. Math. Szeged 19, 98-126. REDEI, L. (1958b) Eine Bemerkung uber die endlichen einstufig nichtkommutativen Gruppen, Acta Sci. Math. Szeged 19, 127-128. REDEI, L. (1958c) Zur Theorie der Polynomideale fiber kommutativen nullteilerfreien Hauptidealringen, Math. Nachr. 18, 313-332. REDEI, L. (1959a) Naturliche Basen des Kreisteilungskorpers, 1, II, Abh. Math. Sem. Univ. Hamburg. REDEI, L. (1959b) Algebra I. Akad. Verlag, Leipzig. REDEI, L. (1959c) Ein spezieller Diskriminantensatz Uber Polynome, Acta Sci. Math.
Szeged 20, 233-237. REDEI, L. (1960) Halbgruppe and Ringe mit Linkseinheiten ohne Linkseinzelemente, Acta Math. Acad. Sci. Hung. 11, 217-222. REDEI, L. (1965a) Ein Uberdeckungssatz fur endliche abelsche Gruppen im Zusammenhang mit dem Hauptsatz von Hajbs, Acta Sci. Math. Szeged 26, 55-61. REDEI, L. (1965b) Logische Dualitat der Frobenius-Stickelbergerschen and Hajbsschen Hauptsatze der Theorie der endlichen Abelschen Gruppen, Acta Math. Acad. Sci. Hung. 16, 327. REDEI, L. (1965c) Theory of Finitely Generated Commutative Semigroups. Pergamon Press, Oxford. REDEI, L. (1965d) Powers in groups, Amer. Math. Monthly 73. REDEI, L. and STEINFELD, O. (1952) (Jber Ringe mit gemeinsamer multiplikativer Halbgruppe, Comment. Math. Hely. 26, 146-151. REDEI, L. and SZELE, T. (1947) Algebraisch-zahlentheoretische Betrachtungen fiber Ringe, I, II, Acta Math. 79, 291-320. REDEI, L. and SZELE, T. (1950a) Algebraische-zahlentheoretische Betrachtungen fiber Ringe, I, II, Acta Math. 82, 209-241. REDEI, L. and SZELE, T. (1950b) Die Ringe "ersten Ranges" Acta Sci. Math. Szeged 12a, 18-29. REDEI, L. and TuRAN, P. (1959) Zur Theorie der algebraischen Gleichungen uber endlichen Korpern, Acta Arithmetica 5, 245-247. REMAK, R. (1911) Uber die Zerlegung der endlichen Gruppen in direkte unzerlegbare Faktoren, J. reine angew. Math. 139, 293-308. RENYI, A. (1947) On the minimal number of terms of the square of a polynomial, Hung. Acta Math. 1, 30-34. SACHS, H. (1956-57) Untersuchungen uber das Problem der eigentlichen Teiler, Wiss. Z. Martin Luther Univ., Halle-Wittenberg, Math. Nat. Reihe 6, 223-259. SANDS, A. D. (1957) On the factorisation of finite Abelian groups, Acta Math. Sci. Hung. 8, 65-86. SANDS, A. D. (1959a) The factorization of Abelian groups, Quart. J. Math. Oxford. Ser. (2) 10, 81-91. SANDS, A. D. (1959b) On the factorization of finite Abelian groups, 11, Acta Math. Acad. Sci. Hung. 13, 153-169. SANDS, A. D. (1962) The factorization of Abelian groups, (II), Quart. J. Math. Oxford. Ser. (2) 13, 153-169. SANDS, A. D. (1964) Factorization of cyclic groups, Proc. Colloq. Abelian Groups (Tihany 1963), Akademiai Kiad6, Budapest. SCHMIDT, O. J. (1924) Uber Gruppen deren samtliche Teiler spezielle Gruppen rind, Recucil Math. d. Moscow 31, 366-372. SCHMIDT, O. J. (1928) Uber unendliche Gruppen mit endlicher Kette, Math. Z. 29,34-41.
806
BIBLIOGRAPHY
SCHREIER, J. and ULAM, S. (1937) Uber die Automorphismen der Permutationsgruppe
der natiirlichen Zahlenfolge, Fund. Math. 28, 259-260. SCHREIER, O. (1926a) Uber die Erweiterung von Gruppen, I, II, Monatsh. Math. Phys. 34, 165-180. SCHREIER, O. (1926b) Uber die Erweiterung von Gruppen, 1, Il, Abh. math. Sem. Hamburg 4, 321-346. SCHREIER, O. (1927) Uber den Jordan-HOlderschen Satz, Abh. math. Sent. Hamburg 6, 300-302. SCHWARZ, S. (1956) On the reducibility of polynomials over a finite field, Quart. J. Math. Oxford. Ser. (2) 7, 110-124. SEIBT, H. (1965) Zur Quadratbildung von Polynomen, Acta Sci. Math. Szeged. 26, 163-170.
SERES, I. (1956) Losung and Verallgemeinerung eines Schur'schen Irreduzibilitats-
problems fur Polynome, Acta Math. Acad. Sci. Hung. 7, 151-157. SPECHT, W. (1956) Gruppentheorie. Springer Verlag, Berlin. SPEISER, A. (1956) Die Theorie der Gruppen von endlicher Ordnung. 4. Aufl. Birkhauser Verlag, Basel.
STEINITZ, E. (1910) Algebraische Theorie der Korper, J. reine angew. Math. 137, 167-309.
STEINITZ, E. (1930) Algebraische Theorie der Kdrper. Walter de Gruyter, Berlin. STICKELBERGER, L. (1898) Uber eine neue Eigenschaft der Diskriminanten algebraischer
Zahlenkorper, Verhandl. des I. Internat. Math. Kongr. Leipzig 182-193. SuzuKu, M. (1957) The nonexistence of a certain type of simple groups of odd order, Proc. Amer. Math. Soc. 8, 686-695. SzAsz, F. A. (1957) Note on rings in which every proper left-ideal is cyclic, Fund. Math. 44, 330-332. Sza,sz, F. A. (1960) Uber Ringe mit Minimalbedingung fdr Hauptrechtsideale, I-III, Pub!. Math. Debrecen 7, 54-64. SzAsz, F. A. (1961) Acta Math. Acad. Sci. Hung. 12, 417-439. SzAsz, F. A. (1963) Acta Math. Acad. Sci. Hung. 14, 447-461. SzAsz, G. (1953-54) Die Unabhangigkeit der Assoziativitatsbedingungen, Acta. Sci. Math. Szeged 15, 20-28. SzAsz, G. (1963) Introduction to Lattice Theory. Akad6miai Kiadb, Budapest. SZEKERES, G. (1952) A canonical basis for the ideals of a polynomial domain, Amer. Math. Monthly 59, 379-386. SZELE, T. (1949) Die Ringe ohne Linksideale, Bul. Sti. Bucuresti 1, 783-789. SZELE, T. (1949-50a) Neuer vereinfachter Beweis des gruppentheoretischen Satzes von Hajbs, Pub!. Math. Debrecen 1, 56-62. SZELE, T. (1949-50b) Zur Theorie der Zeroringe, Math. Ann. 121, 242-246. SZELE, T. (1949-50c) On Zorn's lemma, Publ. Math. Debrecen 1, 254-256. SZELE, T. (1950) Sur les groupes ayant un sous-group parfait, Bull. Sci. Math. Paris 74, 207-209.
SZELE, T. (1951-52a) On direct sums of cyclic groups, Publ. Math. Debrecen 2, 76-78. SZELE, T. (1951-52b) On arbitrary systems of linear equations, Pub!. Math. Debrecen 2, 297-299. SZELE, T. (1952) On ordered skew fields, Proc. Amer. math. Soc. 3, 410-413.
SZELE, T. (1954) Simple proof of the Wedderburn-Artin structure theorem, Acta Math. Acad. Sci. Hung. 5, 101-107. SZELE, T. and SZENDREI, J. (1951) On Abelian groups with commutative endomorphism
ring, Acta Math. Acad. Sci. Hung. 2, 309-324. SZENDREI, J. (1949-50) On the extension of rings without divisors of zero, Acta Sci. Math. Szeged 13, 231-234.
BIBLIOGRAPHY
807
SZENDREI, J. (1954) Eine neue Definition des Holomorphes der Gruppe and der Holomorphe des Ringes, Acta Math. Acad. Sci.Hung. 5, 197-201. SZENDREI, J. (1955-56) Zur Holomorphentheorie der Ringe, Publ. Math. Debrecen 4, 450-454. TEICHMULLER, O. (1939) Braucht der Algebraiker das Auswahlaxiom? Deutsche Math. 4, 567-577.
TUKEY, J. W. (1940) Convergence and Uniformity in Topology. University Press, Princeton. WAERDEN, VAN DER, B. L. (1933) Die Seltenheit der Gleichungen mit Affekt, Math.
Ann. 109, 13-16. WAERDEN, VAN DER, B. L. (1948) Free products of groups, Amer. J. Math. 70,527-528. WAERDEN, VAN DER, B. L. (1955) Algebra. 3 Aufl. Springer Verlag, Berlin Bd. I-1I. WEINERT, H. J. (1961a) Ober die Einbettung von Ringen in Oberringe mit Einzelement,
Acta Sc!. Math. Szeged 22, 91-105. WEINERT, H. J. (1961b) Zur Existenz and Homogenitat des grossten gemeinsamen
Teilers and des kleinsten gemeinsamen Vielfachen, Acta Sci. Math. Szeged 2, 283-285 WEINERT, H. J. (1962-63-64) Ober Halbringen and Halbkorper, I, II, III, Acta Math. Acad. Sci. Hung. 13, 365-378; 14, 209-227; 15, 175-194. WEINERT, H. J. (1963) Ein hinreichendes and notwendiges Kriterium fur die ZPE-
Eigenschaft in kommutativen, regularen Halbgruppen, Acta Sci. Math. Szeged 24, 24-27. WEINERT, H. J. (1964) Halbgruppen ohne Frattinische Unterhalbgruppe, Acta Math. Acad. Sci. Hung. 15, 309-323. WEINERT, H. J. (1965) Zur Theorie der Algebren and monomialen Ringe, Acta Sci. Math. Szeged 26, 171-186. WEINERT, H. J. (1965) Monomiale Ringe mit speziellen Faktorensystemen, Acta Sci. Math. Szeged 27, WERDENIUS, W. (1949) On the number of terms of the square and the cube of polynomials, Nederl. Akad. Wet. Proc. Ser. A. 52, 1220-1226. WILKER, P. (1951-52) Ober die Zwischenkorper einfacher algebraischer Erweiterungen, Math. Ann. 124, 289-290. WITT, E. (1931) Ober die Kommutativitat endlicher Schiefkorper, Abh. Math. Sent. Univ. Hamburg 8, 413. ZASSENHAUS, H. (1958) The Theory of Groups. 2nd. ed. Chelsea, New York. ZORN, M. (1935) A remark on method in transfinite algebra, Bull. Amer. Math. Soc. 41, 667-670.
INDEX Abelian field 663 Abelian group 35 Abel's theorem 674
Algebraically independent 516 Algebraically independent set 516 Alphabet 163 Alternating group 136 Alternating ring 280 Alternating units 280 Amicable double homothetism 196 Anharmonic group 90 Annihilator ideal 396 Annihilator left ideal 396 Anti-automorphism 117 Anti-isomorphism 117 Archimedean ordering 580 Archimedean valuation 618 Artin ring 430 theorem 575 Ascending chain 424 Associated elements 310 Associated factor systems 178 Associated function pairs 178 Associated function quadruple 191 Associative 30 Associativity conditions of the structure constants 243 Autohomomorphism 100 Automorphism 100 group 101
Absolute algebraic element 481 Absolute algebraic field 481
Absolute conjugate 529 Absolute degree
481
Absolute different 556 Absolute discriminant 556 Absolute field 481 Absolute transcendental element 481 Absolute transcendental field 481 Absolute valuation 586 Absolute value 583, 612 of a complex number 612 Absolutely irreducible 449 Absorption rule 214 Abstract structure 89 Addition 28
mod m 65 theorem for determinants 286 Additive commutator 83 Additive order 59 Adjoint 287 determinant 290 matrix 289 Adjunction 107, 481 of indeterminates 259 Admissible factor structure 225 Admissible substructure 223 Affect of equation 688 Affectless Algebra
system 181
47, 152, 153, 240, 245 representation 152, 153, 364 transformation 277, 279 vector 273
688
Basis
34, 241
Algebraic closure
494
Algebraic element 480 Algebraic equation
Between-field
290
Algebraic equation system 290 Algebraic field
479
Bicontinuous isomorphism Bimultiplication 198 Binomial coefficient 62 Binomial equation 672 Binomial theorem 62 Biquadratic 264 equation 697, 705 Bolzano's theorem 603
613
Algebraic hull 516 Algebraic number 613 Algebraic number field 613 Algebraically closed 494, (516) Algebraically dependent 515 Algebraically equivalent 520 am
27/a R.-A.
176
Automorphism-free Schreier extension
594
INDEX
810
Bolzano-Weierstrass theorem 610 Boole algebra 222 Boole lattice 222 Boole ring 222 Breakable semigroup
Commutator ideal 207 Commutator ring 83 Comparable elements 19
Cancellation 45
Central Schreier extension 182 Centralizer 132 Centrally isomorphic subgroup 234
Complementary lattice 221 Complementary theorems to the quadratic reciprocity theorem 665 Complete field 541 Complete group 185 Complete induction 11 Complete invariant system Complete prime ideal 320 Completely reducible 151 Complex character 614 Complex field 297
Centre of a structure 82
Complex number 610
Centre-free 84 Chain 3 Character 377 group 377 relations 614
Complex number field 610 Complex ring 297 Complexes 118-21 Component 143 representation 143 Composite number 50 Composition 28, 78
Cantor's theorem 17 Cardan's formula 699
Cardinal number 16 Cauchy's theorem Cayley's table 41 Cayley-Hamilton theorem subgroup 294
Central automorphism 234
sum 615 Characteristic of a structure 64 Characteristic matrix 293 Characteristic polynomial
293, 551
Characteristic polynomial of an algebra element 295 Characteristic series 229, 232 Characteristic subgroup 184 Characteristic submodule 230 Characteristic subring 198 Characteristically simple 229 Chevalley -Jacobson density theorem 424 Chinese residue theorem
Choice axiom 8 Choice function
8
Class 9
Class composition 97 Class equation 133 Classification 9 Closed 70, 223
Codeterminant 291 Coefficient 33, 57, 259 Coefficient ring 259 Column rank 419 Column vector 269 Combined product 119 Commutative 32 Commutator 83 Commutator group 83
332
Compatible 98
Complement 287 of an element 221
series
211
Concomitant classification 11 Concomitant matrix 553 Congruence 154 Congruent elements 154 Conjugate 54, 121, 528 Conjugate complex 121, 612 Conjugate element 54, 528 Conjugate fields 528 Conjugate quaternions 300 Conjugate subgroups 121 Conjugates of an element 54, 528, Constant 259
Content of a polynomial 342 Continuation of a mapping 5 Convergent infinite product 593 Convergent infinite series 592 Convergent sequence 587 Coset 122 Countable set 15 Countably infinite set 15 Cramer's rule 291 Cross-cut 2 Cross product 252 Cubic 264 Cubic equation 694, 703 Cubic resolvent 699 Cycle
55
555
811
INDEX
Cyclic determinant 561 Cyclic field 663, 672 Cyclic group 54, 360 Cyclic matrix 507 Cyclic module 59, 396 Cyclic permutation 55 Cyclic ring 394 Cyclotomic field 508, 669 Cyclotomic polynomial 509 Cyclotomy 711
Dedekind ring (example) 315 Defining equation of a field 487 Definition domain 488 Degree of an element 293, 480 Degree of a polynomial 260, 264 Degree of a rational function 439 Degree of a skew field 482 Degree of incompletability 544 Degree of inseparability 541 Degree of transcendence 520 Degree valuation 651 Deli problem 713 Delsarte function 382 Delsarte inversion formula 382 Delsarte's theorem 382
Dense set of mappings 424 Density theorem 424 Derived structure 96 Descending chain 423 Determinant 282 Determinant divisor 399 Determinant of a linear equation system 291
Determinant polynomial 284 Diagonal 269 Diagonal matrix 269 Difference 58 Difference field 162 Difference module 162 Different 556 Differential quotient 434
Differential ring 162 Differential set 2 Differential skew field 162 Differential structure 162 Dihedral group 169 Dimension 239 Direct component 141 Direct composition 141 Direct decomposition 148 Direct factor 141 Direct product 105, 141
Direct sum 105, 141 Direct summand 141 Dirichlet mapping 379 Discrete valuation 627 Discriminant of an element 556 Discriminant of a polynomial 451 Disjoint sets 2 Distributive 33, 219 Distributive lattice 220 Divisibility 50, 304 Division 57 Division ring 37 Divisor 59 Domain of definition 7
Dorroh extension ring 110 Dorroh's theorem 110 Double algebra 246 Double endomorphism 195 Double homothetism 195 Double homothetism ring 196 Double module 244 Double operator domain 228 Double vector space 245 Dual 38, 215 zeta function 391 Duality 39, 215 Duality principle 39 of determinants 285 of finite Abelian groups 378 of lattices 215 Duplication of a cube 713 Dyck's theorem Eisenstein's theorem 463 Element I Elementary divisor
402
Elementary group 366 Elementary symmetric polynomials 442 Elementary transformation 402 Embedding 108 Endomorphism 100 Endomorphism ring 113, 115 system
189
Equation 8 system
8
Equipment 14 Equivalence axioms 10 Equivalence classes 10 Equivalence relation 9 Equivalent elements 9 Equivalent Everett extensions 191 Equivalent extension fields 480 Equivalent extensions 182
INDEX
812 Equivalent Schreier extensions 183
Finite field 499 Finite one-step non-commutative group
178,
Equivalent sets 14 Equivalent valuations 593 Euclidean algorithm 328 Euclidean ring 325 Euclid's theorem 330 Euler function 333
736
Finite one-step ring 753 Finite one-step semigroups 786 Finite set 15 Finitely generated structure
First main theorem for finite Abelian
Euler's theorem 436 Even permutation 135 Everett extension ring 187 Everett's fundamental theorem 188 Everett's sum 188
groups
Form 264
Formal degree 261 Formally real 572 Four-group 90 Fraction 57 Fractional linear group 279 Frattini substructure 81 Free field 172
33, 47
of an element 535 of a field 535 Exponent valuation 620 Expression 78 Extension 5, 106 Extension field 71
Free generators 166 Free group 166
of a finite degree 488 Extension group 71 Extension module 71 Extension of the fundamental ring
Free module 172 449
Extension of an ordering 576 Extension of a valuation 594
362
Factor decomposition 49 Factor field 99 Factor group 98, 126 Factor module 98 Factor ring 98 Factor semigroup 98 Factor skew field 99 Factor structure 98 Factor system 116, 175, 188
Factors of normal series 211 96
F-basis 482
Fermat prime numbers 712 Fermat -Euler theorem 334 Field 36
of algebraic number 613 of complex numbers 610 of prime number characteristic of rational functions 438 of real numbers 602
Free product 172 Free ring 164
Free semigroup 164 Free structure 166 Free sum 173 Frobenius classes 54 Frobenius's theorem 406 Frobenius-Stickelberger main theorem
Extension ring 71 Extension semigroup 71 Extension set I Extension skew field 71 Extension structure 71
Faithful representation
369
First structure theorem 427 Fixed element 4
284
Expansion of a determinant
Exponent
80
First general isomorphy theorem 200
498
Full automorphism group 101 Full double endomorphism ring 196 Full endomorphism ring 113, 115 Full invariant series 229, 230 Full invariant subgroup 229 Full linear group 279 Full matrix ring 274 Full permutation-group 111 Full set lattice 214 Function 7 Function pair 175 Function quadruple 188 Fundamental domain 242 Fundamental field 242, 478 Fundamental ring 242 Fundamental skew field 242 Fundamental solutions 418 Fundamental theorem for elementary number theory 329 Fundamental theorem for finite generable Abelian groups 406
813
INDEX
Fundamental theorem for the Galois theory
Icosahedron group Ideal
658
140
128
Ideal of a semi gr oup Galois correspondence 656 Galois field 655 of an equation 659 of a field
133
Ideal of zero sequences
591
Idealizer 132
Idempotent 48
681
Identical mapping
Galois group 655
Identical operator
5 34
Galois theory 655 Gauss rings 335
I mage 4 Incomplete field Indecomposable
Gauss theorem 342
Independent
Gaussian integers 154 General algebraic equation 688 General valuation 626 Generalized associativity 42 Generating elements 77 Generating system 77 Generators 77 Geometrical constructibility 708 Greatest common divisor 310 Greatest common left divisor 308
Independent transcendental elements 517 Indeterminate 259 Index 122
of an equat ion
681
,
729
Greatest lower bound 216 Gratzer-Schmidt theorem 577 Group 35, 51 of units 307 ring 255
Haj6s product 369 Haj6s's main theorem 369 Hamilton's principle 105 Hasse diagram 216 Hausdorff-Birkhoff theorem 25 Hensel's lemma 635 Hilbert's basis theorem 464 Holomorph of a group 184 Holomorph of a ring 198 Homogeneous 264 Homogeneous components 265 Homogeneous linear equation system Homogeneous linear rational function 439
Homomorphic invariant 93 Homomorphic mapping 87 Homomorphic structure 92
Homomorphism 92 Homomorphy 92 Homomorphy problem 96 Homomorphy theorem 99 Hua's theorem 102 Hypercomplex ring 232 Hypercomplex system 241 Hypermatrix 422
Index set
541 151
152, 153
21
Induced congruence 156 Induced equivalence relation 10 induced mapping 5 Inertia theorem of a finite Abelian group 388
Infinite product 42, 592 Infinite series 592 Infinite set 15 Infinitely large 581 Infinitely small 581 Inner automorphism 101, 102 Inner automorphism group 101, 102 Inner double homothetism 195 Inner product 271 inseparable 532 Integral domain 64 Integral quatemion 345 Integrity domain 64 Interchangeable 7, 46, 120 Interpolation 458 Intersection 2 invariance field
Invariance group
656 656
Invariance of the greatest common divisor
339
Invariants of a finite Abelian group 364
Invariants of a module 409 Inverse
29
Inverse complex 120 Inverse composition 57 Inverse image 4 Inverse mapping 5 Inversion 135 Invertible element 29 Invertible matrix 279 Irreducible case 702 Irreducible element 310
INDEX
814
Irreducible equation 487 Irreducible factor decomposition 310 Irreducible radical 673 Irreducible radical expression
681
Isbell's theorem 80 Isobaric polynomial 446 Isomorphic invariant 89 Isomorphic mapping 87 Isomorphic normal series 211 Isomorphic structures 87 Isomorphism 87 of a field 527 Isomorphy 87 Isomorphy principle 89 Isomorphy theorems for groups 203 Isomorphy theorems for rings 204 Iterated polynomial 262
Jacobi's theorem 358 Jordan-Holder theorem 212 Juxtaposition 17
Left inverse 44
Left module 230 Left multiple 50 Left multiplication 232 Left operator 33 Left principal ideal ring 321 Left principal series 232 Left regular 45 Left representative system 123 Left residue polynomial 338 Left side cancelling rule 45 Left unity element 43 Left vector space 239 Left zero divisor 45 Left zero element 44 Legendre symbol 665 Letters 164 Levi's theorem 571 Lexicographical ordering 265 Limit 587 Linear algebra
395
Linear diophantian equation system
Kernel of a homomorphism 95 Konig-Rados theorem 507 Kronecker symbol 275 Kronecker's factor decomposition 560 Kronecker's rank theorem 422 Kronecker-Hensel normal form 472 Kronecker- Hensel theorem 471
Krull's theorem 206 Kuratowski-Zorn lemma 22
406 Linear equation system
Linearly independent Lagrange resolvent
Lagrange's formula of interpolation
Logarithm 608 Liiroth's theorem
458
Lattice 214
Leading coefficient 261 Leading term 261
Least common left multiple 308 Least common multiple 310 Least upper bound 216 Left annihilator 45 Left associated elements 306 Left cosets 122 Left distributivity 59 Left divisor 50 Left Euclidean division 325 Left Euclidean ring 325 Left ideal 129
153, 153, 395,
410
674
Lagrange's theorems 122, 348 Laplace's expansion of determinants Laplace's theorem 287
291. 414
Linear form module 265 Linear group Linear mapping 267 Linear polynomial 264 Linear rational function 439 Linear transformation 274 Linearly dependent 410 Linearly equivalent 411
287
525
Main theorem for finite Abelian groups 369 for finitely generated Abelian groups 406
Main theorem for solvable equations 684 for symmetric polynomials 442
for zeta functions for finite Abelian groups 386 Mapping 4 Matrix 65, 269 Matrix units 276 Maximal condition 424 Maximal element 19 Maximal ideal 205 Maximal left ideal 205
Maximal normal subgroup 205
INDEX
Maximal set 2 Maximal subfield 72 Maximal subgroup 72 Maximal submodule 72 Maximal subring 72 Maximal subsemigroup 72 Maximal sub-skew-field MacCoy's theorem 432 Meromorphism 100 Minimal condition 423 Minimal element 19 Minimal generating system 80 Minimal polynomial 480 Minimal set 2 Minimal subgroup 73 Minimal submodule 73 Minimal subring 73 Mixed group
321
Modular lattice 219 Module 15, 57 of a congruence
of a ring
155
59
Monomial algebra 254 Monomial basis 254 Monomial double algebra 254 Monomial ring 254 Monotonicity 219, 569 of zeta functions 390 Mobius function 385 Mobius inversion formula 385 Mobius theorem 385 Multiple divisor 440 Multiple factor 440 Multiple root 440 Multiple zero 440 Multiplication (6), 28
Multiplication mod m 65 Multiplication table 41 Multiplication theorem for determinants 285 Multiplication theorem for discriminants 453 Multiplication theorem for resultants 450
Natural group ring 256 Natural homomorphism 98 Natural numbers 11 Natural operator 47 Natural polynomial ring 263 Natural semigroup ring 256 Negative
568
Neutral element 29
815
Newton formulae 453 Newton's method of interpolation process 459
Nil left ideal 430 Nil ring 64 Nilpotence degree 49 Nilpotent 49, 423 Non-Archimedean ordering 580 Non-Archimedean valuation 618
Norm 551 Normal automorphism 234 Normal basis 560, 733 Normal divisor 125, (183) Normal field 496 of an algebraic field
498
Normal form of a matrix 402 Normal series 211 Normal subgroup 125, (183) Normal subring 129 Normalizer 132 of a ring
133
Normed class 307 Normed discrete valuation 627 Normed elements 307 Normed polynomial 338
Norming 307 n-tuple
17
Null module 38 Null operator 34 Null ring 38
Odd permutation 135 One-sided ideal 129 One-step non-commutative
structure
736
Operation 33 Operator 33
automorphism 224 compatible classification domain 33
endomorphism 224 Operator group 229 homomorphism 224 isomorphism 224 module 230 ring 231 simple 220 structure 40, 223
Opposed structure 117 Order ideal 399 Order of an element 48 Order of a structure
38
Order preserving automorphism 576
INDEX
816
Order preserving embedding 576 Order preserving isomorphism 576 Order preserving substructure 576 Orderable 571 Ordered Abelian group 607 Ordered field
Ordered module 568 Ordered ring 568 Ordered set 19 Ordered skew field 568 Ordering relation 19 Ordinary quaternion field 299 Ore polynomial ring 559 Ostrowski's first theorem 633 Ostrowski's second theorem 644
55, 111
Pole
575
653
Polynomial 259 Polynomial coefficient 61 Polynomial ring 259, 266 Polynomial theorem 61 Polynomial with integer coefficients Positive 568
Positive kernel 572, 574 Positivity domain 568 Potency
14
Power 46 Power set 2
Overstructures of the same kind p-adic field 631 p-adic hull 631 p-adic number 631 p-adic number field 631 p-adic series 631 p-adic valuation 630 p-adic number 632 p-adic number field 632 p-adic series 632 p-adic unit 632 p-adic valuation 632 p-basis 544 p-component 331 p-dependent 543 p-equivalent sets 544 p-group 331 p-independent 543 p-independent set 543
630 195
Partial product 392 Partial sequence 610
63
Polynomial vanishing differential quotient 435
Overfield 71 Overgroup 71 Overmodule 71 Overring 71 Overset 1 Overstructure 71
Pair 17 of mappings
group
Pickert-Szele theorem Place valuation 653
568
n-adic valuation
Periodic group 331 Periodic kernel 331 Permutable elements 46, 120 Permutation 5
71
Power sum 453 Primary component 331 Primary element 335 Primary group 331
Primary number 335 Primary prime number 335 Primary quaternion 355 Prime 312 Prime decomposition 312 Prime element 312
Prime factors 314 Prime field 477 of a skew field 478 Prime ideal 320 Prime number 50, 329 Prime polynomial 339
Prime power 314 Prime power basis 364 Prime power decomposition 314 Prime power factor 314 Prime power module 408 Prime residue class 333 Prime simplex 369 Primitive element 482 Primitive ideal 466
Partial sum 592
Primitive number mod w 394
Peano's axioms 14 Peirce decomposition 233 Perfect field 592 Perfect hull 594 Perfect valuation 592 Period 48, 671
Primitive polynomial 342 Primitive quaternion 345 Principal class 122 Principal ideal 131 Principal ideal ring 322
with prime decomposition
323
INDEX
Principal left ideal 131 Principal left ideal ring 321 Principal minor 287 Principal polynomial 339 Principal series 229, 232 Product 28
of mappings 6 of matrices 271 of sets 2 Projection 145 Proper degree 261 Proper left divisor 308 Proper subgroup 72
Proper submodule 72 Proper subring 72 Property of finite character 24 PrUfer group 66 Pure inseparable 538 Pure transcendental 517 q-polynomial 558 Quadratic 264
Quadratic equation 693 Quadratic reciprocity theorem 665 Quadratic residue 664 Quadrature of a circle 713 Quadruple 17 Quasi ideal 133 Quaternion
69, 300
with integer coefficient 347 Quaternion field 69, 299 Quaternion group 70 , 298 Quaternion ring 299 Quaternion units 300 Quotient 57 Quotient field 158 Quotient group 158 Quotient ring 158 Quotient semigroup 157 Quotient structure 157
Radical exponent 49 Radical expression 681 Radical free 430 Radical of an element 49
Radical of a ring 430 Radical over a field 672 Range (of a function) 7 Rank 239, 412, 419 of a linear equation system 415 of a matrix. 399 Rank theorem for matrices 419 Rational function 439
817
Rational function field 438 Rational number field 40, 163 Rational numbers 163 Real number 602 Real number field 602 Real valuation 617 Really closed field 615 Recursive definition 12 Reduced degree of an element 530 Reduced degree of a field 530 Reduced norm 296, 301 Reduced trace 296, 301 Reducible element 310 Reducible radical 673 Refinement of a normal series Reflexive relation 3
Regular multiplication
211
31
Regular norm 296 Regular representation 112, 295 Regular semigroup 33 Regular trace 296 Related classifications Relation 3
132
Relative automorphism 480 Relative conjugate 529 Relative different 556, 557 Relative discriminant 556, 557 Relative field 232, 478 Relative isomorphism 182, 480 Relative meromorphism 480 Relative norm 551 Relative trace 551 Relatively prime 324 Remak decomposition 235
Remak-Krull-Schmidt theorem 235 Remarkable points of the triangle 713 Representation 96 Representation problem 96 Representative 9 Representative system 9, 123 Residue 325 Residue class 124 Residue polynomial 338 Residue system 124 Residue theorem for polynomials 340 Resultant 444 Riemann zeta function 391 Ring 36, 59 defined by equations 167 of integer quaternions 345 of integers 40, 329
with greatest common left 308
divisor
INDEX
818
Ring with irreducible factor decomposition 310 with least common left multiple 308 with prime decomposition 313 with unity element 108, 336 Root 8, 49, 262, 290 of unity 509 Root exponent 49 Root system 291 Row rank 419 Row vector 269 Ruffini-Abel theorem 688 Scalar
272
Scalar factor
272
Scalar matrix
276
Schlicht product 120
automorphism-free
Schreier extension, 181
Schreier extension, factor-free 181 Schreier extension, group 174 Schreier extension, semigroup 183 Schreier's fundamental theorem 176 Schreier's main theorem 211
Schur's lemma 423 Second general 200
isomorphy
theorem
Second main theorem for finite Abelian groups 369 Second maximal subgroup 140 Second structure theorem 428 Semifield
36
35, 40 defined by equations 167
Semigroup
of a ring 59 Semigroup ring
255
with factor system 256 Semilattice 222 Semimodule 35 Semiordered set 19 Semiordering relation Semiring 36
Set
24
Set composition
119
extension
Simplex 369
Single-value mapping Skew field 36, 67 Skew product 105 Solution 8, 290 Solution formula 415, 682 Solution system 291 Solution vector 418 Solvable equation 680 Solvable field 663 Solvable group 214 Splitting Everett extension 194
Splitting field of polynomials 490 Splitting Schreier extension 181 Steinitz's first main theorem 494 Steinitz's interchange theorem 411 Steinitz's principle 89 Steinitz's second main theorem 520
Steinitz's theorem 494 of exchange 411 Stickelberger's theorem 663 Structure 34 belonging to equations 167 defined by equations 167 Structure axioms 38 Structure constants 241 Structure extension 106 Subdeterminant 286, 399 Subdirect product 151 Subdirect sum 151
40
Subsemigroup 70 Subset 1 Sub-skew-field 70 Substitution 262
principle of polynomials 262 theorem for polynomials 263 value 262
Substructure
70
of the same kind
1
Set chain
Simple structure 95 Simple transcendental field 483, 523
Submodule 70 Subring 70
19
of natural numbers 40
Separable elemen3 532 Separable hull 538 Sequence 17, 21
Simple algebraic field extension 483, 523 Simple field extension 482, 549
Subfield 70 Subgroup 70 Submatrix 270
of non-negative integers
Sensisimple 427 Semi-skew-field 36 Semistructure 37
Set lattice 220 Simple adjunction 482
71
Substructure problem 96 Subtraction 58
INDEX
Sum 28 of matrices 271 Sylvester determinant 444 Symmetric polynomial 44l Symmetric relation 3 System 1, 19 Szekeres normal-form 467 Szekeres's theorem 466 Szendrei's theorem 335
Taylor's theorem 438 TeichmUller-Tukey lemma 24
Theorem of inertia of finite Abelian group 388 Topological isomorphism 594 Topologically equivalent 594
Torsion free ring 331 Torsion group 331
272
Transposition 55 Triple 17 Trisection 712
Trivial homorphism 95 Trivial ring 64 Trivial solution 418 Trivial valuation 586 True representation 96 Tschirnhaus transformation of an ideal473
Tschirnhaus transformation of a polynomial
691
Union of structures 76 Union set 2 Uniqueness theorem for finite generable Abelian groups 409 Unit
306
Unit factor 314 Unitary module 230 Unity element 29 Unity group 38
Unity ideal 324 Unity matrix 275
Unity of a lattice 219 Unknown 3, 80 Upper bound 22 Vahlen-Capelli theorem 675 Valuated field 585 Valuation 585 Valuation factor field 623 Valuation ideal 523 Valuation ring 622 Value field 623 Value' module 627 Value of an element 585 Vandermonde determinant 457 Variable 7 Vector space
Trace 551 Transcendence basis 520 Transcendental element 480 Transcendental field 480 Transfer rule of equivalence 10 Transfinite induction 26 Transformation 278 Transformed matrix 280 Transitive permutation group 732 Transitive relation 3 Transpose of a matrix
819
239
Vector space over a skew field 412 Waring's formula 454 Wedderburn-Artin structure theorems 426, 428
Wedderburn's theorem 513 Weierstrass's upper bound 610 Weight 446
Well-ordered set 21 Well-ordering theorem 25 Word problem 172 Word sum 164 Words
164
Zassenhaus's lemma 208 Zassenhaus's subgroup 208 Zassenhaus's subring 208 Zermelo's postulate 8 Zermelo's theorem 25 Zero class 124 Zero divisor 45 Zero-divisor-free 64 Zero-divisorless 64 Zero element 32
Zero element of a lattice 219 Zero endomorphism 114 Zero of a polynomial 262 Zero polynomial 260 Zero ring 64 Zero sequence
587
Zeta function for finite Abelian groups 385
Zsigmondy's theorem
394
OTHER TITLES IN THE SERIES IN PURE AND APPLIED MATHEMATICS Vol.
1
WALLACE - An Introduction to Algebraic Topology
Vol. 2 PEDOE - Circles Vol. 3 SPAIN - Analytical Conics Vol. 4 MIKHLIN - Integral Equations Vol. 5 EGGLESTON - Problems in Euclidean Space: Application of Convexity Vol. 6 WALLACE - Homology Theory on Algebraic Varieties Vol. 7 NOBLE - Methods Based on the Wiener-Hopf Technique for the Solution of Partial Differential Equations Vol. 8 MiKUSiNsIU - Operational Calculus Vol. 9 HEINE - Group Theory in Quantum Mechanics Vol. 10 BLAND - The Theory of Linear Viscoelasticity Vol. 11 KURTH - Axiomatics of Classical Statistical Mechanics Vol. 12 FucHSS - Abelian Groups Vol. 13 KuRATOwsKi - Introduction to Set Theory and Topology Vol. 14 SPAIN - Analytical Quadrics Vol. 15 HARTMAN and MIKUSINSKI - The Theory of Lebesgue Measure and Integration
Vol. 16 KulczYCKI - Non-Euclidean Geometry Vol. 17 KURATOWSKI - Introduction to Calculus Vol. 18 GERONIMUS - Polynomials Orthogonal on a Circle and Interval Vol. 19 ELSGOLC - Calculus of Variations Vol. 20 ALExrrs - Convergence Problems of Orthogonal Series Vol. 21 FUCHS and LEVIN - Functions of a Complex Variable, Volume II Vol. 22 GOODSTEIN - Fundamental Concepts of Mathematics Vol. 23 KEENE - Abstract Sets and Finite Ordinals Vol. 24 DrrKIN and PRUDNIKOV - Operational Calculus in Two Variables and its Applications
Vol. 25 VEKUA - Generalized Analytic Functions Vol. 26 AMIR-MoEZ and FASS - Elements of Linear Spaces Vol. 27 GRADSHTEIN - Direct and Converse Theorems Vol. 28 FUCHS - Partially Ordered Algebraic Systems Vol. 29 POSTNIKOV - Foundations of Galois Theory Vol. 30 BERMANT - A Course of Mathematical Analysis, Part II Vol. 31 LUKASIEWICZ - Elements of Mathematical Logic Vol. 32 VULIKH - Introduction to Functional Analysis for Scientists and Technologists
Vol. 33 PEDOE - An Introduction to Projective Geometry Vol. 34 TIMAN - Theory of Approximation of Functions of a Real Variable Vol. 35 CSASZAR - Foundations of General Topology Vol. 36 BRONSHTEIN and SEMENDYAYEV - A Guide-Book to Mathematics for Technologists and Engineers Vol. 37 MOSTOwsKI and STARK - Introduction to Higher Algebra
Vol. 38 GODDARD - Mathematical Techniques of Operational Research 821
OTHER TITLES IN THE SERIES
822
Vol. 39 TIKHONov and SAMARSKII - Equations of Mathematical Physics
Vol. 40 McLEOD - Introduction to Fluid Dynamics Vol. 41 MoISIL - The Algebraic Theory of Switching Circuits Vol. 42 OTTO - Nomography Vol. 43 RANKIN - An Introduction to Mathematical Analysis Vol. 44 BERMANT - A Course of Mathematical Analysis, Part I KRASNOSEL'SKU - Topological Methods in the Theory of Non-Linear Integral Equations Vol. 46 KANTOROVICH and AKILOV - Functional Analysis in Normal Spaces
Vol. 45
Vol. 47 JONES - The Theory of Electromagnetism Vol. 48 FEJES T6TH - Regular Figures Vol. 49 YANO - Differential Geometry on Complex and Almost Complex Spaces Vol. 50 MIKHLIN - Variational Methods in Mathematical Physics Vol. 51 Fucxs and SHABAT - Functions of a Complex Variable and Some of their Applications, Vol. I Vol. 52 BUDAK, SAMARSKH and TIKHONOV - A Collection of Problems on Mathematical Physics Vol. 53 GILES - Mathematical Foundations of Thermodynamics Vol. 54 SAIJL'YEV - Integration of Equations of Parabolic Type by the Method of
Nets
Vol. 55 PONTRYAGIN et al. - The Mathematical Theory of Optimal Processes Vol. 56 SOBOLEV - Partial Differential Equations of Mathematical Physics Vol. 57 SMIRNOV - A Course of Higher Mathematics Vol. I Vol. 58 Vol. 1I Vol. III Pt. I Vol. 59 ,. Vol. III Pi. 2 Vol. 60 Vol. IV Vol. 61 Vol. 62 Vol. V Vol. 63 NAIMARK - Linear Representation of the Lorentz Group
Vol. 64 BERMAN - A Collection of Problems on a Course of Mathematical Analysis MESHCHERSKII - A Collection of Problems of Mechanics Vol. 66 ARSCOTT - Periodic Differential Equations Vol. 67 SANSONE and CONTI - Non-linear Differential Equations Vol. 65
VOLKOVYSKil, LUNTS and ARAMANOVICH - A Collection of Problems on Complex Analysis Vol. 69 LYUSTERNIK and YANPOL'SKII - Mathematical Analysis - Functions, Limits, Series, Continued Fractions Vol. 68
Vol. 70 KUROSH - Lectures in General Algebra BASTON - Some Properties of Polyhedra in Euclidean Space Vol. 72 FIKHTENGOL'TS - The Fundamentals of Mathematical Analysis, Volume I Vol. 73 FIKHTENGOL'TS - The Fundamentals of Mathematical Analysis, Volume 2 Vol. 74 PREISENDORFER - Radiative Transfer on Discrete Spaces Vol. 71
Vol. 75 FADDEYEV and SOMINSKII - Elementary Algebra Vol. 76 LYUSTERNIK, CHERVONENKIS and YANPOL'SKII - Handbook for Computing Elementary Functions
Vol. 77 SHILOV - Mathematical Analysis - A Special Course Vol. 78 DITKIN and PRUDNIKOV - Integral Transforms and Operational Calculus Vol. 79 PoLOZHII - The Method of Summary Representation for Numerical Solution of Problems of Mathematical Physics Vol. 80 MISHINA and PROSKURYAKOV - Higher Algebra - Linear Algebra, Polynomials, General Algebra Vol. 81 ARAMANOViCH et a!. - Mathematical Analysis - Differentiation and Integration
OTHER TITLES IN THE SERIES
823
Vol. 82 RiDai - The Theory of Finitely Generated Commutative Semigroups Vol. 83 MIKHLIN - Multidimensional Singular Integrals and Integral Equations Vol. 84 LBBEDEV, SKAL'SKAYA and UFLYAND - Problems in Mathematical Physics Vol. 85 GAKHOV - Boundary Value Problems Vol. 86 PHILLIPS - Some Topics in Complex Analysis
Vol. 87 SHREmER - The Monte Carlo Method Vol. 88 P000RZELSKI - Integral Equations and their Applications, Vol. 1, Parts 1, 2 and 3 Vol. 89 SvEsHNIKOV - Applied Methods of the Theory of Random Functions Vol. 90 GUTER, KuDRYAVTSEV and LEVITAN - Elements of the Theory of Functions
Vol.92 GEL'FoND and LINraK - Elementary Methods in the Analytic Theory of Numbers
Vol. 93 GuRBVicH - The Theory of Jets in an Ideal Fluid Vol. 94 LANCASTER - Lambda-matrices and Vibrating Systems Vol. 95 DINCULEANU - Vector Measures
Vol.96 SLUPECKi and BoRKOwsKI - Elements of Mathematical Logic and Set Theory
Vol. 97 Rim - Foundations of Euclidean and Non-Euclidean Geometries according to F. Klein