46 EXERCISES 178-183 178. Explain concisely the principle of mathematical induction. Use induction to prove that
179. (a) Evaluate (b) Show that 180. (a) If dy/d# = j>/# for all non-zero x, and y = 1 when # = 1, find jy in terms of x for # > 0. Determine whether or not the information above is sufficient to determine the value of y when x = — 1. (b) If Ay I Ax = 1 — {2xy/(x2 + 1)} for all x9 and jy = 1 when # = 1, find y in terms of x for all real x.
Second paper 1967 181. Find, with proofs, the suprema and infima of the sets l)
n
^ 4 ^ = 1,2,3,...j and {n.2-: n = 1,2,3,...}.
It may be assumed that rn -> 0 as n -> oo if \r\ < 1. 182. Let a, b, c denote respectively the vectors (1,2,3), (1,0, - 1 ) , (7,4,1). Show that a, b, c are linearly dependent. Find all vectors d which are such that a, b, d are linearly dependent and c.d = 0 and d.d = 1. 183. Show that for two polynomials P and Q to have no common factor it is necessary and sufficient that there exist two polynomials A and B such that 2 A B +
(Any theorem used in the argument must be clearly stated, but need not be proved.) Expand j-^—TTT~I—T\
m
partial fractions.
EXERCISES 184-189 47 184. Give the expression for nCr in terms of factorials, where nCr denotes the coefficient of an~rbr in the expansion of (a + b)n by the binomial theorem. Prove that if p is a prime then p divides pCr for 1 ^ r < p — 1. Hence prove that the divisibility of 2 m — 2 by m is a necessary condition for m to be prime, and by consideration of m = 341 = (2 1 0 -l)/3 show that the condition is not sufficient. 185. Prove that if a < b < c and the function / is continuous on the intervals [a, b] and [b> c] then it is continuous on [a, c]. Prove that the function \x\ is continuous on any interval. Hence prove that if/ and g are continuous on an interval / then so are max(/,£) and min (/,#), where {max (/,£)}(#) is the greater of f(x) and g(x) and {min(f,g)}(x) is the lesser of f{x) and£(#). (Standard theorems on the continuity of sums and differences may be quoted without proof.) 186. Find a, b, c, d so that the function/, defined by the conditions f(x) =-x2 for x<0, f(x) = a + bx + cx2 + dx* for /(#) = x for x > 1,
0 < x < 1,
has a continuous derivative for all x. Determine whether or not / has an inverse. 187. Give, with proofs, the values of \02X
-.
,
,.
am —2-T and
hm *[i/*r 0 where [y] means the greatest integer not greater than y. 188. Explain, in terms of Riemann sums, what is meant by the existence rb
of
f(x) dx. From this definition, prove that Ja
J
m
b-k
rb
f(x + k)dx=
a—k
Ja
f(x)dx.
State, and prove similarly, a similar result for
J
*6/m
f(mx)dx
(m>0).
aim
189. Prove that every solution of the differential equation/"(6) +f(d) = 0 is of the form f{6) = A sin d + B cos 6f where A and B are constants.
48 EXERCISES 190-196 Hence obtain the addition formulae for the functions sin, cos, and tan. Solve the equation 2tan- 1 (l/3) + tan- 1 (l/^) = \-n, 190. Explain what is meant by the statement * the function £-sin* + /(l-cos*) for 0 < t < n defines a simple differentiate arc*. Find the length of the arc so defined. Special paper 1967 191. Let N be the set of natural numbers. Prove that the Cartesian product NxN can be put in one-to-one correspondence with N. Prove also that the positive rational numbers can be put in one-to-one correspondence with the natural numbers. 192. Write down the natural numbers less than 8 which have no common factor (other than 1) with 8, and show that they form an Abelian group under the operation of multiplication modulo 8. Do the same thing with 8 replaced by 5; prove that the two groups that have been obtained are not isomorphic. 193. Find the number m which has a four-digit representation ending with 11 in both the decimal scale and the scale with base 7. Find also two other scales in which the representation of m ends with 11. 194. By using the identity (a + bf = az + bz + 3ab(a + b)y or otherwise, prove that there are no integers py qy ry not all zero, such that = 0. 195. For ordered pairs of rational numbers, let addition and multiplication be defined by (a, b). (cy d) = (ac + bdy ad+be).
Prove that addition and multiplication are commutative and associative, and that multiplication is distributive with respect to addition. Determine whether or not the pairs form a field with respect to these operations. 196. If z± and z2 are two complex numbers, and z\ is the conjugate of zly prove that , _ , , , , , , ,, , Fll — l^ll> \Z1Z2\ — Pi I P2|-
EXERCISES 197-200 49 Prove that the product of two numbers, each of which is the sum of squares of two unequal natural numbers, can be expressed as the sum of squares of two unequal natural numbers in at least two distinct ways (a?-\-b2 and b2 + a2 counting as the same). Express 14,645 as the sum of squares of two natural numbers in four ways. 197. If a, b, c are any three vectors, prove that aA(bAc) = ( a . c ) b - ( a . b ) c . Show that the condition a A c = 0 is sufficient for aA(bAc) = (aAb)AC. Determine whether or not it is a necessary condition. 198. Let rational numbers be written in the formp/q, where/) and q are integers with no common factor. Prove that in any interval there are rational numbers with q even and also rational numbers with q odd. Let a function/be defined as follows: f{Pk) = * if ^ is even, /(/>/?) = 0 if q is odd, /(#) = 0 if x is irrational. 1 f Prove that f(x) dx does not exist as a Riemann integral. Jo 199. Defining ex as the function inverse to log x, prove that if / is a function such that e? >f(x) for x > 0 then rx e* > 1 + f(t) dt for x > 0. Jo Hence prove that
e*> l+x + ^ + ...+—. for 2 n\
x>0.
Prove that, corresponding to a given natural number n, there is a number A(n) such that e? > xn for x > A(n). 200. Prove that if/ is continuous for a < x < b and differentiate for a < x < b, then there is a number c such that f(b) —f{a) = (b — a)f\c) and a < c < b. (If Rolle's theorem is quoted then it must be proved.) Prove that if/is a cubic function then c may be chosen so that 2a + b —^— ^c
.
SOLUTIONS AND COMMENTS FOR EXERCISES 1-50
1. The information conveyed is as follows: (i) B is contained in (is a subset of) A. Equivalently: A includes B\ or, there is no element of B which is not also an element of A. (It is possible to say this in other ways, but one must be careful not to exclude the possibility that B may be empty. In symbols, we usually write B^ A or A ^ B—reserving cz and =^ for occasions when we wish to assert that the inclusion is strict, giving the additional information that A =}= B.) (ii) The same as (i). (iii) A and B do not intersect. Equivalently: there is no element which belongs both to A and to B. (iv) A is the empty set and B is the empty set. Equivalently: A and B are both empty; or, neither A nor B is a proper set. (v) The same as (iii). (vi) A is contained in B. The explanations are simple. For instance, (iv) states that the elements, if any, that belong to A but not to B are precisely the elements that belong to B: since an element cannot both belong and not belong to B> we conclude that B = 0 ; thus A\ 0 = 0 , and therefore A = 0 (since A \ 0 = A for any set A). 2. By definition, AKJB consists of those elements x for which xeA or xeB (the word 'or' being used here in the usual 'wide* sense, not the ' exclusive' sense). Thus, by the symmetry of the definition, A\JB = BUA.
To prove that AKJ(B\JC) = {A\JB)\JC, suppose first that SO that xeA or xeBvC. If xeA then xeAuB and hence xe(AuB)<J C; and if xeBuC then xeAvB or xeC, so that again xe(A\jB)vC. Thus
XEA\J{B\JC\
50
SOLUTIONS AND COMMENTS 2 51 We can similarly demonstrate the opposite inclusion, and so establish the required equation. If a is the single element of A then either (i) a of A u B corresponds to a of A u C, or (ii) a of A u B corresponds to, say, c of A u C while a of AKJC corresponds to, say, b of A u 5 , where C G C and 6 e JB. In case (i) the remaining correspondences between the elements of A u B and those of A\JC give a one-to-one correspondence between B and C, and in case (ii) we can let b correspond to c and then have the remaining correspondences as in case (i). (The hypothesis that A be disjoint from BKJ C is not superfluous, as can be seen by letting A = {1}, B = {2}, and C = A\J B. We can allow A to have more than one element, but not infinitely many: for example, if A consists of all the even numbers, B of one odd number, and C of two odd numbers, then Au B and A\J C are in one-to-one correspondence but B and C are not.) If A, B, and C are disjoint finite sets, having m, n, and p elements respectively, the above results show respectively that m + n = n + my
m + (n+p) = and (m = 1 in the third case) n=p
if « + ! = / > + 1 .
These are the fundamental laws of addition for natural numbers. (The first two of these laws of addition are respectively the commutative law and the associative law; from the third we can deduce, by mathematical induction, the cancellation law, which states that n = p if there is a natural number m such that n + m=p + m. The practice of using the symbol + to denote addition of numbers seems to have originated, together with the use of — for subtraction, towards the end of the fifteenth century, in Germany; it was adopted by the sixteenthcentury Welsh mathematician Robert Recorde, who was the first person to write a noteworthy mathematical book in English, and the first to use = as a mathematical symbol. The symbol •+• is now used for various binary operations, but its use is generally confined to operations that are commutative and associative. The symbol u, denoting union of sets, was introduced, together with n for intersection, by the nineteenth-century Italian mathematician G. Peano; but for much of the first half of the twentieth century + was commonly used instead of u, the term 'sum', or 'logical sum', being used instead of 'union*. Peano's notation has now been universally adopted, and the expressions AvB and A + B can
52
S O L U T I O N S AND C O M M E N T S 3-4
occur in the same context but with different meanings: if A and B are subsets of a set with a binary operation + then, by definition,
and A\B
A+B = {a + b: aeA, beB}; is not written A-B. See 8, 15, 51, 52, 56, 57, 81, 84.)
3. The principle of mathematical induction can be stated as follows. Suppose that for each natural number n (1,2,3,...) we have a certain proposition Pn: then Pn is true for every n if (i) Px is true and (ii) for every n, Pn implies P n + 1 . The principle depends for its validity on the fact that every non-empty set of natural numbers has a least element: thus if Pn were false for some n there would be a least such w, say w0; this could not be 1 if condition (i) holds, so there would then be an w, namely n0 - 1 , for which Pn is true but Pn+1 is false, contrary to condition (ii). (a) If Pn is the proposition * the set Et u E2 u ... u En is finite provided that each of the sets EvE2i ...,i? n is finite' then condition (i) obviously holds, and condition (ii) holds because
and the union of any two finite sets is finite. (If two sets have respectively m elements and n elements, their union has at most m + n elements.) (b) If Pn is the proposition
then condition (i) obviously holds, and condition (ii) is easily established by elementary algebra (adding (2n + 1 ) 2 to both sides of the equation that represents Pn, and then re-arranging the right-hand side). (Mathematical induction is a particular mechanism of deduction, and is thus fundamentally different from scientific induction, which is a method of inference having an inherent possibility of error. However, when we use mathematical induction—often called * induction' for short—we are usually seeking to prove a conjecture to which we have been led by a process of scientific induction, or by some other heuristic device which is not valid as a method of proof. Use of the imagination to produce interesting conjectures, which one subsequently tries to prove or to disprove, is an important part of creative work in mathematics.) 4. The classical proof that there are infinitely many prime numbers can be adapted in the following way. Let pv ..., pn be prime numbers of the
SOLUTIONS AND COMMENTS 5-6
53
form 3& + 2 (that is, congruent to 2 modulo 3). Ifpip2 • • -pn 1S divided by 3, the remainder is 1 if n is even, 2 if n is odd. Hence eitherpxp2 ...pn + lor P1P2 • • • Pn + ^ is of the form 3k + 2. Now, neither of these two numbers is divisible by any of the numbers ply ..., pni but any number of the form 3& + 2 has at least one prime factor of that form (since every prime number other than 3 is either of the form 3£ +1 or the form 3k + 2, and a product of numbers of the form 3k +1 is of the same form). Thus there is a prime number of the form 3& + 2 which is not one of the numbers Pv *">Pn' This shows that there are infinitely many prime numbers of the form3& + 2. The same type of argument can be used for prime numbers of the form 4& + 3. (Note, however, that it will not work in the case of prime numbers of the form 3k + 1 , which is much more difficult.) 5. The l.c.m. of the numbers 1,..., n can be found as follows. Express each of the numbers 2,..., n as a product of powers of prime numbers, and for each of the distinct prime numbers so obtained choose the highest power of it that occurs in any of the factorizations: then the product of these highest prime powers is the l.c.m. (This follows from 'the fundamental theorem of arithmetic*.) Thus the l.c.m. is 2mk> where k is a product of powers of odd primes and is therefore odd, and 2m is the highest power of 2 that divides any of the numbers 2,...,«. Since a factor of any of the numbers 2, ..., n is itself one of these numbers, m is the greatest integer such that 2m < n. Now consider the integers 2mk/j\ where/ = 1,2,3,...,«. One of these is odd (the one for which/ = 2m), but all the others are even: the sum of these integers is therefore an odd number, say r. If s = 2mky which is an even number, then . . . 2 3
n s
In a similar way, but considering powers of 3 instead of powers of 2, we can show that, if n > 1, 1 1 1 1+ 3+5+-+2^1 is the ratio of a number which is not divisible by 3 to a number which is divisible by 3. 6. We know that the h.c.f. of the integers m and n can be expressed in the form rn + sniy where r and s are integers (which can be calculated by Euclid's algorithm). If m and n have no common factors (other than 1)
54
SOLUTIONS AND COMMENTS 7
their h.c.f. is 1, so that 1 =rn + sm; and neither m nor n is 0, so that mn 4= 0, and \ r s | mn m n Now any rational number can be expressed in the form
hh-K' where a is an integer and bv ..., bn are powers of distinct prime numbers. We wish to show that there are integers av ..., an such that a-i
a
± i
==
(in i
5*
Assume, as induction hypothesis, that this is the case for some value of n. (It is obviously so when n = 1.) Let bn+1 be a power of another prime number (possibly the first power), so that bn+1 has no factor in common with any of the numbers bv ..., bn. Then
and by the result proved above there are integers a{, d[, ..., any d'n such b b
i n+i
Thus
K
bn+i' a
' KK+i -*1
oxb%...bnbn^
I
bn
K
K+i*
1 a'n I
bn
bn+1
where an+1 is the integer a{ + ... +«^. The desired result now follows by induction. 7. For any integer w greater than 1, let M be the set of integers 0, 1, ..., m—\. If x, yeM we define x+y to be the remainder after division of x +y by m. Thus the equation
x+y = z mr has the same meaning as the congruence x +y = z
(mod m).
Similarly, we define x x y to be remainder after division of xy by m. It is easy to verify that with + and x as * addition' and * multiplication' respectively, M is a commutative ring: this is the ring of integers modulo m
SOLUTIONS AND COMMENTS 8
55
(also called the ring of remainders, or of residues, modulo m); it has the integer 0 as its zero element, and it has a unit element, namely 1. For this ring to be a field, it is necessary and sufficient that for each non-zero element x of M there should be an element x' of M such that x' x x == 1, m
that is, x'x = 1 (mod m). (A field is by definition a commutative ring in which every non-zero element x has a reciprocal, or Multiplicative inverse'—an element whose product with x is multiplicatively neutral; see 9.) If m is not prime then m = xy, where x, y e M; then x xy = 0, so that if x' x x = 1 then m
y = (xr x x) xy = xf x(x xy) = 0,
whereas y 4= 0 because w =J= 0. Thus the ring cannot be a field if m is not prime. If m is prime, let x be any non-zero element of M: then the products Oxx, 1 xx, ...Am — l)xx are all different; for if h x x = k x xt r
m '
m '
fK
'm
'
mm
where 0 ^ h < k < my then (k -h) x = 0 (mod m), and this implies, if m is prime, that m divides k — h or x, both of which are less than m. The set of these products is thus the whole of M; so one of the products is 1, and it follows that the ring is a field. (The above proof can be analysed as follows, (i) The ring in question has divisors of zero—that is, 0 can be expressed as a product of non-zero elements in the ring—if and only if m is composite, (ii) There can be no divisors of zero in any ring which is afieldor a part of a field, (iii) A commutative ring that has a unit element and no divisors of zero—that is, an integral domain—is necessarily a field if it is finite. Note that an infinite integral domain need not be a field—though if it is not itself a field it can always be regarded as a part of a field, in the way that the ring of integers can be regarded as a part of the field of rational numbers.) In the ring of integers modulo 7, 3 is the reciprocal of 5, since 3 x 5 = 1 (mod 7). Since 2 x 5 = 1 (mod 9), 5 has a reciprocal, namely 2, in the ring of integers modulo 9. 8. By the symmetry of the definition,
AAB=BAA for any subsets A> B of E; the binary operation A is thus commutative. Also, x G A A B if and only if x belongs to A or to B but not to both. Hence xeA A (B A C) if and only if x belongs either to only one of A, B, C, or else to all three; and we have the same criterion for x to be an element of
56 SOLUTIONS AND COMMENTS 9 (A A B) A C, so the operation A is associative. For any subset A of E, A A 0 = A, so 0 is a neutral element for A; and so ^4 has an inverse (namely itself) with respect to A. Thus the subsets of E form an Abelian group with respect to A. The operation r» is commutative and associative. Also, if A, B, C are any subsets of E, that is, n is * distributive over A*. Thus, with A as addition and r\ as multiplication, the subsets of E form a ring. Clearly, this ring has a unit element, namely E. If AnB = E then A = E, so that E is the only element that has a reciprocal in the ring. (We have here an example of a * Boolean ring', so called after the nineteenth-century Anglo-Irish mathematician George Boole, whose work on 'the laws of thought* was very influential in the development of pure mathematics; see 80, 106. The symmetric difference of two sets has sometimes been called their ' sum modulo 2 ' ; it is the same as their union if and only if the sets are disjoint. The operation A is used in the subject known as ' combinatorial topology'—which is a part of' algebraic topology*. Intersections of sets were formerly called 'products', or 'logical products'.) 9. A field is a commutative ring in which the non-zero elements form a group with respect to multiplication. (See 57.) Inanyring,
^ = x{0
+
0) = xO+xO,
and therefore xO = 0; similarly, Ox = 0. Hence so that ( — x) (— y) = — (— x) y; similarly,
(-x)y + xy = (~x + x)y = 0, so that (— x)y = — xy. Thus ( — x) (—y) = — (— xy) = xy. If the ring is a field, and x 4= 0, then x has a unique reciprocal xr1: since xx~x = 1, it follows from what has just been proved that so that — x~x is the reciprocal of — x. Also, if xy = xz and x ={= 0 then
y = {x~xx)y = x-^xy) = xr\xz) = (x^x) z = z.
S O L U T I O N S AND COMMENTS 10-11
57
(Note that we have made no use here of the commutativity of multiplication, or even of the commutativity of addition; we have however used the fact that a ring is a group with respect to addition, and we have used both the left-hand distributive law and the right-hand distributive law. Thus the rules of elementary algebra that are sometimes loosely expressed by the statement' two minuses make a plus' can be seen to be valid in a fairly general context. Up to the point where we took the ring to be a field, we made no use of the associativity of multiplication.) 10. Suppose that, in a given field, x and y are square roots of the same element. Then x2 = y2y so that (x-y)(x+y) = x2-y2 = 0. Now, in a field a product is zero only if at least one of the factors is zero. Hence if x =J= y then x +y = 0, so that x = —y. Thus no element of a field has more than two square roots in the field. If 2 had a rational square root there would be integers p and q, with no common factor, such that / \2 _ o that is, p2 = 2q2. Thus 2 would divide p2t and would therefore divide p (since 2 is prime); but if p = 2r, for some integer r, then q2 = 2r2, and this implies that 2 divides qy so th&tp and q would have 2 as a common factor. This contradiction shows that 2 has no rational square root. (In the same way we can show that no other prime number has a rational square root.) Since 3 2 = 2 (mod 7), 3 is a square root of 2 in the field of integers modulo 7. (4 is another square root of 2 in this field.) 11. Since F is a field it has a unit element (with respect to the usual operation of multiplication); this is a non-zero rational number u such that u2 = w, and therefore u = 1, the rational unit. If F contains a positive integer n then it contains n +1; hence, by induction, F contains every positive integer. Also, F has a zero element; this is a rational number z such that z + 1 = 1, and therefore z = 0, the rational zero. It follows, since F is a field, that F contains every integer, and every quotient of an integer by a non-zero integer. Thus F consists of all the rational numbers. (This result can be expressed by saying that the field of rational numbers has no proper subfield, or that it is a minimal field. For any prime number p, the field of integers modulo p is also minimal in this sense.) 3
WSE
58
S O L U T I O N S AND C O M M E N T S 12
12, A totally ordered field is a field F in which a particular set P is specified in such a way that (i) if x eP a n d y e P then # + y e P and xy eP, and (ii) if xeF then either # e P or —xeP or x is zero, these last three possibilities being mutually exclusive. T h e elements of P are the positive elements of the totally ordered field. The conditions (i) and (ii) imply that if x is any non-zero element of F then x2eP; so that if 1 is the unit element of F then leP. Hence the sums 1 + 1,1 + 1 + 1,... belong to P . These sums are all different, for if two of them were equal then one of them would be zero and would consequently not belong to P . Hence if we write ^ ^ ., .,
1 + 1+ ... +1 = nl, where n is the number of terms on the left, the correspondence n~ nl is one-to-one. We can extend this correspondence by associating the rational number (m — n)jp with the element {ml — nl)jpl of F, for any natural numbers m, n, p, and we thereby establish an isomorphism between the field of rational numbers and a certain subfield, Q say, of F; and the positive rational numbers then correspond to the positive elements of Q. In this sense F 'contains' the system (totally ordered field) of rational numbers; we can identify 1 with 1, nl with n, and so on. If x and y are elements of a totally ordered field, the statement '# is greater than y(x>y)\ or equivalently 'y is less than x(y<x)\ means that x —y is positive. Thus, in such a field, every element other than x is either less than x or greater than x. The relations > and < are transitive, because sums of positive elements are positive; and 'positive' has the same meaning as 'greater than zero'. (Negativemeans 'less than zero'.) The system R of real numbers is a totally ordered field with the following property: for any non-empty sets E and H such that every element of E is less than every element of H, there is at least one element £ of R such that no element of E is greater than £ and no element of H is less than £. This property distinguishes R from all other totally ordered fields (except isomorphic copies of itself); see 62,67. The system of rational numbers can be regarded as a part of R, but there are irrational real numbers also. In a totally ordered field, any quotient of positive elements is positive: hence if we are given a positive element x we can find a positive element less than x, for instance \x. Now if c is a positive real number, let E be the set of all positive real numbers whose squares are less than c, and H the set of all positive real numbers whose squares are greater than c.
Then E and H are non-empty (leE if x > l,ceE'tfc^
1, zndc + leH),
and every element of E is less than every element of H. Hence there is a
S O L U T I O N S A N D C O M M E N T S 13-14
59
positive real number £ such that no element of E is greater than £ and no element of H is less than £. If £2 < c, let 8 be a positive number which is less than £ and also less than (£-£ 2 )/3£: then so that £ + 8eE, although £ + # > £ ; this contradiction proves that £2 ^ £. We can show similarly that £2 ^ c, and it follows that £2 = c. Any subfield of R is a totally ordered field. If c is a positive rational number, the numbers of the form r + Sy/c, where r and s are rational, form such a subfield: this is not the whole of R, for it does not contain the square root of every positive number; and it is not the field Q of rational numbers if <Jc is irrational (for example if c — 2). See 68. 13. An upper bound of a set E of real numbers is a real number b such that no element of E is greater than b. If E = 0 then every real number is an upper bound of E; but if E # 0 and 2? has upper bounds then (by the special property referred to in 12) Ehas a /*«£ upper bound (necessarily unique): this is the number sup2?. (The symbol 'sup' is an abbreviation of the word 'supremum', whose meaning must be carefully distinguished from that of ' maximum': the existence of sup E does not imply the existence of max E, but if max E exists then sup E = max E. There is a similar distinction between 'infimum' and 'minimum'.) If E consists of all the rational numbers less than x then x is an upper bound of E, so sup E ^ x: if sup E < x there is a rational number r such that supE < r < x, contrary to the definition of supZ?; thus supE1 = x. (See 62, 67.) 14. If x > c +1, where c > 0, then so that if xz ^ c then x ^ c + 1 . Thus c+\ is an upper bound of the set E of all real numbers x such that Xs ^ c. Let b — sup E. There are some positive numbers in E (leE if c> ly and cel? if c <, 1): hence b > O.lfb3 < c let ^ be a positive number which is less than b and also less than (c-b3)/7b2: then cy so that b + 8eE, contrary to the definition of b. Thus bz ^ c.l£bz > c let S be a positive number which is less than b and also less than (b3 — c)/4b2: then, by the definition of b, there is an x in 2? such that x > b — 8y and then 3-2
60
SOLUTIONS AND COMMENTS 15-16 z
but x ^ c, since ceE. It follows that bz = c. Hence ( — b)z = — c. Thus every positive or negative real number has a real cube root; and, of course, 0 is a cube root of 0. (See 66.) (In fact each real number has only one real cube root, since xz -yz
= (X -y) (x2 + xy +y2) = (x - y ) {(* + iy)2 + 3//4}
and (x + \y)2 + 3j>2/4 > 0 if x and y are real and not both zero. However, in the larger field of complex numbers—which is not a totally ordered field—every non-zero number has three distinct cube roots.) IS. A set of real numbers is bounded if and only if it has upper bounds and lower bounds. Let b = sup E and c = sup H, where E and H are non-empty sets of positive real numbers. If xeE and ye H then x ^ b and y < c. The first of these inequalities implies, since y > 0, that xy < by; and the second implies, since b > 0, that by < be. Hence xy < be, so that be is an upper bound of the set EH. Thus EH is bounded (0 being a lower bound), and sup EH < be. Let a = supEH. If a < be let * = (bc-a)/(b + c). Then b-i
< b and
c-£ < ct so that, for some x inland some y in H, x > b-dandy > c — 8; then
xy > (b-3)(c-d) = bc-(b + c)8 + d2 > bc-(b + c)S = a,
whereas xy < a, since xyeEH. It follows that a = fc. If E consists of the numbers 0 and — 1, and if H = E> then supi? = sup if = 0 but sup EH — 1, so that in this case sup EH 4= sup E sup H. 16. If 0 < a < b in a totally ordered field then, as is easy to prove, a < x < b if and only if a2 < x2 < b2. Now the statement that 1-022 is an accurate estimate of x to 3 decimal places means that and hence that
1-0215 < * < 1-0225, 1-04346225 < x2 < 1-04550625,
which does not imply that 1-0435 < x2 < 1-0445. Thus we cannot infer, from the given information about x, that 1-044 is an accurate estimate of x2 to 3 decimal places. However, since (1-0105)2 < 1-0215 and
1-0225 < (1-0115)2,
S O L U T I O N S A N D C O M M E N T S 17-18
61
the given information does imply that 1-0105 ^ V^ < 1-0115, which is to say that 1*011 is an accurate estimate of *Jx to 3 decimal places. 17. Since k > 1, inf E > 0 if and only if kmlE>miE. But, by the definition of inf E, a > iniE if and only if there is an x in E such that x < a. T h u s inf E > 0 if and only if there is an x in E such that x < k inf E. Since Ex and E2 are sets of positive numbers, inf Ex ^ 0 and inf E2 ^ 0. If inf Ex > 0 there is an element hrn of Ex such that so that k'11"1 < infEl9 which is impossible since k~n~1eE1. Thus infEx = 0. To deal with E2, let E be the set of all numbers knjn\ with n > k2 -1; then E c E2i and therefore inf E > inf £ 2 . If inf E > 0 there is an n greater than k2 - 1 such that kn -x
7
18. Let / ( # ) = ax2 + 2foe + c> where a, b> c are real, and suppose that f(x) ^ 0 for every real x. If a = 0 then 6 = 0, for otherwise we could take x to be — (b2 + c)/2bt making f(x) negative: thus if a = 0 then b2 = ac. If a > 0 then ^ as/ _ £j a \ = ac^ so b2 < ac. T h e case in which a < 0 does not arise, because, if a =N 0, 6\2
c b2]
«/
«
a2)
which has the same sign as a when |#| is sufficiently large. Now let
a = a2+ ...+
a2
b = a1b1+ ...
+anbn,
62 Then
SOLUTIONS AND COMMENTS 19 /(*) = (a1x + bxf + ... + (an x + bn)\
so that/(#) ^ 0 for all real values of x. Hence b2 < ac. Therefore
(a1 + b1?+ ... + K + &J2 = a + 2b + c «S a + 2(ac)i + c = (ai + c*)2, so that (Note that the last inequality is strict unless b2 = ac, and that if al9..., an are not all zero then b2 = ac only if there is a real number x for which f(x) = 0, that is, only if the n simultaneous equations have a solution. Note too that we can reverse the signs of all the numbers bv ..., bn—a change which does not affect the right-hand side; and that the last inequality can then be interpreted geometrically as stating that the length of any side of a triangle in n-dimensional Euclidean space is less than the sum of the lengths of the other two sides—hence the name * triangle inequality'. The inequality
(axbx+ ... +anbnf < (a?+ ... +« n 2 )(V+ - + V ) can be interpreted in terms of 'scalar products'—see 72; it has various generalizations, and is associated with the names of the nineteenthcentury mathematicians Cauchy, Schwarz, and Bunjakowski—respectively French, German, and Russian.) 19. A complex number is an ordered pair, {xyy\ of real numbers; that is to say, it is a 2-dimensional real vector. Complex numbers are added and multiplied according to the following definitions of sum and product:
With these definitions the set of all complex numbers becomes a field, in which the elements of the form {x> 0) constitute a subfield isomorphic, under the correspondence (x, 0) ~ x, with the field of real numbers. It is customary to identify (x, 0) with x, and to denote (0,1) by /, so that (x,y) may be written as x + iy. If we write z for x + iy, we often write re# for x and im# for y, and call these real numbers the real part and the imaginary part of z respectively; \z\> the modulus ofz, is the non-negative real number (x2 +y2)%. The complex numbers / and — i are square roots of
S O L U T I O N S A N D C O M M E N T S 20
63
— 1; and i is sometimes written as ^( — 1). (For reasons of their own, electrical engineers often denote ^( — 1) by/ rather than /.) A square root of (x,y) is a complex number (§, rj) such that
that is, such that
£2 - rj2 = x
and Ifyj = y.
If jy = 0 and a? ^ 0 these equations are satisfied if rj = 0 and £ = #i; if j> - 0 and x < 0 they are satisfied if £ = 0 and 9/ = ( — x)b. If 3; 4= 0 they are satisfied if TJ = y/21; and 2 y and this last equation is equivalent to the equation
which is satisfied if £ is a square root of the positive number
Thus (x,y) has a square root in every case. (Alternatively, we can observe that (z+ \z\)2 = pz, where p = 2(\z\ +re#); if z is not real then p > 0 and hence p~i(# + \z\) is a square root of z. If # is real and negative then i(—z)b is a square root of z.) Any quadratic equation in the field of complex numbers can be solved by the method of * completing the square', which succeeds because of the existence of square roots in all cases. (The field of complex numbers has a much more remarkable property, discovered at the end of the eighteenth century by the German mathematician C. F. Gauss: it is an algebraically closedfield,in the sense that every polynomial equation whose coefficients are in the field has a root in the field. The theorem that tells us this is known as the fundamental theorem of algebra, though it is essentially a theorem of analysis. Algebraic methods—methods involving only rational processes and * root-extraction '—are available for solving all cubic and quartic equations in the field of complex numbers, but it is known that no such method can exist for the general equation of degree n if n > 4. See 71, 85.) 20, It is easy to verify that, with the given operations, F x F is a commutative ring in which the field F is isomorphically embedded by the correspondence x ~ (x> 0); and this is true for any field F. If (and only if) F contains no square root of — 1, then x2 +y2 = 0 in F if and only if x — 0
64 SOLUTIONS AND COMMENTS 21 andy = 0, so that any non-zero element (x,y) of the ring FxF has a reciprocal in F x F> namely and so Fx Fis a field. If z = (0,1), or if z = (0, - 1 ) , then z2 = ( - 1 , 0 ) so that #2 +1 = 0 . (The interesting point here is that the method of constructing the field of complex numbers from that of the real numbers can be applied to any field in which there is no square root of — 1, giving a larger field in which there is a square root of — 1. Similar constructions can be used to 'create' other square roots. See 71.) If/> is a prime number, the field of integers modulo/) contains a square root of — 1 if, for example, p — 1 is a perfect square. But there are prime numbers p (3 and 7 being among them) such that if F is the field of integers modulo p then FxF9 with the given operations, is a field having p2 elements. 21. For any complex number zy the conjugate number z* (sometimes denoted by z) is defined to be rez — iimz. Thus re#* = re* and imz* = — imz. It follows at once that z** = z for every complex number z, and that the real numbers are the self-conjugate complex numbers. (Conjugacy can thus be interpreted geometrically as a symmetry of the plane of complex numbers about the 'real axis'.) The correspondence z ~ z* is one-to-one, and if a — x+iy and /? = u + ivy where x, y, u, v are real, then (a + fl)* = x + u-i{y + v) = x-iy + u-iv = <x* + fi*9 (a/?)* = xu —yv — i(xv +yu) = (x—iy) (u — iv) = a*/?*. The correspondence is thus an isomorphism of the field of complex numbers with itself (it is an automorphism of the field). Hence if /(A) = a0Xn + a1Xn~1+ then
... +an_1X + anJ
/(A*) = ao( A*)» + ^(A*)"" 1 + ... + an_x A* + an
= ao(\")* + a1(\«-i)*+ - . +an..1A* + any /(A)* = aQ*( A*)* + af{ A""1)* + ... + a*^ A* + a*. It follows that if aQi al9 ..., an are all real then/(A # ) =/(A) # , so that, in this case,/(A*) = 0 if (and only if )/(A) = 0. (Another property of the automorphism z ~ z* is that |#*| = \z\ for every complex number z. It is interesting to note that there are only two 'modulus-preserving* automorphisms of the complex field, the other being the one in which every number corresponds to itself. To prove this,
SOLUTIONS AND COMMENTS 22
65
observe first that for any automorphism z ~ z' of the field, V = 1 and hence r' = r for every rational number r. Then if x is any real number, and r is rational, \x'-x\ < | * ' - r | + | r - * | = |(*-r)'| + | # - r | , so that if moduli are preserved then \x' -x\ ^ 2 \x—r\; and since r can be chosen as close to x as we please, it follows that \x' — x\ = 0, which is to say that x' = x. Finally, (i'f = (i2)' = ( - 1 ) ' = - 1 , so that i' = ±i. See 71, 88. We have here an example of an 'isometry': see 73.) 22. A 3-dimensional real vector is an ordered triad, (x,y,z), numbers (scalars). If r l s ..., r n are n such vectors, say *1 = (*l>yi> *l)»
••>
r
n = {XwVw
of real
Z
n)>
and if ocv...,ocn are n scalars the linear combination oc1r1+ ... + a n r n is, by definition, the vector (x,y, z) for which x = oc1x1+ ... +anA:n,
in abbreviated notation, Ht*>v*v = v
v
(li^X^CC^^OC^). v
v
If E consists of all the linear combinations of r l f ..., r n , let vl9..., v m be m elements of E. Then there are mn scalars such that Vi = Ti0Ci,vrv> •••> ym = Tt(X"m,vrv V
Hence, for any scalars
V
where yv = otlvfi1+ ... +0Lmvfim (^=1,...,«). Thus any linear combination of elements of 2? is itself an element of E. Suppose that p, q, and r are linear combinations of r x and r 2 . If p = 0 then p = Oq + 0r, so suppose that p 4= 0. Then p = a1r1+a2r2 and the scalars ocv a2 are not both zero; suppose that oc± =f= 0. Then r x is a linear combination of p and r 2 (in fact rx = af x p - <%f2 a2 r 2 ); so q, being a linear combination of rx and r2, is a linear combination of p and r 2 : we can write q = ^ ! p + ^ 2 r 2 . If /?2 = 0 then q = / ^ p + Or, so suppose that /?2 4= 0. Then r 2 is a linear combination of p and q, and hence rx also is a linear combination of p and q (since it is a linear combination of p and r 2 ). It follows that r is a linear combination of p and q.
66
SOLUTIONS AND COMMENTS 23-24 (Note that in this discussion we have used the fact that the system of real numbers is a field, but we have used no other properties of this system. Accordingly, the whole discussion is valid for * vectors over a field F\ provided that the term 'scalar' is understood to mean 'element of F\ Moreover, the vectors need not be 3-dimensional: the number 3 can be replaced here by any natural number, provided that this ' dimension' is the same for all the vectors considered. The discussion can be extended, in an obvious way, to provide a proof of the basis theorem. See 119.) 23. If r x = (xvyly zx) and r 2 = (x2,y2y ^2) then, by definition, and r ^ i ^ is called the scalar product of the vectors r x and r 2 (evidently, xx. r 2 = r 2 . r x ). The length, |r |, of a real vector r is the non-negative scalar ^(r. r). The statement that the vectors rx and r 2 are orthogonal to each other means that r1. r 2 = 0. If p = (1,0,2) and q = (2,2, - 1 ) then p . q = 2 + 0 - 2 = 0, so that p and q are orthogonal to each other. A vector (x,y, z) is orthogonal to both p and q if
x + 2z = 0 and 2x + 2y-z = 0, and these equations are satisfied if x = — 2z and y = fz, z being arbitrary. Now if r = ( — 2z, f #, z) then
|r| = H|(-2,f,l)| = HfV5 = fH|p| so that |r| = |p| if z = + f. Thus each of the vectors ± ( - f , f, f) has the same length as p and is orthogonal to both p and q. (A general method of finding a vector r orthogonal to each of two 3-dimensional vectors p and q is to calculate the vector product p A q and then take r to be a(pAq) for any scalar a. If p A q =J= 0, a can be chosen so that r has a prescribed length. Note that in defining the length of a real vector we use the fact that the system of real numbers is a totally ordered field in which every positive element has a square root: we cannot assign a 'length* in this way to a vector over an arbitrary field.) 24. If rt = (xvyv
z±) and r 2 = (x2iy2j #2) then, by definition,
r x A r 2 = (yt z2 -y2 zl9 zx x% - z2
Thus r 2 A rx = (y2 z± -yx # 2 , z2 xx - zx x2y x2yx - xxy2) = - r± A r 2
SOLUTIONS AND COMMENTS 25
67
Also, r x . r x A r 2 = x±(yi z2 -y2 zt) +y1(z1 x2 - z2 xt) + z1{x1y2 - x2yx) = 0, and ra.TiATa = -rg.raAT! = 0, so that r x Ar 2 is orthogonal to rx and tor 2 . r^tiktz = 1 and r = a 1 r 1 + a 2 r 2 + a 3 r 3 If then
r. r 2 A r 3 = ax r x . r 2 A r3 + a 2 r 2 . r2 A r 3 + oc3 r 3 . r 2 A r 3 = ax ,
since r 2 A r 3 is orthogonal to r 2 and to r3. Similarly, we find that and
a 3 = r.r 1 Ar 2 .
(We write rx.r2 A r 3 for r x . (r2 A r3) since there is no risk of ambiguity; and since r 1 .r 2 Ar 3 = r 1 Ar 2 .r 3 this scalar triple product is sometimes denoted by [rl9 r2, r 3 ]. Note that if any two of the factors in a scalar triple product are interchanged then the sign of the product is reversed. Note also that although the vectors involved in this discussion must be 3-dimensional, they need not be real: the whole discussion is valid for 3-dimensional vectors over any field. The theory of scalar triple products is essentially the theory of 3-by-3 determinants. See 118.) 25. If a and ft are any scalars then
but this is 0 because UAV is orthogonal to u and to v. Thus UAV is orthogonal to every linear combination of u and v. Hence if r = au + /?v + yuAv, then
r . u A v = y|uAv| 2 .
Now suppose that UAV 4= 0 (so that |UAV| 4= 0) and that r = 0. Then 7 = 0. Hence A 1 0= a and therefore (since u A 0 = 0) 0 = au
Au
+ /?u A v = /?u A v,
so that /? = 0. Therefore 0 = au, so that a = 0 (u == j 0 since UAV #= 0). Thus, if u A v 4= 0, the equation r = 0 is satisfied only if the scalars a, /?, and y are all zero; and this means that the vectors u, v, and UAV are linearly independent. It is a consequence of the basis theorem, for 3-dimensional vectors, that no 4 vectors can be linearly independent (see 119). Thus for any
68
SOLUTIONS AND COMMENTS 26-27
4 vectors u, v, w, r there are scalars, a, /?, y, 8, not all zero, such that = 0. If u, v, and w are linearly independent then 8 =t= 0 (whatever r may be), so that r is a linear combination of u, v, and w. In particular, if u A v 4= 0 then any vector r is a linear combination of u, v, and u A v. But we have seen that if r
=
2
then r.uAV = y |UAV| ; thus if r is orthogonal to UAV, and UAV =J= 0, then y = 0, so that r is a linear combination of u and v. (Note that we have proved that if UAV 4= 0 then a vector is a linear combination of u and v if and only if it is orthogonal to UAV; and that the whole discussion is valid for 3-dimensional vectors over any field.) 26. The vectors r for which r 2 = 1 (the unit vectors) constitute the sphere with centre 0 and radius 1. Those for which r . e = 1 constitute the plane through | e orthogonal to e. The set C is the intersection of these two surfaces. Since (r_Je)2 = r2_§r.e + i r lies on C if and only if r. e = 1 and (Thus C is a circle, with centre Je and radius ^§.) We can choose non-zero vectors f and g, of prescribed length A say, which are orthogonal to each other and to e. Then r. e = 1 if and only if r = Je + #f+j;g, where x andy are real numbers. Since e, f, and g are linearly independent, this last equation establishes a one-to-one correspondence r ~ (xyy), between the plane r . e = 1 and the complex plane. Since the points of C are represented, in this correspondence, by the complex numbers (x y) for which , . .„ v t J1 (x92+y29)A2 = 2f. Thus if we take A to be ^f we obtain a one-to-one correspondence between C and the unit circle of complex numbers. 27. According to the given definition of multiplication, ((*i> h) (r2> *2)) (r3> ^) = ( r i A r a + *! r a +1 2 rv t± tz-rx. r2) (r3, *3)
SOLUTIONS AND COMMENTS 27
69
where r = (rjA^Arg + ^rgAra + ^riArg + ^ ^ - r i . r g J r g t = ^1^^-r1.r2^-r1Ara.r3-^1ra.r3-^r1.r8;
and
(*i h) ((*2> '2)fa»*8)) = (*i, '1) (ra A r 3 +1 2 r 3 + *8 r2, *2 *3 - r 2 . r3) where r' =
Now
t' = * x V3-*i r 2- r 3- r i- r 2 A r 3-*2 r 3- r i-*3 r i- r 2-
r-r' = (^Ar^A^-riA^Ar^-ri.rgrg + ra.rgr! = 0 by the given identity, so r = r'; and t = t' because r 1 Ar 2 .r 3 = r 1 .r 2 Ar 3 . Thus (r, t) = (r', *'), so that multiplication is associative. It is not commutative, since (*i> 0) (r 2,0) = (rx A r2, - v1. r 2 ), and
r 1 Ar 2 4=r 2 Ar 1
(r2,0) (rv 0) = (r2 A rv - r 2 . r 2 ), if r 1 Ar 2 4=0.
For any vector r and any scalar t, (0,l)(M) = (r,0
and (r,*)(0,1) = (r,t),
by the laws of vector algebra. Also,
If (r, ^) =}= (0,0) then ^2 + r 2 is a positive real number; and if s =
then
_ (^2 + r 2)-i r
and w = (t2 + r2)"1
(s, W )(r,0 = (r,0(s,«) = (0,l).
(With multiplication defined in this way, and with addition defined in the usual way, the 4-dimensional real vectors form a ring; this is a division ring, in that every non-zero element has a two-sided multiplicative inverse. The elements of this ring are known as quaternions. Those of the type (0, t) form a subring isomorphic with the field of real numbers; and there are subrings isomorphic with the field of complex numbers. If i, j , k is a right-handed orthonormal sequence, the four
70
SOLUTIONS AND COMMENTS 28-29
quaternions (i, 0), (j,0), (k,0), (0,1) have an interesting multiplication table. If the w-dimensional real vectors, with addition defined in the usual way, are given a scheme of multiplication so as to form a division ring R containing, in a certain natural sense, the field of real numbers, then by a famous theorem due to the nineteenth-century German mathematician F. G. Frobenius, either n = 1 and R is the field of real numbers, or n - 2 and R is the field of complex numbers, or n = 4 and R is the ring of quaternions: there are no other possibilities. Quaternions were introduced—though not in the way described here—by the nineteenth-century Irish mathematician W. R. Hamilton, who wrote extensively on their uses in mathematical physics.) 28. To find the h.c.f. of polynomials p(x) and q(x) we can use Euclid's algorithm, to give a succession of remainders, of decreasing degrees, each of which is divisible by every polynomial that divides both.p(x) and q(x): the last non-zero remainder divides both^>(#) and q(x), and is therefore the h.c.f. In particular,
so that the h.c.f. of the given polynomials is x2 + x +1, unless thefieldof coefficients is such that 2 = 0 (in which case the given polynomials are equal). As a check, note that
29. I f # > 0 t h e n
= \*oPn + a±Pn-\ +... + ViW"" 1 + an t\ > and this is an integer ifp and q and the coefficients a0, alt..., an are integers. Moreover, this integer is positive unless f(p/q) = 0, so df(plq) 4= 0 then « - | / ( p / « ) | > l , that is By the remainder theorem, if /(A) = 0 then, for any number x,
SOLUTIONS AND COMMENTS 30
71
where the coefficients b0, bv ..., bn_1 do not depend on x. Now unless f(x) = 0 for every x> which we assume not to be the case, the equation /(*) = 0 has not more than n real roots; so if A is an irrational root there exists a positive number 8 such that the equation has no rational root between A — 8 and A + 8, Let/> be an integer, and q a. positive integer, and suppose first that \(p/q)- A| < 8. Then f(pjq) # 0, so that \f(p/q)\ > l/? n ; and if x = p/q then \x\ < 8 + | A|, so that
which is a positive number, c say, independent of x. Thus
l/S"/?)- A| > 5 then
Hence if k is the lesser of the positive numbers 8 and \\c then, in either case
'
(What we have proved here is a famous theorem due to the nineteenthcentury French mathematician J. Liouville. It is the foundation of a substantial theory concerning irrational numbers that are * badly approximable' by rational numbers. A real number is said to be algebraic if it is a root of a polynomial equation with integer coefficients, and to be transcendental otherwise; the algebraic numbers form a subfield of the field of real numbers, but one can deduce from Liouville's theorem, by an argument which he devised, that transcendental numbers exist in every open interval.) 30. By definition, f~\E) = {x: f(x) e E}; this set is empty if the domain of/ has no points x such that/(#) e E. If x ef~\E u H) then/(#) eEvH, that is/(#) belongs to EortoH (or to both), so xef-\E) or x ef-\H), which is to say that xef~\E) \jf~\H)\ thus
The opposite inclusion can be established similarly, by showing that if xef~\E) or xef-^H) then xef-^EvH). In essentially the same way it can be shown that
f~\E n H) =f~\E) nf-\H).
72
SOLUTIONS AND COMMENTS 31-32
Let X be the set of all numbers that are square roots of positive integers, and let/be the mapping x -> x2 of X into the set Y of all positive integers. Then the range of/is the whole of Y, and if E is a subset of Y consisting of a single integer n t h e n / " 1 ^ ) is the set consisting of the two numbers + *Jn (so / has no inverse). 31. The domain of the function/o 9 is the domain of 9, namely the set Y; and for eachy in Y, (focp)(y) is defined to bef(cp(y)). T h u s / 0 9 m a P s Y into Y. The domain of 9 0/is the domain of/ namely X, and 9 o/maps X into X according to the formula ) (xeX). For any subset E of X> (9 of) (E) is a subset / / of X, and xeHif and only if there is #' in E such that (9 o/) (xf) — x. Thus if E = 0 there is no # in ff; that is to say, (
(f)HE)
i(f
(If X =}= Y then / 0 9 4= 9 0 / because these functions have different domains. If X = Y the functions may or may not be equal. For example, let X consist of the natural numbers, and let / be the mapping x -> x2: if 9 is the mapping x -> x3 then / o 9 = 9 o/; but if 9 is the mapping x -> x +1 then / o 9 4= 9 0 / For any non-empty set X, we have o as a binary operation for the set of all functions that map X into X; but, as is easily seen, this operation is not commutative unless X has only one element; it is, however, necessarily associative. The operation may be commutative within a particular set of functions: consider, for example, the set consisting of a function/ mapping X into X, together with all its iterates, / 0 / / 0 / 0 / , . . . . See 81.) 32. The given equation holds when s = t = 1, so that and it follows immediately that/(I) = 0. For any positive rational numbers s and t, f(s)=f(st-H)=f(st-i)+f(t), so that/(s) = f(t) if and only if/(st'1) = 0; hence if this last equation can
SOLUTIONS AND COMMENTS 33
73
hold only when st"1 = 1, that is when s = t, then/($) =!=/(£) if s =J= t. Now so that if 2/(5) =/(2) then/(s 2 ) = / ( 2 ) ; with the given assumption, this implies that s2 = 2, and hence that s is not rational (see 10). (This last result can easily be generalized. If r is any positive rational number, it can be seen from the fundamental theorem of arithmetic that riin [s i rra tional if n is a sufficiently large integer; then, if/satisfies the given conditions, there is no rational number s such that/(s) =f(r)/n. It follows that if P is the set of all positive rational numbers then/(P) does not include P ; for i f / ( r ) e P t h e n / ( r ) / w e P \ / ( P ) . Moreover,/(P) does not consist entirely of rational numbers: indeed, if s and t are distinct prime numbers then f(s) or /(*), or both, must be irrational; for if f(s) = m/n and/(J) = pjq, where m, ny p, q are integers, then f(snp) = mp and f(tmQ) = mp, so that snp = tmq, which is impossible if s and t are distinct primes. Note that, under the given conditions, / establishes an isomorphism between the multiplicative group P and a subgroup of the additive group of all real numbers; we have shown that this subgroup neither includes nor is included in the subgroup consisting of the rational numbers. The existence of such a function/cannot, therefore, be proved within the context of elementary arithmetic. A proof of existence is supplied by the theory of the logarithmic function. See 43, 61.) 33. (i) In any interval / with more than one point (any non-degenerate interval) we can choose rational numbers xl9 x2 such that xx < x2, and irrational numbers xz, #4 such that xz < #4. Then, for the given function/,
T h u s / i s not monotonic on /. (ii) If x is rational then/(/(#)) =f(x) = x; if x is irrational then so is -x, and therefore/(/(*)) = / ( - * ) = - ( - * ) = x. Thus/(/(*)) = x for every xy which is to say that/is inverse to itself. (Note that if a monotonic function / is inverse to itself then either f(x) = x for every x orf(x) = —x for every x. To see this, suppose first that/is an increasing function: if x is such that/(#) < x then which is impossible if/(/(#)) = x; if x is such that/(#) > x then
74 SOLUTIONS AND COMMENTS 34-35 which is also impossible if/(/(#)) = x; thus/(#) = x for every x i f / i s inverse to itself. If/ is decreasing and inverse to itself a similar argument, with —/in place of/, shows that/(#) = — x for every x. Since the function / given in the exercise is neither of these two functions, we have an alternative proof that/is not monotonic.) 34. For the given function /, and any real number y, it is clear that f(x) > \y\, and hence f(x) > y, if x > \y\^\ and that f(x) < —\y\, and hence f(x) < y, if x < — \y\$. Thus if / i s the interval of all real numbers, / ( / ) contains points on either side of y. B u t / i s a continuous function, and therefore / ( / ) is an interval; therefore y G / ( 7 ) , which is to say that y = f{x) for some x in /. It is easy to see that/is an increasing function. Thus/has an inverse, f*1; and/" 1 is continuous because/is continuous. The domain of/" 1 is the range of/ which consists of all real numbers. If/" 1 could be represented by a polynomial, of degree n say, t h e n / o / " 1 could be represented by a polynomial of degree 5w. B u t / o / " 1 is represented by a polynomial of degree 1, and no function can be represented, on an interval with more than one point, by two distinct polynomials (because a polynomial equation can have at most finitely many roots, and an interval with more than one point has infinitely many points). Thus / - 1 cannot be represented by a polynomial (on any non-degenerate interval). 35. Let
„ . xt x f(x) = x3 + x-l for any real number x. Then/is an increasing function, and consequently there cannot be more than one value of x for which f(x) = 0. Now f(x) = x(x2 +1) - 1 , so that /(O68) = 0-68 x 1-4624-1 = -0-005568 < 0, /(0-69) = 0-69 x 1-4761 - 1 = 0-018509 > 0. Since/is continuous, it follows that/(#) = 0 for some x between 0-68 and 0-69. (We have calculated/(0-68) and/(0-69) with unnecessary accuracy. It would have been enough to observe that /(0-68) < 0-68 x 1-47-1 = -0-0004 < 0, /(0-69) > 0-69 x 1 4 7 - 1 = 0-0143 > 0. Computational labour can often be saved in this sort of way. For an * algebraic* solution of the given equation, see 85.)
SOLUTIONS AND COMMENTS 36-37
75
36. We must assume that y is fully specified numerically, as a rational number or as a number that can be rationally calculated to any degree of accuracy. If x is a rational number in / it may happen that/(#) = yy but if f(x) 4= y we can compute f(x) with sufficient accuracy to decide whether f{x) < y orf(x) > y. Suppose that, after a few trials (chosen, perhaps, by shrewd judgment), we find rational numbers £x and £2 m I such that
Since/is an increasing function, £x < £2; and since/is continuous, the open interval ]gv £2[ has a point xQ such that/(# 0 ) = y. There is only one such point xQ, and if £, = & + * & - & )
and & = & + § & - & )
we can decide whether x0 lies in ]gl9 £4[ or in ]£3, £2[ (it may, of course, lie in both these intervals). Thus we locate x0 within an interval of length f(£2 —£i). Repeating this procedure, we can eventually show that x0 lies in an open interval, say ]xv x2[, of length (f ) n (£2 — £x), for any given positive integer n. But if n is sufficiently large then (|) n < e/(£2 - ^ ) , because
inf{(f)«: n = 1,2,3,...} = 0
(see 17). Then 1^ — x2\ < e, and, since xx < x0 < x2>
Since f~\y) = x0, this process locates/" 1 ^) within the interval ]^x,^2[; so that any rational number in this interval represents/" 1 ^) with an error less than the given positive number e, and the mid-point of the interval gives an error less than \e. (This is not necessarily the best method of computing f~\y) in a particular case. More-refined methods are available for functions that satisfy special conditions. Any method is legitimate provided that it leads to numbers xx and x2 that are shown to satisfy the required conditions.) 37. The given information implies that /(3-16) < 4-9975 and /(3-17) > 5-0105. Since/is continuous, it follows that there is a number x between 3-16 and 3-17 such that/(#) = 5. Since / i s increasing, there is only one such xt and this i s / - 1 ^ ) . Thus 3-16 < / - 1 ( 5 ) < 3-17.
76
SOLUTIONS AND COMMENTS 38-39
If/(3-16) and/(3-17) had been specified to 2 decimal places only, as 5*00 and 5-01 respectively, we should have known only that 4-995 (3-16) < 5-005 and 5-005 (3-17) < 5-015. From these inequalities we cannot infer that there is a number x such that f(x) = 5; if this fact is known in some other way, we can infer that f-\S) < 3-17 but not that/- x (5) > 3-16. 38. Suppose that 0 < x± < x2 < 1. Then [0, x±] £ [0, #2]> a n ^ therefore /([0, *J) <=/([0, *2]). Since, for any x in [0,1], 9(*) = sup/([0,*]), it follows that 9(#x) < 9(^2)- Thus 9 is a non-decreasing function. Now suppose that/is continuous, and let e be any positive number. Then there is a positive number 8 such that if xe [0,1] and l^c-^l < 8 then \f(x)-/(*i)| < e. Hence if l ^ - ^ l < 8 and 0 ^ x ^ x2 then either x < xlf in which case/(#) < 9(^1) by the definition of 9 ^ ) , or |^ —^x| < 8, in which case f(x) (^i) + e; in each case/(x) < 9(#1) + e; therefore 9(^2) ^ 9(^1) + e by the definition of 9(^2)- Since y(x^) ^ 9(^2)> ft follows that if I x1 — x21 < 8 then 19(^1) — 9(^2)! < e* Thus 9 is continuous. (It is clear that the domain of/could be an interval other than [0,1], I f / i s continuous then 0 if x 4= 0, and that, for every real number x, r, x r, x Thus the curve represented by the equation y =/(*) lies above the #-axis, except that it contains the point (0,0), and it is symmetric about the jy-axis. Its shape can be indicated by plotting, to a suitable scale, the points (x,y) for which
= |^| for every real x} and if/2(#) = xi for every non-negative x, t h e n / = / 2 o / 1 . Now the function fx is continuous by virtue of the
77 SOLUTIONS AND COMMENTS 40 triangle inequality, and / 2 is continuous because it is the inverse of a continuous univalent function. Since composites of continuous functions are continuous, it follows that/is continuous. 4 --
-16
0
-12
40. If
x = -1
x2 = 1
then
so
12
[x2] = 1.
If
- 1 < x < 1 then
0 ^ x2 < 1
If
1<#
1 < x2 < 2
If V2 < x < V3 If If
V3 <
x
x =2
<2 then
then tnen then
2
3 ^ ^2 < 4
2
x =4 4 .-
3 --
2 --
1 --
-1
< *2 < 3 so
2
so so so
so
[* ] = 4.
[x2] = 0. [>2] = 1.
M = 2[x2] = 3.
16
78
SOLUTIONS AND COMMENTS 41 The graph thus consists of the points ( — 1,1) and (2,4), the points of the #-axis between ( — 1,0) and (1,0), and all but the right-hand endpoints of the three straight segments joining respectively (1,1) and (^/2,1), (^2,2) and (^3,2), (^3,3) and (2,3). The left-hand end-points of these three segments are marked by dots on the diagram to show that they belong to the graph. (These three dots, and the other two, represent points on the graph of the function x -> x2.) On each of the intervals ] - 1 , 1 [ , [1, <J2[, [V2, V3[, U/3,2[, the function x -> [x2] has a constant value; it is therefore integrable over each of the corresponding closed intervals, and each integral is the product of the constant value of the function and the length of the interval. By the additivity of the integration process, the function is therefore integrable over the interval [ — 1,2], and -1
41. For n = 1,2,3,..., let P(n) be the proposition l 2 + 2 2 + ... +n2 = \n{n-\ Evidently P{\) is true. If P(n) is true then
thus P(n) implies P(n + 1). Accordingly, by the principle of mathematical induction, P(n) is true for every n. Let the interval [0,1] be divided into n closed subintervals by the points r/n, where r = 0,1,2, ..., w. If (r— \)jn < x < r/n, where r > 1, then x2 has minimum value (r — \)2jn2 and maximum value r2/n2. Accordingly, if .+w 2 )/n 3
and
sn = (02
then S^ and ^n are respectively upper and lower Riemann sums for x2 dx. But, by the propositions P(n) and P(n — 1), , 1 1 /, 1\ 1 1 //. 1 \ dn] n = ^ + o - ( 1 + T-) a n d n 3 2 « \ n 3 2 3/ Hence 5n ^ J < *Sn and 5 n - J n = l/«. It follows that ^ 2 d^ Jo lS
SOLUTIONS AND COMMENTS 42
79
I This result can be generalized in the following way. If k is any positive integer and c is any real number, and if sn = (0k +1* + . . . + ( « - If) (c/n)k+\
and
r
then Sn and sn are respectively upper and lower Riemann sums for xk dx; and Sn — sn = ck+1/n, which we can make as small as we like by
taking n sufficiently large (see 67). Now the well-known identity shows that, for any number r, where there are h +1 terms on the right. If r > 1 none of these k + 1 terms is less than (r — \)h or greater than rk\ therefore
if k — 1,2, ...,w, and consequently nh'1-(n-l)k+1\/c\ J
n*
that is, 5n < T—^- < Sn. It now follows that
From this result, and the linearity and additivity of the integration process, we can integrate any polynomial function over any bounded interval. In particular, it is easy to show that for any real numbers a, b, c> and any non-negative integer k,
42. If 0 < x < 1, {(1 +**)V(l - ^ ) } 2 = (1 + ^ + |^ 2 )(l -x) = 1 -f* 2 {(1 + f«) V(l - x)Y = (1 + * + £*2) (1 - *) = 1 + **(3 Now if 0 < x < J then 8# + 4x2 < 3; therefore, for these values of x, -x) < 1 < (1 +
80 SOLUTIONS AND COMMENTS 43 whence the required inequalities follow. If 0 < x < | then 0 ^ x2 and therefore . Hence, by the linearity and positivity of the integration process,
that is, since
i 2 x dx = ^4, Jo 25
Ci
dx
19
43. By definition, if x > 0,
iog*=j;i«. Hence if t ^ 0,
log(l +0 = j*+t(^
= £+<—^—.
Now if « is any positive integer, and a 4= — 1,
1+a
1+a
Using this well-known identity, with a = u — 1, and using the fact that ri+t ri+t for any positive integer k, we find from the linearity of the integration process that
But, by the positivity of the integration process,
It follows that the number
exceeds log(l + 0 by a number which is not greater than t2n+2/(2n + 2). (In fact, if * > 0 the excess is positive and less than t2n+2/(2n + 2), by the strict positivity of the integration process for continuous functions.)
SOLUTIONS AND COMMENTS 44
81
Using this result with t = J and n = 1, we find that i - ^ + Tb", which is - j ^ , exceeds logf by not more than TOT$J a n d s o by less than 10"3. (More generally, this method enables us to calculate log(l + (l/k))> which is log(k + l)-\ogky to any degree of accuracy, for any positive integer k. Since log 1 = 0, we can thus calculate log 2, log 3, and so on in succession, with an error which can be kept below a prescribed bound. However, to calculate log 2 with high accuracy by this method it is necessary for n to be very large, and for this initial step it is better to use a more sophisticated method, or even the crude method of computing Riemann sums.) 44. Let JR be the set of all real numbers. (i) We know that the mapping x -> x, of R into R, is continuous. (This, of course, is an immediate consequence of the definition of continuity.) Products of continuous functions are continuous: hence, for each natural number «, the mapping x -> xn is continuous. Also, sums of continuous functions are continuous: hence if f(x) = #4 + x* + x2 for every x in R then/is continuous. If x > 1 then/(^) > x, s o / i s unbounded (though, since f(x) = x2{(x + J) 2 + f}, / is bounded below). (ii) Since every constant function is continuous, we can show as in case (i) that every polynomial function is continuous; in particular, the mapping x -> 1 + x2, of R into R, is continuous. If x e R then 1 + x2 4= 0: hence, by a theorem on quotients of continuous functions, if then/is continuous. If x e R then 2\x\ < 1 + x2 (equality occurring when x = ± 1): therefore - | 1 + x2. Since composites of continuous functions are continuous, the mapping x-> log(l +x2)> of R into R, is continuous. Hence iff(x) = #log(l +x2) then/, being a product of continuous functions, is continuous. Since log(l +x2) ^ log 2 if x > 1, / is unbounded. (iv) The exponential function is continuous (being the inverse of a continuous univalent function): hence iff(x) = e~x% for every x in R then /, being a composite of continuous functions, is continuous. The range of the exponential function (of a real variable) is the set of all positive numbers: therefore, if xeR> f(x) > 0. The exponential function is an increasing function, and its value at 0 is 1: therefore, since — x2 < 0, f(x) < 1. Thus / i s bounded.
82
SOLUTIONS AND COMMENTS 45 (We know that the image of an interval by a continuous function is necessarily an interval: hence the range, /(.R), of each of the functions /considered here is an interval. In case (i),/(i?) = {y: y ^ 0}; in case (ii), = [ - i « ; ^ case (in), f(R) = R; in case (iv),f(R) = ]0,1].) 45. We know that the logarithmic function is univalent, and that its range is the set of all real numbers. Its domain is the set of all positive numbers. Thus we know that it has an inverse function, and that this maps the set of all real numbers on to the set of all positive numbers. This inverse function is, by definition, the exponential function (of a real variable), whose value at x is denoted by ex. Since, if x > 0,
a fundamental theorem of the differential calculus tells us that
dx
s
x
Another such theorem (on the derivative of an inverse function) now tells us that, if x is any real number,
Let a, bf c, a, /?, y be real numbers such that, for every number x in a given open interval /, oteax + fiebx + yecx = 0. (1) Then, for every x in /,
(ax
+ fibx +
cx
) = 0,
that is, by fundamental rules of the differential calculus, aoceax + bfiebx + cyecx = 0. Consequently, for the same reasons, tfaeox + pp#x + C2yecx = o
(2) (3)
for every x in /. By a standard process of elimination, or by the theory of 3-by-3 determinants, we deduce from equations (1), (2), and (3) that either (a - b) (b - c) (c - a) = 0 or, for every x in /, a eax = fiebx = y ecx = 0. The first of these alternatives is excluded if a, b, c are distinct; and since 0 is not a value of the exponential function, the second alternative implies that a = /? = y = 0. (We have proved that a certain set of 3 functions is * linearly independent'. There is a similar result for n functions of the same type.)
SOLUTIONS AND COMMENTS 46-47 83 46. We know that since / is differentiable at every point of [a, b] it is continuous on [a,b]. Hence we know that f([a, b]) is a bounded closed interval. The right-hand end-point off([a, b]) isf(x0), where x0 is a point (not necessarily unique) of [a, b]. Thus/(#) a and a < x < x0 then
whence it follows, by the definition of/'(# 0 ), that/'(# 0 ) ^ 0. If x0 < b and
*<*«**»
f(x)-fM x-x0
^ '
and therefore f'(x0) < 0. (This discussion shows that if the differentiable function / on [a, b] takes its greatest value at a point xQ which is neither a nor b then/'(# 0 ) = 0. It is easy to see, from a simple example, that/may take its greatest value at b although f\b) 4= 0. Similar considerations apply in the case of the least value. And of course/' may take the value 0 at a point x of [a, b] such that/(#) is neither the greatest nor the least of the values taken b y / o n [a,b].) 47. If 9 has a local minimum at a point x of / which is not an end-point then, by the definition of the term 'local minimum', / has an open subinterval / ' which contains x and is such that
(*')-?(*) x'-x is non-negative if x' > x and is non-positive if x' < x, provided that x' E /'. But if 9 is differentiable at x, and e is any positive number, then by the definition of (p'(x)>
provided only that \x' — x\ is sufficiently small and not 0. It follows that
84 SOLUTIONS AND COMMENTS 48 its continuity on [a, b], 9 has a least value, c say; and 9 has a local minimum at any point x of [ay b] for which 9(3?) = c. If there is such a point x in ]a, b[ then (p'(x) — 0 and therefore/'(#) = g'(x). If not, then 9 takes the value c only at a and b, so that c = 0 and 9 ^ ) ^ 0 for every # in ]a, b[; in this case there must evidently be a point x of ]
f(x)=g'(X). (Note that this argument gives a proof of the mean-value theorem if we take g to be defined from/by the equation
Note too that the functions considered need not be differentiable at the end-points: for example, / satisfies the stated conditions if /(*) = (*-*)*(*-*)*
(a<x^b),
but in this case/is not differentiable at a or at b.) 48. Since / is an interval and a and b are points of 7,
Therefore/, being differentiable throughout 7, is differentiable throughout [a, b]. Hence if a < x < b then g is differentiable at x> and
Since rj lies between f'(a) and /'(^) it follows that 0 lies between the numbers g\d) and g'(b)f so that one of these numbers is positive and the other negative. Now since g is differentiable throughout [0, b] it is continuous on [a,b]; therefore the set g([a, b]) has a least element and a greatest element. Let x0 and x'o be points of [a, b] such that g(xQ) = infg([a, b]) and^(^o) = SUP£([#> b]). If x0 > a, in particular if xQ = b, then^'(.x:0) ^ 0; and if x0 = a then£'(#0) ^ 0 (see 46). Similarly, if x'o = a then^'(^o) < 0 and if x'o = b then £'(*o) > 0. Thus, since g\a) and g\b) are not both positive or both negative, x0 and x'o cannot both be end-points of [a> b]: one of them, which we may call £, must be in the open interval ]a, b[. Since g(g) is either inf^([a, b]) or supg([a, b]), g has either a local minimum or a local maximum at £; therefore, since £ is not an end-point of [a, b], g'{£) = 0 (see 47). Thus/'(£) = 77. Since 7] is an arbitrary number between/'(a) and/'(6), it follows that the set/'(7) is an interval.
SOLUTIONS AND COMMENTS 49 85 (Note that the function / ' need not be continuous: for example, if f(x) = # 2 sin(l/#) for every real number x other than 0, and if/(0) = 0, then/is differentiate everywhere b u t / ' is not continuous on any nondegenerate interval containing 0. A function that maps intervals to intervals is said to have the Darboux property, after the nineteenthcentury French mathematician G. Darboux; this property represents an intuitive idea of continuity, and that it does not imply continuity is one of the surprising facts of analysis. See 86.)
49. The mean-value theorem asserts that, under the stated conditions, there is a number £ in the open interval ]a>b[ such that b-a
-
/ U ;
'
The conditions (i) and (ii) enable us to apply the mean-value theorem to the case in which/is the function sin, a = 0, and b = 6, where 6 can be any number between 0 and JTT. Since sin 0 = 0, and the derivative of sin is cos, we see that there is a number £ (depending on the choice of 0) such that 0 < £ < 6 and
Since sin# increases from 0 to 1 as 6 increases from 0 to JTT, condition (iii) shows that cos2 6 decreases from 1 to 0, and condition (ii) shows that cos 6 is non-negative for these values of Q\ therefore cos 0 decreases from 1 to 0 as 0 increases from 0 to \TT. Accordingly, if 0 < £ < d then cos# < cos£ < 1. Thus, for any number 6 between 0 and \n> n sin# i cose? < —7T- < 1. o
(A special case of the mean-value theorem, in which it is assumed that f(a)=f(b) = 0, is known as *Rolle's theorem', after Michel Rolle, a French mathematician of the late seventeenth century, who was one of the first critics of the unsound arguments used by early practioners of the calculus; for his theorem he considered only polynomial functions, so that it was, in effect, a theorem about the real roots of polynomial equations with real coefficients. The importance of the mean-value theorem was not fully understood before the nineteenth century. A simple example shows that the theorem can be false for complex-valued functions—a fact which is easily overlooked.)
86 SOLUTIONS AND COMMENTS 50 50. The polynomials (x + 3)2 and (x + 2)3 have no common factor. Hence there are polynomials p(x) and q(x) which satisfy the identity x3 + 4*2 - 1 0 = p(x) (x + 3)2 + q(x) (x + If; and since the degree of the polynomial xz + 4x2 —10 is less than that of (x + 3f(x + 2f, we can assume that the degrees of p(x) and q(x) are at most 2 and at most 1 respectively. These polynomials are then uniquely determined, and they can be calculated in the following way. Let p(x) = ao(x + 2)2 + ax(x + 2) + a2 and q(x) = bo(x + 3) + bv Since a2 — />( — 2), a simple computation shows that a2 = —2; and since bx = q( — 3) we similarly find that bx = 1. Hence {ao(x + If + ax{x + 2)} (x + 3)2 + bQ(x + 3)(x + If = 0 and therefore
. +ox2) + fljw (^?,+o\3),+Lb/ (x,+o\2 f (,x? {a 2)2 =n 0. o o
From this identity we find, on taking x to be —2, then —3, then any other number, that ax = 0, that b0 = 0, and finally that aQ = 0. It follows that if x is not — 2 or — 3 then
The interval [ — 1,1] does not contain —2 or —3, and therefore f1
dx
= J —- = — ^f. (We have illustrated a systematic method of * resolution into partial fractions'. Other methods may be used in particular cases, the only criterion of legitimacy being the correctness of the result. To guard against errors, of reasoning or of computation, one should always check the correctness of the result by ' multiplying out' and then comparing coefficients.)
HINTS AND COMMENTS FOR EXERCISES 51-100
51. B\A has 12 elements (28 -16). C\(A u B) has 9 elements (27 - (14 +13 - 9)). Hence A KJ B <J C has 64 elements (43 +12 + 9). (This is an exercise on the additivity of number: if E and H are disjoint sets having m elements and n elements respectively then EKJH has m + n elements. In a more general form, this notion of additivity is the basis of the important branch of mathematics known as * measure theory', which includes the foundations of the theory of probability.) 52. Two finite sets can be put into one-to-one correspondence with each other if and only if they have the same number of elements; and if A has m elements and B has n elements, then AxB has mn elements. Thus the three results correspond respectively to the commutative law, the associative law, and the cancellation law, for multiplication of natural numbers. 53. The' if' implication is easy to establish. For the * only if' implication, suppose that ~ , as defined in terms of Q, is an equivalence relation. Then ~ is reflexive, and from this it easily follows that 1 e Q. Suppose that qsQ and q'eQ, where q < q'. Then q'q ~ q, q ~ 1, and 1 ~ q'\ hence, since ~ is transitive, q'q ~ 1 and q ~ q\ and therefore q'q e Q and q'/qeQ. It follows that if Q contains numbers other than 1, and if h is the least of these, then Q contains 1, h, h2, ... and no other numbers. 54. Use the sieve of Eratosthenes. 55. If the sum were an integer, s say, then pip2"*pns integer divisible by pv But
would be an
Plp2-~Pns=p2p3'~Pn+Plr> where r is an integer; a n d ^ / ^ ••• Ai *s n o t divisible by p x . 56. Binary operation, Q, on a non-empty set E: mapping, (x,y) ->xQy, of ExE into E. 87
88
H I N T S AND COMMENTS 57 Commutative law, for a binary operation Q on a set E:
xCly = y£lx for any elements x, y of E. Associative law, for a binary operation Q. on E:
(xQ,y)Q,z = xQ,(yQ.z) (=xQ,yQ.z) tor any elements x, y, z of E. (i) For (say) the set of non-negative integers, the binary operation
(xiy)->\x-y\ is commutative but not associative, (ii) For any set having more than one element, the binary operation
(x,y)->x is associative but not commutative. (See also 27 and 77.) (A binary operation that obeys the commutative law is an example of a symmetric function of two variables; scalar multiplication, in vector algebra, is another example (see 23). Vector multiplication (see 24) is a binary operation which is neither commutative nor associative.) 57. Group: set G with an associative operation Q, (making a semigroup —see 103) such that for any elements a,boiG there are elements x, y of G satisfying the equations Abelian group: group whose operation obeys the commutative law. (A group—or a semigroup—may be written in the multiplicative notation, in which xy is written for xQ.y, or, if the operation is commutative, in the additive notation, in which x+y is written for x£ly. Every group has a unique element u such that
uQ,x = x = x£lu for every element x of the group; in the multiplicative notation this * neutral' element u is often denoted by 1 and referred to as the unit element, and in the additive notation it is usually denoted by 0 and referred to as the zero element or simply 'zero'. Every element x of a group has a unique inverse, which is an element y of the group such that
x£ly = u = ytlx; this inverse is denoted by x*1 in the multiplicative notation, by — x in additive notation. The name 'Abelian group' commemorates the nineteenth-century Norwegian mathematician N. H. Abel.)
HINTS AND COMMENTS 58-61
89
Ring: set R with two associative operations, called addition and multiplication (and denoted accordingly) such that R with addition is an Abelian group (the additive group of the ring) and x(y + z) = xy + xz
and
(x +y) z = xz +yz
for any elements x> y, z of R. (The requirements expressed by these two identities are respectively the left-hand distributive law and the righthand distributive law, which coalesce into the distributive law in the case of a commutative ring—a ring in which multiplication obeys the commutative law. See 81.) (Every ring has a unique zero element, namely that of its additive group. It may or may not have a unit element—multiplicatively neutral—and it cannot have more than one such element (see 103). Any ring can be regarded as a sub ring of a ring which has a unit element (see 108).) The element p is the zero element of the ring (see 9). Any Abelian group can be made into a ring by writing it in the additive notation and defining multiplication by the rule that all products are to be zero. Less trivially, consider the multiples of 4 in the ring of even integers modulo 8. 58. The ring Rx x R2 has divisors of zero: if xx =N 0x in Rx and x2 + 02 in R2 (where 0x and 02 are the zero elements of R± and of R2 respectively), then (0v x2) and (xv 02) are non-zero elements of R± x R2, and their product is zero. Such a pair of elements cannot exist in a field. For an element (xv x2) of Rx x R2 to have a reciprocal in R1 x R2 it is necessary and sufficient that xx and x2 have reciprocals in Rx and in R2 respectively. 59. R2 = {(l-u)x: XGR}. (An element u such that u2 = u is said to be idempotent. In a ring that has a unit element, u is idempotent if and only if 1 — w is idempotent; and 1 is the only idempotent element having a reciprocal. See 106.) 60. In the field of integers modulo 2, each element has exactly one square root. If each of n elements of a field has two square roots in the field then the field has at least 2n +1 elements; hence it is impossible for every non-zero element of a finite field to have two square roots in the field. 61. If a + a = 0 and a 4= 0 then 1 + 1 = 0 , since a + a = a (1 + 1); hence x + x = 0. Let x -> x1 be an isomorphism of G into the additive group of F. Then I' = 0. If (— 1)' = 0 then —1 = 1 since the mapping is one-to-one; if
90 H I N T S AND COMMENTS 62-65 ( - 1 ) ' = ^ ^ 0 then a + a = 0 since ( — I) 2 = 1. Hence, in either case, x + x = 0 for every x in F. Therefore if x = y\ where yeG, then y2 — 1, and this implies that y = + 1 = 1. Thus G has only one element, so that F has only two elements. (This result shows that a * logarithmic function' cannot be defined on the whole of the multiplicative group of a field, with values in the field, except in the trivial case of a field with only two elements. See 32.) 62. (The property of F described by any of the three equivalent statements is called completeness. The system of real numbers is a complete totally ordered field, and can be shown to be essentially the only one: any two such fields are related by an order-preserving isomorphism, and are thus * abstractly identical'; see 12, 13, 67. For another characterization of completeness see 132.) 63. If r = <Jp + <Jq then q = (r-jp)2
=
r2+p-2rjpy
so that, since r 4= 0 (being a sum of positive numbers), which is rational if r is rational. 64. For the first pair of inequalities, consider the identity
xn-l
=(x-l)(xn-1+
... +1)
(which is related to the standard formula for the sum of n consecutive terms of a geometrical progression). The first inequality of the second pair shows that if a > b > 0 then an > bn: thus distinct positive numbers have distinct nth. powers. 65. Since x — x = 0, E contains 0. Hence if xeE then — xeE> since — x = 0 — x. Hence if x e EandyeEthen x+yeE, since x+y = x — (— y). Thus E is a subgroup of the additive group of the real numbers. If there are infinitely many points of E between 0 and 1, consider a division of the interval ]0,1[ into k subintervals of equal length: at least one of these subintervals must contain infinitely many points of E (by * the pigeon-hole principle'). Thus there is an interval of length ljk which contains distinct points x and y of E: let h = x—y. Then heE and, because E is an additive group, nheE for every integer n. Also 0 < \h\ < 1/k, and so every interval of length \jk contains nh for some integer n.
H I N T S AND C O M M E N T S 66-67
91
Now if A is irrational and E consists of the numbers m + wA, for all integers m and n, then distinct pairs of integers give distinct points of E (if m + nX = m' + n'X then m = tri and n = n')y and for each non-zero integer n there is an integer m such that 0 < m + nX < 1; hence there are infinitely many points of E between 0 and 1. 66. It is clear that c is an upper bound of E. Let d be a real number less than c. If n is any integer greater than 1/c, there is a positive integer m such that 9 m22 2 n
n
2
2
(m + 1) m 2m+ 1 3m ^ =• = o~^^"> nl nl nl nl 2 2 /3m\ 9 m 9c \w2/ w2 w2 n2 2 so that if « is large enough to ensure that n > 9cj(c — d)2 then m
Then
m c—^<^ ?r
2
m2 c--^ 2 < n j
and so
,
. c-d,
W2
d <—xl < c. n
Thus d is not an upper bound of E. Therefore c — sup E, (Constructions of this kind are often used in analysis. In devising them, one does not necessarily foresee all the details in advance: in this case, for instance, one begins with the idea of finding a rational number whose square is less than c but 'not too much less J , and then makes the details precise as one goes along. See 14.) 67. (A totally ordered field having the property described by any of the equivalent statements (i)-(v) is said to be Archimedean—though an idea resembling that expressed by (iv) was propounded by Eudoxus, a Greek mathematician who preceded Archimedes. Any subfield of an Archimedean field is itself Archimedean. A totally ordered field is necessarily Archimedean if it is complete. That the system of real numbers is essentially the only complete totally ordered field can be proved by considering (v); see 13. Any Archimedean field can be isomorphically embedded—for example by Dedekind's constructionf—in a complete t J. W. R. Dedekind, 1831-1916; German. 4-2
92
H I N T S AND C O M M E N T S 68-70
totally ordered field; thus the system of real numbers is essentially the largest Archimedean field. Totally ordered fields that are not Archimedean are known to exist, but not within the present realm of elementary mathematics.) 68. Any subfield of the field of real numbers must contain 1, and hence all rational numbers. Hence any subfield containing <Jp must contain F%, which is easily shown to be a subfield. (See 12.) In any isomorphism between Fv and FQi 1 must correspond to 1, and hence each rational number corresponds to itself. If, in such an isomorphism, r + s^Jp is the element of Fv that corresponds to the element ^qoiFq, then r2 + s2p + Irs^p corresponds to q, so rs^Jp must be rational; therefore if *Jp is irrational then r = 0 or s = 0. If r = 0 then s2p corresponds to q, so s2p = q, which is impossible if p and q are distinct primes; if 5 = 0 then r = ^]q, which is impossible if q is a prime. (This exercise shows that the real field has infinitely many nonisomorphic subfields. If Jp is rational then Fv is simply the rational field; if not, the irrational elements of FP are called quadratic surds over the rational field, and two such surds are said to be conjugate to each other if their sum—and hence also their product—is rational. The idea of a quadratic surd is prominent in the general theory of'algebraic fields', although that theory is mainly concerned with roots of polynomial equations of degree greater than 2. See 19, 20, 21, 71.) 69. The recurrence relation determines un+2 once un and un+1 are known; and it shows that un+2 belongs to any field that contains a, b, c, un, un+1. Hence, by induction, un is determined for every n if ux and u2 are known, and un belongs to any field that contains a, b, c, uv u2. In particular, un is real for every n if a, b, c> uv u2 are real; but in such a case A and /i may not be real, for example if a and c are positive and b = 0. 70. That C± is convex can easily be deduced from the triangle inequality (the modulus of the sum of two complex numbers does not exceed the sum of their moduli). The complement of C2 is convex, so Cx \ C2 is the intersection of two convex sets. That C2 \ C x is not convex can be seen by letting z± = 3(1 — i) and z2 = — zx: in this case z1eC2\ C±
and z2eC2\Cly
but \zx + \z2eCv
(A diagram can be helpful here, but of course it cannot be a substitute for a proof. Note that the idea of convexity is available for sets in ^-dimensional Cartesian space'.)
HINTS AND COMMENTS 71-73 93 71. Let h = £' - £, where £ is an element of F such that £' =1= £ and £" = £. Then A * 0, and ^, = = = _ ^ so A' 4= K and therefore /*<££. But
(h?)' =h'h' = {-hf = h\ 2
so A e2?. Now 1 4= - 1 in F, since h^h! = -h\ hence division by 2 is possible in F. Since Z? is a field, 2" 1 eE.lfzeF and #" = #, let
« = 2-\z + z'\
y = (2k)-1 (z - zr).
Then z = x + hyy x = x\ and, since (A"1)' = (Z*')"1 == ~^~1> J = JV'(It is shown here that if a field F has a non-trivial automorphism then it has a largest subfield E on which the automorphism acts trivially, and that the automorphism is inverse to itself—is an involution—if and only if F is isomorphic with afieldobtained from E by the standard process of * adjoining a square root'; F\E then consists of quadratic surds over E, conjugate surds corresponding to each other in the automorphism (see 19, 20, 21, 68). The rational field has no non-trivial automorphism, since it has no proper subfield (see 11). The automorphisms of a field form a group in an obvious way; in the branch of algebra known as * Galois theory'—after the nineteenth-century French mathematician Evariste Galois—the solubility of polynomial equations within certain fields is related to structural properties of groups of automorphisms and associated groups of permutations. The insolubility by algebraic methods of the general equation of degree greater than 4 is established by this theory. See 19, 85.) 72. The main point here is that the squared length of a vector r is, by definition, the scalar product r . r, and is positive unless r = 0. Thus if r =
aj^-^
0 ^ |r| 2 = a 1 2 | r 1 | 2 - 2 a 1 a 2 r 1 . r 2 + a 2 2 |r 2 | 2 , equality holding on the left if and only if <x1 r± = oc2 r2. So if rx and r 2 are such that a1r1 4= a 2 r 2 unless a 1 = a 2 = 0we can show (by taking A to be a,/a 2 or aJcc^) that , , ,, , |ri-r2| <
r
i|
|r2|-
(We have here a special case of the Cauchy-Schwarz-Bunjakowski inequality; see 18. One often writes r 2 for |r| 2 .) 73. (Here, if we write /(r) for r', we have a mapping /, of the set of all 3-dimensional real vectors into itself, which * preserves lengths and
94 HINTS AND COMMENTS 74 distances'. The argument shows that such a mapping * preserves scalar products' and is 'linear* (see 131). In fact it is easy to show that if/ satisfies any two of the following conditions then it satisfies the third: (i) /preserves length; (ii) / preserves distances; (iii) / is linear. Simple examples show that a mapping / can satisfy any one of these conditions and fail to satisfy the others. A mapping that preserves distances is called an isometry, and is necessarily one-to-one; a linear isometry is sometimes called a unitary transformation, If £ is any isometry, a n d / i s defined by the equation
then / preserves lengths and distances, and g(r) =/(r)+£(0) for every vector r; thus every isometry is a composite of two isometries, of which one is a translation—addition of a fixed vector—and the other is lengthpreserving and consequently linear. These ideas and results are available for a large class of vector spaces with lengths and distances defined in terms of scalar products, and they have applications in several branches of mathematics. For an isometry of the Euclidean plane see 21.) 74. By taking r 3 to be r x (or r 2 ) in the identity (r 1 Ar 2 )Ar 3 = r 3 . r 1 r 2 - r 2 . r 3 r 1 we can show that if r x and r 2 are linearly independent then t1 A r 2 4= 0; the converse proposition follows from the fact that r A {at) = 0. (The arguments here apply to 3-dimensional vectors over any field. See 24.) (Identities like (^Ar^Arg + ^ A r ^ A r j + ^gAr^Ara = 0 occur in contexts other than vector algebra. The general setting is that of an Abelian group, written in the additive notation, with a binary 'multiplication' x which is distributive but not necessarily associative. Such a system—which is a ring if the multiplication is associative—may be such that, for any of its elements x, y, z, (x xy) x z + (y x z) x x + (z x x) xy = 0; this condition is known as Jacobfs identity—after the nineteenth-century German mathematician C. G. J. Jacobi. An example other than that of vector algebra can be constructed by taking an arbitrary ring and defining
HINTS AND COMMENTS 75
95
x x y to be the commutator xy —yx, for any elements x and y of the ring. In this case, as in that of vector algebra, we also have the identity x x x = 0. Systems that satisfy this together with Jacobi's identity are associated with the name of the nineteenth-century Norwegian mathematician Sophus Lie; for instance, the system of 3-dimensional real vectors, with scalar products disregarded, belongs to a class of systems known as * Lie algebras \ The identity x x x = 0 implies, because of the distributive laws, that the operation x is anti-commutative in that
xxy = —y xx for all elements x and y. When this condition is satisfied Jacobi's identity has the equivalent form x x (y x z) + y x (z x x) + z x (x xy) = 0.) 75. To show that p ' . q = — q'. p, consider (p + q). (p + q)'. To show that r' = to A r, note that if r = xi +j/j + #k then
r' = xi'+yy+zk\ and that
i' = i'.ii + i'.jj + i ' . k k = i ' . j j - k ' . i k ,
etc.
IfuAr = co A r then (u — to) A r = 0; and if this holds for every vector r then u — co = 0 (see 74). (If we think of each point of * space' as being occupied by a moving particle in such a way that the distance between any two particles remains constant, we have the idea of a—space-filling—* rigid body' in motion. If r represents the position at a particular instant of a typical particle relative to some reference particle P of the body, and if ft(r) represents this relative position after a lapse of time t, then ft is a linear isometry; see 73. By making some assumptions about the existence of an instantaneous relative velocity r' for the particle at the relative position r in these circumstances, we can infer that the mapping r -» r' is linear; and, since the rate of change of |r| is zero, that r .r' = 0. We then see that the motion of the whole body, relative to P, is instantaneously represented, through the equation , by the vector co, which is uniquely determined once P has been chosen. If we consider another reference particle Q, whose position relative to P is represented by a vector q, then the position and velocity of the typical
96 HINTS AND COMMENTS 76 particle relative to Q are respectively r — q and r' — q'; and since q' = (OAq, , , , . n n r - q ' = coA(r-q). Thus co does not depend on the choice of reference particle; it is called the instantaneous * angular velocity1, or' spin*, of the body. These results can be used to discuss the motion of a rigid body which does not fill the whole of space: one imagines a 'rigid extension' of the moving body to the whole of space, such an extension being uniquely possible if the body is not 1-dimensional.) 76. IfUAT = A u - v t h e n
0 = u. (u A r) = Au2 — u. v, so A = u. v/u2. The given identity suggests a solution of the form r = auAv, where a is some scalar; in fact UA(auAv) = v . u a u —u.auv = a(u.vu —u2v), so if A = u.v/u 2 we get a solution on taking oc to be 1/u2. But if r is a solution then so is r + q if (and only if) q is a scalar multiple of u. (If the equation r' = co A r (see 75) is taken to represent the instantaneous motion of a rigid body relative to one of its particles P which has instantaneous velocity v relative to points fixed in space, and if Q is a fixed point whose position relative to P is represented instantaneously by p, then the instantaneous velocity of a typical particle of the body relative to Q is r' + v, the position of this particle relative to Q is represented instantaneously by r — p, and
= co A (r - p) + Aco if co A p = Aco — v. Hence if we assume that co 4= 0, and take A to be co . v/co2, we see from the exercise—with u = co and r = p—that Q can be chosen so that the instantaneous motion of the body relative to Q is compounded of a * rotational motion' characterized by the angular velocity co, and a *translational motion' with velocity Aco. Thus if A + 0 the motion is instantaneously that of a * screw'; A is the * pitch' of the screw, and the screw is * right-handed' or * left-handed' according as A is positive or negative; Q is not uniquely determined but can be any point of a certain line, instantaneously fixed, whose direction is that of co: this line is the * axis' of the screw.)
HINTS AND COMMENTS 77-78
97 2
77. If A = (J Jj and £ = / J jj) then AB ==t B4; and ^ = O, where (Sums of 2-by-2 matrices over F are defined by the rule \ cx + c2 dx + d2)' \ci d-J \c2 dj With this definition and that of matrix multiplication, the 2-by-2 matrices over F form a ring—in fact a non-commutative ring which cannot be isomorphically embedded in a division ring. This ring has a unit element, /, and it has a subring isomorphic with F. The matrices / , yX
\-y
*/
form a subring which is isomorphic with the field of complex numbers if F is the field of real numbers, but is not a field if F is the field of complex numbers. The rules defining addition and multiplication of 2-by-2 matrices can be generalized in an obvious way to the case of n-by-n matrices over F9 giving, for each natural number w, a ring with interesting properties ; and F need not be a field, but can be any ring. The theory of matrices is a large branch of algebra, with many applications; it is closely connected with the theory of groups, with the theory of rings, and with * linear algebra'—an important subject in which ideas of linearity are studied in a general setting based on the axiomatic concept of a vector space. In linear algebra one is often concerned with matrices in which the number of rows is not the same as the number of columns: for example, vectors can be regarded as matrices. The problem of inverting a matrix—finding its reciprocal, if this exists, in an appropriate ring—is important in certain kinds of numerical work; this problem is essentially that of solving a set of 'simultaneous' linear equations. See 131.) 78. The binomial theorem states that if a and b are elements of a commutative ring, and n is any natural number, then
(a + b)n = an + nan'1b+ ... +l)an-rbr+
... +bn,
where, for r = 0,1,2,...,«, the coefficient I I is the natural number n\ ,, —r-. (the v(r + l)th number in the v(w + l)th row of 'Pascal's trir\(n-r)\ v ' ' angle'f). t Blaise Pascal, 1623-1662; French.
98
H I N T S AND C O M M E N T S 79
If the ring is a field, the theorem can be stated equivalently in the form n
= l+nx + ... +ln \xr + ... +xn,
where x is any element of the field. If p is a prime number and r is a natural number less than p, then p is a factor of I I since p and r\ are factors of p \j{p — r)! having no common factor. Let p be any prime number, and let n be a natural number such that nv =n
(mod/));
for example, n could be 1. Then, by the binomial theorem applied to the ring of integers, {l+ny = l+np+... Since p is a factor of each term on the right other than the first and the last, it follows that
(1 +n)p ~ 1 +np = 1 +n (modp). Hence, by induction, nv = n (mod p) for every natural number n. (The last result states that p is a factor of nv — n. Since
nv-n = ^nP-1-!), it follows that/), being prime, is a factor of n or of nv~x — 1. Thus, for any natural number n, if p is a prime number which is not a factor of n then nP-x = \
(mod/)).
This result is known as Fermafs theorem, after the seventeenth-century French mathematician P. Fermat; some interesting parts of the elementary theory of numbers are closely connected with it. See 184.)
79.
n2 1 -g—r = 1 + -g—T , so, by the binomial theorem applied to the field
of rational numbers, -o—H
= l + - o — i + ... > 1 + - + . . . > 1 + - ,
the terms represented by ... being positive. The second of the required inequalities is equivalent to the first, as can be seen by writing it in the form
/
n
y
/«+ir + 1
H I N T S AND C O M M E N T S 80-81
99
and noting that this is equivalent to n
Vi-1/ \n + l) The third inequality is equivalent to
>
and we know that xn < 1 if 0 ^ x < 1. The fourth inequality follows from the binomial theorem together with the fact that, for r = 2,3,..., m +1, 1 \ L r-l\ =/1 \ m] " \ m )
The fifth inequality can be deduced from the third, with the aid of the fourth if m < n or of the second if m > n. /
\\n
/
\\-n
(For each natural number n let xn= 11 + -1 and yn — 11 — I Then the facts established here include the following: the sequence {xn} is increasing, the sequence {yn} is decreasing, and each term of the first sequence is less than every term of the second. It follows that {xn} has a least upper bound x, that {yn} has a greatest lower bound yy and that x < y. In fact x = y, since y — x
corresponds to n in the ring of sets. It is an example of a Boolean ring. See 106.) 81. To show that J^is not a ring when A is the additive group of the integers, consider (for example) the functions/andg defined as follows: f(a) = a + l, g(a) = a {as A). Comparison o£fo{g+g) withfog+fog in this case shows that J?/, with o as ' multiplication', is not subject to the left-hand distributive law.
100
H I N T S AND C O M M E N T S 82-83
(A function that belongs to £% is known as an endomorphism of the Abelian group A; and & with + and o is the ring of endomorphisms of A Such rings are important in the general theory of rings.) 82. By the second of the two given equations, whence it follows that^(O) = 0 and that if a can be chosen so that g(a) 4= 0 then/(0) = 1. If /(0) = 1 the given equations show that, for any element b of A, f( — b) = f(b) and^( — b) = —g(b) (that is, that/is an even function and^ is an odd function), and hence also that \x(a)\ = 1 and x(a + b) = X(a)x(b)With the further assumption about the range of/ we know that this range includes the interval [ — 1,1]; so that if x and y are real numbers such that x2+y2 = 1 then, since - 1 < x < 1, there is an element a of A such that/(a) = x. But, by the first of the given equations, so if f(a) = x then g(a) = ±y, and hence x(a) o r x( ~~ a) i s x + *V« (This exercise isolates certain aspects of the theory of the circular functions—aspects which are essential to the theory of polar representation of complex numbers. It also illustrates the idea of a * group character': a function x that maps A into the set of complex numbers of unit modulus in such a way that x(a + b) = X(a)x(b)> f° r a n v elements a and b of A, is called a character of the Abelian group A; the characters of A form a group under pointwise multiplication, and this is called the character group of A. These ideas are fundamental to the subject known as ' harmonic analysis'. Note that a character x n a s t n e property that ^(0) = 1, and that if there is a non-zero element p of A such that x(p) = 1 t n e n for every element a of A, which is to say that % is a periodic function, having/) as a period; the periods, including 0, of any function on A form a subgroup of A. A complex-valued function on A has this property of periodicity if and only if its real and imaginary parts—the functions/and g in the case considered here—have the same property; and if it has the property then so does i/r o x if ^ is any function defined on the unit circle— for instance if, when \z\ = 1, \lr(z) is a sum of constant multiples of zn for integers n of which some may be negative. See 100.) 83. If x ^ 0 then x is clearly an upper bound of the set {f(x'):xf < x}; and if y < x there is a rational number x' such thatjy < x' ^ x and conse-
HINTS AND COMMENTS 84-87 101 quently y (#'): thus x is the least upper bound,
, x , e x , yx t. x /(*)+*(*)< sup (/+*)(/),
and therefore, since inf/(7) (#),
mff(I)+g(x)<sup(J+g)(I); hence sup(/+^)(/)-inf/(7) is an upper bound oig(I), so that )-inf/(7). 85. (This exercise shows how a cubic equation of a certain type can be solved in terms of square roots and cube roots. The method gives what is known as 'Cardan's solution', after a sixteenth-century Italian mathematician who was given the idea by a contemporary, Tartaglia. Although it is of theoretical interest, it has little practical value: in numerical work, where rational approximations of prescribed accuracy are required, polynomial equations are usually * solved* by systematic iterative methods, or by sequences of good guesses. See 35, 36, 37.) 86. Let / be any bounded closed interval with more than one point. If 0 e / then I contains l/2n and l/(2n + 1) for some integer w, and therefore g(I) contains 1 and — 1, so that (from the definition of continuity) g is not continuous on /. If 0 <£ / then g is continuous on / since g restricted to / is then a composite of two continuous functions; hence in this case g(I) is a bounded closed interval. If 0 e / there is a bounded closed subinterval Jof /which does not contain 0 but contains l/2n and l/(2w +1) for some integers: then g(J) is a bounded closed interval containing 1 and — 1, so g(I) ^ [ — 1,1]; but g(I) is contained in the range of/, which, by hypothesis, is contained in [-1,1]. Thus^(/) is a bounded closed interval in every case. (It is a fundamental property of any continuous function g of a real variable that if / is any bounded closed interval in the domain of g then g(I) is a bounded closed interval. One might be tempted to conjecture that every function that has this property is continuous; but the exercise provides a counter-example, since, as is easily shown, a function /having the stated properties does exist.) 87. Consider (for example) the function/defined on [1,2] by the equation /(*)=*»-2.
102
H I N T S AND C O M M E N T S 88-91
88. From the identity/(# + y) = f(x) +f{y) we deduce by induction that f(nx) = nf(x) for every positive integer n and every real number x; in particular,/(w) = w/(l). Also,/(0) = 0 since
If n is a negative integer then -n is a positive integer and therefore /(n)-»/(l) =/(»)+/(-n) = /(«-«) = 0. Thus f(n) = nf{\) for every integer n. If r = m/n, where m and n are integers and w 4= 0, then »/(r) =/(«r) =/(m) =
HI/(1),
so/(r) = r/(l) = ar. If/ is continuous, let x be any real number and let e be any positive number. Then there is a positive number 8 such that \f(x)—f(r)\ < e if r is any number such that |# — r| < #. But there is a rational number r such that |# —r| < #and \ax — ar\ < e: since/(r) = ar, it follows that \f(x)-ax\ < 2e and hence, since e is an arbitrary positive number, that/(#) = ax. (The last part of this argument has two ingredients which are used very often in analysis: we have appealed to the triangle inequality, and to the fact that 0 is the only number whose modulus is less than every positive number. See 126, 128.) 89. To show that/is continuous, it is enough to show that/is continuous on {x: x < 1} and also on {x: x ^ 1}; and on each of these intervals/is a composite of functions that are known to be continuous. (There is no need to appeal directly to a definition of continuity.) 90. By definition, 2X = exlog 2. Let y be any positive number. Since the domain of the logarithmic function contains y, and since log 2 4= 0, f(x) = y if x — log j/log 2. Since the exponential function is continuous and monotonic, so also is/; hence/has a continuous monotonic inverse. (The function/" 1 gives logarithms to the base 2.) 91. From the definition
: = [**? (X>% and from fundamental properties of the integration process, it is easy to deduce that log# < x — 1 (equality occurring if and only if x = 1). To
103
9-
Q
o ~
/
7 -. 6 -
/
5-
/
4-
/
3-
/
2 -
i
-2
I
-1
89
4 -
2 -
-2
90
104 HINTS AND COMMENTS 92-94 deduce the fourth of the required inequalities from the third, let r
for r = 1,...,«.
=
(pi+~. +A,K pa++paJ
A = r
Pr
(The number is the weighted arithmetic mean of the numbers a^, ...,
vA1/(J)i+ - +Pn)
is the correspondingly-weighted geometric mean. The 'ordinary* means are obtained by taking px = ... = pn = 1. Thus the exercise gives a generalization of the well-known inequality between the arithmetic and geometric means of two positive numbers. It is interesting to consider what is implied by the equality of the two means in the general case.) 92. The first proposition states that f(x) -> 0 as x -> 0; that is, that lim /(*) = 0. (The concept of a limit is a subtle one, requiring careful thought for its proper understanding. To define it one has to make a statement whose logical structure involves a sequence of three quantifiers; and in arguments involving the concept it is easy to make serious logical mistakes— and hence serious mathematical mistakes—by taking the quantifiers in a wrong order.) 93. Let g be the function of a real variable defined by the equation g{y) = \y\aThen g is continuous (being a composite of continuous functions), and g(0) = 0. Therefore if e is any positive number there is a positive number 7) such that if \y\
g(y) = \g(y)-g(0)\<e; but since f(x) -> 0 as x -> 0, there is a positive number S (depending on rj, and hence on e) such that if \x\ < 8 then \f(x)\ < rj. Thus g(f(x)) -> 0 as 94. If f(x) = 2X, for every real number x, then / is continuous, and /(0) = 1 (see 90). Hence f(x) -> 1 as x -> 0. If x < 0 then (b/a)1^ < 1, so that
< (1'* + W*y* = a{\ + (b/a)1'*}* ^ a2*;
H I N T S AND C O M M E N T S 95-98 llx
hence (a
llx x
105
llx
+ b ) - > t f a s # - > 0 + . I f # < 0 then (a/b) 1
b < (a^ + b ^
x
x
^ 1, so that x
= b{(albf + \} ^ bl .
95. (The function sinh is monotonic, and its range consists of all the real numbers; in fact—as one can see by solving a quadratic equation—if y is any real number then sinh# = y if x = \og{y + ^(y2 +1)}, so that
If, for any real number x, u = cosh# and v = sinh#, then
and on the other hand, if u and v are any real numbers satisfying this equation there is a unique real number x such that \u\ = cosh# and v = sinhtf. The set {(u,v): u2 — v2 = 1} is an example of a 'hyperbola': hence the name 'hyperbolic functions'. The theory of these functions has some formal similarities with that of the circular functions, but is considerably simpler. See 100, 138, 139.) 96. (If we restrict/to {x: x> 0} or to {x: x ^ 0} we get, in either case, a function differentiable at 0; the derivative at 0 being 1 in the first case and — 1 in the second case.) 97. (This result, giving an' expansion' for the nth derivative of a product of functions, is known as Leibniz's theorem, after the seventeenth-century German mathematician G. W. Leibniz, who shares with his English contemporary Isaac Newton the credit for inventing 'the differential calculus'.) 98. Let y be a real number. There is at most one (positive) prime number p such that p2y is one of the numbers p, p + 1 , . . . , p2 — 1. If there is one such p, and k = p2y, then y = /(#), where ^
1 //>2 V Vr
i \//>2 JlJr
;f
k—t> ~~"x *
If there is no such/) theny =f(y). In either case there is exactly one real number x such that y =/(#). T h u s / " 1 exists and has the same domain as/. Since/(0) = 0, ,, ,
if this limit exists. Let d be a positive number less than \, and suppose that 0 < |x\ < 3. If x = k/p2 then k cannot be p2 — 1, and if k is one of the
106
H I N T S AND C O M M E N T S 99-100
numbers/>, p +1, ..., p2 - 2 then/(#) = (k + \)jp\ so that
1 < ^ = 1 + 1^1+1^1+*p-l >2. y p T h u s / - 1 is not differentiate at 0. (This exercise gives a counter-example to the conjecture—which might be thought plausible—that if/ is a function that maps the set of all real numbers one-to-one on to itself, and if/ has a non-zero derivative f\x) at a point x, then/" 1 is differentiate, with derivative 1//'(#)> at the point f(x). This conjecture becomes a theorem if we add the hypothesis that/is continuous on some open interval containing x.) so that
JfHv)
99. When x 4= 0 the differential equation is equivalent to
f'(x) + x-*f(x) = 0, and this is equivalent to
-r- {e~llxf(x)}
= 0.
(The 'integrating factor' e~llx can be found by a standard procedure; see 138.) It follows that, when x > 0, /(*) = ce"*, where c is a constant; and since/(I) = 1, c must be e~\ (Note that the c o n d i t i o n a l ) = 1 does not determine a solution of the differential equation outside the interval {x: x > 0}.) 100. T h e required identity can be established by induction or by 'the method of differences'. In either case one uses the identity 2C0S9 sin6 = sin(9 + d) — sin(9 — 6). Another method is based on the fact that if x 1S t n e complex-valued function defined by the equation %(0) = cos 6 + /sin 6 then, for any integer k, cos kO is the real part of {x(@)}k- Using this, we can deduce the required result from the polynomial identity
H I N T S AND COMMENTS 100 107 (which is related to the standard formula for the sum of n consecutive terms of a geometrical progression). Thus, let z = x(@): t n e n z* = !/*> so that, if z 4= z*, z* —x and now we have only to * consider real parts'. (The method of differences, for condensing an expression a± + a2+ ... +any depends on the discovery of a sequence bl9 ..., bn, bn+1 such that ak = bk+1 — bk for k = 1, 2, . . . , « ; it can be used, for instance, when ak = l/&(& + l)—and one uses it, with the mean-value theorem, to prove the fundamental theorem of the calculus. The method involving the polynomial identity is more sophisticated, and has the advantage that if we consider imaginary parts instead of real parts we get an expression for s in0 + sin30+ ... + s i n ( 2 n - l ) 0 ; in fact this method is a general one for deducing pairs of identities involving the circular functions from polynomial identities. In the ^
cos# + cos3#+ ... +cos(2»-l)#,
and the corresponding expression with sin instead of cos, we have examples of * Fourier series', so called after the nineteenth-century French mathematician J.B.Fourier; the general theory of such series is an important part of harmonic analysis. See 82, noting that ^ is a character of the additive group of the real numbers, and that its real and imaginary parts occur in the representation of * simple harmonic motion' since they satisfy the differential equation of 137. This function x c a n be expressed in terms of the exponential function of a complex variable, through the formula eid = cos 6 + i sin d, which is associated with the name of the eighteenth-century Swiss mathematician Leonhard Euler; an adequate discussion of this requires an excursion into * complex analysis'—a subject not understood in Euler's time; but one can easily see, with the aid of the addition formulae for cos and sin, that Euler's formula suggests a way to define complex powers of e—and hence of other numbers—which partly respects the * laws of indices' and leads to a unified theory of the circular, exponential, and hyperbolic functions.) 5-2
APPENDIX 1
The examination papers from which Exercises 141-200 are taken relate to the following schedule (which was proposed in 1963 in connexion with 'the Swansea scheme* for pure mathematics in schools).
EXAMINATION FOR THE GENERAL CERTIFICATE OF EDUCATION OF THE WELSH JOINT EDUCATION COMMITTEE
PURE MATHEMATICS ADVANCED LEVEL (ALTERNATIVE SYLLABUS)
The examination will consist of two papers, and an optional Special paper, of 3 hours each. Questions may be set on any part of the syllabus in each of the papers, but a choice of questions will be allowed. The questions will not call for a high degree of ingenuity or for remarkable feats of memory, but candidates will be expected to show an understanding of the nature of mathematical reasoning, and to use symbols intelligently. No question will require the use of mathematical tables, or of any instrument other than a pen. Syllabus 1. Sets (subsets of a given set): unions, intersections, complements; the empty set; de Morgan's laws; Cartesian products; one-to-one correspondence. Elements of mathematical logic. 2. Fundamental properties of the system of natural numbers (in particular, the commutative, associative, and distributive laws for addition and multiplication). The principle of mathematical induction, and simple applications of this. Scales of notation. Construction of the ring of integers (illustrating the ideas of equivalence class, Abelian group, isomorphism). Divisibility: expression of the highest common factor of m and n in the form rm + sn; Euclid's algorithm. Proof of the existence of infinitely many prime numbers; the sieve of Eratosthenes. Unique decomposition of a natural number into prime factors. Congruences. 108
APPENDIX 1 109 3. Construction of the rational numbers. The concept of a field. Finite fields (in particular, the field of remainders modulo a prime). The concept of a totally ordered field. Moduli. Inequalities, and the laws governing their manipulation. Upper and lower bounds of sets in a totally ordered field; supremum and infimum. The non-existence of AJ2 in the field of rational numbers. Postulates for the system of real numbers. Existence of square roots of positive real numbers. Approximation of irrational numbers by rationals. Approximate computations and the checking of accuracy. 4. The field of complex numbers, and its identification with the Cartesian plane. Conjugates. Quadratic equations. The modulus of a complex number. Distance in the plane. The triangle inequality. The modulus of a product. 5. Algebra of 3-dimensional real vectors: linear combinations, linear dependence, the basis theorem; the scalar product and the vector product; determinants of order 3. Elements of the Euclidean geometry of lines and planes, treated algebraically. 6. The ring of polynomials in a single indeterminate, over any field: the remainder theorem; highest common factor; the field of quotients; partial fractions. The binomial theorem (for a positive integral index). 7. The idea of a function. Images and counter-images of sets. Restriction and extension of functions. Functions of a real variable: step-functions, rational functions, bounded functions, unbounded functions, monotonic functions; inverse functions; composite functions; linear combinations, products, quotients of functions. Sketching of graphs in simple cases. Continuity in the large (that is, uniform continuity on bounded closed intervals). Fundamental mapping properties of continuous functions. Continuity of combinations of continuous functions. Existence and continuity of simple algebraic functions. 8. Upper and lower Riemann sums of a bounded function on a bounded closed interval. The idea of integrability and the idea of area. Integrability of continuous functions and of bounded monotonic functions. Fundamental properties of the integration process (linearity, positivity, additivity). (Knowledge of Darboux's theorem on integrability will not be required.) 9. Definition and fundamental properties of the logarithmic function (of a positive real variable). The number e, and the exponential function. Meaning of axy where a is positive and x is real. Logarithms to bases other than e. 10. The idea of a limit. Continuity in terms of limits. Fundamental
110 APPENDIX 1 theorems about limits of compound functions. Differentiability. Theorems underlying the technique of differentiation. Differentiation of rational functions, of the logarithmic function, of the exponential function, and of simple compounds of these. 11. The mean-value theorem and some of its applications, including 'the fundamental theorem of the calculus'. Principles of systematic integration. Calculation of logarithms. Linear differential equations of the first order. 12. Analytical definition of the number TT and of the circular functions (of a real variable). Fundamental properties of these functions and of their inverses. Polar representation of complex numbers, and simple applications of this. The length of a simple differentiable arc in the plane; definition of angle. (No knowledge of infinite series will be required.) Notes (i) The order in which the topics are arranged in the above list is compatible with the logical structure of the subject, and could therefore be followed in a course of study. Some variations of this order could however be made without loss of coherence: for instance the item listed fifth could be taken at a later stage, since nothing else in the syllabus depends on it. (ii) Items 1-9 represent a lower level of sophistication than the rest of the syllabus, in that they involve no explicit consideration of local properties of functions. These items, with the possible omission of the fifth, might form the basis of a first-year course. Anyone embarking on such a course should, if possible, have had a preview of the subject, and to this end some of the material of the first two or three items might be incorporated in an Ordinary-level course. (iii) Items 7-12 call for some skill and imagination on the part of the teacher, and appreciable intellectual effort on the part of the pupil; but the difficulties should not be exaggerated. It is important to avoid unrigorous (that is, question-begging) arguments, but this is not to say that one should never appeal to a theorem that one has not proved. In elementary analysis there are several crucial theorems that are quite hard to prove, and in an introductory course it is legitimate to state these theorems without proof (provided that the individual pupil who wishes to study a proof is given an opportunity of doing so). Honest omission of certain proofs, or parts of proofs, is much less dangerous than the use of vague or incomplete definitions.
APPENDIX 2
AN APPROACH TO ELEMENTARY ANALYSIS (A note based on an address given by J. D. Weston at a symposium held in April 1964 at Syracuse, Sicily, in celebration of the work of Archimedes.)
Elementary analysis is essentially the theory of the calculus, and one of its main themes is the possibility of inferring global properties of functions from local properties. (We make such an inference when, for example, we obtain the general solution of a differential equation.) The concept of a local property is a fairly subtle one; it was formed at a comparatively late stage in the development of mathematics, and it was not thoroughly understood until quite recent times. This, no doubt, is one of the reasons for the widespread practice of teaching the calculus without giving adequate attention to its logical foundations. Many books have been written in which the calculus is presented as though it were an inductive science like physics. This may sometimes have been done through ignorance, but probably in most cases it has been done in the fallacious belief that a difficult subject can be made easier by concealing its logical structure from the student, at least in the early stages of instruction. That this kind of approach is unsatisfactory becomes increasingly clear as *pre-calculus mathematics' becomes increasingly pervaded by the spirit of pure mathematics. An intelligent student who has tasted the delights of the axiomatic method will want the calculus to be presented as a strictly deductive discipline.! It is still not easy to do this, but some of the traditional difficulties can now be reduced. Although local properties are relatively difficult to understand, it is t After describing the joy of his early introduction to * Euclid', Bertrand Russell wrote (in the first volume of his Autobiography, published in 1967) 'My mathematical tutors had never shown me any reason to suppose the Calculus anything but a tissue of fallacies.' The kind of teaching to which he had presumably been exposed was denounced as 'an educational sham' and 'a sin against the spirit of mathematical progress' as long ago as 1889 by Chrystal (in the preface to the second volume of his Algebra), but it has not yet been generally abandoned. Ill
112 APPENDIX 2 usual to introduce them at or near the beginning of a course in elementary analysis, with little or no preliminary discussion of global properties. Generally, local continuity is discussed before uniform continuity, and the notion of a derivative is discussed before that of an integral. However, a reversal of this order can be advantageous, particularly (but not exclusively) in the case of students who, though not unintelligent, cannot give much time to the study of mathematics. Let us consider the main features of a course based on this pedagogical notion. From the axiomatic point of view, an appropriate starting-point for a course in elementary analysis is the concept of a totally ordered field, of which the system of rational numbers is a familiar example. It is easy to show that the algebraic rules by which inequalities are manipulated are valid for any totally ordered field, and that the system of rational numbers is isomorphically embedded in any such field (and, indeed, that it is essentially the only minimal one). One can then introduce the idea of exact bounds of sets in a totally ordered field, and prove that if the field is complete, in the sense that every non-empty set that has upper bounds has a least upper bound, then every positive element has a square root in the field. This motivates the postulation of the system of real numbers as a complete totally ordered field. That such a system exists and is unique to within isomorphism should be stated, but need not be proved at this stage. (It is easy enough to define the real numbers in terms of the rational numbers, as Dedekind sections for example, but to verify that one then has a system with all the required properties is a somewhat tedious matter.) The system of real numbers having been postulated in this way (or in some equivalent way), the idea of a function of a real variable can be considered, and illustrated by simple examples. Such functions can be classified according to certain global properties that they may or may not have, including boundedness, monotonicity, univalence, representability by rational expressions. Continuity also can be defined at this stage as a global property, in fact as uniform continuity on bounded closed intervals. The fundamental mapping theorems for continuous functions, and the usual theorems about the continuity of compound functions, can be deduced from this definition with no greater difficulty than one has with local continuity; indeed some simplifications are possible. Once these theorems are available it is very easy to establish the existence and continuity of various algebraic functions, and the student has a substantiali amount of useful equipment at his disposal. (The fundamental mapping theorems state that if / is a continuous function on a bounded closed interval /then (i) the set/(7) is a bounded closed interval, and (ii) if/is
APPENDIX 2
113
also univalent on I—which, by virtue of (i), is the case only if/is strictly monotonic on /—then the inverse function/" 1 is continuous on/(/).) The next step is to consider upper and lower Riemann sums of a bounded function on a bounded closed interval, and to derive the notion of integrability. With the definition of continuity that has been adopted, it is trivially easy to prove that continuous functions are integrable. Some numerical computation of integrals can now be done; this serves to emphasize that integration is essentially a process of approximation, and that an approximation is worthless unless it is accompanied by an estimate of accuracy (obtained in this case by computing both upper and lower sums). The fundamental properties of the integration process— linearity, positivity, additivity, integrability of the modulus—can be established in a straightforward way, but in a short course these need only be stated. One is then in a position to define the logarithmic function and to establish its functional equation (log xy = log x + logjy), its continuity, and the fact that it has a continuous inverse, which is the exponential function. This provides an efficient way of assigning an unambiguous meaning to the symbol ax, where a > 0 and x is any real number. The circular functions can be introduced in an essentially similar way (independently of trigonometry), but the detailed study of these functions is best deferred until the calculus has been more fully developed.! It may be presumed that a student who has reached this stage in the study of analysis is prepared to appreciate the concept of a limit. Continuity can now be expressed in terms of this concept through Heine's theorem (which states that a function has the global property of being uniformly continuous on a bounded closed interval / if it has the local property of being continuous at each point of / ) , and the fundamental theorems about limits of compound functions can be proved by arguments of a type with which the student is now familiar. At this point the concept of a derivative can be introduced, and the fundamental theorems on which the technique of differentiation is based can be proved without difficulty. By way of the mean-value theorem one then reaches the fundamental theorem of the calculus, which confers the power of evaluating integrals by * antidifferentiation'; the technique of this (' systematic integration') can be developed as part of the wider, and more interesting, technique of solving differential equations, and it is natural to develop concurrently the theory of the elementary transcendental f An account of the circular functions which is suitable from this point of view is given in the pamphlet Notes on the circular functions (second edition, Swansea, 1966) by J. D. Weston. 14 pp.
114 APPENDIX 2 functions of a real variable (the non-algebraic functions already mentioned, and functions simply related to these). It is by no means necessary to use infinite series to represent either numbers or functions in work at this level; it is only in more advanced analysis that infinite series are genuinely required, and their theory and usefulness can perhaps be best appreciated by the student who already has a sound knowledge of the calculus. The introduction to analysis thus briefly described does not differ greatly from what is now usual in first-year university courses; but the order of development allows some economy in the use of time, and gives the student an opportunity to acquire some understanding of analysis while he is developing his skill in the calculus. It is also to some extent a historical order: Archimedes knew something about integration; a long time later came the idea of logarithms, and then the differential calculus. Of course it remains true that some of the fundamental theorems are rather hard to prove. The non-specialist student should learn to appreciate the significance of these theorems, but he need not be obliged to learn proofs of them.
APPENDIX 3
For ease of reference, some definitions and explanations not given elsewhere in the book are collected here. Certain terms denoting very primitive notions—for instance set, function (mapping), one-to-one correspondence, natural number, proof (deduction)—or very elementary concepts—for instance union, equivalence relation, integer (difference of natural numbers), rational number (quotient of integers), polynomial —are not explicitly defined anywhere in the book; but see the index for guidance as to their meanings and use. For sets A and B (subsets of a given set), the complement ofB relative to A, written A\B, is {x: xeA, x$B}\ this is 0 if A c B. When it is understood that all sets under discussion are subsets of a certain given set X, the complement of a set E relative to X is referred to simply as the complement ofE, and may be denoted by cE. Thus cE is the set H, uniquely determined by E, such that
EuH = X
and
EnH=0.
Hence ccE = Ey for any subset E of X. By a family of sets we mean an assignment of a subset EL of X to each member i of some 'index set' I; that is, we mean a function whose domain is I and whose values are subsets of X. For example, I might consist of the numbers 1 and 2, in which case we might prefer to write, say, E and H rather than Ex and E2; but I might be an infinite set (as in the case of an infinite sequence of sets, which is a family whose index set consists of all the natural numbers). The sets EL need not be distinct (a function need not be univalent). De Morgan's laws state that, for any family of sets, (i) the complement of the union of the sets is the intersection of their complements, and (ii) the complement of the intersection of the sets is the union of their complements : in symbols, (i) c U EL = fl cEn tel
(ii) cf)EL=\J
i€l
tel
cELt tel
or, in the special case of two sets E and H (possibly identical),
(i) C(EKJH) = cEncH,
(ii) c(EnH) = cEvcH. 115
116 APPENDIX 3 Since each set is the complement of its complement, the laws (i) and (ii) are * dual' to each other in that each can be deduced from the other by 'taking complements'. (Augustus De Morgan, 1806-1871; British, first president of the London Mathematical Society.) The Cartesian product, \[Eiy of a family of sets is the set of all functions/that map I into U EL in such a way that/(j) is an element of eel
EL for each i in I. In the special case of two sets E and Hy this product is often denoted by E x Hy and consists of all ordered pairs (xyy) for which x e E and yeH. The Cartesian plane is R x R, where R is the set of all real numbers; since RxR consists of the complex numbers, it is often called the complex plane \ and when considered with the usual notion of distance it is called the Euclidean plane. (Rene Descartes, or Des Cartes, 1596-1650; French.) If F is a field, a vector space over F is an Abelian group G, written in the additive notation, together with a mapping (a, x) -> ax of F x G into G such that (i) a(x+y) = ocx + ay; (Hi) (oLfi)x = a(fix) ( = aflx);
(ii) (iv) lx = *,
for any elements <xyfiofF and any elements xy y of G. Linear combinations (weighted sums) of elements of a vector space are defined in an obvious way (as for 3-dimensional real vectors). A set in a vector space is said to be linearly independent if no element of it is a linear combination of other elements of it. A basis of a vector space is a linearly independent set B such that every element of the space is a linear combination of elements of B. In a vector space over Fy if xx =f= x2 the line through xx and x2 is {x: x — axx + {\ —a)x2y oceF}.
(If, in the definition of a vector space, we relax the condition that F be a field, stipulating only that it be a ring, and not insisting on (iv), we obtain the definition of a module over a ring—a unital module if the ring has a unit element and (iv) holds. Any Abelian group can be regarded as a unital module over the ring of integers; and every ring is a module over itself. Thus the theory of modules includes the theory of Abelian groups, the theory of rings, and the theory of vector spaces.) An interval is a non-empty set / of real numbers such that if xx e / and x2 e /then every number between xx and x2 belongs to 7. (In other words,
APPENDIX 3
117
an interval is a convex set of real numbers.) An end-point of an interval / is inf/(left-hand) or sup /(right-hand). The end-points of / constitute a set Ej which may have 0, 1, or 2 points; and / is said to be closed if Ej c= Iy open if Ej n I = 0 . If a < b there are four intervals whose end-points are a and b: [a,b] = {x: a < x < b}\ ]a, b[ = {x: a < x < b}\ [a, b[ = {x: a < x < b}\ ]a, b] = {x: a < x < b}. The length of each of these intervals is b - a. The length of an interval consisting of one point is defined to be 0. A compact interval is an interval that is bounded and closed. A subinterval of an interval / is an interval contained in /. A real-valued function/whose domain is an interval / is called a stepfunction if (i) for each compact subinterval 7 of / , / ( / ) is a finite set, and (ii) for each value y of /, f~\{y}) is a union of finitely many intervals. For a step-function / with bounded domain, each value y determines finitely many disjoint bounded intervals, no two of which have a common end-point, whose union is f~\{y}); the sum of their lengths, Xy say, is uniquely determined b y / and y, and we denote the sum of all the products yXy by J/. (If/ is a non-negative step-function with bounded domain, jy is the area of a certain rectilinear figure in the Euclidean plane.) If / is a real-valued or complex-valued function of a real variable, to say that/is continuous on a compact interval J in the domain of/is to say that for each positive number e a positive number 8 can be found with the property that if xeJ and x'eJ and |tf'-#| < d then \f(xf)-f(x)\ < e. If / is any interval in the domain of/, to say that/is continuous on I is to say that/is continuous on every compact subinterval of /. (For a function / whose domain is a non-degenerate interval /, it is clear that if / i s continuous on / then, for any point x of /, / is continuous at x in the sense that/(#') ->/(#) as x' -> x. Heine's theoremf asserts that a function is continuous on an interval / if it is continuous at each point of /.) Composites of continuous functions are continuous, and so are functions obtained from continuous functions by rational processes (addition, subtraction, multiplication, and division); hence, for example, every rational function is continuous on each interval contained in its domain. (Important properties of continuous functions are expressed by *the fundamental mapping theorems': see Appendix 2.) t H.E.Heine, 1821-1881; German.
118 APPENDIX 3 Let / be a bounded interval, let / be a real-valued function whose domain includes /, and suppose that / is bounded on I (that is, that the image / ( / ) is a bounded set). If g and/* are step-functions with domain /, and if , for every x in /, then §g is a lower Riemann sum, and Jh an upper Riemann sum, for/on 7. Let L be the set of all the lower Riemann sums, and C/the set of all the upper Riemann sums, for/ on /. These sets are non-empty, since / and/(7) are bounded; moreover, each point of L is a lower bound of U, and each point of U is upper bound of L. Hence sup L < inf U. The function / may be such that supL = inf U; f is then said to be integrable on 7, in the sense of Riemann, and the common bound is the Riemann integral of/ over 7, written I / . If a and b are the end-points of 7, Ji
and a ^ b, this integral is also written as
f(x) dx (without ambiguity, Ja
since the integral is unaltered if 7 is altered by the inclusion or omission of an end-point); and, by definition,
f(x) dx = — / . Jb
Ji
A function is integrable on a compact interval 7 if it is continuous on 7, or if it is monotonic on 7, but not if it is unbounded on 7. A function integrable on an interval 7 is integrable on every subinterval of 7. The integration process has the following properties: Linearity. If/ and g are integrable on 7, and if a and /? are real constants, then a/+ fig is integrable on 7, and
(This is true for complex constants oc and fi if integrals of complexvalued functions are defined in the obvious way, in terms of real and imaginary parts.) Positivity. If/ is integrable on 7 and/(#) > 0 for every x in 7 then
1
0; and (strict positivity) i f / i s continuous on 7 and/(#) ^ 0 for
every x in 7 then
/ = 0 only if 7 has only one point or f(x) = 0 for
every x in 7. Additivity. If/ is integrable on 7 and on 7, where 7 and 7 are disjoint intervals such that I\J J is an interval, then/is integrable on 7 u 7, and
f '=
Jiuj
APPENDIX 3 If/is integrable on /then so is |/|, and
119
IJ/hJ> and if eel and Fc is defined on / by the formula
Fe(x)= jj(u)du
(xel)
then Fc is continuous on /. (G. F. B. Riemann, 1826-1866; German.) Let x be a point in the domain of a function / (real-valued or complexvalued), and suppose that there is an open interval / containing x such that the domain of/either contains I or else contains all points of / on one side of x and no points of I on the other side. Then there may be a number f\x) such that j\
j
J\
^ >/'(#)
as
x
if so, / is said to be differentiable at xy and the number /'(#)> which is uniquely determined by/and x> is called the derivative off at x; it is often denoted by Df(x) or by , The technique of differentiation (calculating derivatives) is based on this definition and on the following five propositions deducible from it: (1) If f+g and fg are defined in the usual way (pointwise) for functions / and g which are differentiate at x then
(In particular, differentiation is a linear process.) (2) If/is differentiate at x and/(#) 4= 0 then
(3) A composite function fog is differentiate at x if / and g are differentiate at g{x) and at x respectively, and in this case
120 APPENDIX 3 (4) If the domain of / is an interval on which / is continuous and strictly monotonic, i f / i s differentiate at xy and if/'(#) 4= 0, then the inverse function/" 1 is differentiable at/(#), and (/- 1 )'(/(*)) = !//'(*)• (5) If/ is continuous on an interval / having more than one point, and if
Fc(x)=
Jc
f(u)du
(xel),
then Fc is differentiable at each point x of /, and Fc'(x) = / ( * ) . Differentiability of vector-valued functions of a real variable—an important idea in kinematics and in differential geometry—can be defined in an obvious way. Rules for differentiating scalar products and vector products can be easily deduced from the rule for products of real-valued functions. The fundamental theorem of the calculus states that if / is integrable on [a, b], and if F is a function which is continuous on [a, b] and is such that F\x) =f(x) for every point x of ]a} b[, then
f
= F(b)-F(a).
Ja
The number e is log" 1 1; that is, e is defined by the equation
Since
1 = log' 1 = lim l o g ( 1 + ^ = lim log (1 + xfl*, x-+0 1
and since log" is a continuous function,
The number TT, encountered in the theory of the circular functions, is defined by the equation . x n = 4 -A ^ d#.
Jo 1 + * 2
APPENDIX 3
121
For analytical definitions of the circular functions, and deductions of their fundamental properties, the reader is referred to the pamphlet mentioned in the footnote on p. 113. This pamphlet has the following section-headings: The number n, and the functions sin and cos; The addition formulae) The functions tan, sec, cot, cosec, and tan" 1; The irrationality of n2; The integration of rational functions; The functions sin"1 and cos"1; Polar forms; Arc-length and angle; Approximation to sin 6 and cos 6 by polynomials in 6; The circular functions of a complex variable.
INDEX The numbers refer to pages. Reference is made to definitions of terms (on pages indicated here by italic numerals), to topics on which there are exercises, to important ideas that are discussed or mentioned, and to words whose mathematical uses are exemplified or explained. (Certain 'ordinary* English words have special meanings in mathematical contexts, and the correspondence between the words and their meanings is not one-to-one.) Some of the cross-references indicate significant linkages of ideas: the reader may find it instructive to consider the nature of these linkages, and to trace others. Abel, N. H. 88 of sets 1,2,11,17-18,38,44,108, Abelian group 2, 12, 18, 24, 48, 88, 115; and see intersection, notation, 94, 108, 116; and see additive, relative complement, symmetric character, field, multiplicative, difference, union ring, unit circle of 3-dimensional real vectors 109; abstractly identical 90; and see idenand see vector tification, isomorphism algebraic field 92 accuracy 3, 7, 44, 109, 113; and see function 109, 112 geometry 29 calculation, computation, error methods 63; and see Galois theory, addition 1, 2,4,12,48, 51, 54, 57, 89, 97, 108, 117; and see additive, rational processes laws, plus, pointwise, sum, sumnumber 71 mation, translation rules 112; and see algebra, reasoning formulae for the circular functions topology 56 algebraically closed field 63 40, 48, 107, 121; and see polar analysis 33, 91, 102, 111-14; and see for the hyperbolic functions 21 calculus, continuity, convergence, modulo 2, 17, 56 function, fundamental theorem of table 25 algebra, harmonic analysis, inthe same as subtraction 24; and see equality, limit, real number symmetric difference additive function 18, 19, 30-1; and analytical definition 110, 121 see endomorphism, isomorphism, angle 110, 121; and see arc-length, linear circle, polar angular velocity 96 group of a ring 13, 18, 73, 89 notation 18, 23, 88; and see minus, anti-commutative multiplication 95 antidifferentiation 113; and see difplus, zero ferential equation, systematic inteadditivity of number 87; and see gration pigeon-hole principle of the integration process 78-9, antipodal points of a sphere 29 approximation 3, 7, 8, 34, 101, 109, 109, 113, 118 113, 121; and see accuracy, badly, adjoining a square root 93 error, interval adjoint 32 arbitrary positive number 83, 102; afnne geometry 29 and see small as we like algebra see binary, identity, isomorphism, laws, linear, matrix, num- arc see simple -length 48, 110, 121; and see inteber, polynomial, rational, quatergration, simple nion, vector 123
124
INDEX
Archimedean field 91, 92 Archimedes 91, 111, 114 area 109, 117; and see integration argument see reasoning, similar, unsound arithmetic mean 104 associative law 6, 12, 16, 18, 23, 25, 43, 48, 51, 57, 72, 87, 88, 94,108; and see semigroup automorphism of a field 64, 93 axiomatic method 111 point of view 112 axis see graph, screw, symmetry badly approximable numbers 71 base of logarithms 102, 109 of a scale of notation 48 basis 27-8, 32, 116 theorem 28, 66-7, 109 belongs to (6) 12-13; and see element of a set betweenness 3, 7, 13-14, 31, 116; and see approximation, convexity, greater than, interval, less than, mean binary operation 12,18, 24-5, 48, 72, 87, 88-9, 94-5; and see addition, algebra of sets, field, group, multiplication, notation, ring, semigroup, vector product relation 11, 40; and see equivalence, inclusion, inequality representation of numbers 38 binomial coefficient 17, 21, 47 theorem 17, 47, 97, 109 Boole, George 56 Boolean ring 24, 56, 99 bound 39, 41, 81, 112, 118; and see lower, upper bounded function 8, 9, 19, 81, 109, 112-13, 118 interval 9, 19, 101, 109, 112-13, 118 sequence 33 set of real numbers 3, 40, 60; and see bounded interval Bunjakowski, V. Ja. 62 calculation 8, 14, 28, 44, 53, 81, 86, 110; and see computation calculus 105, 111-14; and see differentiation, integration, unsound arguments
cancellation law 23, 24, 51, 87; and see divisors of zero, field, group, integral domain Cantor, Georg 34 Cardan's solution 101 Cartesian plane 29, 43, 45, 109, 116 product 4 , 1 1 , 1 2 , 2 5 , 4 2 , 4 8 , 1 0 8 , 116 space see w-dimensional Cauchy, Augustin L. 34, 62 Cauchy—Schwarz—Bunjakowski inequality 62, 93 Cauchy sequence 33, 34 centre see circle, mid-point, sphere character of an Abelian group 100, 107 group 100 characteristic function of a set 17 characterization of completeness 90; and see completeness of linear transformations by matrices 32 of orthogonality 28 Chrystal, G. I l l circle 5, 18, 27, 30-1 circular functions 10, 22, 39, 40, 42, 46-8, 100, 107, 110, 113, 120-21; and see differential equation, hyperbolic functions class 23, 26, 28; and see equivalence, set classification of functions 112; and see algebraic, transcendental close to 65; and see distance, sufficiently small closed interval 9, 19, 42, 101, 109, 112—13, 117; and see end-point, maximum, minimum coefficient see binomial, differential equation, linear combination, polynomial, weight collinear 27, 28, 31; and see line combinations of functions 109 combinatorial topology 56 common factor 2, 38, 46, 48-9; and see highest, partial fractions multiple see least commutative law 6, 12, 16, 43, 48, 51, 54-5, 72, 87, 88, 108; and see additive notation ring 24, 30, 54-5, 89; and see binomial theorem, field, integral domain
INDEX commutator 95 compact interval 117, 118 complement 108, 115, 116; and see relative complete totally ordered field 34, 90-2, 112; and see real number completeness 34, 90, 112; and see totally ordered field completing the square 63 complex analysis 107 number 4, 14-15, 18, 27, 29-31, 33, 35, 39, 40, 43, 45, 48, 60, 62, 64-5, 69, 97, 100, 106-7, 109-10, 118 plane 27, 30-1, 64, 68, 116 powers 107 projective plane 29 -valued function 18, 35, 48, 85, 100, 106-7, 117-20 variable 107, 121 composite function (o) 6, 18, 41, 72, 76-7, 81, 94, 100, 109, 117, 119 number 55 compound functions 110, 112-13 ; and see composite, inverse, linear combinations, product, reciprocal, sum computation 7, 34, 97, 101, 113; and see accuracy, approximation, calculation, unnecessary condensing 107; and see summation configuration 31 congruence ( = ) 17, 40, 44, 54, 98, 108; and see equation, modulo conies 29; and see hyperbola conjecture 52, 106 conjugate complex numbers 4, 48, 64, 107, 109 quadratic surds 92, 93 consecutive roots 43 constant 35-7, 44, 47, 78, 81, 118 construction 29, 42, 64, 91, 94,108-9 contain 3, 58; and see embedding, inclusion continuity 8, 9, 47, 76, 103, 112-13, 117; and see continuous in the large 109; and see global, uniform continuous function 7-9, 19, 20, 41, 43, 47, 49, 101, 109, 112-13, 117-20 contradiction 57, 59
125
convergence 33-4 convergent sequence 33 converse 26, 32-3, 38, 94 convex set 14, 15, 117 convexity 92; and see betweenness, convex coordinates 29; and see Cartesian space, point correspondence see mapping, one-toone, representation cosecant (cosec) 121; and see reciprocal, sine coset 24 cosh see hyperbolic functions cosine (cos) see circular functions cotangent (cot) 121; and see reciprocal, tangent counter-example 101, 106; and see conjecture, disprove -image 6, 71, 109 criterion 55, 86 cube root 3, 60, 101 cubic equation 7, 19, 63, 101 function 49; and see polynomial curve 39, 76; and see graph, sketch damping 37 Darboux, G. 85 Darboux property 85 Darboux's theorem on integrability 109 decimal places 3, 7 representation of numbers 3, 7, 38, 48 decomposition 108; and see factorization decreasing 33, 39, 99 Dedekind, J. W. R. 91 Dedekind sections 112 Dedekind's construction 91 deduction 2, 52, 111, 115, 121; and see induction, proof, reasoning deductive discipline 111 degree of a polynomial 70, 74, 86, 92-3 delights 111 demonstrate 51; and see prove De Morgan, Augustus 116 De Morgan's laws 108, 115 denominator 2 derivative of a function 119; and see differentiability, differentiation Descartes, Rene* 116
126
INDEX
determine 2, 46,106; and see calculation, establish, uniquely determined determinant 27, 32, 42, 67, 82, 109 development of mathematics 56,111, 114 diagram 78, 92; and see graph of a function difference 47, 115; and see method, minus, subtraction, symmetric differentiability 9, 21, 43-4, 49, 106, 109, 119, 120 differential calculus 82,105,114; and see differentiation equation 21-2, 34-7, 39, 40, 44, 46-7, 106-7, 110-11, 113 geometry 120; and see arc-length differentiation 9,10, 21, 39,41, 46-7, 110, 113, 119-20 dimension 66 direction of a line 28, 29, 45; and see axis, orthogonal of a non-zero vector 29 disjoint sets 1, 11, 51, 56, 117-18 disprove 52; and see counterexample, negation dissecting 41; and see division, disjoint distance 15, 27-9, 94-5, 109; and see close to, isometry, length, modulus, triangle inequality, radius distinction 59 distributive law 43, 48, 56-7, 89, 94-5, 99, 108 divisibility 17, 38, 40, 47, 108; and see congruence, factor, prime, remainder division 93, 117; and see Euclid's algorithm, factorization, quotient, reciprocal, subinterval by 2 93 ring 69, 70, 97; and see field divisors of zero 30, 55, 89, 97; and see division ring, matrix domain of a function 6, 20-1, 33, 41, 115, 117-19; and see function, non-empty set dynamical systems 37
electrical engineers 63 elementary algebra 57 analysis 110—14 arithmetic 73 transcendental functions 113-14 element of a set 1, 11, 44; and see belongs to, point, subset elements of mathematical logic 108; and see reasoning elimination 82 embedding 12, 23-5, 29, 63, 91, 97, 112; and see contain, identification, isomorphism empty set (0) 1, 6, 38, 59, 108, 115 end-point 9, 78, 83-4, 117, 118 endomorphism of an Abelian group 100 of a vector space 33 equal ( = ) 51 equality 15, 28, 102, 104; and see equation, identity, inclusion, inequality equation 48; and see cubic, differential, identity, linear, polynomial, quadratic, quartic, simultaneous equipment 112 equivalence class 24, 29, 40, 42, 108; and see coset, direction, integer, point (of a projective plane), rational number relation 11, 24, 29, 40, 115; and see congruence, equality equivalent statements 13-14, 23, 27, 32; and see implication Eratosthenes see sieve error 44, 52, 75, 81, 86; and see accuracy, computation, reasoning establish 51, 58; and see prove estimate 3; and see accuracy, approximation, inequality Euclid 111 Euclidean geometry 29, 109 plane 94, 116, 117 Euclid's algorithm 40, 53, 70, 108 Eudoxus 91 Euler, Leonhard 107 Euler's formula 107 evaluate see calculation e 22, 44, 99, 107, 109, 120; and see even function 100; and see cosh, exponential function cosine, even power educational sham 111 number 1, 25, 49, 89 eighteenth century 24, 63, 107 power 43
INDEX exact bound 112; and see infimum, supremum exceed 8; and see greater than existence 25, 35, 73, 101; and see infimum, inverse function, limit, maximum, minimum, non-Archimedeanfield,reciprocal, Riemann integral, solution, supremum, system of real numbers, transcendental number, unit element expansion 44, 105; and see binomial theorem, identity, partial fractions exponential function of a complex variable 107 of a real variable (log-1) 9, 20, 35-7, 39, 42, 44, 49, 82, 102, 106, 109-10, 113-20; and see differential equation expression 39, 40, 107-8, 112; and see form, notation, representation extension 96, 109 factor 24, 67, 98; and see common factor, divisibility, factorization, integrating factor factorial (!) 3, 47, 97 factorization 11, 53; and see decomposition, fundamental theorem of arithmetic, highest common factor, polynomial fallacies 111 family of sets 115 Fermat, P. 98 Fermat's theorem 98 field 2-4, 12-16, 25, 28-30, 38, 42, 44-5, 48, 55, 56, 57-8, 63-71, 89-94, 97-8, 109, 112, 116 of quotients 109; and see integral domain, partial fractions fifteenth century 51 fifty 25 finite field 2, 4,12, 25, 45, 55, 57, 64, 89, 90, 109 group 24; and seefinitefield set 1, 11, 27, 42, 52, 87, 117; and see empty set first-order differential equation 21, 34-5, 40, 44, 46, 106, 110 form 1, 2, 53-4, 98; and see notation formula 72, 90,107,119,121; and see addition formulae, identity 4-dimensional real vector 5, 69, 70; and see 2-by-2 matrix
127
Fourier, J. B. 107 Fourier series 107 Frobenius, F. G. 70 function 6-10, 17-22, 30-7, 39-49, 85, 88, 100, 105-21; and see mapping functional 33 equation 113; and see addition formulae, additive function, identity, logarithmic function fundamental mapping theorems 7, 109, 112-13, 117 property 101, 102, 109, 113, 121 rules of the differential calculus 82, 119-20 theorem of algebra 63 of arithmetic 53, 73; and see unique decomposition of the calculus 107,110,113,120 theorems about limits of compound functions 109-10; and see composite function, limit, local continuity, product, sum Galois, fivariste 93 Galois theory 93 Gauss, C. F. 63 general principle of convergence 53, 34 solution of a differential equation 35-6, 39, 40, 44, 47, 111 generalization 28, 34, 57, 62-3, 66-8, 73, 79, 81-2, 87, 92-5, 97, 104, 107, 116; and see unified geometric mean 104 geometrical interpretation 43, 45, 62, 64 progression 90, 107; and see polynomial, well-known geometry 29, 30, 109, 120; and see angle, arc, area, axis, betweenness, centre, circle, configuration, conies, convexity, direction, distance, Euclid, intersection, isometry, length, line, normal, orthogonality, parallelism, parallelogram, plane, point, position, Pythagoras property, radius, rectilinear figure, representation, space, sphere, straight segment, surface, symmetry, transformation, translation, triangle, vector, vertex
128
INDEX
increasing function 7, 33, 39, 73-4 sequence 33, 99 incremental ratio 83; and see differentiation, mean-value theorem indeterminate 38, 42, 109; and see polynomial index 17, 107, 109; and see power set 115 induction 1, 8, 17, 21, 25, 46, 51, 52, 54, 57, 92, 102, 106, 108 inductive science 111 inequality (^, ^, <, >) 3,4,6-10, 13-15, 17, 19, 20, 25-6, 27-8, 33-4, 39, 44-5, 49, 58, 62, 79, 91, 93,102,104,109,112,118-19; and Hamilton, W. R. 70 see interval, monotonic harmonic analysis 100, 107 inertia 37 motion see simple Heine, H. E. 117 inference 7, 52, 111; and see reasoning Heine's theorem 113, 117 heuristic device 52 infimum (inf) (greatest lower bound) highest common factor (h.c.f.) 6, 53, 3, 9, 13-14, 19, 33, 40, 46, 59, 75-6, 99, 109, 117-18 108-9 infinite sequence 26, 33-4, 99, 115; homogeneous coordinates 29 and see system of natural numbers hyperbola 105 series 26, 110, 114 hyperbolic functions 20, 21, 105, set 115; and see infimum, inter107; and see differential equation, val, space, supremum, system exponential function hypothesis see postulate, superfluous, infinity 29; and see infinite insolubility 93 theorem instantaneous angular velocity 96 motion 95—6 i 62; and see complex number position 95-6 idempotent 24, 89 velocity 95-6 identification 4, 5, 29, 43, 58, 62, 109; and see embedding, iso- instrument 76, 108 integer 115; and see composite, even, morphism odd, prime, ring, square identity 6, 15-16, 21-2, 28, 48, 79, modulo m 2,45,54-5,57,64,89; 80, 86, 90, 94-5, 102, 106-7; and and see remainder see expansion, functional equation, laws, polynomial, recurrence rela- integrability 43, 47, 49, 78, 109, 113, tion 118, 119-20 function 41 integral 112; and see integration, image 82,109,118; and see value of a Riemann integral function domain 55; and see field, ring of imaginary part of a complex number integers, ring of polynomials (im) 15, 27, 39, 45, 62, 64, 107 index 109; and see integer, power of a complex-valued function 100, integrand 39; and see function, inte118 gration imagination 52, 110 integrating factor 106 implication 52, 87; and see reasoning integration 8, 10, 34, 39, 41, 43-4, 46-7, 49, 79, 102, 109, 112-14, impossible 61,89; and see contradic118-21; and see arc-length, area tion, insolubility, non-existence inclusion ( £ , 2 , c , z>) 6, 23, 50-1, interesting conjectures 52 multiplication table 70 61, 71; and see subset global properties 111-13 graph of a function 8, 19, 20-1, 39, 45, 109 greater than (>) 58; and see inequality greatest element 13, 84; and see maximum lower bound (inf) see infimum value 83; and see maximum group 12-13, 24, 57, 73, 88, 90, 93; and see Abelian group, character, division ring guesses 101; and see conjecture
INDEX interesting (cont.) point 64 properties 97 intersection (n) 1, 2, 6,11-12,14,18, 20, 23, 26-9, 38-9, 44-5, 50-1, 56, 68, 92, 99, 108, 115 intuitive idea of continuity 85 interval 26,74,109,112-13,116,117; and see betweenness, bounded, closed, compact, end-point inverse 55-6, 69, 88; and see field, group, reciprocal function 7, 20-1, 42, 47, 49, 72, 81-2, 102, 109-10, 113, 119-20; and see composite, involution, univalent inverting a matrix 97 involution 93; and see conjugate irrational number 6, 7, 13-14, 18, 49, 58, 59, 71, 92, 99, 106, 109 irrationality of n2 121 isometry 65, 94, 95 isomorphism 12-15, 23-5, 48, 58, 62-4, 69, 73, 90-3, 97, 99, 108, 112; and see automorphism, linear isometry iterates of a function 72 iterative methods 101; and see computation j 63 Jacobi, G. C. J. 94 Jacobi's identity 94, 95 joining see segment judgment 75 kinematics 120; and see motion, spin, velocity Lagrange, J. L. 24 Lagrange's identity 28 theorem 24 laws for binary operations see associative, cancellation, commutative, De Morgan's, distributive of addition 1, 51, 88-9; and see algebra of indices 107 of inequalities 109; and see totally ordered field of multiplication 11, 87-9; and see algebra of thought 56; and see reasoning
129
of vector algebra 69; and see identity, product, vector space least common multiple (l.c.m.) 1, 53 element 13, 52, 84; and see minimum upper bound (sup) see supremum value 83-4; and see minimum left coset 24 -hand distributive law 57, 89, 99 end-point 78, 83, 117 -handedness 27, 96 -neutral 23 legitimacy 75, 86, 110 Leibniz (Leibnitz), G. W. 105 Leibniz's theorem 105 length of a side of a triangle 62 of a simple differentiable arc 48, 110 of a vector 5, 15, 28, 66, 68, 93-4 of an interval 75, 90, 117 \ and see integration less than ( < ) 58; and see inequality level surface 31, 32 Lie, Sophus 95 Lie algebras 95 limit of a function 20, 34, 42, 47, 104, 109-10, 113, 119-20; and see derivative of a sequence 33, 34, 46 line 28, 29-31, 39, 45, 96, 109, 116 \ and see collinear at infinity 29 linear algebra 33, 97; and see determinant, linearity, matrix, vector combination 4, 5, 32, 43, 65, 67-8, 109, 116; and see polynomial of functions 109; and see linearity, product, sum dependence 46, 109 differential equation 21, 34-7, 39, 40, 46-7, 110-11, 113 equations 44-5, 62, 97 function 30, 31-2, 33; and see linear mapping functional 33 geometry 30 independence 5, 28, 32, 40, 67-8, 82, 94, 116 isometry 94—5 mapping 94—5; and see linear function operator 33 process 119
130
INDEX
linear (cont.)
transformation 33; and see linear mapping linearity 97; and see linear of the integration process 79-80, 109, 113, 118 linearly dependent vectors 46 independent functions 82 set in a vector space 116 vectors 5, 28, 32, 40, 94 Liouville, J. 71 local continuity 112-13, 117 maximum 9, 35, 84 minimum 9, 35, 83-4 properties 110-13 locate 75 logarithmic function (log) 8, 9, 20, 34, 41, 45, 47, 49, 73, 81-2, 90, 102, 105, 109-10, 113, 120 logarithms 34, 81, 99, 102, 109-10, 114; and see logarithmic function logic 108; and see reasoning logical foundations 111 mistakes 104 product 56 sum 51 structure 104, 110-11 logically equivalent 23; and see equivalent statements London Mathematical Society 116 lower bound 13-14, 60, 109, 118; and see infimum Riemann sum 8, 41, 79, 109, 113, 118 manipulation of inequalities 109, 112; and see inequality mapping 93-5, 115-16; and see automorphism, embedding, endomorphism, function, functional, involution, isomorphism, one-toone correspondence, operation, operator, permutation, sequence, transformation, translation properties 101, 109, 112-13, 117; and see continuous functions mathematical induction see induction logic 108; and see laws of thought mistakes 104 physics 70; and see motion progress 111 reasoning 108; and see reasoning tables 108
matrix 16, 17, 32, 97 multiplication 97 maximum (max) 45, 47, 59, 76, 78; and see greatest element, greatest value, local mean 104; and see betweenness, midpoint mean-value theorem 10, 42, 84, 85, 107, 110, 113; and see Rolle's theorem measure theory 87 method see algebraic, axiomatic, calculation, computation, construction, induction, iterative, procedure, process, rule, sieve, solution, summation, technique of differences 106-7 metrical geometry 29 mid-point 18, 75 minimal field 57, 112 minimum (min) 45, 47, 59, 78; and see least element, least value, local minus (—) 51—2, 57; and see additive notation mistakes 104 module 116 modulo (mod) 2, 17, 40, 44-5, 48, 53-7, 64, 89; and see congruence, ring modulus of a number 5, 6, 20-1, 27, 33, 40, 43, 45, 47-8, 62, 63-5, 88, 92-3, 100, 102,105, 109, 117; and see distance, triangle inequality of a function 113, 119 monotonic 20, 30, 33, 73-4,102,105, 109, 112-13, 118-19; and see decreasing, increasing, non-decreasing, non-increasing motion 95-6, 107 multiple 44, 100; and see least common multiple, scalar multiplication 2-6, 11-12, 16, 24-5, 30,43,48, 54, 57, 62, 69, 70, 88-9, 94, 97, 117; and see binary operation, linear combination, product table 25, 70 multiplicative group of a field 13, 56, 73, 90; and see division ring inverse 55, 69 notation 24, 88 w-dimensional Cartesian space 92; and see Cartesian plane
INDEX w-dimensional (cont.) Euclidean space 62 vector 28, 66, 70; and see vector space wth derivative 21, 39, 46, 105 root 13, 73; and see cube root, square root w-times differentiable 21 natural number 1, 11, 24, 26—7, 38, 42, 48-9, 58, 66, 87,108, 115; and see finite set, positive integer, sequence necessary condition 25, 41, 46-7, 49, 89; and see only if negation 20 negative 58; and see number neutral element 23, 24, 55-6, 88-9; and see unit element, zero Newton, Isaac 105 nineteenth century 34, 51, 56, 62, 70-1, 85, 88, 91, 93-5, 107, 111, 116-17, 119 non-Archimedean field 92 non-associative multiplication 94; and see vector product non-commutative ring 97 non-decreasing 8, 33 non-degenerate interval 73; and see interval non-empty set 1, 11, 13-14, 23, 52, 58, 60, 72, 87, 112, 116, 118; and see domain non-existence 7, 12, 45, 63, 89, 90; and see empty set, insolubility non-increasing 34, 76 non-isomorphic subfields 92 non-negative see distance, length, modulus, positive, zero non-positive 83; and see negative, zero non-trivial automorphism 93 non-vanishing 36; and see reciprocal of a function non-zero elements of a ring 30, 55—6, 64, 89; and see divisors of zero of a field 13, 58; and see multiplicative group vectors 29, 45, 68 normal to a plane 45; and see orthogonal notation 51, 65; and see additive, identification, multiplicative, scale, symbol
131
number 11, 87; and see complex, integer, natural, rational, real numerator 2 numerical information 3, 7; and see computation work 97, 101; and see computation odd function 100; and see odd power, sine, sinh number 1, 25, 44, 49 power 43 1-dimensional 96 one-to-one correspondence 1, 5, 11-12,15, 24, 31-2, 42,48, 58, 94, 106, 108, 115; and see involution, isometry, isomorphism, permutation, univalent only if see implication, necessary condition open interval 9, 20, 30, 71, 75, 82-5, 106, 777, 119 operation see binary operator see linear opposite direction 29 inclusion 51, 71 or 50 order see first-order, second-order, totally ordered field ordered pair 42, 48, 62; and see Cartesian product, sequence triad 29, 65; and see Cartesian product, sequence order-preserving isomorphism 90 orthogonal lines 28, 39 vectors 5, 16, 28, 66, 68 orthogonality 28; and see orthogonal orthonormal basis 28, 32; and see orthogonal vectors, unit vectors sequence 16, 69 n
39, 46, 48, 110, 120, 121; and see circular functions parallel lines 28-9, 31 planes 32 parallelism 29; and see parallel parallelogram 31 partial fractions 10, 41, 46, 86, 109 particle 95—6 particular solution of a differential equation 21-2, 35-7, 44, 46 Pascal, Blaise 97 Pascal's triangle 97 Peano, G. 51
132
INDEX
pedagogical notion 112 perfect square 38, 64; and see natural number, square period 100 periodic function 100 periodicity 100 permutation 32, 93 pigeon-hole principle 90; and see additivity of number, finite set pitch of a screw 96 plane 32, 39, 45, 68, 109; and see Cartesian, complex, Euclidean, protective plus( + ) 51, 57; and see additive notation point see complex number, element, geometry, graph, interval, local, real number at infinity 29 pointwise 30, 100, 119 polar forms 121 representation of complex numbers 100, 110; and see circular functions, modulus polynomial 6,7,38,42-3,46,109,115, 121; and see ring, vector space equation 4, 7, 30, 43, 63, 71, 74, 85, 92-3, 101; and see cubic, quadratic, quartic function 6-8, 36-7, 49, 79, 81, 85; and see pointwise identity 24, 38, 48, 60, 79, 86, 106-7; and see binomial theorem, Euclid's algorithm, factorization, geometrical progression, partial fractions, remainder theorem, ring position 95-6; and see point positive 58; and see number integer see natural number square root (V) 3 positivity of the integration process 80, 109, 118 postulation 109,112; and see reasoning power 2, 90; and see even, exponential function, idempotent, iterate of a function, wth root, odd, polynomial, product, reciprocal, square prime 1, 2, 11-12, 14, 17, 21, 44, 47, 52-3, 57, 73, 98, 108 primitive notion 115 principle 110; and see general, pigeon-hole
of
mathematical induction 1, 46, 52y 78, 108; and see induction probability 87 problem 16, 97 procedure 75, 106; and see method process see linear, rational product 4, 25, 49, 53, 56, 78, 89, 92, 109, 117; and see Cartesian, composite, divisors of zero, factorial, multiplication, multiplicative, power, ring, scalar, vector of functions 17, 21, 81, 105, 120; and see composite, pointwise set 3; and see Cartesian product progress 111 projective geometry 29 plane 28, 29 proof 52, 55, 84, 92, 110, 112-15; and see deduction, method proper subfield 42, 57, 93 property 4, 31-2, 100, 118; and see Darboux, equivalent statements, fundamental, interesting, global, local, Pythagoras, remarkable, special proposition 20, 25, 38, 52, 119; and see converse, statement, theorem prove see proof, show pure mathematics 56,111 Pythagoras property 28 quadratic equation 4, 14, 19, 30, 63, 105,109; and see complex number, square root surd 9 2 , 9 3 quantifiers 104 quartic equations 63 quaternions 69, 70 quotient 57-8, 109, 115; and see division, field, ratio, rational number, reciprocal of functions 81,109; and see pointwise, rational function radius see circle, sphere range 6,18, 20, 33,41, 81-2,101,105 rate of change 95; and see derivative ratio 10, 53, 83; and see quotient rational function 109-10, 112, 117, 121; and see polynomial, partial fractions, quotient number 2, 3, 6, 7, 13-14, 18-19, 34, 41-2, 44-5, 48-9, 54-5, 57-8,
INDEX rational number (cont.) 71,73,75,92-3,98,109,112,115; and see computation, totally ordered field process 117; and see calculation, computation, elimination, Euclid's algorithm, method, procedure, resolution into partial fractions real number 3, 13, 109; and see Cantor, complete totally ordered field, Dedekind, interval, irrational, number, point, scalar part of a complex number (re) 15, 27, 39, 45, 62, 64, 106-7 of a complex-valued function 100, 118 protective plane 29 -valued function see range, real number variable 82, 112; and see domain, real number vector see complex number, quaternion, vector reasoning 108; and see argument, contradiction, converse, counterexample, deduction, demonstrate, determine, disprove, error, establish, hypothesis, implication, induction, laws, logic, logical, logically, mistakes, necessary condition, negation, only if, proof, proposition, pure mathematics, quantifiers, result, rule, show, sufficient condition, theorem, valid inference, verify reciprocal 2, 12, 14, 25, 30, 55, 64, 89, 97 of a function 119; and see pointwise, quotient recurrence relation 14 Recorde, Robert 51 rectilinear figure 117 reflexive 87; and see binary relation relation 28, 40; and see betweenness, binary, recurrence relative complement ( \ ) 1, 11, 44, 52, 115 motion 95—6 position 95-6 velocity 95-6 remainder 44, 53-5, 70, 109; and see Euclid's algorithm, modulo theorem 6, 109
133
remarkable property 63 representation 39, 68; and see binary, decimal, expression, motion, notation, polar, position residue 55 resilience 37 resolution into partial fractions 86 restriction 101, 105, 109 result 86; and see theorem reversal of sign 62, 67; and see anticommutative, odd function Riemann, G. F. B. 119 Riemann integral 43, 49, 118; and see integration sums 47, 81; and see lower, upper right-cancellation law 23 coset 24 -hand distributive law 57, 89 end-point 78, 83, 117 -handed sequence of vectors 16,69 -handedness 27; and see screw -neutral 23 rigid body 95-6 ring 2, 12, 18, 24-5, 30, 42, 54-7, 63-4, 69-70, 89, 94-5, 97-100, 116; and see field of endomorphisms 100 of integers 25, 42, 98, 108, 116 modulo m 2, 54, 55, 89; and see finite field of polynomials 42, 109; and see integral domain, quotient Rolle, Michel 85 Rolle's theorem 49, 85 root-extraction 63; and see nth. root roots of a polynomial equation 4, 7, 14, 19, 30, 39, 43, 63, 71, 74, 85, 92; and see wth root rotational motion 96 rule 23, 57, 97, 112, 120; and see formula, laws, method Russell, Bertrand A. W. I l l scalar 5, 15-16, 31-2, 39, 45, 65, 66; and see real number, vector algebra multiple 96; and see linear combination product 5, 28, 32, 62, 66, 95, 109, 120; and see vector algebra triple product 5, 28, 67; and see 3-by-3 determinant -valued function 31
134
INDEX
scale 76 of notation 38, 48, 108; and see decimal scheme of multiplication 70 Schwarz, H. A. 62 screw 96 secant (sec) 121; and see cosine, reciprocal second-order differential equation 35-7, 39, 47 segment see straight semigroup 23> 24, 88; and see associative law, group, ring sequence 26, 33-4, 99, 107, 115; and see ordered, recurrence relation, right-handed series 107, 110, 114 set 1-6, 11, 17-18, 23, 31, 38, 42, 44, 46, 108, 115-16; and see algebra, analysis, Cartesian product, class, coset, domain, empty, finite, geometry, infinite, interval, range, space, subset, system seventeenth century 85, 97-8, 105, 116 show see demonstrate side of a real number 74, 119; and see greater than, less than of a triangle 62; and see segment sieve of Eratosthenes 87, 108 sign 61; and see minus, negative, plus, positive, reversal significant figures 3 similar argument 2, 51, 74 expression 5 simple differentiable arc 48, 110, 121; and see compact interval, complex-valued function, differentiability, one-to-one correspondence, range, straight harmonic motion 107; and see circular functions, differential equation simplification 112 simultaneous congruences 40, 42 equations 44-5, 62, 97 sin 111 sine (sin) see circular functions sinh see hyperbolic functions sixteenth century 51, 101 sketch 8, 19-21, 39, 45 skill 110, 114
small as we like 79; and see continuity, limit solubility of polynomial equations 93 solution see equation, general, particular, problem sophistication 81, 107, 110 space 95-6; and see Cartesian, Euclidean, vector special property 3 sphere 29, 32, 68 spin 96 square 13, 24-5,44,49; and see idempotent, perfect square, squared length root 2-4, 12, 34, 38, 57, 63-4, 66, 89, 93, 101, 109, 112; and see quadratic surd squared length 15, 93; and see length of a vector starting-point 112 statement see equivalent, proposition step-function 109, 7/7, 118; and see graph of a function straight line see line segment 14', 78; and see arc strict inclusion (<=,=>) 23, 50 inequality ( < , >) 62; and see greater than, less than positivity of the integration process for continuous functions 80, 118 strictly monotonic 113, 120; and see decreasing, increasing structure see logical subfield 4, 14-15, 42, 58-9, 62, 71, 91-3; and see field subgroup 13, 24, 73, 90, 100; and see group subinterval 90,101, 7/7,118; and see interval, Riemann integral subring 12, 69, 89, 97; and see ring subsequence 33 subset 1, 2, 11, 23, 38, 72, 99, 108, 115; and see inclusion, set subtraction 24, 51, 117; and see Abelian group, additive notation, difference, minus sufficient condition 25, 41, 46-7, 49, 89; and see implication sufficiently large 61, 75, 79; and see limit of a sequence small 83; and see continuity, limit of a function
INDEX sum 4, 12, 26, 28, 51, 56, 58, 90, 92, 97, 116-17; awd see addition, additive, linear combination, Riemann, mean, polynomial of functions 17-19, 47, 81; and see linear combination, pointwise, polynomial function of squares 24, 49 summation 1, 8, 22; and see condensing, geometrical progression, method, Riemann sums superfluous 51 supremum (sup) (least upper bound) 3, 8, 9, 13-14, 18-19, 33, 40, 46, 59, 99, 109, 112, 117-18 surd see quadratic surface 31-2, 68 surprising fact 85 syllabus 108-10 symbol 51-2; and see algebra, notation symmetric difference (A) 2, 17-18, 56 function 88 symmetry 50, 55, 64, 76 system 37, 95 of natural numbers 108; and see sequence of rational numbers 3, 34, 58, 112; and see minimal field, totally ordered field of real numbers 3, 34, 58, 90-2, 110, 112; and see complete totally ordered field of 3-dimensional real vectors 95; and see vector algebra systematic integration 110, 113; and see integration methods 7, 101 table 11; and see addition, multiplication tangent (tan) 39, 48 Tartaglia 101 technique 110, 113, 119 term 115; and see sequence, sum theorem 6-8,17, 24, 28, 46-7, 49, 53, 63, 66-7, 70-1, 73, 82, 85, 97-9, 105-7, 109-10, 112-14, 117, 120; and see proposition, result theory of Abelian groups 116 of algebraic fields 92 of conies 29
135
of of of of of of of of of of of
determinants 27, 67, 82 groups 97 limits 20 matrices 97 modules 116 numbers 98, 108 probability 87 rings 97, 100, 116 scalar triple products 67 the calculus 111 the circular functions 100, 105, 107, 113, 120 of the elementary transcendental functions 107, 113-14 of the hyperbolic functions 105, 107 of vector spaces 116 thermometers 76 3-by-3 determinant 32, 67, 109 matrix 32 3-dimensional real vector 4, 65; and see vector time 76, 95, 112, 114 topology 29, 56 totally ordered field 3, 13-14, 25, 34, 58, 59, 60, 66, 90-2, 109, 112 transcendental number 71 function 113-14; and see circular, exponential, hyperbolic, logarithmic transformation see linear, unitary transitive 58, 87; and see binary relation translation 94 translational motion 96 trial 7, 75 triangle 62, 97 inequality 15, 33, 62, 77, 92, 102, 109; and see distance, length, modulus trigonometry 113; and see circular functions twentieth century 108, 111 2-by-2 matrix 16, 17, 97 2-dimensional real vector 62 two minuses 57 unambiguous 113 unbounded function 9, 81, 109, 118 set of real numbers 14 unified theory 107; and see generalization uniform continuity 109, 112-13
136
INDEX
union ( u ) 1, 2, 6, 11, 13, 18, 38, 44, 50-2, 87, 108, 115-8 unique decomposition 108; and see fundamental theorem of arithmetic uniquely determined 14-15, 27, 32, 86, 95-6, 115, 117, 119 unit circle 5, 18, 68, 100 element of a group 88; and see multiplicative of a ring 2, 12, 25, 30, 89, 97; and see field, reciprocal vectors 16, 68; and see length, orthonormal basis unital module 116 unitary transformation 94 univalent function 77, 81—2, 112—13, 115; and see one-to-one, strictly monotonic unnecessary accuracy 74 unsound argument 42, 85, 110—11 upper bound 3, 13-14, 59, 60, 91, 100-1, 109, 112, 118; and see supremum Riemann sum 8, 41, 79, 109, 118 useful equipment 112
variable see complex, domain, element, real, vector vector see complex number, matrix, w-dimensional, quaternion, 3dimensional algebra 4-6,15-16,27-9,31-3,39, 40, 43, 45-6, 49, 65-70, 88, 93-6, 109 over a field 28, 66-8, 94; and see ndimensional vector, vector space product 5, 6, 15-16, 28, 43, 49, 66, 94-6, 109, 120 space 94, 116 -valued function 120; and see transformation velocity 95—6 verify see prove vertex 31 weighted mean 104 sum 116 weight 104; and see coefficient well-known identity 79, 80; and see geometrical progression Welsh Joint Education Committee 108 worthless approximation 113
valid inference 7; and see reasoning zero (0) see additive notation, algebra principle 34 of sets, field, number rule 57, 112 element of an Abelian group 88; value of a function 44, 46, 82-3, 117; and see additive and see constant, image, maxiof a ring 89; and see divisors of mum, minimum, range, univalent zero of a variable 3 9,42; and see domain vector (0) 5; and see vector