This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
-. (P)the partition 0 we obtain from (11.8) and (11.9) the relation I f(x) dx = I f (
--+ 0 and differ in sign if det -+ n 0,'!j;(x)to the= f(x) sionIfofwethenowfunction entire- f(real- x),line,which thenis an odd exten Using Theorem 1 , or more precisely the corollary to it, we find that e ' if X > 0 , 00 1 eix� d� = if X .= 0 , 27r J a + . � - oo 0 , if X < 0 ; +oo 1 ae i x � d � = e - a l x l . J ' 7r a2 + �2 e ' if X > 0 , +oo x � �ei d� = 0 ' if X = 0 , 7r J a 2 + � 2 -oo -eax if X < 0 . All thetheintegrals here arein understood inabsolute the senseconvergence, of the principal value,be although second one, view of its can also understood in thethe sense of animaginary ordinary parts improper integral. Separating real and i n these find the Laplace integrals we have encountered earlier last two integrals, we +oo J acosx� 2 + e d� = +oo � J sinx 2 2 d� = -1 .y2 dy + O(>.. - =) ) = = e>.S(xo) ( I ( (f o .y2 dy + 0 (>.. - oo ) ) ; E: .y2 dy + O(r=) ) . Under the requirements stated above, the function (f o .y 2 m dy ' 0 . -+ +oo, c 1 '1/J ( n\( o ) r ( n + 1 ) .x - W . / 'I/J (y)e - >.y2"' dy _ 2m n=O n. 2m O But, 'I/J ( 2k+ 1 ) (0) 0 since '1/J(y) is an even function; therefore this last � -. i yi 2 dy , I
---+
Now let us consider the general case of transformation of forms of any degree. Let
C
r1 ,
C
:
, T
P
:
Now let us consider how to carry out the transformation of forms practice.
m
12.5 Elementary Facts about Differential Forms
205
Example 1 3. In the domain V IR� let us take the 2-form w dx i 1 1\ dxi 2 • i i 1 m Let x
=
=
r
, . . . , r
matrix via the formulas t: i - ox i
C
r
j
=
C
=
, . . . , r
i
j
i ox ( t ) T , i � utJ ( t ) Tl , �2 � ut J 2 ( The summation on j runs from 1 to m . ) '> 1 -
=
=
1, . . . , n .
Thus,
Consequently, we have shown that If we useandproperties 28 ) andof( 12.the2 9last ) for the operation of transformation ofequality: forms9 repeat the( 12.reasoning example, we obtain the following 9 If (12.29) is used pointwise, one can see that
206
12 Surfaces and Differential Forms in IR.n
Weformremark thattheifargument we make ofthecp*formal change of variablethe xdifferentials x(t) in the that is on the left, express 1 , . . . , dtm , and gather like terms dxin 1the, . . resulting . , dxn in terms of the differentials dt expression, usingsidetheofproperties of the exterior product, we obtain precisely the right-hand Eq. (12.31). Indeed, for each fixed choice of indices i 1 , . . . , ip we have =
Summingthe such equalitiessideover all ordered sets 1 :::; i 1 < < ip :::; n, we obtain right-hand of (12.31). tance.Thus we have proved the following propostion, of great technical impor · · ·
C
If a differential form w is defined in a domain V �" and cp U is a smooth mapping of a domain U �m into V, then the coordinate expression of the form cp*w can be obtained from the coordinate expression
Proposition. : --+ V
C
of the form w by the direct change of variable x = cp(t) (with subsequent transformations in accordance with the properties of the exterior product). Example 1 4 . In particular, if = n = relation (12.31) reduces to the equality (12.32) Hence, if we write f(x ) dx 1 1\ 1\ dxn in a multiple integral instead of 1 n f ( x ) dx dx , the formula J f (x ) dx = J f (cp(t)) det cp'(t) dt p,
m
· · ·
· · ·
V =
U
207
12.5 Elementary Facts about Differential Forms
change of variable inwhen a multiple integral via an orientation-preserving dibyforffthe eomorphism (that is, det cp ( t ) > 0) could be obtained automatically substitution ( t ) , just as happened in the one-dimensional case, andformal it could be givenxthe= cpfollowing form: (12. 3 3) J w = J cp*w . We remark in conclusion that if the degree p of the form w in the domain V IR� is larger than the dimension m of the domain U !R m that is mapped into V via cp : U ---+ V, then the formP cp*w onPU corresponding to w isinjecti obviously zero. Thus the mapping cp* : .f.? (V) ---+ .f.? (U) is not necessarily v e in general. On the other hand, if cp : U ---+ V has a smooth inverse cp - 1 V ---+ U, then by (12. 30) and the equalities cp - 1 cp = e u, cp cp - 1 = ev, we - 1 )* eb and (cp- 1 )* cp* ey. And, since eb and ev find that (cp)* (cp are :the.f.?P(V) identity mappings on( cp-.f.?P(U) respectively, the mappings and .f.?P(V) 1 ) * : .f.?P(U) ---+ .f.?P (V) , as one would expect, and cp*turn out to ---+be .f.?P(U) inverses of each other. That is, in this case, the mapping ---+ .f.?P (U) is bij ective. cp* :We.f.?P(V) alongaswith the map ping cp*note thatfinally transfersthatforms, one thecanproperties verify, also(12.28)-(12. satisfies the3 0)relation (12. 34) cp*(dw) = d(cp*w) . important equality shows in particular that the oper atiis actually oThis n of theoretically differentiation of forms, whi c h we defined in coordinate notation, independent of thebe coordinate in whichin Chapt. the differentiable form w is written. This will discussed insystem more detail 15. '
U
C
C
o
=
o
o
o
=
:
1 2 . 5 . 5 Forms o n Surfaces
We say that a differential p-form w is defined on a smooth surface S !Rn if a p-form w(x) is defined on the vectors of the tangent
Definition 3. C
plane TSx to S at each point x E S. Example If the smooth surface S is contained in the domain D !Rn in which a form woneis can defined, then,thesincerestriction the inclusion TBxto TSTD.x Inholdsthisatwayeacha poi n t x E S, consider of w(x) form w l 8 arises, which it is natural to call the restriction ofxw to S. As weLetknow,cp : aU surface can be defined parametrically, either locally or globally. ---+ S = cp (U) D be a parametrized smooth surface in thedomain domainU ofDparameters and w a form can transfer form w to the and onwriteD. cp*Then w in wecoordinate formthein accordance 1 5.
C
C
C
12 Surfaces and Differential Forms in IR n
208
with the algorithm given above. It is clear that the form cp*w in U obtained in this way coincides with thecp'(t)formTUtcp* (w-+I 8 ).TB is an isomorphism between We remark that, since x TUt and TSx at every point t E U, we can transfer forms both from S to U and fromlocally U to S, and so just as the smooth surfaces themselves are usually defined or globally forms onof local them,charts. in the final analysis, are usually definedbyin parameters, the parameterthedomains Example Let w� be the flux form considered in Example 8, generated by the velocity field V of a flow in the domain D of the oriented Euclidean space JR3. If S is a smooth oriented surface in D, one may consider the restriction w� l s of the form w� to S. The form w� l s so obtained characterizes the flux acrossIf eachI -+element of the surface S. S is a local chart of the surface S, then, making the change of variablethex coordinate cp(t) in the coordinate expression (12. 2 2) for the form w�, we obtain for thecoordinates form cp*w�of thecp*surface. (w� I s ), which is defined on the square I,expression in these local Let w}F acting be theinwork form considered in Example 7, generated byExample the force field a domain D of Euclidean space. Let I -+ cp(I) D be a smooth path ( is not necessarily a homeomorphism). Then, accordance with the general principle of restriction and transfer of forms, aina(t)form cp*w} arises on the closed interval I, whose coordinate representation dt can (12.21) be obtained the change of variable x cp(t) in the coordinate expression for thebyform w}. :
1 6.
'P
:
=
=
1 7.
C
'P
'P
:
=
1 2 . 5 . 6 Problems and Exercises 1. Compute the values of the differential forms w in IR n given below on the indicated sets of vectors: a) w = x 2 dx 1 on the vector e = ( 1 , 2, 3) E TIR (3 , 2 , 1 ) · b) w = dx 1 /\dx 3 +x 1 dx 2 Adx4 on the ordered pair of vectors e 1 ' e 2 E TIR[1 ,0,0,0 ) . c) w = df, where f = x 1 + 2x 2 + · · · + n x n , and e = (1, -, 1 , . . . , ( - 1 ) n 1 ) E -
TIR n( 1 , 1 , . . . , 1 ) .
2 . a) Verify that the form dx i1 1\ · · · A dx i k is identically zero if the indices i 1 , . . . , i k are not all distinct . b) Explain why there are no nonzero skew-symmetric forms of degree p > n on an n-dimensional vector space. c) Simplify the expression for the form
2 dx 1 1\ dx 3 1\ dx 2 + 3 dx 3 1\ dx 1 1\ dx 2 - dx 2 1\ dx 3 1\ dx 1 . d) Remove the parentheses and gather like terms:
(x 1 dx 2 + x 2 dx 1 ) 1\ (x 3 dx 1 1\ d x 2 + x 2 dx 1 1\ dx 3 + x 1 dx 2 1\ dx3 ) .
12.5 Elementary Facts about Differential Forms
209
e) Write the form df 1\ dg, where f = In ( 1 + l x l 2 ) , g = sin l x l , and x l ( x , x 2 , x 3 ) as a linear combination of the forms dx i 1 1\ d x i 2 , 1 ::::; i 1 < i 2 ::::; 3.
=
f) Verify that in IR n
dt 1\ · · · 1\ d r ( x ) = det
( ��;) (x) dx 1 1\ · · · 1\ dxn .
g) Carry out all the computations and show that for 1 ::::; k ::::; n dt 1\ . . . 1\ df k = 3. a
a) Show that a form a of even degree commutes with any form (3, that is, 1\ (3 = (3 1\ a. n b) Let w = L dp; 1\ dq i and w n = w 1\ · · · 1\ w ( n factors) . Verify that
i= l
n ( n -1) W n = n! dp 1 1\ dq 1 1\ · · · 1\ dpn /\ dq n = (- 1) 2 dp 1 1\ · · · l\ dpn l\ dq 1 1\ · · · 1\ dq n . 4. a) Write the form w = df, where f ( x ) = ( x 1 ) 2 + ( x 2 ) 2 + · · · + (x n ) 2 , as a combination of the forms dx 1 , . . . , dx n and find the differential dw of w. b) Verify that d 2 f 0 for any function f E c C 2 ) (D , IR ) , where d 2 = d o d, and d is exterior differentiation. c) Show that if the coefficients a; 1 , ... ,ik of the form w = a; 1 , ... ,ik ( x ) dx i 1 1\ · · · I\ ik dx belongs to the class cC 2) (D, IR) , then d 2 w 0 in the domain D. x d) Find the exterior differential of the form d'2 X � 2dy in its domain of definition. =
=
Y
5.
y
If the product dx 1 · · · dx n in the multiple integral J f ( x ) d x 1 · · · dx n is interD
preted as the form dx 1 1\ · · 1\ dx n , then, by the result of Example 14, we have the possibility of formally obtaining the integrand in the formula for change of variable in a multiple integral. Using this recommendation, carry out the following changes of variable from Cartesian coordinates: a) to polar coordinates in IR 2 , b) to cylindrical coordinates in JR 3 , c) to spherical coordinates in JR3 . ·
6. Find the restriction of the following forms: a) dxi to the hyperplane x i = 1 . b ) dx 1\ dy t o the curve x = x (t ) , y = y (t ) , a < t < b. c) dx 1\ dy to the plane in JR 3 defined by the equation x = c. d) dy 1\ dz + dz 1\ dx + dx 1\ dy to the faces of the standard unit cube in IR 3 . e) w; = dx 1 1\ · · · 1\ dx i - 1 1\ dx i 1\dx i + l 1\ · · · 1\ dx n to the faces of the standard unit cube in IRn . The symbol stands over the differential dx i that is to be omitted in the product. �
210
1 2 Surfaces and Differential Forms in .IR. n
7. Express the restriction of the following forms to the sphere of radius R with center at the origin in spherical coordinates on .IR.3 : a) dx, b) dy, c) dy 1\ dz. 8. The mapping 'P : .IR. 2 -+ .IR. 2 is given in the form ( u , v ) 1--t ( u · v , 1) = (x, y) . Find: a) 'P* (dx) , b) 'P* (dy) , c) 'P* (y dx) .
Verify that the exterior differential d : Q P (D) -+ !] P +1 (D) has the following properties: a) d(w 1 + w 2 ) = dw 1 + dw 2 , b) d(w 1 1\ w2 ) = dw 1 1\ w 2 + ( - 1 ) deg w 1 w 1 1\ dw 2 , where deg w 1 is the degree of the form w 1 . c) Vw E W d(dw) = 0. d) Vf E il 0 df = f: *!r dx ; . t= l Show that there is only one mapping d : QP (D) -+ !]P +1 (D) having properties a) , b) , c) , and d) . 9.
1 0. Verify that the mapping 'P* : QP (V) -+ QP (U) corresponding to a mapping 'P : U -+ V has the following properties: a) 'P* (w l + w2 ) = '{J*w l + '{J*w 2 . b) 'P* (w l 1\ w 2 ) = '{J*w l 1\ '{J*w 2 . c) d'{J*w = 'P*dw. d) If there is a mapping 'ljJ : V -+ W , then ('1/J o '{)) * = 'P* o 'ljJ*.
1 1 . Show that a smooth k-dimensional surface is orientable if and only if there exists a k-form on it that is not degenerate at any point.
1 3 L ine and S urface Int egrals
1 3 . 1 The Integral of a Differential Form 13. 1 . 1 The Original Problems, Suggestive Considerations, Examples The Work of a Field Let F(x) be a continuous force field acting in aa.indomain G of the Euclidean space rn: n . The displacment of a test particle thebyfieldthe isfieldaccompanied byunitwork.testWeparticle ask how weacangivencompute the more work done in moving a along trajectory, precisely, a smooth path I ---+ "'((/) G. We have already touched on that this reason problemwewhen we studied thethe solution applica tions of the definite integral. For can merely recall ofwilthel beproblem atwhat this point, noting certain elements of the construction that useful in follows. It is known that in a constant field F the displacement by a vector e is associated with an amount of work (F, e). Let t H x(t) be a smooth mapping I ---+ G defined on the closed interval I = {t E IR I a :::; t :::; b } . We take a sufficiently fine partition of the closed interval [a, b]. Then onequality each x(interval Ii = { t E II ti� 1 :::; t :::; ti } of the partition we have the t) - x( ti) ;::::; x' ( t) ( ti - ti� 1 ) up to infinitesimals of higher order. To the displacement vector Ti ti+l - ti from the point ti (Fig. 13.1) there 'Y
C
:
'Y
=
Fig. 1 3 . 1 .
:
13 Line and Surface Integrals
212
corresponds a displacement of x(ti ) in !Rn by the vector Llxi = xi +l - Xi , which can atbex(tregarded as equal to the tangent vector �i = x(ti )Ti to the trajectory )i with the same precision. Since the field F(x) is continuous, it canLlA be regarded a locallytoconstant, for thatJ with reasonsmall we canrelative compute work corresponding the (time)andinterval errortheas i
i
or Hence,
A = L LlA i L(F (x(t i ) ) , x(t i ))Llt i i i �
weandfindso, passing that to the limit as the partition of the closed interval I is refined, (13.1) A '= / (F ( x(t) ) , x(t)) dt . If thetheexpression (F ( x(t) ) , x(t)) dt is rewritten as (F(x), dx), then, as n suming coordinates coordinates, we can give this n dxn , after which we can write (13.1) as expression the form F 1 dxin1 +!R are+ FCartesian A = J F 1 dx 1 + + Fn dxn (13. 2 ) or as (13. 2') Formula the precise meaning the integrals 1-form along(13.1) the pathprovides "'( written in formulas (13. 2of) and (13. 2'). of the work Example Consider the force field F = ( - x2 ty2 , x2�y2 ) defined at all the origin. Let us compute the work of this pointsalong of thetheplane IR2"Y except field curve l defined as x = cost, y = sin t, 0 t 2n, and along curve defined by x(13.= 22'),+ coswe t,findy = sin t, 0 t 2n. According to formulasthe (13.1), (13. 2 ), and J WF1 = J - X2 +Y y2 dx + X2 +X y2 dy = 211" ( sin t (- sin t) cos t cos t ) =J cos2 t + sin2 t + cos2 t + sin2 t dt = 27r b
a
· · ·
'"'(
· · ·
1.
::;
'"'11
'"'11
0
·
::;
::;
·
::;
and
13. 1 The Integral of a Differential Form
J
72
21T
213
� -ydx + xdy - � - sint(- sint) + (2 + cost)(cost) dt WF1 -2 2 2 2 X + y sin (2 + cos t) t + 72 0 1T 0
_
k
2cost dt = J 1 + 2cost dt + J 1 + 2cos(27r - u) du = = J 51 ++ 4cost 0 0 5 + 4cost 1T 5 + 4cos(27r - u) 1T 1 + 2cost r 1 + 2cosu = J 5 + 4cost dt - }0 5 + 4cosu du = O . 0 Example Let r be the radius vector of a point (x, y, z) E JR3 and r = l r l . Suppose a force field F = f(r)r is defined everywhere in JR 3 except at the origin. is a so-called central force field. Let us find the work of F on a path This [0, 1 ] JR3 \ 0. Using (13. 2 ), we find J f(r)(xdx + ydy + zdz) = � j f (r)d(x2 + y2 + z2 ) = 2.
'Y :
7
--+
1
= �J 0
7
f(r(t) ) dr2 (t) =
1
� J0 f ( vfuW) du(t) = = � J f ( v'U) du = 4i(ro, r 1 ) . r1
2
r
5
Here, as one can see, we have set x2 (t)+y2 (t)+z2 (t) = r2 (t), r2 (t) = u(t),
ro = r(O), and r 1 = r(1). Thus in any centralr fieldandther work onbeginning a path and has turned out path to depend only on the distances of the end of the from 0 the center 0 of the field. 1 'Y
for the gravitational field � r of a unit point mass located at theIn particular, origin, we obtain
b. The Flux Across a Surface Suppose there is a steady flow of liquid (or gas) in a domain G of the oriented Euclidean space JR 3 and that x V(x) issurface the velocity field of this flow. In addition, suppose that a smooth oriented S has been chosen in G. For definiteness we shall suppose that the orientation of S is given by a field of normal vectors. We ask how to determine H
214
13 Line and Surface Integrals
the ) outflow or flux of fluid across the surface S. More precisely, weunitask(volumetric how to finddirection the volume of fluid thatorienting flows across the surface toS per time in the indicated by the field of normals the surface. To solveandtheequalproblem, we remark that if the velocity field of the flow is constant to V, then the flow per unit time across a parallelogram spanned onby thevectors e 1 and e 2 equals the volume of the parallelepiped constructed vectors V, e 1 , e 2 . If is normal to and we seek the flux across i n the direction of it equals the scalar triple product (V, e 1 , e2 ), provided and the frame e 1 , e 2 give the same orientation ( that is, if , e 2 is a frame having the given orientation in JR3 ) . If the frame e l , e 2 githeveesldirection the orientation opposite to the one given by in then the flow in of is - (V, e 1 , e 2 ) . We nowthatreturnthe toentire the original statement of the problem. For simplicity let usS assume surface S admits a smooth parametrization
II
TJ,
TJ,
TJ
TJ
II
TJ
II
TJ
II,
--+
c
=
=
=
Fig. 1 3 . 2 .
TJ,
13. 1 The Integral of a Differential Form
Summing the elementary fluxes, we obtain
215
where w�(x) (V(x) , ) is the flux 2-form ( studied in Example 8 of Sect. 12. pass totoassume the limit, interval I,5 ) it. Ifis wenatural thattaking ever finer partitions P of the ( 13. 3 ) This last symbol is the integral of the 2-form w� over the oriented surface S. ( 12. 2 2 ) of Sect. 12. 5 ) the coordinate expression for the fluxRecalling form w� (informula Cartesian coordinates, we may now also write ( 13. 4 ) :F J V 1 dx 2 1\ dx3 + V2 dx 3 1\ dx 1 + V 3 dx 1 1\ dx 2 . We havealldicussed here only thegivegeneral principle for solving this problem. Inandessence we have done is to the precise definition ( 13. 3 ) of the flux :F introducedcomputational certain notation ( 13. 3 ) and ( 13. 4 ) ; we have still not obtained anyWeeffective formula similar to formula ( 13.1 ) for the work. remark that formula ( 13.1 ) can be obtained from ( 13. 2 ) by replacing x\ . . . , x n with the functions (x\ . . . , x n ) (t) = x(t) that define the path "'(. We recallof (theSect.form12.w5 ) defined that suchin Ga tosubstitution can be Iinterpreted as the transfer the closed interval = [a, b]. In a completely analogous way, ofa the computational formula forofthethefluxsurface can beintoobtained by direct substitution parametric equations In( 13.fact,4) . ·, ·
=
=
s
and The form cp* w� is defined on a two-dimensional interval I IR 2 . Any 2-form in I has the form f(t) dt 1 1\ dt 2 , where f is a function on I depending on the form. Therefore c
But dt1 1\ dt 2 (r 1 , r 2 ) Tf · Ti is the area of the rectangle Ii spanned by the orthogonal vectors r 1 , r2 . =
216
13 Line and Surface Integrals
Thus,
As the partition is refined we obtain in the limit
J f (t) dt1 I
(\
dt 2 =
J f (t) dt 1 dt2 ' I
(13.5)
where, waccording to (13.3), the left-hand side contains the integral of the 2 = f(t) dt 1 1\ dt 2 over the elementary oriented surface I, and the 2-form right-hand sideonlythe tointegral ofthatthethefunction f over the rectangle I. It remains recall coordinate representation f(t) dt 1 1\ dt2 ofby thetheform cp*w� is obtained from the coordinate expression for the form w� direct substitution x = cp(t) , where cp : I -+ G is a chart of the surface s. Carrying out this change of variable, we obtain from (13.4)
last integral,I. as Eq. (13.5) shows, is the ordinary Riemann integral overThis the rectangle Thus we have found that (13.6)
where x = cp( t) = ( cp l , cp2 , cp:�) ( t 1 , t 2 ) is a chart of the surface S defining the same orientation as the field of normals we have given. If the chart cp : I -+ S gives S the opposite orientation, Eq. (13.6) does not generally hold. But, as follows from thesides considerations at thein beginning of case. this subsection, the left andThe right-hand will di ff er only si g n in that finalfluxes formulaL1:F(13.6) is obviously merely the limit of the sums of the elementary ;::::; (V(x i ) , e 1 , e 2 ) familiar to us, written accurately in i the Wecoordinates t 1 and t 2 . have considered thecancasebe decomposed of a surface into definedsmooth by apieces single Schart. In general a smooth surface i having essentially another,theandpiecesthenS we can find the flux through S asno theintersections sum of thewithfluxesonethough i.
13. 1 The Integral of a Differential Form
217
Example 3 . Suppose a medium is advancing with constant velocity V = (1, 0, 0). If we take any closed surface in the domain of the flow, then, since
the density of the medium does not change, the amount of matter in the volume bounded by this surface must remain constant. Hence the total flux of the medium through such a surface must be zero. In this case, let us check formula (13.6) by taking S to be the sphere x 2 + y 2 + z 2 = R2 . Up to a set of area zero, which is therefore negligible, this sphere can be defined parametrically
x = R cos 'ljJ cos
0
where <
j I
V = (1,
0, 0) are substituted in
8x 1 8x 2 7r / 2 2 7r 8t 1 8t 1 d
j
j
0.
Since the integral equals zero, we have not even bothered to consider whether it was the inward or outward flow we were computing. Suppose the velocity field of a medium moving in IR. 3 is defined in Cartesian coordinates x, y, z by the equality V(x, y, z) (V l , V 2 , V 3 ) (x, y, z) = (x, y , z) . Let us find the flux through the sphere x 2 + y 2 + z 2 = R2 into the ball that it bounds (that is, in the direction of the inward normal) in this case. Taking the parametrization of the sphere given in the last example, and carrying out the substitution in the right-hand side of (13.6), we find that
Example 4.
R cos 'ljJ cos
0
2 7r
=
7r / 2
j d
- 7r / 2
We now check to see whether the orientation of the sphere given by the curvilinear coordinates (
13 Line and Surface Integrals
218
In this case the result is easy to verify: the velocity vector V of the flow has magnitude equal to R at each point of the sphere, is orthogonal to the sphere, and points outward. Therefore the outward flux from the inside equals the area of the sphere 47r R 2 multiplied by R. The flux in the opposite direction is then -47r R3 . 13. 1 . 2 Definition of the Integral of a Form over an Oriented Surface
13.1.1
The solution of the problems considered in Subsect. leads to the defi nition of the integral of a k-form over a k-dimensional surface. First let S be a smooth k-dimensional surface in �n , defined by one stan dard chart
13. 2 ).
i
the mesh A(P) of the partition tends to zero. Thus we adopt the following definition: Definition 1 . (Integral of a k-form w over a given chart smooth k-dimensional surface.)
Jw s
:=
lim
.\ ( P ) --t O
---+
S of a
L(
z
(13. 7)
If we apply this definition to the k-form f (t) dt 1 1\ · · · 1\ dt k on I (when
J f (t) dt 1 1\ . . . 1\ dtk I It thus follows from (13. 7 ) that
J=
S (I)
(13. 8 )
JI f(t) dt 1 . . . dtk .
(13 . 8)
J
(13. 9 )
=
w=
I
and the last integral, as Eq. shows, reduces to the ordinary multiple integral over the interval I of the function f corresponding to the form
13.1 The Integral of a Differential Form
219
We have derived the important relations (13. 8 ) and (13. 9 ) from Definition 1,In but they themselves could have been adopted as the original definitions. particular, if D is an arbitrary domain in !R n (not necessarily an interval) ,
then, so as not to repeat the summation procedure, we set
J
f(t) dt 1 A · · · A dt k :=
D
J
j (t) dt 1 · · · dt k .
(13. 8 ')
D
and for a smooth surface given in the form cp : D --+ S and a k-form w on it we set
J
w :=
J
cp *w .
(13. 9 ')
D
S = cp ( D )
If S is an arbitrary piecewise-smooth k-dimensional surface and w is a k-form defined on the smooth pieces of S, then, representing S as the union Ui Si of smooth parametrized surfaces that intersect only in sets of lower dimension, we set w := � w . )
J
2
S
J
(13.10
S;
In the absence of substantive physical or other problems that can be solved using ( ) such a definition raises the question whether the magni tude of the integral of the partition u si is independent of the choice of the i parametrization of its pieces. Let us verify that this definition is unambiguous.
13. 1 0 ,
Proof. We begin by considering the simplest case in which S is a domain Dx in JR k and cp : D t --+ Dx is a diffeomorphism of a domain D t C JR k onto D x · In Dx = S the k-form w has the form f(x) dx 1 A · · · A dx k . Then, on the one hand implies
(13. 8 )
J
f(x) dx 1 A · · · A dx k =
Dx
J
f(x) dx 1 · · · dx k .
Dx
13. 9' ) and (13. 8 '), j w : = J cp*w = j f (cp(t)) det cp' (t) dt 1 · · · dt k .
On the other hand, by ( Dx
D,
D,
But if det cp' (t) > 0 in D t , then by the theorem on change of variable in a multiple integral we have
J
D x = cp ( D, )
f(x) dx 1 · · · dx k =
/ J (cp(t) ) det cp' (t) dt 1 · · · dt k .
D,
220
13 Line and Surface Integrals
Hence, assuming that there were coordinates x 1 , . . . , x k in S = Dx and curvilinear coordinates t 1 , . . . , t k of the same orientation class, we have shown that the value of the integral J w is the same, no matter which of these two s
coordinate systems is used to compute it . We note that if the curvilinear coordinates t 1 , . . . , t k had defined the op posite orientation on S, that is, det
i
follows from the additivity of the ordinary multiple integral (it suffices to consider a finer partition obtained by superimposing two partitions and ver ify that the value of the integral over the finer partition equals the value over each of the two original partitions) . D
On the basis of these considerations, it now makes sense to adopt the following chain of formal definitions corresponding to the construction of the integral of a form explained in Definition 1 . Definition 1 ' . (Integral of a form over an oriented surface S
a ) I f the form f(x)dx 1 1\
J
D
· · ·
1\ dx k i s defined i n a domain
J(x) dx 1 1\ · · · 1\ dx k :=
J
D
c
D
f(x) dx 1 · · · dx k .
b) If S c !Rn is a smooth k-dimensional oriented surface,
JRn . ) c JR k , then
: D --+ S is a
13.1 The Integral of a Differential Form
J w J �.p*w , :=
S
±
221
D
where the + sign is taken if the parametrization i.p agrees with the given orientation of S and the - sign in the opposite case. c ) If S is a piecewise-smooth k-dimensional oriented surface in JR n and w is a k-form on S ( defined where S has a tangent plane ) , then
where 51 , . . . , Sm , . . . is a decomposition of S into smooth parametrizable k-dimensional pieces intersecting at most in piecewise-smooth surfaces of smaller dimension. We see in particular that changing the orientation of a surface leads to a change in the sign of the integral. 13. 1 . 3 Problems and Exercises
a) Let x, y be Cartesian coordinates on the plane JR 2 . Exhibit the vector field whose work form is w = - x2t y2 dx + x2: y 2 dy. b) Find the integral of the form w in a) along the following paths 1; : [0, JT] 3 t � (cos t, sin t) E JR 2 ; [O, JT] 3 t � (cos t, - sin t) E JR 2 ; 1.
13
consists of a motion along the closed intervals joining the points ( 1 , 0), ( 1 , 1 ) , ( - 1 , 1 ) , ( - 1 , 0) i n that order; 14 consists of a motion along the closed intervals joining ( 1 , 0) , ( 1 , - 1 ) , ( - 1 , - 1 ) , ( - 1 , 0) in that order. 2. Let f be a smooth function in the domain D C R n and 1 a smooth path in D with initial point Po E D and terminal point PI E D. Find the integral of the form w = df over I · 3. a) Find the integral of the form w = dy 1\ dz + dz 1\ dx over the boundary of the standard unit cube in JR3 oriented by an outward-pointing normal. b) Exhibit a velocity field for which the form w in a) is the flux form. 4. a) Let x, y, z be Cartesian coordinates in JR n . Exhibit a velocity field for which the flux form is x dy 1\ dz + y dz 1\ dx + z dx 1\ dy w = (x 2 + y 2 + z 2 ) 3 / 2 b) Find the integral of the form w in a) over the sphere x 2 + y 2 + z 2 = R 2 oriented by the outward normal. x c) Show that the flux of the field X 2 +(y � y+, z ;)31 2 across the sphere (x - 2) 2 + y 2 + z C z 2 = 1 is zero. d) Verify that the flux of the field in c) across the torus whose parametric equations are given in Example 4 of Sect. 12. 1 is also zero.
222
13 Line and Surface Integrals
5. It is known that the pressure P, volume V, and temperature T of a given quantity of a substance are connected by an equation f (P, V, T) = 0, called the equation of state in thermodynamics. For example, for one mole of an ideal gas the equation of state is given by Clapeyron ' s formula Pi - R = 0, where R is the universal gas constant. Since P, V, T are connected by the equation of state, knowing any pair of them, one can theoretically determine the remaining one. Hence the state of any system can be characterized, for example, by points (V, P) of the plane IR. 2 with coordinates V, P. Then the evolution of the state of the system as a function of time will correspond to some path "( in this plane. Suppose the gas is located in a cylinder in which a frictionless piston can move. By changing the position of the piston, we can change the state of the gas enclosed by the piston and the cylinder walls at the cost of doing mechanical work. Con versely, by changing the state of the gas (heating it, for example) we can force the gas to do mechanical work (lifting a weight by expanding, for example) . In this problem and in Problems 6, 7, and 8 below, all processes are assumed to take place so slowly that the temperature and pressure are able to average out at each partic ular instant of time; thus at each instant of time the system satisfies the equation of state. These are the so-called quasi-static processes. a) Let "( be a path in the V P-plane corresponding to a quasi-static transition of the gas enclosed by the piston and the cylinder walls from state Vo , Po to V1 , P1 . Show that the quantity A of mechanical work performed on this path is defined by the line integral A = J P d V.
b) Find the mechanical work performed by one mole of an ideal gas in passing from the state V0 , Po to state V1 , H along each of the following paths (Fig. 13.3) : "(OL I , consisting of the isobar OL (P = Po ) followed by the isochore LI (V = V1 ) ; "(o K 1 , consisting o f the isochore 0 K (V = Vo ) followed by the isobar K I ( P = H ) ; "(OJ , consisting o f the isotherm T = canst (assuming that Po Vo = P1 V1 ) . p Po
0
L
E:JJ
Vo
Fig. 1 3 . 3 .
c) Show that the formula obtained in a) for the mechanical work performed by the gas enclosed by the piston and the cylinder walls is actually general, that is, it remains valid for the work of a gas enclosed in any deformable container. 6. The quantity of heat acquired by a system in some process of varying its states, like the mechanical work performed by the system (see Problem 5) , depends not only on the initial and final states of the system, but also on the transition path. An
13.1 The Integral of a Differential Form
223
important characteristic of a substance and the thermodynamic process performed by ( or on ) it is its heat capacity, the ratio of the heat acquired by the substance to the change in its temperature. A precise definition of heat capacity can be given as follows. Let x be a point in the plane of states F ( with coordinates V, P or V, T or P, T) and e E TFx a vector indicating the direction of displacement from the point x. Let t be a small parameter. Let us consider the displacment from the state x to the state x + te along the closed interval in the plane F whose endpoints are these states. Let LlQ(x, te) be the heat acquired by the substance in this process and LlT(x, te) the change in the temperature of the substance. The heat capacity C = C(x, e) of the substance ( or system ) corresponding to the state x and the direction e of displacement from that state is
C ( x, te )
=
te) . . LlQ(x, " hm L.l T( x, te )
t --+ O
In particular, if the system is thermally insulated, its evolution takes place with out any exchange of heat with the surrounding medium. This is a so-called adiabatic process. The curve in the plane of states F corresponding to such a process is called an adiabatic. Hence, zero heat capacity of the system corresponds to displacement from a given state x along an adiabatic. Infinite heat capacity corresponds to displacement along an isotherm T = canst . The heat capacities at constant volume Cv = C(x, ev ) and at constant pressure Cp = C(x, ep ) , which correspond respectively to displacement along an isochore V = canst and an isobar P = canst, are used particularly often. Experiment shows that in a rather wide range of states of a given mass of substance, each of the quantities Cv and Cp can be considered practically constant. The heat capacity corresponding to one mole of a given substance is customarily called the molecular heat capacity and is denoted ( in contrast to the others ) by upper case letters rather than lower case. We shall assume that we are dealing with one mole of a substance. Between the quantity LlQ of heat acquired by the substance in the process, the change LlU in its internal energy, and the mechanical work LlA it performs, the law of conservation of energy provides the connection LlQ = LlU + LlA. Thus, under a small displacement te from state x E F the heat acquired can be found as the value of the form r5Q := dU + P dV at the point x on the vector te E TFx ( for the formula P dV for the work see Problem 5c ) ) . Hence if T and V are regarded as the coordinates of the state and the displacement parameter ( in a nonisothermal direction ) is taken as T, then we can write The derivative �� determines the direction of displacment from the state x E F in the plane of states with coordinates T and V. In particular, if �� = 0 then the displacement is in the direction of the isochore V = canst, and we find that . ( In the general case V = V ( P, T) Cv = �� . If P = canst, then �� = ( ��) P=COnst is the equation of state f( P, V, T) = 0 solved for V. ) Hence
224
13 Line and Surface Integrals
where the subscripts P, V, and T on the right-hand side indicate the parameter of state that is fixed when the partial derivative is taken. Comparing the resulting expressions for Cv and Cp , we see that
By experiments on gases ( the Joule 1 -Thomson experiments ) it was established and then postulated in the model of an ideal gas that its internal energy depends only on the temperature, that is, ( ��) = 0. Thus for an ideal gas Cp - Cv = T
P ( ��) Taking account of the equation PV = RT for one mole of an ideal gas, we obtain the relation Cp - Cv = R from this, known as Mayer 's equation 2 in thermodynamics. The fact that the internal energy of a mole of gas depends only on temperature makes it possible to write the form 8Q as 8Q = au aT dT + P dV = Cv dT + P dV . To compute the quantity of heat acquired by a mole of gas when its state varies over the path "( one must consequently find the integral of the form Cv dT + P d V over 'Y · It is sometimes convenient to have this form in terms of the variables V and P. If we use the equation of state PV = RT and the relation Cp - Cv = R, we obtain p 8Q = Cp R dV + Cv v dP . R a ) Write the formula for the quantity Q of heat acquired by a mole of gas as its state varies along the path 'Y in the plane of states F. b ) Assuming the quantities Cp and Cv are constant, find the quantity Q cor responding to the paths "(O L I , "(O KI , and "(O I in Problem 5b ) . c ) Find ( following Poisson ) the equation of the adiabatic passing through the point (Vo , Po) in the plane of states F with coordinates V and P. ( Poisson found that PV C p / C v = const on an adiabatic. The quantity CpfCv is the adiabatic constant of the gas. For air CpfCv ::::: 1 .4.) Now compute the work one must do in order to confine a thermally isolated mole of air in the state (Vo , Po) to the volume V1 = � Vo . 7 . We recall that a Carnot cycle 3 of variation i n the state of the working body of a heat engine ( for example, the gas under the piston in a cylinder ) consists of the following ( Fig. 13.4) . There are two energy-storing bodies, a heater and a cooler ( for 1 G.P. Joule (1818-1889) - British physicist who discovered the law of thermal action of a current and also determined, independently of Mayer, the mechanical equivalent of heat. 2 J.P. Mayer (1814-1878) - German scholar, a physician by training; he stated the law of conservation and transformation of energy and found the mechanical equivalent of heat. 3 N.L.S. Carnot ( 1796-1832) - French engineer, one of the founders of thermody namics. P
.
13. 1 The Integral of a Differential Form p
225
1
v
Fig. 13.4.
example, a steam boiler and the atmosphere ) maintained at constant temperatures T1 and T2 respectively (T1 > T2 ) . The working body ( gas ) of this heat engine, having temperature T1 in State 1, is brought into contact with the heater, and by decreasing the external pressure along an isotherm, expands quasi-statically and moves to State 2. In the process the engine borrows a quantity of heat Q 1 from the heater and performs mechanical work A 1 2 against the external pressure. In State 2 the gas is thermally insulated and forced to expand quasi-statically to state 3, until its temperature reaches T2 , the temperature of the cooler. In this process the engine also performs a certain quantity of work A 23 against the external pressure. In State 3 the gas is brought into contact with the cooler and compressed isothermically to State 4 by increasing the pressure. In this process work is done on the gas ( the gas itself performs negative work A 3 4 ) , and the gas gives up a certain quantity of heat Q 2 to the cooler. State 4 is chosen so that it is possible to return from it to State 1 by a quasi-static compression along an adiabatic. Thus the gas is returned to State 1. In the process it is necessary to perform some work on the gas ( and the gas itself performs negative work A 41 ) . As a result of this cyclic process ( a Carnot cycle ) the internal energy of the gas ( the working body of the engine ) obviously does not change ( after all, we have returned to the initial state ) . Therefore the work performed by the engine is A = A 1 2 + A 23 + A 3 4 + A 41 = Q 1 - Q 2 . The heat Q 1 acquired from the heater went only partly to perform the work A. It is natural to call the quantity TJ = = Q'(J1Q2 the efficiency of the heat engine under consideration. a) Using the results obtained in a) and c ) of Problem 6, show that the equality � = � holds for a Carnot cycle. b ) Now prove the following theorem, the first of Carnot's two famous theorems. The efficiency of a heat engine working along a Carnot cycle depends only on the temperatures T1 and T2 of the heater and cooler. ( It is independent of the structure of the engine or the form of its working body. )
r!}-1
8. Let 'Y be a closed path in the plane of states F of the working body of an arbitrary heat engine ( see Problem 7) corresponding to a closed cycle of work performed by it. The quantity of heat that the working body ( a gas, for example ) exchanges with the surrounding medium and the temperature at which the heat exchange takes place are connected by the Clausius inequality J §!1- ::; 0. Here 8Q is the heat exchange ' form mentioned in Problem 6.
226
13 Line and Surface Integrals
a) Show that for a Carnot cycle (see Problem 7) , the Clausius inequality becomes equality. b) Show that if the work cycle "( can be run in reverse, then the Clausius inequality becomes equality. c) Let "( 1 and "(2 be the parts of the path "( on which the working body of a heat engine acquires heat from without and imparts it to the surrounding medium respectively. Let T1 be the maximal temperature of the working body on "( 1 and T2 its minimal temperature on "(2 · Finally, let Q 1 be the heat acquired on "(1 and Q 2 the heat given up on 'Y2 · Based on Clausius' inequality, show that � :::; R · d) Obtain the estimate TJ :::; T,n_T2 for the efficiency of any heat engine (see Problem 7) . This is Carnot 's second theorem. (Estimate separately the efficiency of a steam engine in which the maximal temperature of the steam is at most 150° C, that is, T1 = 423 K, and the temperature of the cooler - the surrounding medium - is of the order 20° C, that is T2 = 291 K.) e) Compare the results of Problems 7b) and 8d) and verify that a heat engine working in a Carnot cycle has the maximum possible efficiency for given values of T1 and T2 . The differential equation � = ��:? is said to have variables separable. It is usually rewritten in the form g(y) dy = f(x) dx, in which "the variables are sepa rated," then "solved" by equating the primitives I g(y) dy = I f(x) dx. Using the language of differential forms, now give a detailed mathematical explanation for this algorithm. 9.
1 3 . 2 The Volume Element . Integrals of First and Second Kind 1 3 . 2 . 1 The Mass of a Lamina
S be a lamina in Euclidean space JR.n . Assume that we know the density p (x) ( per unit area ) of the mass distribution on S. We ask how one can determine the total mass of S. Let
In order to solve this problem it is necessary first of all to take account of the fact that the surface density p (x) is the limit of the ratio .1m of the quantity of mass on a portion of the surface in a neighborhood of x to the area ..1 a of that same portion of the surface, as the neighborhood is contracted to x. By breaking S into small pieces Si and assuming that p is continuous on S, we can find the mass of Si , neglecting the variation of p within each small piece, from the relation
..1 m i ::::::: p (x i ) ..d ai , in which ..d ai is the area of the surface Si and X i E Si .
13.2 The Volume Element. Integrals of First and Second Kind
227
Summing these approximate equalities and passing to the limit as the partition is refined, we find that
m=
j s
p da .
(13. 12)
The symbol for integration over the surface S here obviously requires some clarification so that computational formulas can be derived from it . We note that the statement of the problem itself shows that the left-hand side of Eq. ( 13. 12) is independent of the orientation of S, so that the integral on the right-hand side must have the same property. At first glance this appears to contrast with the concept of an integral over a surface, which was discussed in detail in Sect . 13. 1 . The answer to the question that thus arises is concealed in the definition of the surface element da, to whose analysis we now turn. 13. 2 . 2 The Area of a Surface as the Integral of a Form
Comparing Definition 1 of Sect . 13. 1 for the integral of a form with the construction that led us to the definition of the area of a surface (Sect . 12.4) , we see that the area of a smooth k-dimensional surface S embedded in the Euclidean space JR n and given parametrically by 'P : D --+ S, is the integral of a form D, which we shall provisionally call the volume element on the surface S. It follows from relation (12. 10) of Sect . 12.4 that [l (more precisely
J
in the curvilinear coordinates 'P : D --+ S (that is, when transferred to the domain D) . Here % (t) = ( � , � ) , i,j = 1 , · :..._ · , k. To compute the area of S over a domain D in a second parametrization 0 : i5 --+ S, one must correspondingly integrate the form w
=
Z1
vdet ( g;j ) ( t)- dt A · · · A dt-k ,
( 13. 14)
- (t-) - I\ !!.f.. where g,J !!..<£._ ) . - 1 . . . ' k . at' ' aiJ ' z , J - ' We denote by 7/J the diffeomorphism
.
(13. 15) At the same time, it is obvious that (13. 16)
228
13 Line and Surface Integrals
Comparing the equalities (13. 13)-( 13. 16) , we see that 'ljJ*w = w if det '1/J' ( i) > 0 and '1/J* w = -w if det '1/J' ( i) < 0. If the forms w and w were obtained from the same form [l on S through the transfers r.p* and 0* , then we must always have the equality '1/J* ( r.p* fl) = :p• [l or, what is the same,
'1/J*w
=
w.
We thus conclude that the forms on the parametrized surface S that one must integrate in order to obtain the areas of the surface are different: they differ in sign if the parametrizations define different orientations on S; these forms are equal for parametrizations that belong to the same orientation class for the surface S. Thus the volume element [l on S must be determined not only by the surface S embedded in !R n , but also by the orienation of S. This might apprear paradoxical: in our intuition, the area of a surface should not depend on its orientation! But after all, we arrived at the definition of the area of a parametrized surface via an integral, the integral of a certain form. Hence, if the result of our computations is to be independent of the orientation of the surface, it follows that we must integrate different forms when the orientation is different. Let us now turn these considerations into precise definitions. 1 3 . 2 . 3 The Volume Element Definition 1 . If IR k is an oriented Euclidean space with inner product ( , ) , the on JR k corresponding to a particular orientation and the
volume element
inner product ( , ) is the skew-symmetric k-form that assumes the value 1 on an orthonormal frame of some orientation class.
The value of the k-form on the frame e 1 , . . . , e k obviously determines this form. We remark also that the form [l is determined not by an individual or thonormal frame, but only by its orientation class.
Proof. In fact, if e 1 , . . . , e k and e 1 , . . . , e k are two such frames in the same orientation class, then the transition matrix 0 from the second basis to the first is an orthogonal matrix with det 0 = 1 . Hence If an orthonormal basis e 1 , . . . , e k is fixed in JR k and n 1 , . . . , n k are the projections of JR k on the corresponding coordinate axes, obviously n 1 1\ · · · 1\ n k ( e1 , . . . , e k ) = 1 and
13.2 The Volume Element. Integrals of First and Second Kind
229
Thus,
��
�t
This is the oriented volume of the parallelepiped spanned by the ordered set of vectors e l ' ' e k . 0
0
0
Definition 2 . If the smooth k-dimensional oriented surface S is embedded in a Euclidean space IR.n , then each tangent plane T Bx to S has an orientation consistent with the orientation of S and an inner product induced by the the inner product in IR.n ; hence there is a volume element .f.?(x). The k-form n that arises on S in this way is the volume element on S induced by the embedding of S in IR.n . Definition 3. The area of an oriented smooth surface is the integral over the surface of the volume element corresponding to the orientation chosen for the surface.
This definition of area, stated in the language of forms and made precise, is of course in agreement with Definition 1 of Sect . 12.4, which we arrived at by consideration of a smooth k-dimensional surface S C IR.n defined in parametric form.
Proof. Indeed, the parametrization orients the surface and all its tan gent planes T Bx . If e 1 , . . . , e k is a frame of a fixed orientation class in TSx , it follows from Definitions 2 and 3 for the volume element n that .f.?(x)(e 1 , . . . , e k ) > 0. But then ( see Eq. ( 12.7) of Sect . 12.4 )
J
n(x) (e l , . . . , e k ) = det ((e i , e1 )) .
o
( 13 . 1 7)
We note that the form n(x) itself is defined on any set e 1 , . . . , e k of vectors TSx , but Eq. ( 13. 17) holds only on frames of a given orientation class in TSx . in
We further note that the volume element is defined only on an oriented surface, so that it makes no sense, for example, to talk about the volume element on a Mobius band in IR. 3 , although it does make sense to talk about the volume element of each orientable piece of this surface. Definition 4. Let S be a k-dimensional piecewise-smooth surface ( ori entable or not ) in IR.n , and 81 , . . . , Sm , . . . a finite or countable number of smooth parametrized pieces of it intersecting at most in surfaces of dimen sion not larger than k 1 and such that s = u si
The area ( or surfaces si 0
-
i
0
k-dimensional volume ) of S is the sum of the areas of the
13 Line and Surface Integrals
230
In this sense we can speak of the area of a Mobius band in JR. 3 or, what is the same, try to find its mass if it is a material surface with matter having unit density. The fact that Definition 4 is unambiguous (that the area obtained is independent of the partition 8 1 , . . . , Bm , . . . of the surface) can be verified by traditional reasoning. 1 3 . 2 . 4 Expression of the Volume Element in Cartesian Coordinates
Let S be a smooth hypersurface (of dimension n - 1) in an oriented Euclidean space JR. n endowed with a continuous field of unit normal vectors 11(x), x E S, which orients it. Let V be the n-dimensional volume in JR. n and [l the ( n - 1 ) dimensional volume element on S. If we take a frame e 1 , . . . , e n - 1 in the tangent space TBx from the orien tation class determined by the unit normal n(x) to TBx , we can obviously write the following equality:
(13. 1 8 ) Proof This fact follows from the fact that under the given hypotheses both sides are nonnegative and equal in magnitude because the volume of the parallelepiped spanned by 11, e 1 ' . . . ' e n - 1 is the area of the base il(x) (e 1 , . . . , e n _ 1 ) multiplied by the height 1 11 1 = 1 . D But,
=
n
'L) -1) i - 1 1/ (x) dx 1 i=1
1\
·
·
·
I\ dx i 1\
·
·
·
I\ dx n ( e 1 , . . . , e n - 1 )
·
Here the variables x 1 , . . . , x n are Cartesian coordinates in the orthonor mal basis e 1 , . . . , e n that defines the orientation, and the frown over the differential dx i indicates that it is to be omitted. Thus we obtain the following coordinate expression for the volume element on the oriented hypersurface S C JR. n : [l
=
n
�) -1) i - 1 1Ji (x) dx 1 . . I\ dxi i=1
1\
·
1\ . . . 1\
dx n (e u . . . ' e n - 1 ) .
(13. 1 9)
13.2 The Volume Element. Integrals of First and Second Kind
231
At this point it is worthwhile to remark that when the orientation of the susrface is reversed, the direction of the normal 'fl(x) reverses, that is, the form n is replaced by the new form - n. It follows from the same geometric considerations that for a fixed value of i E { 1 , . . . , n} (13.20) This last equality means that ( 1 3.21) For a two-dimensional surface S in JR n the volume element is most often denoted da or dS. These symbols should not be interpreted as the differen tials of some forms a and S; they are only symbols. If x, y, z are Cartesian coordinates on JR 3 , then in this notation relations (13. 19) and (13.21) can be written as follows: da = cos o: 1 dy 1\ dz + cos o: 2 dz 1\ dx + cos o: 3 dx 1\ dy , cos 0: 1 da = dy 1\ dz , (oriented areas of the projections cos 0: 2 da = dz 1\ dx , on the coordinate planes) . cos 0: 3 da = dx 1\ dy , Here (cos o: 1 , cos o: 2 , cos o: 3 ) ( x) are the direction cosines (coordinates) of the unit normal vector 'fl(x) to S at the point x E S. In these equalities (as also in (13. 19) and (13.21)) it would of course have been more correct to place the restriction sign I s on the right-hand side so as to avoid misunderstanding. But , in order not to make the formulas cumbersome, we confine ourselves to this remark. 13.2.5 Integrals of First and Second Kind
Integrals of type ( 13. 12) arise in a number of problems, a typical represen tative of which is the problem consisdered above of determining the mass of a surface whose density is known. These integrals are often called integrals over a surface or integrals of first kind. Definition 5. The
integral
integral of a function p over an oriented surface S is the
Js pD
(13.22)
of the differential form pD, where D is the volume element on S (correspond ing to the orientation of S chosen in the computation of the integral) .
232
13 Line and Surface Integrals
It is clear that the integral (13.22) so defined is independent of the orienta tion of S, since a reversal of the orientation is accompanied by a corresponding replacement of the volume element. We emphasize that it is not really a matter of integrating a function, but rather integrating a form pfl of special type over the surface S with the volume element defined on it.
Definition
6 . If S is a piecewise-smooth ( orientable or non-orientable) sur face and p is a function on S, then the integral (13.22) of p over the surface S is the sum L pfl of the integrals of p over the parametrized pieces
j
' s ,
81 , . . . , Sm , . . . of the partition of S described in Definition 4. The integral (13.22) is usually called a surface integral of first For example, the integral
kind.
(13.12), which expresses the mass of the surface
S in terms of the density p of the mass distribution over the surface, is such
an integral. To distinguish integrals of first kind, which are independent of the ori entation of the surface, we often refer to integrals of forms over an oriented surface as surface integrals of second kind. We remark that, since all skew-symmetric forms on a vector space whose degrees are equal to the dimension of the space are multiples of one an other, there is a connection w = pfl between any k-form w defined on a k-dimensional orientable surface S and the volume element [l on S. Here p is some function on S depending on w . Hence
That is, every integral of second kind can be written as a suitable integral of first kind.
Example 1.
the path
s
1
(13.2' ) of Sect . 13. 1 , which expresses the work on }Rn , can be written as the integral of first kind
The integral :
[a, b]
-+
s
j (F,e)ds , I
(13.23)
e
where is arc length on 1, d is the element of length (a 1-form) , and is a unit velocity vector containing all the information about the orientation of f. From the point of view of the physical meaning of the problem solved by the integral (13.23), it is just as informative as the integral (13.1) of Sect . 13. 1 .
13.2 The Volume Element. Integrals of First and Second Kind
n(x) j(v,n)dCJ .
233
Example 2. The flux (13.3) of Sect . 13.1 of the velocity field V across a surface S c lRn oriented by unit normals can be written as the surface integral of first kind (13.24)
'Y
n.
The information about the orientation of S here is contained in the direction of the field of normals The geometric and physical content of the integrand in (13.24) is just as transparent as the corresponding meaning of the integrand in the final computational formula (13.6) of Sect . 13.1. For the reader ' s information we note that quite frequently one encounters the notation ds := e ds and du := which introduce a vector element of length and a vector element of area. In this notation the integrals (13.23) and (13.24) have the form
ndCJ,
j (F,
ds)
j( , ) ,
and
V du
'Y
'Y
which are very convenient from the point of view of physical interpretation. For brevity the inner product (A, B ) of the vectors A and B is often written
A · B. Example 3. Faraday 's law 4
asserts that the electromotive force arising in a closed conductor r in a variable magnetic field B is proportional to the rate of variation of the flux of the magnetic field across a surface S bounded by r . Let E be the electric field intensity. A precise statement of Faraday ' s law can be given as the equality
f E · ds
['
=
-
:t J B · s
du .
The circle in the integration sign over r is an additional reminder that the integral is being taken over a closed curve. The work of the field over a closed curve is often called the circulation of the field along this curve. Thus by Faraday ' s law the circulation of the electric field intensity generated in a closed conductor by a variable magnetic field equals the rate of variation of the flux of the magnetic field across a surface S bounded by r , taken with a suirable sign.
4 M. Faraday (1791-1867) - outstanding British physicist, creator of the concept of an electromagnetic field.
234
13 Line and Surface Integrals
Example 4. Ampere 's law5
f B · ds
r
B
=
1
c: o c
2
--
/ j · du s
(where is the magnetic field intensity, j is the current density vector, and and c are dimensioning constants) asserts that the circulation of the intensity of a magnetic field generated by an electric current along a contour r is proportional to the strength of the current flowing across the surface S bounded by the contour.
c:0
We have studied integrals of first and second kind. The reader might have noticed that this terminological distinction is very artificial. In reality we know how to integrate, and we do integrate, only differential forms. No integral is ever taken of anything else (if the integral is to claim independence of the choice of the coordinate system used to compute it) . 1 3 . 2 . 6 Problems and Exercises 1.
Give a formal proof of Eqs. ( 13. 18) and (13.20) .
2.
Let "( be a smooth curve and ds the element of arc length on "Y · a) Show that
II
f(s) ds
'Y
l$I 'Y
l f(s) l ds
for any function f on "( for which both integrals are defined. b) Verify that if lf(s) l $ M on "( and l is the length of "(, then
II
l
f(x) ds $ Ml .
'Y
c) State and prove assertions analogous to a) and b) in the general case for an integral of first kind taken over a k-dimensional smooth surface. 3. a) Show that the coordinates ( x 6 , x 6 , x �) of the center of masses distributed with linear density p ( x ) along the curve "Y should be given by the relations x
�
I
p x
( ) ds =
Ii
x p x
( ) ds ,
i
=
1 , 2, 3 .
b) Write the equation of a helix in JR 3 and find the coordinates of the center of mass of a piece of this curve, assuming that the mass is distributed along the curve with constant density equal to 1 . 5
A.M. Ampere ( 1775-1836) French physicist and mathematician, one of the founders of modern electrodynamics. -
13.2 The Volume Element. Integrals of First and Second Kind
235
c) Exhibit formulas for the center of masses distributed over a surface S with surface density p and find the center of masses that are uniformly distributed over the surface of a hemisphere. d) Exhibit the formulas for the moment of inertia of a mass distributed with density p over the surface S. e) The tire on a wheel has mass 30 kg and the shape of a torus of outer diameter 1 m and inner diameter 0.5 m. When the wheel is being balanced, it is placed on a balancing lathe and rotated to a velocity corresponding to a speed of the order of 100 km/hr, then stopped by brake pads rubbing against a steel disk of diameter 40 em and width 2 em. Estimate the temperature to which the disk would be heated if all the kinetic energy of the the spinning tire went into heating the disk when the wheel was stopped. Assume that the heat capacity of steel is c = 420 Jf (kg-K) . a) Show that the gravitational force acting o n a point mass mo located at ( Xo, Yo, zo) due to a material curve "{ having linear density p is given by the formula
4.
F = Gmo
J � �3 'Y
where G is the gravitational constant and xo, y - yo, z - zo).
r
r
ds
,
is the vector with coordinates
(x -
b ) Write the corresponding formula i n the case when the mass i s distributed over a surface S. c) Find the gravitational field of a homogeneous material line. d) Find the gravitational field of a homogeneous material sphere. (Exhibit the field both outside the ball bounded by the sphere and inside the ball. ) e ) Find the gravitational field created i n space by a homogeneous material ball (consider both exterior and interior points of the ball) . f) Regarding the Earth as a liquid ball, find the pressure in it as a function of the distance from the center. (The radius of the Earth is 6400 km, and its average density is 6 g/cm3 .) 5. Let "{1 and "{2 be two closed conductors along which currents J1 and h re spectively are flowing. Let ds 1 and ds 2 be the vector elements of these conductors corresponding to the directions of current in them. Let the vector R 1 2 be directed from ds 1 to ds 2 , and R2 1 = -R 1 2 . According to the Biot-Savart law6 the force dF 1 2 with which the first element acts on the second is
where the brackets denote the vector product of the vectors and co is a dimensioning constant. a) Show that, on the level of an abstract differential form, it could happen that dF 1 2 # -dF 2 1 in the differential Biot-Savart formula, that is, "the reaction is not equal and opposite to the action."
6 Biot (1774-1862), Savart (1791-1841) - French physicists.
13 Line and Surface Integrals
236
b) Write the (integral) formulas for the total forces F 12 and F 21 for the inter action of the conductors /' 1 and /'2 and show that F 12 = -F 2 1 · 6. The co-area formula (the Kronrod-Federer formula}. Let M m and N n be smooth surfaces of dimensions m and n respectively, em bedded in a Euclidean space of high dimension ( M m and N n may also be abstract Riemannian manifolds, but that is not important at the moment) . Suppose that
m
2:
n.
Let f : M m --+ N n be a smooth mapping. When m > n , the mapping df(x) : TxM m --+ Tf (x ) N n has a nonempty kernel ker df(x) . Let us denote by Tf M m the orthogonal complement of ker df(x) , and by J(f, x) the Jacobian of the mapping df(x) : Tf M m --+ Tf ( x ) N n . If m = n, then J( J, x) is the usual Jacobian. T.l. M m Let d v k (p) denote the volume element on a k-dimensional surface at the point p. We shall assume that vo (E) = card E, where v k (E) is the k-volume of E. a) Using Fubini's theorem and the rank theorem (on the local canonical form of a smooth mapping) if necessary, prove the following formula of Kronrod and Federer: I J( J , x) dv m (x) = I Vm -n ! - 1 (y) dv n (y) .
i
Mm
(
Nn
)
b) Show that if A is a measurable subset of M m , then
I J(J, x) dvm(x) = I Vm -n (A n r 1 (y) ) dvn (y) . Nn
A
This is the general Kronrod-Federer formula. c) Prove the following strengthening of Sard's theorem (which in its simplest version asserts that the image of the set of critical points of a smooth mapping has measure zero) . (See Problem 8 of Sect. 1 1 .5.) Suppose as before that f : M m --+ N n is a smooth mapping and K is a compact set in M m on which rank df(x) < n for all x E K . Then J. Vm - n K n f - 1 (y) dv n ( Y ) = 0. Use this result to obtain in addition the simplest version of Sard 's theorem stated above. d) Verify that if f : D --+ lR and u : D --+ lR are smooth functions in a regular domain D C JR n and u has no critical points in D, then
(
)
e) Let VJ (t) be the measure (volume) of the set {x E D l f(x) function f be nonnegative and bounded in the domain D. Show that I f dv = - I t dVJ (t) = I VJ (t) dt. D
R
>
t}, and let the
00
0
f) Let r.p E c < 1 l (JR, JR + ) and r.p(O) = 0, while f E C ( 1) (D, JR) and Vi11 (t) is the measure of the set {x E D l f(x) l > t } . Verify that I r.p o f dv = J r.p' (t)VjJI (t) dt.
i
D
0
237
13.3 The Fundamental Integral Formulas of Analysis
13.3 The Fundamental Integral Formulas of Analysis The most important formula of analysis is the Newton-Leibniz formula ( fun damental theorem of calculus ) . In the present section we shall obtain the formulas of Green, Gauss-Ostrogradskii, and Stokes, which on the one hand are an extension of the Newton-Leibniz formula, and on the other hand, taken together, constitute the most-used part of the machinery of integral calculus. In the first three subsections of this section, without striving for generality in our statements, we shall obtain the three classical integral formulas of anal ysis using visualizable material. They will be reduced to one general Stokes formula in the fourth subsection, which can be read formally independently of the others. 13.3. 1 Green's Theorem
Green ' s7 theorem is the following.
Let JR2 be the plane with a fixed coordinate grid x, y, and let D be a compact domain in this plane bounded by piecewise-smooth curves. Let P and Q be smooth functions in the closed domain D. Then the following relation holds: ( - �: ) dx dy = P dx + Q Dy , ( 1 3.25)
Proposition 1.
JJ ��
J
D
aD
in which the right-hand side contains the integral over the boundary aD of the domain D oriented consistently with the orientation of the domain D itself We shall first consider the simplest version of ( 13.25) in which D is the square I = { (x, y) E JR 2 1 0 :::; x :::; 1 , 0 :::; y :::; 1 } and Q 0 in /. Then Green ' s =
theorem reduces to the equality
JJ �: I
dx d y =
-
JP
dx ,
(13.26)
81
which we shall prove.
Proof Reducing the double integral to an iterated integral and applying the fundamental theorem of calculus, we obtain 7 G. Green (1793-1841) - British mathematician and mathematical physicist. Newton's grave in Westminster Abbey is framed by five smaller gravestones with brilliant names: Faraday, Thomson ( Lord Kelvin ) , Green, Maxwell, and Dirac.
238
13 Line and Surface Integrals 1 1 dx dy = dx dy = 0 0
jj �: I5
j 1
j j �:
j
j
1
1
(P(x, 1 ) - P(x, 2)) dx = - P(x, 0) dx + P(x, 1 ) dx . 0 0 0 The proof is now finished. What remains is a matter of definitions and interpretation of the relation just obtained. The point is that the difference of the last two integrals is precisely the right-hand side of relation (13.26) . =
/'3 1 1----� /'4
/'2
0
/'1
1
Fig. 1 3 . 5 .
Indeed, the piecewise-smooth curve 81 breaks into four pieces (Fig. 13.5) , which can be regarded as parametrized curves 1'1 : [0, 1] -+ where x r2.4 (x, 0), : [0, 1 ] -+ where y A ( 1 , y) , 1'3 : [0, 1] -+ where x A (x, 1 ) , 1'4 : [0, 1] -+ where y � (0, y) , By definition of the integral of the 1-form w = P dx over a curve 1 'Y ; (P(x, y) dx) : = P(x, O) dx , P(x, y) dx : = 0 ')'1 [0 , 1 ] 1 P(x, y) dx : = ')'HP(x, y) dx) : = O dy = 0 , 0 0 ')'2 [ , 1] 1 P(x, y) dx : = 1'3 (P(x, y) dx) : = P(x, 1 ) dx , 0 0 1 ')'3 [,] 1 P(x, y) dx : = 1H P(x , y) dx) : = O dy = 0 , 0 ')'4 [0 , 1]
')'2
IR2 , IR2 , IR2 , IR2 ,
j
j
j
j
j
j
j
j
j
j
j
j
13.3 The Fundamental Integral Formulas of Analysis
239
and, in addition, by the choice of the orientation of the boundary of the domain, taking account of the orientations of 1'2 , 1'3 , 1'4 , it is obvious that
/'l ,
jw = jw + jw + J w + J w = jw + jw - jw - jw ,
BI
/1
/1
/4
/2
where -l'i is the curve l'i taken with the orientation opposite to the one defined by l'i . Thus Eq. (13.26) is now verified. D It can be verified similarly that
JJ ��
dx dy
I
Adding
=J
(13.27)
Q dy ,
BI
(13.26) and (13.27), we obtain Green ' s formula
JJ (�� - �:)
dx dy
I
=J + P dx
Q dy
(13.25')
BI
for the square I . We remark that the asymmetry of P and Q in Green ' s formula (13.25) and in Eqs. (13.26) and (13.27) comes from the asymmetry of x and y: after all, x and y are ordered, and it is that ordering that gives the orientation in IR2 and in I. In the language of forms, the relation (13.25') just proved can be rewritten as
(13.25 11 )
w
I
BI is an arbitrary smooth form on I. The integrand on the right-hand
where side here is the restriction of the form to the boundary oi of the the square I. The proof of relation (13.26) just given admits an obvious generalization: If Dy is not a square, but a "curvilinear quadrilateral" whose lateral sides are vertical closed intervals (possibly degenerating to a point) and whose other two sides are the graphs of piecewise-smooth functions cp 1 ( x) :S cp 2 ( x) over the closed interval [a, b] of the x-axis, then
w
jj �: = - j dx dy
Dy
P dx .
(13.26')
BDy
Similarly, if there is such a "quadrilateral" Dx with respect to the y-axis, that is, having two horizontal sides, then for it we have the equality
jj ��
dx dy
=j
Q dy .
(13.27')
240
13 Line and Surface Integrals
Now let us assume that the domain D can be cut into a finite number of domains of type D y (Fig. 13.6). Then a formula of the form (13.26 1 ) also holds for that region D . y
Fig. 1 3 . 6 .
Proof. In fact, by additivity, the double integral over the domain D is the sum of the integrals over the pieces of type D y into which D is divided. Formula ( 13.26') holds for each such piece, that is, the double integral over that piece equals the integral of P dx over the oriented boundary of the piece. But adjacent pieces induce opposite orientations on their common boundary, so that when the integrals over the boundaries are added, all that remains after cancellation is the integral over the boundary 8D of the domain D itself. 0
Similarly, if D admits a partition into domains of type Dx , an equality of type (13.27') holds for it . We agree to call domains that can be cut both into pieces of type Dx and into pieces of type D y elementary domains. In fact, this class is sufficiently rich for all practical applications. By writing both relations (13.26 1 ) and (13.27') for a simple domain, we obtain (13.25) by adding them. Thus, Green ' s theorem is proved for simple domains. We shall not undertake any further sharpenings of Green ' s formula at this point (on this account see Problem 2 below) , but rather demonstrate a second, very fruitful line of reasoning that one may pursue after establishing Eqs. (13.25') and (13.25 11 ). Suppose the domain C has been obtained by a smooth mapping cp : ---+ C of the square If w is a smooth 1-form on C, then
I
I.
J
c
dw :=
J I
cp * dw =
J I
dcp*w �
J
EJI
cp*w =:
J
w.
(13.28)
EJC
The exclamation point here distinguishes the equality we have already proved (see ( 13.25")); the extreme terms in these equalities are definitions
13.3 The Fundamental Integral Formulas of Analysis
241
or direct consequences of them; the remaining equality, the second from the left, results from the fact that exterior differentiation is independent of the coordinate system. Hence Green ' s formula also holds for the domain C. Finally, if it is possible to cut any oriented domain D into a finite num ber of domains of the same type as C, the considerations already described involving the mutual cancellation of the integrals over the portions of the boundaries of the Ci inside D imply that
(13.29) that is, Green ' s formula also holds for D. It can b e shown that every domain with a piecewise-smooth boundary belongs to this last class of domains, but we shall not do so, since we shall describe below (Chap. 15) a useful technical device that makes it possible to avoid such geometric complications, replacing them by an analytic problem that is comparatively easy to solve. Let us consider some examples of the use of Green ' s formula.
Example 1.
- y, Q = x in (13.25). We then obtain -y dx + x dy = 2 dx dy = 2 a( D) ,
Let us set P =
J
J
D
8D
where a (D) is the area of D. Using Green ' s formula one can thus obtain the following expression for the area of a domain on the plane in terms of line integrals over the oriented boundary of the domain:
a(D) =
� J -y dx + x dy = - J y dx = J x dy . 8D
8D
It follows in particular from this that the work
8D
A = J P dV performed "Y
by a heat engine in changing the state of its working substance over a closed cycle 'Y equals the area of the domain bounded by the curve 'Y in the PV-plane of states (see Problem 5 of Sect . 13.1).
Example 2 . Let B = { (x, y) E �2 1 x 2 + y 2
1} be the closed disk in the plane. We shall show that any smooth mapping f : B ---+ B of the closed disk into itself has at least one fixed point (that is, a point p E B such that f(p) = p). Proof. Assume that the mapping f has no fixed points. Then for every point p E B the ray with initial point f(p) passing through the point p and the point cp(p) E aB where this ray intersects the circle bounding B are uniquely �
242
13 Line and Surface Integrals
determined. Thus a mapping r.p : B ---+ 8B would arise, and it is obvious that the restriction of this mapping to the boundary would be the identity mapping. Moreover, it would have the same smoothness as the mapping f itself. We shall show that no such mapping r.p can exist . In the domain JR 2 \ 0 ( the plane with the origin omitted ) let us consider the form w = - yx���� d y that we encountered in Sect . 13.1. It can be verified immediately that dw = 0. Since 8B C 1R 2 \0 , given the mapping r.p : B ---+ 8B, one could obtain a form r.p*w on B , and dr.p*w = r.p* ( dw ) = r.p*O = 0. Hence by Green ' s formula r.p*w = dr.p*w = 0 . &B B But the restriction of r.p to 8B is the identity mapping, and so
J
J
J
r.p * w
=
J
w.
&B &B This last integral, as was verified in Example 1 of Sect . 13.1, is nonzero. This contradiction completes the proof of the assertion. 0 This assertion is of course valid for a ball of any dimension ( see Example 5 below ) . It also holds not only for smooth mappings, but for all continuous mappings f : lB\ ---+ lll\ . In this general form it is called the Brouwer fixed-point
theorem. 8
1 3 . 3 . 2 The Gauss-Ostrogradskii Formula
Just as Green's formula connects the integral over the boundary of a plane domain with a corresponding integral over the domain itself, the Gauss Ostrogradskii formula given below connects the integral over the boundary of a three-dimensional domain with an integral over the domain itself. Proposition 2. Let JR 3 be three-dimensional space with a fixed coordinate system x, y, z and D a compact domain in JR3 bounded by piecewise-smooth surfaces. Let P, Q, and R be smooth functions in the closed domain D .
Then the following relation holds: ( aP + aQ + &R ) dx dy Dz =
JJJ
ax
ay
=
az
JJ
P dy I\ dz + Q dz I\ dx + R dx I\ dy .
(13.30)
&75
8
L.E.J. Brouwer (1881-1966) - Well-known Dutch mathematician. A number of fundamental theorems of topology are associated with his name, as well as an analysis of the foundations of mathematics that leads to the philosophico mathematical concepts called intuitionism.
13.3 The Fundamental Integral Formulas of Analysis
243
The Gauss-Ostrogradskii formula (13.30) can be derived by repeating the derivation of Green ' s formula step by step with obvious modifications. So as not to do a verbatim repetition, let us begin by considering not a cube in IR3 , but the domain Dx shown in Fig. 13. 7, which is bounded by a lateral cylindrical surface 8 with generator parallel to the z-axis and two caps 81 and 82 which are the graphs of piecewise-smooth functions r.p 1 and r.p 2 defined in the same domain G C IR; Y . We shall verify that the relation
R dx 1\ d y jjj �� dx dy dz = jj av.
(13.31)
holds for Dz . z
X
Fig.
13.7.
Proof.
dx d y dz = jj d x d y j �� dz = �� jjj Dz 'Pl ( x ,y ) = jj (R(x, y, r.p 2 (x, y)) - R(x, y, r.p 1 (x, y))) dx dy = = - jj (R(x, y, r.p 1 (x, y)) dx dy + jj (R(x, y, r.p 2 (x, y)) dx dy . G
G
G
The surfaces
G
81 and 82 have the following parametrizations: 81 : (x, y ) t----+ (x, y, r.p l (x, y)) ' 82 : (x, y) t----+ (x, y, r.p 2 (x, y )) .
244
13 Line and Surface Integrals
The curvilinear coordinates (x, y) define the same orientation on S2 that is induced by the orientation of the domain D z , and the opposite orientation on S1 . Hence if S1 and S2 are regarded as pieces of the boundary of D z oriented as indicated in Prosition 2, these last two integrals can be interpreted as integrals of the form R dx 1\ dy over S1 and S2. The cylindrical surface S has a parametric representation ( t , z) H ( x( t ) , y( t ) , z ) , so that the restriction of the form R dx 1\ dy to S equals zero, and so consequently, its integral over S is also zero. Thus relation (13.31) does indeed hold for the domain D z . D If the oriented domain D can be cut into a finite number of domains of the type Dz , then, since adjacent pieces induce opposite orientations on their common boundary, the integrals over these pieces will cancel out, leaving only the integral over the boundary 8 D . Consequently, formula (13.31) also holds for domains that admit this kind of partition into domains of type Dz . Similarly, one can introduce domains D y and Dx whose cylindrical sur faces have generators parallel to the y-axis or x-axis respectively and show that if a domain D can be divided into domains of type Dy or Dx , then the relations
JJJ �� dx dy dz = JJ Q dz 1\ dx ,
(13.32) (13.33)
Thus, if D is a simple domain, that is, a domain that admits each of the three types of partitions just described into domains of types Dx , D y , and D z , then, by adding (13.31), (13.32), and (13.33), we obtain (13.30) for D . For the reasons given in the derivation of Green ' s theorem, we shall not undertake the description of the conditions for a domain to be simple or any further sharpening of what has been proved (in this connection see Problem 8 below or Example 12 in Sect. 17.5). We note, however, that i n the language of forms, the Gauss-Ostrogradskii formula can be written in coordinate-free form as follows:
(13.30') where w is a smooth 2-form in D . 0� = { (x, y, z) E Since formula (13.30') holds for the cube I = 1 ::; 1, 0 ::; y � 1, 0 � z � 1}, as we have shown, its extension to more general classes of domains can of course be carried out using the standard computations (13.28) and (13.29).
13
JR3 I
13.3 The Fundamental Integral Formulas of Analysis
245
Example 3. The law of Archimedes. Let us compute the buoyant force of a homogeneous liquid on a body D immersed in it . We choose the Cartesian coordinates x, y, z in JR 3 so that the xy-plane is the surface of the liquid and the z-axis is directed out of the liquid. A force pgzn da is acting on an element da of the surface S of D located at depth z, where p is the density of the liquid, g is the acceleration of gravity, and n is a unit outward normal to the surface at the point of the surface corresponding to da. Hence the resultant force can be expressed by the integral F
=
JJs pgzn da .
If n = e x COS O! x + e y COS O! y + ez COS O!z , then n da = ex dy 1\ dz + e y dz 1\ dx + e z dx 1\ dy ( see Subsect . 13.2.4). Using the Gauss-Ostrogradskii formula (13.30), we thus find that
F = expg JJs zdy/\dz +eypg JJs zdz/\dx +ezpg JJs z dx /\ dy = p Iff = ex pg Iff ezpg JJJ = O dx dy dz +
O dx dy dz + e y g
D
D
dx dy dz
+
pgV ez ,
D
where V is the volume of the body D. Hence P = pgV is the weight of a volume of the liquid equal to the volume occupied by the body. We have arrived at Archimedes ' law: F = Pez .
Example 4. Using the Gauss-Ostrogradskii formula (13.30), one can give the following formulas for the volume V(D) of a body D bounded by a surface aD.
V(D) =
� JJ
x dy 1\ dz + y dz 1\ dx + z dx 1\ dy =
aD
=
JJ
x dy 1\ dz
aD
=
JJ aD
y dz 1\ dx =
JJ
z dx 1\ dy .
aD
13.3.3 Stokes' Formula in JR3
Let S be an oriented piecewise-smooth compact two dimensional surface with boundary 88 embedded in a domain G IR3 , in which a smooth 1-form w = P dx + Q dy + R dz is defined. Then the following relation holds: Proposition 3.
c
246
13 Line and Surface Integrals
Jas
JsJ ( ��
��) d y A d z + 8Q 8P 8P oR - - dx A dy , + ( - - - ) dz A dx + ( {)z ox ox oy )
P dx + Q dy + R dz =
-
(13.34) where the orientation of the boundary as is chosen consistently with the orientation of the surface S. In other notation, this means that dw = w . (13.34') -
j j s
ow
Proof. If C is a standard parametrized surface i.p : I -+ C in �3 , where I is a square in � 2 , relation (13.34) folios from Eqs. (13.28) taking account of what has been proved for the square and Green ' s formula. If the orientable surface S can be cut into elementary surfaces of this type, then relation (13.34) is also valid for it, as follows from Eqs. (13.29) with D replaced by S. D As in the preceding cases, we shall not prove at this point that, for exam ple, a piecewise-smooth surface admits such a partition. Let us show what this proof of formula (13.34) would look like in coordi nate notation. To avoid expressions that are really too cumbersome, we shall write out only the first, main part of its two expressions, and with some sim plifications even in that. To be specific, let us introduce the notation x 1 , x 2 , x 3 for the coordinates of a point x E � 3 and verify only that 8P 8P 2 dx A dx 1 + -3 dx 3 A dx 1 , P(x) dx 1 = 2 8x 8x
J
as
!!s
since the other two terms on the left-hand side of (13.34) can be studied s imilarly. For simplicity we shall assume that S can be obtained by a smooth
t
D
x
=
T
Fig.
13.8.
x(t)
{3
13.3 The Fundamental Integral Formulas of Analysis
247
mapping x = x ( t ) of a domain D in the plane IR 2 of the variables t 1 , t 2 and bounded by a smooth curve "( = 8D parametrized via a mapping t = t ( T ) by the points of the closed interval a :S; T :S; (3 ( Fig. 13.8 ) . Then the boundary T = OS of the surface S can be written as X = x (t ( T )) , where T ranges OVer the closed interval [a, (3]. Using the definition of the integral over a curve, Green ' s formula for a plane domain D, and the definition of the integral over a parametrized surface, we find successively
J
r
P ( x ) dx 1 :=
=
J {3
a
P ( x ( t ( T )))
!! ( I tt:D
8P 8x 2
( 8x 1 dt 1
f)t l d T
+
8x 1 dt 2 ) dT = ot 2 dT
8x 1
8t1
=:
!! ( s
8P 2 dx A dx 1 ox 2
+
8P 3 ) dx A dx 1 . ox 3
The colon here denotes equality by definition, and the exclamation point denotes a transition that uses the Green ' s formula already proved. The rest consists of identities. Using the basic idea of the proof of formula ( 13.34' ) , we have thus verified directly ( without invoking the relation cp* d = dcp* , but essentially proving it for the case under consideration ) that formula ( 13.34 ) does indeed hold for a simple parametrized surface. We have carried out the reasoning formally only for the term P dx, but it is clear that the same thing could also be done for the other two terms in the 1-form in the integrand on the left-hand side of ( 13.34 ) .
248
13 Line and Surface Integrals
13.3.4 The General Stokes Formula
Despite the differences in the external appearance of formulas (13.25), (13.30), and (13.34), their coordinate-free expressions (13.25"), (13.29), (13.30'), and (13.34') turn out to be identical. This gives grounds for sup posing that we have been dealing with particular manifestations of a general rule, which one can now easily guess.
Let S be an oriented piecewise smooth k-dimensional com pact surface with boundary aS in the domain G �n , in which a smooth ( k - 1) -form w is defined. Then the following relation holds:
Proposition 4.
c
(13.35) in which the orientation of the boundary aS is that induced by the orientation of S. Proof. Formula (13.35) can obviously be proved by the same general com putations (13.28) and (13.29) as Stokes ' formula (13.34') provided it holds k k k for a standard k-dimensional interval J = { x = ( x 1 , . . . , x ) E �k I 0 ::; xi ::; 1, i = 1, . . . , k }. Let us verify that (13.35) does indeed hold for J . k Since a (k - I ) -form on J has the form w = L i ai (x)dx 1 1\ · · · I\ dx i 1\ · · · 1\ dx k ( summation over i = 1, . . . , k, with the differential dxi omitted ) , it suffices to prove (13.35) for each individual term. Let w = a( x) dx 1 /\ · · · 1\ dx i /\ · · · I\ dx k . Then dw = (- l) i - l g:i ( x ) dx 1 1\ · · · I\ dxi 1\ · · · I\ dx k . We now
carry out the computation:
1
dx 1 · · · dx' · · · dx k
J J ox'oa (x) dx". = - ( - l ) i - 1 J ( a ( x 1 , . . . , x i - 1 , 1 , x i +l , . . . , x k ) •-
= ( - 1) . 1
�
Jk - l
.
.
0
Jk - l
- a ( x 1 , . . . , X i - 1 , 0 , Xi +l , . . . X k ) ) dX 1 · · · dXi · · · dX k = = ( - l) i - 1 a(t l , . . . , t i - l , l, t i , . . . , t k - 1 ) dt l . . . dt k - 1 +
j
Jk - l
+
( l) i -
J a (t l , . . . , ti - l 0 ti , . . . , tk - 1 ) t l . . . tk- 1 .
Jk - l
'
'
d
d
13.3 The Fundamental Integral Formulas of Analysis
249
Here J k - 1 is the same as J k in R k , only it is a (k - I)-dimensional interval in R k - 1 . In addition, we have relabeled the variables x 1 = t 1 , . . . , x i - 1 = ti - l ,
xi + 1 = t i , . . . , x k = t k - 1 . The mappings
l k - 1 3 t = ( t 1 , . . . , t k - 1 ) ( t l , . . . , t i - l , l, ti , . . . , t k - 1 ) E l k , [ k - 1 3 t = ( t l , . . . ' t k - 1 ) t---+ ( t l , . . . ' t i - l , 0, t i ' . . . ' t k - 1 ) E [ k are parametrizations of the upper and lower faces ri l and rio of the inter val J k respectively orthogonal to the x i axis. These coordinates define the same frame e 1 . . . . , e i - 1 , e i + 1 , . . . , e k orienting the faces and differing from the frame e 1 , . . . , e k of R k in the absence of e i . On ri 1 the vector ei is the exterior normal to J k , as the vector -e i is for the face riO . The frame ei , e 1 , . . . , ei - 1 , ei + 1 , . . . , e k becomes the frame e 1 , . . . , e k after i - 1 inter changes of adjacent vectors, that is, the agreement or disagreement of the orientations of these frames is determined by the sign of ( - l) i - l . Thus, this parametrization defines an orientation on ri l consistent with the orientation of J k if taken with the corrective coefficient ( - l) i - l (that is, not changing the orientation when i is odd, but changing it when i is even) . Analogous reasoning shows that for the face rio it is necessary to take a corrective coefficient ( - 1 ) i to the orientation defined by this parametrization of the face riO . t---+
Thus, the last two integrals (together with the coefficients in front of them) can be interpreted respectively as the integrals of the form w over the faces ri l and riO of [ k with the orientation induced by the orientation of [ k . We now remark that on each of the remaining faces of J k one of the co ordinates x 1 , . . . , x i - 1 , x i + 1 , . . . , x k is constant. Hence the differential corre sponding to it is equal to zero on such a face. Thus, the form dw is identically equal to zero and its integral equals zero over all faces except riO and ri l · Hence we can interpret the sum of the integrals over these two faces as the integral of the form w over the entire boundary aJ k of the interval [ k oriented in consistency with the orientation of the interval J k itself. The formula
(13.35), is now proved. 0 formula (13.35) is a corollary of the
and along with it formula
As one can see, Newton-Leibniz formula (fundamental theorem of calculus) , Fubini's theorem, and a series of definitions of such concepts as surface, boundary of a surface, orientation, differential form, differentiation of a differential form, and transference of forms. Formulas (13.25), (13.30), and (13.34), the formulas of Green, Gauss Ostrogradskii, and Stokes respectively, are special cases of the general formula
250
13 Line and Surface Integrals
( 13.35 ) . Moreover, if we interpret a function f defined on a closed interval [a, b] C 1ft as a 0-form w , and the integral of a 0-form over an oriented point as the value of the function at that point taken with the sign of the orien tation of the point , then the Newton-Leibniz formula itself can be regarded as an elementary ( but independent ) version of ( 13.35) . Consequently, the fundamental relation (13.35) holds in all dimensions k :::: 1 . Formula ( 1 3.35) is often called the general Stokes formula. As historical information, we quote here some lines from the preface of M. Spivak to his book ( cited in the bibliography below ) : The first statement of the Theorem9 appears as a postscript to a letter, dated July 2, 1850, from Sir William Thomson ( Lord Kelvin ) to Stokes. It appeared publicly as question 8 on the Smith ' s Prize Ex amination for 1854. This competitive examination, which was taken annually by the best mathematics students at Cambridge University, was set from 1849 to 1882 by Professor Stokes; by the time of his death the result was known universally as Stokes ' theorem. At least three proofs were given by his contemporaries: Thomson published one, another appeared in Thomson and Tait ' s Treatise on Natural Philosophy, and Maxwell provided another in Electricity and Mag netism. Since this time the name of Stokes has been applied to much more general results, which have figured so prominently in the devel opment of certain parts of mathematics that Stokes ' theorem may be considered a case study in the value of generalization. We note that the modern language of differential forms originates with
E lie Cartan , 10 but the form (13 .35) for the general Stokes' formula for surfaces
in lft n seems to have been first proposed by Poincare. For domains in n dimensional space lft n Ostrogradskii already knew the formula, and Leibniz wrote down the first differential forms. Thus it is not an accident that the general Stokes formula ( 13.35) is sometimes called the Newton-Leibniz-Green-Gauss-Ostrogradski-Stokes Poincare formula. One can conclude from what has been said that this is by no means its full name. Let us use this formula to generalize the result of Example 2.
Example 5 .
ball B
C
Let us show that every smooth mapping
lftm into itself has at least one fixed point .
f:B
---+
B of a closed
Proof. If the mapping f had no fixed points, then, as in Example 2, one could construct a smooth mapping cp : B ---+ aB that is the identity on the sphere a B. In the domain lftm \ 0, we consider the vector field lrlm , where r is the radius-vector of the point = ( , m ) E lftm \ 0, and the flux form
x xl , . . . x
9 The classical Stokes theorem ( 13.34) is meant. E lie Cartan ( 1869-1951 ) - outstanding French geometer.
10
13.3 The Fundamental Integral Formulas of Analysis
251
corresponding to this field (see formula (13. 19) of Sect. 13.2) . The flux of such a field across the boundary of the ball B = { x E IR I I x I = 1 } in the direction of the outward normal to the sphere 8B is obviously equal to the area of the sphere 8B, that is, J w -j. 0. But , as one can easily verify by DB direct computation, dw = 0 in !Rm \ 0, from which, by using the general Stokes formula, as in Example 2, we find that
J J w=
r.p*w =
J
dr.p*w =
DB DB This contradiction finishes the proof. D
J
r.p* dw
=
J
r.p* O = 0 .
13.3.5 Problems and Exercises 1. a) Does Green ' s formula (13.25) change if we pass from the coordinate system x, y to the system y, x? b) Does formula (13.25") change in this case? 2. a) Prove that formula (13.25) remains valid if the functions P and Q are con tinuous in a closed square I, their partial derivatives �� and !ff: are continuous at interior points of I, and the double integrals exist, even if as improper integrals (13.25') . b) Verify that if the boundary of a compact domain D consists of piecewise smooth curves, then under assumptions analogous to those in a) , formula (13.25) remains valid.
a) Verify the proof of (13.26') in detail. b) Show that if the boundary of a compact domain D C JR. 2 consists of a finite number of smooth curves having only a finite number of points of inflection, then D is a simple domain with respect to any pair of coordinate axes. c) Is it true that if the boundary of a plane domain consists of smooth curves, then one can choose the coordinate axes in JR. 2 such that it is a simple domain relative to them? 3.
a) Show that if the functions P and Q in Green's formula are such that �� �� = 1 , then the area a ( D) of the domain D can be found using the formula
4. a
( D) = I P dx + Q dy. aD
b) Explain the geometric meaning of the integral I y dx over some (possibly 'I
nonclosed) curve in the plane with Cartesian coordinates x, y. Starting from this, give a new interpretation of the formula a ( D) = - I y dx. aD
252
13 Line and Surface Integrals c) As a check on the preceding formula, use it to find the area of the domain
{
1
2 2 D = (x, y) E lR2 xa2 + yb2
:::;
}
1 .
5 . a) Let x = x(t) be a diffeomorphism of the domain Dt C lRz onto the do main D x C JR; . Using the results of Problem 4 and the fact that a line integral is independent of the admissible change in the parametrization of the path, prove that
I dx = I lx' (t) l dt ,
where dx = dx 1 dx 2 , dt = d e d e , lx' (t) l = det x' (t) . b) From a) derive the formula
I f(x) dx = I t (x(t) ) l det x' (t) l dt
Dx
Dt
for change of variable in a double integral.
( *) 2
6. Let f(x, y, t) be a smooth function satisfying the condition � ) + =I 0 in its domain of definition. Then for each fixed value of the parameter t the equation f(x, y, t) = 0 defines a curve It in the plane JR 2 . Then a family of curves {It } depending on the parameter t arises in the plane. A smooth curve r C JR 2 defined by parametric equations x = x(t) , y = y(t) , is the envelope of the family of curves {It } if the point x(to ) , y(to ) lies on the corresponding curve Ito and the curves r and Ito are tangent at that point , for every value of to in the common domain of definition of {It } and the functions x(t) , y(t) . a) Assuming that x, y are Cartesian coordinates in the plane, show that the functions x(t) , y(t) that define the envelope must satisfy the system of equations
(
{
2
f(x, y, t) = 0 ,
%f (x, y, t) = O , and from the geometric point of view the envelope itself is the boundary of the projection (shadow) of the surface f(x, y, t) = 0 of lR(x , y , t) on the plane lR(x , y ) . b) A family of lines x cos a + y sin a p( a ) = 0 is given in the plane with Cartesian coordinates x and y. The role of the parameter is played here by the polar angle a . Give the geometric meaning of the quantity p ( a ) , and find the envelope of this family if p( a ) = c + a cos a + b sin a, where a , b, and c are constants. c) Desribe the accessible zone of a shell that can be fired from an adjustable cannon making any angle a E [0, /2] to the horizon. d) Show that if the function p ( a ) of b) is 2rr-periodic, then the corresponding envelope r is a closed curve. -
1r
13.3 The Fundamental Integral Formulas of Analysis
253
e) Using Problem 4, show that the length L of the closed curve r obtained in d) can be found by the formula 2 7r L = p (a ) da .
I 0
(Assume that p E c < 2 l .) f) Show also that the area u of the region bounded by the closed curve r obtained in d) can be computed as 2 7r p(a) = u= (p2 - P2 ) ( a ) d a ' (a) .
�I 0
��
Consider the integral I cos � . n ) ds, in which "Y is a smooth curve in ne , r is the
7.
"'
radius-vector of the point (x, y) E '"f, r = lrl = y'x 2 + y 2 , n is the unit normal vector to "Y at ( x, y) varying continuously along "Y, and ds is arc length on the curve. This integral is called Gauss ' integml. a) Write Gauss' integral in the form of a flux I (V, n ) ds of the plane vector field V across the curve "Y . b) Show that in Cartesian coordinates x and y Gauss' integral has the form ± I -��'"+.�2d y familiar to us from Example 1 of Sect. 13. 1 , where the choice of sign "'
is determined by the choice of the field of normals n . c) Compute Gauss' integral for a closed curve "Y that encircles the origin once and for a curve "Y bounding a domain that does not contain the origin. d) Show that cos � , n ) ds = dcp, where cp is the polar angle of the radius-vector r , and give the geometric meaning of the value of Gauss' integral for a closed curve and for an arbitrary curve "Y C ll�? .
In deriving the Gauss-Ostrogradskii formula we assumed that D is a simple domain and the functions P, Q, R belong to C ( ll (D, IR) . Show by improving the reasoning that formula (13.30) holds if D is a compact domain with piecewise smooth boundary, P, Q, R E C(D, IR) , �= , � ' �� E C(D, IR) , and the triple integral converges, even if it is an improper integral. 8.
9. a) If the functions P, Q, and R in formula (13.30) are such that �= + � + �� = 1 , then the volume V(D) of the domain D can b e found by the formula
V(D) =
II P dy !\ dz + Q dz !\ dx + R dx !\ dy . av
b) Let f ( x, t) be a smooth function of the variables x E D x C IR� , t E Dt C IRt' and � = ( -§!r , . . . , lfn) =/= 0. Write the system of equations that must be satisfied by the ( n - 1 )-dimensional surface in IR� that is the envelope of the family of surfaces {St } defined by the conditions f(x, t) = 0, t E Dt (see Problem 6).
13 Line and Surface Integrals
254
c ) Choosing a point on the unit sphere as the parameter t, exhibit a fmaily of planes in JR 3 depending on the parameter t whose envelope is the ellipsoid 5+ a 2
2
� + � = 1.
d ) Show that if a closed surface S is the envelope of a family of planes cos cl: l ( t ) x + cos a ( t ) y + cos a 3 ( t ) z - p ( t ) = 0 , where a1 , a 2 , a 3 are the angles formed by the normal to the plane and the coordinate axes and the parameter t is a variable point of the unit sphere S 2 C JR3 , then the area rJ of the surface S can be found by the formula rJ = J p ( t ) drJ. s2
e ) Show that the volume of the body bounded by the surface S considered in d ) can be found by the formula V = t J p ( t ) drJ. s
z2
f ) Test the formula given in e ) by finding the volume of the ellipsoid � + � +
� :::; 1 .
g ) What does the n-dimensional analogue of the formulas in d ) and e ) look like? 1 0. a ) Using the Gauss-Ostrogradskii formula, verify that the flux of the field r j r 3 ( where r is the radius-vector of the point x E JR 3 and r = l r l ) across a smooth surface S enclosing the origin and homeomorphic to a sphere equals the flux of the same field across an arbitrarily small sphere l x l = E . b ) Show that the flux in a) is 47r. c ) Interpret Gauss' integral J cos(; ,n) ds in JR 3 as the flux of the field r / r 3 across s the surface S. d ) Compute Gauss' integral over the boundary of a compact domain D C JR3 , considering both the case when D contains the origin in its interior and the case when the origin lies outside D. e ) Comparing Problems 7 and lOa) -d ) , give an n-dimensional version of Gauss ' integral and the corresponding vector field. Give an n-dimensional statement of problems a) -d ) and verify it. 1 1 . a ) Show that a closed rigid surface S C JR 3 remains in equilibrium under the action of a uniformly distributed pressure on it. ( By the principles of statics the problem reduces to verifying the equalities JJ n drJ = 0, JJ [r, nj drJ = 0, where n s s is a unit normal vector, r is the radius-vector, and [r, n] is the vector product of r and n. ) b ) A solid body of volume V is completely immersed in a liquid having specific gravity 1 . Show that the complete static effect of the pressure of the liquid on the body reduces to a single force F of magnitude V directed vertically upward and attached to the center of mass C of the solid domain occupied by the body. 1 2 . Let r : I k ---+ D be a smooth ( not necessarily homeomorphic ) mapping of an interval I k C lR k into a domain D of lR n , in which a k-form w is defined. By analogy with the one-dimensional case, we shall call a mapping r a k- cell or k-pathand by definition set J w = J r*w. Study the proof of the general Stokes formula and Jk r verify that it holds not only for k-dimensional surfaces but also for k-cells.
13.3 The Fundamental Integral Formulas of Analysis
255
13. Using the generalized Stokes formula, prove by induction the formula for change of variable in a multiple integral (the idea of the proof is shown in Problem 5a) ) .
Integration b y parts in a multiple integral. Let D be a bounded domain in JR m with a regular (smooth or piecewise smooth) boundary 8D oriented by the outward unit normal n = (n 1 , . . . , n m ) . Let J, g be smooth functions in D. a) Show that 14.
1 o;f dv 1 fni da . =
D
8D
b) Prove the following formula for integration by parts:
I (8d)g dv I fgni da - I f(oi g) dv .
D
=
8D
D
1 4 Elements of Vector Analysis
and Field Theory
14. 1 The Differential Operations of Vector Analysis 14. 1 . 1 Scalar and Vector Fields
In field theory we consider functions x t-+ T(x) that assign to each point x of a given domain D a special object T(x) called a tensor. If such a function is defined in a domain D, we say that a tensor field is defined in D. We do not intend to give the definition of a tensor at this point: that concept will be studied in algebra and differential geometry. We shall say only that numerical functions D 3 x t-+ f (x ) E R and vector-valued functions Rn :::> D 3 x t-+ V ( x) E TR� � Rn are special cases of tensor fields and are called scalar fields and vector fields respectively in D (we have used this terminology earlier) . A differential p-form w in D is a function Rn :::> D 3 x t-+ w ( x ) E C ( (Rn)P , R) which can be called a field of forms of degree p in D. This also is a special case of a tensor field. At present we are primarily interested in scalar and vector fields in do mains of the oriented Euclidean space Rn . These fields play a major role in many applications of analysis in natural science. 14. 1 . 2 Vector Fields and Forms in R3
We recall that in the Euclidean vector space R3 with inner product ( , ) there is a correspondence between linear functionals A : R3 ---+ R and vectors A E R3 consisting of the following: Each such functional has the form A( e ) = (A, e) , where A is a completely definite vector in R3 . If the space is also oriented, each skew-symmetric bilinear functional B : R3 x R3 ---+ R can be uniquely written in the form B (e 1 , e 2 ) = (B, e 1 , e 2 ) , where B is a completely definite vector in R3 and (B, e 1 , e 2 ) , as always, is the scalar triple product of the vectors B , e 1 , and e 2 , or what is the same, the value of the volume element on these vectors. Thus, in the oriented Euclidean vector space R3 one can associate with each vector a linear or bilinear form, and defining the linear or bilinear form is equivalent to defining the corresponding vector in R3 .
258
14 Elements of Vector Analysis and Field Theory
If there is an inner product in IR3 , it also arises naturally in each tan gent space TIR� consisting of the vectors attached to the point JR3 , and the orientation of IR 3 orients each space TIR� . Hence defining a 1-form w 1 (x) or a 2-form w 2 (x) in TIR� under the con ditions just listed is equivalent to defining some vector A(x) E TIR� corre sponding to the form w 1 (x) or a vector B(x) E TIR� corresponding to the form w 2 (x) . Consequently, defining a 1-form w 1 or a 2-form w 2 in a domain D of the oriented Euclidean space IR3 is equivalent to defining the vector field A or B in D corresponding to the form. In explicit form, this correspondence amounts to the following:
wl (x)( e ) ( A(x), e/ , (14.1) w� (x) (e l , e 2 ) (B(x) , e l , e 2 ) , ( 14.2) where A(x) , B(x), e, e 1 , and e 2 belong to TDx . Here we see the work form w 1 wl of the vector field A and the flux form w 2 = w� of the vector field B, which are already familiar to us. =
To a scalar field follows:
f : D --+ IR, we can assign a 0-form and a 3-form in D as w� w}
f, = f dV , =
( 14.3) ( 14.4)
where dV is the volume element in the oriented Euclidean space IR3 . In view of the correspondences ( 14. 1 )-( 14.4) , definite operations on vector and scalar fields correspond to operations on forms. This observation, as we shall soon verify, is very useful technically.
To a linear combination of forms of the same degree there corresponds a linear combination of the vector and scalar fields corresponding to them. Proof. Proposition 1 is of course obvious. However, let us write out the full
Proposition 1 .
proof, as an example, for 1-forms:
a 1 (A 1 , · ) + a (A2 , · ) = (a 1 A 1 + a 2 A 2 , · ) w;,A, +u 2 A 2 • D It is clear from the proof that a 1 and a 2 can be regarded as functions a 1 wl , + a 2 wl 2
=
=
=
(not necessarily constant) in the domain D in which the forms and fields are defined. As an abbreviation, let us agree to use, along with the symbols ( , ) and [ , ] for the inner product and the vector product of vectors A and B in IR3 , the alternative notation A · B and A x B wherever convenient .
14.1 The Differential Operations of Vector Analysis
259
If A, B, A 1 , and B 1 are vector fields in the oriented Eu clidean space JR3 , then
Proposition 2 .
1 /\ wA1 = wA, 2 xA ' ( 14.5 ) wA, 2 2 ( 14.6 ) wl l\ w� = w_i.B . In other words, the vector product A 1 x A 2 of fields A 1 and A 2 that
generate 1-forms corresponds to the exterior product of the 1-forms they generate, since it generates the 2-form that results from the product . In the same sense the inner product of the vector fields A and B that generate a 1-form wl and a 2-form w� corresponds to the exterior product of these forms.
Proof. To prove these assertions, fix an orthornormal basis in JR 3 Cartesian coordinates x 1 , x 2 , x 3 corresponding to it . In Cartesian coordinates
and the
3 3 wl (x) ( e ) = A(x) . e = L A i (x) � i = L A i (x) dx i ( e ) ' i=1 i= l that is,
( 14. 7 )
and
that is,
( 14.8 )
Therefore in Cartesian coordinates, taking account of expressions ( 14.7 ) and ( 14.8 ) , we obtain
wl , 1\ wt = (At dx1 + Ai dx 2 + Ai dx 3 ) 1\ (A� dx 1 + A� dx 2 + A� dx 3 ) = = (Ai A� - Ai A�) dx 2 1\ dx 3 + (Ai A� - At A�) dx 3 1\ dx 1 + + (At A� - AiA�) dx 1 /\ dx 2 = w� , where B = A 1 x A 2 .
Coordinates were used in this proof only to make it easier to find the vector B of the corresponding 2-form. The equality ( 14.5 ) itself, of course, is independent of the coordinate system. Similarly, multiplying Eqs. ( 14.7 ) and ( 14.8 ) , we obtain
wl l\ w� = (A 1 B 1 + A 2 B 2 + A3 B 3 ) dx1 1\ dx 2 1\ dx 3 = w2 .
14 Elements of Vector Analysis and Field Theory
260
In Cartesian coordinates dx i A dx 2 A dx 3 is the volume element in JR.3 , and the sum of the pairwise products of the coordinates of the vectors A and B , which appears in parentheses just before the 3-form, is the inner product of these vectors at the corresponding points of the domain, from which it follows that p (x) = A ( x ) · B ( x ) . D 14. 1 .3 The Differential Operators grad, curl, div, and V Definition 1 . To the exterior differentiation of 0-forms (functions) , 1-forms,
and 2-forms in oriented Euclidean space JR.3 there correspond respectively the operations of finding the gradient (grad) of a scalar field and the curl and divergence ( div) of a vector field. These operations are defined by the relations
_. dw fO -. -· dwAi -. I _ .. d wB -
I
Wgrad f ' 2 wcuri A' wd3 i v B ·
( 14.9) (14.10) ( 14. 1 1 )
B y virtue of the correspondence between forms and scalar and vector fields in JR. 3 established by Eqs. ( 14. 1)-( 14.4) , relations ( 14.9)-( 14. 1 1 ) are unambiguous definitions of the operations grad, curl, and div, performed on scalar and vector fields respectively. These operations, the operators of field theory as they are called, correspond to the single operation of exterior differentiation of forms, but applied to forms of different degree. Let us give right away the explicit form of these operators in Cartesian coordinates x i , x 2 , x 3 in JR. 3 . As we have explained, in this case
w� = f , wl = A I dx i + A 2 dx 2 + A 3 dx 3 , w� = B I dx 2 A dx 3 + B 2 dx 3 A dx i + B 3 dx i A dx 2 , w� = p dx i A dx 2 A dx3 .
( 14.3') ( 14. 7') ( 14.8') ( 14.4')
Since
I f := dwf0 = df = wgrad
8] dx I + 8] dx 2 + 8] dx3 ' 8x 2 8x 3 8x i
it follows from ( 14.7') that in these coordinates ( 14.9') where
e i , e2 , e3 is a fixed orthonormal basis of JR.3 .
14.1 The Differential Operations of Vector Analysis
261
Since
as
As an aid to memory this last relation is often written in symbolic form
( 14. 10")
curi A = Next , since
w� iv B : = dw� =
d(B 1 dx 2 A. dx3 B 2 dx 3 A. dx 1 B 3 dx 1 A. dx 2 ) = I aB 2 aB 3 l 2 3 = oB ox l ox 2 ox 3 dx A. dx A. dx ,
+
(
+
+ +
)
it follows from ( 14.4') that in Cartesian coordinates div
l
f)B 2 aB 3 B = f)B l ox + ox 2 + ox 3 .
( 14. 1 1')
One can see from the formulas ( 14.9' ) , ( 14 . 10') , and ( 14. 1 1') just obtained that grad, curl, and div are linear differential operations (operators) . The grad operator is defined on differentiable scalar fields and assigns vector fields to the scalar fields. The curl operator is also vector-valued, but is defined on differentiable vector fields. The div operator is defined on differentiable vector fields and assigns scalar fields to them. We note that in other coordinates these operators will have expressions that are in general different from those obtained above in Cartesian coordi nates. We shall discuss this point in Subsect . 14. 1 . 5 below. We remark also that the vector field curl A is sometimes called the rotation of A and written rot A.
262
14 Elements of Vector Analysis and Field Theory
As an example of the use of these operators we write out the famous 1 system of equations of Maxwell, 2 which describe the state of the components of an electromagnetic field as functions of a point x = (x 1 , x 2 , x 3 ) in space and time t.
Example 1 .
uum.)
(The Maxwell equations for an electromagnetic field
1 . div E =
.!!._ .
Eo
2. div B
=
m
a vac
0.
(14. 12) 1 8E j . + -8t EoC2 2 C Here p(x, t) is the electric charge density (the quantity of charge per unit volume) , j (x, t) is the electrical current density vector (the rate at which charge is flowing across a unit area) , E(x, t) and B(x, t) are the electric and magnetic field intensities respectively, and Eo and c are dimensioning constants (and in fact c is the speed of light in a vacuum) . 3. curl E =
-
at
as
.
4. curl B
=
I n mathematical and especially i n physical literature, along with the op erators grad, curl, and div, wide use is made of the symbolic differential operator nabla proposed by Hamilton (the Hamilton operator) 3 ( 14. 13) where { e 1 , e 2 , e 3 } is an orthonormal basis of lR 3 and x 1 , x 2 , x 3 are the corre sponding Cartesian coordinates. By definition, applying the operator \7 to a scalar field f (that is, to a function) , gives the vector field
1 On this subject the famous American physicist and mathematician R. Feynman (1918-1988) writes, with his characteristic acerbity, "From a long view of the history of mankind - seen from, say, ten thousand years from now - there can be little doubt that the most significant event of the 19th century will be judged as Maxwell's discovery of the laws of electrodynamics. The American Civil War will pale into provincial insignificance in comparison with this important scien tific event of the same decade." Richard R. Feynman, Robert B. Leighton, and Matthew Sands, The Feynman Lectures on Physics: Mainly Electromagnetism and Matter, Reading, MA: Addison-Wesley, 1964. 2 J.C. Maxwell (1831-1879) - outstanding Scottish physicist; he created the math ematical theory of the electromagnetic field and is also famous for his researach in the kinetic theory of gases, optics and mechanics. 3 W.R. Hamilton (1805-1865) - famous Irish mathematician and specialist in mechanics; he stated the variational principle (Hamilton's principle) and con structed a phenomenological theory of optic phenomena; he was the creator of the theory of quaternions and the founder of vector analysis (in fact, the term "vector" is due to him) .
14. 1 The Differential Operations of Vector Analysis
263
of + e of + e of , 'Vf = e l ox l 2 ox 2 3 ox 3 which coincides with the field ( 14.9') , that is, the nabla operator is simply the grad operator written in a different notation. Using, however, the vector form in which V' is written, Hamilton pro posed a system of formal operations with it that imitates the corresponding algebraic operations with vectors. Before we illustrate these operations, we note that in dealing with V' one must adhere to the same principles and cautionary rules as in dealing with the usual differentiation operator D = d� . For example, r.pD f equals r.p¥x and not d� ( r.p f) or f � . Thus, the operator operates on whatever is placed to the right of it; left multiplication in this case plays the role of a coefficient , that is, r.pD is the new differential operator r.pd� ' not the function � · Moreover, D 2 = D · D, that is, D 2 f = D(D f ) = d� ( d� f) = � f. If we now, following Hamilton, deal with V' as if it were a vector field defined in Cartesian coordinates, then, comparing relations (14.13) , ( 14.9') , ( 14.10" ) , and ( 14. 1 1') , we obtain grad / = 'Vf curi A = V' x A , div B = V' · B .
( 14. 14) ( 14. 15) ( 14. 16)
In this way the operators grad, curl, and div, can be written in terms of the Hamilton operator and the vector operations in JR 3 .
Example 2. Only the curl and div operators occurred in writing out the Maxwell equations ( 14. 12) . Using the principles for dealing with V' = grad, we rewrite the Maxwell equations as follows, to compensate for the absence of grad in them: p
l . Y' · E = - .
eo
2. V' · B = 0 .
oB j 3. V' x E = - at . 4. V' X B = eoc2
oE + 21 """"!:>
c ut
(14.12') •
14. 1 .4 Some Differential Formulas of Vector Analysis
In the oriented Euclidean space JR 3 we have established the connection ( 14. 1 ) (14.4) between forms on the one hand and vector and scalar fields on the other. This connection enabled us to associate corresponding operators on fields with exterior differentiation ( see formulas ( 14.5) , ( 14.6) , and (14 .9) (14. 1 1 ) ) . This correspondence can b e used t o obtain a number of basic differential formulas of vector analysis.
264
14 Elements of Vector Analysis and Field Theory
For example, the following relations hold: curl ( !A) = fcurlA - A x gradf , (14.17) div ( !A) = A · grad f + fdiv A , ( 14. 18) div (A x B) = B · curl A - A · curlB . ( 14. 19) Proof. We shall verify this last equality: w� i v A x B = dwi x a = d(w}. /\ w�) = dw}. l\ w� - w}. l\ dw� = 2 A 1\ Wa1 - WA1 1\ Wcuri 2 B = WB·curi A - WA·curi B = WB·curi A - A·CUrl B . = Wcuri first equalities two relations arealsoverified similarly. Ofdirect course,difftheerentiation verificationin ofcoordinates. allThethese can be carried out by 0 2 w = 0 for any form w, we can also If we take account of the relation d assert that the following equalities hold: curlgradf = 0 , ( 14.20) divcurlA = 0 . (14.21) Proof. Indeed: W�url grad f = dw!rad f = d(dw � ) = d 2 w � = 0 , W� i v curi A = dw�uri A = d(dw}.) = d 2 w}. = 0 . 0 In while formulas(14.20) (14. 1 7)-( 14. 19) the operators grad, curl, and div are applied once, and of(14.21) involve the second-order operatorsBesides obtained byrulessuccessive execution two of the three original operations. the in ( 14.20) and (14.21 ) , one can also consider other combinations of thesegivenoperators: grad divA, curl curl A, div grad f . ( 14.22) operator div(Delta) grad is applied, as onethecanLaplace see, tooperator a scalar4field. This op eratorThe is denoted and is called or Laplacian. It follows fromLl( 14.9') and (14. 1 1') that in Cartesian coordinates 3
3
3
( 14.23) 4 P.S. Laplace (1749-1827) - famous French astronomer, mathematician, and physicist; he made fundamental contributions to the development of celestial mechanics, the mathematical theory of probability, and experimental and math ematical physics.
265
14.1 The Differential Operations of Vector Analysis
Since the operator Ll acts on numerical functions, it can be applied com ponentwise to the coordinates of vector fields A = e 1 A 1 + e2 A 2 + e3 A 3 , where e l l e2 , and e3 are an orthonormal basis in JR 3 . In that case Taking account of this last equality, we can write the following relation for the triple of second-order operators ( 14.22) : (14.24)
curl curl A = grad div A - LlA ,
(
)
whose proof we shall not take the time to present see Problem 2 below . The equality ( 14.24) can serve as the definition of LlA in any coordinate system, not necessarily orthogonal. Using the language of vector algebra and formulas (14. 14)-( 14. 16) , we can write all the second-order operators ( 14.20)-( 14.22) in terms of the Hamilton operator V': curl grad f = V' x V' f = 0 , div curi A = V' · (V' x A = 0 , grad div A = Y' (Y' · A , curl curl A = V' x (Y' x A , div grad = Y' · 'Vf .
)
/
)
)
From the point of view of vector algebra the vanishing of the first two of these operators seems completely natural. The last equality means that the following relation holds between the Hamilton operator V' and the Laplacian Ll: Ll =
'\7 2 .
*Vector Operations in Curvilinear Coordinates a. Just as, for example, the sphere x 2 + y 2 + z 2 = a 2 has a particularly simple equation R = a in spherical coordinates, vector fields x >-+ A( x ) in JR 3 or 14. 1 . 5
(
Rn ) often assume a simpler expression in a coordinate system that is not Cartesian. For that reason we now wish to find explicit formulas from which one can find grad, curl, and div in a rather extensive class of curvilinear coordinates. But first it is necessary to be precise as to what is meant by the coordinate expression for a field A in a curvilinear coordinate system. We begin with two introductory examples of a descriptive character.
Suppose we have a fixed Cartesian coordinate system x 1 , x 2 in the Euclidean plane JR 2 . When we say that a vector field A 1 , A 2 ( x is defined
Example 3.
(
) )
266
14 Elements of Vector Analysis and Field Theory
JR.2 , we mean that some vector A(x) E TJR.� is connected with each point x = (x 1 , x 2 ) E JR.2 , and in the basis of TJR.� consisting of the unit vectors e 1 ( x), e2 ( x) in the coordinate directions we have the expansion A ( x) = A 1 (x )e 1 (x) + A2 (x )e2 (x) ( see Fig. 14. 1 ) . In this case the basis { e 1 (x), e2 (x)} of TJR.� is essentially independent of x. in
A(x) X
e,
e 1 (x)
Fig.
A(x)
L
0
el
0 Fig.
14.2.
14. 1 .
Example 4. I n the case when polar coordinates (r, cp) are defined i n the same plane JR. 2 , at each point x E JR. 2 \ 0 one can also attach unit vectors e 1 (x) = er (x), e2 = e cp (x) ( Fig. 14.2) in the coordinate directions. They also form a basis in TJR.� with respect to which one can expand the vector A(x) of the field A attached to x: A(x) = A 1 (x)e 1 (x) + A 2 (x)e2 (x). It is then natural to regard the ordered pair of functions (A 1 , A 2 ) ( x) as the expression for the field A in polar coordinates. Thus, if (A 1 , A 2 ) (x) ( 1 , 0) , this is a field of unit vectors in JR. 2 pointing radially away from the center 0. The field (A 1 , A 2 ) ( x) ( 0, 1 ) can be obtained from the preceding field by rotating each vector in it counterclockwise by the angle /2. These are not constant fields in JR. 2 , although the components of their =
=
7f
coordinate representation are constant. The point is that the basis in which the expansion is taken varies synchronously with the vector of the field in a transition from one point to another. It is clear that the components of the coordinate representation of these fields in Cartesian coordinates would not be constant at all. On the other hand, a truly constant field ( consisting of a vector translated parallel to itself to all points of the plane ) which does have constant components in a Cartesian coordinate system, would have variable components in polar coordinates. b. After these introductory considerations, let us consider more formally the
problem of defining vector fields in curvilinear coordinate systems.
14. 1 The Differential Operations of Vector Analysis
267
We recall first of all that a curvilinear coordinate system t 1 , t 2 , t 3 in a domain D c JR 3 is a diffeomorphism
The quadratic form
( 14.25) whose coefficients are the pairwise inner products of the vectors in the canon ical basis determines the inner product on TJRr completely. If such a form is defined at each point of a domain Dt C lRt , then, as is known from geometry, one says that a Riemannian metric is defined in this domain. A Riemannian metric makes it possible to introduce a Euclidean structure in each tangent space TJRr ( t E Dt) within the context of rectilinear coordinates t 1 , t 2 , t 3 in
268
14 Elements of Vector Analysis and Field Theory
JRr, corresponding to the "curved" embedding
;
what follows, for the sake of simplicity, we shall consider only triorthogonal curvilinear coordinate systems. For them, as has been noted, the quadratic form (14.25) has the following special form: ds 2 = E1 ( t ) ( dt 1 ) 2 + E ( t ) ( dt 2 ) 2 + E (t)(dt 3 ) 2 , ( 14.26)
2
Ei ( t ) = 9ii (t), i = 1 , 2, 3. Example 5. In Cartesian coordinates
3
where
( y , z ) , cylindrical coordinates (r,
x,
x
Thus, each of these coordinate systems is a triorthogonal system in its domain of definition. The vectors e 1 (t), e 2 (t), e3 (t) of the canonical basis ( 1 , 0, 0) , (0, 1 , 0) , (0, 0, 1 ) in TJR�, like the vectors e i ( ) E TJR� corre sponding to them, have the following norm5 : l e i I = ..;gii . Hence the unit vectors (in the sense of the square-norm of a vector) in the coordinate directions have the following coordinate representation for the triorthogonal system (14.26) :
x
1 ( t ) = (vk , o , o) , e2 ( t ) = (o , vk , o) , e3 ( t ) = (o , o , �) . ( 14.27) Example 6. It follows from formulas (14.27) and the results of Example 5 e
that for Cartesian, cylindrical, and spherical coordinates, the triples of unit vectors along the coordinate directions have respectively the following forms: er =
( 1 , 0, 0) , ( 1 , 0, 0) ,
eR =
(1, 0, 0) ,
ex =
(0, 1 , 0) , = o, , o ,
ey = e '"
e '" =
( � ) (o , R c�s () ' o) '
ez = ez = eo =
(0, 0, 1 ) ; (0, 0, 1 ) ;
(o , o , �) ;
( 14.27') ( 14.27") ( 14.27"')
5 In the triorthogonal system ( 14.26) we have le ; l = #; = H; , i = 1 , 2, 3. The quantities H1 , H2 , H3 are usually called the Lame coefficients or Lame parame ters. G. Lame ( 1795-1870) - French engineer, mathematician, and physicist.
14. 1 The Differential Operations of Vector Analysis
269
Examples 3 and 4 considered above assumed that the vector of the field was expanded in a basis consisting of unit vectors along the coordinate direc tions. Hence the vector A(t) E TlRt corresponding to the vector A(x) E TJR� of the field should be expanded in the basis e 1 ( t), e 2 ( t), e3 ( t) consisting of unit vectors in the coordinate directions, rather than in the canonical basis
e l (t), e z (t), e3 (t).
Thus, abstracting from the original space JR 3 , one can assume that a Riemannian metric ( 14.25) or ( 14.26) and a vector field t H A(t) are defined in the domain D t C lRt and that the coordinate representation (A 1 , A 2 , A 3 ) ( t) of A ( t) at each point t E Dt is obtained from the expansion of the vector A(t) = Ai (t)ei (t) of the field corresponding to this point with respect to unit vectors along the coordinate axes. d. Let us now investigate forms. Under the diffeomorphism ip : D t ---+ D every form in D automatically transfers to the domain Dt . This transfer, as we know, occurs at each point x E D from the space TJR� into the corresponding space TJRl. Since we have transferred the Euclidean structure into TJRl from TJR�, it follows from the definition of the transfer of vectors and forms that , for example, to a given form wl(x) = (A(x), ·) defined in TJR� there corresponds exactly the same kind of form w l(t) = (A(t), ·) in TJRl, where A(x) = ip1 (t)A(t). The same can be said of forms of the type w� and w� , to say nothing of forms w � that is, functions. After these clarifications, the rest of our study can be confined to the domain D t C Rt, abstracting from the original space JR 3 and assuming that a Riemannian metric ( 14.25) is defined in D t and that scalar fields J, p and vector fields A, B are defined in D t along with the forms w � , w l , w� , w� , which are defined at each point t E D t in accordance with the Euclidean structure on TJRt defined by the Riemannian metric. -
Example
7.
The volume element dV in curvilinear coordinates we know, has the form
t 1 , t 2 , t3 , as
For a triorthogonal system ( 14.28) In particular, in Cartesian, cylindrical, and spherical coordinates, respec tively, we obtain dV = dx 1\ dy 1\ dz = = r dr 1\ dip 1\ dz = = R 2 cos 8 dR 1\ dip 1\ d8 . What has just been said enables us to write the form w � different curvilinear coordinate systems.
( 14.28') ( 1 4.28") ( 14.28"' ) =
p
d V in
270
14 Elements of Vector Analysis and Field Theory
e. Our main problem ( now easily solvable ) is, knowing the expansion A (t) = A i (t)ei(t) for a vector A(t) E TJK[ with respect to the unit vectors ei(t) E TlK[, i = 1 , 2 , 3, of the triorthogonal coordinate system determined by the Riemannian metric ( 14.26 ) , to find the expansion of the forms wl_(t) and w�(t) in terms of the canonical 1-forms dt i and the canonical 2-forms dti 1\ dt1 respectively. Since all the reasoning applies at every given point t, we shall abbreviate the notation by suppressing the letter t that shows that the vectors and forms are attached to the tangent space at t. Thus, e 1 , e 2 , e 3 is a basis in TJK[ consisting of the unit vectors ( 14.27 ) along the coordinate directions, and A = A 1 e 1 +A 2 e2 +A 3 e3 is the expansion of A E TJK[ in that basis. We remark first of all that formula ( 14.27 ) implies that where
6} =
where 6"� =
f. Thus, if
wl : = (A, · ) = a 1 dt 1
{ {
0 ' if i oF j ' 1 ' if i = j ' 0, if (i, j ) oF
( 14.29 )
(k, l) ' ( 14.30 ) if (i, j ) = (k, l) .
1' + a2 dt2 + a3 dt3 , then on the one hand
and on the other hand, as one can see from ( 14.29 ) ,
Consequently,
ai = A\(E;, and we have found the expansion
( 14.31 ) for the form wl_ corresponding to the expansion A = A 1 e 1 + A 2 e2 + A 3 e3 of
the vector A.
Example 8 .
respectively
Since in Cartesian, spherical, and cylindrical coordinates we have A = = =
Ax ex + A y ey + Azez = Ar er + A'Pe'P + Azez AReR + A'Pe'P + A e e e ,
as follows from the results of Example 6,
14. 1 The Differential Operations of Vector Analysis
271
( 14.31') wi_ = Ax dx + Ay dy + A z dz = ( 14.31") = Ar dr + Acpr dcp + A z dz = ( 14.3 1"' ) = AR dR + AcpR cos cp dcp + Ao R dO . Now let B = B 1 e 1 + B 2 e 2 + B 3 e 3 and w� = b 1 dt 2 1\ dt 3 + b 2 dt 3 1\ dt 1 + b3 dt 1 1\ dt2 . Then, on the one hand, w�(e2 , e3 ) : = dV ( B, e2 , e3 ) = 3 B' dV(ei , e2 , e3 ) = B 1 (e 1 , e2 , e3 ) = B 1 , =� L.., i= 1
g.
.
·
where dV is the volume element in TIR� ( see ( 14.28) and ( 14.27) ) . O n the other hand, by ( 14.30) we obtain
w� (e2 , e3 ) = (b 1 dt2 1\ dt 3 + b2 dt3 1\ dt 1 + b3 dt 1 1\ dt 2 )( e 2 , e3 ) = b1 . = b 1 dt 2 1\ dt 3 (e 2 , e3 ) = E2 E3 � Comparing these results, we conclude that b 1 = B 1 v'E2 E3 . Similarly, we verify that b2 = B 2 v'E1 E3 and b3 = B 3 v'E1 E2 . Thus we have found the representation
Example 9. Using the notation introduced in Example 8 and formulas (14.26') , ( 14.26") and ( 14.26"') , we obtain in Cartesian, cylindrical, and spherical coordinates respectively
� = � � /\ � + � � /\ � + � � /\ � = = Er r dcp 1\ dz + Bcp dz 1\ dr + Bz r dr 1\ dcp = = BRR2 cos O dcp 1\ dO + BcpR dO 1\ dR + BoR cos O dR 1\ dcp .
( 14.32') ( 14.32") ( 14.32"' )
h. We add further that on the basis of ( 14.28) we can write
( 14.33) Example 1 0. In particular, for Cartesian, cylindrical, and spherical coordi nates respectively, formula ( 14.33) has the following forms:
272
14 Elements of Vector Analysis and Field Theory
w�
= p dx dy dz = = pr dr A dcp dz = = pR2 cos () dR A dcp A d() . !\
!\
(14.33') (14.3311 ) (14.33"')
!\
Now that we have obtained formulas (14.31) - (14.33), it is easy to find the coordinate representation of the operators grad, curl, and div in a triorthog onal curvilinear coordinate system using Definitions (14.9)-(14.11). Let grad f A 1 e 1 + A 2 e 2 + A 3 e 3 . Using the definitions, we write M 1 + M d 2 + 8f d 3 . 1 f dw a wgrad 82 83
=
: = t := df := Bt l dt t t t t (14.31), we conclude that 1 8f e + I"E' 1 1 8f e2 + � 8f e
From this, using formula grad f
�xample 1 1 .
= YI"E'�l 8t 1
1
Y �2 8t2
Y �3 8t3 3 .
(14.34)
In Cartesian, polar, and spherical coordinates respectively,
(14.34') f = ax8f ex + 8f8y ey + 8f8z ez = (14.3411 ) = 8!8r er + ;:1 88fcp e'P + 8f8z ez = 8f 1 8f 1 8f = 8R (14.34111 ) e R + R () cp e 'P + 2 e e . R 8() cos 8 Suppose given a field A (t) = (A 1 e 1 + A 2 e 2 + A 3 e3 ) (t) . Let us find the co ordinates B 1 , B 2 , B3 of the field curl A(t) = B(t) = (B 1 e 1 + B 2 e 2 + B 3 e 3 ) (t) . Based on the definition (14.10) and formula (14.31), we obtain W�url A : = dwl = d (A \/E\"dt 1 + A 2 .;E;"dt 2 + A 3 yiE;"dt 3 ) = ( 8A3 yE3 8A 2 � ) 2 d t dt 3 + 2 8t 3 8t ( 8A 1 v'FJ; - 8A3 yE3 ) dt3 A dt 1 + ( 8AB2 � 8A 1 v'FJ; ) 1 + d t A dt 2 . l 8t 1 8t 3 t 8t 2 grad
!\
On the basis of (14.32) we now conclude that
( 1 ( �� 1 (
8A 3 yE3 _ 8A 2 � at 3 Bt 2 J�2 �3 8A 1 y'FJ; 8A 3 yE3 8t 3 8t 1 J 3 1 8A 2 � _ 8A 1 v'FJ; 8t 1 8t 2 J�1 �2
1
_
) ) )
'
'
'
273
14. 1 The Differential Operations of Vector Analysis
that is,
�e 1 VE2e2 ,fEJe3 a a a curi A = JE1 E2 E3 at 1 l at 2 2 at3 �A VEJ2A ,f£3A 3 1
Example 1 2.
( 14.35)
In Cartesian, cylindrical, and spherical coordinates respectively
( aAz _ aAy ) ex + ( aAx aAz ) ey + ( aAy _ aAx ) ez = ( 14_35, ) ay az az ax ax ay � ( aAz aA ( aA arA"' ) aAz ) � ( arA'P _ _ r ) ez = (14 35,) r = _ r acp az er + az ar e'P + r ar acp 1_ ( aAe _ aA"' cos e ) e + _!_ ( aA n aRAe ) e"' + n R ae - R cos () acp ae aR 1 a ( A aRA"' _ n ) ee + _!_ ( 14_35,, )
curi A =
_
_
_
_
_
_
_
.
R aR cos () acp i. Now suppose given a field B ( t ) = (B 1 e 1 + B 2 e 2 + B 3 e 3 ) (t). Let us find an
expression for div B. Starting from the definition ( 14. 1 1 ) and formula ( 14.32) , we obtain
1 Wd i v B : = dw� = d (B VE2 E3 dt 2 A dt 3 + + B 2 )E3 E1 dt 3 A dt 1 + B 3 VEJf;dt 1 A dt 2 =
B 3 ) dt 1 _- ( aJEat2 El 3 B 1 + aJEat3 E2 1 B2 + a� at 3
A dt 2 A dt 3
.
On the basis of formula ( 14.33) we now conclude that
In Cartesian, cylindrical, and spherical coordinates respectively, we obtain
aBx + aBy + aBz = ( 14.36') az ay ax � ar Br + aB"' + aBz = ( 14_36,) = r ar acp az aR2 cos ()B n + aRB'P + aR cos ()Be . ( 14_36,,) 1 = 2 aR acp ae R cos e
div B =
(
(
)
)
j. Relations ( 14.34) and ( 14.36) can be used to obtain an expression for the Laplacian L1 = div grad in an arbitrary triorthogonal coordinate system:
274
14 Elements of Vector Analysis and Field Theory
Example 1 3. In particular, for Cartesian, cylindirical, and spherical coordi nates, we obtain respectively
14. 1 . 6 Problems and Exercises 1. The operators grad, curl, and div and the algebraic operations. Verify the following relations: for grad: a) 'il ( f + g ) = 'il f + 'il g, b) 'il ( f . g ) = f 'il g + g 'il J , c) 'il(A · B) = (B · 'il)A + (A · 'il)B + B x ('17 x A) + A x ('17 x B) . d) v O A2 ) = (A . v ) A + A x ('il x A) ;
for curl: e) 'il x { f A) = f 'il X A + 'il f X A, f) 'il X (A X B) = (B . 'il)A - (A . 'il)B + ('17 . B)A - ('17 . A)B; for div: g) 'il · ( f A) = 'il f · A + f 'il · A , h) 'il · (A x B) = B · ('il x A) - A · ('17 x B) and rewrite them in the symbols grad, curl, and div. (Hints. A · '17 = A 1 � + A 2 � + A 3 � ; B · 'il # 'il · B ; A x (B x C ) = B(A · C ) - C (A · B) . )
a) Write the operators (14.20)�( 14.22) in Cartesian coordinates. b) Verify relations ( 14.20) and ( 14.21) by direct computation. c) Verify formula ( 14.24) in Cartesian coordinates. d) Write formula ( 14.24) in terms of 'il and prove it, using the formulas of vector algebra. 2.
14. 1 The Differential Operations of Vector Analysis 3.
275
From the system of Maxwell equations in Example 2 deduce that 'V · j = !!If -
.
4. a) Exhibit the Lame parameters H1 , H2 , H3 of Cartesian, cylindrical, and spher ical coordinates in JR 3 . b) Rewrite formulas ( 14.28) , ( 14.34)-( 14.37) , using the Lame parameters. 5.
Write the field A = grad � ' where r = Jx 2 + y 2 + z 2 in a) Cartesian coordinates x, y, z; b) cylindrical coordinates; c) spherical coordinates. d) Find curl A and div A.
6. In cylindrical coordinates (r, cp, z) the function f has the form ln � . Write the field A = grad f in a) Cartesian coordinates; b) cylindrical coordinates; c) spherical coordinates. d) Find curl A and div A. 7. Write the formula for transformation of coordinates in a fixed tangent space TJR�, p E JR 3 , when passing from Cartesian coordinates in JR 3 to a) cylindrical coordinates; b) spherical coordinates; c) an arbitrary triorthogonal curvilinear coordinate system. d) Applying the formulas obtained in c) and formulas ( 14.34)-(14.37) , verify directly that the vector fields grad /, curl A, and the quantities div A and 11/ are invariant relative to the choice of the coordinate system in which they are computed.
The space JR 3 , being a rigid body, revolves about a certain axis with constant angular velocity w. Let v be the field of linear velocities of the points at a fixed instant of time. a) Write the field v in the corresponding cylindrical coordinates. b) Find curl v . c) Indicate how the field curl v is directed relative to the axis of rotation. d) Verify that I curl v i = 2w at each point of space. e) Interpret the geometric meaning of curl v and the geometric meaning of the constancy of this vector at all points of space for the situation in d). 8.
276
14 Elements of Vector Analysis and Field Theory
1 4 . 2 The Integral Formulas of Field Theory 14. 2 . 1 The Classical Integral Formulas in Vector Notation
w }. and w � In the preceding chapter we noted ( see Sect. 13.2, formulas (13.23) and ( 13.24) ) that the restriction of the work form w� of a field F to an oriented smooth curve ( path ) 'Y or the restriction of the flux form w� of a field V to an oriented surface S can be written respectively in the following forms:
a. Vector Notation for the Forms
w
� l , = (F, e ) ds ,
w
� l s = ( V, n ) da ,
where e is the unit vector that orients "(, codirectional with the velocity vector of the motion along "(, ds is the element ( form ) of arc length on "(, n is the unit normal vector to S that orients the surface, and da is the element ( form ) of area on S. In vector analysis we often use the vector element of length of a curve d s : = e ds and the vector element of area on a surface do- := n da. Using this notation, we can now write:
i l, w� l s w
= (A, e ) ds = (A, d s ) = A · d s ,
= (B, n ) da = (B, do- ) = B · do- .
b. The Newton-Leibniz Formula Let
D be a path in the domain D.
f E C C l l (D, IR), and let
Applied to the 0-form w � , Stokes ' formula
( 14.38) ( 14.39) 'Y
:
[a, b]
--+
means, on the one hand, the equality
which agrees with the classical formula b
J('Y(b)) - J('Y(a)) = I df ('Y(t)) a
of Newton-Leibniz ( the fundamental theorem of calculus ) . On the other hand, by definition of the gradient, it means that
I W� = I W�rad f ·
a,
'
( 14.40)
14.2 The Integral Formulas of Field Theory as
277
Thus, using relation ( 14.38 ) , we can rewrite the Newton-Leibniz formula ( 14.40 ' )
J('r(b)) - J('r(a)) = / ( grad ! ) · ds .
,
In this form it means that
the increment of a function on a path equals the work done by the gradient of the function on the path. This is a very convenient and informative notation. In addition to the obvious deduction that the work of the field grad f along a path depends only on the endpoints of the path, the formula enables us to make a somewhat more subtle observation. To be specific, motion over a level surface f = c of f takes place without any work being done by the field grad f since in this case grad f · do- = 0. Then, as the left-hand side of the formula shows, the work of the field grad f depends not even on the initial and final points of the path but only on the level surfaces of f to which they belong. 1
c. Stokes' Formula We recall that the work of a field on a closed path is called the circulation of the field on that path. To indicate that the integral is taken over a closed path, we often write f F · ds rather than the traditional
notation
£ F · ds. If
1
,
is a curve in the plane, we often use the symbols
f ,
p, in which the direction of traversal of the curve is indicated. , The term circulation is also used when speaking of the integral over some 1
and
finite set of closed curves. For example, it might be the integral over the boundary of a compact surface with boundary. Let A be a smooth vector field in a domain D of the oriented Euclidean space JR 3 and S a ( piecewise ) smooth oriented compact surface with boundary in D. Applied to the 1-form wi_, taking account of the definition of the curl of a vector field, Stokes ' formula means the equality ( 14.41 )
Using relation ( 14.39 ) , we can rewrite ( 14.41 ) as the classical Stokes for mula
f A · ds = JJ ( curl A) · do- .
as
s
( 14.41' )
278
14 Elements of Vector Analysis and Field Theory
In this notation it means that
the circulation of a vector field on the boundary of a surface equals the flux of the curl of the field across the surface.
As always, the orientation chosen on aS is the one induced by the orien tation of S. d. The Gauss-Ostrogradskii Formula Let V be a compact domain of
the oriented Euclidean space IR 3 bounded by a (piecewise-) smooth surface {)V, the boundary of V. If B is a smooth field in V, then in accordance with the definition of the divergence of a field, Stokes ' formula yields the equality ( 14.42) Using relation ( 14.39) and the notation p dV for the form w� in terms of the volume element dV in IR 3 , we can rewrite Eq. ( 14.42) as the classical Gauss-Ostrogradskii formula
!! av
B · dt7
In this form it means that
= !!! v
div B dV .
( 14.42')
the flux of a vector field across the boundary of a domain equals the integral of the divergence of the field over the domain itself.
e. Summary of the Classical Integral Formulas In sum, we have arrived
at the following vector notation for the three classical integral formulas of analysis:
J f = J ( V'!) J = J ( V' = J (V' 'Y
8-y
J
av
B · dt7
·
as
A · ds ·
v
s
d s (the Newton-Leibniz formula) , x A) · dt7
(Stokes ' formula) ,
B) dV (the Gauss-Ostrogradskii formula) .
( 14.40") ( 14.41") ( 14.42")
1 4 . 2 . 2 The Physical Interpretation of div, curl, and grad a. The Divergence Formula ( 14.42' ) can be used to explain the physical
x
x
meaning of div B ( ) - the divergence of the vector field B at a point in the domain V in which the field is defined. Let V( ) be a neighborhood of (for example, a ball) contained in V. We permit ourselves to denote the
x
x
14.2 The Integral Formulas of Field Theory
279
volume of this neighborhood by the same symbol V(x) and its diameter by the letter d. By the mean-value theorem and the formula ( 14.42') we obtain the fol lowing relation for the triple integral
JJ
B · du = div B (x ' ) V(x) ,
8V(x)
where x' is a point in the neighborhood V(x) . If d -+ 0, then x' -+ x, and since B is a smooth field, we also have div B (x') -+ div B (x) . Hence
JJ B · du av(x) . . d IV B( X ) = 1 Im d---t O -:..,V'=-,(,.---,X) -
( 14.43)
Let us regard B as the velocity field for a flow (of liquid or gas) . Then, by the law of conservation of mass, a flux of this field across the boundary of the domain V or, what is the same, a volume of the medium diverging across the boundary of the domain, can arise only when there are sinks or sources (including those associated with a change in the density of the medium) . The flux is equal to the total power of all these factors, which we shall collectively call "sources," in the domain V(x) . Hence the fraction on the right-hand side of ( 14.43) is the mean intensity (per unit volume) of sources in the domain V(x) , and the limit of that quantity, that is, div B (x) , is the specific intensity (per unit volume) of the source at the point x. But the limit of the ratio of the total amount of some quantity in the domain V(x) to the volume of that domain as d -+ 0 is customarily called the density of that quantity at x, and the density as a function of a point is usually called the density of the distribution of the given quantity in a portion of space. Thus, we can interpret the divergence div B of a vector field B as the density of the distribution of sources in the domain of the flow, that is, in the domain of definition of the field B .
Example 1 . If, in particular, div B = 0, that is, there are no sources, then the flux across the boundary of the region must be zero: the amount flowing in equals the amount flowing out . And, as formula ( 14.42' ) shows, this is indeed the case. Example 2. A point electric charge of magnitude q creates an electric field in space. Suppose the charge is located at the origin. By Coulomb 's law 6 the intensity E = E(x) of the field at the point x E JR. 3 (that is, the force acting 6 Ch.O. Coulomb (1736-1806) - French physicist. He discovered experimentally the law ( Coulomb's law ) of interaction of charges and magnetic fields using a torsion balance that he invented himself.
280
14 Elements of Vector Analysis and Field Theory
on a unit test charge at the point
x) can be written as
E-
q
--
4rrco
�
lrl3 ' and r
where c o is a dimensioning constant is the radius-vector of the point x. The field E is defined at all points different from the origin. In spherical coordinates E = 4f!eo Jb-eR, so that by formula ( 14.36111) of the preceding section, one can see immediately that div E = 0 everywhere in the domain of definition of the field E. Hence, if we take any domain V not containing the origin, then by formula ( 14.42') the flux of E across the boundary 8V of V is zero. Let us now take the sphere SR = {x E JR3 I I x l = R} of radius R with center at the origin and find the outward flux (relative to the ball bounded by the sphere) of E across this surface. Since the vector eR is itself the unit outward normal to the sphere, we find
I
Sn
E · du =
Jq
Sn
q
1
-- da = · 4rr R2 4rrco R2 4rrco R2
--
=
q
-
co
.
Thus, up to the dimensioning constant c o , which depends on the choice of the system of physical units, we have found the amount of charge in the volume bounded by the sphere. We remark that under the hypotheses of Example 2 just studied the left hand side of formula ( 14.42' ) is well-defined on the sphere 8V = SR, but the integrand on the right-hand side is defined and equal to zero everywhere in the ball V except at one point - the origin. Nevertheless, the computations show that the integral on the right-hand side of ( 14.42') cannot be interpreted as the integral of a function that is identically zero. From the formal point of view one could dismiss the need to study this situation by saying that the field E is not defined at the point 0 E V, and hence we do not have the right to speak about the equality ( 1 4.42' ) , which was proved for smooth fields defined in the entire domain V of integration. However, the physical interpretation of ( 1 4.42' ) as the law of conservation of mass shows that, when suitably interpreted, it ought to be valid always. Let us study the indeterminacy of div E at the origin in Example 2 more attentively to see what is causing it . Formally the original field E is not defined at the origin, but , if we seek div E from formula ( 14.43) , then, as Example 2 shows, we would have to assume that div E(O) = +oo. Hence the integrand on the right-hand side of ( 14.42) would be a "function" equal to zero everywhere except at one point, where it is equal to infinity. This corresponds to the fact that there are no charges at all outside the origin, and we somehow managed to put the entire charge into a space of volume zero - into the
q
14.2 The Integral Formulas of Field Theory
281
single point 0, at which the charge density naturally became infinite. Here we are encountering the so-called Dirac 7 8-function (delta-function) . The densities of physical quantities are needed ultimately so that one can find the values of the quantities themselves by integrating the density. For that reason there is no need to define the 8-function at each individual point; it is more important to define its integral. If we assume that physically the "function" 8x 0 (x) = 8(xo ; x) must correspond to the density of a distribution, for example the distribution of mass in space, for which the entire mass, equal to 1 in magnitude, is concentrated at the single point x 0 , it is natural to set
j
v
8(xo , x) dV =
{1'
when Xo E V ,
0 , when Xo tf. V .
Thus, from the point of view of a mathematical idealization of our ideas of the possible distribution of a physical quantity (mass, charge, and the like) in space, we must assume that its distribution density is the sum of an ordinary finite function corresponding to a continuous distribution of the quantity in space and a certain set of singular "functions" (of the same type as the Dirac 8-function) corresponding to a concentration of the quantity at individual points of space. Hence, starting from these positions, the results of the computations in Example 2 can be expressed as the single equality div E = !o 8(0; x) . Then, as applied to the field E, the integral on the right-hand side of (14.42') is indeed equal either to qfco or to 0, according as the domain V contains the origin (and the point charge concentrated there) or not. In this sense one can assert (following Gauss) that the flux of electric field intensity across the surface of a body equals (up to a factor depending on the units chosen) the sum of the electric charges contained in the body. In this same sense one must interpret the electric charge density p in the Maxwell equations considered in Sect. 14.1 (formula (14.12)). b.
The Curl
an example.
We begin our study of the physical meaning of the curl with
Example 3. Suppose the entire space, regarded as a rigid body, is rotating with constant angular speed w about a fixed axis (let it be the x-axis) . Let us find the curl of the field v of linear velocities of the points of space. (The field is being studied at any fixed instant of time.) In cylindrical coordinates (r, cp , z ) we have the simple expression v(r, cp , z ) = wrecp . Then by formula (14.35" ) of Sect. 14.1, we find imme diatly that curl v = 2wez. That is, curl v is a vector directed along the axis 7 P.A.M. Dirac (1902-1984) - British theoretical physicist, one of the founders of quantum mechanics. More details on the Dirac <5-function will be given in Subsects. 17.4.4 and 17.5.4.
282
14 Elements of Vector Analysis and Field Theory
of rotation. Its magnitude 2w equals the angular velocity of the rotation, up to the coefficient 2 , and the direction of the vector, taking account of the orientation of the whole space JR 3 , completely determines the direction of rotation. The field described in Example 3 in the small resembles the velocity field of a funnel (sink) or the field of the vorticial motion of air in the neighborhood of a tornado (also a sink, but one that drains upward) . Thus, the curl of a vector field at a point characterizes the degree of vorticity of the field in a neighborhood of the point. We remark that the circulation of a field over a closed contour varies in direct proportion to the magnitude of the vectors in the field, and, as one can verify using the same Example 3, it can also be used to characterize the vorticity of the field. Only now, to describe completely the vorticity of the field in a neighborhood of a point, it is necessary to compute the circulation over contours lying in three different planes. Let us now carry out this program. We take a disk Si (x) with center at the point x and lying in a plane perpendicular to the ith coordinate axis, i = 1 , 2, 3. We orient Si(x) using a normal, which we take to be the unit vector e i along this coordinate axis. Let d be the diameter of Si (x). From formula ( 14.4 1 ) for a smooth field A we find that f A · ds 8S; (x) (curl A) · e i = lim ( 14.44) d--.0 Si ( X ) where Si(x) denotes the area of the disk under discussion. Thus the circula tion of the field A over the boundary 8Si per unit area in the plane orthogonal to the ith coordinate axis characterizes the ith component of curl A. To clarify still further the meaning of the curl of a vector field, we recall that every linear transformation of space is a composition of dilations in three mutually perpendicular directions, translation of the space as a rigid body, and rotation as a rigid body. Moreover, every rotation can be realized as a rotation about some axis. Every smooth deformation of the medium (flow of a liquid or gas, sliding of the ground, bending of a steel rod) is locally linear. Taking account of what has just been said and Example 3, we can conclude that if there is a vector field that describes the motion of a medium (the velocity field of the points in the medium) , then the curl of that field at each point gives the instantaneous axis of rotation of a neighborhood of the point , the magnitude of the instantaneous angular velocity, and the direction of rotation about the instantaneous axis. That is, the curl characterizes com pletely the rotational part of the motion of the medium. This will be made slightly more precise below, where it will be shown that the curl should be regarded as a sort of density for the distribution of local rotations of the medium.
14.2 The Integral Formulas of Field Theory
283
c. The Gradient We have already said quite a bit about the gradient of a
scalar field, that is, about the gradient of a function. Hence at this point we shall merely recall the main things. Since w�rad f (e) = ( grad f , e ) = d f ( e ) = Def, where Def is the derivative of the function f with respect to the vector e, it follows that grad f is orthog onal to the level surfaces of f, and at each point it points in the direction of most rapid increase in the values of the function. Its magnitude l grad !I gives the rate of that growth ( per unit o f length i n the space i n which the argument varies ) . The significance of the gradient as a density will be discussed below. 14.2.3 Other Integral Formulas a. Vector Versions of the Gauss-Ostrogradskii Formula The interpre tation of the curl and gradient as vector densities, analogous to the interpreta tion ( 14.43) of the divergence as a density, can be obtained from the following classical formulas of vector analysis, connected with the Gauss-Ostrogradskii formula.
V' · B dV =
vJ vJ vJ
V' x A dV = V' dV =
f
avJ avJ avJ
du · B
(the divergence theorem ) (the curl theorem )
du x A du f
( the gradient theorem )
( 14.45) ( 14.46) ( 14.47)
The first of these three relations coincides with ( 14.42') up to notation and is the Gauss-Ostrogradskii formula. The vector equalities ( 14.46) and ( 14.47) follow from ( 14.45) if we apply that formula to each component of the corresponding vector field. Retaining the notation V(x) and d used in Eq. (14.43) , we obtain from formulas ( 14.45)-( 14.47) in a unified manner, "v · B ( X ) =
8VJ(xV(x) )
du · B
. 1 liD -"--:,..,. ::'-: ,--.---, d-+0
' " v X A( X ) = l liD
d-+0
V'
lim f(x) = d-+O
8VJ(x)V(x) v_--J(V(x) �x-:: ):-_--:- f
( 14.43')
du x A
----,--...,.--
a
( 14.48)
du
( 14.49)
284
14 Elements of Vector Analysis and Field Theory
The right-hand sides of ( 14.45)-( 14.47) can be interpreted respectively as the scalar flux of the vector field B , the vector flux of the vector field A, and the vector flux of the scalar field f across the surface 8V bounding the domain V. Then the quantities div B, curl A, and grad f on the left hand sides of Eqs. ( 14.43' ) , ( 14.48) , and ( 14.49) can be interpreted as the corresponding source densities of these fields. We remark that the right-hand sides of Eqs. ( 14.43' ) , ( 14.48) , and ( 14.49) are independent of the coordinate system. From these we can once again derive the invariance of the gradient , curl, and divergence. b. Vector Versions of Stokes' Formula Just as formulas ( 14.45)-( 14.47) were the result of combining the Gauss-Ostrogradskii formula with the alge braic operations on vector and scalar fields, the following triple of formulas can be obtained by combining these same operations with the classical Stokes formula (which appears as the first of the three relations) . Let 8 b e a (piecewise-) smooth compact oriented surface with a consis tently oriented boundary 88, let du be the vector element of area on 8, and ds the vector element of length on 88. Then for smooth fields A, B , and J , the following relations hold:
J j s
s
du ·
(V x A)
(du x
j s
=
V) x B =
du x
Vf =
j j j dsf
as as
ds · A ,
( 14.50)
ds x B ,
( 14.51)
.
( 14.52)
as
Formulas ( 14.51) and ( 14.52) follow from Stokes ' formula ( 14.50) . We shall not take time to give the proofs. c. Green's Formulas If 8 is a surface and n a unit normal vector to 8, then the derivative Dnf of the function f with respect to n is usually denoted � in field theory. For example, (V J , du) = (V J , n) da = (grad /, n) da = Dnf da = � da. Thus, � da is the flux of grad f across the element of surface da. In this notation we can write the following formulas of Green, which are very widely used in analysis:
J Vf V JgV2 f J(gVf) J(gV2 f - fV2g) J(gVf - fVg) v
v
·
gdV +
dV =
v
dV =
av
av
· ·
( 14.53)
du
(
du =
J (g �� - ���)
av
)
da . ( 14.54)
14.2 The Integral Formulas of Field Theory
g f=g J l 'Vf l 2 dV + J fl1fdV = J f'Vf · du ( = J f :� ) V V 8V 8V j =j = j :� ) V l1fdV 8V \lf · ( 8V
In particular, if we set respectively,
in ( 14.53) and
=
285
1 in ( 14.54) , we find d a , (14.53' )
du
do- .
(14. 54' )
This last equality is often called Gauss ' theorem. Let us prove, for exam ple, the second of Eqs. ( 14.53) and ( 14.54) :
Proof.
j (g\lf - f\lg) · = j \l · (g\lf - f\lg) = V 8V = j (\lg 'Vf + g'V2 f - 'Vf · \lg - f\l2 g) = J (g\l2 f - f\l2g) = J (gl1f - fl1g) 'V · ('PA) = \lip . A + 1p\l . du
dV
dV =
·
v
dV
v
dV .
v
In this formula we have used the Gauss-Ostrogradskii formula and the relation A. 0 14.2.4 Problems and Exercises 1. Using the Gauss-Ostrogradskii formula (14.45) , prove relations (14.46) and (14.47) . 2.
Using Stokes' formula (14.50) , prove relations (14.51) and (14.52) .
a) Verify that formulas (14.45) , ( 14.46) , and (14.47) remaind valid for an un bounded domain V if the integrands in the surface integrals are of order 0 ( �) as r ---+ oo. ( Here r = l rl , and r is the radius-vector in IR. 3 .) b ) Determine whether formulas ( 14.50) , ( 14.51 ) , and ( 14.52) remain valid for a noncompact surface S C IR. 3 if the integrands in the line integrals are of order o (;\-) as r ---+ oo. c ) Give examples showing that for unbounded surfaces and domains Stokes' formula (14.41') and the Gauss-Ostrogradskii formula (14.42') are in general not true. 3.
286
14 Elements of Vector Analysis and Field Theory
4. a) Starting from the interpretation of the divergence as a source density, explain why the second of the Maxwell equations (formula ( 14.12) of Sect. 14. 1) implies that there are no point sources in the magnetic field (that is, there are no magnetic charges) . b ) Using the Gauss-Ostrogradskii formula and the Maxwell equations (formula (14.12) of Sect . 14. 1 ) , show that no rigid configuration of test charges (for example a single charge) can be in a stable equilibrium state in the domain of an electrostatic field that is free of the (other) charges that create the field. (It is assumed that no forces except those exerted by the field act on the system.) This fact is known as Earnshaw 's theorem. 5. If an electromagnetic field is steady, that is, independent of time, then the system of Maxwell equations (formula (14.12) of Sect. 14. 1) decomposes into two independent parts - the electrostatic equations \7 · E = � , \7 x E = 0, and the , \7 · B = 0. magnetostatic equations \7 x B = h eo c The equation \7 E = p/Eo , where p is the charge density, transforms via the Gauss-Ostrogradskii formula into J E · du = Q/so , where the left-hand side is the s flux of the electric field intensity across the closed surface S and the right-hand side is the sum Q of the charges in the domain bounded by S, divided by the dimensioning constant Eo . In electrostatics this relation is usually called Gauss ' law. Using Gauss ' law, find the electric field E a) created by a uniformly charged sphere, and verify that outside the sphere it is the same as the field of a point charge of the same magnitude located at the center of the sphere; b) of a uniformly charged line; c) of a uniformly charged plane; d) of a pair of parallel planes uniformly charged with charges of opposite sign; e) of a uniformly charged ball. ·
6. a) Prove Green's formula ( 14.53) . b) Let f be a harmonic function in the bounded domain V (that is, f satisfies Laplace's equation L1f = 0 in V). Show, starting from (14.54') that the flux of the gradient of this function across the boundary of the domain V is zero. c) Verify that a harmonic function in a bounded connected domain is determined up to an additive constant by the values of its normal derivative on the boundary of the domain. d) Starting from ( 14.53' ) , prove that if a harmonic function in a bounded domain vanishes on the boundary, it is identically zero throughout the domain. e) Show that if the values of two harmonic functions are the same on the bound ary of a bounded domain, then the functions are equal in the domain. f) Starting from ( 14.53) , verify the following principle of Dirichlet. Among all continuous differentiable functions in a domain assuming prescribed values on the boundary, a harmonic function in the region is the only one that minimizes the Dirichlet integral {that is, the integral of the squared-modulus of the gradient over the domain).
287
14.2 The Integral Formulas of Field Theory
7. a) Let r(p, q) = IP - q l be the distance between the points p and q in the Euclidean space JR 3 . By fixing p, we obtain a function rp (q) of q E JR 3 . Show that Llr; 1 (q) = 47r8(p; q), where 8 is the 8-function. b) Let g be harmonic in the domain V. Setting f = 1/rp in (14.54) and taking account of the preceding result, we obtain =
47rg(p )
I (gv_!_rp - _!_vg rp ) · du . s
Prove this equality precisely. c) Deduce from the preceding equality that if S is a sphere of radius R with center at p, then g (p) = 2 g do- .
�I
47r s This is the so-called mean-value theorem for harmonic functions. d) Starting from the preceding result, show that if B is the ball bounded by the sphere S considered in part c) and V(B) is its volume, then
g (p)
=
V(� )
I g dV .
B
e) If p and q are points of the Euclidean plane lR 2 , then along with the function ...!_ considered in a) above (corresponding to the potential of a charge located at Tp p) , we now take the function ln ...!_ (corresponding to the potential of a uniformly Tp charged line in space) . Show that Ll ln ...!_ = 21r8(p; q), where 8(p; q) is now the Tp 8-function in lR 2 . f) By repeating the reasoning in a) , b) , c), and d) , obtain the mean-value the orem for functions that are harmonic in plane regions. 8. Cauchy 's multi-dimensional mean-value theorem. The classical mean-value theorem for the integral ( "Lagrange's theorem" ) as serts that if the function f : D -+ lR is continuous on a compact, measurable, and connected set D c lR n (for example, in a domain) , then there exists a point e E D such that f(x) dx = f(e) · I D I ,
I
D
where I D I is the measure (volume) of D. a) Now let J , g E C(D, JR) , that is, f and g are continuous real-valued functions in D. Show that the following theorem ( "Cauchy's theorem" ) holds: There exists e E D such that g(e) f(x) dx = J(e) g (x) dx .
I
D
I
D
b) Let D be a compact domain with smooth boundary 8D and f and smooth vector fields in D. Show that there exists a point e E D such that div g(e) · Flux f = div f(e) · Flux g, aD aD where Flux is the flux of a vector field across the surface 8D. aD
g
two
288
14 Elements of Vector Analysis and Field Theory
14.3 Potential Fields 14. 3 . 1 The Potential of a Vector Field :
--+
A
be a vector field in the domain D C Rn . A function R is called a potential of the field if grad U in D.
Definition 1. Let
U D
A A=
Definition 2 . A field that has a potential is called a
potential field.
Since the partial derivatives of a function determine the function up to an additive constant in a connected domain, the potential is unique in such a domain up to an additive constant. We briefly mentioned potentials in the first part of this course. Now we shall discuss this important concept in somewhat more detail. In connection with these definitions we note that when different force fields are studied in physics, the potential of a field F is usually defined as a function U such that F -grad U. This potential differs from the one given in Definition 1 only in sign.
=
Example 1 . At a point of space having radius-vector r the intensity F of the gravitational field due to a point mass M located at the origin can be computed from Newton ' s law as F=
r - GMr3 ,
(14.55)
where r = J r J . This is the force with which the field acts on a unit mass at this point of space. The gravitational field (14.55) is a potential field. Its potential in the sense of Definition 1 is the function
U = GM �r .
(14.56)
Example 2. At a point of space having radius-vector r the intensity E of the electric field due to a point charge q located at the origin can be computed from Coulomb ' s law q E - -- �
- 47rco r3 ·
Thus such an electrostatic field, like the gravitational field, is a potential field. Its potential cp in the sense of physical terminology is defined by the relation cp
= 47rqco r1 . -- -
14.3 Potential Fields
289
14.3.2 Necessary Condition for Existence of a Potential
In the language of differential forms the equality A = grad U means that wl_ = dw � = dU, from which it follows that dwl_ = 0 ,
( 14.57)
since d2 w � = 0. This is a necessary condition for the field A to be a potential field. In Cartesian coordinates this condition can be expressed very simply. If A = (A\ . . . , A n ) and A = grad U, then A i = g� , i = 1 , . . . , n, and if the potential U is sufficiently smooth (for example, if its second-order partial derivatives are continuous) , we must have 8A i
8A3 8x3 - 8xi
'
i, j = 1 , . . . , n ,
( 14.57' )
which simply means that the mixed partial derivatives are equal in both orders: n In Cartesian coordinates wl_ = L: A i dxi , and therefore the equalities
i=l ( 14.57) and ( 14.57') are indeed equivalent in this case.
In the case of JR3 we have dwl_ = w�uri A ' so that the necessary condition
( 14.57) can be rewritten as
curl A = O ,
which corresponds to the relation curl grad U = 0, which we already know. The field A = (x, xy, xyz) in Cartesian coordinates in JR3 cannot be a potential field, since, for example, � =f. g� .
Example 3.
Example 4.
Consider the field A = (Ax , Ay ) given by A=
( - x2 y y2 ' x2 x y2 ) ' +
+
( 14.5 8 )
defined in Cartesian coordinates at all points of the plane exc � t the origin. The necessary condition for a field to be a potential field 8�* = fixY is fulfilled in this case. However, as we shall soon verify, this field is not a potential field in its domain of definition. Thus the necessary condition ( 14. 57) , or, in Cartesian coordinates ( 14.57' ) , is in general not sufficient for a field to be a potential field.
290
14 Elements of Vector Analysis and Field Theory
14.3.3 Criterion for a Field to be Potential C
A continuous vector field A in a domain D �n is a po tential field in D if and only if its circulation {work) around every closed curve "( contained in D is zero: (14.59)
Proposition 1.
Proof. N e c e s s i t y. Suppose A = grad U. Then by the Newton-Leibniz for mula (Formula (14.40') of Sect . 14.2),
f A · ds "'
=
U ('Y (b) ) - U ('Y (a) ) ,
where "( : [a, b] -+ D. If "f(a) = "f(b), that is, when the path "( is closed, it is obvious that the right-hand side of this last equality vanishes, and hence the left-hand side does also. S u ff i c i e n c y. Suppose condition (5) holds. Then the integral over any (not necessarily closed) path in D depends only on its initial and terminal points, not on the path joining them. Indeed, if "( 1 and "(2 are two paths having the same initial and terminal points, then, traversing first 'Y l , then - "(2 (that is, traversing "(2 in the opposite direction) , we obtain a closed path 'Y whose integral, by (14.59), equals zero, but is also the difference of the integrals over 'Y l and "(2 • Hence these last two integrals really are equal. We now fix some point x0 E D and set U(x)
=
X
j
A · ds ,
(14.60)
XQ
where the integral on the right is the integral over any path in D from x 0 to x. We shall verify that the function U so defined is the required potential for the field A. For convenience, we shall assume that a Cartesian coordinate system (x l , . . . , x n ) has been chosen in � n . Then A · ds = A 1 dx 1 + · · · + A n dx n . If we move away from x along a straight line in the direction h ei , where ei is the unit vector along the x i -axis, the function U receives an increment equal to xi +hi U(x + h ei ) - U(x) = A i (x 1 , . . . , x i - l , t, x i + 1 , . . . , x n ) dt ,
J
equal to the integral of the form A · ds over this path from x to x + hei . By the continuity of A and the mean-value theorem, this last equality can be written as
14.3 Potential Fields
where 0 we find
S () S
291
1. Dividing this last equality by h and letting h tend to zero,
that is, A = grad U. 0
. au 8 . (x) = A'(x) , x•
Remark 1 . As can be seen from the proof, a sufficient condition for a field to be a potential field is that ( 14.59) hold for smooth paths or, for example, for broken lines whose links are parallel to the coordinate axes.
We now return to Example 4. Earlier ( Example 1 of Sect . 8 . 1 ) we com puted that the circulation of the field ( 14.58) over the circle x 2 + y 2 = 1 traversed once in the counterclockwise direction was 27f (# 0). Thus, by Proposition 1 we can conclude that the field ( 14.58) is not a potential field in the domain � 2 \ 0. But surely, for example, grad arctan '!!... = X
( - X2 +y y 2 , X 2 +x y2 ) ,
and it would seem that the function arctan � is a potential for ( 14.58) . What is this, a contradiction?! There is no contradiction as yet , since the only correct conclusion that one can make in this situation is that the function arctan � is not defined in the entire domain � 2 \ 0. And that is indeed the case: Take for example, the points on the y-axis. But then, you may say, we could consider the function cp( x, y ) , the polar angular coordinate of the point ( x, y ) . That is practically the same thing as arctan � , but cp( x, y ) is also defined for x = 0, provided the point ( x, y ) is not at the origin. Throughout the domain � 2 \ 0 we have d cp =
- X 2 +y y2 dx + X 2 +X y2 dy .
However, there is still no contradiction, although the situation is now more delicate. Please note that in fact cp is not a continuous single-valued function of a point in the domain � 2 \ 0. As a point encircles the origin counterclock wise, its polar angle, varying continuously, will have increased by 27f when the point returns to its starting position. That is, we arrive at the original point with a new value of the function, different from the one we began with. Consequently, we must give up either the continuity or the single-valuedness of the function cp in the domain � 2 \ 0. In a small neighborhood ( not containing the origin ) of each point of the domain � 2 \ 0 one can distinguish a continuous single-valued branch of the function cp. All such branches differ from one another by an additive constant, a multiple of 27f. That is why they all have the same differential and can all serve locally as potentials of the field ( 14.58) . Nevertheless, the field ( 14.58) has no potential in the entire domain � 2 \ 0.
292
14 Elements of Vector Analysis and Field Theory
The situation studied in Example 4 turns out to be typical in the sense that the necessary condition (14.57) or ( 14.57' ) for the field A to be a poten tial field is locally also sufficient . The following proposition holds.
If the necessary condition for a field to be a potential field holds in a ball, then the field has a potential in that ball. Proof. For the sake of intuitiveness we first carry out the proof in the case of a disk D { (x, y) E JR 2 1 x 2 + y 2 < r} in the plane JR 2 . One can arrive at the point (x, y) of the disk from the origin along two different two-link broken lines II and 12 with links parallel to the coordinate axes (see Fig. 14.3) . Since D is a convex domain, the entire rectangle I bounded by these lines is contained in D. Proposition 2.
=
B y Stokes ' formula, taking account o f condition ( 14.57) , we obtain
J wi = J dwi =
81
0.
I
Fig. 14.3. By the remark to Proposition 1 we can conclude from this that the field A is a potential field in D. Moreover, by the proof of sufficiency in Proposition 1 , the function ( 1 4.60) can again be taken as the potential, the integral being interpreted as the integral over a broken line from the center to the point in question with links parallel to the axes. In this case the independence of the choice of path 12 for such an integral followed immediately from Stokes ' formula for a rectangle. In higher dimensions it follows from Stokes ' formula for a two-dimensional rectangle that replacing two adjacent links of the broken line by two links forming the sides of a rectangle parallel to the original does not change the value of the integral over the path. Since one can pass from one broken-line path to any other broken-line path leading to the same point by a sequence of such reconstructions, the potential is unambiguously defined in the general case. D
Il l
14.3 Potential Fields
293
14.3.4 Topological Structure of a Domain and Potentials Comparing Example 4 and Proposition 2, one can conclude that when the necessary condition (14.57) for a field to be a potential field holds, the ques tion whether it is always a potential field depends on the (topological) struc ture of the domain in which the field is defined. The following considerations (here and in Subsect . 14.3.5 below) give an elementary idea as to exactly how the characteristics of the domain bring this about. It turns out that if the domain D is such that every closed path in D can be contracted to a point of the domain without going outside the domain, then the necessary condition (14.57) for a field to be a potential field in D is also sufficient . We shall call such domains simply connected below. A ball is a simply connected domain (and that is why Proposition 2 holds) . But the punctured plane JR 2 \ 0 is not simply connected, since a path that encircles the origin cannot be contracted to a point without going outside the region. This is why not every field in JR 2 \ 0 satisfying (14.57'), as we saw in Example 4, is necessarily a potential field in JR2 \ 0. We now turn from the general description to precise formulations. We begin by stating clearly what we mean we speak of deforming or contracting a path.
homotopy (or deformation) in D from a closed path 'Yo : [0,2 1] -+ D to a closed path "(1 : [0, 1] -+ D is a continuous mapping r : / -+ D of the square /2 = { W , t 2 ) E JR2 I O ::; t i ::; 1, i = 1 , 2} into D such that T(t l , O) = 'Yo W ) , T(t\ 1) = "(1 (t 1 ), and T( O , t 2 ) = T(1, t 2 ) for all t 1 , t2 E [0, 1]. Thus a homotopy is a mapping r / 2 -+ D (Fig. 14.4). If the variable t 2 is regarded as time, according to Definition 3 at each instant of time t = t 2 we have a closed path TW , t) = 'Yt (Fig. 14.4 ) . 8 The change in this path with time is such that at the initial instant t = t 2 = 0 it coincides with 'Yo and at time t = t 2 = 1 it becomes "(1 . Since the condition 'Yt (O) = T( O , t) = T(1, t) = 'Yt ( l ) , which means that the path 'Yt is closed, holds at all times t E [0, 1] , the mapping r : / 2 -+ D induces the same mappings /3o W ) : = T(t l , O) = T(t\ 1) = : !3 1 W ) on the vertical sides of the square / 2 . The mapping T is a formalization of our intuitive picture of gradually deforming 'Yo to "( 1 . It is clear that time can be allowed to run backwards, and then we obtain the path 'Yo from "(1 . Definition 3. A
:
8
Orienting arrows are shown along certain curves in Fig. 14.4. These arrows will be used a little later; for the time being the reader should not pay any attention to them.
294
14 Elements of Vector Analysis and Field Theory
Definition 4. Two closed paths are homotopic in a domain if they can be obtained from each other by a homotopy in that domain, that is a homotopy can be constructed in that domain from one to the other.
Remark 2. Since the paths we have to deal with in analysis are as a rule paths of integration, we shall consider only smooth or piecewise-smooth paths and smooth or piecewise-smooth homotopies among them, without noting this explicitly. For domains in JR.n one can verify that the presence of a continuous ho motopy of (piecewise-) smooth paths guarantees the existence of (piecewise-) smooth homotopies of these paths.
wi
If the 1-form in the domain D is such that and the closed paths /o and /I are homotopic in D, then
Proposition 3.
/2
dwi = 0,
Let r : ---+ D be a homotopy from /o to 11 (see Fig. 14.4) . If Io and h are the bases of the square and J0 and J1 its vertical sides, then by definition of a homotopy of closed paths, the restrictions of r to 10 and h coincide with /o and 11 respectively, and the restrictions of r to J0 and J1 give some paths (30 and (31 in D. Since r( O , t 2 ) ( 1 , t 2 ), the paths (30 and (31 are the same. As a result of the change of variables ( t ) , the form transfers to the square as some 1-form In the process 0. Hence, by Stokes ' formula 0, since
Proof.
12
2 wi 1 F* dw = dF*wi = dwi = dwi = J w = J dw = O . i)[ 2
[2
Fig.
14.4.
=F x=F w = F*wi.
14.3 Potential Fields
295
But 8[ 2
Io
/1
h =
Jo
J l+J l J l J l J l J l w
�
�
w -
n
w -
�
w =
�
w -
n
w .
D
m
it
Definition 5. A domain is simply connected if every closed path homotopic to a point ( that is, a constant path ) .
IS
Thus simply connected domains are those in which every closed path can be contracted to a point .
A
If a field defined in a simply connected domain D satisfies the necessary condition (14.57) or (14.57') to be a potential field, then it is a potential field in D. Proof. By Proposition 1 and Remark 1 i t suffices t o verify that Eq. (14.59) holds for every smooth path in D. The path is by hypothesis homotopic Proposition 4.
'Y
'Y
to a constant path whose support consists of a single point. The integral over such a one-point path is obviously zero. But by Proposition 3 the integral does not change under a homotopy, and so Eq. (14.59) must hold for 'Y · D
Remark 3. Proposition 4 subsumes Proposition 2. However, since we had certain applications in mind, we considered it useful to give an independent constructive proof of Proposition 2.
Remark 4. Proposition 2 was proved without invoking the possibility of a smooth homotopy of smooth paths. 14.3.5 Vector Potential. Exact and Closed Forms if the relation B
=
AA
is a vector potential for a field B in a domain D curl holds in the domain.
Definition 6. A field
c
JR3
If we recall the connection between vector fields and forms in the oriented Euclidean space JR 3 and also the definition of the curl of a vector field, the relation B = curl can be rewritten as w� = dw l . It follows from this relation that w� i v B = dw� = d2 wi_ = 0. Thus we obtain the necessary condition div B = 0 , (14.61) which the field B must satisfy in D in order to have a vector potential, that is, in order to be the curl of a vector field in that domain. A field satisfying condition (14.61) is often, especially in physics, called a
A
A
solenoidal field.
14 Elements of Vector Analysis and Field Theory
296
Example 5.
In Sect . 14. 1 we wrote out the system of Maxwell equations. The second equation of this system is exactly Eq. ( 14.61 ) . Thus, the desire naturally arises to regard a magnetic field B as the curl of some vector field - the vector potential of B. When solving the Maxwell equations, one passes to exactly such a vector potential.
A
As can be seen from Definitions 1 and 6, the questions of the scalar and vector potential of vector fields ( the latter question being posed only in JR3 ) are special cases of the general question as to when a differential p-form wP is the differential dwP - 1 of some form wP - 1 .
exact in a domain D if there exists D such that wP = dwP - 1 . If the form wP is exact in D, then dwP = d 2 wP - 1 = 0. Thus the condition
Definition 7. A differential form wP is
a form wP - 1 in
dw
=
0
( 14.62 )
is a necessary condition for the form w to be exact. As we have already seen ( Example 4 ) , not every form satisfying this con dition is exact . For that reason we make the following definition. Definition 8 . The differential form w is condition ( 14.62 ) there.
The following theorem holds. Theorem. ( Poincare ' s lemma. ) If a form
there.
closed in a domain D if it satisfies is closed in a ball, then it is exact
Here we are talking about a ball in lR11 and a form of any order, so that Proposition 2 is an elementary special case of this theorem. The Poincare lemma can also be interpreted as follows: The necessary condition ( 14.62 ) for a form to be exact is also locally sufficient, that is, every point of a domain in which ( 14.62 ) holds has a neighborhood in which w is exact. In particular, if a vector field B satisfies condition ( 14.61 ) , it follows from the Poincare lemma that at least locally it is the curl of some vector field We shall not take the time at this point to prove this important theorem ( those who wish to do so can read it in Chap. 15 ) . We prefer to conclude by explaining in general outline the connection between the problem of the exactness of closed forms and the topology of their domains of definition ( based on information about 1-forms ) .
A.
Example 6.
JR2
Consider the plane with two points p 1 and p 2 removed ( Fig. 14.5 ) , and the paths /'o , /' 1 , and 1'2 whose supports are shown in the figure. The path 1'2 can be contracted to a point inside D, and therefore if a closed form w is given in D, its integral over 1'2 is zero. The path /'o cannot
14.3 Potential Fields
Fig.
297
14.5.
be contracted to a point, but without changing the value of the integral of the form, it can be homotopically converted into the path / 1 · The integral over 1 1 obviously reduces to the integral over one cycle en closing the point p 1 clockwise and the double of the integral over a cycle enclosing p2 counterclockwise. If T1 and T2 are the integrals of the form w over small circles enclosing the points p 1 and p2 and traversed, say, counter clockwise, one can see that the integral of the form w over any closed path in D will be equal to n 1 T1 + n2T2 , where n 1 and n2 are certain integers indi cating how many times we have encircled each of the holes P 1 and P2 in the plane JR. 2 and in which direction. Circles c 1 and c2 enclosing p 1 and p2 serve as a sort of basis in which every closed path r c D has the form r = n 1 c 1 + n2c2 , up to a homotopy, which has no effect on the integral. The quantities I w = Ti are called the cyclic c;
constants or the periods of the integral. If the domain is more complicated and there are k independent elementary cycles, then in agreement with the
n 1 c 1 + · · · + n k ck , it results that I w n 1 T1 + · · · + n k Tk · It turns out that for any set T1 , . . . , Tk of numbers in such a domain one can
expansion
r
=
'"Y
=
construct a closed 1-form that will have exactly that set of periods. ( This is a special case of de Rham ' s theorem - see Chap. 15. ) For the sake of visualization, we have resorted here to considering a plane domain, but everything that has been said can be repeated for any domain D c Rn .
In an anchor ring ( the solid domain in JR. 3 enclosed by a torus ) all closed paths are obviously homotopic to a circle that encircles the hole a certain number of times. This circle serves as the unique non-constant basic cycle c.
Example 7.
298
14 Elements of Vector Analysis and Field Theory
Moreover, everything that has just been said can be repeated for paths of higher dimension. If instead of one-dimensional closed paths - mappings of a circle or, what is the same, mappings of the one-dimensional sphere - we take mappings of a k-dimensional sphere, introduce the concept of homotopy for them, and examine how many mutually nonhomotopic mappings of the k dimensional sphere into a given domain D C exist, the result is a certain characteristic of the domain D which is formalized in topology as the so called kth homotopy group of D and denoted n k (D). If all the mappings of the k-dimensional sphere into D are homotopic to a constant mapping, the group n k (D) is considered trivial. (It consists of the identity element alone.) It can happen that n 1 (D) is trivial and n2 (D) is not .
JRn
Example 8. If D
JR;3
0
is taken to be the space with the point removed, obviously every closed path in D can be contracted to a point, but a sphere enclosing the point cannot be homotopically converted to a point.
0
It turns out that the homotopy group n k (D) has less to do with the periods of a closed k-form than the so-called homology group Hk (D). (See Chap. 15.)
Example 9 . From what has been said we can conclude that, for example, in is a simply connected the domain every closed 1-form is exact domain) , but not very closed 2-form is exact. In the language of vector fields, this means that every irrotational field in is the gradient of a function, but not every source-free field B ( div B = is the curl of some field in this domain.
JR3 \ 0
(JR3 \ 0 A 0JR3) \0
Example 1 0.
To balance Example 9 we take the anchor ring. For the anchor ring the group n 1 (D) is not trivial (see Example 7) , but n2 (D) is trivial, since every mapping f : S 2 --t D of the two-sphere into D can be contracted to a constant mapping (any image of a sphere can be contracted to a point). In this domain not every irrotational field is a potential field, but every source-free field is the curl of some field. 14.3.6 Problems and Exercises 1.
Show that every central field A = f(r)r is a potential field.
2 . Let F = -grad U be a potential force field. Show that the stable equilibrium positions of a particle in such a field are the minima of the potential U of that field.
For an electrostatic field E the Maxwell equations (formula (14.12) of Sect. 14. 1 ) , already noted, reduce t o the pair o f equations \7 E = £; and \7 x E = 0 . The condition \7 x E = 0 means, at least locally, that E = -grad <.p. The field of a point charge is a potential field, and since every electric field is the sum (or integral) of such fields, it is always a potential field. Substituting E = -\l<.p in the first equation of the electrostatic field, we find that its potential satisfies 3.
as
·
14.3 Potential Fields
299
Poisson's equation 9 .1ip = ..e.. < . The potential 'P determines the field completely, so that describing E reduces too finding the function ip, the solution of the Poisson equation. Knowing the potential of a point charge (Example 2) , solve the following prob lem. a) Two charges +q and -q are located at the points (0, 0, - d/ 2 ) and (0, 0, d/2) in JR3 with Cartesian coordinates (x, y, z) . Show that at distances that are large relative to d the potential of the electrostatic field has the form 1 z 1 - 3 qd + o 3 ' 'P = 4 11"Eo r · r where r is the absolute value of the radius-vector r of the point (x, y, z). b) Moving very far away from the charges is equivalent to moving the charges together, that is, decreasing the distance d. If we now fix the quantity qd =: p and decrease d, then in the limit we obtain the function 'P = 4 1 -!:Jp in the domain 11'" € 0 r JR3 \ 0. It is convenient to introduce the vector p equal to p in absolute value and directed from - q to +q. We call the pair of charges -q and +q and the construction obtained by the limiting procedure just described a dipole , and the vector p the dipole moment. The function 'P obtained in the limit is called the dipole potential. Find the asymptotics of the dipole potential as one moves away from the dipole along a ray forming angle 8 with the direction of the dipole moment. c) Let 'Po be the potential of a unit point charge and ip 1 the dipole potential having dipole moment P 1 · Show that 4' 1 = - (p 1 · 'V )ipo . d ) We can repeat the construction with the limiting passage that we carried out for a pair of charges in obtaining the dipole for the case of four charges (more precisely, for two dipoles with moments P 1 and p 2 ) and obtain a quadrupole and a corresponding potential. In general we can obtain a multipole of order j with potential 'PJ = ( - 1 F ( PJ · Y') ( PJ - 1 · Y') · · · (p 1 · 'V)ipo = L Qi k l 8};y'f'koiJ z l , where Qi k l i+k+ l=j are the so-called components of the multipole moment. Carry out the computations and verify the formula for the potential of a multipole in the case of a quadrupole. e) Show that the main term in the asymptotics of the potential of a cluster of charges with increasing distance from the cluster is 4 1 , where Q is the total 7TCQ 9. r charge of the cluster. f) Show that the main term of the asymptotics of the potential of an electrically neutral body consisting of charges of opposite signs (for example, a molecule) at a distance that is large compared to the dimensions of the body is 4 1 . Here 1TCQ � r e r is a unit vector directed from the body to the observer; p = L q; d; , where q; is the magnitude of the ith charge and d; is its radius-vector. The origin is chosen at some point of the body. g) The potential of any cluster of charges at a great distance from the cluster can be expanded (asymptotically) in functions of multipole potential type. Show this using the example of the first two terms of such a potential (see d), e) , and f) ) . 9 S.D. Poisson (1781-1849) - French scientist , specializing i n mechanics and physics; his main work was on theoretical and celestial mechanics, mathematical physics, and probability theory. The Poisson equation arose in his research into gravitational potential and attraction by spheroids.
( )
300 4.
14 Elements of Vector Analysis and Field Theory
Determine whether the following domains are simply connected. a) the disk { (x, y) E IR 2 1 x 2 + y 2 < 1 } ;
1
b ) the disk with its center removed { (x, y ) E IR2 0 < x 2 + y2
< 1}.; c) a ball with its center removed { ( x , y, z) E IR3 1 0 < x 2 + y 2 + z 2 < 1 } ; d) a n annulus { (x, y) E IR2 1 � < x 2 + y2 < 1 }; e) a spherical annulus { (x, y, z) E JR3 1 � < x 2 + y 2 + z 2 < 1 }; f) an anchor ring i n JR3 .
a) Give the definition of homotopy of paths with endpoints fixed. b) Prove that a domain is simply connected if and only if every two paths in it having common initial and terminal points are homotopic in the sense of the definition given in part a) . 5.
6. Show that a) every continuous mapping f : 8 1 -+ 8 2 of a circle 8 1 (a one-dimensional sphere) into a two-dimensional sphere 8 2 can be contracted in 8 2 to a point (a constant mapping) ; b) every continuous mapping f : 8 2 -+ 8 1 is also homotopic to a single point; c) every mapping f : 8 1 -+ 8 1 is homotopic to a mapping
7. In the domain IR3 \ 0 (three-dimensional space with the point 0 removed) con struct: a) a closed but not exact 2-form; b) a source-free vector field that is not the curl of any vector field m that domain.
8 . a) Can there be closed, but not exact forms of degree p < n - 1 in the domain D = IR n \ 0 (the space IRn with the point 0 removed)? b) Construct a closed but not exact form of degree p = n - 1 in D = IRn \ 0. 9 . If a 1-form w is closed in a domain D C IR n , then by Proposition 2 every point x E D has a neighborhood U(x ) inside which w is exact. From now on w is assumed to be a closed form. a) Show that if two paths "{i : [0, 1] -+ D, i = 1, 2, have the same initial and terminal points and differ only on an interval [a, ,8] C [0, 1] whose image under either of the mappings "fi is contained inside the same neighborhood U(x) , then
I w = I w.
"1 1
"12
14.3 Potential Fields
301
b) Show that for every path [0, 1] 3 t >-+ 1(t) E D one can find a number 6 > 0 such that if the path ::Y has the same initial and terminal point as 1 and differs from 1 at most by 6, that is omax :o; t :o;1 I ::Y ( t ) - 1(t) l � 6, then I w = I w. ;y
1
c) Show that if two paths 1 1 and 12 with the same initial and terminal points are homotopic in D as paths with fixed endpoints, then I w = I w for any closed /1 form w in D.
12
a) It will be proved below that every continuous mapping r : I 2 -+ D of the square I 2 can be uniformly approximated with arbitrary accuracy by a smooth mapping (in fact by a mapping with polynomial components) . Deduce from this that if the paths 1 1 and 12 in the domain D are homotopic, then for every c > 0 there exist smooth mutually homotopic paths ::Y1 and ::Y2 such that omax :o; t :o;1 I::Yi ( t) - li ( t) I � c, i = 1 , 2. b) Using the results of Example 9, show now that if the integrals of a closed form in D over smooth homotopic paths are equal, then they are equal for any paths that are homotopic in this domain (regardless of the smoothness of the homotopy) . The paths themselves, of course, are assumed to be as regular as they need to be for integration over them. 10.
a) Show that if the forms w P , w P - 1 , and w P - 1 are such that w P dw P - 1 dwP 21 , then (at least locally) one can find a form wP - 2 such that wP - 1 wP - 1 + of a form have dwP - . (The fact that any two forms that differ by the differential the same differential obviously follows from the relation d 2 w 0.) 11.
=
=
=
=
b) Show that the potential t.p of an electrostatic field (Problem 3) is determined up to an additive constant, which is fixed if we require that the potential tend to zero at infinity.
12. The Maxwell equations (formula (14.12) of Sect. 14. 1) yield the following pair of magnetostatic equations: V · B = 0, V x B = - � . The first of these shows EQC that at least locally, B has a vector potential A, that is, B = V x A. a) Describe the amount of arbitrariness in the choice of the potential A of the magnetic field B (see Problem l la) ) . b ) Let x , y , z b e Cartesian coordinates i n JR3 . Find potentials A for a uniform magnetic field B directed along the z-axis, each satisfying one of the following additional requirements: the field A msut have the form (0, A y , 0) ; the field A must have the form (A x , O, O) ; the. field A must have the form (Ax , A y , O) ; the field A must be invariant under rotations about the z-axis. c) Show that the choice of the potential A satisfying the additional requirement V · A = 0 reduces to solving Poisson's equation; more precisely, to finding a scalar valued function 'lj; satisfying the equation LJ.'lj; = f for a given scalar-valued function
f.
d) Show that if the potential A of a static magnetic field B is chosen so that · A = 0, it will satisfy the vector Poisson equation LJ.A = - h . Thus, invoking EQC the potential makes it possible to reduce the problem of finding electrostatic and magnetostatic fields to solving Poisson's equation. V
14 Elements of Vector Analysis and Field Theory 1 3 . The following theorem of Helmholtz 1 0 is well known: Every smooth field F in a domain D of oriented Euclidean space JR(3 can be decomposed into a sum F = F 1 +F 2 of an irrotational field F 1 and a solenoidal field F 2 . Show that the construction of such a decomposition can be reduced to solving a certain Poisson equation. 302
14. Suppose a given mass of a certain substance passes from a state characterized thermodynamically by the parameters Vo , Po (To ) into the state V, P, (T) . Assume that the process takes place slowly (quasi-statically) and over a path 1 in the plane of states (with coordinates V, P) . It can be proved in thermodynamics that the quantity S = J 8:} , where tiQ is the heat exchange form, depends only on the "'
initial point (Vo , Po ) and the terminal point (V, P) of the path, that is, after one of these points is fixed, for example (Vo , Po) , S becomes a function of the state (V, P) of the system. This function is called the entropy of the system. a) Deduce from this that the form w = 8:} is exact, and that w = dS. b) Using the form of tiQ given in Problem 6 of Sect. 13.1 for an ideal gas, find the entropy of an ideal gas.
14.4 Examples of Applications To show the concepts we have introduced in action, and also to explain the physical meaning of the Gauss-Ostrogradskii-Stokes formula as a conserva tion law, we shall examine here some illustrative and important equations of mathematical physics. 14.4 . 1 The Heat Equation We are studying the scalar field T = T(x, y , z, t) of the temperature of a body being observed as a function of the point (x, y, z ) of the body and the time t. As a result of heat transfer between various parts of the body the field T may vary. However, this variation is not arbitrary; it is subject to a particular law which we now wish to write out explicitly. Let D be a certain three-dimensional part of the observed body bounded by a surface S. If there are no heat sources inside S, a change in the internal energy of the substance in D can occur only as the result of heat transfer, that is, in this case by the transfer of energy across the boundary S of D. By computing separately the variation in internal energy in the volume D and the flux of energy across the surface S, we can use the law of conservation of energy to equate these two quantities and obtain the needed relation. It is known that an increase in the temperature of a homogeneous mass m by LJ.T requires energy cmLJ.T, where c is the specific heat capacity of 1 0 H.L.F. Helmholtz (182 1-1894) - German physicist and mathematician; one of the first to discover the general law of conservation of energy. Actually, he was the first to make a clear distinction between the concepts of force and energy.
14.4 Examples of Applications
303
the substance under consideration. Hence if our field T changes by LJ.T = T(x, y, z, t + LJ.t) - T(x, y, z, t) over the time interval LJ.t, the internal energy in D will have changed by an amount
JJDJ
cpLJ.T dV ,
(14.63)
where p = p(x, y, z ) is the density of the substance. It is known from experiments that over a wide range of temperatures the quantity of heat flowing across a distinguished area du = n da per unit time as the result of heat transfer is proportional to the flux -grad T · du of the field -grad T across that area (the gradient is taken with respect to the spatial variables x, y, z ) . The coefficient of proportionality k depends on the substance and is called its coefficient of thermal conductivity. The negative sign in front of grad T corresponds to the fact that the energy flows from hotter parts of the body to cooler parts. Thus, the energy flux (up to terms of order o ( LJ.t)) -k grad T · du (14.64)
LJ.t JJ s
takes place across the boundary S of D in the direction of the external normal over the time interval LJ.t. Equating the quantity (14.63) to the negative of the quantity ( 14.64) , dividing by LJ.t, and passing to the limit as LJ.t -t 0, we obtain
JJJD
cp
k grad T · du . �� dV = JJ s
(14.65)
This equality is the equation for the function T. Assuming T is sufficiently smooth, we transform ( 14.65) using the Gauss-Ostrogradskii formula:
JJJD
cp
�� dV = JJJ div (k grad T) dV . D
Hence, since D is arbitrary, it follows obviously that ar cp 8t = div (k grad T) .
(14.66)
We have now obtained the differential version of the integral equation (14.65) . If there were heat sources (or sinks) in D whose intensities have density F(x, y, z, t), instead of ( 14.65) we would write the equality
JJDJ
cp
a:; dV = JsJ k grad T · du + JJDJ F d V ,
(14.65')
304
14 Elements of Vector Analysis and Field Theory
and then instead of ( 14.66) we would have the equation aT cp at
=
. (k grad T ) + F . d1v
(14.66')
If the body is assumed isotropic and homogeneous with respect to its heat conductivity, the coefficient k in (14.66) will be constant , and the equation will transform to the canonical form 8T (14.67) 8t t1T + f ,
= a2
where f = :;, and ckp is the coefficient of thermal diffusivity. The equation (14.67) is usually called the heat equation. In the case of steady-state heat transfer, in which the field T is indepen dent of time, this equation becomes Poisson 's equation
a2 =
t1T
=
'P ,
(14.68)
t1T
=
0.
(14.69)
where 'P - � f; and if in addition there are no heat sources in the body, the result is Laplace 's equation
=
The solutions of Laplace's equation, as already noted, are called har In the thermophysical interpretation, harmonic functions correspond to steady-state temperature fields in a body in which the heat flows occur without any sinks or sources inside the body itself, that is, all sources are located outside the body. For example, if we maintain a steady temperature distribution T l av = T over the boundary aV of a body, then the temperature field in the body V will eventually stabilize in the form of a harmonic function T. Such an interpretation of the solutions of the Laplace equation (14.69) enables us to predict a number of properties of harmonic functions. For example, one must presume that a harmonic function in V can not have local maxima inside the body; otherwise heat would only flow away from these hotter portions of the body, and they would cool off, contrary to the assumption that the field is stationary.
monic functions.
14.4.2 The Equation of Continuity Let p p(x, y, z, t ) be the density of a material medium that fills a space being observed and = v(x, y, z, t ) the velocity field of motion of the medium as function of the point of space (x, z ) and the time t.
=
v
y,
From the law of conservation of mass, using the Gauss-Ostrogradskii for mula, we can find an interconnection between these quantities.
14.4 Examples of Applications
305
Let D be a domain in the space being observed bounded by a surface S. Over the time interval Llt the quantity of matter in D varies by an amount
JJDJ (p(x,
y, z, t + Ll t)
- p(x, y, z, t) ) dV .
Over this small time interval Ll t, the flow of matter across the surface S in the direction of the outward normal to S is ( up to o(Llt))
Llt ·
JJ p s
If there were no sources or sinks in matter, we would have
or, in the limit as
v · du
.
D, then by the law of conservation of
JJDJ Llp
dV =
JJDJ a;:
dV = -
-
Llt
Llt -+ 0
JSJ p
JSJ p
v · du
v · du .
Applying the Gauss-Ostrogradskii formula to the right-hand side of this equality and taking account of the fact that D is an arbitrary domain, we con clude that the following relation must hold for sufficiently smooth functions p and v: (14.70) = -div (pv) ,
a;:
called the equation of continuity of a continuous medium. In vector notation the equation of continuity can be written as
ap + \7 . (pv) = 0 ' at
(14.70')
ap + v . v + p\7 . v = o , at
(14. 70")
or, in more expanded form,
P
If the medium is incompressible ( a liquid ) , the volumetric outflow of the medium across a closed surface S must be zero:
JJ s
v · du = O ,
from which ( again on the basis of the Gauss-Ostrogradskii formula ) it follows that for an incompressible medium div v = 0
.
(14.71)
306
14 Elements of Vector Analysis and Field Theory
Hence, for an incompressible medium of variable density ( a mixture of water and oil ) Eq. ( 14 . 70") becomes ( 14.72) If the medium is also homogeneous, then \7 p = 0 and therefore if
=
0.
14.4.3 The Basic Equations of the Dynamics of Continuous Media We shall now derive the equations of the dynamics of a continuous medium moving in space. Together with the functions p and v already considered, which will again denote the density and the velocity of the medium at a given point ( y , z ) of space and at a given instant t of time, we consider the pressure p = ( y, z, t) as a function of a point of space and time. In the space occupied by the medium we distinguish a domain D bounded by a surface S and consider the forces acting on the distinguished volume of the medium at a fixed instant of time. Certain force fields ( for example, gravitation ) are acting on each element p d V of mass of the medium. These fields create the so-called mass forces. Let F = ( y , z , t) be the density of the external fields of mass force. Then a force Fp dV acts on the element from the direction of these fields. If this element has an acceleration a at a given instant of time, then by Newton ' s second law, this is equivalent to the presence of another mass force called inertia, equal to -ap dV. Finally, on each element dO" = n da of the surface S there is a surface tension due to the pressure of the particles of the medium near those in D, and this surface force equals -p dlT ( where n is the outward normal to S) . By d ' Alembert ' s principle, at each instant during the motion of of any material system, all the forces applied to it , including inertia, are in mutual equilibrium, that is, the force required to balance them is zero. In our case, this means that (F - a )p dV p dlT = O . ( 14.73)
xp, x ,
F x,
JJJD
!!S
The first term in this sum is the equilibrant of the mass and inertial forces, and the second is the equilibrant of the pressure on the surface S bounding the volume. For simplicity we shall assume that we are dealing with an ideal ( nonviscous ) fluid or gas, in which the pressure on the surface dO" has the form p dlT , where the number p is independent of the orientation of the area in the space. Applying formula ( 14.47) from Sect . 14.2, we find by (14.73) that
!!!D
( F - a ) p dV -
JJDJ
grad p dv = 0 ,
14.4 Examples of Applications
307
from which, since the domain D is arbitrary, it follows that
pa = pF - grad p .
( 14.74)
In this local form the equation of motion of the medium corresponds perfectly to Newton ' s law of motion for a material particle. The acceleration a of a particle of the medium is the derivative �� of the velocity v of the particle. If x = x ( t ) , y = y ( t ) , z = z ( t ) is the law of motion of a particle in space and v = v ( x, y, z, t ) is the velocity field of the medium, then for each individual particle we obtain
av av dx av dy av dz a = dv dt = at + ax dt + ay dt + az dt or
a = av at + ( v V ) v . ·
Thus the equation of motion (14.74) assumes the following form dv 1 p = F - -grad dt
( 14.75)
p
or
av + ( v V ) v = F - -1 \7p . ( 14.76) at Equation ( 14. 76) is usually called Euler 's hydrodynamic equation . The vector equation (14.76) is equivalent to a system of three scalar equa tions for the three components of the vector v and the pair of functions p ·
P
and p. Thus, Euler ' s equation does not completely determine the motion of an ideal continuous medium. To be sure, it is natural to adjoin to it the equation of continuity ( 14.70) , but even then the system is underdetermined. To make the motion of the medium determinate one must also add to Eqs. ( 14.70) and ( 14.76) some information on the thermodynamic state of the medium (for example, the equation of state f(p, p , T) 0 and the equation for heat transfer. ) The reader may obtain some idea of what these relations can yield in the final subsection of this section. =
14.4.4
The Wave Equation
We now consider the motion of a medium corresponding to the propagation of an acoustic wave. It is clear that such a motion is also subject to Eq. ( 14.76) ; this equation can be simplified due to the specifics of the phenomenon. Sound is an alternating state of rarefaction and compression of a medium, the deviation of the pressure from its mean value in a sound wave being very
308
14 Elements of Vector Analysis and Field Theory
small - of the order of 1%. Therefore acoustic motion consists of small devi ations of the elements of volume of the medium from the equilibrium posi tion at small velocities. However, the rate of propagation of the disturbance ( wave ) through the medium is comparable with the mean velocity of motion of the molecules of the medium and usually exceeds the rate of heat trans fer between the different parts of the medium under consideration. Thus, an acoustic motion of a volume of gas can be regarded as small oscillations about the equilibrium position occurring without heat transfer ( an adiabatic process ) . Neglecting the term ( v · V' ) v in the equation of motion ( 14. 76) in view of the small size of the macroscopic velocities v, we obtain the equality P
8v 8t
=
pF - V'p .
If we neglect the term of the form 'tJi v for the same reason, the last equality reduces to the equation
[}
v ot ( p ) Applying the operator V' ( on
=
pF - V'p .
x,y, z coordinates) to it , we obtain
(V' v ) = V' . pF Llp . 8t . p Using the equation of continuity ( 14. 70') and introducing the notation V' · pF = - tl> , we arrive at the equation
[}
-
( 14.77) If we can neglect the influence of the exterior fields, Eq. ( 14. 77) reduces to the relation ( 14.78 ) between the density and pressure in the acoustic medium. Since the process is adiabatic, the equation of state f(p, p , T) = 0 reduces to a relation p = '1/J(p) , from which it follows that � = '1/J'(p) ftJ. + 'l/J"(p) ( '1Jf ( Since the pressure oscillations are small in an acoustic wave, one may assume that '1/J'(p) = 'l/J'(p0 ) , where Po is the equilibrium pressure. Then '1/J" = 0 and � � '1/J'(p) � . Taking this into account , from ( 14.78 ) we finally obtain
1
[}2 p 8t 2
= a
2 Llp ,
( 14.79)
where a = ('1/J'(po ) ) - 1 2 . This equation describes the variation in pressure in a medium in a state of acoustic motion. Equation ( 14.79) describes the
14.4 Examples of Applications
309
simplest wave process in a continuous medium. It is called the homogeneous wave equation. The quantity a has a simple physical meaning: it is the speed of propagation of an acoustic disturbance in the medium, that is, the speed of sound in it (see Problem 4). In the case of forced oscillations, when certain forces are acting on each element of volume of the medium, the three-dimensional density of whose distribution is given, Eq. (14.79) is replaced by the relation
{)2p
at 2
=
a
2 Llp + f
corresponding to Eq. ( 14.77) , which for f ¢. 0 is called the
wave equation.
( 14.80 )
inhomogeneous
14.4.5 Problems and Exercises 1. Suppose the velocity field v of a moving continuous medium is a potential field. Show that if the medium is incompressible, the potential
a ) Show that Euler's equation ( 14.76) can b e rewritten as
�: + grad (� v2 ) - v x curl v = F - � grad p
(see Problem 1 of Sect. 14. 1 ) . b ) Verify o n the basis of the equation of a ) that a n irrotational flow (curl v = 0 ) of a homogeneous incompressible liquid can occur only i n a potential field F . c ) I t turns out (Lagrange's theorem) that i f at some instant the flow i n a po tential field F = grad U is irrotational, then it always has been and always will be irrotational. Such a flow consequently is at least locally a potential flow, that is, v = grad
)
(
-
-
310
14 Elements of Vector Analysis and Field Theory
3. A flow whose velocity field has the form v = ( Vx , Vy , 0) is naturally called plane parallel or simply a planar flow. a) Show that the conditions div v = 0, curl v = 0 for a flow to be incompressible and irrotational have the following forms: avx _ avy = 0 .
ay
ax
b) Show that these equations at least locally guarantee the existence of functions
'1/J(x, y) and r.p(x, y) such that ( -vy , vx ) = grad 'I/J and (vx , Vy ) = grad r.p. c) Verify that the level curves r.p = c 1 and '1/J = c2 of these functions are or thogonal and show that in the steady-state flow the curves '1/J = coincide with the trajectories of the moving particles of the medium. It is for that reason that the function '1/J is called the current function, in contrast to the function r.p, which is the c
velocity potential. d) Show, assuming that the functions r.p and '1/J are sufficiently smooth, that they are both harmonic functions and satisfy the Cauchy-Riemann equations:
ar.p = a'!jJ ar.p ax ay ' ay Harmonic functions satisfying the Cauchy-Riemann equations are called conjugate harmonic functions. e) Verify that the function f(z) = (r.p + i'!jJ) (x, y) , where z = x + iy, is a dif ferentiable function of the complex variable z. This determines the connection of the planar problems of hydrodynamics with the theory of functions of a complex variable.
4. Consider the elementary version � = a 2 � of the wave equation (14.79) . This is the case of a plane wave in which the pressure depends only on the x-coordinate of the point (x, y, z) of space. a) By making the change of variable u = x - at, v = x + at, reduce this equation to the form aO:Cv = 0 and show that the general form of the solution of the original equation is p = f(x + at) + g(x - at) , where f and g are arbitrary functions of class
c< 2 l .
b) Interpret the solution just obtained as two waves J(x) and g(x) propagating left and right along the x-axis with velocity a. c) Assuming that the quantity a is the velocity of propagation of a dis turbance even in the general case (14. 79) , and taking account of the relation -1 12 , find, following Newton, the velocity CN of sound in air, assum a = ( '1/J'(po) ) ing that the temperature in an acoustic wave is constant, that is, assuming that the process of acoustic oscillation is isothermic. (The equation of state is p = ]ff , R = 8.31 d J 1 is the universal gas constant, and 1-£ = 28.8 __£_ 1 is the molecular eg·mo e mo e weight of air. Carry out the computation for air at a temperature of 0° C, that is, T = 273 K. Newton found that CN = 280 m/s.) d) Assuming that the process of acoustic vibrations is adiabatic, find, following Laplace, the velocity CL of sound in air, and thereby sharpen Newton's result CN . (In an adiabatic process p = cp"� . This is Poisson's formula from Problem 6 of
311
14.4 Examples of Applications
Sect. 13. 1. Show that i f CN = � ' then C L = 'Y � · For air "( � 1 . 4 . Laplace found CL = 330 m/s, which is in excellent agreement with experiment.) Using the scalar and vector potentials one can reduce the Maxwell equations (( 14.12) of Sect. 14. 1) to the wave equation (more precisely, to several wave equations of the same type) . By solving this problem, you will verify this statement. a) It follows from the equation V' · B = 0 that at least locally B = V' x A, where A is the vector potential of the field B . b ) Knowing that B = V' x A, show that the equation V' x E = �� implies that at least locally there exists a scalar function <.p such that E = - V' <.p �� c) Verify that the fields E = -V' <.p - �� and B = V' x A do not change if instead of <.p and A we take another pair of potentials cp and A such that cp = <.p � and A = A + V'"¢, where "1/J is an arbitrary function of class c <2 l . d) The equation V' · E = 1'o implies the first relation -V' 2 c.p - -ft Y' · A = /eo between the potentials <.p and A. e) The equation c2 V' x B - ft = a!o implies the second relation 5.
-
-
•
-
a j o2 A = 2 V' 2 A + c2 V'(V' · A) + EJt V'c.p + EJt 2 co between the potentials <.p and A. f) Using c) , show that by solving the auxiliary wave equation .d"¢ + f = � � ' without changing the fields E and B one can choose the potentials <.p and A so that they satisfy the additional (so-called gauge) condition V' A = � � . g) Show that if the potentials <.p and A are chosen as stated in f) , then the required inhomogeneous wave equations -c
·
-
for the potentials c.p and A follow from d) and e) . By finding <.p and A, we also find the fields E = V'c.p, B = V' x A.
1 5 *Integrat ion of D ifferent ial Forms
on Manifolds
1 5 . 1 A Brief Review of Linear Algebra 15. 1 . 1 The Algebra of Forms Let X be a vector space and p k : X k ---+ � a real-valued k-form on X . If e1 , . . . , en is a basis in X and x 1 = x i ' e i, , . . . , X k = x i k e i k is the expansion of the vectors x 1 , . . . , X k E X with respect to this basis, then by the linearity of p k , with respect to each argument (15.1) Thus, after a basis i s given i n X , one can identify the k-form p k : X k ---+ � with the set of numbers a i , ... i k = F k (e i , , . . . , e i k ). lf e1 , . . . , en is another basis in X and a1, ... Jk = F k (eh , . . . , eJk ) , then, setting e1 = cj e i , j = 1 , . . . , n, we find the ( tensor ) law ( 15.2) for transformation of the number sets a i, ... i k ' aj, ... j k corresponding to the same form p k . The set Fk := {F k : X k ---+ �} of k-forms on a vector space X is itself a vector space relative to the standard operations
(F1k + F�)(x) := Ff(x) + F�(x) , ( >-. Fk )(x) := >-. F k (x) of addition of k-forms and multiplication of a k-form by a scalar. For forms p k and F 1 of arbitrary degrees k and l the following product operation ® is defined:
( 15.3) ( 15.4)
tensor
(F k ® F1 )(x 1 , . . . , x k , X k+l , · · · , X k+ l ) := ( 15.5) = Fk (x 1 , . . . , x k )F1 (x k+l , · · · , x k+ l ) . Thus p k ® F 1 is a form p k+ l of degree k + l. The following relations are
obvious:
15 *Integration of Differential Forms on Manifolds
314
( >-. F k ) F1 = >-. (F k F1 ) , (15.6) (15.7) (Ff + F�) 0 F1 = Ff 0 F1 + F� 0 F1 , k k k (15.8) F ( F{ + F�) F F{ + F F� , k m k m 1 1 F ) F = F ( 15.9) (F (F F ) . Thus the set F = {Fk } of forms on the vector space X is a graded algebra F Q9 Fk with respect to these operations, in which the vectork space operations are carried out inside each space Fk occurring in the direct sum, and if F k E Fk , F 1 E F1 , then F k F 1 E Fk +l . Example 1 . Let X* be the dual space to X ( consisting of the linear function als on X) and e 1 , . . . , e n the basis of X* dual to the basis e 1 , . . . , e n in X, that is, e i (e1 ) = Jj . Since e i (x) e i ( xi e 1 ) xi i ( e 1 ) = xi Jj = x i , taking account of (15. 1) and ( 1 5.9) , we can write any k-form F k : X k JR. as 181
181
181
=
181
181
181
181
181
181
=
181
=
=
e
----t
(15. 10)
1 5 . 1 . 2 The Algebra of Skew-symmetric Forms Let us now consider the space w E f? k if the equality
[l k of skew-symmetric forms in Fk , that is,
w(x 1 , . . . , xi , · · · , xj , · · · , x k ) = -w(x 1 , , x1 , . . . , xi , · · · , xk ) holds for any distinct indices i , j E { 1 , . . . , n}. From any form F k E Fk one can obtain a skew-symmetric form using the operation A : F [l k of alternation, defined by the relation (15. 1 1 ) AFk ( XI ' . . . ' X k ) . - -k.1, F k ( Xi, ' . . . ' Xi k ) u l . ... .k. i k ' .
----t
•
.
s: i ,
. _
where 1,
i, ...ki k Jl...
-1, 0,
( i11 ·. ·. ·. ikk ) is even , if the permutation ( i11 ·. .· .· ikk ) is odd , if ( i � : : : �k ) is not a permutation .
i f the permutation
If F k is a skew-symmetric form, then, as one can see from (15. 1 1 ) , AF k = k F . Thus A(AF k ) AFk and Aw w if w E il k . Hence A : Fk f?k is a mapping of Fk onto [l k . =
=
Comparing Definitions ( 1 5.3) , ( 15.4) , and ( 1 5. 1 1 ) , we obtain
A(Ff + F�) AFf + AF� , A(;>..F k ) >-.AF k . =
=
----t
( 15. 12) (15. 13)
315
1 5 . 1 A Brief Review of Linear Algebra
Example 2 .
(15. 10) that
Taking account of relations (15. 12) and ( 1 5 . 1 3) , we find by
AF k =
eik ) A(ei ' so that it is of interest to find A( e i ' ® . . . ® e i k ) .
a
·
r0-. \CY
·
� l · · · �k
•
•
•
r0-. iCY
'
From Definition ( 1 5. 1 1 ) , taking account of the relation e i (x) = x i , we find A( eJ l '
·
- k!
·
x11 '
·
.
.
.
·
xj1 k
-
(15. 14)
x1k'
x1kk
The tensor product of skew-symmetric forms is in general not skew symmetric, so that we introduce the following exterior product in the class of skew-symmetric forms:
+ l)! A(w k ® w l . w k w l := (kk!l! (15. 15) Thus w k w 1 is a skew-symmetric form w k+ l of degree k + l. Example 3 . Based on the result (15. 14) of Example 2, we find by Definition 1\
1\
)
(15. 15) that
(15. 16)
Example 4. Using the equality obtained in Example 3, relation ( 1 5 . 1 4) , and the definitions ( 1 5. 1 1 ) and ( 1 5 . 1 5 ) , we can write e i i ( e i2 e i3 )(x 1 , x 2 , x3 ) = = ( 1 1+. 2 2). 1· A ( e '' ® (e '2 ® e ' 3 ) ) (x 1 , x 2 , x 3 ) = 1\
1\
1 1
. . .
A similar computation shows that
e i ' ( e i 2 e i3 ) (e i ' e i 2 ) e i 3 1\
1\
=
1\
1\
.
( 1 5 . 1 7)
316
15 *Integration of Differential Forms on Manifolds
Using the expansion of the determinant along a column, we conclude by induction that ei l
1\ · · · 1\
e ik (
x 1 , . . . , X k ) --
(15. 18)
and, as one can see from the computations just carried out, formula (15. 18) holds for any 1-forms e i 1 , . . . , e ik (not just the basis forms of the space X * ) . Taking the properties of the tensor product and the alternation operation listed above into account , we obtain the following properties of the exterior product of skew-symmetric forms: (15. 19) (wt + w�) 1\ w 1 = wt 1\ w 1 + w� 1\ w 1 , k k 1\ 1\ 1 1 (15.20) (.Xw ) w = .X(w w ) , w k 1\ w 1 = ( -1) k 1 w 1 1\ w k , (15.21) (w k 1\ w 1 ) 1\ w m = w k 1\ (w 1 1\ w m ) . (15.22) Proof. Equalities (15. 19) and (15.20) follow obviously from relations ( 15.6) ( 1 5 .8) and ( 1 5 . 12) and ( 1 5 . 1 3 ) . From relations (15. 10)-( 15. 14) and (15. 17) , for every skew-symmetric form w = a; 1 . . . ik e i 1 ® · · · ® e ik we obtain w = Aw = a'· l · · · '· k A ( e '. 1 ® · · · ® e '. k ) = -1 a'· l · · · ' k e '. 1 1\ · · · 1\ e'. k .
k!
·
Using the equalities (15. 19) and (15.20) we see that it now suffices to verify ( 1 5 . 2 1 ) and ( 15.22) for the forms e i 1 1\ · · . 1\ e ik . Associativity (15.22) for such forms was already established by (15.17). We now obtain (15.21) immediately from (15. 18) and the properties of determinants for these particular forms. 0 Along the way we have shown that every form w E fl k can be represented as W= (15.23) l "' i 1 < i 2 < · · - < ik "'n of skew-symmetric forms on the vector space =
Thus, the set fl { fl k } X relative to the linear vector-space operations ( 15.3) and ( 15.4) and the dim X
exterior multiplication ( 1 5 . 1 5 ) is a graded algebra fl = E9
fl k . The vectorspace operations on fl are carried out inside each vector space fl k , and if w k E fl k , w 1 E fl1 , then w k w 1 E fl k + l . In the direct sum E9 fl k the summation runs from zero the dimension of the space X , since the skew-symmetric forms w k : X k JR. of degree larger 1\
k =O
�
than the dimension of X are necessarily identically zero, as one can see by ( 15.21) (or from relations (15.23) and ( 15.8) ) .
15.1 A Brief Review of Linear Algebra
317
15.1.3 Linear Mappings o f Vector Spaces and the Adjoint Mappings of the Conjugate Spaces Let X and Y be vector spaces over the field JR. of real numbers (or any other field, so long as it is the same field for both X and Y ) , and let l : X -+ Y be a linear mapping of X into Y , that is, for every x, x 1 , x 2 E X and every
,\ E JR.,
+
(15 .24) l(x 1 x 2 ) = l(x i ) l(x 2 ) and l(-\x) = ,\l(x) . A linear mapping l : X -+ Y naturally generates its adjoint mapping l* : Fy -+ Fx from the set of linear functionals on Y (Fy) into the analogous set Fx. If F� is a k-form on Y, then by definition
+
( 1 5.25) It can be seen by (15.24) and (15.25) that l* F� is a k-form F� on X, that is, l* (FM C F� . Moreover, if the form F� was skew-symmetric, then ( l* FM = F� is also skew-symmetric, that is, l* ( !!� ) C !!!\: . Inside each vector space F� and !!� the mapping l* is obviously linear, that is,
+
+
l* (Ff F�) = l* Ff l* F�
and
l*(,\F k ) = -\l* p k .
( 1 5. 26)
Now comparing definition ( 1 5.25) with the definitions ( 1 5.5) , ( 1 5. 1 1 ) , and (15. 15) of the tensor product , alternation, and exterior product of forms, we conclude that ( 1 5.27) l*(FP ® Fq ) = (l *FP ) ® (l*Fq ) , ( 1 5.28) l*(AFP ) = A(l* FP ) ' P ( 1 5 . 29) l*(w w q ) = (l *wP ) (l*wq) . Example 5 . Let e 1 , . . . , e m be a basis in X , e1 , . . . , en a basis in Y , and l(e i ) = d; ej , i E { 1 , . . . , m} , j E { 1 , . . . , n} . If the k-form F� has the coordinate representation Fyk ( Y 1 , . . . , yk ) -- b)l ...)k Y)l 1 · . . . yJkk in the basis e1 , . . . , en , where bj 1 ... )k = F�( ej 1 , , e)k ), then (l*Fyk )( X 1 , . . . , X k ) = ai 1 ... i k Xil1 . . . · Xikk , where ai 1 i k = bj 1 ...j k c1,� . . . · c1: , since ai 1 . i k =: (l* F� )(ei1 , , e i k ) := F� (le i 1 , , le i k ) = = F� ( c;� e1 1 ' . . . , c1:e1 k ) = F� ( ej p . . . , ej k )d;� . . . · c;: . Example 6. Let e 1 , . . . , e m and e1 , . . . , e'' be the bases of the conjugate spaces X* and Y* dual to the bases in Example 5. Under the hypotheses of Example 5 we obtain (l* el )(x) = (l*eJ )(xi e i ) = el (x i le i ) = xi cJ (c� ek ) = = xi c� eJ (ek ) = xi c� J� = q xi = q e i (x) . 1\
1\
·
•
•
•
·
·
• • •
•
•
•
•
•
•
·
1 5 *Integration of Differential Forms on Manifolds
318
Example 7.
Retaining the notation of Example 6 and taking account of rela tions ( 15.22) and ( 15.29) , we now obtain
l* (eh
l* eJ k e:i k ) l* e:il ( d'l. kk e i k ) 2'1. 11 . . . . . d'l. kk e i 1 ( dl1-1 e i l ) dl'l. l
1\ . . . 1\
=
1\ . . . 1\
1\ . . . 1\
=
=
1\ . . . 1\
d'l. lk
eik
=
Keeping Eq. ( 1 5. 26) in mind, we can conclude from this that
l�l�ij11 <···
bJ· l ··· J. k
1 5 . 1 .4 Problems and Exercises 1.
Show by examples that in general a) p k 0 p l "I= p l 0 F k ; b) A(F k ® F 1 ) "/= AF k ® AF 1 ; c) if F k , F 1 E fl, then it is not always true that F k ® F 1
E
fl.
2. a) Show that if e 1 , . . . , e n is a basis of the vector space X and the linear func tionals e 1 , , en on X (that is elements of the conjugate space X*) are such that ei (ei ) = 8f , then e1, . . . , en is a basis in X* . b) Verify that one can always form a basis of the space Fk = Fk ( X ) from k-forms of the form ei 1 ® · · · ® ei k , and find the dimension (dim Fk ) of this space, knowing that dim X = n . c) Verify that one can always form a basis of the space fl k from forms of the form ei 1 1\ · · · 1\ ei k , and find dim fl k knowing that dim X = n . • • •
d) Show that if f1 = E9 fl k , then dim f1 = 2n.
k=n k =O=
15.1 A Brief Review of Linear Algebra
319
The exterior {Grassmann/ algebra G over a vector space X and a field P (usually denoted 1\ (X) in agreement with the symbol 1\ for the multiplication operation in G) is defined as the associative algebra with identity 1 having the following properties: 1 ° G is generated by the identity and X, that is, any subalgebra of G containing 1 and X is equal to G; 2 ° x 1\ x = 0 for every vector x E X; 3 ° dim G = 2 d i m X . a) Show that if e 1 , . . . , e n is a basis in X , then the set 1 , e 1 , . . . , e n , e 1 1\ e 2 , . . . , e n - 1 1\ e n , . . . , e 1 1\ · · · 1\ e n of elements of G of the form e;1 1\ · · · 1\ e; k = e1 , where I = { i 1 < · · · < i k } C { 1 , 2, . . . , n} , forms a basis in G. b) Starting from the result in a) one can carry out the following formal con struction of the algebra G = 1\(X). For the subsets I = {i 1 , . . . , i k } of {1, 2, . . . , n} shown in a) we form the formal elements e1 , (by identifying € { i } with e ; , and ee; with 1 ) , which we take as a basis of the vector space G over the field P. We define multiplication in G by the formula 3.
(� ) (� ) a1 e 1
bJeJ
=
f.;
ai bJ E (I, J ) e w J
,
where E(I, J) = sgn IT (j i) . Verify that the Grassmann algebra /\ (X) is iEl, jEJ obtained in this way. c) Prove the uniqueness (up to isomorphism) of the algebra /\(X) . k=n d) Show that the algebra /\ (X) is graded: /\(X) = ffi 1\ k (X) , where 1\ k (X) k =O is the linear span of the elements of the form e;1 1\ · · · I\ e; k ; here if a E /\P(X) and p b E N (X) , then a 1\ b E 1\ +q (X) . Verify that a 1\ b = ( - l ) Pq b 1\ a. -
a) Let A : X -+ Y be a linear mapping of X into Y. Show that there exists a unique homomorphism 1\ (A) : /\(X) -+ 1\(Y) from /\( X) into 1\(Y) that agrees with A on the subspace /\' (X) C 1\ (X) identified with X. b ) Show that the homomorphism A (A) maps A k (X) into A k (Y). The restriction of 1\ (A) to 1\ k (X) is denoted by 1\ k (A) . c) Let { e; : i = 1 , . . , m } be a basis in X and { e i : j = 1 , . . . , n} a basis in Y, and let the matrix ( aj ) correspond to the operator A in these bases. Show that if { e1 : I C { 1 , . . . , m } } , { e J : J C { 1 , . . . , n}} are the corresponding bases of the spaces /\ (X) and 1\ (Y) , then the matrix of the operator 1\ k (A) has the form a � = det(a; ) , i E I, j E J, where card ! = card J = k . d) Verify that if A : X -+ Y, B : Y -+ Z are linear operators, then the equality 1\ ( B o A) = 1\ ( B ) o 1\ (A) holds. 4.
.
1 H. Grassmann (1809-1877) - German mathematician, physicist and philologist; in particular, he created the first systematic theory of multidimensional and Euclidean vector spaces and gave the definition of the inner product of vectors.
320
15 *Integration of Differential Forms on Manifolds
1 5 . 2 Manifolds 1 5 . 2 . 1 Definition of a Manifold Definition 1. A Hausdorff topological space whose topology has a count able base 2 is called an n-dimensional manifold if each of its points has a neighborhood U homeomorphic either to all of JR. n or to the half-space
Hn = {x E 1Rn l x 1 :::; o}.
cp : JR.n U M (or Hn U M ) that realizes the homeomorphism of Definition 1 is a local chart of the manifold M, JR.n (or H n ) is called the parameter domain, and U the range of the chart Definition 2 . A mapping
-+
C
cp
:
-+
C
on the manifold M.
A local chart endows each point x E U with the coordinates of the point cp - 1 ( x) E JR.n corresponding to it. Thus, a local coordinate system is introduced in the region U; for that reason the mapping cp, or, in more ex panded notation, the pair (U, cp) is a map of the region U in the ordinary meaning of the term.
t
=
Definition 3. A set of charts whose ranges taken together cover the entire manifold is called an atlas of the manifold. Example 1 . The sphere 82 = { x E JR3 I I x l = 1 } is a two-dimensional mani fold. If we interpret 8 2 as the surface of the Earth, then an atlas of geograph ical maps will be an atlas of the manifold 8 2 . The one-dimensional sphere 8 1 = {x E JR 2 I Ix l = 1 } - a circle in JR 2 is obviously a one-dimensional manifold. In general, the sphere 8n = {X E JR.n + 1 l l x l = 1 } is an n-dimensional manifold. (See Sect. 1 2 . 1 . )
Remark 1 . The object (the manifold M) introduced by Definition 1 obviously does not change if we replace JR. n and H n by any parameter domains in JR. n homeomorphic to them. For example, such a domain might be the open cube In = {x E JR.n l 0 < xi < 1 , i = 1 , . . . , n} and the cube with a face attached Jn = {x E JR.n l o < x 1 :::; 1 , 0 < xi < 1 , i = 2, . . . , n} . Such standard parameter domains are used quite often. It is also not difficult to verify that the object introduced by Definition 1 does not change if we require only that each point x E M have a neighborhood U in M homeomorphic to some open subset of the half-space Hn .
Example 2. If X is an m-dimensional manifold with an atlas of charts {(U0, cp0)} and Y is an n-dimensional manifold with atlas {(V,B , �,B)}, then X x Y can be regarded as an ( m + n )-dimensional manifold with the atlas {(Wa,B, Xa,B) }, where Wa,B = Ua X V,13 and the mapping Xa,B = (cpa, �,B) maps the direct product of the domains of definition of 'Pa and �/3 into Wa ,B · 2 See Sect. 9.2 and also Remarks 2 and 3 in the present section.
15.2 Manifolds
321
In particular, the two-dimensional torus = S 1 x S 1 ( Fig. 12. 1 ) or the 1 1 n-dimensional torus Tn = 8 X · · · X 8 is a manifold of the corresponding '--v----" n factors dimension.
T2
If the ranges Ui and Uj of two charts ( Ui , 'P i ) and ( Uj , r.pj ) of a manifold M intersect, that is, Ui n Uj =f. 0, mutually inverse homeomorphisms 'Pij : Iij -+ Ij i and1 '{!j i : Ij i -+ Iij naturally arise between the sets Iij = 1'Pi 1 (Uj) and Ij i = r.pj (Ui ) · These homeomorphisms are given by 'f!i j = r.pj o 'P i l rij and '{!j i = r.pi 1 o 'Pj l rJ • . These homeomorphisms are often called changes of coordinates, since they effect a transition from one local coordinate system to another system of the same kind in their common range Ui n Uj ( Fig. 15. 1 ) .
Fig. 1 5 . 1 .
Definition 4. The number n i n Definition 1 is the
M and is usually denoted dim M.
dimension of the manifold
Definition 5 . If a point r.p - 1 (x) on the boundary 8Hn of the half-space Hn corresponds to a point x E U under the homeomorphism r.p : Hn -+ U, then x is called a boundary point of the manifold M ( and of the neighborhood U). The set of all boundary points of a manifold M is called the boundary of this manifold and is usually denoted 8M.
By the topological invariance of interior points ( Brouwer ' s theorem3 ) the concepts of dimension and boundary point of a manifold are unambiguously 3 This theorem asserts that under a homeomorphism r.p : E -t r.p(E) of a set E C IR. n onto a set r.p(E) C IR. n the interior points of E map to interior points of r.p(E) .
322
15 *Integration of Differential Forms on Manifolds
defined, that is, independent of the particular local charts used in Defini tions 4 and 5. We have not proved Brouwer ' s theorem, but the invariance of interior points under diffeomorphisms is well-known to us (a consequence of the inverse function theorem) . Since it is diffeomorphisms that we shall be dealing with, we shall not digress here to discuss Brouwer ' s theorem.
Example 3. The closed ball If' = {x E lRn [ l x l :S: I } or, as we say, the n-dimensional disk, is an n-dimensional manifold whose boundary is the (n - I )-dimensional sphere sn - l = {x E lRn [ l x l = I } . Remark 2. A manifold M having a non-empty set of boundary points is usu ally called a manifold with boundary, the term manifold (in the proper sense
of the term) being reserved for manifolds without boundary. In Definition I these cases are not distinguished.
The boundary oM of an n-dimensional manifold with boundary M is an (n - I)-dimensional manifold without boundary. Proof. Indeed, oHn = JRn - l , and the restriction to oH n of a chart of the form 'P i H n -+ Ui belonging to an atlas of M generates an atlas of oM. D Proposition 1 .
:
Example 4- Consider the planar double pendulum (Fig. I5.2) with arm a shorter than arm b, both being free to oscillate, except that the oscillations of b are limited in range by barriers. The configuration of such a system is characterized at each instant of time by the two angles o: and (3. If there were no constraints, the configuration space of the double pendulum could be identified with the two-dimensional torus = s; X s� . Under these constraints, the configuration space of the double pendulum is parametrized by the points of the cylinder s; X I� , where s; is the circle, corresponding to all possible positions of the arm a , and I� = {(3 E 1R [ I f3 1 ::::;
T2
Fig. 1 5 . 2 .
15.2 Manifolds
323
,1} is the interval within which the angle (3 may vary, characterizing the position of the arm b. In this case we obtain a manifold with boundary. The boundary of this manifold consists of the two circles S}, x { - ,1} and S}, x { ,1 } , which are the products of the circle S}, and the endpoints { -Ll} and {Ll} of the interval
IJ . Remark 3.
It can be seen from Example 4 just considered that coordinates sometimes arise naturally on M ( a and (3 in this example) , and they them selves induce a topology on M. Hence, in Definition 1 of a manifold, it is not always necessary to require in advance that M have a topology. The essence of the concept of a manifold is that the points of some set M can be parametrized by the points of a set of subdomains of JR n . A natural connec tion then arises between the coordinate systems that thereby arise on parts of M, expressed in the mappings of the corresponding domains of JR n . Hence we can assume that M is obtained from a collection of domains of JR n by exhibiting some rule for identifying their points or, figuratively speaking, ex hibiting a rule for gluing them together. Thus defining a manifold essentially means giving a set of subdomains of JR n and a rule of correspondence for the points of these subdomains. We shall not take the time to make this any more precise by formalizing the concept of gluing or identifying points, introducing a topology on M, and the like. Definition 6. A manifold is compact (resp. connected) as a topological space.
connected) if it is compact (resp.
The manifolds considered in Examples 1-4 are compact and connected. The boundary of the cylinder s; X IJ in Example 4 consists of two indepen dent circles and is a one-dimensional compact , but not connected, manifold. The boundary sn - l = 8B n of the n-dimensional disk of Example 3 is a com pact manifold, which is connected for n > 1 and disconnected (it consists of two points) if n = 1 . The space JR n itself i s obviously a connected noncompact mani fold without boundary, and the half-space H n provides the simplest example of a connected noncompact manifold with boundary. (In both cases the at las can be taken to consist of the single chart corresponding to the identity mapping. )
Example 5 .
Proposition 2.
If a manifold M is connected, it is path connected.
x0
Proof. After fixing a point E M, consider the set Exo of points of M that can be joined to by a path in M. The set Ex o , as one can easily verify from the definition of a manifold, is both open and closed in M. But that means that Ex 0 = M. D
x0
324
15 *Integration of Differential Forms on Manifolds
If to each real n x n matrix we assign the point of � n 2 whose coordinates are obtained by writing out the elements of the matrix in some fixed order, then the group GL(n, �) of nonsingular n x n matrices becomes a manifold of dimension n 2 . This manifold is noncompact (the elements of the matrices are not bounded) and nonconnected. This last fact follows from the fact that GL(n, �) contains matrices with both positive and negative determinants. The points of GL(n , �) corresponding to two such matrices cannot be joined by a path. (On such a path there would have to be a point corresponding to a matrix whose determinant is zero. )
Example 6.
The group 80 ( 2 , �) of orthogonal mappings of the plane � 2 hav. determmant · · · mg equa 1 to 1 consists of matnces of t he c10rm cos. a sin a - sm a cos a and hence can be regarded as a manifold that is identified with the circle the domain of variation of the angular parameter a . Thus 80 (2, �) is a one dimensional compact connected manifold. If we also allow reflections about lines in the plane � 2 , we obtain the group 0 ( 2 , �) of all real orthogonal 2 x 2 matrices. It can be naturally identified with two different circles, correspond ing to matrices with determinants + 1 and - 1 respectively. That is, 0(2, �) is a one-dimensional compact, but not connected manifold.
(
Example 7.
)
Let a be a vector in � 2 and Ta the group of rigid motions of the plane generated by a. The elements of Ta are translations by vectors of the form na, where n E Z. Under the action of the elements of the group Ta each point of the plane is displaced to a point ) of the form + na. The set of all points to which a given point E � 2 passes under the action of the elements of this group of transformations is called its orbit. The property of points of � 2 of belonging to the same orbit is obviously an equivalence relation on � 2 , and the orbits are the equivalence classes of this relation. A domain in � 2 containing one point from each equivalence class is called a fundamental domain of this group of automorphisms (for a more precise statement see Problem 5d) ) . In the present case we can take as a fundamental domain a strip of width lal bounded by two parallel lines orthogonal to a. We need only take into account that these lines themselves are obtained from each other through translations by a and -a respectively. Inside a strip of width less than lal and orthogonal to a there are no equivalent points, so that all orbits having representatives in that strip are endowed uniquely with the coordinates of their representatives. Thus the quotient set � 2 /Ta consisting of orbits of the group Ta becomes a manifold. From what was said above about a fundamental domain, one can easily see that this manifold is homeomorphic to the cylinder obtained by gluing the boundary lines of a strip of width lal together at equivalent points.
Example 8.
x
Ta.h
g x
Now let a and b be a pair of orthogonal vectors of the plane � 2 the group of translations generated by these vectors. In this case a
Example 9.
and
x g(x
15.2 Manifolds
325
fundamental domain is the rectangle with sides a and b. Inside this rectangle the only equivalent points are those that lie on opposite sides. After gluing the sides of this fundamental rectangle together, we verify that the resulting manifold IR 2 /Ta,b is homeomorphic to the two-dimensional torus.
Example 10. Now consider the group Ga , b of rigid motions of the plane IR 2 generated by the transformations a(x, y) = (x+ 1 , 1 -y) and b(x, y) = (x, y+ 1 ) . A fundamental domain for the group Ga , b i s the unit square whose hori zontal sides are identified at points lying on the same vertical line, but whose vertical sides are identified at points symmetric about the center. Thus the resulting manifold IR 2 /G a , b turns out to be homeomorphic to the Klein bottle (see Sect. 1 2 . 1 ) .
We shall not take time t o discuss here the useful and important examples studied in Sect. 12. 1 . 1 5 . 2 . 2 Smooth Manifolds and Smooth Mappings Definition 7. An atlas of a manifold is
smooth (of class C ( k ) or analytic)
if all the coordinate-changing functions for the atlas are smooth mappings ( diffeomorphisms) of the corresponding smoothness class. Two atlases of a given smoothness (the same smoothness for both) are
equivalent if their union is an atlas of this smoothness. Example 11. An atlas consisting of a single chart can be regarded as having1 any desired smoothness. Consider in this connection the atlas on the line IR generated by the identity mapping IR 1 x tp(x) = x E IR 1 , and a second atlas - generated by any strictly monotonic function IR 1 x ip( x) E IR 1 , mapping IR 1 onto IR 1 . The union of these atlases is an atlas having smoothness equal to the smaller of the smoothnesses of ;:p and ;p- 1 . In particular, if ip( x) = x 3 , then the atlas consisting of the two charts what has just been said, {x, x3 } is not smooth, since ;p- 1 (x) = x 1 1 3 . Using we can construct infinitely smooth atlases in IR 1 whose union is an atlas of a preassigned smoothness class C ( k ) . Definition 8. A smooth manifold (of class C ( k ) or analytic) is a manifold M with an equivalence class of atlases of the given smoothness. After this definition the following terminology is comprehensible: topolog ical manifold (of class c< 0 l ) , C ( k ) _manifold, analytic manifold. To give the entire equivalence class of atlases of a given smoothness on a manifold M it suffices to give any atlas A of this equivalence class. Thus we can assume that a smooth manifold is a pair (M, A), where M is a manifold and A an atlas of the given smoothness on M. 3
H
3
H
326
15 *Integration of Differential Forms on Manifolds
The set of equivalent atlases of a given smoothness on a manifold is often called a structure of this smoothness on the manifold. There may be differ ent smooth structures of even the same smoothness on a given topological manifold ( see Example 11 and Problem 3) . Let us consider some more examples in which our main attention is di rected to the smoothness of the coordinate changes.
Example 12. The one-dimensional manifold !RlP' 1 called the real projective line, is the pencil of lines in IR 2 passing through the origin, with the natural
notion of distance between two lines ( measured, for example, by the magni tude of the smaller angle between them ) . Each line of the pencil is uniquely determined by a nonzero direction vector ( x 1 , x 2 ), and two such vectors give the same line if and only if they are collinear. Hence !RlP' 1 can be regarded as a set of equivalence classes of ordered pairs (x 1 , x 2 ) of real numbers. Here at least one of the numbers in the pair must be nonzero, and two pairs are considered equivalent ( identified ) if they are proportional. The pairs (x 1 , x 2 ) are usually called homogeneous coordinates on !RlP' 1 . Using the interpretation of !RlP' 1 in homogeneous coordinates, it is easy to construct an atlas of two charts on !RlP' 1 . Let Ui , i = 1, 2 , be the lines ( classes of pairs (x 1 , x 2 )) in !RlP' 1 for which x i # 0. To each point ( line ) p E U1 there corresponds a unique 2 2 pair ( 1, � 1 ) determined by the number t 21 = � 1 • Similarly the points of the 1 region U2 are in one-to-one correspondence with pairs of the form ( � , 1 ) I and are determined by the number t§ = � . Thus local coordinates arise in u1 and u2 , which obviously correspond to the topology introduced above on !RlP' 1 . In the common range U1 n U2 of these local charts the coordinates they introduce are connected by the relations t§ = (t?) - 1 and t i = (t§) - 1 , which shows that the atlas is not only C ( oo ) but even analytic. It is useful to keep in mind the following interpretation of the manifold !RlP' 1 . Each line of the original pencil of lines is completely determined by its intersection with the unit circle. But there are exactly two such points, diametrically opposite to each other. Lines are near if and only if the cor responding points of the circle are near. Hence !RlP' 1 can be interpreted as a circle with diametrically opposite points identified ( glued together ) . If we take only a semicircle, there is only one pair of identified points on it , the end-points. Gluing them together, we again obtain a topological circle. Thus !RlP' 1 is homeomorphic to the circle as a topological space.
Example 13. If we now consider the pencil of lines passing through the ori gin in JR 3 , or, what is the same, the set of equivalence classes of ordered triples of points (x 1 , x 2 , x 3 ) of real numbers that are not all three zero, we obtain the real projective plane !RlP' 2 . In the regions U1 , U2 , and U3 where x 1 # 0, x 2 # 0, x 3 # 0 respectively, we introduce local coordinate systems ( 1 ' �� ' �� ) = ( 1 ' ti ' t y ) ( t r' t i ) ' ( � ' 1 ' �� ) = ( t§ ' 1 ' t � ) ( t§ ' t � ) ' and ( �31 , �21 , 1 ) = (t 1 , t�, 1) (t§, t�), which are obviously connected by the relarv
rv
rv
327
15.2 Manifolds
tions ti = (tj) - I , ti = t1 (tk) - I , which apply in the common portions of the ranges of these charts. For example, the transition from ( ti , tr) to ( t� , t�) in the domain UI n U2 is given by the formulas
3
The Jacobian of this transformation is - ( ti) and since ti �� , it is defined and nonzero at points of the set UI n u2 under consideration. Thus 1RlP'2 is a two-dimensional manifold having an analytic atlas consist ing of three charts. By the same considerations as in Example 1 2 , where we studied the projective line lRlP' I , we can interpret the projective plane 1RlP' 2 as the two- dimensional sphere S2 c JR 2 with antipodal points identified, or as a hemi sphere, with diametrically opposite points of its boundary circle identified. Projecting the hemisphere into the plane, we obtain the possibility of inter preting 1RlP'2 as a (two-dimensional) disk with diametrically opposite points of its boundary circle identified. -
,
Example 14. The set of lines in the plane JR 2 can be parititioned into two sets: U, the nonvertical lines, and V, the nonhorizontal lines. Each line in U has an equation of the form y u x + u 2 , and hence is characterized by the coordinates ( U I , u 2 ) , while each line in V has an equation x V I Y + v 2 and is determined by coordinates (V I , v2 ) . For lines in the intersection U V we I I =
i
=
n
have the coordinate transformation V I = u1I , v 2 = -u 2 u1 and U I = v1 , u 2 = -v2 v1 I · Thus this set is endowed with an analytic atlas consisting of two charts. Every line in the plane has an equation ax+by+c = and is characterized by a triple of numbers (a, b, c), proportional triples defining the same line. For that reason, it might appear that we are again dealing with the projective plane 1RlP'2 considered in Example 13. However, whereas in 1RlP' 2 we admitted any triples of numbers not all zero, now we do not admit triples of the form c) where c -=f. A single point in 1RlP'2 corresponds to the set of all such triples. Hence the manifold obtained in our present example is homeomorphic to the one obtained from 1RlP' 2 by removing one point . If we interpret 1RlP' 2 as a disk with diametrically opposite points of the boundary circle identified, then, deleting the center of the circle, we obtain, up to homeomorphism, an annulus whose outer circle is glued together at diametrically opposite points. By a simple incision one can easily show that the result is none other than the familiar Mobius band. Definition 9. Let M and N be C (k Lmanifolds. A mapping f : M -+ N is l-smooth (a c U l_mapping) if the local coordinates of the point f(x) E N are c ( lLfunctions of the local coordinates of X E M.
0
(0, 0,
0.
This definition has an unambiguous meaning (one that is independent of the choice of local coordinates) if l :::; k.
328
15 *Integration of Differential Forms on Manifolds
In particular, the smooth mappings of M into JR. 1 are smooth functions on M, and the smooth mappings of JR. 1 (or an interval of JR. 1 ) into M are smooth paths on M. Thus the degree of smoothness of a function f : M --+ N on a manifold M cannot exceed the degree of smoothness of the manifold itself. 1 5 . 2 . 3 Orientation of a Manifold and its Boundary Definition 10. Two charts of a smooth manifold are consistent if the tran sition from the local coordinates in one to the other in their common range is a diffeomorphism whose Jacobian is everywhere positive.
In particular, if the ranges of two local charts have empty intersection, they are considered consistent.
A of a smooth manifold (M, A) is an orienting atlas of M if it consists of pairwise consistent charts. Definition 1 2 . A manifold is orientable if it has an orienting atlas. Other wise it is nonorientable. Two orienting atlases of a manifold will be regarded as equivalent (in the Definition 1 1 . An atlas
sense of the question of orientation of the manifold considered just now) if their union is also an orienting atlas of the manifold. It is easy to see that this relation really is an equivalence relation. Definition 1 3 . An equivalence class of orienting atlases of a manifold in the relation just defined is called an orientation class of atlases of the manifold or an orientation of the manifold.
oriented manifold is a manifold with this class of orien tations of its atlases, that is, with a fixed orientation on the manifold.
Definition 14. An
Thus orienting the manifold means exhibiting (by some means or other) a certain orientation class of atlases on it. To do this, for example, it suffices to exhibit any specific orienting atlas from the orientation class. Various methods used in practice to define an orientation . of manifolds embedded in JR.n are described in Sects. 12.2 and 12.3.
A connected manfiold is either nonorientable or admits ex actly two orientations. Proof. Let A and A be two orienting atlases of the manifold M with diffeo morphic transitions from the local coordinates of charts of one to charts of the other. Assume that;_there is a point p0 E M and two charts of these atlases whose ranges Ui o and Ui o contain po ; and suppose the Jacobian of the change Proposition 3 .
of coordinates of the charts at points of the parameter space corresponding
329
15.2 Manifolds
Po
p
to the point is positive. We shall show that then for every point E M and any charts of the atlases A and A whose ranges contain the Jacobian of the coordinate transformation at corresponding coordinate points is also positive. We begin by making the obvious observation that if the Jacobian of the transformation is positive (resp. negative) at the point for any pair of charts containing in the atlases A and A, then it is positive (resp. negative) at for any such pair of charts, since inside each given atlas the coordinate trans formations occur with positive Jacobian, and the Jacobian of a composition of two mappings is the product of the Jacobians of the individual mappings. Now let E be the subset of M consisting of the points E M at which the coordinate transformations from the charts of one atlas to those of the other have positive Jacobian. The set E is nonempty, since E E . The set E_is open in M . Indeed, for every point E E there exist ranges Ui and Uj of certain charts of the atlases A and A containing The sets Ui and Uj are open in M, so that the set Ui n Uj is open in M. On the connected component of the set Ui n Ui containing which is open in Ui n Ui and in M, the Jacobian of the transformation cannot change sign without vanishing at some point. That is, in some neighborhood of the Jacobian remains positive, which proves that E is open. But E is also closed in M. This follows from the continuity of the Jacobian of a diffeomorphism and the fact that the Jacobian of a diffeomorphism never vanishes. Thus E is a non-empty open�losed subset of the connected set M. Hence E = M, and the atlases A and A define the same orientation on M. Replacing one coordinate, say t 1 by -t 1 in every chart of the atlas A, we obtain the orienting atlas - A belonging to a different orientation class. Since the Jacobians of the coordinate transformations from an arbitrary chart to the charts of A and - A have opposite signs, every atlas that orients M is equivalent either to A or to - A. D Definition 15. A finite sequence of charts of a given atlas will be called a chain of charts if the ranges of any pair of charts having adjacent indices have a non-empty intersection (Ui n Ui + 1 f. 0). Definition 16. A chain of charts is contradictory or disorienting if the Ja cobian of the coordinate transformation from each chart in the chain to the next is positive and the ranges of the first and last charts of the chain inter sect , but the coordinate transformation from the last to the first has negative Jacobian.
p
p
p
p
p
p0 p.
p
p,
p
manifold is orientable if and only if there does not exist a contradictory chain of charts on it. Proof Since every manifold decomposes into connected components whose
Proposition 4. A
orientations can be defined independently, it suffices to prove Proposition 4 for a connected manifold M.
330
15 *Integration of Differential Forms on Manifolds
N e c e s s i t y. Suppose the connected manifold M is orientable and A is an atlas defining an orientation. From what has been said and Proposition 3, every smooth local chart of the manifold M connected with the charts of the atlas A is either consistent with all the charts of A or consistent with all the charts of -A. This can easily be seen from Proposition 3 itself, if we resrict charts of A to the range of the chart we have taken, which can be regarded as a connected manifold oriented by one chart . It follows from this that there is no contradictory chain of charts on M. S u f f i c i en c y. It follows from Definition 1 that there exists an atlas on the manifold consisting of a finite or countable number of charts. We take such an atlas A and number its charts. Consider the chart (U1 , cp1) and any chart (Ui , 'P i ) such that U1 n Ui # 0 . Then the Jacobians of the coordinate transformations 'P l i and 'Pi l are either everywhere negative or everywhere positive in their domains of definition. The Jacobians cannot have values of different signs, since otherwise one could exhibit connected subsets U_ and u+ in ul uui where the Jacobian is negative and positive respectively, and the chain of charts (U1 , cp l ), (U+ , cp1), (Ui , 'P i ), (U_ , 'Pi ) would be contradictory. Thus, changing the sign of one coordinate if necessary in the chart ( Ui , 'Pi ), we could obtain a chart with the same range Ui and consistent with (U1 , cp1). After that procedure, two charts (Ui , 'P i ) and (Uj , 'Pj ) such that U1 n Ui # 0 , ul n uj "I 0 , ui n uj "I 0 are themselves consistent: otherwise we would have constructed a contradictory chain of three charts. Thus, all the charts of an atlas whose ranges intersect U1 can now be considered consistent with one another. Taking each of those charts now as the standard, one can adjust the charts of the atlas not covered in the first stage so that they are consistent . No contradictions arise when we do this, since by hypothesis, there are no contradictory chains on the manifold. Con tinuing this process and taking account of the connectedness of the manifold, we construct on it an atlas consisting of pairwise consistent charts, which proves the orientability of the manifold. D
This criterion for orientability of the manifold, like the considerations used in its proof, can be applied to the study of specific manifolds. Thus, the manifold IRIP' 1 studied in Example 1 2 is orientable. From the atlas shown there it is easy to obtain an orienting atlas of IRIP' 1 . To do this, it suffices to reverse the sign of the local coordinates of one of the two charts constructed there. However, the orientability of the projective line IRIP' 1 obviously also follows from the fact that the manifold IRIP' 1 is homoeomorphic to a circle. The projective plane IRIP'2 is nonorientable: every pair of charts in the atlas constructed in Example 13 is such that the coordinate transformations have domains of positivity and domains of negativity of the Jacobian. As we saw in the proof of Proposition 4, it follows from this that a contradictory chain of charts on IRIP'2 exists. For the same reason the manifold considered in Example 14 is nonori entable, which, as was noted, is homeomorphic to a Mobius band.
15.2 Manifolds
331
The boundary of an orientable smooth n-dimensional man ifold is an orientable (n - I)-dimensional manifold admitting a structure of the same smoothness as the original manifold. Proof. The proof of Proposition 5 is a verbatim repetition of the proof of the
Proposition 5 .
analogous Proposition 2 of Subsect. 12.3.2 for surfaces embedded in
JRn .
0
Definition 17. If A(M) {(Hn, 'Pi , Ui )} U { (JRn , tpj , Uj) } is an atlas that orients the manifold M, then the charts A (aM) {(JRn -1 , 'Pi itmn=�n-l , 8U; )} provide an orienting atlas for the boundary 8M of M. The orientation of the boundary defined by this atlas is called the
orientation of the boundary induced by the orientation of the manifold.
Important techniques for defining the orientation of a surface embedded in JRn and the induced orientation of its boundary, which are frequently used in practice, were described in detail in Sects. 12.2 and 12.3. 15.2.4 Partitions of Unity and the Realization of Manifolds as Surfaces in JRn
In this subsection we shall describe a special construction called a partition of unity. This construction is often the basic device for reducing global problems to local ones. Later on we shall demonstrate it in deriving Stokes ' formula on a manifold, but here we shall use the partition of unity to clarify the possibility of realizing any manifold as a surface in JRn of sufficiently high dimension.
One can construct a function f E c<= l (JR, JR) on lR such that f(x) 0 for l x l 2: 3, f(x) 1 for l x l :::; 1, and 0 < f(x) < 1 for 1 < l x l < 3 . Proof. We shall construct one such function using the familiar function -1 f x2 ) for x =f. 0 e< g(x) = 0 X = 0 '. Previously ( see Exercise 2 of Sect. 5.2) we verified that g E c<= l (JR, JR) by showing that g
{
=
c 10f
{ -(x-1 ) -2
In such a case the nonnegative function
G(x) = also belongs to
e
· e -(x+1 ) -2 for for 0
c<= l (JR, JR) , and along with it
F(x) belongs to this class, since
�
lxl < 1 , l x l 2: 1
the function
JCXJ G(t) dt ITG(t) t
-
F'(x) = G(x)
-
oo
d
j_[ G(t) dt .
15 *Integration of Differential Forms on Manifolds
332
The function F is strictly increasing on [ - 1 , 1] , = 1 for x � 1 . We can now take the required function t o be
F ( x)
F( x)
f(x) = F(x + 2) + F( -x - 2) - 1 .
=
0 for x :::; - 1 , and
D
Remark. If f : JR. -t JR. is the function constructed in the proof of the lemma, then the function defined in JR_n is such that () E C ( oo ) (JR.n , JR.), 0 :::; () :::; 1, at every point x E lR.n , ()(x) = 1 on the interval I(a) = {x E JR.n j l x i - ai l :::; 1 , i = 1 , . . . , n } , and the support supp () of the function () is contained in the interval l(a) = {x E JR. n I l x i - ai l :::; 3, i = 1 , . . . , n } . Definition 18. Let M be a cC k Lmanifold and X a subset of M. The system E = a E A} of functions E cC k l ( M, lR. is a
{ea,
ea
)
C ( k ) partition of unity
on X if 1 ° 0 :::; ea(x) :::; 1 for every function ea E E and every x E M; 2 ° each point x E X has a neighborhood U(x) in M such that all but a finite number of functions of E are identically zero on U(x); 3 ° L ea ( x) = 1 on X . ea E E
We remark that by condition 2 ° only a finite number of terms in this last sum are nonzero at each point x E X.
{o,13, j3 E B} be an open covering of X C M. We the partition of unity E = { ea, E A} on X is subordinate to the covering 0 if the support of each function in the system E is contained in at
Definition 19. Let 0 =
a
say that
least one of the sets of the system
0.
Let {(Ui , 'Pi ), i = 1, . . . , m } be a finite set of charts of some C ( k ) atlas of the manifold M, whose ranges Ui , i = 1, . . . , m, form a covering of a compact set K M . Then there exists a C ( k ) partition of unity on K subordinate to the covering {Ui , i = 1 , . . . , m } . Proof. For any point x0 E K we first carry out the following construction. We choose successively a domain Ui containing x0 corresponding to a chart 'P i : JR.n Ui (or 'Pi : H n Ui ), the point to = cpi 1 (xo) E JR.n (or Hn ), the function ()(t - t 0 ) (where ()(t) i s the function shown i n the remark t o the lemma ) , and the restriction ()t o of ()(t - t 0 ) to the parameter domain of 'P i · Let It o be the intersection of the unit cube centered at to E JR. n with the parameter domain of 'P i . Actually ()t o differs from () ( t - t0 ) and It o differs Proposition 6.
c
-t
-t
from the corresponding unit cube only when the parameter domain of the
15.2 Manifolds
333
chart 'Pi is the half-space H n . The open sets 'P i (It ) constructed at each point x E K and the point t = 'Pi 1 (x), taken for all admissible values of i = 1, 2, . . . , m, form an open covering of the compact set K. Let { 'PiJ ( lti ) , j = 1, 2, . . . , l} be a finite covering of K extracted from it. It is obvious that 'Pi)Itj ) c uij " We define on uij the function Oi (x) = Btj 0
0
·
e 2 (x) = 02 (x)(1 - 01 (x)), . . . , e 1 (x) = B; (x) (1 - B;_ l (x))
· . . . ·
(1 (x)) l - 01 form the required partition of unity. We shall verify only that I: e 1 (x) 1 j= l on K, since the system of functions { e 1 , . . . , e l } obviously satisfies the other conditions required of a partition of unity on K subordinate to the covering {Ui, , . . . , Ui } {Ui , i = 1, . . . , m } . But l 1 - I >J (x) = (1 - 01 (x)) (1 - 01 (x)) on K , j= l since each point x E K is covered by some set 'P ij Uti ) on which the corre sponding function oj is identically equal to 1. Corollary 1 . If M is a compact manifold and A a C ( k ) atlas on M, then there exists a finite partition of unity { e 1 , . . . , e 1 } on M subordinate to a covering of the manifold by the ranges of the charts of A. Proof. Since M i s compact , the atlas A can b e regarded as finite. We now have the hypotheses of Proposition 6, if we set K = M in it . 0 Corollary 2 . For every compact set K contained in a manifold M and every open set G M containing K, there exists a function f M lR with smoothness equal to that of the manifold and such that f ( x ) 1 on K and supp f G. Proof. Cover each point x E K by a neighborhood U(x) contained in G and inside the range of some chart of the manifold M. From the open covering {U(x), x E K} of the compact set K extract a finite covering, and construct a l partition of unity { e 1 , . . . , e 1 } on K subordinate to it . The function f = I: e i i= l =
c
1
· . . . ·
=
0
0
C
C
:
=
--+
is the one required. 0
Every (abstractly defined) compact smooth n-dimensional manifold M is diffeomorphic to some compact smooth surface contained in JRN of sufficiently large dimension N . Corollary 3.
334
15 *Integration of Differential Forms on Manifolds
Proof. So as not to complicate the idea of the proof with inessential details, we carry it out for the case of a compact manifold M without boundary. In that case there is a finite smooth atlas A = {'P i : I ---+ i , i = . . . , m} on M, where I is an open n-dimensional cube in JRn . We take a slightly smaller cube I' such that I' C I and the set = 'P i (I'), i = . . . , m} still forms a covering of M. Setting K = I' , G = I, and M = JRn in Corollary 2, we construct a function f E C ( 00) (lR n , JR.) such that f ( t) = for t E I' and supp f C I. We now consider the coordinate functions t} ( x), . . . , ti ( x) of the mappings rpi 1 : ---+ I, i = . . . , m, and use them to introduce the following function on M: for X E i ,
U 1,
{Uf
U1
1,
1
1,
i
=
1, . . . , m ;
U for X � Ui ,
k = 1, . . . , n .
3
f-t
At every point x E M the rank of the mapping M x y(x) = and equal to n. Indeed, (yf , . . . , y]', . . . y;,, . . 1. , y;::, ) (x) E JRm · n is maximal if x E Uf, then
o
o
f-t
o
3
o
f-t
=
o
=
o
0
For information on the general Whitney embedding theorem for an ar bitrary manifold as a surface in JRn the reader may consult the specialized geometric literature. 1 5 . 2 . 5 Problems and Exercises 1 . Verify that the object (a manifold) introduced by Definition 1 does not change if we require only that each point x E M have a neighborhood U(x) C M homeo morphic to an open subset of the half-space H n .
2. Show that a) the manifold GL(n, IR) of Example 6 is noncompact and has exactly two connected components; b) the manifold SO (n, IR) (see Example 7) is connected;
15.2 Manifolds
335
c) the manifold O ( n, IR. ) is compact and has exactly two connected components. 3. Let (M, A) and (M, A) be manifolds with smooth structures of the same degree of smoothness C ( k ) on them. The smooth manifolds (M, A) and (M, A) (smooth structures) are considered isomorphic if there exists a C ( k ) mapping f : M -+ M having a c < k l inverse f - 1 : M -+ M in the atlases A, A. a) Show that all structures of the same smoothness on IR. 1 are isomorphic. b) Verify the assertions made in Example 1 1 , and determine whether they con tradict a) . c) Show that on the circle 8 1 (the one-dimensional sphere) any two c < = l struc tures are isomorphic. We note that this assertion remains valid for spheres of dimen sion not larger than 6, but on 8 7 , as Milnor 4 has shown, there exist nonisomorphic c < = l structures.
4. Let S be a subset of an n-dimensional manifold M such that for every point xo E S there exists a chart x = rp(t) of the manifold M whose range U contains xo , and the k-dimensional surface defined by the relations t k + l = 0, . . . , t n = 0 corresponds to the set S n U in the parameter domain t = (t 1 , , t n ) of rp. In this case S is called a k-dimensional submanifold of M. a) Show that a k-dimensional manifold structure naturally arises on S, induced by the structure of M and having the same smoothness as the manifold M. b) Verify that the k-dimensional surfaces S in IR. n are precisely the k-dimensional submanifolds of IR. n . c) Show that under a smooth homeomorphic mapping f IR. 1 -+ T2 of the line 1 IR. into the torus T2 the image f(IR. 1 ) may be an everywhere dense subset of T 2 and in that case will not be a one-dimensional submanifold of the torus, although it will be an abstract one-dimensional manifold. d) Verify that the extent of the concept "submanifold" does not change if we consider S C M a k-dimensional submanifold of the n-dimensional manifold M when there exists a local chart of the manifold M whose range contains xo for every point Xo E S and some k-dimensional surface of the space IR. n corresponds to the set S n U in the parameter domain of the chart. • • .
:
5. Let X be a Hausdorff topological space (manifold) and G the group of homeo morphic transformations of X. The group G is a discrete group of transformations of X if for every two (possibly equal) points x 1 , x 2 E X there exist neighborhoods U1 and U2 of them respectively, such that the set {g E G l g(U1 ) n U2 =/= 0} is finite. a) It follows from this that the orbit {g(x) E X I g E G } of every point x E X is discrete, and the stabilizer Gx = {g E G l g(x) = x} of every point x E X is finite. b) Verify that if G is a group of isometries of a metric space, having the two properties in a) , then G is a discrete group of transformations of X. c) Introduce the natural topological space (manifold) structure on the set X/G of orbits of the discrete group G.
4 J. Milnor (b. 1931) - one of the most outstanding modern American mathemati cians; his main works are in algebraic topology and the topology of manifolds.
336
15 *Integration of Differential Forms on Manifolds
d) A closed subset F of the topological space (manifold) X with a discrete group G of transformations is a fundamental domain of the group G if it is the closure of an open subset of X and the sets g(F) , where g E G, have no interior points in common and form a locally finite covering of X. Show using Examples 8-10 how the quotient space X/G (of orbits) of the group G can be obtained from F by "gluing" certain boundary points. 6. a) Using the construction of Examples 12 and 13, construct n-dimensional pro jective space JR]pm . b) Show that IRIP'n is orientable if n is odd and nonorientable if n is even. c) Verify that the manifolds 80 (3, JR) and IRIP'3 are homeomorphic. 7. Verify that the manifold constructed in Example 14 is indeed homeomorphic to the Mobius band. 8 . a) A Lie group 5 is a group G endowed with the structure of an analytic manifold such that the mappings (9 1 , 92 ) >-+ 91 · 92 and 9 >-+ 9 - 1 are analytic mappings of G x G and G into G. Show that the manifolds in Examples 6 and 7 are Lie groups. b) A topological group (or continuous group) is a group G endowed with the structure of a topological space such that the group operations of multiplication and inversion are continuous as mappings G x G -+ G, and G -+ G in the topology of G. Using the example of the group Q of rational numbers show that not every topological group is a Lie group. c) Show that every Lie group is a topological group in the sense of the definition given in b) . d) It has been proved 6 that every topological group G that is a manifold is a Lie group (that is, as a manifold G admits an analytic structure in which the group becomes a Lie group) . Show that every group manifold (that is, every Lie group) is an orientable manifold. 9. A system of subsets of a topological space is locally finite if each point of the space has a neighborhood intersecting only a finite number of sets in the system. In particular, one may speak of a locally finite covering of a space. A system of sets is said to be a refinement of a second system if every set of the first system is contained in at least one of the sets of the second system. In particular it makes sense to speak of one covering of a set being a refinement of another. a) Show that every open covering of !R n has a locally finite refinement. b) Solve problem a) with !R n replaced by an arbitrary manifold M. c) Show that there exists a partition of unity on !R n subordinate to any preas signed open covering of !R n . d) Verify that assertion c) remains valid for an arbitrary manifold.
5 S. Lie ( 1842-1899) - outstanding Norwegian mathematician, creator of the theory of continuous groups (Lie groups) , which is now of fundamental importance in geometry, topology, and the mathematical methods of physics; one of the winners of the International Lobachevskii Prize (awarded in 1897 for his work in applying group theory to the foundations of geometry) . 6 This i s the solution t o Hilbert's fifth problem.
15.3 Differential Forms and Integration on Manifolds
337
15.3 Differential Forms and Integration on Manifolds 15.3. 1 The Tangent Space to a Manifold at a Point We recall that to each smooth path JR. 3 t 8 x(t) E JR.n (a motion in JR.n ) passing through the point x0 = x(t0) E JR.n at time t 0 we have assigned the instantaneous velocity vector � = (e ' . . . ' � n ) : �( t ) = x( t ) = (x 1 ' . . . ' x n ) ( to ) . The set of all such vectors � attached to the point x0 E JR.n is naturally identified with the arithmetic space JR.n and is denoted TlR.�0 (or Tx 0 ( JR. n ) ) . In TlR.�0 one introduces the same vector operations on elements � E TlR.�0 as on the corresponding elements of the vector space JR.n . In this way a vector space TlR.�0 arises, called the tangent space to JR.n at the point x0 E JR.n . Forgetting about motivation and introductory considerations, we can now say that formally TlR.�0 is a pair (x0, JR.n ) consisting of a point x 0 E JR.n and a copy of the vector space JR.n attached to it. Now let M be a smooth n-dimensional manifold with an atlas A of at least C ( l ) smoothness. We wish to define a tangent vector � and a tangent space TMp0 to the manifold M at a point Po E M. To do this we use the interpretation of the tangent vector as the instan taneous velocity of a motion. We take a smooth path JR.n 3 t 8 p ( t ) E M on the manifold M passing through the point p0 = p( t0) E M at time to. The parameters of charts (that is, local coordinates) of the manifold M will be denoted by the letter x here, with the subscript of the corresponding chart and a superscript giving the number of the coordinate. Thus, in the pa rameter domain of each chart ( Ui , Cf' i ) whose range Ui contains p0 , the path t ..24 cpi 1 o p(t) = x i (t) E JR.n (or H n ) corresponds to the path 'Y · This path is smooth by definition of the smooth mapping JR. 3 t 8 p(t) E M . Thus, in the parameter domain of the chart (Ui , Cf' i ), where Cf' i is a mapping p = Cf'i (xi ), there arises a point x i (to) = cpi 1 (po ) and a vector �i = x i (to) E TJR.�i ( t o ) " In another such chart (Uj, Cf'J ) these objects will be respectively the point Xj ( to ) = cpj 1 (po ) and the vector �j = Xj ( t o ) E TJR.�i ( t o ) " It is natural to regard these as the coordinate expressions in different charts of what we would like to call a tangent vector � to the manifold M at the point p0 E M . Between the coordinates Xi and Xj there are smooth mutually inverse transition mappings (15.30) as a result of which the pairs by the relations
(x i (t0), �i) , (xJ (t0), �j) turn out to be connected
Xi (to) = Cf/Ji ( Xj (to)) , �i = cpj i (xJ (to))�J ,
Xj (to) = Cf'iJ ( xi (to) ) , �j = cp�j (x i (to))�i .
( 15.31) ( 1 5.32)
338
15 *Integration of Differential Forms on Manifolds
Equality ( 1 5.32) obviously follows from the formulas
obtained from (15.30) by differentiation.
tangent vector � to the manifold M at the point p E M is defined if a vector �i is fixed in each space TJR�, tangent to JR n at the point x i corresponding to p in the parameter domain of a chart (Ui , VJi ), where Ui p, in such a way that ( 1 5.32) holds. If the elements of the Jacobian matrix IPJ i of the mapping IPi i are written
Definition 1. We shall say that a 3
out explicitly as ::� , we find the following explicit form for the connection J between the two coordinate representations of a given vector �:
k = 1 , 2, . . . , n ,
(15.33)
where the partial derivatives are computed at the point Xj =
It can be seen from formulas (15.32) and ( 1 5.33) that the vector-space structure introduced in TMp is independent of the choice of individual chart, that is, Definition 2 is unambiguous in that sense. Thus we have now defined the tangent space to a manifold. There are various interpretations of a tangent vector and the tangent space (see Problem 1 ) . For example, one such interpretation is to identify a tangent vector with a linear functional. This identification is based on the following observation, which we make in lR n . Each vector � E TlR�0 is the velocity vector corresponding to some smooth path x = x ( t ) , that is, � = x ( t ) l t = t o with x0 = x ( t0 ) . This makes it possible to define the derivative D� f ( ) of a smooth function f defined on JR n (or in a neighborhood of x0 ) with respect to the vector � E TlR�0 . To be specific,
x0
d Dd(x0 ) : = - ( ! x ) ( t ) l t -- t0 , dt o
(15.34)
15.3 Differential Forms and Integration on Manifolds
339
that is,
( 1 5 .35) Def(xo) = f'(xo)� , where f'(x0) is the tangent mapping to f (the differential of f) at a point xo. The functional De CC l ) ( JR. n , JR. ) JR. assigned to the vector � E TlR.�0 by the formulas ( 15.34) and ( 15.35) is obviously linear with respect to f. It is also clear from ( 15.35) that for a fixed function f the quantity Def(xo) is a linear function of �, that is, the sum of the corresponding linear functionals corresponds to a sum of vectors, and multiplication of a functional De by a number corresponds to multiplying the vector � by the same number. Thus there is an isomorphism between the vector space TlR.�0 and the vector space of corresponding linear functionals De. It remains only to define the linear functional De by exhibiting a set of characteristic properties of it , in order :
-+
to obtain a new interpretation of the tangent space TlR.�0 , which is of course isomorphic to the previous one. We remark that , in addition to the linearity indicated above, the func tional De possesses the following property:
De( ! · g)(xo) = Def(xo) · g(xo) + f(xo) · Deg(xo) .
( 1 5.36)
This is the law for differentiating a product . In differential algebra an additive mapping a f-t a' of a ring A satisfying the relation (a· b)' = a' · b+a·b' is called derivation (more precisely derivation of lthe ring A) Thus the functional De : C ( l ) ( JR.n , JR. ) is a derivation of the ring C ( ) (JR.n , JR. ) . But De is also linear relative to the vector-space structure of
cC l l (JR.n , JR. ) .
One can verify that a linear functional the properties
l C ( 00 ) ( JR.n , JR. ) :
-+
JR. possessing
( 15.37) l( cx f + {3g) = cx l(f) + {3l(g) , cx , {3 E JR. , ( 1 5.38) l(f · g) = l(f)g(xo) + f(xo)l(g) , has the form De, where � E TlR.�0 • Thus the tangent space TlR.�0 to JR.n at xo can be interpreted as a vector space of functionals (derivations) on cC = ) (JR.n , JR. ) satisfying conditions ( 1 5.37) and ( 15.38) . The functions De , f(xo) = �f(x) i x = x o that compute the corresponding partial derivative of the function f at x0 correspond to the basis vectors e 1 , . . . , e n of the space TlR.�0 • Thus, under the functional interpretation of TlR.�0 one can say that the functionals { £r, . . . , a�n } i x = xo form a basis of �o · If � = ( e , . . . , � n ) E TlR.�0 , then the operator De corresponding to the vector � has the form De = � k a�• . In a completely analogous manner the tangent vector � to an n dimensional C ( oo ) manifold M at a point Po E M can be interpreted (or defined) as the element of the space of derivations l on cC =l (M, JR. ) having TJR.
340
15 *Integration of Differential Forms on Manifolds
xo
Po
properties ( 1 5 .37) and (15.38) , of course being replaced by in rela tion (15.38) , so that the functional l is connected with precisely the point E M. Such a definition of the tangent vector and the tangent space TMpo does not formally require the invocation of any local coordinates, and in that sense it is obviously invariant. In coordinates . . . , x n ) of a local chart ( Ui , 'P i ) the operator l has the form + · · · + 8�7! De; . The are naturally called the coo �dinates of the tangent vector numbers (el , . . . l E T Mp0 in coordinates of the chart ( Ui , 'P i ) · By the laws of differentiation, the coordinate representations of the same functional l E T Mp0 in the charts ( Ui , 'Pi ) , ( Uj , rp1 ) are connected by the relations
Po
e
(x 1 ei, =
ef/;r
, er)
n n n a k a a a , e = = ) : f ( m I: I: I: cr I: a cr k=l ax� m=l x k= l m=l a :n axk n
•
which of course duplicate (15.33 ) .
1
1
(15.33' )
•
1 5 . 3 . 2 Differential Forms on a Manifold
Let us now consider the space T * Mp conjugate to the tangent space that is, T * Mp is the space of real-valued linear functionals on TMp.
TMP ,
T * Mp conjugate to the tangent space TMp to the M at the point p E M is called the cotangent space to M at p. If the manifold M is a c < oo l manifold, f E c < oo l ( M, IR ) , and le is the derivation corresponding to the vector E TM , then for a fixed f E C ( 00 ) ( M, lR) the mapping H l£.1 will obviously be an element of the space T * Mp. In the case M !R n we obtain H De f(p) f' (p)e, so that the Definition 3 . The space
manifold
e o e e = = resulting mapping e le f is naturally called the differential of the function f at p, and is denoted by the usual symbol df(p) . If T!R n ( ) (or TH n _ 1 when p E a M) is the space corresponding to (P ) P H
_1
the tangent space TMP in the chart ( U0 , rp0 ) on the manifold M, it is natural to regard the space T * !R n'Pcx_ 1 ( conjugate to TJR1'Pcx_ 1 as the representative of (P) P) the space T * Mp in this local chart. In coordinates of a local chart . . . , dx n } in the conjugate space corresponds to ( Uo , rp0 ) the dual basis { 8 8 } of T!Rn'Pcx. 1 (p) (or TH'Po.n 1 (p ) if p E We recall that the basis { a;:r , . . . , 8 Xa Xa 8j . The expressions for these dual bases in = i , so that dx i ( 8�j ) [)x i [) [) another chart ( Uf3 , 'P f3 may turn out to be not so simple, for {)T at x fl x fl axr"' • j dX i - � d X . 'P cx
'Pcx
dx 1 ,
n
dxi (e) e o
Bx 1
)
{3
Definition 4 . We say that a
=
_
_
(x�, . . . , x�) aM).
=
differential form w m of degree m is defined on
an n-dimensional manifold M if a skew-symmetric form lR is defined on each tangent space TMp to M, p E M.
wm (p) : (TMp ) m
--t
15.3 Differential Forms and Integration on Manifolds
341
In practice this means only that a corresponding m-form wa (x a ) , where , ) ) corresponding X a = cp� 1 (p), is defined on each space TlRn'Pa_ , ( ) (or THn_ 'Pa ( to TMo in the chart (Ua , 'Pa ) of the manifold M. The fact that two such forms wa (x a ) and w13 (xf3) are representatives of the same form w(p ) can be P
P
expressed by the relation
( 15.39)
X a and Xf3 are the representatives of the point p E M, and ( 6 ) a , . . . , ( �m ) a and ( 6 )f3, . . . , ( �m )/3 are the coordinate representations of the vectors 6 , . . . , �m E T Mp in the charts (Ua , 'Pa ), (U13 , 'P!3 ) respectively. in which
In more formal notation this means that =
( 15.31') X a = 'Pf3a (Xf3) , Xf3 'Pa f3(X a ) , = 'P (x/3) /3 = 'P� (15.32') ) (x , ' �a �a �!3 � f3 a �a where, as usual, 'Pf3 a and 'Pa /3 are respectively the functions cp�1 'P/3 and cp� 1 'Pa for the coordinate transitions, and the tangent mappings to them 'P�a =: ('Pf3a ) * , cp�13 =: ('Pa f3 ) * provide an isomorphism of the tangent spaces to !Rn (or Hn ) at the corresponding points X a and X f3 · As stated in Subsect . 1 5 . 1 .3, the adjoint mappings ('P �a ) * =: 'P� a and (cp �13 ) * =: cp� 13 provide the o
o
transfer of the forms, and the relation ( 15.39) means precisely that
( 15.391) where a and f3 are indices (which can be interchanged) . The matrix (d;, ) of the mapping cp �13 (x a ) is known: (c{ ) Thus, if ( 1 5.40) and
bj 1 , ... ,j m dx�1 1\ · · · 1\ dx�m ,
( 15.41)
then according to Example 7 of Sect. 15. 1 we find that
where �, as always, denotes the determinant of the matrix of corresponding partial derivatives.
342
15 *Integration of Differential Forms on Manifolds
Thus different coordinate expressions for the same form w can be ob tained from each other by direct substitution of the variables (expanding the corresponding differentials of the coordinates followed by algebraic transfor mations in accordance with the laws of exterior products) . If we agree to regard the form W0 as the transfer of a form w defined on a manifold to the parameter domain of the chart (U0 ,
a C ( k)
i,. ( l:<:; h <· · < i,. :<:; n
form if the coefficients ai 1
. . .
w on an n-dimensional manifold M is xa
) of its coordinate representation
are C ( k ) functions in every chart ( U0 ,
exterior differential
is the linear operator d : ilk'
DJ:'_i1 possessing the following properties: 1 ° On every function f E D� the differential d : D�
·
--t
--t DL equals the 1 usual differential df of this function. 2 ° d : (w m 1 l\ w m 2 ) = dw m 1 l\ w m 2 + ( - l ) m1 W m1 /\ dw m 2 , where w m 1 E DJ:'1 and w m 2 E D;:' 2 . 3 ° d 2 : = d 0 d = 0.
This last equality means that d ( dw ) is zero for every form w. Requirement 3 ° thus presumes that we are talking about forms whose smoothness is at least c < 2 l . In practice this means that we are considering a c < 00 ) manifold M and the operator d mapping ,a m to ,a m +I .
15.3 Differential Forms and Integration on Manifolds
343
A formula for computing the operator d in local coordinates of a specific chart (and at the same time the uniqueness of the operator d) follows from the relation d
L Ci 1 ... i m ( x ) dx i ' A · · · A dxi m ) = ( 1-<;i, <···
L dci, ... i m ( x ) A dx i ' A · · · A dx i m + 1-<;i, <··-
( 15.43)
0)
The existence of the operator d now follows from the fact that the operator defined by (15.43) in a local coordinate system satisfies conditions 1 ° , 2 ° , and 3° of Definition 6. It follows in particular from what has been said that if W a = cp�w and wp = cp{J w are the coordinate representations of the same form w , that is, Wa = 'P� p Wf3 , then dwa and dwp will also be the coordinate representations of the same form (dw ) , that is, dw a = 'P� p dwp . Thus the relation d(cp� {3 wf3 ) = 'P� p (dwp) holds, which in abstract form asserts the commutativity dcp*
= cp* d
( 1 5.44)
of the operator d and the operation cp* that transfers forms. 15.3.4 The Integral of a Form over a Manifold Definition 7. Let M be an n-dimensional smooth oriented manifold on
which the coordinates x 1 , . . . , x n and the orientation are defined by a sin gle chart 'P x : Dx ---+ M with parameter domain Dx C JR n . Then
j = J a (x) dx 1 A · · · A dxn , Dx w:
( 1 5.45)
M
where the left-hand side is the usual integral of the form w over the oriented manifold M and the right-hand side is the integral of the function f ( x ) over the domain Dx . If 'P t : D t ---+ M i s another atlas of M consisting o f a single chart defining the same orientation on M as 'P x : Dx ---+ M, then the Jacobian det cp' ( t ) of the function x = cp( t ) of the coordinate change is everywhere positive in Dt . The form cp* ( a ( x ) dx 1
A · · · A dx n ) = a ( x ( t ) ) det cp ( t ) dt 1 A · · · A dtn '
in Dt corresponds to the form w . By the theorem on change of variables in a multiple integral we have the equality
15 *Integration of Differential Forms on Manifolds
344
J a(x) dx 1 · · · dxn = J a (x(t) ) det cp' (t) dt 1 . . · dtn , D,
Dx
which shows that the left-hand side of (15.45) is independent of the coordinate system chosen in M. Thus, Definition 7 is unambiguous. Definition 8 . The support of a form w defined on a manifold M closure of the set of points x E M where w (x) =/= 0.
IS
the
The support of a form w is denoted by supp w. In the case of 0-forms, that is, functions, we have already encountered this concept. Outside the support the coordinate representation of the form in any local coordinate system is the zero form of the corresponding degree. Definition 9. A form w defined on a manifold supp w is a compact subset of M.
M is of compact support if
Definition 10. Let w be a form of degree n and compact support on an n dimensional smooth manifold M oriented by the atlas A. Let 'P i : Di ---t Ui , { (Ui , cp) , i = 1 , . . . , m } be a finite set of charts of the atlas A whose ranges U1 , . . . , Um cover supp w, and let e 1 , . . . , e k be a partition of unity subordinate to that covering on supp w. Repeating some charts several times if necessary, we can assume that m = k, and that supp e i C Ui , i = 1 , . . . , m. The integral of a form w of compact support over the oriented manifold M is the quantity
J w : = f J cp; (ei w) ,
M
(15.46)
t= I D;
where cp; ( e i w) is the coordinate representation of the form e i w I U; in the domain D i of variation of the coordinates of the corresponding local chart. Let us prove that this definition is unambiguous. Let A = { 0j : jjj ---t uj } be a second atlas defining the same smooth structure and orientation on M as the atlas A, let ul ' . . . ' Um be the cor responding covering of supp w ' and let el ' . . . ' eii, a partition of unity on supp w subordinate to this covering. We introduce the functions fij = e i ej , i = 1 , . . . , m , j = 1 , . . . , iii , and we set Wij = jijW · We remark that supp Wij c wij = ui n uj . From this and from the fact that Definition 7 of the integral over an oriented manifold given by a single chart is unambiguous it follows that
Proof.
J cp; (wij ) = J
D;
'Pil(Wij )
cpt (w ij ) =
J
0j (wij ) =
J 0j (wij ) .
Dj
15.3 Differential Forms and Integration on Manifolds
345
Summing these equalities on i from 1 to m and on j from 1 to iii , taking iii
m
account of the relation L fiJ interested int.
i= 1 = ej , jL= 1 fiJ = e i , we find the identities we are
D
15.3.5 Stokes' Formula
Let M be an oriented smooth n-dimensional manifold and w a smooth differential form of degree n - 1 and compact support on M. Then (15.47)
Theorem.
where the orientation of the boundary aM of the manifold M is induced by the orientation of the manifold M. If aM = 0 , then f dw = 0. M
Proof.
Without loss of generality we may assume that the domains of vari ation of the coordinates (parameters) of all local charts of the manifold M are either the open cube I = {x E JR n l 0 < x i < 1, i = 1, . . . , n} , or the cube i = {x E JRn l 0 < x 1 ::::; 1 /\ 0 < x i < 1, i = 1, . . . , n} with one (definite! ) face adjoined to the cube I. By the partition of unity the assertion of the theorem reduces to the case when supp w is contained in the range U of a single chart of the form r.p : I --+ U or r.p : i --+ U. In the coordinates of this chart the form w has the form n w = L a i ( x ) dx 1 1\ · · · I\ dx i 1\ · · · 1\ dxn ,
i= 1
where the frown � , as usual, means that the corresponding factor is omitted. By the linearity of the integral, it suffices to prove the assertion for one term of the sum:
wi = ai ( x ) dx 1
1\
· · · I\
dx i
The differential of such a form is the n-form
1\
···
1\
dx n
.
(
15.48 )
aai 1 n ( 15.49 ) = ( - 1)i- 1 � ux' ( X ) dX · · · dX . For a chart of the form : I --+ U both integrals in ( 15.47 ) of the cor responding forms ( 15.48 ) and ( 15.49 ) are zero: the first because supp a i I and the second for the same reason, if we take into account Fubini ' s theorem 1 and the relation J � dx i = a i (1) - a i (O) = 0. This argument also covers the 0 case when aM = 0. dW i
r.p
1\
1\
C
346
15 *Integration of Differential Forms on Manifolds
Thus it remains to verify (15.47) for a chart rp : i ---+ U. If i > 1, both integrals are also zero for such a chart, which follows from the reasoning given above. And if i 1, then
=
I I ( I aa i I ) = 1 . . . 1 1 ox i (x) dx dx 2 . . · dxn = 0I 0I 0 = 1 · · · 1 a i (1, x2 , · · · , xn ) dx2 · · · dxn = 1 WI = 1 WI . o
o
au
aM
Thus formula (15.47) is proved for n > 1. The case n 1 is merely the Newton-Leibniz formula (the fundamental theorem of calculus) , if we assume that the endpoints a and f3 of the oriented interval [a, {3] are denoted a_ and {3+ and the integral of a 0-form g (x) over such an oriented point is equal to -g(a) and + g ( f3 ) respectively. D
=
We now make some remarks on this theorem.
Remark 1 .
Nothing is said in the statement of the theorem about the smooth ness of the manifold and the form w. In such cases one usually assumes that each of them is c < = l . It is clear from the proof of the theorem, however, that formula (15.47) is also true for forms of class c < 2 l on a manifold admitting a form of this smoothness.
M
M
Remark 2. It is also clear from the proof of the theorem, as in fact it was already from the formula (15.47), that if supp w is a compact set contained strictly inside that is, supp w n 0, then f dw 0.
aM = M, = M Remark 3. If M is a compact manifold, then for every form w on M the support supp w, being a closed subset of the compact set M, is compact. Consequently in this case every form w on M is of compact support and Eq. (15.47) holds. In particular, if M is a compact manifold without boundary, then the equality J dw = 0 holds for every smooth form on M. M
Remark 4.
For arbitrary forms w (not of compact support) on a manifold that is not itself compact, formula (15.47) is in general not true. Let us consider, for example, the form w x �� ���x in a circular annulus { ( x, y ) E IR 2 1 1 ::; x2 + y2 ::; 2 } , endowed with standard Cartesian coordinates. In this case is a compact two-dimensional oriented manifold,
M=
=
M
15.3 Differential Forms and Integration on Manifolds
whose boundary aM consists of the two circles Ci = {(x, y) E i = 1 , 2. Since dw = 0 , we find by formula ( 15.47) that
where both circles
347
� 2 l x 2 +y 2 = i},
C1 and C2 are traversed counterclockwise. We know that M = M \ C1 , then aM = C2 and dw = 0 f 27r = w.
Hence, if we consider the manifold
j
M 15.3.6
j
a Ni
Problems and Exercises
a) We call two smooth paths 'Yi : JR. -+ M, i = 1 , 2 on a smooth manifold M tangent at a point p E M if "{ 1 (0) = "{2 (0) = p and the relation ( 15.50) lrp - 1 o "{ 1 (t) - rp - 1 o "{2 (t) l = o (t) as t -+ 0 holds in each local coordinate system rp : JRn -+ U (or rp : H n -+ U) whose range U contains p. Show that if (15.50) holds in one of these coordinate systems, then it holds in any other local coordinate system of the same type on the smooth manifold M. b) The property of being tangent at a point p E M is an equivalence relation on the set of smooth paths on M passing through p. We call an equivalence class a bundle of tangent paths at p E M. Establish the one-to-one correspondence ex hibited in Subsect. 15.3. 1 between vectors of T Mp and bundles of tangent paths at the point p E M. c) Show that if the paths "! 1 and "!2 are tangent at p E M and f E C (ll (M, JR) , then df 0 "{ 1 (0) = df 0 "{2 (0) . dt dt d) Show how to assign a functional [ = l � (= D� ) : cC ool (M, JR) -+ JR. possessing properties ( 15.37) and (15.38) , where Xo = p, to each vector e E TMp . A functional possessing these properties is called a derivation at the point p E M. Verify that differentiation l at the point p is a local operation, that is, if /1 , h E cC oo ) and /1 (x) = h (x) in some neighborhood of p, then l/1 = l/2 . e) Show that if x 1 , . . . , xn are local coordinates in a neighborhood of the point p, then l = E (lx i ) a� ' , where a� ' is the operation of computing the partial derivative i= l with respect to x i at the point x corresponding to p. (H i n t. Write the function 1.
15 *Integration of Differential Forms on Manifolds
348
J
l
U (p )
: M -+ JR. in local coordinates; remember that the expansion f(x)
i it= l x gi(x) holds for the function f E Bi(o) l!r Co), i 1 , ) =
=
c < = l (JR. n , JR) , where
9i
E
=
f(O) +
c < = l (JR. n , JR) and
. . . , n.
f) Verify that if M is a c < = > manifold, then the vector space of derivations at the point p E M is isomorphic to the space T Mp tangent to M at p constructed in Subsect. 15.3. 1 . a ) I f we fix a vector �(p) E TMp at each point p E M of a smooth manifold M, we say that a vector field is defined on the manifold M. Let X be a vector field on M. Since by the preceding problem every vector X (p) = � E TMp can be interpreted as differentiation at the corresponding point p, from any function f E c < = > (M, JR) one can construct a function X f(p) whose value at every point p E M can be computed by applying X (p) to J, that is, to differentiating f with respect to the vector X (p) in the field X. A field X on M is smooth ( of class c < = > ) if for every function f E c < = > (M, JR.) the function X f also belongs to c < = > (M, JR.) . Give a local coordinate expression for a vector field and the coordinate definition of a smooth c c < = > ) vector field on a smooth manifold equivalent to the one just given. b) Let X and Y be two smooth vector fields on the manifold M. For functions f E c < = > (M, JR.) we construct the functional [X, Y]f = X(Y f) - Y(X f). Verify that [X, Y] is also a smooth vector field on M. It is called the Poisson bracket of the vector fields X and Y. c) Give a Lie algebrastructure to the smooth vector fields on a manifold. 2.
3 . a) Let X and w be respectively a smooth vector field and a smooth 1-form on a smooth manifold M. Let wX denote the application of w to the vector of the field X at corresponding points of M. Show that wX is a smooth function on M. b) Taking account of Problem 2, show that the following relation holds:
where X and Y are smooth vector fields, dw 1 is the differential of the form w 1 , and dw 1 (X, Y) is the application of dw 1 to pairs of vectors of the fields X and Y attached at the same point. c) Verify that the relation
i= l + 2:: (-1) i +j w ([xi, xj ], xl , . . . , xi , . . . , xj , . . . , xm+l ) l :'Oi<j:C:: m + l
holds for the general case of a form w of order m. Here the frown � denotes an omitted term, [Xi , Xi] is the Poisson bracket of the fields Xi and Xi , and Xiw represents differentiation of the function w (X 1 , . . . , Xi, . . . , Xm+l ) with respect to the vectors of the field Xi. Since the Poisson bracket is invariantly defined, the
15.3 Differential Forms and Integration on Manifolds
349
resulting relation can be thought of as a rather complicated but invariant definition of the exterior differential operator d : n --+ n. d) Let w be a smooth m-form on a smooth n-dimensional manifold M. Let (6 , . . . , �m + l ); be vectors in !R n corresponding to the vectors 6 , . . . , �m + l E T Mp in the chart <.p; : IR n --+ U C M. We denote by II; the parallelepiped formed by the vectors (6 , . . . , �m + l ) ; in !R n , and let >... II; be the parallelepiped spanned by the vectors (>...6 , . . . , A�m + d i · We denote the images <.p; (II; ) and <.p; ( A II; ) of these parallelepipeds in M by II and >... II respectively. Show that 1 dw (p) (6 , . . . , �m + I ) = l� ).. n l W· +
I
8 ( >- II )
a) Let f : M --+ N be a smooth mapping of a smooth m-dimensional manifold M into a smooth n-dimensional manifold N. Using the interpretation of a tangent vector to a manifold as a bundle of tangent paths (see Problem 1 ) , construct the mapping f. (p) : TMp --+ TNJ ( p ) induced by f. b) Show that the mapping f. is linear and write it in corresponding local coor dinates on the manifolds M and N. Explain why f. (p) is called the differential of f at p or the mapping tangent to f at that point. Let f be a diffeomorphism. Verify that f. [X, Y] = [f. X, f. Y] . Here X and Y are vector fields on M and [· , ·] is their Poisson bracket (see Problem 2). c ) As is known from Sect. 1 5 . 1 , the tangent mapping f. (p) : TMp --+ TN = f p q () of tangent spaces generates the adjoint mapping j* (p) of the conjugate spaces and in general a mapping of k-forms defined on T Nf ( p ) and T Mp . Let w be a k-form on N. The k-form j* w on M is defined by the relation 4.
where 6 , . . . , � k E TMp. In this way a mapping j* : st k ( N ) --+ st k (M ) arises from the space st k (N ) of k-forms defined on N into the space st k (M) of k-forms defined on M. Verify the following properties of the mapping j* , assuming M and N are c < oo) manifolds: 1 ° j* is a linear mapping; 2° j* ( w 1 1\ w2 ) = j* w 1 1\ j* w 2 ; 3° d o j* = j* o d, that is d (j* w ) = j* ( dw ) ; 4° ( h o h ) * = H o f2 . d) Let M and N be smooth n-dimensional oriented manifolds and <.p : M --+ N a diffeomorphism of M onto N. Show that if w is an n-form on N with compact support, then w = e: <.p * w ,
{ 11,
I
I
M
if <.p preserves orientation , . . 1'f <.p reverses onentatwn . e) Suppose A ::::> B. The mapping i : B --+ A that assigns to each point x E B that same point as an element of A is called the canonical embedding of B in A.
where f =
-
,
15 *Integration of Differential Forms on Manifolds
350
If w is a form on a manifold M and M' is a submanifold of M, the canonical embedding i : M' -+ M generates a form i* w on M' called the restriction of w to M'. Show that the proper expression of Stokes' formula (15.47) should be
where i : 8M -+ M is the canonical embedding of 8M in M, and the orientation of 8M is induced from M. a) Let M be a smooth (C (oo ) ) oriented n-dimensional manifold and D:C: (M) the space of smooth (C (oo) ) n-forms with compact support on M. Show that there exists a unique mapping I : n;; (M) -+ lR having the following properties: 5.
M
1 ° the mapping I is linear; M
2 ° if 'P : I n -+ U C M (or 'P : Jn -+ U C M) is a chart of an atlas defining the orientation of M, supp w C U, and w = a(x) dx 1 1\ · · · I\ dx n in the local coordinates x 1 , , x n of this chart, then . . •
J w = J a(x) dx \ . . . , dxn
M
Jn
(or
J w = J a(x) dx 1 ,
M
• • •
, dx n ) ,
jn
where the right-hand side contains the Riemann integral of the function a over the corresponding cube I n (or Jn ) . b ) Can the mapping just exhibited always be extended to a mapping I M n n (M) -+ lR of all smooth n-forms on M, retaining both of these properties? c) Using the fact that every open covering of the manifold M has an at most countable locally finite refinement and the fact that there exists a partition of unity subordinate to any such covering (see Problem 9), define the integral of an n-form over an oriented smooth n-dimensional (not necessarily compact) manifold so that it has properties 1 ° and 2 ° above when applied to the forms for which the integral is finite. Show that for this integral formula ( 1 5.47) does not hold in general, and give conditions on w that are sufficient for (15.47) in the case when M = lR n and in the case when M = H n . 6 . a) Using the theorem on existence and uniqueness of the solution of the differen tial equation x = v(x) and also the smooth dependence of the solution on the initial data, show that a smooth bounded vector field v(x) E lR n can be regarded as the velocity field of a steady-state flow. More precisely, show that there exists a family of diffeomorphisms 'Pt : lR n -+ lR n depending smoothly on the parameter t (time) such that 'Pt (x) is an integral curve of the equation for each fixed value of x E lR n , that is, 8 "'Jix) = v ( 'P t (x) ) and 'Po (x) = x. The mapping 'Pt : lR n -+ lR n obviously characterizes the displacement of the particles of the medium at time t. Verify that the family of mappings 'Pt : lR n -+ lR n is a one-parameter group of diffeomorphisms, that is, ( 'Pt ) - l = 'P - t , and 'Pt 2 o 'Pt 1 = 'Pt , + t 2 .
15.3 Differential Forms and Integration on Manifolds
351
b ) Let v be a vector field on ne and 'P t a one-parameter group of diffeomor phisms of ne generated by v. Veri fy that the relation
lim ! (!( 'P t ( x )) - f( x ) ) = D v ( x) / t holds for every smooth function f E c < = ) (!R n , IR) . If we introduce the notation v(f) := Dv f, i n consistency with the notation of Problem 2, and recall that f o 'P t =: r.p; J , we can write t -+ 0
lim !t ( rp ; f - f ) ( x ) = v(f) (x) .
t-+0
c ) Differentiation of a smooth form w of any degree defined in !R n along the field v is now naturally defined. To be specific, we set
v(w) (x) : = tlim ! ( rp ; w - w) (x) . -+ 0 t
The form v(w) is called the Lie derivative of the form w along the field v and usually denoted Lvw. Define the Lie derivative Lxw of a form w along the field X on an arbitrary smooth manifold M. d ) Show that the Lie derivative on a c < = ) manifold M has the following prop erties. 1 ° Lx is a local operation, that is, if the fields X I and x2 and the forms W j and w2 are equal in a neighborhood U C M of the point x, then (Lx 1 wi ) (x) = (Lx2 W2 ) (x). 2° Lx fl k (M) c fl k (M) . 3° L x : fl k (M) --+ fl k (M) is a linear mapping for every k = 0 , 1 , 2 , . . . . 4° Lx (wi 1\ w2 ) = (Lxwi ) 1\ W2 + WI 1\ Lxw2 . 5° If f E fl 0 (M) , then Lx f = df(X) =: X f. 6° If f E fl 0 (M) , then Lxdf = d(Xf) . e ) Verify that the properties 1 ° �6 ° determine the operation Lx uniquely. Let X be a vector field and w a form of degree k on the smooth manifold M. The inner product of the field X and the form w is the (k - 1 ) -form de noted by ixw or Xj w and defined by the relation (ixw) (X I , . . . , X k - I ) := w(X, X 1 , . . . , Xk- I ) , where X 1 , . . . , Xk - I are vector fields on M. For 0-forms, that is, functions on M, we set X J f = 0. a) Show that if the form w ( more precisely, w l ) has the form 7.
"'""' L.....,
l :S i 1 < · · · < i k :S n
a ' l · · · ' k ( x ) dx i 1 ·
1\ 1\ dx i k = _!_k! a' l · · · ' k dx i 1 1\ 1\ dx i k ·
·
·
·
·
·
·
in the local coordinates x 1 , , x n of the chart r.p : !R n --+ U C M, and X = Xi 8�, , then . 1 i lX W - (k _ 1) ! x i a i ,. 2 . . . i k d X 2 1\ 1\ d X i k . • • •
_
·
·
·
b ) Verify further that if df = j!, dxi , then ix df = Xi j!, = X (f)
=
Dx f.
352
15 *Integration of Differential Forms on Manifolds
c) Let X(M) be the space of vector fields on the manifold M and Jl(M) the ring of skew-symmetric forms on M. Show that there exists only one mapping i : X (M) x Jl(M) --+ Jl(M) having the following properties: 1 ° i is a local operation, that is, if the fields X 1 and X2 and the forms W 1 and w 2 are equal in a neighborhood U of x E M , then (ix 1 w 1 ) (x) = (ix2 w2 ) (x) ; 2 ° ix ( Jl k (M ) ) C n k - 1 (M ) ; 3 ° ix : Jl k ( M) --+ n k - l ( M) is a linear mapping; 4 ° if w 1 E Jl k 1 ( M) and w2 E Jl k 2 (M) , then ix (wu'\w 2 ) = ixwu'\w 2 + ( - 1 ) k 1 wu'\ ixw 2 ; 5 ° if w E Jl 1 ( M ) , then ixw = w (X ) , and if f E Jl 0 ( M ) , then ix f = 0. Prove the following assertions. a) The operators d, ix , and Lx (see Problems 6 and 7) satisfy the so-called homotopy identity (15.51) Lx = ixd + dix , where X is any smooth vector field on the manifold. b) The Lie derivative commutes with d and i x , that is, 8.
Lx o d = d o Lx ,
Lx o ix
=
ix o Lx .
c) (Lx , iy] = i[x,Y] , [Lx , Ly] = L[x,Y) l where, as always, [ A , B] = A o B - B o A for any operators A and B for which the expression A o B - B o A is defined. In this case, all brackets ( , ] are defined. d) Lx fw = f Lxw + df A ixw, where f E Jl 0 ( M ) and w E Jl k (M ) . (H i n t. Part a ) is the main part o f the problem. It can b e verified, for example, by induction on the degree of the form on which the operators act.)
15.4 Closed and Exact Forms on Manifolds 1 5 . 4 . 1 Poincare's Theorem
In this section we shall supplement what was said about closed and exact differential forms in Sect. 14.3 in connection with the theory of vector fields in ffi.n . As before, denotes the space of smooth real-valued forms of degree p on the smooth manifold and =
f!P (M)
M J!(M) U f!P (M). Definition 1 . The form w E f!P (M) is closed if dw 0. Definition 2 . The form w E f!P (M), p > 0, is exact if there exists a form E f!P - 1 (M) such that w d . The set of closed p-forms on the manifold M will be denoted ZP (M), and the set of exact p-forms on M will be denoted B P (M). p
=
a
=
a
353
15.4 Closed and Exact Forms on Manifolds
0
The relation7 d(dw) = holds for every form w E D(M) , which shows that ZP (M) :J BP (M) . We already know from Sect. 14.3 that this inclusion is generally strict. The important question of the solvability (for a ) of the equation d a = w given the necessary condition dw = on the form w turns out to be closely connected with the topological structure of the manifold M. This statement will be deciphered more completely below.
0
Definition 3. We shall call a manifold M contractible (to the point x0 E M) or homotopic to a point if there exists a smooth mapping h : M x I --t M where I = {t E IR I t ::; 1 } such that h ( x, 1 ) = x and h ( x, O ) = x0 .
Example 1 . The h ( x, t) = t x .
0 ::;
space
!Rn can be contracted to a point by the mapping
1. (Poincare) . Every closed (p + 1)-form (p 2': on a manifold that is contractible to a point is exact. Proof. The nontrivial part of the proof consists of the following "cylindrical"
0)
Theorem
construction, which remains valid for every manifold M. Consider the "cylinder" M x I, which is the direct product of M and the closed unit interval I, and the two mappings j; : M --t M x I, where j; ( x ) = ( x, i), i = 1 , which identify M with the bases of the cylinder M x I. Then there naturally arise mappings j;* : f?P (M x I) --t f?P (M) , reducing to the replacement of the variable t in a form of f?P (M x I) by the value i ( = 1 ) , and, of course, di = We construct a linear operator K : f?P + 1 (M x I) --t f?P (M) , which we define on monomials as follows:
0,
0,
0.
K ( a ( x, t) dx i 1 A · · · A dx i v +! ) : = 1 K ( a ( x, t) dt A dx i 1 A · · · A d x i v ) := ( a ( x, t) dt) d x i 1 A · · · A d x i v . 0
0,
J
The main property of the operator K that we need is that the relation K(dw) + d(Kw) = j;w - j�w
( 15.52)
holds for every form w E f?P + 1 (M x I) . It suffices to verify this relation for monomials, since all the operators K, d, Ji , and j0 are linear.
7 Depending on the way in which the operator d is introduced this property is either proved, in which case it is called the Poincare lemma, or taken part of the definition of the operator d. as
15 *Integration of Differential Forms on Manifolds
354
dx i v + 1 , then Kw = 0, d(Kw) 0, and a(x, t) dxi 1 & . dx 'v. + l + [terms not containing dt] , dw = {) dt dx ' 1 t 1 oa dt dx '. 1 dx'v. + 1 K(dw) = {) t 0 (a(x, 1) - a(x, O)) dx i 1 . . I\ dx i v + l Jtw - j�w ,
If w
=
1\
1\ · · · 1\
(/ )
=
1\ · · · 1\
1\ · · · 1\
=
1\
=
·
=
and relation ( 15.52) is valid. If w = a(x, t) dt 1\ dx i 1 1\ · · · 1\ dx i P , then J;w = j0w = 0. Then
K (dw) = K
(
-
io
=
d(Kw)
=
d
( ( j1 a(x, t) dt) dxi 1
=
0
) 1 ( - � J 0��0 dt) dx i o •o 0 dx i v )
.0 . 1 . oa L -ox •o. dt 1\ dx' 1\ dx ' 1\ · · · 1\ dx 'v
1\ . . . 1\
1\
L {)
•o
=
1\ · · · 1\
dx iv ,
=
:io ( J0 a(x, t) dt) dxio dxi 1 1 ( J O{)aXio dt) dx'.0 •o 0 1
=
1\ · · · 1\
"' L.....
(\
dxi v
dx '. 1
=
(\ ' ' ' (\
dx 'P. .
Thus relation ( 15.52) holds in this case also.8 Now let M be a manifold that is contractible to the point x0 E M, let h : M x I ---* M be the mapping in Definition 3, and let w be a (p + 1 )-form on M. Then obviously hoj 1 : M ---* M is the identity mapping and h o j0 : M ---* x0 is the mapping of M to the point x0 , so that (J; o h* )w w and (j0 o h* )w = 0. Hence it follows from ( 15.52 ) that in this case =
K (d(h*w)) + d (K(h*w)) If in addition w is a closed form on we find by ( 15.53) that
=
( 15.53)
w.
M, then, since d(h*w) = h* (dw) = 0,
d (K(h*w)) = w .
Thus the closed form w is the exterior derivative of the form a
QP(M), that is, w is an exact form on M. 8
D
=
K(h*w)
For the justification of the differentiation of the integral with respect to this last equality, see, for example, Sect. 17. 1 .
E
xio in
15.4 Closed and Exact Forms on Manifolds
355
Example 2. Let A, B, and C be smooth real-valued functions of the variables x, y, z E JR3 . We ask how to solve the following system of equations for P, Q,
and R:
( 1 5.54)
An obvious necessary condition for the consistency of the system ( 1 5.54) is that the functions A, B, and C satisfy the relation
aA + aB + ac - 0 ax 8y az
which is equivalent to saying that the form w
_ '
= A dy A dz + B dz A dx + C dx A dy
is closed in IR 3 . The system ( 15.54) will have been solved if we find a form a = P dx + Q dy + R dz
such that d a = w . In accordance with the recipes explained in the proof of Theorem 1 , and taking account of the mapping h constructed in Example 1 , we find, after simple computations,
(/ A(tx, ty, tz)t dt ) (y dz - z dy) + 1
a = K(h*w) =
0
+ (/ B(tx, ty, tz)t dt ) (z dx - x dz) + 1
0
+
1
(/ C(tx, ty, tz)t dt) (x dy - y dx) . 0
One can also verify directly that d a = w .
Remark. The amount o f arbitrariness i n the choice o f a form a satisfying the condition da = w is usually considerable. Thus, along with a , any form a + d1J will obviously also satisfy the same equation. By Theorem 1 any two forms a and (3 on a contractible manifold M satisfying d a = d(3 = w differ by an exact form. Indeed, d( a - (3) = 0, that is, the form ( a - (3) is closed on M and hence exact , by Theorem 1 .
356
15 *Integration of Differential Forms on Manifolds
1 5 . 4 . 2 Homology and Cohomology
By Poincare ' s theorem every closed form on a manifold is locally exact . But it is by no means always possible to glue these local primitives together to obtain a single form. Whether this can be done depends on the topological structure of the manifold. For example, the closed form in the punctured plane IR 2 \ 0 given by w = -y$��-�� d y , studied in Sect . 14.3, is locally the differential of a function cp = cp(x, y) - the polar angle of the point (x, y). However, extending that function to the domain IR 2 \ 0 leads to multivaluedness if the closed path over which the extension is carried out encloses the hole - the point 0. The situation is approximately the same with forms of other degrees. "Holes" in manifolds may be of different kinds, not only missing points, but also holes such as one finds in a torus or a pretzel. The structure of manifolds of higher dimensions can be rather complicated. The connection between the structure of a manifold as a topological space and the relationship between closed and exact forms on it is described by the so-called (co )homology groups of the manifold. The closed and exact real-valued forms on a manifold M form the vector spaces ZP (M) and BP (M) respectively, and ZP (M) :J BP (M) . Definition 4. The quotient space HP (M)
:=
ZP (M)/ BP (M)
(15.55)
is called the p-dimensional cohomology group of the manifold M (with real coefficients) . Thus, two closed forms w 1 , w 2 E ZP (M) lie in the same cohomology class, or are cohomologous, if w 1 - w2 E BP (M) , that is, if they differ by an exact form. The cohomology class of the form w E ZP (M) will be denoted [w] . Since ZP (M) is the kernel of the operator dP : flP (M) ---+ [lP + 1 (M) , and BP (M) is the image of the operator dP - 1 : {}P - 1 (M) ---+ flP(M) , we often write HP (M) = Ker dP /Im dP - 1 . Computing cohomologies, as a rule, is difficult . However, certain trivial general observations can be made. It follows from Definition 4 that if p > dim M, then HP (M) = 0. It follows from Poincare ' s theorem that if M is contractible then HP (M) = 0 for p > 0. On any connected manifold M the group H 0 (M) is isomorphic to IR, since H 0 ( M) = Z 0 ( M), and if d f = 0 holds for the function f : M ---+ lR on a connected manifold M, then f = const. Thus, for example, it results for !R n that HP (JR n ) = 0 for p > 0 and 0 H (1R n ) R This assertion (up to the trivial last relation) is equivalent to Theorem 1 with M = !R n and is also called Poincare ' s theorem. The so-called homology groups have a more visualizable geometrical re lation to the manifold M. ,...,
15.4 Closed and Exact Forms on Manifolds
357
A smooth mapping c : IP --+ M of the p-dimensional cube I C JRP into the manifold M is called a singular p-cube on M.
Definition 5.
This is a direct generalization of the concept of a smooth path to the case of an arbitrary dimension p. In particular, a singular cube may consist of a mapping of the cube I to a single point . Definition 6. A p-chain (of singular cubes) on a manifold M is any finite formal linear combination I: a k c k of singular p-cubes on M with real coeffi.k cients.
Like paths, singular cubes that can be obtained from each other by a diffeomorphic change of the parametrization with positive Jacobian are re garded as equivalent and are identified. If such a change of parameter has negative Jacobian, then the corresponding oppositely oriented singular cubes c and c_ are regarded as negatives of each other, and we set c_ = - c. The p-chains on M obviously form a vector space with respect to the standard operations of addition and multiplication by a real number. We denote this space by Cp (M) .
boundary
Definition 7. The
(p - 1)-chain
8Iof the p-dimensional cube IP in
l
p
ai L L)-l) i +icij i=O j = l :=
JRP is the ( 1 5 . 56)
in JRP , where Cij : IP - l --+ JRP is the mapping of the (p - I )-dimensional cube into JRP induced by the canonical embedding of the corresponding face of IP in JRP . More precisely, if IP - 1 = {x E JRP - 1 1 0 ::::; xm ::::; 1 , m = 1 , . . . , p - 1 } , mp . t hen c,.1· ( X- ) - ( X- 1 , , X-j - 1 , z, X-H 1 , . . . , X-p- i ) E � _
•
•
.
·
It is easy to verify that this formal definition of the boundary of a cube agrees completely with the operation of taking the boundary of the standard oriented cube IP (see Sect . 12.3) .
boundary 8c of the singular p-cube c is the (p - 1 )-chain 1 p 8c L L( - l) i+ic c ii . i=O j = l The boundary of a p-chain I: a k c k on the manifold M is the k
Definition 8. The
:=
Definition 9.
(p - 1 )-chain
o
358
15 *Integration of Differential Forms on Manifolds
Thus on any space of chains Cp (M) we have defined a linear operator Using relation ( 15 .56) , one can verify the relation Consequently a o a = a2 = 0 in general.
a( a!) = 0 for the cube.
p-cycle on a manifold is a p-chain z for which az = 0. Definition 1 1 . A boundary p-cycle on a manifold is a p-chain that is the boundary of some (p + 1 ) -chain. Definition 10. A
Let Zp (M) and Bp (M) be the sets of p-cycles and boundary p-cycles on the manifold M. It is clear that Zp (M) and Bp (M) are vector spaces over the field JR. and that Zp (M) ::) Bp (M) . Definition 1 2 . The quotient space
(15.57) is the p-dimensional cients ) .
homology group of the manifold M ( with real coeffi
Thus, two cycles Z 1 , z2 E Zp ( M ) are in the same homology class, or are homologous, if z 1 - z2 E Bp (M ) , that is, they differ by the boundary of some chain. We shall denote the homology class of a cycle z E Zp (M) by [z]. As in the case of cohomology, relation (15.57) can be rewritten as
Definition 13. If c : I --+ M is a singular p-cube and w is a p-form on the manifold M, then the integral of the form w over this singular cube is
j := j c*w . w
r
(15.58)
I
Definition 14. If 2: o: k ck is a p-chain and k
w
is a p-form on the manifold
M, the integral of the form over such a chain is interpreted as the linear combination 2: O:k J w of the integrals over the corresponding singular cubes. k
Ck
It follows from Definitions 5-8 and 13-14 that Stokes ' formula c
De
(15.59)
holds for the integral over a singular cube, where c and w have dimension p and degree p - 1 respectively. If we take account of Definition 9, we conclude that Stokes ' formula ( 15.59) is valid for integrals over chains.
359
15.4 Closed and Exact Forms on Manifolds
Theorem 2 . a )
The integral of an exact form over a cycle equals zero. b) The integral of a closed form over the boundary of a chain equals zero. c) The integral of a closed form over a cycle depends only on the cohomology class of the form. d) If the closed p-forms w 1 and w2 and the p-cycles z 1 and z2 are such that [w l ] = [w2 ] and [z l ] = [z2 ], then Proof. a ) By Stokes ' formula Jz w dz = J w = 0, since fJ z = 0. 8z ' formula J w J dw = 0, since dw 0. b) By Stokes c 8c c) follows from b). d) follows from a ) . e) follows from ) and d). Corollary. The bilinear mapping DP (M) x Cp (M) ---+ � defined by (w, ) Jc w induces a bilinear mapping ZP (M) x Zv (M) ---+ � and a bilinear mapping HP (M) x Hp (M) ---+ R The latter is given by the formula =
c
=
0
([w], [zl )
c
foo-t
Jw ,
foo-t
(15.60)
z
where w E ZP (M) and z E Zv (M) . Theorem 3. (de Rham9 ) . The bilinear mapping HP (M) x Hp (M) ---+ � given by (15.60) is nondegenerate. 1 0 We shall not take the time to prove this theorem here, but we shall find some reformulations of it that will enable us to present in explicit form some corollaries of it that are used in analysis. We remark first of all that by (15.60) each cohomology class [w] E HP (M) can be interpreted as a linear function [w] ([zl ) = J w. Thus a natural mapz
ping HP (M) ---+ H; (M) arises, where H; (M) is the vector space conjugate to Hp (M) . The theorem of de Rham asserts that this mapping is an isomor phism, and in this sense HP (M) = H; (M) .
9 G. de Rham (1903-1969) - Belgian mathematician who worked mainly in alge braic topology. 10 We recall that a bilinear form L ( x, y) is nondegenerate if for every fixed nonzero value of one of the variables the resulting linear function of the other variable is not identically zero.
360
15 *Integration of Differential Forms on Manifolds
Definition 1 5 . If w is a closed p-form and
then the quantity per (z) : = w
the form over the cycle z.
J z
w
z is a p-cycle on the manifold M, is called the period ( or cyclic constant) of
I n particular, i f the cycle z i s homologous to zero, then, as follows from assertion b ) of Theorem 2, we have per (z) = 0. For that reason the following connection exists between periods:
(15.61) that is, if a linear combination of cycles is a boundary cycle, or, what is the same, is homologous to zero, then the corresponding linear combination of periods is zero. The following two theorems of de Rham hold; taken together, they are equivalent to Theorem 3. Theorem 4. ( de Rham ' s first theorem. ) A
if all its periods are zero.
closed form is exact if and only
Theorem 5. ( de Rham ' s second theorem. ) E Zp(M) M
If a number per (z) is assigned to each p-cycle z on the manifold in such a way that condition (15.61) holds, then there is a closed p-form on M such that J = per (z) for every cycle z E Zp(M) . w
z
w
15.4.3 Problems and Exercises 1. Verify by direct computation that the form a obtained in Example 2 does indeed satisfy the equation da = w.
2 . a ) Prove that every simply-connected domain in ll�? is contractible on itself to a point . b ) Show that the preceding assertion is generally not true in R3 .
Analyze the proof of Poincare's theorem and show that if the smooth mapping h : M x I ---+ M is regarded as a family of mappings ht : M ---+ M depending on the parameter t, then for every closed form w on M all the forms h;w, t E I, will be in the same cohomology class. 4. a ) Let t >--+ ht E c< oo l ( M, N) be a family of mappings of the manifold M into the manifold N depending smoothly on the parameter t E I C R. Verify that for every form w E fl(N) the following homotopy formula holds: 3.
(15.62)
15.4 Closed and Exact Forms on Manifolds
361
Here x E M, X is a vector field on N with X (x, t) E TNh, (x ) > X(x, t) is the velocity vector for the path t' >--+ ht� (x) at t' = t, and the operation ix of taking the inner product of a form and a vector field is defined in Problem 7 of the preceding section. b) Obtain the assertion of Problem 3 from formula ( 15.62) . c) Using formula ( 15.62) , prove Poincare's theorem (Theorem 1) again. d) Show that if K is a manifold that is contractible to a point, then HP ( K x M) = HP (M) for every manifold M and any integer p . e) Obtain relation (15.51) of Sect. 15.3 from formula ( 15.62) . 5. a) Show, using Theorem 4, and also by direct demonstration, that if a closed 2-form on the sphere 8 2 is such that J w = 0, then w is exact.
S2
b) Show that the group H 2 (8 2 ) is isomorphic to JR. c) Show that H 1 (8 2 ) = 0. 6. a) Let t.p : 8 2 --+ 8 2 be the mapping that assigns to each point x E 8 2 the antipodal point -x E 8 2 • Show that there is a one-to-one correspondence between forms on the projective plane IRJP>2 and forms on the sphere 8 2 that are invariant under the mapping c.p , that is, c.p * w = w. b) Let us represent IRJP>2 as the quotient space 8 2 I r, where r is the group of transformations of the sphere consisting of the identity mapping and the antipodal mapping t.p . Let 7r : 8 2 --+ IRJP>2 = 8 2 I r be the natural projection, that is 7r ( x ) = {x, -x } . Show that 1r o t.p = 1r and verify that 2 2 \17] E fJP (8 ) ( c.p * 1J = 17) -¢==} 3 w E fJ P (JRJP> ) (1r * w = 17) . c) Now show, using the result of Problem 5a) , that H 2 ( JRJP>2 ) = 0. d) Prove that if the function f E C(8 2 , IR) is such that f(x) - f ( -x) = const, then f = 0. Taking account of Problem 5c) , deduce from this that H 1 (IRJP>2 ) = 0.
a) Representing IRJP>2 as a standard rectangle II with opposite sides identified as shown by the orienting arrows in Fig. 15.3, show that all = 2c' - 2c, ac = P - Q, and ac' = p - Q. b) Deduce from the observations in the preceding part of the problem that there are no nontrivial 2-cycles on IRJP>2 • Then show by de Rham's theorem that H 2 (IRJP>2 ) = 0. 7.
Q '
c
p
p
II c
Fig. 1 5 . 3 .
Q
362
15 *Integration of Differential Forms on Manifolds
c) Show that the only nontrivial 1-cycle on JRJP'2 (up to a constant factor) is the cycle c' - c , and since c' - c = � 8II, deduce from de Rham's theorem that
H l (JRJP'2 )
=
0.
8. Find the groups H 0 (M) , H 1 (M) , and H 2 (M) if
a) M = 8 1 - the circle; b) M = T 2 - the two-dimensional torus; c) M = K 2 - the Klein bottle.
a) Prove that diffeomorphic manifolds have isomorphic (co)homology groups of the corresponding dimension. b) Using the example of JR 2 and JRJP'2 , show that the converse is generally not true. 9.
10. Let X and Y be vector spaces over the field lR and L(x, y) a nondegenerate bilinear form L : X x Y --+ JR. Consider the mapping X --+ Y* given by the correspondence X 3 x f--7 L(x, · ) E Y* . a) Prove that this mapping is injective. b) Show that for every system Y I , . . . , Y k of linearly independent vectors in Y there exist vectors x 1 , , x k such that x i (yj ) = L(x i , yj ) = 8) , where 8) = 0 if i ¥ j and 8j = 1 if i = j . c) Verify that the mapping X --+ Y* is an isomorphism of the vector spaces X and Y* . d) Show that de Rham's first and second theorems together mean that HP (M) = H; ( M) up to isomorphism. • • •
1 6 Uniform Convergence and the B asic Op erat ions of Analysis on Series and Families of Functions
16. 1 Pointwise and Uniform Convergence
16.1 . 1
Pointwise Convergence
1. We say that the sequence { !n ; n E N} offunctions fn : X ---+ IR converges at the point x E X if the sequence of values at x, Un (x); n E N} ,
Definition
converges.
Definition 2. The set of points E C X at which the sequence Un ; n E N} of functions fn : X ---+ IR converges is called the convergence set of the sequence.
{fn ; n ---+ f(x) := nlim --+ oo fn (x) . This function is called the limit function of the sequence { in ; n E N} or the limit of the sequence Un ; n E N} . Definition 4. If f E ---+ IR is the limit of the sequence { !n ; n E N } , we say that the sequence of functions converges ( or converges pointwise) to f on E. In this case we write f(x) = lim fn (x) on E or fn ---+ f on E as n ---+ oo . n --+ oo Example 1. Let X = { x E IR I x :::>: 0} and let the functions fn X ---+ IR be given by the relation fn ( x) = xn , n E N. The convergence set of this sequence of functions is obviously the closed interval I = [0, 1] and the limit function f I ---+ IR is defined by 0 , if 0 � X < 1 , f(x) =
Definition 3. On the convergence set of the sequence of functions E N} there naturally arises a function : E IR defined by the relation
f
:
:
:
{
1 , if x = 1 .
.
2
The sequence of functions fn ( x) = s m;: x on IR converges on IR to the function f : IR ---+ 0 that is identically 0. Example 3. The sequence fn ( x) = si�2'x also has the identically zero function f : IR ---+ 0 as its limit.
Example 2.
364
16 Uniform Convergence and Basic Operations of Analysis
[0 ,
0
Example 4. Consider the sequence ln (x) = 2(n + 1)x(1 - x 2 ) n on the closed interval I = 1] . Since nqn for l q l < 1 , this sequence tends to zero on -+
/.
the entire closed interval Example 5. Let m, n E N, and let lm (x) := nlim (cos ml7rx) 2 n . If m!x is an -+ao integer, then lm (x) = 1, and if m!x � Z, obviously lm (x) = We shall now consider the sequence Um ; m E N} and show that it con verges on the entire real line to the Dirichlet function V(x)
=
0.
{0'
if x � Q ,
0
1 ' if x E Q . Indeed, if x � Q, then m!x � Z, and lm (x) = for every value of m E N, so that l (x) = But if x = � · where p E Z and q E N, then m!x E Z for m ;::: q, and lm (x) = 1 for all such m, which implies l(x) = 1 . Thus lim lm (x) = V(x) . m-+oo
0.
1 6 . 1 . 2 Statement of the Fundamental Problems
Limiting passages are encountered at every step in analysis, and it is often important to know what kind of functional properties the limit function has. The most important properties for analysis are continuity, differentiability, and integrability. Hence it is important to determine whether the limit is a continuous, differentiable, or integrable function if the prelimit functions all have the corresponding property. Here it is especially important to find conditions that are sufficiently convenient in practice and which guarantee that when the functions converge, their derivatives or integrals also converge to the derivative or integral of the limit function. As the simple examples examined above show, without some additional hypotheses the relation "In -+ I on [a, b] as n -+ oo" does not in general imply either the continuity of the limit function, even when the functions In b b are continuous, or the relation I� -+ f' or J ln (x) dx -+ J l(x) dx, even when a
a
0
all these derivatives and integrals are defined. Indeed, in Example 1 the limit function is discontinuous on [ , 1] although all the prelimit functions are continuous there; in Example 2 the derivatives n cos n 2 x of the prelimit functions in general do not converge, and hence cannot converge to the derivative of the limit function, which in this case is identically zero; 1
J l(x) dx = 0
1
in Example 4 we have J In ( x) dx
0;
0
=
1 for every value of n E N, while
16.1 Pointwise and Uniform Convergence
365
in Example 5 each of the functions fm equals zero except at a finite set of points, so that
b
=
C
J fm (x) dx 0 on every closed interval [a, b] IR, while the (],
limit function V is not integrable on any closed interval of the real line.
At the same time: in Examples 2, 3, and 4 both the prelimit and the limit functions are continuous; in Example 3 the limit of the derivatives co�n x of the functions in the sequence si�;'x does equal the derivative of the limit of that sequence; in Example
1 1 1 we have J fn (x) dx --t J f(x) dx as n --t oo .
0 0 Our main purpose is to determine the cases in which the limiting passage under the integral or derivative sign is legal. In this connection, let us consider some more examples.
Example 6.
We know that for any
1 3!
.
Sill X = X - - X
3
1 +5! X
5
x E IR
- · · ·
m + ( -1)+ X 2m +l + · · · ' (2m l ) !
(16.1)
but after the examples we have just considered, we understand that the re lations
b
sin ' X
=
(-1) m x2m +l ) ' f ( m =O (2m + 1)! 00
(16.2)
b
(- l ) m x 2m + 1 dx / sin x dx """' J ' � -O (2m + 1)! =
a
m
( 16.3)
a
require verification in general. Indeed, if the equality
S(x) = a 1 (x) + a 2 (x) + · · · + am (x) + · · ·
n
is understood in the sense that S(x) = lim Sn (x) , where Sn (x) = 2:::: a m (x), n -+oo m =1 then by the linearity of differentiation and integration, the relations 00
b
S'(x) = L a� (x) , m =1
J S(x) dx a
=
00
b
L J am (x) dx
m= l
a
366
16 Uniform Convergence and Basic Operations of Analysis
are equivalent to
J b
S' (x) = lim S� (x) , n -4oo S(x) dx
a
=
J b
lim n -4oo
a
Sn (x) dx ,
which we must now look upon with caution. In this case both relations ( 16.2) and (16.3) can easily be verified, since it is known that COS X
=1
-
1 2 2!
-X
1 4!
+ -X
4
-
·
·
·
+
( - l) m 2 m + X (2m) !
---
·
·
·
.
However, suppose that Eq. ( 1 6 . 1 ) is the definition of the function sin x. After all, that was exactly the situation with the definition of the functions sin z, cos z, and e z for complex values of the argument . At that time we had to get the properties of the new function (its continuity, differentiability, and integrability) , as well as the legality of the equalities ( 16.2) and ( 16.3) directly from the fact that this function is the limit of the sequence of partial sums of this series. The main concept by means of which sufficient conditions for the legality of the limiting passages will be derived in Sect. 16.3, is the concept of uniform convergence. 1 6 . 1 . 3 Convergence and Uniform Convergence of a Family of Functions Depending on a Parameter
In our discussion of the statement of the problems above we confined our selves to consideration of the limit of a sequence of functions. A sequence of functions is the most important special case of a family of functions f (x) depending on a parameter t. It arises when t E N. Sequences of functions thus occupy the same place occupied by the theory of limit of a sequence in the theory of limits of functions. We shall discuss the limit of a sequence of functions and the connected theory of convergence of series of functions in Sect . 16.2. Here we shall discuss only those concepts involving functions depending on a parameter that are basic for everything that follows.
t
Definition 5. We call a function (x, t) t-t F(x, t) of two variables x and t defined on the set X x T a family of functions depending on the parameter t if one of the variables t E T is distinguished and called the parameter.
The set T is called the parameter set or parameter domain, and the family itself is often written in the form ft (x) or t E T} , distinguishing the parameter explicitly.
Ut i
16. 1 Pointwise and Uniform Convergence
367
As a rule, in this book we shall have to consider families of functions for which the parameter domain T is one of the sets N or JR. or C of natural numbers, real numbers, or complex numbers or subsets of these. In general, however, the set T may be a set of any nature. Thus in Examples 1 - 5 above we had T = N. In Examples 1-4 we could have assumed without loss of content that the parameter n is any positive number and the limit was taken over the base n -+ oo, n E JR. + .
Definition 6 . Let Ut : X -+ JR.; t E T} be a family of functions depending on a parameter and let B be a base in the set T of parameter values. If the limit lim ft (x) exists for a fixed value x E X, we say that the family 8
of functions converges at x. The set of points of convergence is called the convergence set of the family of functions in a given base B. Definition 7. We say that the family of functions converges on the set E X over the base B if it converges over that base at each point x E E. The function f(x) := lim ft(x) on E is called the limit function or the limit of the family of functions ft on the set E over the base B. Example 7. Let ft (x) = e - ( x / t ) 2 , x E X = JR., t E T = JR. \ 0, and let B be the base t -+ 0. This family converges on the entire set JR., and 1 , if x = O , lim ft(x) = t -+ oo O , if x tf: O . C
8
{
We now give two basic definitions.
Ut ; t E T} of functions ft : X -+ JR. converges pointwise (or simply converges) on the set E X over the base B to the function f : E -+ JR. if lim ft (x) = f(x) at every point x E E. In this case we shall often write Ut f on E). Definition 9 . The family {ft; t E T} of functions ft : X -+ JR. converges uniformly on the set E X over the base B to the function f : E -+ JR. if for every c > 0 there exists an element B in the base B such that l f(x)-ft(x) l < c at every value t E B and at every point x E E. In this case we shall frequently write Ut � f on E) . Definition 8. We say that the family 8
C
C
� 8
8
We give also the formal expression of these important definitions: � 8
Ut f on E) := Vc > 0 Vx E E ::JB E B Vt E B (if(x) - ft(x) l < c) , Ut � f on E) := Vc > 0 ::JB E B Vx E E Vt E B (if(x) - ft (x) l < c) . 8
368
16 Uniform Convergence and Basic Operations of Analysis
The relation between convergence and uniform convergence resembles the relation between continuity and uniform continuity on a set. To explain better the relationship between convergence and uniform convergence of a family of functions, we introduce the quantity L1t (x) = l f(x) - ft (x) l , which measures the deviation of the value of the function ft from the value of the function f at the point x E E. Let us consider also the quantity L1t = sup L1t (x) , which characterizes, roughly speaking, the maxi-
mum deviation ( although there may not be a maximum ) of the function ft from the corresponding values of f over all x E E. Thus, at every point x E E we have L1t (x) ::::; L1t . In this notation these definitions obviously can be written as follows: xEE
(ft --,: f on E) .- Vx E E (L1t (x) --+ 0 over B) , ( L1t --+ 0 over B) . (It :::4 f on E) 8
It is now clear that
Ut :::4 f on E) 8
===>
f on E) , Ut --+ 8
that is, if the family ft converges uniformly to f on the set E, it converges pointwise to f on that set . The converse is in general not true.
Example 8. Let us consider the family of functions ft : I --+ lR defined on the closed interval I = { x E lR I 0 ::::; x ::::; 1} and depending on the parameter t E]O, 1]. The graph of the functions y = ft (x) is shown in Fig. 16.1. It is clear that tlim ft (x) = 0 at every point x E I, that is, ft --+ f = 0 as t --+ 0. -tO At the same time L1t = sup l f(x) - ft (x) l = sup l ft (x) l = 1, that is, L1t -/+ 0 xEI xEI as t --+ 0, and hence the family converges, but not uniformly. In such cases we shall say for convenience that the family converges nonuniformly to the limit function. If the parameter t is interpreted as time, then convergence of the family of functions ft on the set E to the function f means that for any preassigned precision c > 0 one can exhibit a time tc: for each point x E E starting from which ( that is, for t > tc: ) the values of all functions ft at x will differ from f(x) by less than c . Uniform convergence means that there is a time tc: , starting from which ( that is, for t > tc: ) the relation l f(x) - ft (x) l < c holds for all x E E. The figure of a traveling bulge of large deviation depicted in Fig. 16.1 is typical for nonuniform convergence.
Example 9. The sequence of functions fn ( x) = x n - x2n defined on the closed interval 0 ::::; x ::::; 1, as one can see, converges to zero at each point as n --+ oo .
369
16. 1 Pointwise and Uniform Convergence y 1
0 t 2t
1
X
Fig. 1 6. 1 .
To determine whether this convergence i s uniform, we find the quantity Ll n = max lfn (x) l . Since f� (x) = nx n - 1 (1 - 2x n ) = 0 for x = 0 and x = 2-1/ n ,
0:5x9
it is clear that Ll n = fn (2- 11 n ) = 1/4. Thus Ll n f+ 0 as n --* oo and our sequence converges to the limit function f(x) = 0 nonuniformly.
Example 1 0.
The sequence of functions fn converges to the function
f(x)
=
{
=
x n on the interval 0 ::; x ::; 1
0 ' if O :S x < 1 , X =
1 , if
1
nonuniformly, since for each n E N
Ll n
=
sup I J (x) - fn (x) l
o::;x::; 1
=
sup l f(x) - fn (x) l
o::;x < 1
=
=
sup l fn (x) l
0 :5x< 1
=
sup l x n l
0:5 x < 1
=
1.
The sequence of functions fn (x) = s in;: x studied in Example 2 converges to zero uniformly on the entire set lR as n --* oo , since in this case 2
Example 1 1 .
lf(x) - fn (x) l
=
lfn (x) l
=
l -nn-2 x l ::; ;1 sin
'
that is, Ll n ::; 1/n, and hence Ll n --* 0 as n --* oo. 1 6 . 1 . 4 The Cauchy Criterion for Uniform Convergence
In Definition 9 we stated what it means for a family of functions ft to converge uniformly on a set to a given function on that set . Usually, when the family of functions is defined the limit function is not yet known, so that it makes sense to adopt the following definition.
370
16 Uniform Convergence and Basic Operations of Analysis
Definition 10. We shall say that the family Ut ; t E T} of functions ft X --+ � converges on the set E X uniformly over the base B if it converges C
on that set and the convergence to the resulting limit function is uniform in the sense of Definition 9. Theorem. ( Cauchy criterion for uniform convergence ) . Let {it ; t E T} be a family of functions it : X --+ � depending on a parameter t E T, and B a
base in T. A necessary and sufficient condition for the family {ft ; t E T} to converge uniformly on the set E X over the base B is that for every E > 0 there exist an element B of the base B such that I it, ( x) - ft2 ( x) I < E for every value of the parameters h , t 2 E B and every point x E E . I n formal language this means that it converges uniformly o n E over the base B {:::::::?- Vc > 0 3B E B 'rft 1 , t z E B Vx E E ( l it, (x) - it2 (x) l < c). Proof. N e c e s s i t y. The necessity of these conditions is obvious, since if f : E --+ � is the limit function and ft f on E over B, there exists an element B in the base B such that l f(x) - ft (x) l < c/2 for every t E B and every x E E. Then for every t 1 , t 2 E B and every x E E we have l it, (x) - ft2 (x) l :::; lf( x) - it, (x) l + lf( x) - it2 (x) l < c/2 + c/2 = E S u f f i c i e n c y. For each fixed value of x E E we can regard ft (x) as a function of the variable t E T. If the hypotheses of the theorem hold, then the C
�
.
hypotheses of the Cauchy convergence criterion for the existence of a limit over the base B are fulfilled. Hence, the family {it ; t E T} converges at least pointwise to some function f : E --+ � on the set E over the base B. If we now pass to the limit in the inequality I ft, ( x) - it2 ( x) I < E, which is valid for any t 1 and t 2 E B and every x E E, one can obtain the inequality l f(x) - it2 (x) l :::; E for every tz E B and every x E E, and this, up to an inessential relabeling and the change of the strict inequality to the nonstrict, coincides exactly with the definition of uniform convergence of the family Ut ; t E T} to the function f : E --+ � on the set E over the base B. D Remark 1 . The definitions of convergence and uniform convergence that we have given for families of real-valued functions ft : X --+ � of course remain valid for families of functions ft : X --+ Y with values in any metric space Y . The natural modification that one must make in the definitions in this case amounts to replacing l f(x) - it (x) l by dy f (x) , it (x) ) , where dy is the metric in Y . For normed vector spaces Y, i n particular for Y
(
y=
=
16.1 Pointwise and Uniform Convergence
371
16. 1 . 5 Problems and Exercises 1. Determine whether the sequences of functions considered in Examples 3-5 con verge uniformly. 2.
Prove Eqs. ( 16.2) and ( 16.3).
3. a) Show that the sequence of functions considered in Example 1 converges uni formly on every closed interval [0, 1 - o] C [0, 1] , but converges nonuniformly on the interval [0, 1 [. b) Show that the same is true for the sequence considered in Example 9. c) Show that family of functions ft considered in Example 8 converges uniformly as t ---+ 0 on every closed interval [o, 1] C [0, 1] but nonuniformly on [0, 1] . d) Investigate the convergence and uniform convergence of the family of func tions ft ( x) = sin( tx) as t ---+ 0 and then as t ---+ oo. 2 e) Characterize the convergence of the family of functions ft (x) = e - t x as t ---+ +oo on an arbitrary fixed set E C JR.
4. a) Verify that if a family of functions converges (resp. converges uniformly) on a set, then it also converges (resp. converges uniformly) on any subset of the set. b) Show that if the family of functions ft : X ---+ JR converges (resp. converges uniformly) on a set E over a base B and g : X ---+ JR is a bounded function, then the family g · ft : X ---+ JR also converges (resp. converges uniformly) on E over the base B. c) Prove that if the families of functions ft : X ---+ JR, 9t : X ---+ JR converge uniformly on E C X over the base B, then the family ht = aft + f3gt , where a, (3 E JR, also converges uniformly on E over B. 5. a) In the proof of the sufficiency of the Cauchy criterion we passed to the limit lim /t1 (x) = f(x) over the base B in T. But h E B, and B is a base in T, not in B . B Can we pass to this limit in such a way that t1 remains in B? b) Explain where the completeness of JR was used in the proof of the Cauchy criterion for uniform convergence of a family of functions ft : X ---+ JR. c) Notice that if all the functions of the family {ft : X ---+ JR; t E T} are constant, then the theorem proved above is precisely the Cauchy criterion for the existence of the limit of the function r.p : T ---+ JR over the base B in T.
6. Prove that if the family of continuous functions ft E C(J, JR) on the closed interval I = { x E JR I a ::; x ::; b} converges uniformly on the open interval ]a, b[, then it converges uniformly on the entire closed interval [a, b] .
16 Uniform Convergence and Basic Operations of Analysis
372
1 6 . 2 Uniform Convergence of Series of Functions 1 6 . 2 . 1 Basic Definitions and a Test for Uniform Convergence of a Series :
{an X ---+ C; n E N} be a sequence of complex-valued (in particular real-valued) functions. The series L an (x) converges or converges n= l uniformly on the set E X if the sequence { s m (x) � an (x); n E N} nl converges or converges uniformly on E. Definition 1. Let
00
C
=
=
m
s m (x) L= an (x), as in the case of numerical nl series, is called the partial sum or, more precisely, the mth partial sum of the series L= an (x) . n l Definition 3 . The sum of the series is the limit of the sequence of its partial Definition 2 . The function 00
sums.
Thus, writing =
00
s(x) L an (x) on E n= l means that s m (x) ---+ s(x) on E as m ---+ oo , and writing the series L a n (x) converges uniformly on E n= l means that s m (x) s(x) on E as m ---+ oo . 00
:::4
Investigating the pointwise convergence of a series amounts t o investigat ing the convergence of a numerical series, and we are already familiar with that .
Example 1 .
Earlier we defined the function exp exp z
:=
00
L 1n1 . z n , n =O
:
C ---+ C by the relation (16.4)
after first verifying that the series on the right converges for every value
z E C.
In the language of Definitions 1-3 one can now say that the series (16.4) n converges on the entire complex plane, and the of functions a n (z) = �z . n function exp z is its sum. By Definitions 1 and 2 just adopted a two-way connection is established between series and their sequences of partial sums: knowing the terms of the
373
16.2 Uniform Convergence of Series of Functions
series, we obtain the sequence of partial sums, and knowing the sequence of partial sums, we can recover all the terms of the series: the nature of the convergence of the series is identified with the nature of the convergence of its sequence of partial sums.
Example 2. In Example 5 of Sect . 16.1 we constructed a sequence {fm ; m E N} of functions that converge to the Dirichlet function 'D( x) on If we set a 1 (x) fi (x) and an (x) fn (x) - fn - l (x) for n > 1, we obtain a series 2::: a (x) that will converge on the entire number line, and 2:::= an (x) 'D(x). n= l n nl Example 3. It was shown in Example 9 of Sect. 16.1 that the sequence of functions fn ( x) x n -x 2n converges, but nonuniformly, to zero on the closed interval [0, 1]. Hence, setting a 1 (x) fi (x) and a n (x) = fn (x) - fn - l (x) for n > 1, we obtain a series 2:::= an (x) that converges to zero on the closed nl interval [0, 1], but converges nonuniformly. =
=
00
=
00
00
1Ft
=
=
The direct connection between series and sequences of functions makes it possible to restate every proposition about sequences of functions as a corresponding proposition about series of functions. Thus, in application to the sequence { s n : X --+
(16 . 5)
1, we obtain the following theorem. Theorem 1. ( Cauchy criterion for uniform convergence of a series ) . The series 2:::= an (x) converges uniformly on a set E if and only if for every > 0 n l N E N such that there exists (16.6) for all natural numbers m, n satisfying m � n > N and every point x E E. Proof. Indeed, setting n 1 m, n 2 n - 1 in (16.5) and assuming that sn (x) is the partial sum of the series, we obtain inequality (16.6), from which relation (16 . 5) in turn follows with the same notation and hypotheses of the From this, taking account of Definition 00
c:
=
theorem.
D
=
Remark 1 . We did not mention the range of values of the functions an (x) in the statement of Theorem 1, taking for granted that it was IR or
374
16 Uniform Convergence and Basic Operations of Analysis
Remark 2. If under the hypotheses of Theorem 1 all the functions an (x) are constant, we obtain the familiar Cauchy criterion for convergence of a 00 numerical series L a n . n= l
Corollary 1 . ( Necessary condition for uniform convergence of a series ) . A 00
necessary condition for the series L= an (x) to converge uniformly on a set n l E is that an 0 on E as n --+ Proof. This follows from the definition of uniform convergence of a sequence to zero and inequality (16.6) if we set m n in it. Example 4. The series (16.4) converges on the complex plane C nonuniformly, since sup I rh z n I for every n E N, while by the necessary condition EIC ::::+
oo .
=
0
= oo
z
for uniform convergence, the quantity sup l an (x) l must tend to zero when xE E
uniform convergence occurs. 00
L z , as we know, converges in the unit disk K = n= l n {z E q l zl < 1}. Since I z: I < � for z E K , we have z: ::::+ 0 on K as n --+ oo.
Example 5.
The series
n
The necessary condition for uniform convergence is satisfied; however, this series converges nonuniformly on K. In fact, for any fixed n E N, by the continuity of the terms of the series, if z is sufficiently close to 1, we can get the inequalities
n
I zn
+
...+
z2n
I
�
>
1.!_
+
...
+
_!_
I
>
�
.
2n 2 n 2n 4 From this we conclude by the Cauchy criterion that the series does not converge uniformly on K. 1 6 . 2 . 2 The Weierstrass M-test for Uniform Convergence of a Series Definition 4. The series
00
L a (x) n= l n
converges absolutely on the set E if the
corresponding numerical series converges absolutely at each point 00
00
x E E.
If the series L= an ( x) and L= bn ( x) are such that I an ( x) I ::; n l n l bn ( x) for every x E E and for all sufficiently large indices n E N, then the uniform convergence of the series L= bn (x) on E implies the absolute and n l uniform convergence of the series L= an (x) on the same set E. Proposition 1 .
00
00
n l
375
16.2 Uniform Convergence of Series of Functions
Proof. Under these assumptions for all sufficiently large indices n and m (let n :S m ) at each point x E E we have l an (x) + · · · + am (x) l :S l an (x) l + · · · + l am ( x ) l :S :S bn ( x ) + · · · + bm (x) = l b n ( x ) + · · · + bm (x) l .
By the Cauchy criterion and the uniform convergence of the series L: bn ( x ) , for each c > 0 we can exhibit an index N E N such that 00
n=l lbn (x) + · · · + bm ( x ) l < for all m 2: n > N and all x E E. But then it follows from the inequalities just written and the Cauchy criterion that the series L: an ( x) and L: I a n ( x) I both converge uniformly. D n=l n=l Corollary 2. (Weierstrass' M-test for uniform convergence of a series) . If for the series n=l L: an (x) one can exhibit a convergent numerical series L: Mn n=l such that sup lan (x) l :S Mn for all sufficiently large indices n E N, then the 00
00
c
00
00
00
xE E
series n=l L: an ( x) converges absolutely and uniformly on the set E. Proof. The convergent numerical series can be regarded as a series of constant functions on the set E, which by the Cauchy criterion converges uniformly on E. Hence the Weierstrass test follows from Proposition 1 if we set bn ( x ) Mn =
in it. D
The Weierstrass M-test is the simplest and at the same time the most frequently used sufficient condition for uniform convergence of a series. As an example of its application, we prove the following useful fact . 00
If a power series n=O L: en (z - z0) n converges at a point ( =I zo , then it converges absolutely and uniformly in any disk Kq {z E C l l z - zo l < ql( - zo l } , where 0 < q < 1. Proof. By the necessary condition for convergence of a numerical series it
Proposition 2.
=
00
follows from the convergence of the series L: en ( ( - zo ) n that Cn ( ( - z) n -+ 0 n=O as n -+ oo. Hence for all sufficiently large values n of n E N we have the estimates l en (z - zo ) n l = len (( - zo) n l · I (=�� I :S l en (( - zo) n l · q n < qn 00 in the disk Kq . Since the series L: qn converges for lql < 1, the estimates n=O len (z - zo) n l < qn and the Weierstrass M-test now imply Proposition 2. D
Comparing this proposition with the Cauchy-Hadamard formula for the radius of convergence of a power series (see Eq. (5.115)), we arrive at the following conclusion.
376
16 Uniform Convergence and Basic Operations of Analysis
( Nature of convergence of a power series ) . A power zo) n converges in the disk K = {z E C l l z - zo l < R}
series whose I: c (z n=O n radius of convergence1 is determined by the Cauchy-Hadamard formula1 R = ( nlim \I[CJ) - . Outside this disk the series diverges. On any closed --+ oo disk contained in the interior of the disk K of convergence of the series, a power series converges absolutely and uniformly. Remark 3. As Examples 1 and 5 show, the power series need not converge Theorem 2. 00
uniformly on the entire disk K. At the same time, it may happen that the power series does converge uniformly even on the closed disk K. 00
Example 6. The radius of convergence of the series I:= � is 1 . But if l z l :::; 1 , n1 then I� I :::; r& , and by the Weierstrass M-test this series converges absolutely and uniformly in the closed disk K = {z E C l l z l :::; 1}. n
1 6 . 2 . 3 The Abel-Dirichlet Test
The following pairs of related sufficient conditions for uniform convergence of a series are somewhat more specialized and are essentially connected with the real-valuedness of certain components of the series under consideration. But these conditions are more delicate than the Weierstrass M-test, since they make it possible to investigate series that converge, but nonabsolutely. Definition 5 . The family F of functions
on a set E for every
C
f
X if there exists a number M
f E F.
X ---+ C is uniformly bounded E lR such that xEE sup l f(x) l :::; M
:
{bn : X ---+ JR; n E N} is called non decreasing (resp. nonincreasing) on the set E X if the numerical sequence {bn (x); n E N} is nondecreasing (resp. nonincreasing) for every x E E. Non decreasing and nonincreasing sequences of functions on set are called mono tonic sequences on the set.
Definition 6 . The sequence of functions
C
We recall ( if necessary, see Subsect. 5.2.3) the following identity, called
Abel 's transformation: m m-1 2.: a k b k A m bm - A n - 1 bn 2.: A k (bk - b k + l ) , k= n k=n where a k = A k - A k - 1 , k = n, . . . , m. =
1 In the exceptional case when lim n -+ oo degenerates to the single point z0 .
+
VfCJ =
oo ,
we take R
=
(16.7)
0 and the disk K
16.2 Uniform Convergence of Series of Functions
377
is a monotonic sequence of real numbers, then, even if , +. ,1a, . . .are, bmcomplex or vectors of a normed space, one can aobtain ,n Ifanb+nthe 1 , b. n. following m estimate,numbers which we need, from the identity (16.7): m (16.8) I k=Ln akbk l :S 4 n-max 1
Proof.
In fact,
I � Ak (bk - bk- 1 ) 1 :S 1 � A I + I b I ( + I b I kl · n m n -1
I Am bm l I An- 1bn l +
=
+
max
=
L...t
n-1:5maxk:S m I Akl · ( l bm l l bn l + I bn - bm l ) :S +
=
In the equality that occurs in this computation we used the monotonicity of the numerical sequence D
bk.
Proposition 3. ( The Abel-Dirichlet test for uniform convergence ) .
oo
A suf-
ficient condition for uniform convergence on E of a series I:=1 an (x)bn (x) whose terms are products of complex-valued functions an : X n-+ C and realvalued functions bn : X -+ IR is that either of the following pairs of hypotheses be satisfied: k ) the partial sums s k (x) I: an (x) of the series I: a n (x) are unin=1 n=1 formly bounded on E; /h) the sequence of functions bn (x) tends monotonically and uniformly to zero on E; or ) the series I: an (x) converges uniformly on E; n=1 /h) the sequence of functions bn (x) is monotonic and uniformly bounded on E. Proof. The monotonicity of the sequence bn (x) allows us to to write an esti mate analogous to (16.8) for each x E E: 00
=
o: I
00
o:2
I k=�n ak (x)bk (x) l :S n- 1
where we take
4
max
sk (x) - Sn- 1 (x) as Ak (x).
max
)
378
16 Uniform Convergence and Basic Operations of Analysis
If the hypotheses o: 1 ) and (31 ) hold, then, on the one hand, there exists a constant M such that I A k ( x) I ::; M for all k E N and all x E E, while on the other hand, for any number c > 0 we have max { l bn (x) l , l bm (x) l } < 4� for all sufficiently large n and m and all x E E. Hence it follows from (16.8) that a k (x)b k (x) < c for all sufficiently large n and m and all x E E, that is,
I k�n
l
the Cauchy criterion holds for this series. In the case of hypotheses o: 2 ) and f32 ) the quantity max { l bn (x) l , l bm (x) l } is bounded. At the same time, by the uniform convergence of the series 00 I: an (x) and the Cauchy criterion, for every c > 0 we have I A k (x) l =
n= 1 S l s k (x) - n - 1 (x) l
< c for all sufficiently large n and k > n and all x E E. Taking this into account, we again conclude from (16.8) that the Cauchy criterion for uniform convergence holds for this series. D Remark 4. In the case when the functions an and bn are constants Propo sition 3 becomes the Abel-Dirichlet criterion for convergence of numerical series. Example 7. Let us consider the convergence of the series
(16.9) Since
1 1 inx l _ 1
e (16.10) - n"' ' n"' the necessary condition for uniform convergence does not hold for the series (16.9) when o: ::; 0, and it diverges for every x E R Thus we shall assume o: > 0 from now on. If o: > 1, we conclude from the Weierstrass M-test and (16.10) that the series (16.9 ) converges absolutely and uniformly on the entire real line R To study the convergence for 0 < o: ::; 1 we use the Abel-Dirichlet test, setting a n (x) = e i nx and bn (x) = n1a . Since the constant functions bn (x) are monotonic when o: > 0 and obviously tend to zero uniformly for x E JR, it 00 remains only to investigate the partial sums of the series I: ei nx .
n= 1
n
For convenience in citing results below, we shall consider the sums I: ei kx , k =O which differ from the sums of our series only in the first term, which is 1. Using the formula for the sum of a finite geometric series and Euler ' s formula, we obtain successively for x =/= 2nm, m E Z, · 1 x -2 x ei( n + 1 ) x - 1 sin n.=_+2 _ e' � . x ei - 1 Sin � e' 2 __
sin n +2 1 X . sin �
---=--,----- e ' 2 n
•
x -
n x = sin n +2 1 ( cos -x sin �
X
2
+
n i sin -x 2
)
(
16.11 )
379
16.2 Uniform Convergence of Series of Functions
Hence, for every n E
N
1_ ei k x l < _ L sm 2 I k=O n
-
X
•
(16.12)
'
from which it follows by the Abel-Dirichlet criterion that for 0 < a $ 1 the series (16.9) converges uniformly on every set E C lR on which inf I sin i I > x EE 0. I n particular the series (16.9) simply converges for every x =/= 2 rr m , m E Z. If x = 2 rrm , then ei n 2 7r m = 1, and the series (16.9) becomes the numerical 00 series 2: ..1., , which diverges for 0 < a $ 1. n= l We shall show that from what has been said, one can conclude that for 0 < a $ 1 the series (16.9) cannot converge uniformly on any set E whose closure contains points of the form 2 rr m , m E Z. For definiteness, suppose 00 0 E E. The series 2: ..1., diverges for 0 < a $ 1. By the Cauchy criterion,
n= l
there exists c > 0 such that for every N E N, no matter how large, one can find numbers m 2: n > N such that ..1., + · · · + �" > co > 0. By the continuity of the functions ei k x on JR, it follows that one can choose a point x E E close enough to 0 so that
I
I
lei nx eimx > co . + · · · + -n <>
m <>
I
But by the Cauchy criterion for uniform convergence this means that the series (16.9) cannot converge uniformly on E. To supplement what has just been said, we note that , as one can see from
(16.10), the series (16.9) converges nonabsolutely for 0 < a $ 1. Remark 5. It is useful for what follows t o remark that, separating the real and imaginary parts in (16.11), we obtain the following relations: n n+l (16.13) L: cos kx = cos !!2 x · sin 2 k=O n
L: sin kx = k=O
which hold for x =/= 2 rrm , m
E Z.
X
sin i
n+l sin !!x 2 · sin 2 X sin i
(16.14)
As another exmaple of the use of the Abel-Dirichlet test we prove the following proposition.
If a power series n=O L c ( converges at a point ( E C, then it converges uniformly on the closed interval with endpoints and ( . Proposition 4. (The so-called second Abel theorem on power series. ) 00
..
z
-
z0 ) n
zo
16 Uniform Convergence and Basic Operations of Analysis
380
Proof. We represent the points of this interval in the form zo + (( - zo)t, where 0 ::; t ::; 1 . Substituting this expression i n the power series, we obtain 00 00 the series L en (( - zo) n t n . By hypothesis, the numerical series L en (( -
=O n =O and the sequence of functions t n is monotonic andnuniformly z0) n converges,
bounded on the closed interval [0, 1] . Hence conditions a 2 ) and /32 ) in the Abel-Dirichlet test are satisfied, and the proposition is proved. D 1 6 . 2 . 4 Problems and Exercises 1 . Investigate the nature of the convergence on the sets E C lR for different values of the real parameter a in the following series:
a) E co::x . =l CXl
n
b) 2: si�:x . CXl
2.
n= l Prove that the following series converge uniformly on the indicated sets: /_ 1 \n a) I: �x n for 0 � x � 1 . n= l n b) E ( -� ) e - n x for 0 � X � +oo. = l n f _ l \n c) I: '* for 0 � x � +oo. = l n CXl
CXl
Show that if a Dirichlet series I: � converges at a point xo E JR, then it n= l converges uniformly on the set x � xo and absolutely if x > xo + 1 . 3.
CXl
= n-1 2 Verify that the series I: ( (ft2)'':' converges uniformly on JR, and the series
n= l I: ( l ;:2 ) n converges on JR, but nonuniformly. n= l 5 . a) Using the example of the series from Problem 2, show that the Weierstrass M-test is a sufficient condition but not a necessary one for the uniform convergence of a series. b) Construct a series I: a n (x) with nonnegative terms that are continuous on n= l the closed interval 0 � x � 1 and which converges uniformly on that closed interval, while the series I: Mn formed from the quantities M = maxl i a n (x) l diverges. O � :z: � n=l 4.
CXl
CXl
CXl
6. a) State the Abel-Dirichlet test for convergence of a series mentioned in Re mark 4. b) Show that the condition that {bn } be monotonic in the Abel-Dirichlet test can be weakened slightly, requiring only that the sequence {bn } be monotonic up to corrections {.Bn } forming an absolutely convergent series.
16.3 Functional Properties of a Limit Function
381
7. As a supplement to Proposition 4 show, following Abel, that if a power series converges at a boundary point of the disk of convergence, its sum has a limit in that disk when the point is approached along any direction not tangential to the boundary circle.
1 6 . 3 Functional Properties of a Limit Function 1 6 . 3 . 1 Specifics of the Problem
In this section we shall give answers to the questions posed in Sect . 16.1 as to when the limit of a family of continuous, differentiable, or integrable functions is a function having the same property, and when the limit of the derivatives or integrals of the functions equals the derivative or integral of the limiting function of the family. To explain the mathematical content of these questions, let us consider, for example, the connection between continuity and passage to the limit. Let fn (x) -+ f(x) on lR as n -+ oo , and suppose that all the functions in the sequence {fn : n E N} are continuous at the point xo E R We are interested in the continuity of the limit function f at the same point xo. To answer that question, we need to verify the equality xlim --+ x o f(x) = f(xo), which in terms of the original sequence can be rewritten as the relation or, taking account of the given continuity lim lim f = lim f x --+ xo ( n --+oo n (x)) n --+oo n (x0), of fn at x0, as the following relation, subject to verification:
--+oo fn (x)) x --t x o ( nlim lim
= nlim --too ( xlim --t x o fn (x) ) .
(16.15)
On the left-hand side here the limit is first taken over the base n -+ oo , then over the base x -+ x0, while o n the right-hand side the limits over the same bases are taken in the opposite order. When studying functions of several variables we saw that Eq. (16.15) is by no means always true. We also saw this in the examples studied in the two preceding sections, which show that the limit of a sequence of continuous functions is not always continuous. Differentiation and integration are special operations involving passage to the limit. Hence the question whether we get the same result if we first differentiate (or integrate) the functions of a family, then pass to the limit over the parameter of the family or first find the limit function of the family and then differentiate (or integrate) again reduces to verifying the possibility of changing the order of two limiting passages. 16.3.2 Conditions for Two Limiting Passages to Commute
We shall show that if at least one of two limiting passages is uniform, then the limiting passages commute.
16 Uniform Convergence and Basic Operations of Analysis
382
Let { Ft ; t E T} be a family of functions Ft : X --+ C depending on a parameter t; let Bx be a base in X and Br a base in T. If the family converges uniformly on X over the base Br to a function F : X --+ C and the limit lim Ft ( x ) = At exists for each t E T, then both repeated limits Bx lim ( lim Ft ( x )) and lim ( lim Ft ( x )) exist and the equality BT Bx Bx BT Theorem 1 .
(16.16) holds. This theorem can be conveniently written as the following diagram: Ft ( x ) ======* F ( x )
Bx 1 / At
Br / / /
__l_ Br
31
Br
(16.17)
A
in which the hypotheses are written above the diagonal and the consequences below it . Equality (16.17) means that this diagram is commutative, that is, the final result A is the same whether the operations corresponding to passage over the upper and right-hand sides are carried out or one first passes down the left-hand side and then to the right over the lower side. Let us prove this theorem.
Proof. Since Ft :::t F on X over Br, by the Cauchy criterion, for every c: > there exists Br in Br such that
0
(16.18) for every h, t 2 E Br and every x E X. Passing to the limit over Bx in this inequality, we obtain the relation
(16.19) which holds for every h , t 2 E Br . By the Cauchy criterion for existence of the limit of a function it now follows that At has a certain limit A over Br . We now verify that A = lim F ( x ) . Bx Fixing t 2 E Br, we find an element Bx in Bx such that
(16 . 20)
for all x E Bx . Keeping t 2 fixed, we pass to the limit in respect to h . We then find
(16.18) and (16.19) over Br with
383
16.3 Functional Properties of a Limit Function
(16.21) I F(x) - Ft2 (x) l :::; E , (16.22) l A - A t2 1 :::; E ' and (16.21) holds for all x E X. Comparing (16.20)-(16.22), and using the triangle inequality, we find I F(x) - A I < 3E for every x E B We have thus verified that A = lim F( x). 0 Bx Remark 1 . As the proof shows, Theorem 1 remains valid for functions Ft x.
X ---+ Y with values in any complete metric space.
If we add the requirement that the limit lim A t = A exists to the Br hypotheses of Theorem 1, then, as the proof shows, the equality lim F(x) = A Bx can be obtained even without assuming that the space Y of values of the functions Ft : X ---+ Y is complete.
Remark 2.
16.3.3 Continuity and Passage to the Limit
We shall show that if functions that are continuous at a point of a set converge uniformly on that set, then the limit function is also continuous at that point.
Let {ft; t E T} be a family of functions ft : X ---+ C depending on the parameter t; let B be a base in T. If ft f on X over the base B and the functions ft are continuous at x0 E X, then the function f : X C is also continuous at that point. Proof. In this case the diagram (16.17) assumes the following specific form:
Theorem 2.
:4
--t
Here all the limiting passages except the vertical passage on the right are defined by the hypotheses of Theorem 2 itself. The nontrivial conclusion of Theorem 1 that we need is precisely that lim f ( x) = f ( x0). 0
x -+ x o
Remark 3. We have not said anything specific as to the nature of the set X . I n fact it may b e any topological space provided the base x ---+ x0 i s defined in it. The values of the functions ft may lie in any metric space, which, as follows from Remark 2, need not even be complete.
384
16 Uniform Convergence and Basic Operations of Analysis
If a sequence of functions that are continuous on a set con verges uniformly on that set, then the limit function is continuous on the set. Corollary 2. If a series of functions that are continuous on a set converges uniformly on that set, then the sum of the series is also continuous on the set. Corollary 1 .
As an illustration of the possible use of these results, consider the follow ing.
Example 1. Abel 's method of summing series. Comparing Corollary 2 with Abel ' s second theorem ( Proposition 4 of Sect. 16.2), we draw the following conclusion. 00 Proposition 1 . If a power series :L cn ( z - z0 ) n converges at a point (, it n=O converges uniformly on the closed interval [z0 , (] from z0 to (, and the sum of the series is continuous on that interval. In particular, this means that if a numerical series :L Cn converges, then 00
n=O n the power series :L Cn X converges uniformly on the closed interval n=O 00 n of the real axis and its sum = L Cn X is continuous on that interval. n=O 00 00 Since = :L cn , we can thus assert that if the series :L Cn converges, n=O n=O 00
0 :-::; x :-::; 1
s(x)
s(1)
then the following equality holds: 00
L Cn
n=O
=
00
(16.23)
L Cn X n . x-+ 1 -0 lim
n=O
It is interesting that the right-hand side of Eq. (16.23) may have a meaning even when the series on the left diverges in its traditional sense. For example, the series 1 - 1 + 1 - · · · corresponds to the series x - x 2 + x 3 - · · · , which converges to x/(1 + x) for l x l < 1. As x --+ 1, this function has the limit 1/2. The method of summing a series known as Abel summation consists of ascribing to the left-hand side of (16.23) the value of the right-hand side if it 00 is defined. We have seen that if the series :L Cn converges in the traditional n=O sense, then its classical sum will be assigned to it by Abel summation. At the 00 same time, for example, Abel ' s method assigns to the series L ( - 1) n , which n=O
diverges in the traditional sense, the natural average value 1/2. Further questions connected with Example 1 can be found in Problems 5-8 below.
16.3 Functional Properties of a Limit Function
385
Example 2. Earlier, when discussing Taylor ' s formula, we showed that the following expansion holds: ( 1 + x)" = a a ( a - 1 ) 2 . . . a ( a - 1 ) · · · ( a - n + 1) n + . . . ( 16_24) x + + = 1+ x+ x 1! 2! n! We can verify that for a > 0 the numerical series a (a - 1 ) · · · (a - n + 1 ) + a a (a - 1 ) 1+ + +···+ ··· 1! 2! n! converges. Hence by Abel ' s theorem, if a > 0, the series ( 16.24) converges uniformly on the closed interval 0 � x � 1 . But the function ( 1 + x)" is continuous at x = 1, and so one can assert that if a > 0, then Eq. ( 16.24) holds also for x = 1 . In particular, we can assert that for a > 0 ( - 1) 4 . . . ( 1 - t2 ) a = 1 - a t2 + a a t 1! 2! . . . + ( _ 1) n . a ( a - 1 ) · · · ( a - n + 1) t 2 n + . . . ( 1 6_25) n! 2 and this series converges to ( 1 - t ) " uniformly on [- 1 , 1] . Setting a = ! and t 2 = 1 - x 2 in ( 16.25) , for l x l � 1 we find
l x l = 1 - 1!! ( 1 - x 2 ) + H !2!- 1) ( 1 - x 2 ) 2 - . . .
( 16.26) ' and the series of polynomials on the right-hand side converges to l x l uniformly on the closed interval [- 1 , 1] . Setting Pn (x) := Bn (x) - Bn (O), where Bn (x) is the nth partial sum of the series, we find that for any prescribed tolerance c > 0 there is a polynomial P(x) such that P(O) = 0 and max
- l :S x ::;!
l l x l - P(x) l < c .
( 16.27)
Let us now return to the general theory. We have shown that continuity of functions is preserved under uniform passage to the limit. The condition of uniformity in passage to the limit is, however, only a sufficient condition in order that the limit of continuous func tions also be a continuous function ( see Examples 8 and 9 of Sect. 16. 1 ) . At the same time there is a specific situation in which the convergence of con tinuous functions to a continuous function guarantees that the convergence is uniform. Proposition 2. ( Dini ' s 2 theorem ) . If a sequence of continuous functions on
a compact set converges monotonically to a continuous function, then the convergence is uniform. 2 U. Dini (1845-1918) - Italian mathematician best known for his work in the theory of functions.
386
16 Uniform Convergence and Basic Operations of Analysis
Proof. For definiteness suppose that fn is a nondecreasing sequence converg ing to f. We fix an arbitrary E > 0, and for every point x of the compact set K we find an index n x such that 0 :S f ( x) - fn x ( x) < E. Since the func tions f and fn x are continuous on K, the inequality 0 :S f(�) - fn x (�) < E holds in some neighborhood U(x) of x E K. From the covering of the compact set K by these neighborhoods one can extract a finite covering U(x i ), . . . , U(x k ) and then fix the index n(s) = max {nx , , . . . , nx k }. Then for any n > n( E ) , by the fact that the sequence { !n ; n E N} is nondecreasing, we have 0 :S f(O - fn (�) < E at every point � E K. 0 00
If the terms of the series I:= an ( x) are nonnegative functions n l set K and the series converges an : K � that are continuous on a compact to a continuous function on K, then it converges uniformly on K . n Proof. The partial sums Sn ( x) = I:= a k ( x) of this series satisfy the hypothe k l ses of Dini ' s theorem. 0 Example 3. We shall show that the sequence of functions fn (x) = n(l -x 1 1n ) tends to f(x) = ln � as n ---+ + oo uniformly on each closed interval [a, b] contained in the interval 0 < x < oo. Proof. For fixed x > 0 the function x t = et x is convex with respect to t, so that the ratio x t =� (the slope of the chord) is nonincreasing as t + 0 and tends to ln x. Hence fn (x) /' ln � for x > 0 as n ---+ + oo. By Dini ' s theorem it now follows that the convergence of fn ( x) to ln � is uniform on each closed interval [a, b] c]O, + oo[. o Corollary 3 . ---+
t
0
ln
---+
We note that the convergence is obviously not uniform on the interval
0 < x :S 1 , for example, since ln � is unbounded in that interval, while each of the functions fn ( x) is bounded (by a constant depending on n). 1 6 . 3 . 4 Integration and Passage to the Limit
We shall show that if functions that are integrable over a closed interval converge uniformly on that interval, then the limit function is also integrable and its integral over that interval equals the limit of the integrals of the original functions. ---+
Let Ut ; t E T} be a family of functions ft : [a, b] C defined on a closed interval a :S x :S b and depending on the parameter t E T, and let B be a base in T. If the functions of the family are integrable on [a, b] and ft f on [a, b] over the base B, then the limit function f : [a, b] ---+ C is also Theorem 3. :::::t
16.3 Functional Properties of a Limit Function
387
integrable on [a, b] and b
b
J f(x) dx = ffl J ft (x) x li
a
d .
a
Proof. Let p = (P, �) be a partition P of the closed interval [a, b] with dis tinguished points � = { 6 , . . . , �n }. Consider the Riemann sums Ft (p) = n 2:::: ft (�i ) iJ.x i , t E T, and F(p) = 2:::n : f(�i )iJ.x i . Let us estimate the difference
i= l i= l F(p) - Ft (p). Since ft f on [a, b] over the base B, for every E > 0 there exists an element B of B such that [f(x) - ft (x) [ < b :_ a at any t E B and any point x E [a, b]. Hence for t E B we have �
B,
and this estimate holds not only for every t E but also for every parti tion p in the set P = { (P, �)} of partitions of the closed interval [a, b] with distinguished points. Thus Ft � F on P over the base B. Now, taking the traditional base >.. ( P) -+ 0 in P, we find by Theorem 1 that the following diagram is commutative:
n i= l ft(�i )iJ.xi =: Ft (P) .2::::
>. ( P ) -+ 0
b
l
n F(p) := .2:::: f(�i )iJ.x i / i= l /
=====*
/
-
/
3
l >. (P) -+0 b
J ft(x) dx =: A t A := aJ f(x) dx a which proves Theorem 3. ()() Corollary 4. If the series .2:::: fn ( x) consisting of integrable functions on a n = l uniformly on that closed interval, then its closed interval [a, b] lR converges sum is also integrable on [a, b] and D
C
J ( � fn (x) ) dx = � � fn (x) dx . b
a
()()
b
()()
a
When we write si� x in this example, we shall assume that this ratio equals 1 when x = 0. X We have noted earlier that the function Si ( x) = J si� t dt is not an ele
Example 4-
o
mentary function. Using the theorem just proved, we can nevertheless obtain a very simple representation of this function as a power series.
388
16 Uniform Convergence and Basic Operations of Analysis
To do this, we remark that sin t
t
=
� (-1) n t 2n ' � (2n 1) !
(16.28)
+
and the series on the right-hand side converges uniformly on every closed interval [ - a , a ] C R The uniform convergence of the series follows from the Weierstrass M-test , since d��:: ) ! ::; ( 2�:1 ) ! for l t l ::; a, while the numerical
2n
series 2: 2 �+ 1 ! converges. n =O ( ) By Corollary 4 we can now write 00
Si (x) =
J0 ( nf=O (2n(-1) n1 ) 1. t2n ) t = oo oo n 2n 1 n - 2: ( J (2n(-1) 1) ! t 2n ) - 2: (2n(-1)1) !x(2n+ 1) n=O n =O X
d
+
X
0
+
dt
-
+
+
The series just obtained also turns out to converge uniformly on every closed interval of the real line, so that , for any closed interval [ a , b] of variation of the argument x and any preassigned absolute error tolerance, one can choose a polynomial - a partial sum of this series - that makes it possible to compute Si (x) with less than the given error at every point of the closed interval [a , b] . 1 6 . 3 . 5 Differentiation and Passage to the Limit
Let Ut ; t E T} be a family of functions ft : X --+ C defined on a convex bounded set X (in IR, C, or any other normed space) and depending on the parameter t E T; let B be a base in T. If the functions of the family are differentiable on X, the family of derivatives {fl; t E T} converges uniformly on X to a function : X --+ C, and the original family Ut ; t E T} converges at even one point x0 E X, then it converges uniformly on the entire set X to a differentiable function f : X --+ C, and f ' = Proof. We begin by showing that the family Ut ; t E T} converges uniformly
Theorem 4.
cp
cp .
on the set X over the base B. We use the mean-value theorem in the following estimates:
l ft, (x) - ft 2 (x) l ::; ::; I ( ft , ( x) - ft 2 ( x)) - (ft , ( Xo) - ft 2 ( Xo)) I + I ft , ( xo) - ft 2 ( xo) I ::; ::; sup l fl, (�) - fl2 (�) 1 1 x - xo l l ft , (xo) - ft 2 (xo) l = L1(x , t 1 , t 2 ) . �E [x o, x ) +
16.3 Functional Properties of a Limit Function
389
By hypothesis the family {f£; t E T} converges uniformly on X over the base B, and the quantity ft(x0) has a limit over the same base as a function of t, while l x - x0 1 is bounded for x E X. By the necessity part of the Cauchy criterion for uniform convergence of the family of functions f£ and the existence of the limit function ft (x0), for every s > 0 there exists B in l3 such that L1(x, t 1 , t 2 ) < s for any t 1 , t 2 E B and any x E X. But , by the estimates just written, this means that the family of functions {ft; t E T} satisfies the hypotheses of the Cauchy criterion and consequently converges on X over the base l3 to a function f : X -+ C. Again using the mean-value theorem, we now obtain the following esti mates:
I (ft 1 (x + h) - ft 1 (x) - 1;1 (x )h) - (ft 2 (x + h) - ft2 (x) - 1;2 (x )h ) I = = I Ut 1 - fto ) (x + h) - (fh - ft2) (x) - ( ft 1 - ft J ' (x)h l ::::; ::::; sup I Ut 1 - ft2) ' (x + B h) l l h l + I Ut 1 - ft2) ' (x) l l h l = 0 <11< 1 sup 1 1; 1 (x + B h) - J;2 (x + B h) l + 11; 1 (x) - J;2 (x) 1 ) lhl . = ( 0 <11< 1 These estimates, which are valid for x, x + h E X show, in view of the uniform convergence of the family {f£; t E T} on X, that the family { Ft ; t E T} of functions ft(x) - J; (x)h ' t (h) = ft (x + h) - lhl which we shall consider with a fixed value of x E X, converges over the base l3 uniformly with respect to all values of h =I 0 such that x + h E X. We remark that Ft(h) -+ 0 as h -+ 0 since the function ft is differentiable at the point x E X; and since ft -+ f and f£ -+ cp over the base B, we have Ft ( h ) -+ F( h ) = f(x+h) - (�� ) -
_ .
F,
1
p
�
. _
1
16 Uniform Convergence and Basic Operations of Analysis
390
=
=
2::: f� ( x) converges uniformly on X, then 2::: f ( x) also converges uniformly non= 1X, its sum is differentiable on X, and n= 1 n fn (x) ' (x) f:, (x) .
(� )
=
�
This follows from Theorem 4 and the definitions of the sum and uni form convergence of a series, together with the linearity of the operation of differentiation.
Remark 4. The proofs of Theorems 3 and 4, like the theorems themselves and their corollaries, remain valid for functions ft : X --+ Y with values in any complete normed vector space Y. For example, Y may be IR, e, !Rn , e n , C[a , b], and so on. The domain of definition X for the functions ft in Theorem 4 also may be any suitable subset of any normed vector space. In particular, X may be contained in IR, e, !R n , or e n . For real-valued functions of a real argument (under additional convergence requirements) the proofs of these theorems can be made even simpler (see Problem 11). As an illustration of the use of Theorems 2-4 we shall prove the following proposition, which is widely used in both theory and in specific computations. C
Let K e be the convergence disk for a power series n 2::: c (z - z0) . If K contains more than just the point zo, then the sum nof=Othen series f(z) is differentiable inside K and (16.29) f'(z) L ncn (z - zo) " - 1 . n= 1 Moreover, the function f ( z) : K --+ e can be integrated over any path "( : [0, 1] --+ K, and if [0, 1] t r--2-..t z(t) E K, z(O) zo, and z(1) z, then � (z - z0) n + 1 . (16 . 30) J f(z) dz � L...n =O n + 1 1 Remark 5. Here J f(z) dz J0 f ( z(t) ) z'(t) dt. In particular, if the equality f(x) 2:::=O an(x - xo) n holds on an interval - R < x - xo < R of the real n Proposition 3 . =
=
3
I
=
=
line IR, then
=
=
=
:=
I
X
J f(t )
xo
dt
=
=
L� (x - xo) n + l . n 1 + =O n
=
16.3 Functional Properties of a Limit Function
391
Since lim ---+ CXl n -� = nlim --+CXJ \lfcJ, it follows from the Cauchyn Hadamard formula (Theorem 2 of Subsect. 16.2.2 that the power series 00 I: ncn (z - z0) n - 1 obtained by termwise differentiation of the power series
Proof.
n= 1 I: ( z z0 ) , has the same convergence disk K as the original power series. n=O Cn - " But by Theorem 2 of Subsect. 16.2.2 the series I: ncn (z - z0) n - 1 converges n= 1 uniformly in any closed disk Kq contained in the interior of K. Since the series I: c n (z - z0) n obviously converges at z = z0 , Corollary 5 is applicable n=O to it, which justifies the equality (16.29). Thus it has now been shown that a 00
00
00
power series can be differentiated termwise. Let us now verify that it can also be integrated termwise. If 1 : [0, 1] --+ K is a smooth path in K, there exists a closed disk Kq such that 1 C Kq and Kq C K. On Kq the original power series converges uniformly, so that in the equality 00
f ( z(t) ) L cn ( z(t) - zo) n n =O =
the series of continuous functions on the right-hand side converges uniformly on the closed interval 0 ::::; t ::::; 1 to the continuous function f ( z(t)) . Multi plying this equality by the function z '(t), which is continuous on the closed interval [0, 1], does not violate either the equality itself nor the uniform con vergence of the series. Hence by Theorem 3 we obtain
1
1
00
J f(z(t) ) z'(t) dt = L J cn ( z(t) - zo) "z'(t) dt . 0
n=O o
But,
1
1
1 +1 J ( z(t) - z(O) ) " z'(t) dt = n +-1 J d ( z(t) - z(0) ) " = 0
0 1 + 1 1 -(z - zo) n + 1 , =n +-1 ( z(1) - z(0) ) " = n+1 and we arrive at Eq. (16.30). 0 Since it is obvious that Co = f(zo) in the expansion f(z) I: c n (z-zo) n , n=O n applying Eq. (16.29) successively, we again obtain the relation Cn = t< nl \. zo) , which shows that a power series is uniquely determined by its sum and is the =
Taylor series of the sum.
00
392
16 Uniform Convergence and Basic Operations of Analysis
Bessel function Jn (x) , n E N, is a solution of Bessel 's3 x 2 y " + xy' + (x2 - n2 )y 0 . Let us attempt to solve this equation, for example, for n 0, as a power 00 series y 2::: ck x k . Applying formula (16 . 29) successively, after elementary Example 5. equation =
The
=
=
k =O
transformations, we arrive at the relation 00
C 1 + :�:.)k 2 c k + Ck - 2 ) X k - l k =O
=
0,
from which, by the uniqueness of the power series with a given sum, we find
From this it is easy to deduce that
=
k 2, 3, . . . c2 k - l 0, k E N, =
( - 1) k ( k ! )?2 2 k . If we assume Jo(O) 1, we arrive at the solution =
k
00
2 '""" - 1) k X Jo(x) 1 + L.) (k!) 2 22 k =
k= l
and
c2 k
.
This series converges on the entire line lR (and in the entire plane C), so that all the operations carried out above in order to find its specific form are now justified.
Example 6. In Example 5 we sought a solution of an equation as a power series. But if a series is given, using formula (16.29), one can immediately check to see whether it is the solution of a given equation. Thus, by direct computation, one can verify that the function introduced by Gauss a(a + 1) · · · (a + n - 1) ,6 ( ,6 + 1) · · · (,6 + n - 1) x n F(a, j), ')', x) - 1 + � 6 ')'('/' + 1) · · · ('/' + n - 1) n= l
(the hypergeometric series) is well-defined for called hypergeometric equation
lxl
<
1 and satisfies the so
x(x - 1)y" - b - (a + ,6 - 1)x] · y' + aj) · y 0 . In conclusion we note that, in contrast to Theorems 2 and 3, the hy potheses of Theorem 4 require that the family of derivatives, rather than the original family, converge uniformly. We have already seen (Example 2 of Sect . 16.1) that the sequence of functions fn (x) � sin n2 x converges to the differ entiable function f(x) 0 uniformly, while the sequence of derivatives f� (x) does not converge to f' ( x). The point is that the derivative characterizes the =
=
3 F.W. Bessel ( 1 784-1846) - German astronomer.
=
16.3 Functional Properties of a Limit Function
393
rate of variation of the function, not the size of the values of the function. Even when the function changes by an amount that is small in absolute value, the derivative may formally change very strongly, as happens in the present case of small oscillations with large frequency. This is the circumstance that lies at the basis of Weierstrass ' example of a continuous nowhere-differentiable funcoo tion, which he gave as the series f(x) = I: an cos(bn nx) , which obviously n =O converges uniformly on the entire line lR if 0 < a < 1. Weierstrass showed that if the parameter b is chosen so as to satisfy the condition a · b > 1 + � n, then on the one hand f will be continuous, being the sum of a uniformly convergent series of continuous functions, while on the other hand, it will not have a derivative at any point x E R The rigorous verification of this last assertion is rather taxing, so that those who wish to obtain a simpler example of a continuous function having no derivative may see Problem 5 in Sect. 5 . 1 . 16.3.6 Problems and Exercises 1. Using power series, find a solution of the equation y " ( x ) - y ( x ) the conditions a) y(O) = 0, y(1) = 1 ; b ) y(O) = 1 , y ( 1 ) = 0 . 2. 3.
CXl
Find the sum of the series L nC n + l ) n= l n-1
=
0 satisfying
·
a) Verify that the function defined by the series Jn (x) =
( x ) 2 k+ n
(- 1)k
� k!(k + n) ! oo
2
is a solution of Bessel's equation of order n 2: 0 from Example 5. b) Verify that the hypergeometric series in Example 6 provides a solution of the hypergeometric equation. Obtain and justify the following expansions, which are suitable for computation, for the complete elliptic integrals of first and second kind with 0 < k < 1 .
4.
K (k) =
E (k)
=
rr/ 2
rr/ 2
/ }1 - dr.pk2 sin2 0
J V1 - k2 s m2 0
•
'P
1n r
= �
d 'P =
2
( (
(
) ) �) )
1) ! ! 2 k 2 n 1 + � (2n - ! ! (2n) � -1 n
(
� (2n - 1) ! ! 2 1-L 2 ( 2n) ! ! 2n - 1 =l
�
n
;
394 5.
16 Uniform Convergence and Basic Operations of Analysis
Find a) f r k e i k
k =O
b) f r k cos k
k =O
c) f r k sin k
k =O
Show that the following relations hold for l r l
d ) f r k e i k
<
1:
= - r 2rp r2 ' - .!2 . 1 -2 r1cos e) .!2 + 2::: r k cos k
l sin
·
·
·
n
n
Let S n = 2::: a k and O' n = l 2::: S k · The series is Cesaro4 summable, more k=1 k=1 = A. In that case we write 2::: ak precisely ( c, 1 )-summable to A, if nlim <7 n -+ = n =l A(c, 1 ) . a ) Verify that 1 - 1 + 1 - 1 + = � (c, 1 ) . 7.
·
·
·
b ) Show that O'n = f ( 1 - k � 1 ) a k .
k= 1 = c) Verify that if 2::: a k = A in the usual sense, then 2::: a k = A(c, 1 ) . k= 1 k= 1 d) The (c, 2)-sum of the series f a k is the quantity lim � ( <7 1 + + O'n ) if n--+ oo k=l this limit exists. In this way one can define the ( c, r )-sum of any order r. Show that = = if 2::: a k = A(c, r) , then 2::: a k = A(c, r + 1 ) . k= 1 k=1 = e) Prove that if 2::: a k = A(c, 1 ) , then the series is also Abel summable to A. k=1 4 E. Cesaro ( 1859-1906) - Italian mathematician who studied analysis and geom etry. =
·
·
·
16.3 Functional Properties of a Limit Function
395
8. a) A "theorem of Tauberian type " is the collective description for a class of theorems that make it possible, by introducing various extra hypotheses, to judge the behavior of certain quantities from the behavior of certain of their means. An example of such a theorem involving Cesaro summation of series is the following proposition, which you may attempt to prove following Hardy. 5
�
�
If a n = A( c, 1 ) and a n = 0 ( � ) , then the series a n converges in the n l n l ordinary sense and to the same sum. b) Tauber's6 original theorem relates to Abel summation of series and consists of the following. =
Suppose the series L
an x
n
converges for 0
<
x
<
=
1 and x �lim L l -O
an x
n
= A.
If J�� a , + 2 a 2 � · · + n a n = 0, then the series L a n converges to A in the ordinary n= l sense. n= l
=
n= l
9. It is useful to keep in mind that in relation to the limiting passage under the integral sign there exist theorems that give much freer sufficient conditions for the possibility of such a passage than those made possible by Theorem 3. These theo rems constitute one of the major achievements of the so-called Lebesgue integral. In the case when the function is Riemann integrable on a closed interval [a, b] , that is, f E R[a, b] , this function also belongs to the class .C[a, b] of Lebesgueb integrable functions, and the values of the Riemann integral (R) I f(x) dx of f and
a
b
the Lebesgue integral (L) I f(x) dx are the same. a
In general the space .C[a, b] is the completion of R[a, b] (more precisely, R[a, b] ) b with respect to the integral metric) , and the integral (L) I is the continuation of b
a
the linear functional (R) I from R[a, b] to .C[a, b] . a The definitive Lebesgue "dominated convergence" theorem asserts that if a sequence {fn ; n E N} of functions fn E .C [a, b] is such that there exists a non negative function F E .C[a, b] that majorizes the functions of the sequence, that is, l fn ( x ) l :S F(x) almost everywhere on [a, b] , then the convergence fn --t f at almost all points of the closed interval [a, b] implies that f E .C[a, b] and b b lim (L) f f (x) d x = (L) I f(x) dx . n n --+ oo a
a
a) Show by example that even if all the functions of the sequence {fn ; n E N} are bounded by the same constant M on the interval [a, b] , the conditions fn E R[a, b] , n E N and f --t f pointwise on [a, b] still do not imply that f E R[a, b] . (See n Example 5 of Sect. 16. 1.)
5 G.H. Hardy (1877-1947) - British mathematician who worked mainly in number theory and theory of functions. 6 A. Tauber (b. 1866, year of death unknown) - Austrian mathematician who worked mainly in number theory and theory of functions.
16 Uniform Convergence and Basic Operations of Analysis
396
b
b) From what has been said about the relation between the integrals (R) I a
b
and (L) J and Lebesgue's theorem show that, under the hypotheses of part a) , if a
b
b
=
it is known that f E 'R.[a, b] , then (R) I f (x ) dx lim (R) I f (x ) dx. This is a n --+ oo a n a significant strengthening of Theorem 3. c) In the context of the Riemann integral one can also state the following version of Lebesgue's monotone convergence theorem. If the sequence {fn ; n E N} of functions fn E 'R.[a, b] converges to zero mono tonically, that is, 0 � fn+1 � fn and fn -t 0 as n -t oo for every x E [a, b] , then b (R) I fn (x) dx -t 0. a
Prove this assertion, using where needed the following useful observation.
1
d) Let f E 'R.[a, b] , I f I � M, and I f (x ) dx 0
1
2:
a
>
0. Then the set
E
= {x E
[0, 1] f (x ) 2: a/2} contains a finite number of such intervals the sum of whose lengths (l) is at least a/(4M) . Prove this, using, for example, the intervals of a partition P of the closed interval 1 [0, 1] for which the lower Darboux sum s(f, P) satisfies the relation 0 � I f(x) dx 0 s(f, P) < a/4. 10. a) Show by the examples of Sect. 16. 1 , that it is not always possible to extract a subsequence that converges uniformly on a closed interval from a sequence of functions that converge pointwise on the interval. b) It is much more difficult to verify directly that it is impossible to extract a subsequence of the sequence of functions {fn ; n E N}, where fn (x ) = sin nx , that converges at every point of [0, 21r] . Prove that this is nevertheless the case. (Use 2 11" the result of Problem 9b) and the circumstance that I (sin n k x - sin n k +1 x ) 2 dx 0 21r =I 0 for n k < n k+1 · ) c) Let {in ; n E N} be a uniformly bounded sequence of functions fn E 'R.[a, b] . Let X
=
Fn (x )
= J fn (t) dt a
(a � X � b) .
Show that one can extract a subsequence of the sequence {Fn ; n E N} that converges uniformly on the closed interval [a, b] . 1 1 . a) Show that if f , fn E 'R.([a, b] , JR.) and fn � f on [a, b] as n -t oo, then for every E > 0 there exists an integer N E N such that
ju - fn ) (x ) dx b
for every n > N.
a
<
t:: ( b - a) .
16.4 *Subsets of the Space of Continuous Functions
397
b ) Let fn E cC ll ( [a, bJ , lR ) , n E N. Using the formula fn (x) = fn (xo) + J f� ( t ) dt, show that if f� :::; 'P on [a, b] and there exists a point xo E [a, b] for which X
XQ
the sequence {fn (xo); n E N} converges, then the sequence of functions { fn ; n E N} converges uniformly on [a, b] to some function f E c < ll ( [a, b] , lR) and f� :::; f' = 'P ·
16.4 * Compact and Dense Subsets of the Space of Continuous Functions The present section is devoted to more specialized questions, involving the space of continuous functions, which is ubiquitous in analysis. All these ques tions, like the metric of the space of continuous functions 7 itself, are closely connected with the concept of uniform convergence. 16.4. 1 The Arzela-Ascoli Theorem Definition 1. A family F of functions f : X --+ Y defined on a set X and assuming values in a metric space Y is uniformly bounded on X if the set of values V = {y E Y l �� E F �x E X (y = f ( x) ) } of the functions in the family is bounded in Y.
For numerical functions or for functions f : X --+ JR. n , this simply means that there exists a constant M E JR. such that I f ( x) I ::; M for all x E X and all functions f E B. Definition 1 ' . If the set V C Y of values of the functions of the family F is totally bounded (that is, for every c; > 0 there is a finite c-grid for V in Y), the family F is totally bounded. For spaces Y in which the concept of boundedness and total boundedness are the same (for example, for JR.,
X and Y be metric spaces. A family F of functions f : Y equicontinuous on X if for every > 0 there exists o > 0 such that dy ( f(x l ), f(x 2 )) < E for any function f in the family and any X 1 , X 2 E X such that dx(x 1 , x 2 ) < o. Example 1 . The family of functions { xn ; n E N} is not equicontinuous on [0, 1], but it is equicontinuous on any closed interval of the form [0, q] where O < q < l. Definition 2. Let X --+ is
c;
7 If you have not completely mastered the general concepts of Chap. 9, you may assume without any loss of content in the following that the functions discussed always map lR into lR or C into C, or lRm into lR n .
398
16 Uniform Convergence and Basic Operations of Analysis
Example 2. The family of functions {sin nx; n any nondegenerate closed interval [a, b] C R
E N} is not equicontinuous on
Example 3. If the family {!a : [a, b] -+ lR; E A} of differentiable functions fa is such that the family { !�; E A} of their derivatives is uniformly bounded by a constant, then l fa(x 2 ) - fa(xi) I :S: M l x 2 - x 1 l , as follows from the mean-value theorem, and hence the original family is equicontinuous on the closed interval [a, b]. o:
o:
The connection of these concepts with uniform convergence of continuous functions is shown by the following lemma.
Let K and Y be metric spaces, with K compact. A necessary condition for the sequence Un ; n E N} of continuous functions fn : K -+ Y to converge uniformly on K is that the family Un ; n E N} be totally bounded and equicontinuous. Proof. Let fn f on K. By Theorem 2 of Sect. 16.3, we conclude that f E C(K, Y). It follows from the uniform continuity of f on the compact set K that for every E > 0 there exists {J > 0 such that ( dK(x 1 , x 2 ) < {J dy ( f(xi), f(x 2 )) < E ) for all x 1 , x 2 E K. Given the same E > 0 we can find an index N E N such that dy (f ( x), fn ( x) ) < E for all n > N and all x E X. Combining these inequalities and using the triangle inequality, we find that dK(x 1 , x 2 ) < fJ implies dy ( fn (xi), fn (x 2 )) < 3E for every n > N and x 1 , x 2 E K. Hence the family Un ; n > N } is equicontinuous. Adjoining to this family the equicontinuous family {h , . . . , fN } consisting of a finite number of functions continuous on the compact set K, we obtain an equicon tinuous family { !n ; n E N}. Total boundededness of F follows from the inequality dy (! ( X ) ' fn ( X )) < N E, which holds for x E K and n > N , and the fact that f(K) and U fn (K) n= l are compact sets in Y and hence totally bounded in Y. 0 Lemma 1 .
==l
==::}
Actually the following general result is true. Theorem 1. (Arzela-Ascoli) . Let F be a family of functions f : K -+ Y de fined on a compact metric space K with values in a complete metric space Y. A necessary and sufficient condition for every sequence Un E F ; n E N} to contain a uniformly convergent subsequence is that the family F be totally bounded and equicontinuous. Proof. N e c e s s i t y. If F were not a totally bounded family, one could obvi ously construct a sequence {fn ; n E N} of functions fn E F that would not
be totally bounded and from which (see the lemma) one could not extract a uniformly convergence subsequence. If F is not equicontinuous, there exist a number Eo > 0, a sequence of functions Un E F; n E N}, and a sequence {(x�, x�); n E N} of pairs
16.4 *Subsets of the Space of Continuous Functions
399
(x�, x�) of points x� and x� that converge to a point x0 E K as n --+ oo , but d y ( fn (x�), fn (x�) ) � c:0 > 0. Then one could not extract a uniformly convergent subsequence from the sequence {fn ; n E N}: in fact , by Lemma 1, the functions of such a subsequence must form an equicontinuous family. S u ff i c i e n c y. We shall assume that the compact set K is infinite, since the assertion is trivial otherwise. We fix a countable dense subset E in K a sequence {x n E K; n E N}. Such a set E is easy to obtain by taking, for example, the union of the points of finite c:-grids in K obtained for c: = -
1, 1/2, . . . , 1/n, . . .. Let { in ; n E N} be an arbitrary sequence of functions of :F. The sequence Un (x 1 ); n E N} of values of these functions at the point x 1 is totally bounded in Y by hypothesis. Since Y is a complete space, it is possible to extract from it a convergent subsequence Unk (xi); k E N}. The functions of this sequence, as will be seen, can be conveniently denoted f� , n E N. The superscript 1 shows that this is the sequence constructed for the point x 1 . From this subsequence we extract a further subsequence {f� k ; k E N} which we denote {f�; n E N} such that the sequence {f� k (x 2 ); k E N} con verges. Continuing this process, we obtain a series { !!; n E N}, k 1, 2, . . . of sequences. If we now take the "diagonal" sequence {gn = f;:; n E N}, it will converge at every point of the dense set E K, as one can easily see. We shall show that the sequence { 9n ; n E N} converges at every point of K and that the convergence is uniform on K. To do this, we fix c: > 0 and choose 8 > 0 in accordance with Definition 2 of equicontinuity of the family :F. Let E1 { 6 , . . . , �k } be a finite subset of E forming a 8-grid on K. Since the sequences {gn (�i ); n E N}, i = 1, 2, . . . , k, all converge, there exists N such that d y (gm (�i ), gn (�i ) ) < c: for i = 1, 2, . . . , k and all m, n � N . For each point x E K there exists �j E E such that dK( X , �j ) < 8. By the equicontinuity of the family :F, it now follows that dy (gn (x) , gn (�j) ) < c: for every n E N. Using these inequalities, we now find that dy (gm (x) , gn (x) ) :S dy (gn (x) , gn ( �j)) + dy ( gm ( �j), gn ( �j ) ) + + dy (grn (x), gm ( �j)) < + + = 3c: for all m , n > N . But x was an arbitrary point of the compact set K, so that , by the Cauchy criterion the sequence {gn ; n E N} indeed converges uniformly on K. 0 =
C
=
C:
16.4.2
C:
C:
The Metric Space C(K, Y )
One of the most natural metrics on the set C(K, Y) of functions f : K --+ Y that are continuous on a compact set K and assume values in a complete metric space Y is the following metric of uniform convergence.
400
16 Uniform Convergence and Basic Operations of Analysis
d ( f, g )
=
max dy ( f ( x ) , g ( x )) ,
x EK
where f, g E C ( K , Y), and the maximum exists, since K is compact. The name metric comes from the obvious fact that d ( fn , f) --+ 0 <=? fn � f on K. Taking account of this last relation, by Theorem 2 of Sect. 16.3 and the Cauchy criterion for uniform convergence we can conclude that the metric space C ( K , Y) with the metric of uniform convergence is complete. We recall that a precompact subset of a metric space is a subset such that from every sequence of its points one can extract a Cauchy ( fundamental ) subsequence. If the original metric space is complete, such a sequence will even be convergent . The Arzellt-Ascoli theorem gives a description of the precompact subsets of the metric space C ( K , Y). The important theorem we are about to prove gives a description of a large variety of dense subsets of the space C ( K, Y). The natural interest of such subsets comes from the fact that one can approximate any continuous function f : K --+ Y uniformly with absolute error as small as desired by functions from these subsets.
Example 4.
The classical result of Weierstrass, to which we shall often return, and which is generalized by Stone ' s theorem below, is the following.
( Weierstrass ) . If f E C ( [a, b], C ) , there exists a sequence { Pn ; N} of polynomials Pn [a, b] --+ C such that Pn f on [a, b] . Here, if f C ( [a, b], IR ) , the polynomials can also be chosen from C ( [a, b], IR ) .
Theorem 2. nE E
:
�
In geometric language this means, for example, that the polynomials with real coefficients form an everywhere dense subset of C ( [a, b] , IR ) .
Example 5. Although Theorem 2 still requires a nontrivial proof ( given be low ) , one can at least conclude from the uniform continuity of any function f E C([a, b] , IR ) that the piecewise-linear continuous real-valued functions on the interval [a, b] are a dense subset of C ( [a, b] , IR ) . Remark 1 .
We note that if E 1 is everywhere dense in E2 and E2 is everywhere dense in E3 , then E1 is obviously everywhere dense in E:� . This means, for example, that to prove Theorem 2 it suffices to show that a piecewise linear function can be approximated arbitrarily closely by a polynomial on the given interval. 16.4.3 Stone's Theorem
Before proving the general theorem of Stone, we first give the following proof of Theorem 2 ( Weierstrass ' theorem ) for the case of real-valued functions, which is useful in helping to appreciate what is to follow.
16.4 *Subsets of the Space of Continuous Functions
401
Proof. We first remark that if f, g E C ( [a, b] , JR) , a E JR, and the functions f and g admit a uniform approximation (with arbitrary accuracy) by polyno mials, then the continuous functions f + g, f · g, and af also admit such an approximation. On the closed interval [ - 1, 1], as was shown in Example 2 of Sect. 16.3, the function lxl admits a uniform approximation by polynomials Pn (x) = n I:; ak x k . Hence, the corresponding sequence of polynomials M · Pn ( x /M) k= l
gives a uniform approximation to lxl on the closed interval lxl ::::; M. If f E C ( [a, b], JR ) and M = max l f(x)l, it follows from the inequality
l lyl - k�l ck yk l <
l
�
l
for IYI ::::; M that l f(x)l ck f k (x) < c for a ::::; x ::::; b. l k Hence if f admits a uniform approximation by polynomials on [a, b], then n I:; ck f k and lfl also admit such an approximation. c
k = l Finally, if f and g admit a uniform approximation by polynomials on the closed interval [a, b], then by what has been said, the functions max{!, g} = H U + g) + I f - g l) and min{f, g} = H U + g) - I f - g l ) also admit such an approximation. Let a ::::; 6 ::::; 6 ::::; b, f(x) 0, g6 6 (x) = �-=t , h (x) 1, Pf. 1 6 = max{f, g6 6 } , and F6 6 = min{ h, tf>6 6 } . Linear combinations of functions of the form F6 6 obviously generate the entire set of continuous piecewise-linear functions on the closed interval [a, b], from which, by Example 5, Weierstrass ' =
=
theorem follows. D
Before stating Stone ' s theorem, we define some new concepts.
A of real- (or complex-)valued functions on a set X is real (or complex) algebra of functions on X if (f + g) E A , (f · g ) E A , (af) E A when f, g E A and a E lR (or E C). Example 6. Let X C C. The polynomials P(z) = co + c 1 z + c2 z2 + · · · + cn z n , n E N, obviously form a complex algebra of functions on X. If we take X [a, b] C JR, and take only polynomials with real coefficients, we obtain a real algebra of functions on the closed interval [a, b]. Example 7. The linear combinations of functions enx , n = 0, 1, 2, . . . with coefficients in lR or C also form a (real or complex respectively) algebra on any closed interval [a, b] C JR. The same can be said of linear combinations of the functions { e i n x ; n E Z} . Definition 4. We shall say that a set S of functions on X separates points on X if for every pair of distinct points x 1 , x 2 E X there exists a function f E S such that f(xi ) =/= f(x2 ) · Definition 3. A set
called a
a
=
402
16 Uniform Convergence and Basic Operations of Analysis
Example 8.
The set of functions { e n x ; n E N}, and even each individual function in the set , separates points on JR.. At the same time, the 27!"-periodic functions {ei n x ; n E Z} separates points of a closed interval if its length is less than 271" and obviously does not separate the points of an interval of length greater than or equal to 271".
Example 9. The real polynomials together form a set of functions that sepa rates the points of every closed interval [a, b] , since the polynomial P(x) = x does that all by itself. What has just been said can be repeated for a set X C C and the set of complex polynomials on X. As a single separating function, one can take P(z) = z. :F of functions f X -+ C does not vanish on X (is nondegenerate) if for every point x0 E X there is a function fo E :F such that fo(xo) =f. 0. Example 1 0. The family :F = { 1, x, x 2 , , } does not vanish on the closed interval [0, 1], but all the functions of the family :F0 = { x, x 2 , . . , } vanish at x = O. Lemma 2 . If an algebra A of real {resp. complex) functions on X separates the points of X and does not vanish on X, then for any two distinct points X 1 , x 2 E X and any real {resp. complex) numbers c 1 , c2 there is a function f in A such that f(x l ) = c 1 and f(x 2 ) = c2 . Proof. It obviously suffices to prove the lemma when c 1 = 0, c2 = 1 and when c 1 = 1, c 2 = 0. By the symmetry of the hypotheses on x 1 and x 2 , we consider only the case c 1 = 1, c 2 = 0. We begin by remarking that A contains a special function s separating the points X 1 and x 2 that , in addition to the condition s(x 1 ) =f. s(x 2 ), also satisfies the condition s(x l ) =f. 0. Let g, h E A, g(x l ) =f. g(x 2 ), g(x l ) = 0, and h(x l ) =f. 0. There is obviously a number A E lR \ 0 such that A (h (x l ) - h(x 2 ) ) =f. g(x 2 ). The function s = g + Ah then has the required properties. Now, setting f(x) = 3;i7l-=���l=���) , we obtain a function f in the algebra A satisfying f(x l ) = 1 and f(x 2 ) = 0. Theorem 3 . (Stone8 ) . Let A be an algebra of continuous real-valued func tions defined on a compact set K. If A separates the points of K and does not vanish on K, then A is an everywhere-dense subspace of C(K, JR) . Proof. Let A be the closure of the set A C(K, JR) in C(K, JR), that is, A consists of the continuous functions f E C(K, JR) that can be approximated :
Definition 5 . The family
•
•
•
.
D
C
8
M.H. Stone (1903-1989) - American mathematician who worked mainly in topol ogy and functional analysis.
16.4 *Subsets of the Space of Continuous Functions
403
uniformly with arbitrary precision by functions of A. The theorem asserts that A = C(K, !R). Repeating the reasoning in the proof of Weierstrass ' theorem, we note that if J, g E A and o: E IR, then the functions J + g, f · g, o:J, IJI , max{f, g}, min{!, g} also belong t o A. B y induction we can verify that i n general if JI , f2 , . . . , fn E A, then max { JI , f2 , . . . , fn } and min{ JI , f2 , . . . , fn } also lie in A. We now show that for every function f E C(K, JR), every point x E K, and every number c > 0, there exists a function 9x E A such that 9x (x) = f(x) and 9x (t) > f(t) - c for every t E K. To verify this, for each point y E K we use Lemma 2 to choose a function hy E A such that hy (x) = f(x) and hy ( Y ) = f(y). By the continuity of f and hy on K, there exists an open neighborhood Uy of y such that hy (t) > f(t) - € for every t E Uy . From the covering of the compact set K by the open sets Uy we select a finite covering {Uy1 , Uy 2 , , Uy n }. Then the function function. 9x = max{ h h1 , hy2 , hyn } E A will be the desired Now taking such a function 9x for each point x E K, we remark that by the continuity of 9x and J, there exists an open neighborhood Vx of x E K such that 9x (t) < f(t) + c for every t E Vx . Since K is compact , there exists a finite covering {Vx1 , Vx 2 , , Vx m } by such neighborhoods. The function g = min {gx1 , , 9x m } belongs to A and by construction, satisfies both in equalities •
•
•
•
•
•
•
•
•
•
•
•
f(t) -
c
< g(t) <
f(t) +
c
at every point . But the number c > 0 was arbitrary, so that any function can be uniformly approximated on K by functions in A. 0
f E C(K, !R)
16.4.4 Problems and Exercises 1. A family :F of functions f : X -t Y defined on the metric space X and assuming values in the metric space Y is equicontinuous at Xo E X if for every o: > 0 there exists 8 > 0 such that dx (x, xo) < 8 implies dy ( !(x) , J (xo) ) < o: for every f E :F. a) Show that if a family :F of functions f : X -T Y is equicontinuous at Xo E X, then every function f E :F is continuous at xo , although the converse is not true. b ) Prove that if the family :F of functions f : K -t Y is equicontinuous at each point of the compact set K, then it is equicontinuous on K in the sense of Definition 2. c ) Show that if a metric space X is not compact, then equicontinuity of a family :F of functions f : X -t Y at each point x E X does not imply equicontinuity of :F on X. For this reason, if the family :F is equicontinuous on a set X in the sense of Definition 2, we often call it uniformly equicontinuous on the set. Thus, the relation between equicontinuity at a point and uniform equicontinuity of a family of
404
16 Uniform Convergence and Basic Operations of Analysis
functions on a set X is the same as that between continuity and uniform continuity of an individual function f : X --+ Y on the set X. d) Let w(f ; E) be the oscillation of the function f : X --+ Y on the set E C X, and B(x, 6) the ball of radius 6 with center at x E X. What concepts are defined by the following formulas?
Vc
>
0 :3 6
>
0 Vf E :F w ( t ; B(x, 6) ) < c ,
Vc > 0 :3 6 > 0 Vf E :F Vx E X w ( f ; B(x, 6) ) < c .
e) Show by example that the Arzela-Ascoli theorem is in general not true if K is not compact: construct a uniformly bounded and equicontinuous sequence { fn ; n E N} of functions fn (x) = rp(x + n) from which it is not possible to extract a subsequence that converges uniformly on JR. f) Using the Arzela-Ascoli theorem, solve Problem 10c) from Sect. 16.3. 2 . a) Explain in detail why every continuous piecewise-linear function on a closed interval [a , b] can be represented as a linear combination of functions of the form F�1 � 2 shown in the proof of Weierstrass' theorem. b) Prove Weierstrass' theorem for complex-valued functions f : [a , b] --+ IC.
c) The quantity Mn
J f (x)x n dx is often called the nth moment of the funcb
=
a
tion f : [a, b] --+
16.4 *Subsets of the Space of Continuous Functions i) For which location of the closed interval by rp(x) = x dense in c([a, bJ , IR ) ?
[a, b]
C
405
lR is the algebra generated
a) A complex algebra of functions A is self-adjoint if it follows from f E A that
7 E A, where f(x) is the value conjugate to f(x) . Show that if a complex algebra 4.
A is nondegenerate on X and separates the points of X, then, given that A is self adjoint, one can assert that the subalgebra AR of real-valued functions in A is also nondegenerate on X and also separates points on X . b ) Prove the following complex version o f Stone's theorem. If a complex algebra A of functions f : X --+ C is nondegenerate on X and separates the points of X, then, given that it is self-adjoint, one can assert that it is dense in C(X, C) . c) Let X = {z E tej Jzj = 1 } be the unit circle and A the algebra on X generated by the function e;"' , where rp is the polar angle of the point z E C. This algebra is nondegenerate on X and separates the points of X , but is not self-adjoint.
Prove that the equalities r f ( e;"' ) ei n
1 7 Integrals Dep ending on a Parameter
In this chapter the general theorems on families of functions depending on a parameter will be applied to the type of family most frequently encountered in analysis - integrals depending on a parameter.
1 7 . 1 Proper Integrals Depending on a Parameter
17 1 1 .
An
.
The Concept of an Integral Depending on a Parameter
integral depending on a parameter is a function of the form F(t)
=
j
f(x, t) dx ,
(17.1)
E,
where t plays the role of a parameter ranging over a set T,and to each value t E T there corresponds a set Et and a function 'Pt ( x) = f ( x, t) that is integrable over Et in the proper or improper sense. The nature of the set T may be quite varied, but of course the most important cases occur when T is a subset of � ' C, � n , or e n . If the integral (17.1) is a proper integral for each value of the parameter t E T, we say that the function F in (17.1) is a proper integral depending on
a parameter.
But if the integral in (17.1) exists only as an improper integral for some or all of the values of t E T, we usually call F an improper integral depending
on a parameter. But these are of course merely terminological conventions. When x E � m , Et � m , and m > 1, we say that we are dealing with a multiple (double, triple, and so forth) integral (17.1) depending on a param eter. C
We shall concentrate, however, on the one-dimensional case, which forms the foundation for all generalizations. Moreover, for the sake of simplicity, we shall first take Et to be intervals of the real line � independent of the parameter, and we shall assume that the integral (17.1) over these intervals exists as a proper integral.
408
17 Integrals Depending on a Parameter
1 7. 1 . 2 Continuity of an Integral Depending on a Parameter 1\
Let P = {(x, y) E JR.2 1 a ::::; x ::::; b c ::::; y ::::; d} be a rectangle in the plane IR.2 • If the function f : P --+ JR. is continuous, that is, if f E C ( P, JR.), then the function
Proposition 1.
b
F(y) = J f(x, y) dx
(17.2)
a
is continuous at every point y E [c, d] . Proof. It follows from the uniform continuity of the function f on the compact set P that 'P y (x) := f(x, y) f(x, yo) =: 'Py0 (x) on [a, b] as y --+ Yo , for y, y0 E [c, d]. For each y E [c, d] the function 'Py (x) = f(x, y) is continuous with respect to x on the closed interval [a, b] and hence integrable over that ::::t
interval. By the theorem on passage to the limit under an integral sign we can now assert that
F(yo)
=
b
b
J f(x, Yo) dx = y-tyo lim J f(x, y) dx = lim F(y) . 0 y-tyo a
a
Remark 1 . As can be seen from this proof, Proposition 1 on the continuity of the function (17.2) remains valid if we take any compact set K as the set of values of the parameter y, assuming, of course, that f E C(I x K, JR.), where I = {x E JR.I a ::::; x ::::; b}.
Hence, in particular, one can conclude that if f E C(I x D, JR.), where D is an open set in IR.n , then F E C ( D, JR.), since every point y0 E D has a compact neighborhood K C D, and the restriction of f to I x K is a continuous function on the compact set I x K. We have stated Proposition 1 for real-valued functions, but of course it and its proof remain valid for vector-valued functions, for example, for functions assuming values in c, IR. m , or e m . Example 1 . In the proof of Morse ' s lemma ( see Section 8.6, Part 1) we men tioned the following proposition, called H a d a m a r d ' s l e m m a.
If a function f belongs to the class c ( l > (u, JR.) in a neighborhood U of the point x0, then in some neighborhood of x0 it can be represented in the form (17.3) f(x) = f(xo) + cp(x)(x - xo) , where cp is a continuous function and cp(xo) f'(xo). Equality (17.3) follows easily from the Newton-Leibniz formula =
1
f(xo + h) - f(xo) = J f'(xo + t h ) dt 0
·
h
(17.4)
409
17. 1 Proper Integrals Depending on a Parameter
1 1 applied to the function F( h) J0 f'(x0 + t h) dt. All that remains is to make the substitution h x - x0 and set cp(x) F(x - x0). It is useful to remark that Eq. (17.4) holds for x0, h E m; n , where n is not restricted to the value 1. Writing out the symbol f ' in more detail, and for simplicity setting x0 0, one can write, instead of (17.4) =
and Proposition
=
=
=
and then one should set
n
cp(x)x L 'Pi(x) x i i= 1 1 in Eq. (17.3), where 'P i (x) J i1r (tx) dt. 0 =
=
17. 1 .3 Differentiation of an Integral Depending on a Parameter
Proposition 2. If the function f P IR is continuous and has a contin uous partial derivative with respect to y on the rectangle P {(x, y) E IR 2 1 a ::; X ::; b /\ c ::; y ::; d} then the integral ( 17.2 ) belongs to c ( l ) ( [ c, d], IR ) and ' ' --t
:
F'(y)
=
=
b
J �� (x, y)
(
dx .
a
17.5 )
Formula ( 17.5 ) for differentiating the proper integral ( 17.2 ) with respect to a parameter is frequently calledLeibniz ' formula or Leibniz ' rule. Proof. We shall verify directly that if y0 E [c, d], then F' (yo) can be computed by formula (17.5):
I F(yo + h) - F(yo) - ( J �� (x, Yo) dx) h i b
I j (f(x, Yo + h) - f(x, Yo) - �� (x, Yo)h) x l ::; ::; j l f (x, Yo + h) - f(x, Yo) - �� (x, Yo)h l dx ::; J 0<8< 1 l -;::;-afuy (x, Yo + ()h) - -;::;-ofuy (x, Yo) I dx J h l = cp(yo, h) · I h i . =
b
::;
a
=
b
a
d
b
a
sup
a
17 Integrals Depending on a Parameter
410
By hypothesis interval a ::; x ::;
h --+ 0.
0
%£ E C(P, JR), so that %£ (x, y) :::; %£ (x, y0) on the closed b as y --+ yo, from which it follows that tp(Yo, h) --+ 0 as
f
Remark 2. The continuity of the original function is used in the proof only as a sufficient condition for the existence of all the integrals that appear in the proof. Remark 3. The proof just given and the form of the mean-value theorm used in it show that Proposition 2 remains valid if the closed interval [c, d] is replaced by any convex compact set in any normed vector space. Here one may obviously assume as well that takes values in some complete normed vector space. In particular - and this is sometimes very useful - formula (17.5) is also applicable to complex-valued functions F of a complex variable and to func tions F(y) = F(y l , . . . , y n ) of a vector parameter y = (y 1 , . . . , yn ) E C'. In this case %£ can of course be written coordinatewise as ( *fr , . . . , 1:/n), and then (17.5) yields the corresponding partial derivatives:
f
b
of
8F = I �(x, y 1 , . . . , yn ) dx �(y) uy' uy' of the function
a
F.
7r
u( x) = J0 cos ( ntp - x sin 'P) dtp satisfies Bessel ' s equation x 2 u" + xu' + (x 2 - n 2 )u = 0. Indeed, after carrying out the differentiation with formula (17.5) and mak Example 2.
Let us verify that the function
ing simple transformations we find 7r
7r
-x 2 I in2 tp cos( ntp - x sin tp) dtp + x I sin 'P sin( n tp - x sin 'P) dtp + 0 0 + (x 2 - n2 ) I cos(ntp - x sin tp) dtp = 0 = - I ( ( x2 sin2 tp + n2 - x2 ) cos ( ntp - x sin 'P) 0 - x sin tp sin( n tp - x sin 'P)) dtp = = -( n + x cos 'P) sin ( ntp - x sin tp) I � = 0 . s
7r
7r
411
17. 1 Proper Integrals Depending on a Parameter
Example 3 .
The complete elliptic integrals
7r /2
E(k) =
IJ ()
1 - k 2 sin 2
( 1 7.6)
as functions of the parameter k, 0 < k < 1, called the modulus of the corre sponding elliptic integral, are connected by the relations dE dk
E-K k
dK dk
E K 2 k(1 - k ) k .
Let us verify, for example, the first of these. By formula ( 1 7.5) dE =dk =
7r / 2
I() 1
k
Example 4.
integral. Let
k sin 2
·
(1 - k2 sin2
K E -17r /2( 1 - k2 sm. 2
()
k
()
Formulas ( 1 7.5) sometimes make it possible even to compute the
7r / 2
F(o:) =
I()
ln(o: 2 - sin 2
(o: >
1) .
According to formula ( 17.5)
7r / 2
F ' (o:) =
I()
2o: d
from which we find F(o:) = 1r ln ( o: + .Jo: 2 - 1 ) + c. The constant c is also easy to find, if we note that, on the one hand F(o:) = 7r ln o: + 7r ln 2 + c + o( 1 ) as o: ---+ +oo, and on the other hand, from the definition of F( o:) , taking account of the equality ln( o: 2 -sin 2
2'. Suppose the function f P ---+ JR. is continuous and has a continuous partial derivative U on the rectangle P = { ( y) E JR. 2 1 a � � b c � y � d}; further suppose o:( y ) and {J(y) are continuously differentiable
Proposition 1\
:
x,
x
412
17 Integrals Depending on a Parameter
functions on [c, d] whose values lie in [a, b] for every y E [c, d] . Then the integral {3( y ) F(y) J f(x, y) dx (17.7) =
n (y)
is defined for every y E [c, d] and belongs to CC l l ( [c, d], R) , and the following formula holds: =
·
F'(y) f (f3 (y), y ) f3'(y) - f (a(y), y ) a'(y) + ·
{3( y )
J �� (x, y) dx .
(17.8)
n( y )
Proof. In accordance with the rule for differentiating an integral with respect to the limits of integration, taking account of formula (17.5), we can say that if a, j3 E [a, b] and y E [c, d], then the function cJ>(a, j3, y)
{3
J f(x, y) dx
=
has the following partial derivatives:
acJ> f(/3, y) ' 8(3
{3
8f (x, y) dx . y) J -f(a, ' aa ay ay Taking account of Proposition 1, we conclude that all the partial deriva tives of cJ> are continuous in its domain of definition. Hence cJ> is continuously differentiable. Formula (17.8) now follows from the chain rule for differentia tion of the composite function F(y) cJ> ( a(y), j3(y), y ) . 0 Example 5. Let Fn (x) ( n � 1)! J (x - t) n - l f(t) dt , 0 where n E N and f is a function that is continuous on the interval of inte gration. Let us verify that F�n ) (x) f(x). For n 1 we have F1 (x) J f(t) dt and F{(x) f(x). 0 By formula (17.8) we find for n > 1 that F� (x) = ( n � l) ! (x - x) n - l f(x) + ( n � 2 ) ! (x - t) n - 2 f(t) dt Fn - l (x) . 0 We now conclude by induction that indeed F�n ) (x) f(x) for every n E N. =
8cf>
=
8cf>
=
=
X
=
=
=
X
=
=
j X
=
=
413
17. 1 Proper Integrals Depending on a Parameter 17.1.4 Integration o f a n Integral Depending o n a Parameter -t
Proposition 3. If the function f P � is continuous in the rectangle P {( x, y) E �2 l a :::; x :::; b l\ c :::; y :::; d}, then the integral (17.2) is integrable over the closed interval [c, d] and the following equality holds: :
=
J ( ! f(x, y) dx ) dy J ( ! f(x , y) dy) dx . d
b
a
c
Proof.
=
b
d
(17.9)
c
a
From the point of view of multiple integrals, Eq. (17.9) is an elemen tary version of Fubini ' s theorem. However, we shall give a proof of (17.9) that justifies it independently of Fubini ' s theorem. Consider the functions
cp(u)
=
u
J ( J f(x, y) dx) dy , c
b
a
b
�(u) = J a
( ! f( u
c
)
x , y) dy dx .
Since f E C(P, �) , by Proposition 1 and the continuous dependence of the integral on the upper limit of integration, we conclude that cp and � belong to C([c, d], � ) . Then, by the continuity of the function (17.2), we find that =
b
=
b
cp' (u) J f(x, u) dx, and finally by formula (17.5) that � ' (u) J f(x, u) dx for u E [c, d]. Thus cp' (u) � ' (u), and hence cp(u) �(u) + C on [c, d]. But since cp(c) = �(c) = 0, we have cp(u) �(u) on [c, d], from which relation (17.9) follows for u = d. 0 a
=
=
=
a
17. 1 . 5 Problems and Exercises
a ) Explain why the function F(y) in (17.2) has the limit J 'P( x ) dx if the family a of functions 'f'y (x) = f(x, y) depending on the parameter y E Y and integrable over the closed interval a :::; x :::; b converges uniformly on that closed interval to a function 'P( x) over some base B in Y ( for example, the base y --t yo) . b ) Prove that i f E is a measurable set in JR m and the function f : E x r --t lR defined on the direct product E x I n = { (x, t) E JR m +n l x E E 1\ t E r } of the set E and the n-dimensional interval I n is continuous, then the function F defined by (17.1) for Et = E is continous on r . c ) Let P = { (x, y) E JR 2 a :::; x :::; b 1\ c :::; y :::; d}, and let f E C(P, JR) , o:, f3 E c([c, d], [a , bJ ) . Prove that i n that case the function ( 17.7) i s continuous on the closed interval [c, d]. b
1.
1
414
1 7 Integrals Depending on a Parameter
C(.IR, .IR) , then the function F(x) = 21a f f (x + t) dt is not -a only continuous, but also differentiable on JR. b) Find the derivative of this function F(x) and verify that F E C ( ll (JR, .IR) . 2. a) Prove that if f E
3. Using differentiation with respect to the parameter, show that for lrl 7r
F(r) =
<
1
I ln ( 1 - 2r cos x + r2 ) dx = 0 . 0
4. Verify that the following functions satisfy Bessel ' s equation of Example
2:
"
a) u = x n J cos(x cos
+1 b) Jn (x) = ( 2n "' � ) !!w J ( 1 - t 2 ) n - 1 / 2 l cos xt dt . -1 c) Show that the functions Jn corresponding to different values of n E N are connected by the relation Jn+1 = Jn - 1 - 2J� . 5 . Developing Example 3 and setting k show, following Legendre, that a) .fk (EK + EK - K K ) = 0. b) EK + EK - KK = rr/2.
:=
v'f"=k"2 , E(k) := E ( k ) , K(k) : = K(k) ,
6. Instead of the integral (17.2) , consider the integral b
:F(y) =
I f(x, y)g(x) dx , a
where g is a function that is integrable over the closed interval [a, b] (g E R[a, b] ) . By repeating the proofs of Propositions 1-3 above verify succesively that a) if the function f satisfies the hypotheses of Proposition 1 , then :F is contin uous on [c, d] (:F E C[c, d] ) ; b ) if f satisfies the hypotheses o f Proposition 2, then :F is continuously differ entiable on [c, d] (:F E C ( 1 ) [c, d] ) , and b
:F' (y) =
I �� (x, y) g(x) dx ; a
c) if f satisfies the hypotheses of Proposition 3, then :F is integrable over [c, d] (:F E R[c, d] ) and
1 c
:F(y) dy =
j (1 a
c
)
f (x , y)g(x) dy dx .
415
1 7.2 Improper Integrals Depending on a Parameter
7. Taylor's formula and Hadamard 's lemma.
a) Show that if f is a smooth function and f (O) = 0, then f(x) = x�(x ) , where � is a continuous function and �(0) = f ' (O) . b) Show that if f E c < nl and / (k ) (O) = 0 for k = 0, 1 , . . . , n - 1 , then f(x) = n x �(x) , where � is a continuous function and �(0) = � / ( n ) (0) . c) Let J be a c < nl function defined in a neighborhood of 0. Verify that the following version of Taylor ' s formula with the Hadamard form of the remainder holds: f(x)
= f (O) + � f' (O)x + · · · +
(
n
� 1 ) / n - 1 ) (O)x n - 1 + x n �(x) ,
where � is a funciton that is continuous on a neighborhood of zero, and �(0) = (n) � n . J (O) . d) Generalize the results of a) , b) , and c) to the case when f is a function of several variables. Write the basic Taylor formula in multi-index notation: n- 1 f (x) = L f(O)x " + L x " � a (x) , a. l <> l = n 1 <> 1 = 0 and note in addition to what was stated in a) , b) , and c) , that if f E c < n+ p ) , then �a E C ( P l .
�D"
1 7 . 2 Improper Integrals Depending on a Parameter
17.2.1 Uniform Convergence of an Improper Integral with Respect to a Parameter a. Basic Definition and Examples Suppose that the improper integral
F(y)
=
w
J f(x, y) dx a
(17.10)
over the interval [a, w] c IR converges for each value y E Y. For definiteness we shall assume that the integral (17.10) has only one singularity and that it involves the upper limit of integration (that is, either w = +oo or the function f is unbounded as a function of x in a neighborhood of w). Definition. We say that the improper integral (17.10) depending o n the parameter y E Y converges uniformly on the set E C Y if for every s > 0 there exists a neighborhood U[ a, w [(w) of w in the set [a, w[ such that the estimate
I ] f ( x, y) dx I < s b
(17.11)
416
17 Integrals Depending on a Parameter
for the remainder of the integral every y E E.
(17.10) holds for every b E U[a,w[ (w) and
If we introduce the notation
H (y)
:=
b
Ia f(x, y) x
(17.12)
d
for a proper integral approximating the improper integral ( 17.10), the basic definition of this section can be restated ( and, as will be seen in what follows, very usefully ) in a different form equivalent to the previous one: uniform convergence of the integral (17.10) on the set E C Y by definition means that (17.13) Fb ( Y ) � F(y) on E as b --+ w , b E [a, w[ . Indeed,
F(y)
=
w
I J(x, y) dx a
:=
b
lim
I f(x, y) dx
bEb--+w [a,w[ a
=
lim
b--+w bE[a,w[
Fb ( Y ) ,
(17.11) can be rewritten as (17.14) I F(y) - Fb ( Y ) I < E: . This last inequality holds for every b E U[a,b[ (w) and every y E E, as shown in (17.13). Thus, relations (17.11), (17.13), and (17.14) mean that if the integral (17.10) converges uniformly on a set E of parameter value::;, then this im proper integral (17.19) can be replaced by a certain proper integral (17.12) depending on the same parameter y with any preassigned precision, simulta neously for all y E E. Example 1 . The integral l+oo 2 dx 2 x y 1 converges uniformly on the entire set � of values of the parameter y E � ' since for every y E � and therefore relation
+
provided b >
1/E:.
Example 2 .
17.2 Improper Integrals Depending on a Parameter
The integral
417
+ oo
J e- xy dx ,
obviously converges only when every set { y IRI y 2': Yo > 0}. Indeed, if y 2': y0 > 0, then
E
0 y > 0.
Moreover it converges uniformly on
+ oo e - bYo --+ 0 0 J e - xy dx = �Y e - by ::=:; � Yo ::=:;
as
b --+ +oo.
At the same time, the convergence is not uniform on the entire set JR + = E on a set E means that
b
{ y IRI y > 0}. Indeed, negating uniform convergence of the integral (17.10)
3co > 0 VB E [a, w[ 3b E [B, w[ 3y E E
( I / f x, y x l ) (
) d > Ea
In the present case Eo can be taken as any real number, since
+ oo
J e- xy dx = � e - by --+ +oo , as y --+ +0 , b
bE
for every fixed value of [0, +oo [ . Let us consider a less trivial example, which we shall be using below.
Example 3.
Let us show that each of the integrals
+ oo
cJ> ( x ) =
J x"' y"'+f3 + 1 e -( l + x ) y dy ,
F (y) =
J x"' y"'+f3 + 1 e -( l +x ) y dx ,
0 + oo 0
in which a and (3 are fixed positive numbers, converges uniformly on the set of nonnegative values of the parameter. For the remainder of the integral cJ> ( x ) we find immediately that
0 ::::;
+ oo a J x"' y +f3+ 1 e -( l +x ) y dy = b + oo J ( xy ) "' e - xy yf3 + l e - y dy < Ma b
17 Integrals Depending on a Parameter
418
where Ma = max uae - u . Since this last integral converges, it can be made O:
0 :S
+oo
J xa ya+f3+l e -(l +x) y dx = b
= yf3 e - y Since
+oo a J ( xy) e- XY y dx = yf3 e-y J Ua e- u du . b � +oo
+oo
J ua e - u du < +oo , 0
for y 2 0 and yf3 e - Y --+ 0 as y --+ 0, for each c > 0 there obviously exists a number y0 > 0 such that for every y E [0, y0 ] the remainder of the integral will be less than c even independently of the value of b E [0 , +oo[. And if y 2 y0 > 0, taking account of the relations M13 = max yf3 e - Y <
O:'O y <+oo +oo +oo a +oo and 0 :S I u e - u du :S I u"'e - u du --+ 0 as b --+ +oo, we conclude byo by that for all sufficiently large values of b E [0, +oo [ and simultaneously for all y 2 y0 > 0 the remainder of the integral F (y) can be made less than c.
Combining the intervals [0 , y0 ] and [y0 , +oo[, we conclude that indeed for every c > 0 one can choose a number such that for every b > and every y 2 0 the corresponding remainder of the integral F(y) will be less than c .
B
B
b . The Cauchy Criterion for Uniform Convergence of an Integral
A necessary and sufficient condition for the improper integral (17.10) depending on the parameter y E Y to converge uniformly on a set E Y is that for every 0 there exist a neighborhood U[a, w [(w) of the point w such that
Proposition 1 . (Cauchy criterion) . c
c
b2
J f(x, y) dx
b,
>
(17.15)
for every b 1 , b2 E U[a, w [(w) and every y E E . Proof. Inequality ( 17.15) i s equivalent t o the relation 1Fb2 (y) - H2 (y) I < so that Proposition 1 i s an immediate corollary o f the form (17.13) for the definition of uniform convergence of the integral (17.10) and the Cauchy cri terion for uniform convergence on E of a family of functions Fb ( Y ) depending on the parameter b E [a, w [ . D c,
17.2 Improper Integrals Depending on a Parameter
419
As an illustration of the use of this Cauchy criterion, we consider the following corollary of it , which is sometimes useful.
If the function f in the integral (17.10) is continuous on the set [a, w[ x [c, d] and the integral (17.10) converges for every y E]c, d[ but diverges for y = c or y = d, then it converges nonuniformly on the interval ]c, d[ and also on any set E c]c, d[ whose closure contains the point of divergence. Proof. If the integral (17.10) diverges at y = c, then by the Cauchy criterion for convergence of an improper integral there exists > 0 such that in every neighborhood U[a , w [(w) there exist numbers b 1 , b2 for which Corollary 1.
Eo
j f(x, c) x > b2
d
bt
The proper integral
Eo
.
(17.16)
b2
J f(x, y) dx
bt
is in this case a continuous function of the parameter y on the entire closed interval [c, d] (see Proposition 1 of Sect . 17.1), so that for all values of y sufficiently close to c, the inequality
j f(x, y) dx > b2
bt
E
will hold along with the inequality (17.16). On the basis of the Cauchy criterion for uniform convergence of an im proper integral depending on a parameter, we now conclude that this integral cannot converge uniformly on any subset E c]c, d[ whose closure contains the point c. The case when the integral diverges for y = d is handled similarly. D
Example 4.
The integral
+ oo
J e-tx2 dx 0
converges for t > 0 and diverges at t = 0, hence it demonstrably converges nonuniformly on every set of positive numbers having 0 as a limit point . In particular, it converges nonuniformly on the whole set { t E lR I t > 0} of positive numbers.
420
17 Integrals Depending on a Parameter
In this case, one can easily verify these statements directly:
+ co
Jb set
2
1
e - t x dx = Vt
+ co
J 2
e - u du -+
+oo
as
t -+ +0 .
hli
We emphasize that this integral nevertheless converges uniformly on any { t E JR I t :2': t0 > 0} that is bounded away from 0, since
c. Sufficient Conditions for Uniform Convergence of an Improper Integral Depending on a Parameter
Proposition 2. ( The Weierstrass test ) . Suppose the functions f(x, y) and g(x, y) are integrable with respect to x on every closed interval [a, b] [a, w[ for each value of y E Y. If the inequality I f ( x, y) I :::; g( x, y) holds for each value of y E Y and every x E [a, w[ and the integral g(x, y) dx C
w
J a
converges uniformly on Y, then the integral w
j f(x, y) dx a
converges absolutely for each y E Y and uniformly on Y. Proof. This follows from the estimates
b2 b2 J f(x, y) dx :::; j l f(x, y) l dx :::; j g(x, y) dx b2
b,
and Cauchy ' s criterion for uniform convergence of an integral ( Proposition
D
1).
The most frequently encountered case of Proposition 2 occurs when the function g is independent of the parameter y. It is this case in which Propo sition 2) is usually called the Weierstrass M-test for uniform convergence of
an integral.
Example 5.
17.2 Improper Integrals Depending on a Parameter
The integral
00
J0
421
cos a x dx 1 + x2
converges uniformly on the the whole set < 1 and t he mtegra cos · · smce l
� of values of the parameter a,
J0oo l +dxx2 converges. I l +xax2 I l +x2, Example In view of the inequality I sin x e - tx 2 1 e - tx 2 , the integral 00 J sinxe-tx2 dx , _
�
6.
0
as follows from Proposition 2 and the results of Example 3, converges uni formly on every set of the form { t E �� t � t0 > 0}. Since the integral diverges for t = 0, on the basis of the Cauchy criterion we conclude that it cannot converge uniformly on any set having zero as a limit point.
Proposition 3. ( Abel-Dirichlet test ) . Assume that the functions f(x, y) and g(x, y) are integrable with respect to x at each y E Y on every closed interval [a, b] [a, w[. A sufficient condition for uniform convergence of the integral C
j (! g)(x, y) dx w
·
a
on the set Y is that one of the following two pairs of conditions holds: a 1 ) either there exists a constant M E � such that b
J f(x, y) dx a
<
M
for any b E [a, w[ and any y E Y and (31 ) for each y E Y the function g(x, y) is monotonic with respect to x on the interval [a, w[ and g(x, y) 0 on Y as x w, x E [a, w[, or a2 ) the integral f(x, y) dx =l
--t
j w
a
converges uniformly on the set Y and
422
17 Integrals Depending on a Parameter
/h ) for each y E Y the function g(x, y) is monotonic with respect to x on the interval [a, w[ and there exists a constant M E lR such that l g(x, Y ) l < M for every x E [a, w[ and every y E Y. Proof. Applying the second mean-value theorem for the integral, we write �
J (f g)(x, y) ·
b,
�
e
dx
J
J
= g(b 1 , y) f(x, y) dx + g(b2 , y) f(x, y) dx , b,
e
where � E [b 1 , b 2 ]. If b 1 and b2 are taken in a sufficiently small neighborhood U[a , w[ (w) of the point w, then the right-hand side of this equality can be made smaller in absolute value than any prescribed c > 0, and indeed simultane ously for all values of y E Y. In the case of the first pair of conditions a i ), /31 ) this is obvious. In the case of the second pair a 2 ), /32 ), it becomes ob vious if we use the Cauchy criterion for uniform convergence of the integral (Proposition 1 ) . Thus, again invoking the Cauchy criterion, we conclude that the original integral of the product f · g over the interval [a, w[ does indeed converge uniformly on the set Y of parameter values. D
Example
7.
The integral
+=
J1
sin x dx ' xa
as follows from the Cauchy criterion and the Abel-Dirichlet test for conver gence of improper integrals, converges only for a > 0. Setting f ( x, a) = sin x, g(x, a) = x - a , we see that the pair a 1 ), /31 ) of hypotheses of Proposition 3 holds for a ?: a0 > 0. Consequently, on every set of the form {a E JR. I a ?: a0 > 0} this integral converges uniformly. On the set {a E lR I a > 0} of positive values of the parameter the integral converges nonuniformly, since it diverges at a = 0.
Example 8 .
The integral
converges uniformly on the set
+=
-x y dx J0 --e X sin x
·
JRI
{y E y ?: 0} .
Proof. First of all, on the basis of the Cauchy criterion for convergence of the improper integral one can easily conclude that for y < 0 this integral diverges. Now assuming y ?: 0 and setting f(x, y) = s i � x , g(x, y) = e -x y , we see that the second pair a 2 ), /32 ) of hypotheses of Proposition 3 holds, from which it follows that this integral converges uniformly on the set {y E JR. y ?: 0}. D
I
17.2 Improper Integrals Depending on a Parameter
423
Thus we have introduced the concept of uniform convergence of an im proper integral depending on a parameter and indicated several of the most important tests for such convergence completely analogous to the correspond ing tests for uniform convergence of series of functions. Before passing on, we make two remarks. Remark 1. So as not to distract the reader ' s attention from the basic con cept of uniform convergence of an integral introduced here, we have assumed throughout that the discussion involves integrating real-valued functions. At the same time, as one can now easily check, these results extend to integrals of vector-valued functions, in particular to integrals of complex-valued func tions. Here one need only note that , as always, in the Cauchy criterion one must assume in addition that the corresponding vector space of values of the integrand is complete (this is the case for � ' c, � n , and e n ) ; and in the Abel-Dirichlet test , as in the corresponding test for uniform convergence of series of functions, the factor in the product f · g that is assumed to be a monotonic function, must of course be real-valued. Everything that has just been said applies equally to the main results of the following subsections in this section. Remark 2. We have considered an improper integral whose only sin gularity was at the upper limit of integration. The uniform convergence of an integral whose only singularity is at the lower limit of integration can be defined and studied similarly. If the integral has a singularity at both limits of integration, it can be represented as
(17.10)
C
W2
W2
j f (x, y) dx = j f (x, y) dx + j f (x, y) dx , c
where c E]w 1 , w2 [, and regarded as uniformly convergent on a set E C Y if both of the integrals on the right-hand side of the equality converge uniformly. It is easy to verify that this definition is unambiguous, that is, independent of the choice of the point c E]w l , w2 [. 17.2.2 Limiting Passage under the Sign of an Improper Integral and Continuity of an Improper Integral Depending on a Parameter
Let f(x, y) be a family of functions depending on a param eter y E Y that are integrable, possibly in the improper sense, on the interval a :::; < w, and let By be a base in Y. If a) for every b E [a, w[ f(x, y) ip(x) on [a, b] over the base By and Proposition 4. x
�
424
17 Integrals Depending on a Parameter
b) the integral I f( x , y) dx converges uniformly on Y, a then the limit function cp is improperly integrable on [a, w[ and the follow ing equality holds: w w lA� I f( x , y) dx I cp ( x ) dx . (17.17) w
=
a
Proof.
a
The proof reduces to checking the following diagram:
Fb ( Y )
:=
I f(x, y) dx b
a
1 ( ) By
---
I cp x dx b
a
I f(x, y) dx = : F(y) w
b""* bE[t:
__-
_j_ __- a
1( )
By
I cp x dx . w
� [a,w [
a
bE
The left vertical limiting passage follows from hypothesis a) and the the orem on passage to the limit under a proper integral sign ( see Theorem 3 of Sect. 16.3). The upper horizontal limiting passage is an expression of hypothesis b) . By the theorem on the commutativity of two limiting passages it follows from this that both limits below the diagonal exist and are equal. The right-hand vertical limit passage is what stands on the left-hand side of Eq. (17.17), and the lower horizontal limit gives by definition the improper integral on the right-hand side of (17.17).
D
The following example shows that condition a ) alone is generally insuffi cient to guarantee Eq. (17.17) in this case.
Example 9.
Let Y
=
{y E JR I y > 0} and 1/y , if 0 $ f(x, y) 0 , if y
0
=
{
Obviously, f(x, y) :4 on the interval time, for every y E Y ,
+ oo
Y
0
$
x
<
X
$
y,
< X .
+oo as y -+ +oo. At the same Y
I f(x, y) dx I f(x, y) dx I t dx 0
=
0
=
0
=
1,
( 17 .17) does not hold in this case. Using Dini ' s theorem ( Proposition 2 of Sect . 16.3), we can obtain the following sometimes useful corollary of Proposition 4. and therefore Eq.
17.2 Improper Integrals Depending on a Parameter
425
Corollary 2 . Suppose that the real-valued function f(x, y) is nonnegative at each value of the real parameter y E Y JR. and continuous on the interval a ::; x w. If a) the function f(x, y) is monotonically increasing as y increases and tends to a function �.p(x) on [a, w[, b) E C ( [a, w[, JR. ) , and c) the integral J �.p( x) dx converges, then Eq. ( 17.17) holds. Proof. It follows from Dini ' s theorem that f(x, y) ::::; �.p(x) on each closed interval [a, b] [a, w[. It follows from the inequalities 0 ::; f(x, y) ::; �.p(x) and the Weierstrass M-test for uniform convergence that the integral of f(x, y) over the interval a ::; x w converges uniformly with respect to the parameter y. Thus, both hypotheses of Proposition 4 hold, and so Eq. (17.17) holds. C
<
'P
w
a
C
<
0
Example 1 0. In Example 31 ofn Sect . 16.3 we verified that the sequence of fn (x) = n(1 - x 1 ) is monotonically increasing on the interval 0 Hence, x ::; 1 , and fn (x) / ln � as n --+ +oo. by Corollary 2 1 1 1 n lim n ( 1 - x 1 ) d x = ln � dx . n-+oo 0 0 functions <
j
j
X
5. If a) the function f(x, y) is continuous on the set {(x, y) E JR. 2 1 a ::; x w c ::; y ::; d}, and b) the integral F(y) = J f(x, y) dx converges uniformly on [c, d] , then the function F(y) is continuous on [c, d] . Proof. It follows from hypothesis a) that for any b E [a, w[ the proper integral b Fb ( Y ) f(x, y) dx
Proposition 1\
<
w
a
=
j a
is a continuous function on [c, d] (see Proposition 1 of Sect. 17.1 ) . By hypothesis b) we have Fb ( Y ) ::::; F(y) on [c, d] as b --+ w, b E [a, w[, from which it now follows that the function F(y) is continuous on [c, d] . 0
426
17 Integrals Depending on a Parameter
Example 1 1 .
It was shown in Example
F(y) =
+ oo
8 that the integral
J si� x e -xy dx 0
(17.18)
converges uniformly on the interval 0 ::; y < +oo. Hence by Proposition 5 one can conclude that F(y) is continuous on each closed interval [0, d] C [0, +oo[, that is, it is continuous on the entire interval 0 ::; y < +oo. In particular, it follows from this that . 1 1m
y ---+ + 0
+ oo
J0
-- e -xy X
sin x
(17.19)
dx =
1 7. 2 . 3 Differentiation of an Improper Integral with Respect to a Parameter
If a) the functions f ( x, y) and f� ( x, y) are continuous on the set { ( x, y) E ffi:.2 J a ::; X w A c ::; y ::; d}, b) the integral
w
a
w
w
F'(y) = J f�(x, y) dx . Proof.
By hypothesis a ) , for every
a
b E [a, w [ the function b Fb ( Y ) = J f(x, y) dx a
is defined and differentiable on the interval
b
c ::; y ::; d and by Leibniz ' rule
( H )'(y) = J f�(x, y) dx . a
17.2 Improper Integrals Depending on a Parameter
427
By hypothesis b) the family of functions (Fb )'(y) depending on the pa rameter b E [ [ converges uniformly on [c, d] to the function P( y ) as b -+ bE [ [ as b -+ b E [ [ ByIt follows hypothesis c)thisthe(seequantity Fb (Y4 oof) hasSect.a limit from Theorem 16. 3 ) that the fami ly of func tions F ( ) itself converges uniformly on [c, d] to the limiting function F(y) Y asandb -+theb equality b E [ [ the function F is differentiable on the interval c :::; y :::; d, proved. D F'(y) = P(y ) holds. But this is precisely what was to be Example For a fixed value a > 0 the integral +oo J xa e -xy dx 0 converges uniformly with respect to the parameter y on every interval of the form { y E �� y 2 Yo > 0}. This follows from the estimate 0 :::; x"'e - xy x a e - xyo e - x l!f, which holds for all sufficiently large x E Hence, by Proposition 6, the function a, w
w,
a, w .
w,
w,
a, w .
a, w ,
12.
<
R
<
+oo
F(y) = J e -xy dx 0 is infinitely differentiable for y > 0 and p ( n ) (y) = ( - I t
F(y) that = t ' and therefore p canButconclude +oo
+oo J xn e -xy dx . ()
( n ) (y )
= (- l ) n Y ���
J xn e -xy dx = � y n+l . 0
In particular, for y = 1 we obtain +oo J xn e-x dx = n! . 0 Example Let us compute the Dirichlet integral 1 3.
and consequently we
428
17 Integrals Depending on a Parameter
To do this we return to the integral (17.18), and we remark that for y > 0 + oo F (y) = - J sinxe- xy dx , (17. 20) 0 since the integral (17. 20) converges uniformly on every set of the form {y E JR. I Y 2: Yo > 0}. (17. 20)is that is easily computed from the primitive of the inte grand,Theandintegral the result 1 F (y) -1 +-y2 for y > 0 , from which it follows that F(y) = - arctany + c for y > 0 . (17. 2 1) We have F(y) --+(17.21) 0 as ythat --+ +oo, as can be seen from relation (17.18), so that itthatfollows from c = 1r /2. It now results from (17.19) and (17. 2 1) F(O) = 1r /2. Thus, + oo (17. 2 2) J0 sinxX dx �2 . We2 2) remark thatimmediate the relation F(y)of--+Proposition 0 as y --+ +oo" used in deriving (17. is not an corollary 4, since si� x e - xy 0 ofon theintervals form {xof theE JRIform x ;:::0: x0 >x 0},x while as yconvergence --+ +oo onlyis onnot intervals the uniform 0 : for s i n x e - xy --+ 1 as x --+ 0. But for xo > 0 we have xo + oo oo si n x si n x -xy xy J0 X e dx - J0 X e dx + xJo sinX x e-xy dx and, and given > 0 we firstxochoose x0 so closexoto 0 that sin x 2: 0 for x E [0 , x0] 0 J sinX x e - xy dx J sinX x dx �2 0 0 for every y > 0. Then, after fixing x0 , on theoverbasis[x of Proposition 4, by letting +oo, we can make the integral yabsolute tend tovalue. 0 , +oo[ also less than c:/2 in '
'
=
=
"
<
X
--
E
_
<
--
--
<
<
<
=l
17.2 Improper Integrals Depending on a Parameter
429
17.2.4 Integration of an Improper Integral with Respect to a Parameter
If the function f(x, y) is continuous on the set {(x, y) E IR 2 1 a :S x w 1\ c :S y :S d} and b) the integral F(y) = J f(x, y) dx converges uniformly on the closed interval [c, d], then the function F is integrable on [c, d] and the following equality holds:
Proposition 7. a)
<
w
a
( 17.23) J dy J f(x, y) dx = J dx J f(x, y) dy . Proof. For b E [ [ by hypothesis a) and Proposition 3 of Sect. 17.1 for improper integrals one can write w
d
c
w
a
d
c
a
a, w ,
d
b
d
b
J dy J f(x, y) dx = J dx j f(x, y) dy . ( 17.24) Usingan integral hypothesissign,b) weandcarry Theorem 3 of Sect. 16.3 on passage to the limit under out a limiting passage on the left-hand side ofthe(17.24) as b -+ b E [ [ and obtain the left-hand side of (17.23). By very definition of an improper integral, the right-hand side of (17.23) is the limit of the right-hand si d e of (17.24) as b -+ b E [ [ Thus, by hypothesis b) we obtain ( 17.23) from (17.24) as b -+ b E [ , [. D Theof followi ng example shows that,integrals, in contrast to thea)reversibility of the order integration with two proper condition alone is in general not sufficient to guarantee (17.23). Example 14. Consider the function f(x, y) = (2 - xy)xye - xy on the set {(x, y) E IR 2 1 0 :S x +oo 1\ 0 :S y :S 1}. Using the primitive u 2 e - u of the function (2 - u)ue-u, it is easy to compute directly that + oo + oo 0 J dy J (2 - xy)xye - xy dx =f. J dx J (2 - xy)xye - xy dy 1 . c
a
w,
a
c
a, w
w,
a, w .
w,
=
<
1
0
0
a w
1
()
=
0
If a ) the function f ( x, y) is continuous on the set a :S x w 1\ c :S y :S d} and b) nonnegative on P, and Corollary 3. <
P
{(x, y) E JR 2 1
430
17 Integrals Depending on a Parameter w
c) the integral F (y) = J f(x, y) dx is continuous on the closed interval [c, d] as a function of y, then Eq. (17. 2 3) holds. Proof. It follows from hypothesis a) that for every b E [a, w[ the integral a
b Fb ( Y ) = J f(x, y) dx is continuous with b)respect to ,y(y)on theF closed interval [c, d]. ItByfollows from that F � b2 (y) for b 1 � b2 . b ' s theorem and hypothesis c) we now conclude that Fb F on Dini [c, d] as b -+ w, b E [a, w[. the hypotheses Proposition (17.Thus 2 3) indeed holds in theof present case.7 are satisfied and consequently Eq. Corollary 3 shows that Example 14 results from the fact that the function f ( x, y) is not of constant sign. gralsIntoconclusion commute.we now prove a sufficient condition for two improper inte a
=l
D
If a) the function f(x, y) is continuous on the set {(x, y) E IR2 1 a � x < w 1\ c � y < w}, b) both integrals
Proposition 8.
w
w
lf> (x) = j f(x, y) dy
F(y) = j f(x, y) dx ,
c
a
c
converge uniformly, the first with respect to y on any closed interval [c, d] [c, w], the second with respect to x on any closed interval [a, b] [a, w[, and c) at least one of the iterated integrals C
w
w
w
w
j dy j l f l (x, y), dx , J dx J l f l (x, y) dy c
a
a
c
converges, then the following equality holds: w
w
w
w
J dy J f(x, y) dx = J dx J f(x, y) dy . c
a
a
c
(17. 2 5)
17.2 Improper Integrals Depending on a Parameter
431
For definiteness suppose that the second of the two iterated integrals inProof. c)Byexists. a)holdsandforthethefirstfunction conditionf forin b)everyonedcanE [c,sayw[.by Proposition 7 thatIfEq.wecondition (17. 2 3) that the right-hand side of (17. 23) tends to the right-hand side ofthe(17.left-hand 25)show as dside -+ w, d E [c, w[, then Eq. (17.25) will have been proved, since thendefinition also existofandan improper be the limitintegral. of the left-hand side of Eq. (17. 2 3) by thewillvery Let us set q] (x) := j f(x, y) dy; . function is defined for each fixed d E [c, w[ and, since f is contin uous,The is continuous on the interval ::; x By theclosed secondinterval of hypotheses b) we have q] (x) q](x) as d -+ w, d E [c, w[ on each [ , b] [ [ Since l q]
d
c
q)d
qJ d
< w.
a
a
w
d
C a, w .
�
w
a
c
w
a
w
d -tw d E [c,w]
d
a
1 5.
1 <
JR.
7r 4
A
A
A
A
432
17 Integrals Depending on a Parameter
For a > 0 and f3 > 0 the iterated integral + co + co + co + co y x l l y J dy J x<> y<> +.B- e -( + ) dx = J y.B e - dy J (xy) <>e-(xy) y dx 0 0 0 0 of a nonnegative continuous function+ coexists, as this identity shows: it equals zero for y = 0 and J0+ co y.Be-Y dy J0 u<> e - u du for y > 0. Thus, in this case a) and c) of Proposition 8 hold. The fact that both conditions of bwehypotheses ) hold for this integral was verified in Example 3. Hence by Proposition 8 have the equality Example 1 6.
·
+ co
+ co
+ co
+ co
J dx J x<> y<> +.B+ l e ( l +x) y dy . 0 0 0 0 Justcorollary as Corollary followed from lowing from 3Proposition 8. Proposition 7, we can deduce the fol J dy J x<> y<> +.B+ l e-( l +x) y dx =
-
If a) the function f(x, y) is continuous on the set P = {(x, y) E lR2 i a � x < w A c � y � w} , and b) is nonnegative on P, and c) the two integrals
Corollary 4.
w
F(y) = j f(x, y) dx , a
w
cJ>(x) = j f(x, y) dy c
are continuous functions on [a, w[ and [c, w[ respectively, and d) at least one of the iterated integrals w
w
w
w
j dy j f(x, y) dx , j dx j f (x, y) dy , a
c
a
c
exists, then the other iterated integral also exists and their values are the same. Reasoning as in the proof of Corollary 3, we conclude from hypotheses ainProof. ) , b ) , and c ) and Dini ' s theorem that hypothesis b ) of Proposition 8 holds this case. Since f 2:: 0, hypothesis d ) here is the same as hypothesis c) of
Proposition 8. Thus all the hypotheses of Proposition 8 are satisfied, and so Eq. (17. 24) holds. D
17.2 Improper Integrals Depending on a Parameter
433
Remark As pointed out in Remark 2, an integral having singularities at both limits of integration reduces to themakes sum itofpossible two integrals, eachthe ofproposi which has a singularity at only one limit. This to apply tions and corollaries just proved to integrals over intervalson] wclosed Here 1 , w2 [ intervals naturally the hypotheses that were satisfied previously [a ,b] [a,w [ must now hold on closed intervals [a , b] c ] w1 ,w2 [. letExample us show thatBy changing the =order of integration in two improper integrals, f0 e_x2 dx � J7r . (17.26) This is the famous Euler-Poisson integral. Proof. We first observe that for y > 0 += += 2 J e -u du y J e - (xy) 2 dx , 0 0 and that the value interval of the integral inor(17.the2 6)openis the same]O,whether it is taken over the half-open [ 0 , +oo[ interval +oo[ . Thus, += += += += J ye-Y2 dy J e- (xy) 2 dx J e_Y2 dy J e - u2 du = .:72 ' 0 0 0 0 y extends over the interval ] O, +oo [ . andAsweweassume that the integration on permissible to reverse the order of integration over this verify, iteratedit isintegral, and therefore x and y inshall dx 7r 1 + x2 4 ' fromLetwhiuschnowEq.justify (17. 2 6)reversing follows. the order of integration. The function += 1-2 J ye - ( l+x 2 ) y2 dy = �2 1 + x 0 is continuous for x 0, and the function += J ye- ( l+x2 ) y2 dx e_Y2 3.
C
C
1 7.
=
J
:=
=
=
�
0
=
.J
R
17 Integrals Depending on a Parameter
434
is continuous for > 0. Taking account of the general Remark 3, we now conclude indeed legal.from DCorollary 4 that this reversal in the order of integration is y
1 7 . 2 . 5 Problems and Exercises
a = ao < a1 < · · · < a n < < w. We represent the integral (17. 10) as the oo an sum of the series L 'P n ( y ) , where 'P n ( Y ) = J f ( x , y ) dx . Prove that the integral n= l an - 1 converges uniformly on the set E C Y if and only if to each sequence { a n } of this form there corresponds a series L
·
·
·
00
2 . a ) In accordance with Remark 1 carry out all the constructions in Subsect . 17.2. 1 for the case of a complex-valued integrand f. b ) Verify the assertions in Remark 2. 3 . Verify that the function
� y' + y = 0.
Jo ( x ) =
+ oo
J
4. a Starting from the equality
)
( 2n-3 ) !! 1 ( 2n - 2 ) !! . x 2n - 1
•
0
x
1
J c� dt satisfies Bessel ' s equation y" + 1 - t2 0
v
2�y
7r
2"
2
2n 3 !! dy ) n 2 (( 2n -- 2 )) !! 1 + ( y 2 /n ) ( 1 + ( y 2 j n )) - n \., e - y 2 on lR + oo
b) Verify that I 0
c ) Show that
.!. 7r
(
.!!:
+ oo
. l 1m
n ->+ oo
I 0
+ oo
a) J e 0
2 x
( 2n - 3 ) ! ! lim -+ oo ( 2n - 2 ) ! ! n
(17.26) , show that
cos 2xy dx =
� yTie-y 2 .
2 2 b ) J e - x sin 2xy dx = e - y +oo 0
n.
as
dy ( 1 + ( y 2 j n )) n -
d ) Obtain the following formula of Wallis: 5 . Taking account of Eq.
. r,;; V
Y
J et 2 dt. 0
n -+ +oo and that
+ oo
I 0
2 e - y dY
= _y7i1_ .
·
6. Assuming t
17.2 Improper Integrals Depending on a Parameter
>
0, prove the identity +
= e� t
x
I 1 + x2 0
dx
=
+
I t
= sin(x -
x
435
t) dx '
using the fact that both of these integrals, as functions of the parameter t, satisfy the equation jj + y 1/t and tend to zero as t --+ +oo.
=
7. Show that
where K(k)
1
rr / 2
I K(k) dk = I sin 0
0
=
rr / 2
J 0
>
'P
is the complete elliptic integral of first kind.
d .
V 1 - k 2 sm 2 r.p
8. a) Assuming that a
_5!_ dip
0 and b > 0 and using the equality
0
a
0
compute this last integral. b) For a > 0 and b > 0 compute the integral +
I 0
=e
�x a
_ X
e�b
x cos x dx .
-----
c) Using the Dirichlet integral (17.22) and the equality +
b
I �I 0
=
compute this last integral. 9. a) Prove that for
+
I 0
a
sin xy dy
= I+ = cos ax - cos bx dx , X
0
2
k>0
= � kt . e sm t dt
+
I 0
= � tu 2 e du
=
I du I e�(k+u2)t sintdt . + 0
=
+
=
0
b) Show that the preceding equality remains valid for the value k c) Using the Euler�Poisson integral (17.26) , verify that
1
vt
2 + =e �tu 2 du .
fi
I 0
= 0.
17 Integrals Depending on a Parameter
436
d) Using this last equality and the relations
l Sill. + oo
X
2
0
l sinVtt dt , l cos x dx = 1 l COS../it dt , + oo
1
dx = 2
+ oo
+ oo
2
0
2
0
obtain the value ( 4 �) for the Fresnel integrals
0
I sin x dx , I cos x dx . + oo
+ oo
2
10.
2
0
0
a) Use the equality
I sin x dx I e - xy dy + oo
+ oo
0
0
0 0 {zr. I 0, 0 0,
and, by justifying a reversal in the order of integration in the iterated integral, obtain once again the value of the Dirichlet integral (17.22) found in Example 13. b) Show that for a > and j3 > + oo
4
sin ax cos j3x dx = --
""
x
0
if /3 < a ,
2 '
, if
j3 = a ,
if /3 > a . This integral is often called the Dirichlet discontinuous factor. c) Assuming a > and j3 > verify the equality
I
+ oo
sin ax sin /3x
o
X
X
dx =
{
!r.2 j3 '
j3 '5:. Q ' if Q '5:. j3 .
if
2a , 1r
d) Prove that if the numbers a, a 1 , . . . , a n are positive and a > L: a; , then + oo
I 0
11.
sin ax sin a 1 x
sin
On X -- --- · · · ---
X
X
X
n
dx =
1r
i= l
- Q 1 02 · · · Q n .
2
Consider the integral
:F(y) = I f(x, y)g(x) dx , a w
g is a locally integrable function [a, w[ (that is, for each b E [a, w[ g[[a ,bJ E R[a, b] ) . Let the function f satisfy the various hypotheses a) of Propositions 58. If the integrand f(x, y) is replaced by f(x, y) g(x) in the other hypotheses of these propositions, the results are hypotheses under which, by using Problem 6 of where
·
17.3 The Eulerian Integrals
437
Sect. 17. 1 and repeating verbatim the proofs of Propositions 5-8, one can conclude respectively that a) :F E C[c, d] ; b ) :F E C (l ) [c, dj , and
� �� (x, y)g(x) dx ; w
:F' (y) =
a
c ) :F E R[c, d] and
j :F(y) dy = ] ( j f(x, y)g(x) dy) dx ; c
c
a
c ) :F is improperly integrable on [c, w[, and
J :F(y) dy = ] (J f(x, y)g(x) dy) dx .
Verify this.
c
c
a
17.3 The Eulerian Integrals
In this section we shallintegrals illustrateof theimportance applicationin analysis of the theory developed aboveandtothesomenextspecific that depend on a parameter. Following Legendre, we define the Eulerian integrals of first and second kinds respectively as the two special functions that follow: 1 (17. 2 7) B(a,/3) J x"' - 1 (1 - x),B - 1 dx , :=
0
+ oo
(17.28) T(a) · - J x"' - 1 e - x dx . 0 The first of these is called the beta function and the second the gamma function. 17.3.1 The Beta Function
necessary and sufficient condition for the con vergence of the integral (17.27) limit is that a > 0. Similarly, convergence at 1 occurs if and onlyat theif (3 lower > 0.
a. Domain of definition A
438
17 Integrals Depending on a Parameter
Thus the function B( a, /3) is defined when both of the following conditions hold simultaneously: a > 0 and f3 > 0 . Remark. We are regarding a and f3 as real numbers here. However, it should beand keptgamma in mindfunctions that theandmostthe complete picture applications of the properties of theinvolve beta most profound of them their extension into the complex parameter domain. b. Symmetry Let us verify that B(a,/3) = B(f3,a) . (17 . 29) Proof. It suffices to make the change of variable x = 1 - t in the integral (17. 2 7). D c. The Reduction Formula If a > 1, the following equality holds: B(a,/3) = a +a -f3 -1 1 B(a - 1, /3). (17. 30) forProof.a >Integrating 1 and f3 > 0,bywepartsobtainand carrying out some identity transformations 1 B(a, /3) = - � xa - 1 (1 - x) f3 1 � + a ; 1 · j x"'- 2 (1 - x) f3 dx = 0 1 = a ; 1 j xa - 2 ((1 - x) f3- 1 - (1 - x) f3- 1 x) dx = 0 a --B(a 1 a-1 =13 - 1, /3) - -13-B(a, /3) , from which the reduction formula (17. 30) follows. D Taking account of formula (17. 29), we can now write the reduction formula B(a,/3) = a +/3 f3- -1 1 B(a,/3 - 1) (17. 30') on the parameter /3,immediately assuming, from of course, that f3 > of1. the beta function that It can be seen the definition B(a, 1) = � ' and so for n E N we obtain n-1 n-2 n - (n - 1 ) B(a,n) = --a +_n_-'-..,.-(n_.. --'-....,.1). B (a ' 1) = a+n-1 a+n-2 _ (n - 1)! (17. 3 1 ) a(a + 1) · . . . · (a + n - 1) · In particular, for m, n E N - 1)! B(m, n) = (m( m- +1)!(n (17.32) n - 1)!
17.3 The Eulerian Integrals d. Another Integral Representation of the Beta Function
lowing representation of the beta function is sometimes useful:
439
The fol
(17. 3 3) Proof. This representation can be obtained from (17. 27) by the change of van. ables x - 1 y +Y .
D
17.3.2 The Gamma Function
It canconverges be seen byatformula the inte gral defining the gamma function zero only(17.for28)athat > 0, while it converges atfactor infinitye - xfor. all values of a E IR, due to the presence of the rapidly decreasing Thus the gamma function is defined for a > 0. b. Smoothness and the Formula for the Derivatives The gamma func tion is infinitely differentiable, and (17. 34) r ( n ) ( a ) = J x"' - 1 lnn xe - x dx . 0 Proof.toWethefirstparameter verify thatontheeachintegral (17. 34) converges uniformly withfixedre spect for each closed interval [a, b] c] +oo[ valueIf 0of n aE :::;N.a, then (asince 2 lnn x 0 as x 0)O, there exists > 0 Cn x"'l + such that a. Domain of Definition
+ oo
<
�
�
forconclude 0 x that :::; en .theHence by the Weierstrass M-test for uniform convergence we integral J x"' - 1 lnn x e- x dx 0 converges withforrespect If a :::; uniformly b then x 1,to a on the interval [a, +oo[. <
c,
<
+oo,
?:
440
17 Integrals Depending on a Parameter
and we conclude similarly that the integral +oo J x"' 1 lnn xe- x dx converges uniformly with respect weto afindon that the interval ] O, b] .(17. 34) converges Combining these conclusions, the integral unifBut ormlyunder on every closed intervaldifferentiation [a, b] c ] O, +oo[.under the integral sign in these conditions (17. 2 7)open is justified. Hence, ontheanygamma such function closed interval, and hence on the entire interval 0 a, is infinitely differentiable and formula (17. 34) holds. D c . The Reduction Formula The relation r(a + 1) = ar(a) (17. 35) holds. It is known as the reduction formula for the gamma function. Proof. Integrating by parts, we find that for a > 0 +oo +oo oo r(a + 1) := J x"'e- x dx = -x"'e- x i � + a J x"' - 1 e - x dx = -
Cn
<
0
0
+oo
= a J x"' - 1 e- x dx = aF(a) . D 0
Since F(1) = +ooJ0 e - x dx = 1, we conclude that for n E N r(n + 1) = n! . (17. 3 6) Thus the gamma function number-theoretic function n!. turns out to be closely connected with the d . The Euler-Gauss Formula This is the name usually given to the fol lowing equality: r(a) = nli---+moo n"' . a(a + 1) (n· . . -. · 1)!(a + n - 1) (17.37) Proof. To prove this formula, we make the change of variable x = ln � in the integral (17. 2 8), resulting in the following integral representation of the gamma function: 1 r(a) = J ln"' 1 (�) du . (17. 38) -
0
441
17.3 The Eulerian Integrals
It was shown in Example 3 of Sect. 16. 3 that the sequence of functions = n(1 u 1 f n ) increases monotonically and converges to ln ( � ) on the interval 010
-t oo.
=
·
=
2
2
·
2
442
17 Integrals Depending on a Parameter
Hence for 0 a 1 12. r(a)F(1 - a) = -a1 n=II00l -(17. 4 1) 1-a But the following expansion is classical: 00 ( 1 - �22 ) . sin 1ra = 1ra n=l (17.42) II (We shallasnota simple take theexample time toofprove thisofformula just now, sincewhenit willwe bestudyobtained the use the general theory Fourier series. 6 of Sect. 18. 2 . ) Comparing relationsSee (17Example . 4 1) and (17. 4 2), we obtain (17.40). D It follows in particular from (17.40) that (17.43) We observe that r G ) = J x - 112 e- x dx = 2 f e- u2 du , 0 0 and thus we again arrive at the Euler-Poisson integral <
<
Ti2
+ oo
+ oo
1 7. 3 . 3 Connection Between the Beta and Gamma Functions
Comparing formulas (17. 3 2) and (17. 3 6), one may suspect the following connection: r(a) . r({3) ( B a, {3) = (17.44) r(a + {3) between the beta and gamma functions. Let us prove this formula. Proof. We remark that for > 0 r(a) = Ya J Xa - J e -xy dx , y
+ oo
0
443
17.3 The Eulerian Integrals
and therefore the following equality also holds:
using which, taking account of (17. 3 3), we obtain +oo a l . T(a {3) B( a , {3) = J r (1( a y{3))a+y f3- dy = +
+
+
0
+oo 1 a ( J y J xa +f3- l e - ( l+y) x dx) dy � +oo +oo � J ( J y a - l x a+ f3 - l e -( l+ y ) x dy ) dx = +oo +oo l ( J xf3- e-x J (xy) a - l e -xY x dy) dx = +oo +oo J (xf3- l e -x J ua - l e - u du) dx = r (a ) r ({3) +oo 0
0
0
0
0
0
0
0
0
0
Allpoint. that But remains isis toexactly explainwhatthe was equality distinguished by oftheSect. exclama tion that done in Example 16 17.2. D 17.3.4 Examples
In conclusion us considerB and a small group of interconnected examples in which the speciallet functions r introduced here occur. Example 7r /2 (17. 4 5) J sina - l cpcosf3- l cpdcp = � B (� , �) . 0 Proof. To prove this, it suffices to make the change of variable sin2 cp = x in the integral. D Usingfunction. formula In(17.particular, 44), we cantaking expressaccount the integral in terms of the gamma of (17. 4(17.3), 4we5) obtain 7r /2 7r /2 a (17. 4 6) J sin - l cpdcp = j cosa - l cpdcp .fii2 rr( a( �+l) ) . 2 0 0 1.
=
17 Integrals Depending on a Parameter
444
A one-dimensional ball of radius r is simply an open interval and itsExample ( one-dimensional ) volume V1 (r) is the length (2r) of that interval. Thus V1 (r) = 2r . If we assume that ther is((expressed n - ! )-dimensional ) volume of the ( n - ! ) dimensional ball of radius by the formula Vn _ 1 (r) = Cn - 1 T n - t , then, integrating over sections (see Example 3 of Sect. 11. 4 ), we obtain 2.
Vn (r) =
r
J
-r
n --1 Cn - 1 (r 2 X 2 ) -2 -
(
dx =
7f /2
Cn - 1
J
- 7f / 2
COSn
) cpdcp
7f /2 Cn = 2Cn- 1 COSn
J cpdcp . By relations (17. 46) we can rewrite this last equality as 0
r(�) C , Cn - Vir r ( � ) n- 1
so that
r(3) r ( n+ 1 ) r( !!: ) en = ( Vir) n - 1 r ( ) . r ( 2 ) . . . . . r ( ,_2 ) · c 1 � �
or, more briefly, But C1 = 2, and
�
- r(� Cn = n 2 1 r ( n )2 C 1 . t) r (� ) = �r(� ) = � fo, n cn = -;--: r ( �..) .,.-,-7T
so that
7T 2
Consequently,
or, what is the same,
n Vn (r) = r r n . � (�)
(17.47) Example It is clear from geometric considerations that d Vn ( r) Sn - 1 ( r) dr, where Sn - 1 ( r) is the ( n 1 ) -dimensional surface area of the sphere the n-dimensional ball of radius r in JR.n . Thusbounding Sn _ 1 (r) = ddrn (r) , and, taking account of (17. 4 7), we obtain 3.
7T 2
-
17.3
The Eulerian Integrals
445
17.3.5 Problems and Exercises 1.
Show that a) B(1 /2 , 1 /2)
= 1r; b) B(o:, 1 - o:) = I dx; 1 c ) �� ( a: , ;3) = I x" - 1 (1 - x l 1 ln x dx; 00
0
d)
e) f)
� ( a +bx q ) r
I Ioo Ioo d
+ oo 0
+
0
+
0
0
x" _ , 1+x
-
=
a-r q
( ) B ( z±l r - p + 1 ) · £ b
E.±.!_ q
q
'
'
q
"- ____ * '. �l +x n _Q_£_ -
n sin 21r
x _ . 1 + x 3 - 3 .,/:3 '
h) Ioo x " -l '+Inx +
0
n
dx =
x
do: n ( sm. 1ro: )
--'.!..::_
"-
-
(0
<
0:
<
1)·'
i ) the length of the curve defined in polar coordinates by the equation rn nip, where n E N and a > 0, is aB ( � , � ) . 2
an cos
Show that a) F ( 1 ) = F( 2) ; b) the derivative F' of F is zero at some point Xo E ] 1, 2[; c ) the function r' is monotonically increasing on the interval ] O, +oo[; d ) the function r is monotonically decreasing on ]0, xo] and monotonically in creasing on [xo , +oo[; 2.
1
1
e ) the integral I ( In � ) x - In In � du equals zero if x = xo ; 0
F(o:) ± as o: --+ +O; + n g) n--+ lim Ioo e - dx = 1. oo f)
"' 0
3.
x
Euler's formula E := ff r ( �)
k= 1 a) Show that E 2 = X( r ( � ) r ( n ;: k ) . n1 b) Verify that E 2 = sin n" sin 2 :;;n · ...- · s in n - 1 ""n · ( )
17 Integrals Depending on a Parameter -1 2 c) Starting from the identity z;_-/ = nTI ( z - e i �" , let z tend to 1 to obtain ) k=1 n- 1 · 2 k 1r n = II ( 1 - e' " ) , k=1 and from this relation derive the relation n- 1 . n = 2 n - 1 II s1n k7r . n k=1 446
d) Using this last equality, obtain Euler's formula. 4.
Legendre 's formula r(a)r ( a: + � ) = 2 2'{Ji 1 r(2o:) . 1 /2 2 a- 1 dx. a) Show that B(o:, o:) = 2 I (t - ( � - x ) ) 0
b) By a change of variable in this last integral, prove that B(o:, o:) 2;_ 2 1 B ( � , a: ) c) Now obtain Legendre's formula. ·
5. Retaining the notation of Problem 5 of Sect. 17. 1 , show a route by which the second, more delicate part of the problem can be carried out using the Euler inte grals. a) Observe that k = k for k = � and
?r / 2
E=E=
I J1 � sin2 cp dcp , -
0
b) After a suitable change of variable these integrals can be brought into a form from which it follows that for k = 1 I .;2 1 1 K = rn B(114, 112) and 2E - K = rn B(3I4, 11 2 ) . 2v 2 2v 2 c) It now results that for k = 1 I .J2
EK + EK - KK = 7rl2 .
1 6. Raabe 's 1 integral I ln r(x) dx. 0
Show that 1 1 a) I ln r(x) dx = I ln r( 1 - x) dx. 0
0
1 • /2 b) I ln r(x) dx = � ln 1r - � I ln sin x dx. 0
0
1 J.L. Raabe (1801-1859) - Swiss mathematician and physicist.
n/2
17.3 The Eulerian Integrals
n/2
447
c) I0 ln sin x dx = I0 ln sin 2x dx - � ln 2. n/2
d) I0 ln sin x dx = - � ln 2. e) I0 ln r ( x ) dx = ln .;21r.
1
7.
Using the equality
_!,_ X8
_
1_ r(s)
_
I y s - 1 e - x y dy + oo 0
and justifying the reversal in the order of the corresponding integrations, verify that + oo n a "' - 1 a) I0 � x a dx = 2 r ( a ) co s ¥ (0 < a < 1) ·' s i n bx dx = b) I0 --;;r r + oo
n b/3 - 1 2 r ( {3 ) s i n ii,j
( 0 < {3 < 2).
c) Now obtain once again the value of the Dirichlet integral I0 s i: x dx and the + oo
+ oo
+ oo
value of the Fresnel integrals I0 cos x 2 dx and I0 sin x 2 dx. ·
8.
Show that for a > 1 + oo
I 0
x "'- 1 dx = r(a) . ((a) ' ex - 1
where ((a) = L n1"' is the Riemann zeta function. n= l 00
9.
Gauss ' formula. In Example 6 of Sect. 16.3 we exhibited the function � a( a + 1) · · · (a + n - 1){3({3 + 1) · · · ({3 + n - 1) x n F(a, f3, 'Y, x) ·. - 1 + � ' n!'Y('Y + 1 ) · · · ('Y + n - 1 )
which was introduced by Gauss and is the sum of this hypergeometric series. It turns out that the following formula of Gauss holds: . F(a, {3, 'Y, 1 ) = r('Y) r('Y. - a - {3) . r('Y - a) F('Y - {3) a) By developing the function (1 - t x) - 13 in a series, show that for a > 0, 'Y - a > 0, and 0 < x < 1 the integral P(x) =
1
I t "' 1 ( 1 - t ) 0
-
-y - a -
1 ( 1 - t x) - !3 dt
448
17 Integrals Depending on a Parameter
can be represented as P(x) =
00
Pn · X n , L n=O
'y a w h ere pn - !3(13+ 1 ) ..n!·(/H n - 1 ) · r ( a+n)·r r ( 'Y+n( ) - ) b) Show that r("y - a: ) . a: ( a: + 1) ( a: + n - 1) ,8 ( ,8 + 1) (,B + n - 1) Pn = r(o:) r("y) �----�--��--���--�--�------� n !"Yh + 1) ("y + n - 1) c) Now prove that for a: > 0, "Y - a: > 0, and 0 < x < 1 r(o:) . r("y - a: ) P(x) = · F(o:, ,B, "'f, X) . T("y) d) Under the additional condition "Y - a: - ,B > 0 justify the possibility of passing to the limit as x ---+ 1 - 0 on both sides of the last equality, and show that r(o:) . r("y - a: - ,B) = r(o:) . r("y - a:) F(o:, ,B, "'f, 1) , r("y - ,B) r("y) from which Gauss' formula follows. 10. Stirling 's2 formula. Show that 2 m for lxl < 1 '· x_ a) ln 11 -+ x = 2x "' L..J _ 2m+l _
0
0
0 0 0
0 0
0 0 •
00
x m=O b) ( n + �) In ( 1 + � ) = 1 + L 2n : 1 ) 2 + t ( 2n : 1 ) 4 + L2n:1 ) 5 + . . · ; c) 1 < ( n + �) ln ( 1 + �) < 1 + 12n (�+1 ) for n E N ; ( 1 + n ) n + 1 /2 eiTn1 d) 1 <
_l
<
e
e 1 2(n + l )
e) a n = n < ;:-;;7 2 ) is a monotonically decreasing sequence; f) b n = a n e - 1 21 n is a monotonically increasing sequence; g) n ' = cn n+1 1 2 e- n+ � , where 0 < (}n < 1 , and c = nlim a n = nlim bn , --+ oc;. --+ CXJ h) the relation sin 1rx = 1rx ( 1 - S ) with x = 1/2 implies Wallis' formula ·
·
J:t
. (n !?2 2n . 1r,:; ; ,fir = n-+ hm oo ( 2n ) '. y n i) Stirling 's formula holds: n ! = � ; e � ' 0 < (jn < 1 ; ••
(r
y'2;;; ( � r as X ---+ + oo. Show that r(x) f= ( �!�n � + I e- 1 e-t dt. This relation makes it possible 1 n=O
j) T(x + 1) 11.
rv
=
°
to define r(z) for complex z E
2 J . Stirling (1692-1 770) - Scottish mathematician.
17.4 Convolution and Generalized Functions
449
17.4 Convolution of Functions and Elementary Facts about Generalized Functions 17.4. 1 Convolution in Physical Problems (Introductory Considerations)
AcarryvariOEt ety oftheirdevices and systems in theto living and nonliving natural world functions responding a stimulus f with an appropriate signal f. In other words, each such device or system is an operator A that transforms the incoming signal f into the outgoing signal ! = A f . Naturally, each such operator has its own domain ofthem perceivable signals (domain of def inition) and i t s own form of response to (range of values). A convenient mathematical a large class of actual processes and machines is a linear operatormodel A thatforpreserves translations. Definition 1 . Let A be a linear operator acting on a vector space of real- or complex-valued functionsacting definedon theon JR.sameWespace denoteaccording by Tt0 theto the shift operator or translation operator rule (Tt0 f)(t) := f(t - to) . The operator A is translation-invariant (or preserves translations) if for every function f in the domain of definition of the operator A. If t is time, the properties relation Aof theTt0device can be interpreted as the = Tt0A areA time-invari assumption that the a nt: the reaction ofto the devcenothing to the signals f ( t) and f ( t - t0) differ only in a shift by the amount inFortime, more. every device --!f ofthethefollowi ng totwoanfundamental problems arise: first, to predict the �action device arbitrary input f; second, knowing the Atoutput f, to determine, if possible, the input signal f. this point shall solve the afirstnt linear of theseoperator two problems in application to awetranslation-invari A. � is aheuristically simple, but very important fact that in order to describe the response f of such a device A to any input signal f, it suffices to know the response E of A to a pulse o. Definition 2. The response E(t) of the device A to a unit pulse o is called the of theengineering). device (in optics) or the transient pulse function of thesystem devicefunction (in electrical As a rule,going we shall usedetailthejust brieferyet,term "system function. " can be imi Without into we shall say that a pulse tated, for example, by the function 00(t) shown in Fig. 17.1, and this imitation is assumed to become closer as the duration a of the "pulse" gets shorter, o
o
450
17 Integrals Depending on a Parameter
�L C¥ 1
I I I I I I I I I I
0
t
a
Fig. 1 7 . 1 .
preserving theusingrelation � 1. Instead of step functions, one may imi tate a pulse smooth functions ( Fig. 17. 2 ) while preserving the natural conditions: 1 1Q ( t) dt 1 , I fet (t) dt 1 as 0 , whereTheU(O) is an ofarbitrary neighborhood of the point t 0. response the device A to an ideal unit pulse ( denoted, following Dirac, byofthetheletter 8) should be regarded as a function E(t) to which the response device to an continuity input approximating 8 tends as the imitation improves. Naturally a Acertain of the operator A is assumed ( not made preciseunderas yeta continuous ) , that is, continuity of the change in the response J of the For device change in {..:1the (t)input f. example, if we take a sequence n } of step functions ..:1 n (t) 81; n (t) ( Fig. 17.1), then, setting A..:1 n En , we obtain A8 E li--+oo m En n lim A..:1 n . n --+oo Let usfunction now consider the input signal f in Fig. 17. 3 andf astheh piecewise constant lh (t) 'L f( Ti ) 8h (t - Ti ) h. Since lh 0, one i must assume that lh Alh Af f as h 0 . a ·
=
--+
=
lR
U ( O)
a --+
=
:=
=:
=
=
Fig. 1 7 . 2 .
--+
=
--+
--+
Fig. 1 7 . 3 .
:= =
=
--+
17.4 Convolution and Generalized Functions
451
But if the operator A is linear and preserves translates, then lh ( t) = L ! h )Eh (t - Ti )h i where Eh = Aoh . Thus, as h --+ 0 we finally obtain J( t) = I f (r ) E ( t - r ) dr . (17. 48) Formula (17.48) solves the first of the two problems indicated above. It represents the response j( t) of the device A in the form of a special integral depending on the parameter t. This integral is completely determined by the input signal f(t) and the system function E ( t) of the device A. From the mathematical point of view the device A and the integral (17. 4 8) are simply identical. that the problem of determining the input signal fromWethenoteoutputincidentally J now reduces to solving the integral equation (17. 4 8) for f. Definition 3. The convolution of th e functions u : lR --+ C and v : lR --+ C is the function u * v lR --+ C defined by the relation (17. 4 9) (u * v)(x) := I u(y)v(x - y) dy , provided this improper integral exists for all x E JR. Thus formula (17.to48)anasserts that theby response of a flinear device A that preserves translates input given the function is the convolution f * E of the function f and the system function E of the device A. '
R
:
R
17.4.2 General Properties of Convolution
Now pointletof usview.consider the basic properties of convolution from a mathematical a. Sufficient Conditions for Existence We first recall certain definitions andLetnotation. a real- or complex-valued function defined on an open set The G cffunction R: G --+ Cf beis locally on G if every point x E G has a neigh borhood U(x) G in whichintegrable the function fl u ( x ) is integrable. In particular, ifequiGv=alentJR, the condition of local integrability of theclosed function f is obviously to the relation ! I [a,b] E R[a, b] for every interval [a, b]. support of the function f ( denoted supp f) is the closure in G of the set The {x E G l f(x) =f- 0}. C
17 Integrals Depending on a Parameter
452
function f is of compact support (in G) if its support is a compact set. The set of functions f : G --+ C having continuous derivatives in G up to order m (0 :::; m :::; oo) inclusive, is usually denoted C (m) (G) andm) the subset of it consisting of functions of compact support is denoted Cr� (G) . In the case when G = � ' minstead ofmC (m ) (�) and c6m) (�) it is customary to use the Weabbreviation C ( ) and c6 l respectively. most frequently encounteredwithout cases ofdifficulty. convolution of functions,nowinexhibit which ithets existence can be established A
Each of the conditions listed below is sufficient for the ex istence of the convolution u * v of locally integrable functions u : � --+ C and v : � -+ C. 1) The functions l u l 2 and l v l 2 are integrable on R 2) One of the functions l u i , l v l is integrable on � and the other is bounded on R 3) One of the functions u and v is of compact support. Proof. 1) By the Cauchy-Bunyakovskii inequality Proposition 1 .
( j l u (y ) v( x - y)l dyr :::; j l u l 2 (y ) dy j l v l 2 (x - y) dy , IR
IR
IR
from which it follows that the integral (17. 49) exists, since + oo
+ oo
J l v l 2 (x - y ) dy = J l v l 2 (y ) dy .
- oo
- oo
2) If, for example, l u i is integrable on � and l v l :::; M on � ' then J l u (y ) v (x - y )l dy :::; M J l u l (y ) dy < +oo . 3) Suppose supp u [a, b] Then obviously IR
IR
C
C
R
b
J u (y)v(x - y) dy = J u(y) v (x - y) dy . u and v are locally integrable, this last integral exists for every value of xSince E case whenby thethe change functionofofvariable compact support is v reduces to the one justTheconsidered x - y = z. D IR
R
a
17.4 Convolution and Generalized Functions
453
b. Symmetry
If the convolution u * v exists, then the convolution v * u also exists, and the following equality holds:
Proposition 2.
(17. 50)
Making the change of variable x - y z in (17. 49), we obtain u * v(x) := J u(y)v(x - y) dy = J v(z)u(x - z) dz v * u(x) . D <X) <X) invariance c. Translation Invariance Suppose, as above, that Tx 0 is the shift operator, that is, (Tx0 )f(x) = f(x - xo). =
Proof.
+=
+=
-
=:
-
If the convolution u * v of the functions u and v exists, then the following equalities hold: (17. 5 1) Tx 0 ( U * V ) Tx 0 U * V U * Tx 0 V . Proof. If we recall the physical meaning of formula (17. 48 ) , the first of these
Proposition 3.
=
=
equalities becomes obvious,Nevertheless, and the second can then be obtained fromof the symmetry of convolution. let us give a formal verification the first equality: (Tx 0 )(u * v)(x) := (u * v)(x - xo) += j u(y)v(x - x o - y) dy = :=
=
-
CXl
=
+=
j u(y - xo)v(x - y) dy
-
+=
CXl
J (Tx0 u)(y)v(x - y) dy
-=
=:
=
( (Tx 0 U) * v) (x) .
D
The convolutionof ofit functions is an integral depending on a parameter, and differentiation can be carried out in accordance with thehypotheses general ruleshold.for differentiating such integrals, provided of course suitable The conditions under which the convolution (17.49) of the functions u and v is continuously differentiable are demonstrably satisfied if, for example, u issupport. continuous and v is a smooth function and one of the two is of compact Proof. Indeed, if we confine the variation of the parameter to any finite in terval, then under theseclosed hypotheses theindependent entire integral (17. 4 9) reduces to the integral over a finite interval of x. Such an integral can beruledifferentiated respect to a parameter in accordance with the classical of Leibniz. with D d. Differentiation o f a Convolution
454
17 Integrals Depending on a Parameter
In general the following proposition holds: Proposition 4. If u is a locally integrable function and v is a c6 m l function of compact support (0 ::::; m ::::; +oo), then ( u * v ) E C ( m ) , and? (17. 52) Proof. When u is a continuous funcction, the proposition follows immediately from what was just proved above. In its general form it can be obtained if we also keep in mind the observation made in Problem 6 of Sect. 17.1. D Remark In view of the commutativity of convolution (formula (17. 50)) Proposition of course remains valid if u and v are interchanged, preserving the Formula left-hand4(17.side of Eq. (17.52). 5 2)it commutes shows that with convolution commutes with(17.the5 1)). differentiation operator, just as translation (formula But while (in17.51) is symmetric in u and v, one cannot in general interchange u and v fail to have the corresponding the right-hand sidethatof the(17.convolution 5 2), since u may derivative. The fact u * v, as one can see by (17. 5 2), may turn out to 4bearea differentiable might suggest that the hypotheses ofstillProposition sufficient, butfunction, not necessary for differentiability of the convolution. Example Let f be a locally integrable function and ba the "step" function shown in Fig. 17 .1. Then 1.
1.
+=
X
(17. 53) J f ( y) ba (x - y) dy = �x-Ja f(y) dy , and consequently point of (smoothing) continuity ofproperty f the convolution f * ba is differentiable, dueatto every the averaging of the integral. The4 conditions for completely differentiability of theforconvolution stated incasesPropo sition are, however, sufficient practically all the one encounters in which formula (17. 5 2) is applied. For that reason we shall not attempt to refine them any further, preferring to illustrate some beautiful new possibilities just discovered.that open up as a result of the smoothing action of convolution (J * ba ) (x) =
-=
17.4.3 Approximate Identities and the Weierstrass Approximation Theorem
We that the integral in (17. 5 3) gives the average value of the func tionremark f on the interval [x x] , and therefore, if f is continuous at x, the ·-
a,
3 Here D is differentiation, and, as usual D k v = v ( k ) .
17.4 Convolution and Generalized Functions
455
relation ( ! * 8a ) (xconsiderations ) f (x) obviously holds17.4.1 as that0.gave In accordance with the introductory of Subsect. a picture of 8-function, we would like to write this last relation as the limiting equalitythe ( ! * 8) (x ) f(x) , if f is continuous at x . (17.54) This equalityelement showswith thatrespect the 8-function can be Equality interpreted(17.54) as thecanidenbe tity (neutral) to convolution. regarded astomaking perfect sense if it same is shown that every family of fami functions converging the 8-function has the property as the special ly 8 a of (17.53). Let us now pass to precise statements and introduce the following useful definition. 4. The family { Lla ; E A} of functions Lla lR lR depending onDefinition the parameter E A forms an approximate identity over a base in A if the following three conditions hold: a) all the functions in the family are nonnegative (Lla 2: 0); b) for every function Lla in the family, IRI Lla (x) dx 1; c) for every neighborhood U of 0 E JR, liBm UI Lla(x) dx 1. Takivalent ng account the firstlimtwoI conditions, we see that this last condition is equi to the ofrelation Lla(x) dx 0. B IR\U The original family of "step" functions 8a considered in Example 1 of Subsect. 17.4.1 is an approximate identity as 0. We shall now give other examples of approximate identities. Example 2. Let lR lR be an arbitrary nonnegative function of compact support that is integrable over lR and satisfies IRI c.p(x) dx 1. For > 0 we ) construct antheapproximate functions Lla(x) obviously identity as� e.p ( � . +0The(seefamiFig.ly of17.2).these functions is Example 3. Consider the sequence of functions for l xl :S 1 , for l x l > 1 . To that establish that c)thisof fami ly is an4 holds approximate identity weandneedb). only verify condition Definition in addition to a) But for every E]O, 1] we have ----+
a ----+
=
:
a
a
=
----+
B
=
=
a ----+
c
:
----+
:=
=
a ----+
a
17 Integrals Depending on a Parameter
456
0 :S J (1 - x2 t dx :S J (1 - c-2 t dx 1
1
At the same time,
=
1- . /(0 1 - x2 ) n dx > /(1 - x t dx n +1 0 Therefore condition c) holds. Example 4- Let Lln (x) cos2n (x) / -''P1rj 2 cos2n (x) dx for l x l :S 7r/2 , for l x l > 7r /2 . 0 firstAsof ialln Example that 3, it remains only to verify condition c) here. We remark 7r/ 2 r ( ) > __L r( ) J0 cos2n xdx -B21 (n + -21 , -)21 -21 r (nF(+n) ) __L n 2n On the other hand, for E]O, / 2 [ 7r / 2 7r /2 J cos 2n xdx :S J cos2n c- dx � (cos c- ) 2n . Combining the two inequalities just obtained, we conclude that for every E]O, / 2] , 7r / 2 J Lln (x) dx -+ 0 as n -+ from which it follows that condition c) of Definition 4 holds. Definition 5 . The function f : G -+
1
=
{
=
=
1
2ri:
=
E
·
1
1
·
1r
<
E
7r
oo ,
C
E
< E
=
17.4 Convolution and Generalized Functions
457
We now prove a fundamental proposition. Proposition 5 . Let f : IR ---+
�
C
w; .
(f * Ll "' )(x) f(x) on E as a ---+ Thus it is asserted that the family of functions f * Ll"' convergesif Euniformly toof only f on a set E on which it is uniformly continuous. In particular, consists one point, the condition of uniform continuity of f on E reduces to that f be continuous at x, and we find that U * Ll<>)(x) ---+ f(x) astheacondition ---+ Previously this fact served as our motivation for writing relation w.
(17.Let54).us now prove Proposition 5. Supposewithl f(x)Definition l M on R Given a number E > 0, we choose > 0 iProof. nU(O).accordance 5 and denote the p-neighborhood of 0 in IR by ng account the symmetry of convolution, twoTaki estimates, whichofhold simultaneously for all x E weE: obtain the following p
:::;;
= I J f(x - y)Ll"' (y) dy - f(x) l I J (f(x - y) - f(x) )Ll"' (y) dy l :::;; J l f(x - y) - f(x) IL1"' (y) dy J l f(x - y) - f(x) I Ll"' (y) dy J Ll"' (y) dy J Ll<> (Y) dy J Ll"' (y) dy :::;;
j (f * Ll "' )(x) - f(x) j
=
=
IR
:::;;
IR
<
+
U(O)
IR\U(O)
<
As
a
---+
E + 2M
+ 2M
E
U(O)
w,
IR\U(O)
0
IR\U{O)
this last integral tends to zero, so that the inequality l (f * Ll "' (x) - f(x) l
<
2E
holds for all5.x ED E from some point a6 on. This completes the proof of Proposition Corollary 1 . Every continuous function of compact support on IR can be uniformly approximated by infinitely differentiable functions. Proof. Let verify that cg= ) is everywhere dense in Co in this sense.
458
17 Integrals Depending on a Parameter
We let, for example,
{
exp ( - 1 _lx2 ) for lxl 1 , for lxl 1 , 0 where k is chosen so that J r.p(x ) dx = 1. Thethefunction r.p is of compact support and infinitely differentiable. In that case, fami l y of infinitely differentiable functions Ll0+0. If�fr.p E( �C) , asitobserved inthatExample 2, is an approximate identity as 0, is clear f * Ll0 E C0. Moreover, by Proposition 4 we have f * Ll0 E C0. Finally, it follows from Proposition 5 that f * Ll0 f on lR as +0. D Remark If the function f E Co belongs to C�m ) , then for every value n E {0, 1, . . . , m} we can guarantee that ( ! * Ll a ) ( n ) f ( n ) on lR as +0. Proof. Indeed, in this case ( ! * Ll a ) ( n ) f ( n ) * Ll0 (see Proposition 4 and Remark 1). All that now remains is to cite Corollary 1. D Corollary 2. (The Weierstrass approximation theorem). Every continuous r.p( x )
=
<
k·
�
IR
:::::t
2.
=
a -+
a -+
:::::t
=
a -+
function on a closed interval can be uniformly approximated on that interval by an algebraic polynomial. Proof. Since polynomials map to polynomials under a linear change of vari
able preserved, while the continuity and uniformity of the2 onapproximation of functions are it suffices to verify Corollary any convenient interval we shall assume 0 a b 1 and let [a ,=b] min{a, 1For- b}.thatWereason continue the given function f E C[a, b] to a func tion F that is continuous on lR by setting F(x) 0 for x E IR\]0, 1 [ and, for example, letting F be a linear function going from 0 to f(a) and from f(b) to 0Ifonwethenow intervals , a] and [b, 1] identity respectively. [0approximate take the Ll of Example 3, we can conclude from Propositionconsisting 5 that ofF *theLlnfunctions f = F ln on [a , b] as n oo. But for x E [a , b] [0 , 1] we have 1 00 F * Ll n (x) : = J F ( y )Ll n (x - y) d y = J F(y ) Ll n (x - y) d y p
C
R
=
-+
1
-
<
<
<
:::::t
C
= j F(y)pn 0
=
0
oo
·
(
1-
(x - y) 2 r
[a , b]
1
dy = j F(y) ( L=O ak (y)xk ) dy 0
2n
k
2n
1
=
)
= L (j F ( y )a k (y) dy x k . k =O o
is a polynomial P2n (x ) of degree 2n and we have shownThisthatlastP2expression n f on [a, b] as n oo. D :::::t
-+
17.4 Convolution and Generalized Functions
459
Remark By a slight extension of this reasoning one can show that Weier strass compacttheorem subset remains of R valid if the interval [a, b] is replaced by an arbitrary Remark 4. It is also not difficult to verify that for every open set G in lR and every function f E c< m ) ( G ) there exists a sequence { Pk } of polynomials such n ) ( that P� f n) on every compact set K G for each n E {0, 1, . . . , m} as k ---+If in addition the set G is bounded and f E c<m l (G) , then one can even get P�n) f ( n) on G as k ---+ oo. Remark Just as the approximate identity of Example 3 was used in the proof of211'Corollary 2, one canonuselR can the besequence fromapproximated Example 4 tobyprove that every periodic function uniformly trigono metric polynomials of the form n Tn (x) = L a k cos kx + bk sin kx . k=O Wesupport have used onlyHowever, approximate identities madein mind up of that functions of com pact above. it should be kept approximate identities of functions not oftwocompact in many cases. We shallthatgiveareonly examples.support play an important role Example The family of functions Ll y (x) = � x2 �Y2 is an approximate identity on lR as y ---+ +0, since Lly > 0 for y > 0, f00 Lly (x) dx = ..!:_11' arctan (�)y l +oo= oo = 1 , x - oo and for every > 0 we have f Lly (x) dx = -11'2 arctan -y ---+ 1 , whenIf fy ---+lR +0.---+ lR is a bounded continuous function then the function , +oo (17. 5 5) u(x, y) = :;;:1 f (x -f(�)y 02 + y 2 d� , - oo whiAsch isonethecanconvolution f * Lly , is defined for all x E lR and y > 0. easily verify the Weierstrass M-test, the integral (17.55), which is called the Poisson using integral for the half-plane, is a bounded infinitely 3.
C
�
00 .
�
5.
·
5.
p
p
-p
:
p
460
17 Integrals Depending on a Parameter
2 differentiable in thesign, half-plane tiating it underfunction the integral we verifyIR� that{(x,fory)y E> IR0 1 y > 0}. Differen =
that is, u is a harmonic function. By Proposition 5 one can also guarantee that u(x, y) converges to f(x) asbounded y -+ 0. Thus, the integral (17. 5 5) solves the problem of constructing a function values that isf harmonic scribed boundary on oiR� . in the half-plane IR� and assumes pre Example The family of functions Llt 2 � e - is an approximate identity on IR as t -+ +0. Indeed, we certainly have Llt > 0 and +ooJ Llt(x) 1, since -+ooJoo e - v dv ,j1i ( the Euler-Poisson integral) . Finally, for every > 0 we have e - v dv -+ 1 , as t -+ +0 . 6.
2 �.
=
=
2
p
=
2
If f is a continuous and, for example, bounded function on IR, then the function +oo 1 - d� , (17. 56) u ( x , t) 2v nt J f(�) e which is the convolution f * Llt , is obviously infinitely differentiable for t > 0. By differentiating under the integral sign for t > 0, we find that =
c;
(x - 0 2
-
oc
4t
that is,condition the function u satisifes the one-dimensional heat equation with the initial u(x, 0) as the5. which should followsbefrominterpreted Proposition limiting relation u (x , t) -+f(xf(x)). This as t -+last+0,equality =
17.4.4 *Elementary Concepts Involving Distributions of Generalized Functions In Subsect. 17. 4 .1 of this section wea.us Definition derived the formula (17.48)ofona linear the heuristic level. ThisA toequation enabled to determine the response transformation an input signal f given that we know the system function E of the device A. In determining
17.4 Convolution and Generalized Functions
461
the system function ofaction a device wethe made essentialthatusedescribes of a certain intuitive idea of a unit pulse and £�" function it. It issenseclear,of however, that the £�" function is really not a function i n the classical the term, since it must have the following properties, which contradict the classical point of view: J(x) 0 on JR; J(x) = 0 for x =f. 0, J J(x) dx = 1. Thethe concepts connectedof with linearacquire operators, convolution, the £�"-function, and system function a device a precise mathematical descrip tion in the so-called theory of generalized functions or the theory of distribu tions.everWemore are now explain theofbasic principles and the elementary, but widelygoingusedtotechniques this theory. Example Consider a point mass m that can move along the axis and is attached to oneelastic end ofconstant an elasticof the springspring. whoseSuppose other endthatis fixed at the origin; letforcek bef(t)the a time-dependent begins to act on the point resting at the origin, moving it along the axis. By Newton ' s law, (17. 5 7) mx + kx = f ' where x( t)atistime the coordinate of the point (its displacement from its equilibrium position) t. Underf,these conditions thex(t)function x(t) is uniquely determined by the function and the solution of the differential (17. 5 7)with is obvi ously a linear function of the right-hand side f. Thusequation we are dealing the linear operator f � x inverse to the differential operator x � f (where k) that connects x(t) and f(t) by the relation Bx = f. Since the B = m/;xA+obviously operator commutes with translations over time, it follows from (17. 4 8) that i n order to find the response x(t) of this mechanical system to the function suffices tofundamental find its response J, that is, it suffices to knowf(t),theit so-called solutionto aEunit of thepulseequation (17. 58) mE + kE = J . RelationHowever (17. 5 8)Eq.would not raise anyclear. problems if J actually denoted as function. (17. 5 8) is not yet But being formally unclear i quiteexplain a differentthe thing fromofbeing actually false. In the present case one need only meaning (17. 5 8). to such anidentity explanation is already familiar to us:andwe can interpretof Jclassical asOnean route approximate imitating the delta-function consisting functions Lla(t); we interpret E as the limit to which the solution Ea(t) of the equation (17. 5 7') tends as the parameter changes suitably. �
7.
a
IR
462
17 Integrals Depending on a Parameter
seconda fundamental approach to this problem,ofonethethatideahasof asignificant is toAmake enlargement function. advantages, It proceeds from the remark that in general objects of observation are characterized bya their interaction with other ("test") objects. Thus we propose regarding function not onas other a set (test) of valuesobjects at different points,manner. but rather astryantoobject that can act in a certain Let us make this statement, which as of now is too general, more specific. Example Let f E C(JR, JR) . As our test functions, we choose functions in C0 (continuous functions of compact support on JR) . A function f generates the following functional, which acts on C : 8.
0
(17.59) approximate identities0consisting of functions of compact support, oneUsing can easily see that (!, cp) on C0 if and only if f(x) 0 on R Thus, each function f E C(JR, JR) generates via (17. 59) a linear functional At : C0 lR and, we emphasize, different functionals A and A correspond to different functions(17.h59)andestablishes h. an embedding (injective mapping) of Hence formula the setconsequently of functionsevery C ( JR, JR ) into the set .C ( C0 ; JR) of linear functionals on C0 , and function f E C (JR, JR) can be interpreted a certain functional At E .C(Co ; JR) . IfC (weJR, JRconsider the class of locally integrable functions on lR instead of the setmapping ) of continuous functions, we obtain by the same formula (17. 5 9) a of this set into the space .C(C0 ; JR) . Moreover ( (!, cp) 0 on Co ) {::} ( f(x) = 0 at all points of continuity of f on JR, that is, f(x) = 0 almost everywhere on lR) . Hence in this case we obtain an embedding of equivalence classes of functions i n to .C ( C0 ; lR ) if each equivalence class contains locally integrable functions that differ functions only on a fsetonoflRmeasure zero. equivalence Thus, the locally integrable (more precisely, classes of such functions) can be interpreted via (17.59) as linear functionals At E .C(Co ; lR) . The mapping f H At = (!, ·) provided by (17.59) of locally integrable functions intofunctions .C(C0 ; JR) is not a mapping onto all of .C(C0 ; JR) . Therefore, interpreting as elements of .C(interpreted C0 ; JR ) (that is, as func tionals) we obtain, besides the classical functions asnofunctionals ofin the form (17.59), also new functions (functionals) that have pre-image the classical functions. Example The functional E .C ( C0 ; JR) is defined by the relation (17. 60) (J, cp) := J(cp) := cp(O) ' which must hold for every function E C0 . (!, cp) : = J f(x)cp(x) dx . IR
=
--t
h
=
h
as
=
9.
c5
'P
17.4 Convolution and Generalized Functions
463
can verify (see Problemo in7)thethatformno locally integrable function f on IR canWe represent the functional (17.59). Thus we have embedded the set of classical locallyfunctionals integrable arefunctions into a larger set of linear functionals. These linear called generalized functions or distributions (a precise definition is given below). The widely used term "distribution" has its origin in physics. Example Suppose a unit mass (or unit charge) is distributed on If this distribution sufficientlydensity regular,p(x)in the it has, offortheexample, continuous oris integrable on IR,sensethe that interaction mass Ma with other objects described by functions 'Po E Cboo) can be defined as a functional M( tp) J p(x)tp(x) dx . Ifat thea single distribution is singular, for example, the whole mass M is concentrated point, then by "smearing" the mass and intepreting thedistribu limiting point situation using an approximate identity made up of regular tions, we find the interaction mass with the other objects mentioned abovethatshould be expressedofbythea formula M( tp ) tp (O) , which shows(17.that60) such o-function on R.a mass distribution on IR should be identified with the These preliminary considerations give some sense to the following general definition. Definition 6. Let P be a vector space of functions, which will be called the space of test functions from now on, on which there is defined a notion of convergence. TheP'space of generalized functions or distributions on P is the vector space of continuous (realor complex-valued) linear functionals on P. Here it is assumed that each element f E P generates a certain functional A (!, ·) E P' and that the mapping f A is a continuous embedding ofconvergence P into P'ofiffunctionals, the convergence that is,in P' is introduced as weak ("pointwise") 1 0.
JR.
=
lR
Jl;f
=
f =
f--t
f
Let us make this definition more precise in the particular case when P is the vectorin space Cboo) ( G, C) of infinitely differentiable functions of compact support G, where G is an arbitrary open subset of IR (possibly IR itself). Definition 7 . (The spaces V and V') . We introduce convergence in Cb00 ( G, C) as follows: A sequence { 'Pn } of functions 'Pn E Cb00 \ G) con verges to 'P E C1�oo) ( G, C) if there exists a compact set K G that contains )
C
17 Integrals Depending on a Parameter
464
the supports of all the functions of the sequence { cpn } and cp�m ) cp ( m ) on K (and hence also on G) as forwayall mwith= 0,this1, 2,convergence . . .. The vector space obtained i n this is usually denoted V(G), and when G = JR. , simply V. We denote the space of generalized functions (distributions) corresponding to this space of basic (test) functions by V' (G) or V' respectively. In thisother sectionthanandthetheelements one following we(G)shall notintroduced. consider For any that generalized functions of V' just reason weof V'shall(G)usewithout the termsayingdistribution or generalized function to refer to elements so explicitly. asDefinition 8 . A distribution F E V' (G) is regular if it can be represented F(cp) = J f(x)cp(x) dx , cp E V(G) whereNonregular f is a locally integrable function in G. distributions will be called singular distributions or singular =t
n --+ oo
,
G
generalized functions.
In accordance with this definition the 8-function of Example 9 is a singular generalized function. The action of a generalized function (distribution) F on a test function cp, that is, the pairing of F and cp will be denoted, as before, by either of the equivalent expressions F(cp) or ( F, cp) . Before passing to the technicalformachinery connected withthatgeneralized functions, which was our motive defining them, we note the con cept of a generalized function, like the majority of mathematical concepts, had aofcertain periodof ofmathematicians. gestation, during which it developed implicitly in the workPhysicists, a number Dirac,1930smadeandactive use ofwiththe singular 8-functiongeneralized as early asfunctions the latewithout 1920sfollowing and early operated worrying about the absence of the necessary mathematical theory. Thelaididea ofmathematical a generalizedfoundations function wasof thestatedtheory explicitly by S. L . Sobolev4, who the of generalized functions intionsthewas mid-1930s. The current state of the machinery of the theory of 5 What has just been said distribu largely the work of L. Schwartz explainsto why, for example, the space V' of generalized functions is often referred as the Sobolev-Schwartz space of generalized functions. 4 S L . Sobolev (1908-1989) - one of the most prominent Soviet mathematicians. 5 L . Schwartz ( 1915-2002) - well-known French mathematician. He was awarded .
the Fields medal, a prize for young mathematicians, at the International Congress of Mathematicians in 1950 for the abovementioned work.
17.4 Convolution and Generalized Functions
465
We shall nowTheexplain certain elements of theofmachinery ofthisthemachin theory oferydistributions. development and extension the use of even today, mainlythein equations connectionofwith the requirements the theorycontinues of differential equations, mathematical physics,offunc tionalTo analysis, andnotation their applications. simplify the weproperties, shall consider below onlyfrom generalized functions inandV',proofs, although all of their as will be seen their definitions remain validof for distributions of any class V' (G), where G is an arbitrary open subset Operations withvalidistributions arefunctions, defined that by starting with thegeneralized integral relations that are d for classical is, for regular functions. b. Multiplication of a Distribution by a Function If f is a locally integrable function on lR and g E C ( oo ) , then for any function rp E Cboo ) , oo ) onequality the one hand grp E Cb and, on the other hand, we have the obvious J U · g)(x) rp (x) dx = j f(x)(g · rp)(x) dx or, in other notation (f · g, rp) = (f,g · rp) . Thibasiss relation, which nisg vali d for regular generalized functions, provides the for the followi definition of the distribution F · g obtained by multiplying the distribution F E V' by the function g E C ( 00 l : (17. 6 1) (F · g , rp) := (F, g · cp) . right-handFsideonof any Eq. function (17. 6 1) is defined, and thus defines the value ofitselftheTheisfunctional E V, that is, the functional F · g · g rp defined. Example Let us see how the distribution · g acts, where g E C ( oo ) . In accordance with the definition (17. 6 1) and the definition of we obtain ( o . g, rp ) : = ( o, g . cp) : = ( g . rp )( o ) = g ( o ) . rp ( o ) . c. Differentiation of Generalized Functions If f E C ( l ) and rp E Cb oo ) , integration by parts yields the equality J f'(x) rp (x) dx = - J f(x) rp'(x) dx . (17. 6 2) Thisofequality is the point of departure for the following fundamental def inition differentiation of a generalized function F E V': (17. 6 3) ( F', cp) := - ( F, cp') . R
�
�
11.
0
�
�
0,
466
17 Integrals Depending on a Parameter
Example 1 2. If f E C ( l ) , the derivative of f in the classical sense equals its
derivative inwith the distribution sense ( provided, naturally, the classical function isfollows identified the regular generalized function corresponding totheit ) .right This from a comparison of relations (17. 6 2) and (17. 6 3), in which hand sides are equal if the distribution F is generated by the function f. Example Take the Heaviside6 function { 0 for 0 , H( ) 1 for 0 , sometimes called theH unit step. Regarding it as a generalized function, let us find the derivative ' of this function, which is discontinuous in the classical sense.From the definition of the regular generalized function H corresponding to the Heaviside function and relation (17. 6 3) we find + oo + oo H( )tp'( ) d (H', 'P) -(H, tp') I tp'( ) d tp(O) , I 0 - oo 00 00 since H' = 'P8. E c6 Thus ( H', 'P) ( 8, 'P), for every function 'P E c6 Hence Example Let us compute ( 8', tp) : - (8, tp1 ) -tp'(O) . ( 8', 'P) It is natural that in the theory of generalized functions, as in the the ory of classical functions, the higher-order derivatives are defined by setting p ( n+ l ) ( F ( nJ ) ' . Comparing the results of the last two examples, one can consequently write (H", tp) -tp'(O) . Example Let us show that ( 8 ( n l , tp) (-1) n 'P ( n ) (O). Proof. For 0 this is the definition of the 8-function. We seen init Example 14 thatassuming this equality for established 1. for We have now prove by induction, that itholds has been a fixed value E N. Using definition (17. 63), we find ( 8 ( n + l ) , 'P) (( 8 ( n ) )', 'P) ( 8 ( n ) , tp') n n nl nl 1 3.
x
x
=
x
:= -
:=
).
14.
x
x
<
�
x
= -
x
x
=
).
=
:=
=
:=
=
1 5.
n = n
6
:=
:=
_
=
=
n =
=
-(- l) ('P') ( ) (O) (-1) + 'P ( + l (O) . =
0
0 . Heaviside ( 185Q-1925) - British physicist and engineer, who developed on the symbolic level the important mathematical machinery known as the operational calculus.
17.4 Convolution and Generalized Functions
467
Example Suppose the function f lR C is continuously differentiable for x function 0 and forexistx >at0,0.andWe suppose thequantity one-sidedf(limits f(f(-0) and f(saltus +0) ofor the denote the +0) the -0), jump ofofthef infunction at 0, by If(O), and bythef'distribution and { !'} respectively the derivative the distribution sense and defined by the function equal to isthenotusualdefined, derivative of fisfornotx important 0 and x for> 0.theAtintegral x=0 this last function but that through which it1 defines thethat regular distribution { !'}. In Example we noted if f E C ( l ) , then f' = { ! '}. We shall show that in general this is not the case, but rather the following important formula holds: (17. 64) f' = { !'} + If(O) · 8 . Proof. Indeed, + oo (!', IP) = - (!, tp' ) = - J f(x) tp'(x) dx = -+oooo = - ( j + j ) (f(x) tp'(x)) dx = - ( U · tp (x) l �= - oo - oo + oo - J f'(x) tp (x) dx + (f · tp)(x) l � 00 - J f'(x) tp (x) dx) = - oo + oo = (f(+O) - f(-O)) tp (O) + j j'(x) tp (x)dx = - 00 = (If(O) · 8, tp) + ( { !'}, tp) If all xderivatives up> 0,to and ordertheym are of thecontinuous functionandf have lR C exist on the iatntervals 0 and x x = 0, then, repeating the reasoning used to derive (17. 64),one-sided we obtainlimits j ( m ) = {j( m )} + Ij(O) . 8 ( m - l ) + Ij' (0) . 8 ( m - 2 ) + . . . · · · +It< m - I l (o) · 8 . (17. 6 5) We now exhibit some properties of the operation of differentiation of gen eralized functions. Proposition 6. ) Every generalized function F E is infinitely differen tiable. b) The differentiation operation D : is linear. <
:
1 6.
--+
<
0
0
0
0
:
<
a
D' --+ D'
D'
--+
. D
17 Integrals Depending on a Parameter
468 c
)
holds:
If F E V' and g E c ( = l , then (F · g) E V', and the Leibniz formula ( F( k ) . g (m - k ) . (F . g) ( m ) =
� �)
d) The differentiation operation D V' V' is continuous. e) If the series I:00= fk (x) S(x) formed from locally integrable functions k l uniformly on each compact subset of IR, then it can fk IR C converges be differentiated termwise any number of times in the sense of generalized functions, and the series so obtained will converge in V'. Proof. a) (F ( m l , cp) - (F ( m - l ) , cp') ( - l) m (F, cp ( m ) ). =
--+
:
:
:=
b) Obvious. c) Let us verify the formula for :=
:=
--+
:=
m =
1: =
=
( (F · g)', cp) -(Fg, cp') - (F, g · cp') -(F, (g · cp)' - g ' · cp) (F', gcp) + (F, g' · cp) (F' · g, cp) + (F · g', cp) (F' · g + F · g', cp) . =
=
=
In the general case we can obtain the formula by induction. Fm F in V' as V (Fd) , Let cp) (F, cp) as oo. Then oo, that is, for every function cp E m
--+
--+
m --+ :=
m --+
--+
=:
-(Fm , cp') -(F, cp') (F', cp) . e) Under these conditions the sum S(x) of them series, being the :uniform limit of locally integrable functions Sm (x) kI:= l fk (x) on compact sets, is locally integrable. It remains to observe that for every function cp E V (that (F:,, cp)
=
is, of compact support and infinitely differentiable) we have the relation (Sm , cp) J Sm (x)cp(x) dx --+ J S(x)cp(x) dx (S, cp) . Weoo.nowDconclude on the basis of what was proved in d) that s:, S' as Wemostseeimportant that the operation of differentiation of generalizedwhile functions retainsa the properties of classical differentiation acquiring of remarkable new properties that opencase up abecause great deal of presence freedom ofofnumber operation, which di d not exist i n the classical of the nondifferentiable functions the instability (lack of continuity) of classical differentiation under there limitingandprocesses. =
m --+
IR
IR
=
--+
17.4 Convolution and Generalized Functions
469
d. Fundamental Solutions and Convolution We began this subsection with intuitive7 ideas of the unit pulse and themechanical system function of thenaturally device. Ingenerates Example we exhibited an elementary system that a linear operator preserving time shifts. Studying it, we arrived at Eq.We(17.58), which thethissystem functionbyEreturning of that operator mustto satisfy. shall conclude subsection once again these ques tiscription ons, butinnowthe with the goal of illustrating an adequate mathematical de language of generalized functions. We begin by making sense of Eq. (17.58). On its right-hand side is the gen eralized function 8, so that relation (17.58) should be interpreted as equality oferalized generalized functions. Sinceoperations we know theon operations of differentiating gen functions and linear distributions, it follows that the left-hand side of Eq. (17.58) is now also comprehensible, even if interpreted in theLetsense of generalized functions. us now attempt to solve Eq. (17.58). At times t 0 the system was in a state of rest. At t 0 the point received at >unit0 there pulse,arethereby acquiring aacting velocityon thev =system, v(O) such that mv 1. For no external forces and its law of mostion x x( t ) is subject to the usual differential equation <
=
=
=
(17.66) which are to be solved with the initial conditions x(O) = 0, ±(0) = v = 1/m. mx + kx = 0 ,
Such a solution is unique and can be written out immediately: 1 . [£ , t 0 . x( t ) = v. rr=: m km sm -t thatSince in the present case theH system is at rest for t 0, we can conclude , tE�, (17.67) E(t) v� km sin y{kt -;;;, whereLetHusisnowtheverify, Heaviside function (seeforExample 13). using the rules differentiating generalizedE tfunctions and the results of the examples studied above, that the function ( ) defined by Eq. (17.67) satisfies Eq. (17.58). To simplify the writing we shall verify that the function (x) H ( x ) sinwx (17.68) w satisfies (in the sense of distribution theory) the equation d22 + w2 ) = 8 . (17.69) ( dx �
<
=
e
=
e
4 70
1 7 Integrals Depending on a Parameter
Indeed, (d2 + w) e - d2 (H -sinwx ) + w2 (H -sinwx ) dx2 sin wwx w dx2 • = H -+ w + 2H coswx - wH (. x) smwx + wH(x) sinwx = 8,sinwx -w + 215 coswx . Further, for every function
_
1
11
1) ,
,
-
1
:
1 7.
m
0
--+
17.4
Convolution and Generalized Functions
471
17.4.5 Problems and Exercises
a) Verify that convolution is associative: u * ( v * w ) = ( u * v ) * w. b) Suppose, as always, that r(a) is the Euler gamma function and H(x) is the Heaviside function. We set 1.
Hf (x )
a- 1
:= H (x ) ; (a) e >. x , where a > 0 , and .X E C .
Show that Hf * Hf = Hf + f3 . c ) Verify that the function F = H(x) (':,':._�� 1 e >. x is the nth convolution power of f = H (x )e >. , that is, F = f * f * · · · * f. '--v--"
"'
n
The function Gu (x) = " � e- 2.;2" , a > 0, defines the probability density function for the Gaussian normal distribution. a) Draw the graph of Gu (x) for different values of the parameter a. b) Verify that the mathematical expectation (mean value) of a random variable ,with the probability distribution Gu is zero, that is, J xGu (x) dx = 0. .,2
2.
lR
c) Verify that the standard deviation of x (the square root of the variance of x)
is a , that is
( [ x2 Gu (x) dx)
1
1
2
= a.
d) It is proved in probability theory that the probability density of the sum of two independent random variables is the convolution of the densities of the individual variables. Verify that G a * G {3 = G a2 {32 • J + e) Show that the sum of n independent identically distributed random variables (for example, n independent measurements of the same object) , all distributed according to the normal law Gu , is distributed according to the law G u .,;n · From this it follows in particular that the expected order of errors for the average of n such measurements when taken as the value of the measured quantity, equals a / y'ri, where a is the probable error of an individual measurement.
We recall that the function A(x) = L an x n is called the generating function of n =O the sequence ao , a 1 , . . . . Suppose given two sequences {a k } and {bk } · If we assume that a k = b k = 0 for k < 0, then the convolution of the sequences {a k } and {b k } can be naturally defined as the sequence C k = � a m bk- m . Show that the generating function of the convolution of two sequences equals the product of the generating functions of these sequences. 00
3.
{
}
4. a) Verify that if the convolution u * v is defined and one of the functions u and v is periodic with period T, then u * v is also a function of period T.
472
17 Integrals Depending on a Parameter
b) Prove the Weierstrass theorem on approximation of a continuous periodic function by a trigonometric polynomial (see Remark 5) . c) Prove the strengthened versions of the Weierstrass approximation theorem given in Example 4.
5. a) Suppose the interior of the compact set K C lR contains the closure E of the set E in Proposition 5. Show that in that case I f(y)L1 k (x - y) dy � f ( x ) on E. K
b) From the expansion ( 1 - z) - 1 = 1 + z + z 2 + · · · deduce that g(p, O) 1+pe16 2 ( 1 - pe'(j ) = .!2 + pei9 + p2 ei 2 9 + . . . for 0 -< p < 1 . c) Verify that if 0 � p < 1 and Pp(O) : = Re g(p , 0) = 21 + p cos O + p2 cos 20 + , ·
then the function
·
·
Pp(O) has the form
1 - p2 (0) = ! 2 1 - 2p cos 0 + p2 and is called the Poisson kernel for the disk. d) Show that the family of functions Pp(O) depending on the parameter p has . 2 71" - e 2 1r the following set of properties: Pp(O) ;::: 0, � I Pp(O) dO = 1 , I Pp(O) dO � 0 as 0 •>0 p � 1 - 0. e) Prove that if f E C[O, 211'] , then the function 2 71' u(p, 0) = Pp(O - t )f(t ) dt P.P
;I 0
is a harmonic function in the disk p < 1 and u(p, 0) � f(O) as p � 1 - 0. Thus, the Poisson kernel makes it possible to construct a function harmonic in the disk having prescribed values on the boundary circle. f) For locally integrable functions u and v that are periodic with the same period T, one can give an unambiguous definition of the convolution (convolution over a period) as follows: (u � v ) (x) : =
a+ T
I u (y)v(x - y) dy . a
The periodic functions on lR can be intepreted as functions defined on the circle, so that this operation can naturally be regarded as the convolution of two functions defined on a circle. Show that if f(O) is a locally integrable 211'-periodic function on lR (or, what is the same, f is a function on a circle) , and the family Pp(O) of functions depending on the parameter p has the properties of the Poisson kernel enumerated in d) , then (f 2*,.. Pp ) (0) � f(O) as p � 1 - 0 at each point of continuity of f.
17.4 Convolution and Generalized Functions
c,1 L 1 )
473
6. a) Suppose cp(x) := a exp for l xl < 1 and cp(x) : = 0 for l x l 2 1 . Let the constant a be chosen so that J cp(x) dx = 1. Verify that the family of functions lR
Cx 1L 1 )
for l x l < 1 and cp(x) : = 0 for l x l 2 1 . Show that a) Let cp(x) := exp J f(x)cpe (x) dx --+ 0 as c --+ +0 for every function f that is locally integrable on R, 7.
IR
where cpe (x) = cp ( � ) . b) Taking account of the preceding result and the fact that (8, cpe ) = cp(O) =f. 0, prove that the generalized function 8 is not regular. c) Show that there exists a sequence of regular generalized functions (even corresponding to functions of class C� 00 ) ) that converges in D' to the generalized function 8. (In fact every generalized function is the limit of regular generalized functions corresponding to functions in D = C� 00 ) ) In this sense the regular gener alized functions form an everywhere dense set in D' , just as the rational numbers Q are everywhere dense in the real numbers R.) .
a) Compute the value (F, cp) of the generalized function F E D' on the function cp E D if F = sin x8; F = 2 cos x8; F = (1 + x 2 )8. b) Verify that the operation F >-+ 'tj;F of multiplication by the function 1/J E c < oo l is a continuous operation in D' . c) Verify that linear operations on generalized functions are continuous in D' .
8.
474
17 Integrals Depending on a Parameter
that if F is the regular distribution generated by the function f (x) = { x0a) fforShow x -< 0 t hen F' = H , where H t he d 1stnbutwn correspond mg to t he ' ' ' or x > 0 ,
9.
· IS
'
·
Heaviside function. b) Compute the derivative of the distribution corresponding to the function lxl .
a) Verify that the following limiting passages in D' are correct: ax = 1rXu� ; lim � = ln x . a = 1r8 ,· 1 1m . lim l l <> -++O a 2 + x 2 a -++ 0 x 2 + a 2 a -++O x + a b) Show that if f = f ( x) is a locally integrable function on lP1. and fe = f ( x + c) , then fe --+ f i n D' as c --+ 0. c) Prove that if { Ll a } is an approximate identity consisting of smooth functions X as a --+ 0, then Fa = I Ll a (t) dt --+ H as a --+ 0, where H is the generalized oo function corresponding to the Heaviside function. 1 0.
-
1 1 . a) The symbol 8(x - a) usually denotes the 8-function shifted to the point a, that is, the generalized function acting on a function c.p E D according to the rule (8(x - a) , c.p) = c.p(a) . Show that the series 2:= 8(x - k) converges in D' .
kEZ
b) Find the derivative of the function [x] (the integer part of x) . c) A 21r-periodic function on lP1. is defined in the interval ]O, 21r] by the formula f i J 0 , 2 11" ] (x) = ! - 2x1l" . Show that f' = - 2� + 2:= 8(x - 21rk) . kEZ
d) Verify that 8(x - c) --+ 8(x) as c --+ 0. e) As before, denoting the 8-function shifted to the point c by 8(x - c), show by direct computation that � ( 8(x - c) - 8(x) ) --+ -8' (x) = -8'. f) Starting from the preceding limiting passage, interpret -8' as the distribution of charges corresponding to a dipole with eletric moment + 1 located at the point x = 0. Verify that (-8' , 1) = 0 (the total charge of a dipole is zero) and that (-8' , x) = 1 (its moment is indeed 1 ) . g ) A n important property o f the 8-function i s its homogeneity: 8(>-.x) = r 1 8(x) . Prove this equality. a) For the generalized function F defined as (F, c.p) = I y'xc.p(x) dx, verify the 0 following equalities: += c.p(x) . (F ' <.p ) = dx , , 2 v'x 0 += c.p(x) - c.p(O) dx ,. (F" , c.p) 4 x3/ 2 0 3 + = c.p(x) - c.p(O) - xc.p' (O) dx ,. (F "' , c.p) 8 x5 / 2 0 00
12.
�I -� I I
17.4 Convolution and Generalized Functions
()
( F n , 'P )
= X
( - 1 ) n - 1 (2n - 3) ! ! X 2n 'P (x) - 'fJ(O) - X'f! 1 (0) -
/+ oo 0
X
· ·
�
·
475
� 'P ( n - 2 ) ( 0) dX .
-
2
b ) Show that if n - 1 < p < n and the generalized function x:;:P is defined by the relation + oo 'P ( X ) - 'fJ(O) - X'f!' (0) - . . . - (�"_-22) ' 'P ( n - 2 ) (0) (x:;:P , 'fJ ) := dx ,
/
.
xP
0
Then its derivative is the function -px _:;: < P + 1 ) defined by the relation 1 + (p + )
( -px 13.
' tn) .,-
= -p
/+ oo 'P (x) - 'fJ (O) - X'f!' (0)xP-+ 1. . . - (�"_-11') 'P ( n 1 ) (0) dx . -
.
0
The generalized function defined by the equality
is denoted P � . Show that
\P� , 'P) +r
=
X
y-+
I
+
X
X
lY
14. Some difficulties may arise with the definition of the multiplication of gener alized functions: for example, the function lxl - 2/ 3 is absolutely ( improperly ) inte grable on lR; it generates a corresponding generalized function += J l x i 2 ! 3 'P (x) dx, oo but its square lxl - 4 / 3 is no longer an integrable function, even in the improper sense. The answers to the following questions show that it is theoretically impossi ble to define a natural associative and commutative operation of multiplication for any generalized functions. a) Show that f (x) o = f (O) o for every function f E c < = l . b ) Verify that xP � = 1 in D' . c ) If the operation of multiplication were extended to to all pairs of generalized functions, it would at least not be associative and commutative. Otherwise, -
1
1
1
(
1
0 = OP - = (xo(x))P- = (o(x)x)P - = o(x) xP X X X X
)
=
o (x) 1
= lO (x) = o .
476
17 Integrals Depending on a Parameter
1 5 . a) Show that a fundamental solution E for the linear operator A : D' ---+ D' is in general ambiguously defined, up to any solution of the homogeneous equation Af = 0 . b) Consider the differential operator d : = dn d n- 1 x, (x) a + ! dx n- 1 + . . . + a n (x) . dx n dx p
( )
Show that if uo = ua (x) is a solution of the equation P ( x , d: ) uo = 0 that satisfies the initial conditions ua (O) = · · · = u6n -2 ) (0) = 0 and u6n - I ) (O) = 1, then the function E(x) = H(x)ua (x) (where H ( X ) is the Heaviside function) is a fundamental solution for the operator P ( x, ddx ) . c) Use this method to find the fundamental solutions for the following operators:
mEN. d) Using these reulsts and the convolution, find solutions of the equations �:,;: d J , ( d x + a ) = J, where f E C(ffi., ffi.) .
m
=
1 7. 5 Multiple Integrals Depending on a Parameter
In the first two subsections of the present section we shall exhibit proper ties oftotalproper andof these improper multipleis integrals depending on a ofparameter. The result subsections that the basic properties multiple integrals depending on a parameter do not differ essentially from the corre sponding properties of one-dimensional integrals depending on a parameter studied above. Insingularity the thirditself subsection weonshalla parameter, study the case of isanimportant improper integral whose depends which in applications. Finally, in the fourthandsubsection we shall study the convolu tion of functions of several variables some specifically multi-dimensional on generalized closely connected integrals depending onquestions a parameter and the functions classical integral formulas ofwithanalysis. 1 7 . 5 . 1 Proper Multiple Integrals Depending on a Parameter
Let X be a measurable subset of JR.n , for example, a bounded domain with smooth or piecewise-smooth boundary, and let Y be a subset of JR.n . Consider the following integral depending on a parameter: (17. 72) F ( y) = J f(x, y) dx , where function is assumed on X fortheeach fixedfvalue of y E toY .be defined on the set X x Y and integrable X
17.5 Multiple Integrals Depending on a Parameter
The following propositions hold. Proposition
1.
then F E C(Y) .
477
If X x Y is a compact subset of JRn + m and f E C(X x Y),
2.
If Y is a domain in JRm , f E C(X x Y), and *!r E C(X x Y), then ithe function F is differentiable with respect to yi in Y, where y (y l , . . . , y , . . . , ym ) and oF of 0 . (y) = J 0 . (x, y) dx . (17.73) y' Y' Proposition
=
X
If X and Y are measurable compact subsets of JRn and JRm respectively, while f E C(X x Y), then F E C(Y) R(Y), and J F(y) dy := j dy j f(x, y) dx j dx j f(x, y) dy . (17.74)
Proposition 3.
Y
Y
=
X
C
X
Y
Wespace note that the values of the function f here may lie in any normed vector Z. The most important special cases occur when Z is JR, C, JR n , n . In these cases the verification of Propositions 1-3 obviously reduce to ortheecase ofverbatim their proofrepetitions for Z Rof theButproof for Zof thelR thecorresponding proofs of Propositions 1 and 2 are propositions for a one-dimensional 17.1), and(Sect.Proposition corollary of Propositionintegral 1 and (see FubiniSect.' s theorem 11. 4 ). 3 is a simple =
17.5.2
=
Improper Multiple Integrals Depending on a Parameter
If theis understood set X JRn asor the function fimproper ( x, y) in the integral ( 17. 72) is unbounded, itexhaustion the limit of integrals over sets of a suitable of X. In studying multiple improper integrals depending on a parameter, as a rule, oneone-dimensional is interested incase.particular exhaustions liwith ke those that we studied in the In complete accord the one-dimensional case, we remove the .: : neighborhood of the singularities/ integrals overintegrals the remaining parts Xc: of X and then find the limit offindtheIfthethis values of the over Xc: as -+ +0. passageintegral is uniform with respect to the parameter y E Y, we say thatlimiting the improper ( 17. 72 ) converges uniformly on Y. Example The integral F().. ) JJ e - .\ ( x 2 + y 2 ) dx d y C
c:
1.
=
JR2
7 That is, the points in every neighborhood of which the function f is unbounded. If the set X is also unbounded, we remove a neighborhood of infinity from it.
4 78
1 7 Integrals Depending on a Parameter
results from the limiting passage JJR2J e- A (x2 +y2 ) dxdy := e!i�o 2 +Jy2J� 1 je2 e- A(x2 +Y2 ) dxdy x and, as one canit converges easily verifyuniformly using polar coordinates, it converges for A > 0. Furthermore, on the set EAo = { A E IR J A 2: Ao > 0} , since for A E EAo > 0<
and last integral tends to 0 as e -+ 0 (the original integral F(A) converges at A this = A o > 0) . Example Suppose, as always, that B(a, r) { x E !Rn J ix - ai < r} is the ball of radius r with center at a E !Rn , and let y E !Rn . Consider the integral Yl dx := lim J l x - y l dx . F(y) = J (1l x -xi) e-HO (1 - l x l )<> l O< B ( O, l ) B ( O, l - e ) Passing to polar coordinates i n !Rn , we verify that this integral converges only for < to1. theIf theparameter value y
2.
a
a
C
1 7 . 5 . 3 Improper Integrals with a Variable Singularity
Example 3. As is known, the potential of a unit charge located at the point x E JR3 is expressed by the formula U(x, y) = l x �y l , where y is a variable point of JR3 . If the charge is now distributed in a bounded region X JR3 C
479
17.5 Multiple Integrals Depending on a Parameter
with a distributed bounded density J.Lway ( x ) ( equal to zero outside X), the potential of a charge in this can be written (by virtue of the additivity of potential) as U (y ) I U ( x, y) J.L ( x ) dx I �;x2 �� . (17.75) X of the parameter in this last integral is played by the variable pointThey role E JR3 . If the point y lies in the exterior of the set X, the integral (17. 75) is a proper integral; but if y E X, then l x - Yl 0 as X x y, and y becomes a singularity of the integral. As y varies, this singularity thus moves. Since U (y) c--t+O lim Uc (y) , where I l xJ.L (-x)y l dx ' =
=
R3
-+
3
-+
=
X\B ( y , c )
it is naturalconverges to consider, as before, thatset theY ifintegral (17.75) with a variable singularity uniformly on the U ) U ( y ) on Y as c +0. We have assumed that IJ.L (x) l :::; M E lR on X,c (Yand therefore 2 I XnBI(y ,c) J.Ll x(x-) dxYl l -< MB (Iy ,c) � l x - Yl 2rrMc . Thi s estimate shows that y E JR 3 , that I U ( y ) - Uc ( y ) I 271" Mc 2 for every 3 converges uniformly on theU set isY continuous JR . is, theIn particular, integral (17.75) with re if we verify that the function c (Y) spect to y, we wi l l then be able to deduce from general considerations that the potential U ( y ) is continuous. But the continuity of Uc ( Y ) does not follow formally froma parameter, Propositionsince1 onin thethe continuity of thean improper integral de pending on present case domain of integration X \ B ( y, c) changes when y changes. For that reason, we need to examine the question of the that continuity We remark for IY -ofYUco I (:::;y )c,more closely. J.L (x) dx + J.L (x ) dx · Uc ( Y ) I I x y l X\B ,c nB o, c l x - y l X\B ( y0 , 2 c ) l ( ( y )) ( y 2 ) The first of these two integrals is continuous with respect to y assuming IY - Yo I < c, being a proper integral with a fixed domain of integration. The absolute value of the second does not exceed Mdx 8rrMc2 . x-y I B ( yo, 2 c ) 1 -1 ::4
-+
=
:::;
=
=
=
480
17 Integrals Depending on a Parameter
Hence the inequality IUo: ( Y) - Uo: ( Yo) l c + 1 6 Mc2 holds for all values of y sufficiently close to y0 , which establishes that Uo:(Y) is continuous at the pointThusYo EweJR.3.have shown that the potential U(y) is a continuous function in the whole space JR.3. These examples provide the basis for adopting the following definition. Definition 1. Suppose the integral ( 1 7.72) is an improper integral that con of singularities the set X obtained verges forfrom each Xy theE Y.c-neighborhood by Let X" be theof portion removing the set of of the inte gral, 8 and let Fo: (Y) = J f(x , y ) dx . We shall say that the integral (17.72) converges uniformly on the set Y if Fo:(Y) F (y) on Y as +0. Theandfollowing useful proposition an immediate inition considerations similar toisthose illustratedconsequence in Exampleof3. this def Proposition 4 . If the function f in the integral (17. 72) admits the estimate l f (x , y) l :::; l x �l " , where M E JR. , x E X JR.n , y E Y JR.n , and n, then the integral converges uniformly on Y . Example 4 . In particular, we conclude on the basis of Proposition 4 that the integral ( y ) = I fJ (x )( x i - y i ) dX ' Ix - y13 obtained by formal differentiation of the potential ( 1 7. 75) with respect to the variable yi (i = 1 , 2, 3) converges uniformly on Y = JR.3 , since I J.L(��(x�;;/l I :::; l x -yAsl 2 . in Example 3, it follows from this that the function V;(y) is continuous on JR.3.Let us now verify that the function U(y) - the potential ( 17.75) - really g� and that g� (y) = V;(y). doesTohave a partial derivativesuffices do this it obviously to verify that <
7r
x.
::::+
C
t r. v,
c --+
C
X
M
Ia V; (yl , y2 , y3) dyi = U(yl , y2 , y3 ) � �' =a . b
8
See the footnote on page 477.
a <
17.5 Multiple Integrals Depending on a Parameter
481
But in fact, b b i f p, (x)(xi - yi ) i ! \I;(Y) dY = j dY I x - y 1 3 dx = ba( 1 ) i b (xi - yi ) i = J p, (x) dx J l x - Yl 3 dy = j p, (x) dx j [)yi l x - Yl dy = = ( / p,l x(x)- dxY l ) l b = U(y ) j b . The only nontrivial pointin order in thistocomputation is theofreversal of the order ofit suffices integration. In general, reverse the order improper integrals, multiple This integral that converges absolutely with respect tothetheinterchange wholeto sethaveisofajustified. variables. condition holds in the present case, so that Of course, it could also be justified directly due to the simplicity of the function involved. Thus,inweJRhave shown that the potential U(y) generated by a charge dis 3 with a bounded density is continuously differentiable in the tributed entireThespace. and reasoning used in inExamples 3 and 4 enable us to studyLetthetechnique::; following more general situation a very similar way. F(y) = j K(y - cp(x)) 7J; (x, y) dx , ( 1 7. 76) wheretheX domain is a bounded measurable domaincp Xin -+!Rn !Rm , the parameter y ranges over is a smooth mapping Y !Rm , with n satisfying rankcp'(x) = n, and 0, that is, cp defines an n cp '(x) l 2: l dimensional parametrized surface, or, more precisely, an n-path in !Rm . Here C(!Rm \ 0, IR), that is, the function K(z) is continuous everywhere in K!RmEexcept at continuous z = 0, near which it may be unbounded; and 7j; X x Y -+ lR isintegral a bounded function. We shall assume that for each y E Y the ( 1 7. 76) (which in general is an improper integral) exists. In the integral ( 1 7. 75) that we considered above, in particular, we had n = cp(x) = x , 7/J (x, y) = p, (x) , K(z) = l z l - 1 . It is not difficult to verify that underof thetheseintegral restrictions on the function convergence ( 1 7. 76) means that for cp,everyDefinition0 one1 ofcanuniform choose E 0 such that a
t
a
X
a
X
a
X
y' =a
y' = a ·
X
X
C
::;;
m,
:
c
>
:
m ,
o:
>
>
482
17 Integrals Depending on a Parameter
I J K (y IY (x) I <E -
(x) ) 'I/J(x , y) dx
i
< a ,
( 17.77)
whereThethefollowi integral is taken overholdthe forset9the{xintegral E X I IY - (x) l E } . ng propositions ( 1 7. 76) . Proposition 5. If the integral ( 1 7. 76) converges uniformly on Y under the hypotheses described above on the functions '1/!, and K, then F E C(Y,IR). Proposition 6 . If it is known in addition that the function '1/J in the integral ( 1 7. 76) is independent of the parameter y (that is, '1/!(x, y) = '1/!(x)) and K E C ( 1 )(JRm \ 0, IR), then if the integral ! 8K IJ i ( y -
<
X
y
converges uniformly on the set y E Y, one can say that the function F has a continuous partial derivative g:, , and
aF j aK IJyi (y) = IJyi ( y - (x) ) 'l/!(x) dx .
X
( 1 7. 78)
Thein proofs of these propositions, as stated, are completely analogous to thoseWe Examples 3 and 4, and so we shall not take the time to give them . noteexhaustion) only that implies the convergence of convergence. an improper Inintegral (under3 andan arbitrary its absolute Examples 4 the hypothesis of absolute convergence was used in the estimates and in reversing an illustration possible uses Propositionsthe 5order and 6,ofletintegration. us considerAsanother example offromthepotential theory.of Suppose a chargev(x).is distributed onofa such smooth compact surface SExample IR3 with surface density The potential a charge distribution isintegral called a single-layer potential and is obviously represented by the surface (x) da(x) c
5.
U(y) =
J
v
( 17. 79) lx - yl . Suppose is a bounded function. Then for y 'f. S this integral is proper, andButthe iffunction U(y) is infinitely differentiable outside S . y E S, the integral has an integrable singularity at the point y. v
s
The singularity is integrable because the surface S is smooth and differs by 9
Here we are assuming that the set X itself is bounded in IE. n . Otherwise one must supplement inequality ( 17. 77) with the analogous inequality in which the integral is taken over the set {x E X l lxl > 1/e:}.
17.5 Multiple Integrals Depending on a Parameter
483
little from aofpiece of1 /thea isplane �2 nearin the point ify E S; 2.andUsing we know that a singularity type integrable the plane Proposition r 5, we can turn this general consideration into a formal proof. If we represent S locally in a neighborhood Vy of the point y E S in the form x
lit
=
C
=
=
Vy
V,
=
17.5.4 *Convolution, the Fundamental Solution, and Generalized Functions in the Multidimensional Case
�n Definition 2. The convolution u * v of real- or complex-valued functions u and v defined in �n is defined by the relation ( 1 7.80) (u * v) (x) J u(y)v(x - y) dy . Example Comparing formulas (17. 75 ) and ( 1 7.80) , we can conclude, for example, that the potential U of a charge distributed in �3 with density J.l(x) is the convolution (J.l * E) of the function and the potential E of a unitRelation charge located at the origin of �3. ( 1 7.80) is a direct generalization of the definition of convolution given infor17.4.theForcasethatn reason, all the properties of the convolution considered in�n 17.4 1 and their proofs remain valid if � is replaced by .An approximate identity in �n is defined j ust as in � with � replaced by �n The and U(O) understood to becontinuity a neighborhood of the point 0 E � n in � n . concept of uniform of a function f G C on a set E G, and with it the basic Proposition 5 of Sect. 17.4 on convergence of the convolution dimensional case.f * L1a to f, also carry over in all its details to the multi a. Convolution in
:=
6.
JFt n
J.l
=
c
:
---+
484
17 Integrals Depending on a Parameter
We note only that in Example 3 and in the proof of Corollary 1 of Sect. must minor be replaced in the definition of the functions . Only changesbyarel x l needed in the approximate identity.dngi(x)venandin Exampleof4periodic of Sect. functions 1 7.4 for the proof of the Weierstrass theorem on approxi mation by trignometric polynomials. In this case it is a question of approximating a function f (x1 , . . . , xn ) that is continuous and pe riodicThewithassertion periodsamounts T1 , T2 , . . . , Tn respectively in the variables x 1 , x 2 , . . . , xn . to the statement thatwiforth theeveryrespective > 0 one can exhibit a trigonometric polynomial in n vari a bles periods T1 , T2 , . . . , Tn that approximates f on lRn within We confine ourselves to these remarks. An independent verification of the properties ( 1 7.80) for n E N, which were proved for the case n = 1 ofintheSect.convolution 1 7.4, will be an easy but useful exercise for the reader, helping to promote an adequate understanding of what was said in Sect. 17.4. b. Generalized Functions of Several Variables We now take up cer tain multi-dimensional aspects ofintheSect.concepts connected with generalized functions, which were introduced 1 7.4. As before, let cC 00 l (G) and c6oo \G) denote respectively the sets of in finitely differentiable in thesupport domaininGG. IfJRnG and the set of infinitely differentiable functionsfunctions of compact = JRn , we shall use the respective abbreviations C ( oo ) and c6oo l . Let := ( , n ) be a multi index and ( {) ) m1 . . . . . ( {) ) mn 'P . 'P ( m ) ·. {)xn - [)xl In c6oo l (G) we introduce convergence ooof functions. As in Definition 7 of Sect. we consider that 'Pk c.p in c6 l (G) as k oo if the supports of all the1 7.4, functions of the sequence { 'Pk } are contained in one compact subset m m ( ) offunctions G and c.p� ) c.p on G for every multi-index as k oo, that is, the uniformly, and songdodefinition. all of their partial derivatives. Given converge this, we adopt the followi Definition 3. The vector space c6oo l (G) with this convergence is denoted V( G) ( and simply V if G = JRn ) and is called the space of fundamental or test functions. Continuous linear functionals on V( G) are called generalized functions or distributions. They form the vector space of generalized functions, denoted V'(G) ( or V' when G = lRn ) . Convergence in V' (G), as in the one-dimensional case, is defined as weak ( pointwise ) convergence of functionals ( see Definition 6 of Sect. 17.4) . The definition of a regular generalized function carries over verbatim to the multi-dimensional case. 17.4 x c.p (x)
E
E.
m
::::l
-+
C
m1 , . . .
m
-+
m
-+
17.5 Multiple Integrals Depending on a Parameter
485
The (denoted definitiono (ofxo),theoro-function andbutthenoto-function shiftedo (tox-xthe))point xremain more often, always happily, 0 also 0 E G the same. Now let us consider some examples. Example Set lxl2 ' 1 L1t (x) := (2a..firi)n e 4;;"2t where > 0, t > 0, x E JR.n . We shall show that these functions, regarded as regular distributions in JR.n , converge tothatthetheo-function on functions JR.n as t --+ + 0. For the proof it suffices to verify fami l y of L1t is an approxi m ate identity in JR.n as t --+ + 0. Usingand a change of variable, reduction of theintegral, multipleweintegral integral, the value of the Euler-Poisson find to an iterated 7.
a
Next, for any fixed value of > 0 we have J L1t (x) dx = ( �)n J e - 1 � 1 2 d.; --+ 1 , r ) ( ' 2 aVt as tFinally, --+ +0. taking account of the fact that L1 (x) is nonnegative, we conclude t that these functions indeed constitute an approximate identity in JR.n . generalization of theino-function (corresponding, for example, to aOsExample unit(corresponding chargeAlocated at the origin JR.n ) is the following generalized function to a distribution of charge over a piecewise-smooth surface S with a distribution of unit surface density). The effect of Os on the function cp E V is defined by the relation ( os, cp) := j cp(x) da . s Like the distribution o, the distribution Os is not a regular generalized function. of a distribution as inMultiplication the one-dimensional case. by a function in V is defined in JR.n just Example If E V, then J-lOs is a generalized function acting according to the rule ( 1 7.81) ( J-LOs, cp) = J cp(x)J-L(x) da . s r
B(O,r)
8.
9.
J-1
B o
486
1 7 Integrals Depending on a Parameter
If the function J-t(x) were defined only on the surface S, Eq. ( 1 7.81 ) could beanalogy, regarded as the definition of theintroduced generalizedin this function Js.calledBy anatural the generalized function way is single layer on the surface S with density J-t. Differentiation ofprinciple generalizedas infunctions in the multi-dimensional casea fewis defined by the same the one-dimensional case, but has peculiarities. If F E V' (G) and G !Rn , the generalized function g;: is defined by the relation J-t
C
It follows that ( 17.82) where m ( m 1 , . . . , mk ) is a multi-index and l m l Ln m; . is naturalof theto verify the relation �:f:x �:bx But that follows from the Itequality right-hand sides in the relations =
=
i
8
=
8
i= l
'.
which follows from the classical equality : lf : 't , which holds for every function cp E Example Now consider an operator D L where m = are numer ( m 1 , . . . , m n ) is a multi-index, D ( � ) ·. . .· ( � ) coefficients,operator. and the sum extends over a finite set of multi-indices. This isicalaThe differential transpose or adjoint of D is the operator usually denoted t D or D* and defined by the relation 1 0.
2 8 , xi
D.
m
=
8 1
m,
=
( DF, cp) ( F, t Dcp) , cp E V F E V'.
which musttheholdexplicit for allformulaand now write
rn
=:
2 EJ i x '
m
am D m ,
EJ n
mn
, am
Starting from Eq. ( 1 7.82) , we can
for the adjoint of the differential operator D.
17.5 Multiple Integrals Depending on a Parameter
487
In particular, if all the values of l m l are even, the operator D is self It is clearof dithatfferentiation the operation of differentiation in V' (JR.n ) preserves all the properties in V' (JR.) . However, let us consider the following important example, which is specific to the multi-dimensional case. Let S be a smooth (n - 1 )-dimensional submanifold of JR.n , that is,isExample S is a smooth hypersurface. Assume that the function f defined on JR.n \ S infinitely differentiable and that all its partial derivatives have a limit at every point x E S under one-sided approach to x from either (local) side of S. the The surfacedifference between these two atlimits will bex corresponding the jump J {Nrto ofa par the partial derivative under consideration the point ticular direction of passageis across the The surfacejumpS atcanx. thus The besignregarded of the jump changes if that direction reversed. as a function defined on an oriented surface if, for example, we make the con vention that the direction of passage is given by an orienting normal to the surface. Theby function {Nr is defined, continuous, and locally bounded outside S, and the assumptions just made f is locally ultimately bounded upon S itself. Since S is a submanifold of JR.n , no mat approach to the surface ter how wediscontinuities complete theondefinition of {Nr on S, we obtain a function with possible only S, and hence locally integrable in JR.n . But integrable functions without that diffworrying er on a about set of themeasure zero have equal inte S, we may assume grals, and therefore, values on that {Nr generates some regular generalized function { {Nr } according to the rule adjoint, that is, t D = D. 11.
We shallimportant now showformula that iffholdsis regarded as aofgeneralized function, then the following in the sense differentiation of generalized functions: 8f 8f ( 1 7.83) 8xi = { 8xi } + (If) s cos O:i bs whereofthethelastfunction term isf understood in the sense of Eq. ( 1 7.81 ) , (If ) s is the jump at x E S correpsonding to either of the two possible unit normal directions onto the xiof-axisthe (that is, = (costoo:S1 , at. . . x,, cosandak )cos) . o:i is the projection of Proof. Formula ( 1 7.83) generalizes Eq. ( 1 7.64) , which we use to derive it. '
n
n
n
488
17 Integrals Depending on a Parameter
For definiteness we consider the case i = 1 . Then
J :;I dx + J · · · J (Jf) cp dx2 . . . dxn . Here thethrough jump Iftheofsurface f is taken at thepointpointin thex direction (xi , x2 , , xn) E S as one-axis. passesThe at that of(I f)the positi ve value Hence, of the function in computing the product is taken XatI the same point. this last integral can be written as a surface integral of first kind j (Jf) cp cos dO" , where isStheat angle between the direction of the positive XI-axis and the normal to x , direct so that in passing through x in the direction of that normal0. Ittheremains functiononlyf hasto remark preciselythattheifjump I f. This means only that cos we choose thecosineotherwould direction I for the normal, the sign of the jump and the sign of the reverse simultaneously; hence the product (If) cos I does not change. bothD Remark As can be seen from this proof, formula ( 17.83) holds once the jump (I f) of f is defined at each point x E S, and a locally integrable partial derivative jb exists outside of S in !Rn , perhaps as an improper integral generating a regular generalized function { jb } . Remark 2. At points x E S at which the direction of the x i -axis is not transversal totheS, jump that is,f itin isthetangent to S, difficulties maybeariseseeninfrom the definition of given direction. But i t can I ( 17.83) that its last term is obtained from the integral =
JR n
=
s
ai
a
:z:2 • • . :z: n
.
.
•
ai
�
1.
a
8
! ! ···
:z:2 . , . :z: n
(If) cp dx 2 · · · dx n .
The projections of the set E on x2 , . . . ,x n -hyperplane has (n - I) dimensional measure zero andthetherefore has no effect on the value of the integral. Hence we can regard form ( 17.83) as having meaning and being valid always if (If) cos ai is given the value 0 when cos ai 0. 8
=
17.5 Multiple Integrals Depending on a Parameter
489
Remark Similar considerations make it possible to neglect sets of area zero; therefore one can regard formula ( 17.83) as proved for piecewise-smooth surfaces. As aourcannextbe example wedirectly shall show howthe thedifferential classical relation Gauss-Ostrogradskii formul obtained from ( 1 7.83) , and ininformed a form thethatreader is maximally free of the extra analytic requirements that we of previously. Example in JR.n field bounded bycontinuous a piecewise-smooth surface S. LetLetA =G (Abe1 a, . finite ben a vector that is in G and . . , An)domain . such that the function divA = L ��: is defined in G and integrable on G, possibly inregard the improper sense. If we the field A as Szeroof theoutside G, then the jump of this field at each poi n t x of the boundary domain G when leaving G is -A(x). Assuming that n is a unit outward normal vector to S, applying formula ( 17.83) to each component A i of the field A and summing these equalities, we arrive at the relation ( 17.84) divA = {divA} - (A · n) 6s , inpointwhichx EA·S.n is the inner product of the vectors A and n at the corresponding Relation ( 17.84) is equality of generalized functions. Let us apply it to the 00 l function E c6beenequal to 1 onmoreG (the existence and construction ofevery such afunction functionr.p1/J Ehas discussed than once previously). Since for V ( 17.85 ) (divA, r.p) = - J (A · 'Vr.p) dx (which follows immediately fromAtheanddefinition of the1/Jderivative of a have gen eralized function), for the field the function we obviously (divA, 'lj;) = 0. But, when we take account of Eq. ( 17.84 ) this gives the relation 0 = ( {div A},'lj; ) - ( (A · n)6s,1/J ) , which in classical notation 0 = J divA dx - J (A · n) dcr ( 17.86 ) is the same as the Gauss-Ostrogradskii formula. now consider several important examples connected with differenta tionLetof usgeneralized functions. 3.
1 2.
i= l
]R n
G
S
490
17 Integrals Depending on a Parameter
We consider3 the vector field A = fxP- defined in JR3 \ 0 and show that in the space V' (JR ) of generalized functions we have the equality div � �3 = 4m5 . ( 1 7.87) We remark first that for x =1- 0 we have div fxP- = 0 in the classical sense. Now, the definition of divAdivifxPn the= form ( 1 7.85) , the definition using of ansuccessively improper integral, the equality 0 for x =1- 0, the Gauss-Ostrogradskii port, we obtain formula ( 1 7.86) , and the fact that tp has compact sup Example
1 3.
\ div � �3 , tp) = - I ( 1 �3 · \7tp(x)) dx = JR3 = lim - I ( 1 � 3 · \7tp(x)) dx = tp ( x3 ) ) d = = lim d" c I V I lxl c: --+ + 0
c: < l x l < l /c:
X
c: --+ +0
For the operator A : V' (G) V' ( g ) , as before, we define a fundamental solution to be a generalized function E V' (G) for which A (E) = c5. Example We verify that the regular generalized function E(x)2 = - 2j 3 in V' (JR2 ) is a fundamental solution of the Laplacian Ll = ( � ) + ( � ) + ( a�3 ) · Indeed, Ll = div grad, and gradE(x) = f for x =1- 0, and therefore the equality divExample gradE 1= , c5onefollows from relation ( 1 7.87) . As i n can verify that for any n E N, n 2, we have the 3 following relation in JRn : ---+
14.
c
8 1
8 2
4 1r x l
4 7r x l 3
:2':
( 1 7.87' )
where an = f(r�:�) is the area of the unit sphere in JR" . Hence we can conclude upon taking account of the relation Ll = div grad that Llln l X I = 27rc5 in JR2 and 1 Ll1 1 n- 2 = - (n - 2)an c5 in JRn , n > 2 . X
491
17.5 Multiple Integrals Depending on a Parameter
Let us verify that the function H;, e - s; , E(x, t) = (2a nt) n where x E JRn , t E JR, and H is the Heaviside function (that is, we set E(x, t) = 0 when t 0) satisfies the equation ( :t - a2 L1 ) E = J . Here .1 is the Laplacian with respect to x in JRn , and J = J ( x, t) is the iS-function in JR� x lRt = JRn+l . When t > 0, we have E E C ( oo ) (JR n+l ) and by direct differentiation we verify that ( :t - a2 L1) E = O when t > O . Takifunction ng this factt.p EintoV(JRaccount along with the result of Example 7, we obtain n+l for any Example
1 5.
<
)
+oo
J0 dt J E(x, t) ( �� + a2 .1t.p) dx = +oo = - �� J dt J E(x, t) ( �� + a 2 .1t.p ) dx = = -
� ,i 0 V
�
�
2'.".'
l�
o
c:
JRn
o
Tdt1 ( ';;:
R (x, £ )1"(x, D) dx +
R (x, £ ) I" ( X , 0) dx +
) d]
- hiE l" x
1 R (x, £) (l" (x, £) - l" (x, D)) dx]
�
lim+0 J E(x, ) t.p (x, 0) dx = t.p(O, 0) = (J, t.p) . Example Let us show that the function 1 E(x, t) = H ( at lxl ) , 2a where equationa > 0, x E JR�, t E JR�[), 2and H [)is2 the Heaviside function, satisfies the ( fJt 2 - a2 fJx2 ) E = J ' in which J = J(x, t) is the iS-function in the space V'(JR� x JRD = V'(JR2 ). =
1 6.
c: -+
JRn
-
E
-
492
17 Integrals Depending on a Parameter
Let 'P E V(�2 ). Using the abbreviation Da := .g;, - a2 l;r , we find ( D a E ,
�
-
a
-
u
1 7.
1 8.
u.
17.5 Multiple Integrals Depending on a Parameter
493
offunction the equation �� - .du = f, for example, under the assumption that the f is continuous and bounded, which guarantees the existence of the convolution *E. We note that these assumptions are made only for example, and are far ffrom obligatory. Thus, from the point of view of generalized functions one could pose the question of the solution of the equation �� .du = f taking as f(x, t) the generalized function cp(x) · o(t), where cp E V(JRn ) andThe o E V'(JR). formal substitution of such a function f under the integral sign leads to the relation � ) ed� . u(x, t) = J [2acp(.Jii; t] n Applying thethatrulethisfor function differentiating an integral depending on��a parameter one can verify is a solution of the equation a.du = 0 for t > 0. We note that u(x, t) cp(x) as t +0. This follows from the result of Example 7, where it was established that the family of functions encountered here is an approximate identity. Example recalling the fundamental solution of the Laplace oper ator obtainedFinally, in Example 14, we find the solution u (x ) = J !(� ) d� ! x -� ! 2 4a
rr
JR n
-+
-+
t
-
1 9.
JRn
lx - ��
ofis thethe same Poissonas theequation .du = - 47r f, which up to notation and relabeling potential ( 1 7.75) for a charge distributed with density J , whiIfchthewe function consideredf isearlier. takenintoas v(x)os, whereleads S is toa piecewise smooth surface in JR3 , formal substitution the integral the function u (x ) = v(l�x) -da(� � ) ' � s
J
as wedistributed know, is aoversingle-layer the potential ofwhich, a charge the surfacepotential; S JR3 more with precisely, surface density v(x). C
17.5.5 Problems and Exercises 1. a) Reasoning as in Example 3, where the continuity of the three-dimensional potential ( 17.75) was established, show that the single-layer potential ( 17.79) is continuous. b) Verify the full proof of Propositions 4 and 5.
494
17 Integrals Depending on a Parameter
2. a ) Show that for every set M C ll�n and every E > 0 one can construct a function f of class c (ool (R n , R) satisfying the following three conditions simultaneously: 'Vx E R n (0 :::; f(x) :::; 1 ) ; 'Vx E M (f(x) = 1 ) ; supp f C M, , where M, is the e:-blowup ( that is, the €-neighborhood ) of the set M. b ) Prove that for every closed set M in R n there exists a nonnegative function f E C (oo) (R n , R ) such that ( !(x) = 0) {o} (x E M). 3 . a ) Solve Problems 6 and 7 of Sect. 17.4 in the context of a space R n of arbitrary dimension. b ) Show that the generalized function 8s ( single layer ) is not regular.
4. Using convolution, prove the following versions of the Weierstrass approximation theorem. a) Any continuous function f : I -+ R on a compact n-dimensional interval I C R n can be uniformly approximated by an algebraic polynomial in n variables. b ) The preceding assertion remains valid even if I is replaced by an arbitrary compact set K C R and we assume that f E C(K, C) . c ) For every open set G C R n and every function f E C ( rn l ( G, R ) there exists a sequence { Pk } of algebraic polynomials in n variables such that P� a) :::; f ( a) on each compact set K C G as k -+ oo for every multi-index a = (at , . . . , a n ) such that l a l S m. d ) If G is a bounded open subset of R n and f E c (oo) (G, R ) , there exists a sequence {Pk } of algebraic polynomials in n variables such that P� a ) :::; f ( a) for every a = ( a t , . . . , a n ) as k -+ oo . e ) Every periodic function f E C ( R n , R ) with periods Tt , T2 , . . . , Tn in the variables x 1 , . . . , x n , can be uniformly approximated in R n by trigonometric poly nomials in n variables having the same periods Tt , T2 , . . . , Tn in the corresponding variables. 5. This problem contains further information on the averaging action of convolu tion. a) Previously we obtained the integral Minkowski inequality
(I
l a(x) + b(x) I P dx
) 1/p ( I <
X
l a i P (x) dx
) 1/p ( I +
X
IW (x) dx
) 1/p
for p 2: 1 on the basis of this numerical Minkowski inequality. The integral inequality in turn enables us to predict the following generalized integral Minkowski inequality:
(I I I
p ) 1/p ( l :::; I I
f(x, y) dy dx
I J I P (x, y) dx
) 1/p
dy .
Prove this inequality, assuming that p 2: 1 , that X and Y are measurable subsets example, intervals in R"' and R n respectively ) , and that the right-hand side of the inequality is finite. ( for
17.5 Multiple Integrals Depending on a Parameter
495
b) By applying the generalized Minkowski inequality to the convolution f * g, show that the relation ll f * g ii P :'::: ll f ll 1 · II9II P holds for p 2': 1 , where, as always, 1 1P dx) . lluii P =
(J,: l u i P (x)
c�oo ) (JRn , JR) with 0 ::::: <.p(x) ::::: 1 on lRn and I <.p(x) dx 1 . Assume that <.p,(x) := � 'P ( � ) and f, := f 'P• for E > 0. Show that if f E Rp(JR n ) (that is, if the integral I IJI P (x) dx exists) , then f, E c ( oo ) (JR n , JR) and ll f, ll p ::::: IIJII P " c) Let 'P E
=
JRn
*
JRn
We note that the function f, is often called the average of the function f with kernel 'P• · d) Preserving the preceding notation, verify that the relation
ll f, - f lip,[ :'::: sup• li Th ! - f lip,[ , Ihi<
(
)1 P
holds on every interval I C lR n , where llull p,I = [ i uiP(x) dx / and Th j (x) = f(x - h). e) Show that if f E Rp(JR n ), then II Th f - f lip, I -+ 0 as h -+ 0. f) Prove that ll f, II P :'::: IIJII P and ll f, - f li P -+ 0 as E -+ +0 for every function f E Rp(JR n ), p 2': 1 . g ) Let Rp(G) b e the normed vector space o f functions that are absolutely inte grable on the open set G C lR n with the norm I llp, G · Show that the functions of class C ( oo ) (G) n Rp(G) form an everywhere-dense subset of Rp(G) and the same is true for the set C� oo ) (G) n Rp(G). h) The following proposition can be compared with the case p = oo in the preceding problem: Every continuous function on G can be uniformly approximated on G by functions of class c ( oo ) (G). i) If f is a T-periodic locally absolutely integrable function on JR, then, setting 1 a T IJI P (x) dx) / P , we shall denote the vector space with this norm by ll f ll p .T = R� . Prove that ll f, - f ll p, T -+ 0 as E -+ +0 . j) Using the fact that the convolution of two functions, one of which is periodic, is itself periodic, show that the smooth periodic functions of class c(oo ) are everywhere dense in R� .
(l
6. a) Preserving the notation of Example 1 1 and using formula (17.83) , verify that if f E C ( l ) (JE.n \ S) , then fJj 82 ! = 82 ! f) i fJx fJxJ fJxi fJxJ + fJxJ f)s cos aiOs) + fJx i ) 8 cos Oj Os .
{
}
(J
( (J
(J
(J �) (
b) Show that the sum £: fxlr) s cos ai equals the jump s of the normal ,= 1 derivative of the function f at the corresponding point x E S, this jump being independent of the direction of the normal and equal to the sum §t + I£; ) (x) of the normal derivatives of f at the point x from the two sides of the surface S.
17 Integrals Depending on a Parameter
496
c) Verify the relation
(
) (
(Jf)
5 is where :n is the normal derivative, that is, :n F,
f
under the assumption that G is a finite domain in JRn bounded by a piecewise smooth surface S; and
f
f
7. a) The function 1 � 1 is the potential of the electric field intensity A created in JR3 by a unit charge located at the origin. We also know that
div
( �) -
�
3
=
47r8 ,
div
( �f3 ) -
�
=
47rq8 ,
div grad
C!l )
=
-�
4m5 .
Starting from this, explain why it was necessary to assume that the function U(x) = J ���:�� must satisfy the equation L1 U = -47rp,. Verify that it does indeed JR3 satisfy the Poisson equation written here. b) A physical corollary of the Gauss-Ostrogradskii formula, known in electro magnetic field theory as Gauss ' theorem, is that the flux across a closed surface S of the intensity of the electric field created by charges distributed in JR3 equals Q/co (see p. 279, where Q is the total charge in the region bounded by the surface S. Prove this theorem of Gauss. 8. Verify the following equalities, understood in the sense of the theory of general ized functions. a) L1E = 8, if for x E JR2 , 2� In E(x) - 2 7r nr(/ 2 nn)- 2) -( n - 2) for X E lRn , n > 2 .
{
(
lxl lxl
17.5 Multiple Integrals Depending on a Parameter
497
x e-ik l x b ) ( Ll + k 2 ) E = 8, .1f E(x ) = - ei4 rrk ll x ll or 1f. E(x ) = - 47iTXT"l and x E JR.3 . 2 2 c) D a E = 8, where D a = � a 2 [ ( -/;;r ) + + ( a �n ) ] , and E = H ( at - l x l) H 2 4 7r a t 8s at = 2 7r( t)a 8(a 2 t 2 - l x l 2 ) for x E IR.3 ' 2 tr a y'a 2 2 l l 2 for x E IR. or E = ..J!!!:d, -
·
t -x
·
·
-
t E R Here H( t ) is the Heaviside function, Sat = {x E IR.3 I I x l = at} is a sphere, and a > 0. d) Using the preceding results, present the solution of the equation Au = f for the corresponding differential operator A in the form of the convolution f * E and verify, for example, assuming the function f continuous, that the integrals depending on a parameter that you have obtained indeed satisfy the equation Au = f. Differentiation of an integral over a liquid volume. Space is filled with a moving substance (a liquid) . Let v = v( t, x) and p = p ( t, x) be respectively the velocity of displacement and the density of the substance at time t at the point x. We observe the motion of a portion of the substance filling the domain flo at the initial moment of time. a) Express the mass of the substance filling the domain flt obtained from flo at time t and write the law of conservation of mass. b) By differentiating the integral F(t) = I j( t , x ) dw with variable domain of 9.
n,
flt (the volume of liquid) , show that F' (t) = I %f dw + I f(v, ) da, where flt , an� , dw, da, v, ( ' ) are respectively the domain, its boundary, the integration
n,
n,
a n,
n
element of volume, the element of area, the unit outward normal, the flow velocity at time t at corresponding points, and the inner product . c) Show that F' ( t) in problem b) can be represented in the form F' ( t) = I ( %f + div ( Jv) ) dw.
n,
d) Comparing the results of problems a) , b) , and c) , obtain the equation of continuity � + div (pv) = 0. (In this connection, see also Subsect. 14.4.2.) e) Let l flt I be the volume of the domain flt . Show that diZr 1 = I div v dw . n,
f) Show that the velocity field v of the flow of an incompressible liquid is divergence-free ( div v = 0) and that this condition is the mathematical expression of the incompressibility (conservation of volume) of any portion of the evolving medium. g) The phase velocity field (p, q) of a Hamiltonian system of classical mechanics satisfies the Hamilton equations p = �� , q = �� , where H = H(p, q) is the Hamiltonian of the system. Following Liouville, show that a Hamiltonian flow pre serves the phase volume. Verify also that the Hamiltonian H (energy) is constant along the streamlines (trajectories) . -
1 8 Fourier Series and t he Fourier Transform
18. 1 Basic General Concepts Connected with Fourier Series
18. 1 . 1 Orthogonal Systems of Functions
Expansion of a vector in a vector space During this course of analysis wea.spaces haveinmentioned several times thatarithmetic certain classes of functions formexample, vector relation to the standard operations. Such, for are the basicreal-,classes of analysis, which consist of smooth, continuous, or integrable complex-, or vector-valued functions on a domain X JR. n . From the point of view of algebra the equality C
where j, h , . . . , fn are functions of the given class and O:i are coefficients from or C, simply means that the vector f is a linear combination of the vectors h , . . . , fn of the vector space under consideration. a rule, it ofis the necessary tions"In analysis, -- series ofasfunctions form to consider "infinite linear combina JR.
(18.1)
The definition of the sum of inthetheseriesvectorrequires that some topology (init particular, a metric) be defined space in question, making possiblen to judge whether the difference f - Sn tends to zero or not, where Sn = 2:.::: O: k fk · k=l The mainisdevice usedsomein classical analysis toor introduce a metric on a vector space to define norm of a vector inner product of vectors in that space.nowSect. 10. 1 was devoted to a discussion of these concepts. We are going toshall consider only ,spaces endowed with an inner product (which, as before, we denote ) ) . In such spaces one can speak of ( orthogonal orthogonal systemsEuclidean of vectors,spaceandfamiliar orthogonalfrombases, just asgeometry. in the casevectors, of three-dimensional analytic
500
18 Fourier Series and the Fourier Transform
Definition 1 . The vectors x and y in a vector space endowed with an inner product ( , ) are orthogonal ( with respect to that inner product ) if ( x, y ) = 0. Definition 2 . The system of vectors { x k ; k E K} is orthogonal if the vectors in it corresponding to different values of the index k are pairwise orthogonal. Definition 3 . The system of vectors {e k ; k E K} is orthonormalized ( or orthonormal) if ( e i , e J ) = bi,J for every pair of indices i, j E K, where bi ,J is 1 if i = j the Kronecker symbol, that is, bi,J = { 0 if i =1- j . Definition 4. A finite system of vectors x 1 , . . . , X n is linearly independent if the equality a 1 x 1 + et2 X2 + + Ctn Xn = 0 is possible only when et 1 = a2 = = Ct n = 0 ( in the first equality 0 is the zero vector and in the second it is the Anzeroarbitrary of the coefficient fieldvectors ). system of of a vector space is a system of linearly independent vectors if every finite subsystem of it is linearly independent. The main questionsystem thatofwilllinearly interestindependent us now is thevectors. question of expanding a vector in a given Having in mind later applications to spaces of functions (which may be infinite-dimensional as well ) we must reckon with the fact that such an ex pansion may, inenters particular, leadstudy to a series offundamental the type ( 18.1). That is precisely where analysis into the of the and essentially alge braicAsquestion we have posed. is known fromhaveanalytic geometry, expansions over in orthogonal andin orar thonormal systems many technical advantages expansions bitrary linearly independent systems. ( The coefficients of the expansion are easy to compute; i t is easy to compute the and innersoproduct of two vectors from theirIt coefficients in an orthonormal basis, on. ) is for systems. that reasonIn function that we shall bethese mainlywiinterested in expansions in or thonormal spaces l l be expansions in orthogonal systems of functions or Fourie? series, to the study of which this chapter is devoted. '
'
'
· · ·
· · ·
1 J.-B.J. Fourier ( 1768-1830) - French mathematician. His most important work Theorie analytique de la chaleur (1822) contained the heat equation derived by Fourier and the method of separation of variables (the Fourier method) of solving it (see p. 516) . The key element in the Fourier method is the expansion of a function in a trigonometric (Fourier) series. Many outstanding mathematicians later undertook the study of the possibility of such a representation. This, in particular, led to the creation of the theory of functions of a real variable and set theory, and it helped to promote the development of the very concept of a function.
18. 1 Basic General Concepts Connected with Fourier Series
501
Extending Example of Sect. 10.1, we introduce an inner product ( 18. 2 ) (!, g) := l (f · g)(x) dx on thelocally vectorsquare-integrable space 'R2 (X, C)(asconsisting of improper functions integrals). on the set X JR.n that are proper or Since I,f · 9unambiguously. 1 :S ! (1/1 2 + l g l 2 ) , the integral in ( 18. 2 ) converges and hence defines ) (!areg discussing real-valued functions, relation ( 18.2) in the real space If we n2 (x, JR.) reduces to the equality ( 18.3) (!, g) := l u · g)(x) dx . Relyi ng anon inner properties of thelistedintegral, one10.1can areeasilysatisfied verify inthatthisallcase, the axi o ms for product in Sect. provided zero. we identify two functions that differ onlytexton portion a set of ofn-dimensional measure Throughout the following, in the 2 ) and inner products of functions will be understood in the sense of Eqs.the( 18.section, b. Examples of Orthogonal Systems of Functions 12
X
C
X
( 18.3) .
We recall that for integers m and n if m # n , x x n . m i -i e e ( 18.4) dx = I 2 if m = n ; rr , r if m # n , I cosmx cosnxdx = r2rrrr , , ifif mm == nn #= OO ;, ( 18.5) ( 18.6) I cosmxsinnxdx = 0 ; . sin mx sin nx dx = ' if m # n , ( 18 . 7) I if m = n # O . rr , r E Z} is an orthogonal system of Thesein relations show'R that rr],{ eiCnx ; nrelative vectors the space to the inner product ( 18. 2 ) [-rr, ) 2( and the trigonometric system { 1 , cos nx, sin nx; n E N} is orthogonal in 'R2 ( [-rr, rr] , JR.) . If we regard the trigonometric system as a set of vectors incomplex 'R2 ([- rr, rr], C ) , that is, if we allow linear combinations of them to have coefficients, then by Euler ' s formulas einx = cos nx + i sin nx, Example
1.
7r
- ?r
7r
- ?r
7r
- ?r
- ?r
502
18 Fourier Series and the Fourier Transform
cos nx -- 2 ( ei n x + e -i nx ) ' sin nx - 2 i ( ei nx - e -inx ) ' we see that these two systems canequivalent. be expressedForlinearly in terms ofexponential each other,system that is,{ ethey inx ; nareE Z}algebraically that reason the issystem also called · the trigonometric system or more precisely the trigonometric in complex notation. Relationswhile (18.4)-(18. 7) show{ that these systems are orthogonal, but not normalized, the systems k einx ; n E Z} and { 1 1 cos nx, 1 sin nx; n E N} 2 7f 7f 7f are Iforthonormal. the closed interval [ of] vari is replaced bycananobtain arbitrary closed interval [-l, l] { JR, then by a change a ble one the analogous sys tems ei f nx ; n E Z} and { 1, cos fnx, sin fnx; n E N} , which are orthogonal in the spacessystems R2 ( [-l, l] , C ) and R2 ([-l, l] , JR) and also the corresponding orthonormal 1 7f 1 . 7f { 1 ei -" nx n E '7/ } and { 1 vrz V'ii V'ii ' cos T nx, vrz sm T nx; n E N } ' Example Let Ix be an interval in JRm and Iy an interval in JR n , and let {fi (x)} be an orthogonal system of functions in R.2 (Ix , lR) and {gj (y)} an orthogonal from Fubini ' s theorem, thesystem systemof offunctions functionsin R{u2i(Ij (x,y , JR).y) Then,/j (x)gas jfollows (y) is orthogonal in R2 (lx X fy , JR). Example We remark that for a f. (3 l J Sill. ax Sill. (3X dX = 21 ( sin(aa --(3(3)l sin(aa ++/3(3)l ) = _- cos al cos (3l · (3 tan aal2 -- /3a2tan (3l . Hence, if a and (3 are such that ta� = ta�f3l , the original integral equals zero. Consequently, if 6 6 · · · �n · · · is a sequence of roots of the equation tan �l c�, where c is an arbitrary constant, then the system of functions ( �n x) ; n E N} is orthogonal on the interval [0, l] . In particular, for c = 0, {wesinobtain the familiar system { sin ( fnx) ; n E N}. Example 4. Consider the equation ( d�2 + q(x) ) u(x) = Au(x) , where E C ( oo ) ( [a, b] , JR) and A is a numerical coefficient. Let us assume that 2 ) ( the functions u1 , u2 , . . . are of class C ( [a, b], JR) and vanish at the endpoints .l.
!
;;;; rrc , v v
;;;; v
- 1r , 1r
C
I
j
ILl
2.
3.
-
-
0
=
q
<
<
<
ad
<
18. 1 Basic General Concepts Connected with Fourier Series
503
ofwiththeparticular closed interval [a, b] and that each of them satisfies the given equation values A 1 , A2 , . . . of the coefficient A. We shall show that if Ai =J Aj , then the functions u i and Uj are orthogonal on [a, b]. Indeed, integrating by parts, we find that J [ ( d�2 + q (x ) ) ui (x )] uJ (x ) dx = J ui (x ) [ ( d�2 + q (x ) ) uj (x )] dx . According to the equation, we obtain from this the relation b
b
a
a
and, since Ai =J Aj , we now0conclude that[a,(ub]i , Uj= )[0,= 0.] we again find that the on [a, b] and In particular, if ( system { sin nx ; n EqN}x ) is orthogonal on [0, ] Further examples, including examples of orthogonal systemsat theof impor tance in mathematical physics, will be found in the problems end of this section. Orthogonalization It is well-known that in a finite-dimensional Eu clidean space,waystarting with a linearly independent system of vectors, there is aofcanonical of constructing an orthogonal and even orthonormal system 2 -Schmidt3 orthog vectors equi v alent to the given system, using the Gram onalization process. By thesystem same method one'lj;can, 'lj; obviously orthonormalize any linearly independent of vectors , . . . in any vector space 1 2 having an inner product. We recall, that the orthogonalization process leading to the orthonormal system
1r .
1r
c.
'1/J
-
nL1 ('1/Jn ,
�---------
--------
-
2 J.P. Gram (1850-1916) - Danish mathematician who continued the research of P.L. Chebyshev and exhibited the connection between orthogonal series expan sions and the problem of best least-squares approximation ( see Fourier series below ) . It was in these investigations that the orthogonalization process and the famous Gram matrix arose ( see p. 185 and the system ( 18. 18) on p. 510) . 3 E. Schmidt (1876-1959) - German mathematician who studied the geometry of Hilbert space in connection with integral equations and described it in the language of Euclidean geometry.
504
18 Fourier Series and the Fourier Transform
The process of orthogonalizing the linearly independent system leads to the system of orthogonal polyno mials known as the Legendre polynomials. We note that the name Legendre polynomials is often given not to the orthonormal system, but to a system ofcanpolynomials proportional to these polynomials. The proportionality factor be chosen from various considerations, for example, requiring the leading coefficient to be 1 orof the requiring theis polynomial to these have therequirements, value 1 at xbut= in1. The orthogonality system unaffected by general orthonormality is lost. the standard Legendre polynomials, which We have already encountered are defined by Rodrigues ' formula Example
5.
{ 1, x, x 2 , . . . , } in R2 ( [ - 1, 1], �)
= 1. Let us write out the first few Legendre For these polynomials polynomials, normalizedPnby(1)requiring the leading coefficient to be 1: p3 ( X ) = X - -3 X . Po(x) 1 , 5 The orthonormalized Legendre polynomials have the form -
=
3
P�n (x) = y� �-2- Pn (x) ,
whereOne can= 0,verify 1, 2, . . . . by direct computation that these polynomials are orthogo nal on the closed interval [-1, 1]. Taking Rodrigues ' formula as the definition ofalsthe{P polynomial Pn ( x), let us verify that the system of Legendre polynomi 1]. To do this, it suffices n (x)} is orthogonal on the closed intervaln[-1, toof degree verify that Pn (x) is orthogonal to 1, x, . . . , x - l , since all polynomials Pk k are linear combinations of these. Integrating by parts for k we indeed find that n
< n
< n,
Awillcertain picture of thelast origin of orthogonal systemsandofinfunctions in analat be given i n the subsection of this section the problems ysis the end ofconnected the section.wiAtth present we shallofreturn to thein terms fundamental general problems the expansion a vector of vectors of a given system of vectors in a vector space with inner product.
18. 1 Basic General Concepts Connected with Fourier Series
505
d. Continuity of the Inner Product and the Pythagorean Theorem
We shallsums have(series to work not only with finite sums of vectors but also with infinite ) . In this connection we note that the inner product is a function, enabling us oftoseries. extend the ordinary algebraic properties ofcontinuous theLetinner product to the case X be a vector space with an inner product ( , ) and the norm it induces 00 ll x ll := y'(x,X} ( see Sect. 10. 1 ) . Convergence of a series iI:= l X i = x of vectors X i E X to the vector x E X will be understood in the sense of convergence in this norm. Lemma 1. ( Continuity of the inner product ) . Let ( , ) : X ---* C be an inner product in the complex vector space X. Then 1--+
(x, y) (x, y ) is continuous jointly in the two variables; 00 b) if x = I: x i , then (x, y) = I: (x i , y); i= l i= l 00 ) if e 1 , e 2 , . . . , is an orthonormal system of vectors in X and x = I: x i ei i= l 00 00 i i and y = I: y e i , then (x, y) = I: x fi . i= l i= l a) the function 00
c
inequality (see Sect. Assertion a) follows from2 the Cauchy-Bunyakovskii l (x - xo, Y - Yo) l :::; ll x - xo ll 2 · IIY - Yo ll 2 · Assertion b) follows from a) , since
Proof. 10. 1 ) :
and i=I:00n+l Xi ---* 0 as n ---* oo. Assertion ) follows by repeated application of b) , taking account of the relation (x, y) = (y, x). D The following result is an immediate consequence of the lemma. Theorem 1. ( Pythagoras4 ) . c
a) If {x i } is a system of mutually orthogonal vectors and x =
ll x ll 2 = I:i ll xi ll 2 •
I:i xi, then
4 Pythagoras of Samos (conjectured to be 580-500 BCE) - famous ancient Greek
mathematician and idealist philosopher, founder of the Pythagorean school, which, in particular, made the discovery that the side and diagonal of a square are incommensurable, a discovery that disturbed the ancients. The classical Pythagorean theorem itself was known in a number of countries long before Pythagoras (possibly without proof, to be sure) .
506
18 Fourier Series and the Fourier Transform
b) If { ei } is an orthonormalized system of vectors and x = L:i xi e i , ll x ll 2 = L:i I xi 1 2
then
•
1 8 . 1 . 2 Fourier Coefficients and Fourier Series
a. Definition of the Fourier Coefficients and the Fourier Series
Let
orthonormal system( ,and) . { li } an orthogonal system of vectors in i } be Xan with a{ espace inner product Suppose that x = L:i xi zi . The coefficients xi in this expansion of the vector x can be found directly: X i = (l(x,i , llii )) . If li = e i , the expression becomes even simpler: We remark that the formulas for x i make sense and are completely deter mined if the vector x itself and the orthogonal system { l i } (or { e i } ) are given.i The to compute x equality x = L:i xi zi (or x = L:i xi e i ) is no longer needed from these formulas. Definition 5 . The numbers { (�·t� } are the Fourier coefficients of the vector x E X in the orthogonal system { l i }. If the system { e i } is orthonormal, the Fourier coefficients have the form {(x, e i )}. Fromx EtheX geometric point ofof view the ithinFourier coefficient (x, e i ) of the vector is the projection that vector the direction of the unit vector e i . In the familiar case of three-dimensional Euclidean ispace E3 with a given orthonormal frame eof1 , ethe2 , e3vector the Fourier coefficients x = (x, e i ), i = 1 , 2, 3, are the coordinates x in the basis e , 1 e 2 , e 3 appearing in the expansion x = x 1 e 1 + x 2 e 2 + x3 e 3 . If we were given only the two vectors e 1 and e 2 instead of allnotthreebe e 1 , e 2 , e 3 , the expansion x = x 1 e 1 +x 2 e 2 in this system would certainly valid for all vectors x E E3 . Nevertheless, the Fourier coefficients xi 2= (x, e i ), 1 i = 1 , 2, would be defined in this case and the vector Xe = x e 1 of+ xthee 2vectors would bee and the orthogonal projection of the vector x onto the plane L e 2 . Among all the vectors in that plane, the vector Xe is distinguished by1 E being closest to x in the sense that ll x - Yll llx - Xe ll for any vector L. This is the remarkable extremal property of the Fourier coefficients, to which we shall return below in the general situation. y
�
18. 1 Basic General Concepts Connected with Fourier Series
507
Ifis anX orthogonal is a vectorsystem spaceof with innervectors product , ) andfor ( then l , . . . , l , . . . nonzero in X , each2 vectorn x E X one can form the series (18. 8 ) X k�=l ((lx,k , llkk)) lk . This series is the Fourier series of x in the orthogonal system {lk }. IfIn thethe system {lank }orthonormal is finite, the system Fourier {series reduces toseries its finiteof asum. the Fourier vector case of e } k x E X has a particularly simple expression: 00 (18. 8 ') x rv L ( x, e k ) e k . k=l Example Let X = 'R. 2 ( [ -1r, 7r] , � ) . Consider the orthogonal system {1,coskx,sinkx; k E N} ofseriesExample 1. To the function f E 'R-2 ( [ - 1r, 1r], �) there corresponds a Fourier f rv ao;n + kf=l ak (f) cos kx + bk (f) sin kx in this system. The tocoefficient � is included in the zeroth term so as to give a unified appearance the following formulas, which follow from the definition of the Fourier coefficients: Definition 6.
h,
"'
�
-
6.
.,..
(18. 9 ) ak (f) = � � f(x) coskxdx , k = 0, 1, 2, . . . , k = 1, 2, . . . . (18.10) bk (f) = ;1 I f(x) smkxdx . = ( -1) k + l f , Let us set f(x) = x. Then a 0, k = 0, 1, 2, . . . , and b = k k k = 1, 2, . . .. Hence in this case we obtain 00 k+ l k2 sin kx . f(x) = X rv L( -1) k=l Example Let us consider the orthogonal system { e i k x ; k E Z} of Example 1 in5 and the space 'R.2 ( [ - 7r , 1r] , C). Let f E 'R.2 ( [ - 7r , 7r] , C ) . According to Definition relations (18. 4by), the Fourier coefficients {eikx } are expressed the following formula: {ck (f)} of f in the system kx (18.11) Ck (f) = 21�,. I f(x)e -i kx dx ( ((f(x),e kei x , eiikx )) ) .,..
- 71"
7.
.,..
- 71"
_
_
·
508
18 Fourier Series and the Fourier Transform
Comparing Eqs. ( 18.9 ) , ( 18. 10 ) , and ( 18. 1 1 ) and taking account of Euler ' s formula eicp = cos cp + i sin cp, we obtain the following relations between the Fourier coefficients of a given function in the trigonometric systems written in real and complex forms: if k 2: 0 , if k < O .
( 18. 12 )
In order that the case k = 0 not be an exception in formulas ( 18.9 ) and a 0 to denote not the Fourier coefficient itself, but rather its double, as was done above. ( 1 8 . 1 2 ) , it is customary to use
b. Basic General Properties of Fourier Coefficients and Series The following geometric observation is key in this section.
Lemma ( Orthogonal complement ) . Let {l k } be a finite or countable system of nonzero pairwise orthogonal vectors in X , and suppose the Fourier series of x E X in the system { l k } converges to X! E X . Then in the representation x = X! + h the vector h is orthogonal to x 1 ; moreover, h is orthogonal to the entire linear subspace generated by the system of vectors { l k } , and also to its closure in X.
Proof. Taking account of the properties of the inner product , we see that it suffices to verify that ( h , lm) = 0 for every lm E {l k } · We are given that
(x, l k ) l k · � h = X - X! = X - " (lk , lk ) k
-
Hence
Geometrically this lemma is transparent , and we have already essentially pointed it out when we considered a system of two orthogonal vectors in three-dimensional Euclidean space in Subsect . 18. 1 . 2a. On the basis of this lemma we can draw a number of important general conclusions on the properties of Fourier coefficients and Fourier series. Bessel's inequality
Taking account of the orthogonality of the vectors X! and h in the de composition x = X! + h , we find by the Pythagorean theorem that ll x ll 2 = ll xd l 2 + ll h ll 2 2: ll x d l 2 ( the hypotenuse is never smaller than the leg ) . This relation, written in terms of Fourier coefficients, is called Bessel ' s inequality.
18.1 Basic General Concepts Connected with Fourier Series
Let us write it out. By the Pythagorean theorem
509
(18. 13)
Hence
"'"""' l(x,(l llk )l) 2 - I I X 1 2 • (18. 14) k kk This is Bessel orthonormal system's inequality. of vectorsIt{ ehask }: a particularly simply appearance for an ( 18 . 15) L l(x, e kW l x l 2 . k In terms of the Fourier coefficients O:k themselves Bessel ' s inequality ( 18. 14) can be written as I: l o: k l 2 l l k l 2 :::; l x l 2 , which in the case of an orthonormal system reduces kto I:k l o: k l 2 :::; l x l 2 . We allowing have included thevectors absolutespaces valueX.sigInn this in thecaseFourier coefficient, since wemayare complex the Fourier coefficient assume complex values. Bessel ' s inequality we made use of the assump We note that in deriving tion that the vector x1 exists and that Eq. (18. 13) holds. But if the system { l k } is finite, there is no doubt that the vector Xt does exist (that is, that the Fouri er seriesof {lconverges in X). Hence inequality (18. 14) holds for every finite subsystem },k and then it must hold for the whole system as well. Example For the trigonometric system (see formulas (18.9) and ( 18. 10)) 's inequality has the form Bessel l ao � W + f l a k ( J W + l bk (f) l 2 � J l f l 2 (x) dx · ( 18. 16) k=l ikx ; k E Z} (see formula ( 1 8. 1 1 ) ) Bessel ' s inequality can {e For the system be written in a particularly elegant form: L...t
<
:S:
8.
:S:
71"
- 11"
(18. 1 7) Convergence of Fourier series in a complete space
Suppose I:k xk ek = l:k (x, ek ) ek is the Fourier series of the vector x E X ' inI: thexk 2orthonormal system {ek }. By Bessel theorems inequality (18.15) the series lk l converges. By the Pythagorean
18 Fourier Series and the Fourier Transform
510
By the Cauchy convergence criterion for a series, the right-hand side of this equality becomes less than any e > 0 for all sufficiently large values of m and n > m. Hence we then have
ll xm e m + · · + xn e n ll < ,fi · Consequently, the Fourier series E x k e k satisfies the hypotheses of the k Cauchy convergence criterion for series and therefore converges provided the original space X is complete in the metric induced by the norm ll x ll = y' (x, x). ·
To simplify the writing we have carried out the reasoning for a Fourier series in an orthonormal system. But everything can be repeated for a Fourier series in any orthogonal system. The extremal property of the Fourier coefficients We shall show that if the Fourier series E x k e k =
E ((ekx, ,eekk)) e k of the k k vector x E X in the orthonormal systsem { e k } converges to a vector x 1 E X, then the vector X! is precisely the one that gives the best approximation of x 00 among all vectors = E o: k e k of the space L spanned by {e k }, that is, for k= l every E L, ll x - x d l � ll x - Yll , and equality holds only for = x 1 • Indeed, by the orthogonal complement lemma and the Pythagorean the y
y
y
orem,
ll x - Yll 2 = ll (x - X!) + (x l - Y ) ll 2 = llh + (xl - Y ) ll 2 = = llhll 2 + ll x 1 - Yll 2 � ll hll 2 = ll x - x 1 ll 2 · Example 9. Digressing slightly from our main purpose, which is the study of expansions in orthogonal systems, let us assume that we have an arbitrary system of linearly independent vectors x1 , . • . , X n in X and are seeking the n best approximation of a given vector x E X by linear combinations E O: k X k k =l o f vectors o f the system. Since we can use the orthogonalization process to construct an orthonormal system e 1 , . . . , e n that generates the same space L that is generated by the vectors X 1 , . . . , X n , we can conclude from the extremal property of the Fourier coefficients that there exists a unique vector X ! E L such that ll x - x d l = inf ll x - Y ll · Since the vector h = x - X ! is orthogonal to yE L the space L, from the equality x 1 + h = x we obtain the system of equations � (x ) ( · · x,) , + . ( · .. .. .. ( 18. 18) . (x l , Xn )o: l + + (x n , Xn )O: n - (x, Xn )
{�
� :: � �� �:)�� · · ·
:�
·
.
18.1 Basic General Concepts Connected with Fourier Series
511
n O: of the unknown = l k Xk k system x 1 , . . • , X n· The existence
for the coefficients a: 1 , . . . , O: n of the expansion X !
=
I:
vector X! in terms of the vectors of the and uniqueness of the solution of this system follow from the existence and uniqueness of the vector x 1 • In particular, it follows from this by Cramer ' s theorem that the determinant of this system is nonzero. In other words, we have shown as a by-product that the Gram determinant of a system of linearly independent vectors is nonzero. This approximation problem and the system of equations ( 18. 18) corre sponding to it arise, as we have already noted, for example, in processing experimental data by Gauss ' least-squares method. (See also Problem 1 . )
c.
Complete Orthogonal Systems and Parseval's Equality
Definition 7. The system {x0; a: E A} of vectors of a normed space X is complete with respect to the set E C X (or complete in E) if every vector x E E can be approximated with arbitrary accuracy in the sense of the norm of X by finite linear combinations of vectors of the system.
If we denote by L {xa} the linear span in X of the vectors of the system (that is, the set of all finite linear combinations of vectors of the system) , Definition 7 can be restated as follows: The system {X a } is complete with respect to the set E C X if E is con tained in the closure L{ X a} of the linear span of the vectors of the system.
Example 1 0. If X = E3 and e� , e 2 , e 3 is a basis in E3 , then the system { e 1 , e 2 , e 2 } is complete in X . The system { e 1 , e 2 } is not complete in X , but it is complete relative to the set L{ e 1 , e 2 } or any subset E of it.
Example 1 1 . Let us regard the sequence of functions 1, x, x 2 , • • . as a system of vectors {x k ; k = 0, 1 , 2, . . . } in the space R2 ( [a, b] , JR.) or R2 ( [a, b] , C) . If C[a, b] is a subspace of the continuous functions, then this system is complete with respect to the set C[a, b]. Proof Indeed, for any function f E C[a, b] and for every number e > 0, the Weierstrass approximation theorem implies that there exists an algebraic polynomial P(x) such that max l f(x) - P(x) l < e. But then .
x E [a , b]
II ! - P ll :=
b
J I f - P l 2 (x) dx < evb - a a
and hence one can approximate the function r in the sense of the norm of the space R2 ( [a, bl ) with arbitrary accuracy. D
512
18 Fourier Series and the Fourier Transform
We note that, in contrast to the situation in Example 9, in the present case not every continuous function on the closed interval [a, b] is a finite linear combination of the functions of this system; rather, such a function can only be approximated by such linear combinations. Thus C[a, b] C L{ x n } in the sense of the norm of the space R2 [a, b].
Example 1 2. If we remove one function, for example the function 1 , from the system { 1 , cos kx, sin kx; k E N}, the remaining system { cos kx, sin kx; k E N} is no longer complete in R2 ( [-1r, 1r] , C ) or R2 ( [ - 1r, 1r], 1R.) . Proof. Indeed, by Lemma 3 the best approximation of the function f(x) among all the finite linear combinations
=
1
n Tn (x) = 2_)a k cos kx + bk sin kx ) k=l
of any length n is given by the trigonometric polynomial Tn (x)in which a k and b k are the Fourier coefficients of the function 1 with respect to the orthogonal system { cos kx, sin kx; k E N}. But by relations ( 18.5), such a polynomial of best approximation must be zero. Hence we always have
and it is impossible to approximate 1 more closely than /2ii by linear com binations of functions of this system. D Theorem ( Completeness conditions for an orthogonal system ) . Let X be a vector space with inner product ( , ) , and h , l 2 , . . . , ln , . . . a finite or count able system of nonzero pairwise orthogonal vectors in X . Then the following conditions are equivalent: a) the system { l k } is complete with respect to the set!' E c X; b) for every vector x E E c X the following (Fourier series) expansion holds: (x, l k ) k ; (18. 19) X=" L...., k (lk , lk )
c ) for every vector x E E C X Parseval 's6 equality holds:
(18. 20) 5 The set E may, in particular, consist of a single vector that is of interest for one reason or another. 6 M.A. Parseval (1755-1836) - French mathematician who discovered this relation for the trigonometric system in 1799.
18.1 Basic General Concepts Connected with Fourier Series
513
Equations ( 18. 19) and ( 18.20) have a particularly simple form in the case of an orthonormal system { e k } . In that case ( 18. 19' ) and
ll x ll = L l (x, e k ) l 2 · k
( 18.20' )
Thus the important Parseval equality (18.20) or ( 18.20') is the Pythagorean theorem written in terms of the Fourier coefficients. Let us now prove this theorem.
Proof. a ) :::} b) by virtue of the extremal property of Fourier coefficients; b) :::} c) by the Pythagorean theorem; c) :::} a ) since by the lemma on the orthogonal complement (see Subsub sect . b) above) the Pythagorean theorem implies
Remark. We note that Parseval ' s equality implies the following simple nec essary condition for completeness of an orthogonal system with respect to a set E C X: E does not contain a nonzero vector orthogonal to all the vectors in the system. As a useful supplement to this theorem and the remark just made, we prove the following general proposition. Proposition. Let X be a vector space with an inner· product and x 1 , x 2 , . . . a system of linearly independent vectors in X . In order for· the system { x k } to be complete in X , a ) a necessary condition is that there be n o nonzero vector i n X orthogonal to all the vectors in the system ; b) if X is a complete (Hilbert) space, it suffices that X contain no nonzero vector orthogonal to all the vectors in the system. Proof. a) If the vector h is orthogonal to all the vectors in the system {x k }, we conclude by the Pythagorean theorem that no linear combination of vectors in the system can differ from h by less than llhll · Hence, if the system is complete, then llhll = 0. b) By the orthogonalization process we can obtain an orthonormal system { e k } whose linear span L{ e k } is the same as the linear span L{ x k } of the original system. We now take an arbitrary vector x E X. Since the space X is complete, the Fourier series of x in the system { e k } converges to a vector Xe E X. By the
514
18 Fourier Series and the Fourier Transform
lemma on the orthogonal complement, the vector h = x - Xe is orthogonal to the space L{ek } = L{xk } . By hypothesis h = 0, so that x = Xe , and the Fourier series converges to the vector x itself. Thus the vector x can be approximated arbitrarily closely by finite linear combinations of vectors of the system { e k } and hence also by finite linear combinations of the vectors of the system {xk } . 0 The hypothesis of completeness in part b ) of this proposition is essential, as the following example shows.
Example 1 3. Consider the space l 2 ( see Sect. 10.1) of real sequences a = 00 1 2 (a , a , . . . ) for which j=I: (ai) 2 oo . We define the inner product of the l vectors a = (a 1 , a 2 , . . . ) and b = (b 1 , b 2 , . . . ) in l 2 in the standard way: (a, b) := 00 2:: aibi. j= l Now consider the orthonormal system e k (0, . . . , 0, 1, 0, 0, . . . ) k k = 1, 2, . . . . The vector e 0 = (1, 0, 0, . . . ) does not belong to this system. We now add to the system {e k ; k E N} the vector e = (1, 1/ 2 , 1/ 2 2 , 1/ 2 3 , . . . ) and consider the linear span L{ e, e 1 , e 2 , . . . , } of these vectors. We can regard this linear span as a vector space X ( a subspace of l 2 ) with the inner product from l 2 . We note that the vector e0 = (1, 0, 0, . . . ) obviously cannot be ob tained as a finite linear combination of vectors in the system e, e 1 , e 2 , . . . , and therefore it does not belong to X . At the same time, it can be ap proximated as closely as desired in l 2 by such linear combinations, since e - I:=n -dt e k = (1, 0, . . . , 0, 2 n\ 1 , 2 n\2 , · · · ) · k l Hence we have established simultaneously that X is not closed in l 2 ( and therefore X , in contrast to l 2 , is not a complete metric space ) and that the closure of X in l 2 coincides with l 2 , since the system e0, e 1 , e 2 , . . . generates the entire space l 2 . We now observe that in X = L{ e, e 1 , e 2 , . . . } there is no nonzero vector orthogonal to all the vectors e 1 , e 2 , . . . . n Indeed, let x E X , that is, x = ae + 2:: a k e k , and let (x, e k ) 0, k = k= 1, 2, . . . . Then (x, e n+l ) = 2 n� ' = 0, that is, al = 0. But then O k = (x, e k ) = 0, k = 1, . . . , n. Hence we have constructed the required example: the orthogonal system e 1 , e 2 , . . . is not complete in X , since it is not complete in the closure of X, which coincides with l 2 . This example is of course typically infinite-dimensional. Fig. 18.1 repre <
�
=
sents an attempt to illustrate what is going on.
18. 1 Basic General Concepts Connected with Fourier Series
515
Fig. 1 8 . 1 .
We note that in the infinite-dimensional case (which is so characteristic of analysis) the possibility of approximating a vector arbitrarily closely by linear combinations of vectors of a system and the possibility of expanding the vector in a series of vectors of the system are in general different properties of the system. A discussion of this problem and the concluding Example 14 will clarify the particular role of orthogonal systems and Fourier series for which these properties hold or do not hold simultaneously (as the theorem proved above shows) . Definition 8 . The system x 1 , x 2 , . . . , X n , . . . of vectors of a normed vector space X is a basis of X if every finite subsystem of it consists of linearly inde pendent vectors and every vector x E X can be represented as x = 2::: O: k X k l
k
where o: k are coefficients from the scalar field of X and the convergence (when the sum is infinite) is understood in the sense of the norm on X . How i s the completeness of a system of vectors related t o the property of being a basis? In a finite-dimensional space X completeness of a system of vectors in X, as follows from considerations of compactness and continuity, is obviously equivalent to being a basis in X . In the infinite-dimensional case that is in general not so.
Example 14. Consider the set C ( [ - 1, 1 ] , 1R ) of real-valued functions that are continuous on [ - 1, 1] as a vector space over the field lR with the standard inner product defined by (18.3). We denote this space by C2 ( [ - 1, 1 ] , 1R ) and consider the system of linearly independent vectors 1, x, x 2 , . . . in it . This system is complete in C2 ( [ - 1, 1], 1R ) (see Example 11), but is not a basis. ()()
2::: o: k x k converges in C2 ( [ - 1, 1] , 1R ) , k=O that is, in the mean-square sense on [ - 1, 1], then, regarded as a power series, it converges pointwise on the open interval ] - 1, 1 [. Proof. \Ve first show that if the series
516
18 Fourier Series and the Fourier Transform
Indeed, by the necessary condition for convergence of a series, we have
ll a k x k II --+ 0 as k --+ oo. But
1 2 k ll a k x ll = J ( a k x k ) 2 dx = a� 2k � 1 . -1 Hence l a k l J2k + 1 for all sufficiently large values of k. In that case 00 the power series I: a k x k definitely converges on the interval ] - 1 , 1 [. k =O We now denote the sum of this power series on ] - 1, 1 [ by We remark that on every closed interval [a, b] c] - 1, 1 [ the power series converges uni <
cp .
formly to cp I [a , b] " Consequently it also converges in the sense of mean-square deviation. It now follows that if a continuous function f is the sum of this series in the sense of convergence in C2 ( [ - 1, 1], then f and cp are equal on ] - 1, 1[. But the function cp is infinitely differentiable. Hence if we take any function in C2 ( [ - 1, 1], lR) that is not infinitely differentiable on ] - 1, 1 [ it cannot be expanded in a series in the system { x k ; k = 0, 1, . . . } . D
IR),
Thus, if we take, for example, the function x = l x l and the sequence of numbers {en = * ; E N } , we can construct a sequence {Pn (x); E N} of finite linear combinations Pn (x) = a0 + a 1 x + · · · + a n x n of elements of the system {x k ; k E N} such that II ! - Pn ll < * ' that is, Pn f as If necessary, one could assume that in each such linear combination Pn (x) the coefficients can be assumed to have been chosen in the unique best-possible 00 way ( see Example 9). Nevertheless, the expansion f = I: a k x k will not arise
n
--+
n n --+ oo.
k =O
since in passing from Pn ( x) to Pn+l ( x), not only the coefficient an+l changes, but also possibly the coefficients ao , . , a n . If the system is orthogonal, this does not happen ( no , . . . , an do not change ) because of the extremal property of Fourier coefficients. For example, one could pass from the system of monomials {x k } to the or thogonal system of Legendre polynomials and expand f(x) = l x l in a Fourier series in that system.
..
1 8 . 1 . 3 *An Important Source of Orthogonal Systems of Functions in Analysis
We now give an idea as to how various orthogonal systems of functions and Fourier series in those systems arise in specific problems.
Example 1 5. The Fourier method. Let us regard the closed interval [ 0, l ] as the equilibrium position of a homogeneous elastic string fastened at the endpoints of this interval, but
18. 1 Basic General Concepts Connected with Fourier Series
517
otherwise free and capable of making small transverse oscillations about this equilibrium position. Let u(x, t) be a function that describes these oscilla tions, that is, at each fixed instant of time t = t0 the graph of the function u(x, t0 ) over the closed interval 0 :::; x :::; l gives the shape of the string at time t0 . This in particular, means that u(O, t) = u(l, t) = 0 at every instant t, since the ends of the string are clamped. It is known (see for example Sect . 14.4) that the function u(x, t) satisfies the equation EPu = a2 82 u , ( 18. 2 1)
8t 2
8x 2
where the positive coefficient a depends on the density and elastic constant of the string. The equation ( 18.21) alone is of course insufficient to determine the func tion u(x, t). From experiment we know that the motion u(x, t) is uniquely determined if, for example, we prescribe the position u(x, 0) = cp(x) of the string at some time t = 0 ( which we shall call the initial instant) and the velocity �� (x, 0) = 1./J(x) of the points of the string at that time. Thus, if we stretch the string into the shape cp(x) and let it go, then 'lj;(x) = 0 . Hence the problem of free oscillations of the string7 that is fixed at the ends of the closed interval [0, l] has been reduced to finding a solution u(x, t) of Eq. ( 18.21) together with the boundary conditions
u(O, t) = u(l, t) = 0
( 18. 22 )
and the initial conditions
u(x, 0) = cp(x) ,
(18.23)
To solve such problems there exists a very natural procedure called the method of separation of variables or the Fourier method in mathematics. It consists of the following. The solution u(x, t) is sought in the form of a series 00 I: Xn (x)Tn (t) whose terms X(x)T(t) are solutions of an equation of special
n= l
form (with variables separated) and satisfy the boundary conditions. In the present case, as we see, this is equivalent to expanding the oscillations u(x, t) into a sum of simple harmonic oscillations (more precisely a sum of standing waves) . Indeed, if the function X(x)T(t) satisfies Eq. (18.2 1 ) , then X(x)T" (t) = a2 X"(x)T(t), that is, T" ( t) X"(x) ( 18. 2 4) ' 2
a T(t)
X(x) In Eq. (18. 2 4 ) the independent variables x and t are on opposite sides of
the equation (they have been separated) , and therefore both sides actually 7 We note that the foundations of the mathematical investigation of the oscillations of a string were laid by Brook Taylor.
518
18 Fourier Series and the Fourier Transform
represent the same constant .\. If we also take into account the boundary conditions X(O)T(t) = X(l)T(t) = 0 that the solution of stationary type must satisfy, we see that finding such a solution reduces to solving simultaneously the two equations
T"(t) = .\ a 2 T(t) , X" (x) = .\X(x) under the condition that X (O) = X (l) = 0.
(18. 25) (18.26)
It is easy to write the general solution of each of these equations individ ually:
T(t) = A cos V>..at + B sin V>..at , X(x) = C cos J>..x + D sin J>..x .
(18. 2 7) (18.28)
If we attempt to satisfy the conditions X (O) = X (l) = 0, we find that for .\ =/= 0 we must have C = 0, and, rejecting the trivial solution D = 0, we find that sin ../5.l = 0, from which we find ../). = ±mr /l , n E N. Thus it turns out that the number .\ in Eqs. ( 18.25) and (18.26) can be chosen only among a certain special series of numbers (the so-called eigenval ues of the problem) , An = (mr/l) 2 , where n E N. Substituting these values of .\ into the expressions ( 18.27) and (18. 2 8) , we obtain a series of special solutions 1r
(
)
na t + B sm . na u11 (x, t) = sm n y x An cos n l n nlt , .
(18. 2 9)
satisfying the boundary conditions u n (O, t) = u n (l , t) = 0 (and describing a standing wave of the form
(18.30)
18. 1 Basic General Concepts Connected with Fourier Series
and
519
( 18.31) '1/J ( x) = n"L.= l n 7 Bn sinn y x · Thus the problem has been reduced to finding the coefficients An and Bn , which up to now have been free, or, what is the same, to expanding the Fourier [0,series in the system { sinnfx; n E N } , which functions is orthogonalandon '1/Jtheininterval l] . to remark that the functions { sin nfx; n E N } , which arose fromIt Eq.is useful ( 18.26) can be regarded as eigenvectors of the linear operator A = � corresponding to the eigenvalues A n = nf, which in turn arose from the condition that the operator A acts on the space of functions in c< 2 ) [0, l] that vanish at the endpoints of the closed interval [0, l] . Hence Eqs. (18.30) and ( 18.31) can be interpreted as expansions in eigenvectors of this linear operator. linearofoperators connected with particularin analysis. problems are one of the mainThe sources orthogonal systems of functions nowsystems recall another fact known from algebra, which reveals the reason whyWe such are orthogonal. Let Z equal be a vector space with inner product ( , ) , and let E be a subspace (possibly to Z itself) that is dense in Z. A linear operator A : E --+ Z iseigenvectors symmetric if ( Ax, y ) = ( x, Ay ) for every pair of vectors x, y E E. Then: of a symmetric operator corresponding to different eigenvalues 00
cp
are orthogonal.
Proof.
Indeed, if Au = au and Av = (3v , and a =/= (3, then a (u, v) = (Au, v) = (u, Av) = (3 (u, v) ,
from which it follows that (u, v) = 0. 0 It is now useful to look at Example 3 from this point of view. There we were essentially considering the eigenfunctions of2the operator A = ( � onclosedthe interval space of[a,functions in C ( ) [a, b] that vanish at the q(x)) operating endpoints of the b] . Through integration by parts one can verify thatinnerthisproduct operator( 18.4)is symmetric on this space (with respect to the standard ) , so that the result of Example 4 is a particular manifestation of this algebraic fact. In particular, when q(x) 0 the operator A becomes d�2 , which for [a, b] = [0, l] occurred in the last example (Example 15) . We note and also that(seeinrelations this example theandquestion reduced to expanding the functions (18.30) ( 18.31)) in a series of eigenfunc '1/J A = d� Here of course the question arises whether tions of the operator 2• itequiisvtheoretically possible to formtosuchthe anquestion expansion, this question alent, as we now understand, of theandcompleteness of theis +
=
cp
18 Fourier Series and the Fourier Transform
520
system of eigenfunctions for the operator in question in the given space of functions. The completeness of the trigonometric system (and certain other particu lar systems of orthogonal functions) in R2 [-1r, 1r] seems to have been stated explicitly for the first time by Lyapunov.8 The completeness of the trigono metric system in paraticular was implicitly present in the work of Dirich let devoted to studying the convergence of trigonometric series. Parseval ' s equality, which is equivalent to completeness for the trigonometric system, as already noted, was discovered by Parseval at the turn of the nineteenth cen tury. In its general form, the question of completeness of orthogonal systems and their application in the problems of mathematical physics were one of the main subjects of the research of Steklov,9 who introduced the very con cept of completeness ( closedness) of an orthogonal system into mathematics. In studying completeness problems, by the way, he made active use of the method of integral averaging (smoothing) of a function (see Sects. 17.4 and 17.5) , which for that reason is often called the Steklov averaging method. 1 8 . 1 .4 Problems and Exercises
The method of least squares. The dependence y = j ( x 1 , . . . , xn ) of the quantity on the quantities x 1 , . . . , Xn is studied experimentally. As a result of m (� n ) experiments, a table was obtained 1.
y
X1
X2
Xn
I Y
. .�L . � � . . . . . . . . _t_t�. I b l -
-
each of whose rows contains a set ( ai , a�, . . . , a�) of values of the parameters and the value b' of the quantity y corresponding to them, measured by some device with a certain precision. From these experimental data we would n like to obtain an empirical formula of the form y = 2::: a;x; convenient for com i= l putation. The coefficients a 1 , 0: 2 , . . . , an of the required linear function are to be X 1 , X 2 , . . . , Xn
(
)
chosen so as to minimize the quantity f bk - f= a;a7 , which is the meank= l k= l square deviation of the data obtained using the empirical formula from the results obtained in the experiments. 8
2
A . M. Lyapunov (1857-1918) - Russian mathematician and specialist in mechan ics, a brilliant representative of the Chebyshev school, creator of the theory of stability of motion. He successfully studied various areas of mathematics and mechanics. 9 V.A. Steklov ( 1864-1926) - Russian/Soviet mathematician, a representative of the Petersburg mathematical school founded by Chebyshev and founder of the school of mathematical physics in the USSR. The Mathematical Institute of the Russian Academy of Sciences bears his name.
18.1 Basic General Concepts Connected with Fourier Series
521
Interpret this problem as the problem of best approximation of the vector (b 1 , . . . , brn ) be linear combinations of the vectors (a} , . . . , ai ) , i = 1 , . . . , n and show that the question reduces to solving a system of linear equations of the same type as Eq. {18. 18) . 2. a) Let O[a, b] be the vector space of functions that are continuous on the closed interval [a, b] with the metric of uniform convergence and 02 [a, b] the same vector space but with the metric of mean-square deviation on that closed interval (that is, b
J I f - g l 2 (x) dx) . Show that if functions converge in O[a, b] , they also converge in 02 [a, b] , but not conversely, and that the space 02 [a, b] is not complete, in contrast to O[a, b] . b) Explain why the system of functions { 1 , x, x 2 , . . . , } is linearly independent and complete in 02 [a, b] ' but is not a basis in that space. c) Explain why the Legendre polynomials are a complete orthogonal system and also a basis in 02 [- 1 , 1] . d) Find the first four terms of the Fourier expansion of the function sin JrX on the interval [ - 1 , 1] in the system of Legendre polynomials. e) Show that the square of the norm I I Pn ll in 02 [- 1 , 1] of the nth Legendre polynomial is d(f, g) =
2 2n + 1
(
�
2) · · · 2n (n + 1) = (-1t 2�n
j
)
(x 2 - 1 t dx .
-1 f) Prove that among all polynomials of given degree n with leading coefficient 1 , the Legendre polynomial Pn (x) is the one closest to zero on the interval [- 1 , 1] . g) Explain why the equality
where {Po , P1 , . . . } is the system of Legendre polynomials, necessarily holds for every function f E 02 ( [- 1 , 1] ,
3. a) Show that if the system {x 1 , x 2 , . . . , } of vectors is complete in the space X and X is an everywhere-dense subset of Y, then {x 1 , x 2 , . . . , } is also complete in Y. b) Prove that the vector space O[a, b] of functions that are continuous on the closed interval [a, b] is everywhere dense in the space R 2 [a, b] . (It was asserted in Problem 5g of Sect. 17.5 that this is true even for infinitely differentiable functions of compact support on [a, b] .) c) Using the Weierstrass approximation theorem, prove that the trigonometric system { 1 , cos kx, sin kx; k E N} is complete in R 2 [-1r, 1r] . d ) Show that the systems { 1 , x , x 2 , . . . } and { 1 , cos kx, sin kx; k E N} are both complete in R 2 [ -Jr, 1r] , but the first is not a basis in this space and the second is.
522
18 Fourier Series and the Fourier Transform e) Explain why Parseval's equality 1 :;
rr
J
- rr
holds, where the numbers
ao 2
l -l + � 2 2 I f I 2 (x) dx = L_., l a k I + l b k l 2 k= l oo
a k and bk are defined by (18.9) and (18. 10) .
f) Using the result of Example 8, now show that I:: � = 00
rr 2 6
•
n= l 4. Orhogonality with a weight function. a) Let po, . . . , Pn be continuous functions that are positive in the domain D . Verify that the formula Pl ,
J P (x)f(k)(x)g(k)(x)dx
k=O D k defines an inner product in C ( n ) (D, C) . (!, g) = t
b) Show that when functions that differ only on sets of measure zero are iden tified, the inner product
(!, g) =
J p( x) f( x)g( x) dx ,
D
involving a positive continuous function p can be introduced in the space R(D, C) . The function p here is called a weight function, and if (!, g, ) = 0, we say that the functions f and g are orthogonal with weight p. c) Let 'P : D -+ G be a diffeomorphism of the domain D C R n onto the domain G C JR n , and let { u k (y); k E N} be a system of functions in G that is orthogonal with respect to the standard inner product ( 18.2 ) or (18.3 ) . Construct a system of functions that are orthogonal in D with weight p( x) = I det i.p1 ( x) I and also a system of functions that are orthogonal in D in the sense of the standard inner product. d) Show that the system of functions {e rn , n (x, y) = e i(rnx+n y ) ; m, n E N} is orthogonal on the square I = {(x, y) E R 2 l l x l :::; 1r 1\ IYI :S: 1r } . e) Construct a system of functions orthogonal on the two-dimensional torus T2 c R2 defined by the parametric equations given in Example 4 of Sect. 12. 1 . The inner product o f functions f and g o n the torus i s understood as the surface integral J fg da. T2
5 . a) It is known from algebra (and we have also proved it in the course of discussing constrained extremal problems) that every symmetric operator A : E n -+ E n on n-dimensional Euclidean space E n has nonzero eigenvectors. In the infinite dimensional case this is generally not so. Show that the linear operator f ( x) >-+ x f ( x) of multiplication by the independent variable is symmetric in C2 ( [a, b], R ) , but has no nonzero eigenvectors.
18. 1 Basic General Concepts Connected with Fourier Series
523
b) A Sturm 10 -Liouville problem that often arises in the equations of mathemati cal physics is to find a nonzero solution of an equation u"(x)+ [q(x)+Ap(x)]u(x) = 0 on the interval [a, b] satisfying certain boundary conditions, for example u(a) = u(b) = 0. Here it is assumed that the functions p(x) and q(x) are known and continuous on the interval [a, b] in question and that p(x) > 0 on [a, b]. We have encountered such a problem in Example 15, where it was necessary to solve Eq. (18.26) under the condition X (O) = X(l) = 0. In this case we had q(x) = 0, p(x) = 1 , and [a, b] = [0, l] . We have verified that a Sturm-Liouville problem may in general turn out to be solvable only for certain special values of the parameter A, which are therefore called the eigenvalues of the corresponding Sturm -Liouville problem. Show that if the functions f and g are solutions of a Sturm-Liouville problem corresponding to eigenvalues AJ f. A9, then the equality d: (g' f - !'g) = (AJ A9 )pf g holds on [a, b] and the functions f and g are orthogonal on [a, b] with weight p. c) It is known (see Sect. 14.4) that the small oscillations of an inhomogeneous string fastened at the ends of the closed interval [a, b] are described by the equation (pu�)� = pu�� ' where u = u(x, t) is the function that gives the shape of the string at each time t, p = p(x) is the linear density, and p = p(x) is the elastic constant at the point x E [a, b] . The clamping conditions mean that u(a, t) = u(b, t) = 0. Show that if we seek the solution of this equation in the form X(x)T(t), the question reduces to a system T" = AT, (pX')' = ApX, in which A is the same number in both equations. Thus a Sturm-Liouville problem arises for the function X(x) on the closed interval [a, b], which is solvable only for particular values of the parameter A (the eigenvalues) . (Assuming that p ( x ) > 0 on [a, b] and that p E C ( l l [a, b] we can obviously bring the equation (pX' )' = ApX into a form in which the first derivative is missing by the change of variable x = f !ch . ) a
d) Verify that the operator S(u) = (p(x)u'(x))' - q(x)u(x) on the space of functions in c< 2 l [a, b] that satisfy the condition u(a) = u(b) = 0 is symmetric on this space. (That is, (Su, v) = (u, Sv), where ( , ) is the standard inner produt of real-valued functions.) Verify also that the eigenfunctions of the operator S corre sponding to different eigenvalues are orthogonal. e) Show that the solutions X 1 and X2 of the equation (pX')' = ApX corre sponding to different values A 1 and A 2 of the parameter A and vanishing at the endpoints of the closed interval [a, b] are orthogonal on [a, b] with weight p(x) . 6. The Legendre polynomials as eigenfunctions. a) Using the expression of the Legendre polynomials Pn ( x ) given in Example 5 and the equality (x 2 - 1 ) n = (x - 1t(x + 1) n , show that Pn (1) = 1 .
1 0 J.Ch.F. Sturm (1803-1855) French mathematician (and, as it happens, an honorary foreign member of the Petersburg Academy of Sciences) ; his main work was in the solution of boundary-value problems for the equations of mathematical physics. -
18
524
Fourier Series and the Fourier Transform
b) By differentiating the identity (x 2 - l ) d';, (x 2 - l ) n = 2nx(x 2 - l) n , show that Pn ( x) satisfies the equation (x 2 - 1) P:; (x) + 2x P� (x) - n(n + l)Pn (x) = 0 . ·
·
c) Verify that the operator
d + 2x d d [ ( x 2 - 1)d] - 1)dx 2 dx dx dx is symmetric in the space c< 2 l [-1, 1] c R 2 [-1, 1]. Then, starting from the relation A : = ( x2
2
=
A(Pn ) = n(n + l)Pn , explain why the Legendre polynomials are orthogonal. d) Using the COmpleteness of the system { 1, X, x 2 , . . . } in C ( 2) [ -1, 1], show that the dimension of the eigenspace of the operator A corresponding to the eigenvalue n(n + 1) cannot be larger than 1. e ) Prove that the operator A = d';, [ ( x 2 - 1) d';, ] cannot have eigenfunctions in the space c< 2) [ -1, 1] except those in the system {Po ( X ) , pl ( X ) , } of Legendre poly nomials, nor any eigenvalues different from the numbers {n(n + 1); n = 0, 1, 2, . . . } . 7. Spherical functions. a ) I n solving various problems i n IR3 (for example, problems of potential theory connected with Laplace's equation Llu = 0) the solutions are sought in the form of a series of solutions of a special form. Such solutions are taken to be homogeneous polynomials Sn (x, y, z ) of degree n satisfying the equation Llu = 0 . Such polynomials are called harmonic polynomials. In spherical coordinates ( r,
0 0
·
Pn (cos B) , Pn ,rn (cos B) cos m
18. 1 Basic General Concepts Connected with Fourier Series
525
a) Show that Ho (x) = 1 , H1 (x) = 2x, H2 (x) = 4x 2 - 2. b) Prove that Hn (x) is a polynomial of degree n. The system of functions { Ho ( x), H1 ( x), . . . } is called the system of Hermite polynomials. c) Verify that the function Hn (x) satisfies the equation H;: (x) - 2xH� (x) + 2nHn (x) = 0. 2 d) The functions '1/J n (x) = e -x Hn (x) are called the Hermite functions. Show that '1/J� (x) + (2n + 1 - x 2 )'1/J n (x) = 0, and that '1/J n (x) --+ 0 as x --+ oo. + oo
e) Verify that J '1/Jn 'I/Jm dx = 0 if m i- n. oo
2 f) Show that the Hermite polynomials are orthogonal on lR with weight e -x . 9. The Chebyshev-Laguerre 1 1 polynomials {L n (x) ; n = 0, 1 , 2 , . . . } can be defined ). by the formula L n ( x ) : = e x d ( xdn e-"' xn Verify that a) L n (x) is a polynomial of degree n; b) the function L n (x) satisfies the equation xL � (x) + ( 1 - x)L� (x) + nL n (x) = 0 ; -
c) the system {L n ; n = 0, 1 , 2, . . . } of Chebyshev-Laguerre polynomials is or thogonal with weight e - x on the half-line (0, +oo[. 10. The Chebyshev polynomials {To (x) = 1, Tn (x) = 2 1 - n cos n(arccos x); n E N} for l xl < 1 can be defined by the formula dn (-2) n n! � ,., ( ) = 1! - x 2� n ( 1 - X 2 ) n - ! . .L n X y (2n) ! dx Show that: a) Tn (x) is a polynomial of degree n; b) Tn (x) satisfies the equation ( 1 - x 2 )T;: (x) - xT� (x) + n 2 Tn (x)
=
0;
c) the system {Tn ; n = 0, 1 , 2, . . . } of Chebyshev polynomials is orthogonal with on the interval ] - 1 , 1 [. weight p(x) = yr!---:; 1 - x2 1 1 . a) In probability theory and theory of functions one encounters the following system of Rademacher1 2 functions : { '1/J n (x) =
1 1 E.N. Laguerre ( 1834-1886) - French mathematician. 1 2 H.A. Rademacher (1892-1969) - German mathematician (American after 1936) . 1 3 A. Haar (1885-1933) - Hungarian mathematician.
526
{ -� ,
18 Fourier Series and the Fourier Transform
Xn , k (x) =
1'
if ��+r < x < ��:jJ ,
if ��'fl < X < 2 �i1 ,
at all other points of [0, 1 ) . Verify that Haar system is orthogonal on the closed interval [0, 1 ) .
1 2 . a) Show that every n-dimensional vector space with an inner product is iso metrically isomorphic to the arithmetic Euclidean space JRn of the same dimension. b) We recall that a metric space is called separable if it contains a countable everywhere-dense subset. Prove that if a vector space with inner product is sepa rable as a metric space with the metric induced by the inner product, then it has a countable orthonormal basis. c) Let X be a separable Hilbert space ( that is, X is a separable and complete metric space with the metric induced by the inner product in X). Taking an or thonormal basis { e; ; i E N} in X, we construct the mapping X 3 x f-t ( c 1 , c2 , . . . ) , where c; = (x, e; ) are the Fourier coefficients of the expansion of the vector in the basis {e; } . Show that this mapping is a bijective, linear, and isometric mapping of X onto the space l 2 considered in Example 14. d) Using Fig. 18. 1 , exhibit the basic idea of the construction of Example 14, and explain why it comes about precisely because the space in question is infinite dimensional. e) Explain how to construct an analogous example in the space of functions C [a , b] C 'R.2 [a, b] .
1 8 . 2 Trigonometric Fourier Series 1 8. 2 . 1 Basic Types of Convergence of Classical Fourier Series
a. Trigonometric Series and Trigonometric Fourier Series A classical trigonometric series is a series of the form 1 4
�0 + L= ak cos kx + bk sin kx , 00
k l
( 18.32)
obtained on the basis of the trigonometric system { 1 , cos kx, sin kx; k E N}. The coefficients {a 0 , a k , b k ; k E N} here are real or complex numbers. The partial sums of the trigonometric series ( 18.32) are the trigonometric polyno mials n 0 Tn (x) = + L a k cos kx + bk sin kx (18.33)
n.
�
k= l
corresponding to degree 1 4 Writing the constant term in the form ao/2, which is convenient for Fourier series, is not obligatory here.
527
18.2 Trigonometric Fourier Series
If the series ( 18.32) converges pointwise on JR, its sum f(x) is obviously a function of period 21r on JR. It is completely determined by its restriction to any closed interval of length 27r. Conversely, given a function of period 21r on lR (oscillations, a signal, and the like) that we wish to expand into a sum of certain canonical periodic func tions, the first claimants for such canonical status are the simplest functions of period 27r, namely { 1 , cos kx, sin kx; k E N} , which are simple harmonic oscillations of corresponding degree n, with frequencies that are multiples of the lowest frequency. Suppose we have succeeded in representing a continuous function as the sum ao 00 . ( 18.34) f(x) = 2 + "' L a k cos kx + b k sm kx = k l of a trigonometric series that converges uniformly to it . Then the coefficients of the expansion ( 18.34) can be easily and uniquely found. Multiplying Eq. ( 18.34) successively by each function of the system { 1 , cos ks, sin kx; k E N} , using the fact that termwise integration is possible in the resulting uniformly convergent series, and taking account of the relations 7r
7r
7r
J 1 2 dx = 27r ,
- 7r
J cos mx cos nx dx = J sin mx sin nx dx 7r
7r
- 7r
- 7r
=
0 for m =I n , m, n E N ,
J cos2 nx dx = J sin2 nx dx = 7r ,
we find the coefficients ak bk
= ak (f) =
bk (f)
7r
� J f(x) cos kx dx , 1 J f(x) sm. kx dx , - 7r
7r
:;
- 7r
nEN,
k = 0, 1 , . . .
'
(18.35)
k = 1 , 2, . . . .
( 18.36)
of the expansion ( 18.34) of the function f in a trigonometric series. We have arrived at the same coefficients that we would have had if we had regarded ( 18.34) as the expansion of the vector f E R2 [ - 1r , 1r] in the orthogonal system { 1, cos kx, sin kx; k E N}. This is not surprising, since the
528
18 Fourier Series and the Fourier Transform
uniform convergence of the series ( 18.34) of course implies convergence in the mean on the closed interval [-1r, 1r] , and then the coefficients of (18.34) must be the Fourier coefficients of the function f in the given orthogonal system (see Sect . 18. 1 ) . Definition 1 . If the integrals (18.35) and (18.36) have meaning for a func tion f , then the trigonometric series
f "'
� + kf:= l ak (f) cos kx + bk (f) sin kx
ao f)
( 18.37)
assigned to f is called the trigonometric Fourier series of f. Since there will be no Fourier series in this section except trigonometric Fourier series, we shall occasionally allow ourselves to drop the word "trigono metric" and speak of just "the Fourier series of f." In the main we shall be dealing with functions of class 'R ( [-7r, 1r] , C) , or, slightly more generally, with functions whose squared absolute values are integrable (possibly in the improper sense) on the open interval J - 1r, 1r[. We retain our previous notation 'R2 [-1r, 1r] to denote the vector space of such functions with the standard inner product
(!, g) =
7r
j fg dx .
( 18.38)
Bessel ' s inequality
l ao �W + f: l a k UW + l bk UW ::::; � j l f l 2 (x) dx , k= l 7r
(18.39)
- 7r
which holds for every function f E 'R2 ([ - 7!" , 1r] , C) , shows that by no means every trigonometric series ( 18.3 2 ) can be the Fourier series of some function f E 'R2 [-1r, 1r] . Example 1 . The trigonometric series
f: sin kx k= l
v'k '
IR,
as we already know (see Example 7 of Sect . 16. 2 ) converges on but is not 2 the Fourier series of any function f E 'R2 [ -7r, 1r] , since the series f: ( �)
k=l
diverges. Thus, arbitrary trigonometric series ( 18.32) will not be studied here, only Fourier series (18.37) of functions in 'R2 [-1r, 1r] , and functions that are abso lutely integrable on J - 1r, 1r [.
18.2 Trigonometric Fourier Series
529
b. Mean Convergence of a Trigonometric Fourier Series Let
Sn (x)
=
; + t= ak (f) cos kx + bk (f) sin kx
ao f )
( 18.40)
k l be the nth partial sum of the Fourier series of the function f E R 2 [ - 71" , 1r] . The deviation of Sn from f can be measured both in the natural metric of the space R2 [-1r, 1r] induced by the inner product (18.38) , that is, in the sense of the mean-square deviation I I / - Sn ll
=
11"
J If
- 11"
-
Sn l 2 (x) dx
(18.41)
of Sn from f on the interval [-1r, 1r] , and in the sense of pointwise convergence on that interval. The first of these two kinds of convergence was studied for an arbitrary series in Sect. 18. 1 . Making the results obtained there specific in the context of a trigonometric Fourier series involves first of all noting that the trigonometric system { 1 , cos kx, sin kx; k E N} is complete in R 2 [-1r, 1r] . (This has already been noted in Sect . 18. 1 and will be proved independently in Subsect . 18.2.4 of the present section. ) Hence, the fundamental theorem of Sect. 18. 1 enables us t o say i n the present case that the following theorem is true. Theorem 1. (Mean convergence of a trigonometric Fourier series) . The Fourier series ( 18.37) of any function f E R 2 ( [-1r, 1r] , C) converges to the function in the mean ( 18.41 ) , that is,
ao ( f )
. f (x) ;: -- + � L...t a k (f) cos kx + bk (f) sm kx , 2 2 k= l and Parseval 's equality holds: ( 18.42) We shall often use the more compact complex notation for trigonometric polynomials and trigonometric series, based on the Euler formulas ei x cos x + i sin x, cos x = � ( ei x + e - i x ) , sin x = � ( e i x - e - i x ) . Using them, we can write the partial sum (18.40) of the Fourier series as =
Sn (x) =
n
L
k= -n
Ck e i kx ,
( 18.40' )
530
18 Fourier Series and the Fourier Transform
and the series ( 18.37) itself as
_
where
Ck -
{
(18.37') � (a k - ib k ) , 1 2 ao ,
if k
Ck
0,
if k = 0 ,
� (a -k + ib_ k ) , if k
that is,
>
1f
f(x)e - i kx dx , = ck (f) = 2_ 27r J
<
( 18.43)
0, kEZ,
(18.44)
and hence the numbers Ck are simply the Fourier coefficients of f in the system {ei kx ; k E Z} . We call attention to the fact that summation of the Fourier series ( 18.37') is understood in the sense of the convergence of the sums ( 18.40') . In complex notation Theorem 1 means that for every function f E
R2 ( [ - n, n] , C )
and
00
2� 11 ! 11 2 = f00 i ck UW -
·
( 18.45)
c. Pointwise Convergence of a Trigonometric Fourier Series Theorem 1 gives a complete solution to the problem of mean convergence of a Fourier series ( 18.37) , that is, convergence in the norm of the space R2 [-n, n]. The remainder of this section will be mainly devoted to studying the conditions for and the nature of pointwise convegence of a trigonometric series. We shall consider only the simplest aspects of this problem. The study of pointwise convergence of a trigonometric series, as a rule, is such a delicate matter that , despite the traditional central position occupied by Fourier series after the work of Euler, Fourier, and Riemann, there is still no intrinsic descrip tion of the class of functions that can be represented by trigonometric series converging to them at every point (the Riemann problem). Until recently it was not even known whether the Fourier series of a continuous function must converge to it almost everywhere (it was known that convergence need not occur at every point ) . Previously A.N. Kolmogorov 1 5 had even given an 1 5 A.N. Kolmogorov ( 1903-1987) - outstanding Soviet scholar, who worked in probability theory, mathematical statistics, theory of functions, functional analysis, topology, logic, differential equations, and applied mathematics.
18.2 Trigonometric Fourier Series
531
example of a function f E L[ - n, n] whose Fourier series diverged everywhere ( where L[ -n, n] is the space of Lebesgue-integrable functions on the interval [-n, n] , obtainable as the metric completion of the space R[ - n, n]) , and D.E. Men ' shov 1 6 constructed a trigonometric series ( 18.32) with coefficients not all zero that nevertheless converged to zero almost everywhere ( Men 'shov 's null-series ) . The problem posed by Luzin 1 7 ( Luzin's problem ) of deter mining whether the Fourier series of every function f E £ 2 [ - n, n] converges almost everywhere ( where £ 2 [-n, n] is the metric completion of R2 [ - n, n]) was answered in the affirmative only in 1966 by L. Carleson. 1 8 It follows in particular from Carleson ' s result that the Fourier series of every function f E R2 [-n, n] ( for example a continuous function ) must converge at almost all points of the closed interval [ -n, n] .
N. N .
18.2.2 Investigation of Pointwise Convergence of a Trigonometric Fourier Series a. Integral Representation of the Partial Sum of a Fourier Series Let us now turn our attention to the partial sum ( 18.40) of the Fourier series ( 18.37) and, using the complex notation ( 18.40') for the expression ( 18.44) for the Fourier coefficients, we make the following transformations:
7r
�J
2
- 1r
f(t)
( kL=n- n eik (x -t) ) dt .
( 18.46)
But u
Dn ( )
:=
n
ei (n+ l )u - e - i n u ei. k u = ---,-.---L ei u - 1 k= - n
ei (n+ ! )u - e - i (n+ ! )u ei ! u e - i ! u
and, as can be seen from the very definition, D n ( u ) Hence sin ( n + � ) u Dn ( U ) = . 21 u ' sm
_
=
( 2n + 1) if e i u
( 1 8.47) =
1.
( 18.48)
1 6 D.E. Men'shov (1892-1988) - Soviet mathematician, one of the greatest special ists in the theory of functions of a real variable. 1 7 N.N. Luzin (1883-1950) - Russian / Soviet mathematician, one of the greatest specialists in the theory of functions of a real variable, founder of the large Moscow mathematical school ( "Lusitania" ) . 1 8 L. Carleson ( b. 1928) - outstanding Swedish mathematician whose main works are in various areas of modern analysis.
532
18 Fourier Series and the Fourier Transform
where the ratio is regarded as 2n + 1 when the denominator of the fraction becomes zero. Continuing the computation ( 18.46) , we now have
Sn ( x )
=
� J f (t) Dn (X - t) t . 71"
2
( 18.49)
d
- 71"
We have thus represented Sn ( x ) as the convolution of the function f with the function ( 18.48) , which is called the Dirichlet kernel. As can be seen from the original definition ( 18.47) of the function Dn (u), the Dirichlet kernel is of period 2rr and even, and, in addition
� J Dn (u) du � J Dn (u) du = 1 . 71"
2
=
- 71"
71"
( 18.50)
0
Assuming the function f is of period 2rr on R or is extended from [ -rr, rr] to R so as to have period 2rr, and making a change of variable in ( 18.49) , we obtain 71" 71" sin ( + .!2 ) t 1 1 (x dt . (18.51) t) Dn (t) dt = - f (x - t) Sn ( x ) = f . 1 2 1r 2 1r sm 2t
!
n
!
- 7r
- 7r
In carrying out the change of variable here, we used the fact that the integral of a periodic function is the same over every interval whose length equals a period. Taking account of the fact that Dn (t) is an even function, we can rewrite Eq. (18.51) as
Sn ( x )
= =
� J ( / (x - t) 1 (f(x - t) 2 J 71"
2
0
1r
71"
+ f(x + t)) Dn (t) d t + f(x + t))
sin
0
=
n
( + .!2 ) t t. . 1 sm 2t d
( 18.5 2)
The Riemann-Lebesgue Lemma and the Localization Principle The representation ( 18.52) for the partial sum of a trigonometric :Fourier series, together with an observation of Riemann stated below, forms the basis for studying the pointwise convergence of a trigonometric Fourier series. Lemma 1 . ( Riemann-Lebesgue ) . If a locally integmble function f ]wb w 2 [---+ R is absolutely integmble (perhaps in the improper sense) on an open interval ]w 1 , w2 [, then W2
J f(x)ei>..x x ---+ 0 as A ---+ oo ,
WI
d
AER.
( 18.53)
18.2 Trigonometric Fourier Series
533
Proof. If ]w 1 , w2 [ is a finite interval and f(x) = 1 , then Eq. ( 18.53) can be verified by direct integration and passage to the limit. We shall reduce the general case to this simplest one. Fixing an arbitrary E: > 0, we first choose an interval [a, b] c]w 1 , w2 [ such that for every >. E � W2
IJ
f(x)e i>.x dx -
J f(x)ei>.x x l b
< E:
d
a
.
( 18.54)
In view of the estimates
W2
IJ w1
f(x)ei >.x dx a
:::; J W1
J f(x)ei>.x x l b
d
a
l f(x)e i>.x l dx +
:::; w2
J b
l f(x)ei>.x l dx =
a
w2
J l fl (x) dx J lfl (x) dx +
W1
b
and the absolute integrability of f on ]w 1 , w2 [, there does of course exist such a closed interval [a, b] . Since f E R([a, b] , � ) (more precisely !I [a,b] E R( [a, bl ) ) , there exists a n lower Darboux sum 2.:: mj LlXj , where mj = inf f(x) , such that
j= l
x E [Xj - 1 ,Xj )
b
0<
J f(x) dx - j=Z: mj LlXj < E . n
l
a
Now introducing the piecewise constant function g(x) we find that g(x) :::; f(x) on [a, b] and
[xj - ! , Xj ] , j = 1 ,
. . . , n,
I / f(x)ei>.x dx - J g(x)ei>.x dx l :::; :::; J lf(x) - g(x) l l ei>.x l x J {J(x) - g(x)) b
mj for x E
b
0 :::;
a b
a
d
a
=
b
( 18.55)
dx < E .
a
but b
J a
g(x)ei >.x dx
= =
8n J mj ei>.x dx Xj
J-
Xj - 1
=
;_ � (mJ· ei >.x ) l Xxj3 - 1 L.....,; I A j= l
--+
0 as >. --+
oo ,
>. E � ( 18.56) .
534
18 Fourier Series and the Fourier Transform
Comparing relations ( 18.53)-( 18.56) , we find what was asserted. D
Remark
1.
Separating the real and imaginary parts in ( 18.53) , we find that
W2
j f ( x) cos Ax dx --7 0
Wl
and
--7 oo,
W2
j f(x) sin Ax dx --7 0
Wl
(18.57)
as A A E R If the function f in the preceding integrals had been complex, then, separating the real and imaginary parts in them, we would have found that relations (18.57) , and consequently (18.53) , would actually be valid for complex-valued functions f : ] w 1 , w 2 [ -7 C.
Remark 2. If it is known that f E R2 [ -7r, 7r ] , then by Bessel ' s inequality ( 18.39) we can conclude immediately that 71"
J f(x) nx x --7 0 cos
d
71"
J f(x) nx x --7 0
and
- 71"
- 71"
n --7 oo, n
sin
d
as E N. Theoretically, we could have gotten by with just this discrete version of the Riemann-Lebesgue lemma for the elementary investi gations of the classical Fourier series that will be carried out here. Returning now to the integral representation ( 18.52) of the partial sum of a Fourier series, we remark that if the function f satisfies the hypotheses of the Riemann-Lebesgue lemma, then, since sin �t 2: sin � 8 > 0 for 0 < 8 :::; t :::; 71" , we can use ( 18.57) to write
Sn (x) =
1 2 7r
8
j (f (x - t) + f (x + t) ) sin (n + 2 ) .
1
sm 2 t
0
! t
d t +o ( 1 ) as
n --7 oo . (18.58)
The important conclusion one can deduce from ( 18.58) is that the con vergence or divergence of a Fourier series at a point is completely determined by the behavior of the function in an arbitrarily small neighborhood of the point . We state this principle as the following proposition.
Theorem 2. (Localization principle ) . Let f and g be real- or complex-valued locally integrable functions on ] - 71" , 7r [ and absolutely integrable on the whole interval (possibly in the improper sense). If the functions f and g are equal in any (arbitrarily small} neighborhood of the point xo E] - 71", 71" [ , then their Fourier series +oo
f(x) "' L Ck ( f ) eikx ' - oo
+oo
g(x) "' L ck ( g ) eikx - oo
18.2 Trigonometric Fourier Series
535
either both converge or both diverge at xo . When they converge, their limits are equal. 1 9 Remark 3. As can be seen from the reasoning used in deriving Eqs. ( 18.52) and ( 18.58) , the point x0 in the localization principle may also be an endpoint of the closed interval [ - 1r , 1r ] , but then (and this is essential!) in order for the periodic extensions of the functions f and g from the closed interval [ - 7r , 1r ] to lR to be equal on a neighborhood of x0 it is necessary (and sufficient) that the original functions be equal on a neighborhood of both endpoints of the closed interval [ - 1r , 1r ] . c. Sufficient Conditions for a Fourier Series to Converge at a Point Definition 2 . A function f : U C defined on a deleted neighborhood of a point x E lR satisfies the Dini conditions at x if a) both one-sided limits
--+
lim0 f(x - t) , f(x_ ) = t --++
exist at x; b) the integral
J +O
lim0 f(x + t) f(x + ) = t --++
(f(x - t ) - f(x _ )) : (f(x + t ) - f(x+ )) dt
converges absolutely. 2 0 Example 2. If f is a continuous function in U(x) satisfying the Holder con dition l f(x + t ) - f(x) l ::; M I W , 0 < a ::; 1 , then, since the estimate f(x + t ) - f(x) < __J!_ - I W-"' t now holds, the function f satisfies the Dini conditions at x. It is also clear that if a continuous function f defined in a deleted neighbor hood U(x) of x has one-sided limits f(x_ ) and f(x + ) and satisfies one-sided Holder conditions l f(x + t ) - f(x+ ) l :S M t a , l f(x - t ) - f(x - ) 1 :S M t a ,
I
I
where t > 0, 0 < a ::; 1 , and M is a positive constant , then Dini conditions for the same reason as above.
19 Although the limit need not be J(xo)
20
What is meant is that the integral
f0
f will satisfy the
g(xo ) . converges absolutely for some value E > 0. =
18 Fourier Series and the Fourier Transform
536
Definition 3. We shall call a real- or complex-valued function f piecewise continuous on the closed interval [a, b] if there is a finite set of points a = x0 < x 1 < · · · < Xn = b in this interval such that f is defined on and has one-sided limits on approach to each interval ] x3 _ 1 , x3 [, j = 1 , its endpoints.
. . . , n,
Definition 4 . A function having a piecewise-continuous derivative on a closed interval is piecewise continuously differentiable on that interval.
Example 3. If a function is piecewise continuously differentiable on a closed interval, then it satisfies the Holder conditions with exponent a = 1 at every point of the interval, as follows from Lagrange ' s finite-increment (mean-value) theorem. Hence, by Example 1 , such a function satisfies Dini ' s conditions at every point of the interval. At the endpoints of the interval, of course only the corresponding one-sided pair of Dini ' s conditions needs to be verified. Example 4 . The function f( x ) point x E even at zero.
IR,
=
sgn x satisfies Dini ' s conditions at every
Theorem 3. (Sufficient conditions for convergence of a Fourier series at a point ) . Let f : lR -+ C be a function of period 2rr that is absolutely integrable on the closed interval [ - rr , rr ] . If f satisfies the Dini conditions at a point xE then its Fourier series converges at that point, and
IR,
� c (f) ei x = f ( x_ ) ; f(x -
oo
k
k
+
)
(18.59)
Proof. By relations ( 18.52) and ( 18.50)
Sn ( x )
_
f (x ) + f (x+ ) = 2 7r _- -1 { J(x - t) - f( x ) ) + { J(x + t) - f ( x + ) ) sm. ( + 1 ) t dt . 1 2 rr 2 sin 2t _
J 0
-
n
-
Since 2 sin ! t "' t as t -+ + 0 , by the Dini conditions and the Riemann Lebesgue lemma we see that this last integral tends to zero as 0
n -+ oo.
Remark 4. In connection with the theorem just proved and the localization principle, we note that changing the value of the function at a point has no influence on the Fourier coefficients or the Fourier series or the partial sums of the Fourier series. Therefore the convergence and the sum of such a series at a point is determined not by the particular value of the function at the point, but by the integral mean of its values in an arbitrarily small neighborhood of the point. It is this fact that is reflected in Theorem 3.
537
18.2 Trigonometric Fourier Series
Example
5.
In Example 6 of Sect . 18. 1 we found the Fourier series x
"'
00 ( - 1 ) k + l sin kx 2
2::::
( 1 8.60)
k
k=1
for the function f(x) = x on the closed interval [-1r, 1r] . Extending the func tion f(x) periodically from the interval ] - 1r, 1r [ to the whole real line, we may assume that the series (18.60) is the Fourier series of this extended function. Then, on the basis of Theorem 3 we find that
oo ( - 1 ) k + l 2:::: 2 k sin kx = k =1
{X,
if l x l < 1r ,
0 ' if l x l = 1r .
In particular, for x = � it follows from this relation that
� (-l) n = � . L..n = O 2n + 1 4
Example 6. Let a E lR and l a l < 1 . Consider the 21r-periodic function f(x) defined on the closed interval [-1r, 1r] by the formula f (x) = cos a x . B y formulas (18.3 5 ) and (18.36) we find its Fourier coefficients
( -1) n sin 1r a J cos a x cos nx dx = ; 7r 1
bn (f) =
7r
2a
- 7r
7r
� J cos ax sin nx dx = 0 . - 7r
By Theorem 3 the following equality holds at each point x 2 sin 7ra 1 � (-1) n cos nx cos a x = a + L..- a 2 - n 2 2 a 2 n=1 1r
(
)
E [-1r , 1r] :
When x = 1r, this relation implies 2 a 00 1 1 cot 1r a - - = - ""' L..- a 2 - n 2 1r n=1 7r a
--=----=-
I
I
(18.61)
1 < a0 < 1 ' then a 2-n2 < n2 -1 a g ' and hence the series on the If l a l right-hand side of Eq. (18.61) converges uniformly with respect to a on every closed interval l a l :s; a0 < 1 . Hence it is legitimate to integrate it termwise, that is,
538
18 Fourier Series and the Fourier Transform 7r
J o
00 2::: ( cot 7rcY - 2_ ) dx = :
7r
J � - n2 1
n= o
?TO'
and
a
a a ,
d
00 sm ?TX 2::: : ln 1 - x2 , l n -- 1TX 1T 2 n= 1
yielding
.
(
_
)
oo sin ?TX IJ x2 when l x l < 1 . ( 18.62) = 1 -2 n ?TX n= 1 We have thus proved relation ( 18.62) , which we mentioned earlier when deriving the complement formula for Euler ' s function T(x) (Sect. 1 7.3) .
and finally,
)
(
d . Fejer's Theorem. Let us now consider the sequence of functions
O"n (x) := So(x) +n· +· · 1+ Sn (x)
,
that are the arithmetic means of the corresponding partial sums S0(x), . . . , Sn (x) of the trigonometric Fourier series ( 18.37) of a function f : JR. � C of period 21r. From the integral representation (18.51) of the partial sum of the Fourier series we have 7r
J(x - t)Fn (t) dt , O"n (x) = _]:_ 21T
J
- 7r
where
1 ( Do (t) + · · · + Dn (t) ) . Fn (t) = n+1 Recalling the explicit form (18.48) of the Dirichlet kernel and taking ac count of the relation n sin2 ( n+2 1 ) t 1 ) -1 n I: sin k + -21 ) t = -21 sin -t L ( cos kt - cos(k + 1)t ) = . -1 t , 2 sm 2 k� k�
(
we find
(
sin2 n+2 l t . Fn (t) = (n + 1 ) sin2 21 t The function Fn is called the Fejer kernel, more precisely the nth Fejer kernel. 2 1 21 L. Fe}§ r ( 1880-1956) - well-known Hungarian mathematician.
18.2 Trigonometric Fourier Series
539
account of the original definition ( 1 8.47) of the Dirichlet kernel Dwhose ,n Taking one can conclude that the Fejer kernel is a smooth function of period 2n value equals ( n + 1) where the denominator of this last fraction equals zero. The properties of the Fejer and Dirichlet kernels are similar in many ways, but in contrast to the Dirichlet kernel, the Fejer kernel is also nonnegative, so that we have the following lemma. Lemma 2. The sequence of functions
L1n (x) = { 2�F0n (x), , ifif l xx l 7r , l l > 7r :S:
is an approximate identity on JR.
L1n (x)
Proof. The nonnnegativity of is clear. Equality ( 18.50) enables us to conclude that CXJ
7r
- CXJ
- 7[
7r
J L1n (x) dx = J .dn (x) dx = � J Fn (x) dx = 2
15
- 7[
Finally, for every > 0 7r
7r
J L1n (x) dx = J L1n (x) dx =JL1n (x) dx ::::; 2n ( n + 1 ) J sm. �x2xl -t 0 as n -t + oo. D Theorem 4. (Fejer) . Let f : IR -t C be a function of period 2n that zs absolutely integrable on the closed interval [ n] . If - li
0 :S:
- CXJ
a)
+ CXJ
li
1
li
li
-n,
f is uniformly continuous on the set E c IR, then
O"n (x) f(x) on E as n -t oo ;
b) f E C(IR, C) , then
::4
O"n (x) f(x) on 1R as n -t oo ; c) f is continuous at the point x E IR, then O"n (x) -t f(x) as n -t oo . ::4
540
18 Fourier Series and the Fourier Transform
Proof. Statements b) and c) are special cases of a) . Statement a) itself is a special case of the general Proposition 5 of Sect. 17.4 on the convergence of a convolution, since
an (x)
=
� 2rr
71"
J f(x - t):Fn (t) t d
- 71"
=
( ! * Ll n )(x) . D
Corollary 1 . (Weierstrass ' theorem on approximation by trigonometric polynomials) . If a function f : [-rr, rr] C is continuous on the closed interval [-rr, rr] and f ( -rr) = f(rr) , then this function can be approximated uniformly on the closed interval [-rr, rr] with arbitrary precision by trigono metric polynomials. Proof. Extending f as a function of period 2 rr, we obtain a continuous 2rr periodic function on JR, to which the trigonometric polynomials an (x) con verge uniformly by Fejer ' s theorem. D Corollary 2 . If f is continuous at x, its Fourier series either diverges at x or converges to f (x) . Proof Only the case of convergence requires formal verification. If the se quence Sn ( x ) has a limit as then the sequence an ( x ) = So (x) ��i Sn (x) has that same limit. But by Fejer ' s theorem an (x ) f ( x ) as and hence Sn (x ) f ( x ) also whenever the limit Sn (x) exists as D Remark 5. We note that the Fourier series of a continuous function really can diverge at some points.
�
�
n � oo,
�
n �n oo.� oo,
1 8 . 2 . 3 Smoothness of a Function and the Rate of Decrease of the Fourier Coefficients
a. An Estimate of the Fourier Coefficients of a Smooth Function We begin with a simple, yet important and useful lemma. Lemma 3 . (Differentiation of a Fourier series) . If a continuous function f E C ( [-rr, rr] , C) assuming equal values at the endpoints of the closed inter val [ -rr, rr] is piecewise continuously differentiable on [ -rr, rr] , then the Fourier series of its derivative
00
-
oo
can be obtained by differentiating formally the Fourier series
00 -
of the function itself, that is, c k (f ' )
=
oo
i kck (f ) ,
kEZ.
(18.63)
18.2 Trigonometric Fourier Series
541
Proof. Starting from the definition of the Fourier coefficients (18.44) , we find through integration by parts that 7r
7r
- 7[
- 7[
ck (f') = 27r Jf'(x)e - 1' kx dx = 217r f(x) e - 1' kx 1 7r- 1r + 2ik7r Jf(x)e - 1' k x dx = ikck (f), 1
since
f(1r) e - i k1f - f(- 7r ) e i k7r = 0.
D
Proposition 1. (Connection between smoothness of a function and the rate of decrease of its Fourier coefficients) . Let f E C ( m - ll ( [ - 7r , 7r ] , IC ) and f (j ) ( -1r) = fUl ( 7r ), j = 0, 1, . . . , m - 1 . If the function f has a piecewise continuous derivative f (rn ) of order m on the closed interval [ - 1r, 1r] , then
(18.64) and as moreover,
k --+ oo , k E 7l ;
(18.65)
00
L "'(� < oo. - oo
Proof. Relation ( 18.64 ) follows from an m-fold application of Eq. ( 18.63) : Setting "'/k
= l c k (f ( m l) l and using Bessel ' s inequality
we obtain ( 18.65 ) from ( 18.64 ) . D
Remark 6. In the proposition just proved, as in Lemma 3, instead of asssum ing the conditions f ( j l ( - 1r) = fUl ( 7r ), we could have assumed that f is a function of period 21r on the entire line.
Remark 7. If a trigonometric Fourier series were written in the form ( 18.37 ) , rather than in the complex form ( 18.37' ) , it would be necessary to replace the simple relations ( 18.64 ) by noticeably more cumbersome equalities, whose meaning, however, would be the same: under these hypotheses a Fourier series can be differentiated termwise whichever from it is written in, ( 18.37) or ( 18.37') . As for the estimates of the Fourier coefficients, a k (f) and b k (f) of ( 18.37 ) , since a k (f) = c k (f) + c_ k ( J) and b k (f) = i(c k (f) - c_ k (f)), (see formulas ( 18.43 ) ) it follows from ( 18.65 ) that if a function f satisfies the hypotheses of the proposition, then
542
18 Fourier Series and the Fourier Transform
(18.64') 00
where I: a: � < = '"Yk + '"Y-kk ·l
oo and kI:00= l (3� < oo, and we can assume that a:k = fJk =
b. Smoothness of a Function and the Rate of Convergence of its Fourier Series
-+
Theorem 5. If the function f : [ - n , n] C is such that a) f E C ( m - l ) [ - n , n] , m E N, b) f Ul ( - n ) = f Ul ( n ) , j = 0, 1 , . . . , m - 1 , c) f has a piecewise continuous mth derivative f ( m ) on [ - n , n ] , m :2: 1 , then the Fourier series of f converges absolutely and uniformly o n [ - n , n] to f, and the deviation of the nth partial sum Sn (x) of the Fourier series from f ( x) has the following estimate on the entire interval:
where
{En } is a sequence of positive numbers tending to zero.
Proof. We write the partial sum ( 1 8.40) of the Fourier series in the compact notation ( 18.40'):
n -n
According to the assumptions on the function f and Proposition 1 we have
l c k (f) l '"Yk / l k l m , and L '"Yk / l kml m < oo : since 0 :::; '"Yk / l k l m :::; � ('"'(� + 1/k2m ) and m :2: 1 , we have L '"Yk / l k l < oo. Hence the sequence Sn (x) converges =
uniformly on [ - n , n] ( by the Weierstrass M-test for series and the Cauchy criterion for sequences ) . By Theorem 3 the limit S(x) of Sn (x) equals f(x), since the function f satisfies the Dini conditions at each point of the closed interval [ - n , n] ( see Example 3) . And, since f ( - n ) = f (n) , the function f can be extended to lR as a periodic function with the Dini conditions holding at each point x E JR. Now, using relation (18.63) , we can proceed to obtain an estimate:
l f(x) - Sn (x) l = I S(x) - Sn (x) l = 00
± k=n+ l
I ± kf==n+ l ck ( f ) ikx l :::; e
00
± k =n+ l
18.2 Trigonometric Fourier Series
543
The first factor on the right-hand side of the Cauchy-Bunyakovskii inoo equality here tends to zero as -+ since L:; 'Y� <
n oo,
oo.
Next (see Fig. 18.2) 00
'"' 1/ L..... k =n + l
00 12m k -< J xd2mx = 2m1- 1 . _ 2m n -1 .
n
We thus obtain the assertion of Theorem 5. D
n n+ l
Fig. 1 8 . 2 .
X
In connection with these results we now make a number of useful remarks.
Remark 8. One can now easily obtain again the Weierstrass approximation theorem stated in Corollary 1 from Theorem 5 (and Theorem 3, of which essential use was made in the proof of Theorem 5), independently of Fejer ' s theorem. Proof It suffices to prove this result for real-valued functions. Using the uni form continuity of f on [-rr, rr] , we approximate f on this closed interval uniformly within c/2 by a piecewise-linear continuous function cp(x) assum ing the same values as f at the endpoints, that is, cp( -rr) = cp(rr) = f (rr) (Fig. 18.3) . By Theorem 5 the Fourier series of cp converges to cp uniformly on the closed interval [-rr, rr] . Taking a partial sum of this series that dify
Fig. 18.3.
X
544
18 Fourier Series and the Fourier Transform
fers from 'P(x) by less than c/2, we obtain a trigonometric polynomial that approximates the original function f within F: on the whole interval [ -1r, 1r] . D
Remark 9. Let us assume that we have succeeded in representing a function f having a jump singularity as the sum f = 'P '1/J of a certain smooth function '1/J and a simple function 'P having the same singularity as f (Fig. 18.4 a, c, b) . Then the Fourier series of f is the sum of the Fourier series of '1/J, which converges rapidly and uniformly by Theorem 5, and the Fourier series of the function 'P· The latter can be regarded as known, if we take the standard function 'P (shown in the figure as 'P( x) = -1r - x for -1r < x < 0 and 'P(x) = 1r - x for 0 < x < 1r) .
+
c.
Fig. 18.4.
This observation can be used both in theoretical and computational prob lems connected with series (it is Krylov ' s 22 method of separating singularities and improving the convergence of series) and in the theory of trigonometric Fourier series itself (see for example the Gibbs2 3 phenomenon, described be low in Problem 1 1 ) .
Remark 1 0. (Integration of a Fourier series) . By Theorem 5 we can state and prove the following complement to Lemma 3 on differentiation of a Fourier series. Proposition 2 . If the function f : [-1r, 1r] --+ C is piecewise continuous, 00 then after integration the correspondence f (x) "' E ck ( f ) ei k x becomes the oo equality -
X
j f ( t ) dt = c U) x + 2.:::oo , ckiV) ( ei kx - 1 ) , 0
o
00
-
22 A.N. Krylov ( 1863-1945) - Russian/Soviet specialist in mechanics and mathe matics, who made a large contribution to computational mathematics, especially in methods of computing the elements of ship::; . 2 3 J.W. Gibbs (1839-1903) - American physicist and mathematician, one of the founders of thermodynamics and statistical mechanics.
18.2 Trigonometric Fourier Series
545
where the prime indicates that the term with index k = 0 is omitted from the n sum; the summation is the limit of the symmetric partial sums 'L: , and the -n
series converges uniformly on the closed interval [ -1r, 1r] . Proof. Consider the auxiliary function F (x )
X
= j f(t) dt - co (J)x 0
on the interval [-1r, 1r] . Obviously F E C[-1r, 1r] . Also F( -1r)
F(1r) - F(-1r) =
7r
= F(1r) , since
J f(t)dt - 2Jrco(J) = 0 ,
as follows from the definition of c0 ( ! ) . Since the derivative F' (x) = f(x) 00 c0 (f) of the function F is piecewise continuous, its Fourier series I:: c k ( F )e i k x -
oo
converges uniformly to F on the interval [-1r, 1r] by Theorem 5. By Lemma 3 we have c k (F) = c k ;[') for k -1- 0. But c k (F') = c k (F) if k -1- 0. Now writing 00 the equality F(x) = I:: c k (F)ei k x in terms of the function f and noting that
F(O)
oo
= 0, we obtain the assertion of the proposition. -
0
18.2.4 Completeness of the Trigonometric System
a. The Completeness Theorem In conclusion we return once again from
pointwise convergence of the Fourier series to its mean convergence ( 1 8.41 ) . More precisely, using the facts we have accumulated on the nature of point wise convergence of the Fourier series, we now give a proof of the completeness of the trigonometric system { 1 , cos kx, sin kx; k E N} in R2 ( [-7r, 1r] , IR) inde pendent of the proof already given in the problems. In doing so, as in Subsect. 18.2. 1 , we take R2 ( [-1r, 1r] , IR) or R2 ( [-1r, 1r] ,
546
18 Fourier Series and the Fourier Transform
Every function can be approximated arbitrarily closely in mean a ) by functions of compact support in ] - 7!", K[ that are Riemann integrable over the closed interval [ - 7!", K]; b) by piecewise-constant functions of compact support on the closed inter val [ - K, K]; c ) by piecewise-linear continuous functions of compact support on the closed interval [-7!", K]; d) by trigonometric polynomials.
Theorem 6. (Completeness of the trigonometric system) .
f
E R2 [ -7!", 7!"]
Proof. Since it obviously suffices to prove the theorem for real-valued func tions, we confine ourselves to this case. a) It follows from the definition of the improper integral that
�
�-8
J� f2 (x) x = 8-r +O �j8 f2 (x) x lim
d
Hence, for every
E
d .
-+
> 0 there exists b > 0 such that the function
fo ( x )
=
{
l x l < 7l" - b , if 7l" - b :::; l x l :::; 7l" ,
f (x) , if 0
E,
[ - 7!", 7!"] by less than since � � - �+8 2 2 j (f - fo) (x) dx = j f (x ) dx + j f2 (x) dx . �-8 -� -� b) It suffices to verify that every function of the form fo can be approxi mated in R 2 ( [ - 7!", K], JR ) by piecewise-constant functions of compact support in [ - 7!", K]. But the function fo is Riemann-integrable on [ - K +b, 7!" - b]. Hence it is bounded there by a constant M and moreover there exists a partition - 7!" + b = x0 < x1 < · · · < Xn = 7!" -n b of this closed interval such that the corresponding lower Darboux sum 2::::: mu1x i of the function fo differs from i= l the integral of fo over [ - 7!" + J, 7!" - J ] by less than > 0.
will differ in mean from f on
Now setting
g(x) = we obtain
{ mi , 0,
E
x E]x i _ 1 , x i [ , i = 1 , . . . , n , at all other points of [ - 7!", K] , if
18.2 Trigonometric Fourier Series
547
11" 11" 2 11"- 0 ju, - g) (x) dx ::; jl f, -g)l l fo +gl (x ) dx ::; 2M j ( f, - g) (x ) dx ::; 2Mc , -11" - 11" - 11" +0 and hence fo really can be approximated arbitrarily closely in the mean on [-n , n ] by piecewise-constant functions on the interval that vanish in a
neighborhood of the endpoints of the interval. c) It now suffices to learn how to approximate the functions in b) in mean. Let be such a function. All of its points of discontinuity X I , . . . , X n lie in the open interval ] There are only finitely many of them, so that for every E > 0 one can choose 5 > 0 so small that the 5-neighborhoods of the points XI , . . . , X n are disjoint and contained strictly inside the interval ] , [ , and 25nM < E, where M = sup Now replacing the function l x l�" on - 5, i = 1 , . . . , n, by the linear interpolation between the values 5) that assumes at the endpoints of this interval, we 5) and obtain a piecewise linear continuous function that is of compact support in -1r, 1r] . By construction I M on [ -1r, 1r] , so that
g
- n , n[.
-n n l g (x) l. gg(x -[xi xi +g(x5] , + g i i g0 [ l g0 ( x) ::; 11" 2 11" j (g - g0 ) (x) dx ::; 2M j l g - gol(x) dx = -11" x; o - 11" n + = 2M L j l g - gol(x) dx ::; 2M . (2M . 25) . n < 4Mc , t = I x; - o
and the possibility of the approximation is now proved. d) It remains only to show that one can approximate any function of class ] by a trigonometric polynomial. But for every E > 0 c) in mean on and every function of type Theorem · 5 enables us to find a trigonometric polynomial Tn that approximates uniformly within E on the closed interval and the possibility of an arbitrarily Tn ) 2 Hence J <
[-n, n 0 g , g0 [-n, n]. -11"11" (g0 - dx 2nc2 , precise approximation in mean by trigonometric polynomials on [ 1r] for any function of class c) is now established. By the triangle inequality in Rd - n , 1r] we now conclude that all of The orem 6 on the completeness of these classes in R 2 [-1r , n ] is also proved. D b. The Inner Product and Parseval's Equality Now that the complete ness of the trigonometric system in R2 ([- n , n] , C) has been proved, we can use Theorem 1 to assert that ( 18.66 ) ak ( f) cos kx + bk ( f) sin kx f = ao�f) + f k =I for every function f E R 2 ([-n, n] ,C), or, in complex notation, - 7r ,
548
18 Fourier Series and the Fourier Transform 00 -
(18.67) oo
where the convergence is understood as convergence in the norm of R2 [ -1r, 1r] , that is, as mean convergence, and the limiting passage in (18.67) is the limit n of sums of the form Sn ( x ) = E as -n If we rewrite Eqs. (18.66) and ( 18.67) as
ck ( f) ei kx n --+ oo.
(18.66') (18.67') then the right-hand sides contain series in the orthonormal systems
{
vk-ei kx;
1
1 1 r:;; cos kx, r:;; sin kx; k E N } 7r v 7r v 7r
2 , �
y
k E Z} . Hence by the general rule for computing the inner and { product of vectors from their coordinates in an orthonormal basis, we can assert that the equality
� ( !, g) = ao ( f �ao (g) + kf=l ak ( f) ak (g) + bk ( f) bk (g) holds for functions f and g in R 2 ( [ -7r , 1r ] , C ) , or, in other notation, 00 1 , = (f g) 'L00 ck ( f) ck (g) , 7r 2
( 18.68)
( 18.69)
-
where, as always,
7r
(f,g) J f (x ) g (x ) dx . In particular, if f = g, we obtain the classical Parseval equality from (18.68) and (18.69) in two equivalent forms: 2 (18.70) � 11 ! 11 2 = l ao �)l + kf=l l ak ( f)l 2 + l bk (J W ' ( 18.71) � 11 ! 11 2 = f l ck ( f)l 2 =
2
- (X)
·
18.2 Trigonometric Fourier Series
549
We have already noted that from the geometric point of view Parseval ' s equality can be regaded as an infinite-dimensional version of the Pythagorean theorem. Parseval ' s relation provides the basis for the following useful proposition. Proposition 3. (Uniqueness of Fourier series) . Let f and g be two functions in 'R. 2 [-1r, 1r] . Then a ) if the trigonometric series
converges in mean to f on the interval [-1r, 1r] it is the Fourier series of f ; b) if the functions f and g have the same Fourier series, they are equal almost everywhere on [1r , ?r] , that is, f = g in 'R.2 [-7r, 7r] . Proof. Assertion a) is actually a special case of the general fact that the expansion of a vector in an orthogonal system is unique. The inner product , as we know (see Lemma lb) shows immediately that the coefficients of such an expansion are the Fourier coefficients and no others. Assertion b) can be obtained from Parseval ' s equality taking account of the completeness of the trigonometric system in R. 2 ( [-7r, ?r] , C ) , which was just proved. Since the difference (! - g) has a zero Fourier series, it follows from Par seval ' s equality that II / - g ll n2 = 0. Hence the functions f and g are equal at all points of continuity, that is, almost everywhere. D Remark
11.
f:=O t<�!( a) ( x - a ) n we noted previn 00
When studying Taylor series
ously that different functions of class C ( ) (IR, lR) can have the same Taylor series (at some points a E lR). This contrast with the uniqueness theorem just proved for the Fourier series should not be taken too seriously, since every uniqueness theorem is a relative one in the sense that it involves a particular space and a particular type of convergence. For example, in the space of analytic functions (that is, functions that can 00 be represented as power series 2:: an ( z - z0 ) n converging to them pointwise) ,
n=O
two different functions have distinct Taylor series about every point . If, in turn, in studying trigonometric series we abandon the space 'R.2 [ -?r, 1r] and study pointwise convergence of a trigonometric series, then, as already noted (p. 531) one can construct a trigonometric series not all of whose coefficients are zero, which nevertheless converges to zero almost everywhere. According to Proposition 3 such a null-series of course does not converge to zero in the mean-square sense.
550
18 Fourier Series and the Fourier Transform
In conclusion, we illustrate the use of the properties of trigonometric Fourier series obtained here by studying the following derivation, due to Hur witz, 2 4 of the classical isoperimetric inequality in the two-dimensional case. In order to avoid cumbersome expressions and accidental technical difficulties, we shall use complex notation. Example 7. Between the volume V of a domain in the Euclidean space En , n ?: 2 , and the ( n - 1 )-dimensional surface area F of the hypersurface that bounds it , the following relation holds:
( 18. 72 ) called the isoperimetric inequality. Here Vn is the volume of the n-dimensional unit ball in E n . Equality in the isoperimetric inequality ( 18. 72) holds only for the ball. The name "isoperimetric" comes from the classical problem of finding the closed plane curve of a given length L that encloses the largest area S. In this case inequality (18. 72) means that
(18. 73) It is this inequality that we shall now prove, assuming that the curve in question is smooth and is defined parametrically as x = cp(s) , y = '1/J(s) , where s is arc length along the curve and cp and '1/J belong to c< l ) [0, L] . The condition that the curve be closed means that cp(O) = cp(L) and '1/J(O) = '1/J(L) . We now pass from the parameter s t o the parameter t = 27r f - 1r, which ranges from -7f to 1r, and we shall assume that our curve is defined paramet rically as y = y(t) , x = x(t) , - 7f � t � 7f , ( 18.74) with y( -7r) = y(7r) . (18.75) We write (18.74) as a single complex-valued function
z
= z(t) ,
-7f
�
t
�
where z(t) = x(t) + i y(t) and by (18.75) z ( -1r) We remark that
l z ' (tW
7f ,
= z (1r) .
(18.74 ' )
= (x ' (t)) 2 + (y' (t)) 2 = (�;f ,
and hence under our choice of parameter
( 18. 76) 2 4 A. Hurwitz (1859-1919) - German mathematician, a student of F. Klein.
551
18.2 Trigonometric Fourier Series
Next , taking into account the relations zz' = ( x - iy ) ( x' iy' ) = ( xx' yy ' ) i ( xy' - x'y ) , and using Eqs. ( 18.75) , we write the formula for the area of the region bounded by the closed curve (18.74) :
+
+
+
S=
� j( IT
xy ' - yx ' ) ( t ) dt
= ;i
j IT
z ' ( t ) z ( t ) dt .
(18.77)
We now write the Fourier series expansion of the function (18.74') : (X) z ( t ) = L c k ei k t . (X) Then (X) z ' ( t ) "' L ik ck ei k t . (X) Equalities (18.76) and (18.77) mean in particular that -
-
�2 ll z' ll 2 = 2� ] l z' ( t ) l 2 dt = 4�22 , - IT
and
2_ 2n ( z ' , z )
= 2_ 2n
IT
j
z ' ( t ) z ( t ) dt
= !:_ S . 1r
In terms of Fourier coefficients, as follows from Eqs. (18.69) and (18.71 ) , these relations assume the form (X) L 2 = 4n 2 L l kck l 2 , (X) -
Thus,
(X) L2 - 4n S = 4n 2 L ( 2 - ) l l 2 . (X) The right-hand side of this equality is obviously nonnegative and vanishes only if 0 for all E Z except = 0 and 1. Thus, inequality (18.73) is proved, and at the same time we have found the equation z (t) - 7r ::::: t ::::: 7r ' of the curve for which it becomes equality. This is the complex form of the parametric equation of a circle with center at in the complex plane and radius -
ck =
l c1l ·
k
k = co + c1 e'" t ,
k k ck k= c0
552
18
Fourier Series and the Fourier Transform
1 8 . 2 . 5 Problems and Exercises 1.
a) Show that
00 """' sm nx 1r - x for 0 < x < 21r , L.. 2 n =l n and find the sum of the series at all other points of JR. Using the preceding expansion and the rules for operating with trigonometric Fourier series, now show that the following equalities are true.
b) � L..J
.
_
_
si n 2 k x - for 0 < X < 7r . k=l 2 k = 4 2 s i n( 2kk -ll ) x 4 for O < X < 7r . c) � L..J 2k=l 00 ( d ) I; � sin nx = � for lxl < n =l e) x 2 = i + 4 f= < -n't cos nx for l x l < n =l 4 � cos( 2 k - l ) x rr f) x - 2 - -; L..J ( 2 k - l 2 for 0 � x � ) k=l 2!:
=
'E.
2!:
)n- 1
1r .
_
1r . 1r .
h) Sketch the graph of the sums of the trigonometric series here over the entire real line JR. Using the results obtained, find the sums of the following numerical series: oo 00 00
(-1) n
1
(-1t
' n2 . =O 2n + 1 ' nL =l n 2 nL: =l nL
Show that: a) if f : [ - 1r , 1r ] -+ IC is an odd (resp. even) function, then its Fourier coefficients have the following property: a k (f) = 0 ( resp. bk (f) 0) for k = 0 , 1, 2, . . . ; b) if f : lR -+ IC has period 27r/m then its Fourier coefficients c k (f) can be nonzero only when k is a multiple of m; c) if f : [-1r, 1r] -+ lR is real-valued, then c k (f) c _ k ( J ) for all k E N; d) la k (f) l � 2 sup l f(x) l , l bk (f) l � 2 sup l f(x) l , lck (f) l � sup l f(x) l . 2.
=
=
l x l
l x l
l x l
3. a) Show that each of the systems {cos kx; k = 0, 1, . . . } and {sin kx; k E N} is complete in the space R 2 [a, a + 1r] for any value of a E JR. b) Expand the function f(x) = x in the interval [0, 1r] with respect to each of these two systems. c) Draw the graphs of the sums of the series just found over the entire real line. d) Exhibit the trigonometric Fourier series of the function f(x) = l x l on the closed interval [-1r, 1r] and determine whether it converges uniformly to this function on the entire closed interval [ -7r, 1r] .
I8.2 Trigonometric Fourier Series 4.
553
The Fourier series f C k (f)ei k t of a function f can be regarded as a special case
of a power series f Ck Z k - (X)
(
=
E Ck Z k + f0 Ck Z k ) , in which z is restricted to the
- (X)
unit circle in the complex plane, that is, z = ei t . Show that if the Fourier coefficients Ck (f) of the function f : [ 1r] -+
-?T
-
-
.
. t he expans10ns cos z = e ) usmg verify that
+ I)x f ) - I t cos(2n (2n + I ) ! + I)x I) - I t sin(2n (2n + I ) ! f:( - I t cos 2nx (2n) ! f:( - I t sin(2n!)2nx
n=O
n=O
n =O
n=O
z2 21
00
00 ·
-I-
,
·
·
z• + 4T -
· ·
·
. z=z and sm
-
z3 3T
zs + 5T
- , · ·
·
sin( cos x) cosh(sin x) , sin( cos x) sinh(sin x) , cos( cos x) cosh(sin x) , cos( cos x) sinh(sin x) .
Verify that a) the systems { I , cos k 2; x, sin k � x ; k E N } and { ei k 2T x ; k E Z } are orthog onal in the space R2 ( [a, a + T] ,
18 Fourier Series and the Fourier Transform
554
c) if C k (f) and
Ck (g) are the Fourier coefficients of T-periodic functions f and a+ oo 1 iT (x) dx = ck ( j ) ck ( g) ; f(x) L g T oo a d) the Fourier coefficients c k (h) normalized by the factor t of the "convolution " T h(x) = � I f(x - t)g(t) dt
g, then
0
of T-periodic smooth functions f and g and the coefficients c k (f) and functions themselves are related by C k (h) c k (f)c k (g), k E Z. 6. Prove that if a is incommensurable with 7!', then N " i kt .. e dt · a) N-t lim ..!.. L e i k ( x+n a ) ...!. J N 2 7r oo n =I b) for every continuous 27r-periodic function f : JR. --+ IC 1 1 N lim f(x + na ) = j (t) dt . N� oo N L 27!' n= l
=
=
-
ck (g) of the
'
7T
" I
- 7r
7. Prove the following propositions. a) If the function f JR. --+ IC is absolutely integrable on JR., then
:
1 1 f(x)eiAx dx l l l f (x + �) - j (x) l dx . -
S
(X)
- CX)
b) If the functions f : JR. --+ IC and g : JR. --+ IC are absolutely integrable on JR. and g is bounded in absolute value on JR., then I j (x + t)g(t) e''A t dt =:
oo .
c) If f : JR. --+ IC is a 27r-periodic function that is absolutely integrable over a period, then the remainder Sn (x) - j (x) of its trigonometric Fourier series can be represented as 7r
Sn (x) - j(x) = � I( L1? f)(x , t)Dn (t) dt , 0
where D n is the nth Dirichlet kernel, and ( L1 2 j )(x, t) = f(x + t) - 2 j (x) + f(x - t). d ) For every 8 E]O, 7r[ the formula for the remainder just obtained can be brought into the form 0
sin nt (L1 2 f)(x, t) dt o (1) , Sn (x) - f(x) = - I -+ t 1
7l'
0
where o ( 1 ) tends to zero as n --+ oo , and uniformly on each closed interval [a, b] on which f is bounded.
18.2 Trigonometric Fourier Series
555
e) If the function f : [-1r, 1r] -+ C satisfies the Holder condition i f(xi ) - J(x 2 ) l :S Mlx 1 - x 2 l"' on [-1r, 1r] (where M and a are positive numbers) and in addition f( -1r) = j(1r) , then the Fourier series of f converges to it uniformly on the entire interval. 8. a) Prove that if f : R -+ R is a 21r-periodic function having a piecewise contin uous derivative f ( m ) of order m (m E N) , then f can be represented as
f(x) where Bm (u)
=
�0 + � I Bm (t - x)f ( m ) (t) dt , .,.
_ .,.
L
00 cos(ku+ 2� ) , k '"'
m E N. n= l b) Using the Fourier series expansion obtained in Problem 1 for the function " ;"' on the interval [0 , 21r] , prove that B (u) is a polynomial of degree 1 and Bm (u) 1 is a polynomial of degree m on the interval [0 , 27r] . These polynomials are called the Bernoulli polynomials. 2 11" c) Verify that J Bm (u) du = 0 for every m E N. 9.
a) Let Xm
=
=
0
;;_;;_ , m = 0, 1, . . . , 2n. Verify that
2n 2 cos kxm cos lxm = 8kl , L 2n + 1 m=O 2n 2 '""' sin kxm sin lxm = 8kl , 2n + 1 m� =O 2n sin kxm cos lxm = 0 , L m =O where k and l are nonnegative integers, 8kl = 0 for k #- l, and 8kl = 1 for k = l. b) Let f : R -+ R be a 27r-periodic function that is absolutely integrable over a period. Let us partition the closed interval [0, 27r] into 2n + 1 equal parts by the points Xm = ;;_;;_ , m = 0, 1 , . . . , 2n. Let us compute the integrals ---
a k (f)
=
2 11"
� I f ( x) cos kx dx , 0
bk (f)
=
2 11"
� I f ( x) sin kx dx 0
approximately using the rectangular method corresponding to this partition of the interval (0, 21r] . We obtain the quantities 2n 2 cos kx m , L 2n + 1 m =O f(xm) 2n 2 '""' f(xm) sin kxm , 2n + 1 m� =O
--
556
18 Fourier Series and the Fourier Transform
which we place in the nth partial sum Sn (f, x) of the Fourier series of f instead of the respective coefficients a k (f) and b k (f). Prove that when this is done the result is a trigonometric polynomial Sn (f, x) of order n that interpolates the function f at the nodes Xm , m = 0, 1 , . . . , 2n, that is, at these points f(x m ) = Sn (x, m ) . 1 0 . a) Suppose the function f : [a, b] --+ R is continuous and piecewise differ entiable, and suppose that the square of its derivative f' is integrable over the interval ]a, b[. Using Parseval's equality, prove the following: a) if [a, b] = [0, 1rj , then either of the conditions f(O) implies Steklov 's inequality
=
j ( 1r )
=
0 or I f(x) dx = 0 7r
0
I f2 (x) dx ::; l (f') 2 (x) dx , 7r
7r
0
0
in which equality is possible only for f(x)
=
a cos x;
b) if [a, b] = [ - 1r , 1r] and the conditions f( - 1r ) hold, then Wirtinger 's inequality holds: 7r
=
j ( 1r ) and I f(x) dx = 0 both 7r
7r
I l (x) dx ::; I (J' ) 2 (x) dx , where equality is possible only if f(x)
=
a cos x + b sin x.
1 1 . The Gibbs phenomenon is the behavioral property of the partial sums of a trigonometric Fourier series described below, first observed by Wilbraham (1848) and later (1898) rediscovered by Gibbs ( Mathematical Encyclopedia, Vol. 1, Moscow, 1977) . a) Show that
ao sin(2k - 1)x for x i < 7r . sgn x = -7r4 L I k= l 2 k - 1 b) Verify that the function Sn (x) = � £: s i n ��k_-/ ) x has a maximum at x = 27\"n k= l and that as n --+ oo 7r 2 ...f-, sin(2k - 1 ) 271"n -1r 2 sin x Sn = - � --+ - -dx � 1 . 179 . 2n 1r (2k - 1) n 1r x 2n k= l 0 Thus the oscillation of Sn ( x) near x = 0 as n --+ oo exceeds the jump of the function sgn x itself at that point by approximately 18% (the jump of Sn (x) "due to inertia" ) . c ) Describe the limit of the graphs of the functions Sn (x) i n problem b) . Now suppose that Sn (f, x) is the nth partial sum of the trigonometric Fourier series of a function f and suppose that Sn (f, x) --+ f(x) in a deleted neighborhood
( ) -
7r
·
11\"
18.2 Trigonometric Fourier Series
557
0 < ix - .;1 < J of the point .; as n --+ oo and that f has one-sided limits f(.; _ ) and f(.;+ ) at .;. For definiteness we shall assume that f (.; - ) :::; f(.;+ ) · We say that Gibbs ' phenomenon occurs for the sums Sn (!, x) at the point .; if lim Sn (f, x) < f(.; - ) ::; f(.; + ) < lim S (f, x) . n-->oo n n-->oo d) Using Remark 9 show that Gibbs' phenomenon occurs at the point .; for every function of the form tp(x) + c sgn (x - .;) , where c oj 0, 1.;1 < 1r, and tp E CC l l [-7r, 7r] . 12.
Multiple trigonometric Fourier series. a) Verify that the system of functions
e i k x , where k = (k 1 , . . . , k n ) , X = (x 1 , . . . , X n ) , kx = k 1 X 1 + + k n X n , and ( 2 1r �n ; 2 k 1 , . . . , kn E /Z, is orthonormal on any n-dimensional cube I = { x E lR n I aj :'::: Xj :'::: · ·
ai + 27r, j = 1 , 2, . . . , n}.
b) To a function f that is integrable over I we assign the sum f
"'
·
f: Ck (f) e i k x ,
- oo which is called the Fourier series of f in the system { 2 " �n 1 2 ei k x }, if C k (f) = - i k x dx. The numbers C k (f) are called the (Fourier coefficients of f 2( " �n ; 2 II f (x)e · the system { 27r 1n / 2 ei kx } . tn ( ) In the multidimensional case the Fourier series is often summed via the partial sums SN (x) = L Ck (f) e i k x , i k i :S N where lkl :'::: N means that N = (N1 , . . . , Nk ) and l ki l :'::: Nj , j = 1 , . . . , n. Show that for every function f(x) = f(x 1 , . . . , x n ) that is 21r-periodic in each variable
where D Nj (u) is the Njth one-dimensional Dirichlet kernel. c) Show that the Fejer sum
of a function f(x) represented as
f(x l , . . . , X n ) that is 27r-periodic in each variable can be
n where P N (u) = IT :FNj (ui ) and :FNj is the Njth one-dimensional Fejer kernel. j=l d) Now extend Fejer's theorem to the n-dimensional case. e) Show that if a function f that is 21r-periodic in each variable is absolutely integrable over a period I, possibly in the improper sense, then I if(x + u) I f(x) l dx --+ 0 as u --+ 0 and I i f - aN i (x) dx --+ 0 as N --+ oo . -
I
558
18
Fourier Series and the Fourier Transform
f) Prove that two functions f and g that are absolutely integrable over the cube
I can have equal Fourier series (that is, C k (f) = C k g) for every multi-index k) only if f(x) = g(x) almost everywhere on I. This is a strengthening of Proposition 3 on
uniqueness of Fourier series. g) Verify that the original orthonormal system { (27r�n / 2 e i k x } is complete in R2 ( I) , so that the Fourier series of every function f E R2 ( I ) converges to f in the mean on I . h) Let f be a function in c C =l (JR n ) of period 21r in each variable. Verify that ck (f ( "' ) ) = i l "' l k "' c k (f) , where a = (a 1 , . . . , a n ), k = (k 1 , . . . , kn ), l a l = l a 1 l + + k';: n , and aj are nonnegative integers. I a n I , k"' = kr' i) Let f be a function of class c C mn l (JR n ) of period 21r in each variable. Show that if the estimate 1 - ("') 2 (x) dx < M 2 (27r) n IJ I · · ·
·
.
.
.
·
/
holds for each multi-index j = 1, . . . , n ) , then
I
a = (a 1 , . . . , a n )
such that
aj
1s 0 or
m
(for every
IJ (x) - Sn (x) l :::: NCM m- 2 where N = min{N1 , . . . , Nn } and C is a constant depending on m but not on N or -1 ,
E I. j) Notice that if a sequence of continuous functions converges in mean on the interval I to a function f and simultaneously converges uniformly to '{), then f(x) = 'P(x) on I. Using this observation, prove that if a function f JR n ---+ C of period 27r in each variable belongs to c C l l (JR n , C), then the trigonometric Fourier series of f converges to f uniformly on the entire space JR. n .
X
:
:
1 3 . Fourier series of generalized functions. Every 21r-periodic function f lR ---+ C can be regarded as a function f ( s) of a point on the unit circle r (the point is fixed by the value of the arc-length parameter s, 0 :::: s :::: 21r) . Preserving the notation of Sect. 17 . 4, we consider the space V(r) on r consist ing of functions in c C = l ( r) and the space V' ( r) of generalized functions, that is, continuous linear functionals on V(r) . The value of the functional F E V' (r) on the function 'P E V( r) will be denoted F ( 'P) , so as to avoid the symbol (F, 'P) used in the present chapter to denote the Hermitian inner product (18.38). Each function f that is integrable on r can be regarded as an element of V' ( r) (a regular generalized function) acting on the function 'P E V(r) according to the formula 71"
2
f('P)
=
j f(s)'{J(s) ds . 0
Convergence of a sequence { Fn } of generalized functions in V' ( r) to a general ized function F E V' ( r) , as usual, means that for every function 'P E V( r)
18.2 Trigonometric Fourier Series
559
a) Using the fact that by Theorem 5 the relation rp( s ) = f: Ck ( rp )e i k x holds for 00
f: ck (rp) , show that in the -oo sense of convergence in the space of generalized functions V' (r) we have n 1 """" - e i k s -+ 8 as n -+ oo . L..- 27r k=n every function rp E c
-
Here 8 is the element of V' ( r) whose effect on the function rp E V( F) is defined by 8(rp) = rp(O) . b ) If f E R( F), the Fourier coefficients of the function f i n the system { e i k s } defined in the standard manner, can be written as 2 1f ck (f) = 2 f ( s )e - i k x dx = 2 f (e - i k s ) .
�
�I 0
By analogy we now define the Fourier coefficients c k (F) of any generalized function F E V' (r) by the formula Ck (F) = 2� F(e - i k • ) , which makes sense because e - i k s E V(r) . Thus to every generalized function F E V' ( r) we assign the Fourier series - oo 00 Show that 8 "' I:; 2� e i ks . - oo c) Prove the following fact, which is remarkable for its simplicity and the free dom of action that it provides: the Fourier series of every generalized function F E V' ( r ) converges to F (in the sense of convergence in the space V' ( r )) . d) Show that the Fourier series of a function F E V' ( r ) (like the function F itself, and like every convergent series of generalized functions) can be differentiated termwise any number of times. e) Starting from the equality 8 = f: 2� e i k s , find the Fourier series of 8' . - oo f) Let us now return from the circle r to the line lR and study the functions e i k s as regular generalized functions in V' (.IR) (that is, as continuous linear functionals on the space V(.IR) of functions in the class c6 oo) (.IR) of infinitely differentiable functions of compact support in .IR) . Every locally integrable function f can be regarded as an element of V' (.IR) (a regular generalized function in V' (.IR) ) whose effect on the function rp E c6 oo) (.IR, iC) is given by the rule f(rp) = J f(x)rp(x) dx. Convergence in V' (.IR) is defined in the IR standard way:
( nlim�� Fn = F) := Vrp E V(.IR) ( nlim�� Fn (rp) = F(rp) )
560
18 Fourier Series and the Fourier Transform Show that the equality
� f ei kx f 8(x - 27rk)
2
-
ex:>
=
-
oo
holds in the sense of convergence in V' (IR) . In both sides of this equality a n limiting passage is assumed as n --+ oo over symmetric partial sums 2: , and -n 8(x - xo ) , as always, denotes the 8-function of V' (R) shifted to the point xo , that is, 8(x - xo) (cp) = cp(xo) .
1 8 . 3 The Fourier Transform 18.3. 1 Representation of a Function by Means of a Fourier Integral
The Spectrum and Harmonic Analysis of a Function Let f(t) be aa.function T-periodic function, for assume examplethata periodic signalf iswithabsolutely frequencyintegrable � as a of time. We shall the function over period.theExpanding f in a Fourier series (when f is sufficiently regular, as wea know, Fourier series converges to f) and transforming that series, f(t) = ao�f) + f. a k (f) cos kwot + bk (f) sin kw t k= l = :�::>k (f)ei kwa t = Co + 2 L00 i ck i cos(kwot + arg ck ) , ( 18.78) k= l wemeanobtain a representation of f as a sum of a constant term � = co - the value of f over a period and sinusoidal components with frequencies v0and=so� on.(theInfundamental frequency), 2 vo (the second harmonic frequency), general the kth harmonic component 2jc i cos (k � t + arg c ) of the signal has frequency kvo = � ' cyclic frequency kwk 0 = 2rrkvo = �kk, amplitude 2ic k i = Ja� + b�, and phase arg c k = - arctan � The expansion of isa called periodicthefunction (signal) into a sum of simple harmonic oscillations harmonic analysis of f. The numbers {ck (f); k E Z} or {a0 (f), a k (f), bk (f); k E N} are called the spectrum of the function (signal) f. A periodic function thus has a discrete spectrum. Let us now set out (on a heuristic level) what happens to the expansion ( 18. 78) when the period T of the signal increases without bound. Simplifying the notation by writing l = f and O:k = kf, we rewrite the expansion 00 f(t) = L Ck ei k f t o
-
oo
-
-
oo
=
18.3 The Fourier Transform
as follows: f(t)
where
= f= (ck � ) e i k f t y , -oo
561
( 18.79)
l
Ck = ;l j f (t) e- i o:k t dt -l
and hence
l
Ck -l = -1 J f (t) e -IO:k t dt . 21f
1f
l
.
-l
Assuming that in the limit as --+ +oo we arrive at an arbitrary function f that is absolutely integrable over �. we introduce the auxiliary function
c(o:)
= 2�
00
j f (t) e- io:t dt ,
( 18.80)
-oo
whose values at points o: = O: k differ only slightly from the quantities formula (18.79) . In that case f (t)
�
kf
2:.:: c ( o: k ) eic� k t y ' -oo 00
f.
Ck � in ( 18.81)
where O: k = and O: k+ l - O: k = This last integral resembles a Riemann sum, and as the partition is refined, which occurs as --+ oo , we obtain
00
f(t) =
j c(o:)eio:t do: .
-
l
( 18.82)
00
Thus, following Fourier, we have arrived at the expansion of the function
f into a continuous linear combination of harmonics of variable frequency
and phase. The integral ( 18.82) will be called the Fourier integral below. It is the continuous equivalent of a Fourier series. The function c( o:) in it is the analog of the Fourier coefficient , and will be called the Fourier transform of the function f ( defined on the entire line � ) . Formula (18.80) for the Fourier transform is completely equivalent to the formula for the Fourier coefficients. It is natural to regard the function c(o:) as the spectrum of the function ( signal ) f . In contrast to the case of a periodic signal f considered above and the discrete spectrum ( of Fourier coefficients ) corresponding to it , the spectrum c(o:) of an arbitrary signal may be nonzero on whole intervals and even on the entire line ( continuous spectrum) .
562
18 Fourier Series and the Fourier Transform
{
Example 1 . Let us find the function having the following spectrum of compact support: h ' if l a l � a , c (a) = (18.83) 0 , if l a l a . Proof. By formula ( 18.82) we find, for t =f. 0
f( t ) and when t t -+ 0. D
=
a
I heiat da
-a
=
>
ei h a t � e i at t
=
2h
�
si at
'
(18.84)
= 0, we obtain f(O) = 2 ha , which equals the limit of 2 h "i� a t as
The representation of a function in the form ( 18.82) is called its Fourier integral representation. We shall discuss below the conditions under which such a representation is possible. Right now, we consider another example.
Example 2. Let P be a device having the following properties: it is a linear signal transform, that is, P 2;:: aj fJ = 2;:: aj P(fJ ) , and it preserves the
(
)
J
J
periodicity of a signal, that is, P(eiwt) = p(w) e iwt , where the coefficient p(w) depends on the frequency w of the periodic signal e iwt . We use the compact complex notation, although of course everything could be rewritten in terms of cos wt and sin wt. The function p(w) = : R(w) ei cp ( w ) is called the spectral characteristic of the device P. Its absolute value R( w ) is usually called the frequency characteristic and its argument 'P ( w ) the phase characteristic of the device P. A signal eiwt , after passing through the device, emerges transformed into the signal R (w ) ei (wt + cp (w )) , its amplitude changed as a result of the factor R (w ) and its phase shifted due to the presence of the term 'P ( w ) . Let us assume that we know the spectral characteristic p( w ) of the device P and the signal j ( t ) that enters the device; we ask how to find the signal x ( t ) = P(f) (t) that emerges from the device. Representing the signal f(t) as the Fourier integral ( 18.82) and using the linearity of the device and the integral, we find x (t)
=
P(f) (t)
In particular, if
p (w ) =
{
=
00
I c(w)p(w)eiwt dw . oo
-
� fl , l w l > fl '
1 for l w l
0 for
( 18.85)
18.3 The Fourier Transform
then
x(t)
=
n
J
-n
563
c(w)eiw t dw
and, as one can see from the spectral characteristics of the device, for lw l � n , for lw l > n
0
A device P with the spectral characteristic ( 18.85) transmits (filters) fre quencies not greater than n without distortion and truncates all of the signal involved with higher frequencies (larger than il ) . For that reason, such a de vice is called an ideal low-frequency filter (with upper frequency limit n) in radio technology. Let us now turn to the mathematical side of the matter and to a more careful study of the concepts that arise. b. Definition of the Fourier Transform and the Fourier Integral In accordance with formulas ( 18.80) and ( 18.82) we make the following defini tion. Definition 1 . The function
F[ f ] ( � )
00
2_ J f x
:=
21f
- oo
( ) e - i f; x dx
( 18.86)
is the Fourier transform of the function f : lR -+ C. The integral here is understood in the sense of the principal value
oo
Joo f(x)e-if;x x
-
d
:=
A
J f(x)e-if;x x
A -lim t +oo A -
d ,
and we assume that it exists. If f : 1R -+ C is absolutely integrable on JR, then, since l f (x)e - i x f; l = l f(x) l for x, � E JR, the Fourier transform ( 18.86) is defined, and the integral ( 1 8.86) converges absolutely and uniformly with respect to � on the entire line JR.
Definition 2. If c(�) = F[ f ] ( � ) is the Fourier transform of f : lR -+ C, then the integral assigned to f, 00
f (x)
rv
Joo
-
c(�)e i x f; d� ,
understood as a principal value, is called the Fourier integral of f.
( 1 8.87)
564
18 Fourier Series and the Fourier Transform
The Fourier coefficients and the Fourier series of a periodic function are thus the discrete analog of the Fourier transform and the Fourier integral respectively. Definition 3. The following integrals, understood at> principal values,
00 � J f(x) cos �x dx , oo 00 . 1 T. [J] (�) : = ; J f(x) sm �x dx , 00 Fe [!] (�) : =
( 18.88)
-
( 18.89)
-
are called respectively the Fourier cosine transform and the Fourier sine transform of the function f. Setting c(�) = F[/] (0 , a(�) = Fc [J] (O , and b(O = Fs [/] (0 , we obtain the relation that is already partly familiar to us from Fourier series 1 . c(O = 2 (a(�) - zb(�) ) .
As can be seen from relations ( 18.88 ) and ( 18.89 ) , a(-0 = a(�) ,
b( -0 = -b(�) .
(18.90) ( 18.91 )
Formulas ( 18.90 ) and ( 18.91 ) show that Fourier transforms are completely determined on the entire real line lR if they are known for nonnegative values of the argument. From the physical point of view this is a completely natural fact - the spectrum of a signal needs to be known for frequencies w ;:::: 0; the negative frequencies o: in ( 18.80 ) and ( 18.82 ) - result from the form in which they are written. Indeed, A
J
-A
A
A
0
c(Oei x � d� =
( J J) +
0
-A
c(�)ei x � d� =
J
( c(�)ei x � + c( -0e - i x � ) d� =
0
A
=
J
(a(�) cos x� + b(O sin x�) d� ,
0
and hence the Fourier integral ( 18.87 ) can be represented as
00 J (a(O cos x� + b(�) sin x�) d� , 0
(18.87')
18.3 The Fourier Transform
565
which is in complete agreement itwith classical form (of1 8.90) a Fourier Ifthatthe function f is real-valued, followsthe from formulas and ( series. 18.91) ( 1 8.92) sincefromin this casedefinitions a(�) and(18.88) functions on JR., as one can b(�) areandreal-valued see their ( 18.89 ) On the other hand, under the assumption( 18.86) f(x) =off(x), Eq. ( 18.92 ) can be obtained immediately from the definition the Fourier transform, if we take into account that the conjugation sign can be moved under the integral sign. This last observation allows us to conclude that ( 18.93 ) F[/]( -�) = F[f] (�) for Iteveryis alsofunction ftoJR.note -+ IC. useful that if f is a real-valued even function, that is, f(x) = f(x) = f( -x), then Fc [J] (�) = Fe[J] (� ) , Fs[J] ( �) 0 , ( 18.94 ) F[f] (�) = F[f] ( 0 = F[f] ( - � ) ; and if f is a real-valued odd function, that is, f(x) = f(x) = -f( -x), then Fe[J] (�) 0 , Fs[J] (�) = Fs[J] (� ) , ( 18.95 ) F[f] (�) = - F[f] (� ) = F[f] ( - �) ; and if f is a purely imaginary function, that is, f(x) = -f(x), then ( 18.96 ) F[/] ( - �) = - F[f] (�) . that asif f is a real-valued function, its Fourier integral ( 18.87' ) canWealsoremark be written 00 00 j y'a2 (�) + b2 (�) cos(x� + cp(�)) d� = 2 j l c(�) I cos(x� + cp(�)) d� , :
=
=
0
0
where cp(�) = - arctan !i�� = argc(�). Example Let us find the Fourier transform of f(t) f(O) = a E JR.). 3.
si� at
(assuming
18 Fourier Series and the Fourier Transform
566
A
F[f] (a)
= A-t+oo lim __!___27f J sint at e� ia �A
__!___
= A-t+oo lim 27f
d
dt =
A
+oo 2_ sin at cos at dt = J sin at cos at dt = J t
�A
27f
0
t
( sin (a + a)t + sin ( a - a)t ) dt = J 27f t t oe sin u { � sgn a ' if I a I :::; Ia I ' 1 ( =sgn (a + a) + sgn ( a - a)) J - du = 27r u 0, if lal > l a l , since we know the value of the Dirichlet integral 00 sinu d 7r ( 18 9 7 ) J -2 u Hence if we assume a ;:::: 0 and take the function f(t) = 2h s i �a t of Eq. ( 18.84) , we find, as we should have expected, that the Fourier transform is the spectrum of this function exhibited in relations (18.83). The transform function fhasin discontinuities. Example 3 is notThatabsolutely integrable on ofJR,anandabsoits Fourier the Fourier transform lutely integrable function has no discontinuities is attested by the following lemma. Lemma 1 . If the function f lR -t
+oo 0
0
U - - .
.
_
0
:
-t oo .
R
�A
�A
18.3 The Fourier Transform
establishes that the integral A
567
;7r J f(x)e -ix� dx , with respect to �; and the uniform convergence of this integral asis continuous A --+ +oo enables us to conclude that F [ / ] E C(JR., C) . 0 Example 4. Let us find the Fourier transform of the function f(t) = e- t 2 1 2 : +oo +oo 2 / t t a 2 -i e- t2 / 2 cosatdt . dt = e e J F[/](a) = J oo - oo Differentiating integrating by parts,thiswelastfindintegral that with respect to the parameter a and then d:�J (a) + aF [f] (a) = 0 , or d ln F[/](a) = -a . da - <>2 / 2 , where c is a constant which, using It follows that [ (a) = ce f ] F the Euler-Poisson integral (see Example 17 of Sect. 17.2) we find from the relation + oo c = F[/] (0) = j e - t2 1 2 dt = V2/ff . - 00 Thus we have found that F2 / 2[/] (a) = ,;2iie - <> 2 / 2 , and simultaneously shown that Fc[f] (a) = ,;2iie - <> and Fs [/] (a) 0. Normalization of the Fourier Transform We obtained the Fourier transform (18.80) and the Fourier integral (18.82) as the natural continuous analogs of the Fourier coefficients ck = 2� J11" f(x)e -i kx dx and the Fourier series I:00oo ck ei kx of a periodic function f in the trigonometric syskx ; k E Z} . This system is not orthonormal, and only the ease of tern {ei writing a trigonometric in it has caused kx ; k tradition E Z } . In ally instead of the moreFourier naturalseries orthonormal systemit00{ tokbeeiused this normalized system the Fourier series has the form -I:00 k ei kx , and the Fourier coefficients are defined by the formulas 2,., = k -J11"11" f(x)e -i kx dx. -A
-
:::=
c.
-
2,.,
568
18 Fourier Series and the Fourier Transform
The series continuous of suchtransform natural Fourier coefficients and such a Fourier would analogs be the Fourier 00 ( 18.98) f( � ) : = vk I f(x)e-i x f. dx oo and the Fourier integral 00 ( 18.99) f(x) = vk Ioo f(�)eixf. d� , whichIn the differsymmetric from thoseformulas considered(18.98) aboveandonly( 1 8.99) in thethenormalizing coefficient. Fourier "coefficient" and the Fourier "series"onlypractically coalesce,ofandthesointegral in the future we shall es sentially be interested in the properties transform ( 18.98) , calling itthetheFourier normalized Fourier transform or, where no confusion can arise, simply transform of the function f . In general the name integral operator or integral transform is customarily given to an operator A that acts on a function f according to a rule -
-
A(f) (y)
= I K(x, y)f(x) dx , X
where K(x, y) is a given function called the kernel of the integral operator, and X JR. n is the set over which the integration extends and on which the integrands are assumed to be defined. Since y is a free parameter in some set Y, it follows that A( ! ) is a function on Y. In mathematics there are many important integral transforms, and among them the Fourier transform occupies one of the most key positions. The reasons for this situation go very deep and i n volve the remarkable properties ofillustrate the transformation ( 1 8 .98) , which we shall to some extent describe and in action in the partFourier of this transform section. (18.98) . Thus, we shall study theremaining or;_ rmalized Alongthewithnotation the notation f for the normalized Fourier transform, we in troduce 00 I f(x)eif.x dx , �) = - ( 18. 100) J( : vk C
-
oo
thatFormulas is, J(�) =( 1f(8 .98)-�)and . (18.99) say that -::::.
i= f = f ,
(18.101)
18.3 The Fourier Transform
569
that other. is, the Hence integralif (18.98) transforms (18.98) and (18.99) are mutually inverse to each is the Fourier transform, then it is natural to call the Weintegral operator (18. 100) the inverse Fourier transform. discussandinjustidetail belowForcertain Fourier shall transform fy them. exampleremarkable properties of the f n l ( O = ( iO n [(o , (
r;g 11111
= y'2;f . g ' = ll f ll .
That is,of multiplication the Fourier transform maps the operator of differentiation into the operator by the independent variable; the Fourier transform ofFouritheerconvolution of functionsthe amounts to multiplying transforms; the ' s equality),the and transform preserves norm (Parseval is therefore an isometry of thebegin corresponding functionformula space. ( 1 8 . 1 01 ) . But we shall with the inversion another convenient normalization of the Fourier transform see Prob lemFor10 below. d. Sufficient Conditions for a Function to be Representable as a Fourier Integral We shall now prove a theorem that is completely analo gous inFourier both form andat content toTothepreserve theoremtheonfamiliar convergence of a trigono metric series a point. appearance of our earlier formulas and transformations to the maximum extent, we shall use the nonnormalized Fourier transform c (O in the present part of this subsec tion, together with itswhen ratherstudying cumbersome but sometimes convenient notation Afterwards, the integral Fourier transform as such, [ ] ( . f O .F wetionshalf. l as a rule work with the normalized Fourier transform 1 of the func Theorem 1 . (Convergence of the Fourier integral at a point). Let f : lR --+ C be an absolutely integrable function that is piecewise continuous on each finite
closed interval of the real axis R If the function f satisfies the Dini conditions at a point x E IR, then its Fourier integral ( ( 18.82) , (18.87) , (18.87' ) , (18.99) ) converges at that point to the value � ( f ( x _ ) + f ( x + ) ) , equal to half the sum of the left and right-hand limits of the function at that point.
By Lemma 1 the Fourier transform c(�) = .F [ f ]( � ) of the function f isweProof. continuous on lR and hence integrable on every interval [-A, A] . Just as transformed the partialofsum of the Fouri following transformations the partial Fourierer series, integral:we now carry out the
570
18 Fourier Series and the Fourier Transform
A
A x SA (x) = J c( () ei � d( = J ( 2� J f(t) e -it� dt) eix� d( = -A -A A x - t ) � d( ) dt = � J J(t) ei ( x - t )� - e -i( x - t ) A dt i( e ( = � J(t) 2n 1(x - t) 2n A 00 00 .!_ .!_ sin(x t)A = f(x + u) sinuAu du = J f(t) x - t dt oo
- oo
oo
oo
J
J
- oo
1r
- oo
=
- oo
=
J
1r
- oo
sinAu du . J ( f(x - u) + f(x + u) ) -u The ofchange in the order isoflegal. integration atintheviewsecond equality fromcontinuity the be ginning the computation In fact, of the piecewise of J , for every finite B > 0 we have the equality A A t x J ( 2� J j( t) e -i � dt) ei � d( = 2� J f(t) ( J ei(x - t � d() dt , -A -A from which as B -+ +oo, taking account of the uniform convergence of the integral J f(x) e-it� dt with respect to ( , we obtain the equality we need. We now use the value of the Dirichlet integral (18.97) and complete our transformation: 1
00
= 7r
B
B
0
B
-B
-B
)
-B
.!_
+ (f(x - u) - f(x_)) + (f(x + u) - f(x )) + sinAudu . oo
J u The resulting integral tends to zero as A -+ oo. We shall explain this and thereby finish the proof of the theorem. We represent this integral as the sum of the integrals over the interval ]0, 1] and over the interval [ 1 , +oo[. The first of these two integrals tends to zero as iAndexananameRiemann, -+ +oo in view of the BDini. Theconditions and the Riemann-Lebesgue lemma. second integral is the sum of four integralsThecorresponding to the four terms f(x - u), f(x + u), f(x_) and f(x + )· Riemann-Lebesgue lemma applies to the first two of these four integrals, and the last two can be brought into the following form, up to a constant factor: + sin Au J u du = AJ si: v dv . =
7r
0
oo
+ oo
18.3 The Fourier Transform
571
But as A +oo this last integral tends to zero, since the Dirichlet integral (18.97) converges. D Remark 1 . In the proof of Theorem 1 we have actually studied the conver gence of the integral as a principal value. But if we compare the notations (18.87) and (18.87'), it becomes obvious that it is precisely this interpretation of the integral that corresponds to convergence of the integral (18.87'). --+
From this theorem we obtain in particular
Corollary 1 . Let f : JR; --+ C be a continuous absolutely integrable function. If the function f is differentiable at each point x E JR; or has finite one sided derivatives or satisfies a Holder condition, then it is represented by its Fourier integral.
Hence for functions of these classes both equalities (18.80) and (18.82) or (18.98) and (18.99) hold, and we have thus proved the inversion formula for
the LetFouriuserconsider transformseveral for such functions. examples. Example Assume that the signal v (t) = P(f) (t) emerging from the device P considered in Example 2 is known, and we wish to find the input signal f (t) entering the device P. In Example 2 we have shown that f and v are connected by the relation 00 v( t) = J c(w)p(w) e iw t dw , -oo where c(w) F[f](w) is the spectrum of the signal F (the nonnormalized Fouri e r transform of the function f) and p is the spectral characteristic of the device P. Assuming all these functions are sufficiently regular, from the theorem just proved we conclude that then 5.
=
c(w)p(w) F[v] (w) . find c( w)(18.87). = F[f] ( w). Knowing c( w), we find usingFromthe this Fouriweer integral Example 6. Let a 0 and =
>
for x 0 , for x :::; 0 . >
Then
+oo _.!_ _1 . F[f] (�) = 21r J e - ax e il; x dx = _.!_ 21r a + �� -
0
_ _
the signal f
572
18 Fourier Series and the Fourier Transform
In adiscussing theits definition of the Fourier transform, we havesubsection. already noted number of obvious properties in Part b of the present _ (x) := f(-x), then F[f- ] (�) = F[f](-�). This is We note furtherchange that ifofJvariable an elementary - a linx l =thef(x)integral. We now take the function e + f( -x) = :
()()
r= 1
2 '
r·"
0
0
a +�
18.3 The Fourier Transform
573
Example On the basis of Example 4 it is easy to find (by an elementary change of variable) that if 2 e2 1 4,;2" . e then ! (f. ) = f ( x) = e - a 2 x , v'2a Itfunctions is very instructi ve to trace the simultaneous evolution of the graphs of the f and j as the parameter a varies from 1/v'2 to 0. The more "concentrated"is one of theconnected functionswith is, thethemore "smeared" the otherprinciple is. This circumstance closely Heisenberg uncertainty in quantum mechanics. (In this connection see Problems 6 and 7.) Remark In completing the discussion of the question of the possibility of representing a function by a Fourier integral, we note that, as Examples 1 and 3 show, the conditions on f stated in Theorem 1 and its corollary are sufficient but not necessary for such a representation to be possible. 7.
�
2.
18.3.2 The Connection of the Differential and Asymptotic Properties of a Function and its Fourier Transform a. Smoothness of a Function and the Rate of Decrease of its Fourier Transform lR 1
It follows fromintegrable the Riemann-Lebesgue lemma that theat infinity. Fourier transform of any absolutely function on tends to zero ThisthehasFouri already been notedthein smoother Lemma the proved above.theWefaster now show that, like e r coefficients, function, i t s Fourier transform tends to itszero.Fourier The dual fact is that the faster a function tends to zero,Wethebegin smoother transform. with the following auxiliary proposition.
Lemma 2. Let f : lR -+ C be a continuous function having a locally piecewise continuous derivative f' on JR. Given this, a ) if the function f' is integrable on then f (x) has a limit both as x -+ -oo and as x -+ +oo; b) if the functions f and f' are integrable on then f x) -+ 0 as x -+ oo .
IR,
IR, ( Proof. Under these restrictions on the functions f and f ' the Newton-Leibniz formula holds f(x) = f (O) + J f ' (t) dt . In conditions a) the right-hand side of this equality has a limit both as x -+ +oo and as x -+ -oo. function having a belimitzero.at infinity is integrable on IR, then both of theseIf alimits mustfobviously D X
0
574
18 Fourier Series and the Fourier Transform
We now prove Proposition 1 . (Connection between the smoothness of a function and the rate of decrease of its Fourier transform. ) If f E C( k) (JR , q (k = 0, 1 , . . . ) and all the functions f, f', . . . , f( k ) are absolutely integrable on JR, then a ) for every E {0, 1 , . . . , k} n
f (n) (0 = (iO n !(�) '
(18. 102)
�) = o( �\ ) as � ---+ 0 . Proof. If k = 0, then a) holds trivially and b) follows from the Riemann Lebesgue lemma. Let k > 0. By Lemma 2 the functions f, f', . . . , f( k - l) tend to zero as x ---+ Taking this into account, we integrate by parts, b) i(
oo .
= vk (
-
oo
f ( k - l) x - � l ::'_ oo +
( )e i x
(i�) J f (k-l) ( x) e-i�x dx) = . . . 00
00
= % J f(x)e -i�x dx = (i�) k no 00 Thus return Eq. ( 1 8 . 102) is established. This is a very important relation, and we We shall to it. �e shown that i( �) = (i�) k f ( k l (O , but by the Riemann-Lebesgue lemma f ( k) (0 ---+ 0 as � ---+ 0 and hence b) is also proved. 0 b. The Rate of Decrease of a Function and the Smoothness of its Fourier Transform In view of the nearly complete identity of the direct and inverse Fourier transforms the following proposition, dual to Proposition 1 , holds. Proposition 2. (The connection between the rate of decrease of a function and the smoothness of its Fourier transform). If a locally integrable function k 0
0
0
0
-
-
f : lR ---+ C is such that the function x f(x) is absolutely integrable on JR, then a ) the Fourier transform of f belongs to C( k ) (JR, q . b ) the following equality holds: ( 18. 103 )
18.3 The Fourier Transform
575
Proof. For k = 0 relation (18. 103) holds trivially, and the continuity of f( � ) has alreadyn been proved in Lemma 1 . If k > 0, then for < k we have the estimate l x f(x) l ::::; l x k f(x) l nat infinity,x from nwhich it follows that x n f(x) is absolutely integrable. But l x ofj(x)thesee-i�integrals 1 = l x f(x) l , which enables us to in voke the uniform convergence with respect to the parameter � and successively differentiate them under the integral sign: 00 f( � ) vb J f(x) e-i� x dx ' n
- oo
00
f (O � J00 xf(x) e - i�x dx , -
- oo
By Lemma 1 this last integral is continuous on the entire real line. Hence indeed f E C k l ( JR. , C ) . (
0
c. The Space of Rapidly Decreasing Functions
We denote the set of functions f E c C 00 l (JR., C) satisfying the condition sup l x,a j C"' l (x) l < for all nonnegative and (3 by S(JR., C) or more briefly by S. Such functions are called integers rapidly decreasing functions (as x ) Thethesetstandard of rapidlyoperations decreasingof addition functionsofobviously forms a vector spaceof under functions and multiplication a function by a complex number. Example The function e - 2 or, for example, all functions of compact sup port in c600 (JR., q belong to s. Definition 4.
oo
x E IR o:
8.
)
--+ oo .
x
Lemma 3. The restriction of the Fourier transform to automorphism of S.
S is a vector-space
We first show that ( ! E S) ([ E S). ToWedothenthisremark we firstthat remark that by Proposition 2a we have f E c C oo l (JR., C) . the operation of multiplication by x"' ( 0) and the operation D of differentiation do not lead outside the class of rapidly decreasing functions. Hence, for any nonnegative integers and (3 the relation f E S implies that the function D,a(x "' f(x)) belongs to the space S. Its Proof.
=?
o:
o:
2:
576
18 Fourier Series and the Fourier Transform
Fourier transform tends to zero at infinity by the Riemann-Lebesgue lemma. But by formulas ( 18. 102) and (18. 103) Df3 ( x�)) (�) = io + f3eJCal (�) , �!3 jCa) ( �) --+ 0 � --+ oo ,
and we have shown that as that is, j E S. nowwholeshowspacethatS.S = S, that is, that the Fourier transform maps S ontoWe the Wesimple recallrelation that thef(�)direct and inverse Fourier transforms are connected by the = [( -�) . Reversing the sign of the argument of the function obviouslytransform is an operation thatS into mapsitself. the set S into itself. Hence the inverse Fourier also maps Finally, if f is an arbitrary function in S, then by what has been proved
We havefmade study above of the Fourier transform of a function : JR; aCrather defineddetailed on the real line. In particular, we have clarified the connection that exists between the regularity properties of a function and the corresponding properties of its Fourier transform. Now thattransform this question hassufficiently been theoretically answered,so weas toshallexhibit studythethefundamental Fourier only of regular functions technical properties of the Fourier transform inwe concentrated form and without technical complications. In compensation shalltransform consider and not derive only one-dimensional but practically also the multi-dimensional Fourier its basic properties independently of whatThose was discussed above. confine themselves to the one-dimensional case may assume thatwishing1 tobelow. Definition 5 . Suppose f : JR; n C is a locally integrable function on JR;n . The function f(�) := (2 7r1) n / 2 J f(x) e-i((, x ) d x ( 18. 104) is called the Fourier transform of the function f. Here we mean that x (x1 , . . . , xn ) , � = (6 , . . . , �n ), (�, x) 6 x1 + · · · + �n X n , and the integral is regarded as convergent in the following sense of principal value: a. Definitions, Notation, Examples
n
=
--+
--+
JR n
=
=
A
A
-A
-A
J rp(x1 , . . . , Xn ) dx1 · · · dxn : = A--flim+oo J · · · J rp(x1 , . . . , Xn ) dx1 · · · dxn .
JR n
18.3 The Fourier Transform
577
In thisas ncaseone-dimensional the multidimensional Fourier transform ( 18. 104) can be re garded Fourier transforms carried out with respect to eachThen, of thewhen variables x 1 , . . . , Xn . the function f is absolutely integrable, the question of the senseLetin whi= c(ha the, . . integral ( 18. 104) is to be understood does not arise at all. . , a and (3 = ((31 , . . . , f3n ) be multi-indices consisting of ) 1 n nonnegative integers (31 , j = 1, . . . , n, and suppose, as always, that va denotes the differentiation operator ax"11 8 1.��x"'nn of order I a I := a 1 + · · · + n and x�' .· = x�'1 1 . . . xnl'n . Definition 6. We denote the set of functions f E c< = ) (IR. n , C) satisfying the condition sup l x,B v a f(x) l < n EJR x for all nonnegative multi-indice8 and (3 by the f:iymbol S(IR.n , q, or by S where can arise. Such functions are said to be rapidly decreasing (aD x -+no confusion ) The setofS awithfunction the algebraic operations of functions tiplication by a complex numberof addition is obviously a vectorand space.mul Example The function e l x l 2 , where l x l 2 xi + · · · + x;;, and all the function8 in c6= ) (IR.n , IC) of compact support belong to S. Ifunif fEormly S, then integral in relation ( 18. 104) obviouslyn converges absolutely and withrulesrespect tointegral space !R. . Moreover, if f E S, � on thecanentire then by 8tandard this be di ff erentiated as desired=wil (JR.,th q.respect to any of the variables 6 , . . . , �n · Thusas many if f E times S, then f E c< Example Let us find the Fourier transform of the function exp ( - lxl 2 /2) . 's When integrating rapidly decreasing functions one can obviously use Fubini theorem and if necessary change the order of improper integrations without difficulty. In the present case, using Fubini ' s theorem and Example 4, we find 1 j e l x l 2 / 2 . e - i � , x ) dx = (27r ) n / 2 a
a1 ,
•
a
•
00
a
oo .
9.
=
-
1 0.
JRn
-
(
= n = IT _1_ J e
n d xj = IT e -�J /2 = e- 1 � 1 2 /2 . j= 1 j= 1 V2if (X) We nowassuming, state andsoprove the dbasic structural propertiesthatof thethe Fourier Fourier transform, as to avoi technical complications, transform i8 beingtoapplied to(compute) functionswith of class S. This is approximately the same as learning operate rational number8 rather than the entire space JR. all at once. The process of completion is of the same type. On this account, see Problem 5 . -
-
x] f 2 e i �j X j -
578
18 Fourier Series and the Fourier Transform
b. Linearity The linearity of the Fourier transform is obvious; it follows from the linearity of the integral. c. The Relation between Differentiation and the Fourier Transform The following formulas hold
�( 0 = (i) l<>l�<> i(�) '
(x�)) (� ) = ( i ) l<>l v<> i( � ) .
( 18. 105) ( 18. 106)
Proof. The first of these can be obtained, like formula ( 18. 102), via integra tion with a preliminary use of Fubini's theorem in the ncourse, caseFormula ofby aparts space( 18.(of!R106) of generalizes dimension > 1). relationto (the18.103) and is obtained by direct differentiation of ( 18.104) with respect parameters 6 , . . . , �n · 0 Remark In view of the obvious estimate l f (�) l :S: ( 2n1) n/2 J l f (x ) l dx < +oo , it follows from ( 18. 105) that i(O 0 as � oo for every function f E S, since D <> f E S. Next,that the simultaneous use of formulas ( 18. 105) and ( 18.106) enables us to write v !3 (x�) ) ( � ) = (i) I <> I+I/Jiev,f(o , from that if f E S, then for any nonnegative multi-indices and f3which we haveit follows �/3D<> f(� ) 0 when � oo in !Rn . Thus we have shown that n
3.
�
JRn
-+
-+
-+
(f E
d. The Inversion Formula
S)
-+
=?
(f E
a
S) .
Definition 7. The operator defined (together with its notation) by the equality ( 18. 107)
is called the inverse Fourier transform. The following Fourier inversion formula holds: �
f=f=f,
or in the form of the Fourier integral: f(x) ( 2n1) n/2 J f( � ) ei x � ) d� . =
JRn
(
,
( 18. 108) ( 18. 109)
579
18.3 The Fourier Transform
Using Fubini 's theorem one( 1 8can immediately obtain formula (18. 108) from the corresponding formula . 1 0 1 ) for the one-dimensional Fourier trans form, but, as promised, we shall give a brief independent proof of the formula. Proof. We first show that
J g(�)j(�) ei(x ,e) d� = J g(�)f(x + y) dy for any functions J, g E S(IR, C) . Both integrals are defined, since J, g E S and so by Remark we also have f, g E S. ( 1 8 . 1 10)
3 Let us transform the integral on the left-hand side of the equality to be proved:
= Jg(�) ( (27r) n/2 jJ (y)e- i (e , y) dy) ei(x ,eJ d� = = (27r) n/2 J ( Jg(�)e- i (e ,y -x) d�) f(y)dy = = J g(y - x)f(y) dy = J g(y)f(x + y) dy . 1
Rn
Rn
1
Rn
Rn
Rn
Rn
There is since no doubtandas toaretherapidly legitimacy of thefunctions. reversal Thus in the( 1 8order of integration, decreasing . 1 10) is g f nowWeverified. now remark that for every c > 0 so that, by Eq. (18. 1 10)
J g(r::�)i(�)ei (x,e) d� = J r:: -ng(yjr::) J(x + y) dy = J g(u)f(x + cu) du .
Rn
Rn
Rn
g accountintegrals of the absolute convergence withasrespect c ofTakithe nextreme in the lastandchainuniform of equalities, we find, c to g(O) J i(�)ei(x ,e) d� = f(x) J g(u) du . -+
0,
580
18 Fourier Series and the Fourier Transform
Here we set g (x ) = e - l x l 2 /2• In Example 10 we saw that g(u) = e- lul 2 /2 • Recalling the Euler-Poisson integral -Ioo e- x2 dx = V7f and u�ing Fubini ' � theorem, we conclude that I e-lul 2 12 du = (27r ) n/2 , and a� a result, we obtain Eq. ( 18. 109) . D Remark 4. In contr�t to the single equality (}8. 109) , which means that J = f, relations ( 18. 108) al�o contain the equality 1 = f . But thi� relation follows immediately from the one proved, since J(f.) = [(-f.) and � = {(;). Remark We have already seen (see Remark 3) that if f E S, then J E S, and henc� 1 �S also, that is, S S and S S. We now conclude from the relations 1 = J = f that S = S = S. e. Parseval's Equality Thi� is the name given to the relation 00
JRn
5.
C
C
( ! , g ) = ([, g) '
(18. 1 1 1 )
which in expanded form means that
I f(x)g (x) dx = I [(f. g f. df. . ) ( )
( 18. 1 1 1' )
It follows in particular from (18. 1 1 1 ) that ( 18. 1 12) 11 ! 11 2 = ( ! , f) = ([, [) = 11 111 2 . From thepreserves geometric point of view,between Eq. (18.functions 1 1 1 ) means that the Fourier transform the i n ner product (vectors of the space S), The and name hence "Parseval is an isometry of S. ' s equality" is also sometimes given to the relation I [(f,)g(f.) df. = I f(x)g(x) dx , ( 18. 1 13) which is obtained from ( 1 8 . 1 10) by setting x = 0. The main Parseval equality (18. 1 1} ) is obtained from ( 18. 1 13) by replacing g with g and using the fact that (g) = g, since � = 0 and � = g . f. The Fourier Transform and Convolution The following important relations hold ( J;9 ) = (2 7r ) n f 2 J · g ' (18.1 14) (h) = (211") - n/2 [ g (18. 1 15) *
18.3 The Fourier Transform
581
( sometimes called Borel 's formulas) , which connect the operations of convo lution multiplication of functions through the Fourier transform. Letandus prove these formulas:
Proof.
( j;g ) ( O
= ( 27r) n/2 l u g)(x)e-i(�,x) dx = = ( 27rn/2 I (I f(x - y)g(y) dy)e-i(�,x) dx = = (27r) n/2 1g(y)e-i(�,y) (lj (x - y)e- i(�,x- y) dx) dy = = ( 27r) n/2 I g(y)e-i(�,y) ( I f(u) e-i(�,u) du) dy = = I g(y)e-i(�,y) f(o dy = ( 27r) n/2 !(�)§(�) . 1
*
]Rn
1
-
-
1
1
]Rn
]Rn
]Rn
]Rn
]Rn
]Rn
]Rn
Thef,glegitimacy of the change in order of integration is not in doubt, given thatFormula E S. 1 15) can be obtained by a similar computation if we use the inversion(18.formula (18. 109) . However, Eq. ( 1 8 . 1 15) can be derived from relation (18.1 14) already proved if we recall that f = f = J , f = J, f = J, and that u · = · u = D Remark If we set J and g in place of f and g in formulas ( 1 8 . 1 14) and (18.1 15) and apply the inverse Fourier transform to both sides of the resulting equalities, we arrive at the relations J:9 = ( 2 7r ) - nf 2 (J g) , ( 1 8 . 1 14') ( 1 8 . 1 1 5') j;g = ( 2 7r ) n f 2 (J . g) . v
6.
u v,
*
v
u
*
v.
*
18.3.4 Examples of Applications
Let the Fourier transform (and some of the machinery of Fouriuser now seriesillustrate ) in action. The Wave Equation The successful use of the Fourier transform in the equations of mathematical physics is bound up ( in replaces its mathematical aspectof) primarily with the fact that the Fourier transform the operation differentiation with the algebraic operation of multiplication. a.
582
18 Fourier Series and the Fourier Transform
For example, suppose we are seeking a function u lR lR satisfying the equation where a0Fourier , . . . , antransform are constant coefficients andthisf equation is a known(assuming function.thatApply ing the to both si d es of the functions u and f are sufficiently regular), by relation (18.105) we obtain the algebraic equation :
---+
for u. After finding u(.;) = t��\ from the equation, we obtain u(x) by apply ing the inverse Fourier transform. now apply thiswave ideaequation to the search for a function u u(x, t) satisfying the Weone-dimensional (a > 0) and the initial conditions au (x, 0) = g(x) u(x, 0) = f(x) , at in lRHere x and computations in the next example weasshall notittake the time to tojustify the intermediate because, a rule, is easier simply find theto required function and verify directly that it solves the problem posed than justify and overcome all functions, the technicalwhichdifficulties that arise along the way.playAs itanhappens, generalized have already been mentioned, essential role in thet astheoretical struggle withoutthesea Fouri difficulties. Thus, regarding a parameter, we carry erhand transform on x onentiation both sides of the equation. Then, assuming on the one that differ respect to theonparameter integral sign is permitted and usingwith formula (18.105) the other under hand, the we obtain =
JR.
from which we find
t) = A (,;) cosa.;t + B(.;) sina.;t . By the initial conditions, we have u (,;,
u� ( .; , O)
u(.; , o ) = =
!(.;) = A ( .; ) ,
(u D ( .; , O)
=
a B(.;) .
g(.;) = ,;
18.3 The Fourier Transform
Thus,
583
Multiplying this equality by v�27r eix� and integrating with respect to � the inverse Fourier transform - and using formula ( 18. 105) wein short, obtaintaking immediately t 1 1 u(x, t ) = 2 (f(x - at ) + f( x + at)) + 2 I (g(x - aT ) + g(x + a r ) ) dr . b. The Heat Equation Another element of the machinery of Fourier trans forms (specifically, formulas (example, 1 8 . 1 14') and (18. 1 15')) which remained in the background in the preceding itself quite clearly when we seek a function u = u(x, t), x E �R_n , t manifests 0, that satisfies the heat equation 0
2::
au
2 at = a .1u (a > 0) !R.n . u(x, 0) = f (x) vx1 + . . . + uxn ·
and the initial condition 82 82 on all of Here, as always .1 Carrying out a Fourier transform with respect to the variable x E �R_n , we find by (18.105) the ordinary equation = =
from which it follows that
�
u(� , t) c(�)e- a21 � 1 2 t ' 2 wewherefind1 � 1 �� + · · · + �;. Taking into account the relation U(� , 0) = f(�), Now applying the inverse Fourier transform, taking account of ( 1 8. 1 14') , we obtain =
=
I
u(x, t ) = (2rr) - n / 2 f(y)Eo (Y - x, t) dy ,
where Eo (x, t) is the function whose Fourier transform with respect to x is inverse Fourier respect to � of10.theMaking functionan ee--a21a21 �� 11 22 tt . isTheessentially already transform known to with us from Example
584
18 Fourier Series and the Fourier Thansform
obvious change of variable, we find
( )e Setting we find the fundamental solution E(x, t) (2a v'1rt) - n e - � (t > 0) , ofSect.the17.4) heat, and equation, which was already familiar to us (see Example 15 of the formula -..fo n - � n/ 2 (211") av'i E(x, t) = (211") - n f 2 E0 (x, t), Eo (x, t)
1
=
4a t
•
lx12
=
u(x, t) =
(f E)(x, t) *
for the solution satisfying the initial condition u(x, 0) = f (x) . c . The Poisson Summation Formula This is the name given to the following relation � L
:
00
=
-t
00
00
rp(n)einx ' ( 18 . 1 17)
L (x + 211" ) = L n = - oo m = - oo
R
=
ck (f)
:=
00
2�
d
�J
=
00
+
00
f(x )e - i k x x = L n = - oo 0 � n+l) 2 ( 1
2�
� J
+
oo
d
211"n)e - i kx x =
J
n =2:: - oo � 2 � n
- oo
18.3 The Fourier Transform
585
f is a smooth 2n-periodic function and so its Fourier series converges to itButat every point E R Hence, at every point E JR. we have the relation = = = i nx = M:: 1 cp(n) einx . D 2:: ( + 2nn) = f(x) = 2:: en ( ! ) eM:: 2:: 2 2 1f 1f = = = n -= n -= n -= As can beforseenfunctions from theof proof, relations ( 1 8 . 1 16) and ( 1 8 . 1 1 7) by notoRemark means hold only class S. But if does happen to belong S, then Eq. ( 18. 1 1 7) can be differentiated termwise with respect to any number of times, yielding as a corollary new relations between , and . x
x
x
V
7.
V
1
cp
•
•
x
•
d. Kotel'nikov's Theorem (Whittaker-Shannon Sampling Theo rem)25
This example,series basedandlikethetheFourier preceding one hason aa direct beautifulrelation combito nation of the Fourier integral, the theoryappearing of information transmission in abecause communication channel.capabilities To keep itof from artificial, we recall that of the limited our sense For organs, we aretheableearto"hears" perceivein signals onlyfrom in a20certain rangekHz.of frequencies. example, the range Hz to 20 Thus, no matter what the signals are, we, like a filter (see Subsect. 18.3. 1 ) cut out only(having a bounded part ofspectrum). their spectra and perceive them as band-limited signals a bounded For that signal reason,f (t)we (where shall assume from the outset that the transmitted orthereceived t is time, < t < ) is band-limited, spectrum being nonzero only for frequencies whose magnitudes do not exceed a certainfunction criticalthevalue a > 0. Thus J(w) 0 for l w I > a, and so for a band-limited representation = f (t) vk J J(w)eiwt dw -= reduces to the integral over just the interval [ - a, a]: ( 1 8 . 1 18) f (t) vk J J(w)eiwt dw . - oo
=
oo
=
=
a
-a
2 5 V.A. Kotel'nikov ( b. 1908) - Soviet scholar, a well-known specialist in the theory of radio communication. J.M. Whittaker ( 1905-1984) - British mathematician who worked mainly in complex analysis. C.E. Shannon ( 1916-2001) - American mathematician and engineer, one of the founders of information theory and inventor of the term "bit" as an abbreviation of "binary digit."
18 Fourier Series and the Fourier Transform
586
On the closed interval [ - a , a] we expand the function J(w) in a Fourier senes J(w) L ck (j) ei rraw k (18.1 19) - oo { ei rraw k ; k E Z} which is orthogonal and complete in that iinterval. n the system Taking account of formula ( 1 8 . 1 18), we find the following simple expression for the coefficients a ck (f) of this series: J2;2a f ( - -ka ) (18. 120) ck ( f) 2a1 f f (w ) e rrw k dw Substituting the series ( 1 8. 1 19) into the integral (18. 1 18) , taking account of relations ( 18. 120) , we find a 1 f J2; j (t) _ ( 2a f J ( �a k ) ei w t - i rrak w ) dw = J2; k= -oo 1 oo = L f ( � k ) f eiw ( t-%k ) dw . 2a = k -oo Calculating these elementary integrals, we arrive at Kotel 'nikov 's formula oo -a �k ) . (18.121) f ( t ) =Loo f ( a k) sina(t a ( t - -7r k ) k (18.121) shows that, in order to reconstruct a message described byquency aFormula band-limited function f ( t ) whose spectrum is concentrated in the fre range l w l a, it suffices to transmit over the channel only the values marker values) of the function at equal time intervals /a. f ( kThis) (called proposition, together with formula (18.121) is due to V.A. Ko tel ' nikov and is called Kotel 'nikov 's theorem or the sampling theorem. Remark The interpolation formula (18.121) itself was known in mathe ' nikov ' s 1933 paper, but this paper was the first to point matics before Kotel out the fundamental significance of theoverexpansion (18.121) for the theory of transmission of continuous messages a communication of the derivation of formula ( 1 8 . 121) given above is also duechannel. to KotelThe' nikidea ov. In reality transmission and receiving time ofwea communication isRemark also limited, so thattheinstead of the entire series (18.121) take one of its partial sums I: . Special research has been devoted to estimating the errors that thereby arise. =
�
=
:= -
�
-a
00
=
-· 'a
-a
7r
a
-a
=
:::;
iJ.
8.
9.
N
-N
�
iJ.
=
1r
18.3 The Fourier Transform
587
Remark Ifcommunication it is known howf(t)muchovertimea given is required to transmitchannel, one sample value of the communication it is easy to estimate the number of such communications that can be transmitted in parallel over that channel. In other words, the( and possibility arisesas ofa function estimat ing the transmitting capability of the channel moreover, ofspectrum the informational of the communications that will affect the of the signalsaturation f(t ) ) . 1 0.
18.3.5
Problems and Exercises
a) Write out the proof of relations ( 18.93)-( 18.96) in detail. b) Regarding the Fourier transform as a mapping f >-+ j, show that it has the following frequently used properties: 1.
f(at) >-+ �1 f-( -;;w )
( the change of scale rule ) ;
f(t - to) >-+ f(w) e - i w to
{
( time shift of the input signal - the Fourier preimage - or the translation theorem)
f(w)2 cos wto , [f(t + to) ± f(t - to)] >-+ f(w )2 sin wto ; ±i 0t f(t) e w >-+ f(w ± wo) ( frequency shift of the Fourier transform ) ; 1 -
- + wo )] , f(t) cos wot >-+ 2 [f(w - wo) + f(w - + wo)] f( t) sin wot >-+ 21 [f(w - wo) - f(w
( amplitude modulation of a harmonic signal ) ; wot >-+ 1 - f(w - - wo) sin 2 2
f(t)
4 [2f(w) -
{
- + wo)] . - f(w
c ) Find the Fourier transforms ( or, as we say, the Fourier images) of the fol lowing functions: 1 for l t l :S A , 2A
JIA (t) =
( the
rectangular pulse) ;
IlA
Q
for l t l > A
(t) cos wot
18
588
Fourier Series and the Fourier Transform
(a harmonic signal modulated by a rectangular pulse) ; Ih (t + 2A) + Ih (t - 2A) (two rectangular pulses of the same polarity) ; lh (t - A) - liA (t + A) (two rectangular pulses of opposite polarity) ; � (1 - � ) for l t l � A , AA (t) = for l t l > A 0 (a triangular pulse) ; cos ae and sin ae ( a > 0) ; IW ! and IW ! e - a i t i ( a > 0) . d) Find the Fourier preimages of the following functions: wA , 2 I. -- sm 2 w A 2 . 2 wA . smc smc - ,
{
•
7r wA where sine ;- := � is the sample function ( cardinal sine) . e ) Using the preceding results, find the values o f the following integrals, which we have already encountered: 00 oo 00 I I sin 2 x 2 x , d cos x dx , sin x 2 d x . X2
si x
7r
l
- oo
- oo
- oo
f) Verify that the Fourier integral of a function f(t) can be written in any of the following forms: 00
f(t) "'
I - oo
f(w) e i t w dw = 2�
00
00
I - oo
dw
I - oo
i x t dx =
f(x) e - w (
- )
00
=
� I dw 0
00
I
f(x) cos 2w(x - t) dx .
- oo
Let f = f(x, y) be a solution of the two-dimensional Laplace equation � + = 0 in the half-plane y 2: 0 satisfying the conditions f(x, 0) = g(x) and f(x, y) -+ 0 as y -+ +oo for every x E JR. a) Verify that the Fourier transform [(t;., y) of f on the variable x has the form 2.
�
g( t;,) e- yi{ i .
b) Find the Fourier preimage of the function e- Y i el on the variable t;.. c) Now obtain the representation of the function f as a Poisson integral
1
f(x, y) = ;
00
I - oo
y ) dt;. , (x - t;.) 2 + y 2 g( t;,
which we have met in Example 5 of Sect.
17.4.
18.3 The Fourier Transform
589
We recall that the nth moment of the function f : lR -+
3.
-
oo
bution, that is, f(x) 2 0 and J f(x:) dx = 1 , then xo = M1 (f) is the matheoo matical expectation of a random variable x with the distribution f and the varioo ance CJ 2 : = J ( x - x0) 2 f ( x) dx of this random variable can be represented as oo CJ 2 = M2 (f) - M[ (f) . Consider the Fourier transform -
-
j f(x)e- i�x dx 00
f(EJ =
of the function f. By expanding e - i � x in a series, show that n a) f(f.) = f; ( - i) n"f. n U ) C if, for example, f E S. =O n b) Mn (f) = (i) n j\ n l (0) , n = 0, 1, . . . . c) Now let f be real-valued, and let f(f.) = A(E,)ei'P ( O , where A( f.) is the absolute value of f(EJ and cp(EJ is its argument; then A( f.) = A( -EJ and cp( - f.) = -cp(E,) . To 00 normalize the problem, assume that J f(x) dx = 1 . Verify that in that case oo -
and 4. a) Verify that the function e- a lxl ( a > 0) , like all its derivatives, which are defined for x =/= 0 , decreases at infinity faster than any negative power of lx l and yet this function does not belong to the class S. b) Verify that the Fourier transform of this function is infinitely differentiable on JR, but does not belong to S (and all because e- a lx l is not differentiable at x = 0) . 5. a) Show that the functions of class S are dense in the space R 2 (1R n ,
590
18 Fourier Series and the Fourier Transform
c) Show that a vector-space structure and an inner product can be introduced in a natural way on L 2 , and in these structures the Fourier transform L 2 �L 2 turns out to be a linear isometry of L 2 onto itself. d) Using the example of the function f(x) = Vb l+ x 2 , one can see that if f E R 2 (JR., q we do not necessarily have f E R(JR., C). Nevertheless, if f E R2 (JR., q , then, since f i s locally integrable, one can consider the function
A
fA (�) =
vk I f(x) e - i�x dx . -A
Verify that fA E C(JR., C) and fA E R 2 (lR., C). e) Prove that fA converges in L 2 to some element f E L 2 and I fA ll as A --7 +oo (this is Plancherel's theorem 2 6 )
--7
ll fll = ll f ll
6. The uncertainty principle. Let cp(x) and '1/J(p) be functions of class 8 (or elements DO DO of the space L 2 of Problem 5), with 'ljJ = ip and J l cp l 2 ( x ) dx = J l 'l/JI 2 (p) dp = 1. I n this case the functions I'PI 2 and l 'l/JI 2 can be regarded as probability densitie:s for random variables x and p respectively. a) Show that by a shift in the argument of
00
I l axcp(x) + cp' (x) l 2 dx � 0
-
DO
and, using Parseval's equality and the formula ip' (p) = ipip(p) , show that a 2 M2 (
M.
Plancherel ( 1885-1967) - Swiss mathematician.
18.3 The Fourier Transform
591
(called Heisenberg 's2 7 uncertainty principle) , is mathematically the same as the relation between M2 ( cp ) and M2 ('l/J) found above. The next three problems give an elementary picture of the Fourier transform of generalized functions. 7.
{
a) Using Example 1 , find the spectrum of the signal expressed by the functions
L1a (t)
=
1 2a for l t l S a ,
O
for lt l
>a.
b) Examine the variation of the function .d, ( t) and its spectrum as a -+ +0 and tell what, in your opinion, should be regarded as the spectrum of a unit pulse, expressed by the 8-function. c) Using Example 2, now find the signal cp ( t ) emerging from an ideal low frequency filter (with upper frequency limit a ) in response to a unit pulse o(t). d) Using the result just obtained, now explain the physical meaning of the terms in the Kotel'nikov series ( 18. 121 ) and propose a theoretical scheme for transmitting a band-limited signal f(t) , based on Kotel'nikov's formula ( 18. 121 ) .
The space of L. Schwartz. Verify that a) If cp E S and P is a polynomial, then (P cp) E S; b) if cp E S, then D"'cp E S and D f3 (PD"cp) E S, where a and (3 are nonnegative multi-indices and P is a polynomial. c) We introduce the following notion of convergence in S. A sequence { cp k } of functions 'P k E S converges to zero if for all nonnegative multi-indices a and (3 the sequence of functions { x 13 D"cp(x) } converges uniformly to zero on lRn . The relation 'P k -+ cp E S will mean that ( cp - 'P k ) -+ 0 in S. The vector space S of rapidly decreasing functions with this convergence is called the Schwartz space. Show that if 'P k -+ cp in S, then (j)k -+ (j5 in S as k -+ oo. Thus the Fourier transform is a continuous linear operator on the Schwartz space. 8.
·
9. The space S' of tempered distributions. The continuous linear functionals defined on the space S of rapidly decreasing functions are called tempered distributions. The vector space of such functionals (the conjugate of S) is denoted S' . The value of the functional F E S' on a function cp E S will be denoted F( cp). a) Let P : lRn -+ C be a polynomial in n variables and f : lRn -+ C a locally integrable function admitting the estimate l f(x) l S I P(x) l at infinity (that is, it may increase as x -+ oo, but only moderately: not faster than power growth) . Show that f can then be regarded as a (regular) element of S' if we set
f(cp)
=
J f(x)cp(x) dx
lll. n
(cp E S) .
2 7 W. Heisenberg ( 1901-1976 ) - German physicist , one of the founders of quantum mechanics.
592
18 Fourier Series and the Fourier Transform
b) Multiplication of a tempered distribution F E S' by an ordinary function f : JR n -t C is defined, as always, by the relation (JF) (�) := F(f�) . Verify that for tempered distributions multiplication is well defined, not only by functions f E S, but also by polynomials P : JR n -t C. c) Differentiation of tempered distributions F E S' is defined in the traditional way : (D a F) (�) : = ( - 1 ) 1 a i F(D a �). Show that this is correctly defined, that is, if F E S', then D a F E S' for every nonnegative integer multi-index a = ( a 1 , . . . , a n ) d) If f and � are sufficiently regular functions (for example, functions in S) , then, as relation (18.1 13) shows, the following equality holds:
f( �) =
IIRn f(x)�(x) dx JRnI f(x) ij5(x) dx =
=
J ( i/5) .
This equality (Parseval's equality) is made the basis of the definition of the Fourier transform F of a tempered distribution F E S'. By definition we set F( �) : = F( i/5) . Due to the invariance of S under the Fourier transform, this definition is correct for every element F E S' . Show that i t i s not correct for generalized functions i n V' (JRn ) mapping the space V(JR n ) of smooth functions of compact support. This fact explains the role of the Schwartz space S in the theory of the Fourier transform and its application to generalized functions. e) In Problem 7 we acquired a preliminary idea of the Fourier transform of the 8-function. The Fourier transform of the 8-function could have been sought directly from the definition of the Fourier transform of a regular function. In that case we would have found that ?( � ) -
u
_
( 27r ) n / 2 lll.In 1
'(X) e -i ( E. x ) dx - ( 27r1) n 2 . _
u
/
Now show that when we seek the Fourier transform of the tempered distribution 8 E S' (JR n ) correctly, that is, starting from the equality 8( � ) = 8( ij5) , the result (still the same) is that 8(ij5) = i/5 ( 0 ) = 2 rr �n / 2 • (One can renormalize the Fourier ( transform so that this constant equals 1; see Problem 10) . f) Convergence in S' , as always in generalized functions, is understood in the following sense: (Fn -t F) in S' as n -t oo : = (V� E S (Fn (�) -t F(�) as n -t oo )) . Verify the Fourier inversion formula (the Fourier integral formula) for the 8function: A A 8(x)
=
lim
A -+ + oo
(
1
(
21r ) n 12 1 . . . 1 8(� ) ei x .o d� .
_
-A
-A
g) Let 8(x - x0) , as usual, denote the shift of the 8-function to the point x0 , that is, 8(x - Xo ) (�) = �(xo ) . Verify that the series
"L 8(x - n ) 00
n = - cx:> n converges in S' (JR ). (Here 8 E S' (JR n ) and
n
E Z.)
18.3 The Fourier Transform
593
h) Using the possibility of differentiating a convergent series of generalized func tions termwise and taking account of the equality from Problem 13f of Sect. 18.2, DO show that if F = 2:.::: 8(x - n) , then n = - oo
F = y'2;
DO
L
)
8(x - 21r n .
n = - oo
i) Using the relation F( VJ ) = F(ij5) , obtain the Poisson summation formula from the preceding result. j) Prove the following relation (the 0-formula) DO
L
2
e-tn =
n= - oo
�1r L e DO
-
t
,.-
2
- -n
'
2
(t
>
0) '
n = - oo
which plays an important role in the theory of elliptic functions and the theory of heat conduction. 10.
If the Fourier transform F[f] of a function f : JR. --+ C is defined by the formulas DO
] ( v) := F[f] (v) := J f ( t) e- 2rrivt dt , - DO
many of the formulas relating to the Fourier transform become particularly simple and elegant. a) Verify that i(u) =
vh:-1 ( 2: ) .
b) Show that .f[F[f]] (t) = f( -t), that is, DO
f(t) =
J ](v) e2rrivt d v .
- DO
This is the most natural form of the expansion of f in harmonics of different frequencies and in this expansion is the frequency spectrum of f. c) Verify that J = 1 and i = 8. d) Verify that the Poisson summation formula (18.1 16) now assumes the par ticularly elegant form
v,
] ( v)
L VJ ( n ) = L
n = - oo
n = - oo
(j> (n) .
1 9 Asymptotic Expansions
The majority ofbyphenomena thatofwenumerical have to parameters deal with canhaving be characterized mathematically a certain set rather com plicated interrelations. However the description of a phenomenon as a ruleor becomes significantly simpler if it is known that some of these parameters some combination of them is very large or, contrariwise, very small. Example In describing relative motions occurring with speeds v that are much than the speed of lig3htof( lSect. v l c) we may use, instead of the Lorentzsmaller transformations (Example 1 .3) 1.
�
t - (�)x ' t' = Jl - (�) 2
x' = -vr:xl=--==;:;vt( �;:;:):;;<::< 2 '
the Galilean transformation since vjc Example
x' = x - vt ,
�
2.
t' = t '
0.
The period
of oscillations of a pendulum is connected with the maximal deviation 2 = sinangle2 !t'fof (see from its equilibrium position via the parameter k Sect. 6.4) . If the oscillations are smalll, that is, 0, we obtain the simple formula cp 0
cp 0
T�
�
2rr{f
for the period of such oscillations. Supposeposition a restoring a particle toisthereturning toExample its equilibrium and thatforcetheacting force isonproportional displaceit ment (a spring with spring constant k, for example). Suppose also that the 3.
m
596
19 Asymptotic Expansions
resisting force ofoftheproportionality medium is proportional to the square of the velocity (with coefficient a). The equation of motion in that case has the following form (see Sect. 5.6): m!i + ax 2 + kx = 0 . If the mediumby"rarities", a 0 and one may assume that the motion is approximated the motionthendescribed by the equation m!i + kx = 0 (harmonic oscillations with frequencey {f), and if the medium "condenses", then a and, dividing by a, we find in the limit the equation :i;2 = 0, that is, x(t) const. Example 4 . If 1r(x) is the number of primes not larger than x E JR., then, as is known for large x the quantity 1r(x) can be found with small relative(see errorSect.by the3.2),formula 7r(x) � -1nx- . Example It would be difficult to find more trivial, yet nevertheless impor tant relations than sinx � x or ln(l + x) � x , inThese whichrelations the relative smallerifasdesired, x approaches can beerrormadebecomes more precise namely0 (see Sect. 5.3) . -+
-+ oo ,
=
X
5.
1 . � x - -x smx 3! by adjoining one or more of the following terms obtained from the Taylor series. Thus theofproblem is to findbeinga clear, convenient, andspecifics essentially correct description a phenomenon studied using the of the situa tion that arises when some parameter (or combination of parameters) that characterizes the phenomenon is small (tends to zero) or, contrariwise, large (tendsHence, to infinity). weofarethisoncetypeagaiaren essentially discussing the theory of limits. Problems called asymptotic problems. They arise, as one can The see, solution in practically allasymptotic areas of mathematics and consists natural sciofethence.following of an problem usually stages: passing to thesimplified limit anddescription finding theof(main term of the)estimating asymptotics, that is, a convenient the phenomenon; the error that arises i n using the asymptotic formula so found, and determining its range of applicability; then sharpening the main term of the asymptotics, 3
19.1 Asymptotic Formulas and Asymptotic Series
597
to theequally processalgorithmic of adjoiningin the nextcase).term in Taylor 's formula (but faranalogous from being every methods solving asymptotic problemsof(called asymptotic methods) are The usually closelyofconnected with the specfics a problem. Among the few rather general and at the same time elementary asymptotic methods finds 's formula, one of the most important relations in differential one Taylor calculus. The present chaptermethods should ofgivanalysis. e the reader a beginning picture of the elementary asymptotic the firsttosection we shall introducemethods; the general concepts andshalldefini tithem onsInrelating elementary asymptotic in the second we use 's method of constructing the asymptotic expan in discussing Laplace sion of Laplace transforms. This method, which was discovered by Laplace incomponent his research on saddle-point the limit theorems oflater probability theory, is an important of the method developed by Riemann, usually discussed in a course of complex analysis. Further i n formation on various asymptotic methodsTheseof anlysis can becontain foundaninextensive the specialized books cited in the bibli o graphy. books also bibliography on this circle of questions. 19. 1 Asymptotic Formulas and Asymptotic Series 19. 1 . 1 Basic Definitions
a. Asymptotic Estimates and Asymptotic Equalities For the sake of completeness we begin with some recollections and clarifications. Definition 1. Let f X -+ Y and g X -+ Y be real- or complex- or in general Then thevector-valued relations functions defined on a set X and let be a base in X. xEX f = O(g) or f(x) = O (g(x) ) f = O(g) or f(x) = O (g(x) ) over the base f = o(g) or f(x) = o (g(x) ) over the base mean by a:(x) definition that in thebounded equalityonl f(x)X,l ultimately a:(x) l g(x) l , the real-valued function is respectively bounded over the base and infinitesimal over the base Theserelation relations are usually called asymptotic estimates (of f). The f "' g or f(x) "' g(x) over the base :
:
B
B
B,
B.
B
=
B ,
598
19 Asymptotic Expansions
which bycalled definition means that f(x) = g(x) + o (g(x)) over the base B, is usually asymptotic equivalence or asymptotic equality 1 of the functions over the base B. Asymptotic estimates and asymptotic equalities unite in the term asymp totic formulas. Wherevernotations it is not important to indicate the argument of a function the abbreviated f = o(g), f = O(g), or f g are used, and we shall makeIf fsystematic use of this abbreviation. = O(g) and simultaneously g = 0( ! ), we write f ::::: g and say that f andIng are quantities of the same order over the given base. what we are going to be doing below, Y =
C
3
COS X z
z
z
3
X
z
z
C
z
1.
o C:x )
1
x
---t 0 as x ---t +oo . Thiscomputational circumstance,importance as we shallwhensee one below,considers leads totheasymptotic seriesbutthat have relative error not the absolute error; for that reason these series are often divergent, in contrast to classicalbeingseries,approximated for which theandabsolute value of the difference between theto function the nth partial sum of the series tends zero as n ---t +oo. ( l:x )
1 It is also useful to keep in mind the symbol equivalence.
�
often used to denote asymptotic
19. 1 Asymptotic Formulas and Asymptotic Series
599
Let us consider some examples of ways of obtaining asymptotic formulas. Example The labor involved in computing the values of n! or ln n! increase asunder n E that N increases. We shall use the fact that n is large, however, and obtain assumption a convenient asymptotic formula for computing ln n! approximately. It follows from the obvious relations n n+1 n k n n k+1 J ln x dx = L J ln x dx < L lnk < L J ln x dx = J ln x dx k = 2k - 1 k =1 k=2 k 1 2 that n n+1 2 0 < ln n! - J ln x dx < J ln x dx + J ln x dx < ln2 ( n + 1 ) . n 1 1 But n J ln x dx = n ( ln n - 1 ) + 1 = n ln n - (n - 1 ) , 1 and therefore as n --+ oo n Inn! = J ln x dx + O ( ln2 ( n + 1 ) ) = 1 = nlnn - ( n - 1 ) + O ( lnn ) = nlnn + O ( n) . = (ton lnzeron) when --+ +oo, the relative error of the formula ln n!Sincen ln0 n(n)tends as n --+n +oo. Example We shall show that as x --+ +oo the function x t en dt ( n E JR.) fn (x) = J t 1 is+ooasymptotically equivalent to the function 9n (x) x - n ex . Since 9n (x) --+ as x --+ +oo, applying L ' Hopital ' s rule we find - n ex- n 1 = 1 . x fn (x) = lim f� (x) = lim lim x --++ = 9n (x) x--+ + = g� (x) x --+ cxo x - ne x - nx - e x Example Let us find the asymptotic behavior of the function x t e dt f(x) = J t 6.
o
�
7.
=
8.
1
600
19 Asymptotic Expansions
more precisely. It differs from the exponential integral x Ei (x) = J tet dt oo onlyIntegrating by a constant term. we obtain by parts, x et x 2et t t t x x e ( e ) 1 e j j dt = + - + - dt = f(x) = l + -
t t2 1 1 t3 X t t t x ( e e 3! et dt = 1! e ) 2! j = -t + 2t + 3t l 1 + 4 1 t X ( n - 1 ) ! ) x j n! et ( 0! 1! 2! t . . = e t + -t 2 + -t 3 + + t n l 1 + -n + 1 dt . t 1 This last integral, as shown in Example 7, is O(x -( nn+ 1 l ex ) as x ---+ +oo. Including in the term O(x -(n+ 1 l ex ) the constant -e k2::= 1 (k - 1 ) ! obtained when t = 1 is substituted, we find that t 1 1 t2
-
-
-
·
-
f(x) = ex t (k �k 1 ) ! + a c:: 1 ) as X ---+ +oo . k= 1 The error 0 ( x��r ) in the approximate equality f(x) t (k X-k 1 ) ! ex k= 1 �
is asymptotically infinitesimal compared with each term of the sum, including the last. As the same time, as x ---+ + oo each successive term of the sum is infinitesimal its predecessor; therefore it is natural the continuallycompared sharper with sequence of such formulas as a series generatedto bywritef: !( x ) _ eX � (k X-k 1 ) ! . k= 1 this series obviously diverges for every value of x E JR, so thatWewenote cannotthatwrite f ( X ) = ex � ( k -k 1 ) ! roo.J
�
� k= 1 x
19. 1 Asymptotic Formulas and Asymptotic Series
601
Thus we ofareadealing here with ina new and clearly useful asymptotic in terpretation series connected, contrast with the classical case, with the relative rather than the absolute error of approximation of the function. The sopartial sums of such a series, in contrast tofunction the classical case, points are usedas not much to approximate the values of the at specific to(which describe collective passage in question in thetheirpresent examplebehavior occursunder as x the+oo)limiting . -+
b. Asymptotic Sequences and Asymptotic Series Definition 2.
A sequence of asymptotic formulas
f(x) f(x)
7/Jo(x) + o (7/Jo(x)) , 7/Jo(x) + ¢1 (x) + o (¢1 (x)) ,
f(x)
7/lo(x) + 7/J 1 (x) + · · · + 7/Jn (x) + o ( 7/Jn (x) )
that areasvalithed over a base in the set X where the functions are defined, is written relation B
j ( X ) � 7/Jo ( X ) + 7j; 1 ( X ) + · · · + 7/Jn ( X ) + . . .
or, more briefly, as f (x) f in the given base B.
�
CXJ
7/J (x). It is called an asymptotic expansion of k =O k 2:.::
expansions we always haveIt is clearo 7/Jfrom this= 7/Jdefinition that in asymptotic ( n (x)) n + l (x) + o (¢n + l (x)) over the base and hence for any n = 0, 1 , 2, . . . we have 7/Jn + l ( x) = o (7/Jn ( x)) over the base is, each successive term of thein comparison expansion contributes its correction, which isthatasymptotically more precise with its predecessor. Asymptotic expansions usually arise in the form of a linear combination B,
B,
c0tp0(x) + c 1 tp 1 (x) + . . . + Cn iPn (x) + . . . of functions of some sequence { IPn ( x)} that is convenient for the specific
problem.
19 Asymptotic Expansions
602
Let X be a set with a base B defined on it. The sequence {
Definition 3.
identically zero on any element of Remark The condition that (
8
2.
B
9.
C
•
•
X
•
oo
00 ;
· · ·
B,
f(x) � Co<po(x) + CI
isrespect calledtoantheasymptotic expansion or asymptotic series of the function f with asymptotic sequence {
3.
19. 1 . 2 General Facts about Asymptotic Series
When we speakonly of thein asymptotic behavior of a function over a base we are interested the naturefunctions of the limiting behavior of the function, so that if two generally different f and g are equal on some element of the base they have the same asymptotic behavior over and should be considered equal in the Moreover, asymptotic sense. if we fix toin carry advanceout some asymptoticexpansion, sequencewe{
B
B,
19. 1 Asymptotic Formulas and Asymptotic Series
603
with the there limitedwillpossibilities of that any such system of functions { 'Pn }. To be specific, be functions are infinitesimal with respect to every term 'Pn of the given asymptotic system. Example Let 'Pn (x) = x1n , n = 0, 1, . . . ; then e - x = o ( rpn (x)) as x +oo. For that reason it is natural to adopt the following definitions. Definition 5. If { 'Pn ( x)} is an asymptotic sequence over the base B, a func tion f such that f(x) = o ( rpn (x) ) over B for each n = 0, 1 , . . . is called an asymptotic zero with respect to { 'Pn ( x)}. Definition 6 . Functions f and are asymptotically equal over the base B respect to a sequence of functions { 'Pn } that is asymptotic over B if the diwithfference f - is an asymptotic zero with respect to { 'Pn }. Proposition 1 . (Uniqueness of an asymptotic expansion). Let { 'Pn } be an asymptotic sequence of functions over a base B . ) If a function f admits an asymptotic expansion with respect to the sequence { 'Pn } over B, then that expansion is unique. b) If the functions f and admit an asymptotic expansion in the system { 'Pn }, then these expansions are the same if and only if the functions f and are asymptotically equal over B with respect to { 'Pn } Proof. a) Suppose the function rp is not identically zero on any element of B. We shall show that if f(x) = o ( rp(x) ) over B, and at the same time f(x) = crp(x) + o ( rp(x) ) over B, then c 0. Indeed, l f(x) l l crp(x) l - i o (rp(x)) I = l c l l rp(x) l - o (l rp(x) l) over B, and soButif ifl eif(x) > 0, there exists B1 E B at each point of which l f(x) l � l rp(x) l . o ( rp(x) ) over B, then there exists B2 E B at each point of which1 l1f(x) l :::; l�l l rp(x) l . Hence at each point x E B1 n B2 we would have to � l rp(x) l :::; ¥ 1 rp(x) l or, assuming l ei =/:- 0, 3 l rp(x) l :::; 2 l rp(x) l . But this is have =1- 0 attheeven one pointexpansion of B1 n Bof2a. function f with respect impossible ifusrp(x) Now let consider asymptotic to theLetsequence { 'Pn }. rpo(x) and corp f(x) = corpo(x) +o ( ) f(x) = (x) +o ( rpo(x) ) over B. Sub tracting the second equality from the first, we find that 0 = (co - co)rpo(x) + o ( rp0(x) ) over B. But 0 = o ( rp0(x) ) over B and so, by what has been proved, co - co = 0. we havef proved that c0{ = c},0 , then . . . , Cn - 1 = Cn - 1 for two expansions of the Iffunction in the system by the equalities 'Pn f(x) = Corpo(x) + · · · + Cn - 1 'Pn - 1 (x) + Cn 'Pn (x) + o ( rpn (x)) , --t
1 0.
g
g
a
g
g
�
=
=
�
f(x) = corpo(x) + · · · + Cn - 1 'Pn - 1 (x) + Cn 'Pn (x) + o ( rpn (x)) we find in the same way that Cn = Cn·
604
19 Asymptotic Expansions
By induction we now conclude that a) is true. b) If f(x) Co<po(x) + · · · + Cn
B
B,
=
B.
=
f(x) = f(O) + 2.1! f'(O)x + · · · + ..!:_n! f ( n ) (O)xn + o(xn ) asthexsystem --t 0. But even the function x 1 1 2 cannot be expanded asymptotically in 1, x, x2 , Thus one must not identify an asymptotic sequence
and asymptotic an asymptoticin it.expansion withmanyanymore canonical basetypesandoftheasymptotic expansionbeof any There are possible havior thanofcanthebeasymptotic described behavior by any fixed asymptoticis notsequence, soanthatexpan the description of a function so much n in termsOneofcannot, a preassigned asymptotic system as it theis theindefinite search forintegral such aofsiosystem. for example, when computing anofelementary function, functions, require in advance that thenotresultbe anbe elementary a composi tion certain elementary because it may function at all. The search for asymptotic formulas, like the computation of indefinite interest onlythan to thetheextent the result is simpler and more integrals, accessible istoofinvestigation originalthatexpression. b. Admissible Operations with Asymptotic Formulas The elementary arithmetic properties of the symbols o and 0 ( such properties as o(g) + o(g) = o(g), o(g) +O(g) = O(g) +O(g) = O(g), and the like ) have been studied along with the theory of limits ( Proposition 4 of Sect. 3.2). The following obvious proposition follows from these properties and the definition of an asymptotic expansion. Proposition 2 . ( Linearity of asymptotic expansions ) . If the functions f and .
•
• •
00
00
g admit asymptotic expansions f L=O an iPn and g L=O bn
c::'.
c::'.
00
19.1 Asymptotic Formulas and Asymptotic Series
605
Further expansionscases. and asymptotic formulas in general will properties involve moreof asymptotic and more specialized Proposition 3 . ( Integration of asymptotic equalities ) . Let f be a continuous
function on the interval I = [a, w[ (or I =]w, a]). a) If the function g( x) is continuous and nonnegative on I and the integral w I g( x) dx diverges, then the relations a
f(x) = O(g (x) ) , f(x) = o(g (x) ) , f(x) '"'" g(x)
as I
imply respectively that
F(x) = O(G(x) ) , F(x) = o(G(x)) , where
X
F(x) = J f(t) d t a
and
and
3
x -+ w
F(x) '"'" G(x) ,
X
G(x) = J g(t) d t . a
b) If the functions 'Pn ( x), n = 0, 1 , . . . , which are continuous and positive on I = [a, w[ form an asymptotic sequence as I 3 x -+ w and the integrals w
X
expansion f ( x) c:::o 2::: Cn 'Pn ( x) as I 3 x -+ w with respect to the asymptotic n =O sequence { 'Pn ( x)} of b) , then :F( x) has the asymptotic expansion :F( x) c:::o 00
"E
P . n=O Cn n (x) Proof. a) If f(x) = O ( g (x) ) as I x -+ w, there exists x0 E I and a constant M such that l f (x) l .S Mg (x) for x E [xo , w[. It follows that I f f(t) dt l .S I :? f(t) dt l + M J g(t) dt = o( J g(t) dt) . To prove the other two relations one can usex L 'Hopital 's rule (as in Example 7) , taking account of the relation G(x) = I g (t) dt -+ as I x -+ w. 3
a
a
�
a
oo
a
As a result, we find lim F(x) lim F'(x) = lim f(x) . I3 x -+w G (x) I3 x -+w G' ( ) I3 x -+w g( ) X
X
3
606
19 Asymptotic Expansions
1 , . . . ) applying L ' Hopital ' s rule (x) as I 3 x ----t w ( again,b) Since we findtPnthat tP (x) 'P (x) tf>� (x) I3limx -+w lf>nn+(IX ) I3limx-+w tf>�+(lX ) I3limx -+w 'Pnn+(lX ) c) The function rn (x) in the relation f(x) cotpo(x) + CI 'P I (x) + · · · + Cn 'Pn (x) + rn (x) , being the difference of continuousw functions on I, is itself continuous on I, and we obviously have Rn (x) J rn (t) dt ----t as I 3 x ----t w. But rn (x) o ( cpn (x) ) as I 3 x w and tPn (x) ----t as I 3 x ----t w. Therefore, again by L'Hopital ' s rule, it follows that in the equality :F(x) co lf>o(x) + c 1 tf> 1 (x) + · · · + Cn lf>n (x) + Rn (x) the quantity Rn (x) is o (tPn (x)) as I 3 x ----t w . D Remark Differentiation of asymptotic equalities and asymptotic series is generally not legitimate. The function f(x) = e- x sin(ex ) is continuously differentiable onasExample lR and is an asymptotic zero with respect to the asymptotic sequence { xln } x ----t +oo. The derivatives of the functions }n , up to a constant factor, agai nonlyhavefaithels toform . However thezero;function f'(x) = -ex sin( ex ) + cos( ex ) x\ asymptotic ' not be an it doesn expansion with respect to the sequence } } as xt ----teven+oo.have an asymptotic ----t 0
n =
0,
=
= 0 .
=
=
=
----t
X
0
0
=
=
5.
11.
{ n
19.1.3 Asymptotic Power Series
Intheyconclusion, let us examine asymptotic powerinseries in some detail, form, since are encountered relatively often, although a rather generalized as was the case in expansions Example 8.with respect to the sequence { xn ; 1 , . . . } , We shall study which is asymptotic as x ----t and with respect to { ln ; = 1 , . . . } , which ischange asymptotic as x ----t oo. Since these are both thex same object up to the of variable x � , we state the next proposition only for expansions with respect to the firstin sequence andexpansions then notewith the specifics ofthecertain of the formulations given the case of respect to second sequence. =
Proposition 4.
0
n
0,
n =
Let 0 be a limit point of E and let
f(x) '::::' ao + a 1 x + a2 x2 + · · ·
,
as E 3 x ----t O .
0,
19.1 Asymptotic Formulas and Asymptotic Series
607
x ---+ 0, a) (a f + f3g) '::::' L=O (aan + f3bn )xn n b) (f · g)(x) '::::' L=O Cn Xn , where Cn = aobn + a 1 bn - 1 + · · · + an bo , = n 0, 1, c) if bo -=/:- 0, then ( i ) (x) L; dn x n , where the coefficients d n can be =O n found from the recurrence relations n an = 2..: b k dn - k , . . . k =O d) if E is a deleted neighborhood or one-sided neighborhood of 0 and f is Then as E
3
00
;
00
0
0
0
;
c::::
n
00
continuous on E, then X
J0 f( t ) dt
c::::
a 1 x2 + . . . + --x an - 1 n + . . . a0x + 2 n
e) if in addition to the assumptions of d) we also have f E then
a� = (
n
+ 1)an + I ,
n
f ' ( x)
c::::
a� + a� x + · · ·
= 0, 1, . . . .
,
C ( l ) (E)
and
a) Thistheis properties a special case of Proposition 2. b) Using of o( ) (see Proposition 4 of Sect. 3. 2 ), we find that Proof.
(f . g)(x) = f(x) . g(x) = = (ao + a 1 x + · · · + an x n + o(xn ))(bo + b 1 x + · · · + bn x n + o(xn )) = = (aobo) + (aob l + a 1 bo)x + · · · + (aobn + a 1 bn - l + · · · + an bo)xn + o(xn ) as Ec) Ifx b---+ -=1-0. 0, then g(x) -=1- 0 for close to zero, and therefore we can x 0 consider the ratio ;(�j = h(x). Let us verify that nif the coefficients d0, . . . , dn inin the representation h( x) = do + d 1 x + ·n· · + dn x + rn ( x) have been chosen accordance with c), then rn (x) = o(x ) as E x ---+ 0. From the identity f (x) = g(x)h(x), we find that ao + a 1 x + · · · + an x n + o(xn ) = = (bo + b 1 x + · · · + bn xn + o(xn))(do + d 1 x + · · · + dn x n + rn (x)) = = (bodo) + (bod l + b 1 do )x + · · · + (bodn + b 1 dn - l + · · · + bn do)xn + + born (x) + o ( rn (x) ) + o(x n ) , 3
3
608
19 Asymptotic Expansions
from which it follows that o(xn ) born (x)+o(rn (x))+o(xn ), or rn (x) o(xn ) as Ed) This x -+ 0, since bo -1=- 0. follows 0 from part c ) of Proposition 3 if we set = 0 there and recall that I f(t) dt I0 f(t) dt . e) Since the function f'(x) is continuous on ]0, x] (or [x, 0[) and bounded ( it tends to a� as x -+ 0 ) , the integal I f'(t) dt exists. Obviously f (x) = a0 + 0 I0 f' (t) dt , since f(x) -+ a0 as x -+ 0. Substituting the asymptotic expansion of f'(x) into this equality and using what was proved in d) , we find that a� - 1 xn + . . . a� x2 + . . . + -f(x) -::::: a0 + a0' x + 2 It now follows from the uniqueness of asymptotic expansions ( Proposition 1) that a� ( + 1)an , 0, 1 , . . . . D Corollary 1 . If U is a neighborhood (or one-sided neighborhood) of infinity in � and the function f is continuous in U and has the asymptotic expansion =
3
-
w
X
=
X
=
X
X
=
n
n
n =
a1 + a2 + · · · + an + f(x) -::::: a0 + 2 xn x x
then the integral F
00
(x) =
over an interval contained in expansion:
j ( (t) X
!
-
ao
·
-
·
as
·
3
U x -+
oo ,
�1 ) dt
U converges and has the following asymptotic
The convergence of the integral is obvious, since a a f ( t) - ao -1 "' -22 as U t -+ t t It remains only to integrate the asymptotic expansion a a a a f ( t) - a0 - -1 -::::: -22 + 33 + + nn . as U t -+ t t t t citing, for example, Proposition 3d ) . D Proof.
oo .
3
-
�
·
·
·
-
·
3
·
oo ,
If in addition to the hypotheses of Corollary 1 it is known that and f' admits the asymptotic expansion
Corollary 2 .
f E cC l l (U) f
,
a� + a� + . . . + -a' + . . . (x) -::::: a0 + x x2 1
n
as
3
U x -+
00 ,
19.1 Asymptotic Formulas and Asymptotic Series
609
then this expansion can be obtained by formally differentiating the expansion of the function f, and a� = - ( n - 1)an- 1 ,
n=
2 , 3, . . .
Proof. Since J' (x) = a� + � + 0( 1/x 2 ) as f(x) = f(xo) +
X
U
and a� = a� = 0 . ::7
x -+ oo , we have
J f' (t) dt = a�x + a� ln x + 0 ( 1 ) xo
U
as ::7 x -+ oo; and since f(x) � a 0 + 7" + � + · · · and the sequence x, ln x, 1 , � ' � ' . . . is an asymptotic sequence as ::7 x -+ oo , Proposition 1 enables us to conclude that a� = a� = 0. Now, integrating the expansion J ' (x) � � + � + · · · , by Corollary 1 we obtain the expansion of f(x) , and by the uniqueness of the expansion we arrive at the relations a� = - ( n - 1)an _ 1 for n = 2 , 3, . . . . D
U
19. 1 .4 Problems and Exercises
a) Let h(z) = I: a n Z- n for lz l > R, z E C. Show that then h(z) � I: a n Z- n n=O n=O as C 3 z -+ oo. b) Assuming that the required solution y(x) of the equation y' (x) + y 2 (x) = sin -fx has an asymptotic expansion y(x) � f C n X - n as x -+ oo, find the first three n=O terms of this expansion. c) Prove that if f(z) = f a n z n for lz l < r, z E C, and g(z) � b 1 z + b2 z 2 + · · as n=O C 3 z -+ 0, then the function f o g is defined in some deleted neighborhood of 0 E C and (f o g) (z) � c0 + c1z + c 2 z 2 + as C 3 z -+ 0, where the coefficients c0 , c 1 , . . . can be obtained by substituting the series in the series, just as for convergent power series. =
=
1.
·
·
2.
·
·
Show the following. a) If f is a continuous, positive, monotonic function for x
t f( k ) k =O b) c)
k
�
0, then
= ]O f(x) dx + o (f(n) ) + 0(1) as n -+ oo ;
£: f: = ln n + c + o(1) as n -+ oo. £:= l k" (ln k) /3 n "+��� n ) �' as n -+ oo for a > - 1 .
k =l
2:
19 Asymptotic Expansions
610
3. Through integration by parts find the asymptotic expansions of the following functions as x -+ +oo: + oo a) r. (x) = I t • - l e - t dt - the incomplete gamma function;
b) erf (x)
00 I e-x 00
-
2
dx =
c) F(x)
=
X
=
Jrr
2 I e - t dt - the probability error function (we recall that X
-x V7f is the Euler-Poisson integral) . +oo " t
I �� dt if a > 0.
X
4. Using the result of the preceding problem, find the asymptotic expansions of the following functions as x -+ +oo: 00 a) Si (x) = f si t dt - the sine integral (we recall that I s i � x dx � is the
0
Dirichlet integral) . b) C(x)
=
�
J0 cos � e dt , S(x) J0 sin � e dt - the Fresnel integrals (we recall 00 2 =
that I cos x 2 dx = I sin x dx + oo
0
=
0
0
=
� yif) .
The following generalization of the concept of an expansion in an asymptotic sequence { 'P n (x) } introduced by Poincare and studied above is due to Erdelyi? Let X be a set, B a base in X, { 'P n (x) } an asymptotic sequence of functions on X. If the functions f(x) , '1/Jo (x) , 'I/J 1 (x) , 'I/J2 (x) , . . . are such that the equality 5.
f(x) = t 'I/Jk (x) + o ( cp n (x) ) over the base k =O holds for every n = 0, 1 , . . . , we write
B
00
{ 'P n (x) } over the base B , n =O and we say that we have the asymptotic expansion of the function f over the base B in the sense of Erdelyi. a) Please note that in Problem 4 you obtained the asymptotic expansion in the sense of Erdelyi if you assume 'P n (x) = x - n , n = 0, 1 , . . . . b) Show that asymptotic expansions in the sense of Erdelyi do not have the property of uniqueness (the functions '1/J n can be changed) . c ) Show that i f a set X, a base B i n X, a function f o n X, and sequences {J.t n (x) } and { 'P n (x) } , the second of which is asymptotic over the base B, are given, then the expansion f(x)
f(x)
�
�
L '1/Jn (x) ,
00
,L: a n J.t n (x) , {cpn (x) } over the base B ,
n =O
where a n are numerical coefficients, is either impossible or unique. 2 A. Erdelyi (1908- 1977 ) - Hungarian/British mathematician.
19.1 Asymptotic Formulas and Asymptotic Series
61 1
6. Uniform asymptotic estimates. Let X be a set and l3x a base in X, and let and g(x, y) be ( vector-valued ) functions defined on X and depending on the parameter y E Y. Set 1/(x, y) l = a(x, y) i g (x, y)l. We say that the asymptotic relations
f(x, y)
f(x, y) = o(g (x, y) ) , f(x, y) = o (g(x, y) ) , f(x, y) "' g(x, y) are uniform with respect to the parameter y on the set Y if ( respectively ) a( x, y) � 0 on Y over the base Bx ; a(x, y) is ultimately bounded over the base !3, uniformly with respect to y E Y; and finally f = a · g + o ( g ), where a(x, y) � 1 on Y over the base l3x . Show that if we introduce the base l3 = {!3, x Y} in X x Y whose elements are the direct products of the elements B, of the base Bx and the set Y, then these definitions are equivalent respectively to the following:
f(x, y) o(g (x, y) ) , J(x, y) o (g (x, y) ) , J(x, y) "' g(x, y) =
=
over the base !3.
7. Uniform asymptotic expansions. The asymptotic expansion 00
f(x, y) l: an (y)
over the base Bx in X holds uniformly on Y in the equalities
n f(x, y) 2: a k (y)
n =
�
00
[ J (x, y) dy � ([ an (Y) )
dy
over the base Bx .
b ) Let Y = [c, dJ C JR. Assume that the function f(x, y) is continuously differ entiable with respect to y on the closed interval Y for each fixed x E X and for some Yo E Y admits the asymptotic expansion
f(x, yo) l: a,. (yo)
over the base Bx .
Prove that if the asymptotic expansion
8f a n (Y)
00
over the base Bx
612
19 Asymptotic Expansions
holds uniformly with respect to y E Y with coefficients a n (Y) that are continuous in y, n = 0, 1 , . . . , then the original function f(x, y ) has an asymptotic expansion j(x, y) � L a n (y)cp n (x) over the base Bx that is uniform with respect to y E Y, n =O its coefficients a n (y) , n = 0, 1, . . . are smooth functions of y on the interval Y and � ( y ) = On (y) . oc
Let p(x) be a smooth function that is positive on the closed interval c :::; x :::; d. a ) Solve the equation � (x, .X) = .\ 2 p(x)u(x, .X) in the case when p(x) = 1 on [c, d] . b ) Let 0 < m :::; p(x) :::; M < +oo on [c, d] and let u(c, .X) = 1 , �� (c, .X) = 0. Estimate the quantity u(x, .X) from above and below for x E [c, d] . c ) Assuming that ln u(x, .X) � f; cn (x) .\ 1 -n as .X --+ + oo , where eo (x) , c1 (x), . . . n =O are smooth functions and, using the fact that ( �) 1 = :!!__ ( .!!.... ) 2 show that � 2 (x) = p(x) and ( c� - 1 + 'f c� · c� - k ) (x) = 0. k =O 8.
II
u
-
1
u
'
1 9 . 2 The Asymptotics of Integrals (Laplace's Method) 1 9 . 2 . 1 The Idea of Laplace's Method
In this subsection we shall discuss Laplace ' s method - one of the few reason ably general methods of constructing the asymptotics of an integral depend ing on a parameter. We confine our attention to integrals of the form b
F(>-.) J f(x)e>. S (x) dx , =
a
(19.1)
where S(x) is a real-valued function and ).. is a parameter. Such integrals are usually called Laplace integrals.
Example
1.
The Laplace transform +co
L(f) (�)
=
J f(x)e -t;x dx 0
is a special case of a Laplace integral.
Example b
2.
Laplace himself applied his method to integrals of the form
J f(x)cpn (x) dx, where
a
n
E N and cp(x) > 0 on ]a, b[. Such an integral is also a
special case of a general Laplace integral (19. 1 ) , since cpn (x) = exp ( n ln cp(x)).
19.2 The Asymptotics of Integrals (Laplace's Method}
613
We shall be interested in the asymptotics of the integral ( 19. 1 ) for large values of the parameter A, more precisely as A -+ +oo, A E So as not to become distracted with secondary issues when describing the basic idea of Laplace ' s method, we shall assume that [a, b] = I is a finite closed interval in the integral ( 19. 1 ) , that the functions f(x) and S(x) are smooth on I, and that S(x) has a unique, strict maximum S(x0) at the point xo E I. Then the function exp (AS(x)) also has a strict maximum at xo, which rises higher above the other values of this function on the interval I as the value of the parameter A increases. As a result , if f(x) ¢. 0 in a neighborhood of xo, the entire integral ( 19. 1) can be replaced by the integral over an arbitrarily small neighborhood of x0, thereby admitting a relative error that tends to zero as A -+ +oo. This observation is called the localization principle. Reversing the historical sequence of events, one might say that this localization principle for Laplace integrals resembles the principal of local action of approximate identities and the 8-function. Now that the integral is being taken over only a small neighborhood of x0, the functions f(x) and S(x) can be replaced by the main terms of their Taylor expansions as I 3 x -+ Xo. It remains to find the asymptotics of the resulting canonical integral, which can be done without any particular difficulty. It is in the sequential execution of these steps that the essence of Laplace ' s method of finding the asymptotics of an integral is to be found.
�-
Example 3. Let x0 = a, S'(a) =/=- 0, and f(a) =/=- 0, which happens, for example, when the function S(x) is monotonically decreasing on [a, b]. Under these conditions f(x) = f(a) + o(1) and S(x) = S(a) + (x - a)S'(a) + o(1), as I 3 x -+ a. Carrying out the idea of Laplace ' s method, for a small c > 0 and A -+ +oo, we find that
F(A)
a+c
rv
J f(x) e>.S(x ) dx a
g
rv
f(a) e >.S( a ) J e >. t S ' ( a ) dt = 0 !( a ) e >.S( a ) >.S ' ( a ) c ) . = - AB ' (a) ( 1 - e
S'(a) < 0, it follows that in the case in question e >.S( a ) as A -+ +oo . ( 19.2) F(A) "' - f(a) AB' (a) Example 4 - Let a < xo < b. Then S'(xo) = 0, and we assume that S"(xo) =f. 0, that is, S"(x0) < 0, since xo is a maximum. Using the expansions f(x) = f(xo) + o(x - xo) and S(x) = S(xo) + �S"(x0)(x - x0) 2 + o((x - x0) 2 ), which hold as x -+ x0, we find that for small Since
19 Asymptotic Expansions
614
e > 0 and
A
---+
+oo
F(A) "'
��
g
J f(x) e>.S(x ) dx "' f(xo) e>.S(xo ) J e � >.S" (xo ) t2 dt .
xo - c
-g Making the change of variable �AS"(x0) t 2 = -u2 ( since S"(x0) < 0) , we obtain cp (>. , g ) g t2 xo Jg e � >.S" ( ) dt = AS'�xo) - cp (>. , g ) >.S"(-2-x 0-e) +oo as A +oo. where cp( A, e ) = V - ---+
f
---+
Taking account of the equality
+oo
J e - u2 du
- oo
=
.Jii ,
we now find the principal term of the asymptotics of the Laplace integral in this case: 27r (19.3) F(A) ,...., xo ) e >.S( xo ) as A\ ---+ +oo .
AB"(xo) J( If x0 = a , but S'(x0) = 0 and S"(x0) < 0, then, reasoning as in
Example 5. Example 4, we find this time that
a+g
g xo x ) ) >.S( >.S( J e �>.S" (xo) t2 dt ' F(A) ,...., J f(x) e dx ,...., f(xo) e 0 a and so
AS" (xo) J(xo ) 2 7f
F(A) ,...., -21
e
>.S( xo )
as
A ---+ \
+oo .
(19.4)
We have now obtained on a heuristic level the three very useful formulas ( 19.2) - ( 1 9.4) involving the asymptotics of the Laplace integral (19. 1 ) . It i s clear from these considerations that Laplace ' s method can b e used successfully in the study of the asymptotics of any integral
j f(x, A) x
d as
X
A
---+
+oo
(19.5)
provided ( a ) the localization principle holds for the integral, that is, the in tegral can be replaced by one equivalent to it as A ---+ +oo extending over
19.2 The Asymptotics of Integrals (Laplace's Method)
615
arbitrarily small neighborhoods of the distinguished points, and ( b ) the in tegrand in the localized integral can be replaced by a simpler one for which the asymptotics is on the one hand the same as that of the integral being investigated and on the other hand easy to find. If, for example, the function S(x) in the integral ( 1 9 . 1 ) has several local maxima x0, X I , . . . , Xn on the closed interval [a, b], then, using the additivity of the integral, we replace it with small relative error by the sum of similar integrals taken over neighborhoods U(x1 ) of the maxima x0, x i , . . . , X n so small that each contains only one such point . The asymptotic behavior of the integral f ( x )e >- S ( x ) dx as A --+ + oo ,
J
U ( xj )
as already mentioned, is independent of the size of the neighborhood U(x1 ) itself, and hence the asymptotic expansion of this integral as A --+ + oo is denoted F(A, x1 ) and called the contribution of the point Xj to the asymptotics of the integral ( 19. 1 ) . In its general formulation the localization principle thus means that the asymptotic behavior of the integral ( 19.5) is obtained as the sum 2::: F(A, Xj )
j
of the contributions of all the points of the integrand that are critical in some respect . For the integral ( 19. 1) these points are the maxima of the function S(x), and, as one can see from formulas ( 19.2)-( 19.4) , the main contribution comes entirely from the local maximum points at which the absolute maximum of S(x) on [a, b] is attained. In the following subsections of this section we shall develop the gen eral considerations stated here and then consider some useful applications of Laplace ' s method. For many applications what we have already discussed is sufficient. It will also be shown below how to obtain not only the main term of the asymptotics, but also the entire asymptotic series. 19.2.2 The Localization Principle for a Laplace Integral Lemma 1. ( Exponential estimate ) .
Let M = sup
a<x
S(x ) < oo,
and suppose
that for some value Ao > 0 the integral ( 19. 1) converges absolutely. Then it converges absolutely for every A ;:::: Ao and the following estimate holds for such values of A : b
I F(A) I ::; J i f ( x )e >- S where A E JR.
a
(x)
l dx ::; Ae >-M ,
( 19.6)
616
19 Asymptotic Expansions
Proof. Indeed, for A 2: Ao ,
II
l II
b
I F(A) I =
l
b
f(x) e>.S (x) dx =
a
a
b
:::; e ( >. - >. o) M
f(x) e>. o S (x) e ( >. - >. o) S (x) dx :::;
I
b
(
lf(x) e>. o S (x) I dx = e->. o M
a
I lf(x)e>.o S (x) I dx) e>.M . D a
Suppose the integral ( 19. 1 ) converges absolutely for some value A = Ao , and suppose that in the interior or on the boundary of the interval I there is a point x0 at which S(xo) = sup S(x) = M . If f(x) and S(x) are continuous at xo and a< x 0 and every sufficiently small neighborhood UI (xo) of xo in I we have the estimate
Lemma 2 . (Estimate of the contribution of a maximum point).
II
l
f( x) e>.S (x) dx :=:: Be>. ( S (xo) - e: )
U1 (xo)
(19.7)
with a constant B > 0, valid for A 2: max{..\0 , 0} . Proof. For a fixed c > 0 let us take any neighborhood U1 (x0) inside which l f (x) l 2: � l f (xo) l and S(xo ) - c :::; S(x) :::; S(xo ) . Assuming that f is real valued, we can now conclude that f is of constant sign inside U1 (x) . This enables us to write for A 2: max{ Ao , 0}
II
l I
f( x) e>.S (x) dx =
lf( x) l e>.S (x) dx
UI (xo)
U1 (xo)
2:
I
UI (xo)
�
2:
l f(xo ) l e>. ( S (xo) - e: ) dx
=
Be>. ( S (x o ) - e: ) . D
Suppose the integral ( 19. 1) con verges absolutely for a value A = Ao , and suppose that inside or on the bound ary of the interval I of integration the function S(x) has a unique point x0 of absolute maximum, that is, outside every neighborhood U(x0) of the point xo we have sup S(x) < S(xo) . I\ U (xo) If the functions f (x) and S(x) are continuous at xo and f(x0) :/=: 0, then Proposition 1 . (Localization principle) .
F(A)
=
Fu1 (xo) (A) ( 1 + O ( r = ) ) as A ---7 + oo ,
(19.8)
19.2 The Asymptotics of Integrals (Laplace's Method) where
617
Ur(x0) is an arbitrary neighborhood of x0 in I, Fui (x 0 ) (>.)
:=
J
UI (xo)
f (x) e >. S (x) dx ,
and 0 ().. - oo ) denotes a function that is o ( ).. - n ) as )..
� +oo for every E N . Proof. It follows from Lemma 2 that if the neighborhood U1(x0) is sufficiently small, then the following inequality holds ultimately as ).. � +oo for every E>O x0
n
I FU1 (xo ) (>.) [ > e>. ( S (xo) - c: ) · (19.9) At the same time, by Lemma 1 for every neighborhood U(x0) of the point we have the estimate
J
I\ U (xo)
l f (x) l e >. S (x) dx � Ae >.Jl
as )..
� +oo ,
(19. 10)
11 =
sup S(x) < S(x0). x EI\U (xo) Comparing this estimate with inequality ( 19.9) , it is easy to conclude that inequality ( 19.9) holds ultimately as ).. +oo for every neighborhood Ur(x0) of xa. It now remains only t o write
where A > 0 and
�
and, citing estimates (19.9) and (19. 10) , conclude that ( 19.8) holds. D Thus it is now established that with a relative error of the order 0 (>. - oo ) as +oo when estimating the asymptotic behavior of the Laplace integral, one can replace it by the integral over an arbitrarily small neighborhood Ur(x0) of the point x0 where the absolute maximum of S(x) occurs on the interval I of integration.
).. �
19.2.3 Canonical Integrals and their Asymptotics Lemma 3. ( Canonical form of the function in the neighborhood of a critical point ) . If the real-valued function S(x) has smoothness cC n+ k ) in a neighbor hood (or one-sided neighborhood) of a point x 0 E � ' and
S'(xo) = · · · = s C n - l ) (xo)
=
0,
s C n l (xo) =/=- 0 ,
and k E N or k = oo, then there exist neighborhoods (or one-sided neighbor hoods) Ix of x o and Iy of O in � and a diffeomorphism r.p E C k ) (Iy , Ix ) such that
618
19 Asymptotic Expansions S (
= S(x0) +
Here
s
y n , when y E Iy and
= Xo
and
1
s
= sgn S ( n l (x0) .
(0) = ( I S (nn!l (xo ) )
1/ n
l Proof. Using Taylor ' s formula with the integral form of the remainder,
S(x)
=
1
� j J s(n) (xo + t(x - Xo ) ) ( 1 - t) n 1 dt '
( -x n S(xo ) + �, n-
-
0
we represent the difference S(x) - S(x0 ) in the form
S(x) - S(xo )
= (x - xo )nr(x) ,
where the function 1
r(x)
= ( n � 1) ! J S(n) (xo + t(x - Xo ) ) ( 1 - t) n- 1 dt , 0
by virtue of the theorem on differentiation of an integral with respect to n the parameter x, belongs to class C (k) , and r(x0) = � n . S( l (x0 ) -1- 0. Hence the function y = 'l/;(x) = (x - x0 ) v' l r(x) l also belongs to C( k) in some neighborhood (or one-sided neighborhood) Ix of x0 and is even monotonic, since s ( xo ) n '1/J' (x o ) = v' lr( xo ) l = c n�; l f / -1- 0 0
In this case the function '1/J on Ix has an inverse '1/J - 1 =
Remark 1 . The cases n ones of most interest.
= 1 or n = 2 and k = 1 or k = oo are usually the
Suppose the interval of integration I = [a, b] in the integral (19. 1 ) is finite and the following conditions hold: a ) f, S E C(I, lR) ; b) max S ( x) is attained only at the one point x0 E I; xE I n c ) S E C( ) (UI (x0 ) , JR) in some neighborhood U1 (x0 ) of x0 {inside the interval I); d) sC n l (x0) -1- 0 and if 1 < n, then S(ll (x0 ) = . . . = sC n - 1 l (x0 ) = 0.
Proposition 2 . (Reduction) .
19.2 The Asymptotics of Integrals (Laplace's Method)
619
Then as A --+ +oo the integral (19. 1 ) can be replaced by an integral of the form n R(A) = r ( y ) e - y dy
e>.S(xo) I
>.
Iy
with a relative error defined by the localization principle ( 19.8) , where Iy = [ - c-, c-] or Iy = [O , c-], c is an arbitrarily small positive number, and the func tion r has the same degree of smoothness Iy that f has in a neighborhood of
xo.
Proof Using the localization principle, we replace the integral ( 1 9 . 1 ) with the integral over a neighborhood I = UI (xo) of xo in which the hypotheses of Lemma 3 hold. Making the change of variable x = cp( y ), we obtain
x
I f(x) e>.S(x) dx = ( I f ( cp( y ) ) cp (y ) e- >.yn dy)e>.S(xo) . '
Iy
Ix
(19. 1 1 )
The negative sign i n the exponent ( - A Y n ) comes from the fact that by hypothesis x0 = cp( O ) is a maximum. D
The asymptotic behavior of the canonical integrals to which the Laplace integral ( 19 . 1 ) reduces in the main cases is given by the following lemma.
Let a > 0, (3 > 0, 0 < a � oo, and f E C ( [O, a] , IR ) . Then with respect to the asymptotics of the integral
Lemma 4. (Watson 3 ) .
a
W(A) = I xf3 - l f(x) e ->. x "' dx 0 as A --+ +oo, the following assertions hold: a) The main term of the asymptotics of ( 19.12 ) has the form
1 .x- a + O ( A - --u# +-I ) , W(A) = -f(O)F((3/a) if it is known that f(x) = f(O) + O(x) as x --+ 0 . b) If f(x) = ao + a 1 x + · · · + an x n + O(x n + l ) as x --+ 0, then W(A) = � t a k r ( k : f3 ) .x - � + O (.x- n +g+ 1 ) . k =O Q
3
{3
G.H. Watson (1886- 1965) - British mathematician.
( 19. 12 )
( 19. 13 )
( 19.14 )
620
19 Asymptotic Expansions
c) If f is infinitely differentiable at x = 0, then the following asymptotic expansion holds: + f3) � f C kl ( ) r , (19. 15) W(A) ':::' �a � a L.., k =O which can be differentiated any number of times with respect to A. Proof. We represent the integral ( 19. 12) as a sum of integrals over the interval ]0, c-] and [c-, a[, where c is an arbitrarily small positive number. By Lemma 1
k! o r (
I J xf3-l f (x )e- >-.xa dx l a
E
and therefore
:<::::
k
Ae - >-. .:a =
O ( r00 )
E
W(A) = J xf3- l f ( x )e - >-.xa dx + O ( r = )
as
as
A ---+ +oo ,
A ---+ +oo .
0
n In case b ) we have f ( x ) I: a k x k + rn ( x ) , where rn E C[O, c] and k =O i rn ( x ) i :<:::: Cxn + l on the interval [O, c-]. Hence E E n W(A) = L a k x k +f3- l e ->-.xa dx + c(A) xn+ f3 e - >-.xa dx + o(A - ) , k =O 0 0
J
where c(A) is bounded as A ---+ +oo. By Lemma 1 , as A ---+ +oo, E + k + >-.xa l f3 x k + f3 - l e ->-.xa dx + O(A - 00 ) dx = x - e
oo J
J
.
0
0
But
oo
J
oo k J x +f3- l e - >-.xa dx = � r ( k : {3 ) A - � , + 0
from which formula ( 19. 14) and the special case of it, formula ( 19. 13) , now follow. The expansion ( 19. 15) now follows from ( 19. 14) and Taylor ' s formula. The possibility of differentiating ( 19. 15) with respect to A follows from the fact that the derivative of the integral ( 19. 12) with respect to the parameter A is an integral of the same type as ( 19. 12) and for W' ( A ) one can use formula ( 19. 15) to present explicitly an asymptotic expansion as A ---+ +oo that is the same as the one obtained by formal differentiation of the original expansion (19.15). D
621
19.2 The Asymptotics of Integrals (Laplace's Method) Example
6.
Consider the Laplace transform
F(>.) =
+oo
j0 f(x) e- >-x dx ,
which we have already encountered in Example 1. If this integral converges absolutely for some value ,\ = ).. 0 and the function f is infinitely differentiable at x = 0, then by formula (19.15) we find that
00 f ( k l r < k + I ) F(>.) L (o) =O k c::::
as
>. � +oo .
19.2.4 The Principal Term of the Asymptotics of a Laplace Integral Theorem 1 . (A typical principal term of the asymptotics ) .
Suppose the in terval of integration I = [a, b] in the integral (19.1) is finite, f, S E C(I, JR) , and max S ( x) is attained only at one point x0 E I . x EI Suppose it is also known that f(x0) -=/:- 0, f(x) = f(xo) + O(x - xo) for I 3 x � x0, and the function S belongs to C ( k ) in a neighborhood of x0 . The following statements hold. a) If xo = a, k = 2, and S'(x0) -=f. 0 (that is, S'(x0 ) < 0}, then
S( xo r 1 0 r 1 F(>.) = -Sf(xo) ' (xo) e>- l [1 + ( )]
b) if a < xo
as
,\ � + oo ;
(19.2')
< b, k = 3, and S"(xo) -=f. 0 (that is, S"(xo) < 0}, then
) if
x0 = a, k = 3, S'(a) = 0, and S"(a) -=f. 0 (that is, S"(a) < 0}, then F(,\) = - 2S"(xo ) f(x0 ) e>-S ( xo l r 1 1 2 [1 + 0 (>.- 1 1 2 )] as ,\ � +oo (19 4') c
7r
·
·
Proof Using the localization principle and making the change of variable
x = cp(y) shown in Lemma 3, according to the reduction in Proposition 2, we
arrive at the following relations: c
a) F(,\) = e>- S( xo ) ( (f o cp)(y)cp'(y) e->- Y dy + O( r 00 ) ) ; 0
j
19 Asymptotic Expansions
622
E:
b) F(>.. ) = e>.s(xo) ( l(f o
- E:
0
E:
F(>..) = e>.s(xo) ( l (f o
)
0
D
Now let us consider some examples o f the application o f this theorem. Example 7. The asymptotics of the gamma function. The function +oo dt > =
r(>.. + 1) 1 t )..e - t
(>.. -1)
0
can be represented as a Laplace integral +oo e e
r(>.. + 1) = I -t >. n t dt , 0
and if for integral
I
>.. > 0 we make the change of variable t = >..x , we arrive at the +oo + l >.,A + r(>.. 1) = I e - >.(x -ln x) d ' 0
x
which can be studied using the methods of the theorem. The function S x ) ln x - x has a unique maximum x on the interval 0, oo , and S" ( 1 ) == By the localization principle (Proposition and assertion b) of Theorem we conclude that
( = =1 1) -1.1, r(>.. + 1) = � (�) ).. [ 1 + 0 (>.. - 112)] as >.. ---+ + In particular, recalling that F(n + 1) = n! for n E N, we obtain the classical Stirling 's formula4 n! = �(n/e)n[1 + O(n-112)] as n ---+ nEN. ---] + [
oo
oo ,
4 See also Problem 10 of Sect. 7.3.
0
19.2 The Asymptotics of Integrals (Laplace's Method) Example
8.
623
The asymptotics of the Bessel function
11" In (x) = � J ex cos e cos nO dO , 0 where n E N. Here f ( O ) = cos nO, S(O) = cos O, omax :<;; x :<;; 7r S(O) = S(O) = 1, S'(O) = 0, and S"(O) = - 1, so that by assertion c ) of Theorem 1 ex [1 + O(x - 1 1 2 )] as x -t +oo . In (x) = y� 21TX Example 9. Let f E C{ l l ( [a, b], IR ) , S E c< 2 l ( [a, b], IR ) , with S(x) > 0 on [a, b], and a:<;;max S(x) is attained only at the one point x0 E [a, b]. If f (xo) of- 0, x :<;; b S'(x0) = 0, and S"(x0) of- 0, then, rewriting the integral .
b
.F (A) = j f (x) [S(x) f dx a
in the form of a Laplace integral b
n
( )
.F (A) = J f (x) e>. i S x dx ,
a on the basis of assertions b ) and c ) of Theorem
1, we find that as A -t +oo [S(xo)]>.+1 / 2 A - 1 / 2 [1 + O(A - 1 / 2 )] '
.F (A) = f (xo) -S�'�xo) = 1 if a < x0 < b and = 1/2 if x0 = a or x0 = b. c
where
c
Example
1 0.
c
The asymptotics of the Legendre polynomials
11"
j
Pn (x) = � (x + � cos O ) n dO 0 in the domain x > 1 as n -t oo, n E N , can be obtained as a special case of the preceding example when f 1, S(O) = x + � cos O , 0max :<;; 6 :<;; 11" S(O) = S(O) = x + � , S'(O) = 0 , 8"(0) = - � . =
Thus,
624
19 Asymptotic Expansions
1 9 . 2 . 5 *Asymptotic Expansions of Laplace Integrals
Theorem 1 gives only the principal terms of the characteristic asymptotics of a Laplace integral (19.1) and even that under the condition that f(x0) -:/- 0. On the whole this is of course a typical situation, and for that reason Theorem 1 is undoubtedly a valuable result . However, Watson ' s lemma shows that the asymptotics of a Laplace integral can sometimes be brought to an asymptotic expansion. Such a possibility is especially important when f(xo) = 0 and Theorem 1 gives no result . It is naturally impossible to get rid of the hypothesis f(x0) -:/- 0 com pletely, without replacing it with anything, while remaining within the limits of Laplace ' s method: after all, if f(x) = 0 in a neighborhood of a maximum x0 of the function S(x) or if f (x) tends to zero very rapidly as x -+ xo , then x0 may not be responsible for the asymptotics of the integral. Now that we have arrived at a certain type of asymptotic sequence { e>. c A -pk } , (Po < p 1 < · · · ) as A -+ + oo as a result of the considerations we have studied, we can speak of an asymptotic zero in relation to such a sequence and, without assum ing that f(x0) -:/- 0, we can state the localization principle as follows: Up to an asymptotic zero with respect to the asymptotic sequence { e>.S (xo) A -pk } (Po < p 1 < · · · ) the asymptotic behavior of the Laplace integral (19.1) as A -+ +oo equals the asymptotics of the portion of this integral taken over an arbitrarily small neighborhood of the point x0 , provided that this point is the unique maximum of the function S(x) on the interval of integration. However, we shall not go back and re-examine these questions in order to sharpen them. Rather, assuming f and S belong to cC = l , we shall give a derivation of the corresponding asymptotic expansions using Lemma 1 on the exponential estimate, Lemma 3 on the change of variable, and Watson ' s lemma (Lemma 4) .
Let I = [a, b] be a finite interval, f, S E C(J, �), and assume max S(x) is attained only at the point x0 E I xE I and f , S E C ( oo ) ( UI (x0 ) , �) in some neighborhood U1 (x0) of x0 . Then in re lation to the asymptotics of the integral (19.1) the following assertions hold. a) If x0 = a, sC m l (a) -:/- 0, sUl (a) = 0 for 1 ::; j < m, then
Theorem 2 . (Asymptotic expansion) .
00
F(A) ::::::' A - l fm e>.S ( a ) L a k A - k / m as A -+
k =O
where
+ oo ,
(- 1) k+ l mk r k + 1 h (x , a ) d k ( f(x) h (x , a )) l ( � ) ( ldxl ) x=a ' k! h (x, a) = ( S( a ) S(x) ) - / m j S' (x) . -
(19.16)
19.2 The Asymptotics of Integrals (Laplace's Method) b)
625
If a < Xo < b, S ( 2m l (xo) =1- 0, and sUl (xo) = 0 for 1 � j < 2m, then (19.17) F(.>.. ) ::::: ).. - l / 2m e.X S( xo ) L Ck A - k / m as ).. ---+ +oo ' k =O CXJ
where
2 k + l 2 k 2k + 1 d 2k Ck = 2 (-1) (2k)( !2m ) r ( � ) ( h(x, xo) dx ) ( f(x)h(x, xo)) i x = xo ' h (x, xo) = ( S(xo) - S(x)) 1 - 2!, /S'(x) . ( n l (xo)(x - xo) n as x ---+ x0, then the c) If f ( n l (xo) =f. 0 and f(x) �f . n main term of the asymptotics in the cases a) and b) respectively has the form rv
F(.>.. ) = ]:_m .>.. - � e.X S( a ) r ( n m+ 1 ) ( I Smm!(a) l ) � x X [ � f ( n l (a) + 0 (.>.. - � )] , (19.18) ! + 1 ) ( (2m) ! ) w X F(.>.. ) = ]:__m ).. - W e .X S( xo ) r ( n2m I S2m (xo) l X [ � f ( n ) (xo) + O (.>.. - W )] . (19.19) ! The expansions (19.16) and (19.17) can be differentiated with respect
d) to ).. any number of times.
Proof. It follows from Lemma 1 that under these hypotheses the integral (19.1) can be replaced by an integral over an arbitrarily small neighborhood of xo up to a quantity of the form e .X S( xo ) Q(.>.. - =) as ).. ---+ oo. Making the change o f variable x =
e -.X S ( xo )
ju
o
(19.20)
Iy
where Iy = [O , E], a = m if x0 = a, and Iy = [ - E, E] , a = 2m if a < x0 < b. The neighborhood in which the change of variable x =
626
19 Asymptotic Expansions If Iy =
[
]
- c- , c- ,
that is, in the case
( 19.20 ) into the form
c
a < xo <
b, we bring the integral
e>.S( xo ) ![(!
(19.21)
o
o
and, once again applying Watson ' s lemma, we obtain the expansion (19.17). The possibility of differentiating the expansions (19.16) and (19.17) fol lows from the fact that under our assumptions the integral (19.1) can be differentiated with respect to >. to yield a new integral satisfying the hy potheses of the theorem. We write out the expansions (19.16) and (19.17) for it , and we can verify immediately that these expansions really are the same as those obtained by formal differentiation of the expansions (19.16) and ( 19.17) for the original integrals. We now take up the formulas for the coefficients a k and c k . By Watson ' s lemma a k = k !� �:f (O) r � , where if> (y) = (f o
( )
S(
d k iJ>
d k k+ 1 dy k (0) = ( -m) ( h (x, a) dx ) ( f(x) h(x, a) ) l x =a , 1 where h (x, a) = ( S(a) - S(x)) - � /S'(x). Formulas for the coefficients C k can be obtained similarly by applying Watson ' s lemma to the integral ( 19.21). Setting '1/J(y) = f (
f
=
asymptotic expansion can be rewritten as
c / '1/J( y) e - >.y 2"' dy 0
�
1 � 'I/J ( 2k) (O) r ( 2k + 1 ) >. _ 2�;:; 1 _ 2m 2m � k =O (2k) !
·
19.2 The Asymptotics of Integrals (Laplace's Method)
627
It remains only to note that 1/1 ( 2 k ) (O) = 2tJ> ( 2 k ) (O), where tl>( y ) = f (
( ! o
n:�
C xo) (
and
(n) ( ! o
n!
..I..
a
•
It now remains only to substitute these expressions into the integrals
( 19.20 ) and 19.21 ) respectively and use formula (19.13 ) from Watson ' s
lemma. D
(
n
Remark 2. We again get formula ( 19. 2') from formula (19. 18) when = and m = 1 . Similarly, when = 0 and m = 1 formula (19. 19) yields ( 19.3' ) again. Finally, Eq. ( 19.4' ) comes from Eq. (19. 18) with = 0 and m = 2 . All of this, of course, assumes the hypotheses of Theorem 2.
n
0
n
Remark 3. Theorem 2 applies to the case where the function S(x) has a unique maximum on the interval I = [a, b] . If there are several such points x� , . . . , Xn , the integral (19. 1 ) is partitioned into a sum of such integrals, each of whose asymptotics is described by Theorem 2. That is, in this case the n asymptotic behavior is obtained as the sum L: F(>., Xj ) of the contributions
j= l
of these maximum points. It is easy to see that when this happens, some or even all of the terms may cancel one another.
Example
11.
If s E
c C = l (IR, IR) and S(x) --+ - oo as X --+ oo, then
F(>.)
=
00
J S'(x) e>-.S (x) dx
-
oo
=
0
for
)..
>0.
Hence, in this case such an interference of the contributions must necessarily occur. From the formal point of view this example may seem unconvincing,
628
19 Asymptotic Expansions
since previously we had been considering the case of a finite interval of in tegration. However, those doubts are removed by the following important remark.
Remark 4. To simplify what were already very cumbersome statements in Theorems 1 and 2 we assumed that the interval of integration I was finite and that the integral (19. 1 ) was a proper integral. In fact , however, if the inequality sup S(x) < S(x 0 ) holds outside an interval U(x0 ) of the maxiI\ U (xo ) mum point x 0 E I, then Lemma 1 enables us to conclude that the integrals over intervals strictly outside of U ( x 0 ) are exponentially small in comparison with e >- S (xo ) as A -+ + oo (naturally, under the assumption that the integral ( 19. 1 ) converges absolutely for at least one value A = A 0 ) . Thus both Theorem 1 and Theorem 2 are also applicable to improper integrals if the conditions just mentioned are met. Remark 5. Due to their cumbersome nature, the formulas for the coefficients obtained in Theorem 2 can normally be used only for obtaining the first few terms of the asymptotics needed in specific computations. It is extremely rare that one can obtain the general form of the asymptotic expansion of even a simpler function than appears in Theorem 2 from these formulas for the coefficients a k and C k . Nevertheless, such situations do arise. To clarify the formulas themselves, let us consider the following examples. Example
12.
The asymptotic behavior of the function Erf ( x )
as
+oo
=
I X
- 2 e u
du
x -+ + oo is easy to obtain through integration by parts: X
X
from which, after obvious estimates, it follows that Erf ( x )
c::::
e- x 2 00 ( - 1 ) k (2k - 1 ) ! !
-- �
2x k=O L...J
2k
x -2 k as x -+ + oo .
Let us now obtain this expansion from Theorem 2. By the change of variable u = xt we arrive at the representation +oo
Erf (x) = x
I 1
2 2 e- x t dt .
( 19.22)
19.2 The Asymptotics of Integrals (Laplace's Method)
629
Setting >. = x 2 here and denoting the variable of integration, as in Theorem 2, by x, we reduce the problem to finding the asymptotic behavior of the integral
F(>.)
=
j e - Ax2 dx , 00
(19.23)
1
since Erf (x) = xF(x 2 ). When Remark 4 is taken into account, the integral (19.23) satisfies the hypotheses of Theorem 2: S(x) = -x 2 , S'(x) = - x < 0 for 1 � x < + oo ,
-2, 8(1) Thus, xo
8'(1)
=
=
2
- 1. 1, m 1, f(x) 1, h(x, ) -�x , h(x, ) d� -�x d� . Hence, ( _�x d� ) 0 ( - 2� ) - 2� = ( - ! ) x - 1 , .! 2 1 _£_ 1 ...!.. _ ...!.. _.!!_ ( - ...!.. (_ - 2x d x ) ( - 2x ) - 2x dx 2x ) - ( - 2 ) (-1)x - 3 ' ( _�x d� ) 2 ( - 2� ) ( - 2� d� ) 1 ( (- ! ) 2 (-1)x - 3 ) ( - ! ) 3 ( - 1)( - 3)x - 5 , = a =
=
a
=
=
=
a
=
-
=
=
=
_ ( 2k - 1 ) !! x -( 2k + 1 ) · ( _-12x_ A_ dx ) k ( ...!.2x . ) - � Setting x 1, we find that k 1 ak ( - 1)k ! + r ( k + 1 ) ( ( 2k2 k-+ 11) !! ) (- 1 ) k (2k2 k-+ 11) !! . Now writing out the asymptotic expansion (19.16) for the integral (19.23) taking account of the relations Erf (x) = xF(x 2 ), we obtain the expansion (19.22) for the function Erf (x) as x -+ + oo . _
=
Example
_
=
1 3.
_
=
In Example 7, starting from the representation
+ oo 1 A+ F(>. + 1) .). J e - A ( x - Jn x ) dx ' 0 =
(19. 24 )
we obtained the principal term of the asymptotics of the function r ( >. + 1) as >. -+ +oo . Let us now sharpen the formula obtained earlier, using Theorem 2b) . To simplify the notation a bit, let us replace x by x + 1 in the integral (19. 24 ). We then find that
+ oo 1 A A+ .). e J e A ( Jn ( l + x )- x ) d x F(>. + 1) -1 =
19 Asymptotic Expansions
630
and the question reduces to studying the asymptotics of the integral +oo
F(>.) = J e >. (in(l + x) -x) dx
( 19.25)
-1 as ).. +oo. Here S(x) = ln ( 1 + x) - x, S'(x) = 1 �x - 1 , S'(O) = 0, that is, X o = 0, S"(x) = - 1 ;x )2 , S"(O) = - 1 -/=- 0. That is, taking account of ( Remark 4, we see that the hypotheses b ) of Theorem 2 are satisfied, where we must also set f(x) 1 and m = 1 , since S"(O) -/=- 0. In this case the function h(x, x0) = h (x) has the following form: 1+X - ln ( 1 + x)) 1 2 . h(x) = - --(x X ---*
=
I
If we wish to find the first two terms of the asymptotics, we need to compute the following at x = 0:
(h (x) dd) 0 (h (x) ) = h(x) , (h(x) dxd ) 1 (h(x)) = h(x) ddxh (x) , (h(x) dxd ) 2 (h(x)) = (h (x) dxd ) (h (x) ddxh (x) ) = dh d2 h = h (x) [ ( ) 2 (x) + h(x) 2 (x)] . dx dx
This computation, as one can see, is easily done if we find the values h(O), h'(O), h"(O), which in turn can be obtained from the Taylor expansion of h (x), x ;:::: 0 in a neighborhood of 0: h (x) = - 1 +X x [x - (x - �2 x2 + �3 x3 - �4 x2 + O(x 5 ))] 1 1 2 = 1+x � 2 � 3 � 4 = _ [ x - 3 x + 4 x + O (x5 )] 1 / 2 = X 2 1+x 1 = - J2 [1 - � x + � x 2 O(x 3 )] / 2 = 3 4 1+x = - J2 (1 - � x + 2._ x 2 + O (x 3 ) ) = 3 36 J2 5 1 = - J2 - 3 x + J2 x 2 + 0 ( x 3 ) . 36 Thus, h(O) = - �, h' (O) = - :{f , h "(O) = 1 � , 8 +
(h (x) dxd ) o (h (x) ) I x =O = - J21 ,
631
19.2 The Asymptotics of Integrals (Laplace's Method) 1 ( h (x) dd 1 ( h (x)) l x =O = - 3 , ( h (x) d ) 2 ( h (x)) l x =O = ' 12
�
)
�(
co = - 2r( ) - �)
(
=
�
v'2ii ,
22 3 1 1 1 1 ..tii r . ) r = = 4 ( ) c1 = -2 ( ) · 2! 2 2 2 12 J2 12 1 2 J2 Hence, as A -+ oo,
that is, as
A -+ +oo, (19.26)
It is useful to keep in mind that the asymptotic expansions (19. 16) and ( 19. 17) can also be found by following the proof of Theorem 2 without invok ing the expressions for the coefficients shown in the statement of Theorem 2. As an example, we once again obtain the asymptotics of the integral (19.25) , but in a slightly different way. Using the localization principle and making a change of variable x = r.p(y) in a neighborhood of zero such that 0 = r.p(O), S ( r.p( y)) = ln( 1 + r.p(y)) - r.p(y) = - y 2 , we reduce the problem to studying the asymptotics of the integral E:
j
- E:
r.p'( y) e - >. y 2 dy =
E:
j '¢(y) e - >.y2 dy , 0
where '1/J(y) = r.p' (y) + r.p' ( -y) . The asymptotic expansion of this last integral can be obtained from Watson ' s lemma
( )
E:
oo 'ljJ( k l (O) k + 1 - k+l / A ( ) 2 as A -+ + oo ' r -'1/J(y) e - >. y 2 dy � -21 '"' k! 2 � k =O 0 which by the relations '¢ ( 2 k+l ) (0) = 0, 'ljJ( 2 k ) (0) = 2 r.p ( 2 k+l ) (0) yields the
J
asymptotic series
( )
00 k+l '"' r.p ( 2 ) (0) r k + -1 A - ( k+l/ 2) 2 =O (2k! ) k�
()
00 r.p ( 2 k+l ) (0) A- k = A - 1 1 2 r -21 '"' . � k!22 k k=O
632
19 Asymptotic Expansions
Thus for the integral (19.25) we obtain the following asymptotic expansion
00 'P ( 2k + l ) (0) .x - k (19.27) F(.A) �- .x - 1 / 2 Y11f" "' ' 6 k =O k!2 2k where x = cp(y) is a smooth function such that x - ln ( 1 + x) = y 2 in a neighborhood of zero ( for both x and y). If we wish to know the first two terms of the asymptotics, we must put the specific values cp ' (0) and cp ( 3 ) (0) into formula (19.27).
It may be of some use to illustrate the following device for computing these values, which can be used generally to obtain the Taylor expansion of an inverse function from the expansion of the direct function. Assuming x > 0 and y > 0, from the relation
x - ln(1 + x) = y2
we obtain successively
But x "' V'i,y as y ---+ 0 (x ---+ 0) , and therefore, using the representation of x already found, one can continue this computation and find that as y ---+ 0 V2 y ( ..;2y + 3 V2 yx + O(y3 ) ) - V2 y(..;2y) 2 + O(y4 ) = x = V'i,y + 3 12 V2 Y3 + O(y4 ) = = .t.y + 32 y 2 + g2 Y2 X - (j 3 = .,f2y + �3 y 2 + �9 y 2 ( .,f2y) V2 6 y + O(y4 ) = V2 y3 + O(y4 ) . = V'i,y + 32 y 2 + 18 Thus for the quantities cp'(O) and cpC 3 l (O) of interest to us we find the following values: cp' (0) = V'i, , 'P ( 3 ) (0) = '{; . Substituting them into formula ( 19.27), we find that F(.A) = .x - I / 2 v'27f(1 + 112 ..\ - l + O(.A - 2 ) ) as .A ---t + oo , from which we again obtain formula (19.26). r.; y
_
19.2 The Asymptotics of Integrals (Laplace's Method)
633
In conclusion we shall make two more remarks on the problems discussed in this section.
Remark 6. (Laplace ' s method in the multidimensional case) . We note that Laplace ' s method can also be successfully applied in studying the asymptotics of multiple Laplace integrals F( >.. ) =
J f(x)e>-.S (x) dx ,
X
in which x E lR n , X is a domain in JR n , and f and S are real-valued functions in X. Lemma 1 on the exponential estimate holds for such integrals, and by this lemma the study of the asymptotics of such an integral reduces to studying the asymptotics of a part of it
J
f(x)e>-. S (x) dx ,
U (x o ) taken over a neighborhood of a maximum point x 0 of the function S(x) . If this is a nondegenerate maximum, that is, S"(x 0 ) 0, then by Morse ' s lemma (see Sect. 8.6 of Part 1) there exists a change of variable x =
=f.
j (f o
which in the case of smooth functions f and S can be studied by applying Fubini ' s theorem and using Watson ' s lemma proved above (see Problems 8-1 1 in this connection) .
Remark 7. (The stationary phase method) . In a wider interpretation, Laplace ' s method, as we have already noted, consists of the following: 1° a certain localization principle (Lemma 1 on the exponential estimate) , 2 ° a method of locally reducing an integral to canonical form (Morse ' s lemma) , and 3 ° a description of the asymptotics of canonical integrals (Watson ' s lemma) . We have met the idea of localization previously in our study of approxi mate identities, and also in studying Fourier series and the Fourier transform (the Riemann-Lebesgue lemma, smoothness of a function and the rate at which its Fourier transform decreases, convergence of Fourier series and in tegrals) .
634
19 Asymptotic Expansions Integrals of the form F(>.)
=
J f(x) ei>.S (x) dx ,
X
where x E JR. n , called Fourier integrals, occupy an important place in mathe matics and its applications. A Fourier integral differs from a Laplace integral only in the modest factor i in the exponent. This leads, however, to the relation l ei>.S ( x ) I = 1 when >. and S(x) are real, and hence the idea of a domi nant maximum is not applicable to the study of the asymptotics of a Fourier integral. Let X = [a, b] C JR.\ f E Cb oo ) ( [a, b] , JR.) , ( that is, f is of compact support on [a, b] ) , S E C ( oo l ( [a, b] , JR) and S'(x) =/=- 0 on [a, b] . Integrating by parts and using the Riemann-Lebesgue lemma ( see Prob lem 12) , we find that b
J a
f(x) ei>.S ( x ) dx = _.!._ i>.
b
J Sf'(x)(x) dei>.S (x) = a
= _ ;_ J � (L ) (x)e i>.S ( x ) dx = b
IA
= �A
b
J a
a
dx
ft (x)e i >.S ( x ) dx
S'
=
.
..
=� >, n
b
J fn (x) ei>.S(x ) dx = a
= o(>. - n ) as >. -t oo .
Thus if S'(O) =/=- 0 on the closed interval [a, b] , then because of the con stantly increasing frequencies of oscillation of the function ei>.S ( x ) as >. --+ oo, the Fourier integral over the closed interval [a, b] turns out to be a quantity of type 0 ( >. - oo ) . The function S(x) in the Fourier integral is called the phase function. Thus the Fourier integral has its own localization principle called the stationary phase principle. According to this principle, the asymptotic behavior of the Fourier integral as >. --+ oo ( when f E Cb 00 ) ) is the same as the asymptotics of the Fourier integral taken over a neighborhood U(x 0 ) of a stationary point x0 of the phase function ( that is, a point x 0 at which S'(x0 ) 0) up to a quantity 0 ( >. - oo ) . After this, by a change of variable the question reduces to the canonical integral =
E ( >. )
=
E
J J(x) ei>.xa dx 0
19.2 The Asymptotics of Integrals ( Laplace's Method )
635
whose asymptotic behavior is described by a special lemma of Erdelyi, which plays the same role for the Fourier integral that Watson ' s lemma plays for the Laplace integral. This scheme for investigating the asymptotics of a Fourier integral is called the stationary phase method. The nature of the localization principle in the stationary phase method is completely different from its nature in the case of the Laplace integral, but the general scheme of Laplace ' s method, as one can see, remains applicable even here. Certain details relating to the stationary phase method will be found in Problems 12-17. 19.2.6 Problems and Exercises
Laplace 's method in the one-dimensional case. 1 . a) For a > 0 the function h ( x ) = e - >. x" attains its maximum when x = 0. Here h ( x ) is a quantity of order 1 in a 8-neighborhood of x = 0 of size 8 = 0(>. - l / " ) .
Using Lemma 1 , show that i f 0 < 8 < 1 , then the integral
W (>.)
=
a
J
x(3 - I f ( x ) e- >. x" dx ,
c( >. ,o)
<> 6-1
where c(.>., 8 ) = >. has order O ( e - A >. 6 ) as >. -+ + oo , A being a positive constant. b ) Prove that if the function f is continuous at x = 0, then
W (>.) = a - 1 F(/3 / a) [f (O) + o(1)] >. - (3 /a as >. -+ + oo .
c ) In Theorem 1 a ) , the hypothesis f ( x ) = f ( xo ) + O ( x - xo ) can be weakened and replaced by the condition that f be continuous at xo . Show that when this is done the same principal term of the asymptotics is obtained, but in general not Eq. ( 1 9 . 2 ) itself, in which O ( x - xo ) is now replaced by o( 1). 2.
'
a) The Bernoulli numbers B2 k are defined by the relations
lt l < 2 7r . It is known that
Show that
( r'r )
( r') r
(x)
=
ln x +
1-) t !"" ( .!.t - 1 - e- t e d t . - x
0
1
- ( x ) � ln x - - 2x
oo B2 k x -2 k L k= 2k O
as
x
-+ + oo .
19 Asymptotic Expansions
636
b) Prove that as x --+ +oo
This asymptotic expansion is called Stirling 's series. c) Using Stirling's series, obtain the first two terms of the asymptotics of r(x+1) as x --+ +oo and compare your result with what was obtained in Example 13. d) Following the method of Example 13 and independently of it using Stirling's series, show that
r(x + 1 )
=
�(; r ( 1 + 1 �x + 28�x 2 + o(:3))
(
)
(
as x --+ +oo .
)
a) Let f E c [o, aJ, � , S E C ( ll [o, aJ, � , S(x) > 0 on [O , a] , and suppose S(x) attains its maximum at x = 0, with S' (O) =I 0. Show that if f(O) =I 0, then
3.
a
I (A) : = I f(x)S >.. (x) dx rv 0
)..�\��) s>-- +1 (0) as )..
--+
+oo .
b) Obtain the asymptotic expansion
00 2:::a k ).. - ( " +1 ) as ).. k =O if it is known in addition that J, S c< 00 ) ( [0, a], �) . I (A) s>-- + 1 (0) �
--+
+oo ,
E
4.
a) Show that
7r / 2
I sinn 0
t
dt =
�( 1 + ( - )) as O
n
1
n --+
+oo .
b) Express this integral in terms of Eulerian integrals and show that for n E N "t equa ls ( 2(n2 n-)1! )! ! ! · 27r .
1
.
-
)
2 !! 1 ( (2(2n) c) Obtam Walhs. , formula 1r - nhm . _ n . . n 1 ) " --+ oo d) Find the second term in the asymptotic expansion of the original integral as
n --+
_
·
+oo.
a) Show that J ( 1 - x 2 ) n dx rv � as n --+ +oo. 1
5.
-1
b) Find the next term in the asymptotics of this integral. 6. Show that if a
>
0, then as x --+ +oo
l
+ oo
0
t
( )
a ..!. -at t dt rv vfhx _l_ 2 ;,:;- a exp -; x a x
19.2 The Asymptotics of Integrals ( Laplace's Method )
637
a) Find the principal term of the asymptotics of the integral + oo (1 + t f e - n t dt as n -+ + oo .
7.
I 0
b ) Using this result and the identity k!n - k
=
+ oo n t k J e - t dt, show that 0
Laplace 's method in the multidimensional case. The exponential estimate lemma. Let M = sup S(x) , and suppose that for some x ED .X = .Xo the integral 8.
F(.X)
=
I f(x)e>.S( x ) dx
D c JR n
converges absolutely. Show that it then conveges absolutely for .X ?: .Xo and
lf(.X) I
� I l f (x) e>. s( x ) l dx � Ae>.M
(.X ?: .Xo) ,
D
where A is a positive constant.
9. Morse 's lemma. Let xo be a nondegenerate critical point of the function X E IR. n , defined and belonging to class in a neighborhood of xo . Then
S(x) , there exist neighborhoods U and V of x = xo and y = 0 and a diffeomorphism
c
s (
=
:
S(xo) + � t vi ( Yi ) 2
,
j=l
where det
•
.
Asymptotics of a canonical integral. a) Let t = (h , . . . , t n ) , V = { t E IR. n l l t i l � o, j = 1 , 2, . . . , n } , and a E c
where t'
=
(t 2 , . . . , t n ) and
!11
>
0. Show that
.X -+ + oo . This expansion is uniform in t' E V' = c
and a k E
I; a k (t')_x- ( k + 2 ) as k =O {t' E IR. n- l lt i l � o, j = 2, . . . , n } Ft (.X, t ' )
�
l
00
1
638
19 Asymptotic Expansions
-
b) Multiplying F1 (>.. , t') by e 9 t � and justifying the termwise integration of the corresponding asymptotic expansion, obtain the asymptotic expansion of the function
F2 ( >.. , t " )
=
8 J- 8 F1 (>.. , t')e -
� t2 2 2
d t 2 as >.. --+ +oo ,
where t " = (h , . . . , t n ) , v2 > 0. c) Prove that for the function
A ( >.. ) where Vj
>
=
8 8 !- 8 -8! a(t 1 , . . . , tn ) e ·
=
·
-�
f: Vj t�
J='
dt1 . . . t n ,
0, j = 1 , . . . , n , the following asymptotic expansion holds:
A ( >.. ) where a o
·
�
>.. - n /2 L a k >.. - k as >.. --+ +oo , k=O
V v;2.� ):n a(O) .
1 1 . The asymptotics of the Laplace integral in the multidimensional case. a) Let D be a closed bounded domain in ne , J, S E C(D, JR.), and suppose max S(x) xED i s attained only at one interior point x 0 of D. Let f and S b e c C oo ) i n some neighborhood of xo and det S " (xo) =P 0. Prove that if the integral ( * ) converges absolutely for some value >.. = >.. o , then
F ( >.. )
�
e.\ S(xo ) >.. - n /2 L a k >.. - k as >.. --+ +oo ' k=O
and this expansion can be differentiated with respect to >.. any number of times, and its principal term has the form
F ( >.. ) = e .\ S(xo ) >.. -n /2
(27r) n ( (xo) + O ( >.. - 1 )) I det S"(xo) l f
·
b) Verify that if instead of the relation J, S E c C oo ) all we know is that f E C and s E c C 3 ) in a neighborhood of Xo, then the principal term of the asymptotics as >.. --+ +oo remains the same with 0 ( >.. - 1 ) replaced by o(1) as >.. --+ +oo.
The stationary phase method in the one-dimensional case. 1 2 . Generalization of the Riemann-Lebesgue lemma. a) Prove the following gener alization of the Riemann-Lebesgue lemma. Let S E C (l ) ( la, bJ , lR ) and S'(x) # 0 on [a, b] = : I . Then for every function f that is absolutely integrable on the interval I the following relation holds:
F ( >.. )
=
b
J f(x ei.\ S (x) dx --+ 0 as >.. --+ oo , )
>.. E lR .
19.2 The Asymptotics of Integrals (Laplace's Method) b) Verify that if it is known in addition that f c
=
)
(
k � ( i >.) - ( k + 1 ) -1_ j_ f(x) L....S ' (x) dx S ' (x) k=O
lb
c< n +1 l (J, JR) and S
E
+
639 E
o(>. - ( n + 1 ) ) .
a
c) Write out the principal term of the asymptotics of the function F(>.) as >. -+ oo, >. E JR. E c< 2 l [c, b] , but d) Show that if S E c<=l (I, JR) and t E c< 2 l [a, c] , t [a , � �.� f (j_ c< 2 l [a, b] , then the function F(>.) is not necessarily o (>. - 1 ) as >. -+ oo . e) Prove that when j, s E c<=J (I, JR) , the function F(>.) admits an asymptotic series expansion as >. -+ oo . f) Find asymptotic expansions as >. -+ oo , >. E JR, for the following integrals: e J(1 + x) - " '1/!i (x, >.) dx, j = 1 , 2, 3, if a > 0 and '1/! 1 = e L\ x , '1/!2 = cos >.x, and 0 '1/!3 = sin >.x.
l
l
13. The localization principle. a) Let I = [a, b] c JR, f c<=l (I, JR) , and S' (x) i- 0 on I. Prove that in this case
F(>.)
:=
b
I f(x)ei>. S ( x) dx a
=
o l >- 1 - oo
(
)
E
ca =l (I, JR) , S
as >. -+ 00
E
.
b) Suppose f E ca =l (J, JR) , s E c<=l (J, JR), and X 1 , , X m is a finite set of stationary points of S(x) outside which S' (x) i- 0 on I. We denote by F(>., xi ) the integral of the function f(x)e i >. S ( x ) over a neighborhood U(xi ) of the point Xj , j = 1 , . . . , m, not containing any other critical points in its closure. Prove that . • •
F(>.)
=
f F(>., Xj ) o ( l >- 1 -oo ) as >. -+ +
j =1
00 .
14. Asymptotics of the Fourier integml in the one-dimensional case. a) In a reason ably general situation finding the asymptotics of a one-dimensional Fourier integral can be reduced through the localization principle to describing the asymptotics of the canonical integral a E(>.) = x{J - 1 f(x)e i >. x"' dx ,
I 0
for which the following lemma holds. E r d e l y i's l e m m a. Let a :::0: 1 , (J k = 0, 1, 2, . . . . Then E(>.)
c:::
00
>
0, f
E
(
c<=l [o , aJ , lR and j< k l (a)
L a k >. - .!!:.±.1! "' as >. -+ + oo .
k=O
)
=
0,
19 Asymptotic Expansions
640
)
(
where
k + f] i � -"¥- t < k l ( o ) ak = ! r e , a a k! and this expansion can be differentiated any number of times with respect to A. Using Erdelyi's lemma, prove the following assertion. Let I = [ x o - t5, xo + .5] be a finite closed interval, let f, S E c<=l (J, ffi.) with f E Co (J, ffi.) , and let S have a unique stationary point xo on I, where S' (xo) = 0 but S" (xo) /=- 0. Then as A --+ + oo x o+J , -k F- ( "'' x 0 ) .J( x ) ei >. S(x) dx - ei �sg n S 11 (x o ) ei >. S ( x oh" - � � L a k" k=O xo - 8 and the principal term of the asymptotics has the form
I
·-
F(A, xo)
�
2 71'
=
A I S " (xo ) l
e i( � sg n S " (x o ) +>- S (x o )) ( ! ( xo ) + O(A - 1 ) )
·
b) Consider the Bessel function of integer order n � 0: 7r
Jn (x) Show that
=
� I cos(x sin ip -
n
ip) dip .
0
f2[ (
ln (x) = y � cos x -
n1r
1r
2 - 4
)
+
]
O(x - 1 ) as x --+ +oo
.
The stationary phase method in the multidimensional case 1 5 . The localization principle. a) Prove the following assertion. Let D be a domain in ffi.n , f E c6 = l (D, ffi.) , S E c<=l (D, ffi.) , grad S(x) /=- 0 for x E supp f, and
F(A) :=
I f ( x )ei>. S( x) dx .
D
Then for every k E N there exists a positive constant A(k) such that the estimate IF(A) I ::; A(k)A - k holds for A � 1 , and hence F(A) = O(A - = ) as A --+ + oo . b) Suppose as before that f E c6 =l (D, ffi.) , S E c<=l (D, ffi.) , but S has a finite number of critical points x 1 , . . . , Xm outside which grad S(x) /=- 0. We denote by F(A, XJ ) the integral of the function f(x) e i >. S (x) over a neighborhood U( x J ) of Xj whose closure contains no critical points except Xj . Prove that F(A)
=
L F(A, XJ ) j=l
+
O(A -= )
as
A --+ + oo
.
19.2 The Asymptotics of Integrals (Laplace's Method)
641
16. Reduction to a canonical integral. If xo is a nondegenerate cricial point of the function S E cC=) (D, JR) defined in a domain D c JRn , then by Morse's lemma (see Problem 9) there exists a local change of variable x = tp(y) such that Xo = tp(O) , s ( tp(y) ) = S(xo) + ! Ej (yl ) 2 ' where Ej = ± 1 , y = (y 1 ' ' y n ) , and det tp' (y) > j l 0. Using the localization principle (Problem 15) , now show that if f E c6 =) ( D, JR) , S E cC=) (D, lR) , and S has at most a finite number of critical points in D, all nondegenerate, then the study of the asymptotics of the integral ( * * ) reduces to studying the asymptotics of the special integral
�
,
8
• . .
8
I I
-8
. , .
-8
'1/J(y 1 , . . . , y n )e
!f .
z: "J (y1 ) 2 n
j=I
.
dy 1 . . . dy n .
17. Asymptotics of a Fourier integral in the multidimensional case. a) Using Erdelyi's lemma (Problem 14a) ) and the scenario described in Problem 10, prove that if D is a domain in lR n , f, S E cC=) (D, JR) , supp f is a compact subset of D, and Xo is the only critical point of S in D and is nondegenerate, then for the integral ( * * ) the following asymptotic expansion holds as A ---t +oo:
= i \ A) :::' A - n / 2 e i >. S ( x o ) L a k A - k ' k=O which can be differentiated any number of times with respect to A. The main term of the asymptotics has the form F(A)
=
c;)
n/2
[
exp iAS(xo) + i: sgn s " (xo) x
]
X
l det S" (xo) l - 1 1 2 [/(xo) + O(A - 1 )] as A ---t +oo .
Here S" (x) is the symmetric and by hypothesis nonsingular matrix of second derivatives of the function S at xo (the Hessian) , and sgn S" (xo ) is the signature of this matrix (or the quadratic form corresponding to it) , that is, the difference v+ - v_ between the number of positive and negative eigenvalues of the matrix S" (xo ) .
Topics and Q uestions for Midterm Examinat ions
1 . Series and Integrals Depending on a Parameter 1. The Cauchy criterion for convergence of a series. The comparison theorem and the basic sufficient conditions for convergence (majorant , integral, Abelco Dirichlet) . The series ((s) = L n-8 • n= l 2. Uniform convergence of families and series of functions. The Cauchy criterion and the basic sufficient conditions for uniform convergence of a series of functions (M-test , Abel-Dirichlet) . 3 . Sufficient conditions for two limiting passages to commute. Continuity, integration, and differentiation and passage to the limit . 4. The region of convergence and the nature of convergence of a power series. The Cauchy-Hadamard formula. Abel ' s (second) theorem. Taylor ex pansions of the basic elementary functions. Euler ' s formula. Differentiation and integration of a power series.
5. Improper integrals. The Cauchy criterion and the basic sufficient con ditions for convergence (M-test , Abel-Dirichlet) . 6 . Uniform convergence of an improper integral depending on a param eter. The Cauchy criterion and the basic sufficient conditions for uniform convergence (majorant , Abel-Dirichlet) . 7 . Continuity, differentiation, and integration of a proper integral depend ing on a parameter.
8. Continuity, differentiation, and integration of an improper integral de pending on a parameter. The Dirichlet integral. 9. The Eulerian integrals. Domains of definition, differential properties, reduction formulas, various representations, interconnections. The Poisson integral.
10. Approximate identities. The theorem on convergence of the convolu tion. The classical Weierstrass theorem on uniform approximation of a con tinuous function by an algebraic polynomial.
644
Topics and Questions for Midterm Examinations
2 . Problems Recommended as Midterm Questions Problem
1. P is a polynomial. Compute (et 1.: ) P( x).
tAxo is a solution of the Veri f y that the vector-valued function e Cauchy x = Ax, x(O) = xo. (Here x = Ax is a system of equations defined byproblem the matrix A.) Problem 3 . Find up to order o(1/n3) the asymptotics of the positive roots A 1 < A2 < · · · < An < · · · of the equation sinx + 1/x = 0 as n -+ Problem 4. a) Show that ln 2 = 1 - 1/2 + 1/3 - · · ·. How many terms of this series must be taken to determine ln 2 within w - 3? b) Verify that ! ln ��� = t + �t 3 + � t 5 + · · ·. Using this expansion it becomes convenient to compute ln x by setting x = ���. c) Setting t = 1/3 in b), obtain the equality 1 1 1 (1) 1 (1)5 2 ln 2 = 3 + 3 3 3 + 5 3 + . . . How this manywithtermstheofresult this series must one take to find ln 2 within w-3? Com pare of a). This is one of the methods of improving convergence. Problem 5 . Verify that in the sense of Abel summation a) 1 - 1 + 1 · · · = ! · b) 2::: sinktp = ! · ! 'P, 'P � 21rn, n E Z. c) ! + 2::: cosktp = 0, 'P � 27rn, n E Z. Problem 6 . Prove Hadamard ' s lemma: a) If f E C( 1 l (U(xo)), then f(x) f(xo) + cp(x)(x - x0), where 'P E C(U(xo)) and cp(xo) = f'(xa ). b) If f E C(n ) (U(xo)), then f(x) = f(xo) + �1. f'(xo)(x - xo) + · · · + + (n 1 1)! f( n - I )(xo)(x - xa ) n - I + cp(x)(x - x0) n , where 'P E C(U(xo)) and cp(xo) = ,h J Cn l(xo). Problem 2 .
oo .
00
k= l
00
k= l
_
2. Problems Recommended as Midterm Questions
645
c) What do these relations look like in coordinate form, when x = (x 1 , x n ) , that is, when f is a function of n variables? .
.
.
•
Problem 7. a) Verify that the function
Jo (x) =
.!_ 7r
1
J
cos xt
0 v'f=t2
dt
y" �y' y
satisfies Bessel ' s equation + + = 0. b) Tty to solve this equation using power series. c) Find the power-series expansion of the function J0 (x) .
+Joo +Joo
Problem 8. Verify that the following asymptotic expansions hold a)
r (a ' x) : =
b)
Erf (x) : =
as x -+ +oo.
X
X
oo r (a) � r (a oo
t O!. - l e - t dt � e -01. """"'
k= l
k + 1)
1 '- x 2 t2 e - dt � 2 v ne - """"'
x a -k '
1
� r (3/2 k)x 2 k- l
k= l
_
Problem 9. a) Following Euler, obtain the result that the series 1 - 1 !x +
2!x 2 - 3!x 3 + · · · is connected with the function S(x) : =
+Joo 0
e- t dt . 1 + xt
--
b ) Does this series converge? c) Does it give the asymptotic expansion of S(x) as x -+ 0? Problem 10. a) A linear device A whose characteristics are constant over time responds to a signal J(t) in the form of a <>-function by giving out the signal (function) E(t) . What will the response of this device be to an input signal f(t) , -oo < t < +oo? b) Can the input signal f always be recovered uniquely from the trans formed signal f := Af?
646
Topics and Questions for Midterm Examinations
3 . Integral Calculus ( Several Variables) 1. Riemann integral on an n-dimensional interval. Lebesgue criterion for ex istence of the integral. 2 . Darboux criterion for existence of the integral of a real-valued function on an n-dimensional interval. 3. Integral over a set . Jordan measure (content) of a set and its geomet ric meaning. Lebesgue criterion for existence of the integral over a Jordan measurable set . Linearity and additivity of the integral. 4. Estimates of the integral. 5 . Reduction of a multiple integral to an iterated integral. Fubini ' s theo rem and its most important corollaries. 6. Formula for change of variables in a multiple integral. Invariance of measure and the integral. 7. Improper multiple integrals: basic definitions, majorant criterion for convergence, canonical integrals. Computation of the Euler�Poisson integral. 8 . Surfaces of dimension k in JR n and basic methods of defining them. Abstract k-dimen�ional manifolds. Boundary of a k-dimensional manifold as a (k - I )-dimensional manifold without boundary. 9. Orientable and nonorientable manifolds. Methods of defining the ori entation of an abstract manifold and a (hyper )surface in JRn . Orientability of the boundary of an orientable manifold. Orientation in duced on the boundary from the manifold. 10. Tangent vectors and the tangent space to a manifold at a point. Interpretation of a tangent vector as a differential operator. 1 1 . Differential forms in a region D C JR n . Examples: differential of a function, work form, flux form. Coordinate expression of a differential form. Exterior derivative operator. 1 2 . Mapping of objects and the adjoint mapping of function� on the�e ob jects. Transformation of points and vectors of tangent spaces at these points under a smooth mapping. Transfer of functions and differential forms under a smooth mapping. A recipe for carrying out the transfer of forms in coordinate form. 13. Commutation of transfer of differential form� with exterior multipli cation and differentiation. Differential forms on a manifold. Invariance ( un ambiguou� nature) of operations on differential forms. 14. A scheme for computing work and flux. Integral of a k-form over a k-dimensional smooth oriented surface, taking account of orientation. Inde pendence of the integral of the choice of parametrization. General definition
4. Problems Recommended for Studying the Midterm Topics
647
of the integral of a differential k-form over a k-dimensional compact oriented manifold. 15. Green ' s formula on a square, its derivation, interpretation, and ex pression in the language of integrals of the corresponding differential forms. The general Stokes formula. Reduction to a k-dimensional interval and proof for a k-dimensional interval. The classical integral formulas of analysis as particular versions of the general Stokes formula. 16. The volume element on JR. n and on a surface. Dependence of the volume element on orientation. The integral of first kind and its independence of orientation. Area and mass of a material surface as an integral of first kind. Expression of the volume element of a k-dimensional surface S k C JR. n in local parameters and the expression of the volume element of a hypersurface s n - l c JR. n in Cartesian coordinates of the ambient space. 1 7. Basic differential operators of field theory ( grad, curl, div ) and their connection with the exterior derivative operator d in oriented Euclidean space
JR_3 .
18. Expression of work and flux of a field as integrals of first kind. The basic integral formulas of field theory in JR. 3 as the vector expression of the classical integral formulas of analysis. 19. A potential field and its potential. Exact and closed forms. A necessary differential condition for a form to be exact and for a vector field to be a potential field. Its sufficiency in a simply connected domain. Integral criterion for exactness of 1-forms and vector fields.
20. Local exactness of a closed form ( the Poincare lemma ) . Global anal ysis. Homology and cohomology. De Rham ' s theorem ( statement ) . 2 1 . Examples of the application of the Stokes ( Gauss-Ostrogradskii ) for mula: derivation of the basic equations of the mechanics of continuous media. Physical meaning of the gradient, curl, and divergence. 22. Hamilton ' s nabla operator and work with it . The gradient , curl, and divergence in triorthogonal curvilinear coordinates.
4. Problems Recommended for Studying the Midterm Topics The numbers followed by closing parentheses below refer to the topics 1�-22 just listed. The closing parentheses dashes are followed by section numbers ( for example 13.4 means Sect . 4 of Chapt . 13 ) , which in turn are separated by a dash from the numbers of the problems from the section related to the topic from the list above.
648
Topics and Questions for Midterm Examinations
1) 11.1-2,3; 2) 11.1-4; 3) 11.2-1,3,4; 4) 11.3-1,2,3,4; 5) 11.4-6,7 and 13.2-6; 6) 11.5-9 and 12.5-5,6; 7) 11.6-1,5,7; 8) 12.1-2,3 and 12.4-1,4; 9) 12.2-1,2,3,4 and 12.5-11; 10) 15.3-1,2; 11) 12.5-9 and 15.3-3; 12) 15.3-4; 13) 12.5-8,10; 14) 13.1-3,4,5,9; 15) 13.1-1,10,13,14; 16) 12.4-10 and 13.2-5; 17) 14.1-1,2; 18) 14.2-1,2,3,4,8; 19) 14.3-7,13,14; 20 ) 14.311,12; 21) 13.3-1 and 14.1-8; 22) 14.1-4,5,6.
Examinat ion Topics
1. Series and Integrals Depending on a Parameter 1. Cauchy criterion for convergence of a series. Comparison theorem and the basic sufficient conditions for convergence (majorant , integral, Abeloo Dirichlet) . The series ((s) = I: n-8 •
n=l
2 . Uniform convergence of families and series of functions. Cauchy crite rion and the basic sufficient conditions for uniform convergence of a series of functions (M-test , Abel-Dirichlet) . 3 . Sufficient conditions for commutativity of two limiting passages. Con tinuity, integration, and differentiation and passage to the limit . 4. Region of convergence and the nature of convergence of a power series. Cauchy-Hadamard formula. Abel ' s (second) theorem. Taylor expansions of the basic elementary functions. Euler ' s formula. Differentiation and integra tion of a power series. 5. Improper integrals. Cauchy criterion and the basic sufficient conditions for convergence (majorant , Abel-Dirichlet) . 6 . Uniform convergence of an improper integral depending on a parame ter. Cauchy criterion and the basic sufficient conditions for uniform conver gence ( M-test , A bel-Dirichlet) . 7. Continuity, differentiation, and integration of a proper integral depend ing on a parameter. 8. Continuity, differentiation, and integration of an improper integral de pending on a parameter. Dirichlet integral. 9. Eulerian integrals. Domains of definition, differential properties, reduc tion formulas, various representations, interconnections. Poisson integral. 10. Approximate identities. Theorem on convergence of the convolution. Classical Weierstrass theorem on uniform approximation of a continuous function by an algebraic polynomial.
650
Examination Topics
1 1 . Vector spaces with an inner product . Continuity of the inner product and algebraic properties connected with it . Orthogonal and orthonormal sys tems of vectors. Pythagorean theorem. Fourier coefficients and Fourier series. Examples of inner products and orthogonal systems in spaces of functions. 12. Orthogonal complement. Extremal property of Fourier coefficients. Bessel ' s inequality and convergence of the Fourier series. Conditions for com pleteness of an orthonormal system. Method of least squares. 13. Classical (trigonometric) Fourier series in real and complex form. Riemann-Lebesgue lemma. Localization principle and convergence of a Fourier series at a point. Example: expansion of cos( a x) in a Fourier series and the expansion of sin(7rx) /7TX in an infinite product . 14. Smoothness of a function, rate of decrease of its Fourier coefficients, and rate of convergence of its Fourier series. 1 5 . Completeness of the trigonometric system and mean convergence of a trigonometric Fourier series. 16. Fourier transform and the Fourier integral (the inversion formula) . Example: computation of f for f(x) : = exp( -a 2 x 2 ) . 17. Fourier transform and the derivative operator. Smoothness of a func tion and the rate of decrease of its Fourier transform. Parseval ' s equality. The Fourier transform as an isometry of the space of rapidly decreasing functions. 18. Fourier transform and convolution. Solution of the one-dimensional heat equation. 19. Recovery of a transmitted signal from the spectral function of a device and the signal received. Kotel ' nikov ' s formula. 20. Asymptotic sequences and asymptotic series. Example: asymptotic expansion of Ei ( x). Difference between convergent and asymptotic series. Asymptotic Laplace integral (principal term) . Stirling ' s formula.
2. Integral Calculus (Several Variables)
651
2. Integral Calculus ( Several Variables) 1 . Riemann integral on an n-dimensional interval. Lebesgue criterion for ex istence of the integral. 2 . Darboux criterion for the existence of the integral of a real-valued function on an n-dimensional interval. 3. Integral over a set. Jordan measure (content) of a set and its geomet ric meaning. Lebesgue criterion for existence of the integral over a Jordan measurable set . Linearity and additivity of the integral. 4. Estimates of the integral. 5. Reduction of a multiple integral to an iterated integral. Fubini ' s theo rem and its most important corollaries. 6. Formula for change of variables in a multiple integral. Invariance of measure and the integral. 7. Improper multiple integrals: basic definitions, the majorant criterion for convergence, canonical integrals. Computation of the Euler-Poisson integral. 8. Surfaces of dimension k in !R n and the basic methods of defining them. Abstract k-dimensional manifolds. Boundary of a k-dimensional manifold as a (k - I)-dimensional manifold without boundary. 9. Orientable and nonorientable manifolds. Methods of defining the ori entation of an abstract manifold and a (hyper )surface in !Rn . Orientability of the boundary of an orientable manifold. Orientation on the boundary induced from the manifold. 10. Tangent vectors and the tangent space to a manifold at a point . Interpretation of a tangent vector as a differential operator. 1 1 . Differential forms in a region D C !R n . Examples: differential of a function, work form, flux form. Coordinate expression of a differential form. Exterior derivative operator. 12. Mapping of objects and the adjoint mapping of functions on these ob jects. Transformation of points and vectors of tangent spaces at these points under a smooth mapping. Transfer of functions and differential forms under a smooth mapping. A recipe for carrying out the transfer of forms in coordinate form. 13. Commutation of the transfer of differential forms with exterior mul tiplication and differentiation. Differential forms on a manifold. lnvariance (unambiguous nature) of operations on differential forms. 14. A scheme for computing work and flux. Integral of a k-form over a k-dimensional smooth oriented surface. Taking account of orientation. Inde pendence of the integral of the choice of parametrization. General definition
652
Examination Topics
of the integral of a differential k-form over a k-dimensional compact oriented manifold. 1 5 . Green's formula on a square, its derivation, interpretation, and ex pression in the language of integrals of the corresponding differential forms. General Stokes formula. Reduction to a k-dimensional interval and proof for a k-dimensional interval. Classical integral formulas of analysis as particular versions of the general Stokes formula. 1 6 . Volume element on JR n and on a surface. Dependence of volume ele ment on orientation. The integral of first kind and its independence of ori entation. Area and mass of a material surface as an integral of first kind. Expression of volume element of a k-dimensional surface S k c JRn in local parameters and expression of volume element of a hypersurface s n - l c ]Rn in Cartesian coordinates of the ambient space.
17. Basic differential operators of field theory ( grad, curl, div ) and their connection with the exterior derivative operator d in oriented Euclidean space JR3 0
18. Expression of work and flux of a field as integrals of first kind. Basic integral formulas of field theory in JR3 as the vector expression of the classical integral formulas of analysis. 19. A potential field and its potential. Exact and closed forms. A necessary differential condition for a form to be exact and for a vector field to be a potential field. Its sufficiency in a simply connected domain. Integral criterion for exactness of 1-forms and vector fields. 20. Examples of the application of the Stokes ( Gauss-Ostrogradskii ) for mula: derivation of the basic equations of the mechanics of continuous media. Physical meaning of the gradient, curl, and divergence.
References
1 . Classic Works 1 . 1 . Primary Sources
Newton, 1 . : - a. ( 1687): Philosophire Naturalis Principia Mathematica. Jussu Societatis Regire ac typis Josephi Streati, London. English translation from the 3rd edition (1726): University of California Press, Berkeley, CA (1999) . - b. (1967-1981): The Mathematical Papers of Isaac Newton, D. T. Whiteside, ed. , Cambridge University Press. Leibniz, G. W. (1971): Mathematische Schriften. C. I. Gerhardt , ed. , G. Olms, Hildesheim. 1 . 2 . Major Comprehensive Expository Works
Euler, L. - a. (1748) : Introductio in Analysin Infinitorum. M. M. Bousquet, Lausanne. English translation: Springer-Verlag, Berlin - Heidelberg - New York (1988-
1990) .
- b. (1755) : Institutiones Calculi Differentialis. Impensis Academire lmperialis Scientiarum, Petropoli. English translation: Springer, Berlin - Heidelberg New York (2000) . - c. (1768-1770): Institutionum Calculi Integralis. Impensis Academire Impe rialis Scientiarum, Petropoli. Cauchy, A.-L. - a. (1989) : Analyse Algebrique. Jacques Gabay, Sceaux. - b. (1840- 1844) : Lec;ons de Calcul Differential et de Calcul Integral. Bachelier, Paris. 1 . 3 . Classical courses of analysis from the first half of the twentieth century
Courant, R. (1988) : Differential and Integral Calculus. Translated from the Ger man. Vol. 1 reprint of the second edition (1937) . Vol. 2 reprint of the 1936 original. Wiley Classics Library. A Wiley-Intercience Publication. John Wiley & Sons, Inc., New York.
654
References
de la Vallee Poussin, Ch.-J. ( 1954, 1957 ) : Cours d'Analyse Infinitesimale. ( Tome 1 1 1 ed. , Tome 2 9 ed. , revue et augmentee avec la collaboration de Fernand Simonart. ) Librairie universitaire, Lou vain. English translation of an earlier edition, Dover Publications, New York ( 1946 ) . Goursat, E. ( 1992 ) : Cours d'Analyse Mathematiques. ( Vol. 1 reprint of the 4th ed 1924, Vol. 2 reprint of the 4th ed 1925 ) Les Grands Classiques Gauthier-Villars. Jacques Gabay, Sceaux. English translation, Dover Publ. Inc., New York ( 1959 ) .
2 . Textbooks
5
Apostol, T. M. ( 1974 ) : Mathematical Analysis. 2nd ed. World Student Series Edi tion. Addison-Wesley Publishing Co. , Reading, Mass.-London-Don Mills, Ont. Courant, R. , John F. ( 1999 ) : Introduction to Calculus and Analysis. Vol. I. Reprint of the 1989 edition. Classics in Mathematics Springer-Verlag, Berlin. Courant, R., John F. ( 1989 ) : Introduction to Calculus and Analysis. Vol. II. With the assistance of Albert A. Blank and Alan Solomon. Reprint of the 1974 edi tion. Springer-Verlag, New York. Nikolskii, S. M. ( 1990 ) : A Course of Mathematical Analysis. Vols. 1, 2. Nauka, Moscow. English translation of an earlier adition: Mir, Moscow ( 1985 ) . Rudin, W. ( 1976 ) : Principals of Mathematical Analysis. McGraw-Hill, New York. Rudin, W. ( 1987 ) : Real and Complex Analysis. 3rd ed. , McGraw-Hill, New York. Spivak, M. ( 1965 ) : Calculus on Manifolds: a Modern Approach to the Classical Theorems of Advanced Calculus. W. A. Benjamin, New York. Whittaker, E. T., Watson, J. N. ( 1979 ) : A Course of Modern Analysis. AMS Press, New York.
3. C lassroom Materials Biler, P. , Witkowski, A. ( 1990 ) : Problems in Mathematical Analysis. Monographs and Textbooks in Pure and Applied Mathematics, 132. Marcel Dekker, New York. Demidovich, B. P. ( 1990 ) : A Collection of Problems and Exercises in Mathematical Analysis. Nauka, Moscow. English translation of an earlier edition, Gordon and Breach, New York - London - Paris ( 1969 ) . Gelbaum, B. ( 1982 ) : Problems in Analysis. Problem Books in Mathemetics. Springer-Verlag, New York-Berlin, 1982. Gelbaum, B., Olmsted, J. ( 1964 ) : Counterexamples in Analysis. Holden-Day, San Francisco. Makarov, B. M., Goluzina, M. G., Lodkin, A. A., Podkorytov, A. N. ( 1992 ) : Selected Problems in Real Analysis. Nauka, Moscow. English translation: Translations of Mathematical Monographs 107, American Mathematical Society, Providence ( 1992 ) . 5
For the convenience of the Western reader the bibliography of the English edition has substantially been revised.
4. Further Reading
655
P6lya, G., Szego, G. ( 1970/71): Aufgaben und Lehrsatze aus der Analysis. Springer Verlag, Berlin - Heidelberg - New York. English translation: Springer-Verlag, Berlin - Heidelberg - New York ( 1972-76).
4. Further Reading Arnol'd, V. I. - a. (1989a) : Huygens and Barrow, Newton and Hooke: Pioneers in Mathemat ical Analysis and Catastrophe Theory, from Evolvents to Quasicrystals. Nauka, Moscow. English translation: Birkhauser, Boston (1990) . - b. (1989b) : Mathematical Methods of Classical Mechanics. Nauka, Moscow. English translation: Springer-Verlag, Berlin - Heidelberg - New York (1997) . Avez, A. ( 1986) : Differential Calculus. Translated from the French. A Wiley Intercience Publication. John Wiley & Sons, Ltd., Chichister. Bourbaki, N. ( 1969): Elements d'Histoire des Mathematiques. 2e edition revue, cor rigee, augmentee. Hermann, Paris. English translation: Springer-Verlag, Berlin - Heidelberg - New York (1994) . Cartan, H. ( 1977) : Cours de Calcul Differentiel. Hermann, Paris. English translation of an earlier edition: Differential Calculus. Exercised by C.Buttin, F.Riedeau and J.L.Verley. Houghton Miffiin Co. , Boston, Mass. (1971 ) . de Bruijn, N. G. (1958) : Asymptotic methods i n analysis. North-Holland, Amster dam. Dieudonne, J. (1969) : Foundations of Modern Analysis. Enlarged and corrected printing. Academic Press, New York. Dubrovin, B. A., Novikov, S. P. , Fomenko, A. T. (1986) : Modern Geometry - Meth ods and Applications. Nauka, Moscow. English translation: Springer-Verlag, Berlin - Heidelberg - New York (1992) . Einstein, A. (1982) : Ideas and Opinions. Three Rivers Press, New York. Contains translations of the papers "Principles of Research" (original German title: "Mo tive des Forschens" ) , pp. 224-227, and "Physics and Reality," pp. 290-323. Evgrafov, M. A. (1979) : Asymptotic Estimates and Entire Functions. 3rd ed. Nauka, Moscow. English transalation from the first Russian edition: Gordon & Breach, New York ( 1961) . Fedoryuk, M . V . (1977) The Saddle-Point Method. Nauka, Moscow (Russian) . Feynman, R. , Leighton, R. , Sands, M. (1963-1965): The Feynman Lectures on Physics, Vol. 1: Modern Natural Science. The Laws of Mechanics. Addison Wesley, Reading, Massachusetts. Gel'fand, I. M. ( 1998) : Lectures on Linear Algebra. Dobrosvet, Moscow. English translation of an earlier edition: Dover, New York (1989) . Halmos, P. ( 1974) : Finite-dimensional Vector Spaces. Springer-Verlag, Berlin - Hei delberg -- New York. Jost, J. (2003) : Postmodern Analysis. 2nd ed. Universitext. Berlin: Springer. Klein, F. ( 1926) : Vorlesungen iiber die Entwicklung der Mathematik im 19 Jahrhun dert. Springer-Verlag, Berlin Kolmogorov, A. N., Fomin, S. V. (1989) : Elements of the Theory of Functions and Functional Analysis. 6th ed. , revised, Nauka, Moscow. English translation of an earlier edition: Graylock Press, Rochester, New York (1957) .
656
References
Kostrikin, A. l., Manin, Yu. l. ( 1986 ) : Linear Algebra and Geometry. Nauka, Moscow. English translation: Gordon and Breach, New York ( 1989 ) . Landau, L. D., Lifshits, E. M. ( 1988 ) : Field Theory. 7th ed. , revised, Nauka, Moscow. English translation of an earlier edition: Pergamon Press, Oxford-New York ( 1975 ) . Lax, P. D., Burstein S. Z., Lax A. ( 1972 ) : Calculus with Applications and Comput ing. Vol. I. Notes based on a course given at New York University. Courant Institute of Mathematical Sciences, New York University, New York. Manin, Yu. l. ( 1979 ) : Mathematics and Physics. Znanie, Moscow. English translation: Birkhauser, Boston ( 1979 ) . Milnor, J. ( 1963 ) : Morse Theory. Princeton University Press. Narasimhan, R. ( 1968 ) : Analysis on Real and Complex Manifolds. Masson, Paris. Olver, F. W. J. ( 1997 ) : Asymptotics and Special Functions. Reprint. AKP Classics. Wellesley, MA: A K Peters. Pham, F. ( 1992 ) : Geometrie et calcul differentiel sur les varietes. [Course, studies and exercises for Masters in mathematics] Inter Editions, Paris. Poincare, H. ( 1982 ) : The Foundations of Science. Authorized translation by George Bruce Halstead, and an introduction by Josiah Royce, preface to the UPA edition by L. Pearce Williams. University Press of America, Washington, DC. Pontryagin, L. S. ( 1974 ) : Ordinary Differential Equations. Nauka, Moscow. En glish translation of an earlier edition: Addison-Wesley, Reading, Massachusetts ( 1962 ) . Shilov, G. E. - a. ( 1996 ) : Elementary Real and Complex Analysis. Revised English edition translated from Russian and edited by Richard A. Silverman. Corrected reprint of the 1973 English edition. Dover Publications Inc., Mineola, NY. - b. ( 1969 ) : Mathematical Analysis. Functions of one variable. Pts. 1 and 2. Moscow: Nauka, ( Russian ) . - c. ( 1972 ) : Mathematical Analysis. Functions of several variables ( 3 pts. ) . Moscow: N auka, ( Russian ) . Schwartz, L. ( 1998 ) : Analyse. Hermann, Paris. Weyl, H. ( 1926 ) : Die heutige Erkenntnislage in der Mathematik. Weltkreis-Verlag, Erlangen. Russian translations of eighteen of Weyl's essays, with an essay on Weyl: Mathematical Thought. Nauka, Moscow ( 1989 ) . Zel'dovich, Ya. B., Myshkis, A. D. ( 1967 ) : Elements of Applied Mathematics. Nauka, Moscow. English translation: Mir, Moscow ( 1976 ) .
Index of B asic Not at ion
LOGICAL SYMBOLS ==} - logical consequence (implication) � - Logical equivalence
·-
.-
}
equality by definition; colon on the side of the object defined
SETS
E - closure of the set E 8E - boundary of the set E E E \ 8E - interior of the set E B(x, r) - ball of radius r with center at x S(x, r) - sphere of radius r with center at x 0
:=
SPACES (X, d) - metric space with metric d (X, T ) - topological space with system T of open sets ]Rn (C n ) n-dimensional real (complex) space JR 1 = lR (C 1 = q - set of real (complex) numbers x = ( x 1 , . . . , xn ) - coordinate expression of a point of n-dimensional space C ( X , Y) - set (space) of continuous functions on X with values in Y C[a, b] - abbreviation for C([a, b] , JR) or C([a, b] , C ) C ( k ) (X, Y) - set of mappings from X into Y that are k times continu ously differentiable C ( k ) [a, b] - abbreviation for C ( k ) ([a, b] , JR ) or C ( k ) ([a, b] , C) . Cp[a, b] - space C[a, b] endowed with norm II/Il P C2 [a, b] - space C[a, b] with Hermitian inner product (!, g) of functions or mean-square deviation norm
658
Index of Basic Notation
R ( E ) - set (space) of functions that are Riemann integrable over the set E R[a, b] - space R ( E ) when E = [a, b] R(E) - space of classes of Riemann integrable functions on E that are equal almost everywhere on E Rp(E) (Rp(E)) - space R(E) endowed with norm R2 (E) (R2 (E)) - space R(E) endowed with Hermitian inner product ( ! , g ) or mean-square deviation norm Rp [a, b], R2 [a, b] - spaces Rp ( E ) and R2 ( E ) when E = [a, b] .C(X; Y ) , (.C(X1 , . . . , Xn ; Y)) - space of linear (n-linear) mappings from X (from (X1 X · · · X Xn )) into Y TMP or TM(p) , TpM, Tp(M) - tangent space to the surface (manifold) M at the point p E M S - Schwartz space of rapidly decreasing functions V( G) - space of fundamental functions of compact support in the do main ·G V' (G) - space of generalized functions on the domain G V - an abbreviation for V( G) when G = JRn V' - an abbreviation for V' (G) when G = JRn
I JI P
M ETRICS , NORM S , I N N E R PRO D U CTS
1
d (x , x 2 ) - distance between points x 1 and x 2 in the metric space (X, d) lx l , l l x l l - absolute value (norm) of a vector x E X in a normed vector space - norm of the linear (multilinear) operator , p :2:: 1 - integral norm of the function dx := ( J
A I AI 1 P I J I P I J I P (x) ) 1 l f l 2 -- mean-square deviation norm ( I / I l P when = 2) E
p
(a, b) - Hermitian inner product of the vectors a and b
f
(!, g) := EJ(! · g) (x) dx - Hermitian inner product of the functions f and g
a · b - inner product of a and b in JR 3 a x b -- vector (cross) product of vectors a and b in JR3 (a, b , c ) - scalar triple product of vectors a, b, c in JR 3
F U NCTIONS
g o f - composition of functions f and g
f - 1 - inverse of the function f f (x) - value of the function f at the points x; a function of x
Index of Basic Notation
659
f(x l , . . . , xn ) � value of the function f at the point x = (x 1 , . . . , xn ) E X in the n-dimensional space X ; a function depending on n variables x l , . . . , xn . supp f � support of the function f J f (x) -- jump of the function f at the point x {ft : t E T} � a family of functions depending on the parameter t E T {!n ; n E N} or {!n } � a sequence of functions ft --+ f on E convergence of the family of functions Ut ; t E T} to the B function f on the set E over the base B in T ft =4 f on E uniform convergence of the family of functions Ut ; t E T} B to the function f on the set E over the base B in T �
�
f = o(g) over B f = O(g) over B j g or j � g over "'
}
B
asymptotic formulas (the symbols of comparative asymptotic behavior of the functions f and g over the base
B
00
f(x) 2::: 'Pn (x) over B � expansion in an asymptotic series n= l V(x) � Dirichlet function �
exp(A) � exponential of a linear operator A B(o:, /3) � Euler beta function T(o:) � Euler gamma function XE characteristic function of the set E �
DIFFERENTIAL CALCULUS
f'(x) , fx (x) , df(x) , D f(x) � tangent mapping to f (differential of f) at the point x . *fr , a;j(x) , D;j(x) � partial derivative (partial differential) of a function f depending on variables x 1 , . . . , x n at the point x = (x 1 , . . . , x n ) with respect to the variable x i Dv f (x) � derivative of the function f with respect to the vector v at the point x
'V
� Hamilton ' s nabla operator grad f � gradient of the function f div A � divergence of the vector field A curl B � curl of the vector field B INTEGRAL CALCULUS
f.L(E) � measure of the set E
Index of Basic Notation
660
I J(x) dx E
I f(x 1 , . . . , x n ) dx 1 . . . dx n E
I . . . I f(x l ' . . . ' x n ) dx l . . . dx n E
,}
integral of the function f over the set E C JRn
I dy I f(x, y) dx - iterated integral
y
X
I P dx + Q dy + R dz
1
I F . ds , I (F , ds) 1
1
curvilinear integral (of second kind) or the work of the field F = (P, Q, R) along the path '"'(
,}
I f ds - curvilinear integral (of first kind) of the function f along the curve '"Y
1
II P dy A dz + Q dz A dx + R dx A dy s
II F · du , s
II (F, du) , x
integral (of second kind) over the surface S in JR3 ; flux of the field F = (P, Q, R) across the surface S
III f da - surface integral (of first kind) of f over the surface s
D I F F ERENTIAL FORMS
w (wP) - a differential form (of degree p) wP A wq - exterior product of forms wP and wq dw - (exterior) derivative of the form w I w - integral of the form w over the surface (manifold) M
M
w} : = (F, · ) - work form w{r : = (V, · ) - flux form
S
Subject Index
8-function, 281 8-neighborhood, 5, 1 1 c-grid, 16 k-dimensional volume, 185 k-path, 254 n-dimensional disk, 1 79, 322 nth moment, 404 p-- cycle, 358 p-- form, 197 TJ space, 14 T2 space, 14 0-formula, 593
A Abel summation, 394 Abel's transformation, 376 Abel-Dirichlet test, 376, 377, 379 Absolute convergence - of a series, 374-376 - of an improper integral, 420 - of functions, 375 Adiabatic, 223 Adiabatic constant, 224 Adjoint mapping, 317 Admissible set, 117 Alexander horned sphere, 162 Algebra - exterior, 319 - graded, 314 - Grassmann, 319 - Lie, 72 - of forms, 313 -- skew-symmetric, 314 - of functions, 401 - - complex, 401 real, 401 self-adjoint, 405 separating points, 401
Almost everywhere, 1 1 1 Alternation, 314 Amplitude, 560 Amplitude modulation, 587 Analysis, harmonic, 560 Angular velocity, 69-70 Approximate identity, 455 Area - as the integral of a form, 227-230 - Minkowski outer, 194 - of a k-dimensional surface, 187, 229 - of a piecewise-smooth surface, 229 - of a sphere in lie, 192, 444 Asymptotic equality, 598 Asymptotic equivalence, 598 Asymptotic estimate, 597 - uniform, 6 1 1 Asymptotic expansion - in the sense of Erdelyi, 610 - uniform, 6 1 1 Asymptotic formula, 598 Asymptotic methods, 597 Asymptotic problem, 596 Asymptotic sequence, 601 Asymptotic series, 601 , 602 - general, 601 - in the sense of Erdelyi, 610 - in the sense of Poincare, 602 - power, 606-609 Asymptotic zero, 603 Asymptotics, 596, 597 - of a Bessel function, 640 - of a Laplace integral, 638 - of canonical integral, 633 - of Legendre polynomials, 623 - of the Fourier integral, 635, 639 - of the gamma function, 622, 629
662
Subject Index
- of the Laplace integral, 635 - of the probability error function, 628 Atlas analytic, 325 of a surface (manifold) , 162, 320 - of class c ( k ) ' 325 - orienting, 174, 328 - smooth, 325 Averaging of a function using a kernel, 494 B
Ball - closed, 5 - in JR n , volume, 444 - in a metric space, 5 Band-limited signal, 585 Base - in the set of partitions, 108 - of a topology, 10 Basis of a vector space, 500 Bernoulli integral, 309 Bernoulli numbers, 635 - generating function, 635 Bernoulli polynomials, 555 Bessel function, 392, 410 - asymptotics, 640 Bessel's equation, 392, 410, 414, 434 Bessel's inequality, 508-509 - for trigonometric system, 534 Beta function, 437-439 Bicompact set, 15 Borel's formulas, 581 Boundary - of a p-dimensional cube, 357 of a half-space, 1 78 - of a manifold, 321 - of a surface, 178 Boundary cycle, 358 Boundary point, 6, 13, 321 Boundedness - total, 397 - uniform, 397 - uniform, of a family of functions, 371 Brachistochrone, 92-94 Bracket, Poisson, 348 Bundle of tangent paths, 347
c
Canonical embedding, 349 Canonical integral - asymptotics, 633 Cardinal sine, 588 Carnot cycle, 226 Category of a set, 28 Cauchy integral, 309 Cauchy-Bunyakovskii inequality, 452, 505 Cauchy-Riemann equations, 310 Chain - of charts, 329 - - contradictory, 329 - - disorienting, 329 - of singular cubes, 357 Change of variable in an integral, 135-143, 156-158 Characteristic - frequency, 562 - phase, 562 - spectral, 562 Chart - local, 161, 320 - parameter domain, 161 - range, 161 - range of, 320 Charts, consistent, 17 4, 328 Chebyshev metric, 3 Chebyshev polynomials, 525 Chebyshev-Laguerre polynomials, 525 Circulation of a field along a curve, 233, 277 Class - orientation, 328 Closed ball, 5 Closed form, 352 Closed set, 13 - in a metric space, 5, 6 - in a topological space, 13 Closure, 6, 13 Coefficient - of thermal conductivity, 303 - of thermal diffusivity, 304 Coefficients - Fourier, 506-508, 510, 513, 519, 526, 531, 536, 541, 551 , 553, 557
Subject Index - - extremal property, 506 -- Lame, 268, 275 Cohomology group, 356 Compact set, 15 - elementary, 159 -- in a metric space, 16 Companion trihedral, 73 Complete system of vectors, 51 1-516 Completeness of the trigonometric system, 545 Completion of a space, 24-27 Condition - necessary, for convergence, 374 Conditions - Dini, 535, 570 Connected set, 19 Consistent charts, 174, 328 Constant, cyclic, 297, 360 Content of a set (Jordan) , 1 19 Continuity - and passage to the limit, 383 - of an improper integral depending on a parameter, 423-426 - of an integral depending on a parameter, 408-409 Continuous group, 72, 336 Contribution - of a maximum point, 616 Contribution of a point to asymptotics, 615 Convergence - absolute, 375 - - of an improper integral, 420 - in mean, 528, 545 - of a family of functions - - pointwise, 363, 367 - - uniform, 367 - of a series of vectors, 505 - of an improper integral, 153 -- Cauchy principal value, 155 -- of distributions, 463 - of generalized functions, 463 - of linear functionals - - strong (norm) , 58 - of test functions, 463 - uniform, 397 - - Cauchy criterion, 369-370 - weak, 463
663
Convergence set, 363, 367 Convolution, 449-470 - differentiation, 453 - in lRn , 483 - multidimensional, 483-493 - symmetry, 453 - translation-invariance, 453 Coordinate parallelepiped, 107 Coordinates - Cartesian, 259, 261 - curvilinear, 161, 265 - cylindrical, 268-273 - of a tangent vector, 340 - polar, 166 - spherical, 166, 268-273 - triorthogonal, 268-273 Cotangent space to a manifold, 340 Covering - locally finite, 336 - refinement of another covering, 336 Criterion - Cauchy, 422 for uniform convergence, 369-370, 375 for uniform convergence of a series, 378 for uniform convergence of an integral, 418-420 - Darboux, 1 14-1 16, 1 29 - for a field to have a potential, 290 - for compactness in a metric space, 17 - for continuity of a mapping, 31 - Lebesgue, 1 1 1-114, 1 17, 1 19, 138, 145 Critical point, 149 Cube - singular, 357 - - boundary of, 357 Curl, 203, 260, 274 - physical interpretation, 281-282 Current function, 310 Curvature of a curve, 73 Cycle - boundary, 358 Carnot, 226 of dimension p, 358
664
Subject Index
Cycles, homologous, 358 Cyclic constant, 297, 360 Cyclic frequency, 560 Cylinder, 167 D
Darboux integral - lower, 1 14 - upper, 1 14 Darboux sum - lower, 1 14, 396 - upper, 1 14 Deformation (of a closed path) , 293 Degree (order) of a differential form, 195 Delta function (8-function) , 281 , 449, 455, 461 , 464, 469, 473, 485, 496, 559, 591 , 593 - shifted, 485 Derivation - of a ring, 352 Derivative - Lie, 352 - of a mapping, 61, 62 - of order n, 80 - partial, 7Q-71 - second, 80, 83 - with respect to a vector, 81 Derivative mapping, 62 Deviation - mean-square, 529 Diffeomorphism, elementary, 140 Differential - exterior, 342, 349 - exterior, of a form, 201 , 342, 349 - of a mapping, 61 - of order n, 80 - partial, 70, 71 - second, 80, 83 - total, 71 Differential equation with variables separable, 226 Differential form, 197 closed, 296, 352 - exact, 296, 352 - flux, 199 - of class c ( k ) ' 342 - of compact support , 344 -
- of order zero, 201 - on a manifold, 340 - on a smooth surface, 207 - restriction to a submanifold, 350 - work, 198 Differential operator, 486 - adjoint, 486 - self-adjoint, 487 - transpose, 486 Differentiation, 61 - at a point of a manifold, 347 - of a family of functions depending on a parameter, 388-393 - of a Fourier series, 544 - of a generalized function, 465-468 - of a power series, 391 - of a series, 391 - of an integral - - over a liquid volume, 497 - of an integral depending on a parameter, 409-412, 483 - on a manifold, 347, 351 - with respect to a parameter, 426-428 Dimension of a manifold, 320 Dini conditions, 535 Dipole, 299, 496 Dipole moment, 299, 496 Dipole potential, 299 Direct product of metric spaces, 7 Direction of motion along a curve, 176 Dirichlet discontinuous factor, 436 Dirichlet integral, 436, 447, 570, 610 Dirichlet kernel, 532, 557 Discontinuous factor, Dirichlet, 436 Discrete group of transformations, 335 Discrete metric, 2, 8 Disk, n-dimensional, 179, 322 Distribution, 460 - regular, 464 - singular, 464 - tempered, 591 Divergence, 203, 260, 274 - physical interpretation, 278-281 Domain - elementary, 240
Subject Index - fundamental, of a group of automorphisms, 324, 336 - of parameters of a chart, 320 - parameter, 366 - simply connected, 293 Double layer, 496
E Efficiency of a heat engine, 225 Eigenvalue, 518 - of a Sturm-Liouville problem, 518 Eigenvector of an operator, 519 Element of volume, 227 Elementary diffeomorphism, 140 Elliptic integral, 393 - complete - - of first kind, 393, 4 1 1 - - o f second kind, 393, 4 1 1 - modulus, 4 1 1 Embedding - canonical, 349 Entropy, 302 Envelope of a family of curves, 252 Equality - asymptotic, 598 - Parseval's, 520, 529, 547, 569, 592 Equation - Bessel's, 392, 410, 414, 434 - differential, 34, 226 - Euler's - - hydrodynamic, 307 - Euler-Lagrange, 91 - heat, 302, 583 - hypergeometric, 393 - Laplace's, 304, 524 - Mayer's, 224 - of an adiabatic, 224 - of continuity, 304-306 - of state, 222 - Poisson's, 298, 304, 493 - wave, 307-310, 581 - - homogeneous, 309 - - inhomogeneous, 309 Equations - Cauchy-Riemann, 310 - electrostatic, 286 - magnetostatic, 286
665
- Maxwell, 262, 263, 275, 281 , 298, 311 Equicontinuity, 397 Equivalence, asymptotic, 598 Equivalent atlases - with respect to orientation, 175, 328 - with respect to smoothness, 325 Erdelyi's lemma, 635 Error function - asymptotics, 628 Estimate, asymptotic, 597 uniform, 6 1 1 Euler's formula, 446 Euler's hydrodynamic equation, 307 Euler-Gauss formula, 440 Euler-Lagrange equation, 91 Euler-Poisson integral, 433, 442, 567, 580, 610 Eulerian integral, 437-448 Exact form, 352 Exhaustion of a set, 150 Expansion, asymptotic - in the sense of Erdelyi, 610 - uniform, 61 1 - uniqueness, 602 Exponential - of an operator, 71-73 Exponential integral, 600 Exponential system, 502 Exterior algebra, 319 Exterior differential, 201 , 342, 349 of a form, 342, 349 Exterior point, 6, 13 Exterior product , 195, 315, 342 Extremum of a function - necessary condition, 87 - sufficient condition, 87 - with constraint, 106
-
-
F Family of functions, 366 - equicontinuous, 397, 398 - - at a point, 403-404 - separating points, 401 - totally bounded, 397 - uniformly bounded, 397 Fejer kernel, 557 Field
666
Subject Index
- of forms, 257 - of linear forms, 197 - potential, 288 - scalar, 257 - solenoidal, 295 - tensor, 257 - vector, 257, 348 - - smooth, 348 Filter, low-frequency, 563 Flow - planar, 310 - plane-parallel, 310 Flux across a surface, 213-2 18, 233, 278, 302, 496 Force - mass, 306 Form - anti-symmetric, 195 - differential closed, 296, 352 exact, 296, 352 flux, 199 of class c k ) ' 342 of compact support, 344 on a manifold, 340 restriction to a submanifold, 350 work, 198 - Hermitian, 45 - - nondegenerate, 45 - - nonnegative, 45 - on a surface in lRn , 227 - semidefinite, 87 - skew-symmetric, 195, 314-316 - volume in lR k , 227 Formula - asymptotic, 598 - Cauchy-Hadamard, 375 - co-area, 236 - complement, for the gamma function, 441 Euler's, 446 - Euler-Gauss, 440 - for change of variable in an integral, 136 - Fourier inversion, 571, 592 - Frenet, 73 - Gauss' , 447 -
- Gauss-Ostrogradskii, 242-245, 278, 303, 489, 496 - - in vector analysis, 278 - Green's, 496 - homotopy, 360 - Kotel'nikov's, 586 - Kronrod-Federer, 236 - Legendre's, 446 - Leibniz ', 409 - Newton-Leibniz, 237, 278, 408, 573 - Poisson summation, 593 - reduction - - for the beta function, 438 - - for the gamma function, 440 - Stirling 's, 448, 622 - Stokes' , 237, 276, 278, 345-347, 358 general, 248-251 , 345 - - in JR3 , 245-247 - - in vector analysis, 277 - Taylor's, 94, 415 - Wallis', 434, 448, 636 Formulas - Borel's, 581 - differential, of field theory, 263-265 - Green's, 284, 285 - integral, of vector analysis, 278 Fourier coefficients - extremal property, 506 Fourier cosine transform, 564 Fourier integral, 634 - asymptotics, 635, 639 - multiple, 641 Fourier inversion formula, 571, 592 Fourier series, 499-526 - in a general orthogonal system, 499-500 - multiple, 557 - of generalized functions, 558 - partial sum - - integral representation, 531 - pointwise convergence, 527 rate of convergence and smoothness, 540 Fourier sine transform, 564
Subject Index Fourier transform, 561 , 563, 566, 571, 574, 579, 583 - asymptotic properties, 573-576 - frequency shift, 587 - in £ 2 , 589 - inverse, 583 - multidimensional, 577 - normalized, 568 - of a convolution, 569 - of generalized functions, 592 - rate of decrease and smoothness, 573 - time shift, 587 Frame - Frenet, 73 - orienting, 172 Frequencies, natural, 518 Frequency, 560 -- cyclic, 560 - fundamental, 560 - harmonic, 560 Frequency characteristic, 562 Frequency spectrum, 593 Fresnel integral, 436, 447, 610 Function - band-limited, 585 - Bessel, 392, 410 -- - asymptotics, 640 - beta, 437-439 - cardinal sine, 588 - change of coordinates, 321 - current, 310 - delta, 449, 455, 461 , 464, 469, 473 - Dirichlet, 364 - exponential integral, 600 - fundamental, 484 - gamma, 439-442 - - asymptotics, 622, 629 _:_ - incomplete, 610 - generalized, 460 - - differentiation, 465-468 of several variables, 484 - - regular, 464 - - singular, 464 - generating, 471 - Green 's, 470 - harmonic, 286, 304 - -- conjugage, 310
667
- Heaviside, 466, 469, 491 , 497 - limit, 363, 367 linear, 49 locally integrable, 452 multilinear, 49 of compact support, 136, 452 - phase, 634 piecewise continuous, 536 piecewise continuously differentiable, 536 - probability error, 610 -- asymptotics, 628 - rapidly decreasing, 575 - Riemann integrable - - over a set, 1 18 - - over an interval, 109 - sample, 588 sine integral, 610 spectrum of, 560 spherical, 524 support, 451 system, 449, 460 test, 463, 484 - transient pulse, 449 - uniformly continuous, 456 - unit step, 466 - zeta, 447 Functional - linear, 49 - multilinear, 49 Functions, asymptotically equal, 598 Functions, asymptotically equivalent, 598 Fundamental domain of a group of automorphisms, 324, 336 Fundamental frequency, 560 Fundamental sequence, 2 1 Fundamental solution, 469-470, 490 Fundamental tone, 518 G
Galilean transformation, 595 Gamma function, 439-442 - asymptotics, 622, 629 - incomplete, 610 Gauge condition, 3 1 1 Gauss' formula, 447 Gauss' theorem, 496
668
Subject Index
Gauss-Ostrogradskii formula, 237, 242-245, 489, 496 - in vector analysis, 278 Generalized function, 460 - differentiation, 465-468 - of several variables, 484 - regular, 464 - singular, 464 Generating function of a sequence, 471 Gibbs' phenomenon, 544, 556 Graded algebra, 314 Gradient, 203, 260, 274, 282 - physical interpretation, 283 Gram matrix, 185 Grassmann algebra, 319 Green's formula, 496 Green's function, 470 Green's theorem, 237-242 Group - cohomology, 356 continuous, 72, 336 - discrete, of transformations, 335 homology, 298, 356, 358 - - p-dimensional, 358 homotopy, 298 Lie, 72, 336 of automorphisms, 324 one-parameter, 350 - topological, 72, 336 H
Haar system, 525 Hamilton operator (nabla) , 262, 265 Harmonic analysis, 560 Harmonic frequency, 560 Harmonic function, 286, 304 - conjugate, 310 Harmonic polynomials, 524 Hausdorff space, 12, 14 Heat capacity, 223 - molecular, 223 Heat engine, 224 Heat equation, 302, 583 Heaviside function, 469, 491, 497 Hermite polynomials, 524 Hermitian form, 45 - nondegenerate, 45 - nonnegative, 45
Hilbert's fifth problem, 336 Homeomorphism, 31 Homologous cycles, 358 Homology, 358 Homology group, 298, 356 Homotopic paths, 294 Homotopy, 293 Homotopy formula, 360 Homotopy group, 298 Homotopy identity, 352 Hypergeometric equation, 393 Hypergeometric series, 392 I
Identity - approximate, 455 - homotopy, 352 - Jacobi, 72 Improper integral, 151 - depending on a parameter, 407 - with variable singularity, 478-483 Improper integral depending on a parameter - Abel-Dirichlet test, 421 - Cauchy criterion, 422 - continuity, 423-426 - limiting passage, 423-426 - uniform convergence, 431 - Weierstrass' M-test, 431 Induced orientation on the boundary of a surface, 181 Inequality - Bessel ' s, 508-509 - - for trigonometric system, 534 - Brunn-Minkowski, 121 - Cauchy-Bunyakovskii, 46, 452, 505 - Clausius, 225 - Holder's, 126 - isoperimetric, 194, 550-551 - Minkowski 's, 44, 126, 494 - - generalized, 494 - Steklov's, 556 - triangle, 1 - Wirtinger's, 556 Inertia, 306 Inner product, 45-47, 351 - of a field and a form, 351 Instantaneous axis of rotation, 70
Subject Index Integral, 108 - Bernoulli, 309 - canonical - - asymptotics, 633 - Cauchy, 309 - Darboux - - lower, 1 14 - - upper, 1 14 - depending on a parameter - - continuity, 408-409 - - differentiation, 409-412 -- integration, 413 - depending on a paramter, 407-415 - Dirichlet, 286, 436, 447, 570, 610 - double, 109 - elliptic, 393 - - complete, of first kind, 393, 41 1 , 435 - - complete, of second kind, 393, 411 - Euler-Poisson, 433, 442, 567, 580, 610 - Eulerian, 437-448 - - first kind, 437 - - second kind, 437 - Fourier, 561 , 564, 567, 571 , 578, 585, 588, 634 - - asymptotics, 635, 639 - - multiple, 641 - Fresnel, 436, 447, 610 - Gauss' , 253 - improper - - differentiation with respect to a parameter, 426-428 - - integration with respect to a parameter, 429-434 iterated, 127-129 - Laplace, 572, 612, 634 - - asymptotics, 635, 638 - Lebesgue, 395 - line, 212 - multiple, 109 - - depending on a parameter, 476-497 - - with variable singularity, 481 - of a differential form - - over a surface, 215, 218 - of a form on a manifold, 343-345 -
669
- of a function over a surface, 227, 231 - over a chain, 358 - over a set, 1 17 - over a singular cube, 358 - Poisson, 459, 588 - Raabe's, 446 - Riemann, 395 - - over a set, 1 17 - - over an interval, 108 - surface - - of first kind, 231 - - of second kind, 232 - triple, 109 Integral metric, 4 Integral operator, 568 Integral representation of the partial sum of a Fourier series, 531 Integral transform, 568 Integration by parts in a multiple integral, 255 Integration of an integral depending on a parameter, 413 Integration with respect to a parameter, 429-434 Interchange - of differentiation and passage to the limit, 388-393 - of integrals, 127 - - improper, 429-434 - - proper, 127 - of integration and passage to the limit, 386-388 - of limiting passages, 381 - of summation and differentiation of a series, 391 Interior point , 6, 13 Interval in R n , 107 Isobar, 222 Isochore, 222 Isometry of metric spaces, 24 Isomorphism - of normed vector spaces, 60 - of smooth structures, 335 Isoperimetric inequality, 194, 550-551 Isotherm, 222 Iterated integral, 127-129
670
Subject Index
J
Jacobi identity, 72 Jacobian of a coordinate change - cylindrical coordinates, 269 general polar, 166 spherical coordinates, 269 triorthogonal coordinates, 269 Jordan measure, 1 19 K
Kernel - Dirichlet, 532, 557 - Fejer, 557 - Poisson, 472 Klein bottle, 169, 325 Kotel'nikov's formula, 586 L
Lagrange's theorem, 309 Laplace integral, 572, 612, 634 - asymptotics, 635, 638 Laplace transform, 612 Laplace's equation, 304, 524 Laplace's method, 612-615 - multidimensional, 633 Laplacian, 264, 265, 273, 493 Law - Ampere's, 234 - Archimedes', 245 - Biot-Savart , 235 - Coulomb's, 279 - Faraday's, 233 - Gauss', 286 - Newton's, 288, 306, 307 - normal distribution, 471 - of conservation of mass, 279 Layer - double, 496 - single, 493 Legendre polynomials, 504, 516, 623 Legendre's formula, 446 Lemma - Erdelyi's, 635, 640, 641 - exponential estimate, 615, 624, 637 - Hadamard's, 415 - Morse's, 149, 408, 633, 637, 641 - nested ball, 27
- on continuity of the inner product, 505 - on finite c:-grids, 17 - on orthogonal complement, 508 - Poincare's, 296 - Riemann-Lebesgue, 532, 570, 638 - Sard's, 149 - Watson's, 631 , 635 Lie algebra, 72, 348 Lie derivative, 352 Lie group, 72, 336 Limit, 2 1 , 28 - of a family of continuous functions, 383-386 - of a family of functions, 370 - of a mapping, 28 - of a sequence, 21 - of a sequence of functions, 363 Limit function, 363, 367 Limit point, 5, 7, 13 Limiting passage - interchange, 381 - under a differentiation sign, 388-393 - under an integral sign, 386-388 Linear transformation, 49 Local chart, 320 Local maximum, 87 Local minimum, 87 Localization principle - for a Fourier series, 532 - for a Laplace integral, 615 Locally integrable function, 452 Lorentz transformation, 595 Low-frequency filter, 563 M
M-test for convergence, 374 Manifold, 320-325 -- analytic, 325 - compact, 323 - connected, 323 - contractible, 353 embedded in lR n , 161 nonorientable, 328 - of class c< k ) , 325 - orientable, 328 - oriented, 328 - smooth, 325
Subject Index - topological, 325 - with boundary, 322 - without boundary, 322 Mapping - adjoint, 317 - bounded, 29 - continuous, 30-33 - - at a point, 30 - continuously differentiable, 75 contraction, 34 derivative, 62 - - of order n , 80 - differentiable at a point, 61-62 - differentiable on a set, 62 - homeomorphic, 31 - linear, 49 - multilinear, 49 - of class c ( k ) , 325 - partial derivative, 70 - smooth, 325 - tangent, 61, 339, 349 - ultimately bounded, 29 - uniformly continuous, 32 Mass force, 306 Maximum, local, 87 Mean convergence and completeness, 545, 547 Mean value over a period, 560 Mean-square deviation, 3, 529 Measure - of a set (Jordan) , 1 19 - of an interval, 110 Measure zero, 110-1 1 1 , 1 17 Method - asymptotic, 597 - Fourier, 517-520 - Laplace's, 612-615 - - multidimensional, 633 - of Lagrange multipliers, 106 - of least squares (Gauss') , 520 - of separating singularities (Krylov's) , 544 - of tangents (Newton's), 38 - of tangents, modified (Newton-Kantorovich) , 39 - separation of variables, 517-520 - stationary phase, 633, 635 - - multidimensional, 640 -
671
one-dimensional, 638 - Steklov's averaging, 520 Metric, 1 - Chebyshev, 3 - discrete, 2, 8 - integral, 4 - of mean-square deviation, 3, 4 - of uniform convergence, 4, 397, 399 - Riemannian, 267 Metric space, 1 - separable, 15 Metric spaces, direct product, 7 Minimum, local, 87 Mobius band, 168, 175, 180, 184, 327, 330 Modulus - of an elliptic integral, 4 1 1 Moment - dipole, 299, 496 - multipole, 299 - of a function, 404 Morse's lemma, 633, 637, 641 Multilinear transformation, 49 Multiple integral - depending on a parameter, 476-497 - improper, 151 - integration by parts, 255 - iterated, 127-1 29 - with variable singularity, 476, 478 Multiplication of generalized functions, 475 Multipole, 299 Multipole moment, 299 Multipole potential, 299 N
Nabla (Hamilton operator) , 262, 265 Natural frequencies, 518 Natural oscillations, 518 Natural parametrization, 73 Necessary condition for uniform convergence, 374 Neighborhood, 13 - in a metric space, 5, 6, 9 - in a topological space, 1 1 - of a germ of functions, 1 1 Newton-Leibniz formula, 573
672
Subject Index
Norm - in a vector space, 42, 589 - of a transformation, 52 - of a vector, 42, 589 Null-series, Men'shov's, 531 Numbers, Bernoulli, 635 - generating function, 635 0
One-parameter group, 350 Open set - in a metric space, 5 - in a topological space, 9 Operational calculus, 466 Operator - differential, 486 adjoint, 486 - - self-adjoint, 487 - - transpose, 486 - Hamilton (nabla) , 262, 265 - integral, 568 - Laplace, 264, 265, 273, 493 - nilpotent, 71 - of field theory, 260 - - in curvilinear coordinates, 265-274 - symmetric, 519 - translation, 449 - translation-invariant, 449 Orbit of a point, 324, 335 Order (degree) of a differential form, 195 Orientation - induced on the boundary of a manifold, 331 of a domain of space, 172 of a manifold, 328 of a surface, 171-177, 181 of the boundary of a surface, 181 opposite to a given orientation, 172 Orientation class, 328 - of atlases - - of a surface, 175 - of coordinate systems, 171, 172 - of frames, 171 Oriented space, 171 Orienting frame, 172 Orthogonal vectors, 499, 5 19
Orthogonality with a weight, 522 Orthogonalization, 503-504 Oscillation of a mapping, 30 - at a point, 32 Oscillations, natural, 518 Overtones, 518 p
Parallelepiped, coordinate, 107 Parameter domain, 161, 320, 366 Parameter set, 366 Parameters - Lame, 268, 275 Parseval's equality, 520, 529, 569, 592 Partition - locally finite, 189 - of an interval, 108 - - with distinguished points, 108 - of unity, 148, 331-333 - - k-smooth, 332 - - subordinate to a covering, 332 Paths - homotopic, 294 - tangent, 347 Period of an integral, 297 Period, over a cycle, 360 Phase, 560, 634 - stationary, 634 Phase characteristic, 562 Phase function, 634 Piecewise continuous function, 536 Piecewise continuously differentiable function, 536 Planar flow, 310 Plancherel's theorem, 590 Plane - projective, 326 Plane-parallel flow, 310 Point - boundary, 6 - - in a topological space, 13 - boundary (of a manifold) , 321 - boundary (of a surface) , 178 - critical, 149 - exterior, 6 - - in a topological space, 13 interior, 6, 13 - - in a topological space, 13 -·
Subject Index - limit, 5, 7, 13 - of a metric space, 1 Poisson bracket, 348 Poisson integral, 459, 588 Poisson kernel, 472 Poisson's equation, 298, 304, 493 Polar coordinates, 166-167 Polynomial - trigonometric, 526 Polynomials - Bernoulli, 555 - Chebyshev, 525 - Chebyshev-Laguerre, 525 - harmonic, 524 Hermite, 524 Legendre, 504, 516, 623 Positive direction of circuit, 176 Potential - dipole, 299 - multipole, 299 - of a field, 288 - quadrupole, 299 - scalar, 296 - single-layer, 493 - vector, 295, 296 - - of a magnetic field, 296 - velocity, 309 Potential field, 288 Power series, 375 - differentiation of, 391 Principal value (Cauchy) of an integral, 155 Principle - Cavalieri's, 132 - contraction mapping, 36 - d'Alembert 's, 306 - Dirichlet 's, 286 - fixed-point, 34, 38 - localization - - for a Fourier series, 532 - - for a Laplace integral, 615 - Picard-Banach, 34, 38 - stationary phase, 633, 634, 640 - uncertainty, 590 Probability error function, 610 - asymptotics, 628 Problem - asymptotic, 596
673
- brachistochrone, 92, 94 - curve of most rapid descent, 92 - Luzin 's, 531 - Riemann, 530 - shortest-time, 92 - Sturm-Liouville, 523 Process - adiabatic, 223 - quasi-static, 222 Product - exterior, 3 15, 342 - inner, 45-47, 351 - - of a field and a form, 351 -- of functions, 522 - of generalized functions, 4 75 - of manifolds, 320 - of topological spaces, 13 - tensor, 313 Projective line, 326 Projective plane, 326 - real, 326 Pulse - rectangular, 587 - triangular, 588 Q
Quadrupole, 299 Quadrupole potential, 299 Quantities of the same order, 598 R
Raabe's integral, 446 Rademacher system, 525 Range of a chart, 161, 320 Rapidly decreasing function, 575 Rectangular pulse, 587 Restriction of a form to a submanifold, 207, 350 Riemann sum, 108 Riemann-Lebesgue lemma, 532, 638 Riemannian metric, 267 Rule, Leibniz, 134, 409, 453 s
Sample function, 588 Sard's lemma, 149, 150 Sard's theorem, 236 Scalar potential, 296 Schwartz space, 592
674
Subject Index
Schwarz boot, 194 Separable metric space, 15 Separation of points by functions, 401 Separation of variables, 517-520 Sequence - asymptotic, 601 - Cauchy, 21 - convergent, 21 - - uniformly, 368 - convergent at a point, 363 - convergent on a set, 363 - fundamental, 21 - monotonic, of functions, 376 - nondecreasing, of functions, 376 - nonincreasing, of functions, 376 Series - asymptotic, 601 , 602 general, 601 in the sense of Erdelyi, 610 in the sense of Poincare, 602 power, 606-609 - continuity of sum, 384 - Dirichlet, 380 - Fourier, 499-526 in a general orthogonal system, 499-500 multiple, 557 of generalized functions, 558 partial sum, 531 pointwise convergence, 527 rate of convergence and smoothness, 540 - hypergeometric, 392 - of functions, 366 - power, 375 - Stirling ' s, 636 - trigonometric, 526-560 - uniformly convergent - - Cauchy criterion, 378 Set - admissible, 1 1 7 - bicompact, 15 - Cantor, 23 - closed - - in a metric space, 5, 6 - - in a topological space, 13 - compact, 15
-
conditionally compact, 18 everywhere dense, 12 Jordan measurable, 1 19 nowhere dense, 28 of area zero, 190 of content zero, 120 of convergence, 363, 367 of first category, 28 of measure zero (Jordan) , 120 of measure zero (Lebesgue) , 1 17, 120, 121 - of second category, 28 - of volume zero, 120 - open - - in a metric space, 5 - - in a topological space, 9 -- parameter, 366 - relatively compact, 18 - totally bounded, 18 Signal - band-limited, 585 - spectrum of, 560 Simply connected domain, 293 Sine integral, 610 Sine, cardinal, 588 Single layer, 482 Single-layer potential, 493 Singular cube, 357 - boundary of, 357 Smooth structure, 326 Smooth structures, isomorphic, 335 Solenoidal field, 295 Solution - fundamental, 469-470, 490 - - of the Laplacian, 490 Space - Banach, 43 - complete, 21 - connected, 19 - cotangent to a manifold, :340 - Euclidean, 4 7 - Hausdorff, 12, 14 - Hermitian (unitary) , 47 - Hilbert, 47 - locally compact, 18 - locally connected, 20 - metric, 1 - - separable, 15
Subject Index - normed affine, 62 - normed vector, 42 - - complete, 43 - of distributions, 463 of fundamental functions, 484 of generalized functions, 463, 484 of tempered distributions, 591 of test functions, 463, 484 path connected, 20, 34 pre-Hilbert, 47 Schwartz, 591 , 592 Sobolev-Schwartz, 464 - tangent, 61 - tangent to JR n , 337 - tangent to a manifold, 337, 338 - topological, 9, 10 in the strong sense, 14 - - Tl , 14 - - T2 , 14 Spectral characteristic, 562 Spectrum - bounded, 585 - discrete, 560 - frequency, 593 Sphere, Alexander horned, 162 Spherical functions, 524 Stabilizer of a point, 335 Stationary phase method, 633, 635 - multidimensional, 640 - one-dimensional, 638 Stationary phase principle, 634 Steklov's inequality, 556 Stirling's formula, 448, 622 Stirling's series, 636 Stokes' formula, 237, 345-347, 358 - general, 248-251 , 345 -- in JR 3 , 245-247 Structure - smooth, 326 Sturm-Liouville problem, 523 Submanifold, 335 Subset, everywhere dense in c([a, bJ) , 400 Subspace - of a metric space, 7 - of a topological space, 13 Sum - Darboux
675
- - lower, 1 1 4 - - upper, 1 14 - Riemann, 108 Summation method - Abel, 384-385, 394 - Cesaro, 394 Support - of a differential form, 344 - of a function, 136, 451 Surface - elementary, 162 - k-dimensional, 161 - - smooth, 1 73 - nonorientable, 1 74 - of dimension k, 178 - of measure (area) zero, 190 - one-sided, 176 - orientable, 1 75 - oriented, 175 - piecewise smooth, 183 - - orientable, 184 two--sided, 176 - with boundary, 178 - without boundary, 1 78 - zero-dimensional, 183 Surface integral - of first kind, 231 - of second kind, 232 Symmetric operator, 519 System - exponential, 502 - Haar, 525 - Rademacher, 525 - trigonometric, 501 - - completeness, 545 - - in complex notation, 502 System function, 449, 460 System of sets - locally finite, 336 - refinement of another system, 336 System of vectors - complete, 51 1-516 - condition for completeness, 512 - linearly independent, 500 - orthogonal, 499-506 - orthonormal, 500 - orthonormalized, 500
676 T
Subject Index
Tangent mapping, 339, 349 Tangent paths, 347 Tangent space - to JR. n at a point, 337 - to a manifold, 337, 338 Tangent vector to a manifold, 338, 339, 349 Tempered distribution, 591 Tensor product, 313 Test - Abel-Dirichlet , 376 - Weierstrass', 374-376 - - for integrals, 425 Test function, 463 Theorem - Abel's, 379 - Arzela-Ascoli, 397-399 - Brouwer - - fixed-point , 242 - - invariance of domain, 321 Carnot 's, 225 - Cauchy's multidimensional mean-value, 287 - curl, 283 - Darboux', 1 15 - de Rham's, 359 - Dini's, 386, 430 divergence, 283 Earnshaw's, 286 Fejer's, 538, 557 finite-increment, 73-79 - Fubini's, 127-129, 580 Gauss', 285, 496 gradient , 283 - Green's, 237-242 - Hardy's, 395 Helmholtz', 302 implicit function, 96-106 inverse function, 105 - Kotel'nikov's, 585 - Lagrange's, 309 - Lebesgue's - - dominated convergence, 395 - - monotone convergence, 396 - mean-value for harmonic functions, 287 - - for the integral, 125
- on asymptotics of a Laplace integral, 621 - on existence of solutions of differential equations, 36-37 - on interchange of limiting passages, 382 - on the principal term of asymptotics of an integral, 621 - on uniform continuity, 32 - Plancherel 's, 590 - Poincare's, 353, 356 - Pythagorean, 505 - - for measures of arbitrary dimension, 193 - sampling, 585 - Sard's, 236 - Stone's, 40Q--403 - - for complex algebras, 405 - Tauberian, 395 - translation, 587 - Weierstrass' on approximation by polynomials, 400, 472, 494 on approximation by trigonometric polynomials, 484, 540 - Whitney 's, 169, 334 Whittaker-Shannon, 585 Timbre, 518 Tone, fundamental, 518 Topological group, 72, 336 Topological space, 9, 10 - base of, 10 - in the strong sense, 14 - weight of, 1 1 Topological spaces, product of, 13 Topology - base of, 10 - induced on a subspace, 13 - on a set, 9 - stronger, 14 Torsion of a curve, 73 Torus, 167 Total boundedness, 397 Transform - Fourier asymptotic properties, 573-576 - frequency shift, 587
-
Subject Index - - in L2 , 589 multidimensional, 577 of a convolution, 569 of generalized functions, 592 rate of decrease and smoothness, 573 time shift, 587 - Fourier cosine, 564 - Fourier sine, 564 - integral, 568 - Laplace, 612 Transformation - Abel's, 376 - bilinear, 49 - bounded, 53 - continuous - - multilinear, 56 - Galilean, 595 - linear, 49 - Lorentz, 595 - multilinear, 49, 52, 56 - trilinear, 49 Transient pulse function, 449 Translation operator, 449 Translation theorem, 587 Translation-invariant operator, 449 Triangle inequality, 1 Triangular pulse, 588 Trigonometric polynomial, 526 Trigonometric series, 526-560 Trigonometric system, 501 - completeness, 545 - in complex notation, 502 Trihedral, companion, 73 Triorthorgonal coordinates, 268 u
Uncertainty principle, 590 Uniform boundedness, 371, 397 - of a family of functions, 376 Uniform convergence, 397 - Cauchy criterion, 369-370
677
Uniformly convergent series - Cauchy criterion, 378 Unit step, 466 v
Value (Cauchy principal) of an improper integral, 155 Vector - angular velocity, 70 - tangent to a manifold, 338, 339, 349 Vector field - central, 213 - on a manifold, 348 - - smooth, 348 Vector potential, 295, 296 of a magnetic field, 296 Vector space, tangent to a manifold, 337, 338 Vectors - orthogonal, 499, 519 Volume - of a ball in R n , 192, 444 - of a set (Jordan), 1 19, 245 - of an interval, 107 Volume element, 227 -
w
Wallis' formula, 434, 448, 636 Watson's lemma, 63 1 , 635 Wave equation, 307-310, 581 - homogeneous, 309 - inhomogeneous, 309 Weak convergence, 463 Weierstrass M-test, 374 Weight of a topological space, 11 Wirtinger's inequality, 556 Work of a field, 2 1 1 z
Zeta function, 447
Name Index
A Abel, N. , 376, 377, 379, 384, 42 1 Alexander, J., 162 Ampere, A., 234 Archimedes ( ' ApXLJlifJOTJ') , 245 Arzela, C., 398, 404 Ascoli, G . , 398, 404 B
Banach, S., 34 Bernoulli, D., 309 Bernoulli, Jacob, 555, 635 Bernoulli, Johann, 92 Bessel, F. , 392, 410, 414, 434, 508 Biot , .J . , 235 Borel, E. , 581 Brouwer, 1 . , 178, 242 , 321 Brunn, H., 121 Bunyakovskii, V. , 46, 452 , 505 c
Cantor, G., 23, 32, 1 1 1 Carleson, 1 . , 531 Carnot, S., 226 Cart an, E. , 250 Cauchy, A. , 2 1 , 29, 46, 155, 287, 309, 310, 369, 375, 378, 422, 452, 505 Cavalieri, B., 132 Cesaro, E., 394 Chebyshev, P. , 3, 525 Clapeyron, E., 222 Clausius, R. , 225 Coulomb, Ch. , 279, 288 D
d'Alembert , J., 306 Darboux, G., 1 14- 1 16, 120, 129, 396
de Rham, G . , 359 Dini, U., 386, 430, 535, 570 Dirac, P. , 237, 281 , 464 Dirichlet , P. , 286, 364, 377, 379, 380, 42 1 , 436, 447, 520, 532, 557, 570, 610
E Earnshaw, S . , 286 Erdelyi, A., 610, 635 Euler, 1 . , 91, 307, 433, 437, 440, 442, 446, 530, 567, 580, 610 F Faraday, M . , 233, 237 Federer, H., 236 Fejer, 1 . , 538 Feynman, R. , 262 Fourier, J., 499-593, 634, 635 Frechet , M . , 1 1 Frenet , J . , 73 Fresnel, A., 436, 447, 610 Fubini, G., 127, 580 G
Galilei, Galileo, 595 Gauss, C . , 237, 242 , 249, 278, 285, 392, 440, 447, 471, 489, 496 Gibbs, J . , 544, 556 Gram, J . , 185, 503, 5 1 1 Grassmann, H . , 319 Green, G . , 237, 284, 470, 496 H
Haar, A., 525 Hadamard, J. , 375, 415 Hamilton, W. , 262 , 265 Hardy, G . , 395 Hausdorff, F., 1 1 , 13
680
Name Index
Heaviside, 0 . , 466, 469, 491 , 497 Heisenberg, W., 573 Helmholtz, H., 302 Hermite, Ch. , 45, 524 Hilbert, D., 47, 336 Holder, 0 . , 126 Hurwitz, A., 550 J
Jacobi, C., 72 Jordan, C . , 1 19-12 1 , 126, 130-132, 137, 144 Joule, G., 224 K
Kantorovich, L., 39 Kelvin, Lord (W. Thomson ) , 224, 237, 250 Klein, F . , 169, 325, 550 Kolmogorov, A., 530 Kotel'nikov, V., 585 Kronrod, A., 236 Krylov, A . , 544 L
Lagrange, J . , 39, 91, 106, 287 Laguerre, E., 525 Lame, G . , 268 Laplace, P. , 264, 265, 273, 304, 310, 493, 524, 572, 612, 634, 635, 638 Lebesgue, H., 23, 1 1 1-1 14, 1 17, 120, 1 2 1 , 125, 129, 136, 145, 146, 149, 395, 532, 570, 638 Legendre, A., 414, 446, 504, 516, 623 Leibniz, G., 134, 237, 278, 408, 453, 573 Lie, S . , 72, 336, 348 Liouville, J., 523 Lorentz, H., 595 Luzin, N., 531 Lyapunov, A., 520 M
Maxwell, J . , 237, 262, 263, 275, 281, 286, 298, 311 Mayer, J., 224 Men'shov, D., 531
Milnor, J . , 335 Minkowski, H., 121, 126, 194, 494 Mobius, A., 168, 327, 330 Morse, A., 149 Morse, M., 633, 637, 641 N
Newton, I., 38, 237, 278, 288, 306, 307, 408, 573 0
Ostrogradskii, M., 237, 242, 249, 278, 286, 489, 496 p
Parseval, M., 520, 529, 547, 569, 592 Picard, E . , 34, 38 Plancherel, M., 590 Poincare, H., 250, 296, 353, 356, 602, 610 Poisson, S., 224, 298, 304, 348, 433, 442, 459, 472, 493, 567, 580, 588, 593, 610 Pythagoras (IIv8o:'"'(6po:c;) , 193, 505 R
Raabe, J . , 446 Rademacher, H., 525 Riemann, B., 23, 107-109, 310, 395, 447, 530, 532, 638 s
Sard, A., 149, 236 Savart, F., 235 Schmidt, E., 503 Schwartz, L., 464, 591 , 592 Schwarz, H., 193 Shannon, C., 585 Sobolev, S . , 464 Steklov, V., 520, 556 Stirling, J., 448, 622, 636 Stokes, G., 237, 249, 345-347, 358 Stone, M., 400 Sturm, Ch. , 523 T
Tauber, A., 395
Name Index Taylor, B., 94, 415, 517 Thomson, W. (Lord Kelvin) , 224, 237, 250 w
Wallis, J . , 434, 448, 636
681
Watson, J., 631 , 635 Weierstrass, K., 374, 393, 400, 425, 431 , 472, 484, 494, 540 Whitney, H., 149, 169, 334 Whittaker, J . , 585 Wirtinger, W., 556
Universitext Aksoy, A.; Khamsi, M. A.: Methods in Fixed
Booss, B.; Bleecker, D. D. :
Point Theory
Analysis
Alevras, D.; Padberg M. W.:
Linear Opti
mization a nd Extensions
Andersson, M.: Topics in Complex Analysis Aoki, M.: State Space Modeling of Time Se ries
Audin, M.: Geometry
Borkar, V. S.: Prob ability Theory
Carleson, L.; Gamelin, T. W.: Complex Dy
namics
Cecil, T. E.: Lie Sphere Geometry : With Ap
plications of Sub ma nifolds
Chae, S. B.: Lebesgue Integration Chandrasekharan, K.:
Aupetit, B.: A Primer on Spectral Theory
Tra nsform
Duality
Ma nifolds
Bachem, A.; Kern, W.: Linear Programming
Topology a nd
Classical Fourier
Charlap, L. S.: Bieb erbach Groups and Flat
Bachmann, G.; Narici, L.; Beckenstein, E.:
Chern, S.: Complex Ma nifolds without Po
Fourier a nd Wavelet Analysis
tential Theory
Badescu, L.: Algebraic Surfaces Balakrishnan, R.; Ranganathan, K.: A Text b ook of Graph Theory
Balser, W.: Formal Power Series a nd Linear
Systems ofMeromorphic Ordinary Differen tial Equations
Bapat, R.B.: Linear Algebra a nd Linear Mod els
Benedetti, R.; Petronio, C.: Lectures on Hy perbolic Geometry
Benth, F. E.: Option Theory with Stochastic
Analysis
Chorin, A. f.; Marsden, f. E.: Mathematical
Introduction to Fluid Mechanics
Cohn, H.: A Classical Invitation to Algebraic Numb ers a nd Class Fields
Curtis, M. L.: Abstract Linear Algebra Curtis, M. L.: Matrix Groups
Cyganowski, S.; Kloeden, P.; Ombach, f.:
From Elementary Probability to Stochastic Differential Equations with MAPLE
Dalen, D. van: Logic a nd Structure Das, A.: The Special Theory of Relativity : A Mathematical Exposition
Debarre, 0.: Higher-Dimensional Algebraic
Berberian, S. K.: Fundamentals of Real Anal
ysis
Deitmar, A.: A First Course in Harmonic
Berger, M.: Geometry I, a nd II
Analysis
Blowey, f. F.; Coleman, f. P.; Craig, A. W. (Eds.): Theory a nd Numerics of Differential
phes
Bliedtner, f.; Hansen, W.: Potential Theory
Equations
Borger, E.; Gradel, E.; Gurevich, Y.: The Clas
sical Decision Problem
Bottcher, A; Silbermann, B.: Introduction to Large Truncated Toeplitz Matrices
Boltyanski, V.; Martini, H.; So/tan, P. S.: Ex cursions into Combinatorial Geometry
Boltyanskii, V. G.; Efremovich, V. A.: Intu itive Combinatorial Topology
Geometry
Demazure, M.: Bifurcations a nd Catastro Devlin, K. f.: Fundamentals of Contempo
rary Set Theory
DiBenedetto,
E.:
Degenerate
Parab olic
Equations
Diener, F.; Diener, M. (Eds.): Nonstandard Analysis in Practice
Dimca, A.: Singularities a nd Topology of Hy persurfaces
DoCarmo, M. P.: Differential Forms a nd Ap
plications
Du istermaat, f. f.; Kolk, f. A. C.: Lie Groups
Edwards, R. E.: A Formal Background to
Hlawka, E.; Schoiflengeier, J.; Taschner, R.:
Higher Mathematics Ia, and Ib
Geometric and Analytic Numb er Theory
Edwards, R. E.: A Formal Background to
Holmgren, R. A.: A First Course in Discrete
Higher Mathematics Ila, and lib
Dynamical Systems
Emery, M.: Stochastic Calculus in Manifolds
Howe, R., Tan, E. Ch.: Non-Abelian Har
Endler, 0.: Valuation Theory
Erez, B. : Galois Modules in Arithmetic
Everest, G.; Ward, T.: Heights of Polynomials and Entropy in Algeb raic Dynamics
Farenick, D. R.: Algebras of Linear Transfor mations
Foulds, L. R.: Graph Theory Applications Frauenthal, J. C.: Mathematical Modeling in Epidemiology
Friedman, R.: Algeb raic Surfaces and Holo morphic Vector Bundles
Fuks, D. B.; Rokhlin, V. A.: Beginner's Course in Topology
Fuhrmann, P. A.: A Polynomial Approach to Linear Algebra
Gallot, S.; Hulin, D.; Lafontaine, J.: Rieman nian Geometry
Gardiner, C. F.: A First Course in Group The
ory
Gtlrding, L.; Tambour, T.: Algebra for Com
puter Science
Godbillon, C.: Dynamical Systems on Sur faces
Godement, R.: Analysis I Goldblatt, R.: Orthogonality and Spacetime Geometry
monic Analysis
Howes, N. R.: Modern Analysis a nd Topol ogy
Hsieh, P. -F.; Sibuya, Y. (Eds.): Basic Theory of Ordinary Differential Equations
Humi, M., Miller, W.: Second Course in Or
dinary Differential Equations for Scientists and Engineers
Hurwitz, A.; Kritikos, N.: Lectures on Num b er Theory
Iversen, B.: Cohomology of Sheaves
Jacod, J.; Protter, P.: Probab ility Essentials
Jennings, G. A.: Modern Geometry with Ap plications
Jones, A.; Morris, S. A.; Pearson, K. R.: Ab stract Algebra and Famous Inpossibilities
Jost, J.: Compact Riemann Surfaces Jost, J.: Postmodern Analysis Jost, J.: Riemannian Geometry and Geomet ric Analysis
Kac, V.; Cheung, P.: Quantum Calculus
Kannan, R.; Krueger, C. K.: Advanced Anal
ysis on the Real Line
Kelly, P.; Matthews, G.: The Non-Euclidean Hyperbolic Plane
Kempf, G.: Complex Ab elian Varieties a nd Theta Functions
Gouvea, F. Q.: p-Adic Numb ers
Gustafson, K. E.; Rao, D. K. M.: Numerical
Kitchens, B. P.: Symbolic Dynamics
Range. The Field of Values of Linear Opera
From Elementary Probab ility to Stochastic
tors and Matrices
Kloeden, P.; Ombach, J.; Cyganowski, S.: Differential Equations with MAPLE
Gustafson, S. J.; Sigal, I. M.: Mathematical
Concepts of Quantum Mechanics
merical Solution of SDE Through Computer
Hahn, A. J.: Quadratic Algebras,Clifford Al
Experiments
geb ras,and Arithmetic Witt Groups
Kloeden, P. E.; Platen; E.; Schurz, H.: Nu
Kostrikin, A. I.: Introduction to Algebra
Hajek, P.; Havranek, T.: Mechanizing Hy pothesis Formation
Krasnoselskii, M. A.; Pokrovskii, A. V.: Sys
Heinonen, J.: Lectures on Analysis on Metric
Luecking, D. H., Rubel, L. A.: Complex Anal
Spaces
ysis. A Functional Analysis Approach
tems with Hysteresis
Ma, Zhi-Ming; Roeckner, M.: Introduction to the Theory of (non-symmetric) Dirichlet Forms
Mac Lane, S.; Moerdijk, I.: Sheaves in Geom
etry and Logic
Marcus, D. A.: Number Fields
Rey, W. ]. ].:
Introduction to Robust and
Quasi-Robust Statistical Methods
Ribenboim, P.: Classical Theory of Algebraic Numb ers
R ickart, C. E.: Natural Function Algebras
Rotman, ]. ].: Galois Theory
Martinez, A.: An Introduction to Semiclas
Rubel, L. A.: Entire and Meromorphic Func
sical and Microlocal Analysis
tions
Matousek, ].: Using the Borsuk-Ula m Theo rem
Matsuki, K.: Introduction to the Mori Pro gram
Me Carthy, P. ].: Introduction to Arithmeti cal Functions
Meyer, R. M.: Essential Mathematics for Ap plied Field
Meyer-Nieberg, P.: Banach Lattices
Mikosch, T.: Non-Life Insurance Mathemat
ics
Mines, R.; Richman, F.; Ruitenburg, W.: A
Course in Constructive Algebra
Moise, E. E.: Introductory Problem Courses in Analysis and Topology
Montesinos-Amilibia, ]. M.: Classical Tessel lations and Three Manifolds
Morris, P.: Introduction to Game Theory
Nikulin, V. V.; Shafarevich, I. R.: Geometries
and Groups
Oden, ]. ].; Reddy, ]. N.: Variational Methods in Theoretical Mechanics
0ksendal, B.: Stochastic Differential Equa tions
Poizat, B.: A Course in Model Theory Polster, B.: A Geometrical Picture Book
Rybakowski, K. P.: The Homotopy Index and Partial Differential Equations
Sagan, H.: Space-Filling Curves Samelson, H. : Notes on Lie Algebras Schiff, ]. L.: Normal Families
Sengupta, ]. K.:
Optimal Decisions under
Uncertainty
Seroul, R.: Progra mming for Mathemati cians
Seydel, R.: Tools for Computational Finance
Shafarevich, I. R.: Discourses on Algebra
Shapiro, ]. H.: Composition Operators and Classical Function Theory
Simonnet, M.: Measures and Probabilities
Smith, K. E.; Kahanpitii, L.; Kekiiliiinen, P.; Traves, W.: An Invitation to Algebraic Ge
ometry
Sm ith, K. T.: Power Series from a Computa
tional Point of View
Smorynski, C.: Logical Numb er Theory I. An Introduction
Stichtenoth, H.: Algebraic Function Fields and Codes
Stillwell, ].: Geometry of Surfaces
Stroock, D. W.: An Introduction to the The
ory of Large Deviations
Absolutes of Hausdorff Spaces
Sunder, V. S.: An Invitation to von Neumann
angularization
ogy
Porter, ]. R.; Woods, R. G.: Extensions and Radjavi, H.; Rosenthal, P.: Simultaneous Tri
Ramsay, A.; Richtmeyer, R. D.: Introduction to Hyperbolic Geometry
Rees, E. G.: Notes on Geometry Reise/, R. B.: Elementary Theory of Metric Spaces
Algebras
Tamme, G.: Introduction to Etale Cohomol Tondeur, P.: Foliations on Riemannian Man
ifolds
Verhulst, F.: Nonlinear Differential Equa
tions and Dy na mical Systems
Wong, M. W.: Wey l Transforms
Xamb6-Descamps,
S.:
Block
Error-Cor
recting Codes
Zaanen, A. C.: Continuity, Integration and Fourier Theory
Zhang, F.: Matrix Theory
Zong, C.: Sphere Packings Zong, C.: Strange Phenomena in Convex and Discrete Geometry
Zorich, V. A.: Mathematical Analysis I
Zorich, V. A.: Mathematical Analysis II
Vladimir A. Zorich is professor of mathematics at Moscow State University. His areas of specialization are analysis, conformal geometry, quasiconformal mappings, and, most recently, mathe matical aspects of thermodynamics. He solved the problem of global homeomorphism for space quasiconformal mappings, for whicH he was awarded the National Prize for Young Mathe maticians. He also holds a patent in the technology of mechanical engineering. Vladimir A. Zorich was one of the organizers of advanced experi mental courses at the Department of Mechanics and Mathematics of Moscow State University. He has designed and taught modern analysis courses in several universities and is the author of numerous research mathematical publications.
Mathematica l Ana lysis I I This two-volume work presents a thorough first course in analysis, leading from real numbers to such advanced topics as differential forms on manifolds, asymptotic methods, integral transforms, and distributions. Especially notable in the course is the clearly expressed orientation toward the natural sciences and its informal exploration of the essence and the roots of the basic concepts and theorems of calculus. Clarity of exposition is matched by a wealth of in structive exercises, problems and fresh applications to areas seldom touched on in real analysis books. This second volume expounds classical analysis as it is today, as a part of uni fied mathematics, and its interactions with other modern mathematical courses such as algebra, differential geometry, differential equations, complex and func tional analysis. The book provides a firm foundation for advanced work in any of these directions. " . . . The textbook of Zorich seems to me the most successful of the available comprehensive textbooks of analysis for mathematicians and physicists. It differs from the traditional expo sition in two major ways: on the one hand in its closer relation to natural-science applications (primarily to physics and mechanics) and on the other hand in a greater-than-usual use of the ideas and methods of modern mathematics, that is, algebra, geometry, and topology. The course is unusually rich in ideas and shows clearly the power of the ideas and methods of modern mathematics in the study of particular problems. Especially unusual is the second volume, which includes vector analysis, the theory of differential forms on manifolds, an introduction to the theory of generalized functions and potential theory, Fourier series and the Fourier transform, and the elements of the theory of asymptotic expansions. At present such a way of structuring the course must be considered innovative. It was normal in the time of Goursat, but the tendency toward specialized courses, noticeable over the past half century, has emasculated the course of analysis, almost reducing it to mere logical justifications. The need to return to more substantive courses of analysis now seems obvious, especially in connection with the applied character of the future activity of the majority of students. . . . In my opinion, this course is the best of the existing modern courses of analysis." From a review by Vladimir 1. Arnold
ISBN 3-540-40633-6
9
I I II I 783540 406334
) springeronline.com