BASIC LINEAR PARTIAL DIFFERENTIAL EQUATIONS
This is a volume in PURE AND APPLIED MATHEMATICS
A Series of Monographs and Textbooks Editors: SAMUEL EILENBERG AND HYMAN BASS A list of recent titles in this series appears at the end of this volume:
1’1 I’
BASIC LINEAR PARTIAL DIFFERENTIAL EQUATIONS FranGois Treves DEPARTMENT OF MATHEMATICS RUTGERS UNIVERSITY N E W B R U N S W I C K , N E W JERSEY
1975
ACADEMIC PRESS, INC. (Hartour! Brace lovanovlch, Publishers)
Orlando San Diego San Francisco New York London Toronto Montreal Sydney Tokyo Sao Paul0
COPYRIGHT 0 1975, BY ACADEMIC PRESS, INC. ALL RIGHTS RESERVED. NO PART OF THIS PUBLICATION MAY BE REPRODUCED OR TRANSMITTED IN ANY FORM OR BY ANY MEANS, ELECTRONIC OR MECHANICAL, INCLUDING PHOTOCOPY, RECORDING, OR ANY INFORMATION STORAGE AND RETRIEVAL SYSTEM, WITHOUT PERMISSION IN WRITING FROM THE PUBLISHER.
ACADEMIC P R E S S , Orlando, Florida 32887
INC.
United Kingdom Edition published by ACADEMIC PRESS, I N C . (LONDON) LTD.
24/28 Oval Road, London NWl
7DX
Library of Congress Cataloging in Publication Data Treves, Fransois, (date) Basic linear partial differential equations. (Pure and applied mathematics v. 62) Bibliography: p. Includes indexes. 1. Differential equations, Partial. 2. Differen1. Title. 11. Series: tial equations, Linear. Pure and applied mathematics; a series of monographs and textbooks; v. 62 QA3.P8 [QA3741 5 15’.353 74-10206 ISBN 0-12-699440-4
AMS (MOS) 1970 Subject Classifications: 35-02, 35505, 35K05,35L05 PRINTED IN THE UNITED STATES OF AMERICA
838485
9 8 7 6 5 4
Contents
ix ...
Preface Notation
xi11
CHAPTER I The Basic Examples of Linear PDEs and Their Fundamental Solutions 1. The Basic Examples of Linear PDEs 2 . Existence and Smoothness of Solutions Not Submitted to Side Conditions 3. Analyticity of Solutions 4. Fundamental Solutions of Ordinary Differential Equations 5. Fundamental Solutions of the Cauchy-Riemann Operator 6. Fundamental Solutions of the Heat and of the Schrodinger Equations 7. Fundamental Solutions of the Wave Equation 8. More on the Supports and Singular Supports of the Fundamental Solutions of the Wave Equation Appendix. Explicit Formulas for E , in Space Dimensions Two and Three 9. Fundamental Solutions of the Laplace Equation Appendix. Computation of the Area of the Unit Sphere 10. Green’s Formula. The Mean Value Theorem and the Maximum Principle for Harmonic Functions. The Poisson Formula. Harnack’s Inequalities V
3 14 22 26 34 41
47 59
62 68 74
77
vi
CONTENTS
CHAPTER I1 T h e Cauchy Problem 11. The Cauchy Problem for Linear Ordinary Differential Equations 12. The Cauchy Problem for Linear Partial Differential Equations. Preliminary Observations 13. The Global Cauchy Problem for the Wave Equation. Existence and Uniqueness of the Solutions 14. Domain of Influence, Propagation of Singularities, Conservation of Energy 15. Hyperbolic First-Order Systems with Constant Coefficients 16. Strongly Hyperbolic First-Order Systems in One Space Dimension 17. The Cauchy-Kovalevska Theorem. The Classical and Abstract Versions 18. Reduction of Higher Order Systems to First-Order Systems 19. Characteristics. Invariant Form of the Cauchy-Kovalevska Theorem Appendix. Bicharacteristics and the Integration of the Characteristic Equation 20. The Abstract Version of the Holmgren Theorem 21. The Holmgren Theorem
89 96 102 111 119 132 142 156 161 167 174 181
CHAPTER 111 Boundary Value Problems 22. The Dirichlet Problem. The Variational Form 23. Solution of the Weak Problem. Coercive Forms. Uniform Ellipticity 24. A More Systematic Study of the Sobolev Spaces Appendix. The Sobolev Inequalities 25. Further Properties of the Spaces H" 26. Traces in H"(Q) Appendix. Extension to R" of Elements of H". p(Q) 27. Back to the Dirichlet Problem. Regularity up to the Boundary 28. A Weak Maximum Principle 29. Application : Solution of the Classical Dirichlet Problem 30. Theory of the Laplace Equation : Superharmonic Functions and Potentials 31. Laplace Equation and the Brownian Motion
189 20 1 210 217 224 237 245 249 259 268 278 294
CONTENTS
32. Dirichlet Problems in the Plane. Conformal Mappings 33. Approximation of Harmonic Functions by Harmonic Polynomials in Three Space. Spherical Harmonics 34. Spectral Properties and Eigenfunction Expansions 35. Approximate Solutions to the Dirichlet Problem. The Finite Difference Method 36. Girding’s Inequality. Dirichlet Problem for Higher Order Elliptic Equations 37. Neumann Problem and Other Boundary Value Problems (Variational Form) 38. Indications on the General Lopatinski Conditions
vi i
306 314 322 332 347 354 367
CHAPTER IV Mixed Problems and Evolution Equations 38 I 39 1
39. Functions and Distributions Valued in Banach Spaces 40. Mixed Problems. Weak Form 41. Energy Inequalities. Proof of Theorem 40.1 : Existence and Uniqueness of the Weak Solution to the Parabolic Mixed Problem 42. Regularity of the Weak Solution with Respect to the Time Variable 43. The Laplace Transform 44. Application of the Laplace Transform to the Solution of Parabolic Mixed Problems 45. Rudiments of Continuous Semigroup Theory 46. Application of Eigenfunction Expansion to Parabolic and to Hyperbolic Mixed Problems 47. An Abstract Existence and Uniqueness Theorem for a Class of Hyperbolic Mixed Problems. Energy Inequalities
458
Bibliography
465
Index
467
40 I 408 416 424 436 449
This Page Intentionally Left Blank
Preface The discrepancy between what is taught in a standard course on partial differential equations and what is needed to understand recent developments in the theory is now very wide. It is a fact that only a relatively small number of specialists, in a few universities, are able, these days, to teach a course that is truly introductory to those developments. Perhaps this is not much different from what has been happening in all active areas of mathematics. But it is also true, speaking of the best graduate students, as well as of professional mathematicians, that when they are said to be conversant in all aspects of mathematics, this often excludes substantial portions of analysis and most of partial differential equations. The complementary facet of such a state of affairs is that many up-to-date expositions fail, frequently because of lack of time, to show the link with the older results, and give the erroneous impression that the modern theories have no roots and are cut off from a rich past. The truth, of course, is that progress comes not only from pushing further and further into new territory but also from frequent returns to the familiar grounds, from seeking an ever-deeper understanding of their nature, and finding there new inspiration and guidance. The archetypes of linear partial differential equations (Laplace’s, the wave and the heat equations) and the traditional problems (Dirichlet’s and Cauchy’s) are the main topic of this book. Most of the basic classical results can be found here. But the methods by which these are arrived at are definitely not traditional ; the methods are, in practically every instance, applications of those now in favor at a higher level of abstraction. The aim of this approach is twofold: it is, on one hand, that of recalling the classical material to the modern analyst, in a language he can understand; on the other hand, that of exploiting the same material, with the wealth of examples it provides, as an introduction to the modern theories. Developments toward greater generality have not been avoided when it was felt that they represented the natural “next step” and afforded a meaningful opening to the more advanced stages of the theory-provided, also, that they did not require more machinery than had been made available up ix
X
PREFACE
to that point. Thus the reader will find a discussion of the Cauchy problem for first-order systems of hyperbolic equations with constant coefficients in Section 15, following the study of the same problem for the wave equation. Similarly, GArding’s inequality for strongly elliptic equations of any (even) order is established in Section 36, and, in a somewhat more philosophical vein, the meaning of the Lopatinski boundary conditions is explained in Section 38. The approach to the classical Dirichlet problem calls for some comment. Because 1 felt committed to describe the classical results, it was out of the question to limit the discussion to the weak solution, or variational, methodeven strengthened by the proof of regularity up to the boundary, when the latter is sufficiently smooth. After all, one might want to have the solution to the Dirichlet problem in a cube when the boundary value is continuous. Thus I was resigned to the dichotomy between the variational methods within the framework of the Sobolev spaces, and the Perron-Brelot method, tied to potential theory, until Guido Stampacchia indicated to me how to make the transition from the former to the latter, by way of his weak maximum principle (Section 28). I have followed his advice and adapted the argument of his article [2] (where, needless to say, more general second-order elliptic equations than Laplace’s are studied). From there on, the classical potential theory can easily take off, as is succinctly indicated in Sections 29 and 30. Like potential theory, many other important topics are very lightly touched upon : for example, the Dirac equations, random walks, the finite difference method, and continuous semigroups of operators. Here the book is truly introductory; its sole ambition is to give an idea of what these topics are all about and a taste for learning more. Thus it is not a treatise. Nor is it a classroom text, due to its size and the quantity of its contents, although it is true that it began as a set of lecture notes used at the University of Miami and at Rutgers University. Some readers might find that the writing shows too little regard for concision-for which 1 apologize. I have made a point, rather, of explicitly formulating some of the many thoughts that usually go unformulated while writing, especially while writing mathematics; and I hope not to have totally failed in this. Today, distributions are the language of linear PDE theory, and I am certainly not of the school that would like to do without them. But knowing that not all students are seriously exposed to distributions, I have limited their use to their more mechanical aspects-convergence of sequences, differentiation, convolution ; sometimes, but not often, the local representation of a distribution as a finite sum of derivatives of continuous functions is used to advantage. Fourier transformation of distributions, however, is used systematically ; the student genuinely interested in PDE must make an
PREFACE
xi
effort to learn it. Not that much effort is needed, for it is such a smooth and simple theory: Excellent expositions are found in Chapter VII of L. Schwartz' book [TD], or in Chapter I of Hormander's book [LPDO] (and in many other texts). In particular, the reader should be familiar with the Plancherel theorem and with the Paley-Wiener(-Schwartz) theorem. As far as linear functional analysis is concerned, the basic facts about Hilbert and Banach spaces must be known, but nothing much deeper-although, from the middle of the book on, an ever greater use is made of functions (and, later, distributions) with values in Banach spaces. Finally, it is presumed that the student has a fairly good knowledge of holomorphic functions of one complex variable, of real variable theory, mainly Lebesgue integration, and a smattering of measure theory. A bit of linear algebra will be of help, here and there. There are 390 exercises, and several contain detailed information which should enable the reader to reconstruct the proofs of some important results: for example, the hypoellipticity of elliptic equations-of any order-with C" coefficients, in Exercises 36.4 and 36.7, or the theorem of supports-in one variable-in Exercises 43.4, 43.5, and 43.6. Other exercises are simple variants or straightforward applications of the results and the methods in the text.
This Page Intentionally Left Blank
Notation R" product of n copies of the real line R x', . . . , X coordinates in R"; also y ' , . . . , y", etc. x = (XI, . . ., x") the variable in R";also y. etc.
R, dual of R" 1;,, .. . , 5, coordinates in R,; also ql, .. . , q , , etc. 5 = (tl,. . . , 5,) the variable in R,; also q, etc. x . 5 = x't, + ... + x"5, the scalar product between a vector in R" and a covector in R,; also (x, 5)
Z" product of n copies of the set Z of integers (of all signs) Z'!+ product of n copies of the set Z, of nonnegative integers C" product of n copies of the complex plane C z', . . . , z" complex coordinates in c" z = (z', . . . , z") the variable in C" Re z = (Re z ' , . . ., Re z") the real part of the complex vector z Im z = (Im zl, ..., Im z") the imaginary part of z C, dual ofC" Cl, . . . , C, coordinates in C, c = . . . , in) the variable in C, z * C = z'Cl + . . . + z"Cn the (real) scalar product between z E C" and C E C,; also (z, C> 1x1 = {(x')' + ... + (x")'}"' the Euclidean norm on R" I 5 1, z 1, 15 I the Euclidean norms on R, , C",C, , respectively Ia J = at+ - - + a, the length of the n-tuple a E Z: a!=a,!-.. a,,!,(E) = (E:) . . . (El), (); = aj!/pj!(aj- pi)! where a,p E Z; and aj 2 pj for every j = 1 , . . ., n d(x, A ) Euclidean distance from the point x to the set A B,(x) open ball centered at x , having radius r sup supremum, or least upper bound, of a set of real numbers inf injimum, or greatest lower bound, of a set of real numbers ch A conuex hull of a set A (contained in a linear space) R an open subset of R" [a, b] a closed interval, with limit points a and b in the real line R' (also when either a = - co and/or b = + cm)
(cl,
I
xiii
xiv
NOTATION
[a, b[ semiclosed interval a < t < b ; ] a ,b ] = { t E R' ; u < t Ib}
au axj
partial derivative of u with respect to x j ; also u,, , a,,u; also u' if there is only one variable
- a
DJ. = -J-l-;also
D,,
axj
. .. ( a / a x " y ,
(a/ax)ol = ( a / a x l p Da = DY . .. 0:"
(a
r+)
... dx" the Lebesgue measure on R"; d( = d t , - * . d t , the analog in R, (both measures dx and dt assign the volume l to the unit cube) dz' the line measure in the complex plane (oriented counterclockwise) dz = dz' * dz" the product of n line measures in C"
dx = dx'
-
( f * g)(x) =
a(t) =
I
I"
I
R"
f ( x - y)g(y) dy the value at x of the convolution o f f and g
e - i X ' 5 u ( xdx ) Fourier transform of u ( i =
fi)
Fu also Fourier transform of u , F ' u ( x ) = (2n)-"/ e'"'%(t) d t
inverse Fourier transform of u
R,
P(x, d/ax) =
1.1 s m
c , ( x ) ( a / d x ~ a linear partial differential operator of order
m in R (generally with C" coefficients c,)
'P(x, d/dx) =
1
1. s m
P(x, ax)*
=
( - d/Jx)ol c,(x)
-
1 ( -a/dx)' ca(x) lalsm
the (formal) adjoint of P(x, a/dx)
($)+ .. . + (k)the Laplace operator (or Laplacian) in n vari2
A=
the (formal) transpose of P ( x , a/ax)
ables
xv
NOTATION
0 = ($)2
- A the wave operator (or d’dlembertian) in n space variables
(r always denotes the time variable) E, F, . . . Hilbert spaces or Banach spaces over the complex numbers (sometimes over the real numbers) x, e, . . . elements of the Banach space E llelle norm, in the Banach space E, of the element e L(E; F) space of bounded linear operators of E into F, equipped with the operator norm
IIAll
= SUP II~ellFillelle OfesE
E‘, E* topological dual of the Banach space E, the space of continuous linear functionals on E [equipped with the dual norm, which is the operator norm when one recalls that E’ = L(E; C)] (e*, e) or (e, e*) the duality bracket between e E E and e* E E* E’ antidual of the Banach space E,i.e., the space of continuous antilinear functionals on E (a mapping u : E + F is antilinear if u ( l x + py) = w 4+ P4Yh vx, Y E E, VA, p E C) the bracket of the antiduality between E and E’ (x*, x)-
Main Spaces of Functions and Distributions space of complex-valued functions, defined and continuous in the open set R, equipped with the topology of uniform convergence on the compact subsets of R space of complex continuous functions on the closure K (supposed to be compact), equipped with the maximum norm space of complex functions, defined and m times continuously differentiable in R, equipped with the topology of uniform convergence on every compact subset of R, of the functions and of each one of their derivatives of order
+
-=
+
NOTATION
subspace of Cm(R)consisting of the functions having compact support; elements of C:(R) are often referred to as test functions in R subspace of Cr(R") whose support is contained in the compact set K , equipped with the topology induced by Cm(R") Lebesgue space of measurable functionsjsuch that the pth power of the absolute value is integrable over R ( 1 < p < +a), equipped with the norm I(f = f ( x ) l dx)l/" (actually f represents an equivalence class of functions equal almost everywhere) Lebesgue space of (classes of) measurable functionsf in R which are essentially bounded, equipped with the norm ]lfllLm(n), the essential supremum off space of locally-LP functions f in R [i.e., if K is any compact subset of R, the function f K equal to f on K and to zero in R\K belongs to Lp(R)] space of distributions in R space of distributions with compact support in R space of functions u in R such that Dau E Lp(R) for all n-tuples c1 E 2, , I a1 5 m (D" denotes the distribution derivative); H"*P(R)is the Soboleu space, equipped with the norm
If
closure in H", "(a)of C:(R) space of distributions u in R which can be written as finite sums of derivatives of order s m of functions belonging to Lp(R) (m E Z + ) Y(R")or Y space of C" functions u in R" such that, for any pair of nonnegative integers k, M ,
{ + 1x1
pk,M(u)= sup (1 xoR"
2)k
la1 5 M
}<+
I D"u(x)l
equipped with the topology defined by the seminorms pk,M (9 is the space of C" functions in R" rapidly decaying at infinity) Y'(R")or 9' the dual of 9, also the space of tempered distributions in R" H"(R")or H s the Sobolev space of order s E R in R", i.e., the space of tempered distributions u in R" whose Fourier transform 0 is a measurable function such that
equipped with the Hilbert space structure defined by the norm
I1
IIS
NOTATION
xvii
subspace of H' consisting of the elements having their support contained in the compact set K Hf(W space of distributions in R which belong to the space H " ( K )for some choice of the compact subset K of R Hl",,(R) space of distributions u in R such that c(u E H" for every CI E Cp(i2) &'(a) space of Radon measures in R[A'(R) is the dual of C;(O)] 9'+(R') or 9'+ space of distributions on the real line which vanish identically in the open negative half-line HYK)
Note: When R etc.
=
R",(R")will often be omitted, e.g., as in C", 9', H", Lp,
This Page Intentionally Left Blank
BASIC LINEAR PARTIAL DIFFERENTIAL EQUATIONS
This Page Intentionally Left Blank
CHAPTER I
The Basic Examples of Linear PDEs and Their Fundamental Solutions
This Page Intentionally Left Blank
The Basic Examples of Linear PDEs The theory of linear PDEs stems from the intensive study of a few special equations, whose importance was recognized in the eighteenth and nineteenth centuries. These were the basic equations in mathematical physics (gravitation, electromagnetism, sound propagation, heat transfer, and quantum mechanics). After their introduction in applied mathematics, they were shown to play important roles in pure mathematics: For instance, the Laplace equation was first studied as the basic equation in the theory of Newton’s potential and in electrostatics; later, suitably reinterpreted, it was used to study the geometry and topology of Riemannian manifolds. Similarly, the heat equation was studied by Fourier in the context of heat transfer. Later it was shown to be related to probability theory. One of the basic examples, which we describe below, does not seem to have originated in applications to physics: the CauchyRiemann operator, which is used to define analytic functions of a complex variable. But to my knowledge, all the remaining ones have their origin in applied mathematics. At any rate, the general theory of linear PDEs is an elaboration of the respective theories of these special operators. During the twentieth century it was recognized that many properties which had seemed to be the prerogative of the Laplace equation or of the wave equation could in fact be extended t o wide classes of equations. These properties usually center around a question or a problem that only makes sense for one or the other equation : for instance, around the Dirichlet problem, which makes sense for the Laplace equation but not really for the wave equation, or the Cauchy problem, which is well posed for the latter but not for the former. The purpose of this introductory course is to help the student to understand some of these problems and some of their solutions-but always by staying very close to the special equation for which they were originally considered. It is therefore necessary that we have the nature of the basic examples clearly in mind. 3
4
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
1.1 The Laplace Equation in n > 1 Variables Let us denote by x = (x', . . . , x") the variables in the Euclidean space R". Usually the Laplace operator is
A = ( -a )+'
... + ( -a ). 2
ax.
ax1
Some people call the Laplace operator that which in our notation would be - A . They have very good reasons to do this; it is a pity that historical custom is not on their side, but they are gaining ground. Indeed, - A is a positive operator; its Fourier transform is the square of the norm of the variable in R, , I 5 I '. The latter remark underlines the close relationship between the Laplace operator and the Euclidean norm, the spheres in the Euclidean space, the orthogonal transformations, and so on. Indeed, A is invariant under orthogonal transformations; that is, if T is any such transformation in R" and f any infinitely differentiable function of x , then (1.1)
( A f ) ( T x )= A{f(Tx)},
xE
R".
This, of course, is a crucial symmetry property of the Laplace operator and is part of the reason for its role in the description of many phenomena in isotropic media. As a matter of fact, any linear transformation T of R" such that (1.1) holds for all C" functionsfmust be orthogonal :The orthogonal transformations are exactly those which leave A invariant (i.e., which commute with A). The functions that satisfy the homogeneous Laplace equation Ah = 0
(1.2) are called harmonic junctions.
1.2 The Wave Equation For reasons which will become clear when we begin using the Fourier transformation, it is convenient to replace the partial differentiations d/axi by purely imaginary variables J-1t j ( j = 1, . . . , n). Thus the operator - A becomes (1.3)
1512
=
r: + ... +
which is a positive-definite quadratic form. Its signature is (n, 0 ) : It has n positive eigenvalues and no nonpositive ones. We may also look at quadratic forms with different signatures. An important case is the form with all
Sect. 11
THE BASIC EXAMPLES OF LINEAR PDES
5
eigenvalues strictly positive except one which is strictly negative. For various reasons it is convenient to consider such a form on an ( n + 1)-dimensional space R,+lwhere the variables are denoted by (tl.. . ., c, , T). It is essentially the form
l(J' - T 2 = r; + . - * + (; - T 2
(1.4)
corresponding to the partial differential operator in R"" (where the variables are denoted by xL,. . . , x", t ) :
This is the wave operator (sometimes called the d'Alembertian): The xi's are called the space variables, and f is the time variable. It is the operator used t o describe oscillatory phenomena and wave propagation. If we are interested in those linear transformations of Rn+'which commute with 0, we will have no trouble in determining what they are. Of course, they are the same as the linear transformations in (the dual space) R,+l which leave the quadratic form (1.4) invariant. They form a group much used in physics since the advent of relativity: the Lorentz group. The solutions of the wave equation
og=o
(1.5)
have properties that are radically different from those of the Laplace equation, as will become clear when we take a closer look at them.
1.3 The Heat Equation The examples given in $1.1 and $1.2 are both homogeneous second-order differential operators, that is, differential operators which involve secondorder partial differentiations and none of order f 2 . The heat operafor in RII+1 9
a
- - Ax,
at
is not of this type. It is used to describe various transfer phenomena, like the transfer of heat in isotropic media. At first glance the heat and the wave equations look alike, and indeed they have some properties in common. But there are also very deep differences. No wave propagation phenomena are associated with the solutions of the heat equation; phenomena of the diffusion type are. As a matter of fact, there is some similarity with the Laplace equation. It should not come as a surprise: The Zeading terms in the heat equation,
6
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
that is, the second-order partial derivatives, are the same as in the Laplace equation in space variables. We have just seen what are probably the most important examples of linear partial differential equations. The Laplace equation is the archetype of a large class of equations, called the elliptic PDEs. The reason for this is obvious : If we look at the quadratic form a,(:
+ t: ,
a, > 0, a2 > 0,
it is equal, up to a change of scale, to the symbol (1.3) of the Laplace operator in two variables. It is also the function in R2 whose level curves are ellipses. Similarly, the wave equation is the archetype of the hyperbolic PDEs: The level curves of the function t 2- z2 in R2 are the standard hyperbola. The heat equation is the archetype of the parabolic PDEs: Its symbol can be defined as being the function t2 - z in R, whose level curves are the standard parabola. As a matter of fact, again in view of our use of the Fourier transformation, we prefer to define its symbol as ] tI2 + iz, replacing a/& by iz rather than by -T. This has been the classical way of categorizing partial differential equations, when only those of first and second order were studied by mathematicians. It is quite inadequate to classify systems of PDEs, higher order equations, or equations with complex coefficients. It turns out that some of the essential properties of the Laplace equation follow from the fact that its symbol (1.3) only vanishes at the origin-and not from the fact that it is a positive-definite quadratic form. In other words, these properties subsist in other equations which partake of the former characteristic but not of the latter. This is the case of the equation we study next.
1.4 The Cauchy-Riemann Equation Let x, y denote the variables in the plane R2. The homogeneous CauchyRiemann equation reads
af -+ ax
-
J-1-=
af aY
0.
Here f = u + iv is a complex-valued differentiable function (u, v are real). Equation (1.7) is equivalent to the system
au
-=-
ax
+
av
ay
,
aU ax
-=--
aU
ay'
Let us set z = x iy, Z = x - iy, or, equivalently, x = f ( z + Z), y = (1/2i)(z - 2). Thus any function such as f ( x , y ) in a subset of R2can also be
Sect. 11
THE BASIC EXAMPLES OF LINEAR PDES
7
viewed as a function of (z, 2). Equation (1.7) can then be rewritten (by the chain rule of differentiation) as
af = 0. az
We have set
Roughly speaking, (1.9) tells us thatfis “independent of 2 ’ ’ ; more precisely, it states that,f(supposed to be sufficiently smooth) is an analytic function of z, i.e., has a complex derivative [at every point where (1.9) holds]. It is convenient to introduce also the “ anti-Cauchy-Riemann ” operator
Note that (1.10) the Laplace operator in two variables. The identity (1.10) points to strong relations between the Laplace and the Cauchy-Riemann equations. These will be confirmed when we study them. The symbol of 8/82 is (1.11)
i
(5 + irl)
(we have denoted by t, q the variables in the dual plane R2).Note that, like the symbol of the Laplace operator, it only vanishes at the origin. This property will have important consequences. Because of it, in the modern terminology, the Cauchy-Riemann operator is also said to be elliptic.
1.5 The Schrodinger Equation In the study of partial differential equations one is quickly taught to expect important and deep implications to follow from merely formal differences. This is confirmed by everything that follows and is well exemplified by the theories of the heat and of the Schrodinger equations. The Schrodinger operator with constant coeficients in n-space variables is (1.12)
-i _a _ Axi at
8
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
The only difference with the heat operator is the presence of the factor i-' in front of ajat. Yet the solutions of the two equations exhibit very different kinds of behavior, as will be seen later. The Schrodinger equation was originally introduced to describe the behavior of the electron and other elementary particles. It has the defect of not being Lorentz-invariant and therefore of not fitting in the relativistic formulation of quantum mechanics. It is still used as an approximation, but in a more rigorous setup, it has been replaced by Dirac's equations. So far we have only looked at examples of a single, or scafar, linear partial differential equation. But there are many important (for mathematics and for physics) examples of systems of equations. This means that we are given N , N , linear partial differential operators Pjk ( j = 1, . . . , N , , k = 1, . . . , N , ) and that we consider the N , equations in N , unknown functions uk Pjkuk=j-,,
(1.13)
j = I , ..., N , .
k= 1
The system (1.13) is said to be determined if N , = N , , that is, if there are exactly as many equations as there are unknowns; overdetermined if N , > N , , that is, if there are strictly more equations than unknowns; and underdetermined if there are strictly fewer equations than unknowns. The theory of systems is more difficult than the theory of single equations, especially the theory of overdetermined systems. At this stage we shall content ourselves with some examples. The Maxwell equations, on which classical electromagnetism is based, constitute an example of a determined system, as are the Dirac equations, alluded to above. Both are hyperbolic systems. Without getting into the technicalities of the definition, let us say that hyperbolic systems have formal and nonformal properties closely related to those of the wave equation. We next give some examples of systems of linear PDEs which are not determined.
1.6 The Gradient Let n > 1 denote the number of independent variables
xj.
The gradient.
(1.14) is an overdetermined system of differential operators : The number N , above is equal to n, N , = 1. The system of equations (1.13) reads here (1.15)
gau= f ; : ,
j=l,
..., n.
Sect. 13
THE BASIC EXAMPLES OF LINEAR PDES
9
The study of overdetermined systems is beset with difficulties which are absent in the study of single equations; they are algebraic in nature and show up even in cases as simple as ( I . 15). Indeed, if the equations (1.15) hold (assuming that the functions u, fi, . . . , ffl are sufficiently smooth), we derive from them
In other words, if we regard (f,, . . . ,ffl) as a vector-valued function f, we see that (1.15) implies
(1.16)
curl f = 0.
Equation (1.16) is a system of n(n - 1)/2 equations, which are called the compatibility conditions for the system (1.15). It is not difficult to see that if (1.16) holds, one can in fact solve (1.15)-at least locally. The gradient is an elliptic (nonscalar) differential operator. We cannot go into the exact meaning of this, but it points to a certain analogy with the Laplace operator (see below).
1.7 The Divergence The divergence operator acts on functions which are defined in a subset of
R" and are valued in C" (most of the time, we shall deal with complexvalued scalar functions). The system (1.13) defined by the divergence operator reads
(1.17) and N , = 1 , N , = n. If we set u = (u', . . . , u"), the left-hand side of (1.17) is usually denoted by div u. Reasoning very formally we see at once that (1.17) admits many solutions. F i x j arbitrarily and take ui to be any primitive offwith respect t o x j while taking uk r-0for k # j ; this gives us a solution. This kind of procedure works for all underdetermined systems : by setting N, - N , unknowns equal to zero, we reduce it to a determined system. This is a familiar device of elementary linear algebra; it shows that underdetermined systems are much easier to study than overdetermined ones (and in most cases, than determined ones also); they are also much less interesting. Let u = (u', . . , , u") denote a C" function defined in the open subset Q of
10
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
R" and valued in C", v a Cm function with compact support in valued. By integration by parts we see at once that (1.18)
J(div u)v dx
= -
[Chap. I
a, complex-
(u, grad v > dx
where (u, grad v ) = I uk av/d.x? is the standard scalar product between vectors (more precisely, between vectors and covectors). Formula (1.18) can be stated by saying that the operators -div and grad are the transpose of each other. Another well-known formula is (1.19)
div grad
= A,
which underlines the relationship between gradient, divergence, and Laplace operator. There are other examples of overdetermined systems of differential operators which are important in mathematics and its applications: We have come across one of them above, the curl. Another one is the complex gradient a:
a = (a/azl, .. . ,aiaz"). The system of homogeneous equations (1.20)
au = 0
is called the Cauchy-Riemann equations in several variables. The solutions of (1.20) (where u is supposed to be once continuously differentiable) are the holomorphic functions of the n complex variables zl, ... , 9,whose theory has been vigorously developed in recent years. In addition to the preceding examples we shall often refer to ordinary diferential equations (ODEs) as a particular case of PDEs; of course, this is simply the case where the number of independent variables is n = 1. The theory of linear ODEs is known in much greater detail than that of PDEs. Some of the properties of ODEs can be generalized to PDEs; other properties can be exploited to construct solutions of certain PDEs. This will be seen in the subsequent discussions.
Exercises 1.1. Using polar coordinates, write the expression of the Laplace operator in two variables and that of the Cauchy-Riemann operator. 1.2. Write the expression of the Laplace operator in three variables in spherical and in cylindrical coordinates.
Sect. 13
II
THE BASIC EXAMPLES OF LINEAR PDES
1.3. Consider a linear partial differential operator of order 1, with constant complex coefficients, in n variables:
Prove the following statements If the complex vector a = (a', , . . , a")is of the form za where a is a real vector and z is some complex number, we may perform a linear change of variables x + y in R" such that the expression of L in the y coordinates becomes z ajay'. Suppose now that the vectors Re a and Im a are linearly independent (this demands n > 1); show that we may now perform a linear change of variables x + y in R" such that the expression of L in the y coordinates becomes
1.4. Consider a linear partial differential operator of order I , with C" real coefficients in a neighborhood of the origin in Rn: n
L=
a
C d ( x ) -.ax'
j=l
Suppose that at least one of the coefficients ai does not vanish at the origin. Show that there is a C" change of variables x -+ y in a neighborhood of the origin such that the expression of L in the y cootdinates is w ( y ) a/dy' with w(0) # 0. Is it always possible to choose the w r d i n a t e s y so as to have w(y) = 1 near y = O ? 1.5. Let = (az/axz)- (d2/ay2)be the wave operator in the plane. Show that there is a linear change of variables in R2 transforming 0 into 4 az/dx' ay'. Use this fact to prove that all the solutions of the wave equation y ) g(x - y). in the plane, Ou(x, y ) = 0, are of the form u(x, y ) = f ( x where f and g are functions on the real line (the student may assume that u is twice continuously differentiable). Comparing with the property of the Laplacian in the plane, expressed in formula (1. lo), is it true that every harmonic function in two variables is of the form u(x, y ) = f ( x + iy) + g(x - iy), where f and g are holornorphic functions of one complex variable? 1.6. Prove that, i f f is a C' function, defined in the closed unit ball B, = { x E R"; 1x1 2 I}, valued in C", which satisfies (l.16), the system (1.15) always has a C' solution u, defined in B, (and valued in C).Show by an example that this is not so if n = 2 and if one replaces the closed unit disk
+ +
12
BASIC EXAMPLES A N D FUNDAMENTAL SOLUTIONS
[Chap. I
+
B, c R2 by the semiclosed annulus 0 < x2 y 2 5 1. Relate this fact with the fact that the anti-Cauchy-Riemann equation au/dz =f does not always have a solution u which is a holomorphic function of z in that annulus, when fitself is holomorphic (say in the complement of the origin in the plane). 1.7. Consider the heat equation in one space variable: (1.21)
au
a2u
at
ax2
-=-
(in R2).
Describe all solutions of the form u(x, t ) = v(x)w(t)(the student may assume that v and w are infinitely differentiable). Apply the same method t o the wave equation and to the Schrodinger equation (also in one space variable) and compare the results in the threecases. 1.8. Consider the wave equation in the plane: (1.22) Let uo(x), u l ( x ) be two C" functions in the real line, vanishing outside the interval 1x1 2 1. By exploiting the description of all the solutions of (1.22) given in Exercise I .5, show that there is a unique solution u of (1.22) such that (1.23)
u,(x, 0 ) = u1(x).
u(x, 0) = uo(x),
Suppose that u1 E 0 and that uo(x) > 0 for I x I < 1. What is the support of the solution u ? What is the region in the plane in which the solution u(x, t ) > O ? 1.9. Let uo(x) be a C" periodic function on the real line. One may then write +m
(1.24) p= - m
where w = 2n/T, T the period of u,, , with Fourier coefficients uo, whose absolute value tends to zero, as IpI -,co, faster than any power of l/lpl. Admitting these facts (if he does not know them) the student should try to exploit the method of separation of variables indicated in Exercise 1.7, in order to solve the initial value problem for (1.21), that is, find a solution u of that equation, which satisfies u(x, 0) = uo(x). 1.10. Give necessary and sufficient conditions on the function uo(x), defined and C" in the open interval I x I < 1, in order that there be a solution u(x, t ) t o the initial value problem: (1.25) (1.26)
au
. au ax
-= I at
u(x, 0) = uo(x)
in the region in the interval
x2
+ t 2 < 1, 1 X I < 1.
Sect. 11
THE BASIC EXAMPLES OF LINEAR PDES
13
1.11. Let R be an open subset of R", P(x, d/dx) a linear partial differential operator with complex coefficients, defined and C" in R, C a space of distributions in R [that is, a linear subspace of 9'(n),equipped with a locally convex topology finer than the one induced by 9'(Q)]. Prove that the linear subspace Ker, P consisting of the distributions, belonging to C, which satisfy the homogeneous equation P ( x , d/dx)h = 0 in R, is closed in C. 1.12. Let R and P(x, d j d x ) be as in Exercise 1.1 1 and write
P ( x , d/dx) =
c c,(x)(~/W,
la1 S m
where (d/dx)n = (J/Jx')al ... (djdx"),. and I a1 = 01, point in R. Prove the following statement:
+ ... + a n . Let
xo be a
LEMMA 1 . 1 . If there is an n-tuple LY with length I 01 I = m such that c,(xo) # 0, the homogeneous equation P(x, d/dx)h = 0 in R does not have any nonzero distribution solution whose support consists of the single point xo. [Hint: Use the fact that any distribution u whose support is { x o } is a finite
linear combination of derivatives of the Dirac measure at xo : u=
2 181 < m '
1
US P ( X - xo).
2 Existence and Smoothness of Solutions Not Submitted to Side Conditions Since we now take a closer look at linear partial differential operators, it is time to adopt convenient notation and terminology. As it is now universally accepted, we adopt the multi-index notation. A linear partial differential operator in n independent variables x', ..., x", with complex coefficients defined in an open subset R of R",is a polynomial in the partial differentiations and has the form (2.1)
P(x, a p x ) =
c c,(x)(d/dx)".
la1 S m
Here a is a multi-index, that is, an n-tuple of integers a j 2 0; l a / denotes its length a1 + ... + a,. Also (d/i?x)" = (d/dxl)al * . - (a/dx")".. The integer m is usually the order of the operator; this assumes that for some multi-index a with length I a I = m, the coefficient c,(x) is not identically equal to zero. If the coefficients c, are constant throughout R, we write P(d/dx) instead of P(x, djdx). Because of the usefulness of the Fourier transformation, one often prefers to deal with the elementary operators D~ =
-4-1 ajaxj,
j = 1,
. .. , n,
rather than d/dx'. We consider then differential operators of the form P(x, D) =
1 C,(X)D"
I4 5 m
rather than of the form (2.1). A typical differential operator with constant coefficients is denoted by P ( D ) . When passing from the study of ordinary differential equations to partial differential equations, the first novelty one notices is that all the examples of 14
Sect. 2)
SOLUTIONS NOT SUBMITTED TO SIDE CONDITIONS
15
PDEs one encounters have infinitely many linearly independent solutions. Take, for instance, a homogeneous PDE with constant coefficients (2.3)
P(D)u = 0.
For suitable complex vectors [, we may take u = exp(i(c, x ) ) , where
(C,
x ) =(,XI
+ * . . + in?.
Indeed, p(~)~i(S= . x p(c)&S. > x>
and it suffices to require i to be a zero of the polynomial P, i.e.,
P(C) = 0.
(2.4)
But a polynomial on C" always has infinitely many distinct zeros, provided that n > 1 , and if ( # c', the functions exp(i([, x ) ) and exp(i([', x ) ) are linearly independent. Let us give another example, the equation (with variable coefficients) in the plane (2.5)
au - y au = 0.
x-
ax
ay
If we switch to polar coordinates, r, 0, (2.5) can be written
au -- 0. _ ae
We get a solution of (2.6) by taking u to be any (once continuously differentiable) function of r alone, that is, rotation-invariant. Here again there are many linearly independent solutions. Among the wealth of solutions of a partial differential equation, the mathematician will select a restricted number, often only one, which satisfy certain " side conditions." Let us briefly describe two of the most important examples :
Example 2.1. Let 52 be an open subset of R3,bounded, whose boundary is a smooth surface S. The Laplace equation Ah = 0 in R has infinitely many linearly independent solutions. But we may consider only those which take preassigned values on S . Given a function f defined on S (and not necessarily anywhere else), we may seek a harmonic function h in R which is equal tof on S. Under suitable conditions, there will be one and only one such function. Example 2.2. Consider the wave equation (2.7)
(a/ax)2u- (ajay)*U = o
16
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
in the plane R2.Any function u(x, y ) = f ( x + y ) + g ( x - y ) is a solution of (2.7), but we may want to look only at those solutions such that u and (d/dy)u take preassigned values when y = 0, say uo(x), u , ( x ) , respectively. Under suitable assumptions of smoothness, we may write
(2.8)
f ( 4+ g ( x ) = U O W ,
(2.9)
f ’ ( x > - 9’W = U I ( X ) .
If we denote by U , ( x ) any primitive of u , ( x ) , we derive from (2.9):
whence, by combining with ( 2 . Q
It is verified at once that the value of u ( x , y ) thus obtained is independent of the choice of the primitive U , of u l . We have therefore obtained a solution of (2.7) satisfying our side conditions. One can show, as we do eventually, that this solution is unique. This type of problem with side conditions leads immediately to problems of existence and uniqueness: Does there exist a solution to Ah = 0 in $2equal tof on S? And if there is at least one solution, how many are there? We have only looked at homogeneous equations, i.e., at equations where the right-hand side is equal to zero, but exactly the same considerations apply to inhomogeneous equations, where the right-hand side is a given function or distribution. But then it is only natural that before tackling the much more difficult problems with side conditions, we wish to know more about the solutions of the equation when no side conditions are imposed. For instance, there are linear partial differential equations with variable coefficients which do not have solutions at all (another difference with ordinary differential equations !). In such cases, it makes little sense to discuss problems with side conditions. It is thus somewhat more urgent that we answer the question: Does the equation Pu=f
have solutions? Unfortunately, this is a very difficult question to answer in general, althoughit might not be too difficult to answer in particular cases or for particular classes of equations (once it has been more precisely formulated). One such class for which a fairly satisfactory answer can be given is that of linear PDEs with constant coefficients.
Sect. 21
SOLUTIONS NOT SUBMITTED TO SIDE CONDITIONS
17
2.1 Existence of Solutions of Linear PDEs with Constant Coefficients in Bounded Sets We could also discuss the problem in unbounded sets but it would then require some machinery that is not too elementary. The discussion in bounded sets, on the contrary, can be carried through with relatively elementary machinery (assuming that the student has some basic knowledge of distribution theory). We consider the equation (2.1 1)
P ( D)u =f
in a bounded set Q c R“. As stated before, P ( ( ) is a polynomial in n variables with complex coefficients; we have substituted D j = d/dx’ for the variable t j . We assume that there is a distribution E in R” such that
-4-1
(2.12)
P ( D ) E = 6,
the Dirac distribution at the origin.
First of all, there is atheorem which states that this is indeed the case, whatever the polynomial P (except in the uninteresting case where all its coefficients are equal to zero). Secondly, in all the cases we discuss, we shall effectively construct, i.e., give an explicit expression of E. The distribution E is called a fundamental solution of P ( D ) ; a linear PDE in general possesses many fundamental solutions. For take arbitrarily one of them, E , , and add to it any solution h of the homogeneous equation (2.3): Eo + h is again a fundamental solution (fundamental solutions form a linear manifold, not passing through the origin, of the space of all distributions in R”). Let us now return to (2. I I). We shall make the assumption that if we extend J’by setting it equal to zero outside of Q, we obtain a distribution, which we shall denote by], in the whole space R”. It is not true that we can always do this. For instance, if n = I and Q is the open interval 10, 1[, take
f = exp(
+ l/x).
Then the extension offwhich we defined above does not yield a distribution. If our assumption aboutfwere to be eliminated, it would not be possible to handle arbitrary bounded open sets, although it would still be possible to solve our problem for some of them (but by different methods). How inconvenient is the assumption? In practice, not much: Any f belonging to a space LP(Q), 1 2 p 5 + co,is “extendable” in the way described-as a matter of fact, any distribution in C2 which does not grow at the boundary dC2 of R (“grow” in a suitable sense, adapted to distributions) faster than some power of the inverse of the distance to dR is extendable. We use both hypotheses-the boundedness of R and the extendability of f-via the property that f has compact support in R” (the support off is
18
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
contained in the closure a of R). Now we may convolve two distributions, one of which, at least, has compact support-such as E andf. This yields a distribution E *f in R" which is not, in general, compactly supported. Now, in order to differentiate a convolution like E * f , we are free to differentiate any one of the factors. Thus, P ( D ) ( E * f ) = [ P ( D ) E ]* f = 6
*f = f .
In particular, if we call u the restriction to R of the distribution in R",E *f: we see that (2.1 1) is satisfied. If we can content ourselves with solving (2.1 1) in a smaller set than R, more precisely in a relatively compact open subset 0' of R,we can dispense with the extendability assumption about the right-hand side J We arbitrarily select a C" function g with compact support in R, equal to one in R' and take u = E * ( g f ) . It is easily checked that (2.1 1) is satisfied in R'. Once fundamental solutions have been introduced, for example in the process of solving (2.1 l), it is soon discovered that they can be put to several uses. One of these is related to the smoothness of the solutions of (2.11). Let us take another look at ordinary differential equations, this time with constant coefficients. Suppose, in fact, that (2.11) represents such an equation. Suppose moreover that the right-hand sidef is a C" function in R. Then, every solution u of (2.11) is a C" function. This is a striking property and we may ask whether it persists when P ( D ) is a linear partial differential operator. The answer in general is that it does not. In order to see this, it suffices to return to Example 2.2: The right-hand side, the function zero, is certainly C". But this obviously is not the case for u(x, y ) =f (x + y ) + g ( x - y ) when either f or g is not a C" function of one variable. Could it happen, however, for certain special equations? Now the answer is yes. It is, in fact, true in three of our basic examples : the Laplace, Cauchy-Riemann, and heat equations. Let us tackle the problem by means of the fundamental solution, but first let us introduce a definition: Definition 2.1. The linear partial differential operator P in R is said to be hypoelliptic if, given any open subset U of R and any distribution u in U, u is a C" function in U i f this is true of Pu.
Suppose that P has constant coefficients, and let E be any fundamental solution of P = P ( D ) . By (2.12) we see that P(D)E = 0 in the complement of the origin in R",R"\{O}. Therefore, if P ( D ) is hypoelliptic, E must be a C" function in R"\{O}. But a classical theorem of L. Schwartz states that the converse is true :
THEOREM 2.1. If there is one fundamental solution E of P ( D ) which is a C" function in R"\{O}, P ( D ) is hypoelliptic (in R").
Sect. 21
SOLUTIONS NOT SUBMITTED TO SIDE CONDITIONS
19
Theorem 2.1 provides us with a useful criterion to test whether a PDE with constant coefficients is hypoelliptic. We shall exploit it in subsequent sections. Proof of Theorem 2.1. Let U be an arbitrary open subset of R",u a distribution in U such that f = P(D)u is a C" function in U.Let xo be an arbitrary point of U. It will suffice to show that, under our hypothesis, u is a C" function in some open neighborhood of xo . Let U' be a relatively compact open subset of U containing xo and let g be a cutoff function of the kind already encountered, that is, g E C:(U), g = 1 in U'. We have P(D)(gu) = gP(D)u
+ u = gf + u.
By applying the standard Leibniz formula for differentiation of a product, we see that u is a linear combination of derivatives of g of order #O, hence v = 0 where the derivatives of g vanish, in particular in U' and outside the support of g. Using the fundamental solution E in the statement of Theorem 2. I we may write
E * P(D)(gu) = [P(D)EI * (gu)= gu, hence gu = E
* (gf)+ E * U .
But g f ~C: and the convolution of any distribution with any C" function with compact support is a C" function, hence everything is reduced to showing that E * u is a C" function in an open neighborhood of xo:for then this would also be true of gu, which is equal to u in U' 3 xo . Let us select a number E > 0 such that the set V, = {X E R";d(x, R"\U') > E }
is a neighborhood of xo . Let i,(x), another cutoff function, be equal to one 8 for I X I < 4 2 , to zero for I > E ; 5, E C"(R"). We write
XI
* u = ( ( , E ) * v + [(l - CJE] * U . - ( , ) E l * v is a C" function everywhere, since by
E
The second term [(I our hypothesis, (1 - 5,)E is a C" function and since the convolution of a distribution with compact support, here u, with any C" function is a C" function. On the other hand, by the standard properties of convolution,
= SUPP(t;,E ) + SUPP 0. we see that the support of ( 5 , E ) * u is contained
SUPPKC, E ) *
UI
By our choice of t;, in the order E neighborhood of supp u. We have already seen that u = 0 in U'. We reach the conclusion that ( L E E )* D vanishes in V, and, consequently, that E * u is a C" function in V, . Q.E.D.
20
BASIC EXAMPLES A N D FUNDAMENTAL SOLUTIONS
[Chap. I
We have seen two important applications of fundamental solutions. We study a third one in the next section-it is closely related to the application in Theorem 2.1. In the sections following the next one, we explicitly compute certain remarkable fundamental solutions in the basic examples of Sect. 1.
Exercises
2.1. Consider the following equation in the plane au au x - - y - =f(x2 ay ax
(2.13)
+ y2),
where f ( t ) is a C" function of the real variable t such that if
f(t)=O
t
2,
f(+)=I.
Show that Eq. (2.13) has no distribution solution in R2\{O}. [Hint: Rewrite
JJ If(x2 +yZ)12d x d y by introducing u, assumed to satisfy (2.13).] 2.2. Let R be a nonempty open subset of R", n > 1, L a first-order linear partial differential operator with real C" coefficients in Q. Using Exercise 1.4, show that L cannot be hypoelliptic in f2. 2.3. Consider the following equation :
(2.14)
(ajax)'u - (a/ay)2u
+ ~ ( xy)u, = 0,
where 1 is a C" function in R2. Show that (2.14) is not hypoelliptic in R2. 2.4. Let P ( D ) denote a linear partial differential operator with constant coefficients in R",E a distribution in R" such that h = P(D)E - 6
is a C" function in R". By suitably modifying the proof of Theorem 2.1 show that, if E is a C" function in the complement of the origin, P ( D ) is hypoelliptic in R". 2.5. Extending Definition 2.1 to systems (whether determined or not), show that the gradient ($1.6) is hypoelliptic (in any open subset of R")whereas the divergence (01.7) is not, unless n = 1. 2.6. Construct all the fundamental solutions of d/dx, d/dx - 1,d2/dx2- I (in R', I E C). Apply this, together with the change of variables in Exercise 1.5, to find a fundamental solution of a2/ax2- a2/ay2(in R2). 2.7. Show that we may obtain a set of solutions of the homogeneous Schrodinger equation ($1 S), (2.15)
1
au
--=i at
a2u
a2
(in R2),
Sect. 21
SOLUTIONS NOT SUBMITTED TO SIDE CONDITIONS
21
by taking
j
+m
(2.16)
u(x, t ) =
exp[i(r2x - ~t)]w(r)d?.
-m
By choosing w appropriately, exhibit a solution of (2.15) which is not a C" function in R2. 2.8. Show that there is an integral representation analogous to (2.16) for certain solutions of the wave equation in R"" (n > 0 arbitrary): (2.17) Show that, for any n > 0, Eq. (2.17) cannot be hypoelliptic. 2.9. Let P ( D ) , Q ( D ) be two linear partial differential operators with constant coefficients in R".Show that the product P ( D ) Q ( D ) is hypoelliptic if and only if both P ( D ) and Q(D)are. What could we have stated if P and Q had variable coefficients? 2.10. Let P ( D ) be a differential operator with constant coefficients in R", having a fundamental solution E which is C" in the complement of the origin. Let R be an open subset of R" and denote by N , the space of distribution solutions of the homogeneous equation P(D)h = 0 [by Theorem 2.1 A",c Cm(S2)]. Prove that the following topologies on N , are identical: (i) the C" topology (uniform convergence of the functions and all their derivatives on every compact subset of R); (ii) the Co topology (uniform convergence of the functions on every compact subset of 0);(iii) the topology induced by 9'(!2) (the functions f , E N , converge if the integrals jf, 6 dx converge for any test function 4 E C:(Q), uniformly on bounded subsets of C:(SZ): it suffices to consider sequencesf , and thus require the convergence of the integrals for each individual 41. 2.11. Derive from the result stated in Exercise 2.10 the following (we use the same notation as in Exercise 2.10 and form the same hypotheses): If P(D) is hypoelliptic, and if {fi} is a collection of solutions, in 0,of the homogeneous equation P( D ) f = 0, bounded (independently of cc) on every compact subset of R, it contains a subsequence which converges in Cm(R). Is the conclusion true if we assume, instead of boundedness on every compact subset of 0, that to every test function 4 E CF(R) there is a constant C(4) > 0 such that, for all indices cx,
3 Analyticity of Solutions Let us return, once more, to a linear ordinary differential equation with constant coefficients. We have already pointed out that if the right-hand side f is a smooth, i.e., a C", function, this is also true of all the solutions. But there is more: Iff is an analytic function, all the solutions will also be analytic. We recall that a complex-valued function u in an open set R c R"is analytic if its Taylor expansion about any point of R converges to the function in a full neighborhood of the point (cf. Definition 3.2 below). An equivalent definition is that u can be extended to an open neighborhood of R in C" as an analytic function of the complex variables zl, . . ., z", or, as we shall often say, as a holomorphic function. Obviously, this property that all the solutions are analytic when the righthand side is an analytic function does not hold for an arbitrary linear PDE with constant coefficients: It does not hold for the wave equations (cf. Example 2.2); it does not hold for the heat equation either, as we show below (Example 3.1). But it holds for the Laplace equation and for the CauchyRiemann equation. As a matter of fact, the linear PDEs with constant coefficients which do possess it are exactly known. They are the elliptic equations. However, there are nonelliptic equations with variable coefficients which have the property. Without going into the meaning of ellipticity, the preceding property warrants a new definition: Definition 3.1. (Cf. Definition 2. I ) A linear partial differential operator P in R is said to be analytic-hypoelliptic i f , given any open subset U of R and any distribution u in U , u is an analytic function in U fi this is true of Pu. Example 3.1. The heat equation is not analytic-hypoelliptic.
It suffices to look at the case where there is only one space variable:
22
Sect. 31
ANALYTICITY OF SOLUTIONS
23
We define the following function F(x, t ) in the set U = {(x, t ) E R2;x # O}: when t > 0, when t 4 0 ,
t-''* exp( - x 2 / 4 t )
F(x, t )
=
It is checked at once that (i) F(x, t) is a C" function in the whole set U, (ii) L F = 0 in U , (iii) F is not an analytic function in U . Claim (iii) is obvious if we recall that an analytic function equal to zero in an open subset U , of its domain of definition, here the part of U where t < 0, vanishes in every connected component of U whose intersection with U , is not empty: In our case, the union of these components is U itself. Suppose now that P ( D ) is an analytic-hypoelliptic linear partial differential operator with constant coefficients and let E be any one of its fundamental solutions; since P(D)E = 0 in R"\{O}, E must be an analytic function there (see Definition 3.2). In fact, the converse is true, and this is the analog of Theorem 2.1 : THEOREM 3 . 1 . rf there is a fundamental solution E of P(D) which is an analytic function in R"\{O), P ( D ) is analytic-hypoelliptic in R". Proof By Theorem 2.1 we know that P ( D ) is hypoelliptic (thus, every linear partial diferentiul operator with constant coefficients which is unulytic-hypoelliptic is hypoelliptic). It suffices to prove that if U is any open subset of R", u any C" function in U such that P(D)u =f is analytic in U,then u is also analytic in U . We reason in the neighborhood of an arbitrary point x o of U. Although it is not really necessary, we shallsimplifytheargument byexploiting a consequence of the Cauchy-Kovalevska theorem (on the latter, see Sect. 17; the student unfamiliar with the Cauchy-Kovalevska theorem may want to accept the consequence of it as applied below), namely, that if the open neighborhood W t U of x , is sufficiently small, there is an analytic function h in W satisfying there the equation P(D)h =$ Thus we have P(D)(u- h) = 0 in Wand it is enough to show that u - h is analytic in a neighborhood of x, . Let U' be an open neighborhood of xo whose closure is compact and contained in W, and let g E C r ( W ) be equal to one in U'.We set = P ( D ) [ g ( u- h)l,
from which: g(u - h) = E
* V.
We must show that E * v is analytic in the neighborhood of x o , noting that v = 0 in U'. We use the same cutoff function 5, as in the proof of Theorem 2.1, and write
E*v
= ([,E)
* u + [(I
- (,)El
* U.
24
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
The first term vanishes in the neighborhood of x o , V, = {x; d(x, R"\U') > E } ( E > 0 is very small); it will be enough to show that the second term is analytic in V, . We use the following characterization of analytic functions: Dejnition 3.2. A C" function cp in an open subset 0 of R" is analytic in 0 if to every compact subset K of 0 there is a constant rK > 0 such that
(3.1)
1 sup - rkl ~ " c p ( x1 )< U!
I
+ 03,
where the supremum is computed over all points x of K and all n-tuples nonnegative integers.
Since v is a C" function with compact support, w = [(l - (,)El function. We have D"w= { D a [ ( l- (,)El}
*v
=
[(l - [,)DUE]* v
c(
of
* u is a C"
+ T* V .
By the Leibniz formula, the support of Tis contained in the support of the gradient of (, , therefore T * v = 0 in V,. On the other hand, ( I - (,)D"E is a C" function in the whole space R".Therefore, when x E V , ,
J
D"w(x) = 11 - 5,(Y)lD"E(Y)V(X- Y>dY.
The integration can be restricted to the set K = { ~ E R "l ;
~ L] E / ~ , YV,-SUPPV}. E
Now K is clearly a compact subset of R"\{O} where E is analytic; we may therefore apply (3.1) with E instead of (and y instead of x). We obtain
where x is any point in V, and C = 1 + sup,, once that w is analytic in V,.
I[,[.
From this we conclude at Q.E.D.
In the forthcoming sections we shall compute certain fundamental solutions of the ordinary differential equations with constant coefficients, and of the Laplace and of the Cauchy-Riemann equations. It will be seen that these fundamental solutions are all analytic in the complement of the origin. We shall thus reach the conclusion that these equations are all analytic-hypoelliptic. The result for the Laplace equation has been known for a long time as Weyl's lemma.
Sect. 31
ANALYTICITY OF SOLUTIONS
25
Exercises
3.1. By using the fact that the Cauchy-Riemann operator in RZ is analytic-hypoelliptic, derive that this is also true of the operator in R2
where c(x, y ) is an analytic function in the plane. 3.2. Let P(D) denote a linear partial differential operator with constant coefficients in R", E a distribution in R" such that h =P(D)E - 6 is an analytic function in R". By suitably modifying the proof of Theorem 3.1 show that, i f E is an analyticfunction in the complement ofthe origin, P ( D ) is analytic-hypoelliptic in R". 3.3. Prove that any C" function u(x, y ) which satisfies the homogeneous wave equation in the plane, u,, - uyy= 0, is analytic with respect to (x, y ) in R2 if (and only if) u(0, y ) and u,(O, y ) are analytic functions of y in R'. 3.4. For each positive number d, introduce the following definition (cf. Definition 3.2): Definition 3.3. A C" function cp in an open subset 8 of R" is said to belong to the dth Geurey class in 8 , and one writes cp E G,(O), i f to every compact subset K of 0 there is rK > 0 such that (3.2) By adapting the proof of Theorem 3.1, prove the following statement: THEOREM 3.2. Suppose that P(D) has a fundamental solution E which belongs to G,(R"\(O)). Then, given any open subset U of R" and any distribution u in U, such that P( D)u = 0 in U, we must have u E Gd(U ) .
4 Fundamental Solutions of Ordinary Differential Equations This section is devoted to a review of certain facts concerning ordinary differential equations and their interpretation in the light of PDE theory. We limit ourselves to ODES with constant coefficients. Later we shall consider ODES with variable.coefficients. The simplest nontrivial ordinary differential operators are the first-order ones, of the kind d L=--a , aEC. dx We seek the solutions of the ODE
dF - - aF = 6 . dx The answer is well known when a = 0. It is one of the first examples of differentiation of distributions that the derivative of the Heauiside function H(x), equal to one when x > 0 and to zero when x < 0, is the Dirac distribution. In order to get all the solutions of F‘ = 6 , it suffices to add to H any constant function. Now, the solutions of (4.1) are transformed into the solutions of F’ = 6 by the mapping F H e-(IXF. By performing the inverse transformation, we see that all the solutions of (4.1) are given by
(4.217
F
=E
+ Ceax,
t These fundamental solutions are analytic in the complement of the origin; therefore, by Theorem 3.1, the differential operator L is analytic-hypoelliptic. In particular, all the distribution solutions of the homogeneous equation Lh = 0 are “classical” solutions, and (4.2) gives all the distribution solutions of (4.1). 26
Sect. 41
SOLUTIONS OF ORDINARY DIFFERENTIAL EQUATIONS
27
where C is an arbitrary complex constant and E = H(x)e"". Note that when C = 0, F = E is the unique fundamental solution of L with support in the nonnegative half-line. When C = - 1, Fis the unique fundamental solution of L with support in the nonpositive half-line. When C = +, we obtain the " symmetric" fundamental solution F
=
-5 sgn(x)e"",
where sgn(x) denotes the function signum of x, equal to + 1 for x > 0 and to - 1 for x 0 (as a distribution, it need not be defined when x = 0). Let us also observe that W = eax is the unique solution of the initial value problem
-=
(4.3) and that we have E=HW.
Although this example may seem overly simple, it will only need straightforward elaboration to enable us to construct all the fundamental solutions of ODEs, as well as important fundamental solutions of many PDEs. First of all, let us point out that the preceding argument extends directly to first-order determined systems of ODEs. Suppose that we deal with a system o f p equations i n p unknowns. This means that, instead of L, we study the matrix ordinary differential operator
d L=I--A, dx where I is the p x p identity matrix and A any p x p matrix (with complex entries). Thus L acts now on functions valued in Cp, the space of complex p-vectors, and transforms them into functions (or distributions) also valued in CP. Keep in mind, however, that all these functions and distributions are defined in intervals of the real line (where we continue to denote the variable by x). The problem (4.3) must be suitably generalized: U denotes now a matrix-valued function, the solution of the problem (4-4)
LU
= 0, /x=O
= I,
the p x p identity matrix.
The unique solution of (4.4) is found at once: U
= e"A.
28
[Chap. I
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
Now let K denote an arbitrary distribution with values in the space of p x p matrices or, equivalently, a p x p matrix whose entries are distributions. We have, by the Leibniz formula, L ( U K ) = ( L U ) K + UK'
=
UK'.
Thus, if we wish to solve L ( U K ) = SI, which corresponds to finding a rightfundamental solution of L,we must solve K'
= 6u-I =
= 61.
Thus, everything reduces to finding the solution of K'
(4.5)
= 61.
In the scalar case, i.e., when p = 1, the solutions of (4.5) are well known as we have pointed out: They are the distributions of the form H(x) C, where H is the Heaviside function and C an arbitrary constant. This is what motivated our choice of the fundamental solution E of L and the expressions (4.2). When p is arbitrary, the same considerations apply: All the solutions K of (4.5)are of the form K = H(x)I C
+
+
where C is now a constant p x p matrix. Note that H(x)Z is the p x p diagonal matrix whose diagonal entries are all equal to the Heaviside function. We have thus obtained all the right-fundamental solutions of L. They are the matrix-valued distributions of the form (4.6)
F
=E
+ eXAC,
where
(4.7)
E
= H(x)eXA
is the unique fundamental solution of L with support in the nonnegative half-li ne. We have used the names fundamental solution and right-fundamental solution. The reason is that systems of PDEs with constant coefficients have both right- and left-fundamental solutions. For instance, a left-fundamental solution of L will be a distribution G such that (4.8)
G'
- G A = 6,
whereas the distributions F given by (4.6) satisfy F' - AF = 6. (4.9) If F is a right-fundamental solution and T any distribution with compact
support, we have
L(F * T ) = T,
Sect. 41
SOLUTIONS OF ORDINARY DIFFERENTIAL EQUATIONS
29
whereas if G is a left-fundamental solution of L , we have
G*LT=T All left-fundamental solutions of L are given by G =E
(4.10)
+ CeXA,
where E is given by (4.7) and C is an arbitrary constant p x p matrix. There are two-sided fundamental solutions of L , for instance E in (4.7) and, in general, many more (see Exercise 4.1). The next step is to extend the preceding argument to higher order ODES with constant coefficients. Consider therefore the operator L
d"
(jm-1
=-
dx"
+ a, dx"-' -+ . . . + a,
We shall begin by transforming the equation Lu =finto a first-order system, in the customary manner, by setting
(4.11) The equation Lu = f i s then rewritten as
(4.12)
u:,+a,u,+a,u,-,
+ * . *
+a,u, =J
To this we adjoin the equations
(4.13)
U; = uk+l,
k
= 1,
. . . ,WI - 1,
which follow from (4.11). Let us then denote by M the rn x rn matrix 0 ... 0 I M=( 0 0 -am
-am-l
-am-2
...
Last, we denote by u the vector (i.e., the one-column matrix) with components u,, . . . , urn,and by f the vector with componentsf, = 0, . . . , f m - , = 0,s" =f. Equations (4.12) and (4.13) together can be written in the system form:
(4.14)
du
- _- M u + f . dx
Suppose now that for our given f we find a solution u to (4.14). Explicitly stated this means that (4.12) and (4.13) hold. But substituting then (4.13), in the equivalent form (4.11) where u = u1 defines u, into (4.12) yields at once Lu=f.
30
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
If we seek a fundamental solution E to L it is only natural to try to solve fi = 0 f o r j < m andf, = 6, the Dirac distribution. We proceed more or less as we did in dealing with a general system. We set
(4.14) when
u
= ex%,
which transforms (4.14) into (4.15)
In order to find out the value of the right-hand side in (4.15), it suffices to observe that f = de,, where em is the vector with all components equal to zero, except the mth one, which is equal to one. Then e-XMf= 6(e-xMe,)
Thus (4.15) reduces to
dv _ -- de,. dx
(4.16)
But all the solutions of (4.16) are easily found; they are
+ c,
v = H(x)e,
(4.17)
where c is a constant rn-vector. Therefore
+
u = H(x)exMe, exMc.
(4.18)
The last term in the expression of u corresponds to the solution of the system of homogeneous equations
dw _
(4.19)
dx
-- Mw.
A particular case of such a solution w is eXMem: As a matter of fact, it is the only solution such that
w j= O
(4.20)
if j < m ;
w, = 1
when x = O .
Let us set U ( x ) = wl(x). If we recall that wk = ( w ~ ) ( ~ - ' k) , = 2, . . ., m,we see that U(x) is the unique solution of LU
(4.21)
= 0,
such that (4.22)
U ( j )= 0
if j < m - 1;
U(,-')
=
1
when x
= 0.
Sect. 41
SOLUTIONS OF ORDINARY DIFFERENTIAL EQUATIONS
31
The fundamental solutions of L are all obtained by taking the first components u1 of the vector-valued functions (4.18). In other words, they are all of the form
F=E+h,
(4.23) where Lh
=0
and
(4.24)
E
= HU,
We have now come around full circle and obtained the unique fundamental solution of L with support in the nonnegative half-line as the product of the Heaviside function with a remarkable solution of the homogeneous equation, namely the solution satisfying the initial conditions (4.22)-just as in the case of a single first-order equation. Notice that all the fundamental solutions we have constructed are analytic in R'\{O}: All the equations considered in this section are analytic-hypoelliptic. Notice also that in the preceding discussion, we had no need to investigate the nature of the eigenvalues of the matrices A or M. We should also stress the link we have encountered between the construction of fundamental solutions and the initial value problem (4.21F(4.22). This relationship dominates the theory of certain linear PDEs, such as the wave equation.
Exercises 4.1. Let Idenote thep x p identity matrix, A a p x p matrix with complex entries. Describe all the two-sided fundamental solutions of the system of ordinary differential operators L = f d/dx - A . 4.2. Let I, A , , . . . , A , denote m + 1 p x p matrices, Idenoting the identity matrix. Describe all the fundamental solutions of the operator
(3 (X'+ . . . +
L=I-
+A, -
A,.
4.3. Describe all the two-sided fundamental solutions of the differential operator L in Exercise 4.2. 4.4. Can one solve the problem in Exercise 4.2 by transforming L into a determined system of first-order ordinary differential operators ? If you think this is the case, describe the procedure. Otherwise, explain why it is not possible. 4.5. Let L denote the operator in Exercise 4.1. Give necessary and sufficient conditions on the matrix A in order that the fundamental solution of L with support in the nonnegative half-line be (1) a bounded function; (2) a function rapidly decaying at infinity.
32
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
4.6. Give necessary and sufficient conditions on the matrix A in order that the operator L in Exercise 4.1 have a fundamental solution which is rapidly decaying at infinity on the real line. 4.7. Give the expression of the fundamental solution of
whose support is the nonnegative half-line. Do the same when k is replaced by J-1k . 4.8. Let U be the solution of the problem (4.21)-(4.22), H the Heaviside function, and L the ordinary differential operator in (4.21). Show by direct computation that L ( H U ) = 6. 4.9. Let L denote the operator in Exercise 4.1, xA(z)= det(z1- A ) the characteristic polynomial of A . Show that the solutions of the homogeneous equation Lu = 0 are vector-valued functions whose components u i satisfy the equation xA(d/dx)ui= 0. Conversely, let u be any vector-valued function with this property; show that there is a p x p system of ordinary differential equations with constant coefficients, of order p - 1, M , such that Mu is a solution of Lv = 0. Exactly describe M in the general case. 4.10. Let L denote the operator in Exercise 4.1. Let r be an invertible p x p matrix. There is a natural linear isomorphism between the spaces of solutions of Lh = 0 and L,h = dhjdx - TAT-'h = 0. Describe this isomorphism. Next write A = S + N where S is a diagonalizable p x p matrix, N a nil-potent p x p matrix (i.e., such that N P = 0) such that SN = N S . Study the equation Lh = 0 in the two separate cases S = 0 and N = 0: (1) Show that, when N = 0, this equation is equivalent (in a sense to be made precise) with the p scalar first-order ODES
dw - - l i w =0, dx
15 i S p ,
where the liare the eigenvalues of A (repeated according to their multiplicity); (2) when S = 0, show that all the solutions of the equation just given are of the form g(x), where g is a vector-valued polynomial of degree < p . Describe precisely how g is determined by its value at x = 0, g(0). Use the preceding discussion to describe exactly all the solutions of Lh = 0 in the general case (when neither S nor N is necessarily zero). 4.11. Let 9'+denote the space of distributions on the real line which vanish for the strictly negative values of the variable (denoted by t). Show that the convolution of any two distributions S, T E9'+always makes sense [we may
Sect. 41
SOLUTIONS OF ORDINARY DIFFERENTIAL EQUATIONS
define the convolution S (4.25)
(S
* T , 4)
33
* T by the formula
=
( T , w + t)>>,
<ss9
4 E C(R91.
Show that 9'+ is a commutative convolution algebra with a unit, the Dirac measure 6. Let L be an ordinary differential operator of order rn, with constant coefficients (not all of which are equal to zero). Show that Lb has an inverse in the convolution algebra 9;. Compute this inverse. Derive f r o q this that the equation LU = F has a unique solution U E 9 i for every F E 9'+ and write U in terms of F. 4.12. Let L be the operator in Exercise 4.1. Define the transpose ' L of L by the formula
1
+m
(4.26)
--co
(Lu,u ) dx =
1
+m
( u , 'Lu) dx,
u , u E C,"(R'; C p ) .
-m
Give the expression of ' L and relate the right- and left-fundamental solutions of ' L to those of L.
5 Fundamental Solutions of the Cauchy-Riemann Operator Consider the equation in the (x, y ) plane R2:
-dE+ + - =. asE. ax ay We shall transform it into an ordinary differential equation by means of the Fourier transformation with respect to the variabley. Iff is a function of x , y we set (5.2)
j ( x , q ) = J e - iyY(x,Y ) dY.
This only makes sense for suitable functionsf. Here we shall use the Fourier transformation in the distribution sense, that is, acting on tempered distributions with respect to y . We recall that these are the distributions equal to finite sums of derivatives (with respect to y ) of continuous functions of y which grow at infinity slower than some power IyJ'. On the space 9; of tempered distributions, the Fouriertransformation isalinear isomorphism. Its inverse extends the transformation defined on functions by theclassical Fourier inversionformula. On functions of the type we consider here, this reads
(5.3)
1
I
. f k Y ) = g eiY%
rl) 4.
Note that the right-hand side of (5.1) is the Dirac distribution 6(x, y ) at the origin in R2.Its Fourier transformation with respect to y is equal to S(x)l(q), where l(q) stands for the function of q identically equal to one. For brevity we shall write 6(x) instead of 6(x)l(q). If we Fourier transform both sides of (5.1), we obtain (5.4)
aE
- - qE = 6(x).
ax
34
Sect. 51
SOLUTIONS OF THE CAUCHY-RIEMANN OPERATOR
35
This is a first-order differential equation of the kind (4.1). We know ali its fundamental solutions-they are given by (4.2). Here, however, the constant C need not be independent of q . We thus obtain
B = ( ~ ( x+) C(q))eqx.
(5.5)
At this point there arises a serious problem: We are interested in E , not in its Fourier transform with respect to y , l?. We wish, therefore, to compute the inverse Fourier transform of E. But this is only possible if E is tempered with respect to q. For x > 0, esx is not tempered when q --+ + 0 0 ; for x < 0, it is not tempered when q --t - 00. Fortunately, we have the “constant” C(q) which we can use to correct the growth of H(x)esx.This is done in the following manner. We choose when q > O , when q < 0. This means that H( -x)esX
(5.7)
when q > 0, when q < O .
Observe that, when x# 0, E is a very rapidly decreasing function of q at infinity. We may compute its inverse Fourier transform by the formula (5.3):
-x
1
+ iy‘
This is valid when x # 0. But we observe that the function z-’ is locally integrable in the plane (z = x + iy). Indeed, the only question concerns its integrability in a neighborhood of the origin. But using polar coordinates r, 0 shows that we must check the (local) integrability of (l/r)e-ie with respect to r dr do; and this is obvious. Furthermore, z-’ goes to zero at infinity. Therefore, it is certainly a tempered distribution whose Fourier transform with respect to y is equal to (27t)E. We have reached the conclusion that 1/2nz is a solution of (5.1). Recalling that the Cauchy-Riemann operator is
a
(- + );
i a 2 ax
-=-
a5
i
9
36
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
we may state
THEOREM 5.1, The locally integraHe function I/nz is a fundamental solution of the Cauchy-Riemann operator d/dF. It is now very easy to describe all the fundamental solutions of a/dF. Indeed, (nz)-’ is an analytic function of ( x , y ) in RZ\{O}; hence, by Theorem 3.1, a/& is analytic-hypoelliptic. Consequently, all the distribution solutions of the homogeneous Cauchy-Riemann equations are “classical ” solutions, in fact C“ functions, and therefore holomorphic functions. Thus COROLLARY 5.1. Every fundamental solution ofdld5 is of the form llnz + IT, where h is an entirefunction. We recall that an entire function of z E C is a function which is holomorphic in the whole plane. We now derive the homogeneous and inhomogeneous Cauchy formulas. Let R be a bounded open subset of C. We shall assume that the boundary of R is a finite disjoint union of Jordan curves, i.e., of closed rectifiable curves that are not self-intersecting. Moreover, we assume that, locally, R lies on one side of its boundary, dR: Every point of an has an open neighborhood U such that U\dR consists of two open connected components, only one of which is contained in R. This excludes the possibility, for instance, that R be the union of two half-disks such as { z ; IzI < 1 , Im z > 0}, { z ; IzI < 1 , Im z < 0). Suppose then that
in an open neighborhood of Ti, where both u and f are defined and, for instance, once continuously differentiable (or even C” : the formula which we shall obtain will clearly extend to a larger class of functions). Let then x = zn(x, y ) denote the characteristic function of R, equal to one in R and to zero everywhere else. By the Leibniz formula we derive from (5.8): (5.9) Since both sides of (5.9) have compact support we may write
(5.10) Let us compute (d/d5)xn. We know that it vanishes in R and off a, hence it is a distribution supported by the boundary 2R. For obvious reasons, it is “uniformly distributed”; i.e., given any two points of dR, it must be equal in
Sect. 51
SOLUTIONS OF THE CAUCHY-RIEMANN
OPERATOR
37
a neighborhood of the first one (in 80)to what it is in a similar neighborhood of the second one. But this does not tell us what it is. In an obvious sense, xn is a measure depending continuously on the set R, and 8/82 is a continuous linear operator when acting on distributions. We may therefore approximate R by open sets which are the union of finitely many, pairwise disjoint, simple sets, like triangles or disks. Since xn is an additive functional of R, it suffices to prove the formula when R is a disk. We may take the center of the disk as the origin. The equation of dR is now r = R , a constant greater than zero. For any 4 E Cp(R), we have
using the expression of 8/85 in polar coordinates. We may compute [setting $(r, 0) = 4 ( r cos 6, r sin 6)]
J:neie& dB = - i
$(r, 8)e" d 6 ,
whence ((d/d2)xn,
4) = -fR
J'
Zn
eie&R, 6) d6.
0
This can be rewritten in the following manner 1 . .
(5.1 1)
Formula (5.11) clearly makes sense for arbitrary open subsets R of R2 of the type we considered at the start: The boundary dR is a finite union of disjoint Jordan curves and, locally, R lies on one side of it. But we must state precisely what the notation $ stands for: I t cannot mean counterclockwise orientation, as the example R = { z E C; 1 < 1 z 1 < 2) readily shows. Because of our assumptions, whenever the boundary is a smooth curve, we may look at its interior normal N : Its unit vector, v , is orthogonal to the line Ttangent to 2R and points inside R. Then the orientation on 8R is such that the angle (T, v ) is + 4 2 (not - 4 2 ; T is the unit tangent vector to the oriented curve dR at the same point where we consider v). For boundaries which are not smooth (i.e., C") we go to the limit over smooth approximations. Roughly speaking, $ indicates that the orientation on SR is counterclockwise when we /ook,from inside R.
38
[Chap. I
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
Substituting (5.1 1) into (5.10) yields the inhomogeneous Cauchy formula
There is a more “symmetric” way of writing (5.12) by systematically using the two complex variables z, Z. If we interpret the integration of a function h(x, y ) over a (measurable) subset of the plane as the integration of the two-form h(x, y ) dx A dy, and the integration of h(x, y ) with respect to dx + i dy over a curve as the integration, over the same curve, of the oneform h(x, y ) dz, noting furthermore that dz
A
d.? = - 2 i d x
A
dy,
we see that (5.12) can be rewritten as (5.12‘)
1
U(X,
IJ-auaz
y) =-
2ni
(x’, y ’ )
dz‘
A
d2‘
1
dz‘ u(x’, Y’) - (in Q). z’-z
When f = 0 in R, (5.8) implies that u is a holomorphk function in R ; we denote it then by u(z). I n this case, formula (5.12‘) [or (5.1211 is simplified into the standard Cauchy formula: (5.13) Remark 5.1. Formula (5.10) can be regarded as the extension of the inhomogeneous Cauchy formula [(5.12) or (5.12’)] to arbitrary bounded open sets R (without the condition that their boundary be a finite union of disjoint Jordan curves).
Exercises 5.1.
Show by direct computation that, for any cp
E
CF(R2),
1 dqo d x d y cp(0) = - ; - -d2 x + iy
Jj
[Hint: Switch to polar coordinates.] 5.2. Suppose thatf(x, y ) E Cm(R2) vanishes for 1x1 > 1 and is periodic with respect to y , with period 2n. By using the Fourier expansion off with
Sect. 51
OPERATOR
SOLUTIONS OF THE CAUCHY-RIEMANN
39
respect to y , show that the Cauchy-Riemann equation (5.14)
inR2,
k(g+i$)=f
has a solution which is periodic with respect t o y (with period 271).Show also that there is a distribution Efx, y ) in R2,periodic with respect to y , such that the following is a solution of (5.14):
5.3. Let D, denote the disk IzI < r ( r > 0) in the plane. Let L be a straight line through the origin, f a continuous (complex-valued) function in the closure D,,holomorphic in the complement of D, n L in D,. By applying the formula for the distribution derivative of a function g(t) of one real variable, which is smooth for f > 0 and t c 0, and has a finite jump at t = 0, prove that f is holomorphic in the whole of D, . 5.4. Let D, be as in Exercise 5.3. By using the assertion made there, prove the Schwarz - rejection principle, namely that if h is any continuous function in 0: = { z = x + iy E Dr; y 2 0}, holomorphic for y > 0, and real for y = 0, it has a unique holomorphic extention to the whole of D, . 5.5. Let p be a distribution with compact support, in the plane R2, u any distribution in R2 satisfying au
-=p
(5.16)
az
in R2.
Prove that if h is any entire function in C and y any closed simple rectifiable curve containing the support of p in its interior, we have (5.17)
$ u(z)h(z) dz = 2 J y ( p ,
h).
Y
5.6. Let fi,. . . , f. be n functions belonging to C;(Rzn) and assume n > 1. Assume that the following compatibility conditions [cf. (1.16)] are satisfied:
(5.18)
afj -_ -a ! , _ a?, azj
j,k=l,
..., n.
By exploiting the fact that l/nz, is a fundamental solution of d/dZ,, show that there is a Cmfunction u in Rz” which satisfies the (overdetermined) system of equations (5.19)
au
azj =fi,
-
j = 1,
..., n,
40
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
in the whole of R2" and which, furthermore, has compact support. [Hint: Take I z2 I + . . . + 1 z, I -, + co in the inhomogeneous Cauchy formula with respect to zl.] 5.7. Let K be a compact subset of R2" whose complement is connected (n > 1) and let h be a holomorphic function of z = (zl, . . . , z,) in C"\K [which means that h is, say, C', and satisfies the Cauchy-Riemann equations (d/aFj)h = 0, j = 1, . . ., n, there]. Derive from the result in Exercise 5.6 that there is an entire function h in C" which is equal to h in C"\K. [Hint: Use a cutoff function g E C:(R2"), equal to one in a neighborhood of K, and solve (5.19) where fi = ( d / d F j ) { ( l - g)h}, j = 1, . . . , n.] Show that the same result would not be true if n = 1. (The preceding extension result in dimension n > 1 is a classical theorem due to Hartogs.) 5.8. Let R be an arbitrary open subset of R2. Let Cm(R) denote, as usual, the space of complex C" functions in R, equipped with the topology 9 of uniform convergence of the functions and of each one of their derivatives on the compact subsets of R. Let F'denote the topology on C" in which convergence of functions means that the functions only converge uniformly on compact subsets of R and that their first Z derivatives converge in the sense of the natural topology 9. By using the inhomogeneous Cauchy formula (5.12), show that the topologies 9 and 5' are the same. 5.9. Show that there exists a real-analytic, but no complex-analytic (i.e., holomorphic) function u in C\{O} such that
(5.20)
au
1
- = - i n C\(O}. a2 z
Let f ( z )be a holomorphic function in C\{O}. By using its Laurent expansion, prove that the equation
(5.21)
av aZ
- =f in C\{O}
always has a solution, and that all its (distribution) solutions are real-analytic in C\{O}. 5.10. Let P(z) be a polynomial in one variable, with complex coefficients. Describe all solutions in R2 of the partial differential equation
(5.22) and construct a fundamental solution of P ( a / Z ) .
Fundamental Solutions of the Heat and of the SchrBdinger Equations If we go back to our basic examples in Sect. 1, we see that both the heat equation and Schrodinger's equation are first order with respect to the time variable t . It is only natural to perform a Fourier transformation with respect to the space variables x, according to the formula ii(& t ) =
e - i ( x . r > u ( x ,t ) d x R"
where (x, 5 ) = x'tl + ... + ~ " 5. ,On the other hand, the wave equation is second order with respect to all variables, and this is also true of the Laplace equation. Their treatment is therefore more difficult. Constructing a fundamental solution for the heat equation and for Schrodinger's equation is similar to constructing one for the Cauchy-Riemann equations (see pp. 34-35).
6.1 Heat Equation We must find the solutions of the equation (6 -2)
dE
- - Ax E = 6 at
in Rnf'.
As announced, we perform a Fourier transformation with respect to x; (6.2)
is transformed into
(6.3)
aE
at
+ I tI2E = h(t).
A remarkable fundamental solution of (6.3) is given by
B(t, t ) = H ( t ) exp( - t I t 1)'. 41
42
BASIC EXAMPLES A N D FUNDAMENTAL SOLUTIONS
[Chap. I
It is obviously tempered with respect to 5, in fact it decays rapidly at infinity in R. provided that t # 0. We may perform its inverse Fourier transformation: according to the formula
Suppose t > 0 and take i((, t) =I?((, t)
= exp( - tI
(I2).
We obtain
Let z be the complex variable in the plane C' and consider the integral
I
I = exp( - t z 2 ) dz, Y
where y is any horizontal line Im z = const = c. We contend that I is independent of c. To see this it suffices to apply Cauchy's integral theorem, taking as the contour of integration the rectangle boundary shown in Fig. 6.1. The contributions to the integral of exp(-fz2) coming from the two
FIG.6.1
vertical segments at Re z = R rapidly go to zero as R + + m. Consequently, at the limit, the sum of the integrals on the two horizontal lines, with the orientations as indicated, equals zero. This proves that
s,
+m
I=
exp[ - t(Re z)'] d(Re z)
and, finally, that E(x, f)
=( 2 J i i - n
exp(-
=
!$I.
Sect. 61
HEAT AND SCHRODINGER EQUATIONS
43
Therefore, for t # 0, the inverse Fourier transform of E(t, t ) (with respect to x and <) is given by
As a matter of fact, if we look at things in the right way, we see easily that E(x, t ) is the inverse Fourier transform of E(t, t ) for all t . Indeed, E(t, t ) is a tempered distribution, in fact a bounded function, of E R,, with values in the space of bounded measurable functions o f t [these form a Banach space, L"(R:)]. As such, it has an inverse Fourier transform, which is precisely (6.5). All other fundamental solutions of the heat equation are obtained by adding to E , given by (6.5), a solution of the homogeneous heat equation. Let us point out two important properties of the fundamental solution E in (6.5):
<
E(x, t ) is invariant under the space rotations, i.e., it depends only on the square norm 1 xi' as far as the space variables are concerned; (2) E(x, t ) is a C" function of (x, t ) in the complement of the origin, R"+'\(O). In order to see this, it suffices to reason in the neighborhood of a point x = x o , t = 0. One must show that all the partial derivatives of (1)
tend to zero when t > 0 converges to zero. This is not difficult. Notice that E is nowhere zero in the half-space t > 0, and everywhere zero in the half-space t < 0. Consequently, E cannot be analytic in R"+'\(O} (though it is an analytic function in the region t # 0) and the heat operator, as already pointed out, is hypoelliptic but not analytic-hypoelliptic (Definitions 2.I and 3.1).
6.2 The Schrodinger Equation Let us write the analogs of (6.2) and (6.3) in the present case: 1 dE i at
A X E = 6 in R"",
1 aE i dt
--
+ It 1 2 8 = q t > .
We obtain at once a solution of (6.7) by taking
E = iH(t) exp( - it I t I').
44
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
Here also E can be looked upon as a tempered distribution, or a bounded function of 5 E R, valued in the space Lm(R:). We must compute its inverse Fourier transform with respect to 5. We observe that, when E > 0 tends to zero,
EE= iH(t) exp( - ( E
+ it) I t I')
converges, in the distribution sense, to E. As a consequence, the inverse Fourier transform of Ee converges to the one of E. The same argument applied to the function u(x, t ) given by (6.4) shows that v(x, t )
= (2n)-"
J-R"
exp(i(x,
5)
+ if) I t 1 ') d t
- (E
is equal to
At this point we observe that when t > 0 and E has a limit, namely
--f
0 ( E > 0), the function (6.8)
(6.9) where C is equal to the nth power of the Fresnel integral +W
J=
exp(-il')
dl.
J - W
We have +m
+m
J = J , - iJ,,
J, =
-m
cos 1 ' dl,
J,
= -m
sin 1 ' dl.
There are various ways of computing J , ,.I1 (differentiation with respect to a parameter, integration by the residue methods, etc.); they show that
Since (1 -
i)/a
J,
= exp(-in/4),
= .I1 = Jn/2.
we see that (6.9) equals
and, therefore, that (6.10)
exp( -
g).
Sect. 61
45
HEAT A N D SCHRODINGEREQUATIONS
Now, a few remarks about the distribution E are in order:
( I ) E is rotation-invariant with respect to the space variables; (2) E is a C", and even an anafytic function when t # 0 ; it is not a C" function in any neighborhood of a point x = x o , t = 0. As a matter of fact, it is not even an integrable function in such a neighborhood, unless n = 1. For instance, when n = 2, E is a pseudofunction with respect to t , of the type Pf H( t)t - * . The singularities of E on the hyperplane t = 0 get worse as n increases. At a n y rate, the Schrodinger operator is nor hypoelliptic.
Exercises 6.1. Suppose n = 2. Prove directly that E , given by (6.5), is a fundamental solution of the heat operator, i.e., show that, given any test function cp E CP(R3), E(x, t )
Cp@) = -
- Ax.)
dx dt.
R3
[Hint: Use the polar coordinates r, 0 in the plane of the space variables x.] 6.2. Same question as in 6.1, but replace formula (6.5) by (6.10) and the heat operator by Schrodinger's. 6.3. Let E be the distribution defined by (6.5). Prove that it is a locally integrable function. As a matter of fact, prove that its integral in the slab {(x, t ) E R"+l; x E R", 0
< t < tl}
(tl <
+ co)
is bounded. 6.4. Let E be given by (6.5). For t > 0, we regard E as a distribution i n the space variables, depending on the parameter t . Prove that, as r + +0, E(x, t ) + 6(x), the Dirac distribution. 6.5. Let E be given by (6.5). Assuming n 2 3, compute E(x, t ) dt,
F ( x )= 0
and show, by using the result in Exercise 6.4, that (6.1 1)
-AF
=6
(in R").
6.6. Prove that the fundamental solution of the heat equation, E , given by (6.5), belongs to G,(R"f'\{O}) (Definition 3.3, Exercise 3.4) and derive that the same is true of a n y fundamental solution of the heat equation.
46
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
6.7. Let T” denote the n-dimensional torus, 8 = (O1, . . . , 13,) the variable in it (each Oj is an angle and varies from 0 to 27~).Construct theanalogE’(8, t) of the fundamental solution E(x, t) given in (6.5), for the “periodic heat equation ”
write the Fourier series expansion of E’(8, t). What happens if t is also interpreted as an angle and made to vary on the one-dimensional torus ?
7 Fundamental Solutions of the Wave Equation The choice of a time variable is unambiguouslyt suggested to us in the wave equation
It is not so in the case of the Laplace equation (see Sect. 9). As in the study of the heat and of Schrodinger's equations, we call n the number of space variables: x = (x', . . ., x"). We now construct a remarkable fundamental solution of (7.1). The case n = 1 is totally elementary and we begin by describing it.
Case of a Single Space Variable We denote by x the space variable. We wish to solve the equation a2E
a2E
at2
aX2
= h(t) 6(x).
It is convenient to change variables and set (7.3)
s = ~ - x ,
y=t+x.
As we are going to see, we may find a solution E of (7.2) which is a locally integrable function, also denoted by E(t, x). Let cp be any element of Cp(R2)
t Up to a Lorentz transformation. 47
48
BASIC EXAMPLES A N D FUNDAMENTAL SOLUTIONS
[Chap. I
and let q(t, x ) [resp. cp'(s, y ) ] be its expression in the coordinates t, x (resp. s, y). Of course, we have
Also
On the other hand,
Thus the distribution E is defined, in the coordinates s, y , by the function
-)
s+y y-s E'(s, y ) = E ( -, 2 2 2
.
We must therefore find a locally integrable function E', solution of
(7.4) and set, once this is done,
E(t, X )
(7.5)
= 2Eli(t - X ,
t
+ x).
We immediately find many solutions of (7.4), suitable linear combinations of (7.6)
E:
= $ff(s)H(y),
Ei
=
E\
-&H(s)H(-y),
=
-$H(-s)H(y),
El:= $H( - s)H( - y). By (7.5) the first of these functions leads to the fundamental solution El of the wave operator in R2 which, in the coordinates t , x,is defined by the locally integrable function
(7.7)
E,(t, x) = +H(t - x)H(t
+ x).
Sect. 71
FUNDAMENTAL SOLUTIONS OF THE WAVE EQUATION
49
Note that E , ( t , X)
= +fl(t)H(t2 - x').
The support of El is equal to the following sector (which should be called the forward light-cone in the plane): (7.8)
t+x20,
t-xzo.
Moreover, the function E,(t, x) is constant (equal to +) in the interior of the set (7.8).Consequently, its singular support, i.e., the smallest closed set outside of which it is a C" function, is exactly equal to the boundary of the set (7.8), that is, to the union of the two rays (7.9)
x+t=o,
t2O;
x-t=o,
tzo.
We shall eventually prove that El is the unique fundamental solution of the wave equation with support in (7.8). Similar considerations apply to the transforms E , , E , , E4 of E ; , E : , E i , respectively. The sector (7.8) must be replaced by different sectors. For instance, in the case of E,, the role of (7.8) is played by its symmetric image with respect to the origin: (7.10)
x+ts0,
t-xso.
Analogous phenomena will appear in the case of n > 1 space variables, with important differences.
The General Case After a Fourier transformation with respect to the space variables x , the equation we have t o solve reads (7.1 1) We readily obtain the solution of (7.11) with support in the half-line t 2 0 by means of formula (4.24). Indeed, it is not difficult to find what W is, in our case:
hence (7.12)
50
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
The solution of (7.1 1) with support in the half-line t S 0 is likewise given by (7.13)
We obtain further solutions of (7.11) by taking expressions of the form a +B
4t)E+ + B(t)E- ,
(7.14)
= 1,
with a, j?,say, bounded measurable functions of 5 E R, or even tempered distributions in R,. The first fact to be noted is that all the distributions (7.14) are then tempered with respect to t; their inverse Fourier transforms in these variables 5 are fundamental solutions of the wave equation. In the sequel, we limit ourselves to studying t) and its inverse Fourier transform with respect to 4, E + ( t , x). Let cp = q ( x ) denote an arbitrary test function in R". We observe that E + ( t , t) is a function of t with values in the space of bounded measurable functions of 5. By definition of the Fourier transformation of tempered distributions, we have
e+(t,
( E + ( t , 4, d x ) > = ( E + ( t , 4, (F9-1q)(x))
0, (F-lcp)(t)),
= (E+(t,
where the first two brackets are those of the duality between distributions and test functions in the variable x, whereas the third one applies to the variable t. We have also denoted by 9and 9-' the Fourier transformation and its inverse, respectively; the first one transforms distributions in t into distributions in x; 9 - l transforms distributions in x into distributions in 5. In summary, by using formula (6.4),
We cannot interchange the two integrations and write E + ( t , x) = (2n)-" l e i ( x ' C ) E + ( tt) , dt,
(7.15)
because E is not integrable with respect to t (therefore, we cannot apply Fubini's theorem). But we can introduce a convergence factor, like exp( -E I 5 I), and interpret (7.15) as meaning (7.16)
E + ( t , x)
= (2n)-"
lim Jexp(i(x, &-++O
t) - c l t l ) E + ( t , t) dt.
Sect. 71
FUNDAMENTAL SOLUTIONS OF THE WAVE EQUATION
51
We may rewrite (7.15) in the following manner: E + ( t , XI
= H(t)*(t,
4,
where (7.17)
where the integrals should be understood in the same sense as the one in ( 7 . 1 9 , with a hidden convergence factor. The distribution a(?, x) will play an important role in the theory of the Cauchy problem for the wave equation. Taking the Fourier transform of E + ( t , x) with respect to the variable x has one great defect: It breaks the Lorentz invariance, which is one of the most important features of the wave equation. In order to recover this invariance, we must also perform a Fourier transformation with respect to t . The Fourier transform of E + ( t , x ) with respect to ( t , x) is, of course, equal to the Fourier transform of E + ( t , () with respect to t :
This is a divergent integral. We must introduce a convergence factor: A
1 -j” 2l5li
{exp[-it(7-k-
I<()]-exp[-it(.r--iie+
0
Thus the Fourier transform we seek is given by (7.18)
l(l)]}dt
52
BASIC EXAMPLES A N D FUNDAMENTAL SOLUTIONS
[Chap. I
Let q(t, x) be an arbitrary test function in R"" and set @(t,x ) = cp( - t , -x). By definition of the Fourier transformation on tempered distributions, we have
Let us first look at the integration with respect to T on the right-hand side of (7.19). We observe that the integrand is a holomorphic function of r regarded as a complex variable, for Im T < 6. On every horizontal line Im T = const < E , the integrand decays faster than any power of I Re T I when the latter tends to zero. We may therefore apply Cauchy's integral theorem as we have done in 56.1. We find that the integral under consideration is equal to
-'
(7.20) where u is any number greater than zero. But in the integral (7.20) we can safely go to the limit as E > 0 goes to zero. We obtain
The integral on the right-hand side of (7.21) may be viewed as an integral over an n-dimensional real submanifold of C" = R2".But notice that we have gone into the nonreal space only along the T direction; this still gives a privileged role to the time variable, as we have chosen it: this is not Lorentzinvariant. We are now going to deform the contour of integration with respect to the variables . . . , (.(each time, by applying the Cauchy integral theorem) so as to reestablish this invariance. We begin with the integral (7.22) We can move the integration to a horizontal line (llE C' ; = t1 - ib,, E R'} (b, real) provided that the denominator does not vanish in a slab -b, - c < Im il < +c, c > 0. We have (T
- iu)' -
+ iqJ2 - I('/'
We have used the notation to vanish, we should have T'-
5'
=
= 2'
-
1
(t2,. . . , (.).
1(('=a2-q:,
- a2
+ q: - 2i(uz + qltl).
If the quantity just given were ~ ~ + q , =t O, .
In other words, the vectors ( T , 5) and (a, q l , 0, . . . , 0) should be orthogonal while lying on the same level surface of the Lorentz quadratic form. This is
Sect. 71
F U N D A M E N T A L SOLUTIONS OF THE WAVE EQUATION
53
impossible if we require that a 2 - r ] : > 0 : In the interior of the forward [ight-cone, there are not two vectors which are orthogonal. Furthermore, if we require a2 - b: > 0,
we will also have a' - q i > 0 for all q1 such that - b, - c < qI < c for some c > 0 (depending on a). We conclude that the integral (7.22) is equal to
s
+m
4 ( -~ ia, 5, (t - ia)' -
- ib,,
5') dt,
(tl - ib,)2 - / < ' I 2 '
Next we deform into the complex plane the contour of integration with respect to t 2 , . . . , 5. successively, by using exactly the same reasoning. We move these contours to horizontal lines
{ l jE C' ; l j = t j - ib,, 5, E R'},
b j real,
2 5 j 5 n.
We may do this by the Cauchy integral theorem provided that b = ( b l , . . ., 6,) satisfies the condition a' - lb12 > 0. For in this case, there is no real n-vector q such that -b, - c < q j < c, and such that proved
t2 -
It l2
j
=
I , . . ., n,
for some c > 0,
I I ',((t, t), (a, r ] ) )
= a' - r ]
= 0.
Finally we have
PROPOSITION 7.1. Let ( a , b ) E R"" be such that 1 bI2 < a*, a > 0. Then, given any cp E C;(R"+l),
(7.23)
( E + , @)
=
$(T
-(2n)-n-' sS(T
((17
-
- ja)'
We have used the following notation: I f . . . > lJ>
(1)'
=
ia,5 - ib) -
(t - ib)2dr dt.
l is a complex n-vector,
[=
+ ... + (iJ2.
We shall derive from Proposition 7.1 that E , is invariant under a remarkable subgroup of the Lorentz group. But we must first specify what we mean by a distribution being invariant under a linear transformation of the Euclidean space. Let the space be RN,y the variable in it. If the distribution under study, u,is a function, also denoted by u, and if Tis any linear transformation of RN,the transform of u under T is the function ~ ' ( y =) u(T-'y).
54
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
Viewing u and uT as distributions this means that, for every cp E Cr(RN),
=
ldet TI (u, c p T - ' ) .
This suggests the definition of uT when u is an arbitrary distribution:
(7.24)
(u', cp) = (det TI (u, cpT-'),
cp E Cp(RN).
We now ask, what is the Fourier transform of ur? Here again we restrict ourselves to the case where u is a function, in fact a C" function rapidly decaying at infinity. The formula we obtain will extend at once to tempered distributions, either by using the density of 9' in 9''or by the definition of the Fourier transforms of such distributions. We have (u*)(q) = [ e i ( y * n ) u ( T - ldy y ) = I det T 1 [ei(ry.n)u(y) dy = I det
TI /exp(i(y, 'Tq))u(y)dy = I det TI u('Ty),
where 'T is the transpose of T. In other words A
(7.25)
uT
=
(det
The Lorentz group in Rn+' is the group of linear transformations of R"" which preserves the quadratic form t Z - I XI'. Among the Lorentz transformations we isolate those which preserve the forward light-cone
r+= {(x, t ) E R ~ + L1x12 ; 6 t2,
t
2 01.
These form a subgroup 2'+of the Lorentz group. A Lorentz transformation belongs to it if, given an arbitrary point (x, t ) such that x2 c t2 and t > 0, its transform (x', t ' ) satisfies t' > 0. The subgroup contains the connected componentof the identityin the Lorentz group,Y+ AllLorentz transformations have determinants equal to +_ 1 ;the ones in 9 , have determinants equal to + 1. Note also that T H IT-' is an isomorphism (for the topological group structure) of 9,onto itself (Exercise 7.10). To say that u is invariant under Y , is to say that uT = u for all T E 44,.
PROPOSITION 7.2. The distribution E , is invariant under 44, . Proof. We must prove that, for any T EY , , (E, , cp) = (E, , cpr-'>. We might in fact replace cp(x) by $(x) = cp( -x) and note that ($)*-I = (cpT-')'. We use (7.23) and (7.25), the latter with cp in the place of u. We see that
Sect. 71
55
FUNDAMENTAL SOLUTIONS OF THE WAVE EQUATION
We change variables in the integral, setting (r’, 5’) = ‘T(t, 5). We write (a‘, b’) = ‘T(U, b). Then ( ~ - i u ) ~ - ( ( r - i i b ) ~ = ( r ~1-t 1 2 ) - ( a 2 - lb12)-2i(ar-(b,t)) = ( T ’~ 15’1 ’) - (a” - I b’12)- 2i(a’z’ - (b’, t’)), for any linear transformation which preserves the quadratic form r 2 - I t I must also preserve the associated bilinear form (ar - (b, 5)). We see therefore that
But (u’, b’) belongs to the interior of follows therefore from Proposition 7.1.
r+since
‘TEY + .Proposition 7.2 Q.E.D.
We conclude this section by deriving an important consequence of Proposition 7.2:
7.3. E + ( t , x) vanishes identically in the complement of the PROPOSITION forward light-cone
r+= { ( t , x) E R ” + I; I
112
t2,
t
5 01.
Proof. We know that E + ( t ,x) vanishes for t < 0. Let Tbe some transformation belonging to 9+ ; ET , being equal to E , , vanishes for t < 0, and therefore E+ must vanish on the image under T-‘ of the half-space t < 0. Let then 9 be an n-dimensional plane in R“” whose intersection with I-+ is reduced to the vertex of r +, the origin. It suffices to show that there is a transformation belonging to Y + transforming 9 into the hyperplane t = 0. The line L, orthogonal to 9 (in the sense of the Euclidean norm on R“”), is a straight line, half of which is in the interior of r+. It suffices to show that there is a transformation in 2+ which maps this half-line on the positive t-axis. In order to show this, one performs a space rotation, that is, a rotation affecting only the variables x, which transforms the line L into a line lying in the two-dimensional plane (1, x ‘ ) . This is clearly possible. We are thus reduced to a two-dimensional problem: We must find a Lorentz transformation involving only the two variables 1, x’, therefore preserving the quadratic form t 2 - (XI)’, and preserving the forward light-cone, mapping a straight line with equation
(7.26)
x1 = I t ,
111 < 1,
onto the t-axis. I t is easily seen (Exercise 7.10) that all Lorentz transformations in R2 are of the form ( E cash 0 E sinh 0) (7.27) qsinhd q c o s h e ’
56
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
YJ'
[Chap. I
where 8, E , YJ are real numbers and E' = = 1. The ones which preserve the forward light-cone correspond to the values E = q = 1. If we choose the argument 8 so as to have
+
1 + tanh 8 = 0, which is possible since I ;1I < 1, the matrix (7.27) transforms the line (7.26) into the t-axis. Q.E.D. Let us return to the distribution q ( t , x ) defined in (7.17). If we look at its Fourier transform with respect to x, sin( I 5 I t)/ I 5 1, we see at once that it is continuous, and in fact a C" function of t , with values in the space 9;of tempered distributions in the 5 variables. Moreover, this function vanishes at the origin. In order to see the latter, it suffices to show that, given any function (p E V:(R,),
which is obvious. We see therefore that E+(t, x ) is a continuous function of real t, valued in the space of distributions in x , when t > O , when t SO. From this and the fact that Q( - t , x ) = - * ( t , x ) , we see that Proposition 7.3 implies
COROLLARY 7.1. light-cone
The distribution %(t, x ) vanishes identically outside the 1x1' 5 t 2 > .
r = {(x, r) E R"";
Exercises
7.1. Set u = t Z - r2 and consider a twice continuously differentiable function on R"+',f(t, x ) = F(t2 - r 2 ) (r = 1x1). (i) Show that there is a differential operator L in the single variable u such that
Give the expression of L.
Sect. 71
FUNDAMENTAL SOLUTIONS OF THE WAVE EQUATION
57
(ii) Let T be the distribution on R"+l defined by the function f ( t , x ) . Show that there is a differential operator M in u such that the distribution
is defined by the function MF(u). Give the expression of M and compare with L in (i).
7.2. If you were to solve (7.2) as if E were a function, not a distribution, and if you were t o make the change of variables (7.3), solve the transformed equation, and revert to the coordinates t , x , what is the value of E that you would find? 7.3. What is the inverse Fourier transform, with respect to (7,t), of
4(t - ia, t), and that of
(R2
+ (t - i ~ + )1 [ 1 2~) k $ ( ~
- ia,
t),
where 4 is the Fourier transform of cp E C,"(R"+'), a any real number and k any nonnegative integer? 7.4. Suppose a > 1. (i) Show that, for all
(T,
t) E R"",
I ( ~ - i a ) ~(t(21 ? a ( 2 + [t12+a2)1/2.
(ii) Show that if k > 4 2 ,
is bounded by a constant independent of a > 1. Using the results in Exercises 7.3 and 7.4 show that if k is an integer greater than 4 2 , there is a constant C independent of a > 1 such that 7.5.
Derive from this inequality that the linear functional 4 + ( E , , +), defined by (7.21), is indeed a distribution on R"" (ignore the earlier definitions of E,).
58
[Chap. I
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
7.6. Derive from Exercise 7.5 that, for all 4
E
Cp(R"+l),
with a constant independent of a > 1 [A,,x stands for the Laplacian in the (n + 1) variables ( t , x ) ] . Deduce from inequality (7.29) that, if the support of 4 lies in the space t < 0, we have ( E , , 4) = 0; in other words, deduce that the support of E , is contained in the half-space t 2 0. 7.7. Construct explicitely a fundamental solution of the differential operator in R2,(a/&)' - (d/dx)' + A, where /I is an arbitrary complex number. 7.8. Prove the analog of Proposition 7.3 for the differential operator (sometimes called the Klein-Gordon operator)
(7.30)
K,
a2
= - - Ax at2
+ m2.
7.9. Let V be a nonzero real number. Prove the analog of Proposition 7.3 for the operator
(7.31) What is the equation of the light-cone in this case? 7.10. Prove that every Lorentz transformation in R2 is of the form (7.27). Identify any 2 x 2 real matrix with a point in R4 by regarding its four entries as coordinates. Describe the subset of R4 corresponding to the Lorentz group. Show that it consists of four connected components. 7.11. Let 64 denote the Lorentz group in R"", n 2 1. (i) Prove that T H IT-' is an isomorphism (for the group structure) of 8 onto itself, and of 9,onto itself. (ii) Prove that, if T E 64, and if we write
N Y ,Y ' ) = Y n + IY: (iii)
+ 1 - Y1Yi - * * * - Y n Y:
then B(Ty, Ty') = B(y, y'). Prove that, for any T E 9, det T =
9
Y , Y'
E
R"+
'9
1.
7.12. Using the result in Exercise 7.10 and the properties of the group of rotations in R",prove that the Lorentz group 8 in R"" consists exactly of four connected components. 7.13. Let us call an orbit of 8 (resp., of 64,) any subset of R"" which is theimageof Y(resp., of .Y+)under a mapping T H Tx0,for some xo E R"". Describe all the orbits of 64 (resp., of 64,) and point out any difference there might be between the case n = 1 and the cases n > 1.
More on the Supports and Singular Supports of the Fundamental Solutions of the Wave Equation It is not easy to obtain an explicit formula for 42 and E , (see Sect. 7) in the coordinates ( t , x), out of their Fourier integral representation (7.17), (7.19), or (7.21). Explicit formulas in the case of two and three space variables are derived in the Appendix to this section, but much valuable information can easily be extracted from the Fourier integrals, as we show here. We are going to prove two important properties of E , ,one concerning its support (the smallest closed set outside which it vanishes) and the other concerning its singular support (the smallest closed set outside which it is a C" function). We begin with the property of its support (classically known as the Huyghens principle). THEOREM 8.1. Ifn is odd aHd n > 1 , the support of E+ is exactly equal to the boundary of the forward fight-cone, i.e., to the set {(t, x ) ; t2 - I x ( ' = 0, t 2 0). For all other ualues of n, it is exactry equal to the forward light-cone itself Proof. (1) Case n = 1. In this case, we show that E , is equal to the distribution E , in the plane defined by (7.7). We have, by virtue of (7.12) and (7.19,
E,
= (2n)-'H(t)
+m
sin(((x
0
+ t ) ) - sin(t(x - t ) )
r
dt.
Direct computation of the integral would prove our contention but it suffices to observe that E , - El is a solution of the homogeneous wave equation in the plane. All the solutions of this equation are known: They are of the form S(t - x) + T(t x), where S and T are distributions in one
+
59
60
[Chap. I
BASIC EXAMPLES A N D FUNDAMENTAL SOLUTIONS
variable. Unless they are identically zero, their support cannot be contained in the half-plane t >= 0. ( 2 ) Case n > 1, odd. It suffices to prove that E , = 0 in the interior r+of the forward light-cone r +: For the support of E + , which we know (by Proposition 7 . 2 ) consists of orbits of 8 , and (by Proposition 7 . 3 )is contained in r + ,then must be equal to the boundary of r +(it cannot justobe {O}!). In E+ is the particular, t will always be assumed to be greater than zero. In r+, limit, as E + 0, of the distributions. --f
(8.1)
E:
=(
2 ~ ) ~ " lo+mexp[-(E - ir(8, w ) ) p ] sin(pt)p"-' d p dw, s*-1
where p , w (resp. r, 0) denote the spherical coordinates in the space R, (resp. R"); w (resp. 8) is the variable on the unit sphere Sn-l (resp. Sn-') [see (7.16)]. Now, clearly
(27c)"E; = (- 1)"-2
a n-2 (%)
[
Ie(s,1 ) dw,
S,-
I
where s = r ( 0 , w ) and
We note that (- 1)"-2(d/d~)'-2(~- a)-'
= (n - 2)!(~ - a)'-",
from which
Let now cp = cp(t) E C,"(R') with support in the positivehalf-linet cp will be made to depend also on x. For all k > 0, +m
( k - I)!
( t - a)-kcp(t) dt = ( k - 2 ) ! 0
s
> 0. Later
+m
(t
- a)-k+lrp'(t)d t .
0
We see therefore that (8.4)
(274"
j E:(t,
x)cp(t) d t
This formula is valid whatever the parity of n. Let us assume now that n is odd (la = 3, 5, ...). We notice that the integral with respect to w over of the function gE(s,t ) = ( t - s - ie)-' - ( t s + i E ) - l
+
Sect. 81
MORE ON SUPPORTS AND SINGULAR SUPPORTS
is equal to the integral of its even part, g:(s, t ) = -)[g,(s, t ) have g:(s, t ) = h&(t- s)
+ gE(-s,
61 t ) ] . We
+ h,(t + s),
where
h , ( ~=) t [ ( r- i s ) -
- (r
+ is)-']
is
=T2
+ E2
'
Consequently,
and, as E +
+O,
lo
+m
g:(s, t)cp("-')(t) dt + incp("-2'(ls)).
At this point, we take cp
=
cp(t, x ) E C:(R"+') with support in the open cone
(8.5)
t
> 1x1.
From (8.4) and from the preceding computation we reach the conclusion that
J J E + ( t ,x)cp(t, X ) dt dx
=
-*(2ni)'-"
But Is1 5 r = 1x1, therefore the point ( I s ( , x ) does not belong to the set (8.5) and cp( I s 1, x ) = 0. (3) Case n even. We return to (8.4), taking cp = cp(r, x ) as above and noting, as before, that Is1 < t on the support of 9.We can go to the limit right away in (8.4) as E --* + O and conclude that (8.6)
llE+(t,
x)cp(t, x) dt d x
= - (2ni)-"
1..
Js
t
(i)
n-2
"-, t 2 - s2
cp(t, x ) d t d x do.
This means that, in the interior of the forward light-cone, E+ is defined by the function t do t 2 - r 2 ( o , e)2
62
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
It is clear that (8.7) is an analytic function of ( t , r'), hence of (t, x) in the set (8.5). If it were to vanish in any open subset of (8.5), it would vanish in the whole of (8.5), since this set is connected. If this were true, the function
should be a polynomial with respect to t (of degree < n - 2). But in every cone r < I t , A < 1, it tends to zero as t + 0 0 ; therefore, if it were a polynomial in t, it should be identically zero. It is not, since it is 2 t-'lS,,-ll, Q.E.D. with IS,=,l = Area of S n - l .
+
We have also proved the second result we had in mind, that concerning the singular support of E , : THEOREM 8.2. Offthe boundary of the forward light-cone, E+(t, x) is an analytic function of ( t , x). The assertion is evident when n = 1, for in this case E+ is a constant off the boundary of the forward light-cone. When n is odd and n > 1, E , = 0 o f fthat boundary. When n is even, we have shown that E+ is defined in the interior of the forward light-cone by the function (8.7) which, as we have already said, is an analytic function of ( t , r 2 ) .
Appendix
Explicit Formulas for E , in Space Dimensions Two and Three
Our starting point will be (8.3).
E + in two space dimensions In this case, (8.3) reads
(8.9)
E:
H(t)
I,
=S
1
( t - s - ie
+ t + s + ie) d o .
We recall that s = r (8, o),where 8 is the variable point in S' and r = I X I . But any integral of the form jsn-, f ((0, a)) d o is independent of 8 E S"-' ; we are therefore free to choose 8 as we like. We choose it to be the unit vector along the first coordinate axis. Therefore (8, o>= cos p if we agree to write o = (cos cp, sin cp). We have
Sect. 81
63
MORE ON SUPPORTS AND SINGULAR SUPPORTS
Let us look at the integral
dv
1
+ ic). Then *(t + ic)Je+= ( h i ) - ’
dz
Let us set a = r/(t
s,z,=l+ az2
dz 22 + a ‘
Let us first consider the case where r > t 2 0. If E > 0 is sufficiently small, we have 1 a I > 1. In fact, let us set
+
t ie a-1 = --b+ir],
q +O when roots: -+
E
-+
t b=-
I , = -b - ir] + ai(l - (b + iq)2)1/2,
Modulo q 2 , we have Ia = -b - iq
a = +I.
+ ai(1 - b2)’l2(1 - -
1y;z)l’2
- ir] + cti(1 - b2)’l2+ abq(1 - b2)-’/2 = -b{1 - aq(1 - b 2 ) - ” 2 }+ ai((1 - b2)’/’ - q},
= -b
’= [ - b
+ ai(l - bz)”2](1- aq(l - b2)-1/2}.
We derive from this that, for q > 0 sufficiently small, (8.10)
lIal # 1 - aq(l - b2)-”2. Similarly, the polynomial f - ( z ) = z2 - 2(G)-’z + 1 = z2 - 2(b - iq)z + 1 has two roots: pa = b - ir] ai(1 - (b - i ~ ) ~ ) ’ / ~ such that, for small r] > 0,
+
/ p a /# 1 - ~ / ( l b2)-1/2 (notice that one goes from f + to f - by substituting -b for 6). The roots (8.1 1)
A,
64
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. 1
+
and pa belong to the disk I zI < 1 when a = 1 and lie outside of the closure of this disk when a = - 1. The residue off;' at z = 1,' is equal to 1 R , = (1 - (b i ~ ) ~ ) - " ~ ,
+
at z = 2,' is equal to R -
the one off:'
- iq)2)-'/2. Thus
= (1/2i)(l - (b
converges to zero when v] --t +0, i.e., when E + +O. We have thus proved that E , = 0 when r > t, which we knew already by Proposition 7.3. Consider now the case where r 5 t ; for every E > 0, we have then l a ( < 1 . The polynomialf, given above now has the roots
1 # --(I a
-a)-a-.
U
2 If a = - 1 , AU # -(2/u)(l - a2/4) lies outside of the disk I zI 5 1, whereas if a = + 1, AU belongs to the open disk 1 z 1 < 1 (for E > 0 sufficiently small). Since one passes fromf, tof- by substituting -5 for u, a similar conclusion applies to the root p a off- : It belongs to the disk I z I < 1 when a = 1 , and to the exterior 1 z I > 1 when t~ = - 1. The residue off; at A , equals
'
R
J;
U
- - (1 - a21-112,
+-2
at pL+' equals R -
the one off:'
+ JY
+
=
whence
-(&/2)(1 -
+ i ~ ) - ' ( l- a*)-"' + (t - i ~ ) - ' ( l= 2 Re[(t + i ~ ) - ' ( l- u ~ ) - ' /= ~ 2] Re[(t + i~)' - r 2 ] - 1 / 2 = (t
(the square root always stands for the branch that is greater than zero on the positive real numbers). Finally we observe that the function H(t2 - r2)(t2 - r 2 ) - 1 / 2 is locally integrable in R3 (where the measure is r dr do dt). Indeed,
f
f=tl
r=(fl
f Jt=ro that J > + JY
Jr=O
We see thus words,
( t 2 - r2)-'j2r
dr
= ('It1
dt.
fo
converges to 2H(t2 - r2)(t2 - r 2 ) - 1 / 2 .In other
(8.12) (0
otherwise.
Sect. 81
65
MORE ON SUPPORTS AND SINGULAR SUPPORTS
E , in three space dimensions When n = 3, (8.3) reads
(8.13) - E :
1
= j(2ni)-3H(t)
s1
(( 1 t - s - i&)2 ( t
+ s + i&)' ) d m
We use spherical coordinates in the 5 variables: 51
=P
cos (PI cos 402
52
7
=p
sin 'PI cos ( P 2
53
3
= P sin
(P2 9
and we choose 8, in s = r(0, a),to be the unit vector in the direction of the coordinate x3, i.e., (0, w ) = sin ( p 2 . We use the fact that dw = 5 I cos (P2 I 4%
if we allow both
Is
f(sin
'pl
5
and cp2 to vary between 0 and 2n, and observe that 1
(P2)
d40,
do
=j
2
11 2n
2n
0
0
!(sin
4021
I cos 402 I
d'P, d(P2
nl2
= 271
jo[f(sin
= 2n
J"m+f(-u)l
(P2)
+ f (- sin c p 2 ) l
cos v2 d v 2
du,
which combines with (8.13) to yield (8.14)
1
-Ee --(2n)-'H(t) +-2
1
-
+
1
1
-
t+ru+ie
t-rru+is
)du
66
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
Let now cp be an arbitrary element of CF(R4) and set (cf. Sect. 9) ( P h ( t , I-) = (4n)-’
S‘
V ( t , r, 9) d9.
By virtue of (8.14) we have
In the last expression, the integral with respect to t is equal to
For r > 0, &Jeconverges to -mpQ(r, r ) . We reach the conclusion that
In distributions notation, this can be rewritten as 1 E+ = (47c)-’ - b(t - r ) . (8.16) r Remarks. 8.1. When n = 2, the support of E + , as seen in (8.12), is identical with the set r 5 t ; when n = 3, (8.16) shows that it is the set r = t . Of course, this agrees with Theorem 8.1. 8.2. In both cases n = 2 and n = 3, we find that E , is a Radon measure; as a matter of fact, when n = 2, E , is absolutely continuous with respect to the Lebesgue measure: It is of the form h(x, t) dx dt, where h is locally integrable (but not bounded; compare with the case n = 1). When n = 3, E+ is a measure carried by the surface r = t of the forward light-cone, as we see in (8.15). As n 7 03, E + becomes less and less “regular ”: For n = 4, it ceases to be a measure; it is the distribution derivative of a measure carried by the forward light-cone (the increasing singularity of E+ with the number of variables is already visible when n = 1, 2, 3).
+
Exercises
8.1. Let m be a real number not equal to zero, E+(m) the fundamental solution of the Klein-Gordon operator (7.30) with support in the forward light-cone. State and prove the analogs of Theorems 8.1 and 8.2; in particular, show what becomes of the Huyghens principle.
Sect. 81
MORE ON SUPPORTS AND SINGULAR SUPPORTS
67
8.2. Compute the fundamental solution E , of the wave operator in any odd space dimension n 2 5 (cf. the second part of the Appendix to Sect. 8). 8.3. Set s = (t2 - 1 ~ 1 ~ ) ' when ' ~ t 2 1x1 (i.e., in the forward light-cone) and s = 0 otherwise. Show that
(8.17)
(F(4,4 ) = JJsV(2, X I dt dx,
4 E C,m(R"+'),
defines a holomorphic function of CI E C, Re a > -2, valued in the space of distributions in R"", .9'(R"+'). Prove that, for all complex numbers CI such that Re a > 0, we have
(8.18)
n F ( a ) = CI(CI - 1 + n)F(a - 2),
where 0 is the d' Alembertian, (d/Jt)2 - A x . [Hint: Use spherical coordinates in space, and cf. Proposition 9.3.1 8.4. Let F(m) be the distribution defined in (8.17) and set
(8.19) where r is the Euler gamma function (the distributions 2, have been introduced by Marcel Riesz and are called Riesz potentials). Prove that OZ, = Z a - * (0is the wave operator on R"") and thatZ, can be extended as an entire function of CI in C, valued in 9'(R"+'). Show that the support of Z,, for all complex a, is contained in the forward light-cone. 8.5. Let 2, be the Riesz potential (8.19). Compute 2, when n = 2 and derive from this the value of Z , . Compute 2, for n = 3 and derive from this the values of Z , and Z , . 8.6. Suppose n odd > I . Admitting the fact that 2, = 6, derive an explicit formula for the fundamental solution E , of the wave operator in R"" (the one having its support in the forward light-cone) making it obvious that its support is supported by the surface of the forward light-cone. 8.7. Let 2, be the Riesz potential (8.19). Prove that +m
(8.20)
w * =c ~ p z 2 p + 2 p=o
defines an entire function of the complex variable d, valued in 9'(R"+'), which is a fundamental solution of 0 - A. (The student may admit that 2, is the Dirac measure.)
Fundamental Solutions of the Laplace Equation In this section we are concerned with solving the equation in R", AE = 6.
(9.1)
We recall that
We could proceed as in the preceding sections and reduce the problem to solving an ordinarydifferential equation with respect to one of the variables x , say 2,after having performed a Fourier transformation with respect to the remaining ones. Equation (9.1) would be transformed into
(9.3)
(d/dx")2E - (<:
+ . . + <;*
$7
= d(x").
But under the present circumstances this procedure is strikingly unnatural. By selecting one of the variables, we violate the basic symmetry of the situation. The Laplace operator enters in many partial differential equations of physics which purport to describe phenomena taking place in isotropic media. It is therefore more natural, and more convenient, to use a different method-a method that does not disregard but instead fully exploits the fact that the Laplace operator A is rotation-invariant. Actually, the new method, like the one used in Sects. 5 to 7, reduces the problem to solving a linear ordinary differential equation. But the variable in the latter will be the radial variable, r=
IX I
=
[(XI)'
+ ... + (x")~]"~,
and the equation will not have constant coefficients. Nevertheless we shall beable to solve it without much difficulty. [When n 2 3 a different, and perhaps quicker, method of solving (9.1) is indicated in Exercise 6.5.1 68
Sect. 91
SOLUTIONS OF THE LAPLACE EQUATION
69
We must begin with some general considerations following from the invariance under rotations of the Laplace operator. As a matter of fact, we shall exploit the invariance of A, not only under rotations but under the full orthogonalgroup O(n). This is the group of linear transformations of R" which leave the quadratic form I XI' invariant. PROPOSITION 9.1. Let T be any orthogonaj transformation of R", u any C" junction with compact support in R". We have (9.4)
( A u )= ~ A(u').
Proof. It suffices to apply the considerations about the action on functions of linear transformations, and their " reflection " on the Fourier transforms (see pp. 53-54). We observe that, if T E O(n),det T = f 1, hence
But if T E O(n), its contragredient ' T - ' belongs also to O(n). By Fourier transformation, Eq. (9.4) is equivalent to A
(9.5)
A
( A U ) ' ~ -=' A(u').
This in turn reads (9.6) Recalling that U ' ~ - ' ( = Q v('TS),(9.6) follows from l1T5I2=
which is evident.
1512, Q.E.D.
As a matter of fact, we have also proved that if a linear transjormation T leaves A invariant, it is an orthogonal transformation. Formula (9.4) remains valid for an arbitrary distribution u, as we see readily by using the definition (7.24) of the transform of a distribution under a linear transformation of R". We introduce the average of a function f on a sphere of radius r = 1x1 centered at the origin:
Here idenotes the variable on the unit sphere S " - ' ; IS"-'l stands for the area of S"-' (for a computation of the area of the unit sphere, see the
70
BASIC 'EXAMPLES A N D FUNDAMENTAL SOLUTIONS
[Chap. I
Appendix to this section). Note thatfh(x), although a function of r alone, is not a function on R: ; it is a function on R". Nevertheless we shall denote it sometimes by f , ( r ) .
PROPOSITION 9.2. Given any C" function f in R",we have (9.7)
Proof. The proof we give here is not elementary; the student who is not acquainted with the theory of the Haar measure on locally compact groups may skip it and try to do Exercise 9.2. We denote by dTthe Haar measure on the compact group O(n) normalized so as to have dT = 1, and we set
SO(-)
The averagef' is O(n)-invariant :
f b ( T i l ~ ) = Sf(7Ti1x)dT= Wn)
1
f(Tx)dT
O(n)
[for dTis left- and right-invariant :O(n),being compact, is unimodular]. Consequently, the average off' on the sphere of radius 1x1 is equal to f '(x): (9.9)
(f">, =f h.
On the other hand,f,(Tx) =&(x), hence
t.e., (9.10) But by virtue of Fubini's theorem, we have the right to interchange the integrations over S"-' and over O(n). We derive
(f"@= ( f X ? hence, by (9.9) and (9.10), (9.1 1)
f h=fh*
Finally, by the rule of differentiation under the integral sign, we have
Sect. 91
SOLUTIONS OF THE LAPLACE EQUATION
71
which is equal, in view of Proposition 9.1, to
I.e.,
A ( f 9 = (MY.
(9.12)
Q.E.D.
Combination of (9.11) and (9.12) yields (9.7).
PROPOSITION 9.3. When acting on a function cp E C:(R") which depends only o n r = 1x1, (9.13)
Proof. Letf(r) be a function of r alone. Then
hence
1 =-f'(r)
r' 1
=-f'(r)
r
'(:
+ x~ - f r ( r ) ) (X')Z + -f"(r) rz
I
xi
r
(X')Z
- -f'(r).
r3
By summing with respect t o j = 1, . . . , n, we obtain
1.
A f ( r ) = (f (xj)')Y(r) +f"(r). r r3 j = l
Q.E.D.
Now let u be a distribution in R" which is defined by a locally integrable function depending on r alone, which we denote by f ( r ) . We have, given any cp E C,"(R"),
72
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
where (somewhat improperly) q , ( r ) = q h ( x )with r = 1x1 [i.e., here q , ( r ) is a function of the single real variable r E R + , whereas before cp,(x) denoted a function on R", constant on the spheres centered at the origin]. Let us replace cp by A q in (9.14):
=
1 S"-' 1 S t l f ( r ) A((pb)(r)rn-'dr
(by Proposition 9.2)
0
(by Proposition 9.3) = IS"-' I
+m
0
d f ( r ) ;i;{r"-'qb(r)}dr.
We assume that the derivative offwith respect to r is also locally integrable in R";we may integrate by parts:
Recalling that n > 1 and that cp has compact support, we see that the integrated term vanishes and therefore that we have (9.15) At this point, we note that
2
-jo+m dr = q,(O) = q(0). Therefore, if we could determine the locally integrable functionf, depending only on r, so as to have (9.16) we would have (9.17)
(4A.cp> = (A% 9) = q(0);
in other words u would be a fundamental solution of the Laplace operator. Moreover, u would be a rotation-invariant fundamental solution of A.
Sect. 91
73
SOLUTIONS OF THE LAPLACE EQUATION
Solving (9.16) is immediate. A particular solution is obtained by taking
f= F with (9.18)
F(r) =
I
1
when n = 2 ,
2;;logr
'
1 I-(n-2)ISn-1]
1 rn-2
when n > 2.
Since the element of volume in R" is r n - l dr do, we see at once that f(r) given by (9.18) is indeed locally integrable and that its derivative with respect to r is also locally integrable [the latter is in fact evident by (9.16)]. We may apply Theorem 3.1 and obtain Weyl's lemma:
THEOREM 9.1. The Laplace operator is analytic-hypoelliptic. This in turn enables us to defermine all the rotation-invariant fundamental solutions of A. Each one of them differs from the one defined by the function F in (9.18) by a solution u of the homogeneous Laplace equation Au = 0. In view of Theorem 9.1, u is defined by an analytic function in R"; it is necessarily rotation-invariant, and so must be the function defining u [denoted as beforef(r)]. Note, first, that
df . a r -= x~ -f ( r > dr j = l ad
is an analytic function of x, and that if k 2 1, (d/dr)(rkdf/dr) is a bounded function of r 2 0 on any finite interval [0, R],analytic for r > 0. According to (9.15) we must have, whatever the test function cp in R",
hence, by our previous considerations,
+
This means that df/dr= C r l - " , hence , f = [Cl(2 - n)]r2-" C , when n > 2, f = C log r + C, when n = 2. In all cases,fcan be bounded in intervals [0, R ] , as it should be, only if C = 0. In at1 cases, we find f = C , = const. Remark 9.1. We shall eventually prove that the value of a harmonic function at the center of a sphere equals its average on the sphere. This implies at once that the only harmonic functions which are rotation-invariant are the constant functions.
Let us summarize what we have obtained so far.
74
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
THEOREM 9.2. The function F(r) in (9.18) defines a rotation-invariant fundamental solution of the Laplace operator A. Every rotation-invariantfundamental solution of A is dejned by a locally integrablefunction of the form F(r) + const. Let us denote by E the fundamental solution of A defined by F(r) of (9.18). If we apply (9.14) to u = E, we obtain q&r) r log r dr (9.19)
Appendix
( E , cp> =
I
-
1 I t m q , ( r ) r dr
n-2
if n = 2, if n > 2.
0
Computation of the Area of the Unit Sphere
Assume n 2 1 and set I,, = la. exp( - I XI') dx. Clearly I,, = Zy and, by going to polar coordinates, Z, = IS"-'] exp( -r2) r"-' dr (if n = I , we take IS"-'[ = 2, assigning "area" 1 to each one of the two points x = 1 and x = - 1 which make up So). Thus Zz = 2n 5: exp( -rZ) r dr = n, and consequently, Zl= Assuming now n 1 2 and setting r z = s, we obtain
+
&.
+m
21, =
J"
l~n-11
e-sS-~+n/2
ds = I S"-' I T(n/2),
0
where T(z) = Jim e-'s2-' ds (Re z > 0) is the Euler gamma function. Thus we obtain, for all n = 1, 2, ..., IS"-'
I
= 2n"/2/r(n/2).
It is easy to derive more explicit values from this. If n = 2p, we have
I s 2 p - ' I = 2 n q p - l)! . I f n = 2p + 1, T(n/2)= r(p+ +) = ( p - 4) -.. +r(+), and we have seen that r(+)= 2Z1/I SoI = Zl= A, from which I S Z P l = 2P+'nP/[1. 3 . . * (2P - 1)l.
Exercises [In the exercises below A always stands for the Laplace operator in R", (d/dxj)', and r for the Euclidean norm, 1x1. We assume n 2 2.1 9.1. For each complex number a, let V, denote the two-dimensional linear space, over the complex numbers, having the basis formed by the two
Sect. 91
75
SOLUTIONS OF THE LAPLACE EQUATION
functions (in R"\{O)), r', ra log r. Show that A induces a linear map V, .+ V a - z and write down the 2 x 2 matrix representing it. Describe completely the injectivity properties of this mapping, for the various values of a. Now viewing V , as a linear space of distributions on R", describe AVZ-. . 9.2. Let m be an arbitrary integer, m 2 1. Using the notation and the results of Exercise 9.1, construct a sequence of distributions in R", Ek , k = 1, . . ., m, having the following properties:
(9.20)
= Ek,
(9.21)
EkE
k
. ,, m - 1, k = 1, ..., m .
= 1,.
VZk-,,,
AEI = 6;
Show that, for a suitable choice of the constant C,,,,,, the following is true:
(9.22)
if 2m - n is not an even integer 20, Em = Cm,,rZm-" i s a fundamental s o h i o n of A"';
(9.23)
i f 2m - n is an even integer 20, Em = Cm,, r Z m - "logr is a fundamental solution of A"'.
9.3. Derive from (9.22)that, if n is odd (and I > 0), a fundamental solution of I - A is given by
c~~+~,,(Jir)~~,
+m
F1 -- - r 2 - "
(9.24)
m=O
whereas if n is even, we may take p-
(9.25) F ,
=
-r2-"
i
1 C m + , , n ( J ~ r ) 2-mrZ-"log r m=O
+m
Cm+l,n(J~r)zm, m=p-1
where p = n / 2 (the student must show that the series above converge). Compute the coefficients in (9.24) when n = 3. 9.4. Show that, when n = 3 (and A 2 0),
F = - - e 1471 r
J l r
is a fundamental solution of il- A. How do you reconcile this fact with the result in Exercise 9.3? 9.5. Let 1 be a number greater than zero. Denote by C, the inverse Fourier transform of (I 51' I ) - ' . Prove that G, is a rotation-invariant fundamental solution of I - A. the Let p = ( p , , . . ., p,) be an arbitrary n-tuple of integers 20, Fourier transform of x P G AProve . that there is a constant C , > 0 such that
+
(9.26)
lG,p(5)l 5 CP(1+ l t l ) - ~ p ~ - z ,
v5 E R,.
Derive from (9.26)that the function G,(x) in R"\O goes to zero faster than any power of l/r when r -+ co.
+
76
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
9.6. Let G, be the distribution so denoted in Exercise 9.5. Derive from (9.26) that r2MG is continuously differentiable as many times as we wish, provided that we choose the integer M large enough. Derive from this that G , is C" in the complement of the origin, and consequently, that 1 - A is hypoelliptic. 9.7. Let G, be the distribution so denoted in Exercises 9.5 and 9.6. Prove that, if n is odd, n 2 3, or if n is even, n 2 4, we have the asymptotic equivalence, as r 0,
-
(9.27)
Gn(x)
-
(n - 2)
___
I
-
I S T F2'
whereas, when n = 2, we have (9.28)
G,(x)
--
1 2rc
- log r
(the student must state precisely the meaning of these equivalences, by estimating the size of the difference between the two members in a neighborhood of the origin). 9.8. Let E denote the fundamental solution of A in R" given by (9.19). Prove that u H E * u is a continuous linear map C:(R") + C'(R"). Hint: Take advantage of the fact that
(9.29)
9.9. Let A be a number greater than or equal to 0. Prove that, iff is any continuous function with compact support in R", every distribution solution of the equation (9.30)
(A
- A)u = f
belongs to C1(R"). 9.10. Let E denote the fundamental solution of A given by (9.19). Let R be any number greater than zero, y any point in R" such that I y 1 < R . Prove that the average of E(x - y ) over the sphere I X I = R is equal to F(R), given by (9.18). [Hint: Prove that the average of E(x - y ) over the sphere I x 1 = R is a solution of the Laplace equation, in the variable y , in the interior of that sphere, and that it is rotation-invariant; then apply the assertions in Remark 9.1.]
I0 Green's Formula. The Mean Value Theorem and the Maximum Principle for Harmonic Functions.
The Poisson Formula. Harnack's Inequalities We are now going to derive, for the Laplace operator, the analog of the inhomogeneous Cauchy formula (5.12) for the Cauchy-Riemann operator. It is useful to put things i n the proper perspective. We are dealing with a finite-dimensional real vector space, equipped with a positive-definite quadratic form; the operator we are studying is the one canonically associated with this quadratic form (via transfer of the quadratic form to the dual by the Riesz representation theorem and via Fourier transformation): the space is R", the form is the square of the norm, Ix12, the operator is Laplace's. Rephrasing this slightly we may say that, associated with the operator under study, we are given a Riemannian structure on R". Now let R be a bounded connected open set in R" whose boundary dR is a smooth, say C 2 , hypersurface. We make the hypothesis that R lies 012 one side of its boundary. Because of the Riemannian structure on R" we may talk about the normal to this hypersurface. On the other hand, dR divides R" into two regions: an inner region, R, and an outer one, the interior of R"\R. The former is relatively compact; but the latter is not. The normal line to JR at some point consists of two well-defined half-lines : the exterior normal, and the interior one (in other words, the normal to dR is naturally orientated and so is the tangent hyperplane to an).We may also define the unit iwtovfield v at each point of dR, normal to dR and pointing into R"\n. 77
78
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
Let xo be an arbitrary point of a n . If U is a sufficiently small open neighborhood of xo in an, we may find (n - 1) functions y', . . .,y"-', defined and Czin U,such that, at every point of U,the vectors grad fl( j = 1, . . . , n - 1) define a frame in the tangent hyperplane to a n , having the "correct " orientation. Then U is the image of an open subset Y of R"-' under the Cz diffeomorphism y' = (y', . ..,y''-I)x = xQ').It is standard to set
where ( ,) stands for the inner product in R". We may define the area element in U as the transfer from V of the (oriented) measure dy'. It is easy to check that this definition is independent of the choice of the local coordinates y' (having the right orientation). It coincides with the usual definition when U is a piece of hyperplane. By covering 8 0 with coordinate patches such as U we define a positive (oriented) measure on a n , canonically associated with the Riemannian structure and with the orientation of dR (both induced by those of R"), which we shall denote by da.
LEMMA10.1. Let cpl, ..., cp" be n functions, defined and C' in an open neighborhood 0 of SZ and valued in C ; set cp(x) = (cp'(x), . .., cp"(x)).We h u e (10.1)
div cp dx =
(cp, v ) do.
In
an
ProoJ By using a partition of unity, we see that it suffices to prove (10.1) when the support of cp is compact and contained in a small open neighborhood JV (in R")of an arbitrary point xo of 8R. We may assume that JV n aR = U is of the kind considered above, and that we have a system of coordinates (y', . .. ,y"-') in U, exactly as before; we add to this system a coordinate y" along the axis parallel to the normal to aR at xo. There is a C 2 function f(y') (in V) such that y"
+
af af (-a. ..., --
1)
af-1'
is normal to U and points into R"\n.Consequently the right-hand side of (10.1) reads (10.3)
Sect. lOJ
79
FORMULAS, FUNCTIONS, AND INEQUALITIES
where cp; (y') = cpj b', .. . , yn-', f(y')], j
= 1,
... , n. The left-hand side is
(10.4)
In (10.4) we set zj = y j if j < n, z" = y" -f ( y ' ) , @(z)= cp' (z', .. .,f-', z" f b ' ) ). This turns (10.4)into
+
which, because of the nature of supp cp, is equal to (10.3).
Q.E.D.
COROLLARY 10.1. k t u E Cz(@).We have (10.6)
Proof. Apply (10.1)with cp = grad u.
Q.E.D.
We have denoted by du/Jv the partial differentiation in the direction of the outer normal to ail: au
1 h
- ( x ) = lim - [u(x + hv) - u(x)]. av
k-.n
h#O
COROLLARY 10.2. Let cp E c'(O), v E C'(O; Cn).We have
(10.7) COROLLARY 10.3. Letf, g E C2(0). We have
(10.8) Equation (10.8)is Green'sformula.
Proof of Corollary 10.3. In (10.7)choose v = grad
(10.9)
I
(cp
A$
+ (grad cp, grad $)I
dx =
In this identity take first cp =f,$ = g, then cp = g, obtains (10.8).
+, $ E C2(0).We obtain
an
cp
a* do. 5
=f.By subtraction one Q.E.D.
80
BASIC EXAMPLES AKD NNDAMEKTAL SOLUTlOh’S
[Chap. I
We now derive the analog, for the Laplace equation Au =f
(in the neighborhood 6 of
a),
of the inhomogeneous Cauchy formula [see (5.10), (5.12)]. We shall assume that u E C2(8)and f E Co(f?). Let ln denote the characteristic function of the (bounded) open set R. By the Leibniz formula we have
Since both members have compact support, we may apply E obtain (10.10)
znu = E
* (xnf)
* to them; we
+ 2E *
where E is any fundamental solution of the Laplace operator. We need to know what the following distributions are:
Let cp be an arbitrary test function in R”,in fact having its (compact) support in 0. We have
dU
= -IZRq
do
(by Corollary 10.2).
In other words, (10.11)
A = -7-
?U
-do, 2v
the measure carried by dQ with density (with respect to do) - ? u / ~ vOn . the other hand,
by Corollary 10.1. Thus B = - A
+ B,, where
(10.13) The distribution B, is a distribution carried by tQ but it is not a measure carried by do. It is called a “double layer.”
Sect. 101
FORMULAS, FUNCTIONS, A N D INEQUALITIES
81
We take (10.11) and (10.12) into account in (lO.lO), but restrict the latter to
R. Notice that, when x E R, E(x - x ' ) is a C" function of x' in a neighborhood of an. Consequently we may evaluate A and B, as distributions in the variable x', on this function. On the other hand, E is a locally integrable function and
therefore E have
* (xnf)
(10.14) u(x0) =
1
can be computed by integration over R. Thus, if xo E R, we
E ( x 0 - x ) ~ ( xdx )
n
In this formula it makes no difference which fundamental solution E of A we choose to use. Customarily one likes to use the fundamental solution defined by (9.18). This is particularly convenient in the special case where R is anopen ball { x E R"; I x - xo I < R } centered at an arbitrary point xo and we wish to evaluate u(xo). It is natural to switch to spherical coordinates at x o , i.e., set x = xo + r i , where iis the variable point on the unit sphere S"-'. With the notation of (9.18) we have ( 1 0.1 5) ,I-
-
I
F ( r ) f ( x o+ r i ) rn- dr d i
I
R"-'F(R)
au
-(xo + Ri) dk
s n - ~ar
+ R"-'F'(R)Ss"-lu(xo + R 1 ) dk, since, in the present situation, do = R"-' d i . We have also used the fact that (d/dv)E(xo- x ) = (d/dr)F(r).If we apply (10.6) once more, we see that
whence, by (lO.l5),
(10.16)
u ( x ~= ) J { F ( l x l ) - F ( R ) ) f ( x )dx n
+ F ( R )J-a 0U ( X ) do,
assuming that R is the open ball of radius R centered at xo . Formula (10.16) is very important. One of its first consequences is the following striking property of harmonic functions, the so-called mean value theorem :
82
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[chap. I
THEOREM 10.1. If u is a harmonic function in an open neighborhood of the closed baN { x E R"; I x - xo 1 S R ) , we have (10.17) In other words, the value of a harmonic function at the center of a sphere is equal to its average on the sphere.
Proof. If Au = 0 for I x - x o I < R
+ E , E > 0, we have, by (10.16),
u(x0)= F'(R)R"-'
Js"- u(x0 + R i ) d i . I
We have, of course, IS"-' 1 R"-'F'(R)= 1 [cf. (9.16) and 9.18)]. COROLLARY 10.4. Under the hypothesis of Theorem 10.1, we haze (10.18)
Proof. One gets (10.18) at once by applying the elementary mean value theorem to (10.17). Corollary 10.4 implies the celebrated maximum principle: THEOREM 10.2. Let R be a connected open subset of R",u a harmonicfunction in R. Unless u is constant in R, we haze, for ecery xo E R, (10.19)
Proof. Suppose that I u(xo)1 = sup, I u(x)1 for some xo E R. Let R be any number greater than zero such that the ball ( x ; I x - xo I S R } is contained in R. Suppose. for instance, that u(xo) 2 0 [otherwise replace u by ii(x,)u]. By (10.17) we have necessarily u(xo) = sup3Esm-I I u(xo + Rj)I and therefore "-1
I sup 1 u(xo + R j ) I - Re u(xo + Ri)'I d i = 0. \yES"-l
The integrand is a continuous function, everywhere 20, on the unit sphere; hence it must be identically zero. Thus u(x) = u(xo) for any x such that I x - xoI = R ; and we may take any R < d ( x o , 2Q). This proves that u(x) = u(xo) in a full neighborhood of xo in R, therefore in the whole of R: Both functions are analytic in R or, if one prefers, the set where they are equal, Q.E.D. which is obviously closed, has just been shown to be open. Next we derive the Poisson formula for an open ball.
Sect. 101
83
FORMULAS, FUNCTIONS, A N D INEQUALITIES
First let us consider the case of a bounded open subset R of R",with smooth boundary aR and lying on one side of this boundary-otherwise arbitrary. We apply the identity (10.10) once more, taking (10.11) and (10.12) into account. We evaluate both sides of (10.10) at a point x1 lying in the complement of B. We obtain [cf. (10.14)] (10.20)
I
E(x, - x ) f ( x ) dx =
n
au a (E(x, - x ) - ( x ) - u(x) - E(x, - x ) ] do, an av av
recalling that f = Au in R (we may assume that u is a twice continuously differentiable function in In particular, if u is harmonic,
a).
Let us now assume that R is the open ball 1x1 < R and let x, be a point of
R different from its center, i.e., xo # 0. We shall apply (10.21) with Efx) the fundamental solution of A defined by (9.18) and with
x1 = ( R / I xo I )'XO ( x , is the point obtained from xo by "reflection" with respect to the sphere 1x1 = R). We have
(10.22)
[ X - i i I
= ( R / I x ~ I ) / x - x ~ .I , V X , 1x1 = R .
In view of this we have, according to (9.18),
) , =R, (10.23) E(xl - x ) = ( ~ x ~ ~ / R ) ' - ' E ( ~ ~ - x1x1 (10.24) E(xl - x ) = E(xo - x )
+ const,
if n 2 3 ;
if n = 2 .
1x1 = R ,
If we substitute this into the left-hand side of (10.21) and take into account the fact that the surface integral of du/av over aR, when u E C'@) is harmonic in Q, is equal to zero (Exercise 10.6), we see that, for any n 2 2, (10.25)
1
au E(xo - x ) - do = ( R / I xo I )n-' 1x1 = R ar
1
a
u(x) - E(x, - x ) do. IxI=R ar
We may take this into account in (10.14) (wheref- 0). We obtain
We note that if z # x (and r (10.27)
=
I x 1 ), then
84
BASIC EXAMPLES AND FUNDAMENTAL SOLUTIONS
[Chap. I
If we use (10.27) in the right-hand side of (10.26), we see that u(xo) is equal to the integral, over dR, of u(x) multiplied by
If we go back to the definition of x1 and to (10.22), we obtain at once the Poisson formula
It will be noticed that this formula, which was proved under the assumption that x , # 0, is in fact also valid when x, = 0. It then reduces to the mean value formula (10.17). In (10.28) let us assume that u(x) 2 0 everywhere and let us write ro = I xo 1. Noting that R - ro < I x - x , 1 < R r , if I x 1 = R , and applying the mean value theorem (Theorem 10.1) we obtain at once Harnack's inequalities
+
(10.29)
By solving the problem in Exercise 10.7 the student may check that the two constants in (10.29) are the best possible. From (10.29) we derive the classical result, also known as Harnack's inequalities : THEOREM 10.3. Let R be a connected open subset of R". K any compact subset of R. There is a constant cK > 0 such that erery nonnegatice harmonic function u in R satisfies the inequality : (10.29')
C K U(J')
5 U ( X ) 5 ,C
U(J'),
V X , y E K.
The proof is a simple application of (10.29) and is left to the student.
Exercises
(In the exercises below, R denotes an open subset of R".) 10.1. Suppose that R is bounded. Let u be a continuous real-valued functwice continuously differentiable in R, and such that Au 2 0 in R.
tion in
a,
Sect. 101
FORMULAS, FUNCTIONS, AND INEQUALITIES
Prove that for any x,
(10.30)
85
and r < d(x, , CR),
ER
u(x0)
5 1 sn-'
"- 1
+ ri) d i ,
Derive from (10.30) the maximum principle for u : (10.31)
vx,
E R,
u(xo) <= sup u(x). xean
What is the correct statement of (10.31) when R is not bounded? 10.2. Let u E C'(R), x , E R arbitrary. By using the Taylor expansion of order 2 or u about x, , prove that (10.32)
-Au(xo)=
lim n- I
Derive from this that, if u satisfies (10.30) for any x, E R, Au 2 0 everywhere in R. 10.3. Let I be a number greater than or equal to zero. Prove the maximum principle (10.31) for the solutions of the (metahurmonic) equation
(-A+I)u=O
(10.33)
inR.
10.4. Exhibit a solution of the equation (10.34)
d'u dx2
__
u=
-1
in the unit interval
IX I
< 1,
which does not obey the maximum principle. 10.5. Let I be a number less than zero. Construct an example in which a solution of Eq. (10.33) does not satisfy the maximum principle. 10.6. Assume that 98 is an open ball in R" and that u E C ' ( B ) is harmonic in a. Prove that we then have (10.35) Conversely, show that if M E C'(R) satifises (10.35) for any ball closure contained in R, u is harmonic in R. 10.7. Prove that the function
with
u(x) = ( I - Ix12)/lzo - ZI",
where zo is a point on the sphere S"- I , is a harmonic function in the interior 1x1 < 1 of that sphere (n 2 2). Show that there are points x o , I x, I = ro < 1, where one of the two inequalities in (10.29) (with R = 1) is in fact an equality. 10.8. Let u be a real-valued harmonic function in R". Prove that either u is constant or else u maps R" onto R.
86
BASIC EXAMPLES AND FUSDAMESTAL SOLUTIOh'S
[Chap. I
10.9. Let f denote a measurable function on the sphere 1 X I = R ( R > 0) whose values are either (finite) real numbers or else +a.Let us set [cf. (10.28)1
(10.36)
(the so-called Poisson integral off ). Suppose that f is lower semicontinuous on the sphere I x I = R, which means that, given any y o , I yo I = R , and any z < / ( y o ) , there is an open neighborhood Ua of yo on the sphere such that f(J) > 2,
V Y ua. ~
Prove that, given any y o , 1 y o I = R, the lower limit of I f ( x ) as x , 1 x I < R, tends to y o , is larger than or equal to f (yo). Derive from this that iff is continuous on the sphere 1x1 = R, I , is continuous on the closed ball 1x1 2 R and equal to f on its boundary. 10.10. Let R be an open subset of R".Derive from the Poisson formula (10.28)that if a sequence of harmonic functions in R converges uniformly on every compact subset of R, then they also converge in C=(R). 10.11. Let R be an open subset of R".Derive from the Poisson formula (10.28) that if a sequence of harmonic functions in R converges in the distribution sense, then they also converge in Cx(R). (Cf. Exercise 2.10.)
CHAPTER IT
The Cauchy Problem
This Page Intentionally Left Blank
I1
The Cauchy Problem for Linear Ordinary Differential Equations Consider a first-order linear differential equation in an open interval - T < t < T (T > 0) of the real line: du
(11.1)
- - a(t)u
dt
=f.
Suppose that both the coefficient a(t) and the right-hand sidef(t) are continuous functions of t , I t I < 2". We obtain at once a solution of (11.1) by taking (11.2)
u( t ) = eA(',')uO
+
Sd
eA("'"f(t'>dt',
where uo is an arbitrary constant and A(t, t ' )
=
f
1 tl < T ,
a(s) ds,
1 t'l < T.
I'
As a matter of fact, all the solutions of (1 1 .l) are obtained in this manner: They are all of the form (1 1.2). It is clear that u(t) is completely determined by the choice of uo . Observe that
(11.3)
u(0) = 2.40.
In other words, the problem (11.1)<11.3) has a unique solution, given by (1 1.2). Notice, incidentally, that this solution is once continuously dzfferentiable in 3 - T, T [ . We now go to first-order systems of linear ODES: (1 1.4) (11.5)
dU
dt
M(t)u = f,
u(0) 89
= uo
.
90
THE CAUCHY PROBLEM
[Chap. I1
Here u, f, uo are valued in C” and M ( t ) is an m x m matrix with continuous functions as entries. If all these entries were constant functions, i.e., M(t) = M, a complex m x m matrix, the theory of (11.4H11.5) would be a straightforward generalization of the theory of (11.1H11.3). The solution would be given by
+
(11.6)
~ ( t= ) etzHu0
Id
e(t-t’)Mf(r‘)dr‘.
But the situation is radically different when M ( t ) is nonconstant. In the constant coefficients case, set U(t, 1’) = exp((t - t’)M).What lies at the root of formula (1 1.6) is the fact that
dU - -- M U , Ul =Z (the m x m identity matrix). dt t = t’ But this ceases to be true, in general, when M is nonconstant. The thing to do, therefore, is to lay aside the exponential and concentrate on the solution of (1 1.7). It should be noted that problem (1 1.7) is of the same kind as (1 1.4)(11.5): I t is a system of first-order linear ODES. This time there are m2 equations in m 2 unknowns, submitted to rn2 conditions at time t = t’. It follows at once from the fundamental theorem on ordinary differential equations that (11.7) has a unique solution, which furthermore is a C’ function o f t , t’ for Irl < T, It’] < T . The correct generalization of (1 1.2) is then (11.7)
(11.8)
u(t) = U(t, O)u0
+
J-:
U(t, t’)f(t’) dt’.
The function U(t, 1’) is sometimes called the Riemann funcrion of the problem (11.4H11.5).
PROPOSITION 11.1. If
t , t’, t“
U(t, t’)U(t’,t “ ) = U(t, r”),
(11.9) (11.10)
are three points in rhe interval ] - T, T[, we haue
U(t, 1) = I ,
U(t, t’)U(t’,r ) = I.
Proof: The first identity in (1 1.10) is part of the definition of U(t, f‘). By differentiating it (with respect to t), we obtain
dU dt
-+-=O
dU dr’
when
t=t‘,
when
t = t’.
whence, by combining this with (11.7), (1 1.1Oa)
dU - -M(t’) _ dr’
Sect. 111
LINEAR ORDINARY DIFFERENTIAL EQUATIONS
91
On the other hand, if we differentiate the first equation in (1 1.7) with respect to t’, we obtain (1 1. lob)
The solution of the Cauchy problem (1 l.lOa)-(ll.lOb) is given by
dU dt, -- - U(t, ow’),
(11.1Oc)
as can be checked directly. Set now V(t, t ’ ) = U(t, t’)U(t‘, t). We have, by (11.7) and (ll.lOc),
dV dU _ -- dU -(t, t’)U(t’, t ) + U(t, t ’ ) - ( f ’ , t) dt’ dt‘ dt = - U(t, t’)M(t‘)U(t’,t )
+ U(t, t’)M(r’)U(t’,r) = 0 ,
hence V(t) = V(0) = I. Now set W(t) = U(t, t‘)U(t’,t ” ) . We have
dW dU - - _ - (t, t‘)U(t’,t ” ) = M(t)U(t, t’)U(f’,t “ ) dt dt
=MW
On the other hand, W = I when t = t“. By the uniqueness of the solution of (1 1.7) we conclude that W = U(t, t“). Q.E.D. Consider now an mth-order differential equation (11.11)
d“u
dm-1U
Lu = - + a,(t) -+ * . . + a,(t)u =f, dt” dt”-’
where the coefficients a j and the right-hand sidef are continuous functions in ] - T, T[. By the standard procedure, already described (see p. 29) we may transform it into an equation of the kind (11.4). This means that, here, M(t) =
(
0
0
-a,(t)
1 0
-~,,,-~(r)
and the components uj of u are given by (11.12)
0 1 -a,-z(t)
... ... * * -
-al(?)
92
[Chap. I1
THE CAUCHY PROBLEM
We see the meaning here of the initial condition ( I 1.5): Let us denote by uo,o . uo, . . . uo, the components of uo . Condition ( I 1.5) is equivalent. in the present context. to u‘”(0) = u ~ , ~ . 0
(11.13)
5 k 2 rn
- 1.
Condition ( I I . 13) is thus what generalizes ( I 1.3) to higher order equations. In accordance with the terminology for partial differential equations. w e shall refer to the problem ( 1 1.1 1 )-( 1 1.13) as the Cauchj,problem for the differential operator L. The next question is related to what generalizes formula ( I 1.2) in the higher order case. This can be ansuered by means of formula ( 1 1.8) relative to the equivalent system ( I l.4)+11.5) One must recall that ffr) = f ( r ) e , . We denote by ej(l S j 5 rn) the vector uhose components all vanish. except the jth one. which is equal to one. Observe then that m- 1
u(t,O)uo = k = O uo. u(t.O h , Let us call Uj(r. 1‘) thefirsr component of U ( t . r ‘ ) e j T , In . viebb of (1 1.12) we derive from (1 1.8)
From this identity we derive that U J t . 1 ’ ) is the solution of the Cauchy problem (11.15) (11.16)
LV=O.
at r
= r’.
d’V 10 -dt’ \I
if j # k (Ozjsrn-I). if j = k
In general, it is not easy to compute the functions Uj(r. r’). It is therefore convenient to give an equivalent statement of (11.14) in which all c’, have been eliminated but one. This can be done as follows. Introduce the function (1 1.17)
Clearly iio(t) satisfies the initial conditions (1 1.13). Let us set r(r) = u ( t ) - ijo(r).
We have (11.18) (11.19)
LL. =f - LEO.
r‘k’(0)= 0,
0 5 k 5 rn - 1.
Sect. 11 ]
LINEAR ORDINARY DIFFERENTIAL EQUATIONS
93
We may therefore apply (1 1.14) with v instead of u : u(t) =
I‘ 0
U,,r-l(t. r’)( f( t ’ ) - Lu,(t’)) dt‘,
whence
In the first integral, on the right-hand side of (11.20), we may integrate by parts and use the explicit expression of LC,. This leads to a more explicit formula for u(t), involving I Y , , - ~ (t ’~) ,and its derivatives. We leave it to the student to compute it, if he wishes to do so. In the case where the coefficients a j are constant, U m - l ( t .t’) is not difficult to compute; it is the first component of e(r - t ‘ ) M em 3
We have U m - l ( t ,t’) = U m - , ( t - t ’ , 0 ) = U(t - t ’ ) where U ( x ) is the function so denoted on page 30. With the notation of Sect. 4, we have E = H U where H is the Heaviside function and E is the fundamental solution of L with support in the nonnegative half-line. Formula (1 1.20) in this case can be rewritten as (11.21)
u =Go
+ ( H U ) * { H ( f - LEO)}- ( B U ) * {A(f- LEO)},
where B(r) = H( - t). The support of the first convolution on the right-hand side of (1 1.21) is contained in the half-line t 2 0, the one of the second convolution is contained in the half-line t 5 0. Let us go back, for some final remarks, to the general case of variable coefficients. Concerning the system (1 1.4) and its solution (1 1.8) we may note that if the coefficients M ( t ) and the right-hand side f(t) are N times continuously differentiable functions of t . 1 t / < T , the Riemann function U(t, t’) is an ( N + ])-times continuously differentiable function off, t’ in the same interval. Consequently, u(t) is CNf’in 1- T , T [ . Let us now return to the mth-order equation (I 1.11 ) via the equivalent first-order system ( I 1.4). Assume now that the a j and f are C N functions in ] - T , T[. Because of the relations (1 1.12) we reach the conclusion that u ( ~ - ’is) a CN+’function in 1-T, T [ . We may state
PROPOSITION 11.2. If the coeficients a j (1 5 j 5 m ) and the right-hand side f of Eq. (11.11) are C Nfunctions o f t , I t [ < T , the solution u given by (11.14) is a CNt” function in the interval ] - T, T[. COROLLARY 11. I . I f the coeficients a j and the right-hand side ,f are C“ functions in the interval ] - T , T [ , the same is true of the solution u.
94
THE CAUCHY PROBLEM
[Chap. I1
In this corollary one may replace C" everywhere by analytic. This follows at once from the fact that if the coefficient M ( r ) in ( I 1.4) is analytic with respect to t , the Riemann function U(t. r') is analytic with respect to t. r' (in the interval 1- T. TD.
Exercises 11.1. Suppose that for any two points 1. t' in the interval 1- 7.T [ the m x rn matrices M(r) and M ( r ' ) commute. In this case. give a simple expression of the Riemann function C(t. t ' ) of the problem ( 1 1.4H11.5). 11.2. Suppose that. in (1 1.4), the matrix M IS constant. Suppose furthermore that M is negative definite. i.e.. (Mv. v) < 0 for all v E C". v # 0 [we have denoted by ( . ) the Hermitian inner product In Cm].Let U(r) denote the solution of the problem (1 1.7). Shorn that there is a constant c > 0 such that the matrix norm of U ( t ) decreases at least as fast as e-" when t + + x. 11.3 Using Exercise 4.10. gike a complete description of the solution U of problem (1 1.7) when M is an arbitrary m x m matrix. 11.4. Give a complete description of the function U,,-l(t. t ' ) in ( I 1.14) in the case where the coefficients a, of L are constant (cf. Exercises 4.10 and 11.3). 11.5. Let L be the differential operator defined in (11.11). Suppose that the coefficients a,(r) and the right-hand side f ( t ) are C" functions in the interval 1- T, T [ .Suppose furthermore that f ( t ) = 0 if t < 0 and that I( is a distribution in ] - T. T [ of the form
with g a continuous function of t. t < T, vanishing when t < 0. Use the fact that Lu = j t o prove that u is a C" function in 1- T. T [ . 11.6. We assume that the coefficients uj (.j = 1. . . . . rn) of the operator L and the second member f in (1 1.1 1) are (complex-valued) C" functions in the real line. It follows at once from formula (11.14) that the problem (1 1.1 1)(1 1.13) has a unique solution I( which is also a C" function in R' (the uniqueness follows from the fundamental theorem about ODES). Let then u'(t) be the function equal to u ( t ) for r 2 0 and to zero for t < 0. Compute the distribution Lu'. In particular, compute the distributions Lu' when u = Uk(t,0). k = 0, .. . , m - 1 [see (11.14) to (11.16)]. 11.7. Suppose that the differential operator L in (11.11) has constant coefficients and let u' denote the function defined in Exercise 11.6. Prove that if c is any distribution in the real line. vanishing for t < 0, i.e., L' E P ;
Sect. 111
95
LTNEAR ORDINARY DIFFERENTIAL EQUATIONS
(cf. Exercise 4.1 1) such that LD= Lii, then necessarily we must have u = E. Is this fact still true when 1. has variable (but Cm)coefficients? 11.8. Give an example of a homogeneous first-order ODE with analytic coefficients (in the whole real line) whose solutions all vanish of infinite order at the origin. 11.9. Analyze completely the possibility of solving the “initial value problem ”
(1 1.22)
xu”
+
U l 24’
+ ut xu = 0, u(0) = uo.
(1 1.23)
Show that, in general (this is to be made precise), the Cauchy problem, obtained by adjoining the condition (11.24)
u’(0) = 241,
cannot be solved. Is it true that, when it can be solved, its solution is unique? 11.10. Let a, A and u be three nonnegative continuous functions on the closed interval [0, TI, uo a nonnegative constant. By comparing u to a solution of (1 l . l ) , prove that the inequality (11.25)
v ( t ) 5 uo
+
f(s) ds
+ Iza(s)u(r) ds, 0
0 5 t 5 T,
implies the Gronwall inequality (11.26)
u(t)s
(u,+Jb’f(s)ds)exp(~~u(s)ds), O s t j T .
What inequality, bearing on u, would have followed from (11.27)
Iu‘(f)l
5 u 1 + f(s) ds
assuming that a(s) is monotone?
+ Izu(s)u(s) ds, 0
0
t
5 T,
I2 The Cauchy Problem for Linear Partial Differential Equations. Preliminary 0bservations
In Example 2.2 we showed, using slightly different notation and under suitable assumptions of smoothness about the data uo , u,, that the problem 2
(12.1)
(i)'u-($)
(12.2)
when t
u=O,
= 0,
XER,
u = uo(x),
It1
au
-= ~,(x), dt
has a solution, given by (12.3)
~ ( tX), = t [ u o ( t
+ X) +
UO(X
- t)]
+ i[U,(t +
X)
- U,(X - t ) ] ,
where U , is an arbitrarily chosen primitive of u l. We have also stated that this is the only solution of (12.1)-(12.2). Now, clearly, this problem is the natural generalization of the Cauchy problem (1 1 . 1 I)-( 1 1.13) when the right-hand side f is identically zero. In addition to the time variable t , we now have the " transversal " or space variable x; the coefficients 'in (1 2.1 ) are no longer functions of t , acting as multipliers, but rather differential operators with respect to x. But we know how to revert to the multipliers' case: We perform a Fourier transformation with respect to x. Equation (1 2,.1) is transformed into (12.4) which is of the kind (1 1.1 1); as a matter of fact, its coefficients are constants depending on the parameter 5. 96
Sect. 121
LINEAR PARTIAL DIFFERENTIAL EQUATIONS
97
This method may even work for the inhomogeneous equation
(k) 2
(12.5)
(')'u
u = f ( t , x),
-
x E R,
It1 < T,
provided that the Fourier transform offwith respect to x,f(;(t,t),makes sense, i.e., is a tempered distribution. It suggests a general way of approaching problems of the kind (12. I)-( 12.5) and may give us valuable information about the possibility of solving them. There are obvious restrictions on the kind of data we are allowed to handle: They should all be tempered with respect to x. In addition to the fact that these are rather lax restrictions, we may also hope to get rid of them, once we know how to handle the problem when they are fulfilled. The crux of the matter is that the Riemann function of the ordinary differential equation, obtained by performing a Fourier transformation with respect to x in the partial differential equation under study, should be a tempered distribution with respect to t. We may put our basic examples to test along these lines. Take for instance the homogeneous Cauchy-Riemann equation. By Fourier transformation, it leads to the Cauchy problem (12.6)
aii
- - 56 =o,
=iio(5).
iil
at
t=O
Its solution is given by (12.7)
ii(t, 5)
= ii,(<)e".
When is this solution tempered with respect to speaking, it is when (12.8)
5 for all t , I t 1 < T? Roughly
Co(~)eT1" is tempered.
If T' < T. (12.8) implies that 6 0 ( ~ ) e T ' ~isc ' rapidly decreasing when [ 51 -+ +a.Then write
I
uo(x)= (25r)-" ekc-uo(5) &. It is clear that we may replace x by z
=x
+ iy provided that Jyl < T :
Moreover, if [ y1 < T, we may differentiate with respect to Z under the integral sign; and the result is obviously zero: uo(z) is a holomorphic function of z in the strip IIm zI < T. Consequently, uo(x) must be an analytic function in R.
[Chap. II
THE CAUCHY PROBLEM
98
This must come to us as a reminder-a of
au .au ax
(12.9)
-at+ I - = o
3
reminder that (I 2.6) is the transform
u/,;o = uo(x)*
In view of the first condition (12.9), u(t, x ) must be a holomorphic function of t + ix. Therefore uo(x) = u(0, x) must be analytic. As soon as we make this observation, we notice that a similar one will apply to all analytic-hypoelliptic differential operators. Let us state it with some generality:
PROPOSITION 12.1. Let R be an open subset of R",P a linear partialdifferential operator with respect to the variables t , x, with analytic coeficients, of order m > 0, in the product set { t E R' ; I t 1 < T ) x 0.If P is analytic-hypoelliptic and ifthe function f ( t , x) is analytic in this product set, the problem Pu=f,
(12.10) i
(12.11)
(i)u=uj(x),
XER,
It]
j = o ,..., m - 1 ,
when
t=o,
x E 0 ,
has no (distribution) solution unless the initial data uj are analytic functions in R. This observation settles also the case of the Laplace operator: The Cauchy problem (to give it its name) (12.10)-(12.11) will not have a solution for general data uj E Cm(12),j = 0, 1, when P = A. Let us look now at the heat equation. In this case, the transform equation reads (12.12)
If we take f
= 0, all solutions of (12.12) are of the form
W,0=no(O exp(-tltI2).
(12.13)
This is not tempered with respect to t when -T < t < 0 unless for any T' < T, Go(() exp(T'1
=-
Sect. 121
LINEAR PARTIALDIFFERENTIAL EQUATIONS
99
observation applies only to the existence of solutions and will be confirmed by a more rigorous analysis. As for the uniqueness of the solution, we already have a hint that it is not going to hold. For this we refer to Example 3.1 : Take R to be any open subset of the complement of the origin in R' ; the function denoted F(x, t ) in Example 3.1 is a C" function of ( x , t ) in Q x R', vanishes identically for t < 0, and hence satisfies (12.14)
dF F=-=O at
when t =0, X E R ,
and is a solution of the homogeneous heat equation (12.1 5 )
aF - = A x F in R x R'. dt
If instead of the heat equation, we look a t the Schrodinger equation, we face a different situation. The transform equation then reads (12.16)
The Riemann function of (1 2.16) is (12.17)
which is tempered for all real numbers t , t'. This suggests that solutions to the two-sided Cauchy problem will always exist-in any case, that they exist when the Cauchy datum uo(x), at t = 0, is tempered. Uniqueness, without additional restrictions, does not hold (Exercise 12.3). In connection with this, notice that the heat and the Schrodinger equations have an important feature in common, namely, the order with respect to t of both is 1, whereas their total order is 2. This lies at the heart of the matter, as the main theorem on uniqueness, Holmgren's theorem (Sect. 21), will show (see also Exercises 12.1 to 12.3). Last but not least, there is the waue equation. The Riemann function of the transformed equation (1 2.18)
is the function (12.1 9)
sin( 1 (1 (t - t ' ) )
ltl
which is tempered for any I, t' E R'. Furthermore, the order of the waveequation with respect to t is 2, the same as its total order. We are in a situation where we can expect both the existence and the uniqueness of the solution to
100
THE CAUCHY PROBLEM
[Chap. I1
the Cauchy problem. This is borne out by what we know in the case of a single space variable, and will now be confirmed (for any number of space variables) by a rigorous analysis, based on the information already gathered in Sects. 7 and 8.
Exercises Let 0 be a number such that $ < 8 < 1. Prove that -+ m (12.20) exp[ -i.rt - (iT)1’2x- (i~)’] dT F(t, x ) = 12.1.
-m
defines a C“ function in R2 which is a solution of the homogeneous heat equation (12.21)
aF - a2F _ dt
-dX2
(in R’).
[Note: The branches of the fractional powers za are always chosen in the “natural w a y ” : they are the ones that are positive on the positive half of the real axis.] 12.2. By applying the Cauchy integral theorem, show that the function defined in (12.20) is also equal to (12.22)
s,:,::
exp( -zt - z1’2x- z’) dz
(complex integration performed over the vertical straight line Re z = - a , where a is an arbitrary positive number). Derive from this that there is a constant C > 0, independent of a > 0, such that (12.23)
I F(t, x) I
5 c exp(at + Cue>
(C is allowed to’ depend on x but remains bounded as long as x remains bounded). Derive from (12.23) that F(t, x ) = 0 if t < 0. 12.3. By modifying the argument indicated in Exercises 12.1 and 12.2, construct a solution Fl(t, x ) of the Schrodinger equation,
(12.24)
1 aF1
a‘F,
i at
ax2
--=-
in R2,
which is a C“ function of ( t , x) in the whole plane, and vanishes identically in the half-plane t < 0. 12.4. Fix t > 0 in (12.20). What is the behavior of F(t, x ) as x tends to + a or to - a ?
Sect. 121
LINEAR PARTIAL DIFFERENTIAL EQUATIONS
101
12.5. Let P(5, T) be a polynomial of degree m in two variables, with complex coefficients. Assume that P(0, T) is of degree s m - 1 [whereas at least one partial derivative of P(5, T) of order m is nonzero]. By using the Puiseux theorem and generalizing the argument in Exercises 12.1 and 12.2, show that there is a C" function of (t, x) in R2,F(t, x), vanishing identically for t < 0, and satisfying, in the entire plane, the homogeneous equation (12.25) 12.6. Let A be a complex m x m matrix and consider the Cauchy problem in R2, au _ -- A
(1 2.26)
dt
au
ax
-3
u/
= uo(x),
t=O
where boldface indicates complex m-vectors. Prove that, if A is self-adjoint, i.e., A = A * ='A, the problem (12.26) has a unique solution which is a C" function o f t (in R') with values in the space YSp:(Cm) of tempered distributions with respect to x, valued in C" (i.e., of m-vectors whose components are tempered distributions in x) for every u,, E 9:(Cm). 12.7. Suppose that the matrix A in Exercise 12.6 has distinct eigenvalues. Prove that ifnone of these eigenvalues is real, the Cauchy problem (1 2.26) has a solution only if uo(x) is an analytic function of x. 12.8. As in Exercise 12.7, suppose that the eigenvalues of the matrix A are all distinct. Prove that, if all these eigenvalues are real, then the Cauchy problem (12.26) always has a solution, whatever the continuous function uo in R'. Give an explicit expression of the solution and use this expression to describe the support of the solution when we assume that uo(x) > 0 for 1x1 < 1 and uo(x) = O for 1x1 2 1. 12.9. Consider the Cauchy problem (12.27)
(34
- = At A x u
dt
in R n f i
ui:=o = uo(x) in Rn,
where uo(x) E L2(R") (we could have taken uo as well to be a tempered distribution). Show that if A > 0, problem (12.27) has a solution, which is a distribution in Rn+l,and in fact an analytic function for t # 0.
I3 The Global Cauchy Problem for the Wave Equation. Existence and Uniqueness of the Solutions
In this section we study the Cauchy problem (13.1) (13.2)
a2u -at2
when t = 0,
A x u =f ( t , x ) in R"", u = uo(x),
au
- = u l ( x ) in R". at
We shall prove the existence and uniqueness of the solution u(t, x ) for a very general class of data f and u,, , ul. For the moment, we make a very restrictive assumption on the data : (13.3)
f ( t , x ) is a C" function of (1, x ) in R"" whose support is contained in sume cylinder {(x, t ) ; I x I 5 R} ;
(13.4)
uo , u1 are C" functions of x in R", having a compact support.
Under these circumstances, we may perform a Fourier transformation with respect to x in both (13.1) and (13.2). We obtain (13.5) (13.6)
I?,,+
lt121?=.r(t,5)i n R ' x R,,
W, 0= fio(O,
4(0,
0= ill(()
in R,,
where the subscripts mean differentiation. In order to express u in terms off and uo , u1 we shall apply formula (1 1.14). We have already computed U l ( t , 1 ' ) in Sect. 7. It is given by (13.7) 102
Sect. 131
WAVE EQUATION. EXISTENCE AND UNIQUENESS
103
Since [see (1 1.19, (1 1.16)] Uo(t,0) is the solution of
we immediately obtain
Uo(C 0) = cos( I 5 I 0.
(13.9)
By (1 1.14) we have
First take t > 0. Then we may write
Observe that the first term, on the right-hand side of (13.11) is equal to
In view of this, (13.11) can be rewritten, for t > 0, as
(13.12)
wt,
5) = q
o a E+@,5) + 4(5)E+(44)
+ W(t)f(t,4)) * E+(4 51, (t)
where E, means that the convolution should only be computed with respect to t . For t < 0, we have (13.13)
w,5 ) = - Eo(0 a mt,r) - W ) E - ( t , 5) + W(-
t m ,0
3 * E-(4 5). (0
In these formulas, E+ (resp. E - ) is the fundamental solution of the wave equation with support in the forward (resp. backward) light-cone, E, (resp. E - ) its Fourier transform with respect to x [recall that E+ - E- =
104
[Chap. I1
THE CAUCHY PROBLEM
sin( I 5 I t ) / I 5I 1. If we now perform an inverse Fourier transformation with respect to x, we obtain
+ { H ( t ) f ( t ,x)> * E+ + { H ( - t ) f ( t , XI> * E- . ( t . x)
(1,
x)
Once this formula is written down, it is apparent that much of our restricting conditions (13.3)-(13.4) on the data can be lifted. Indeed, the convolutions in (13.14) make sense when uo(x), ul(x) are arbitrary distributions in R" and whenf(t, x) is a function of t , say continuous, with values in the space 9'(RZ) of distributions with respect to x. This is due to the fact that, for each t , the supports of E , and E- as distributions in x are contained in the ball I X I 5 1 t 1. As for the convolutions with respect to t , they make sense because both the support of E+ and of H ( t ) f ( t , x),are contained in the half-space t 2 0, whereas those of E- and H( - t ) f ( t , x) are both contained in the halfspace t 5 0 (we recall that, on the real line, the distributions with support in the half-line t >= 0 form a convolution algebra). Thus formula (13.14) extends to very general data [the only restriction is thatf(t, x ) should be a continuous function o f t with values in 93 But we may ask whether u(t, x) given by (13.14) satisfies (13.1) and (13.2). That it satisfies (13.1) is obvious. Let us examine (13.2). Let R denote a bounded open subset of R". If I t I < T, the supports of
E-
.
E+ - E - ,
viewed as distributions in x, are contained in the open ball 1x1 < T. Let us denote by R, the set of points x E R" whose distance to R does not exceed T. Let c1 E C,"(R"),c1 = 1 i n R,, a = 0 off R,,, ( E > 0). By well-known properties of convolution, we see that (13.15)
if
It1
< T,
XER,
(E+- E - ) )
+ {a(x)uI(x)} * ( E , - E - ) (1)
+ (ff(t)a(x)f(4
x)}
*
E,
(1, x )
+ (4- t ) W f ( t , x)> * E - . (1,
x)
But the right-hand side in this formula is somewhat similar to the right-hand side of (13.14) under assumptions (13.3) and (13.4). It is true that now cl(x)uo(x), a(x)u,(x), a(x)f(t, x) are not necessarily smooth functions. But their supports with respect to x are contained in a fixed compact set. Therefore, by the Paley-Wiener theorem, their Fourier transforms with respect to
Sect. 131
WAVE EQUATION. EXISTENCE AND UNIQUENESS
105
5 . A formula of the kind (13.10) is also valid here: One must of course replace u j ( x ) by v,(x) = cr(x)uj(x), j = 0, 1, f ( t , x) by g(t, x) = cr(x)f(t,x), and also u(t, x) by v ( f , x), equal to the right-hand side of (13.15). Suppose thatfis a continuous function of t valued in 9 ; . Then g is a continuous function o f t with values in the space of tempered distributions in x, Y:, 2nd J ( t , 5) is a continuous function of t valued in 9;. By virtue of (13.10) we see that, when t 0, G ( t , 5) converges to G o ( ( ) in 9':, while x are analytic functions of
--f
(1 3.16)
k ( t , 5) = - co(5)I5 I sin( I t I t ) + GI(() cos( 15 I t ) +J'B(t', 0
5) cos( I 5 I ( t - t')) dt'
converges to Gl(t).This implies that, when t + 0, u(t, x) converges to uo(x) and u,(t, x) converges to ul(x) in the sense of distributions in R. As t2 was an arbitrary bounded open subset of R", this convergence is also valid in R". Moreover, (13.16) shows that ut(t, x) is a C' function of t with values in g'(R"), hence (cf. Proposition 11.2) u(t, x) is a C 2 function of t valued in 9'(R"). The reader will notice that we are here dealing with differentiable functions valued in (infinite-dimensional) locally convex spaces, such as 9'(R"). On the subject of differentiable functions valued in Banach spaces, see Sect. 39. If E is a locally convex Hausdorff space, it is in particular a topological space, and we know therefore the meaning of a continuous functionf, defined in another topological space X , and valued in E. If X is an open subset of the real line, or the Euclidean space R", we may define the deriuative off, or the gradient off, at a point of X . Consider for instance the case X = R' and denote by t the variable in R'. Then f ' ( t o ) = lim h - ' [ f ( t , + h) -f(t0)] as h # 0 converges to zero. The only novelty, compared to the standard situation, is that the convergence takes place in E and that the latter can be equipped, in a natural way, with various topologies, for instance the strong (or initial) topology or the weak, u(E. E'), topology. Throughout this book, we shall most of the time use the strong topology-in a Banach space, this means the topology defined by the norm. In spaces such as 9' or Y' (the space of tempered distributions), it does not make any difference: for sequences, or for filters with countable bases (which is what is involved in the definition of the derivative), it is equivalent to the weak topology. Thus to say that a distribution T(t), depending on the real variable t (say, in some interval ]to, t l [ ) is continuously differentiable, resp. Ck(0 5 k 5 +m), is the same as saying that, given any test function 4 E Cp, the scalar function (T(t), 4 ) is continuously differentiable, resp. Ck.This is also valid when T(t)is tempered and regarded as valued in 9".One fact that will be used, and should perhaps be recalled, is that a functionf, defined and C" in an open subset of R", X ,
106
THE CAUCHY PROBLEM
[Chap. I1
valued in the space of C" functions in an open subset Y of another space R", is the same thing as a C" function in the product X x Y. Consider now a distribution h(t, x) in R"" which is a C2 function of t valued in .9'(R:) and which satisfies h,, - A, h = 0 in R"",
(13.17) (1 3.1 8)
h = h, = O
in R"
when t =O.
Let us set h,(t, x ) = H(t)h(t, x). The Leibniz formula and the initial conditions (13.18) imply a t once that
hence (13.19)
(;)'h+
- Axh+ = 0 in R"".
Since the support of h, is contained in the half-space t 2 0, the preceding considerations about convolutions in formula (13.14) apply. We may now write 0 = E , * ( O h , ) = (OE,) * h+ = h+ where 0 stands for the wave operator. This proves the uniqueness of the solution u(t, x) whose existence was proved above. Summarizing: THEOREM 13.1. Let uo, uI be any two distributions in R",f( t , x ) any continuous function of t E R' valued in 9'(R:). There is a unique solution u(t, x) of the Cauchy problem (l3.l)-(Z3.2) which is a C2 function of t with values in .9'(R:). It is given by formula (13.14). Formulas (13.5) and (13.16) show also that when the data are C" functions, the same is true of the solution: THEOREM 13.2. Let u o , u1 be any two C" functions in R", f any C" function in R"". The solution u of problem (13.I)-(13.2)given by formula (13.14) is a C" function in R"". It should be understood that Theorem 13.2 is not a t all a hypoellipticity result: It is not a statement of regularity about every solution of (13.1) but only about that unique solution of (13.1) which satisfies (13.2) also. Rather than looking for C" solutions to the Cauchy problem with C" data, one may want to use data with limited regularity of some kind and obtain solutions with as much regularity as possible. A good method of measuring regularity with respect to x is provided by the so-called Sobolev
Sect. 131
WAVE EQUATION. EXISTENCE AND UNIQUENESS
107
spaces in R".They measure it by taking square-integrability as the zero-level regularity. Then the degree of regularity can be any real number s: If s is a positive integer, to say that u is regular of degree s is to say that all derivatives of u of order 5 s are L2 functions; if s is a negative integer, it is to say that u is a sum of distribution derivatives of order $s of L2 functions. The great advantage of this approach lies in the fact that the Fourier transformation works so well in the space L2(R"):One can fully exploit the Plancherel theorem. This is what enables one to define the degrees of regularity when they are not integers. The formal definition of the sth Sobolev space H"(R")is as follows: Definition 13.1. A tempered distribution u in R" is said to belong to H"(R") u is a square-integrable function with respect to the measure (1 + 15 12)5 d t .
if the Fourier transform 0 of
We repeat that here s can be any real number. One may turn H" = H"(R") into a Hilbert space by setting the norm to be ( 1 3.20)
Denoting by .F the Fourier transformation in R" and by may introduce the operator (1 3.21)
T"u = $-'(I
9 - l
its inverse we
+ 1t12)"'29~.
Then we see that T" is an isometry of H" onto Ho = L2: via T",H" is a copy of L 2 . Let Y denote the space of C" functions in R", cp, such that I D"cp( decreases at infinity faster than I x ( - k , whatever the multi-index ct andtheinteger k. The Fourier transformation is an isomorphism of Y x(equipped with its on the other hand, it is obvious that multiplicanatural topology) onto 9,; tion by (1 + 15 I2)s/' is an isomorphism of 9, onto itself. We conclude that T" is an isomorphism of 9,onto itself. In fact, when s varies, the operators T"form a one-parameter group of automorphisms of 9,. In particular, T-" maps Y c L2onto Y c H". Since it is an isometry and Y is dense in L2, we reach the following conclusion, used below:
PROPOSITION 13.1. The space Y of C" functions in R", rapidly decaying at injinity, is dense in every space H" ( s real). Since the space of test functions C : is dense in 9, it is also dense in every H" (for more on the Sobolev spaces, see Sects. 24 and 25). Let us now return to formula (13.10). Let us multiply both members by (1 lt12)"'2 and observe that
+
108
THE CAUCHY PROBLEM
[Chap. I1
hence, by virtue of (1 3.20), (13.22)
IMt,
.>IIs
5
+ 2 SUP(l7 l ~ l ~ l l ~ l l l s - l
II~OllS
Because of the density of Cp in H" this shows that formula (13.10) remains valid when uo E H",u1 E H ' - ' , and f is a continuous function of t valued in HS-1 . It is not true, however, that u is then a C 2 function of t with values in HS: for instance, if we differentiate (13.10) once with respect to t , a new factor shows up, as in (13.6). Thus, if the hypotheses stated above hold, we see that u, is a continuous function oft with values in H S - ' . We have
One can push this argument one step further and prove THEOREM 13.3. Suppose that u,, E H S , u1 E H S - ' and that f is a continuous function of t E R' valued in H S - ' . Then the solution u of (13.1)-(f3.2)is a C kfunction o f t valued in Hs-k, k = 0, 1, 2. More generally,
THEOREM 13.4. Suppose that uo E H', u1 E H'-' and that f is a C"'function of t valued in IT"-'. Then the solution u of (13.1)-(13.2)is a Ckfunction of t valued in H s P kk ,= 0,, . . , m + 2. At this stage, it is traditional to discuss the dependence on the data of the solution u whose existence and uniqueness are asserted in the preceding theorems. This can easily be done by means of the closed graph theorem, after having observed that formula ( I 3.14) has the following obvious implication: THEOREM 13.5. The mapping (13.24)
(uo 9
%,f)H u,
where u is given by (13.14),is a continuous linear map of the triple product 9 ' ( R " ) x W(R") x C0($ ; 9'(R;)) into C2(R:; B'(R;)).
Proof. It suffices to apply formula (13.15), which reduces the proof to the case where uo , u1 have compact support while the projection on the space R" of the x variables of suppfis compact. We may then make a Fourier transformation with respect to x and verify the statement on formula (13,10), where it is obvious. Q.E.D
Sect. 131
WAVE EQUATION. EXISTENCE AND UNIQUENESS
I09
Then one obtains at once the following results: THEOREM 13.6. The mapping (13.24)is a continuous linear map of
C-;(R") x C"(R") x C'(R"+') into C%(Rflf1).
Indeed, the graph of this mapping is easily seen to be closed (cf. Remark 15.1), and hence continuous by the closed graph theorem. Similarly, one proves the following extension of Theorem 13.3:
THEOREM 13.7. The mapping (13.24) is a continuous linear map of H"(R") x HS-'(R") x Co(R:; H'-'(R")) into
n
c~(R: ; ~ 5 - - k ( ~ n ) ) .
k=O.1,2
We recall that if R is an open subset of R" and X a Banach space, the space Ck(R; A') is equipped with the topology of uniform convergence, on every compact subset, of the functions and of all their derivatives of order s k . We also recall that the topology of an intersection A n B is defined as follows: A set is the neighborhood of a point in A n B if this is true for the topology induced on A n B by A or for the one induced by B.
Exercises
13.1. In formula (12.3) suppose that both uo and u1 have compact support. Prove that (12.3) is equivalent to (13.10) in which 17 = 1 a n d f - 0. 13.2. Let 8 denote the angular variable on the unit circumference r (thus varies from 0 to 27r). By using Fourier series expansions, solve the Cauchy problem (13.25) (13.26)
u = uo(e),
dU 2t
- = ul(0)
when
t = 0, 0 5 6 5 27r,
where uo and u1 are two C" functions of e (on r).Do the same when uo and u1 are any two distributions on r. Compare with (12.3) and (13.10). 13.3. Using the same notation as in Exercise 13.2, state the analog of Theorem 13.4 for the Cauchy problem (13.25)-(13.26) (in particular the student must define the Sobolev spaces H k on the unit circumference r).
110
THE CAUCHY PROBLEM
[Chap. I1
13.4. Using the same notation as in Exercise 13.2, solve explicitly the problem (13.25)-(13.26) when uo = 0, u1 = a(@), the Dirac distribution on the unit circumference r (i.e., the mass 1 at 8 = 0). Consider then the periodic distribution 6 on the real line, equal to the Dirac measure in the interval -27-c < t < 211. Solve the Cauchy problem (12.1)-(12.2) when uo = 0, u1 = 8 by applying both formulas (12.3) and (13.10). 13.5. Consider the Cauchy problem (13.1)-(13.2) in the case n 2 2. Suppose that the data f, u o , u1 are rotation-invariant functions of x E R", i.e., are functions of 1 x Jalone. Prove that the solution u is then also a function of I x ( alone (for fixed time t ) . Show that formula (13.14) can be simplified to involve integrals, and more generally expressions where only the variables t and 1x1 intervene. Write it down explicitly in the cases n = 2, 3, using the expression of E , computed in the Appendix to Sect. 8. 13.6. Consider the Klein-Gordon operator
+
(13.27)
a2
K,,, = - - Ax at2
+ m2.
Give the formulas that are the analogs of (13.7) and (13.9) when m is real, m # 0. Extend Theorem 13.1 to this case by verifying that the proof can indeed be extended. Is Theorem 13.1 still true when rn is any complex number?
I4 Domain of Influence, Propagation of Singularities, Conservation of Energy
The solution 14 of the Cauchy problem (1 3. I)-( 13.2), whose existence and uniqueness are stated in Theorem 13.1, is the sum u w of the solution v of the problem
+
a2u
(14.1)
-at2
(14.2)
when t = 0, u = u&),
and of the solution (14.3)
(14-4)t
A,u = 0 in R"",
14' of
au
- = ul(x) at
in R",
the following problem:
-aZ _w at2
Ax w = f ( t , x ) in R"",
aw when t = 0 , w = 0 , - -- 0 at
inR".
One moment of reflection easily shows the meaning of each one of these problems. In the first one, an oscillatory phenomenon is generated at time zero, with a certain initial intensity and a certain initial velocity of propagation: It then propagates, both in the future and in the past, through a medium where no other disturbance is created. The function u measures, at any given point in space and time, the amplitude and the phase of the oscillation. In problem (14.3)-(14.4) there is no "initial impulsion ": but a "field" is
t Throughout this section, unless otherwise specified, uo and u1 will be two distributions in R" and f ( t , x ) will be a continuous function of t in R' valued in 9:(R"). 111
112
THE CAUCHY PROBLEM
[Chap. I1
generated in the medium, in certain zones (where " sources " and " sinks " are located), in a manner that varies with time; w measures the resulting oscillation. It is convenient to consider the two problems separately. We shall look mainly at the first, (14.1)-(14.2). Its solution is given by (13.14):
Let us look, for instance, at v in the half-space t > 0. There we have
(14.6)
THEOREM 14.1. Let R be any open subset of R",t any number greater than zero. The values of v(t, x ) in R depend only on the values of uo(x) and u I ( x ) in the set
R, = { x E R"; dist(x, R) < t}. Proof. It suffices to show that if both uo and u, vanish in R,, v(t, x) vanishes in R. We have, in such a case,
where supp, stands for the support of distributions in the x variables ( t is fixed); A B stands for the vector addition of the subsets A and B of R". Recalling that the support of E + ( x , t ) as a distribution in x is contained in the ball 1x1 5 t , we see that
+
A similar conclusion holds for u1 * E + .
Q.E.D.
(X)
The preceding statement is usually referred to as the theorem on the domain of iaguence. The open set R, can be constructed in the following manner: View R as a subset of R"", contained in the hyperplane corresponding to time t , i.e., identify R with 6 = ( ( x , t ) ; x E Q}. Let R,- denote the union of all the translates of the backward light-cone with vertices at the points of 6. Then R, is the intersection of R, with the hyperplane corresponding to time zero (see Fig. 14.1). Suppose for a moment that uo and u1 are C" functions in R". Then the preceding considerations about the domain of influence can be made punctual: For fixed t , the value of v ( t , x ) at xo (v is now a C" function in R"") depends
Sect. 141
INFLUENCE, PROPAGATION, AND CONSERVATION
FIG.
113
14.1
only on the values of uo and u1 in the ball I x - xo I 5 t . This can be checked directly on the convolutions in (14.6) or else one may apply Theorem 14.1 and make R converge to the point xo . As a matter of fact, for this argument to work, uo and u1 need not be C“ : They must have sufficient regularity so that v be a continuous function of (f, x). The above considerations can be reverted in time: Rather than asking where the phenomena observed in a certain region of space, at a given time, did originate, one might ask what regions will be influenced by something happening in some set at time zero. More rigorously, this is to ask what is the support of u(t, x), knowing that the supports of uo and u1 are contained in K? Of course, in the half-space t > 0, (14.7)
supp v c K
+ supp E, ,
where K has been identified with the set of points (x,0), x E K. Because of the Huyghens principle, there is now a significant difference between the case n > 1, n odd, and the other cases. This is best seen by taking K to be a point, say K = ( 0 ) [for instance, one may then take uo = S(x), u1 = 0; more generally, one may take uo , u1 to be arbitrary linear combinations of the Dirac measure and of its derivatives, with respect to XI. This would correspond to something like flashing a light at some point in space, at time t = 0, with an “infinite” intensity and turning it off at time t > 0. If n = 3 or, more generally, if n >= 3 and odd, at an arbitrary time f, > 0, the oscillation, measured by v(x, l ) , is concentrated on the sphere 1 x - xoI = t o . We may rephrase this as follows: Suppose that an observer is located at some point in space, at a distance r > 0 from 0: the oscillation is coming toward him as t goes from 0 to r , reaches him at time t = r , and leaves him behind as t > r increases. The fact that it reaches the observer at time t exactly equal to r is simply due to our choice of the unities measuring time and space; we have assumed throughout that the velocity of propagation is equal to one. The general case, where the velocity is any number V > 0, is easy: The oscillation reaches the observer at time r / V .
114
THE CAUCHY PROBLEM
[Chap. I1
When n = 1 or when n is even, it is also true that the observer is reached by the wave at time r = r but the difference with the case n = 3, 5 , 7, ... , is that now he will continue to perceive the oscillation even at times t > r (if he has at his disposal instruments which are sufficiently sensitive). The comparison with the second problem, (14.3)-(14.4), is also instructive. Its solution is given by (14.8) w(t, X) = { H ( t ) f ( t ,x)) * E+ + { H ( - t ) f ( t , x ) } * E - . Here * means convolution with respect to all the variables, t and x . In the half-space r > 0 we have (14.9) w ={H(t)f}* E , . It is worth noticing that the only right-hand side contributions representing the effect of sources and sinks lying in the medium (and variable in time), which can influence events a t time t originates between zero and t (the influence of events that occurred in negative times is suppressed by the imposition of values at zero time). This is in agreement with the causality principle. Of course, (14.10)
supp w c supp H ( t ) f + supp E+ .
This is the analog of (14.7). But in (14.7) K is a compact subset of the hyperplane t = 0, whereas here supp(H(t)f) can be any subset of the closed halfspace t 2 0. The difference can be best seen by looking at punctual sources. Suppose, for instance, that f ( t , x) = l ( t ) 6(x). Then H(t)f(r, x) = H(t) 6(x) would describe something like turning on a light, at a given point of space (taken as the origin of space), and keeping it on forever after. The support of H(t)f(t, x ) is the half-line t 2 0, x = 0. The right-hand side of (14.iO) is equal to the forward light-cone r+ regardless of the parity of the space dimension n. This agrees with the customary experience that if a source keeps broadcasting, an observer located anywhere in space will keep receiving the broadcast. Rather than sources (of light, of sound, of radio waves, etc.), one may consider sinks or obstacles, or any combinations of sources and sinks [this corresponds to the sign of the right-hand sidef(t, x), assumed to be a real-valued function-or, more generally, a real-valued measure]. With obstacles one would of course encounter shadows and light absorption, and so on. We study now the so-called propagation of singularities. We assume that uo and u1 are C" functions outside of a certain closed subset S of R",in other words, we assume that the singular supports of uo and u1 are contained in S and we try to locate the singularities of the solution v of (14.1)-(14.2). The analogous question can be raised about problem (14.3)-(14.5). In both cases, the answer can be obtained by looking at a convolution (14.1 1)
g=E*h
Sect. 141
INFLUENCE, PROPAGATION, AND
CONSERVATION
115
where E , g , h are distributions in R"", the support of E is contained in r+, and the supports ofg and h are contained in the half-space t 2 0. Observe that the two terms in the expression (14.6) of v are indeed of this kind: For instance, the second one is obtained by taking E = E , and h(t,x) = b(t)u,(x). LEMMA14.1. Under the preceding hypotheses, (14.12)
sing supp g c sing supp E
+ sing supp h.
Proof. Let a (resp. 8) be a C" function in R"" whose support lies in the neighborhood of order E > 0 of A = sing supp E (resp. of B = sing supp h) and which is equal to one in the neighborhood of order ~ / of 2 A (resp. B). We have E = aE (1 - a)E, h = /?h (1 - P)h and both (1 - a)E, (1 - B)h are C" functions; hence 9 - (W * (Bh)
+
+
+ +
is a C" function. But supp[(aE) * (Bh)] c supp a supp fl is contained in the neighborhood of order 26 of A B. By taking E -+ 0, we reach the conclusion that g is a C" function in the complement of A B. Q.E.D.
+
In applying this to our situation, we have E = E , or E = aE,/at, h = G(t)uj(x) ( j = 0, 1) or h = H(t)f ( t , x). We may therefore state THEOREM 14.2. Suppose that uo and u1 are C" functions in R"\S (where S is a closed subset of R"). Given any t > 0 , the distribution with respect to x, u ( t , x), the solution of (14.1)-(14.2), is a C" function in the complement with respect to R" of the set S(t, = { X E R";3y E S , ( X - y ( = t ] . There is an analogous statement concerning the solution w of (14.3)-(14.4) which we leave to the student. The physical interpretation of Theorem 14.2 is not difficult. Since support and singular support of E , coincide when n = 3, 5 , 7, . . . , it is best seen when n = 1 or n is even. Take, for instance, n = 2: In concrete terms, this corresponds to a two-dimensional propagation medium, like the surface of a lake or a sea. In the center of the lake, at time zero, there takes place a very brief but very intense stormlike event ("very" means of course "infinitely"); from thereon, a wave, something like a tidal wave, propagates. For a fixed observer on the lake, at a distance r from the center, nothing happens until he is reached by the wave, at time t = r . At that moment, he is likely to find himself in a very violent situation: all the energy of the initial "explosion " or storm seems to be concentrated at the wave front. If our observer survives, once the wave front passes him by, he will find himself in a smooth " swell," whose amplitude, as measured from the level of the lake at rest, decreases to zero as time goes to a.
+
1I6
[Chap. I1
THE CAUCHY PROBLEM
It should be observed that the set S(,)has a certain thickness, determined by the thickness of the set S containing the singularities of uo and u l . If S is compact, the thickness of S(f,will be finite: Here one could think of a star which has exploded and is at the center of an expanding shell of burning matter: S(,, is the shell (at time t ) . We may consider the outer boundary of S(,);it is the set S(,) = {x E R"; dist(x, S ) = t } . The S(,) form a one-parameter family of hypersurfaces; if t , < t 2 , is contained in the interior of If we assume that these hypersurfaces are sufficiently smooth, say C', we may introduce their orthogonal trajectories. These are usually called light rays: Their interpretation in optics needs no elaboration. When suitably generalized, the concept of light ray plays an important role in the general theory of partial differential equations. Returning once more to problem (14. I)-(14.2) we introduce the following quantity
s(f2).
called the total energy of the system. The integration is performed over the whole space R" (t is regarded as a parameter). The integral makes sense when u, is square-integrable with respect to x, and so is grad, v. We may apply Theorem 13.3, noting that heref- 0 and taking s = 1. We conclude that, if uo E H'(R"), u1 E H o = L2(R"), u is a Co function of t valued in H' (hence grad, u is a Co function of t valued in L 2 ) whereas u, is a Co function of t valued in L2. In this case, E ( t ) is well defined. THEOREM 14.3. Let uo belong to H'(R"), u1 to H o = Lz(R"). The total energy E is constant, i.e.,for all t E R',
(14.13)
E ( t ) =/{I.'(X)I'
+ lgrad uo(x)l2}dx.
Proof. Suppose first that both uo and u1 are C" functions with compact support in R". Then u is a C" function in R"+' and, for each t , the support of x H u(t, x) is compact. We have E'
=2
Re -(vff6,+ (grad, v, grad, fit))
J
dx.
In the second term, we may integrate by parts with respect to x: E'
=2
J
Re (v,, - Axu)6,dx = 0
This proves (14.13) when u o , u1 E Cp.
by (14.1).
Sect. 141
INFLUENCE, PROPAGATION, A N D CONSERVATION
117
When uo E H' and u1 E HO, for each fixed t , ut and grad, u belong to Ho and, by Theorem 13.7, the mapping (uo U l ) H (0, grad, 0) from H' x H o into (L2)n+'is linear and continuous. On the other hand, E(t) is clearly a continuous (quadratic) function on its range. Consequently, viewed as a function of (uo , u , ) it is quadratic and continuous, and so is the right-hand side of (14.13). If the latter coincides with E(t) when ( u o , ul) belongs to the dense linear subspace C: x C ,: as it does, it must coincide with E(t) for all (uo , u l ) . Q.E.D. I
5
Exercises 14.1. Does Theorem 14.1 remain true if we replace the wave operator by the Klein-Gordon operator (1 3.27)? 14.2. Does Theorem 14.2 remain true if we replace the wave operator by the Klein-Gordon operator (13.27)? 14.3. Let u be a solution of the Cauchy problem a2u --
(14.14)
A,u
at2
(14.15)
+ l u =0 au
u = uo(x),
-=
at
in R"",
uI(x)
at t
= 0,
where uo and u1 are two smooth functions with compact support in R". What should be the definition of the total energy E(t) of the system (at time t ) if Theorem 14.3 is to be extended to all real values of A.? 14.4. Let K be a compact subset of R", u the solution of (14.1)-(14.2) where the Cauchy data u o , u1 will be assumed to be C" functions with compact support. Let us introduce the energy of K (at time t ) :
EK(t)= ( / u , / '
(14.16)
K
+ Jgrad, ul')
dx.
Under the hypothesis that the number n of space variables equals one, prove that there is a number T 2 0 such that EK(t)= 0 for all t > T. 14.5. Use the same notation as in Exercise 14.4. Suppose now that the number of space variables, n, is an odd integer 2 3 . Prove that EK(t) [defined in (14.16)] vanishes for all t > T, T > 0 large enough. 14.6. Let us use the same notation as in Exercise 14.4 and also introduce the distribution U(t, x ) in R"" such that U,, - A, U = 0 in R"+l, (14.1 7) (14.18)
=0,
Ul t=O
=0
= d(x),
the Dirac measure in R".
118
[Chap. I1
THE CALJCHY PROBLEM
We. know by Theorem 13.4 that there is a unique C" function of t in R', valued in Q'(R"), satisfying (14.17)414.18). What is the Fourier transform of U? Prove that the energy density E ( X , t ) = I U,12 I grad, UI2 is a smooth (and, in fact, an analytic) function of ( x , t ) in the region lxI2 # t2. 14.7. Using the same notation as in Exercise 14.6, suppose that n = 2. Compute the energy EK(t), defined in (14.16), corresponding to v = U ( U defined in Exercise 14.6) and to
+
K = ( x E R " ; 1x1 S R } ,
R>0,
for values of time t > R. From the result of this calculation derive that, when u is any solution of (14.1)-(14.2) with compactly supported initial data (and when n = 2), EK(t) does not vanish identically for large values of t, contrary to what happens for odd values of n (Exercises 14.4 and 14.5). 14.8. Let us introduce the following
DEFINITION 14.1. Let u be any distribution in an open subset R of R".The smallest closed subset of R, in the complement of which u is an analytic function, is called the analytic singular support of u and will be denoted by sing supp, u. Prove the analog of Lemma 14.1, replacing sing supp everywhere by sing supp, (cf. Definition 3.2). 14.9. Use the result stated in Exercise 14.8 (i.e., the analog of Lemma 14.1 for analytic singular supports) to prove the following analog of Theorem 14.2:
THEOREM 14.4. Let S be a closed subset of R", u o , R" which are analytic functions in R"\S.
u1 two distributions in
Given any t > 0 , the solution v(x, t ) of (14.1)-(14.2) is an analytic function of x in the complement of S(r)= { y E R";3y' E S , I y - y' I = t}.
I5 Hyperbolic First-Order Systems with Constant Coefficients In Sects. 13 and 14 we studied the Cauchy problem (in the large, i.e., in the whole space-time R"+')for the wave equation, and encountered remarkable properties. One may say that the wave equation and closely related equations such as Klein-Gordon's are the best suited for such a study. In particular, a solution to the Cauchy problem always exists, under the weakest assumptions about the Cauchy data, is unique, and is a continuous linear function of the data. It is the tradition to summarize these properties by saying that the Cauchy problem is well posed for the wave equation (see Definition 15.1). In Sect. 12 we saw that this is not so for our other basic examples (CauchyRiemann, Laplace, heat, Schrodinger). However, there is a wide class of PDEs for which the Cauchy problem is well posed: the hyperbolic equations (the wave equation is hyperbolic). In the present section we study the hyperbolic first-order systems with constant coefficients. They are systems of linear partial differential operators of the form (15.1)
sj
where the Aj's (0 S n ) are m x m matrices with complex coefficients. It is obviously convenient to perform a Fourier transformation with respect to x , which leads to the system of ordinary differential operators (1 5.2)
where
a
L = - - iA(r) - A , at
120
[Chap. I1
THE CAUCHY PROBLEM
We want to study the Cauchy problem: (15.3) (15.4)
Lu=f
ult=o
inR"+',
= uo(x) in R",
where the values of uo and f are complex m-vectors. Dejinition 15.1. We shall say that the Cauchy problem (15.3)-(15.4) is well posed if, given any data uo E 9k(R"), f E Cp(9L(R")), it has a unique solution u E C:(9k(R")), and this solution is a C" function whenever this is true of the data. Remark 15.1. If we extend this terminology to higher order equations and systems (bringing then to the right level the required differentiability of the solution), we see that Theorems 13.1 and 13.2 state that the Cauchy problem is well posed for the wave equation. Remark 15.2. We have posed the Cauchy problem in the most global terms: Eq. (15.3) should be verified in R"", condition (15.4) in R". One can of course state it also in local terms. When the coefficients are variable, one cannot in general avoid it. In the local statement, we are given an open set R c R" and a number T > 0; f is a continuous function of t, I t 1 < with values in 9;(R) while uo belongs to 9L(R). The solution u is required to be a C' function of t , I t I < T, valued in Qk(l-2). Remark 15.3. Of course, one may somewhat vary the regularity conditions with respect to t in Definition 15.1. But it ought to be noticed that, because of (15.4), it must be made clear what one means by the value of u(t, x ) when t = 0. For instance, if u(t, x ) were only L' with respect to t , it is not clear what we would mean by it. On the other hand, if both u and f are continuous in t (with values in 9:), Eq. (15.3), rewritten as u, = f +
c n
j= 1
AjU,,
+ Aou,
shows that u, would also be continuous with respect to
t.
Remark 15.4. In our definition of well-posed Cauchy problems we have not included the continuous dependence of the solution on the data. This is because it automatically follows from the existence and uniqueness of the solution, by virtue of the closed graph theorem, as alluded to at the end of Sect. 13. Indeed, the closed graph theorem applies to spaces like 9; and C:(9:). Suppose then that nets u: -+ uo in 91, f v +f and uv + u in Cp(9;) and in C:(9:), respectively. Since L is a continuous linear operator on the
Sect. 151
HYPERBOLIC FIRST-ORDER SYSTEMS
121
latter space, if Lu' =f v for every Y, we must also have Lu = J On the other hand, if ~"(0,x) = u;(x> for every v , it is trivially clear that u(0, x) = uo(x). The same remark applies if we replace 9; by Cp and Cf(9:) by CTx. This latter observation leads to interesting developments, as we shall now see.
If one recalls the definition of the C" topology, one sees that the continuity of the mapping (uo ,f)H u can be expressed as follows: (15.5)
sup
To every compact subset K of R"", to every integer M 2 0 , there are another compact set K ' c R"' a compact subset KO of R", two integers M ' , M , 2 0, and a constant C > 0 such that, f o r all u E Cm(Rnfl),
',
c
Io:o,"u(t, x)l
(t,x)eK k+la[6M
- +UP XE
1
KO J U l j M o
I D:u(o,
x>l
+ t t sup 1 I ~ : wX4 Nl]. , x ) ~ K 'k + / a l S M '
It is tempting to try to get information out of this inequality. Let us simplify the situation a little, by taking f = Lu = 0. We notice first that if uo(x) = u(0, x) vanishes identically in a neighborhood of K O ,u(t, x) vanishes identically in K. We may rephrase this by saying that the value ofu(t, x) in the neighborhood of K depends only on the uo(x) in the neighborhood of K O . This is the domain of influence phenomenon, already encountered in the study of the wave equation, where, however, the relation between KO and K was very precisely established (Theorem 14.1). Next, still keeping Lu = 0, we are going to make a special choice of u,. We take uo(x) = e i ( * * ~ ) u 0 ,
where v, is a fixed nonzero complex m-vector and 5 any vector in R".If we write u(t, x) = e'("' r)u(t, x, {), we see that u is the solution of the problem (15.6)
av
_at
1A , - {iA(t) + A,}v ax' n
a0
= 0,
j=l
We may take u independent of x, hence, by (15.2), the unique solution of (15.7) which is
[Chap. I1
THE CAUCHY PROBLEM
122
Let us apply (15.5), choosing K to be an interval {(x, t ) ; x T > 0. We derive, taking M = 0,
= 0,
I rl 5 T},
Let us choose arbitrarily a unit vector 4 and set A = A ( [ ) . Let f ( z , denote the characteristic polynomial of A - k A O : f(z,
E)
= det(z1- A
E)
+ ieAo).
If we view E as a real variable, it is not difficult to see that there are M continuous functions of E , I j ( & ) , j= 1, . . . ,m, with complex values, which, at each point E , represent them roots off(z, E ) (the proof of this assertion is left to the student). Suppose then that A has an eigenvalue I , with nunzero imaginary part. There is > 0 such that, for all E , I E I < A - i&Aohas an eigenvalue Is verifying 1 Im I,1 > c > 0. From the preceding considerations we derive that we may choose 1, to be a continuous function of E . From now on let us restrict the variation of E to the open interval 10, eo[. For each such E we may select an eigenvector u, of A - ieA, corresponding to the eigenvalue I , . Let us take I u s ] = 1. Let us substitute 27, for u, in (15.8) and choose l = E - ' [ . We have exp[t{iA(<) + Ao)]v, = exp[ite-'(A - i ~ A , ) ] v= , exp(ite-'I,)u,, from which, by (1 5.8), (15.9)
sup {exp[- t(Im @-']} 1'1 ST
2 C(1 +
Because of the continuity of I, with respect to sign for E c E, . In (15.9) we take t =
We obtain exp(cTE-') 2 C(l to 0. We have thus proved
E,
Im 1, will keep the same
-T(sign of I&).
+
which is absurd if
E
is to converge
PROPOSITION 15.1. If the Cauchy problem (15.3H15.4)is well posed, the eigenvalues of the matrix A(C;) are real. Remark 15.5. Take n = 1, A , = I / i , A , = 0 : L = a/at + i d/ax. We see, by a different method than the one-used in Sect. 12, that the Cauchy problem is not well posed for the Cauchy-Riemann operator. Remark 15.6. Suppose that we had taken K t o be the interval {(x. t ) ; x = 0, 0 5 t 6 T } (rather than 1 t 1 S T). Then the condition derived from (1 5.9)
would have been the following one: (15.10)
the imaginaryparts of the eigenualues o f A ( ( ) are 50.
Sect. 151
HYPERBOLIC FIRST-ORDER SYSTEMS
123
This observation might serve as the starting point for a study of the forward Cauchy problem. A more precise result than Proposition 15.1 can be proved, namely THEOREM 15.1. If the Cauchy problem (15.3)-(15.4) is well posed, the following is true: (15.1 1) there is a constant C > 0 such that, given any 5 E R, and any eigenualue 2 of A ( 5 ) - iA,, we have Ilm A ) 5 C. The proof of this statement is not elementary; it is based on the SeidenbergTarski theorem and we do not give it here. However, the underlying idea is not difficult to grasp. Indeed, one derives from (15.8) that the absolute values of the imaginary parts of the eigenvalues of A ( t ) - iA, are bounded by C, log(1 + 1 ti), where C , > 0 does not depend on 5. However, these eigenvalues are the roots of the polynomial in z , det(zI - A ( 5 ) - A,), whose coefficients are polynomials with respect to 4. If these roots or their imaginary parts must grow as 15 I -,+ 00 and tend to infinity, they should not do so logarithmically; they do it like fractional powers of 15 1 . One can check this easily on simple examples and the Seidenberg-Tarski theorem enables one to prove it in general. Consequently, if the absolute values of the imaginary parts of the eigenvalues are bounded by C , log(1 It]),this means that they are bounded independently of 5 . The main result of this section, Theorem 15.2, states that (15.11) implies that the Cauchy problem (153415.4) is well posed. Thus (15.11) is both necessary and sufficient in order that this be true. This constitutes the celebrated Gdrding’s theorem. It motivates the introduction of the following terminology :
+
Definition 15.2. The system (15.1) is called hyperbolic if(15.11) holds. Before proving the converse of Theorem 15.I we give two criteria that are useful when checking if a given system is hyperbolic. PROPOSITION 15.2. The system (15.2) is hyperbolic whenever one of the following two conditions is fulfilled:
5 E R,, the matrix A(<) is self-adjoint; (15.13) for any 5 E R,, # 0, the eigenualues of A ( ( ) are real and
(15.12) for any
distinct. Proof. Suppose that the following is true: (15.14) given any point toE R, , I to1 = 1, there is a neighborhood U, of to and an invertible m x m matrix r(5) whose entries are continuous functions in U,, such that S(<)= r(t)-’A(t)r(t) is self-adjoint, whatever 5 in U , .
I24
[Chap. 11
THE CAUCHY PROBLEM
We contend that (15.14) implies that the system (15.1) is hyperbolic. Let us set B , ( [ ) = r(t)-lA,r(P), t E uo, it1 = 1, and take 5 = p [ . We observe that if A is an eigenvalue of A(5) - i A , , it is also an eigenvalue of pS([) - iBo([).Therefore there is a unit vector u in C'" such that p ( ~ ( [ ) u 0) , - i(Bo([)u,u) = I(a, u) = A. We have denoted by ( , ) the Hermitian product in C". By hypothesis, S ( [ ) is self-adjoint, therefore Im I which implies IIm (1 5.1 5 )
= - Re@,(&,
u),
A1 S IlB,([)jj (11 jl stands for the matrix norm), hence IIm 21
5 IIr(0lllluo-l II
IIAoll.
But IIr(t)IIand (Ir([)-'II are continuous functions in U,. Therefore we may find a subneighborhood Uh c U , of to where these two functions are bounded. We may then cover the unit sphere S,-, c R, with a finite collection of such neighborhoods U& and we conclude easily that the numbers I Im A I are bounded independently of 5. In the case (15.12), condition (15.14) is trivially satisfied. Let us show that it is also satisfied in the case (15.13). Let A,([) < * < A,([) denote the eigenvalues of A ( [ ) and set d = int(dj+,([) - A j ( [ ) ) where the infimum is taken overj = 1, . . . , m - I and [ E S,-l. Let c j be the circle in the complex plane, centered at A,([) and with radius d/2. The circle c j varies continuously with g. Let us set
P j ( [ ) = (2in)-'
f
j
(zZ - A ( [ ) ) - ' dz,
=
1, . . . , rn.
c,
The following properties are standard and, at any rate, easy to check:
Pi([)'
= Pi([),
A ( [ ) P j ( @ )= Aj(P)Pj(4),
I
=Pl([)
+ + P'"([). * * *
For each j = 1, . . . , m, let vy be an eigenvector of A(<') corresponding to Aj(tO), say with unit length. The vectors vy, . . . , v: are linearly independent and therefore form a basis of C". Let us set vj([) = P j ( f ) v g .If [ is close enough to lo,the vj([) also form a basis of C'". Let ek be the vector whose components are all zero, except the kth one, which is equal to one. We set
The matrix I-([) = ( $ ( [ ) ) l j j , k j , the student can easily ascertain.
has the properties required in (15.14), as Q.E.D.
Sect. 151
125
HYPERBOLIC FIRST-ORDER SYSTEMS
The following terminology is often used:
Dejinition 15.3. The system (15.1) is called strongly (or strictly) hyperbolic
if condition (15.13) is satisjied.
We now prove the main result of this section. THEOREM 15.2. If the system L is hyperbolic, the Cauchyproblem (15.3)-(15.4) is well posed.
Proof. The theorem follows from the study of the matrix-valued distribution U ( t , x), which is the inverse Fourier transform with respect to x of the distribution
U ( t , 5) = exp{it[/l(()
-i~~]}.
We recall that A(5) is linear with respect to t. We may extend at once U(t, t) to the complex values o f t , denoted by c, and we immediately have that
(I 5.16)
there are two constants B, C > 0 such rhat, for all real t and all complex vectors iE C, ,
11 U ( t , c)li 5 Ceelrl'(1. Thus, U(t, 5) is an entire function of exponential type with respect to stands for the matrix norm).
c (11
(1
Q.E.D.
Next we prove the following assertion:
(I 5.17)
There are two constants B, , C , > 0 such that, for all real t and all real vectors 5 E R, ,
II 0(t7011 5 CO(1 + I5 I)" exp(B0 I t I). Thus o(t, 5 ) is a function of 5 in R","slowly growing" at infinity. Property (15.17) will follow from certain properties of matrices, which we now describe.
LEMMA15.1. Let M be any m x m matrix. There is a constant C , > 0, depending only on the degree m, such that, for all complex numbers z whose distance to the spectrum of M (i.e., the set of eigenvalues of M ) is at least one, (1 5.1 8)
Il(ZZ-
M)-y I C,(I
+
(21
+ llMll)"-'.
Proof. Let us write N = z 1 - M . Then by the theorem of Hamilton-Cayley, and denoting by a,, . . . , a,,,the coefficients of the characteristic polynomial of N , we have N(N"-'
+ a,Nrn-' + . + am-,) + a,Z * *
= 0,
126
[Chap. I1
THE CAUCHY PROBLEM
whereas IIN-lII 5 IarnI-'IINIIrn-' + Ial/arnI IINIIm-2 +
*
a
*
+
Iarn-1/arnl.
But IJNII5 IzI + IlMll, and the ratios of symmetric functions aj/am are bounded by universal constants and so is a; : indeed, these are the symmetric functions of the eigenvalues of N-', which have absolute value 1. Q.E.D.
LEMMA15.2. Let M be any m x m matrix. There is a constant CA > 0, depending only on rn, such that
I[eiMI( 5 CL( I + I(Ml()me'(M),
(15.19)
where I ( M ) is the largest absolute value of the imaginaryparts of the eigenvalues of M .
Proof. Let y denote the contour of integration shown in Fig. 15.1. Since all 41m I
FIG.15.1
the eigenvalues of M have absolute value 5 llMll, the distance from y to the spectrum of M is at least one (in fact, it is one). On the other hand, if z E y, IzI <= 2(1 IlMll). We have e i M - (274-l f e"(zZ - h 4 - I dz,
+
7.
from which, by Lemma 15.1,
IleiMlj5 c m ej- I m Z ( 1
+ IZI + I I M I I ) ~ - ~ I ~ ~ I
Y
5 3Cm(1+ llMII)"-' e (length of y) As the length of y is 1 4 ( 1
+ IIMII), we obtain (15.19).
Proofof(15.17). It suffices to take M = [ ( A ( ( ) - iA,) in (15.19) and use the fact that, by our hyperbolicity hypothesis, the imaginary parts of the eigenvalues of A ( ( ) - iA, are bounded independently of 5. Also keep in mind that A ( ( ) is linear with respect to 5. Q.E.D.
Sect. 151
127
HYPERBOLIC FIRST-ORDER SYSTEMS
We exploit properties (1 5.16) and (15.17) by applying the Paley-Wiener (-Schwartz) theorem: I f a function of 5 E R, can be extended to the complex space C, as an entire function of exponential type and i f , on R,, it grows at most like apolynornial, it is the Fourier transform of a distribution with compact support in R”.We have in fact additional information, to the extent that the “exponential type” of the Fourier transform is related to the convex hull of the support of the distribution. Indeed, we may conclude from (15.16) that (15.20) The support of U(t, x), regarded as a distribution in x (t is fixed), is contained in the ball { x E R”;1 x 1 5 B I t I }. Now, if in our Cauchy problem (15.3)-(15.4) the data f and uo were C“ functions with compact support, we would have the right to transform the problem by a Fourier transformation in x . The transformed problem, in R’ x R,,
5)
(1 5.21)
Zii = A t ,
(15.22)
G(0, 5) = G 0 ( t )
in R,;
has the unique solution (15.23)
G(t,
t) =
w,
OGO(t)
+ j 0h t - t’, 5)J(t‘>t) dt‘,
which is the Fourier transform of (15.24)
u(t, x) = U ( t , x)
* u o ( x ) + f U ( t - t’, x) 0
(X)
* f ( t ’ ,x ) dt’, (X)
i.e., (15.25)
u(t, X)
=
U(t, X )
* u ~ ( x+) ( H ( t ) f ( t ,x)} *
E+(t,X )
( t .x)
(X)
+ W ( - t ) f ( t , 4 1 * E-(t, 4, (tax)
where E+(t,X )
= H(t)U(t,x),
E-(t,
X)
= -H(
- t)U(t, x).
There is no need to stress the analogy between formulas (15.25) and (13.14). From there the argument proceeds exactly in the same fashion as in Sect. 13: Formula (1 5.24) [or (I 5.291 makes sense whatever the distributions uo ,S, since, for each fixed, the support of U(t, x) is compact [by (15.20)]. Furthermore, u is a C” function. Example 15.1. We suppose n
=
3 and consider the following 4 x 4 matrices:
128
[Chap. 11
THE CAUCHY PROBLEM
where the aj are the following 2 x 2 matrices, called Pauli matrices in physics texts (in K, I stands for the 2 x 2 identity matrix): 61=
(O1 0l),
130
cr2
=i( - 1 O 0I),
cr3 = ( I0
- 1O ) *
[Chap. I1
THE CAUCHY PROBLEM
Give an example of a linear transformation T on R, such that Q(Tt) = all c E R4. 15.5. We denote by M,(K) the algebra of m x m matrices with entries in the field K = R or C, by LG(m, K) the group of these matrices which are invertible, by SL(m, I() the subgroup formed by those matrices having a determinant equal to + 1. If M is a complex matrix, we denote by M* its adjoint, i.e., the transpose of the complex conjugate of M. Let us denote by eo the 2 x 2 identity matrix, and by ej the Pauli matrix introduced in Example 15.1 under the notation cj ( j = 1, 2, 3). Show that - Q(5) for
1
(15.35) is a linear isomorphism of C4 onto M2(C), transforming complex conjugate into adjoint, i.e., M ( % )= M([)* and, consequently, mapping the real space R, onto & , the space of all self-adjoint 2 x 2 matrices. Establish the following formulas:
(15.36)
f Tr M(5) = l o
(15.37)
(Tr M is the truce, that is to say, the sum of the diagonal entries of the matrix M). Give the expression of a generic matrix corresponding, via (15.35), to a point of the hyperplane to= 0 in R4. Remark 15.7. The reader will check at once that the multiplication table of the e j ( j' = O , 1, 2, 3) is that shown in Table 15.1.
ie
e4
e0
-ie,
e2
ie3
e0
-ie,
e3
-ie2
,
e0
ie
Sect. 151
HYPERBOLIC FIRST-ORDER SYSTEMS
129
Then let h(z) be an entire function in C , valued in C". Consider the operator (15.29)
+m
(
exp t A - h =
Show that there is an invertible matrix that, if we set (15.30)
S, w(x)
=
tk
- Akh(k)(x).
k=Ok!
r (m x m, with complex entries) such
r - * exp
(t~ - rw, :XI
then the components of the vector S, w are related to those of w by the formula (15.31)
(S,w)j(x) = wj(x + r A j ) ,
j = 1, . . . , n.
Derive from this that, if all the eigenvalues of A are real, S,can be extended as a continuous linear operator acting on the continuous functions of x in R' and the Cauchy problem for L is then well posed (Definition 15.1). 15.2. Let L denote the same operator as in Exercise 15.1. Suppose that (15.28) holds but that one of the eigenvalues of A is not real. By using the matrix introduced in Exercise 15.1 show that the Cauchy problem is not well posed (Definition 15.1) for L. 15.3. Consider an operator L = I d / & - A a/dx on R2, where I is the m x m identity matrix and A any m x m complex matrix. By writing A in its Jordan canonical form, show that the Cauchy problem for L is well posed if and only if all the eigenvalues of A are real. In this case, give the expression of a function u(t, x ) E Cco(R2),satisfying Lu = 0 in R2, in terms of u(0, x) E Cm(R1). 15.4. Let A ( j = 1, 2, 3) be the matrices so denoted in Example 15.1, P4 the real linear space formed by the 4 x 4 matrices (1 5.32)
Consider a linear transformation T of R, such that there is a 4 x 4 matrix S , satisfying (1 5.33)
M(Tt) = S T M ( t ) ,
Writing Q ( t ) = ti - (t:
(1 5.34)
v<
R4.
+ l: + tg), show that we have
Q(T5)'
=
Q(O',
v t E R4
9
for some number K, What is the relation between K and S T ? Show that the set of matrices T such that the above holds true with K = 1 forms a subgroup C of the group of 4 x 4 invertible matrices, with real entries, that is, of the fourth linear group LG(4, R). Is the mapping T w S T of G into LG(4, R) injective? What is its image?
THE CAUCHY PROBLEM
130
[Chap. I1
Give an example of a linear transformation T on R, such that Q(T't) =
- Q(T) for all < E R, .
15.5. We denote by M,(K) the algebra of m x m matrices with entries in the field K = R or C, by LG(m, K) the group of these matrices which are invertible, by SL(m, K) the subgroup formed by those matrices having a determinant equal to 1. If M is a complex matrix, we denote by M* its adjoint, i.e., the transpose of the complex conjugate of M. Let us denote by eo the 2 x 2 identity matrix, and by e j the Pauli matrix introduced in Example 15.1 under the notation oj ( j = 1, 2, 3). Show that
+
(15.35)
5 = ([O
7
[ I , (2
9
c3lH
M(c) =
3 j=O
c j ej
is a linear isomorphism of C4 onto M2(C),transforming complex conjugate into adjoint, i.e., M ( [ ) = M(C)* and, consequently, mapping the real space R, onto X 2 , the space of all self-adjoint 2 x 2 matrices. Establish the following formulas : (15.36)
e,
e2
e3
e4
e0
-ie3
e2
ie3
e0
-ie,
e3
-iez
,
e0
ie
ie
Use the same notation as in Exercise 15.5. Given any matrix M2(C)define a transformation in R, , T A , by the formula
15.6.
A
E
( 15.38)
AM(t)A
* = M(TA0, t E R4.
Sect. 1-51
HYPERBOLIC FIRST-ORDER SYSTEMS
131
Prove, by using (1 5.36), that A H T Ais a group homomorphism of the special linear group SL(2, C) into the Lorentz group Y (on R4). What is the kernel of this homomorphism? 15.7. Let U(2) denote the 2 x 2 unitary group [ U EU(2) if UU* = I ] and set, for U in U(2), (15.38') UM(t;)U-' = M(T,t;), t; E R,. Prove that U M U - ' preserves the determinant and the trace. Use then formulas (15.36)-(15.37)to prove that, if U E U(2), T , preserves every hyperplane to= const, in R,, and induces an orthogonal transformation on R, identified with the hyperplane to = 0. Show that U++T , is a group homomorphism of U(2)into O(3), the group of 3 x 3 orthogonal matrices, whose kernel is equal to the subgroup of diagonal matrices of the form e'"Z, 0 2 w 5 277. 15.8. Use the same notation as in the preceding exercises. Let U + ( 2 ) denote the subgroup of U(2) consisting of the unitary matrices whose determinant is exactly equal to + 1 ; U'(2) is also a subgroup of SL(2, C)and it is obvious that the definitions (15.38) and (15.38') of T , are the same. We have seen, in Exercise 15.7, that UH T , is a group homomorphism of U'(2) into (not onto!) O(3). What is its kernel? Let us set cos 8 sin 8 V ( 4 )= R(8) = -sin 8 cos 8 Prove that every matrix U E U'(2) can be written
(g
u = V ( 4 / 2 ) R ( WV($/2), (15.39) where 4, $, 0 are angles (i.e., vary on the torus TI). Conclude from this that U f ( 2 )is a connected three-dimensional compact (real analytic) manifold and that, consequently, U H T , is a group homomorphism of U'(2) onto the group of rotations in R,, O'(3) [this is the connected component of the identity in O(3), also the intersection of O(3) with SL(3, R)]. Compute T , in each one of the following cases: U = V(4/2), U = R(8/2), and derive from this and from (1 5.39) the general formula for T , . 15.9. Using the same notation as in Exercise 15.6, show that every matrix A E SL(2, C ) such that the restriction of T A to the plane to= t3 = 0 is the identity is necessarily of the form (15.40)
By combining this fact with the main result in Exercise 15.8 [namely that UHT, maps U'(2) onto O'(3)] derive that A - T , is a group homomorphism of SL(2, C ) onto the connected component of the identity in the Lorentz group, Y , , and thus that SL(2, C ) is a covering group of order 2 of 2+.
I6 Strongly Hyperbolic First-Order Systems in One Space Dimension Some of the results in the preceding sections, in particular those concerning the existence and uniqueness of the solutions to the Cauchy problem, can be extended to certain partial differential equations and systems with variable coefficients, generally called hyperbolic. Here we shall limit ourselves to firstorder systems with variable coefficients, when the number of space variables is equal to one. These systems are of the form
a
a
at
ax
L = - - A(x, t ) -
- B(x, t).
Here A(x, t ) and B(x, t ) are m x m matrices whose entries are smooth functions, say C' functions of (x, t ) in R x 1-T, T [ (R c R" open, T > 0); of course, a/& stands for I d / a t where Z is the m x m identity matrix. We shall assume that the system L is strongly hyperbolic (cf. Definition 15.3):
(1 6.1)
For every xo E 0,to E ] - T, T [ , the eigenvalues of A@,, are real and distinct.
to)
LEMMA16.1. Zf(16.1)holds, therearem C'functionsof(x, t ) i n 0 x 1-T, T [ , 11,. . . , A,,,, representing at each point ( x , t ) the eigenvalues of A(x, t ) . Prooj Since the eigenvalues of A(x, t ) are real and distinct, we may'choose their indices so as to have I , < I 2 < * * . < I , everywhere in R x 1 - T, T [ . Let ( x o , t o ) be an arbitrary point in the latter set, I: thejth eigenvalue of A ( x o , to). Let us denote by f(I; x , t ) the characteristic polynomial of A(x, t ) : f ( A ; x,t ) = det(II - A(x, t ) ) . We have
f(2: ;xo
I
fxq; xo , t o ) # 0.
t o ) = 0,
We may therefore apply the implicit function theorem and conclude that there is a C' function A(x, t ) in a neighborhood of ( x o , t o ) satisfying there 132
Sect. 161
133
STRONGLY HYPERBOLIC FIRST-ORDER SYSTEMS
f (A(x, t ) ; x , t ) = 0 and equal to A; when x have l ( x , t ) = Aj(x,t).
= xo , t = t o . We
must necessarily Q.E.D.
Remark 16.1. The lemma would still be true if the eigenvalues ceased t o be real, provided that they remain distinct. In general, if multiple eigenvalues are allowed, the lemma is not valid. A simple example is provided by
W e now set P j ( x , t ) = (2ni)-'
'
( z l - A(x, t))-' dz,
7,
where y j is a circle in the complex plane o n which n o eigenvalue of A(x, t ) lies and whose interior contains one and only one eigenvalue of A ( x , t ) , the j t h one, Aj(x, t ) . It is clear that P j ( x , t ) is a C' function of ( x , t ) ; it is t h e j t h spectralprojector of A(x, 1 ) . The range g j ( x , t ) of the matrix Pj(x,t)actingon C" is one-dimensional: It is the j t h eigenspace of A(x, t ) (which is onedimensional). We have I = P , +.**+P,. This implies that, for each (x, t ) , Y , ( Xr),, . . . , Y J x , t ) span the whole space C". Of course, we have Pi' = Pi for every j .
LEMMA 16.2. Every point ( x o , t o ) of 0 x ] - T, T [ has an open neighborhood Bo in this set where a matrix-valued function T ( x , t ) is dejined, having the following properties : (16.2)
r ( x , t ) is C' in Soand invertible at every point of So;
(16.3)
we have,for every (x, t ) E Bo,
r-l(x,t)A(x, t ) q x , t ) =
i
U x , r)
0
o
...
0
0
: 1.
A,(x, t )
Proof: F o r each j , we arbitrarily select a nonzero vector v: in T j ( x 0 ,t o ) a n d we set v,(x, t )
=Pj(X,
t)V?.
When x = x o , t = t o , we have vj = v?. hence the vj then form a basis of C" and this will remain true in a small neighborhood of ( x o , to), which we take to be Bo . Let us write rn
134
[Chap. I1
THE CAUCHY PROBLEM
where (el, . . ., em) is the canonical basis of C"; i.e., for each k,all the components of ek are zero, except the kth one, which is equal to one. The matrix ($(x, t))ljj,kjm will be our matrix T(x, t ) . We leave to the student the verification that it possesses the properties (1 6.2) and (16.3). Q.E.D. We are now in a position to study the following local Cauchy problem. In what follows, U will be a small open subset of R, 6 a small number such that 0 < 6 < T . The Cauchy problem is stated as follows : (16.4)
Lu=f(x,t),
(16.5)
XEU,
u(x, 0) = u,(x),
It(
x E u.
About the data, we assume thatf(x, t) is a C' function of (x, t) in R x ] - T, T [ while uo(x) is a C' function in Q. We transform the problem by setting
where T(x, t) is the matrix in Lemma 16.2 when we apply it to (xo , t o ) where xo is a point in U and t o = 0. From now on, until the contrary is specified, U and 6 will be assumed to be sufficiently small so that U x ] - 6,6[ is contained in Bo . We have
where we have set (16.7)
B, = B - rI+ A r , .
Problem (16.4)-(16.5) is thus transformed into (16.8) (16.9)
V , - T - ~ A T V , - B , V = ~ ( ~ , ~ ) ,X E U , V(X, 0) = V0(X),
It1
<6,
x E u.
The advantage of the new system lies in the fact that T-'AT is diagonal [cf. (16.3)]. This advantage will be exploited fdly in the solution of the Cauchy problem. It is instructive to consider first the case where m = 1, i.e., L is a first-order scalar linear partial differential operator.
Sect. 161
STRONGLY HYPERBOLIC FIRST-ORDER SYSTEMS
135
The Scalar Case In this case A and B are complex-valued functions. We shall write A(x, t), instead of A(x, t). Hypothesis (1 6.1) demands that 1be real. We notice that the principal part of L,
a
L - - - A(& O - at
t)
a ax -3
can easily be transformed, by a change of variables, into a/&. It should be emphasized, however, that this transformation is only local, that is, valid in a small open set (which we may identify with the set goabove). Indeed, we solve the nonlinear ordinary differential equation dx ds
- = -A(x,
( 16.10)
s),
with initial condition
-
(16.11)
-y,
where y is a point sufficiently near xo . For small s, (16.10) has a unique solution, which we denote by x = x(y, s). Consider then the change of variables ( 16.12)
t = s.
x = x(y, s),
By virtue of (16.1I), the derivative of x with respect to y does not vanish if s is close to zero (for s = 0, it is equal to one). Consequently, (16.12) is a true C' change of variables. We have
a
a
as
at
a ax
1(x, t ) - = Lo
-=--
The operator L is transformed into
a
L" = - - B ( x ( ~s), , as
S)
(rather, one should say that this is the expression of L in the new coordinates y, s). Since the lines s = 0 and t = 0 coincide and (16.1 1) holds, the problem (16.5)-(16.5) is transformed into (1 6.13)
(16.14)
L # u = m y , s), s),
u, Y E u.
x(y, s) E
u(y, 0) = uo(y>,
Is1
< 6,
136
THE CAUCHY PROBLEM
[Chap. I1
But because of the expression of L’, this latter problem can be viewed as a Cauchy problem (depending on the parameter y ) for a first-order linear ordinary differential equation, and we know how to solve it. It suffices to recall the considerations at the beginning of Sec. 11. We have
We have set (16.16)
P(y, s) = fB(x(y, s’), s’) ds’. 0
Formula (16.15) can be interpreted in an interesting geometrical manner. Notice first that y , s are the new coordinates of the point (x, t). In particular, s = t. On the other hand, when s’varies from 0 to s = t , the point (x(y, s’), s’) ranges over an arc of curve joining ( y , 0) to (x, t ) . The curve in question is the integral curve of (16.10)-(16.11). If it passes through (x, t ) , we shall denote it by C(x, t). At the point (x, t ) , Lo is a partial differentiation in the direction of its tangent. This defines an orientation on C(x, t ) , which is the same as the one defined by taking time as a parameter (we go forward if time increases). By the fundamental theorem o n ordinary differential equations, applied to (16.10)-(16.1 I), through every point ( x , t ) there is one and only one arc of integral curve of (16.10)-(16.11). Of course, C ( x , t ) = C(x,, tl) means that ( x , t ) and (xi, tl) lie on one and the same such integral curve. If U and 6 are sufficiently small, U x 1-6, 6[ will be “trivially fibered” by these curves C(x, t ) , in fact by the curves C(x, 0) as x ranges over U. This means that the effect of (16.12) is to “straighten up” the curves C(x, 0) and transform them into segments of vertical lines x = const. Now, in this picture, y is completely determined by the fact that ( y , 0) is the intersection of C(x, t ) with the straight line t = 0. As for P(y, s) it is equal to the integral of 3 along C(x, I), with respect to the measure dt, from ( y , 0) to (x, t). We shall write B,(x, t ) instead of P(y, s). We have
B,(x, t ) =
f
B(x’, t’) dt‘,
C(x, 1 )
keeping in mind that the bounds are 0 and t ; (x’, t ‘ ) is the variable point on C(x, t). As for (16.5), it reads
(16.17)
keeping in mind that y is dejned by the fact that ( y , 0) is the intersection of C(x, t ) with the axis t = 0; here also the integration is performed from t’ = 0 to t‘ = t even when t < 0.
Sect. 161
137
STRONGLY HYPERBOLIC FIRST-ORDER SYSTEMS
Remark 16.2. Suppose that f = 0. Then the value of u at ( x , t ) depends only on the value of uo at y . In a sense, we may say that the “influence” of the initial data propagates along the curves C(x, t ) .
The General Case We return now to the general (but “ diagonalized ”) problem (16.8)-( 16.9). To solve it we shall now combine the considerations in the scalar case with Picards interation method. But first let us simplify (16.8)-(16.9) by substituting u(x, t ) - uo(x) for u(x, t ) . This amounts to the same as assuming that uo = 0 (note that g must then be replaced by g + T-’ATu,, + B , uo). Next we define a sequence of functions d k )(k = 0, 1, 2, . . .) as follows: ( 16.1 8)
$)
-
r- l A r U ( k )= B X
#
“(k-1)
+ g,
dJj
r=o
= 0,
agreeing that = 0. We shall show that, if U and 6 are sufficiently small, the functions d k )converge uniformly in U x ] - 6, 6[ to a C’ function u(x, t) which is a solution of (16.8)-(16.9); grad d k ) will converge uniformly to grad u. We set w(k)
= u(k)
- u(k
- 1 ),
k = 0 , 1 , ....
Then we define +m
u
=
1 dk),
k=O
and we shall show ,that the sequence {d”}is summable. We derive from (16.18), by subtraction (16.19) (16.20)
wjO)
w;k) -
- r - 1 A r wx (0)= g,
d y x , 0 ) = 0,
r-lArwkk) = B ~ ( ~, - ~w ‘’~ ) ( x0 ,) = 0,
k > 0.
Equations (16.19) and (16.20) are easy to solve. They are essentially the conjunction of m scalar equations. Indeed, let us call d k ) j thejth component of wtk),( B , dk))j that of B , d k !and , g’ that of g. Equation (16.19) is the conjunction of the following equations (when j = 1, . . . , m): (16.21)j
~ . ~ ( o=) jj J g,
w y x , 0 ) = 0,
where L j = d / d t - Aj(x, t) d/dx, whereas (16.20) is the conjunction of the equations (16.22),
LJ . W (=~{ )B ~ #~
( ~ - ‘ ) } j ,
w ( ~ ) ~ (0) x= , 0,
k > 0.
[Chap. I1
THE CAUCHY PROBLEM
138
We call Cj(x, t ) the segment of integral curve of the vector field LJ which joins the axis t = 0 to the point (x, t). We apply formula (16.17): (16.23)
dk-')}j(X', t') dt',
w ( ~ ) ~ (tx) ,=
k > 0,
whereas (1 6.24)
W(O)j(X,
t) =
We introduce now the following notation: Notation 16.1. We denote by 9(x, t ) the smallest compact set containing the point ( x , t ) and hauing the following property: If(x', t') E 9 ( x , t ) , every curue segment Cj(x', t') ( j = 1, . . . , m ) is contained in 9(x, t ) .
We then set, for any vector-valued continuous function p(x, t), IIIcp(x, t>lll = SUP 1 4G', t') I . P(x. 1)
We derive at once from (16.23), IIIw(~)(x, t)lll
S CI t I I I I w ( ~ - ~ ) ( x ,
k > 0,
t>lll,
hence, by iteration, IIIw(~)(x, t)III
5 ~ ~ l ~ l ~ k l 01l11,l ~ ~ ko>~ 0.~ ~ ,
Since, on the other hand, (16.24) implies IIIw'o'(x, t)lIl
5 CI t I
Illdx, t)lII,
we finally obtain (1 6.25)
IIIW(~'(X,
t)lll
S
(eltl)k+'Ills(x, t)lIl,
',
1
k = 0, 1, .-..
Consequently, for I t I < C - the series wCk)converges uniformly. An inequality analogous t o (16.25) can be derived for grad d k )instead of d k ) (g then also has to be modified). To see this, one first estimates w$k) using formula (16.23). This is not as difficult as it may seem, if one first straightens up the curve Cj as indicated in the scalar case. To estimate wik) one uses the estimates of d k and ) wik)in combination with (16.21) and (16.22) (expressing wj') in terms of the other quantities). In these latter estimates, one must take advantage of the fact that the eigenvalues Aj(x, t ) are C' functions of (x, t) [this is then true for the curves Cj(x, t ) ] and of the assumption that the original data f(x, t), uo(x) are also once continuously differentiable. We leave the details to the student. Finally we have obtained an expression of u of the form u = ( I - T)-'g
Sect. 161
STRONGLY HYPERBOLIC FIRST-ORDER SYSTEMS
139
where T is an integral operator. This integral operator T is such that, as shown by formula (16.25), the value of u a t the point ( x , t ) depends only on the values of g in the set 9 ( x , t ) (we are assuming here that uo = 0). We encounter once more the " domain of influence" phenomenon (cf. Theorem 14.1) : In the present case, the influence propagates along the curves Cj(x, t ) .
Example 16.1. Consider the Cauchy problem for the wave equation in one space variable : (16.26)
urr- u,,
=f,
~ ( 0X,) = u ~ ( x ) ,
~ ~ (X0) = , u,(x).
Set u1 = u, u2 = u, - u, and call u the vector with components u l , u2, f the one with components 0, J and uo the one with components u 0 , u1 - uox. The problem (16.26) is then transformed into the following Cauchy problem :
+
Here L , = a/at - a / a x , L, = a/at a l a x ; c,(x, t ) is the curve X I = --t' + (x + t ) , and C,(x, t) is the curve x' = t' + ( x - t). Supposing, for instance, that t is greater than zero, the set 9 ( x , t ) is the portion of the (closed) backward light-cone, with vertex at (x, t ) , which lies in the half-plane t 2 0.
Exercises
16.1. Here x = ( x ' , . . . , x"), n 2 1, and (16.28)
a
L
a 1 Uj(X, t) ad' n
=-at
where the coeffcients aj are real-valued C" functions in an open subset 0 of R"+l. Show that every point ( x o , t o ) of Q has an open neighborhood B0c z2 where the system of ordinary differential equations
(16.29) has a unique solution x
(16.30)
= x(y, s)
x'(
such that
=yj,
j = 1 , ...,
s=ro
where y is a point R",close to xo . Prove that
(16.31)
x = x ( y , s),
t = s,
defines a Cmchange of variables near ( x o , to).
[Chap. I1
THE CAUCHY PROBLEM
140
Apply these facts to the solution of the Cauchy problem (16.32) where u,, is a sufficiently regular function of x in some neighborhood of xoin R". Solve the same problem with Lu =f(instead of Lu = O),fbeing a smooth function in an open neighborhood of ( x o , to) in R"". In particular, let ui be the solution of (16.32) when uo(x)= xj. Relate u = ( u ' , . . . , u") to the transformation ( I 6.31). 16.2. Use the notation of Exercise 16.1. We assume now that u j ( x , s) = uj(x), independent of s ( j = 1, . . . ,n). Denote by y(x, t ) the function obtained by inverting with respect to y the function x = x(y, t ) in (16.31). Set
y ( x , t)
(16.33)
= T,x.
Prove that, for sufficiently small values of I t 1, small neighborhood of xo , we have (16.34)
To = Z,
the identity;
I t'I
and for x in a suitably
TtT, ,= T,,,. .
Compute T, in the following cases: (16.35) the u j are all constant; (16.36)
L = -a- - a at ax
(n = 1);
(16.37) 16.3. Using the same notation and hypothesis as in Exercise 16.2, give an example in the case where n = 1 in which the operator T , , defined in (16.33), does not exist for all real values of t. 16.4. Use the notation as in Exercise 16.1, with R = R"". We suppose here that there is a constant K > 0 such that
(16.38)
Igrad,ujI
IrC inthewholeofR"",
j = 1, ... ,n.
Prove that the solution of (16.29)-(16.30) exists for all y E R" and all t E R. 16.5. Consider L = I a / d t - A , where I is the identity m x m matrix and (16.39) where the A j ( x ) are rn x rn matrices with entries that are analytic functions of x in some open neighborhood of the origin in R". Let h be an arbitrary
Sect. 161
STRONGLY HYPERBOLIC FIRST-ORDER SYSTEMS
141
function in R",valued in C". which can be extended to C" as an entire function. Prove that the series at the right of +m
(1 6.40)
(erAh)(x)=
.k
2- k5! (Akh)(x)
k=o
converges, provided that 1 x [ and I t I are sufficiently small, and defines an analytic function of x in a suitably small open neighborhood of the origin. 16.6. We use the same notation as in Exercise 16.5, except we assume now that m = I (thus the coefficients A j are scalar functions). Compute the operator e f A , defined in (16.40), in each one of the cases (16.35), (16.361, (16.37), and also in the case (16.41)
a
L=-dr
"z 2
( n = I).
Compare with the operator T , defined in (16.33). 16.7. Consider L = a/at - I ( x ) iijdx (we are in the case of one space variable, x ; i.e., n = I ) and suppose that I is an analytic function of x in a neighborhood of zero. Give an interpretation of the operator (16.40) in this case as a "motion" in the complex plane (near the origin) and derive from it that the Cauchy problem is well posed for L if and only if I is real-valued.
The Cauchy-Kovalevska Theorem. The Classical and Abstract Versions
The Cauchy problem, as we have seen, is well posed for hyperbolic equations and systems and only for these. Our definition of well-posed Cauchy problems involves the assumption that the data, the Cauchy data and the right-hand side, are arbitrary distributions (or arbitrary C" functions). In practice one might be willing to settle for much less and content oneself with restricted data-restricted either from the viewpoint of smoothness or from the viewpoint of growth at infinity. In this case the classes of equations for which the solution to the Cauchy problem exists and is unique (within the chosen framework) may be considerably larger than the hyperbolic. The most classical example of this phenomenon is provided by the CauchyKovalevska theorem. In its most common form it applies to the local Cauchy problem: The data are required to be analytic functions and one seeks an analytic solution. The class of equations for which it is possible to find such a solution, which is then unique, includes essentially all equations with analytic coefficients (as we shall see, there is one restrictive condition, concerning the choice of the time variable t ) . Until now we have handled the PDEs under consideration by reducing them to ODES via a Fourier transformation with respect to the space variables. This method can be modified to cover more cases, if we generalize the kind of ODE we are willing to study: We must allow the coefficients of the ODE to be not only complex- or matrix-valued functions, as before, but also linear operators on certain function spaces. For instance, we may view the heat operator a / d t - Ax as an ordinary differential operator (with respect to time); -Ax is then its zero-order coefficient. It turns out to be a linear operator acting on functions and distributions in the x variables. A similar 142
Sect. 171
THE CAUCHY-KOVALEVSKA THEOREM
143
interpretation will apply to all first-order systems considered in Sects. 15 and 16, and to many others. Generally speaking, they are of the form (17.1)
L
0
= - - A(t), at
where A ( t ) is an operator-valued function of t . Notice that this covers also the case of the operators we have often denoted by L , resulting from a Fourier transformation with respect to x. In the operators the coefficients were linear operators acting on spaces of functions of the variables 5 E R, which happened to act as multipliers: multiplication by smooth functions of 5. The advantage lies in the fact that multipliers are particularly easy to handle. We are now going to move in a different direction, looking toward the CauchyKovalevska theorem, and deal with operators A ( ? ) of the kind (17.2) acting on spaces of analytic functions of x ; the coefficients Aj(x, t) will be mx matrices with entries which are analytic functions of x, continuous with respect to t (we will have to be more precise than this). Here x will vary in an open set SZ of R" while t varies in an open interval 3-T, T [ ( T > 0). We are given two functionsf(x, t ) , uo(x) analytic with respect to x in R; moreover, we shall assume thatf(x, t ) is continuous with respect to t , I tl < T. We look then at the Cauchy problem (17.3) (1 7.4) Our version of the Cauchy-Kovalevska theorem states (roughly speaking) that if 0' is any relatively compact open subset of R and if T',0 < T' 5 T, is sufficiently small (depending on the choice of U),there is a unique function u(x, t ) , analytic with respect to x, once continuously differentiable with respect to t , which satisfies (17.3) when X E SZ' and I t )
144
THE CAUCHY PROBLEM
[Chap. I1
These various results must be stated in precise and rigorous terms. To do so we shall have to be more technical. I t is advantageous to extend the variable x = (xl, . . . , x") to the complex domain-where we denote it then by z = (z', . . . , 2")-and everywhere replace "analytic" by " holomorphic." An analytic function in SZ c R" is a function which can be extended holomorphically to some open neighborhood (depending on the function) of i2 in C". For our purpose it is convenient t o consider those functions which can be extended holomorphically to afixed open neighborhood 6 of R in C" and, as a matter of fact, which can be extended as continuous functions to the closure 6 of 6 (assumed to be bounded). In what follows we may forget that 6 originated as a neighborhood of 0 and regard it simply as an arbitrary bounded open subset of C". Definition 17.1. Let 6 be a bounded open subset of C". We denote by H ( @ ) the space of continuous functions in 8 which are holomorphic in the set 0. We make a normed space of H ( B ) by equipping it with the maximum norm : (17.5)
PROPOSITION 17.1. The normed space H ( b ) is complete, i.e., is a Banach space. Proof. A Cauchy sequence ( u j } ( j = 1, 2, . . .) in H ( 6 ) converges uniformly on B to a continuous function u in 8. I n 0, we have (d/dZk)uj= 0, k = 1, . . . ,n, for everyj = 1, 2, . . . ; hence (d/d?'))u = 0, k = 1, . . . , n, since the u j converge Q.E.D. to u in the distribution sense. Thus u is holomorphic in 0. We may also look at the continuous functions in 6,holomorphic in 0, which are valued in C".We denote the space of such functions by H ( 8 ; Cm) and equip it also with the maximum norm (which turns it into a Banach space). We wish now to study the action of an operator of the kind (17.2) (with z substituted for x) on the elements of H ( a ; C"). Clearly we might run into trouble if we look at the result of this action at the boundary of 0 but not if we stay away from the boundary. Let us take a closer look at this fact. Let 6,be a second bounded open subset of C" such that
B
c 0,.
Let d denote the distance from 0 to the boundary of 0,. Observe then that the partial differentiation a / d z j is a linear operator from H ( B , ; C") into
IT(@ : C"). PROPOSITION 17.2. The partial dijjierentiation d/azj (1 5 j 5 n ) defines a bounded linear operator H(8,; C") -+ H ( 8 ; Cm)with norm I d - ' .
Sect. 171
THE CAUCHY-KOVALEVSKA THEOREM
145
ProoJ: Let zo be an arbitrary point of 8, f any element of H(0, ; Cm).We have (Cauchy’s inequalities)
which implies at once
from which we obtain Proposition 17.2.
Q.E.D.
Let M m ( C denote ) the Banach space of rn x m matrices with complex entries, equipped with the matrix norm 11 11. PROPOSITION 17.3. Let M ( z ) belong to H(U; Mm(C)).Then f H M f is a bounded linear operator of H ( 8 ; C m )into itself, with norm (17.6)
The proof of Proposition 17.3 is very easy and we leave it to the student. Let us now consider a matrix-valued function M(z, t ) which satisfies the following condition: (17.7)
t H M ( x , t ) is a continuous function of valued in H ( b ; Mm(Cm)).
t,
1 t / < T (T > O),
) This means that, for each 1, It1 < T, M ( z , t ) belongs to H ( b ; M m ( C ) [i.e., its entries belong to H ( B ) ] ,and furthermore
(17.8) to every
E
> 0 there is an g > 0 such that, for every t‘, I t’I < T,
I t - f ’ I < q implies 11 M( . , t ) - M ( , t’) < E . *
(Of course, in general
y~ depends
on
t.)
PROPOSITION 17.4. r f (17.7) holds, t ~ (HMf ( . , t ) f ) is a continuous function of t, 1 t 1 < T , valued in the Banach space of bounded linear operators of H ( a ; C m )into itself. For each t , I tI < T, the operator norm o f , f w M ( . , t )f is 5IIM(., t)lln. This follows at once from Proposition 17.3, (17.7), and (17.8). We look now at the operator (17.2) where x has been replaced by z. Combining Propositions 17.2 and 17.4 yields
146
[Chap. I1
THE CAUCHY PROBLEM
PROPOSITION 17.5. Let Aj(z, t) ( j = 0, 1, . . . , n) be n tions o f t , 1 t I < T, valued in H ( b ; M,(C)), and set
c Aj(Z, r azJ +Ao(z, t). n
A(?)=
+ 1 continuous func-
3
t)
j=1
Then A ( ? )is a continuousfunction o f t , 1 t ) < T, valued in the space of bounded linear operators of H(8, ; C'") into H ( 8 ; C'"). For each t, I t I < T, the norm of A(?) is not larger than
Thus, the bound for the norm of A(?) grows like l/d when d +O. This observation is crucial in the proof of the Cauchy-Kovalevska theorem. In our version of this important result we shall deal with the following objects: (17.9)
n + 1 continuous functions of t , 1 tl < T, Aj(z, t) ( j = 0 , 1, .. . , n), valued in H ( 8 , ; M,,,(C));
(17.10) a continuousfunction o f t , I tl < T , f ( z , t ) , valued in H(b,; C"); (17.11) an arbitrary member of H(Q,; C'"), uo(z). We shall then study the following Cauchy problem : (17.12)
au " au = Aj(Z, t ) -.
at
j=l
aZJ+ A , ( z , t)u +f(z,
t),
(17.13) We will not be able to require that (17.12) be verified for all z E 9,and I t 1 < T and that (17.13) holds for all z E 9,. Rather THEOREM 17.1. (Cauchy-Kovalevska). Let Oo be a nonempty connected open subset of C" whose closure is contained in 8,. There is a number do, 0 < 6 , 5 T , such that the following is true: There is a unique C' function of t , I t < do, u(z, t ) , valued in H(6,; C'"), satisfying (17.12) f o r all z E Oo , t , I t I < d o , and (17.13)for all z E 9,.
I
Let do be the distance between 0, and the boundary of 8,.There is no loss of generality if we decrease 9,and assume that
0,= { z E C"; dist(z, O0)< do). We introduce a one-parameter family of open sets, intermediary between 0, and 0,and all connected: 0,= {z E C"; dist(z, @ < sd,), lo) 0 5 s S 1.
THE CAUCHY-KOVALEVSKA THEOREM
Sect. 171
147
Notice that dist(0,. , Cn\O,) = ( s - s’)do,
(17.14)
0 5 s’ < s 5 1.
We shall exploit (17.14) in relation to Proposition 17.5. We shall transform our Cauchy problem (17.12)-( 17.13) into the abstract problem ( I 7.15 )
u, = 4 t b + f f t ) ,
(17.16)
uit=o= U o .
We view uo as an element of the Banach space El = H(6,; Cm)and f ( t ) as a continuous function of t , It 1 < T , valued in El-which they are. More generally let us write ( 17.17)
E,
= H(6,; Cm),
0
s
5 1.
It is clear that we have (17.18) Ifs’ < s, E, c E,. and the norm in E, is not smaller than the norm induced on E, by E,.. Of course, the natural injection E, G E,. is obtained by taking functions belonging to If((?,; C m )and restricting them to (?,,. We may also reinterpret Proposition 17.5 in the following manner [replacing by 0, and 0 by 0 , and taking (17.14) into account]: (17.19)
us’
< s,A(t) isacontinuousfinction of t , 1 tl < T , valuedin the Banach space of bounded linear operators E, + E,. .
For each t , I t 1 < T, the operator norm or A(t) : E, + E,. is not greater than C(t)/(s- s’), where 1 ” c(t)= IIAj(*, t)IIo, + IIAo(., t>IIo,.
jF,
The important thing about C(t) is that it is bounded on any closed subinterval of ] - T, T [ . Thus, if we slightly decrease T > 0, we may assume C ( t ) is bounded by a constant C > 0 and therefore that
I
(17.20) For each t , t 1 < T, the operator norm of A(t) : E, -+ ESP (0 5 s‘ < s 5 1) is not greater than C/(s - s’) where thepositive constant C does not depend on s, s’ nor on t .
Also note that, as soon as we have decreased T (however slightly), we may assume that f is a continuous function in the closed interval [ -T, T ] with values in E,. Notice also that (17.20) is not modified if we increase C. For technical reasons it is convenient to assume that (17.21)
(Ce)-’ 5 T.
148
[Chap. I1
THE CAUCHY PROBLEM
We are now in a position to apply the following result, which can be regarded as the abstract version of the Cauchy-Kovalevska theorem.
THEOREM 17.2. Under the preceding hypotheses, given any uo E E, and any continuous function o f t , I t I T,S, valued in El, the following is true: (I) There is a function u in the interval I tI < (Ce)-', valued in E o , which, for any s, 0 5 s < 1, is a C' function o f t , It1 < (Ce)-'(l - s), valued in E, and which,furthermore, satisjies ( 17.15)when I t I < (Ce)-', together with (17.16). (11) If,for some number T ' , 0 < T' 5 T, and for some s, 0 < s 5 I , there are two C' functions oft, I t 1 < T , valued in E, and satiflying (17.15) in this interval, together with (f7.16),they must be equal. Proof: ( I ) Existence of the solution. We define a sequence v,(t) ( k = 0, 1, . . .) of continuous functions of t, [ f 1 5 T, valued in E, (for any value of s, 0 I s < I), as follows:
By (17.19) we know that, if uk(t ) is a continuous function of t , I t I < T, valued in E,, A(t)vk(t)is a continuous function in the same interval, valued in E S T , for any s' < s, and the same is true of v k + ( t ) . As s is arbitrarily close to one, the same is true of s', from which arises our assertion that each vk is a continuous function of f , I t I < T , valued in any E,, 0 5 s < 1. Next we set Wo=Uo,
if k > 0 .
Wk=Vk-Uk-l
We have W,+,(t) = J / f ( t ' ) b ' k ( t ' )
dt',
k
= 0,
1, . . . ,
Each Wk is a continuous function of t , 1 t ) < T, valued in any E,, s < 1. By induction on k = 0, 1, . . . , we shall prove the following inequality : (17.22) where N, denotes the norm in Es and where
Sect. 171
THE CAUCHY-KOVALEVSKA THEOREM
149
Observe that M ( t ) is a nondecreasing function of I t I on the half-intervals [0, TI and [ - T , 01, respectively. Inequality (17.22) trivially holds when k = 0. Suppose that it holds up to k and let us prove it for k + 1. We return to the definition of w ~ and . ~ apply ( 1 7 . 2 0 ) . 1 f O 5 s ’ < s < 1,
We now choose s = s’ + ( I - s’)/(k + I), so that 1 - s = [ k / ( k+ 1)]( 1 - s’). We obtain
and its suffices to observe that ( 1 + l / k ) k2 e. Recalling that f is continuous on the closed interval 1 t 1 M ( t ) 5 M < + G O . From (17.22) we derive that the series
5 T, we see that
converges absolutely in E,, uniformly in every closed subinterval of the open interval It1 < (Ce)-’(l - s ) : Its sum u ( t ) will be the sought solution of (17.15)-( 17.16). Of course, u is the limit of the u k as k + + co [in E, ,uniformly on compact subsets of the interval I t I < (Ce)-’(l - s)], and from the definition cf the u k we immediately obtain
Let now E be such that 0 < E < I - s. We know that u is a continuous function in the interval I t I < (Ce)-’( 1 - s - E ) with values in E,,, . Therefore, by virtue of (17.19), A(t)u(t) is a continuous function in the same interval but with values in Es . From (17.23) we derive that u is a C’ function in the same interval with values in E , . By taking E -+ +O we derive that u is a C’ function of t, / t i < (Ce)-’(l -s), valued in E,. That it satisfies (17.15)-(17.16) is obvious from (17.23). (11) t,
Uniqueness of the solution. We must show that if a C’ function of valued in E,, satisfies
I t I < T‘ (0 < s 5 I),
(17.24)
150
THE CAUCHY PROBLEM
[Chap. I1
it must vanish identically in that interval. The set of points where h vanishes is not empty, as it contains t = 0, and is obviously closed; we are going to show that it is open in 1-T, T[, which will imply what we want. Let to be any point such that h(to) = 0. Note that we have h(t) = /tA(t’)h(r’)dr’.
(17.25)
fo
Let s’ be any number such that 0 5 s’ < s. We shall prove, by induction on k = 0, 1, ...,that (17.26)
N,,(h(t)) 5 M,(t)(s - s‘)-k(Ce)k( t - to/&,
where MlW = SUP N,(h(r’)), the supremum being taken over the segment joining r and t o . Inequality (17.26) is trivial when k = 0. By (17.20) and (17.25) we have
+
+
where E = (s - s’)/(k 1). This immediately yields (17.26) with k 1 substituted for k. Now (17.26) shows that if It - t o ( < (Ce)-’(s - s’), then N,.(h(t)) = 0. But since E, is injected in E,. this implies that N,(h(t)) = 0. Q.E.D. Let us indicate right away that a “ holomorphic version” of Theorem 17.2 is valid. In this version, t is a complex variable, varying in the disk I tI 5 T. Hypothesis (17.19) is replaced by (17.27) Ifs‘ < s, A ( t ) is a holomorphicfunction o f t , I tI < T, valued in the Banach space of bounded linear operators E, + E,. . Hypothesis (17.20) stands unchanged, of course. But now we assume that f is a continuous function of t , It I 5 T, valued in E l , holomorphic when I t I < T. The proof of Theorem 17.2 remains valid: One must simply keep in mind that t is now a complex variable; the integrations are performed on straight line segmentsjoining the limits of integration. Since uniform limits (over open sets) of holomorphic functions are holomorphic, the statement of parts (I) and (11) in Theorem 17.2 can be kept unchanged, but then C’ must be interpreted in the complex sense, which is identical with holomorphic. The classical holomorphic version of the Cauchy-Kovalevska theorem follows at once from the holomorphic version of Theorem 17.2.
Sect. 173
151
THE CAUCHY-KOVALEVSKA THEOREM
Exercises
17.1. Denote -y CUR ( R > 0) the space of C" functionsf ( x ) in R" such that (17.28)
Rial sup sup --ja:f(x)\ U E Z + "x s R "
a!
< +a,
and equip '21Rwith the norm defined by the left-hand sidein(17.28). Provethat CUR is a Banach space and that every element f of Bt, can be extended as a holomorphic function of z = x + iy in the slab {x E R";( y ( < R}. Let 0 < Ro < R 1< + OC) and set E, = s 5 1. Prove that -s)Ro+sR,, 0 conditions (17,18), (l7.19), and (17.20) are satisfied when (17.29) where the coefficients A , ( j = 0, . . . ,n) are continuous functions of t , 1 t 1 < T , valued in CUR,. 17.2. Let g ( r ) be a continuous function in R,, real-valued, and denote by Re the space of measurable functionsfin R, such that e y e Lz.
(17.30)
Prove that R9, equipped with the norm
is a Hiibert space in which C:(R,) is dense. Let $(R") denote the space of functions in R"whose Fourier transform belongs to C,"(R,) (by the Paley-Wiener theorem, = Y n Exp, the space of the C" functions rapidly decaying at injkity which can be extended to c" as entirefunctions of exponenrial lype). Denote by K g the completion of G(R") for the norm (17.31)
ll4RS
= IIQlln..
Prove that K gcan be canonically identified to the dual of K g . What is the canonical antilinear isometry of the Hilbert space Ke onto its dual, E e ? Prove that if, for some constant C > 0,
(17.32)
-g(O
5 C log(\
+ I
VE, E R n
7
K e can be continuously embedded in a Sobolev space H s for some suitable real number s (see Definition 13.1).
THE CAUCHY PROBLEM
152
[Chap. I1
Prove that if (17.33)
g(t)/Iog(l
+ 151)
tends to f a
as
+
+co,
then K g is continuously embedded in C"(R") and all the Sobolev spaces H", s E R,are continuously embedded in its dual, K - g . 17.3. Let K g be the space defined in Exercise 17.2. Prove the following result : PROPOSITION 17.6. Suppose that (17.34) where d is a number strictly positive and a is a real number. Then any element u of K g can be identijed with a C" function in R" such that (1 7.3 5)
17.4.
Let K g be the space defined in Exercise 17.2. We shall take
(17.36)
g ( 0 =SP(t),
s E R,
where p is a function in R, , continuous, p > 0. Then let A ( t ) be an arbitrary continuous function in R, and set (17.37)
A(Dx)u(x)= (27c)-" Jei(x,c)A(t)&2(t) dt.
Prove the following result: PROPOSITION 17.7. Suppose that, for some constant C , > 0, (17.38)
IA(t) I 5 Co ~ ( t ) , t E Rn.
Then, for any pair of real numbers s' < s, the operator A(D,) [in (17.37)] dejines a bounded linear operator K S g+ KS'gwith norm 5 C, e - ' ( s - s')-'.
Derive from this that the Cauchy problem (17.39)
au
- - A(D,)u =f, at
(17.40) where u, E KsoP,f E C o ( [ - T , TI;Ksop),has a unique solution u which is a C' function of t in some interval 1 t I < 6, 5 T, valued in Kslp, where s1 < so.
Sect. 171
THE CAUCHY-KOVALEVSKA THEOREM
153
Can you give an upper bound for the interval radius 6, in terms of Co and (so - s,)? 17.5. Show that the Cauchy problem for the heat equation
(17.41) is well posed if we restrict ourselves to data and solutions which belong to suitable spaces K g where g satisfies (17.34) for an appropriate value of d (and, say, a = 0). What is this value of d? [Hint: Use the result established in Exercise 17.4.1 17.6. Consider the Cauchy problem (in one space variable)
Show that if the dataf, u o , u1 are valued in suitable Gevrey classes (Definition 3.3) with respect to the variable x, problem (17.41') has a unique solution valued also in such a class. [Hint: Apply the results of Exercises 17.3 and 17.4.1 17.7. Let 4 ( r ) be a continuous function of r > 0, nondecreasing, 4(r) > 0 for all r, B a complex Banach space. We recall that a function h in C is called an entire analytic function with values in B if it is a C' function RZ + B satisfying everywhere the Cauchy-Riemann equations. Denote by %+(I?) the space of such functions which satisfy
(17.42) where 11 ( I B is the norm in B. Prove that if we equip %+(B) with the norm defined by the left-hand side of (17.42), it becomes a Banach space. For 0 5 s 5 1, set E, = BI,(B) where +(r) = 40((l- s/2)r) for a given 4o (continuous, greater than zero, and nondecreasing). Prove the following: PROPOSITION 17.8. Given 0 5 s' < s 5 1, a/dz defines a bounded linear operator E, -+ E,. with norm not exceeding [exp b0(l)]/(s- s'). Apply this to the Cauchy problem
(17.43) where A is a bounded linear operator on B and the data f and uo are valued in %+(B) for a suitable choice of 4 (and f depends continuously on t in 1-T, TI).
154
[Chap. I1
THE CAUCHY PROBLEM
Can we modify the choice of the functions C#J so as to obtain results about the equation
au aZU -at= + + A
(1 7.44)
analogous to those described in Exercise 17.5? 17.8. Consider a first-order system with constant coefficients:
(17.45)
a
L=Z--
at
a. - A 0 ; C A j -azJ n
j=l
the coefficients Z, A j (0 < = j5 n) are m x m complex matrices (Zis the identity). Show that we can apply Theorem 17.1 choosing, as domain Oo , an arbitrary open ball B,(zo) = {zE C";I z - zo I < R } (R > 0). Obtain an upper bound for the radius 6, of the interval (or disk) in the t-line (or t-plane) in which the solution of the following problem exists:
(17.46)
Lu = f(r, z),
UI t=o
= u&),
where f is an entire function in C"" and uo an entire function in C",both valued in C". Show that 6, can be chosen independently of zo E C".Derive from this fact that the problem (17.46)has a unique solution u( t, z) which is an entire analytic function in Cn'', valued in C". 17.9. Let uo(z) be an entire function in the complex plane (valued in C). Show that if u = u(t, z) is an entire function in C2 satisfying
(17.47) we must have u(t, z) = uo(z/(l - zf)). Derive from this that the Cauchy problem (17.47) does not have, in general, an entire function as its solution (cf. the conclusion in Exercise 17.8). 17.10. Let H o , H' be two complex Hilbert spaces, H' ci Ho, with continuous injection. Let A be a bounded linear operator H i+ H o such that, for some constant co > 0,
(17.48)
c o ~ ~ uI I ~(Au, f u ) ~ , Vu E H'.
[The inner product in H o is denoted by (-,)o, the norm by 11 /lo .} Prove that, whatever the number t 2 0, (Z + tA/m)-" defines a bounded operator on H o , having a limit as m -,+ 00 in the sense of the strong convergence of bounded linear operators on Ho. Let us denote by e - r A this limit. Show that it has the semigroup properties
(17.49)
e-sAe-rA
- e-(s+r)A
7
e - j t=o
= I.
Sect. 171
THE CAUCHY-KOVALEVSKA THEOREM
155
For s 2 0, let E, be the image of H o = Eo under the map eCsAequipped with the norm (17.50)
Ile-SAhlls= llhllo *
Prove that E, is a Hilbert space for this norm, and that E, is dense in H o for all s 2 0. For s < 0 define E, as the completion of H o for the norm (lesAhl(o. Prove the following statements: (17.51) Given any two real numbers s > s’,E, can be identified with a dense linear subspace of E,. ; the natural injection E, 4 E,. is continuous and has norm 5 1. (17.52) If s > s‘, A defines a bounded linear operator E, + E,. with norm S e - ’ / ( s - s‘). Exploit these facts in order to solve the Cauchy problem (17.53)
du
- - n(t)Aiu =L dt
= uo,
u1 t=O
where II is an arbitrary complex-valued continuous function on an interval I tl < T,f a continuous function in that same interval, valued in E,, , and uo is an element of E,, (so is an arbitrary real number). Prove also that, whatever the real number s, E, can be “canonically” identified with the dual of E - , .What is the canonical antilinear isometry of E, onto its dual, E - , ?
Reduction of Higher Order Systems to First-Order Systems We have proved the Cauchy-Kovalevska theorem for first-order systems of linear PDEs and this may seem t o lack sufficient generality. We are going to show, however, that this is not so-precisely, that every determined system of linear PDEs of arbitrary order, in which the time variable plays a privileged role, can be transformed into a system of the kind (17.3), in such a way that any significant statement concerning the Cauchy problem for the former is equivalent to the analogous statement about the latter (for instance, as to the existence or the uniqueness of the solution). However, let us remark that it is quite easy t o modify the proof of the abstract Cauchy-Kovalevska theorem (Theorem 17.2) so as to have an analogous result for abstract differential equations in t whose order is 2 1. The modification of the proof that is required is not difficult and we advise the student to try to do it. We deal with a system of linear PDEs of order m > 1, of the following kind :
with Cauchy conditions
(18.2)
Oful r=o
= uk(x),
k =0, . . . , m - 1.
It is not true that every system of linear PDEs can be put into the form (18.1): The restriction 1 ct0 1 < rn in the summation on the right-hand side assigns a privileged role to the t variable. We return to this question later, in the case of a single (i.e., scalar) equation. Let us point out for the moment that the order of the system with respect t o t is equal t o m, no less, and that the coefficient of 07 in the system is the N x Nidentity matrix. The other coefficients C ~ ~ , ~t () Xare , N x N matrices; N is some positive integer. Their entries 156
Sect. 181
157
REDUCTION OF HIGHER-ORDER SYSTEMS
are functions of (x,t ) in some open subset SZ of R"" (or C"", or C" x R', if we wish t o consider complex variables.) For simplicity weshallalwaysassume that they are C" functions in Q. We also assume thatf(x, t ) is a C" function of (x, t ) in R with values in CN and that the uk(x)are C" functions of x in the x projection of Q, also valued in CN. We are going to perform a change of "unknown in (18.1)-(18.2). We set "
(1 8.3)
(1 J S
u j = D,,u
u0 = U ,
5 n),
unfl = D t u ,
and call U the vector ( u o , u l , . . . , u,, u , , + ~which ) is valued in C" with N' = N ( n 2). We make the substitutions (18.3) i n (18.1). We obtain an equation of the form
+
x-
n+ 1
(18.4)
D F - ' U , , + ~=
c ~ ~ , ~t )(Dxp, D t u j
+.f.
j = O ao+laldm-l uojm-2
There are several ways of achieving this, as one can easily see. We choose any one of these ways. Observe, on the other hand, that by differentiating m - 1 times the first n + 1 equations of (18.3) we obtain (1 8.5)
D,"-'u,
=D~-*U,,+,,
(18.6)
D,"-'uj
=
D~-ZDx,~,+ j
=
1, . . ., n .
We may combine Eqs. (18.4), (18.5), and (18.6) and write
DY-'U
(1 8.7)
C
=
ao+ la1 ( m -
C u o , a (t>D:ODoS: ~, U
+F
1
aojm-2
where the C,,,, are N' x N ' matrices and F is the A''-vector whose components are all zero except the last N ones, which are equal to$ The restriction ci0 < rn - 2 (which is all important) was made possible by the fact that the same restriction held in the right-hand sides of (18.4). (18.5), and (18.6). Next we must transform the Cauchy conditions ( I 8.2). We have
(18.9) (18.10)
D:uj
D:u,+,
= D,,uk(x),
k
= 0 , . . . , m - 2, .j =
ILO
It=*
= uk+'(x),
k =0,
. . . , m - 2.
These equations can be summarized as (18.1 I )
0:Ul
= t=O
Vk(x),
k =0, . . . , m - 2.
1, . . . , n,
158
THE CAUCHY PROBLEM
[Chap. I1
We have transformed the Cauchy problem (18.1)-(18.2), which is of order m, into the Cauchy problem (18.7)-(18.11), which is of order m - 1. We may pursue this reduction step by step until we obtain a Cauchy problem of order 1 (beyond which we cannot pursue it further). We raise now the question of the equivalence of statements about uniqueness and/or existence of solutions to the original problem (1 8. ])-( 18.2) and analogous statements about the transformed problem (18.7)-(18.11). The latter must be reinterpreted, however, in full generality; that is, the data F, V kmust be allowed to take all values in C" and not only the special kind originating in (18.1)-(18.2). (1) Suppose first that, given any choice of F and V k (0 5 k 5 m - 2), the Cauchy problem (18.7)-(18.11) possesses at feast one solution (we do not want and need not be specific about the regularity of this solution). Then this is in particular true when (18.12) k I; = (0,. . . , OJ), V k = (uk, DXIu , . . ., D,,uk, u k + ' ) (0 5 k 5 m - 2). We obtain a solution to (18.1)-(18.2) by taking u equal to the first N components of any solution U of (18.7)-(18.11) in the case (18.12) (what we have denoted uo above). (2) Suppose next that the problem (18.7)<18.11) admits two different solutions for some choice of the data [not necessarily submitted to (18.12)]. By subtraction we see that the homogeneous problem (18.7)-( 18.11) (where all data, F and the V k ,are identically zero) admits a nontrivial solution U . By the argument used in (I), the first N components uo of U constitute a solution of the homogeneous problem (18. I)-( 18.2). Suppose that uo were to vanish identically. By the equations (18.5) we could conclude that DP-*u,+, = 0, hence that u,+, is a polynomial with respect to t of degree Srn - 3. But from (18.10), where uk+' = 0, we derive that all the derivatives of order S m - 2 of with respect to t , vanish at t = 0. Thus u , , + ~= 0. Also, by (18.6) we see that D,"-'uj = 0 (1 2.j 5 n), hence u j is a polynomial with respect to t of degree s m - 2; the derivatives of order Srn - 2 of this polynomial at t = 0 are all zero, by (18.9), therefore the polynomial itself is identically zero. We have reached the conclusion that all the components of U are zero, contrary to our assumption. Thus u = uo cannot vanish identically and the homogeneous problem (18.1)-( 18.2) has a nontrivial solution. (3) If the homogeneous problem (18.1)-(18.2) (i.e., when f and the uk vanish identically) has a nontrivial solution u, it is clear that the homogeneous problem (18.7)-(18.11) also does. (4) We assume at last that the problem (18.1)-(18.2) has a solution for any choice of the dataf, uk and prove that the same is true of the problem (18.7)-(18.11) [without the restriction (18.12)].
Sect. 181
REDUCTION OF HIGHER-ORDER SYSTEMS
159
Observe first that we may assume the Cauchy data V k to be all identically zero. Indeed, it suffices to substitute
for U ( x , t ) . Of course, this requires a redefinition of F. Once this is done, we perform a further transformation on the " unknown " U, in the following manner. Let us write F = ( F o , F,, .. . , F , , F,, F,,,) where each Fj (0 s j 5 n + 1) is valued in CN.For j 2 n, we call wj the (unique) solution of the problem D , " - ~ ~ ~ = FD~: w, j
Lo
=0,
k = O ,..., m - 2 .
Then we substitute 0 = U - ( w o ,w l , . .. , w., 0) for U. We leave it to the student to check that (18.13)
(18.14)
=0,
k = O ,..., m - 2 ,
t=O
where F = (0, . . . , 0, F n + J But problem (18.13)-(18.14) is the transform of problem (18.1)-(18.2) where we make the choices f = F,+,, vk = 0 for all k . We know that it has a solution. We may retrace our steps and reconstruct the solution U of the problem (18.7)<18.11). The sought equivalence has been completely proved.
Remark 18.1. If we follow through our changes of unknowns from u to U and vice versa, we notice that certain basic features have been preserved: For instance, if the original system (18.1) has constant (resp.?polynomial, analytic, C", etc.) coefficients, the same is true of the "final" system (18.7) and if u is C" (resp. analytic, a polynomial, etc.) with respect to ( x , t ) , the same is true of the "end product " U ; and this is also true if we go from U to u.
Exercises 18.1. Apply the reduction procedure, from second-order equations to first-order systems, to the Laplace equation in n variables, and to the wave equation in n space variables (n 2 1).
160
[Chap. 11
THE CAUCHY PROBLEM
~ the system (18.1) be constant. We may 18.2 Let the coefficients c , ~ , in then define the determinant of its symbol, as the determinant of the matrix
(18.15)
IT"
-
c
Cao,uTQo(a.
ao+laljm ao>m
Prove that the determinant of the symbol of the transformed system (18.7) is equal to the determinant of (18.15) multiplied by a power o f t . 18.3 Let P(<,T) be a polynomial with complex coefficients, of degree m, in n + 1 variables, = (tl, .. . , <,) and T, of the form
<
P(<,T) = T"
(1 8.16)
-
1s m
ao+ 1.1 aosm
~ u o ~ u ~ u o < u (C,o,a
E
c).
By using the result in Exercise 18.2, prove the following: In order that the operator P(D,, D,) be transformable, by the procedure described in Sect. 18, into a hyperbolic first-order system (Definition 15.2), it is necessary and sufficient that it satisfy the property (18.17) there is a positive constant C such that, for every T E
C,i f P ( ( , z)
= 0,
5 E R,
and every
I
then necessarily Im T 1 5 C .
(In connection with this, we mention the important Definition 18.1. The operator P(D,, 0,)is called hyperbolic with respect to t if(18.17)holds [assuming that P(5, T) has the form (18.16); cf. Definition 15.21.) 18.4 Prove that the polynomial P ( t , T) of Exercise 18.3 has property (18.17) if its principal symbol (Definition 19.1) has the following property:
(18.1 8)
V< E R,,
4 # 0, the roots of Pm(<,.) are real and distinct.
[If (18.18) holds, P(D,, 0,) is called strongly, or strictly, hyperbolic; cf. Definition 15.3.1
19 Characteristics. Invariant Form of the Cauchy-Kovalevska Theorem As we have said, not every system of linear PDEs, not even every single equation, can be put into the form ( 1 8.1). We would like to understand what it means that it can. Obviously this is a feature of the principal part of the equation, that is, of its terms of order m. Let us write P ( x , t, D, , Q) = DY -
uo+
c c 1.
G0,u(X,
t>DP"DZ>
Qm
ao<m
Pm(x, t , D,, Dt) = 0," -
t)W'o;.
c ~ ~ , ~ ( x ,
uo+lul=m ua<m
We shall restrict ourselves to the scalar case, i.e., where the coefficients c ~ are complex-valued. The general case of systems is more difficult to analyze. We observe that, given any (x, t ) in the region where the equation is studied, we have Pm(x,t , 0 , 1) = 1.
More generally, suppose we were dealing with a differential operator P ( x , t , D x Dr) = 9
C
a u , , a(X7
t)DP"D:
C
auo,a(X,
t)D?DZ.
ao+Ialsm
with principal part Pm(x,
t , Dx Dr) = 9
+ la1 = m Assume that the coefficients are smooth complex-valued functions in an open set R c R"+l.We may "reduce" the equation uo
(19.1)
P(X>
t , D,, Dt)u = 9
to the form (18.1) if we know that (19.2)
P,(X,
t , 0, 1) # 0 , 161
( x , t ) E Q.
~
,
~
162
THE CAUCHY PROBLEM
[Chap. I1
Indeed, (19.2) means that the coefficient a,,,, does not vanish at any point of Jz and it suffices to take, in (1 8. l), Go,a = -a,, ,a / % ,
0
,
f = 91am.o.
One expresses the condition (19.2) by saying that the covector (0, 1) or, equivalently, the t direction, is not characteristic for P(x, t , D,, 0,)at any point of R. Let us introduce this notion of characteristic covectors in a somewhat more "invariant" setting. Assume that we deal with an open subset 0 of the Euclidean space R" where the variable is denoted by y ; we denote by q the variable in the dual space RN . We consider a linear partial differential operator
whose coefficients are complex C" functions in 0 (the requirement that they be C" is obviously too much; at the present stage they could as well be merely continuous). Set P J Y , q) =
C
I"] = m
aa(~h"9
where q is the variable in R, . Notice that P,,,(y, q ) is a homogeneous polynomial of degree M with respect to q whose coefficients are C" functions of y in 0. Definition 19.1. The function P,(y, q ) on 0 x RN is called the principal symbol of the differential operator P(y, Dy).
O f course, the differential operator P,,,(y, D,,) corresponding to P,(y, q ) is the principal part of P(y, Dy). Fixing y in 0, we consider the set of zeros of P,(y, q) as a function of q : C A Y ) = {V
E
RN; Pm(Y, V ) = 01.
Since P,(y, q ) is homogeneous (of degree m) with respect to q, for each y the set C,(y) is a cone with vertex at the origin. Definition 19.2. The set C,(y) is called the characteristic cone of P(y, DJ at the point y E 9. Every covector q E Cp(y)different from zero is said ro be characteristic with respect to P(y, D,) at the point y.
In the case of the differential operator P(x, t , D, , D,) considered above we see that for no ( x , t ) E R did the covector (0, 1) belong to the characteristic cone at ( x , t). We consider now a C' hypersurface S c 0.By this we mean a subset S of 0 defined as follows: Every point y o of 0 has an open neighborhood Uo such
Sect. 191
CHARACTERISTICS. CAUCHY-KOVALEVSKA
THEOREM
163
that there is a C' function cp(y) in U , with the following properties: grad cp does not vanish anywhere in U o ; S n U , is exactly the set of points y E U , such that cp(y) = 0. We may consider the normal direction to S at any point y E S n U o : It is the straight line in R,, passing through the origin, spanned by (grad cp)(y).Any nonzero convector q belonging to this line will be called a covector normal to S at the point y .
Definition 19.3. The hypersurface S E O is said to be characteristic at the point y E S with respect to P ( y , D,,) ifany normal covector q to S at the point y is characteristic with respect to P(y, D,) at that point, i.e., belongs to C,(y). Of course, if one covector q, normal to S at y , belongs to C,(y), the same is true of all the others. A hypersurface S is said to be characteristic if it is characteristic at every one of its points (this terminology is somewhat confusing but it is traditionally accepted). In our example above, the pieces of hyperplanes {(x, t ) E R ; f = const} are nowhere characteristic with respect to P(x, t , D, , D c ) .Furthermore, if a hypersurface S c 0 is noncharacteristic at some point y o [with respect to P(y, D,,)], we may choose new coordinates x = (XI,. . . , x"), t (where now n = N - 1) in an open neighborhood U , of y o such that S n U , is exactly the set of points (x, 0). Furthermore, in the new coordinates, if P(x, t , D, ,Dt)is the expression of the operator P(y, Dy)and if U , is sufficiently small, we will have Pm(x,t , 0, 1) # 0 for every (x, t ) E U , . The equation P(y, D,)u = g will be transformable into (18.1).
Example 19.1. The principal symbol of the Cauchy-Riemann operator +(d/dx + i d/dy) on R2 is .)(< - iq). The one of the Laplace operator Ax = (djax') + * . . + (d/aX")2on R"is - I ( 12. For none of these operators are there any (nonzero) characteristic covectors. In relation to Example 19.1 we introduce the following important definition:
Definition 19.4. The differential operator P(y, Dy) in 0 is said to be elliptic at the point yo E 0 if C,(yo) = {0},i.e., i f there are no characteristic covectors at y o . The operator P(y, D,) is said to be elliptic in a subset 0, of 0 if it is elliptic at every point of 0,. The set of points where a given operator is elliptic is obviously open. The Cauchy-Riemann operator on R2 and the Laplace's on R" are everywhere elliptic.
Example 19.2. The principal symbol of the heat operator ajar - A, on R"+l is Its characteristic cone (at any point) is the one-dimensional linear subspace of the (t, T) space R,+ I defined by 5 = 0. The same holds for the Schrodinger operator.
/
164
THE CAUCHY PROBLEM
[Chap. I1
Example 19.3. The (principal) symbol of the wave operator a 2 / d t 2 - Ax on R"+I is 1 ( I 2 - 72. Its characteristic cone (at any point) is the surface of the light-cone 15 1' = 72. Note, however, that if, instead of the form of the wave operator above, we consider the form
with V > 0 but not necessarily Y = 1, the characteristic cone is defined by the equation
vz I fy= T 2 .
(19.3)
Now, unless V = 1 , this is not the equation of the surface of the light-cone. It is much more preferable to consider the light-cone as a subset of R"", not of the dual space R,,, (after all, remember that the light-cone is meant to contain the supports of the remarkable fundamental solutions E , and E of the wave equation-and these are distributions of R"", not of R,+l). Then the equation of the surface of the light-cone is (19.4)
1x12 = V 2 t 2 .
The relation between (19.3) and (19.4) is not difficult to find: Any normal direction to the characteristic cone [given by (1 9.3)], where such a direction is defined, that is, at every point of the cone which is not its vertex, can be regarded as a straight line in R"" passing through the origin; the union of all these straight lines is the surface of the light-cone [given by (19.4)]. Note that the covector given by 5 = 0 , 7 = 1 does not belong to the characteristic cone of the wave operator. This means that the hyperplanes t = const are nowhere Characteristic. Example 19.4. Consider, in R2, the first-order operator
L
= ajat
-
a(x, t ) a p x
where a(x, t ) is a real-valued C" function. The characteristic cone of L at the point (xo , to) is the straight line in the plane R2 of the variables (t, 7)defined by T = a(xo,t o ) ( . Consider now the curve x = x ( t ) in the (x, t ) space defined by the equation dx - = -a(x, t ) , dr
=xo I
= fo
The tangent direction to this curve at the point ( x o , t o ) is spanned by ( -a(xo, to), 1). The normal t o the curve at ( x g, to) is therefore characteristic with respect to L. This is of course true at every point of the curve. We reach the conclusion that any curve dejned by conditions (19.5) (possibly with dif-
Sect. 191
CHARACTERISTICS. CAUCHY-KOVALEVSKA THEOREM
165
ferent x , , t o j is characteristic with respect to L (at every one of its points; we view here a C’ curve in R2 as a hypersurface in RZ,which it is). Conversely, let y be any C’ curve in R 2 everywhere characteristic with respect to L. Let ( x , , to) be any one of its points and let cp(x, t ) = 0 be an equation of y in an open neighborhood U , of ( x o , to). We know that grad cp = (cp,, cpt) does not vanish near (x, , t o ) , say in I/, , and furthermore, that cpr = acp, in U, n y. In particular this requires that cpx never vanishes in a neighborhood of U, n y , which we can take t o be U, itself. The vector field ( - q , / q x , I ) is tangent t o Uo n y at every point, and we have just seen that it is equal to ( - a , 1). This means that the conditions (19.5) define y in U , . Thus, every characteristic curue of L is (locally) dejined by conditions (19.5). We may summarize this in the following way: The characteristie curves of L (viewed as a differential operator) are exactly the integral curoes of L (viewed as a vector field). It should be pointed out that this statement only makes sense when the number of independent variables is equal to two: Otherwise “characteristiccurves should be replaced by characteristic hypersurfaces which cannot be identified with “ integral curves (a meaningful concept in all dimensions). There is, however, a way to salvage the statement above, but this requires the notion of bicharacteristic curue (see the Appendix to the present section).
”
“
”
”
We conclude this section by stating an invariant version of the CauchyKovalevska theorem. By invariant we mean here coordinates independent : in other words, there are no more time or space variables, and hyperplanes (such as the one on which f = 0) should not have any privileged role. We deal with an open subset % of R N and with a linear partial differential operator P(y. D y ) of order m > 0 in 12/. whose coefficients are analytic functions in a. Furthermore, we are given an anal.kltic hypersurface C in % which is nowhere characteristic with respect to P(y, Dy)and whose exterior normal at every point is well defined. “
”
THEOREM 19.1. Let f be an analytic function in a,u, (0 6 j 5 m - 1) rn analytic functions in C. There is an open neigiiborhood V of X in and a unique analytic function u in V such that P ( y . D J u =f
(19.6) (19.7)
for euery j
= 0. . . . , m -
in Y ,
1, (d/av)ju = u j in C.
We have denoted by d/dv the partial differentiation in the exterior normal direction to C.
Proof. It suffices to prove that every point y o of C has an open neighborhood V ( y o ) where there is a unique analytic function u(j3; y o ) satisfying P(y, Dy)u = 0 in $‘-(yo) and (2/Sv)ju = t i j (0 = < j< m)i n Y ( y o ) n Z. Indeed, if Y ( y , ) n V ( y , ) # 0, by the uniqueness we would have u ( y ; y o ) = U(J>; y l ) in
166
[Chap. II
THE CAUCHY PROBLEM
this intersection and there is an analytic function u in the union -Y of all the Y(y,), yo E C,whose restriction to each Y(y,) is equal to u(y; yo). If the neighborhood Y ( y o )is sufficiently small, we may perform a change of variables so as to “flatten” the hypersurface C n Y(y,): In the new coordinates, say xl, . . .,x“,t (n = N - l), this pieceof hypersurfacebecomesa piece of the hyperplane t = 0. As pointed out at the beginning of this section, the fact that C is nowhere characteristic implies that the expression of P(y, Dy) in the new coordinates is of the form 4 x 9
t)D;“ +
m
C j=
Qj(x, 1
t, D X ” - ’ ,
where a(x, t ) and the coefficients of the Q,(x, t, D,) are analytic functions in Y(y,), a(x, r) vanishes at no point of this set, and the degree of Q,(x, 1, with respect to 5 is not greater thanj. After division by a(x, t ) we may transform our equation into a first-order system of the kind (17.3) and the Cauchy conditions into conditions of the kind (17.4). After extending the coefficients and the data to the complex domain we may apply Theorem 17.1 (or rather the version of Theorem 17.1 where the coefficients and the data are assumed to be holomorphic with respect to t , see the end of Sect. 17). We easily reach the desired conclusion, namely that our local Cauchy problem has a unique Q.E.D. analytic solution.
r)
Remark 19.1. One could call Theorem 19.1 the “ real-analytic” version of the Cauchy-Kovalevska theorem. There is a “ holomorphic version of the same, which in a sense is to be preferred to the real-analytic one. In the holomorphic version we must assume that % is an open subset of C” and C is an analytic submanifold of codimension 1 of %: By this we mean that every point z,, of 4 has an open neighborhood U , where there is defined a holomorphic function cp,(z) such that C n U , is exactly the set of points z of Uo satisfying the equation cp,(z) = 0 and such, moreover, that grad cp does not vanish at any point of U,. We deal then with a differential operator P(z, 8/82) of order m,with holomorphic coefficients in %. We assume that E is nowhere characteristic with respect to P(z, djdz) which means that, in any neighborhood U , such as the one above, ”
P,(z, grad cp,(z)) # 0
if
2E
U, ,
where cp, is a holomorphic function defining U , n C as before. We also suppose given a holomorphic vector field d/dv in an open neighborhood %’ c % of & normal to E at each one of its points, that is, a linear combination
Sect. 191
CHARACTERISTICS. CAUCHY-KOVALEVSKA
167
THEOREM
with coefficients yi which are holomorphic functions in 4’. Furthermore, on Uo n Z, where U , is the neighborhood of zo considered above, we have
where 1 is a holomorphic function, nowhere vanishing, in U , n X dependent ofj). Then we may state
(A is in-
THEOREM 19.2. There is an open neighborhood Y of Z in 42 such that, to every holomorphic function f in 4 and to every set of rn holomorphic functions on C , there is a unique holomorphicfunction u in Y such that uo , . . . ,
(
(19.8) (19.9)
a z>
P z,for every j
u=f
= 0,. . ., m
-
in-Y,
1, (d/az)ju
= uj
in C .
Not only does Theorem 19.2 imply (trivially) Theorem 19.1, but it adds precision to it: Indeed, the defect of Theorem 19.1 is that the neighborhood Y of Z in its statement depends not only on the hypersurface Z and on the operator P(y, 0,) but also on the dataf, uj . This is not so in Theorem 19.2! As a consequence, Theorem 19.2 enables us to specify on what properties of the dataf, uj , the choice of Y in Theorem 19.1 does depend. It depends on the “size” of the subsets of the complex space to which these data can be holomorphically continued.
Appendix
Bicharacteristics and the Integration of the Characteristic Equation
In this Appendix, we consider a linear partial differential operator P(y, D,,) of order m, with coefficients defined and C“ in an open subset of RN, 0. We assume that its principal symbol P,(y, q) is real. We then associate with this principal symbol the following system of 2N equations : ( 19.1 0)
to which we adjoin the (19.1 1)
“
initial
y =yo,
”
conditions :
q =qo
when t = O .
Here yo is an arbitrary point of 0,q0 an arbitrary point of RN\{O}. By the fundamental theorem about ordinary differential equations, the problem
168
[Chap. I1
THE CAUCHY PROBLEM
(19.10)-(19.11) has a unique solution, at least for small values of I t ] . The solution (19.12)
Y = m o q 7
0 9
9,
It
= S(Y0 7
V01
t)
is smooth with respect to all the arguments; for t fixed, and sufficiently close to zero, it defines a C" diffeomorphism, ( y o ,qo) H( y , q), in sufficiently small subsets of 0 x (RN\{O}). The equations (19.10)-( 19.11) are known as the Hamilton-Jacobi equations for P,. If the (total) gradient of P, does not vanish in a full neighborhood of ( y o , qo), the solution ( y , q), given by (19.12), describes a true curve through ( y o , qo). The tangential differentiation along this curve is given by the Hamiltonian jield of P, , (19.13) This is a vector field over 0 x RN which, it ought to be pointed out, is welldefined even when the solution of (19.10)-(19.11) is not a curve (i.e., is a single point), or even, for that matter, when P, is complex (in which case there might not be any solution at all!), The integral curves of H,_ are obviously curves on which the function P, remains constant, since HP,Pm = 0. Therefore, if any one of them meets the characteristic set of P ( y , D,),i.e., the set ( 19.14)
{(Y, V ) E 0 x (RN\{Ol); Pm(y7 V ) = 01,
it is entirely contained in this set. Such an integral curve of H P mcontained , in (19.14), is called a bicharacteristic strip of P,; its projection in the y space, that is, in 0, is called a bicharacteristic curue (or simply a bicharacteristic). Of course, it might reduce to a single point. It is a true curve, in 0, when grad, P, does not vanish on it. The preceding notions are important in various aspects of PDE theory, in particular in the matter of solving the characteristic equation : (19.15)
P,(y, grad w) = 0,
in some open neighborhood CJ of y o . Usually, one seeks a reaf-valued solution w, defined and C" in U, moreover satisfying a condition
(19.16)
grad w
=qo Y =Yo
In order that (19.15) and (19.16) be compatible we must have (19.17)
P m t y o yo> = 0, 7
in other words ( y o , qo) must belong to the characteristic set (19.14),
Sect. 191
CHARACTERISTICS. CAUCHY-KOVALEVSKA THEOREM
169
It is not always possible to find a (smooth) solution w to (19.15)-(19.16). We are going to show that, under certain favorable circumstances, it can be done. We are going to make the following assumption: (grad, Pm)(yo 1') z 0.
( 19.18)
9
Then let 0 be a vector (in R") not orthogonal to grad, P,(yo, qo). We may choose the coordinates in R" in such a way that 0 becomes the unit vector to the yN-axis.This implies at once that (19.19)
apm
-does not vanish in some neighborhoodN o f ( y o ,qo).
as N
We may therefore apply the implicit functions theorem and write
(I 9.20)
p , ( y ,q) =
Q(Y7
V)[~N
qi,
. ..
t
V N -111
inx,
provided that is small enough. Both Q and 2 are C" functions in N ,and Q does not vanish at any point of N.Consequently, the characteristic set (19.14) is defined, in M , by the equation (19.21)
)?N
='(y>
?1?
* . . )q N - 1 ) '
On this set [which is a piece of hypersurface in 0 x (RN\{O})], the Hamiltonian field of P, is equal t o that of the factor q N - A(y, ql, . .., q H multiplied by Q; hence its integral curves, that is, the bicharacteristic strips of P,, can be defined by the Hamilton-Jacobi equations for that factor; that is (19.22)
_- - -an
ayj
at
aqj
9
j = l , ...,N - 1 ;
aY
-=l;
at
Several observations can be made, on these equations: First of all, the last equation in (19.22) shows that we can take the variable y N as the parameter along the bicharacteristic strip. Second, all the right-hand sides are independent of q N ,which must be given by (19.21), since the strip lies entirely on the hypersurface defined by (19.21)-as we have observed earlier. It is therefore convenient t o change notation and to write t instead of y N ; we shall write xj instead ofy'and tiinstead of q j i f j < N . In fact, we shall write n = N - 1, to further underline the analogy with a decomposition of the space RN into a space hyperplane (where x varies) and a time axis, where the variable is t . The system (19.22)-(19.23) reads, in the new notation, (19.24)
[Chap. I1
THE CAUCHY PROBLEM
170
where the subscripts mean differentiations. We may further simplify the picture by choosing the coordinates (x,t) in Rnfl so as to have t = 0 at the point y o , and those in the dual R,+' so as to have 7 = 0 at the point qo (T is the " covariable " corresponding to t ;in the new coordinates, qo will be represented by (to,0); we cannot have to= O since we do not have qo = O ) . We shall obtain a solution of (19.1 5)-( 19.16) if we solve (19.25) THEOREM 19.3. If the principal symbol P,(y, q) is real and if (19.17) and (19.18)hold, the problem (19.15)-(19.I6)has a smooth solution in a suficiently small neighborhood of yo . The solution cannot be unique (unless N = I , and even then uniqueness can only be understood modulo constants), since every solution of (19.25) (which is the problem we are going to solve) is a solution of (19.15)-(19.16), and the choice of the time direction t could have been made otherwise. Note also that the "initial " condition in (19.25) could have becn chosen differently, e.g.,
o=tl'w
=t0+0(~x-xo~~).
Proof. This proof is based on the following observation. Let (19.26)
x =F(xi, t , 5'h
t = G(x1, f , t'),
represent the solution of (19.24) equal to (xl,5 ' ) when t = 0. We look at the restriction of w,(x, r) to a bicharacteristic of P, , say to the bicharacteristic obtained by projecting into x space the curve defined by (19.26). In (x, t ) space (i.e., in 0)this bicharacteristic goes through the point x = xlrt = 0. Let us write O ( t ) = WX(F(XI7 t , t'), t ) .
We have
dB - = w, f', dt
+ w,,
= u x , t , e)
=w ,,
F, + &(x, t , 0) f &(x, t , 0)w,,
+ w,,{~~(x, t , 6) - l ~ xt,, G)).
We observe that, by the second equation (19.24), this is satisfied if 0(t) = G(x,, t, 5'). But since 0(0) = to,we must have 5' = 5'. Thus (19.27)
w,(F(xi, t , to),r)
= G(x1, t ,
5').
Sect. 191
CHARACTERISTICS. CAUCHY-KOVALEVSKA THEOREM
171
The solution of our problem follows easily from (19.27). We are interested in w,(x, t ) and therefore we must express x1 in terms of x = f(x,,t , 5") : x 1 = x l ( x , t , to),and (19.28) w,(x, t ) = G ( x l ( x ,t , 5'1, t , 5')If we put this into (19.25), we obtain
(19.29)
w(x, 0
=(x,
to)+ fA(x, s, G(x,(x, 3, 5'1, 0
s, to)> ds.
In order to check that (19.29) indeed defines a solution of (19.25), it suffices to prove that (19.28) holds, and for this it suffices to show that
ax ax, A&x, t , G) - -ax ax, at vanishes, which follows from the fact that F(xl(x, t , to),t , to)= x , that is,
= - A&F,
t , G)
aF ax, +ax, at
Q.E.D.
Under hypothesis (19.18), if w is a solution, in an open neighborhood U of y o , of (19.15)-( l9.16), the pair ( y , grad w(y)), regarded as a point in U x R N , varies on a smooth hypersurface, the characteristic set of P, , (19.14). This set is fibered by the integral curves of the Hamiltonian field H p , , (19.13). When y varies along a bicharacteristic of P, contained in U , ( y , grad w(y)) varies along a bicharacteristic strip of P, contained in U x R, . Exercises 19.1. Consider the operator in
RN ( N 2 2)
(19.30) Describe its characteristic set [see (19.14)], its bicharacteristic strips, and its bicharacteristics through the origin in R"' (see the Appendix to this section).
172
[Chap. I1
THE CAUCHY PROBLEM
19.2. Consider a first-order operator in RN, L=
c .'(yj N
.
j=1
a
-.,
dy'
with real coefficients d,defined and C" in an open subset of RN,0 , under the hypothesis that N
(19.31) Show that there is identity between the integral curves of the vector field L and the bicharacteristics of L. Redo the proof of Theorem 19.3 in the case where P,(y, 0,) = 1 L and indicate the analogy with Exercise 16.1. 19.3. Suppose that the principal symbol P,(y, q ) has analytic coefficients (in some open subset 0 of R"')and extend it as a holomorphic function of ( y , r,~) in 6x C,, where 8 is an open neighborhood of 0 in C,. Prove that the characteristic equation (19.15) can be integrated in the complex sense, i.e., that we may find a solution w , holomorphic in a (complex) neighborhood of yo E 8,satisfying (19.16) where now q o is any nonzero complex N-vectorprovided that we make the hypothesis (1 9.18). Describe the complex characteristic set of the Laplacian in N variables and its complex bicharacteristics. Solve (19.15)-( 19.16) when P,(y, q ) = - I q l2 and qo is an element of CN\(O} (take y o = 0). 19.4. Suppose that the principal symbol P, has constant coefficients and write then P, =P,,,(q). Under assumption (19.18) show that Q and d in (19.20) can be chosen independently of y . In this case, solve problem (19.25) first directly, then by following, step by step, the proof of Theorem 19.3. 19.5. Let S be a C" hypersurface in RN, passing through the origin, and characteristic at that point with respect to the operator (19.30) (Definition 19.3). Let To denote the bicharacteristic strip of this operator passing through (0, q0) where qo is the covector normal to S a t 0. Prove that the corresponding bicharacteristic (i.e., the projection in RN of the strip r,) is necessarily tangent to S at the origin. Give an example of such a hypersurface S which is tangent at the origin to more than one bicharacteristic of (19.30). 19.6. Consider the following operator in C 2 :
-6
(19.32) Describe all the solutions h = h(z,, z 2 )of the homogeneous equation (19.33)
LIZ
=o,
which are holomorphic in the polydisk (19.34)
{(zit 22)
E
C2; /z1I < ri,
1221
<~21,
ri, r2 > 0.
Sect. 191
CHARACTERISTICS. CAUCHY-KOVALEVSKA
THEOREM
173
If /I(z,, z 2 )is such a solution, prove that liO(zl) = h(z,, 0) can be extended as a holomorphic function in the disk ( 19.3 5 )
{zl E
+
c1; IZI I < Y,
r2).
Derive from this that the Cauchy problem for (19.33) with Cauchy condition h = hO(z,)
(19.36)
when
I z1 I
< r l , z 2 = 0,
does not have, in general, a holomorphic solution in (19.34). Describe the maximum set to which one can extend holomorphically all the solutions of (19.33) defined and holomorphic in the polydisk (19.34). 19.7. Let P ( D J denote the differential operator (19.30) and consider the Cauchy problem (19.37) P( Dy)h = 0,
2h
(19.38) / i ( y ‘ ,0) = hO(y’),
J.”
- (I,’,0 ) = h,(y’), ayN
1
= (11
, . . . ,yN-’).
Let B‘ be the open unit ,ball in RN-’. Describe the maximum open subset S of R N to which the solution / I of (19.37)-(19.38) can be extended as a C“ function-wliatever the Cauchy data h O .h, belonging to C “ ( B ’ ) .What is the relation of S to the bicharacteristics of P ( D y ) ? 19.8. Let P ( y , D y ) be a linear partial differential operator of order m 2 0, with coefficients defined and C“ in an open subset 0 of RN.Let PJy. q ) be its principal symbol. Let y ~ y =‘ $(y) be a C“ mapping of 0 onto another open subset, 0’, of RN,which is bijective and whose Jacobian determinant does not vanish at any point of 0. Define the transform P“(y’, D y e )of P(y, Dy)by the formula u E CYO’). P%’, Dy,)u(y’)= [P(Y,D,)u($(y))l,= 2 (,,’) Show that P@(y’,D,,) is a linear partial differential operator of order m with coefficients defined and C“ in 9’, whose principal symbol is related to that of P ( y , D,,) by the formula
(19.39)
1
-1
(19.40)
P,%Y’, 4) = P,(
4 (Y’), ‘J&’)q’),
Y’ E O r , 9’ E R N 1
where ‘J,(y’) is the transpose of the Jacobian matrix of
4
evaluated at the
-1
point 4 (y’). 19.9. Apply formula (19.40) in the case where 0 = 0’ = R2, the diffeomorphism 4 is represented by the change of coordinates (19.41)
y = X,
s =t
+ :x*,
and the operator under study is P = dZ/dx2- 8/32. Give the total expression of the transformed operator P“, and derive from this that the lower order terms are not invariant [whereas the principal symbol is, in the sense of transforming according to the law (1 9.40)]. “
”
20
The Abstract Version of the Holmgren Theorem In this section we study certain uniqueness results which follow from the dual form of the abstract Cauchy-Kovalevska theorem (Theorem 17.2) and which are going to be used (in Sect. 21) to derive Holmgren’s theorem. We deal with a scale of Banach spaces {E,} (0 5 s 5 1) as in Sect. 17; they satisfy condition (17.18). We are given an operator A ( t ) satisfying (17.19) and (17.20), but now we add the following hypothesis: (20.1)
If 0 4 s’ < s 5 1, E, isdense in E,. .
It implies that the transpose of the natural injection of E, into E,. is an injective continuous linear map of the dual of E,. , E,’., into the one of E, , EL . We shall refer to the latter map as the natural injection of EL. into Ej . Since transposition preserves the norm of the operators, this injection has nom 4 1. As a matter of fact, we set 0 5 s 5 1, F, = (El -,)’, and we see that the Banach spaces F, form a scale analogous to the scale {E,}. Let ‘A(?)denote the transpose of the operator P ( t ) ; it is seen at once that properties (17.19) and (17.20) hold if we substitute F, for E, (for each s) and ‘A(t)for A ( t ) (for each t). This means that Theorem 17.2 is valid with these substitutions. As we shall use it, let us stress the uniqueness part.
LEMMA 20.1. Suppose that (17.18), (17.19), and (17.20) hold. Suppose also that (20.1) holds. Let v(t) be a C’ function oft, I t I < T , valued in Ei for some s > 0 and satisfymg, for I t 1 < T, the homogeneous equation dv - = ‘A(t)v. (20.2) dt If v(0) = 0, the function v vanishes identically in the interval ] - T, T[. 174
Sect. 2G]
ABSTRACT VERSION OF HOLMGREN THEOREM
175
We wish to take a look at distribution solutions of (20.2) (that is, distributions in the variable t : no other variable is involved here; on the subject of distributions valued in a Banach space, we refer the reader to Sect. 39, although only the most elementary information will be needed-and provided-here). For this to make sense we must strengthen our smoothness requirements on the operator 'A(t), that is, on A ( t ) . Incidentally we use the following lemma, whose proof we leave to the student: LEMMA 20.2. Suppose that the following holds:
(20.3)
O ~ s ' l,A(t)isaC"functionof < s ~ t,Itl
~
Then the analogous property holds for the transpose 'A(?):
(20.4)
i f 0 5 s' < s S 1, ' A ( I ) is a C" function o f t , I t ] < T, valued in the Banach space of bounded linear operators E,'. -+El.
As a matter of fact, the same is true if we substitute C"' with m < + co for C" or if we substitute analytic (or holomorphic when t is a complex variable). We suppose in the sequel that (20.4) holds. Note that E; is the smallest of the duals Ei (it has the largest norm). If u is a distribution in the interval 1-T, T [ with values in E J , 'A(t)v is a distribution in the same interval, with values in E,' for any s > 0. We shall be interested in distributions which can be written in the form
(20.5)
a
v=a:g,
d ' -dt' --
where g is a continuous function of t , I t I < T , valued in E; , and vanishing for t < 0. In this simple case, the multiplicative product 'A(t)v is particularly easy to define (although it is very easy also,in the general case; see 539.4). Indeed, we have, by the Leibniz formula, k
(20.6)
'A(t)a:g =
1( - i ) Q f [ ~ ( ~ - ~ ) ( t ) ~ ] .
I=O
Since 'A satisfies (20.4), the g1 = (- l)'[d:-'('A)]g are continuous functions of t , I t I < T , valued in E,' (for an s > 0). We shall exploit the following regularity result : LEMMA 20.3. Suppose that (17.18), (20.3),and (17.20) hold, and also (20.1). Let u be the distribution given by (20.5). If u, - 'A(t)u is a C" function o f t , I t I < T , valued in E; , then u itself is a C" function o f t , I t 1 < T , valued in E I , for any s > 0.
176 ProoJ
[Chap. I1
THE CAUCHY PROBLEM
By (20.6) we have
c k
j-= v, - 'A(t)u = a:+1g -
afgl.
1=0
But we may write
f = dF+lf,,
g r = dF-r+lh1 ,
with h, (0 5 I =< k ) andf, vanishing for t < 0 (this determines them uniquely). Note furthermore that fl is a C" function of t , I t I < T , valued in EA and that, if g is a C" function of t, I t I < T , valued in every E l , s > 0, h = h, is a C"" such function (recall the definition of g l ) . We have a:+'(g - h
-A)
= 0,
which means that g - h -fl is a polynomial o f degree But since it must vanish identically for t < 0, we have
sk (valued in E,').
g = h +fl.
We know that g is Co (with values in E l , s > 0), hence h is C', henceg is C1, hence h is Cz, and so on. We reach the conclusion that g is a C" function oft, I t I < T , valued in E,' (s > 0), and Lemma 20.3 is proved. COROLLARY 20.1. Using the same hypotheses as in Lemma 20.3, let u be given by (20.5). I f v, = 'A(t)u in ] - T , T [ ,we have u = 0 in this same interval. ProoJ: By Lemma 20.3 we know that u is a C" function o f t , I t 1 < T, valued in E,' for any s > 0. By Lemma 20.1 we may then conclude that u = 0. Q.E.D. We describe now the situation to which we are going to apply the preceding considerations. We introduce a one-parameter family of potydisks 8, (0 5 s 5 1) in C": 0 , = { z ~ C " ; l z j l < r o + s d o , j = l ,..., n }
(rO,do>O).
Notice that
(20.7)
If 0 I s' < s 5 1, 8,.c 0,and dist(0,. , do,)
= do(s - s').
We are going to choose (20.8)
LEMMA 20.4.
E,
= H(B,;
Cm)
(see Definition 17.1).
With the choice (20.8), condition (20.1) holds.
ProoJ If we reason componentwise, it suffices t o prove that if s'<s, H(d,) is dense in H(B,.). Let h be an arbitraryelement of H(b,.) and set h,(z) = h[(l - d)z], 0 < 6 < 1. As h is uniformly continuous in a,,, h, converges to h
Sect. 201
177
ABSTRACT VERSION OF HOLMGREN THEOREM
uniformly on 8,. when 6 + +O. But / I , can be extended to (1 + S)B,. as a continuous function, holomorphic in the interior of this polydisk. The Taylor expansion of h , converges uniformly to h, on every compact subset of this interior, in particular on as,.By the diagonal process we obtain a sequence of polynomials in z converging to h in H(8,.). Q.E.D. We shall beinterested in distributions u of the kind (20.5) defined bydistributions with respect to (x, t ) whose support has an x projection contained in a fixed compact subset K of 0, n R". How does this definition work? By possibly decreasing T , we may restrict ourselves to distributions 24 =
z a,a0o:
fao,
a(& t ) ,
ao+laljk
where thef,,,. are continuous functions of ( x , t) valued in C" (k is an integer at least zero). We assume that u vanishes identically when x 4 K c c 0, n R". We may then choose the functions f,,, a vanishing for x 4 K', where K' is an arbitrary compact neighborhood of K contained in 0, n R".Let us then set gao(x, 2 )
=
C
X f a o , a ( x , t).
la1 S k - ao
This distribution gaodefines a continuous function t , the dual of H(8,; C"), by the formula
I t f < T, with values in
(20.9) This makes good sense, since the integration can be restricted to the compact subset K' of 0, n R". Let us denote by Gao(t)the linear functional (20.9).By Cauchy's inequalities we have sup Io;h(x)I 5 a ! d-l"l sup lh(z)l, z E 80
XEK'
where d = dist(K', 130,). This shows that (20.9) is continuous on H(8,; C"). Likewise,
which implies at once that G,,(t) is a continuous function of t , in H(8,; C")'. We write then
Observe that if the G,, vanish for t 5 0, we may write G
a0
=
8k-Q I
GO?
It )
< T , valued
178
THE CAUCHY PROBLEM
[Chap. I1
where G:o is also continuous in 1-T, T [ [with values in H(0,; Cm)']and vanishes for t < 0. Setting then
we obtain the expression (20.5). Thus with the distribution u in the (x, t ) variable we have associated the distribution u with respect to t , with values in H(Q,; Crn)'. As we shall seek a uniqueness theorem, we want t o know whether the fact that u vanishes identically implies that u also does. We may evaluate v on test functions of the form hcp(t) where h is an arbitrary element of H(8,; Cm) and cp an arbitrary C" with compact support in ] - T , T [(and complex values). From our definition we have (0,
h d t ) ) = ( u , h(x)cp(t)),
where the brackets express the appropriate dualities. Thus if u = 0, u vanishes on all products of the form h(x)cp(t)where cp E Cp(] - T , T [ )and h is the restriction to 0, n R" of an arbitrary element of H ( a 0 ; Cm),in particular when h is an arbitrary polynomial on R". But linear combinations of products of the form h(x)cp(t) are dense in CF(R" x ] - T , T [ ) (this is a variant of the Stone-Weierstrass theorem), hence u must vanish identically. Let us summarize by stating that (20.10)
the correspondence u H u is injective
Remark 20.1. We have considered distributions in ] - T , T [ which vanish for t < 0; but the origin could obviously have been replaced by any other point to of the open interval 3 - T , T [ .Also we could have derived analogous results about distributions vanishing for t > 0.
Consider now a differential operator (20.11) where the A j (0 5 j 5 n ) are m x m matrices with entries which are holomorphic functions of (z, t ) in an open neighborhood, in C"", of some closed polydisk containing 0 , x ] - T , TI (we momentarily regard t as a complex variable). It is clear that when z = x E R" and t is real, (20.1 1) acts on distributions like u considered above. Let us denote by A ( t ) the action of (2.1 1) transferred to the distribution in t , v, associated with u. In other words, A(t)v is associated, in the way described earlier, with 2l(t)u, where
a C A j ( x , t ) -ax' . + A,(x, t ) . n
%(t) =
j= 1
Sect. 201
ABSTRACT VERSION OF HOLMGREN THEOREM
a
n
(20.13)
A’(t) = -
”
C ‘ A ~ ( zt ,) + ‘ A ~ ( zt ), - C azJ
j= 1
j= 1
179
a‘Aj
azj (z, t)
(‘Aj is the transpose of the matrix A j ) . This follows by integration by parts. Now, it is clear (cf. Sect. 17) that A’(t) defines a bounded linear operator H(0,; Cm)-+ff(fi,,;Cm).What formula (20.12) tells us is that A ( t ) is equal to the transpose of A’(t). It is clear that (20.3) and (20.4) are satisfied if we substitute A’ for A and we choose E, according to (20.8). By virtue of (20.7) we have the analog of Proposition 17.5 with A’ substituted for A . Then (17.20) holds. In view of Lemma 20.4 all the hypotheses in Lemma 20.3 are fulfilled. We may therefore apply Corollary 20.1.We reach the following conclusion, which as we shall see is one of the versions of Holmgren’s theorem:
THEOREM 20.1, Let u be a distribution in (0, n R”)x 1-T, T [ vanishing when x does not belong to a certain compact subset I( of 0, n R”. Suppose that du - = 9l(t)u (20.14) dt and that u = O for t < 0. Then u = O in (0,n R”) x 1-T, T [ .
The reader might object that u is not necessarily a finite sum of derivatives of continuous functions in (0, n R”)x 1-T, T [ as we have supposed in the preceding argument. But u equals such a sum if we replace 1-T, T [ by any subinterval 1-T‘, T’[,T’ < T . We have seen that u = 0 if I tl < T’, hence if I t I < T as one sees by taking T’ --+ T.
Exercises 20.1. By modifying (slightly) the proof of the “uniqueness” in Theorem 17.2, prove the following stronger version of part (11) of Theorem 17.2:
THEOREM 20.2. Let {E,} (0 5 s 5 1) be a scale of Banach spaces satisfying the condition
(20.15) i f 0 s’ 5 s 5 1, E, E,. has norm 5 1.
4
E,. and the natural injection of E, into
I80
[Chap. TI
THE CAUCHY PROBLEM
Let then u(t) be any C' function of t , I t I < T , valued in E l , satisfving, for every pair of numbers 0 5 s' < s 5 1, the inequality
,.
(20.16) where C is apositive constant, independent of s,s', t . I f u vanishes at some point of the interval ] - T. T [ , it vanishes throughout this interval. 20.2. We use the notation and the concepts introduced in Exercise 17.10. Let u(t) be a continuous function valued in H I , once continuously differentiable when regarded as being valued i n Ho. such that (20.17)
Ilu'(t)llo
5 const
IIAu(t)ll,
ItI
< T.
Prove that if u(0) = 0, then u(t) = 0 far all t, I t I < T . 20.3. Consider the partial differential operator in R", (20.18) where the coefficients c P ( t ) are continuous functions of t , I t I < T , with complex values. Prove that, if u is a C' function of t, I t I < T, valued in some space Hs(R') (s E R), satisfying (20.19) (20.20)
dU
- = P ( t , Dx)u at
in R" x 1-T, T[,
IL0
= 0,
then u vanishes identically in R" x 1-T, T [ .
21
The Holmgren Theorem Holmgren’s theorem is a stronger version of Theorem 20.1. Let R be an open subset of R“. Consider the first-order system of linear partial differential equations au -=
(21.1)
at
c A j ( X , t ) au + A,(x, n
j=l
7
t)u,
OXJ
about which we assume that (21.2) theentries of them x m matrices A j ( x , t ) (0 5 j 5 n)are analytic functions of (x, t ) in Q x ] - T , T [ . THEOREM 21 .l. Suppose that (21.2) holds. There is an open neighborhood @ of R x {0} in R x ] - T , T [ such that every distribution u in R x ] - T , T [ , satisfying (21.1) in this set and vanishing when - T < t < 0 , must also vanish in @. The main difference between this statement and Theorem 20.1 is that, in Theorem 21.1, u is not required to have a compact support with respect to x. Before proving Theorem 21.1 let us show how it implies the following “classical version of Holmgren’s theorem: ”
COROLLARY 21.1. Let u be a C’ function satisfying (21.1) in R x ] - T , T [ . u(x, 0 ) = 0 for all x E R, then u = 0 in the neighborhood G2 of R x (0).
If
Proof. Consider the function z?(x, t ) = H(t)u(x, t ) , where H is Heaviside’s function, equal t o one on the positive half-line and to zero on the negative half-line. Since u(x, 0) = 0, we have (d/at)ii = H(t)(d/dt)u,hence z? also satisfies (21.1) and vanishes, of course, for t < 0. It suffices then to apply Theorem 21.1. Q.E.D.
Proof of Theorem 21.1. It suffices to prove that u = 0 in an open neighborhood, which does not depend on u, of an arbitrary point (xo, 0), xo E R. For 181
182
[Chap. I1
THE CAUCHY PROBLEM
simplicity, let us take xo = 0. We may thus assume that R is some neighborhood of the origin in R" which we have the right to shrink a finite number of times, as we wish; we shall also decrease T when needed. We make the following change of variables x=y,
t=s-
IyIZ,
y=x,
s=t+
1x12.
I.e.,
We have
.a
a
a
a9
ayj
-=-
+2y'-
8s
(1 SjSn),
a -=-*
at
a as
The system (21.1) is transformed into (21.3) where B j ( y ,s) = A j ( y , s - IyI2) (0 s j
I n) and n
M ( y , s) = I - 2
1yjBj(y,s).
j= 1
If R is small enough, and y E R, M ( y , s) is invertible; of course, its inverse is also analytic with respect t o ( y , s). Equation (21.3) is equivalent t o (21.4) where C j = M - ' B j . The system is of the same lund as (21.1). The only difference in our assumptions is that the solution u now vanishes when s < \ y [ * . We could have made this assumption to start with: Reverting to (21.1) and to the coordinates (x, t ) , we may assume that u = 0 when t < 1x1'. If T is sufficiently small, the projection into the x space of the region {(x, t ) ; x E R, I r I < T , 1x1' 2 r}, which contains supp u, is contained in a compact subset K of R. Finally we contract the whole configuration so that R is contained in a polydisk 0, in C", as the polydisk in Theorem 20.1, and so as to be able to assume that the coefficients A can be extended as holomorphic functions to an open neighborhood of the polydisk
-
{ ( Z , ~ ) E C " + ~ ; Z E UI t~( ,S T } . It suffices to apply Theorem 20.1 : We conclude that u = 0 in R x 1-T, T [ . Since the choice of R and T > 0 was made independently of u, this is the Q.E.D. desired conclusion.
Sect. 211
THE HOLMGREN THEOREM
183
We conclude this section by stating an invariant form of Holrngren's theorem. As we have done for the Cauchy-Kovalevska theorem, we do so only in the case of a single equation, that is, in the case of a scalar differential operator of order m > 0, P(y, Dy) [ y = ( y ' , . . . , y") is the variable in an open subset % of R"]. The principal symbol of P(y, D,,) will be denoted, as usual, by Pm(Y9 4. THEOREM 21.2. Suppose that the coeficients of the differential operator P(y, Dy)are analytic in the open set %. Let C be a C' hypersurface in 42, subdividing % into two parts, %+ and % - , and nowhere characteristic with respect to P(y, Dy). There is an open neighborhood of X in %, N , such that every distribution u in 42, satisfying there P(y, D,)u = 0 and vanishing on one side of E (that is, in % + or in % -), also vanishes in N . That X is a C' hypersurface in % means that every point of % has an open neighborhood 8 where there i s defined a real-valued C' function cp(y), with nowhere vanishing gradient (in O), such that X n 0 is exactly the set of points in 0 where cp = 0. ThatC subdivides%into two parts %+, 42-, means that % = X LJ 02+ LJ % where %+ and u%I- are disjoint connected open sets which do not intersect C. For the meaning of characteristic, see Definition 19.3.
Proof of Theorem 21.2. It suffices t o reason in the neighborhood of an arbitrary point of C, which we take to be the origin in RN.Let cp be a C' function in an open neighborhood Y of 0 such that X n Y is the set of y E ^Ysuch that q ( y ) = O; we assume, of course, that grad cp does not vanish at any point of Y . Suppose that we have chosen the coordinates y j so as to have q ( y ) = y N + o(ly1) in V . Setting y' = ( y ' , . . . , y"-'), there is a number 6 > 0 such that Y += { y E Y f ; q ( y ) > 0 ) is contained in the set (21.5)
yEY,
y">
-61y'l.
Note also that, as we shrink Y about the origin, we may decrease 6 > 0. Let E be a number greater than zero and call W Ethe open subset of R" defined by
I y'I For
E
< 2&6,
I y"
sufficientIy small, W ec Y . Set then
< 4a2E2.
184
[Chap. I1
THE CAUCHY PROBLEM
Let Y, denote the intersection of the paraboloid $ = 0 with W , . We make the following three assertions (assuming that 6 is sufficiently small): (21.6)
Pm(y,grad Y) does not vanish anywhere in W Ein ; particular
YE is nowhere characteristic; (21.7)
i f y belongs to the boundary of Y Ewe , haveyN < -61y'I ;
(21.8)
x = y ' , t = Y ( y )defines a diffeomorphism of W Eonto a neighborhood of the origin in RN [as usual, we have set n = N - 1, x = (XI,
. . . ,Y)].
The first assertion, (21.6), follows from the fact that
..., 0, 1)1
IgradY -(O,
<26,
and that Pm[O,(0,. . . , 0, l)] # 0. Assertion (21.7) follows from the fact that if y belongs to the boundary of Y Ewe , have I y' I = 286 and yN
=
- 4 & P = -261y'I.
Assertion (21.8) is quite obvious. We are in the situation described in Fig. 21.1.
t
FIG.21.1
If we make the change of variables contemplated in (21.8), the expression of the operator becomes m
P ( x ,t ,
9
+1
0,) = a ( x , t)o,"
j= 1
Qj(x,
t,
ox)or-',
where a(x, t ) and the coefficients of the Q j are analytic functions of ( x , t ) in a neighborhood of the origin; moreover, a(x, t ) does not vanish anywhere near the origin and the Q j ( x , t , t) are polynomials with respect to t of degree l j , respectively. -
Sect. 211
THE HOLMGREN THEOREM
185
Let u be a distribution in W e ,satisfying there the homogeneous equation P(y, D,)u = Oandvanishingfor cp(y) < 0. I n the new coordinates ( x , t ) this implies the following: We have P ( x , t , D,, D,)u = 0 in a set W = { ( x , t ) E R"" ; 1x1 < r , -to < t < t , } (where to and t , are greater than zero); there is r', 0 < r' < r , such that u = 0 if I X I > r', - t o < t < t , ; and finally, there is t , , 0 < t , < t o , such that u = 0 if 1x1 < r , - t o < t < - t , . After transformation of the equation P ( x , 1 , D, , D,)u = 0 into a first-order system, according to the method described in Sect. 18, we find ourselves in a position to apply Theorem 20.1. We reach the conclusion that u = 0 in the whole set W . Exercises 21.1. Let P(x, D) be an elliptic differential operator of order m, with analytic coefficients in a connected open subset R of R" (Definition 19.4). Let S be a piece of C" hypersurface contained in R, h an analytic function in R, satisfying P(x, D)h = 0 there. Prove that if all derivatives of order <m of h vanish on S , then necessarily h = 0 identically in R. 21.2. Let P ( D ) be a differential operator with constant coeficients on R", p an arbitrary distribution with compact support in R". By applying the Holmgren theorem (Theorem 21.2) prove that the support of p is necessarily contained in the convex hull of (i.e., the smallest convex set containing) the support of P ( D ) p . 21.3. By using the Holmgren theorem, prove that the fundamental solution of the wave operator vanishing for t < 0 must have its support in the forward light-cone. 21.4. Let P([) be a homogeneous polynomial in n variables (with complex coefficients), of degree m 2 1. Suppose that, for some toE R,, to# 0 , P ( t o )= 0. Construct a C" function h(x) in R" with h(x) = 0 if ( x , l o )5 0 and h(x) # 0 if ( x , to) > 0, such that P(D,)h = 0 in R". 21.5. Let P ( t ) be a polynomial of degree m 2 1 on R,, P,([) its homogeneous part of degree m. By using the result in Exercise 12.5 show that if to# 0 satisfies P,(to) = 0, there is a C" function h in R" such that P(D,)h = 0 and, furthermore, such that h(x) = 0 if (x, l o )5 0, h(x) # 0 if ( x , to) > 0. 21.6. Let P(D ) be a differential operator with constant coefficients in R". Prove that every distribution u such that P(D)u = 0 in R", whose support is contained in a half-space whose boundary (which is a hyperplane in R") is noncharacteristic with respect to P(D),must vanish identically in R". 21.7. Let be an open convex cone with vertex xo in R", P ( D ) a differential operator with constant coefficients in R" having the following property: Every characteristic hyperplane through the origin intersects l-. Prove then that every distribution u in which vanishes outside a bounded subset of r, and satisfies P(D)u = 0 in r, vanishes identically in r.
THE CAUCHY PROBLEM
186
[Chap. 11
21.8. Consider the operator L = d / d t + t d/dx in R2.Show that the x = 0 is characteristic (with respect to L) at the origin and that, nevertheless, every distribution u satisfying Lu = 0 in some open disk n centered at the origin, whose support lies in the half-disk { ( x , t ) E Q; x 5 0}, vanishes identically in R. 21.9. Let
C aJ'(x)-.ax'a + ao(x) n
L
=
j=l
be a first-order differential operator with C" coefficients in an open subset R of R". Assume that the coffiecients of the principal part of L, aj,j = 1, . . . ,n, are real and that they never vanish all together at any point of n. Let S be a C" hypersurface contained in n, characteristic with respect to L at each one of its points. Let xo be an arbitrary point of S. Prove that there is an open neighborhood U of x,, in R, exactly subdivided into two parts by S, and a C" function h in U , satisfying Lh = 0 in U , vanishing identically on one side of S but at no point on the other side of S (and of course vanishing on S n U).
C H A P T E R I11
Boundary Value Problems
This Page Intentionally Left Blank
22 The Dirichlet Problem. The Variational Form We begin by describing, in rather loose terms, the kind of problems with which we shall be concerned in most of this chapter. They apply to a large class of elliptic (Definition 19.4) linear partial differential equations, a class whose prototype is the Laplace equation or (even better, in a sense) the metaharnionic equation (-1 + A)u =J A > 0. Most of the time we deal with the latter equation, but we frequently explore the possibility of extending the results obtained to more general second-order (elliptic) operators (22.1)
L
=
d2 a c ajk(x)-ax’axk + C b’(x) 2 + c(x).t n
n
j . k= 1
j=l
The coefficients are complex functions, defined in an open subset R of R”, and usually endowed with some degree of regularity (depending on the purpose of the study: in some theories the coefficients are merely assumed to be measurable and bounded). Part of the problem will be to solve the equation (22.2)
Lu
in R.
=f
But we shall not be content with any solution; we shall demand that the solution satisfy certain bounrfury conditions. In view of this, the boundary of R will play an important role. We denote it by dR. In the most important problem of the kind studied in this chapter, the boundary condition reads (22.3)
u=g
onan.
This is what we shall call the Dirichlet problem (it is often called the inhomogeneous Dirichlet problem and some people like to reserve the name of Dirichlet problem to the same, but where the right-hand side in (22.2) is
t At the end of this Chapter, in Sects. 36 and 38, we shall examine boundary value problems for higher order elliptic equations. 189
190
[Chap. 111
BOUNDARY VALUE PROBLEMS
zero). Admittedly it is a very vague statement of the problem. In particular, the meaning of condition (22.3) is unclear. It bears on the values of the solution u at the boundary dR, and thus presumes that u can be extended to the closure 0 of R. The possibility of extending u, in a natural way, up to the boundary, depends very much on the nature of the boundary, and also, of course, on the regularity properties of the data, f and g, as well as on those of the coefficients of L,aik,bj (1 s j , k 5 n), and c. The purpose of this section is almost solely introductory. We shall therefore make a number of simplifying assumptions, so as to get some hold on the problem. Later on we are going to show how to relax many of the restrictions imposed here. First of all, we limit ourselves (for the time being) to the operator 2
(22.4)
L=-A+I,
AZO,
A=f:(-&) j= 1
We are going to assume, throughout the section, that (22.5)
the open set i2 is bounded.
It will be seen, in the following sections, that condition (22.5) is to a large extent superfluous when the zero-order coefficient I is greater than zero, but not when I = 0, in the all important case of the Laplace equation. In many applications R is not a bounded set: for instance, R may happen to be the complement of a bounded set (e.g , when studying the distribution of the electrostatic potential in the region surrounding a charged body). In this case, extra conditions at infinity are needed. We do not concern ourselves with such a problem here. We shall also assume, momentarily, that (22.6)
the boundary dR of R is a (compact) connected C' hypersurface.
To this we adjoin the requirement that JR subdivide the whole space R" into two regions (an extension of Jordan's curve theorem) and that R is the bounded connected component of the complement of an. Thus we may talk about the normal to 8R at each one of its points and orient this normal, say, positively toward the exterior. What kind of right-hand sidesf should we consider? Our choice should be strict enough to imply the regularity of the solution needed for a reasonable interpretation of the boundary condition (22.3), yet not so restrictive as to exclude the cases of interest in the applications. It turns out that, from both these viewpoints, (22.7) is an excellent requirement.
fE L 2 W
Sect. 221
DIRICHLET PROBLEM. VARIATIONAL FORM
191
We shall be a bit more restrictive in what concerns the boundary value g. We shall suppose that
cl(aR).
(22.8)
The fact that g is defined only on dR is somewhat of a nuisance. We shall avail ourselves of properties (22.6) and (22.8) to extend g off do. This can be done in a number of ways. One simple procedure is the following. Cover aR with a finite number of open subsets 01,. . . , 0, of R" such that, for eachj , there is a C' diffeomorphism q j of Q j onto the open unit ball B of R" mapping Q j n R onto the open half-ball B + = { x E B ; x" > 0). This is possible by virtue of (22.6). For each j = 1, . . . , r, let us set -1
h j ( ~= ) g(qj
( ~ ' 3
. . . *yn-l?O ) ) ,
Y E B,
and gj(x) = hj(qj(x)),
XE
0j.
Clearly g j E C1(0,), g j = g on B j n a n . Let then il,. .. , c, be r functions such that cj E Cz(Oj) for each j and
C cj = 1
in a neighborhood of
an.
j= 1
At last we set g # = l j g j . We see that g # E C.!(R") and that gx = g on aQ. As a matter of fact we shall only make use of the restriction of g# to Q, which we denote by Q. Condition (22.3) can be rewritten as (22.9)
u-tj=O
onaR.
This suggests that we change the unknown function (or distribution) and consider
u=u-g. Our problem has now become that of finding (22.10) (22.1 1)
(-A
+ 1)U = F U=O
(I such
that
in R, on aR.
We have used the notation F = f + A g - 1s.
Observe that if we only form the hypothesis (22.8), we cannot assert that AQ is a function (in Q). We merely know that it is a sum of first-order derivatives of continuous functions, as a matter of fact of functions which are continuous on a:
192
BOUNDARY VALUE PROBLEMS
[Chap. I11
As a consequence, we cannot expect that the solution U , if it exists, will be very smooth. But we may still hope that U will be smooth enough to enable us to assign a reasonable meaning to (22.1 1). Let us go back to (22.10). As the right-hand side F is now a distribution, we shall want to solve (22.10) in the sense of distributions. This means that, given any cp E C$(Q), we require
(u,( - A + ;i>~o>= ( F , v>.
(22.12)
But of course F is not just any kind of distribution: It is continuous on C:(Q) for some special seminorms (or norms). Indeed,
( F , cp> =
j(f- wcp dx + Jkrad $7)
*
(grad cp) dx.
In view of our hypothesis (22.7) onf and the properties of 0 (namely, that 0 and grad 0 are continuous in a),we obtain at once (22.13)
I ( E cp)l
5 const
IIcpI11,
where
This defines a norm on Cp(Q) which is going to play a very important role in the forthcoming [notice that, by virtue of the Plancherel theorem, it is equal to the norm (13.20) when s = 11. In order to get some idea of this role, it is helpful to study first the case d = I . We begin by assuming that there exists a solution U to Eq. (22.12) which belongs to Lz(Q)and whose first-order derivatives (in the distribution sense) also belong to Lz(Q). In this case, we may integrate by parts once in the left-hand side of (22.12). After exchange of cp and Cp, the result can be written as (22.14) where ( ,
( U , cph = ( F , cp),
cp E C,m(Q)?
is the Hermitian product associated with the norm (1 [I1:
The new equation (22.14) strongly suggests that we try to take advantage of the Riesz representation theorem. Indeed, the right-hand side defines a continuous antilinear functional on C$(Q) for the norm 11 [I1. The latter is obviously a Hilbert space norm. For the moment, let us call 2 the completion of C;(Q) for this norm. By the canonical isomorphism of a Hilbert space onto its antidual, we know that there is a unique element I/ in 2 which satisfies (22.14). In a sense we have solved this equation but the defect of our method
Sect. 221
DIRICHLET PROBLEM. VARIATIONAL FORM
193
is that the solution we have found lies in an abstract space, the completion of C:(R) for a certain norm. It would be helpful if we could "concretize" the solution U as a function-in other words, if we could identify the Hilbert space X with a functions space. We have derived (22.14) from (22.12) by virtue of the hypothesis that U and grad U are square-integrable in R. This suggests exactly what must be done, and leads to the following very important definition: Definition 22.1. The space of functions w E L2(R) whose first derivatives (in the sense of distributions) belong to L2(Q)is denoted by H'(R).
H'(R) is the first, and probably most important, example of Sobolev space. It is closely related to the space H" encountered in the study of the Cauchy problem (Definition 13.1). It is equipped with the Hermitian product ( , )1 and the norm I( Ill.
PROPOSITION 22.1. H'(R) is a Hilbert space. Proof. Let {wJ') be a Cauchy sequence in H'(R). It is, a fortiori, a Cauchy sequence in LZ(R), and the same is true of { d w v / d x j }( j = 1, . . . , n). These Cauchy sequences have limits in L*(R)-w and w j , respectively. But (a/dxj) is a continuous linear operator on 9'(R), the space of distributions in R, therefore (d/dxi)wV converges to (d/dxi)w in 9'(Q); but it also converges there to w j , hence its two limits must be equal. Q.E.D.
Now the completion 2 of C:(R) closure of CF(R) in H'(R).
for the norm
Definition 22.2. The closure of C:(Q)
JI )I can be identified with the
in H'(R) is denoted by Hh(R).
We have seen that Eq. (22.14) has a unique solution U E X . We may now interpret U as an element of Hh(R). Observe that this means that U is the limit, for the norm 11 Ill, of C" functions vanishing identically in a neighborhood of the boundary of R. This fact can be interpreted as meaning that, in a sense, U vanishes on dR. Of course, that sense is not quite the same as that applied to continuous functions. Yet the two are related, as shown by the next result.
PROPOSITION 22.2. Assume that (22.5)and (22.6)hold. If U E C 0 ( Q n H'(R), the following two properties are equivalent: (a) U = O on 8R; (b) UEHA(R).
194
BOUNDARY VALUE PROBLEMS
[Chap. I11
Proof. We shall sketch it, leaving the details to the student, as an exercise about Sobolev spaces. We begin by extending U to the whole space R" by setting it equal to zero in the complement of R. We use the covering {Oj}lsjsr of dR and the functions Cj (1 S j 5 r ) introduced on page 191. By slightly shrinking each set O j we may assume that the Jacobian matrix of the diffeomorphism 'pi and -1
that of its inverse, c p j , are both bounded uniformly in their domains of definition. This ensures that w Hw 0 'piis an isomorphism (for the structure of topological vector space) of H1(S') onto H1(Oj n Q), whose inverse is -1
u 'pi. Clearly, U = 0 on dR if and only if each Ci U vanishes on O j n an. On the other hand, if U E HA@), each Ci U belongs to HA(Oj n R): Indeed, it suffices to consider a sequencef, E Cz(R) converging to U in the sense of the norm 11 [I1; then the C j f,E Cz(Oj n R) and converge to ti li in H'(Oj n R). Conversely, suppose that every C j U belongs to HA(Q). Then so does U. Indeed, it is very easy to check that U , = (1 - il - ... - C,)U belongs in any case to HA(R), because U E H'(R) and the support of U , is a compact subset of R. It suffices to make use of the standard mollifiers p e : U H
0
p,(x) = E-"P(x/E),
p
E
CXR"),
P
2 0,
Then, for E > 0 sufficiently small, pE* Uo E C:(R). That it converges to U , in H'(R) or, which is exactly the same, in the space H'(R") (see Definition 13.1), is easy to check. To summarize, it suffices to prove the equivalence when O j n R is substituted for R and Cj U for U ( j = 1, . . ., r). Thus let us f i x j once and for all. Set -1
V ( y ) = (cj U ) ( q j (y)),
y
E B,
the open unit ball in R".
We know that V E C o ( z ) n H'(B+) and, furthermore, that the support of V is a compact subset of 3.We must prove the equivalence of the following two properties : (a') (b')
V =0 when y" = 0, V E HA(B+).
(I) (a') (b'). It suffices to prove that, if (a') holds, Vnot only belongs to H'(Bf) but even belongs to H'(R") (we recall that V = 0 in the complement of B). For then the translates V"(y)= V ( y ' , .. . , y"-', y" - E ) also belong to H'(R"). But for E > 0 sufficiently small, they have their support compact and contained in B + , hence they belong to HA(Bf) (as one sees by regularization;
Sect. 221
DIRICHLET PROBLEM. VARIATIONAL FORM
195
cf. the argument about Uo above). We know that V belongs to Lz(R"). We must show that the gradient of V , in the sense of distributions on R", also belongs to L2. The assertion is trivial for each (a/axi)V if j < n, and needs only to be proved whenj = n. Let cp E C:(R") be arbitrary. We have
In the last member, the first limit is equal to zero since V(y', . . . , y"-', E ) converges to zero uniformly with respect to y', . . . , y"-' as E -+ +O. The second limit is equal to
av Jyn>o
- cp dY,
ayn
which proves what we wanted. (11) (b') * (a'). Under our various hypotheses on V we have, for almost all y' = ( y l , . . . , y"-') and every E , 0 < E < 1,
Let {w'} be a sequence in C:(B+) converging to V in H'(B+). We have
since the norm of the integral does not exceed the integral of the norm [here, this is the norm in L2(R"-')]. If we apply the Cauchy-Schwarz inequality to the last member, we see that it is at most equal to 11 V - wv)I1.We may choose v so large that this last quantity will not exceed an arbitrarily preassigned number q > 0. We keep v fixed at this value and choose E > 0 so small that w'(y', K ) = 0 for all y'. We thus obtain
jI W ' ,4 l2 dy' 5 q2.
196
BOUNDARY VALUE PROBLEMS
[Chap. I11
At the limit we see that I V ( y ’ , +0)l2 dy’ = 0. Since V(y’, +0) is a conQ.E.D. tinuous function of y’, this means that it must vanish identically. Proposition 22.2 strongly suggests that we replace condition (22.11) by
and try to find such a function U which satisfies (22.10) (until now we have done it only in the case 1 = 1). In the next section, we describe the solution of this problem completely. Let us say, right now, that this solution does not answer all our questions. For instance, the restrictions on the kind of open set R we can handle, i.e., conditions (22.5) and (22.6), are quite narrow: squares in the plane are excluded. In our solution of problem (22.10)-(22.15) the second condition, (22.6), will be quite superfluous. But it should be emphasized that, once we drop it, the relationship between our original boundary condition (22.11) and our new condition (22.15) becomes unclear. For example, what is the relation when R is a square in the plane, and U E Co(n)?How do we obtain a solution of the Dirichlet problem (22.2)-(22.3) when the dataf and g are continuous-a solution, moreover, which is acceptable from a classical viewpoint ? We discuss these and other related questions in the forthcoming sections. As a conclusion to this section, let us recall that the reinterpretation of the Dirichlet problem (22.2)-(22.3) in the context of the Sobolev spaces, precisely in the form (22.10)-(22.19, is often referred to as the variational form of the Dirichlet problem. The reason for this is not hard to find. Let us introduce the following notation, which is used throughout the sequel : (22.16) al(U, V ) =
(grad U)* (grad
n
V) dx + 1 j
U V d x , U, V E H’(R).
R
Observe that, if cp E C:(Q), ( F , (P)
=
J f (P dx - a,G,
cp).
R
Thus our weak problem (22.10)-(22.15) can be restated as follows : Problem 22.1. Find U E
such that the antilinear functional on C:(Q),
vanishes identically.
Because of the properties of $7 the form (22.17) is continuous for the norm
11 Ill, as already pointed out, and can therefore be extended, in a unique manner, to HA@).
Sect. 221
197
DIRICHLET PROBLEM. VARIATIONAL FORM
Let us now restrict ourselves, for the sake of simplicity, to real-valued functions and distributions (the complex case can be settled by handling the real and imaginary parts separately). Then the functional (22.17) is the FrPchet derivatice, at the point U of HA(R), of the (nonlinear) functional (22.18)
Q( V ) = faA( V, V )
+ u,(J, V ) -
f V dx. R
Indeed, the value of (22.17) at cp is equal to (22.19) The vanishing of (22.19) is a necessary condition for Q to have an extremum at Uo . Let t be any real number, M' any element of HA(R). We have Q(U
+ tct.) = fa,(U,
U)
+ a,&?, V ) - Jnf U dx
i.e., by virtue of the vanishing of (22.17), (22.20)
Q(U
t2 + fM') - Q ( V ) = a,(ct., w), 2
which, I claim, is strictly positive if tw # 0. The claim is evident when I. > 0. When A. = 0, a,(w, M,) = 0 is equivalent to grad w = 0, i.e., M' = const, which, by Proposition 22.2, is incompatible with w E HA(i2) unless, of course, u = 0. Thus, if the solution U E HA(R) to (A - A)U = F exists, it is a strict minimum of the functional Q ( V ) . Our weak problem (22.10)-(22.15) is thus seen to be a problem in the calculus of variations.
Exercises
22.1. Let R be a bounded open interval in the real line R'. Prove that every function u E HA(R) is an absolutely continuous function vanishing at the boundary points of 0. 22.2. Let R be as in Exercise 22.1. Prove that H'(R) is equal to the direct sum of HA(i2) and the space of affine functions h(x) = Ax B ( A , B complex constants) restricted to R. 22.3. Let R be an open disk in the plane, centered at the origin, and with radius R > 0. Prove that H'(i2) is the space of functions
+
f o U
(22.2 I )
u(x,y) =
C
m=-a)
cm(r)eime
(x
+ iy = re")
198
BOUNDARY VALUE PROBLEMS
[Chap. I11
whose Fourier coefficients have the following properties : (22.22) for each m E Z, c,(r) is a locally integrable function of r in the open interval 0 < r < R, and so is its distribution derivative c i ( r ) ; (22.23) (22.24) 22.4. Using the same notation as in Exercise 22.3, the Fourier representation (22.21), and the properties (22.22) to (22.24), show that the space Cm(n) (of C" functions in R2 restricted to 0) is dense in H'(R) and that H'(R) is the image of H'(R2) under the restriction mapping to Q. 22.5. Use the same notation as in Exercise 22.3. Let u E CF(R) be given by (22.21). Derive from the conditions (22.23) and (22.24) that we have, if 0 < ro 5 R ,
and conclude from this that, given any element u of HA(R), the coefficients c, (r)(m E Z) are continuous functions of r in the semiclosed interval 10, R], converging to zero as r + R. 22.6. Use the results in Exercises 22.5 and 22.6 to show that, if u is an arbitrary function in H'(R), its Fourier coefficients c,(r) are continuous functions in the semiopen interval 10, R], and that u E HA(R) if and only if cm(R)= 0 for all m E Z. 22.7. Use the same notation as in Exercise 22.3. Let u E CF(Q) be given by (22.21). Derive from (22.24) that, if 0 5 ro 5 R ,
Conclude from this that, for every u E HA(R), if m # 0, c,(r) is a continuous function in the closed interval [0, R] and (cf. Exercise 22.6) that this is also true when u E H'(R). Suppose R < 1 and take u(x) =[log(l/r)]". Prove that, if 0 < a < t , u belongs to H'(R). 22.8. Derive from Exercise 22.7 that, for some constant C > 0, for all r, R/2 r 5 R, and all u E H1(!2),
Sect. 221
DIRICHLET PROBLEM. VARIATIONAL FORM
199
22.9. Use the same notation as in Exercise 22.3. Let u E N'(R) be given by (22.21). Prove that
c (1 + m2)'12I cm(R)
+m
(22.28)
m=
- cm(r)I2 - 0
-m
as r -+ R (0 < r < R )
and derive from this that, if we write u#(r, 0) = u(x, y ) ,
[ H i n t : Prove that (22.28) is true when u E Cm(Tz) and then exploit the density of C"(n) in H'(R), and the inequality (22.27).] 22.10. Let S1 denote the unit circumference, 0 the variable in S', H'/2(S') the space of functions
(22.30) such that
c (1 +
+ W
(22.31)
llgll:/2 =
m2)1'2(cm12<
+a.
m=-co
Prove that if H112(S1)were contained in the space Co(S') of continuous functions on S' (equipped with the maximum norm), then the space of Radon measures on S', A ( S 1 ) , would be contained in the space of distributions on S', +m
(22.32)
2
u=
umeime,
m= -m
such that +m
(22.33)
=
C
(1
m= - m
+ m2)-1/21vm12 < +a
[space denoted by H-'/2(S')]. Prove that the Dirac measure 6 at the origin, in S', does not satisfy (22.33). 22.11. Let g E H'"(S') (we are using the notions and notation of Exercise 22.10) be given by (22.30). Set (22.34) u = CO
+ C (c,z"' + c-,Z"'), m= 1
z =x
+ iy,
Z = x - iy.
By using the criterion established in Exercise 22.3 [and condition (22.31)] prove that u E H'(R) and satisfies Au = 0 in R (taking here R = 1, i.e., taking 12 to be the unit disk in the plane). Derive from this, and from the
200
BOUNDARY VALUE PROBLEMS
[Chap. 111
preceding exercises, that H112(S1)is exactly the space of '' boundary values " of functions belonging to I f 1 @ ) , and that not all of these boundary values are continuous functions on S . 22.12. Let IIE HA(SZ) be real-valued (SZ is an arbitrary bounded open subset of R"),8 an arbitrary number greater than zero. Show that the function x H sup(u(x) - 8,O) belongs to HA(SZ). Derive from this fact that the function Iu I belongs to H @ ) .
23 Solution of the Weak Problem. Coercive Forms.
Uniform Ellipticity Let R denote an arbitrary open subset of R".The natural injection CXQ) 4
Hm)
can be transposed ; since it has dense image, its transpose is a continous linear injection
(H;(R))'
4 g'(Q).
We may therefore identify the dual of HA@) with a certain space of distributions in R. More precisely: PROPOSITION 23.1. The dual of H,!,(Q) is the space H -'(Q) of the distributions in R which are equal to a (finite) sum of derivatives of order 5 1 of functions belonging to Lz(R).
ProoJ A distribution T in R defines a continuous linear functional on HA(R) if and only if the linear form (PH ( T , q) is continuous on CF(Q) for the norm (I Ill. If T=fo this is clearly so. Conversely, suppose that the linear functional (P H ( T , cp) can be extended continuously to HA(R). By the Riesz representation theorem we know that there is a unique element V of HA(R) such that ((P7
V ) , = ( T , CP), 20 1
q E CXQ).
202
BOUNDARY VALUE PROBLEMS
[Chap. I11
The left-hand side can be rewritten as ((1 - A) K cp) from which we conclude that (23.1)
T = (1
- A)V.
v
d/dx'(d v / d x i ) is a sum of firstIt suffices then to observe that A = Q.E.D. order derivatives of L2functions (the d Vax').
PROPOSITION 23.2. The canonical antilinear isometry of H#2) onto its dual, H-'(Cl), is the mapping V w ( 1 - A)V. This is essentially the meaning of (23.1). We leave the details of the proof to the student. Proposition 23.2 has the following consequence : COROLLARY 23.1.
C:(Cl) is dense in H-'(Q).
Indeed, Cp(i2) is dense in Rh(0) and is mapped into itself by (1 - A). Corollary 23.1 shows that H-'(Cl) is the closure of CF(i2) in H-'(R"). Remark 23.1. If we go back, momentarily, to the hypotheses (22.5) through (22.8), we see that the right-hand side
P =f - (I. - A)ij in Eq. (22.10) belongs to H-'(Q). The problem we wish to solve, in the present stage, is the following: Problem 23.1. . Given any F E H-'(Q)find U E H A ( 0 ) such that
(23.2)
(-A
+ Jv)U= F.
Equation (23.2) is equivalent to [cf. (22.16)] (23.3)
a,(Li, V) = (F,
v),
V E Hi@).
In order to solve Problem 23.1 we shall combine the Riesz representation theorem with the following results; PROPOSITION 23.3. If A > 0, a,(U, V ) is a positive-definite Hermitian form on H'(0). The topological vector space structure defined on H'(Q) by a,( U, V ) is independent of A. In particular, it is the same as that defined by al( U , V ) = (U,V),. PROPOSITION23.4. Suppose Q is bounded. Then a,(Li, V ) =
1n(grad U) (grad 7 )d x
Sect. 231
WEAK PROBLEM. FORMS. ELLIPTICITY
203
is a positive-definite Hermitian form on HA(R), defining a topological vector space structure on HA(R) equivalent to that defined by ( U , V),.
Proof of Proposition 23.3. If I > p > 0, a,(U,
u) = a d U , u) s ( O 4 a P ( u ,u),
from which we arrive at our assertion. Proof of Proposition 23.4. Obviously ao(U , U ) 5 ( U , U ) ,. Let then cp E C:(R) be arbitrarily. We have
whence
by applying the Cauchy-Schwarz inequality and assuming that the x" projection of R is contained in the (compact) interval [a, b]. Finally by integrating with respect to X" from a to b the squares of the extreme members, we get
(23.4) We derive from this
II 9I10 s C(Q)IIgrad cp II 0
(23.5) where (1
(lo
denotes the norm in L*(R");(23.5) implies, in turn,
IIcpII: 5 [1 + C2(Q>lao(%cp). This can be clearly extended to all q E HA@), and the proposition is proved. Remark 23.2. In the proof of Proposition 23.4 we have only used the fact that i2 was bounded in the x" direction-that is, in view of the rotation invariance of the whole situation, in some direction. Iis strictly positive, Remark 23.3. Proposition 23.3 implies that, when , a,(U, V ) defines on HA(i2) a topological linear space structure identical to that defined by ( U , V),.
From Proposition 23.3 and Remark 23.3, on one hand, and from Proposition 23.4 on the other, we derive that the dual of HA(i2) when this space is
204
BOUNDARY VALUE PROBLEMS
[Chap. I11
equipped with the structure defined by aA(U,V ) (assuming that S2 is bounded when I = 0) is identical to H-'(R). It is checked at once that the canonical antilinear isometry of the Hilbert space (HA(Q), aJ is nothing else but the mapping VH (A - A) V. We thus obtain THEOREM 23.1. Let ;1be strictly positive. The diferential operator A-A defines an isomorphism of HA(S2) onto H-'(R). I f S 2 is bounded, this is also true of the Laplace operator A.
By virtue of Remark 23.1 we see that we have proved that Problem 22.1, that is, the problem (22.10)-(22.15) has a unique solution. It is easy to see that Theorem 23.1 extends to a large class of operators L of the form (22.1). Indeed, whether it can be extended depends on the properties of the sesquilinear form which ought to play the role of u1 in the proof of Theorem 23.1. In order to define the form in question, let us rewrite the operator L in the following manner: (23.6) where we have set
It is then natural to define the form associated with L by the formula
(23.7) a(u, u ) = j,k=l
+f
j=1
s
au a6 ajk(x)-( x ) -( x ) d x
ax)
au
R
axk
P ( x ) 7(x)fi(x)d x ax
+
c(x)u(x)fi(x)d x . R
Clearly, if we suppose that ujk, bj, c all belong to L"(S2), a(u, u) will be a continuous sesquilinear form on H'(R) x H'(Q); i.e., it will satisfy, for some c > 0, (23.8)
]a(u,v ) ]
5 Cllulllllulll
for all u, u E H ' ( 0 ) .
The crucial fact, in the proof of Theorem 23.1, was that, for some c1 > 0, we had a&, u) 2 c1 Ilull:
for all u E H;(R).
Sect. 231
WEAK
PROBLEM. FORMS. ELLIPTICITY
205
We need the analog for our more general form a(u, v). As a matter of fact it suffices to have (23.9)
la(u, u)l 2 cI Ilull:
for all u E ffh(R) (cl > 0).
Because of the far-reaching consequences of the argument, it is worthwhile to make it a bit more abstract. Let us consider a Hilbert space E, with inner product ( , )E and norm I/ fIE, and a continuous sesquilinear form a(u, v) on E x E. The map u Ha(u, .)
defines a bounded linear operator A of E onto its antidual E' (the space of continuous antilinear functionals on E). The importance of condition (23.9) justifies the following
Dejinition 23.1. The form a(u, v) is called coercive i f there is a constant such that
c1 > 0
(23.10)t
1 a(u, u) 1
2 cI I(ullg
for all u E E.
We have then the following simple result, known as the Lax-Milgram theorem: LEMMA 23.1. If the form a(u, v) is coercive, the operator A is an isomorphism of E onto E'. We shall denote by (u', v)- the antiduality bracket between E and E'. By definition of A we have a(u, v) = (Au, v ) - , u, v E E. Let A * denote the antitranspose of A : (Au, v)- = (A*v, u)-.
Proof of Lemma 23.1. From (23.10) we derive that, for every u E E, (23. I I )
llullE
5~
~ i ~ ~ A u ~ / E1IuIIE r ?
5 CY'IIA*ullE',
which implies at once the assertion (the first estimate shows that A is injective and has a closed image, the second one that this image is dense). Q.E.D. Let us return to the special case of the form a(u, v) given by (23.7). We may identify H-'(R), the dual of Hh(R), with its antidual, by the formula {u', 0 ) - = ( u ' , ,)U u' E H-'(R), D E Hh(R), U being the complex conjugate of u. Observe that, under our hypothesis that aik, 6', c E L"(R), L acts continuously from Hh(R) into H-'(R): It suffices to look at its expression (23.6)
+
t Many authors call coercive the forms a(u,v) such that Re a(u, u) ~ollull&2 e l l l u l l ~ , where xo is a nonnegative constant and IIuiIH a norm on E which is weaker than ]lullE [for instance, the Lz norm on H ' @ ) = El. Definition 23.1 is simpler and perfectly adequate for all our needs.
BOUNDARY VALUE PROBLEMS
206
[Chap. 111
to see this. If we restrict a(u, u) to u, u E C$(Q), we check at once that the operator A : Hh(R) -+ H-'(Q) associated with a(u, u) is identical with L. We have thus obtained
THEOREM 23.2. Suppose that the coeflcients ajk,d', c of L in the expression (23.6) aZZ belong to L"(Q) and that the form (23.7) is coercive on HA@). Then L is an isomorphism of HA(Q) onto H-'(Q). Often one refers to the expression (23.6) of L as the variational form of L ; the justification for this lies, of course, in a remark like that at the end of Sect. 22. We conclude this section by showing that the coercivity hypothesis on the form a(u, v) has an important consequence concerning the leading terms of the differential operator L . Indeed, let us take
5 E R, ,
u(x) = qD(x)eiX'rr
If we go back to (23.7), we derive for large
cp E C$(Q).
1
On the other hand,
If we apply (23.9) and make 15 I go to
+ co, we have (setting e = t/ I 5 I)
(23.12)
It is not difficult to see that this, which has to be true for any cp E C$(Q), implies c1 5
I
ajk(x)ej j,k= 1
that is, by homogeneity:
Indeed, set
4,
I
almost everywhere in Q,
Sect. 231
WEAK PROBLEM. FORMS. ELLIPTICITY
207
Fixing ( arbitrarily on the unit sphere of R,, we can find a sequence of functions in L"(R) which converge to a(., in this space, and which have the following property: For each v, a, is continuous in the complement of a closed subset of measure zero of Q, N, . From (23.12) we derive that, to every E > 0, there is V ( E ) such that v > V ( E ) implies, for all cp E C;(Q),
r)
(23.14)
Fix v > v(E);let xo be an arbitrary point of Q\N,, which is open; take cp with support in R\N, and let I cpIz converge to the Dirac measure 6(x,) at xo. We derive at once from (23.14): (23.15)
(c1
- E)
5 I +o)
I,
vx,
E
Q\Nv.
In the complement of the unidn N of all the N,'s, the Q, converge (for the essential supremum norm) to a(., 5). Since meas(N) = 0, we easily obtain what we want. Condition (23.13) is very important and thus qualifies for a definition: Definition 23.2. The differential operator L. is said to be uniformly elliptic in R i f (23.13) holds. Remark 23.4. Condition (23.13) is not sufficient to ensure that the sesquilinear form a(u, v ) be coercive. Take L = - A 2, with A < 0 and I A I very large. The corresponding al(u, u ) will not be coercive on HA(R). However, we have :
+
Remark 23.5. - When the aJkmake up a matrix which is Hermitian, i.e., when aki = aik,the function we have denoted by a(x, () will always be real (for E R,,). If it does not vanish in R,\(O}, unless n = 1 it must keep the same sign in R, (indeed, unless n = 1, R,\{O} is connected). It follows from this that a(x, t), as a quadratic form on R,, will be either positive or negative definite.
r
PROPOSITION 23.5. Let aik = (1 s j , k 2 n) belong to L"(R); let c E L"(Q) be real-valued. Suppose furthermore that (23.16)
L=
a . a 1" a'"x) - + c ( x ) axk ad
j,k=
is uniformly elliptic in R and the matrix {ajk>is negative definite a.e. in $2. Then i f c(x) > co > 0 for almost all x E R, the form (23.7) is coercive on HA(R). The same is true when SZ is bounded and c(x) >= 0.
208 Proof:
BOUNDARY VALUE PROBLEMS
[Chap. 111
From (23.13) we derive, for u E HA@) real-valued,
1 a j k ( x )ax'au- - , au n
c,Igrad uI2 5 -
j , k= 1
axk
whence, by integration over R, (23.17) But since the matrix {dk)is Hermitian, a(u + iu, u + iu) = a(u, u) + a(u, u) when u, u E HA(R) are real-valued. Thus (23.17) holds also for complex-valued u E H$(R). In order to obtain Proposition 23.5, it suffices to apply Propositions 23.3 and 23.4.
Exercises (Unless otherwise specified, R will be a bounded open subset of R".)
23.1. Prove that there is a number I , < 0 such that the sesquilinear form aA, defined in (22.16), is coercive on HA(R) for all A > I,. Compute the value of A. when n = 1 and R is an open interval, of finite length I, in the real line. 23.2. Let /Iobe the number so denoted in Exercise 23.1. Prove that there is a function u belonging to HA@) such that Au = I. u in R. Compute I. when R is a product of bounded intervals, R = { x E R";a j < x j < b j ,j = 1 , . . . , n } . 23.3. Let R be the open unit disk in RZ,and let I , be the number so denoted in Exercise 23.1, relative to R. Prove that A,, = -so, where so is the smallest zero of the functions Jo(&) (s >= 0) where we have used the standard notation for the functions (23.18)
Jo(z) =
z;1;
2n 0
cos (z sin 0) do.
Prove that there is a positive number x such that J,(x) = 0. 23.4. Prove that, if n 2 2, the restrictions to R of the harmonic functions in R" form an infinite-dimensional linear subspace of H'(i-2). Derive from this that HA(R) has a topological supplementary in H'(R) whose dimension is infinite. If n = 1 and R is a finite interval, what is the analogous statement? 23.5. We consider a complex Hilbert space E ( E could as well be real: the argument would be essentially the same) and a linear subspace CD of E on which is defined a norm, ))I I)),larger than the one induced by E. We are given a sesquilinear form on the product E x CD, having the property that (23.19) for every
4 E CD, u ~ a ( u4) , is a continuous linear form on E.
Sect. 231
209
WEAK PROBLEM. FORMS. ELLIPTICITY
Prove the following (useful) generalization of the Lax-Milgram lemma (Lemma 23.1): LEMMA23.2. Suppose that there is a constant co > 0 such that
(23.20)
coII141112 5 I44?4)l
forall 4 E Q
Then, given any antilinear form on @, p , continuous for the norm is an element u of E such that (23.21)
a(u, 4 ) = p ( 4 )
for all
111 111, there
4 E a.
23.6. Let R be a bounded open subset of R". Prove that A - A (A 2 0) maps H-"'(Cl) onto H-"-*(Cl) for every m = 0, I , . . . . Describe the image and the cokernel of the transpose mapping. 23.7. Let R; denote the open half-space x" > 0 in R" (n 1 2 ) . Prove that the sesquilinear form jR,,(grad u ) . (grad V ) dx is not coercive on H h ( R ; ) . 23.8. Let 8 be the open subset of R3 defined by
z > (1
+ f(X2 +
y2))l'Z
[the variable in R 3 is denoted by (x, y , z)]. Prove the following facts:
(i) the sesquilinear form j F (grad u) . (grad C) dx is not coercive on Hh(O); (ii) there are no harmonic functions in 0 belonging to - Hh(8); there is a harmonic function in 0, continuous in 0, vanishing on the (iii) boundary of 0 (we are not including the points at infinity in this boundary).
24 A More Systematic Study of the Sobolev Spaces If we wish to study the regularity of the solutions to the weak Dirichlet problem (as we strengthen the hypotheses on the regularity of the data), it is convenient to introduce the higher order Sobolev spaces, and also the Sobolev spaces constructed on L p for any p , I 5 p 5 + 00. DeJinition 24.1. Let R be any open subset of R". We denote by Hm'P(R)the space offunctions u E Lp(R) such that (d/ax>"u E LP(R)forall a = (a1, . . . , a,,) E Z:, la1 = a , + * . . + a , s m .
The partial differentiations (d/dx)L1= (d/ax'yl . . - (13ldx")". must be understood in the distribution sense. One usually equips H'"-p(R)with the norm
PROPOSITION 24.0. H m 3"(n) is a Banach space. If 1 < p < rejexive Banach space. I f p = 2, it is a Hilbert space.
+
00,
it is a
Proof. The first assertion, i.e., about the completeness of HmrP(C2),is proved like Proposition 22.1. We leave it to the student. Let N(m,n) be the
number of n-tuples a such that I a1 5 m. There is a natural injection of H'",P(SZ) onto a closed linear subspace of (Lp((n))Ncm7 "), namely (24.1)
u H ((~/ax)bu),U,6 m .
We know that Lp(R) is reflexive for 1 < p < + co, that a finite product of reflexive Banach spaces and a closed linear subspace of a reflexive Banach 210
Sect. 241
21 1
SYSTEMATIC STUDY OF THE SOBOLEV SPACES
space are reflexive Banach spaces, from which we obtain the second assertion. The third assertion is evident. Let us point out that, for p = 2, one writes, customarily, H"(R) instead of H m 32(Q).
In H"(R), the norm (1 (I,
= 11
(u, u), =
Ilm
is associated with the Hermitian product
cI
1a16m
(a/ax)=u(a/ax)'v dx,
R
which, when m = 1, coincides with the product (u, u)~.
Q.E.D.
PROPOSITION 24.1. If 1 5 p < +a,C"(R) n Hm.p(R)is dense in HmtP(R). Proof Let { a v }( v = 0, 1, . . .) be a sequence of relatively compact open subsets of R, whose union is equal to Q and such that, for each v = 0, 1, . . . , O , c R , + , . Set Q; = R , and for v > 1, Q:=Q, -TiY-2. Clearly the R: ( v = 1, 2, . . .) form an open covering of R. Note that at most two R: ever have a nonempty intersection. Let now {cv}( v = I , 2, . . .) be a C" partition of .unity subordinate to the covering {Q:} (the supports of the i,are compact). Let p E C?(R"), p 2 0, SRn p dx = + 1 and set, as usual, for E > 0, p,(x) = E - " ~ ( x / E ) . For each v we select a number E , > 0 such that the neighborhood of order E , of supp [, has its closure contained in a:. We write then u, = pa, * (c, u ) ; of course, v, E CF(Q4). We shall now apply the following lemma:
LEMMA 24.1. L E 1~5 p < + co. r f f E LP(R"),pE* f converges to f in LP(R") -+ +O. Proof. By Holder's inequalities for convolution (TVS, D&K, Theorem 26.1) we have as E
llPE
* fllLP 5
IIPEIIL1llf
IILP.
) ~S ,pe(x)dx = S p ( x ) dx = 1. This shows that, as E varies from 1 But I ~ P ~ I= to 0 (excluding 0), the convolutions pE* form an equicontinuous (i.e., bounded) set of linear operators on Lp. Now, for such a set, the pointwise convergence on LP and that on any dense subset of L p are identical. Thus it suffices to prove that pE* f converges to f in L p for each f in a suitable dense subset of LP. But the assertion is evident if we take this subset to be, e.g., C:(R") [or CYR")l. Q.E.D. In view of Lemma 24.1, given any have, for every n-tuple, c1, I ct I 5 m,
E
> 0 we may choose
ll(~/~x)Ol{(ivu) - UV}/ILP = Il(a/ax)"(ivu) - Pe,
from which it follows that we may choose lli"U
E,
E,
so small as to
* (~/~x)l(c,u)llLP 5 E2-,,
so small as to have
- vvllm,p5 &2-"-l.
212
[Chap. 111
BOUNDARY VALUE PROBLEMS
Let us set u = uv . Since, for each v > 1, the support of u, meets at most only those of u , - ~ and u Y t l r u clearly belongs to Cm(R). Moreover, if K is any compact subset of R, there is v ( K ) such that (, u and v, vanish identically in a neighborhood of K for v 2 v(K). Therefore
6
C
IICvu - uvIIrn,p
v
=< E.
This proves everything we wanted: first of all that u - u, hence u belongs to H"*P(R)-and then, that u is the limit in HrnaP(R) of a sequence of functions such as u. Q.E.D. Remark 24.0. Proposition 24.1 shows that HmqP(R)is a "concrete" realization of the completion of Cm(R)n HmpP(R) for the norm I( I(,, (when p is finite). Remark 24.1. If we inspect the proof of Proposition 24.1 and make now the assumption that the support of the element u of H'", "(n) is compact, we see that this is true also of the support of the approximating element, u E C,"(R). Finally let us point out that Proposition 24.1 is evidently false forp = It is then false when rn = 0.
+ OD :
Definition 24.2. The closure of C:(R) in H", "(n) is denoted by H t , "(n) [by Ht(R) when p = 21. Let us point out the obvious inclusion relations (which give rise to the natural injections-bounded linear, with norm 5 1):
H",
"(n) c
H"'. "Q),
HtyP(Q)c H$,P(R)
if
rn > m'.
When R is bounded, we have LP(R) ci Lq(Q) if p > q. Hence, in this case, Hm.P(R)ci Hrn*q(R) and Ht7P(R)cj HE*q(R)(however, one should be careful and not think that the norms of these injections are 5 I). The case p = 00 is not often used-this is because L"(R) has some unpleasant properties, as is well known: Not only is it not reflexive [this is also true of L'(a)], but it is not separable [which L'(R) is]. Let us also point out that Ht*"(R) consists of C" functions in which vanish at the boundary of R (or at infinity, if R = R"), together with all their derivatives of order s m . In any case, the injection Cp(R) 4 Hr.p(R) has a dense image, therefore its transpose
+
a
(Ht*P(R))'-+ W(R)
Sect. 241
SYSTEMATIC STUDY OF THE SOBOLEV SPACES
213
is injective and we may identify the dual of H:.P(Q) with the image of this transpose, which is a space of distributions in R. When p < + co the description of the latter is very simple: PROPOSITION 24.2. r f I p < + co,the dual of H:* "(Q) is the space H-", "'(R) of distributions in R of the form (24.2) where p'
=p / ( p -
1).
Proof. That every distribution T of the form (24.2) defines a continuous linear functional on H:* "(Q) is evident. Conversely, consider the mapping (24.1): It is an isomorphism of H;vP(R) onto a closed linear subspace of (Lp(Q))Ncm,n). By virtue of the Hahn-Banach theorem, its transpose is a continuous linear surjection of the dual of the latter onto that of H;*"(R). Of course, canonically, {(L"Q))N(m.n)}'
= (p'(Q))Ncm.
n),
can be identified with a i.e., a continuous linear functional on (Lp(Q))N(m*n) collection (fa)lul s m of functions belonging to Lp'(R),and the linear functional it induces on H:*p(R) can be defined as
If we want to know precisely what the distribution T i n R is equal to, we restrict this formula to u E C:(R). We obtain Q.E.D.
There is a natural norm on H - m . "'(Q). If p' <
where the injrrzum is taken over all families that (24.2) is true. When p' = + co, it is
+ co, it is
(fa)lul5m
in (Lp'(Q))Ncm,n) such
where inf has the same meaning as in the case p' < + co. If we identify the dual of H;*"(R) with H-m*P'(R), we have the right to ask whether the norm 11 11 -m, p g is equal to the dual norm on H-"* "'(R). That it is so can easily be seen by inspection of the proof of Proposition 24.2. We leave this verification to the student.
214
[Chap. 111
BOUNDARY VALUE PROBLEMS
We shall need the following result, which is an immediate consequence of Remark 24.1 (and which is obviously false when p = + 0 0 ) : PROPOSITION 24.3. If 1 5 p < + 00, every element u of H", support in R belongs to Hr>"(R). PROPOSITION 24.4. Let 1
"(a)with compact
s p 5 + co be arbitrary. Then the multiplication (A U ) H f U
(24.3)
is a continuous bilinear map of H", "(Q) x H"* "(R)into H". "(SZ).
The proof is left to the student. PROPOSITION 24.5. I f p < + co, the mapping (24.3) is a continuous bilinear map of H". "(R) x H t - "(R) into H r * "(R).
Proof. Suppose that u E Hm3P(R)has a compact support contained in R; then the same is true of f u , whateverf E H", "(R). It suffices therefore to comQ.E.D. bine Propositions 24.3 and 24.4. Remark 24.2. Proposition 24.5 is false when p
=
+ co.
Let us now introduce a second open subset R' of R" and a mapping $ : R R' with the following properties:
II/ is bijective;
(24.4)
(24.5) $ = ($I, . . . , $") and I/-'= (XI,. . . , x") are m times continuously differentiable (that is, their components are) in R and R', respectively; (24.6) for every j = I , . . . , every n-tuple a, 0 < in R, (d/dx)"xjis bounded in R'.
I tl I 5 m, (d/dx)ol$jis bounded
We are supposing m > 0. Then PROPOSITION 24.6. Under the preceding hypotheses, whatever 1 5 p 2 + co, $ is an isomorphism of H"*P(R') onto H"*'(R). Its inverse is
UHU
0
V H V
0
*-I.
The proof, a simple application of the chain rule and of the mean value theorem. is left to the student. PROPOSITION 24.7. The hypotheses are the same as in Proposition 24.6 except that p must be finite. Then U H U $ is an isomorphism of HZ*P(R')onto 0
Htg "(R).
Sect. 241
SYSTEMATIC STUDY OF THE SOBOLEV SPACES
215
Proof. In view of Propositions 24.3 and 24.6, it suffices to show that if the support of u E H", "(a') is a compact subset of R', that of u 0 1c/ is a compact subset of Q, which is obvious, since the latter is the image of the former under $-'. Q.E.D.
We consider now the case where R = R" and p = 2. We recall the definition of the spaces H" for s real arbitrary (Definition 13.1): H" is the space of tempered distributions u in R" whose Fourier transform is square-integrable with respect to the measure (27C)-"(1
+ lt12)"dt.
The norm of ti in the space L2 with respect to this measure is, by definition, the norm of u in H", which we have denoted by IIulls. Now, if we go back to Definition 24.1 and perform a Fourier transformais the space of L2 functions u in R" tion, we see that H"(R") = Hmg2(Rn) whose Fourier transform ti is such that (24.7) belongs to L2(R,) and, again by virtue of the Plancherel theorem, I I U I I ~ , ~ is equal to the L2 norm of (24.7) divided by (27~)"". We conclude easily that the space H"(R") defined according to Definition 24.1 is identical to the one defined according to Definition 13.1. However, the norms II Ilm and II llm.2 are not equal, unless m = 0 or 1 ; they are merely equivalent. We are now going to consider H" for arbitrary, possibly nonintegral s. It is clear that we have H" c H"' for s' < s and that the injection is continuous and has norm 5 1. On the other hand, Proposition 13.1 shows that, when R = R", H"(R) = H:(R). Let in fact s be any real number and introduce the isometry of H" onto H o = L2 defined in (13.21): T"u= F - ' { ( l
+ 1 t12)"/2a},
u E H".
By transposing the sequence
c,m
natural injection
~
s
-
LZ, 2
-
and by identifying, as usual, L2 with its own dual, we obtain 'T=
L2 -+ (H")'
natural injection
9'(R"),
where the second arrow is injective. If we identify (H")' with its image in 9', by restricting '(T")to functions belonging to C: (or to 9) we see at once that it is equal to T" itself. But the transpose of a surjective isometry is a surjective isometry, and T" is such an isometry from L2 onto H-'. If the student thinks a little about all this, he will see that we have proved
216
[Chap. I11
BOUNDARY VALUE PROBLEMS
PROPOSITION 24.8. The dual of H" is canonically isomorphic to H-". The canonical antilinear isometry of the Hilbert space H" onto its dual is transformed, by this isomorphism, into the mapping
Tz": U H % - ' ( ( ~
+ 1512)"%U),
'
where 9 denotes the Fourier transformation and %- its inverse. It is checked at once, again by using Fourier transformation, that Proposition 24.8 does not contradict Proposition 24.2 (when s = m and p = p' = 2): H-",'(R) = H-"(R") when R = R". However, it should be pointed out that the norm )I 11 -", we have defined on page 213 is not equal to the norm 11 11" defined in (13.20) when s = -m, except in the case where m = 0 or 1. Otherwise they are merely equivalent. The spaces H" for nonintegral s intervene in a natural way in several problems of PDE theory-we have encountered them in the study of the Cauchy problem. They will intervene once more, in an unavoidable manner, when we study the traces, or boundary values, of functions belonging to H"(L2) (Sect. 26). We shall then take a closer look at their properties. We conclude this section by extending the property that H"(R") = HZ(R") to the H", spaces for p # 2 : PROPOSITION 24.9. Whatever the integer m 2 0, whatever p , 1 5 p < C:(R") is dense in H"3P(R").
+ co,
Proof: Let CEC:(R"), ((x) = 1 for 1x1 < 1, ((x) = O for 1x1 > 2; set (,(x) = C(v-'x), v = 1, 2, . . . . It is checked at once that, if u E HmVP(R"), cvuconverges to u in H", P(R")when v + + 00. But the support of each l v u is a compact subset of R", hence, by Proposition 24.3, (, u E HE, p(R")for every v. This proves our assertion. Remark 24.3. One should not think that R" is the only open set such that, for some m > 0 and some p , 1 g p < + co, HmSP(!2)= HEsP(!2). An interesting case is that of p = 2. One has the following result (for a proof, see [I, Expos6 151): THEOREM.Let R be a nonempty open subset of R", m an integer greater than zero. Then H"(R) = HE@) ifand only ifR"\R is m-polar, i.e., has the following property: (24.8) a distribution u in R" whose compact support is contained in R"\a cannot belong to H-"(R") unless u = 0. In relation with this, let us note that we have PROPOSITION 24.10. Let R be a bounded subset of R", m an integer greater than zero. Then HEpP(Q)# H"*P(R).
Sect. 241
SYSTEMATIC STUDY OF THE SOBOLEV SPACES
217
Proof. If R is bounded, the constant functions belong to H", "(R); it suffices to show that the function identically equal to one, 1, does not belong to H;*"(R). It suffices to show that 1 does not belong to HA*'(Q), the largest of all those spaces. Let cp E C:(Q) be arbitrary. We have
On the other hand, there is a constant C > 0, depending only on R, such that
from which 111-d1,1
Zme a s(Q)-(C-
2 measG-4
- (C
1)IIgradcpl dx
- 1>111- (PII1.1,
that is, meas(Q> 5 CII1 - (Pll1.1, which shows that the distance in H1*'(R) between the function 1 and the subspace C:(R) is bounded away from zero. Q.E.D.
Appendix
The Sobolev Inequalities
In this section we have looked at the regularity of functions and distributions as measured by the scale of spaces H", or H s and, actually, throughout this text, we have been and shall be using, almost exclusively, the nOtion of regularity defined and measured by the spaces H s (i.e., with " base" L'). But in numerous problems one needs to have a more precise definition, or at any rate a different measure of the regularity. For instance, an often-used notion of regularity is that of Holder continuity. It so happens that there is a link between Holder continuity and regularity in the sense of the Sobolev spaces, and although we shall not explore this aspect, we shall give in this Appendix the simplest and most celebrated results in such a direction, linking Sobolev regularity to continuity. These are the Sobolev inequalities. The reason for stating and proving them here is not that they will be used again in this book, but only to have them at the disposal of the reader who needs them, or is curious regarding their statement (or proof). For further facts, and the relationship with Holder continuity, we refer the reader to [F, Part I]. We begin by proving the following global (that is, in R")result:
218
BOUNDARY VALUE PROBLEMS
[Chap. I11
LEMMA 24.2. Suppose 1 S p < n and define p* by (24.9)
Then there is a constant C
= C(n, p )
> 0 such that
Proof. If u E C:(R"), we have
and a similar inequality with integration performed from xi to which follows with obvious notation,
+
00,
from
and therefore (24.1 1)
We integrate both sides of (24.1 1) with respect to dx" and apply the standard Holder's inequalities (24.12)
where 1 6 tLk 6 +00 and a;' are assuming n 2 2), a1 = *
a
*
+ ... +a;' = M, = n -
= 1. Here we take r 1, and
=n
- 1 (we
We obtain
Next we integrate with respect to dx"-l, applying once more (24.12) with r = n - 1, but choosing this time
uk =
(/Is1
( n - 1) -
dxk dx"}
(n-
1
,
k
un=(/lgldx")
.
Sect. 241
219
SYSTEMATIC STUDY OF THE SOBOLEV SPACES
We repeat this operation until we obtain (24.13)
In order to obtain the estimate (24.10) when p elementary inequality (24.14)
1 a , ... a , i l ( a , n.
+...+a,,)",
=
1, it suffices to apply the
a j 2 0 (1 S j j n ) .
Assume now p > 1 . We observe that, if u E C",R"), the function I u/C(n-l)'nlpL belongs to Ca(R"), since [(n - l)/n]p* = [(n - I)/(n - p ) ] p > 1. We also observe that the estimate (24.13) extends to functions belonging to C;(R"). We substitute in it ~u~[("-')'"lp* for u, and use the fact that
We observe that q([(n - I)/n]p* - I } = p * and that (n - l ) / n - l / q = l/p*. We obtain at once (24.15)
from which we obtain (24.10) for p > 1 by applying (24.14) once again. Q.E.D.
We come now to the Sobolev inequalities valid for functions in a bounded open subset n of R".We shall subdivide their study into two cases, and make different assumptions about R in each of the cases. Our first assumption, which we make for values of the integration exponent p close to one, will be that elements of HI. "(Q) can be extended into elements of H' * p(R") by a continuous linear operator: (24.16) The mapping "restriction to R", rR, maps H'pP(R")onto H'.P(R), and ir has a continuous right inverse, zn : H'.p(R) +. H'*P(R")[which means that roen is the identity on H"P(R)].
It will be shown in the Appendix to Sect. 26 (Theorem 26.A.3), that (24.16) is satisfied whenever the boundary 8R of R is a C' hypersurface, with R lying on one side of it.
THEOREM 24.1. Suppose that (24.16) holds and that (24.17)
1 s p < n/m
( m is an integer 2 0 ) .
220
[Chap. I11
BOUNDARY VALUE PROBLEMS
-l ---l _ r_ n
(24.18)
P'
P
n'
then H". "(R) c Lp'(Q),and there is a constant C > 0 such that
(24.19) 11 u IILP'(R) 5 CII IIH**P(R) vu H"' In the second case, corresponding to " large " values of p , we shall assume that Q has the cone property. Let A be a subset of the unit sphere in R"; let us denote by T ( A , h) the set of vectors u in R" such that 0 < lul < h and such that u/l u I E A . We shall assume that there is a number h > 0 such that the following holds : 9
(24.20) Given any point x E R, there is an open subset 0, of S"-', whose (surface) measure is no less than h, and such that
x + l-(Ox, h) c R. If the boundary of is a C' hypersurface, and R lies on one side of it, or if R is convex, then Q has the cone property (Exercise 24.5); this also holds in other cases (Exercise 24.6).
THEOREM 24.2. Suppose thaf (24.20)holds and that n -
(24.21)
rnzl.
rn
Then H"pP(R)c Co(Q)n L"(Q) and there is a constant C > 0 such that
(24.22)
SUPIU(X)I 5 CII~IIH*.*(R)7
Vu E H"* "(Q).
R
Proof of Theorem 24.1. We consider first the case rn = 1. Then p', defined in (24.18), is equal to p*, defined in (24.9). If we combine Lemma 24.2 with Proposition 24.9, we see that H'."(R") c LP*(R")and that we have, for a suitable constant C > 0, (24.23)
IIVJIp*(p>s
Vu E
CIIUJIH1+P(p),
(R").
We then use the extension mapping E * , whose existence (and continuity) is postulated in (24.16). We get, for some C1 > 0 and all u E H ' l P ( Q ) ,
5 C1
IIEQuIIH~.P(R~)
IIuIIH~.P(R)-
Since obviously I I u I I ~ ~ * (5~ ) l l ~ n ~ l l L p * ( R - ) we , get (24.19) in this case. Let now rn be arbitrary and greater than one. To say that u E H", "(R) is equivalent to saying that Dau E HlVP(R)for all c1, I cx I tn - 1. By applying (24.19), where m = 1, to Dau, I u I rn - 1, instead of u, we obtain IIUIIHm-1.P*(n)
5 CZIIUIIH*.~(~) 9
Vu E H".
"(a).
Sect. 241
SYSTEMATIC STUDY OF THE SOBOLEV SPACES
22 1
We may iterate this inequality m times, observing that the operation p ~ p * , Q.E.D. applied rn times to p , yields exactly p' given by (24.18). Proof of Theorem 24.2. Let x be an arbitrary point of R. We may assume that it is the origin and use polar coordinates r , 8 centered at this point. Let us call I- the truncated cone x + I'(Ox, h) in condition (24.20). Let g denote a nonnegative C" function on the real line, g(t) = 1 for t < h/2, g(t) = 0 for t 2 3h/4. Let u E Cm(R) n H m S P ( Rfor ) ; 8 E O,,
(-1)" -
(rn - I)! ~
h
r"-"{(a/ar)"[g(r)ul(r,O)])r"-' dr,
0
after m - 1 integrations by parts, and writing ul(r, 8) for the expression of u in the coordinates r , 8. We integrate now with respect to 8 over 0, whose measure is Lh. We obtain
by Holder's inequalities [q = p / ( p - l)]. If p = 1, in which case q = +a, we have rn > n. I f p > 1, q(rn - n) + n - 1 > - 1 because m > n/p. In order to conclude the proof, it suffices to observe that (a/&)"' is a linear combination of the ( d / d ~ ) ~I p, 1 = nz, with coefficients belonging to L"(T). The fact that r c R implies at once (24.22) for u E Cm(R) n Hm9"(R)and Q.E.D. Proposition 24.1 implies the full conclusion of Theorem 24.2.
Exercises 24.1. Let E be the fundamental solution of the Laplace operator A in R" defined in (9.19). Prove that, whatever the bounded open set Q, the restriction of E to 52 belongs to H ' * P ( Q )for all p < n/(n - 1) (we suppose n 2 2). Derive from this fact that the Dirac measure 13,~at some point xo of R belongs to H - * "(Q) for those same values of p . 24.2. Let R be a bounded open subset of R".We suppose n 2 2. Let p be an arbitrary Radon measure in R", positive and carried by R (this means that the complement of Q is of p-measure zero). Let E be the fundamental
222
BOUNDARY VALUE PROBLEMS
[Chap. I11
solution of the Laplace operator considered in Exercise 24.1. Prove that, for all p c n/(n - l), the restriction to R of the convolution E * p belongs to H'vP(SZ). [Hint: Write (24.24) limiting the variation of x to R, regard E(x - y) as a function of y in R, valued in the space H'*P(R)with respect to x , by using the result in Exercise 24.1 applied to translates of E. Use then the fact that the norm, here in H ' s P ( R ) , of the integral, is not greater than the integral of the norm, together with the fact that
Derive from the above that all the Radon measures in R, whose total mass is finite, belong to H-"P(SZ), p < n/(n - 1). Derive the same result from Theorem 24.2, when R has the cone property. 24.3. Let m be an integer, 0 < m c n, Em the distribution defined in Exercise 9.2. Prove that if p < n/(n - m), and if R is an arbitrary bounded open subset of R", Em E H", "(R). 24.4. Let R be a bounded open subset of R". Let m be an integer, 0 < m < n, p a positive number, 1 p < nlm. Let R be a positive number, so large that Si + is contained in the open ball I x I < R (we shall denote by zR the characteristic function of this ball). Let Em be the distribution so denoted in Exercise 24.3. Prove that (24.25) IIo""(Em * u)llLycq 6 I ) ~ ' u J I L p l l ~DB&l Vu E HZ*P(Q), R J, , if lcll m , m, q < n/(n - m ) , and l / r = l / p + l/q - 1 . Derive from this fact that u EL' for all r such that 1 5 r < np/(n - m).Compare this with the conclusion of Theorem 24.1. 24.5. Let R be a bounded open subset of R". Prove that R has the cone property (24.20) in the following cases: (1) R is convex; (2) Q is a C' hypersurface and 0 lies on one side of it. 24.6. Give an example in R2 of a bounded open set which does not have the cone property. Using the fact that any finite union of sets having the cone property has the cone property, give an example of an open subset of RZ which is not convex, whose boundary has " corners," and which does have the cone property. 24.7. Let R be a bounded open subset of R", having the cone property (24.20). Given any integer m, 0 6 m < +a,denote by 9T"(SZ) the space of complex C" functions in Q whose derivatives, of order g m , all belong to L"(Q). Suppose that we have (24.26) l S p S + a ~ , m>k+n/p. Prove then that H'"? '(R) is continuously embedded in @(a).
s
s
s
Sect. 241
SYSTEMATIC STUDY OF THE SOBOLEV SPACES
223
24.8. Prove that ifp > n, there is a constant C > 0 such that
[Hint: Write
where R is the set of points z such that Iz--xI S I x - y l , I z - y [ S I x - y I. Estimate then each integral on the right by changing to polar coordinates, centered either at x or at y , and by applying Holder's inequalities.] 24.9. Let Q be any open subset of R",a a number such that 0 2 a 5 1. We shall denote by g'(R) the space of bounded continuous functions u in CI such that
(24.28) [Functions having the property (24.28) are called uniformly Holder continuous, with Hiilder exponent a if 0 < a 5 1 ; when a = 0, they are simply the functions which are continuous and bounded; when a = 1, they are called uniformly Lipschitz continuous.] Suppose R is bounded and show that the left-hand side of (24.28) defines a norm on B'(Q)which turns it into a Banach space. Prove that @(a)is exactly equal to the space of continuous functions in R whose distribution derivatives belong to L"(R). 24.10. Let R be a bounded open subset of R" having the extension property (24.16). Let 0 < a < 1 and denote by ~ " + " ( R the ) space of C" functions in R whose derivatives of order rn belong to g"(f.2)(see Exercise 24.9). Derive from the result in Exercise 24.8 that H"9P(R)is continuously embedded in Bm-l+u(R)if p = n/(l - a). Suppose now that m > n/p (1 5 p < m), n/p not an integer. Prove then that H", p(t2) is continuously embedded in GP"'"/p(Q).
+
25 Further Properties of the Spaces H" The study of boundary values, or traces on 8R, of the functions belonging to H"(R) requires a deeper study of the global Sobolev spaces H" = H"(R") for nonintegral values of s.? This section is devoted to such a study. We begin by looking at some elementary operations on H", and first of all, at multiplication by smooth functions. We recall that Y = Y(R") is the space of C" functions cp in R" whose derivatives of all order decay a t infinity faster than any power of 1/ I x I.
PROPOSITION 25.1. The multiplication (cp, into H", such that (25.1)
Ilcpulls 6
We have denoted by
U)H
cpu is a bilinear map of Y x H"
ll~Il"J(1+ lvl)'"'I@(v)l 4-
4 the Fourier transform of cp: @(<)= e - '<"'~)cp(x) dx.
It is well known that the Fourier transformation is an isomorphism of Y onto itself. Proof. It suffices to prove the assertion when s 2 0. Indeed, in H - " , the multiplication by cpg.9 is the transpose of the same operation in H".We have
Ilcpull:
=
J (1 + I 5 12)s I J4j(v)fi(5 - 'I)4 I2 d5,
But, as easily seen, (25.2)
(1
+ 1512)"'2 6 (1 + Ivl>"l + 15 - )112)s'2?
t In the subsequent discussions, unless otherwise specified, s will be an arbitrary real number. 224
Sect. 251
FURTHER PROPERTIES OF THE SPACES
H"
225
and therefore
We obtain (25.1) by applying the classical Holder inequality: llf*g11Lp 5 IlfllL111g11u9
f e L ' , g E L P (1 SL'S +a).
Q.E.D.
Remark 25.2. The estimate (25.l), although sufficient in many applications, is not the best possible. For instance, when s = m, a nonnegative integer, we have the better estimate:
(25.3)
IlCPUllm
S const
I l ~ l l m SUP
bllm
l/Da~/lL~*
Generalization of (25.3) to H s for [ sI 2 m is possible but less easy to prove. Note also that, when s = m (or s = -m), the multiplication cpu, u E H", is defined for far more functions cp than just those belonging to 9. In particular, it is defined for cp E Bm,the space of C" functions whose derivatives of all order are bounded in R". For such cp the estimate (25.3) is valid, and can be extended to Is1 5 m. We go on, next, to look at convolution. In a sense, this is a neater operation than multiplication on the Sobolev spaces H". Let us denote by A" the space of tempered distributions u in R" whose Fourier transform ti is a (measurable) function with the property that (1
+ [ 5 lyti(t)E L".
PROPOSITION 25.2. The convolution (u, u ) w u * v is a bilinear mapping of H" x A' into H"", such that (25.4)
/ / u * u/I,+t
S IIuIIsII(1 +
I
t12)"2fj/ILm.
The assertion is practically evident. An important case is that where 3 is a polynomial P ( t ) of degree 5 m ; note that, then, u E H - " , m. As for the convolution with u, it is the (linear) partial differential operator with constant coefficients usually denoted by P( D), D = (- i dldx', . . . , - i d/dx"). We thus see
COROLLARY 25.1. Let P(5) be a polynomial of degree 5 m on R,. Then u w P ( D ) u is a continuous linear map of H" into H"-". We may take advantage of Propositions 25.1 and 25.2 to add some precision to the statement that Y is dense in H s (Proposition 13.1). First of all, let c E Cp(R") be equal to one in the open unit ball, and set cy(x)= [(x/v), v = 1, 2, . . , . We have [,(<) = v"g(v5). and it follows at once from (25.1)
226
[Chap. I11
BOUNDARY VALUE PROBLEMS
that the mappings U I - + [ , U form an equicontinuous set of linear mappings H" -+ H". For such a set, pointwise convergence in H" and that on a dense subset of H" are one and the same thing. Since when u E 9, [,u converges to u in H" (as a matter of fact, in Y ) ,we reach the following conclusion:
c,
PROPOSITION 25.1'. If the functions are as described, for each u E H", the cutoffs u converge to u in H" as v + 00.
+
c,
E
Let now p belong to C:(R"), > 0. Observe that be(<)= @(E().
p ( x ) dx
=
1 and set p,(x) = E - " ~ ( x / E for )
25.2'. If u E H", pE * u converges to u in H" as PROPOSITION
E +
+O.
ProoJ: We see that pe E A' and, by (25.4),U H p, * u form an equicontinuous set of linear mappings of H" into itself. Since the assertion is well known when u E C:, it is true for any u E H". Q.E.D.
By combining Propositions 25.1' and 25.2' we see that every u E H" is the limit of the p, * ([,,u) when E + v - l -+ +O. l n particular, each u is the limit of functions u, (k = 1, 2, . .) belonging to CP(R") whose supports converge to that of u, in the sense that, given any compact set K c R", K n (supp uk) converges to K n (supp u). We denote by C: the space of m-times continuously differentiable functions in R" whose derivatives of order all tend to zero at infinity.
sm
PROPOSITION 25.3. If s > m
+ 4 2 , H" c C:.
Proof. In view of Corollary 25.1, it suffices to prove the assertion when m = 0. Let u E H". There is a function v E L2 such that
fi
= (1
+
1<12)-"a.
If s > 4 2 , (1 + 15 12) - " belongs also to L2, hence li E L'. It suffices then to apply a classical theorem of Lebesgue. Q.E.D. PROPOSITION 25.4. The Dirac measure 6 belongs to H - n 1 2 -for E any E > 0. No distribution with support contained in a finire set of points belongs to H-"" (unless it is identical to zero).
+
ProoJ: The Fourier transform of 6 is one ;and (1 I ( I')s/' E L2 if and only if s -n/2. This proves the first assertion. If u E H s has its support contained in a finite set of points, there is a function q E Cr such that q ~ uis not identical to zero (unless u is!) and supp(qu) is one point. Since the spaces H" are easily seen to be invariant under translations, we may suppose that this set is the origin. Now, all distributions with support
-=
Sect. 251
FURTHER PROPERTIES OF THE SPACES H"
227
at the origin are of the form P( D)6, where P ( c ) is a polynomial. The Fourier transform of P( D)6 is precisely P(5). Suppose that the degree of P is m. Then (1 + I t12)"'2P(()belongs to L2 if and only if s + m < - 4 2 , from which we Q.E.D. obtain the last part of the statement. Let K now denote a compact subset of R". We denote by H s ( K ) the subspace of H" consisting of those u which vanish in R"\K, equipped with the norm I( ( I s . It is a closed linear subspace of Hs,hence a Hilbert space for the induced structure. We shall now compare the norms 11 /Is and 11 JIsI when restricted to Hs((K), for s' < s.
PROPOSITION 25.5. Let s, s' be any two real numbers such that s' < s and let K be any compact subset of R". The natural injection of Hs((K) into H" is compact. Proof: We must show that, if a sequence {u,} converges weakly to zero in H"(K), it converges strongly to zero in H"'. For distributions, strong and weak convergence of sequences are one and the same thing. Form
a,(()
= (u,,
e - i ( x * c>,)
CEC".
From the easy part of the Paley-Wiener theorem we know that the i,() are entire functions of exponential type in C". When ( remains in a bounded subset of C", the exponentials exp(-i(x, ()) form a bounded subset of C"(R"). We know that, since they have their supports in a fixed compact set, the u, converge to zero uniformly on such subsets of C"(R") and therefore the ti, converge to zero uniformly on every bounded subset of C". On the other hand, we know by the "principle of uniform boundedness," that the norms IIu,II, are bounded. We have
We may choose R independently of v, and so big that the first term in the last member is 5 ~ 1 2then , v so large as to make this true also for the second term. Q.E.D.
PROPOSITION 25.6. Let s, s' be any two real numbers such that s' < s and s 2 -n/2. To every E > 0 there is 6 > 0 such that, if the diameter of K is 56, (25.5)
IIulls. 5
EIIuII,
for every u E H'(K).
[Chap. I11
BOUNDARY VALUE PROBLEMS
228
ProoJ Suppose that the assertion in Proposition 25.6 were not true. After a translation, we could find a sequence of elements {u,} in H" with supp u, c {x E R";1x1 < v - I } , such that IJu,II, 5 I , [[u,lls,> c > 0 for all v. From the first inequality we conclude that a subsequence of the u, converges weakly in H', hence, by Proposition 25.5, it converges strongly to some element u E H s in H", and from the second inequality we conclude that u # 0. But the support of u must be {0}, and this contradicts Proposition 25.4. Q.E.D.
If R is any open subset of R",we write
H:(R)
=
u K
H"(K),
where K ranges over the collection of all compact subsets of R. When R = R", one often writes HE = HE(R"). We denote by Hf,, (R) the space distributions u in R such that cpu E H s for all cp E C,"(Q). One usually equips Hi,,(Q) with the topology defined by the seminorms U H llcpulls as cp ranges over Cp(R). It is easy to find a sequence {cp,} (v = 1,2, . . .) in C,"(Q) which converges to the function identically equal to one in a strong manner: If K , is the (compact) set of points x E R such that cp(x) = 1 , K , is contained, for each v , in the interior of K , , and R is equal to the union of the K,'s. Then the seminorms u t,IIcpvullsform a basis of continuous seminorms on Hfoc(R),which shows that this space is metrizable. By using the fact that H" is complete, one checks at once that H ~ , , ( Q )is also complete. Thus the latter is a Frtchet space. By using the reflexivity of H s (which is a Hilbert space!), one can easily check that HY,,(Q) is also reflexive. We have Cr(R) c Hi,,(Q) and the natural injection is continuous and has a dense image; its transpose is a continuous linear injection of the dual of HS,,(Q) into 9'(Q), and enables us to identify the dual of HT,,(R> with the image of this transpose. By using Proposition 24.9, one sees at once that this image is nothing else but H;'(Q). We wish now to prove the inoariance of the spaces H,S,,(Q) under a diffeomorphism. For that, we shall need a new norm on H", equivalent to the norm II 11s.
LEMMA25.1. If 0 5 s S 1,
ProoJ
The proof follows at once from the obvious inequality (for 0 5 s 5 1) (1
+ IC;12)"51 + 1q2"2(1 + 15I2)s,
(ER".
Sect. 251
FURTHER PROPERTIES OF THE SPACES
HS
229
LEMMA25.2. If0 < s < 1, we have, for all u E C:(R"),
where C, depends only on n and s. Proof: By Parseval's formula we have
Indeed. the Fourier transform of u(x + y ) with respect to y is equal to O(<) exp(i(x, f ) ) . Note that, since s is smaller than one, the integral
is convergent. Let R be a rotation i n R,. We have
=
j1 e i ( x ' . c )- 1 l 2 [ x ' (
-n-2s
dx'
setting x'
= 'R-'x.
Thus I ( R f ) = I ( f ) , which shows that Z(f) depends only on the norm the other hand,
I f I. On
This shows that f ( f ) is a positively homogeneous function of f of degree 2s, hence I ( ( ) = C,I (I2" for a suitable constant C, > 0. This easily implies what we wanted. Q.E.D.
Remark 25.2. By combining Lemmas 25.1 and 25.2 we see that the square root of the right-hand side of ( 2 5 . 7 ) defines on H" a norm equivalent to I1 ]Is. When s is an arbitrary positive number, possibly greater than one, but in
230
[Chap. I11
BOUNDARY VALUE PROBLEMS
any case different from an integer, the following is the square of a norm on H" equivalent to 11 11":
where [s]denotes the largest integer ing only on s and n (C > 0).
5 s and C, is a suitable constant depend-
Remark 25.3. Let us go back to the case where 0 < s < 1. We may then define H" as the space of functions u E L2(Rn)such that
I x - y (-"-""[u(x)
- ~ ( y )E ]L2(RZn).
Consider now a C" diffeomorphism of R onto another open subset R' of R", x. If . U E C:(R'), we set (u x)(x) = u(x(x)), X E R ;of course, v x E C;(n). If now u E 9'(Q'), we set 0
0
We have denoted by dxldx the Jacobian determinant of
x.
PROPOSITION 25.7. Let K' be an arbitrary compact subset of R', s any real number. There is a constant C,(K') such that (25.10)
IIu
0
5 CS(K')IIuIIs for every
u E H"(K').
Proof It suffices to prove (25.10) when u E C:(K'). Indeed, let K ; be a compact neighborhood of K' in R'. We can then assert that u H u 0 x is a continuous linear map of C:(K;), equipped with the norm 11 (IE, into H" and hence extends to the closure of C:(K'J in H'. This closure contains H"((K') as we see by regularization (Proposition 25.2'). Case I : 0 < s < 1. We apply Lemmas 25.1 and 25.2. We write x' = ~ ( x ) ,y' = ~ ( y )We . have J J I U ( 4
- u(y')121x - Y l =
-n-2s
d x dy
jJI u(x!) - u(y')
12
I x'
- y'l-"-2" w(x', y ' ) dx' dy',
where
and where dxldx' and dyldy' stand for the Jacobian determinants. Clearly w is bounded on K' x K ' .
Sect. 251
FURTHER PROPERTIES OF THE SPACES
Case ZZ: s = 0, 1, . . ,
.
HS
23 1
This is a particular case of Proposition 24.6.
Case ZZZ:s > 0 arbitrary. Let [s] denote the largest integer 5.q and set 8 = s - [s]. Now, that u belongs to H S means that D"u E H e for all CI, I a ] 5 [s]. And llulls is equivalent to
By combining this remark with the result relative to 8, we easily obtain the result for s. Case ZV: s < 0.
We have
Ilu
O
xlls =
SUP /Ju(x(x))O(x)dxl
where the supremum is taken over v E C: such that llvll - s = 1. As a matter of fact, we may limit ourselves to those Y whose support lies in ,j'(K;), K ; a compact neighborhood of K' in R'. Now, if we set x' = ~ ( x ) we , see that
The absolute value of the Jacobian determinant I dxldx' I is a C" function in an open neighborhood of K' (in fact, in a), therefore multiplication by this function defines a bounded linear operator on H-"(K'). On the other hand, we may apply (25.10) with u substituted for u and 2' for x, when s is replaced by -s. We obtain IIu O
xlls
5 const
IIVII - s l l u l l s = const
llulls.
Q.E.D.
We are now in a position to prove the announced invariance:
PROPOSITION 25.8. Let x be a difeomorphism of R onto R'. Then u H u 0 x is an isomorphism of HS,,(R') onto H~,,(R).Its inverse is the mapping v H Y 0 2'. Proof: Let cp E C;(sZ) be arbitrary. Set cp'
d u
O
= cp
0
1'. We have
x) = (9'4
O x 9
hence IIcp(u x)lls 5 C(K)IIcp'ull, by Proposition 25.7 [ C ( K ) > 0 depends on the compact set K = supp cp]. This shows that the mapping u H u o x is continuous from HfJR') into H~,,(R), which, of course, is enough. Q.E.D. 0
We are now able to define H'(9I) when )131 is a compact C" manifold (without boundary). This means that 9I is a compact Hausdorff topological
232
[Chap. I11
BOUNDARY VALUE PROBLEMS
space, carrying a C" structure. The latter can be described as follows: We are given an open covering {U,) (a E A ) of W and, for each a E A , a homeomorphism cp, of U, onto an open subset of R" (the dimension n is independent of a and is called the dimension of the manifold m). These homeomorphisms cp, are submitted to the following condition. Suppose that U, n U, = U,, # fzI; the restrictions of cp, and p ' , to Uas must have the property that (25.11)
cp,Cp;
is a C" diffeomorphism of cp,(U,,) onto (p,(U,,).
This gives a meaning to the statement that a function f is C" in W:It means that, given any a E A , f 0 Cp: is a C" function in cp,( U,). The dual of C"(W) is the space g'(9Jl)of distributions in 9Jl. Clearly u H u cp, is an isomorphism of C?(cp,(U,)) onto C:(U,); its transpose is an isomorphism of Q'(U,) onto 9'(cpa( U,)) and we may denote by Hfo,(U,) the preimage of Hf,,(cp,( U,)) under this transpose. 0
Dejnition 25.1. We denote by Hs(W) the space of distributions u in W such that, for every a, the restriction of u to U, belongs to HSo,(U,).
That this is a correct definition follows from (25.11) and from Proposition 25.8. Remark 25.4. We have exploited the compactness of W only insofar as we have adopted the notation Hs(W) instead of Hf,,(W) : In the case ,of a noncompact manifold, the latter should stand for the former-everything being otherwise equal. The compactness of 9Jl is now going to be exploited in a more substantial way: It will enable us to equip H"(W) with a Hilbert space structure-which, however, is not canonical (in general). As W is compact, we may select a finite open subcovering U,,, ..., Umr in the covering {U,}. Let then g l , . . . , gr be a C" partition of unity subordinated to the subcovering U,,, . . . , Uar:For each j = 1, . . . ,r, g j E C;( U,,); g1 + * . gr = 1 on 9Jl. We may then define a Hilbert space structure on H"(W) by setting
+
r
(u,u)s
=
1( ( g j u )
j= I
0
qlj, ( g j u )
0
Cp-LtjIs
where, on the right-hand side, ( , ), denotes the Hermitian product in Hs(R"). It follows at once from (25.1 1) and from Proposition 25.7, that if we change our choice of the finite subcovering {U,,} or that of the partition of unity { g j } , we replace the Hilbert space structure of H"(9X) by an equivalent one (i.e., the underlying locally convex topological vector is unchanged).
Sect. 251
FURTHER PROPERTIES OF THE SPACES
Hs
233
Exercises 25.1. We use the concepts and notation of Exercise 17.10, in particular the "semigroup" e V r At, 2 0. For any 0 E [0, I], let H e denote the image of H' under the mapping u ~ A ' - ~ Prove u . that H e is a Hilbert space for the norm IIAeuIIHo. Let uo be an arbitrary element of HI/' and write u(t) = e - t A u o , t 2 0. Prove that we have
(25.12) u ( t ) E L'(0,
+ G O ; H ' ) n Co([O, + o0[;Ho),
u'(t) E
L2(0, + 03 ; H').
Prove that, conversely, given any function u(t) satisfying (25.12) we have u(0) E H'". Let Tbe a continuous linear map H o H o , whose restriction to H' defines a continuous linear map HI --+ H I . By considering v ( t ) = Tu(t), with u(t) satisfying (25.12). prove that T defines a continuous linear map H"' + H I / ' . 25.2. Let denote the space of C" functions in R" whose derivatives of all orders are bounded in the whole of R". Prove that, given any function 4 E Lh9m, the multiplication mapping u H 4 u is continuous from H" to H s for every integer s ( 2 0 or
2n
c, = Jo e-""f(O)
d0
(where the integral is merely a notation for the duality bracket between distributions and C" functions on S'). Prove that the distributionfbelongs to H"(R') if and only if +m
(25.13)
1 ( I + m')SIc,IZ
< +oo
-m
(s is any real number). 25.4. Using the result in Exercise 25.3, prove that Hs(T") (T": the n-
dimensional torus) is the space of distributions u on T" whose Fourier coefficients c, (a E Z") satisfy (25.14) (s is any real number).
234
[Chap. 111
BOUNDARY VALUE PROBLEMS
25.5. Let mZ be a compact C" manifold, and suppose that H0(1132), H'(mZ) are both equipped with a Hilbert space structure. Show that there is a linear isometry of H'(W) onto Ho(mZ), A , which is positive, i.e., such that for some co > 0, (25.15)
(Au, u)o 2 collull~,
vu E H1(mZ).
Prove that we have then necessarily, not just (25.15), but in fact (25.16)
(Au, 4
0
2 2 c1 llull1,2
v u € H1(W),
7
for a suitable c1 > 0. Prove that A - : Ho(mZ)+ H'(W) can be regarded as a compact operator (necessarily self-adjoint) of Ho(9Jl) into itself. 25.6. Use the same concepts and notation as in Exercise 25.5. Let x1 2 x2 2 . - . be the sequence of eigenvalues of A-' : HO(1132)+ Ho(mZ), with repetitions according to multiplicity, and let { c $ ~ }( j = 1, 2, . . .) be a complete orthonormal system in HO(%l)such that, for eachj, 4 j is an eigenfunction of A-' corresponding to the eigenvalue x i . Prove that, whatever the real numbers, Hs(W) is the space of distributions u in mZ, which can be written +m
(25.17)
=
C
Uj4j5
j= I
with coefficients uj satisfying
c x,:2sIuj(2 < +oo.
+m
(25.18)
j= I
1;
I
.
Let '33 be a compact C" manifold, ds2 = j = gijdx' dx' (n = dim mZ) a Riemannian metric on mZ. Show that ds2 defines a Hilbert space structure on every H" (s E R) and a second-order differential operator A on 1132 which, in turn, defines an isometry of H2(mZ) onto Ho(mZ). Prove that A must be elliptic (Definition 19.4). 25.8. Let s2 be an open subset of R".Prove that, for any s > 0, Co(Q) is not contained in Hi,,,(s2). 25.9. Let Sk denote the k-dimensional unit sphere (k = 0, 1 , . . .); regard Skas the "equator" in Ski', and thus, for k S n - 2, as a subset of S"-' and therefore as one of R".Let dqk be the canonical measure on Sk,regarded as a Radon measure on R",precisely the measure 25.7.
C,m(R")3 9 H
J
S"
(9 I Sk> d%
*
What is the lowest upper bound of the real numbers s such that dfsk (viewed as a distribution on R")belongs to HS(R")?
Sect. 251
FWRTHER PROPERTIES OF THE SPACES
H"
235
25.10. Prove that, given any integer k 2 0 or <0, the mapping "restriction to the open subset R of distributions in R", " r n , is a continuous linear map of Hk(R") into Hk(R). Show that this mapping has a dense image if k 5 0. Show that there is a natural injection of H:(R) (m 2 0) onto a closed linear subspace of H"(R"), and that H-"'(R) is canonically isomorphic to a closed linear subspace of H-"(R"). Describe the latter and relate its orthogonal, in H-"(R"), to the restriction map rR above. Prove that, if R is bounded, and if k < m ( k 2 0 or
LEMMA25.3. Let s, s', S" be three real numbers such that s' < s. Let x, be an arbitrary point of R, E any number greater than zero. There is an open neighborhood U c R o f x o and a constant C > 0, depending on s, s', s", E , and xo, such that
[Hint: Assume that for fixed s, s', S", and E as in Lemma 25.3, no neighborhood U or constant C exists. Show that this implies that there is a sequence of functions u, E Cp(R), with diam(supp u,) + O and whose support converges to {x,}, such that
(25.20)
I I ~ Y l l S ,
'1,
whereas
(25.21)
IlP(x, D)u,II,,,
5 I/v,
l l ~ " l l S5 & - I .
By applying Proposition 25.5, derive from this that the u, converge, for the norm in H"', to a non2ero distribution u whose support is reduced to the single point {x,}. Then apply Lemma 1 . 1 , Exercise 1.12.1 Show that the conclusion in Lemma 25.3 is not valid if we remove the hypothesis that the principal symbol of P ( x , D) does not vanish identically at any point of R. 25.12. Let p Edenote the mollifiers used in Proposition 25.2' (pl = p). Let L(5) be an arbitrary linear form on R, . Prove that there is a constant C > 0 such that, for all (, q E R,,
125.22)
I UOP(&5)- L(Vl)P(EdI s c 1 5 - rl I .
236
BOUNDARY VALUE PROBLEMS
Derive from this that, given any function c(x) E C:(R"), defined by
(25.23)
T&u = J
[Chap. 111 the operators
Y N P &* (4- C(P, * 4 )
form (as E varies on 10, 1[) an equicontinuous set of linear mappings H s + H s (s E R arbitrary). Derive from this Friedrich's lemma:
LEMMA25.4. Let P(x, 0)denote a linear partial d$erential operator of order m 2 0, with C" coefficients in an open subset R of R". Let s be an arbitrary real number. Given any distribution u E HZ(R), the differences
(25.24)
P(X, N P , * 4 - P& * P(X, D)u
converge to zero in H;':"+'(R)
as E
-+
+O.
[Hint: In the proof of (25.22) it is enough to consider the case n = 1 and L(() = t. Observe then that
The proof of the equicontinuity of the operators T, is analogous to that of Proposition 25.1 .]
26 Traces in Throughout this section, R will be a bounded open subset of R" whose boundary LX2 = r is a C" hypersurface. For simplicity, we may assume that R is connected and lies entirely "on one side" of r (i.e., R is one of the bounded components of R"\T). But R could as well be a j n i f e union of such connected bounded open sets. The definition of the spaces Hs(!JJl) given at the end of the preceding section applies to the case where Bnl = r. We shall prove the following results: THEOREM 26.1. Let m be any nonnegative integer. The restrictions to R of the functions belonging to Cm(R")form a dense subspace of H m ( 0 ) . i As can be seen, this dense subspace is nothing else but C"(Q), the space of C" functions in R whose derivatives of all order extend to the closure of R as continuous functions. For such functions we may clearly define their values at the boundary r, what is called their trace on r, and which is simply their restriction to r.
THEOREM 26.2. Let rn be any integer strictly greater than zero. The trace on r, defined at first on C"(n), can be extended (in a unique manner) as a continuous linear map of H"(S2) onto H"-'/'(r). In the proofs of both these statements (as in those of many similar ones), the first step consists of a localization and aflattening of the boundary. This is done as follows. We choose a finite covering { V , , . . . , V,} of r by open subsets of R" such that, for each j = I , . . . , r, there is a C" diffeomorphism q j of V j onto the open unit ball of R", B = { x E R"; [ x 1 < 11, such that
(26.1)
the image of V j n R under p j is the open upper half-ball B + = { x E R " ; 1x1 < l , x " > O } .
-f Theorem 26.1 is well known when m = 0. In the remainder of this section, we shall assume that m 2 1.
237
238
BOUNDARY VALUE PROBLEMS
[Chap. 111
This is possible, as one sees by using local coordinates in sufficiently small open subsets of r (forming a covering of r)-and extending these coordinates to open subsets of R" by using the normal direction to r. We leave the details to the student. Let V , be a relatively compact open subset of $2, containing r
K O
=Q\
U (vj n Q),
j= 1
+
and let l o , 11, . . . , [, denote r 1 elements of C:(R") such that C0 + + ... + [, = 1 in a neighborhood of and that, for each j = 0, 1, . , . , r , supp i j c vj. Let then u be an arbitrary element of H"(R). Suppose that, for each j , 0 g j r, we find a sequence of functions u j , ,E Cr( V j ) which converge to u in H"( V j n a). We then claim that the uj, ,converge to Ti u in H"(R): This is simply due to the fact that l j u and the u j , ,vanish identically outside of a compact subset of V j, K j , hence in a neighborhood of
a,
rj
a( V j n Q \ K j
n
r.
From this it would follow at once that the restrictions to R of the functions u, = CS= u j , converge to u in H"(R) and would prove Theorem 26.1. Suppose now that u is the restriction to R of a C" function in R" and let us denote by yu its trace on r, that is, its restriction to r. In order to prove Theorem 26.2, we wish to prove that, for some C > 0 independent of u, l l ~ 11wu vqr) 5 Cllu I I H ~ R ) As seen at once, this is equivalent to proving that, for e a c h j = 1, . . ., r, we have, for a suitable constant C j > 0 independent of u,
(26.2)
(26.3)
l I ~ ~ T j ~ l l l5~C~j 115juIIHm(n,* -~~~(r)
Of course, we are using on Hm-1/2(r)one of the Hilbert structures defined at the end of Sect. 25, for instance the one defined by means of the local charts (Vj n r, 'pi I n r). If we combine (26.2) with Theorem 26.1, we see that y can be extended as a continuous linear map H"(L2) + H"-"'(r).It remains to show that this mapping is onto. Let v be an arbitrary element of Hm-1/2(r).It is clear what we mean by t jv (1 _ I j 5 r ) : It is the product of u with the restriction to r of t i . Suppose then that we have proved the following: There is uj E H"(Vj n a), whose support is contained in a compact subset of V j and which, consequently, defines an element (also denoted by u j ) of H"(R), such that the trace on r of this element of H"(R) is equal to t j u . Then, of course, the trace on r of u1 + * * + u, will be equal to v.
Sect. 261
TRACES IN
H"(n)
239
This concludes the argument and shows that, in order to prove Theorems 26.1 and 26.2, we may reason in the sets V j , V j n GI, V j n r (with all the precautions suggested above). This is what we have called the localization. The next step is theflattening of the boundary. It is done by transferring the whole situation to the unit ball B by means of the C" diffeomorphisms 'pi (1 r ) . We know (Proposition 24.6) that U M u o 'pi defines an isomorphism of Hm(Van 0)onto H"(B+) and, by Proposition 25.8, that
sjs -1
u H u o $ j (where i,9j is the restriction of 'pi to V j n r) defines an isomorphism of HI",;I/'( Vi n r) onto HI",; '/2(C), with C = { X E R " ;( X I < l , x " = O } . At this point we should remember that all the elements of H m ( V j n GI) and Hm-1/2 ( V j n r) used in the earlier localization argument had their supports ,oc contained in compact subsets of V j . The analog will be true for their images under the isomorphisms above : These images will have supports contained in compact subsets of the open unit ball B . In view of this, it will suffice for us to prove the following assertions : (26.4) Let u E H"(B+) have its support contained in a compact subset of B. Then u is the limit, in H"(Bf), of a sequence of functions that are the restrictions to B+ of elements of C:(B). (26.5) Let u E CF(B),yu its trace on C. There is a constant C > 0, independent ofu, such that 1/Y~I1".'-1/2(Rn-I)
5 CIIUIIHm(B+).
We have identified R"-' with the hyperplane X" = 0 in R". Assertions (26.4) and (26.5) enable us to extend the trace y on C to all elements u E H"(B+) whose support is contained in a compact subset of B. The third assertion will then be (26.6) To every u E H"- '/'(R"- ') with compact support contained in C there is u E H"(B+) with support contained in a compact subset of B such that yu = u. As a matter of fact, because of the condition that the supports of u and u be contained in compact subsets of B and C , respectively, B becomes somewhat irrelevant and can be replaced everywhere by R" itself (this is equivalent to extending u and u by zero outside their supports). This leads us to replace C by R"-' and B + by R; = {x E R";X" > O } . We shall then prove the following results:
THEOREM 26.3. The restrictions to R; of thefunctions belonging to CF(R") are dense in H"(R1).
240
BOUNDARY VALUE PROBLEMS
[Chap. 111
In the statement below, we identify R"-' with the hyperplane x" = 0. We may define the trace map y on the restrictions to R: of the functions f E Cr(R") : y(f) is the restriction off to that hyperplane.
THEOREM 26.4. The trace y can be extended as a continuous linear map of H"(R;) onto Hm-1/2(Rn-1). These two statements easily imply our three assertions above, (26.4), (26.5), and (26.6). It suffices to use cutoff functions (' E C:(B) equal to one in neighborhoods (in B ) of the supports of u and u. Observe that we have the multiplicative property : (26.7)
Y ( V 4 = Y(cp)Y(u),
v E C,m(R"),
u EH"(W.
Indeed, (26.7) is true when u itself is the restriction of an element of CF(R"), hence for all u E H"(R;) by virtue of Theorems 26.3 and 26.4. Theorems 26.3 and 26.4 represent the last stage in the transformation of our "trace" problem. They are not difficult to prove if we are willing to make the situation a little bit more abstract. The main point will be to interpret H"(R") [resp. H"(R;)] as a space of functions on the real line R' (resp., on the positive half-line R:), where the variable is x", functions which take their values in suitable spaces of functions of the first n - 1 variables, x' = (XI,. . . , x"-'). More precisely, we shall regard u E H"(R") as an L2 function on R' with values in H"(R"-'), with the following additional properties : (26.8)
for each j = 0, 1, .. . , m,
0:. E LZ(R'; H"-'(R"-')).
-Jz
We have set, as usual, D: = d/ax". We use the notation L 2 ( X ;E ) to mean the space of L2functions in the measure space (X,dx) with values in the Hilbert space E . That (26.8) constitutes an equivalent definition of H"(R") is obvious. Similarly, u E H"(R;) can be regarded as an L2 function on R \ such that (26.9)
for each j
= 0,
1,
. . . , m,
Diu
E
L2(R: ; H"-j(R"-')).
Also the norm on H"(R")-any one of the norms on this space we have considered so far-is equivalent to the following one:
We have an analogous statement with R; substituted for R" and R: for R'.
Sect. 261
TRACES IN
ff"(Q)
24 1
Proof of Theorem 26.3 Let U E Hm(R;). We shall make use of the characterization (26.8). By cutting and regularizing u with respect to the "transversal variables " x' = ( X I , . . . , x " - ' ) we see that u is the limit in Hm(R;) of a sequence of functions v having the following property: (26.10) There is a compact subset K' of R"-' (depending on u) such that, for every s real and every j = 0, 1, . . . , m,
Di u E Lz(R: ;Hs((K')). We have stated (26.10) in order to avoid using the spaces L2(R:; CF(K')) which might have put off the student; (26.10) is interesting, from our viewpoint. when applied for s large, s + 03. It has the advantage of involving only one Hilbert space of values, E = H'(K) (at least, for any given choice of s). Let us denote by t the variable in R' orin R: . We know that v(j)E L2(R: ;E ) for every j = 0, 1, . . . , m. Then d j ) must be absolutely continuous in R: provided that j < m ; in fact, for j < m, v ( j ) can be extended as a continuous function to the closure of R: in R', and we may talk about the elements v ( j ) ( + O ) of E ( j = 0, . . . , m - I). Let [ ( t ) E CP(R1) (say with real values), equal to one in a neighborhood of the origin. and let us define a new function V(t) on the whole real line R', as follows:
-
The (distribution) derivatives of V ( t ) of order <m are continuous functions; V c m )belongs t o L2(R1; E ) and is equal to utm) for t > 0. Let then p E CP(R1), [ p dt = + I , and set p,(t) = ~ - ' p ( t / ~ E) ,> 0. When E -, +0, whatever j = 0, ..., m, ($)j(p,
* V)
tends to
V(') in LZ(R'; E ) ;
consequently,
21
j=l
*
-d
q ; dt -,0 .
t>O
We know that the pE * V are C functions valued in E = Hs((K). But s is arbitrarily large and our definition of V is independent of s, hence the p E * Y are C" functions of t valucd in C P ( K ) . By multiplying them with cutoff
242
BOUNDARY VALUE PROBLEMS
[Chap. 111
functions of the variable t = x" (C" functions with compact support in R', equal to one on ever larger intervals), we obtain a sequence of functions belonging to C:(R") whose restrictions to R: converge to u in H"(R:). Q.E.D.
Proof of Theorem 26.4 We shall begin by proving the following: (26.11) There is a constant C > 0 such that, i f u is the restriction to R ', of any function belonging to C:(R"), l l Y ~ I I H m - ' / ~ ( a ~5 - ' )w 4 l H m ( l l + n )
(26.12) Let (xl,
ti(t', x") denote
the Fourier transform of u ( x ) with respect to x' =
.. . , Y-'). We have
(26.13)
lii(t', 0)" = -2 Re
+m
ati
-(<', t)d(<', t ) dt.
ax"
+
We multiply both sides of (26.13) by (1 l<'\2)'"-' 2 and integrate with respect to 5' over R,- By the Cauchy-Schwarz inequality we have
'.
+
x (/o+m(l 1<'l2)"Ia(t',
~~)"d~'dx'']''~.
If we use the remark at the bottom of page 240 on the norms on H"(R;), we see at once that the combination of (26.13) and (26.14) implies (26.12). We are now going to prove that y maps Hm(R:) onto Hm-1/2(Rn-1) by constructing a right inverse K to y, in fact a continuous linear map K of H"-'"(R"-l) into H"(R;) such that y(Ku) = v for all u E H m - l I 2 . I t suffices to construct Ku for u E Y(R"-'). Let 8(t') denote the Fourier transform of 0. Then Kv(x) will be the inverse Fourier transform with respect to c' of the function (26.15)
O(<') exp(-(l
+ I 5 I 2)1122)f
In other words, (26.16) Ku(x) = (27r)-"+' lexp[i(x', 5') - (1
+ It'12)''2x"]O(t')
d<'.
Sect. 261
TRACES IN
H"(0)
243
It is clear that, for x" 2 0, the function (26.15) belongs to Y(R,- l). Furthermore, if j < m,
2
= f(lUJIHrn-(,+l)/z(R"-*)
5 f 1 J u I I 2w - 1 / 2 ( R n - 1 ) ,
which shows that K : H"-'/'(R"obvious in view of (26.16).
')
-+
H"(R?+)is continuous. That y(Ku) = u is Q.E.D.
Remark 26.1. From (26.16) we derive that
hence Kv is a solution of the following Dirichlet problem in R: (1 - A)Kv = O ,
(26.17)
y(Kv) = 0.
(26.18)
We may now define on H'"(R:) all the traces of order j < m. Indeed, into H"-j(R?+) and the trace y maps the latter onto Hm- j - 11.2(R"-'). We set (a/dx")j maps H"(R;)
(26.19)
yj(u)
= )J((a/axn)ju).
Thus y j is a continuous linear map Hm(R:) -+ Hm-j-l/*(Rn- 1). Let us now go back to our bounded open set R with smooth boundary r. If we keep in mind what the localization and boundary flattening meant, we see that the partial differentiation in the direction normal to r, a/&, must now replace ajdx". Of course, the vector field a/& may not be well defined far away from the boundary, but when we consider the traces on r, what happens far away from the boundary is irrelevant. It therefore makes sense to set (26.20)
yj(u) = y((a/av)ju),
u E H"(R),
O 5 j < m.
THEOREM 26.5. Let R be a bounded open subset of R" with a smooth boundary r, lying on one side of r. I f 0 5 j < m, the jth trace y j is a continuous linear map of Hm(R)onto Hm-j-'12 (r-1. It is, of course, the "onto" part of this statement that we must prove. By localization and flattening of the boundary, we are easily reduced to prove the analogous statement in R; :
244
[Chap. 111
BOUNDARY VALUE PROBLEMS
THEOREM 26.6. If 0 S j < m , y j maps H"(RY) onto H m - i - " 2 (R"- 1). We have once more identified R"-' with the hyperplane Proof. Let u
E
X" = 0.
Hm-j-ll2(R"-'). Set
(26.21)
Clearly, (d/dx")iKju = u when x" = 0. That K j u E H"(R'!+)can be seen by the same method used in the case j = 0 on page 243. Q.E.D. We conclude this section by proving a few illuminating properties of H"(R) and &(Q). Without further mention, we assume that Q is bounded, has a smooth boundary, and lies on one side of it. THEOREM 26.7. The restriction to R maps H"(R") onto H"(Q). Theorem 26.7 is of course true also when m = 0, although in the proof we shall assume m > 0. It states that every f E H"(R) can be extended to R" as an element of H"(R"). By localization and flattening of the boundary, the proof reduces to that of the analogous statement in R; : THEOREM 26.8. The restriction to R; maps Hm(R")onto Hm(R;). Proof. We shall content ourselves with sketching it. Let u E H"(R:), and yj(u) be its traces of order j = 0, . . . , m - 1. Let u be any element of Hm(RY) such that yj(u) = yj(v) for every j = 0, . . . , m - 1 (of course, R 1 = ( x E R"; x" < 0); here trace means trace on the hyperplane X" = 0). Let u" denote the function equal to u for X" > 0 and to u for X" < 0. If we view u" as a function of xnE R' valued in the space of distributions with respect to x' E R"-', we can easily check that it has ( m - 1) continuous derivatives and that, for any k , 0 5 k 5 m, (d/dxn)ku"is the function equal to (d/axn)kufor x" > 0 and to (d/dx")kufor x" < 0. We conclude from this that (d/dxn)ku"is an Lz function of x" with values in Hm-k(R:; '). Q.E.D. Remark 26.2. If we know that y j ( u ) = 0 for every j < m, in the preceding proof, we may choose u = 0. Then u" is the extension of u by zero in R1.
Let
7 be the mapping U H ( ~ ' ( U ) , . . . , y"-'(u))
of H"(Q) into
THEOREM 26.9. The kernel of 7 in Hm(R)is exactly equal to HZ(R).
Sect. 261
TRACES IN
H"(R)
245
Since Cp(R) c Ker y", we certainly have H;(R) c Ker f. We must prove the converse inclusion. By localization and flattening of the boundary, we are reduced to proving the analogous statement for R; :
THEOREM 26.10. The kernel o f f in H"(R:) i s exactly H!(R;). Proof. Let u E Ker By virtue of Remark 26.2, the extension ti of u to R" by setting ii = 0 in the complement of R; belongs to H"(R"). On the other hand, in H"(R:), u is the limit, as E 4 +0, of the functions u, defined by u,(x) = ii(x'. xn - E ) . Since the supports of the latter stay at a distance 2 E from the boundary of R; , they belong to Hr(R;) and therefore so does u.
v.
Q.E.D.
Remark 26.3. Let u E H"(R) and let ii denote the extension of u to R" by zero in the complement of R. Naturally ii E Lz(R").But, in general, ii does not belong to H"(R"). We have seen that it does when u E Ht(R). Actually, this is the only case where it does (Exercise 26.1).
Appendix Extension to R of Elements of H".P(Q) Although the trace mapping theorems of this section (Theorems 26.2, 26.4 to 26.6, 26.9, and 26.10) have no simple analogs for spaces H m 3 "with p # 2, some of the other results, such as Theorem 26.1 for instance, remain valid for p # 2. We shall briefly indicate how to prove them. Throughout this Appendix, p will be any positive number, 1 5 p < + 00, m any integer, m 2 0. It is convenient to state the results in a slightly more general form than those proved in this section. More precisely, we shall not assume that r = dR is a C" hypersurface, but only that it is C". This could also have been done in this section, as inspection of the proofs will easily convince the reader (we have not done it, for the sake of simplicity and also in order to have statements that would be valid for any m).
THEOREM 26.A.1. Let m be a nonnegative integer, p a number such that 1 5 p < + co,R a bounded open subset of R" whose boundary dR is a C" hypersurface. Then the restrictions to R of the functions belonging to Cr(R") are dense in H". "(0). Proof: By localization and flattening of the boundary, we are reduced to proving a statement of the kind (26.4). It must be kept in mind that, in order to flatten the boundary, we can use diffeomorphisms which are merely C" and not, in general, C". The analog of (26.4) reads: (26.A.1) Let u E H " , P ( B + ) have its support contained in a compact subset of B. Then u is the limit, in HmXP(B+), of a sequence of functions which are the restrictions to B + of elements of C:(B).
246
[Chap. 111
BOUNDARY VALUE PROBLEMS
Of course, u may be viewed as an element of Hm*P(R:) whose support is contained in a compact subset of B. By duplicating the proof of Theorem 26.3 we can prove the following statement, which obviously implies (26.A.1): THEOREM 26.A.2. Let m be any nonnegative integer, p any number such that 1 g p < + m . The restrictions to R; of functions belonging to Cp(R") are dense in HmSP(R:). The proof of Theorem 26.A.2 is identical to that of Theorem 26.3, except that we must replace L2 everywhere by Lp. In particular, we must replace the space Hs((K') by HS*"(K')= (w E H"p(R"-l); supp w c K'}. Now s is an arbitrarily large integer; H". P(K') is a separable Banach space. Otherwise nothing is to be changed in the argument. The restrictions to R of functions belonging to C;(R") clearly belong to Cm(a),the space of C" functions in R whose derivatives of order z m can be extended as continuous functions to the closure of R. As a matter of fact, the restriction mapping to R, r,, maps Cr(R") onto C"(Q)-provided that the hypotheses of Theorem 26.A.1 are satisfied. Furthermore, it is not too difficult, then, to construct a right inverse E, : Cm@)-+ C;(R") of r,, which is continuous when the first space is equipped with the norm of HmTP(R) and the second one, with that of H".P(R"). Let cp be an arbitrary element of Cm(D). After localizing and flattening the boundary, we may assume that R = B + and that the support of cp is contained in a compact subset of B. We set, for x" < 0, m
@(x)=
1 Ljcp(x', - p j x " ) ,
p j > 0, j = 0, . . . , m,
j=O
and choose the real numbers Aj, p j in such a way that each trace of order k, 0 5 k 5 m, of (7, on the hyperplane x" = 0 is equal to the trace of the same order of cp. This will be true if we require m
(26.A.2)
1 &p(i =(-I)],
.j = 0 , . . . , m.
k=O
The advantage over the extension by means of the Taylor expansion of cp is that we may now estimate the norm of (7, in H m q P ( R Yin) terms of the norm of cp in H". p(R",. Indeed, for some constant C depending only on n, p, and on the choice of the iljand p j , we have Il$lIHmsL'(R")
=< C I I V I I H m . P ( R + n ) .
We shall then set Q, = cp for X" > 0, Q, = (7, for x" < 0. We know then that the support of Q, is a compact subset of B, that Q, E C:(B), and that ll@'ll#fm.P(R")
5 C11911Hmd(R+")
'
Sect. 261
TRACES IN
H"(n)
247
We may now revert to the initially given open set R. After some patching up, we obtain the following:
THEOREM 26.A.3. Let m , p , R be a s in Theorem 26.A.I. Let R, be any open subset of R" containing the closure of R. There is a linear map En : C"(W) -+
c;(R,)
such that rnE, = I, the identity mapping of Cm(Q),which is continuous when C"(Q) and CF(R,) carry the norms of H", "(0) and H", "(no), respectiuely.
From Theorem 26.A.1 it follows that C"@) is dense in H"*P(R).Consequently,
COROLLARY 26.A. 1 . The mapping E, in Theorem 26.A.3 can be extended as a continuous linear map H"',"(a) + Hg ' "(a,). The map r n , the restriction to R, maps H!pP(R,) onto H"7P(R). Remark 26.A.I. Inspection of the manner in which theextension mapping is constructed shows that it may be chosen independently o f p , 1 s p < + co, and so as to be continuous from Hk."(S2)into Hk,,"(RO)for every k = 0, 1,
..., m. Exercises 26.1. Let R be a bounded open subset of R" with C" boundary r; we assume that R lies on one side of I-. Let m be an arbitrary integer at least zero. Prove that H;(R) is the image, under the restriction mapping r, (restriction to R of distributions in R"),of the subspace of Hm(R")consisting of the functions in this space whose support is contained in 0. 26.2. Let R be a bounded subset of R",open and star-shaped about the origin (this means that if x, is any point in 0, the closed straight line segment joining 0 to xo is contained in R). Prove that the restrictions to R of C" functions in R" are dense in H"(R). 26.3. Let R = { ( x , y ) E R2; x 2 + y 2 l}, Ro = R\{O}. Prove that the mapping restriction to R, maps H"(R) onto a closed linear subspace of H"(R0). 26.4. Use the same notation as in Exercise 26.3. Equip H1(Ro) with the inner product D(u, u) = jn(grad u ) * (grad i7) d x (this defines a Hilbert space structure equivalent to the standard one). Show that if u E H'(R,) is orthogonal [for D(u, u)] to all the restrictions of elements of H'(R), we must have
-=
u = Q( D)(log r )
+ h,
248
BOUNDARY VALUE PROBLEMS
[Chap. I11
where h is harmonic in the unit disk R, and Q ( D ) is a differential operator with constant coefficients in R2. Conclude from this that u must be identically zero, and that the restriction to R, maps H'(R), and therefore also H'(R"), onto H1(R,). 26.5. Use the same notation as in Exercise 26.4. Prove that the natural injection of C:(Ro) into C,"(R) can be extended as an isometry of H;(R,) onto H@). Derive from this that there are C" functions, belonging to HA@,), which are identically one in a neighborhood of the origin. 26.6. Let E be a complex Hilbert space. Show that there is a bounded linear operator E : H1(R: ; E) + H'(R'; E) such that the restriction to R: of any EU, u E H'(R: ; E), is equal to u (apply the method in the proof of Theorem 26.A.3). Let then 51' be an open subset of R"-'. Apply the preceding result to obtain that there is a continuous linear extension mapping of H'(R' x R:) into H'(R' x R') (i.e., a continuous right inverse of the restriction mapping). Let (R,)" denote the product of n copies of the open positive half-line. Prove that there is a continuous extension mapping of H'((R+)") into H'(R"). Let R be a convex polygon, i.e., the interior of the convex hull of a finite set of points. Prove that there is a continuous extension mapping H'(R) + H'(R"). 26.7. Let R be a bounded open subset of R" such that the mapping " restriction to R of functions in R"," r , , regarded as a continuous linear map H"(R") + H"(O), has a continuous right inverse (see Theorem 26.A.3). Prove then that, whatever the integer k < m (k could be negative), the natural injection ofH"(R) into Hk(G)is compact (this is one of the standard versions of the classical Rellich's lemma; another version is Proposition 25.5).
27 Back to the Dirichlet Problem. Regularity up to the Boundary Let R be a bounded open subset of R" whose boundary is a C" hypersurface r, and which lies on one side of r. With the insight gained in Sect. 26 we may now give a neater formulation of the weak form of the (inhomogeneous) Dirichlet problem : (27.1) Given any f
E
H-'(Q) and any g E H"'(I-)$nd u E H'(R) such that:
(27.2) (27.3) We recall that y is the trace of u on I-. By virtue of Theorem 26.2 we know that there is v E H'(R) such that y(v) = u. Hence, setting w = u - v , we transform our problem into that of solving [in H'(R)] (27.4)
(A - A)w
=F
in R,
y(w) = 0,
where F = I - Iv + Au. By Theorem 26.9 we know that y(w) = 0 is equivalent to w E Hh(s1). If we apply Theorem 23.1, we see that, for real nonnegative I , (27.4) has a unique solution, as does problem (27.2)-(27.3). We may state
THEOREM 27.1. Let s1 be a bounded open subset of R",with boundary a C" hypersurface r, lying on one side of I-. Let I be any nonnegative number. The mapping (27.5)
u I-+ (Au - AM,~ ( u ) )
is an isomorphism of H'(R) onto H-l(s1) x We are now going to study the following question. Suppose that we strengthen the regularity requirements on the data A g in problem (27.1); what can we say about the regularity of the solution u? As a partial answer to this question, we shall prove the following result: 249
250
BOUNDARY VALUE PROBLEMS
[Chap. I11
THEOREM 27.2. Under the same hypotheses as in Theorem 27. I , let m be any integer strictly greater than zero. The mapping (27.5) is an isomorphism of H"(R) onto H"-'(sZ) x H'"-'12 (J3. Proof. All we have to do is to prove that iff E H"-'(Q), g E H"-'/'(r), then the solution u of (27.2)-(27.3), which we know to belong to H'(R), does in fact belong to H"(R). It suffices to prove this when g = 0 as we can always replace u by u - u with u E H"(R) such that y(u) = g (by Theorem 26.2). In order to prove our contention we shall use localization and flattening of the boundary. I n fact, we shall use the same open covering of a, Vo, V,, . . . , V,, and the same partition of unity, Co, C1, . . ., C,, introduced at the beginning of Sect. 26. Note that, for e a c h j = 0, 1, , .. r ,
.
(27.6),
(1 - A)(Cju) = A in R,
where (27.7)
fj = ij f + 2(grad ij, grad u ) + (ACj)u.
Suppose that we have proved that u E Hk(Q), 1 S k < m. T h e n 4 will belong to Hk-'(R). We are going to show that this implies u E Hk+'(Q).Since our hypothesis that u E Hk(Q)is verified when k = 1, our assertion will follow by induction on k. The case j = 0 is easily settled by the following lemma :
LEMMA27.1. Let u E Ht(R")(k 2 0) such that u E Hk+'(Rn).
(A - A)u E Hk-'(R").Then
Proof. We have (1 - A)u = (1 - A)u + (1 - A)u E Hk-l(R") and 1 - A induces an isometry of HS+'(R")onto HS-'(Rn)(whatever the real numbers).
From now on we focus our attention on the cases j > 0. This means that V j intersects the boundary hypersurface r. Note that we can choose each V j as small as we wish (this might of course force us to increase the number r of elements in our covering { V j } and also to enlarge the "central" element Vo-but these facts will not affect the argument below). Thus we shall study the equatiou (27.6), for] > 0. As a matter of fact, we shall drop any subscriptj. We shall write u instead of i j u , also Vinstead of V j ,f instead of&, and so on. We have then (27.8)
(A -. A)u =f in
I/ n R,
with u E HA( V A Q). We must keep in mind, moreover, that now (27.9) the closures of sppp u, supp f in V are compact subsets of V.
Sect. 271
BACK TO THE DIRICHLET PROBLEM
25 1
The statement which we must prove, and which will imply what we want, is the following one : (27.10) i f 1 5 k < m and i f u E Hk(V n a), J E Hk-'(Q n V ) , then u E Hk+'( V n Q). We note, once more, that (1 - A)u =f+ (1 - A)u E Hk-'(V n Q) if the hypotheses in (27.10) are satisfied. This means that we may assume I to be equal to one. We shall begin by flattening the boundary. Such an operation can be performed in any way one wishes, but we shall perform it by taking advantage of the particular properties of the Laplace operator (in particular, of its rotation invariance). This will simplify our exposition, but is by no means essential. The same reasoning applies to all second-order elliptic operators with C" coefficients, whose principal part is a Hermitian quadratic form. First of all, we may assume that the point xo is the origin in R".Second, possibly after performing a rotation in R", we may assume that the normal to r at xo is the x" coordinate axis. In fact, we may then assume that V n r is . . . , x"-'). We defined by an equation x" = cp(x') where, as usual, x' = (XI, then set y' = x', y" = x" - cp(x'). As we may eventually want to shrink V (at most, finitely many times!), we may point out that cp is defined and C" in a fixed open neighborhood W ' of the x' projection of i? The change of variables above transforms our operator 1 - A, into
It should be noted that (27.1 1)
grad cp(0)
= 0.
Let then U , Fdenote the expressions of u, f in the new coordinates y . We have (27.12)
QU
=F
in V n R,
and clearly U E HA( V n R). As a matter of fact, both supp U and supp F are contained in compact subsets of V , hence U can be regarded as an element of HA(R:) while F can be regarded as an element of Hk-'(R:). We must show that if U E Hk(R;), then, in fact, U E Hk+'(R;). We shall write Q=(l -A)+R, with
We shall make Q and R act on functions with support contained in compact subsets of V . Thus the fact that R is not defined outside of V will not matter.
252
[Chap. I11
BOUNDARY VALUE PROBLEMS
Nevertheless, in order to avoid undue complications in the exposition, we shall extend cp as a CF function in the whole space R" (in any fashion we wish) and thus R will be defined everywhere. Let us indicate how we intend to prove our assertion (27.10). We know (Theorem 23.1) that 1 - A is an isomorphism of HA(R;) onto H-'(R?). We shall give an explicit description of its inverse, which we call Go. This expression will enable us to prove the following statements : (27.13) giuen any k = 0, 1, . . . , Go induces a bounded linear operator Hk-'(R:) + Hk+'(R'+); (27.14) given any k = 1, 2, - .., and any E > 0, there is 6 > 0 such that, if the diameter of V is less than 6, then, f o r every w E Hk-'(R'!+) whose support is contained in V , llRGO
WllHk-l(R+")
5 EllWIIHk-'(R+").
These properties of Go will imply what we want. Indeed, set
u1= G,(Z + RG,) - 'I? We know that F E Hk-'(R:) and that supp F c V. I n (27.14) we choose < 1 and accordingly take the diameter of Y small (
E
QU,
= (1
- A)G,(Z
+ RG,)-'F + RG,(Z + RG,)-'F
= F.
But the uniqueness of the solution to (27.12) [which follows from the uniqueness of the solution to (27.2)-(27.3)] demands that U = U,.
Construction of the Operator Go Given any v E H-'(R;)
we must have Gou E HA(R:)
and
(I - A ) G O U = U .
(27.15)
It is natural to perform a Fourier transformation with respect to y'
= (y',
. .. ,
y"-'). Let B denote the corresponding Fourier transform of u. For simplicity
let us write p
= (1
+ (q'(2)1'2,
q' E R,-,,
Equation (27.15) can be rewritten as (27.16)
($-
P)(+
+ P ) G = - 0.
Sect. 271
253
BACK TO THE DIRlCHLET PROBLEM
We are reduced to solving an ordinary differential equation for y" > 0, depending on the parameter q'. We have two basic requirements on the solution: First it must vanish when y" = 0; this corresponds to the fact that Gou must belong to HA(R"). Second, it must be tempered with respect to q', A
which is necessary if Go u is to be the Fourier transform with respect to y' of a As we shall see, it is possible to comply with both demands. distribution in W+. It is convenient to invert separately the two factors
keeping in mind what we have just said. Note that it suffices to define Go on a dense subset of H-'(R?). Thus we may assume that u is C" and vanishes for y" large. In this case we may set
-j
+ W
E - ~ ( q 'y, ) =
exp[p(y" - t ) ] ~ ( q 't,) dt.
Y"
It is clear that L- E - v = u and that E - is tempered with respect to q'. We may also set
and then L+ E + D = 3; E + D is tempered with respect to q'. Finally we set A
G ~ =u-E+ E- 0
(27.17)
Of course, (27.1 8)
Go u(y) = ( 2 ~ )-It'
s
A
ei(Y'."')G 0 uh'> Y">dv'.
Proof that Go Maps H
-
(W+) into H
+
@+)
Let us call b+ u the inverse Fourier transform with respect to q' of E , 8 ; similarly, call € - u that of E - 0 . It suffices to prove that €+ and b- map Hk-'(R'!+) into Hk(R'!+).We shall give the proof for b+ only; the treatment of 8- is analogous. Note that + W
E + ~ ( q 'y") , =
J0
exp[-p(y" - t ) l ~ ( y " t ) ~ ( q 'r,) dt
254
BOUNDARY VALUE PROBLEMS
[Chap. 111
where H ( s ) stands for the Heaviside function. From the standard Holder's inequalities for convolution we obtain
If we multiply both extreme sides by pZkand integrate them with respect to q' over R,,-l, we obtain
Note, incidentally, that k need not be an integer-it could be any real number. Next we exploit the fact that (d/dy")E+ G = pE+ 8
+ 8,
or, equivalently, with obvious notation, (d/dy")L?+ u = (1
(27.20)
-
u
+ U.
Suppose that we know that u E Hk-'(R;). This means that given any j = 0, .. . , k - 1 [we may assume k 2 1 here, as we know already that Go maps H-'(R;) onto Hh(R'+)], (d/dy")ju is an L2 function of y" > 0 valued in Hk-1-1 (R"-').Suppose then that we have already proved that (d/dy")'(&+ u) E L2(R; ;Hk-j(R"-l)).
(27.21) This implies
(1 - A')"'(d/dy")j(L?+
v ) E L2(R; ; Hk-j-'(R"-')).
If we apply (dldy")' to both members in (27.20), we obtain that (d/dy")j"(&, v ) EL'(R; ; Hk-("')(R"-')).
Since (27.21) has been proved for j = 0, our induction argument works and proves it for a l l j = 0, 1, ... ,k. But then it means that 8, u belongs to Hk(R7) (cf. p. 240).
Proof of (27.14) In what follows w will always denote an arbitrary element of Hk-'(R!,.) with support in V . Let V , be an open neighborhood of Vand 5 E CF(Vl) be equal to one in a neighborhood of i? We have
(I - AM1
- 5)Go w ) = 2(grad L grad(G, w)> + ( A W 0 w.
Sect. 271
255
BACK TO THE DIRICHLET PROBLEM
The right-hand side, which we denote by h, belongs to Hk(R:). On the other hand, we know that Gow belongs t o Hh(RT) and therefore the same is true of (1 - <)Gow . By the uniqueness part of Theorem 23.1 we conclude that (1 - i)G0 w = Go h E Hkf2(R"+).Moreover there is a constant C > 0, depending on but not on w , such that
c
(27.22)
11(1
- {)GO W I I H * + 2 ( R + n ) 5 c I I w I I H k -
1(R+").
Note that
RG, w
= R[(Go W ]
and that R : Hk''(R?) -Hk-'(R;). following:
+ R[(l - i)Go w] Suppose that we have proved the
(27.23) to every E~ > 0 there is q > 0 such that qdiam Vl < I], then, f o r every u E H"+'(R;) with support in Vl, llRUIlH*-'(R+n)
If we apply this with v IIRGO W I / H * - 1 ( g + n ) 5
= [Go u,
5 E1~~U~/H*fl(R+n)'
we obtain
+ cll/(l
&lllcGO WIIHk+'(R+n)
-
WIIffk+l(R+n)
5 E i I I G o w l l ~ * + I c ~ + +n )( C I +El)II(1 - ~ ) G O W I I H ~ + * ( R + ~ ) 5 E ~ C ~ \ I W \'(R+") I H * -+ C ~ I I W I I H * - ~ ( R + ~ ) by virtue of (27.22) applied with k - 1 substituted for k t . It is important to note that whereas C, depends on ( and hence on q, C , is simply the norm of the bounded linear operator Go : Hk-l(R:) + Hk+'(R;). We shall eventually prove the following statement: diam V < 6, then, f o r
(27.24) to every E~ > 0 there is 6 > 0 such that, every w E Hk-'(R;)with support in V , I\wl\ff*- 2 ( R + n )
11
&2
IIHk- ' ( I t + " )
.
Let now E > 0 be arbitrary. Choose I] in (27.23) corresponding to E , = c/2CO. From now on we keep i unchanged. This enables us to find the constant C , (which depends on q ) . Select then 6 in (27.24) corresponding to E, = &/2C2.We see that we have thus proved (27.14). All that is left for us to do is to prove the two statements (27.23) and (27.24). This is done by standard Sobolev space techniques. We recall that
t We recall that, in the present argument, k is an integer strictly greater than zero.
256
BOUNDARY VALUE PROBLEMS
[Chap. 111
k 2 1 : This is going to be important. We begin by proving (27.23). For this we go back to the expression of R on page 25 1. We derive from it:
+ 11 lgrad VIZullHk+l(R+") + CIIuIIHk(R+")* We are going to avail ourselves of (27.1 1). We shall prove the following:
LEMMA27.2. Let I be any nonnegative integer. Let x E CF(R") such that ~ ( 0=) 0. Given any E > 0 there is S > 0 such that, ifu E H'(R;) has its support contained in the open ball {x E R"; 1x1 < S}, (27.25)
~ ~ X ' ~ ~ H z ( R + " )EllUIIHI(R+") *
Proof. There is a constant C' > 0 such that, for all v E H'(R",,
We select 6 > 0 such that the maximum of I ~ ( xI )for I x I 5 S does not exceed 42C'. Then Lemma 27.2 will follow from LEMMA 27.3. Given any E > 0 there is 6 > 0 such that, if v E H'(R;) has its support in {x E R"; I x I < S}, (27.26)
11 11 H I - '(R+") 5 Ell 11 H I ( R +")'
Proof: Suppose first that 1 = 0. Then CF(R;) is dense in both H'(R;) = Lz(R;) and H-'(R;) (the latter by Corollary 23.1). We may therefore take u E CF(R;). But then the assertion is a particular case of Proposition 25.6. Suppose now that I is greater than zero. We write, for 1' = 1 or I' = I - 1,
But if we denote by E any one of the spaces HS(Rn-l),
by the Cauchy-Schwarz inequality. If we integrate both extreme sides with respect to x" from 0 to 6, we obtain
Sect. 271
BACK TO THE DIRICHLET PROBLEM
257
Substituting (a/ax")jv for u in (27.27) and making the appropriate choice for E yields at once what we wanted. End of the Proof of Theorem 27.2. Lemma 27.3 not only enables us to complete the proof of Lemma 27.2 but also implies (27.24). Furthermore, if we return to the estimate of I(RvIIHkwhich precedes the statement of Lemma 27.2, we see, by applying Lemma 27.2 to the first two terms on the righthand side and Lemma 27.3 to the third one, that we have proved the assertion (27.23). Thus the proof of Theorem 27.2 is complete. COROLLARY 27.1. We make the same hypotheses as in Theorems 27.1 and 27.2. I f f E Cw(Q and g E Cm(r), the solution u to (27.2)-(27.3) belongs to CcO(K).
This simply follows from the fact that . .
m=O
Exercises
27.1. Let R = {(x, y ) E R2; x 2 + y 2 < l}. Use Fourier series expansion for functions defined (almost everywhere) in R,
+" (27.29)
u(r, 0) =
C
c,(r)eime.
m=-w
State and prove necessary and sufficient conditions on the Fourier coefficients cm(r)in order that u(r, 0) E Rk(R)( k a positive integer). Using the characterization of H"(T')(s E R) in Exercise 25.4, and the found necessary and sufficient conditions. prove the following assertions: given any k >= 1 and any u E Hk(R), if 0 s j < k , the trace of (a/dr)ju on aR belongs to H k - j - ' I 2 ( 3 0 ) ; (2) given any k 2 1 and any element g of Hk-'"(R), there is a unique harmonic function in R, belonging to Hk(R), of which g is the trace on aR. (1)
27.2. Let R be a bounded open subset of R", having a C" boundary an and lying on one side of it. Let m be an integer, m 1 1, and denote byN"(Q) the space of harmonic functions in R which belong to H"(0). Prove that H"(R) is the topological direct sum ofMm(R)and Nm(R)n HA(R). 27.3. Let 51 be the unit disk x 2 + y 2 < 1 in R2. Give an example of a function u E H,@) n H2(R) whose extension ii to R2, obtained by setting u" = 0 in R2\R, is such that Aii $ L2(R2).
258
BOUNDARY VALUE PROBLEMS
[Chap. 111
27.4. Let R c R" be open, bounded, have a smooth boundary, and lie on one side of it. Show that given any set of functions gj E Hz(k-j)-3/2(Q) ( j = 0,. . . , k - l), f E H-yQ),
there is a unique function u E H2k-1(R)such that (27.30) (27.31)
Aku =f i n n ,
y(Aiu)=gj,
j=O
,..., k -
1,
where y is the trace on 82. Give a reasonable condition on the polynomial P ( X ) in one variable X , with real coefficients, in order that the same conclusion be valid after replacing Eq. (27.30) by (27.32)
P(A)u =f i n n .
28 A Weak Maximum Principle In this section we take the first step in the transition from the L2 theory of the weak, or generalized, Dirichlet problem, to the socalled classical theory. In the latter, the regularity assumptions on the data, the open set Q, its boundary r = dQ, the right-hand sidef, and the boundary value g are relaxed or, more accurately, are different from what they were in the L2 theory. For instance, we must be able to handle the case where the open set R is a square in the plane, or a cube in R3.Another feature will be that the regularity of the boundary value g will not be describable by means of the space H1'2(r). For instance, g could merely be a continuous function. Similarly, the right-hand side f will not necessarily belong to H-'(Q), and so on. For the time being, we drop any smoothness requirement on dQ. We shall only assume that R is a bounded open subset of R".As a matter of fact, in many statements, even the boundedness assumption can be dropped. In this section, all functions and distributions will be real-valued. This is a technical difference with little bearing on the conclusions : The operator under study, - A, A 2 0, transforms real functions into real functions, and the complex function (A - A)u =f can be decomposed into two real equations of the same kind. The spaces H"(R), C"(Q), etc., will all consist of real functions and distributions; H"(C2) will be regarded as a real Hilbert space. We begin by introducing a new notion of (partial) ordering on elements of H 1 ( Q ) . We recall that C"(C2) n H'(C2) is dense in H'(R) (Proposition 24.1).
a,
Definition 28.1. Let E be any subset of u any element of H'(Q). W e say that u is nonnegative on E in the sense of H'(Q) if there is a sequence {u,} in Cm(Q n H1(R)which converges to u in H ' ( Q ) and such that the following is true, for each v, (28.1)
there is an open neighborhood U, of E in R" such that u, > 0 in U, n R.
W e shall then write u
> 0 on E. 259
260
[Chap. I11
BOUNDARY VALUE PROBLEMS
<
If - u 3 0 on E, we write u 0 on E ; we say then that u is nonpositive on E in the sense of If'@). If we have both u 3 0 and u 0 on E, we write u w 0 on E. We may then compare any two elements u, v of H'(R) on E by checking whether we have u - v 3 0 or u - v 0 on E. Then, of course, we write u 3 v or u v on E, respectively, and so forth. We say that u is bounded on E in the sense of H'(R) if u 4 M on E for some real number M . The infimum of those numbers M will be called the maximum of u on E and denoted by max, u. We can also define the minimum of u on E ; we shall denote it by min, u. Let us also observe that the set of functions u E H'(R) such that u 3 0 on E is a closed convex cone in H'(R). One may wish to compare the notion " U 0 on E" with the more customary one, " U 2 0 almost everywhere (a.e.) in E." The following statement is easy to prove and will be used in the sequel:
<
<
+
PROPOSITION 28.1. r f u >= 0 a.e. in R, then u 3 0 on 0. I f E is asubset of R and i f u 3 0 on E, then u 2 0 a.e. in E. Proof. The proof of the first assertion is practically evident if we go back to the proof of Proposition 24.1: The functions v, and v constructed in that proof are 2 0 in R, and u + E, which is > 0 in R, converges to u in H'(R) as E goes to +O. Next we prove the second assertion. Let u, E Cm(R)n H'(R) converge to u in H'(R) and for each v, let U,be an open neighborhood of E in R in which u, is $0. Let Fdenote the intersection of all the U,,'s;E c F a n d F is measurable. Since the u, converge to u in L2(R)there is a subsequence of the u,'s which converges to u a.e. in R. As the elements of this subsequence are all >O on F, we reach the conclusion that u itself must be 2 0 a.e. in F. Q.E.D.
Let us now consider a (real-valued) function G(t) on the real line, which is uniformly Lipschitz continuous, i.e., such that, for some constant K > 0,
(28.2)
IG(t) - G(t')I
Klt - t'I
for all t , t
' R'. ~
It is clear that the distribution derivative G' of G belongs to L"(R') and that its L" norm is bounded by K . Indeed, given any cp E C r ( R 1 ) ,
(G', cp)
=
- JG(t)cp'(t) dt
= lim
1
/ G ( t ) [cp(t) - cp(t
h+O
+ h)] dt
h#O
[G(t)- G(t - h ) ] q ( t )dt, h
hence, by (28.2), I (G', cp)l 5 KJlcpll,,, which proves our assertion [note that the converse is also true: if G' E L"(R'), then (28.2) holds with K = / ~ G ~ ~ L m ] . We have
Sect. 281
A WEAK MAXIMUM PRINCIPLE
26 1
LEMMA 28.1. Let C be uniformly Lipschitz in R' and let G' denote a bounded representative of its distribution derivative. Then U H C(u) maps H'(R) into itseg, and (28.3)
(d/dxj)G(u) = G'(u)(d/axj)u,
j = 1, . . . , n,
with the agreement that the right-hand side vanishes wherever any one of its factors does. Ifmoreover G(0) = 0, there ut+G(u) maps Hh(R) into itse[f: Proof. Let (u,} be a sequence in Cm(Q) n H ' ( 0 ) converging to u in H ' ( Q ) . We shall apply (28.2) repeatedly: First of all, for each v, IC(u,(x))I 5 I G(u,(x,)) I + KI u,(xo) I + KI u,(x) 1, where x is a variable point in R whereas xo is a fixed, arbitrarily chosen point of R. This shows that each G(u,) belongs to L Z ( R )(observe that all these functions are continuous in a). Next we have IIG(u,) - G(u,,)llLz(n)5 K(Ju,- u ~ I I ~ ~ ( which ~ ) , shows that the C(u,) form a Cauchy sequence in LZ(R). Next, for every x E R, I C(u(x))- G(u,(x)) 1 5 KI u,(x) - u(x) 1, which shows that G(u) belongs to LZ(Q)and that
(28.4)
llG(u,) - G(u)llLz(n) 2 Kllu, - 4 I L 2 ( R ) ;
thus the G(u,) converge to G(u) in L2(R). Formula (28.3) is true, as seen by direct computation [of the distribution derivatives of G(u)].In particular, it implies, for every v, (28.5)
Il(dldxi)C(u,)11,2(*, 5 Kll~vlll.
It follows from (28.4) and (28.5) that the G(u,) form a bounded sequence in H'(R); hence there is a subsequence of it which converges weakly i n H'(R), necessarily to G(u), which thus belongs to H'(R). Assume now that G(0) = 0 and that u E Hh(Q). We may take the sequence {u,} given above in CP(R). Then the G(u,) have compact support contained in 52, and therefore belong to Hh(Q) (Proposition 24.3). On the other hand, we have seen that C(u)belongs to the weakly closed convex hull of the G(u,). By a standard consequence of the Hahn-Banach theorem, this is the same as their strongly closed convex hull, which is contained in the closed linear Q.E.D. subspace HA(R).
PROPOSITION 28.2. Let u befong to H'(R) [resp., to Hh(R)]. Then I u l , u+ = sup(u, 0), u- = inf (u, 0) also do. Let v be another element of H'(R) [resp., of HA(Q)]. Then sup(u, v) and inf(u, v) also belong to H'(R) [resp., to Hh(Q)]. Proof. For the statement about IuI it suffices to apply Lemma 28.1 with G ( t ) = I t [ . Then u + = f ( u + lul), u - = u - u + , sup(u, u ) = u + ( o - u ) + , inf(u, v ) = u + ( v - u ) - . Q.E.D.
262
BOUNDARY VALUE PROBLEMS
[Chap. I11
We are going to need an extension of the Lax-Milgram theorem (Lemma 23.1). Let N be a real Hilbert space [for us, H will be HA(R) or, in Corollary 28.1,H = H'(Q)]; let K be a dosed convex subset of H . We denote by H' the dual of H , and by ( , ) the duality bracket between H and H'. We set, for any u E K, (28.6)
K,, = {v E H ;3p > 0 such that u
+ pv E K } .
When K is an afine subvariety (i.e., the parallel of a linear subspace), we have K,, = K - u, which is a linear subspace. In general, Ku is a convex cone; we have K,, = H if and only if u belongs to the interior of K . LEMMA 28.2. Let a(u, v ) be a continuous bilinear form on H x H . Suppose that a is coercive on H (Definition 23. I), symmetric, and that the associated quadratic form a(v, v ) is nonnegative. Given any f E H' there is a unique element u of K such that (28.7)
a(u, v) 2 (f, v )
for every v E K,,.
We obtain the Lax-Milgram theorem (Lemma 23.1), in the case of real Hilbert spaces, by taking K = H in Lemma 28.2. In this case, K,, = H and v E K,, if and only if - u E K,,, hence we must have simultaneously a(u, v) 2 and S ( L v ) . This argument remains valid whenever Kuis a union of whole straight lines (and not merely of half-lines), e.g., when K,, is a linear subspace (i.e., when K is an affine subvariety). Proof of Lemma 28.2. Set
1(u)= a(u, u) - 2(L u ) . We are going to show that there is a number d > - k such that Z(u) 2 d for every u E K and, for a unique u E K , 1(u)= d. The argument is the same as that proving the existence of an orthogonal projection into a dosed convex set in a Hilbert space of which, in fact, Lemma 28.2 is a restatement. First of all, by the coerciveness of a(u, v ) ,
hence d = inf,,, 1(u)> - co. For each,j = 1, 2, . . . , let K j = {w E K ;1(w)5 d + l/j]. If w l ,w 2 belong to K j , we have t(wl w2) E K since K is convex. Moreover,
+
= Z(w,)
+ Z(w2) - 21 ( W ' ; w 2 ) $ 2 ( d + ; ) - 2 d = l . ~
2
Sect. 281
263
A W E A K MAXIMUM PRINCIPLE
If we use the coerciveness of a(u, v ) once more, we see that the diameter of K j is 52/(cJj). Since K is closed, the intersection of the K j consists of a single point, u E K. Of course, i ( u ) = d. Now let v E H be such that u + pv E K for some p > 0. Since K is convex, we have u + tv E K for all t, 0 2 t 5 p, and we know that the function I(u tv) - f ( u ) = t2a(v,u) + t{a(u,u) - ( h u)> 2 0 on (0, p). This is possible if and only if (28.7) holds. Suppose that (28.7) were also true with u1 E K in place of u. Observe that we have u = u1 - u E K,, whereas -v E K,,,. We derive from (28.7):
+
a(u, v )
L
(f, v>,
-4u1,
4 2 --,
from which, by adding, a(v, v) 5 0. Since a(v, u) >= czllvlj&, we must have u = u,. Q.E.D. COROLLARY 28.1. Let a(u, v ) be a continuous symmetric bilinear functional on H x H , coercive on a closed linear subspace H , of H . Let K be a closed convex subset of H such that,for some ho E H , K c H , + ho. Then there exists a unique element u of K such that (28.8)
a(u, v ) 2 0
for every v
E
K,,.
Proof. Set KO = K - ho.Clearly KO is a closed convex subset of H , . Furthermore, u E K e u - ho E KO and u-ho+pvEKo
c>
u+~vEK,
hence K,, = {v E H , ; 3p > 0 such that u - ho + pv E KO}. We may apply Lemma 28.2 with H o substituted for H , KO for K, and withf : hr-, -a(ho, h). We conclude that there is a unique u - ho E KO such that
a(u - ho, v) 2 -a(ho, v)
for every u
E
K,,.
Dejinition 28.2. W e say that a distribution in R, u, is a subsolution of I - A i f ( A - I ) u is a positive Radon measure in R. W e say that u is a supersolution if - u is a subsolution. When A = 0, i.e., when the operator under study is minus the Laplace operator, subsolutions are called subharmonic distributions (or functions), supersolutions are called superharmonic distributions (thus u is subharmonic if Au is a positive Radon measure). I t is a classical theorem of L. Schwartz that any positive distribution is a positive Radon measure; therefore the claim that u is a subsolution of I - A means that (28.9)
((2 - 4 4 cp> = ( u , (A - A>cp> 5 0 for every cp E C:(R) such that cp 2 0 in R.
264
[Chap. I11
BOUNDARY VALUE PROBLEMS
If we know furthermore that u E H’(R), we may restate (28.9) in the following manner: (28.10)
a,(u, cp) = 1
ucp d x R
+
n
j=l
au
acp
--d x 5 0 nax~ax~ for all cp E C:(R),
THEOREM 28.1. Let u, v E H’(R) be two subsolutions of - A sup(u, v) is also a subsolution of - A + 1. Proof. (28.11)
Set w
= sup(u, v )
and let K denote the set off
f<w
onR,
E
cp
2 0.
+ 1. Then
H’(R) such that
f-wEHA(R).
As the intersection of a closed convex cone and of a closed affine subvariety of H1(R), K is a closed convex subset of H’(Q). By virtue of Corollary 28.1, applied to a,(u, v), we obtain that there is a unique q E K such that (28.12)
a,(q, g) 2 0
for all g
E
K,.
At this point we observe that K,, contains all functions cp E C:(Q) such that cp 5 0 in R. Indeed, if q w, we also have q + pcp w whatever p >O, and if q - w belongs to HA(Q), so does q + pcp - w. If we replace g by such a function cp in (28.12), we obtain that q is a subsolution of -A + A. Let us now set 5 = sup(u, q ) = u + ( q - u)’ = q - (q - u ) - . Clearly (cf. Proposition 28.2), 5 w on a. We know that q - w is the limit in H’(R) of a sequence {cp,} of elements of C.: If one goes back to the definition of the supremum of two elements of H’(R) (cf. Proposition 28.1), one derives that 5 - w belongs to the closed convex hull of the set of functions sup(u - w,q,), and the latter vanishes outside of the support of cp, (since u 5 w everywhere), hence has a compact support and therefore belongs to Hk(R). As a consequence, we see that 5 - w E H@). In summary, 5 E K and, of course, 5 - q E K,, . From (28.12) we derive
<
<
<
5 - v ) 2 0. From our definition of 5 it follows that (5 - u)([ - q ) = - ( q - u)’(q - u)-
(28.13)
QAV,
vanishes identically in R. On the other hand, the first-order distribution derivatives of (5 - u) and of (5 - q) are locally integrable functions. It follows from this that (28.14)
(we recall that, in our terminology, a function is always viewed as a distribution and we say that it vanishes “identically” when any one-and every one-
Sect. 281
265
A WEAK MAXIMUM PRINCIPLE
of its representatives vanishes almost everywhere). From (28.14) it follows at once that a,([ - u, i- I') = 0 , i.e., a,(L 5 - 'I)= a,(% i- 'I). We shall now use the fact that 5 - v] E and that 5 2 'I in R. By cutting and regularizing we see that 5 - v] is the limit, in H'(n),of a sequence of nonnegative elements of C,"(R), hence, by (28.10), we have a,(u, i- I') 5 0. Consequently, (28.15)
By subtracting (28.13) from (28.15) we see that a,(( - v ] , i- r ] ) 5 0 hence, since is coercive, 5 = 'I. Thus we have u 5 v] 5 w almost everywhere. Similarly, we have v 5 v] 5 w almost everywhere. By modifying, or rather choosing appropriately the representatives of u, v, and 'I, we may assume that those inequalities are valid everywhere. Then, for any given x E 0, if w(x) = u(x), we have v](x)= w(x); and the same conclusion is valid if w(x) = v ( x ) .We reach the conclusion that w = v ] . As we have already shown that 'I is a subsolution, the proof of Theorem 28.1 is complete.
a,
COROLLARY 28.2. sup(u, v\.
If
COROLLARY 28.3.
If u E H'(R) is subharmonic, the same is true of
u,
v
E
H'(R) are subharmonic, the same is true of
WtI = SUP(% whatever the real number 8. In particular, u+
= sup(u, 0)
is subharmonic.
COROLLARY 28.4. Let A be 2 0 , u E H'(R) be a subsolution of - A given any number 8 5 0, { u } is ~ a subsolution of - A + 1.
+ A. Then,
Indeed, (A - 4 0 = -A8 2 0. We come now to the main result of this section: THEOREM 28.2. Let u E H'(R) be a subsolution of -A almost everywhere, we have (28.16)
+ 1(A 2 0). Zf
u20
max u S max u. R
an
Remark 28.1. By virtue of Proposition 28.2, the inequality (28.16) means that u S maxaRu almost everywhere in R.
266
[Chap. 111
BOUNDARY VALUE PROBLEMS
Proof of Theorem 28.2. We assume M = maXaRu < following lemma.
+ co. We
need the
LEMMA 28.3. Let u E H'(R) be bounded on dR in the sense of H'(R). I f 0 is any real number strictly greater than maxan u, (u - 0)' E HA(R). Proof. By hypothesis we may find a sequence (40,) in Cm(R) n H'(R) converging to u in H'(R) and such that, for each v, (p, < 0 on a set U,n R, where U,is an open neighborhood of dR in R".But (cf. proof of Lemma 28.1) (u - 0)' belongs to the closed convex hull in H'(R) of the set of functions (cp, - 0)'; for each v, (cp, - 6)' vanishes in U, n R, and therefore belongs to HA(R) (Proposition 24.3) and so does (u - 0)'. Q.E.D.
s
Suppose now that u 2 0 a.e. in R. If we had - 00 M < 0, we would derive from Lemma 28.3 that ( u - 0)' = u 10 I E H ~ ( Rfor ) any 0, M e 0<0. But this is incompatible, as the student may easily ascertain, with the fact that u 101 2 101 > 0 a.e. in R. Thus we must have M 2 0. Suppose now, in addition to u 2 0 a.e. in R, that u is a subsolution of -A I. Let then t? be any number greater than M. Clearly, u - 6 is a subsolution of -A I, and so is ( u - 0)' by Corollary 28.4. Consequently, for all cp E Cr(R), cp 2 0 in 0,
+
+
+
+
(28.17)
a,((u -
fa+, cp) s 0.
But (28.17) extends by continuity to all 40 E Hk(R) such that cp 2 0 a.e. in $2. By Lemma 28.3 we may therefore take cp = (u - 0)+, and by the coercivity of a, we conclude that (u - 0)' = 0, i.e., u 5 6 a.e. This, of course, implies u M a.e. Q.E.D.
s
COROLLARY 28.5. Let u E H1(R) be a subsolution of -A
+ I(A 2 0). Then
max u max .'u n an Indeed, u 'u in R, and we may apply (28.16) to u' substituted for u (by virtue of Corollary 28.4).
(28.18)
s
COROLLARY 28.6. Let u E H'(R) be subharmonic. Then max u 5 max u. n an Proof. Call M the right-hand side of (28.19), assumed to be < +a, and let 0 be a real number strictly greater than M. Then u - 0 is obviously subharmonic (here I =0), hence by (28.18) we know that u - 0 =< maxdn(u- 0)' a.e. in R. But (Lemma 28.3) (u- 0)'EHh(R) hence maxdn(u - 0)' = 0. Thus u 5 0 a.e. in R and therefore u 5 M a.e. in R. Q.E.D.'
(28.19)
Sect. 281
267
A WEAK MAXIMUM PRINCIPLE
COROLLARY 28.7. Let u E H'(R) be a supersolution of -A almost everywhere in R. Then
+ 1(1 2 0), u 5 0
min u S min u.
(28.20)
an
Proof. u is a subsolution of - A valid for - u replacing u.
R
+ A, a.e. nonnegative, therefore (28.16) is Q.E.D.
COROLLARY 28.8. Let u E H'(R) be such that ( - A
+ I)u = 0 in R. Then
(28.21) Proof. Note that u and - u are subsolutions of -A + I , therefore this is also true of u+ and of - u p = sup( -u, 0). We may apply (28.16) to u+ and Q.E.D. to -u-. We immediately obtain (28.21). COROLLARY 28.9. Let u E H'(R) be a subsolution of -A + I , h E H1(R) such that ( - A + I)h = 0 in R. If u - h E HA@), u 5 h a.e. in R. ProoJ Clearly u - h is a subsolution of ( - A + I ) , and so is (u - h)'. The latter belongs to H,@) by Proposition 28.1, hence, by virtue of Theorem 28.2, ( u - h)+ 5 0 a x . in R. Q.E.D.
COROLLARY 28.10. Let u E HA(R) be a subsolution of - A a.e. in 0.
+ I . Then u 5 0
Remark 28.2. The entire argument in this section would have worked equally well, had we been dealing with more general second-order elliptic operators than - A 1.We could have applied it to any operator L of the form (23.6) whose associated bilinear form a(u, v ) [see (23.7)1 is symmetric, continuous on H ' ( 0 ) x H'(R) and coercive on HA(R). The symmetry requirement could even be dropped-but then we need a further generalization of the Lax-Milgram theorem: In Lemma 28.2, the condition that a(u, v ) be symmetric should be shown to be superfluous. This can be done, but the corresponding proof is considerably more difficult than that of Lemma 28.2. When dealing with general operators L of the form (23.6) one might ask whether there is an analog to the results which concern the Laplacian A, such as Corollaries 28.2, 28.3, and 28.6. Inspection of their proofs shows that the only fact which was used was that the constant functions where subsolutions (of - A). Therefore, analogous statements will be valid whenever the zero-order term c ( x ) in (23.6) is identically zero. In general, in the proofs of the statements about subsolutions when c(x) is not identically zero, it is necessary that the nonpositive constant functions be subsolutions : This demands that c(x) be 2 0 a.e. in R.
+
29 Application: Solution of the Classical Dirichlet Problem We use the same notation as in Sect. 28: R is a bounded open subset of R", where we study A - A, with A 2 0. Unless otherwise specified, all functions and distributions are supposed to be real-valued. Let g be any continuous function on the boundary dR. We can extend g as a continuous function with compact support in R",and construct a sequence of C" functions, say polynomials, in R",which converge to g uniformly on its support. Let then { g j } ( j = 0, 1, . . .) be their restrictions to a. For each j , there is a unique function uj E Hi@) such that
(A - A)uj = 0 in R,
(29.1) (29.2)
~j
- gj E HA(R).
By the weak maximum principle (Corollary 28.8) we have (29.3)
maxIuj - uj.l n
5 maxluj - uj.l an
= maxIgj - gj.l an
This implies a t once that the uj converge uniformly in R; let u E Co(R) be their limit. Of course, we have (A - A)u = 0 in R. I t is also clear that u is independent of the choice of the sequence {gj} (and therefore of the sequence uj), as one checks by comparing u to the limit of a different such sequence. Definition 29.1. The function u just given is called the generalized solution of the classical Dirichlef problem, (29.4) (29.5)
(A - A)u = 0
in R,
u = g ondR,
and will be denoted by H , ( g ) . Suppose that there is u E Co(n) satisfying (29.4)-(29.5) or else that g can be extended to $2 in such a way as to belong to Co((n) n H'(R), and that there is 268
Sect. 291
SOLUTION OF CLASSICAL DIRICHLET PROBLEM
269
u E H’(R) satisfying (29.4) and such that u - g E HA(R). In both those cases, u = H,(g) (Exercise 29.1). Let us denote by Bo(R) the space of bounded continuous functions in R, equipped with the norm sup, 1 f 1. From (29.3) and the fact that maxn I u j l 2 maxpR 1 gj 1, we derive
PROPOSITJON 29. I .
H, defines a continuous linear map of Co(dR) into B0(R).
Let then x be a fixed. but arbitrary, point of R. The linear map
g++H,(g)(x)
(29.6)
is continuous on Co(dQ) and therefore defines a Radon measure m X on
an:
(29.7) The case A = 0 deserves special mention. In this case, the measure rn, is called the harmonic measure on dR. Because the unique harmonic function in R, identically equal to one on dR, is the function identically equal to one in R, we see that the total mass of m, is one. Because of the “minimum principle,” we have
whatever the nonnegative continuous function g on dR, i.e., m, is a positive Radon measure on d o . In summary, we see that m, is a probability on dR. This fact leads to a new interpretation of the Dirichlet problem (for the Laplace equation), in the context of probability theory. A brief description of this approach (or, rather, of its first steps) can be found in Sect. 31. It is also clear that the integration theory for Radon measures enables us now to extend the domain of definition of the functional HA:
Definition 29.2. A real-valued function g in dR (not necessarily continuous) is said to be resolutive if it is integrable with respect to m, for every x in R. Formula (29.7) defines also in this case the generalized solution of the Dirichlet problem (29.4)-(29.5). We may also define the generalized solution of the inhomogeneous Dirichlet problem :
( A - A)u
(29.8)
u=g
(29.9) fo r f E
=f
c0(Ti), g E CO(dR).
in R, ondR,
270
BOUNDARY VALUE PROBLEMS
[Chap. I11
Letfbe equal tofin and to zero in R"\n. Let E be any fundamental solution of 2 - A. We shall apply the following: LEMMA 29.1. The convolution E * f is Lipschitz continuous in R" and to every compact subset K of R" there is a constant C > 0 such that (29.10) (29.11)
SUP x,x'eK
scIx-x'III~II~~.
I(~*f)(x)-(E*f)(x')I
Proof. Since it does not matter what fundamental solution of I - A we deal with, we may take E = G,, the distribution considered in Exercise 9.7. It follows at once from (9.27) and (9.28) that E and grad E are locally integrable in R" and that, consequently, ft+ E * J f-grad E * f a r e continuous linear mappings of L"(R) into Lgc(R";R) and into L,",,(R";R"),respectively. Q.E.D.
Let v then denote the restriction of E +o!t R, Ij its restriction to dR. Both restrictions are continuous. We may then take, in (29.8)-(29.9), (29.12)
u =0
+ H l ( g - Ij).
It is interesting to consider the case where g = 0. Then (29.12) can be written more explicitly, taking (29.7) into account:
that is, (29.14)
u(x) =
1
n
G ( x , x')f(x') dx',
where (29.15)
~ ( xx') , = E ( X - x') -
Jan E(Y - x') dm,(y).
It is evident that G ( x , x') is a distribution in R x R; it is a C" function in the complement of the diagonal. In fact, G(x, x') - E(x - x') is a Cm function in R x R. It is the generalized solution of (29.4)-(29.5) when g(x) = E ( x - x'). The function G(x, x') does not depend on the choice of the fundamental solution E. Indeed, any other fundamental solution differs from E by a function which is a solution of ( A - A)h = 0 in the whole space. But then h(x - x') = Iflnh(y- x') dmx(y),
X , X' E Ti.
Sect. 291
SOLUTION OF CLASSICAL DIRICHLET PROBLEM
27 1
Definition 29.3. The distribution G(x, x’) in .R x R is called the Green function of the Dirichlet problem for A - A in R. We have
(29.16)
( A - Ax)C(x, x’)
= ( 2 - Ax,)G(x,x’) = 6(x - x’)
C(X, x’) = 0
(29.17)
in R x R,
for X E ~ R (x‘ER),
in a generalized sense akin to that of Definition 29.1. For “ general f and g, the generalized solution of (29.8)-(29.9), given in (29.12), can be written ”
(29.18)
u( x) = J G(x,x’)f(x’)dx‘+j R
g(y)dm,(y),
XER.
aR
I t is important t o underline the fact that, even when g is a continuous function in dR, and xo an arbitrary point of dR, u(x) does not necessarily converge to g ( x o )as x E R tends to xo. We wish now to take a look at the points where this occurs [that u(x) + g(xo)as x + xo].
Definition 29.4. A point xo of the boundary dR is called regular $, whatever the continuous function g on dR, H,(g)(x) converges to g(xo) as x E R converges to xo . It is not difficult to obtain a necessary condition for a point to be regular. First of all,
PROPOSITION 29.2. Let xo E dR be regular for A - A. Then, whatever the point x’ in Q, G(x, x‘) tends to zero us x E R tends to xo, and whatever the continuous function f in 0,Jn G(x, x‘)f (x’) dx’ -+0. This is evident by (29.13) through (29.15).
PROPOSITION 29.3. Let xo E dR be regular for A. - A. There is a function
b E H’(R) having the following properties: (29.19) (A-A)/?= (29.20)
-1
inn;
b < 0 in R and, whatever y
E
dR, y # xo, l z b(x) < 0; Rsx-ty
(29.21) b(x) -P 0 us x
E
R tends to xo .
Proof. Consider w(x) = 1 - exp(M1x - xoI2). It is easily checked that (A - A)w > A 2 0 as soon as 2nM > 2. Let b be the unique solution of (29.19) in H’(C2) such that p - w E HA(R). Since b is a subsolution and w a supersolution, we derive from Corollary 28.10 that b < w in R, whence (29.20). By
272
BOUNDARY VALUE PROBLEMS
[Chap. I11
Definition 29.1, we have p(x) = -fn G(x, x') dx' +fan w(y) dmx(y),and by the hypothesis that xo is regular, taking into account Proposition 29.2, we get that p(x) -+ w(xo)= 0 as x E fl tends to xo . Q.E.D. We shall prove a kind of converse to Proposition 29.3. First we introduce the classical definition : Dejnition 29.5. Let xo be apoint in the boundary an. A continuousfunction p in I]. will be called a barrier at xo(for 1 - A) i f p is a subsolution of 1 - A and satisfies (29.20) and (29.21).
Before stating and proving the main result of this section (Theorem 29.1) let us observe that the following function in 22, (29.22)
P*(y) = 1%
P(X),
Y E an,
nsx-y
is upper semicontinuous. In particular, it reaches its maximum on any compact subset of 30. THEOREM 29.1. For apoint xo E an to be regular, it is necessary and suficient that there be a barrier p at xo which belongs to HI@). Proof. The necessity follows from Proposition 29.3 and we shall now prove the sufficiency. Let g be an arbitrary continuous function on an and set u = H , ( g ) . We are going to show that given any E > 0, there is an open neighborhood U(xo)of xo in R" such that
(29.23)
I u(x) - g(xo) 1
<E
for every x E U(xoj n il.
If we go back to the definition of H,(g) at the beginning of this section (by means of the sequencesgj and u j j , we see that it suffices to prove our assertion when g can be extended as a C" function in R"; we continue to denote the extension by g. Let q, z be two numbers greater than zero ( q will be small, z large). Let h+(resp. h - ) be a function in R" satisfying (A - A)h, = 0 and such that h,(xo) = g(x0) f q.
We set v=h-
+ ~ p , w=h+
-ZB;
u is a subsolution, w a supersolution of 1 - A, We are going to show that u < g < w in n, provided that z is large enough. Indeed, in a sufficiently small neighborhood N(xo) of xo in R", we have h- < g < h + ,
Sect. 291
SOLUTION OF CLASSICAL DIRICHLET PROBLEM
273
hence, since /3 < 0 in R, (29.24)
Vx
v ( x ) < g(x) < w ( x ) ,
E
N ( x , ) n R.
Let us assume that N ( x , ) is open and let robe the complement of N(x,) n 8R with respect to 80.According to our preliminary remark, if p* is the function on di2 defined in (29.22), there is a number c > 0 such that p*(y) < -c for all y E T o . But this implies at once that, if we decrease c a little bit, we will have
p < -c
in R\N(x,).
Of course this means that we may choose 7 so large as to have
v < i nfg
(29.25)
-
sup g
I],
+ I] < w
in Q\N(x,).
The conjunction of (29.24) and (29.25) implies t) < g < w in R. Set V = v - u ; V obviously belongs to H'(i2) and is a subsolution of A - A in R. Similarly, W = w - u E H'(R) and it is a supersolution in R. We have, by Corollary 28.5, (29.26)
max V j max V + . R
PR
Observe that
v + = (u
-g
- ( u - g))' = sup(v - g,
u - g) - ( u - g).
We know u - g E H;(R)). Let {$j), j = 1, 2, . . ., be a sequence of functions in C,"(Q) which converge to u - g in H'(R). We know that sup(v - g, u - g) belongs to the closed convex hull spanned by the functions sup(v -9, $ j ) (cf. the proof of Lemma 28.1) and since v < g i n R, we have sup(v - g, + j ) = 0 where $ j = 0. We conclude that every function sup(u - g, $ j ) belongs to HA(R) and that this is also true of sup(u - g, u - g ) and therefore of V ' . But then maxi'R V' = 0 and, by virtue of (29.26), V 5 0 a.e. in Q. But v is continuous and u analytic in R, hence u 5 u everywhere in R. Similarly, u 5 w in 0. By using the definition of u, w and the continuity of h , together with the property (29.21), we see that we may choose g and U(x,) sufficiently small, so as to satisfy (29.23). Q.E.D. We shall conclude this section by giving a sufficient condition in order that there be a barrier at a given point x, of the boundary do. This criterion is classical; it should be noted that it is independent of A 2 0. PROPOSITION 29.4.
If there is a closed ball B in R'such that
(29.27)
{x,} = Ti n B,
then there exisrs a distribution
(29.28) (29.29)
fl
in R" such that
Ap = 1 in the complement of the center of B ; B(x,) = 0, p(x) < O f o r all x E x # xo
a,
274
[Chap. 111
BOUNDARY VALUE PROBLEMS
The condition (29.27) is often expressed by saying that there is an osculating sphere to Q at xo . Proof. We may assume that the center of the ball B is the origin; let R be its radius. We take p to be a function of s = I xI2 alone; then
AD = 4s(~?/as)~/? + 2n(a/as)P. 1 p(x) = - (1x1' - R 2 ) 2n
+ C( I x I 2 - " - R 2 - " )
P(x) =+(Ix12 - R 2 ) - Clog(lxl/R)
if n > 2, if n
= 2.
Whatever the constant C > 0, AP = 1 in R"\{O} (in fact, Ap = 1 + C'6 in R"). Furthermore, p ( x ) = 0 for 1x1 = R, in particular p(xo) = 0. It remains to choose C large enough so as to have, for every x E x # xo ,
a,
1x1' - R2 < 2nC(R2-" -
Ix('-")
Ixl2 - R Z < 2C log( I x ( ~ / R ~ )
if n > 2, if n = 2.
Since R is bounded it suffices to verify this when 1x1 = R small, in which case it is practically evident.
+ E,
with
E >0 Q.E.D.
We may say that the classical Dirichlet problem f o r II - A is well posed in R YE Co(Q) and to every g E CO(dR)there is u E Co(Q) satisfying (A - A)u = f i n R, u = g in i3R. By Proposition 29.3 we see that this is the case if and only if there is a barrier belonging to If'@) at every point of 8R.By Proposition 29.4 we see that large classes of open sets have this property. For instance: (i) all convex sets; (ii) all sets R whose boundary is a C' hypersurface (and which lie on one side of it). It is easy to make up more examples. if to every
Exercises
29.1. Prove the assertions immediately following Definition 29.1. 29.2. Give an example of a bounded open subset R of the plane R2 which is not convex, whose boundary is not a C' curve, and which nevertheless has the property that, given any point xo in its boundary, there is a closed disk in RZ intersecting only a t xo (cf. Proposition 29.4). 29.3. Prove that the set of points xo in the boundary i3R of a bounded open subset R of R", such that there is an open ball B in R"\Q having the property (29.27), is dense in i3R.
a
Sect. 291
275
SOLUTION OF CLASSICAL DlRlCHLET PROBLEM
29.4. Let R be any bounded open subset of R",G(x, x') the Green function of the operator 1 - A in R (2 2 0). Prove that G is a symmetric distribution in R x R [this means that
I
(29.30) 29.5.
when n
n
G(x, x')~(x')dx'
=
G(x', x)&x') dx',
V
4 E C:(R)].
Let R and G(x, x') be as in Exercise 29.4. Prove that, in Q x R,
2 3,
(29.31)
as
whereas, when n
J x- X'I
-
0,
= 2,
Give a precise meaning to these equivalences. 29.6. Prove that the harmonic measure, relative to the open ball 1x1 < R in R", is equal to (29.33) where do,, is the canonical area measure on the sphere I yl = R. Let us now denote by x* the inverse of x with respect to the sphere 1 y [ = R :
Prove that the Green function for the Laplace operator, relative to the open ball I x I < R , is equal, when n 2 3, to
where C,
= (n - 2)/1 S"-'
I, whereas, when n = 2, it is equal to
(29.35) 29.7. Let R be a bounded open subset of R" whose boundary is C', and which lies on one side of it. Let G(x, x') be the Green function of A - A (A > 0) relative to R, and dm,(y) the "metaharmonic" measure, defined in (29.7). Derive from Green's formula (10.8) that
(29.36)
276
[Chap. 111
BOUNDARY VALUE PROBLEMS
where dldv is the normal derivative, with respect to the y variable, in the interior normal direction to dR. 29.8. Check formula (29.36) when 1 = 0 and R is the open ball of radius R > 0, centered at the origin. 29.9. Let R be a bounded open subset of R",G(x, x') the Green function of 1 - A in R. Prove that the mapping
(
C,"(R) 3 f$H
G(x, x')&x') dx'
X H
Jn
)
can be extended as an isomorphism of H-'(R) onto HA(R). 29.10. Let us denote by x, y , z the coordinates in R3, and set r 2 = x2 + y 2 . Denote by l2 the open set in R3 defined by (29.37)
r 2 + z2 < 1,
r > exp(- 1/2z)
if z > 0.
Describe R and show that every point of its boundary different from zero is regular (Definition 29.4). Let p be the positive Radon measure 4 H1; 4(0, 0, z)z dz and call u the restriction to R of the convolution E * p, where E = (4n)-'(x2 + y 2 + z')-'/~. Show that, up to a constant factor, (29.38)
+ z2)'l2 + zIogl(r2 + z2)1/2+ zI I(r2 + ( z - I)~)''~ + i - zI
u = (r2
+ (z -
- (r2
- 22 log r.
Prove that u is harmonic in R and that the restriction of u to the boundary dR is continuous on 132. Compute the limit of u as (x, y , z) -+ 0 along the surfaces
r
= exp( -a/2z),
z > 0,
for various values of M , 0 < ct < 1. Conclude from this that the origin is not regular (relatively to the Laplace equation and to the open set R ; a point of dR like the origin in this exercise is called a Lebesgue spine). 29.11. Let R be a bounded open subset in the plane R2, zo a point in its boundary such that there is a straight line segment [zo, z l ] entirely contained in R2\R (zo # zl). Prove that zo is then regular for the Laplace equation in R. [Hint: Consider a branch of log(z - zo) adapted to the situation and study the function -Re{lllog(z - zo)}.] 29.12. Let a < b be two (finite) real numbers; set L = b - a. Show that, given any complex number R which is not one of the numbers -k27c2/L2,k an integer 2 1, there is a Green function G,(x, x') for the differential operator 1- d2/dx2in the open interval ]a, b[. Setting
Sect. 291
SOLUTION OF CLASSICAL DIRICHLET PROBLEM
277
where Ji is the branch of the square root which is positive for z real positive, give the explicit expression of G,(x, x ’ ) - E,(x
- x’).
Describe what happens when a tends to -a or when b tends to +a,and also what happens when 1 tends to one of the critical values - k 2 x Z / L 2 , k = 1, 2, . . . (while a and b remain finite).
30 Theory of the Laplace Equation: Superharmonic Functions and Potentials
In the preceding two sections, we “ solved ” the Dirichlet problem when the data were merely continuous by adapting the classical Perron’s method and combining it with the use of the weak maximum (see Sect. 28). The advantages of this approach are twofold: first of all, it is a natural extension of the variational method, which has led to the weak solution; second, and more important, although we have applied it only to the typical operator A - A, it extends to a very wide class of second-order elliptic equations, equations whose coefficients are not even required to be continuous (see [2]).It should be noted, however, that in the study of the Laplace operator proper, -A, we dispose of more refined information than that available in the more general situation. Specifically, we have used subsolutions (and supersolutions) which obey the maximum (or the minimum) principle. In the case of the Laplace equation, these are subharmonic (or superharmonic) functions, which satisfy a more precise inequality, namely that the value of the function at the center of a sphere is at most equal to its average on the sphere itself (at least equal, if we deal with superharmonic functions). This fact has deep implications on their regularity, as we shall see in this section. The importance of superharmonic functions is that they include the socalled potentials (as we are going to see, roughly speaking every superharmonic function can be represented as the sum of a potential and of a harmonic function). Potentials are of course very important in the applications to gravitation theory, electrostatics and magnetism, heat transfer, and so on. Historically (in the eighteenth and nineteenth centuries), the Laplace equation came to play a fundamental role in the process by which these theories were established on a firm mathematical basis. The type of phenomena it has 278
Sect. 301
SUPERHARMONIC FUNCTIONS. POTENTIALS
279
been used to describe conforms to a well-defined and fairly simple pattern : They relate to the “ states” of a homogeneous medium (which fills a certain region 9 in space), and they are stationary, i.e., they do not vary with time. There is afield in the medium, which for us means that we are given a vector field X in 9. The field has two basic properties: There is no circulation, and the flux is conserved. In common language, the first one means that there are no eddies. Mathematically speaking, this means that curl X = 0, which also means (at least locally) that X is the gradient of a scalar function U,the potential. The flux of X across a piece of surface 0 is mathematically defined as the integral over 0 of the normal component of X. Its conservation means that the incoming flux, at one cross-sectional end of a tube of force, equals the outgoing one, at the other end. Again, in mathematical terms, this means that the field Xis divergence free: div X = 0. If we combine this with the fact that X = grad U,we obtain precisely that AU = 0. However, the field must be created by something: the somethings are the “ charges ” or the “ masses (depending on which theory we are considering). Their presence in a (compact) region of space, R, is detected by the fact that the flux of the field X across any closed surface C enclosing R is not zero; this flux does not depend on the surface, since div X = 0 in the complement of R (provided of course that C is not too far away from 8). Its value can be taken as a measure of the total charge, or mass, carried by R. By virtue of the conventions agreed upon by physicists, in what concerns gravitation and electricity, it is the custom to equate this total charge to the incoming flux across C, that is, to ”
where v is the outer normal to Z. By formula (10.6), this is equal to the integral of -AUover the interior of C, that is, over R. If we assume that there is “creation” inside C (as opposed to absorption), we see that the total mass of -AU is at least zero. If we push this argument to the infinitesimal level, we see that -AUmust be a positive (Radon) measure; i.e., U must be a superharmonic distribution. Therein lies much of the motivation for our taking a closer look a t this kind of distribution. All functions and distributions dealt with in this section are real-valued, unless otherwise specified. Given a function f defined on the closed ball 1 x - x I 5 r (and sufficiently regular there), we introduce its average over the sphere I x - xoI = r,
280
BOUNDARY VALUE PROBLEMS
[Chap. 111
and also its average over the ball, (30.2) where B" is the unit ball in R" and 1 B" I is its "volume." Let R be an open subset of R". In what follows we deal with functions, defined in Q, whose values are either (finite) real numbers or else f M. Such a function is said to be lower semicontinuous at a point xo of R if to every (finite) real number CI CI for all x in U,; f is lower semicontinuous in a subset A of R iff is lower semicontinuous at every point of A . It is easily seen that, for a functionfto be lower semicontinuous in R, it is necessary and sufficient that f be the upper envelope of a family of continuous functions in Q. Definition 30.1. A function f in R is called hyperharmonic if it is lower semicontinuous, i f f (x) > - co f o r every x E 0, and if
(30.3)
f '(x; r ) Sf(x),
V x E R,
Vr < d ( x , CR).
By integrating both sides of (30.3) with respect to r we immediately obtain (30.3')
f@(x;r ) 2f(x),
V x E R,
Vr < d ( x , CR).
The terminology " hyperharmonic " is justified by the following fact, whose proof we leave as an exercise to the student: (30.4) I f f is hyperharmonic and u is harmonic in R, and $ whatever the point y of the boundary of R (or at injinity, if0 is unbounded),
lim (f(x) - u(x)) 2 0, x-Y
then f 2 u in R.
We have denoted by lim the lower limit. In particular, if B is any open ball whose closure is contained in R, and if I : denotes the Poisson integral o f f with respect to B [defined in formula (10.36): the integration over 8B can obviously be extended to semicontinuous functions], then f 2 1: everywhere in B. The hyperharmonic functions in R form a convex cone: Any finite linear combination of such functions with positive coefficients is again such a function. Two other important, but almost evident, properties of hyperharmonic functions are the following ones: (30.5) The upper envelope of any increasing oriented set of hyperharmonic functions in R is hyperharmonic in Q
Sect. 301
SUPERHARMONIC FUNCTIONS. POTENTIALS
28 1
[,' increasing oriented" means that, given any two elements f, g in the set, there is a third one, h, such that h 2 sup(f, g)].
(30.6) Iff, g are hyperharmonic in R, so is inf(f, g).
The next property is slightly less obvious:
PROPOSITION 30.1. Let f be a hyperharmonic function in 0. In each connected component of R, either f is identically equal to co or else it is locally Lebesgueintegrable.
+
Prooj The subset A of Q consisting of points in a neighborhood of which f is integrable is obviously open. Let xo E Q\A. By lower semicontinuity, there is a number, r, 0 < r < d(x,, CR),and a finite constant M such thatf (x)+ M>= 0 for every x E Br(xo),i.e., for every x such that Ix - xoI < r (we are using here the property that f is nowhere equal to - co). If I x1 - xo I < r/2, then xo E BrI2(x1)c Q, andf(x) + M cannot be integrable over BrI2(x1),otherwise f itself would be integrable over a neighborhood of x,, . By (30.3') we have ( f + M)(xl) 2 ( f + M ) @ ( x l ,r / 2 ) = +a.This shows that Q\A is also open and that f = + co in this set. Q.E.D.
A function g in R is called hypoharmonic if -g is hyperharmonic. By the mean value theorem (Theorem 10.1) we see that harmonic functions are both hyperharmonic and hypoharmonic; as a matter of fact, they are the only such functions (Corollary 30.1 below). Let us introduce now the classical terminology: DeJinition 30.2. Any hyperharmonic function which is locally integrable in R is called superharmonic.
A function f is called subharmonic if -fis superharmonic. I f f is harmonic in R, - I f I is superharmonic. An important class of superharmonic functions is the following:
Example 30.1 Newton's Potential This is the function (30.7)
where n is assumed to be >= 3 and r = I x 1. If we go back to the definition of the fundamental solution E of the Laplace equation in Sect. 9, we see that
(30.8)
-AG = 6, the Dirac measure, in R".
282
BOUNDARY VALUE PROBLEMS
[Chap. 111
When n = 2, we take G to be the fogarithmicpotential:
(30.9)
1 1 G ( x ) = - log - 2n r
It is clear that G is locally integrable in R";we shall regard i t as a "true" function, not as an equivalence class of functions a.e. equal; we shall agree that G(0) = +co. Let us then introduce the standard mollifiers P,(E > 0) : pe(x) = & - " p ( x / ~ ) ; p E C:(R"), p 2 0 everywhere, p dx = + 1. Then the regularizations G * pe are solutions of -AT = p e , hence are C" functions in R", which satisfy (Exercise 10.1) (G * pe)#(xo;r ) 2 ( G * pJ(x,) for all x , E R",r > 0 . By going to the limit as E + +0, we obtain
5
(30.10)
G # ( x o ;r ) 2 G(x,),
Vx, E R", Vr > 0
(noting that the restriction of G to any sphere is integrable over that sphere). Of course, we then also have
(30.11)
G@(x,; r ) S G(x,),
Vx, E R", r > 0.
It is clear that G(x) is lower semicontinuous.
Example 30.2
Potentials in R"
Definition 30.3. Let p be a distribution with compact support in R", G the Newton potential (30.7)i f n 2 3, the logarithmic potential (30.9)i f n = 2. The convolution C * p is called the potential of p and denoted by U p .
Potentials of the kind defined above are " global," that is, they are defined in the whole Euclidean space. Later on we shall introduce the Green potentials, which are defined in a given open subset R of R". The potential U' of the distribution p E 8' satisfies the inhomogeneous Laplace equation
(30.12)
-AUp=p
in R".
We recall (Weyl's lemma, Theorem 9.1) that U' is an analytic function in the complement of supp 1.1.
PROPOSITION 30.2. r f p E b', its potential U p is the unique solution of (30.12) which tends to zero at infnity. Proof. The uniqueness is obvious, since if there were another solution of (30.12) decaying a t infinity, V', the difference U p- V' would be a harmonic function in the whole space, decaying at infinity, hence identically zero by the
Sect. 301
283
SUPERHARMONIC FUNCTIONS. POTENTIALS
maximum principle. We must prove that U pitself decays at infinity. It suffices to note that, given some bounded open subset Q of R” containing the support of p , as d(x, Q) -, + co,y - G ( x - y ) converges to zero in Cm(Q)and thereQ.E.D. fore U”(x)= (py, G(x - y ) ) + 0. In the sequel we shall limit our attention to the case where p is a positive Radon measure, at first with compact support. The convolution of such a measure with a lower semicontinuous function such as G is lower semicontinuous. Moreover,
PROPOSITION 30.3. I f p is a positive Radon measure with compact support in R”, its potential U’ is locally Lebesgue-integrable in R”. Proof. Let us first consider the case n 2 3. Letfbe the characteristic function of an arbitrary compact subset of R”.By Fubini’s theorem for Radon measures and by a straightforward application of Lemma 29. I, we obtain
I
U’f dx
= J(G
*f)dp S const sup 1 (G* f ) ( x )1
-= +
00.
x E SUPP Ir
In more than two dimensions, G 2 0 everywhere, and therefore U’ 2 0 everywhere. This implies at once what we wanted. Suppose now n = 2. Since the support of p is compact, given any compact subset K of the plane, there is a number R > 0 such that
U p= ( z RG)* p
= ( z RG
+ 51) * p - 7
dp in K ,
where xR stands for the characteristic function of the disk 1x1 < R , 1 for the constant function equal to one, 7 for a large positive number. It suffices to show that (zRG + 71) * p is locally L ’ . For T large enough, xRG + ~l 2 0 everywhere, and the same argument as in the case n 2 3 applies. Q.E.D. Since y~ G(x - y ) is lower semicontinuous for every x in R“,we may form the potential of an arbitrary positive Radon measure p (not necessarily with compact support), (30.13)
U”(x)= G(x - Y ) dp(y).
Of course U p ( x )can be infinite. Noting that p is the (weak) limit of an increasing sequence of positive Radon measures pa with compact support [for instance, the measures xR(x)pas R -+ + co], we see that U p is equal to the upper envelope of an increasing sequence of lower semicontinuous functions (at least when n 2 3; when n = 2 one must add suitable constant functions, as in the proof of Proposition 30.3), hence it is lower semicontinuous.
284
BOUNDARY VALUE PROBLEMS
[Chap. 111
We may compute the average of both members of (30.13) (on the right-hand side, under the integral sign) over any sphere I x - xo1 = r. The inequality (30.10) at once implies
(30.14)
( U p ) # ( x , ;r )
5 Up(xo),
Vx,
E
R", Vr > 0.
We may summarize our findings as follows:
PROPOSITION 30.4. Let p be a positive Radon measure in R". Its potential U p is a hyperharmonic function in R". If the support of p is compact, U p is a superharmonic function in R". It is obvious that U p might be superharmonic, that is, locally Lebesgueintegrable, even in cases where the support of p is not compact. We recall that a superharmonic distribution in R is a distribution Tsuch that -AT is a positive Radon measure in R.
PROPOSITION 30.5. Let T be a superharmonic distribution in R. Given any relatively compact open subset R' of R, there is a positive Radon measure p with compact support in R" such that T - U p is a harmonic function in Q'. Proof. Any positive Radon measure p with compact support equal to -AT in R' satisfies the requirements of the lemma. The next statement justifies the introduction of Definitions 30.1 and 30.2:
PROPOSITION 30.6. For any distribution T in R the following properties are equivalent:
(30.15)
T is a superharmonic distribution in R;
(30.16) T is a function (in the sense of distribution theory) and one, and only one, of its representatives is a superharmonicfunction in R.
Proof: I . (30.15) implies (30.16). This follows a t once from Proposition 30.5 [the uniqueness of the representative in (30.16) is a consequence of the fact that two superharmonic functions which are a.e. equal are everywhere equal].
II. (30.16) implies (30.15). For d > 0, let R, denote the subset of R consisting of the points x such that d ( x , cC2) > d. It is clear that, i f f € L&JR), ( P , * f ) ( x )=
I
PAX
- Y ) f ( Y )dY
is well defined, and C", in R, . Let us take for f the representative o f T which
Sect. 301
285
SUPERHARMONIC FUNCTIONS. POTENTIALS
is a superharmonic function in R. For d > E , xo E Q d , and r < d - E , we derive from (30.3), ( P , *f)”(xo; r ) 5 ( P , * f ) ( x o ) .
* Af +O, then as
We apply formula (10.32) to u = pE * f and conclude that -Au = - p c
2 0 in ad. It suffices then to go to the limit, first as
E -+
d + +O.
Q.E.D.
COROLLARY 30.1. Any function in Q which is both hyperharmonic and hypoharmonic is harmonic. Indeed, f is continuous and by Proposition 30.6, Af
=0
in Q.
Example 30.3 Green’s Potentials As usual, let R denote a bounded open subset of R”, and let G(x, x’) denote the Green function of - A in Q. We have
where dm, is the harmonic measure on aR and we may take G(x) to be either Newton’s potential if n 2 3 (30.7), or else the logarithmic potential (30.9) if n = 2 [see (29.15)]. The first term on the right-hand side of (30.17) is a superharmonic function in R”; the second term is a harmonic function in R. Therefore, if p is a positive Radon measure in 0, we may form (30.18)
U’(x)
=
J
G(x, x’) dp(x’),
R
which is called the Green potential of p. Observe that the potential of p in R”, (30.19)
V ( x )=
J G(x - x ’ ) dp(x’), R
possibly infinite (notice that p does not necessarily extend as a measure on R”), is certainly hyperharmonic. We note that
is harmonic, though possibly constant and equal to + 00 in some connected component of Q. In a generalized sense, it takes the value V’ at the boundary.
286
[Chap. 111
BOUNDARY VALUE PROBLEMS
From the fact that the total mass of the harmonic measure m, is one, we derive easily that janV P ( y )dm,(y) <= V P ( x )for every x E R. We have (30.20)
U’(X) = V P ( x )-
Jan V”(y)dm,(y),
x E R.
If then v is another positive Radon measure in R, by Fubini’s theorem for Radon measures we obtain the reciprocity formula (30.21)
JnU Pdv = JnU’ dp.
We know (by Proposition 30.1) that, in each connected component of R, the Green potential U’ is either identically equal to +co or else it is a superharmonic function. A sufficient condition for the latter to happen is easy to obtain :
PROPOSITION 30.7. If the total mass of the positive Radon measure p in R is finite, its Green potential UPis a superharmonic function in R. Proof. To say that the total mass of p is finite is equivalent to saying that p extends as a continuous linear functional on .?tYo(R), the space of bounded continuous functions in R. Whatever x E R.
is a bounded function of x’ in R, according to Proposition 29.1, and therefore In h(x, x’) dp(x‘) < + co. On the other hand, p can be extended as a positive Radon measure ii on R”,setting $ = 0 in R“\n; and then we know that G * p, which is the potential of in R”,is equal to jnG ( x - x’) dp(x’); it is a superharmonic function. Q.E.D. Remark 30.1. To say that the total mass of the positive Radon measure p is finite is equivalent to saying that p ‘can be extended as a Radon measure on R”,supported by Ti (or “carried” by R, since the boundary of R is necessarily of measure zero, in the sense of the extended measure). In these circumstances the notation U p is ambiguous, for we must distinguish between the Green potential of p (in R) and the potential of the extended measure in R“.For our limited intent here, this ambiguity will not be harmful.
Of course, one can easily define the Green potential of an arbitrary Radon measure p in R. By linearity, we write p = p + - p-, where p’ and p- are positive measures, and set UP = UP+ -
up-,
Sect. 301
287
SUPERHARMONIC FUNCTIONS. POTENTIALS
Potentials with Finite Energy Definition 30.4. Let ji be a Radon measure in the bounded open set R, U” its Green potential. We say that ji and U” have jinite energy $ Ur is locally integrable in R and if its Dirichlet integral,
jRI grad U p ( x )1
(30.22)
dx
isfinite. It is then called the energy of U” (or of p). The energy of U p , (30.22), will be denoted by I U” I:, another Radon measure in R, we set (30.23)
also by
I ji I f . If v is
(grad U p ). (grad U v )dx.
(p,v ) ~= ( U ” , Uv)e= R
Similar definitions can be introduced when R = R”. We restrict ourselves, in the sequel, to the case where R is bounded. We know then that - A defines an isomorphism of HA(R) onto H-’(R). Its inverse, when restricted to C,“(R), or even to C:(R) c H-’(R), is identical with the Green kernel mapping:
4H
(30.24)
1
G( ., x’)&x’) dx‘
R
(Exercise 29.9). Let ji be a Radon measure belonging to H-’(R). By cutting and mollifying we may find a sequence of functions 4 j E CP(R) which converges to ji in H-’(R), such that G(x, X ‘ ) ~ ~ ( Xdx‘ ’ ) -+ U”(x)
for Each
x E R.
JR
This implies that U p is the pointwise limit of the latter converge to ( -A)-’ji in HA(R). This means that the equivalence class of U” (in the sense of Lebesgue integration theory) belongs to Hh(R). In the language of distribution theory we may state
PROPOSITION 30.8. The potentials withfinite energy in R belong to H,!JR). The potentials with finite energy form a dense, but nor closed, subset of HA(R): Indeed, on one hand their set contains the preimage under (-A) of the dense subspace C,“(R) of H-’(R); on the other hand, recalling that n > 1, there are elements in H - ’ ( R ) which are not Radon measures. Noting, however, that any positive distribution is a positive Radon measure and that thelimit of a sequence of such distributions is also a positive Radon measure, we derive
288
BOUNDARY VALUE PROBLEMS
[Chap. I11
PROPOSITION 30.9. The cone of the Green potentials of positive Radon measures in R, withfinite energy, is closed in H;(R). It is identical to the set of all nonnegative superharmonic functions belonging to HA(R). Now taking Proposition 30.8 into account and the duality between HA(Q) and H-’(R), we see that, if p and v are two Radon measures in R, belonging to H-’m, (30.25)
(p, v),
5
(UW, UV),= J U’ dv. n
Of course, IIpllf = jn U p dp. The quantity (30.25) is called the mutual energy of the measures (or of the “charges,” or of the “masses”) p, v. Thus, under the present hypotheses (of finiteness of the energy), the reciprocity formula (30.21) merely expresses the symmetry of the inner product (p, Y),. Note that, since R is bounded, the energy norm U”J1, is equivalent to the norm IIU’I(l on HA(R), and 11pl1, is equivalent to the norm llp(l-l on H-’(R). Thus the inner product ( , ), defines, on the set of Radon measures belonging to H-’(R), a structure of pre-Hilbert space (Hausdorff but not complete). Note also that if p and v are positive Radon measures, their mutual energy is 2 0 . From this it follows that, if p 2 v, then 11p11~2 llvll:. The preceding argument does not extend to R”,as - A is not an isomorphism of HA = HI onto H-’.
The Riesz Representation of Superharmonic Functions Letfbe a superharmonic function in an open subset R of R”and suppose that there is at least one subharmonic function v in R,u(x) > - 00 for every x E Q, such that u sfthroughout R. Let f * denote the upper envelope of all such subharmonic functions u. We contend that f * is harmonic in R. Indeed, if u 5 f is subharmonic and if B is any open ball with closure contained in R, the function 6 equal to u in R\B and to the Poisson integral If in B is also subharmonic [as one sees by checking that E(x) 5 B # ( x ; r ) for every x E R and every r sufficiently small] and smaller than u, hence, thanf, and therefore f* is equal to the upper envelope of these functions 5, which is harmonic in B. The function f * is called the greatest harmonic minorant off. Suppose now that R is bounded and f 2 0. Given any E > 0 we denote by R, the set of x E R such that d(x, CR) > E . The function X H ~ @ ( X ; E ) is well defined, and obviously superharmonic in RE.Moreover,f@(x; E ) S f ( x ) there andf@(x; E ) is a continuous function of x E Q, . When E + +O,f@(x;E ) .+f (x). For every E , let Rf be a relatively compact open subset of R,, containing
Sect. 301
SUPERHARMONIC FUNCTIONS. POTENTIALS
289
Rz,, whose boundary is a
C" hypersurface and such that R: lies on one side of it (locally). Let h, denote the generalized solution of the homogeneous Dirichlet problem for - A in the open set Cl\RL, with boundary data f @(.;E ) on aR: and zero on &. Since all the points of are regular, the function equal to f @(.;E ) in R: and to h, in R\Rf is continuous in the whole of R. We shall denote it by f,; it is superharmonic, and p, = - A A has compact support (contained in q).Moreover f,5 f and f, converges pointwise to f (in fact, if 2~ < d,f, 2 f,.). If Up6denotes the Green potential of p, (in R), we note that f , - U p c is harmonic in R and tends to zero at dR, in a generalized sense: i.e., it is the uniform limit (over R) of a sequence of functions uj E H'(R), harmonic in R, such that uj - g j E Hh(R), with g j E Co(a) n H'(R) tending to zero uniformly on We conclude easily from this that it vanishes identically. Inotherwords,
a.
(30.26)
f" = U"
But the measure p, is equal to p side in (30.26) is not less than
=
I
n
=
G(., x') dp,(x').
-Af in R: and therefore the right-hand
which converges to U" as E -+ + O [indeed, G ( x , x ' ) is integrable with respect to dp(x') and nonnegative]. By going to the limit as E -+ +0, we conclude that U p 5f Of course we know thatf - U p is harmonic in R (note that it is 2 0 and therefore locally integrable). Sincef 2 0 it has a greatest harmonic minorant, f *, which is also 20. We may apply the preceding argument to f -f * instead o f f ; U p remains the same, since p does. We conclude that f - UN- f * =f f is harmonic and 2 0 in R. But t h en f * +f is harmonic, and 5 f in R. In view of the maximal character o f f *, we conclude that .f = 0, and we obtain the representation formula for superharmonic functions, due to Riesz:
sf,
T
(30.27) where p = -Af and f * is the greatest harmonic minorant off (which, we recall, is nonnegative). We derive from (30.27): PROPOSITION 3 0 . 1 0 . Let R be a bounded open subset of R". In order that a nonnegative superharmonic function in R be the Green potential of a positive Radon measure in R, it is necessary and suficient that its greatest harmonic minorant be identically zero.
290
BOUNDARY VALUE PROBLEMS
[Chap. I11
Let F denote the space of locally integrable functionsf in R whose Dirichlet integral is finite (i.e., such that grad f is square-integrable). To say that Ilfll, = 0 is equivalent to saying that f is locally constant in R. Let $ denote the quotient o f f modulo the locally constant functions in R, and Ilfll, the norm on @ associated with the energy norm. It is a Hilbert norm, but of course $ is not complete. However, HA@) can be regarded as a closed subspace of $ (using again the fact that R is bounded) and the cone of Green potentials of positive Radon measures in R with finite energy can be regarded as a (closed convex) cone in $, which we denote by r.Although 4 is not complete, in view of the fact that r is, we may introduce the orthogonal projection [for the inner product ( , ),I of $ onto r. Let n denote this projection. Let f be any nonnegative superharmonic function in Q, f its class modulo locally constant functions. We know that f - nf is orthogonal, for ( , ) e , to the potentials U ' E HA@) where v is a positive Radon measure; but then this is also true when v is not necessarily positive. We know that the U' form a dense linear subspace of H;(R), and therefore f* =f- nf is orthogonal to the latter subspace. This implies that it is harmonic in a. But of course, -A(& = -Af = p, i.e., nf= U w , and there is a representative f * off* in 9 such that (30.28) f = U'+ f*, which is the representation formula (30.27) under the present particular circumstances. Thus (30.27) can be viewed as a generalization of an orthogonal decomposition formula, associated with the energy norm.
Capacity Potential and Capacity Let R be a bounded open subset of R", K a compact subset of R. We shall denote by W, the lower envelope (i.e., the infimum) of all superharmonic functions u in R, such that v 2 0 in R and u S 1 on K. It is seen at once that W,(x) 2 W,"(x; r), 0 < r < d(x, R"\R). However, W , might fail to be a superharmonic function, because it might fail to be lower semicontinuous. We denote then by V, the upper envelope (i.e., the supremum) of all lower semicontinuous functions w in R such that w 5 W , . The following facts are easy to ascertain :
5
(30.29)
0 5 V,
(30.30)
V, = W , = 1 in the interior of K;
(30.31)
V,
=
W,
W,
1 in R;
in R\K and both are harmonic in R\K.
PROPOSITION 30.11. The function V, is the Green potential (in R) of a positive measure p, carried by K .
Sect. 301
SUPERHARMONIC FUNCTIONS. POTENTIALS
29 1
Proof. It is easy to derive from its definition that V, is superharmonic in R (it is of course lower semicontinuous and all one has to check is that i t majorizes its average on spheres, which is easy). Suppose, for the sake of simplicity, that R is connected (otherwise reason in each connected component separately). Fix xo E R arbitrarily; the Green function G(x, x,,) is a potential [that of the Dirac measure 6 ( x - xo)] and is greater than zero everywhere in R, therefore it has a minimum c > 0 on K , hence 0 V, 2 c-’G(., xo). Thegreatest harmonic minorant of cV, is a nonnegative harmonic minorant of G(., xo). We apply Proposition 30.10 twice: once to conclude that, since G(., x o ) is a potential, the greatest harmonic minorant of V , is zero, the second time to draw from this that V, itself is a potential. Since V, is harmonic in R\K, pr = -AV, is carried by K. Q.E.D. Definition 30.5. The potential V, is called the capacity potential of K, the measure p, is called the capacity distribution of K, and the total mass of p,, which is p K ( K ) , is the capacity of K (relative to R). The capacity of K is denoted by W K ) . We leave the proof of the next two assertions as an exercise to the student: (30.32) The capacity potential V, is the greatest Green potential U p of positive Radon measures p, supported by K , such that U p 5 1 in R. (30.33) The capacity V ( K ) is the supremum of the total masses p ( K ) , as p ranges over the set of all positive Radon measures supported by K whose Green potential does not exceed 1 in R. The notion of capacity plays a truly fundamental role in the modern potential theory and that is why we have decided to include its definition and basic properties here (see below), even though we do not give the proofs. As can be guessed, it originated with electrostatics : It is a measure of the maximum charge that can be loaded, at equilibrium, on a conductor (occupying the compact set K ) surrounded by a medium void of charges (here, R\K) and itself bounded by a grounded surface (here dR: “grounded” means that the potential V, on it is maintained at zero). The requirement that V, not exceed one in fl is a “normalizing” condition, enabling us to compare capacities. PROPOSITION 30.12. Let R be a bounded open subset of R”,V ( K )the capacity with respect to R of an arbitrary compact subset of R. The following is true:
(30.34) V ( 0 ) = 0 (30.35) W(Kl u K 2 )
and Kl c K2 =.V(Kl) 5 V ( K 2 ) ;
+ W(Kl n K 2 ) 5 V ( K , ) + V ( K 2 ) ;
(30.36) i f { K j } ( j = 1,2, . . . ) is a decreasing sequence of compact subsets of R, with intersection K, V ( K )= l i m j + + m % ( K j ) .
292
BOUNDARY VALUE PROBLEMS
[Chap. 111
For a proof, see [ H , Theorem 7.201. Based on the properties (30.34) to (30.36) one can extend the capacity (regarded as a function of sets), somewhat in the manner one extends a Radon measure. One can define the inner capacity of an arbitrary subset A of R as the supremum of %(K),when K ranges over the collection of compact sets contained in A , and the outer capacity as the infimum of the inner capacities of all open sets containing A . A set is then said to be capacitable if its inner and outer capacities are equal (they are then called its capacity) and so on. An interesting result, due to H. Cartan, states that there is identity between polar sets (see Exercise 30.6, Definition 30.6) and sets with zero capacity (the surrounding R is here irrelevant).
Exercises ~
30.1. Letfbe a superharmonic function > - 00 in a n open neighborhood of a closed ball 1 x - xo1 S R (xo E R", R > 0). Derive from the remark which follows (30.4) and from the Poisson integral formula (10.36) that the restriction off to the sphere l x - xol = R is integrable (with respect to the area measure). 30.2. Let 4 be a conuex mapping of an interval ]a,b[( - 00 a < b + co) of the real line, into R [convex means that (b(at, P t 2 ) 5 a4(tl) + P(b(t2) if a p = 1, a,p 2 01. Let R be an open subset of R", h a harmonic function in R such that h(x) E ]a, b[ for all x E R. Prove that (b h is subharmonic in R. Show, moreover, that if 4 is monotone increasing, then (b 0 f is subharmonic in R whatever the subharmonic functionfin Q whose image lies in the interval ]a, b[. 30.3. Letfbe a superharmonic function in R",f@(.; r ) its average on the ball of radius r > 0 (centered at the point under consideration). Show that fa(.;r ) is a continuous superharmonic function in R". What is f @ when f i s the fundamental solution of -A, given in (30.7) or (30.9)? For this choice of f, compute the spherical average f ' [see (30.1) and (30.2)]. Derive from the property o f f @above that, given any integer m 2 0, f is the pointwise limit of an increasing sequence of C" superharmonic functions in R". 30.4. Let f be a superharmonic function in an open subset R of R". By using the ball averagesf@(.;r ) defined in (30.2),show thatfis the pointwise limit of an increasing sequence of Cz superharmonic functions in R. 30.5. Let Q be a bounded open subset of R", C(x, x') the Green function of - A in Q. Let pf be the measuref(x) dx in R, wherefEL1(R). Prove that the Green potential of pf [see (30.20)]is a continuous function in Q. [Hint:
+
+
0
Sect. 301
293
SUPERHARMONIC FUNCTIONS. POTENTIALS
Reduce the proof to the case where the support off is contained in a closed ball B c Q and then exploit the properties of the Green function for the ball, (29.34) or (29.35).] 30.6. This exercise concerns the following important notion :
Dejinition 30.6. A subset S of R" is called polar if there is an open neighborhood U of S and a superharmonicfunction f in U which is equal to + 00 on s. Prove that a polar set has (Lebesgue) measure zero and that its intersection with an arbitrary sphere I x - x, I = R has (area) measure zero. 30.7. Let (x, y , z ) denote the variable point in R3 and J , the segment x = y = 0,O 5 z 5 1. Consider the measure p : 4-j: +(O, 0, z ) dz (carried by J ) , and its potential in R3, U p (Definition 30.3). Show that U p (0, 0, z ) = + co if 0 5 z 5 1 and conclude from this that every straight line segment in R", n 2 3, is polar (Definition 30.6). 30.8. Let r be a positive number, and denote by II, the piece of hyperplane (30.37)
(xi(< r
( I = < j s n - I),
xn=O,
in R". Suppose that there were an open neighborhood of II,, U, and a superharmonic function f in U , equal to + 00 on n,. By rotation and translations (which leave superharmonicity invariant), show that there would be a number r', 0 < r' < r, a function F, defined and superharmonic in an open neighborhood of the hypercube {x E R"; 1 x j ( < r', j = 1, . . . , n ) , equal to + co on the boundary of this hypercube. Derive from the minimum principle for superharmonic functions that this is not possible and conclude that 17, cannot be polar (Definition 30.6). 30.9. Let R be the open ball of radius R > 0 in R" ( n 2 2), centered at the origin, K the closed ball of radius R',0 < R' < R, also centered at the origin. Compute the capacity (Definition 30.5) of K with respect to R. What are the capacity potential and distribution of K relative to Q ? What are the answers to the same questions if we now take K to be the sphere 1x1 = R'? 30.10. Prove the following result, due to Hartog: Let {uk) (k = 1, 2, . . .) be a sequence of superharmonic functions in an open set s1 c R",such that:
(i) given any compact subset K of Q, there is a constant M , > 0 such that uk(x) 2 - M , for every index k and every x E K ; (ii) there exists a constant C > 0 such that, whatever x E R, the lower limit of the numbers uk(x), as k + + 00, is 2 - C . Then, given any compact subset K of R and any number index k, such that
(30.38)
vk(x)2
- C - E,
Vk 2 k , ,
V x E K.
E
2 0, there is an
31 Laplace Equation and the Brownian Motion 31.1 The Discrete Case Let ZNdenote the lattice of points x = (XI, . ..,x N) in RN whose coordinates xi are integers (20or
(31.1) p(1, x, y ) = 1/2N
if x and y are neighbors, 294
0 otherwise.
Sect. 311
LAPLACE EQUATION AND THE BROWNIAN MOTION
295
The function B + + p ( n x, , B) defines a positive measure, with total mass one, on ZN:It is indeed a probability. If the distance between x and B exceeds n, we have p(n, x , B) = 0; in other words, y ~ p ( nx,, y ) has compact (finite) support (easy to describe). Let 52 be a subset of ZN.We shall assume R to be connected, which means that any two points in Cl can be joined by a path entirely contained in R. A point of R will be called an interior point if all its neighbors also belong to R, a boundary point if this is not so. A (say, vector-valued) function f in R is said to be harmonic if its value a t any interior point x E R is equal to its average over the set of neighbors of x : (3 1.2) Note that the right-hand side of (31.2) is nothing else but
The operator f HPf is often called the (one-step) shift or averaging operator. A real-valued function f is called superharmonic if f ( x ) 2 Pf(x) at every interior point x ; nonnegative superharmonic functions are called excessive functions in the theory of Markoff processes. It is obvious that the maximum principle holds for complex-valued harmonic functions in Q. Also, any harmonic function in the whole lattice ZN which is bounded must be a constant. We may set
A
(31.4)
=
-2(I - P ) ,
where P is the shift operator (31.3) (and I the identity); A is the discrete Laplacian. By definition, a function f is harmonic if Af = 0. Suppose now that the set R is bounded, and let aR denote its boundary. Let g be a real-valued function defined in an. The Dirichlet problem in the discrete case is the problem of solving (31.5) (31.6)
Au
=0
in the interior points of R, ~
=
gonan.
This problem has a remarkable probabilistic interpretation, and so does its unique solution! Indeed, we may interpret g as a p a y o f : If our particle, starting from a point x in the interior, hits the boundary for the first time at the pointy, we have winnings or losses equal to g ( y ) [whether it is a gain or a loss depends on the sign of g ( y ) ] .The value of the solution u at x will then be the value of the expected winnings (or fosses) at x , as we are now going to show.
296
BOUNDARY VALUE PROBLEMS
[Chap. I11
Let B be an arbitrary subset of the boundary dR. If x is an interior point, we define m,(B) as the probability that the random walk of the particle, starting at x , hits the boundary for the first time at a point y belonging to the set B. It is immediately seen that m, defines a positive measure on dR, with total mass 5 1. Let u s fix B and consider the nonnegative function x Hm,(B); it is defined at the interior points of R but we may extend its definition to boundary points x by setting it equal there to one if x E B , to zero otherwise. It is clear that we have, if x is an interior point,
which shows that m,(B) is harmonic. Then take B =aR: When x is a boundary point, we have m,(aR) = 1, and therefore rn,(dR) is a harmonic function in R, identically equal to one on the boundary. By the maximum principle, it must be identically equal to one throughout R. This implies that the total mass of the measure M, on 8R is one, for every x E R. Now, in order to obtain the solution u of (31.5)-(31.6) for arbitrary g, it suffices to take (3 1.8) Indeed, u is harmonic as a finite sum of harmonic functions, and it takes the value g(y) at a boundary point y , because of the properties of the harmonic measure m, [cf. (29.7)]. Because of our definition of m,({y}) (the probability that the particle first hits the boundary at the point y ) , (31.8) defines u(x) exactly as the mathematical expectation at the point x of the payoff g. We may interpret the values of any function f on ZN as coefficients of a Fourier series. Let us set, indeed, (31.9) where fj is the variable in the N-dimensional torus T N : 8 = (:<el, . . . ON), 0 5 Bj 5 2n for every j , and 8 .y = 0,y’ + . . . + 8,yN. The series (31.9) certainly converges iffhas finite support (and also, of course, under broader conditions). We have then (31.10) We may apply this to the function p(n7x , y ) for fixed x and n. We note that (31.11)
p(0, x, O) = eie‘ ,
Sect. 311
LAPLACE EQUATION A N D THE BROWNIAN MOTION
297
and on the other hand, by the total probability formula, for n 2 1, (31.12) p(n, x, y )
=
1
1
p(1,
X,
x ' M n - 1, x ' , Y ) = -
2N
Ix-x'I = 1
1
~ x - x '=~I
P(n - 1 %x'. Y L
whence 1 p"(n, x, 0) = -
(31.13)
p"(n - I , x', e).
2"x-,q=1
Taking n = 1 and applying (3 I . 1 1) yields, since I x - x ' ( x.8fBjforsomej, 1 S j S N ,
=1
implies x' * 6 =
p( I , X, e) = eie "a@),
(31.14)
'
where we have set (31.15)
If we return to (31.13) and apply induction on n obtain
=
1, 2, . .., we immediately
p(n, X, 6) = e i e .xw(e),
(3 1.16 )
and (31.11 ) shows that this is also true for n = 0. Applying now the Fourier inversion formula (31 .lo) yields
Let us then consider the sum of all the powers W,n = 0, 1, . . . , ( I - @)- ; it is clear that it has a pole at 8 = 0; the order of (1 - 0(0))-' at that pole is 2 , in the sense that (31.18) for 8 - 0 ,
[(el =(0:+...+e,Yi2] 2 3, thefunction (1 - a)-' on the torus
(1 - @ ( 0 ) ) - ' - 2 N 1 8 1 ~
We derive from this that, provided N T Nis integrable, and therefore the function t m
(31.19)
G(z) =
p ( n , z , 0 ) = ( 2 7 1 - ~ e""(1 - @(e))-' d0 T"
n=O
is bounded. If we apply the total probability formula (3 I . 12), we immediately obtain
(3120)
- +AG(x)
=6
(~),
BOUNDARY VALUE PROBLEMS
298
[Chap. I11
where 6 is the analog of the Dirac measure: It is equal to one if x = 0 and to zero otherwise. Thus C is harmonic in the complement of the origin. Let us also denote by G the convolution operator (31.21)
=cG(X-Y)P(Y)
G:p-G*W
Y
(defined, say, on functions-which are the same thing as measures, on ZNwith compact support). Then (3 1.20) can be rewritten in the form (3 1.22)
( I - P)G = I,
the identity operator,
i.e., G = ( I - P)-'. If we look back at (31.15) and (31.19), we see that @(8) is the Fourier transform of the shift operator P, and (1 - @(O))-' is the one of G. The usefulness of (31.19) is that it shows readily that G(z) is a bounded function when N 2 3. One can also easily derive from it that (31.23)
when
1x1
- +oo,
G(x) - c N I x I 2 - "
[we leave the derivation as an exercise for the student: The basic idea, of course, is to exploit (31.18) and to make a change of variable 8 = IzI -'8' in the integral in (31.19)]. The properties just given show that G is the analog of the Newton potential (see Example 30.1). The presence of the factor 3 in (3 1.20) is merely due to our presently adopting a different convention (which, actually, is the one commonly adopted in potential and probability theories). We may of course define the potential in Z" of a measure (i.e., function) with compact support, p, by the formula (3 1.24)
U '
=G
*p
(cf. Definition 30.3).
It is seen at once (cf. Proposition 30.2) that U p is the unique solution of the inhomogeneous Laplace equation (31.25)
-+AU' = p
(in Z")
which tends to zero at infinity. If we rewrite (31.24) in the following fashion: (31.26) we obtain the probabilistic interpretation of the potential. Again, we must regard the given function p ( y ) as a payoff. The nth term, (31.27)
1 Y
x, Y)P(Y),
is the mathematical expectation of winnings .uat the nth trial. Equation (31.26) says therefore that U'(x) is the total mathematical expectation (or, equiva-
Sect. 311
LAPLACE EQUATION AND THE BROWNIAN MOTION
299
lently, the mean value) of the payoff during a random walk of the particle starting at the point x. We conclude the first half of this section by computing the quantity (3 I .27) when p ( y ) = I y l’, the square of the Euclidean norm of y E ZN.The result will give us the mean value of the square of the distance (from the origin) reached by a particle in a random walk started at x. Let us denote it by d’(n, x). For n 2 1, we have (3 1.28) Of course, we have d’(0, x) = 1 x 1.’ By induction on n, we derive from (31.28) that
d’(n, x)
(31.29)
=
+ n.
1x1’
Indeed, (31.28) implies
d’(n, x)
=n -
I
+ (2N)-’
2
IX’(2
[x-x’l= 1
But because of the obvious symmetry, (x-x’l=
I x’ 1’
= N(
I x + el l2
1
I
3- x
- el 1)’
= 2N(
where el is the unit vector along the x’-axis. In $3 1.2, we shall make use of (3 1.29) or rather of its consequence (which is also a direct consequence of the central limit theorem) that the mean value of the distance reached at the nth step by a particle in a random walk is of the order of (for large n).
fi
31.2 The Continuous Case Instead of dealing with the lattice ZNin $3.1, we could have dealt with any one of its homothetical images hZN (the points of hZN are obtained by multiplying those of ZN by h). Then, by going to the limit as h + +0, we should arrive at the concept of a continuous random walk. However in the process of going to the limit, we should not lose sight of the fact that the notion of distance covered during the random walk must not be allowed to dissolve! Specifically, the distance covered during a random walk in ZN must be essentially the same as the one covered in a walk “of same duration” in hZN. But we cannot, really, talk of the distance covered during a random walk, we can only talk of its mean value which, by the remark concluding Thus, if we wish to divide the length of $31.1, we know to be of the order of the steps by h-’, we must divide the “duration” of each step by h - ’ ; i.e.,
&.
300
BOUNDARY VALUE PROBLEMS
[Chap. I11
we must multiply the number of steps effected in a given time by h-’. The switch to a “temporal” terminology is not accidental: the variable n, the number of steps, in $31.1, is now going to be replaced by the time variable t , i.e., the duration of the walk. For reasons of convenience in the notation, we shall divide the duration of each step, not by h-’, but by Nh-’. Thus we return to the total probability formula (3l.12), but now in the lattice hZN and according to the preceding remarks, we rewrite it as follows: (3 1.30) [thus we have written t instead of (n - l)h2/N]. Let us denote by e j the unit vector along the xj-axis ( , j = I , . . . , N ) . We subtract p(t. x, y ) from both members in (31.30) and divide the result by h’/N. We obtain
Now we go to the limit as h + +O. We easily obtain (3 1.32) We see that the transition probability p(t, x, y ) is a solution of the heat equation, in the half-space t > 0 (with +A substituted for A). For fixed t > 0, we must have p(t, x, y ) -+ 0 as I x - y I .+ + 00, since this is true in the discrete case. Furthermore, (3 1.33) p(0, x, y ) = 6(x - y ) ,
the Dirac measure (on the diagonal).
These conditions determine p(t, x, y ) uniquely (see $6.I , also Example 44. I). We have p(t, x, y ) = p ( t , x - y ) , where (31.34)
p(t, z) = ( 2 n ~ ) - ”exp( ~ - 1 zI2/2t).
Here t > 0; when t -,+0, p(t, z ) converges (in the distributions, or in the measures sense) to the Dirac measure at the origin 6. If B is a Lebesgue measurable subset of R”, the probability that the particle, presently (i-e., at some given time which we take to be the origin) at the position x, will be, at time t > 0, at some point belonging to B is equal to (31.35)
Sect. 311
LAPLACE EQUATION A N D THE BROWNIAN MOTION
30 1
A probability distribution of the kind (3 1.34) is called normal or Gaussian. It is spherically symmetric, indicating the perfect isotropy of the continuous random walk (which is called Brownian motion. in honor of the botanist Robert Brown who observed the rapid oscillatory motion” of pollen grains suspended in water, in 1827). The graph of p ( t , z ) (for fixed r > 0) is the usual bell-shaped surface. with maximum at z = 0. The fact that, for arbitrary x, the probability distribution is equal to p ( t , x - y ) emphasizes the fact that the motion is translation-inuariant. Note also that, according to (31.35), past events have no influence on the future motion. One notable difference with the discrete case is that the probability that the particle finds itself at any given individual point. at some time t > 0, is always zero. The particle, presently at the point x, will at random pick up a path-according to the law (31.35). Paths are simply continuous mappings ofR, = ( t E R ; t 2 0 ) into R N ; with every “starting point” x we may associate a probability distribution. i.e., a positive measure with total mass one, on the space of all paths originating at x, but we do not go into this here. “
The probabilistic approach to the Dirichlet problem can now easily be guessed. Let R be a bounded open set in RN,dR its boundary. We introduce the time offirst exit T of a particle starting at x E R : T is the minimum of all numbers t such that x(t) E RN\R.Next we define the escape probability m,: It is a measure on the boundary dR. To every (say, Borel) subset A of 22 it assigns the number which is the probability that X ( T ) E A . If now g is an arbitrary continuous function on dR, and if we set (3 1.36) we shall have (31.37)
AM = O
(31.38)
u =g
i nR, on dR.
A few facts have to be established and the meaning of (31.38) must be made clear. First of all, we must show that the time of escape is finite, or rather (more in keeping with the standpoint adopted here) that the particle escapes from the set R with probability I . Actually, it suffices to prove that the particle escapes from an arbitrary ball, B R ( x ) ,centered at x, with probability 1. Let us show that the probability that the particle remains inside the ball forever is zero. Suppose it remains inside BR(x) until time n E Z , ; the increments x ( j ) - x ( j - I ) , j = I , . . . , n, must all have norm 5 2 R . Because of the probability law presiding over the motion, we have
P[I x ( j ) - x ( j - I ) \
s 2R] = (2n)-N’2J,
zI < 2 R
e x p ( - I ~ 1 ~ / 2 ) d z = u<~1,
302
[Chap. 111
BOUNDARY VALUE PROBLEMS
and by the formula of total probability,
n n
P,[r>n]I
P[lx(j)-x(j-l)l
52R]=a;
j= I
+
which converges to zero as n + 03. (We have denoted by P [el the probability of the event e.) A very simple probabilistic argument shows that the function (31.36) is harmonic. Let B,(x) be any open ball, centered at x , whose closure is contained in R. Before hitting the boundary the particle starting at x must hit the boundary S,(x) of the ball. But then, if we want to compute the probability of hitting (at the time of first exit from R, T ) some subset of aR, we may as well forget what happened before the time of first exit from the ball B,(x), 7 ’ . In other words, we may assume that the particle starts at time 7’ from a point of S,(x). But we must specify the probability that the particle finds itself (at the time 7 ’ ) in a given subset of S,(x), A’. This must be necessarily equal to the probability that the particle hit A’ at T’ when coming from x. The latter cannot be anything but uniform on S,(x), by virtue of the symmetry of the Gaussian law. In mathematical terms, the analog of the measure m, for B,(x) is the area measure on the sphere S,(x), normalized, i.e., divided by the total area of S,(x). What we have just said means exactly that (31.39)
m,(A) = I S n - ’ [ -*
I
s,,-
rnx+,*(A) d i
[cf. (30.1)],
I
where A is a Bore1 subset of aR. In other words, the measure m, is harmonic (its measurability with respect to x is easy to verify) and this implies (31.37). Let us now look at the boundary condition (3 1.38). We know, by the considerations of Sect. 29, which led us to the notion of regular point (Definition 29.4), that things are not simple. Suppose, for instance, that R is the complement of the origin in the unit disk in the plane R2.Then it can be shown, whatever the point x in R, that is, however near x is to 0, that the probability that the particle, starting from x , will exit for the first time from R at the origin is equal to zero. The notion of regularity of a boundary point has the following probabilistic interpretation: If we denote by 7 the time o f j r s t exit from R of a particle starting at x , a point xo of dR is regular if and only if (31.40)
lim P,[t > h]
= 0,
for every h > 0.
x+xo
(in standard terminology, t converges to zero in probability as x + xo). We shall not prove this assertion (the proof is not very difficult and the student might try to find it). It is a consequence of the following assertion: xo is regular if and only if (31.41)
Vr > 0,
lim P,[x(T) E aR n Br(xo)] = 1. x-xo
Sect. 311
LAPLACE EQUATION AND THE BROWNIAN MOTION
303
Exercises N
31.1. Let C(z) be the potential in Z N given by (31.19). Show that, when = 2,
but that G(0) < + co if N 2 3. [Hint: In the case N = 2, observe that p(2k + 1, 0, 0) = 0 for all k = 0, 1, . . . .] 31.2. This exercise deals with the important concept introduced by Definition 31.1. A subset B of ZN is called recurrent iJ given any point x of ZN, the probability that a particle, starting at x, eventually hits B is equal to I . If x E ZN,denote by nB(x)the probability that a particle, starting at x, hits B at least once during its random walk, by &x) the probability that it hits B infinitely often. Prove the following assertion : (31.42) i f B is recurrent, nz(x) = 1 ; i f B is nonrecurrent, rc*,(x) = 0. [Hint: Express n,(x) as the sum x:zoqB(n, x , y ) , where qs(n, x, y ) is the probability that the particle, starting at x , hits the set B for the first time on the nth step, at the point y E B , and show that n f ( x ) = nB(x)nt(x).] 31.3. Prove that if N 5 2, every subset of ZNis recurrent (Definition 31. l), whereas, if N 2 3, every bounded subset ofZNis nonrecurrent. Prove that the “plane” x 3 = O in Z3is recurrent. 31.4. Prove the Riesz decomposition formula [see (30.27)]in the discrete case. That is, prove that given any excessive function f in ZN (i.e., any nonnegative superharmonic function), we have (3 1.43)
f
=
up4- f * ,
where U p = G * p is the potential of p = -$AA and f * is a harmonic function in ZN.Show thatf* is the greatest harmonic minorant off. 31.5. Let B E ZN,and nB, n: be the functions so denoted in Exercise 3 1.2. Prove that nB is excessive and that nt is the greatest harmonic minorant of IT*.Prove that p B = - +AnBis supported by B and that B is nonrecurrent if and only if 7~~ is a potential. 31.6. Suppose that B is nonrecurrent (see Definition 31.1). Prove that p B = -$AnB is the greatest charge carried by B whose potential is I1. [Then the total mass %(B) = p B ( y )is called the (discrete) capacity of B.1 31.7. Prove that, if N 1 3 , the capacity of a set B consisting of k points (k < + co) is equal to k/G(O), G(z) being the potential (31.19), and that the capacity of any infinite nonrecurrent set is infinite.
xyeB
304
BOUNDARY VALUE PROBLEMS
[Chap. 111
31.8. Let R be a bounded open subset of RN. We shall denote by a the first strictly positive time of exit from 51 of a particle starting at x. (Note that if XECR, the first time of exit z is obviously zero, but if x belongs to the boundary of R, the particle might reenter R at once and then reexit from R at a later time.) Let h be an arbitrary number greater than zero. Show that, given any point x, (31.44)
P,[z > h] 5 P,[a > h].
Denote then by A(to) the event that, in the time interval t o 5 t 5 h ( t o > 0), the trajectory of the particle lies entirely in R, and denote by P,[A(t,)] the probability of A(?,) for a particle starting at x. Prove that P,[A(t,)], for fixed t o , 0 < to < h, is a continuous function of x in RN. ( H i n t : Prove that (31.45) Show that (3 1.46)
PX[o> h] = lim P,[A(t,)], bL+O
and conclude from this that P,[a > h] is upper semicontinuous with respect to x , and that, by using the necessary and sufficient condition (3 1.40) for the regularity of xo E dR, together with (3 1.44), xo is regular if Px,[a > 01 = 0.
(3 I .47)
31.9. Use the same notation as in Exercise 31.8. Let h, t , be two numbers such that 0 < h < t, and let now A ( ? , ) denote the event that, in the time interval h 2 t 5 t , , the trajectory of the particle lies entirely inside R. Prove that, if x is any point in R”, (3 1.48)
Px[a> 0, 4 f l ) I = P,[a > OlP,[A(tl)l,
i.e., for h > 0, the events a > 0 and A(?,) are independent (observe that the occurrence of a > 0 is decided in an infinitely small time interval 0 < t < E ) . Derive from (31.48) that (31.49)
P,[O
> 01 = (P,[a > 0])2,
whence the zero-one law: P,[a > 01 is either equal to zero or to one. 31.10. Let R be a bounded open subset of RN, xo a point of its boundary. Suppose that there is a truncated spherical cone r, contained in RN\R, with vertex at xo . By combining the sufficient condition of regularity (31.47) with the zero-one law (31.49), show that if xo were irregular, there would be to > 0 such that the probability that the trajectory lie entirely inside R during the time interval 0 < t < t o would be 1. Thus for 0 < t < t o , the trajectory
Sect. 311
LAPLACE EQUATION A N D THE BROWNIAN MOTION
305
would certainly lie outside the cone r. Use then the rotation invariance of the Gaussian law to conclude that, during the open interval 0 < t < t o , the trajectory would lie outside an open ball, with strictly positive radius, centered at x o , and derive from this that xo must be regular, contrary to the assumption. 31.11. Derive from the Gaussian law the mean value of the distance covered, during a time interval 0 5 t 5 T, in a continuous random walk.
32 Dirichlet Problems in the Plane. Conformal Mappings In this section we are concerned with the relationship between harmonic functions of two variables (x, y) in the plane R2 (or in open subsets of R2)and analytic functions of the complex variable z = x + iy ( i = The relationship is based on the fact that
n).
(32.1) where d / d z = +(a/ax + i d/dy) is the Cauchy-Riemann operator (cf. Sect. 5 ) and d/az its “ complex conjugate.” The first consequence of (32.1) is that the Laplace equation is invariant under a large class of transformations of the plane, well beyond the orthogonal group in two variables. For if Z H Z = Z(z)is a holomorphism of an open subset n of R2 onto another such set R‘ [Z(z)is holomorphic in R,Z’(z)does not vanish at any point of R, and the mapping Z H Z is a bijection of i2 onto Q‘], then ~ ( Z ) H h(Z(z))constitutes a bijection (and in fact, an isomorphism for the natural topologies) of the space of harmonic functions in 51’ onto that of harmonic functions in R. This is simply because
(32.2)
=
P’I2A X , U >
where X = Re Z, Y = Im Z . The fact that the Laplace equation in the plane is invariant under holomorphisms, or conformal mappings as they are more often called, enables us to transfer the Dirichlet problem from an open set onto one of its biholomorphic images, preferably a simpler set. Perhaps the simplest of all bounded open subsets of C is the unit disk ( z ; J z I < 11, which we shall denote by 9. If Q is a bounded domain in C (that is, a bounded open connected and simply connected set), we know, by the Riemann mapping theorem, that there is 306
Sect. 321
PROBLEMS IN THE PLANE. CONFORMAL MAPPINGS
307
a holomorphism Z H w = w(z) of R onto 9, Moreover, given any point zo of R, we may select the function w so as to have w(zo) =O. Let us furthermore assume that the boundary dR is a C' curve and that the mapping Z H w extends as a C' homeomorphism (meaning that its derivative exists, is continuous, and nowhere vanishes) of aR onto the unit circumference 3 9 . It is then easy to obtain an expression of the harmonic measure on 8R. We are here looking at the homogeneous Dirichlet problem :
(32.3)
Au = O
(32.4)
u=g
inn,
on 82,
under reasonable regularity assumptions on the boundary datum g. Let us set = w(z). Thus f is a function of w, I wI = 1. Consider then the solution u of the Dirichlet problem
f(w) = g(z) with w (32.5)
Av = O
in 9,
(32.6)
u=f
ona9.
The mean value formula (10.17) yields
Let us then set (32.7) recalling that the choice of w depends on that of zo . We obtain
(32.8)
In particular, we may apply this formula when R is the unit disk itself, and Z H w is a holomorphism of 9 onto itself, extending into a C' homeomorphism of the unit circumference onto itself, and mapping zo onto zero. It is standard to take (32.9)
w
= (2 - zJ(1
- ZOZ),
and an easy computation shows that if I z 1
(32.lo)
= 1,
308
BOUNDARY VALUE PROBLEMS
[Chap. 111
If we put this into (32.8), we obtain the Poisson formula (10.28) in the present particular circumstances :
(32.1 1)
1
u ( z o ) = 2 7 ~ g(eie)
jo
2=
(1 - r-6) d0 1 - 2r, cos(0 - 0,)
+ r;’ zo = ro exp(
ie,),
r0 < 1.
We continue to assume that R is a bounded domain. Iffis a holomorphic function in R, it is harmonic and therefore so are its real and imaginary parts. Conversely, let u be a real-valued harmonic function in Q. The differential form
is closed in R, and therefore (since R is simply connected) i t is equal to the differential dv of a function in 0.We may take
=I (X,Y)
u(x, Y>
w,
(X0,YO)
where ( x o , y o ) is an arbitrary (but fixed) point of Q, and the integral is performed over any reasonably smooth (e.g., piecewise linear) path in R, joining ( x o , y o ) to ( x , y ) (since w is closed, the choice of the path is immaterial). Since w = dv, we have the Cauchy-Riemann equations:
(32.12)
a0 -_ - - au ,
ax
ay
au
au - --ay ax’
which imply that u also is harmonic in R. It is customary to say that u and u are conjugate harmonic functions. Any other function satisfying the relations (32.12) differs from u by a constant; and this of course corresponds to a change in the choice of the point ( x o , y o ) in the integral defining u. An interesting remark is that the relations (32.12) imply
(32.13)
(grad u) * (grad u )
= 0.
This orthogonality has an important interpretation. Suppose that R represents an isotropic (and uniform) medium and that u is the temperature inside that medium; suppose that the values of u are maintained constant (in time) at each point of the boundary of R. Then the curves u = const in R are the isothermic lines. It should be noted that they are not necessarily true “curves”: for instance, if the value of the temperature at the boundary is constant, say equal to one at every point of aR, then u = 1 in R, and there is only one isothermic line, identical with the set R itself. But let us assume, for the sake of the argument, that the isothermic lines form a bona fide one-
Sect. 321
PROBLEMS IN THE PLANE. CONFORMAL MAPPINGS
309
parameter family of smooth curves. Then the orthogonal trajectories of these curves are the level curves of v : When u is the temperature, these curves would be the lines of heat f l o w : the vector grad u is tangent to the curves v = const and grad u is proportional to the flux. I t is also clear that if we perform a conformal mapping of R onto another domain in C, the whole configuration will be conformally transformed: The isothermic lines will be mapped into isothermic lines and the lines of heat flow into lines of heat flow, i n the new domain. Let f be a continuous function in the closed unit disk G, holomorphic in the interior 9, and setf = u + iv. Noting that, if IzI = 1,
we have, by (32.8) and (32.10), u(zo) =
1 -.$ 2x1
z
IzI=l
+ zo
u(z) Re[-) z - z o
.
dz z --?
and therefore
since the integral at the right is a harmonic conjugate of u(zo) (both being regarded as functions of zo in R). We conclude that
But it is clear that we may apply this formula with --substituted v = Re( - if) for u, whence
f(zo) =
4,
1
zl=l
forfand
z + zo dz iv(z) -- + C'. z-zo z
By adding these two formulas and modifying the definition of the constant C we obtain 1
4
2f(zo) = 27[i ~
z
+ zo dz + c.
f ( z ) -~ l z = z0~ z
Taking zo = O in the preceding expression and applying the mean value theorem shows that
310
[Chap. 111
BOUNDARY VALUE PROBLEMS
hence that
which shows that Cauchy's formula in the unit disk [cf. (5.13)] is a consequence of the Poisson formula (32.1 1). This is not unexpected and it should be noted that the Poisson formula is a consequence of the mean value theorem, itself an immediate consequence of Cauchy's formula. We continue to consider the Dirichlet problem (32.3)-(32.4) in the case where R = 9,the unit disk. We assume that the boundary values g are sufficiently regular, which could mean here g E L z , or g E Lp(l 5 p 6 + 00). As a matter of fact, most of what we are now going to say remains valid if we only assume that g is a distribution on the unit circumference 8 9 . The important thing, in what follows, is that g should have a Fourier expansion: +m
(32.14)
g(e") = C - gmeim". m=-w
In the case where all the Fourier coefficients gmvanish for m < 0 the solution of (32.3)-(32.4) (in some suitably generalized sense) is readily obtained: It is a holomorphic function of z, I z I < 1 ; its Taylor expansion is simply +m
(32.15) If all the coefficients gm for m > 0 were to vanish, the solution would be given by
c
--m
(32.16)
m=
gmrm.
0
In the general case, therefore, we may take (32.17)
u(x, y ) = U + ( 4
+ u - ( 4 - go
9
where u+ and u - are the functions (32.15) and (32.16), respectively. As one easily checks by means of the Cauchy formula applied to u+ and of the antiCauchy formula applied to u - , the solution u coincides (under the appropriate assumptions on g ) with that given by the Poisson formula. It should also be noted that the Poisson kernel in (32.11) can be regarded as a convolution kernel on the unit circumference (identified with the group of rotations in the plane). For ro < 1, it is Cm and, as a consequence, we have the right to apply it to any distribution on the unit circumference: The result is then " the solution" of the homogeneous Dirichlet problem in the unit disk, having that distribution as boundary value. One can even show that
Sect. 321
PROBLEMS IN THE PLANE. CONFORMAL MAPPINGS
31 1
the solution does indeed “ take” the boundary values: Indeed, u is a smooth function of ro > 0, valued in the space of distributions on the circumference I z ( = r,; as ro /* I, u(roe“), regarded as a distribution in the angular variable 8, converges to g(eie).This assertion can be checked at once, for instance on the expression of u given by (32.17) (using Fourier series). Finally we note that the Fourier series method can be used in any annulus R, < I z ( < R , (0 < R, < R 2 ) (see Exercise 32.2). Many “doubly” connected open sets can be mapped onto such an annulus by means of a holomorphism; the solution of the Dirichlet problem in the annulus can then be transferred to such an open set. One may want to solve the Dirichlet problem in an unbounded open set, for instance in the exterior of the unit circumference. It is then necessary to impose “ conditions at infinity,” for instance the rate of decay of the solution one seeks, and therein lies a certain amount of ambiguity (cf. Exercises 32.4 to 32.7).
Exercises
32.1. Suppose that the function of 8,0 5 8 6 277, g(eie)is square-integrable and let u be the function in the open unit disk given by the Poisson formula (32.11). Prove that, as ro > 0 converges to 1,
JO
[ H i n t : Use the Fourier series expression of u.] 32.2. Let gj be a distribution on the circumference IzI = r j ( j = 1, 2; 0 < rl < r2). By the Fourier series method, show that there is a harmonic function u(r, 8) in the annulus rl < r < r, having the following property: Given any C“ function 4(eie) on the unit circumference,
as r -,r j ( j = 1, 2). 32.3. Prove that the harmonic function u in Exercise 32.2 is unique. [ H i n t : Show that, in the case where the boundary data gl,g2 vanish identically, every convolution ~;.u(r, do - w(ei0) do,
4 E c~m([o, 241,
is also a solution, and that it is continuous up to the boundary.]
312
[Chap. 111
BOUNDARY VALUE PROBLEMS
32.4. Let g(eie)be a continuous function on the unit circumference. What is the condition on g in order that there be a holomorphic function f(z) in the region I zI > 1, equal to g when I z I = 1 ? Show that, iff exists, it is not unique. Show that there is only one such holomorphic functionfif we impose the additional condition that it vanish at infinity. Derive from this that, for an arbitrary continuous function g(eie), there is a unique harmonic function in the region 1 zI > 1, equal to g when 1 zI = 1 , which tends to zero at infinity. 32.5. Let g(eie) be an arbitrary continuous function on the unit circumference, u(x, y ) the unique harmonic function in the region x2 + y z > 1, equal to g when xz + y 2 = 1, and tending to zero at infinity. Show that u can be expressed as a convolution on the unit circumference, similar to the Poisson formula (32.11) (but now with ro > 1). Compute the kernel acting on g(eie) in this convolution. [Hint: Perform an inversion on the variable zo in the Poisson formula.] 32.6. Let g(t) denote an arbitrary continuous function, with compact support, in the real line. Let us set
g(z) = (2ni)-' [ + " g ( t ) ( t - z)-' dt, u(x, y ) = g(z) - g(L)
(z =x
+ iy,
i=
4-1).
Writing
show that the kernel K ( x , y, t ) (often called the Poisson kernel relative to the upper half-plane) can be regarded as a distribution in ( x , t ) depending smoothly (i.e., in a C" fashion) on the parameter y > 0 and satisfying: lim K ( x , y , t ) = 6(x
(32.18)
- t)
(6, the Dirac measure).
y++o
Prove that u is the unique harmonic function in the upper half-plane which satisfies both
(32.19) lim u(x, y ) = g(x), uniformly with respect to x in compact subsets of y++o
the real line;
and
(32.20)
lim u(x, y )
= 0,
z-m
but that it is not the only one which satisfies (32.19) alone.
Sect. 321
PROBLEMS IN THE PLANE. CONFORMAL MAPPINGS
313
32.7. Let g(t) be a continuous function on the real line, periodic with period 2rc. Show that there is a unique harmonic function u in the open upper half-plane, satisfying (32.21)
lim u ( x , p)
= g(x),
y-+o
S,
2n
lim u ( x , p) = (27c1-1
(32.22)
Y++a;
g ( t ) nt,
where the limits are uniform with respect to x in R’. Can u be expressed in terms of g by means of the Poisson kernel for the upper half-plane (see Exercise 32.6)? 32.8. Consider the complex differential equation (32.23)
dw dz
- = (1 - ~ ~ ) - ” ~ ( 1k 2 z 2 ) - ” 2
(0 < k < I),
where the radicals are assumed to be equal to + 1 when z = 0. Show that there is a unique holomorphic solution w(z) of (32.23) for Re z > 0, equal to zero when z = 0 (it is clear that w will be holomorphic in a neighborhood of the origin). Show that ZH w ( z ) is a biholomorphic mapping of the open upper half-plane onto the rectangle (32.24)
0 < Re z < K,
0 < Im z < K ’ ,
where K and K ’ are two real constants, which the student should prove to be equal to w( 1) and to ( I / i ) w ( I / k ) - w( I). By using the inverse mapping W H z = sn(w) (sn is Jacobi’s elliptic function) and the fact that (32.25)
z-a
ZH u = eieZ-ti
(0 5 6 5 2z,
c(
E
C, Im a > 0)
is a conformal mapping of the half-plane Im z > 0 onto the open unit disk, give the expression of the harmonic measure relative to the rectangle (32.24). 32.9. Solve the weak Dirichlet problem in the annulus 0 < IzI < 1 by means of Fourier series. Explain what you find and compare with what one seeks in the classical Dirichlet problem. 32.10. Let u and u be conjugate harmonics in the unit disk IzI < 1. Writing z = re“, consider the Fourier expansions of u and of u with respect to 8 and state the relation between the Fourier coefficients u,,,(r) and u,(r) of u and u, respectively (rn E Z).
33 Approximation of Harmonic Functions by Harmonic Polynomials in Three Space. Spherical.Harmonics
Let us look provisionally at the Laplace equation in two independent variables, x and y, and try to determine all polynomials P(x, y) which are solutions of it. Let us write P(x, y) = P,(x, y ) + P,,-l(x, y ) + ..-,where Pd(x, y) is homogeneous of degree d. Since the Pj(x, y) are linearly independent, each one of them must be a solution of the Laplace equation and we may therefore assume that P = P , is homogeneous of degree m. By going to polar coordinates r, 8, we may write ...
1- m eveive. ,
P,(x, y ) = r m
(33.1)
v=
But clearly the exponentials exp(iv0) are also linearly independent, and thereforeeachtermrmeive must be harmonic. By using the expression of the Laplace operator in polar coordinates, we see that we must have (33.2) which implies at once m2 = vz, i.e., m = f v . In other words, (33.3) P,(x, y ) = A,?? + B F . Let now f ( x , y) be an arbitrary harmonic function in the disk r < R, say continuous up to the boundary. By using the Fourier series of its boundary value, we have seen that it can be represented by a Fourier series whose coefficients are multiples of powers of r, specifically: +oo
(33.4)
f ( x , y) =
C
CvrVeiVe = Co + f + ( z )+ f - ( z ) ,
v=-m
314
Sect. 331
SPHERICAL HARMONICS IN THREE SPACE
315
where f and f - are both holomorphic in the disk r < R and vanish at the origin. Thus we see that the harmonic polynomials are dense in the space of harmonic functions in the disk (this remains true if the disk is replaced by any domain in R2). This approximation property has many applications. Foremost among them is the possibility of solving the Dirichlet problem in disks and annuli, as shown in Sect. 32, by exploiting the series expansions (33.4). It is therefore natural to ask whether a similar approximation property is valid in higher dimensions and whether it leads to reasonably simple series representations which would enable us to solve " explicitly " the Dirichlet problem in balls (or in "shells"). It should be pointed out, however, that we cannot expect the same rewards in dimension 2 3 as those reaped in the plane: Precisely, we shall not have at our disposal the analog of the Riemann mapping theorem, which tells us that we may transfer the Dirichlet problem from any bounded domain in RZ to the unit disk, and therefore we shall not be able to transfer the Dirichlet problem to the ball (from even simple open subsets of R", n 2 3). If R is an open subset of R" whose complement has no bounded connected components, it is true that every harmonic function in Q is the limit (say for the uniform convergence on compact subsets of Q) of harmonic polynomials. We shall not prove this theorem in the present book. We shall only look at the case of dimension 3 and show that relatively simple expansions, akin to (33.4), indeed exist. Furthermore, they exhibit spherical symmetries which make them very handy if one desires to solve the Dirichlet problem in balls. As in the case of two variables, we shall look at a homogeneous polynomial of degree m,here P,,,(x,y , z), and switch to spherical coordinates r, 4, 0 (4 is the longitude, 8 the colatitude) : +
(33.5)
x = r cos
4 sin 8,
y = r sin 4 sin 8,
z = r cos 0.
An easy computation (cf. Exercise 33.1) shows that, in these coordinates,
(33.6)
a + 2r - +(sin 8 ) - 2 A + , B r ar2 ar a2
r2A =r2 -
where
(33.7)
a + (sin 8)' ae + (cos 8) sin e -. ae a2
Here takes place an automatic separation of variables, since P,,,(x, y , z ) = ~"'P,,,(cos 4 sin 8, sin 4 sin 8, cos 0 ) ; hence we must have
(33.8) (33.9)
(r2
-$+
2r "r"' dr
=k P ,
+ k sin2 e)h(+, 0 ) = 0,
316
BOUNDARY VALUE PROBLEMS
[Chap. I11
representing P,,,(cos 4 sin 0. sin # sin 8, cos 8) as a linear combination of linearly independent functions h ( 4 , 0). The first equation immediately gives k
(33.10)
= m(m
+ 1).
On the other hand, because of the linear independence of the exponentials exp[i(c$ + PO)] we may also use separation of variables in (33.9). This gives, if we write h(4, 0) = v(4)u(0),
(33.1 1) (33.12)
U”
d2u
(sin 0)’ do2
+ k,v = 0,
du + (cos O)(sin 0) + k(sin Q2u - k , u = 0, d0
and (33.1 I ) shows at once that k , = n2, where n is an integer (positive, negative, or zero). Thus (33.13) = A ~ W+ J B~-W. We must now concentrate on (33.12). It is convenient to make the change of variables t = cos 0. It transforms (33.12) into
d -((I dt
(33.14) When n
= 0,
- t2)
z] + (m(m + 1) - u =o. 1 - t2
du
(33.14) reduces to the Legendre equation,
(33.15) The solutions of (33.15) are called the Legendre functions. Here we shall only consider the case of nz (and also n ) taking nonnegative integral values, but the theory of these equations has been developed for all complex values of m and n. As we shall see later, the interesting solutions of (33.14) can be expressed in terms of the (interesting) solutions of (33.15). We shall therefore look first at the latter. Of course there is no need to restrict the variation of t to the interval [ - 1, + I ] [although we were led to (33.14) by setting t = cos 8 in (33.12)]. In order to solve (33.15) it is natural to try a power series expansion +m . u(t) =
1ujtJ.
j=O
This leads to the following recursion formula for the coefficients u j :
(33.16)
(j
+ 2)(j + 1 ) u j + 2 = -(m
-j)(m
+j + I ) u j .
Formula (33.16) shows at once the existence of a polynomial solution to (33.15) : If rn is even, we take all uj with j odd equal to zero, and all uj with j even > m also equal to zero; (33.16) enables us then to determine the remaining
Sect. 331
317
SPHERICAL HARMONICS IN THREE SPACE
u j withJ even in terms of the first one of them. The same argument holds when m is odd-except that one must exchange even for odd throughout. When m = 0, this polynomial solution is a constant. For m = 1, it is of the form Ct with C an arbitrary constant. For any m = 0, 1, . . . , the polynomial solutions are all multiples of one of them, i.e.. span a one-dimensional linear space. I t is not difficult to find a generator of this linear space. Let us set u = D"v ( D = d/dt) and integrate (33.15) from 0 to t . We get
(33.17) ( I - t 2 ) D m + ' v+ m(m + I)D"-'v= D"[(I - t2)Do+ Zmtv] = O . We obtain at once a solution 11 of (33.17) by taking v polynomial solutions of (33.15) are all of the form (33.18)
U(t) =
c
= (1 -
r2)".
Thus the
(3"
(1 - t2)".
-
Integration by parts shows at once that
(1-t2)"dr=0
(33.19)
if h < m ,
which implies that any two functions (33.18), corresponding to two different values of m, are necessarily orthogonal in L2(] - 1, + 1 [). On the other hand,
Dejnition 33.1. The polynomials
are called Legendre's polynomials. We have shown that (33.21)
(m
+ ))'/'(m' + + ) ' I 2
1
1
Pm(t)Pm.(t) dr = drn,",
-1
(Kronecker's index).
318
[Chap. 111
BOUNDARY VALUE PROBLEMS
THEOREM 33.1. The sequence of functions
(m + $ ) 1 ’ 2 f m ( t ) ,
(33.22)
m
= 0, 1,
...,
forms a Hilbert basis ( i x . , a complete orthonormal system) in L 2 ( ] - 1, -t1 [). ProoJ In view of (33.21) it suffices to prove that the Legendre polynomials span a dense linear subspace of L 2 ( ]- I , + I [). As a matter of fact, Po(t), . .., P,(t) constitutes a basis of the linear space of polynomials in one variable of degree S m . The proof of the latter is by induction on m, using the observation that (2m) ! Q.E.D. is of degree m - 1. Pm(r)- 2-” -tm (m!)2
We know that the solutions of (33.19, in the open interval two-dimensional linear space. For m = 0 we have
1tI
< 1, form a
du
(33.23)
dt
The constant solutions of (33.23) are multiples of Po(?) = 1. The “other” solutions are multiples of 1
I+r I-?
Qo(t) = - log -.
(33.24)
2
It is easy to show that the general solution of (33.19, for any m
= 0, 1,
. . . , is
1 I + t + ... + a m - l P m - l ( t ) +} Afm(t)-log+ BPm(t), 2 I-t
(33.25) A{aoPo(t)
where the aI)s ( j = 0, . . . , m - 1) are well-determined numbers and A , B are arbitrary constants. The presence of the logarithmic term excludes that solutions (33.25) with A # 0 enter in the expression of the spherical harmonics (which we are seeking) as we know’ that the latter are polynomials with respect to t = cos 0 and sin 0 = (1 - t 2 ) ’ / ’ . We shall therefore restrict ourselves to multiples of the Legendre polynomials. With this in mind we look now at Eq. (33.14) for n # 0. A change of unknown function u(t) = (1 - t2>”’2u(t)
transforms it into (33.26)
(1 - t2)u” - 2(n
+ 1)tu’ + [m(m + 1) - n(n + I)]u
= 0.
On the other hand, if we differentiate n times the Legendre equation (33.15) for h, we obtain precisely (33.26) for u = h(”).In view of the preceding considera-
Sect. 331
319
SPHERICAL HARMONICS IN THREE SPACE
tions about the Legendre equation, we see that we obtain a solution of (33.14) for all n = 0, 1, 2, . . . , by taking u equal to
(33.27) or any scalar multiple of this function. The functions (33.27) are often called the associatedLegendre functions (of the first kind) and denoted by P$(t). When n is odd, the square root in (33.27) must be understood as the positive square root (for I t I < 1). According to (33.20) we have
(33.28) P;(t)
(- I ) n + n 2"m !
(m, n = 0,1, . . .>.
= -(1 - t 2 ~ ~ ~ ( ~ ) ' " + ' ' t (2 )~m
Observe that Pi E 0 if n > m.
Q.E.D.
The proof of the next statement is very similar to that of Theorem 33. I and we leave it t o the reader:
THEOREM 33.2. For each fixed n = 0, 1, (33.29)
( m + $)'I2
[-(m ](m -+
. . . , the sequence offunctions
n)!
P$(t),
n)!
forms a Hilbert basis in L 2 ( ]- I ,
m
= n,
n + 1, . . . ,
+ I [).
We return now to our original problem, that of determining the harmonic polynomials in three variables. According to (33.28) we have
(33.30)
P;(COS e) =
C
crk,,(sin
B)"-lk
.
2kjm-n
We derive from this :
(33.31) rmeindP$(cos 0) =
C
2kSm-n
a:. k ( x
+ iy)nzm-"-2k(x2 + y 2 + Z2)k,
which means that the left-hand side is a homogeneous polynomial of degree m in the variables x. y , z. We have shown that it is harmonic. Since n can only take the values 0. I , . . . , m, and since we can obviously substitute - 4 for $C (which is equivalent to replacing x + iy by x - iy) in (33.31), we have thus obtained 2m + 1 linearly independent harmonic polynomials which are homogeneous of degree m (in x , y , z). the space of homogeneous polynomials of degree Let ,Pio,,denote ,, d 2 0 in N variables. It has dimension ( N + d - l)!/(N - l)! d ! . We identify it with its own dual by means of the duality bracket
(33.32)
320
[Chap. 111
BOUNDARY VALUE PROBLEMS
where ci = (E,, . ..,uN) is an N-tuple of nonnegative integers. Let us denote by d , j = 1, . . . , N, the variables and set D j = d/axi. Then D j is a linear map of N . We have 9io,, into Pi!,;:, N ; its transpose ' D j maps Pi!,::. into 9'iOm,
which shows that the transpose of d/axj is multiplication by xj. Now let P ( D ) = P(D,, . . ., D N )be a homogeneous polynomial in the D j , of degree m. We may state:
LEMMA 33.1.
The transpose of the linear mapping
(33.33)
p(D)
9'fom,
N
is the multiplication of elements of 9 : ; :
s&:
N
by P(x).
COROLLARY 33.1. The mapping (33.33) is surjective and the dimension of its kernel is equal to (N+d-l)! -(N+d-m-I)! (N-l)!d! (N-l)!(d-m)!*
(33.34)
I f we apply this to the Laplace operator (thus m see that the following is true:
= 2) in
N
= 3 variables,
we
THEOREM 33.3. The linear space of homogeneous harmonic polynomials of degree m 2 0 in three variables x, y , z, has dimension exactly equal to 2m + 1. The polynomials (33.31), together with their complex conjugates,form a linear basis of this space. It is also clear how we can exploit Theorem 33.3 and the properties established earlier, for the Legendre functions, in order to solve the Dirichlet problem in the open ball
B,
= {(x,
y , Z ) E R3; X'
+ y 2 + z2 < R 2 }
( R > 0).
As a matter of fact, because of radial homogeneity, we may as well reason in the case R = 1 ; B, is then the unit ball, B3, and its surface is the unit sphere, S 2 . Suppose we are given any square-integrable function (for the Lesbegue measure on the unit sphere), g(4, e), on S 2 . We may of course expand it in a Fourier series with respect to 4:
(33.35 )
g ( 4 , e) = go(@
+
+m
{g,'(e)ein+ n= 1
+ gn-(d)e-+}.
By applying now Theorem 33.2 we may expand each one of the coefficients g o , 9,' in terms of the Legendre functions with respect to cos 0 [we view the
Sect. 331
SPHERICAL HARMONICS IN THREE SPACE
32 1
Fouriercoefficients in (33.35) as functions of cos 8: we recall that 8has to vary only from 0 to 7r, since q5 varies from 0 to 2x1. Finally we obtain an expansion in L2(S2): +m
(33.36)
g ( 4 , e ) ==
1gm,
m=O
i m
0 Pm(cos
0)
+w
+ 1 C (g,:
+
B)ein4 g,,
P;(COS
n = l m=n
P;(COS O)e-inb}.
It is now very easy to construct the harmonic function u in the open ball B , which takes (in some suitable sense) the boundary value g on S 2 . Let us denote by S,,,, ,,(x,y, z ) the spherical harmonic (33.3 1). The solution u is then given by
where
S denotes the complex conjugate of S .
Exercises 33.1. Let r , 8,,. . . , On-, denote spherical coordinates in R". Write the expression of the Laplace operator A in these coordinates. 33.2. Show that, given any integer n = 0, 1, . . . , if a polynomial in one variable x , of degree Zn,is orthogonal in L2(]- 1, + 1 [) to every polynomial of degree Sn - 1, it is necessarily a multiple of a Legendre polynomial. (Do not use computation, only linear algebra.) If P,,(x) denotes the nth Legendre polynomial, show that (33.38)
(1 - 2xt
+ t2)-"2
c P,(x)t"*
+m
=
fl=O
Prove the following formulas : (33.39)
1 ( w 2 - 1)" 2"Pfl(x)= -. f dw 27rz Y ( w - x ) " + l
(Schlafli's formula),
where y is a circle centered at x ; (33.40) P,(x) (33.41) (33.42)
33.3.
= - Jr(x
n o
(n
+ (x2-
cos 8)'' d8
(Laplace's formula);
+ l)P,+, - (2n + l)xP,, + nPn-l = 0,
nP, = xP; - PA-
n P n P 1= PA - x P :,-,.
Let P,(x) denote the nth Legendre polynomial. Prove that the roots of
P,,(which, of course, number n) are all real, pairwise distinct, and that they all belong to the interval - 1 < x < + 1.
Spectral Properties and Eigenfunction Expansions We return now to a second-order differential operator A = A(x, a/ax) whose coefficients (when it is written in the variational form below) are L" functions in the open set R c R":
As we have said, we assume that ajk, b' (1 5 j 2 n), c belong to Lm(R). In addition to this, we are going to assume that A is formally seljraa'joint:
(34.2)
vu, u E H;(Q),
(Au, 6) = (2.4, AD),
where ( , ) is the bracket of the duality between HA@) and H-'(R). It is not difficult to state necessary and sufficient conditions in order that (34.2) holds. The principal part of A ,
must be (formally) self-adjoint, and this is so if and only if
(34.3)
-
d k ( x )= aki(x)
for almost every x E 0, 1 5 j , k
=< n.
The part of order S 1 in A must also be self-adjoint, which implies
(34.4) (34.5)
-
bj(x) = -bj(x),
a.e. in R,
1g
j 5 n,
- " ab' c(x) - c(x) = C -. ( x ) a.e. in 0. j=
ax]
In particular, the divergence of the vector (b', . . . , b") must belong to Lm(R). 322
Sect. 341
EIGENFUNCTION EXPANSIONS
323
Condition (34.2) is equivalent to the property that the sesquilinear form on HA(R) x Hh(iz) associated with A .
au ao is Hermitian, i.e., a(v, u) = a(u, v). We shall also make the strong ellipticity assumption : (34.6)
f o r suitable co > 0, all
5 E c",and almost all x E iz,
As usual, this leads to the coercivity of the form
1
In 1 u12 dx + a(u, u),
provided that the real number 1is sufficiently large: (34.7) (34.8)
There is 1, E R such that, f o r all 1, > A,, and all u E HA@), ((21 + 4 4
c> 2 c1 I I 4 I L ( R ) .
The constant c1 in (34.8) is strictly greater than zero and independent of u (but depends on A). The coercivity inequality (34.8) implies that, for L > A,, LZ + A defines an isomorphism of HA@) onto H-'(R). Let us denote by G(1) the inverse isomorphism. Throughout this section we make the hypothesis that (34.9)
R is bounded.
Then if J denotes the natural injection of H;(R) into H-'(R), we know, for instance by Proposition 25.5 (cf. also Exercise 25.10), that J is compact. I t follows from this : PROPOSITION 34.1. I f 2 > A, and if0 is bounded. G(2)J is a compact operator ofHA(R) into itsew Thus, under the hypothesis (34.9), we may apply the classical theorem, due to Riesz, on the spectral decomposition of a positive compact operator, in a Hilbert space. Its proof can be found in any text on the subject (see, e.g., [TVS, D&K, Theorem 48.11). An operator of this kind has a discrete spectrum, which means that every point in the spectrum is isoiated and is an eigenvalue of the operator. Furthermore, the eigenspace corresponding to any given eigenvalue 1, which we may denote by V, , is finite-dimensional. If 1,x' are two distinct eigenvalues, the corresponding eigenspaces V, , V,. are
324
BOUNDARY VALUE PROBLEMS
[Chap. I11
orthogonal. The eigenvalues, which are (of course) strictly positive numbers, form a decreasing sequence converging to zero. The orthogonal sum of the eigenspaces V,, as x ranges over the spectrum, is dense in the orthogonal of the kernel of the operator, hence in the whole Hilbert space if the operator happens to be injective, as is the case for G(1)J when 1> A0. Thus, in the injective case, we may find a complete orthonormal system, in our Hilbert space, consisting of eigenfunctions. Let h be an eigenfunction of G(1)Jin @(a),corresponding to the eigenvalue x ; then x is strictly greater than zero. We have (34.10)
Ah
=
(x-' - 1)h,
which shows the relation between the spectrum of A and the one of G(L)., Since 1is somewhat arbitrary, we shall in the sequel focus on the spectrum of A , S ( A , Q): I t consists of a sequence of real numbers converging to + co, which we may order as an increasing sequence (34.1 1) with repetitions according to the multiplicity of each eigenvalue. We may view the resolvent of A , (34.12)
( z l - A)-'
= (Z
- A)-'G(1)(G(1)
- ( z - A)-'l)-',
as a holomorphic function of z in C\S(A, Q) valued in the space of bounded linear operators H-'(O)-+H,!JQ). Note that it enables us to solve the Dirichlet problem : ( A - zZ)u
(34.13)
=f
u =g
(34.14)
in !2,
on 132,
whenever z $ S ( A , Q), for sufficiently regularfand g:fmust belong to H-'(Q) and g must be the trace on 130 of an element w of H'(R) (assuming that the boundary 8Q is regular enough, so that the trace of w can be defined). Then the solution u of (34.13)-(34.14) is given by (34.15)
u =w
- (ZI-
A)-"f-(A
- zl)w].
Suppose now that z is an eigenvalue of A . The range 9 of A - zl is canonically isomorphic to the orthogonal A of its kernel X in HA(Q), since A is self-adjoint (and therefore z is real). We may regard A - zlas an isomorphism of A! onto W ,whose inverse we may still denote by ( A - z l ) - ' . With such notation, and provided that f - ( A - zl)w E 92, (34.15) still gives a solution of (34.13)-(34.14), but now we may obtain a whole affine space of solutions, whose dimension is 2 1 , by adding to the right-hand side in (34.15) an arbitrary element of X . If one wishes to select in a unique manner
Sect. 341
325
EIGENFUNCTION EXPANSIONS
one solution among these, one must adjoin a sufficient number of conditions -a familiar situation, which goes with the Fredholm alternative. We are now going to look at Hilbert space bases, i.e., complete orthonormal systems, consisting of eigenfunctions of A . It is convenient to redefine the inner product in H'(Q), in a manner better suited to the study of A . We shall set ( ( u , 0)) = a(u, v)
(34.16)
+ K JQu6 dx,
with K > A, [see (34.7)]. We may then select a sequence of eigenfunctions of A , { U j } ( j = I , 2, . . .) belonging to HA(R), such that [cf. (34.1 I)]:
AUj=AjUi,
(34.17)
(34.18) ((Ui, U j ) ) = hi,
j = 1 , 2 ,...,
(Kronecker's index),
i, j
=
1, 2, . . . .
By the Riesz theorem we know that the U j form a Hilbert space basis of Hi(R). Noting that K + i j > 0, let us now set (34.19)
Ej
= (ti
+ ij)li2Uj
( j = 1, 2, .. .).
PROPOSITION 34.2. Suppose R is bounded. Then the functions Ej [resp. ( K + Aj)1'2Ej] form a Hilbert space basis in L2(R)[vesp. in H-'(R), equipped with the Hilbert space structure dual of the one defined by (( , )) on HA(Q)]. A distribution u in R belongs to HA(R) [resp. to L2(Q),resp. to H-'(R)] ifand only if +m
u = zujEi,
(34.20)
j=1
where the series converges in 9'(R) and where +li
(34.21)
resp., where (34.22)
resp., where +on
(34.23)
s(K+Aj)-11uj/2< j = 1
1
J
+a .
326
[Chap. 111
BOUNDARY VALUE PROBLEMS
Proof. We have, by virtue of (34.16), (34.18), and (34.19),
s,Ei
E , dx
=((A
+ K ) G ( K ) J E E,) ~,
= ( ( G ( K ) J E E,)) ~,
On the other hand,
(Ei, Ej)&-l ( n ) = (Ei, G(K)JE,) = ( K
+ 2,)-'
n
EiE j dx,
from which the first part of the statement follows. As for the last part, it follows from
LEMMA 34.1. Let X be a Hilbert space of distributions in R, {ti} ( j = 1, 2, . . .) a Hilbert space basis in X . For a distribution u in R to belong to X it is necessary and suflcient that there be a sequence ( c j ) in I' such that the series cjcj converges to u in 9'(R). Then the sequence ( c j ) is unique.
xi
We recall that the space I' is the Hilbert space of square-summable sequences of complex numbers. Proof: The mapping ( c ~ ) ~ = ~ , ~ , , c. j,c+j ,+ where ~~ the convergence is to be understood in the sense of X , is an isometry of 1' onto X . Let $ denote the space of distributions u which can be represented as series c j c j converging in 9'(R) [with (c,) E 1'1. There is a natural mapping of $ onto &': To each series as above we assign the same series, but now converging in X . Since the topology of X is finer than the one induced by 9'(R), this mapping must be the identity. Q.E.D.
x,
Remark 34.1. The analogy with the global situation, where R = R" and A = -A, is worth stressing. Instead of the eigenfunction expansion (34.20) we have, in the global situation, the Fourier inversion formula: exp( - ix . t;) is obviously an eigenfunction of - A (corresponding to the eigenvalue I 5 I which varies over R, ; the difference, of course, is that the spectrum is not discrete; note also that theeigenfunctions do not belong to If'). The " weight" (K A j ) should be replaced by ( K It; l'), where K is any number greater than zero (usually one takes K = 1). The analog of Proposition 34.2 would then state that u E HJ(R")( j = 1, 0, - 1) if and only if
+
+
which is the definition of the Sobolev spaces (via the Plancherel formula).
Sect. 341
EIGENFUNCTION EXPANSIONS
327
Remark 34.2. The eigenfunction expansion (34.20) enables us to ‘L insert” Hk(R), Ho(R), and H-’(R) into a one-parameter family of distribution spaces, which we could denote by l?:(Q; A ) (s E R): a distribution u in R belongs to the latter space if and only if it has a series expansion (34.20) with coefficients uj satisfying +m
(34.24)
(K
-k
Aj)sIUj12
j= 1
<
-I- CO.
In particular, for I s I 5 1, we obtain a family of interpolation spaces between Hk(Q) and H - ’ ( Q ) .
Example 34.1 We shall determine the eigenfunctions Ej ( j = 1, 2, . . .) in the case of one space variable, denoted by x, when A = - ( d / d ~ )and ~ when R is a finite interval a < x b. We have
-=
-E!’J (34.25) and since Ejmust belong to HA(R),
(34.26)
= A.E. I
1’
Ej(a) = Ej(b)= 0.
The general solution of (34.25) is given by
E~ = u cos(xJ&)
+ u sin(x&),
and the boundary conditions (34.26) require that the vector (u, u) E R2 be orthogonal to the two straight lines through the origin whose angles with the u-axis are aAj‘2 and bl:’’, respectively. Since u2 + u2 # 0, this is only possible if the two lines in question are identical, that is, setting L = b - a, (34.27)
~ 1 ;= ’~ jrr,
j = 1,2, . . . .
This condition determines the eigenvalues of -(d/dx)’ in our present setup; observe that these eigenvalues must be strictly positive, by Proposition 23.4. They are the numbers (34.28)
lj = (jn/L)2,
j = 1, 2,
....
By taking into account (34.26) and the fact that the Ej must form an orthonormal system in L’(n),we immediately obtain (34.29) Observe that the multiplicity of each eigenvalue is one.
328
BOUNDARY VALUE PROBLEMS
[Chap. I11
Example 34.2 We continue to look at A = -A, but this time in two independent variables, denoted by x and y . We take R to be the open disk x2 + y 2 < R 2 , R > 0. It is convenient, therefore, to switch to polar coordinates r, 0 and to expand the prospective solution of (A + I)h = 0 into a Fourier series (in 0): +m
(34.30)
h(r,
e) = h,(r) + C (hm(r)eime+ h-m(r)e-ime). m= 1
We use the expression of the Laplacian in polar coordinates:
Because of the fact that any two functions v(r)eime,w(r)eim" with m # m', are orthogonal in L2(R), in order that h, given by (34.30), be a solution of (A + I ) h = 0, it is necessary and sufficient that each term hm(r)eime, m E Z, be one. We must therefore study the equation
hm=O,
r
m=0, + I , +2 ,....
We require h E HA(R), hence (34.32)
hm(R)= 0,
Vm E Z.
Notice that we know a priori that there will be nontrivial solutions to (34.31)(34.32) only if ;1 is an eigenvalue of the problem, and then there must be solutions only for finitely many m [otherwise the space of the functions h E HA(R) satisfying (A + i ) h = 0 would be infinite-dimensional]. If we set s = $ r in (34.31) and denote now by primes the differentiations with respect to s, that equation is transformed into the mth Bessel equation (34.33)
h"+-h'+ S
( - -3h =o. 1
The space of the solutions that are regular at the origin is one-dimensional, it is spanned by the rnth Besseffunction (supposing now that m >= 0):
where denotes the Euler gamma function. The way to find the expression (34.34) is straightforward : One writes that the solution h in (34.33) is a power series in the variable s and determines the coefficients of this series by the
Sect. 341
329
EIGENFUNCTION EXPANSIONS
relations between them derived from (34.33). If m is a negative integer one defines J,(s) as (- I ) - " ' J - m ( ~ ) it; is obviously also a solution of (34.33)! The question, of course, is whether one can also achieve
J,(JA R ) = 0.
(34.35)
In answering this question we invoke classical results of the theory of Bessel functions (to be found, e.g., in [W]). We may as well suppose m >= 0 (most of the statements below are valid even when m is not an integer, but we are solely interested in this case, of course).
(34.36) For any m 2 0, J,(s) has only real zeros. (34.37) Let z,, ( j = I , 2, . . .) denote the positive zeros of J , , arranged in increasing order. Then (interlacing of the zeros of the Bessel functions) O
< * * ' .
Property (34.37) makes it important to know something about the distribution of zeros of J , : (34.38) Jo has an injinity of positive zeros; they lie in the intervals [(k + $)Tc, (k
+ +)XI,
k
= 0,
I , . .. .
Finally, let us emphasize the fact [implicit in (34.37)] that any two functions J , (m = 0, 1, . . .) have no common zeros except the origin (when Jo is not one of them). If we take all this into account, we conclude that we have a (double) sequence of eigenvalues : (34.39)
A=(z,,~/R)',
m = 0 , 1 , ..., j = 1 , 2 ,...,
and that the eigenspace corresponding to each one of them is two-dimensional -it is spanned by Jm(z,, j r / R ) e i m and e by Jm(zm,j r / R ) e - i " e [except form = 0, in which case the eigenspace is one-dimensional and spanned by J o ( z o , r / R ) ] .
Exercises 34.1. This exercise concerns the meaning of the smallest eigenvalue. Let V q H be two complex Hilbert spaces and assume that V is dense in H and that its injection into H is compact. By composing the injections of V into H
v'
and that of H into the antidual of V (we are identifying H with its own antidual) we obtain an injection J of V into V'. Show that this enables us to introduce positive continuous linear mappings of V into V', and if A is a positive isomorphism of V onto we may define its square root, A'/'. Prove
v',
330
BOUNDARY VALUE PROBLEMS
[Chap. 111
that A'/' is an isomorphism of Vonto H.Let a(#, u) = (A1l2u,A ~ / ' u )Show ~. that the smallest eigenvalue of A is equal to the largest number c > 0 such that
(34.40)
cllull;
5 a(u, u),
v u E V.
34.2. Apply the abstract description in Exercise 34.1 to the case where V = HA@), H = L2(Q) [hence V' = H-'(Q)] when Q is the open subset of R" defined by (34.41)
-m
j = I , . . ., n.
< aj < x i < bj < +m,
Taking A to be minus the Laplace operator, -A, compute its smallest eigenvalue. 34.3. Let J J z ) be the mth Bessel function, defined in (34.34) (m E Z). Show that ,iz s i n 0
(34.42)
-
J,,,(z)eimO. m=-w
Derive from this that, for all real x, 1
(34.43)
IJfAX)'
3
if m ZO,
IJo(x)I S 1 ;
Prove the following formulas :
(34.44)
(34.46)
JJz)
cos(m8 - z sin 8) dB
=
Jm-l
+ Jm+l =-2mZ J , ,
(Bessel's formula);
Jm-l - Jm+l = 2JA;
+oo
(34.47)
Jm(Y
+ Z ) =v =1 -w
JA~)Jrn-v(z)-
Explain the relation between (34.47) and the properties of Fourier series related to convolution (in 2 and on the torus T). 34.4. Formula (34.34) is used to define the Bessel function Jm(s)for any complex number m which is not an integer
(34.48)
a([) = pi-"/'
+W
u"(r)Jn12-l(rp)rn/2 dr.
Sect. 341
EIGENFUNCTION EXPANSIONS
33 1
34.5. Prove that the Laplace operator A can be regarded as an unbounded self-adjoint operator (which we shall denote here by -A) on Lz(Rn) with domain Hz(Rn).Prove that the spectrum of A is exactly equal to R, but that there is no eigenfunction of A belonging to Lz(Rn), hence no eigenvalue of A. Prove that given any number 12 0, there is a functionf, E L"(R") A Cw(R") such that -A fL = Af, (f,not identically zero). 34.6. Let 1 be an arbitrary number 20. Let A be the Laplace operator on R2. Prove that all the tempered distributions u which satisfy the homogeneous equation
(A + A)u = O
(34.49)
in RZ,
are given by a series expansion
(34.50)
i m
u(z) =
cmJm(fir)eime ( z = re"),
m=-m
where (c,),,,,~ is a sequence of complex numbers growing slowly at infinity [i.e., 3C > 0 such that I c, I 5 C(l + Im I)', Vm E 21. Prove that the series on the right in (34.50)converges in Cm(R")(J, stands for the mth Bessel function). 34.7. How do you reconcile the fact that, given a bounded open subset of R",for suitable values of A > 0 the homogeneous equation (A + A)h = 0 has nontrivial solutions in Hh(R), with the fact that no linear PDE with constant coefficients can have compactly supported distribution solutions different from zero (the latter assertion can be established by means of Fourier transformation, applying the Paley-Wiener theorem) ?
35 Approximate Solutions to the
Dirichlet Problem. The Finite Difference Method Conformal mappings and eigenfunction expansions provide us with the means of approximating the solution to the Dirichlet problem
(35.1)
( A - A)u =f
(35.2)
u =g
(in f2, an open subset of R"),
on 8R
(the boundary of
a).
Unfortunately, their use is clearly limited to highly " symmetric " configurations: That of conformal mappings presumes that we are essentially in a twodimensional situation, and the use of spherical harmonics, of cylindrical, ellipsoidal, etc., harmonics, is possible only when the domain R exhibits spherical, cylindrical, ellipsoidal, etc., symmetries, respectively. In practice these are excessive limitations. Thus quite early (at the beginning of this century), physicists and engineers were led to devise a simple and practical method of approximating the solution of (35.1)<35.2). This method is closely related to the notion of discrete harmonic functions described in Sect. 31. It goes under the name offinite diflerence method. A crucial advantage of this method is that it is particularly well adapted to treatment by computer. Another method, which has been in favor recently, is the Galerkin method; it has a certain similarity to eigenfunction expansion, but much greater flexibility. Both the finite difference and the Galerkin method are based on the approximation of Eq. (35.1) by linear equations in finite-dimensional spaces. One solves the latter and shows that their solutions converge to that of (35.1) as the finite-dimensional spaces suitably " increase." We give here a very brief description of the two methods, from the standpoint of the variational approach, currently the standpoint which is most often adopted. 332
Sect. 351
333
THE FINITE DIFFERENCE METHOD
It pays to look a t a setup which is general enough to cover both the finite differenceand Galerkin's method. We shall deal with a separable Hilbert space V , i.e., with a Hilbert space having a countable dense subset or, if one prefers, a countable orthonormal complete system. In the application given here V will be H;(R), and the separability assumption is certainly satisfiedas it always is, in practice [usually, V is a closed subspace of H"(R)]. We shall now describe an external approximation of the Hilbert space V. At first this concept may seem a bit too abstract, but the examples will help, we hope, to understand the reason for the abstraction. We shall deal with spaces and mappings indexed by a parameter h; h will run over a sequence (or sometimes over a net with a countable basis-like an interval in the real line) which we assume to converge; the limits with respect to h will always be meant as h tends to the limit of that'sequence. But first of all we introduce a monomorphism of V into another Hilbert space F, J, i.e., a continuous linear injection whose image is closed but not necessarily all of F (by the closed graph theorem, this implies that J is a homeomorphism into). For each h, we introduce a Hilbert space v h which, in the applications, will be finite-dimensional, and a pair of continuous linear mappings:
(35.3)
rh V + V/,,
Ph
: vh+F
(the rh are often called the restriction mappings, the extension mappings).
Ph
the prolongation or
Definition 35.1. The system (Vh, ph, rh, F, J ) is called a convergent external approximation of V if both following conditions are satisfied:
(35.4) Vu E V, P h rhu converges to Ju in F
(for the norm of F ) ;
(35.5) suppose that, for each h, we are given an element v h of vh such that P h v h converges weakly in F to some v ; then necessarily v = Ju, u E V. Furthermore the approximation is said to be stable if the norms of the mappings rh and P h are bounded independently of h. Example 35.1. Let V = F and J be the identity mapping of V . Let h vary over the sequence 1, 1/2, . . ., 1/ k , . . . , and let { vh} be an increasing sequence of finite-dimensional subspaces of V whose union is dense in V (recalling that Vis separable). For instance, we may be given a complete orthonormal system { e l , . . .,e,,, .. .) in V and let V, be the subspace spanned by el, . . . , e, where k = l/h. We may then take for ph the natural injection of Vh into V and for rh, the orthogonal projection of V onto vh.The approximation thus obtained is convergent and stable; it is called a Galerkin approximation of V. We shall simply denote it by {vh}.
Let us momentarily go back to Sects. 22, 23, and 26 and recall that, under suitable assumptions on R, on dR, and on the boundary datum g , we may
334
[Chap. 111
BOUNDARY VALUE PROBLEMS
reduce the problem (35.1)-(35.2) to the same one, but where g = 0. We must now assume that f E H-'(Q) and the variational approach consists in seeking u in HA@) satisfying, for all u E HA@), the equation
4%0)
(35.6)
= (f, fi),
where ( , ) is the bracket of the duality between H;(Q) and H-'(Q), and
a(u, u ) = A
(35.7)
1u5 n
dx
+
n j=1
au I axJ
afi dx. axJ
(We assume throughout that A is 20; when R is unbounded, we assume Iz > 0.) In the abstract situation, we assume that we are given a continuous sesquilinear form a(u, v ) on V x V which is coerciue (Definition 23.1) and a continuous antifinear form 4 on V; we want to approximate the solution u, which belongs to V (Lemma 23.1), of the equation (35.8)
a(u, u) = +(u)
for all u E V.
[When dealing with real-valued functions and distributions, hence with real Hilbert spaces V, H , and V', the form a is assumed to be bilinear, and the form 4 linear. Note that we do not assume the form a to be Hermitian-or symmetric, in the real case, as it is in the particular case (35.7).] Let us consider an external approximation of V, (Vh ,p , , r, , F, J), as in Definition 35.1, and assume that we are given, for each h, a continuous sesquilinear form a, on v h x vh and a continuous antilinear functional 4, on vh, submitted to the following conditions:
(35.9)
there is a constant co > 0, independent of h, such that
Iah(uh
9
uh)
I 2 11uhll Y2 h cO
7
vuh E
v h;
(35.10) there is a constant C > 0, independent of h, such that
11 4 h 11Y,,'5 c. Of course, we must somehow relate the data a , , 4 h to a and 4. This is done by requiring that a, converge t o a and 4 h to 4 in a reasonable sense, which is made precise by the following consistency conditions:
(35.11) for all v, w E V , if the sequence P h O h , Ju in F, then: (35.12)
lim ah(Uh, rhw) = a(v, w), h
(35.13)
lim 4h(uh) = 4(v)* h
Uh
E v h , converges
lim a,(r,w, uh) = a(w, u ) ; h
weakly to
Sect. 351
335
THE FINITE DIFFERENCE METHOD
Clearly, when ah is Hermitian for every h, one of the two conditions (35.12) is redundant. Example 35.2. Let { Vh}be a Galerkin approximation of V (Example 35.1). If we take for ah the restriction of a to vh x vh and for (bh the restriction of 4 to vh, we see at once that conditions (35.9) to (35.11) are satisfied.
By virtue of the Lax-Milgram lemma (Lemma 23.1), for every h the equation (35.14),
7
has a unique solution uh in
= &h(uh)>
uh)
vh.
vuh
vh
1
We may state and prove:
35.1. Let (Vh, p h r rh, F, J ) be a convergent and stable external THEOREM approximation of the Hilbert space V. Let a(u, v) be a continuous coercive sesquilinearform on V x V , & a continuous antilinear form on V. For every h, let ah be a continuous sesquilinear coercive form on vh x v h and f$h a continuous antilinear form on v h , such that (35.9), (35.10), and (35.11) hold. If u E V denotes the unique solution of (35.8) and uh E vh ,for each h, the one of (35.14), ,P h t+, converges to Ju in F (for the norm). Proof. By applying (35.14), with uh = uh and applying (35.9) and (35.10) we immediately obtain
(35.15)
I f sup,
/IUhllV,,
jlj+,)I = M
5 c/cO-
+ co (since the approximation is stable, Definition 35.1),
(35.16)
IIPhUhl l F
5 MC/cO-
We may extract a subsequence, {Ph‘ u,,,), which converges weakly in F, to some element, of the form Ju,, u1 E V, by (35.5). Let us arbitrarily fix u E V and apply now (34.14), with vh = rhu.We apply (35.4) and see that the sequence { O h } is of the kind considered in (35.11). We derive from (35.13) that #h(Uh) converges to 4(u). On the other hand, let Ch = uh, if h = h’ belongs to the set of indices of the weakly convergent subsequence Ph’ uh‘ above, and & = rhu1 otherwise. We see that Ph i i h converges weakly to Ju, in F, hence, by (35.12), ah(uh, vh) converges to a@,, v). But ah, (+, v,,,) = &h’(Uh’) 4(v), hence u1 satisfies (35.8), which demands that u1 = u. This shows that --+
(35.17)
{Phuh}
converges weakly to Ju in F.
Let us then consider T = ah(uh - rh u, uh - rh u ) = ah(uh 9 uh)
+ ah(rh u, rh u) - adUh,
rh
u,
- ah(rh
u, uh)*
336
[Chap. 111
BOUNDARY VALUE PROBLEMS
By (35.9) we have
(35.18)
- rhullY, = < c -0’ T .
Il‘h
If we apply (35.12) [with uh = ‘ h u and recall (35.4)], we see that a h ( r h u, r h u),
ah(uh
9
r h u),
fih(rh #, u h )
converge to a(u, u).
On the other hand, by (35.13), ah(uh
9
uh)
converges to $(u)
= $h(uh)
= a(u, u).
Combining these assertions yields that T converges to zero, and therefore
(35.19)
lim
lluh
- r h u l ( y , =0.
h
By using the fact that the norms of the P h are bounded independently of h, and also (35.4), we conclude that P h u h converges to Ju for the norm in F. Q.E.D.
The property (35.19) is sometimes expressed by saying that the u h converge discretely to u (from which arises the notion of discrete convergence). Inspection of the proof of Theorem 35.1 yields an estimate of the error. Actually there are several notions of error: the error between u and uh ,which is the value of IJJu- p hu,,llF(thus this error depends on the choice of J ) ; (2) the discrete error between u and u h , which is the value of I)uh- r h ullY,; (3) the cutof error on u, IJJu- P h rh ullF.
(1)
It is on the size of the latter that we might best be able to have an influence, as it depends solely on the choice of the approximation ( v h , p h, rh, F, J ) and not on the sesquilinear form a(u, u). We shall determine an upper bound for the error between u and u h (l), in terms of the cutoff error (3). First we shall determine an upper bound for the discrete error (2), in terms of an “extension” a of a to F x F, that is, of
(35.20) a continuous sesquilinear form a on F x F such that ~ ( J u Jw) , a(v, w ) for all u, w E V .
=
Then let u be the solution of (35.8). We define the following continuous linear functional on F :
(35.21)
@W= a W , g),
It is verified at once that @(v)
= $(Jv) for
g E F.
all u E
V .
Remark 35.1. Observing that F = ( J V ) 0 (JV)’ [(JV)’ is the orthogonal ofJV i n F ] ,we see that forms such as a always exist: We may take a to be the
Sect. 351
337
THE FINITE DIFFERENCE METHOD
only extension of a such that a ( f , g) = 0 for all f E F,g E (JV)’. But other extensions might be more suited to the problem under study, as shown in the following: Example 35.3. Let V = H’(R) [or HA(Q)] and take F
5 ). ax.
av
(35.22)
= (L2(R))”+’,
We know that J is an isometry of Vinto F. The extension of the form (35.7) is evident :
Observe that (35.23) is certainly different from the extension described in Remark 35.1. Indeed, the latter is never coercive when J is not onto, whereas (35.23) is obviously coercive (when A > 0) although J is certainly not onto. We introduce now the following two quantities :
(35.24) (35.25)
&h(#)
= \ltf)h@ - $h\lVhr
3
qh(u) = sup )a(Ju,Phvh)- a h ( r h u , o # U h Evh
vh)l/\IvhllV,.
Concerning (35.24), note that Ph is a continuous linear map ‘ph is a continuous linear map F’ + VL .
vh
-+I;, hence
THEOREM 35.2. Use the same hypotheses and same notation as in Theorem 35.1. We have (35.26)
11 uh
- rh
11 V h 2 co
[&h(#) f
qh(u)l>
where co is the positive coercivity constant in (35.9). Proof. It suffices to reexamine the quantity T introduced at the end of the proof of Theorem 35.1, and rewrite it as follows: T = ah(uh, uh - r,,u ) - ah(rh u, uh - rh u ) = $h(uh - rh u ) - a(Ju, ph(uh - rh u)>
+ a(Ju,ph(uh - rh u>)
Q.E.D.
338
[Chap. 111
BOUNDARY VALUE PROBLEMS
COROLLARY 35.1. We have the same hypotheses as in Theorem 35.2. Let M be an upper bound for the norms of the extension operators P h . Then
The Finite Difference Method In this second part of the section we apply the general considerations of the first part and describe the approximation of the solution to the Dirichlet problem one obtains by the finite difference method. We assume throughout that the open set n is bounded. We deal with diagonal matrices h whose diagonal entries hl, - ..,h, are strictly greater than zero (and converge to zero). If x E R",hx denotes the vector (h,x', . . ., h,x"); hZ" is the image of Z", the lattice in R" of points with integral coordinates ( > 0 or 50), under the mapping X I + hx. In what follows a matrix h will be our parameter (so denoted in the first part) and will range over a sequence of (strictly positive) diagonal matrices converging to the zero matrix. The sequence will not be specified and will simply be denoted by $ (of course, in actual computations one must make a choice of the sequence $). Given xo E R" and h E $, we write
n n
(35.29)
Oh(X0)
=
[x',
j=1
- thj
7
+thj[;
(35.30) i f r is a nonnegative integer, uh(xO
7
r) =
u
LIE
Oh(x0 + $ha).
Z"
14 i r
Three examples of sets O h ( X 0 , r ) in the plane (i.e., when n Fig. 35.1.
= 2) can be found in
We denote by w,,, xa the characteristic function of the set oh(x0) and by &, .. ., n) the finite difference operator
( j = 1,
(35.31)
1
hh,j F ( x ) = - [F(x + $hiej) - F(x - +hiej)], hi
where ej stands for the unit vector along the $-axis. Returning to the open set R, we shall use the notation
Sect. 351
THE FINITE DIFFERENCE METHOD I----
339
-- -7
I
I
I
I I
I
I
I
!
FIG. 35.1
When dealing with second-order elliptic operators, one makes use of these sets only for r = 0, 1. If the order of the operator under study is 2m, one will customarily make use of the same sets but for r = 0, .. . , m. We describe now the external approximation (or dtfference scheme) we are going to use. As we have said, Y = Hh(R). We take F = (LZ(R))”+’and J t o be the isometric injection (35.22). For each h E I), the space V, will be the linear span of the characteristic functions w , , as ~ y ranges over the set (35.32) where r = 1, i.e., V,, will be the space of step functions:
(35.33)
=
c
uh(Y)wh.
Ythhl
The notation is consistent since w,,, w,,,,= 0 whenever y , y’ are different elements of 0;. Observe that the dimension of V, is exactly equal to the number of such elements. We equip V, with the inner product
The extension operator p , will be the mapping
(35.35)
Uh-(uh?
4,lUh,
.. ., 8 h , “ % ) ,
which is obviously an isometry of V, into F.
340
BOUNDARY VALUE PROBLEMS
[Chap. I11
As for the restriction operator rh : V - t vh, it assigns to every v E HA(C2) the step function (35.33) defined by
(35.37)
I ch(y)I
= volume O f c h ( y )= h,
. .. h, .
We are going to prove that the difference scheme described above is stable and convergent (Definition 35. I), but under a restrictive condition involving the open set R and the sequence $.
De$nition 35.2. Let @ be a sequence of strictly positive diagonal matrices converging to zero. W e say that the open set R c R" is $-regular fi there is a Jinite collection of vectors in R", 8,, . . . , 8,, and a constant C > 0 , independent of j = 1 , . . . , n, and of h E b, such that the following is true:
(35.38) given any point y E hi such that either y + hiej or y - hiej does nor belong 'to 6;, to every z E c,(y) there is p, 0 < p < C, such that, for someq, 1 ~ q 5 p , z + p h j B , $ R .
THEOREM 35.3. Let Ij be a sequence of strictly positive diagonal matrices, converging to zero, R a bounded open subset of R". I f f 2 is @-regular,the external approximation (vh,P h , rh, F, J ) of V = H,$2) described above is stable and convergent. Proof. Since the mappings ph are isometries, in order to prove that the approximation is stable, it suffices t o prove that
(3 5.39)
llrhUllVh
5 const
llullY
for some constant independent of h and of v E V. Of course it suffices to prove (35.39) for u in a dense subset of V ; we shall take v E Cz(R). First of all, recalling that the sets ah(y),y E hZ", are pairwise disjoint, we see that
'
"CTh,,)
I v(z)I
dz
(by Cauchy-Schwarz)
Sect. 351
341
THE FINITE DIFFERENCE METHOD
where all summations are performed with respect t o y E 6,'[cf. (35.32)].Thus,
(35.40)
11 uh 11 L'(R) 2 11 uII Lz(R) .
In order to show that the difference scheme ( v h , P h , rh, F, J ) is stable, we must estimate the L' norm of a h , u,, in terms of the H' norm of u. Let us f i x j , 1 2 j S n, and let us denote, for any point x, x- =x-'hz
x+ =x++hjej,
e i. .
~
We have uh(x+)
=
c
=
Ilh(Ylwh, y ( x + )
c1
~ o h ( y - ~ ( Z +d)z wh, y - ( x ) '
ah(y-)I
Y-
that is,
(35.41)
Wh,y-(x)/uh(y-)
" ' h n ) - ' x
v(z') dz,
Y-
the summation being performed over the image of
6,'under the translation
y ~ y - Similarly, .
(35.42)
vh(x-)
=
1
' ' '
wh, y +
Juh(y')
Y+
d Z - )dz.
It follows from the definition of h,'that y + and y - belong to R as long as y remains in A,'. Note also that, if y' = y - h j e j , then y" = y - . If there are two points y , y' in 6: such that y" = y - , we shall denote the latter point by y o . Otherwise we write y*; thus y* = y + (resp. y - ) whenever
(35.43)
y
+ h j ej (resp. y - h, e j ) does not belong to 6;.
We derive from (35.41)-(35.42):
(35.44)
(hl
'
.
hn)[Uh(X+)
+
-
uh(x-)l
kw h , y * ( X ) Y*
We have [cf. proof of (35.40)]
I
sh(Y*)
U(Z*)
dz.
342
[Chap. 111
BOUNDARY VALUE PROBLEMS
In the terms corresponding t o points yo we use the fact that
I u(z+) - u ( z - ) I
(35.46)
As for the terms corresponding to points y*, we observe that, if y* = y + for some y E the corresponding term in (35.45) is
a:,
Iu(z-)l2 d z =
(35.47) JOh,,+,
Id,,Iu(z)~*
dz.
At this point we exploit the t)-regularity of R (Definition 35.2) and property (35.43) of y. For each z E b h ( y ) we may select p = p(z), 0 < p < C, such that z p h j 8, 4 R for some q, 1 q p . Since u E C:(Q), we have
+
s s I u ( z ) l z = ( u ( z + ph,8,) - u(z)12
(35.48)
CC:h;
11 C
P
(grad u(z
0 q=l
+ thjOq)I2d t ,
where C, = sup, I 041. Since the sets a&) and b h ( y " ) are all contained in R [by (35.32)] we derive from (35.45), (35.46), and (35.48):
+ CC:
f I 1 ]grad u(z + thjO,)) C
q=1
0
n
dz dt.
After obvious changes of variables in the z integrals we obtain (35.49)
s
Id,,, u(x) 1' dx 5 (1
s
+ pC2C:) 1 grad u(x) l2 dx,
which, together with ( 3 5 4 , implies (35.39), and hence the stability of the scheme we are looking at. In order to complete the proof of Theorem 35.3 we must show that it is convergent. First we show that, for each u E v,P h r h u converges to Ju in F. By the Banach-Steinhaus theorem (i.e., the principle of uniform boundedness), thanks to the fact that the norms ofph and r h are bounded independently of h, it suffices to prove the assertion when u E CF(R). Then it suffices to show that + u and a h , j u -,(d/dxj)u in L*(R), which is quite obvious. , uh be an element of v h such that the sequence Next, for each h ~ t )let { p h Oh} converges weakly in F. This implies that the Uh and the d h , Vh converge
Sect. 351
343
THE FINITE DIFFERENCE METHOD
in the distribution sense, and that their limits, respectively u and (a/axj)u, belong to Lz(R), i.e., u E H'(ZZ). But since the supports of the vh and of the d h , uh are compact subsets of 0, it is not difficult to see that u E H;(R). Q.E.D.
From Theorem 35.3 we can derive an approximation of the weak solution u to the Dirichlet problem
(A - A)u =f E H-'(R),
(35.50)
u E HA(R),
that is, the solution u of Eq. (35.8) where a is given by (35.7) and qqu) = (f, C),
(35.51)
uE
v = HA@).
We introduce the following approximations of a and
4, respectively:
(35.52) (35.53) assuming that u,,,
Wh
E
Vh and that we have
(35.54) [If we have at our disposal additional information about f, for instance that f€L2(R), the solution of Eq. (35.54) can become very easy; for instance, f,=f, = 0 for j > 0.1 Condition (35.9) is satisfied with co = inf(1, A); if 1 = 0, we may still be able to satisfy (35.9) by modifying the inner product in Vh [usually given by (35.34)]. Condition (35.10) is satisfied by
It is seen at once that condition (35. I 1) is satisfied (taking into account what has been established in the proof of Theorem 35.3). We consider the approximate problem (35.55)
9
Oh)
= $h(uh)t
vuh
Vh
*
It sufficesto verify (35.55) when o h ranges over a basis of V,, in particular when = w h , y , y E hi, We may set, for any two points y , y' in hi,
(35.56)
a"h(Y,
v') = ah(Wh,
y
9
w h , y')'
In order to find the solution of (35.55), (35.57)
$h(Y)
= +dWh,
y).
344
[Chap. I11
BOUNDARY VALUE PROBLEMS
we must solve the system of linear equations (35.58)
$.
Zh(L’3
y’)uh(Y> = $h(y‘),
y’
hL.
Yeah’
Calling n(h) the number of points in hi, i.e., n(h) =dim vh, we see that (35.58) is a system of n(h) linear equations, in n(h) unknowns, of the complex numbers uh(y). By Theorems 35.1 and 35.3, we know that, when h tends to zero along I), the solution u h of (35.55), given by (35.57), converges to the solution u of (35.50). The requirement that ST be lj-regular depends, of course, on the choice of lj. Let us assume that, when h E Ij tends to zero, the “components” h, remain comparable, that is, that there is a constant y > 0 such that
(35.59) y
s hj/h, 5
7-1
for all h e lj and all j , k = 1 , . . ., n.
Then, by choosing enough vectors O4 (for instance, enough to ensure that every convex cone with solid angle greater than a given value wo > 0 will contain at least one of them), one can easily see that every convex bounded open set R is ?-regular, or that every bounded open set ST whose boundary is C’ (and which lies on one side of 8R) is lj-regular. Finally, we wish to underline the fact that the application of the finite difference method presented above is probably the most primitive one can give, and that the method is amenable to great elaboration and diversification. It can be applied to a variety of problems besides the Dirichlet problem, for instance to the Neumann problem, the oblique derivative problem, (cf. Sect. 37), and also to higher order equations (cf. Sects. 36 and 38). One may obtain, under suitable circumstances, convergence properties other than the one contemplated in Theorem 35.1, for instance convergence in the L“ norm. Also, rather than using simple sets such as the b h ( y ) introduced in (35.29) and the associated a,(y, r), fi;, it might be convenient to use more complicated, curvilinear sets. Also the method can be extended to elliptic equations with coefficients which are not smooth, and to nonlinear equations. On all of this we refer to the texts on approximation of solutions to elliptic boundary value problems, such as [A] and [FS]. The range of application of the finite difference method is not limited to elliptic boundary value problems. With appropriate modifications it has been successfully applied to a number of hyperbolic and parabolic evolution equations and mixed problems (on these problems, see Chapter IV; a very simple application of the Galerkin method to hyperbolic evolution equations can be found in the proof of Theorem 47.1). Exercises All the functions considered in this exercise will be real-valued. Let be the linear space of step functions (35.33, f+, any element of V,, M an
35.1. Vh
Sect. 351
345
THE FINITE DIFFERENCE METHOD
arbitrary number at least zero. Prove that ( u h - M ) + = sup(t+, - M , 0) also belongs to Vh. Let f €Lrn(R) and consider the h e a r functional on v h [cf. (35.53)] faf u h dx. Let A be strictly greater than zero and choose M > 0 such that IlfllLm(n) AM. Let q,(t.+,wh) be the bilinear form (35.52) and u h the only element of Vh such that
s
(35.60)
7
uh)
=IQfuh
Derive from (35.60) that if
wh
= u,,
ah(Uh
(35.61)
uh(wh
9
wh+)
=
dx,
vuh E
vh
*
-M,
In (f( x )
- AM)wh+(x)dx.
Prove that (35.62)
(a,,
jw,'(X))'
5 ah,jwh(x)
and derive from (35.61) that (35.63)
ah,
jw:(x),
In (w;(x))'
uh(x)
v x ER",
dx = 0, hence that
5M,
VX E
R.
[Similarly one would obtain uh(x) >= - M , Vx E R.] Since Lao@)is the dual of L'(R), conclude that there is a subsequence of the uh which converges weakly, i.e., for the weak dual topology on L"(R), a(Lm(R), (L'R)), to u, and that we have
35.2.
1
u E L"(%
(35.64) Let Rh =
IlUllL-(n)
5 2 IIfIILm(n).
in the notation of (35.32). Call Vhthe space of functions
(35.65) where w h , y is the characteristic function of a h ( y ) . Let P h denote the natural injection of Vh into F =L2(R) and r h the mapping of V = L Z ( n )into vh which assigns the functions (35.65) to u E V such that (35.66)
Uh(y)
= ( h , ' ' . An)-'
j
U(X)
dx
( y E nh).
4 Y )
Prove that (V,,, ph, rh , F ) is a stable and convergent external approximation of L'(R). 35.3. Let Ri be the set of points y E hZ such that a&) intersects Q; define Vh as the space of functions (35.67)
=
1
YEnh'
uh(Y)wh,y(x)
ICf.(35.33)l
346
BOUNDARY VALUE PROBLEMS
[Chap. 111
equipped with the inner product (35.34). Take F = (L*(R))"+' and the operator P h
(35.68)
UhH(Uh?
8h.lUh,
*..?
6/1,nv/I)lQ
where In denotes, as usual, the restriction (of each component) to R. Suppose that R is bounded and regular enough so that there is a continuous extension mapping E,: H'(R) + H'(R") (cf. Appendix to Sect. 26). Take r,v, u E H'(R), to be the step function (35.67) such that
Prove that (V, , P h , r,,, F ) is a stable and convergent external approximation of v = H'(R). 35.4. Let R be a bounded open subset of R", lying on one side of its boundary r; the latter is assumed t o be a C' hypersurface. Exploit the approximation of H'(R) described in Exercise 35.4 to obtain an approximation of the solution of the Neumann problem in R, for -A + I [A > 0; see (37.1H37.2)and 437.11.
36 Garding's Inequality. Dirichlet Problem for Higher Order Elliptic Equations
In this section we show how the variational method can be used to obtain weak solutions to the Dirichlet problem for higher order strongly elliptic equations. Basic ingredients will be an open subset R of the Euclidean space R",which need not be bounded but which we shall suppose so, in order to streamline the statements, and an elliptic differential operator in R, of even order 2m > 0. (36.1) Here a = (aI, . . . , a,) is an n-tuple of integers 2 0; I CI I = a, + * . * + a,, and D" = D;' . - .D:, D j = -J-1 d/axj. We assume that the coefficients a, are C" functions in R, and that the operator P(x, D ) is uniformly strongly elliptic in R. This is a condition bearing on its principal symbol (Definition 19.l),
(36.2) The condition is that, for a suitable constant c, > 0, (36.3)
Re P2,,,(x, 0 1 c, 1 t I 2m,
Vx E R,
V5
E
R,
[such a positivity condition requires that the order of P(x, D ) be even]. We shall also make the following hypothesis:
(36.4) The coeficients a , and all their derivatives of order s m are bounded functions in the whole of R. 347
348
[Chap. 111
BOUNDARY VALUE PROBLEMS
We shall now state and prove the main result of this section, the celebrated Gdrding's inequality : THEOREM 36.1. Under the preceding hypotheses, there are two positive constants C , , C , such that v u E C,m(d). Ilulli 5 C , Re(f'(x, D)u, u)o + CiIIuII~-r We have denoted by ( , ) k and (1 Ilk the inner product and the norm in the
(36.5)
9
Sobolev space Hk(R") [or in the Sobolev space Hk(d)]. Proof.
(36.6)
Let x, be an arbitrary point of d.By virtue of (36.3) we have
~olr12rnI~(~112 s R@2rn(x,, <)l~(<)lZI7
where ii stands for the Fourier transform of u E C:(Q). from (36.6) and from Plancherel's formula, that
(36.7)
We derive at once
~ , s ~ l l : s Re(Pzm(x0, D)u, u)o + c o l l u l l L ~
Suppose now that the support of u lies in the open ball B , ( x o ) = { x ~ R " ;Ix-xoI < r } . There is a constant B, > 0, depending only on the mth first partial derivatives of the coefficients a,, I a I = 2m, such that
(36.8)
1 (f'2rn(~,D)u, U ) O - (f'Zrn(X0
3
D)u, U)OI
5 B o r l l ~ I+ l ~B o l l U l l m l l ~ l l m - ~ ~
In order to prove (36.8) it suffices to consider the case where PZm(x,<) = a(x)(", I a I = 2m. It is clear that
(36.9)
+ B',C8"D8'aa.,str(~)D8",
a(x)Da = Da'a(x)DU"
where [ a'I = Ia"l = M ,and where the summation ranges over a set of pairs of n-tuples j',j"of length at most m and m - 1, respectively. Consequently,
(36.10) I({u(x) - U ( X , > ) D ~u>~f , ({a(x) - a(xo))Du"u,Da'u)j 5 B1 IIuIImIIuIIm-1, where the constant B, depends solely on the aa.,sas, which are suitable derivatives of a(x). But if the gradient of a(x) is bounded in a, we see that, for a suitable constant B, depending only on the bound of grad a in d,we have
(36.11)
I({a(x) -a(xo)ID"u, u)I S BzrIIuII:
which implies what we want, that is, (36.8).
+ BlIIuIImIIuIIrn-1
9
Sect. 361
349
GARDING’S INEQUALITY
If we combine (36.7)and (36.8),and choose r 6 c0/2B0,we obtain (36-12)
&o
Ilulli 2 Re(P,m(x, D)u, u)o + B,
vu E CCm(Br(XcJ)*
llullm IIullm- 1,
In view of the hypothesis (36.4)and by an argument similar to that which led to (36.10),we see that there is a constant B4 > 0 such that
I((P(x, D)-Pzm(x, D))u, u)ol 2 B4 IIUIIm I I u I I m - 1 2
V U E cCm(Q>.
If we combine this with (36.12) and choose the constant B , appropriately, we obtain (36.13)
~0
IIuIIi ~ 2 R e ( P ( x , ~ ) u , + ~ )B5o I I u I I m I I u I I m - i y
VuEC:(Br(xo))*
It should be emphasized that the constants co and B, are independent of xo. The last step consists of “patching together” the local estimates (36.13). In order to do this, we cover the closure of R, with a finite number of open balls Br(xi), i = 1, ..., J, and introduce an equal number of nonnegative test functions 4i (i = 1 , . . . , J ) with the following properties: For each i, supp Cpi c Br(xi); 4: + * . . + 4: = 1 in R (thus 0 5 bi 5 1 for every i). We consider then an arbitrary test function u in R and apply (36.13) to each 4i u (in place of u). We add the corresponding inequalities and observe that
a,
I(P(x, D)(4iu) -#iP(x, D)u, 4iu)O I S B ~ I l ~ I I m - ~ l I ~ I l m ~ Finally, by using the fact that, givqn any 9 > 0 ,
and by choosing q small enough, one easily obtains (36.5).
Q.E.D.
Remark 36.1. The hypotheses in Theorem 34.1 could have been relaxed in several ways. For instance, the result remains valid even when the open set R is unbounded. In this case we must exploit the uniformity in the hypotheses (36.3)-(36.4) with greater care: We must construct a partition of unity, {&}, as in the end of the proof, but now injinite, though having uniform properties (for instance, consisting of the translates at the points of a suitable lattice of a fixed test function). Also the regularity requirements on the coefficients of P ( x , D)could have been made less stringent: By inspection of the proof one sees easily that all one needs is that the coefficient of D“ in P(x, D)( 101) 5 2m) have sup( I a 1 - m, 0 ) derivatives in L“(0). Remark 36.2. We have stated and proved a global version of Gkrding’s inequalities which is well suited for the study of boundary value problems. But it is clear that we could have as well stated and proved a local version: If
350
[Chap. 111
BOUNDARY VALUE PROBLEMS
we drop the uniformity requirements in (36.3)-(36.4), we could have proved (36.5) for all u with support in any given relatively compact open subset of 0,R', and with constants C,, C, depending on R'. on the right-hand Remark 36.3. We could have replaced the norm 11 side of (36.5) by any norm II 1) j , j < m, in particular by 11 [lo. It suffices to observe that, given any q > 0, there is a constant C(q) > 0 such that (1
+ I5l2)"-' 6 q(1 + 15IZ)" + C(?),
VY E R,,
hence (36.14)
Ilullf-1
6 qllullt + c(v)ll~Il;,
vu
E
c.:
By choosing q 5 1/2C, and increasing C, we derive from (36.5) and (36.14): (36.15)
llulld 5 C, Re(P(x, D)u, u)o
+ C;llulI~,
V u E Cp(R).
We shall now make the additional hypothesis that (36.15) is valid with
c; =o: (36.16)
Ilull~6 C, Re(P(x, D)u, u ) ~ ,
Vu E CCm(R).
Remark 36.4. If P(x, D ) satisfies (36.15), P(x, D ) + 1 satisfies (36.16) as soon as Re I is sufficiently large.
We observe then that (36.17)
(W,
m4 v)o
can be extended as a sesquilinear form on HZ(R) x H;(R) and that (36.16) implies that this form is coercive (Definition 23.1). We may apply the LaxMilgram lemma (Lemma 23.1) and derive (cf. Theorem 23.2) THEOREM 36.2. If (36.16)holds, the operator P(x, D ) defines an isomorphism of HZ(f2) onto H-"(R). Remark 36.5. It is clear that we could have assumed the coercivity of (36.17) instead of formulating the hypothesis (36.16). But "in practice" it is easier to check whether (36.16) holds, rather than whether the form under study is coercive.
Theorem 36.2 enables us to show that the (inhomogeneous) Dirichlet problem, relative to R and P(x, D), has a unique solution. But let us first state this problem. As usual we shall make rather stringent regularity assumptions on R: We shall assume that its boundary r is a smooth (that is, Cm) hypersurface and that R lies on one side of it. We refer now t o the theory of traces developed
CARDING'S
Sect. 361
35 1
INEQUALITY
at the end of Sect. 26. We introduce the traces of orderj = 0,1, . . .,m - 1, on r, defined in (26.20), and the " multiple" trace 7 considered in Theorem 26.9: .
r" : U b ( Y 0 ( U ) , . . . , y"-l(u)),
aju
yJ(u) = - Ir avI
The inhomogeneous Dirichlet problem we consider here is that of finding u in H"(R) satisfying
(36.18) P(x, D)u = f H-"(R) ~
(in R),
(36.19) yj(u) = g ' ~Hm-j-1'2(r)
(onr),
j = O , ..., m - 1.
It suffices to combine Theorem 36.2 with Theorems 26.5 and 26.9 to obtain THEOREM 36.3. If(36.16)holds, (36.20)
u k+
(W,m, P(4)
is an isomorphism of H"(R) onto H-"(R) x n y Z J H m - i - l t Z
Exercises 36.1. Let Pz,(x, 5 ) be a homogeneous polynomial of degree 2m in the variables tl, . . . , 5, , with coefficients in C"@) [cf. (36.2)]. Let N be an arbitrary nonzero vector in R, and let ITN be a hyperplane of R, supplementary to N . Give a necessary and sufficient condition on the roots of the polynomial in the variable z, PZm(x,8 zN ), as x ranges over and 8 ranges over l l N, in order that (36.3) be verified. 36.2. Let R be a bounded open subset of R", P(x, 0)a uniformly strongly elliptic differential operator of order 2m in R [thus (36.3) holds]. Assume, furthermore, that all the coefficients of P(x, 0) belong to B'"(R) (i.e., they are complex C" functions in R, and all their derivatives, of any order, are bounded in R). Prove that, given any nonnegative integer s, there are two positive constants C,(s), C,(s) such that (36.21)
+
w
l I ~ l l S~ Co(s) + ~ Re(P(x, D)u,
+ Ci(s)llullf,
VU E c$(Q).
36.3. Let R and P(x, D)be as in Exercise 36.2, but suppose now that all the coefficients of P(x, D) can be extended as C" functions, say with compact support, in R".Prove that, for a suitable choice of C,(s), C,(s), the estimate (36.21) can be extended to all real values of s. 36.4. Let R be an arbitrary bounded open subset of R",P(x, D ) an elliptic operator of order m (Definition 19.4) with C" coefficients in Q. Prove that
352
[Chap. I11
BOUNDARY VALUE PROBLEMS
every point xo E R has an open neighborhood U c Q such that, for some constant C > 0, (36.22)
IlUllm
5 CIIP(x, D)ullo
7
V U ECCm(u).
Derive from the analogous estimate for the transpose of P(x, D) that the equation (in V)
P(x, D)u =f E H - " ( U )
(36.23)
has a solution u E L2(cr) and, in fact, that u can be taken to depend linearly and continuously on f. 36.5. Let R and P(x, D)be as in Exercise 36.4. Prove that, for all positive real numbers s, there is an open neighborhood Usc R of xo and a constant C, > 0 such that (36.24)
IIullrn+s
5 CsIIP(x, D k l l s r
Vu~C:(us)*
From the estimate analogous to (36.24) for the transpose of P(x, D)derive that, given any point xo of R and any real number s, there is an open neighborhood V c Q of xo such that, for every f E Hs( V ) , there is a u E ,'+"'( V ) satisfying P(x, D)u =fin V . 36.6. Let R and P(x, 0)be as in Exercise 36.4. Prove that, given any real number s and any compact subset K of R, there is a constant C > 0 such that (36.25)
IIullm+s
5 C(IlP(x, D1uII.s
+ IIullrn-l+s),
VU E CCm(K).
By combining this with Lemma 25.3 (Exercise 25.11) prove that, for an appropriate choice of Usand C, ,the inequality (36.24) is also valid when s is a negative real number. 36.7. Let R and P(x, D)be as in Exercise 36.4. Let u be a distribution with compact support in R, such that f = P(x, D)u E Hso (so E R). Let s be a real number such that u E H" and that s - m 5 so. Apply the Friedrichs lemma (Lemma 25.4, Exercise 25.12) to derive from the inequality (36.25) (where s + 1 is substituted for s + m) that the functions p, * u converge weakly in HS+',and therefore that u E Hs0+"'. By using cutoff functions derive from this the following result : THEOREM 36.4. Let P(x, 0)be an elliptic operator of order m, with C" coeflcients in the open subser Q of R". Euery distribution u in R such that P(x, D)u E Hs,",(SZ) belongs to HS:OC+'"(R). COROLLARY 36.1. Every elliptic linear partial differential operator with C" coeflcients in R is hypoelliptic in R (Definition 2.1).
36.8. Let P(x, 0)be an elliptic operator of order m, with C" coefficients in the open set R c R".Let Kbe any compact subset of R. Prove, by exploiting
Sect. 361
GARDING'S
353
INEQUALITY
the estimate (36.25), that the set of C" functions h with support contained in K , satisfying the homogeneous equation P(x, D)h = 0, forms a finite-dimensional linear space. 36.9. Let P(x, D)be a uniformly strongly elliptic differential operator of order 2m, with C" coefficients, in a bounded open subset R of R".Assume that the coefficients of P(x, D) and all their derivatives are bounded in R. Prove that, given any complex number c, the following is true: (36.26) the kernel o f P ( x , 0)-t[ in HE(R) is afinite (perhaps, zero) dimensional linear subspace V, , and P(x, D ) + 5 induces an isomorphism of the quotient space H:(S2)/Vo onto a subspace V' 0 f H - ~ ( t 2 )whose codimension isJinite (and possibly zero); V' is the orthogonal (for the duality) of the kernel of the adjoint operator P(x, D)* [.
+
Prove furthermore that the values of c for which dim V, or codim V' are at least one form a sequence converging to infinity in the complex plane, and all lying in a half-plane Re [ 5 C < + 03. [Hint: Use Rellich's lemma (see Exercise 25.10) and the basic properties of compact operators (see, e.g., [Y, Chap. X, 45, Theorem 2]).]
37 Neumann Problem and Other Boundary Value Problems (Variational Form) Throughout this section R is a bounded open subset of R”,with a smooth boundary and lying on one side of it; the boundary is denoted by r. Boundary value problems, other than the Dirichlet one, occur frequently in applications. For instance, one may want to study the Neumann problem, say for the equation (37.1)
(A. - A)u =f
(in R),
which is to say, the problem where the following boundary condition is imposed on the solution: (37.2)
au
-=g av
onr.
As before dlav stands for the partial differentiation along the normal to R, outwardly oriented. The reasons for studying the Neumann problem are obvious: For instance, one might want to study the distribution oftemperature inside the body R when one only knows the heat flow through the boundary surface r, rather than the temperatures on r, as in the Dirichlet problem. One might also want to study “mixtures” of the Dirichlet and Neumann problems, such as the following one. Let r be represented as the union of two disjoint subsets To, rl (which we assume to be fairly regular). We may seek a solution of the “hybrid ” problem:
(37.3) (37.4)
(A - A)u =f in R, 8U
u =go onr,,
-=gl
av 354
onr,.
Sect. 371
355
NEUMANN AND OTHER PROBLEMS
In this section we are going to show that, as long as we are satisfied with weak solutions and select our problems from a relatively restricted list, a unified treatment is available to us, generalizing the one described in Sect. 23 and based on application of the Lax-Milgram lemma (Lemma 23.1). Predictably, it requires that the problem be reformulated in terms of a suitable coercive sesquilinear form on the " right " Hilbert space, which in general, but not always (see 37.lb), is a closed linear subspace of H'(R), intermediate between the latter and HA(R)-or between H"(R) and H:(R) if we deal with a differential operator of order 2m. Before embarking on the discussion of these variational boundary value problems, we must emphasize the fact that many kinds of boundary value problems of great importance are out of reach of the method we use here: nonvariational boundary value problems, which enter in the Lopatinski class of which a brief description is given in Sect. 38; problems like the oblique derivative one, to be found in 937.5, but where the oblique derivative is allowed to become tangent to the boundary at some point; or problems for certain operators, or systems of operators, which are not strongly elliptic, and are not stable by small perturbation of the same order. The foremost example of the latter is the so-called 8-Neumann problem, which is a boundary value problem for the overdetermined system au/aFj =A. ( j = 1, ... , n), of fundamental importance in the theory of holomorphic functions, of which nothing more will be said in this book.
37.la
The (Weak) Neumann Problem for - A
+ 1, 1 > 0
The sesquilinear form we are going to use is the same as in the Dirichlet problem : a,(u, v)
=
J
n
(A(u6)
+ (grad u ) . (grad 6)}
dx.
Since we assume A > 0, a,(u, v) is coercive (Definition 23.1) on H'(R) [and therefore also on any closed linear subspace of H'(R)]. Let then f E L2(12), g E L2(R). We note that (37.5) defines a continuous antilinear functional on H'(l2) [in the integral over the boundary r, v stands for y(v), the trace of v on r; see Sect. 261. The LaxMilgram lemma implies that there is a unique element u of H ' ( Q such that (37.6)
a,(u, v) =
f 6 dx R
+ Jr g6 dc,
Vv E H'(R).
356
[Chap. I11
BOUNDARY VALUE PROBLEMS
If we take u E C"(n), we may integrate by parts in the left-hand side of (37.6) and rewrite it as (37.7)
u(1
- A)i dx
av d a + Jr u av
[the definition of the surface measure do on r is precisely such that (37.7) is the result of the integration by parts of a,(u, u ) ] . By equating (37.7) to the right-hand side of (37.6) and choosing u in C:(R), since the surface integral in (37.7) is then zero, we obtain (37.1). We return to arbitrary u in Cm(n), now talung (37.1) into account. Actually, we may proceed as follows. Let (0 < E < 1) be a one-parameter family of relatively compact open subsets of Q, with smooth boundaries r,, say monotone increasing and exhausting R. We have
{n,}
(37.8)
a,(u, u ) = lim e++O
I
R.
(nu6
+ (grad u ) . (grad a)}
dx.
Since f €L2(R) we know (and it is not difficult to show) that u belongs to Hkc(Q). We may therefore integrate by parts in the integrals over R, [in (37.8)] and thus obtain (37.9)
al(u, u ) = lim e-t + O
(JQ.f6 + Ir. aua v fi d a ) . dx
-
In view of (37.1) this means that
(37.10) This can be regarded as the meaning of (37.2). In fact, if we choose the open sets Q, carefully, we can find a diffeomorphism 4e of r, onto r and transfer the normal derivatives of u on T, into normal derivatives, on r, of a one-parameter family of functions u, E H2(R) which converge t o u, say in H1(Q), and whose normal derivatives on r converge to g . At any rate, we may consider that we have obtained a weak solution of the Neumann problem (37.1)-(37.2) for A > 0. If we strengthened the regularity requirements on the data f and g, we would be able to establish better regularity properties for the solution u and, if these were good enough, conclude that the boundary condition (37.2) is satisfied in a " classical " sense.
Sect. 371
357
NEUMANN AND OTHER PROBLEMS
37.1b
The Weak Neumann Problem for the Laplace Equation
It is clear that, when 1 = 0, we cannot expect that the solution u of (37.1)(37.2), if it exists, will be unique: The constant functions are evidently solutions of the homogeneous problem. The Dirichlet sesquilinear form a(u, u) =
I (grad
u) . (grad U) dx
R
is not coercive on H'(R). The associated seminorm is the Dirichlet integral seminorm : (37.1 1) whose null space (or kernel) is the linear subspace of H'(R) consisting of the locally constant functions in R. Let us assume that R is connected, for the sake of simplicity. Then D(u) = 0 => u = const. Let us go to the quotient fi'(R) of H'(R) modulo the constant functions (which, of course, form a one-dimensional linear subspace, identifiable with C); we shall denote by ti, 6, etc., the elements of this quotient space. By definition, so to speak, c- D(a) is a norm of fi'(R)and turns it into a Hilbert space; we denote by a(ti, 6) the associated sesquilinear form, which is of course the inner product on fi'(R), and also the transfer of a(u, u) to the quotient. Needless to say, a(u, 6) is coercive, and is therefore amenable to the Lax-Milgram treatment. Now let f be an arbitrary function belonging to L2(R), g one belonging to L2(R). We may ask under which conditions it is true that the continuous antilinear functional (37.5) defines a continuous linear functional on the quotient space fi'(R). The answer is obvious: It must be orthogonal to the constant functions; that is, it must satisfy (37.12)
Jafdx+J gdo=O. r
If this is satisfied, we may proceed as before (in $37.la) and conclude that there exists a function u E H'(R), unique up to constants (if R is connected), satisfying (37.1)-(37.2) when I = 0, in the weak sense described in 537.la.
37.2
The
"
Hybrid " Problem for - A
+ A(1 > 0 )
We refer here to the problem (37.3)-(37.4). We continue to deal with the sesquilinear form al(u, 0). The important step is the choice of the subspace V
358
[Chap. 111
BOUNDARY VALUE PROBLEMS
of H’(R) on which we consider it. It is defined as follows: It is the closure in H’(R) of the space of functions C#J E C”(n) which vanish in a neighborhood of To (in a). In this case V is the space of functions belonging to H’(R)which “vanish on ro”[when To = T, V = HA(R)]. Let f~ L2(R), g1 E L2(rl). By applying once again the Lax-Milgram lemma we see that there is a unique element u of V which satisfies (37.13)
aJu, U) =
f6 dx
n
+
I,
g16da,
VV E V.
1
By a reasoning quite analogous to the one used for the Neumann problem we may interpret (37.13) as meaning that (37.14) (37.15)
(A - A)u =f in R,
au -= av
g1 o n r , .
The fact that u belongs to V means that u = 0 on To-in the usual generalized sense. If we wish to satisfy an inhomogeneous condition (37.16)
u=go
onr,,
it is convenient to make some rather strong regularity assumptions about go : For instance, we may assume that go is the restriction to To of a function
For then we may find h E H2(R) which is on r which belongs to H3l2(r). equal to go on To (precisely, whose trace on r is equal to go on To). Let then w be the unique solution in V of the problem: (37.17) (37.18)
(A - A)w =f - ( A - A)h in R, aw ah _ -- g1 - on rl.
av
av
It is clear that u = w + h is a solution of (37.14)-(37.15)-(37.16). The question of its uniqueness is more delicate, since we must state precisely within what framework we ask this question. For instance, if g1 is sufficiently reguIar, say g1 is the restriction to rl of an element of H1”(T), we may hope to prove that the solution u belongs to HZ(R)(at least in some cases), and we might ask whether such a solution is then unique-in H2(R). It will of course suffice to prove that any element h of HZ(R) which satisfies h = O on To actually belongs to V, i.e., can be approximated in H’(R) by functions which are C“ in the closure of R and vanish identically in a neighborhood of To (in This will certainly be possible in a number of cases (recognizable by sufficiently strong regularity assumptions on To and rl).t
n).
-
f On the subject of the problem (37.3)-(37.4), it is suggested that the reader look at Exercises 37.2 to 37.4 (where the case h 0 is also discussed).
Sect. 371
NEUMANN AND OTHER PROBLEMS
359
37.3 Weak Neumann Problems for More General Second-Order Elliptic Equations We might want to know how the Neumann problem should be posed for elliptic equations more general than E. - A, and first of all for a differential operator of the kind (23.6), that is, (37.19)
L=-
k4
j , k= I
ax
a
ax'
a + 1bj(x) ax' + c(x), n
-
j=l
assuming that the associated sesquilinear form,
is coercive on H'(R). If we proceed exactly as in the case of the operator E. - A, we reach the conclusion that, given any f E L2(n)and any g E L2(Q), there is a unique function u in I f 1 @ ) satisfying (37.21)
a(u, u) =
Jn,fb dx + S, g b do,
V v E HI@).
It remains to interpret this relation. The implication concerning the interior is clear [and is derived by choosing u E C:(n)]:
Lu =f i n n .
(37.22)
The implication concerning the boundary is better understood if we assume that the coefficients ajk are real-valued and the matrix A = ( d k ) is symmetric. It is then positive definite because of the hypothesis that a(u, u) is coercive. By taking u in C"(Q) and integrating by parts in the " leading part " 6(u, u) of the form a(u, v), we obtain Z(u, u) =
1 (grad u) . (A grad
b) dx
n
= JR
u div
div{u(A grad b)} dx -
(A grad 6 ) dx,
R
hence, by the standard Stokes-like formula (Lemma lO.l), (37.23)
-
a(u, v ) = n
uL*u dx
+
u{v
. (A grad b)} do,
where L* denotes the formal adjoint of L,and v the unit vector along the outer normal to r. We have av
v * A grad u = ('Av) . grad u = -,
av:
360
BOUNDARY VALUE PROBLEMS
[Chap. 111
where we use the notation (37.24) Let us remark that if u were sufficiently differentiable, up to the boundary, we could have integrated by parts “the other way,” and obtained (37.25)
a(#, v ) =
Jn ( L U )dx~ +
where now we use the notation (37.26) In (37.24) and in (37.26) it is understood that x lies on the boundary
a/av, is the normal differentiation on r, defined by the metric
r. Then
ajk(x) dxj dxk j,k
(this metric is defined on a).In the case where the operator (37.19) is equal to -A, with A the Laplace operator in n variables, the matrix A = ( d k ) is equal to Z,the n x n identity matrix. In the general case, we recall that, by the coercivity hypothesis on the form a(u, v), the matrix A is positive definite. By reasoning as in the case of the Neumann problem, we derive from (37.22) and (37.23) that
au _ -g
(37.27)
onr,
ava
in a weak sense (and in a classical sense, if sufficiently strong regularity hypotheses are made). The conjunction of (37.22) and (37.27) is the Neumann problem naturally associated with the differential operator L.
Weak Neumann Problems for Higher Order Elliptic Equations
37.4
We shall now describe briefly how the Neumann problem is posed for elliptic equations of order 2m, with m arbitrary. We shall assume that the differential operator we propose to study is given in the variational form, that is, in the form (37.28)
Sect. 371
NEUMANN AND OTHER PROBLEMS
361
where dP = ( d / d ~ ' ) ~. ' * ( d / ~ ? x " ) ~I p~I, = p1 + . * * + pn , and where the ) to L"(R) (and might be required to have stronger coefficients a p s q ( xbelong regularity properties). In the form (37.28) the operator P(x, D)is naturally associated with a continuous sesquilinear functional on Hm(R)x H"(R), namely (37.29) and we formulate the hypothesis that a(u, u) is coercive [on H"(Q)]. We consider then the following continuous linear functional on Hm(R): (37.30)
rn - 1). By virtue of the Laxwhere, say, YEL2(R), g j E LZ(r)(OS j I Milgram lemma we obtain that there is a unique u in Hm(R) satisfying (37.31)
a(u, u) = F(6)
for every u E H"(R).
The problem for us now is to interpret Eq. (37.31). In the interior we have, obviously, (37.32)
P ( x , D)u =f
(in a).
As for the boundary conditions it is convenient to assume as much regularity on the solution u as we need: We may take u in H2"(R) [and u, say in C"(n)]. Of course there remains the question of finding out when the assumption is satisfied; we do not discuss it here, however. In order to write down explicitly the boundary conditions implied by (37.31) we shall, as before, integrate by parts-which amounts to using a Green formula for the differential operator P(x, 0 ) [given in the variational form (37.28)] and the m - 1 first normal derivatives on u. Let zn denote the characteristic function of the set R (which, we recall, is bounded). Let us extend u (in any admissible manner) as a C" function in R". Then
362
[Chap. I11
BOUNDARY VALUE PROBLEMS
where we have used the notation r 5 q to mean r j S qj for every j = 1, . ..,n. Now, (37.33)
where c,(x) is a C" function on r (cf. Exercise 37.6) and 6 , stands for the surface Dirac measure " associated with r,
"
(37.34)
<ar
9
4)
=
J"r 4 do,
4 E c:(R")-
Applying once more Leibniz's formula, we obtain (37.35)
a(u, u) =
1fiP(x,D)u dx + mi'((z)f i ] N j ( x ,D)u da, a
R
j=O
j
r
where the N j are differential operators, defined in the neighborhood of r, of orders 52m - 1 -j , respectively, and which can be easily calculated (cf. Exercise 37.6). If we return then to the expression (37.30) of F(u), we see that the boundary conditions implied by the equation (37.31) are
Nj(x,D)u = g j on r,
(37.36)
j = 0,.. . , m - 1.
In the absence of sufficient regularity (up to the boundary) on the part of the solution u, we can only say that the boundary conditions (37,12) are satisfied in a weak sense. Remark 37.1. It is very important to observe that the boundary operators N j in (37.35) and (37.36) do not only depend on r and on P(x, D) but also on the variational form of P(x, D) used, in our case (37.28). Several choices of the variational form are possible for a given differential operator P(x, D), and they may lead to different boundary conditions. This remark applies also to the second-order Neumann problems described in 537.3, and even to simple operators such as A - A (cf. Exercise 37.5). Remark 37.2. Underlying the treatment of the Neumann problem in the present section is the generalized Green formula, (37.37)
1uP(x, R
2m- 1
D)*u dx =
{ P ( x , D)u}fi d x
n
+
j=O
I {Nj(x, r
D)u}(L)jfi do,
valid for u and u, say, in H2"(R). In (37.37), P(x, D)* stands for theformaf adjoint of P ( x , D) [of which that equation is, in fact, a definition, if we take u and u in C,"(R)], while the N j ( j = 0, . . .,2m - 1) are 2m operators defined in a neighborhood of r (and which can be computed by integration by parts).
Sect. 371
363
NEUMANN AND OTHER PROBLEMS
37.5 Radiation Problems and ObliqueDerivative Problems for - A + A Another interesting problem which is amenable to the variational method is that of solving the equation
(A - A)u =f in 52,
(37.38)
under a " radiation " boundary condition :
au -+au=g av
(37.39)
onr,
where a is a nonnegative function belonging to Lm(T)(fand g can be taken as in the Neumann problem). In this case we consider the sesquilinear form (37.40)
a(u, v) =
n
{iuij
+ (grad u ) . (grad 0)) dx +
I
r
a uU do,
on H'(R). Since the trace of an element of H'(R) belongs to (r)(Theorem 26.2), a(u, u) is clearly continuous on H'(R) x H'(R). Since a 2 0 it is coercive, from which arise the existence and uniqueness of a solution t o the variational problem (37.41)
a(u, u)
=
/ f dx + fr n
ij
gij do.
Assuming sufficient regularity on u (or otherwise interpreting it in a weak sense), we obtain, by Green formula,
which shows that we have solved (37.38)-(37.39)--in a weak sense. Next we consider a certain type of oblique-derivative problems. We consider a smooth, real vector field T o n the boundary r, also a smooth, real function a on r, which never vanishes in r. We shall make the following assumption : (37.43) a-'T
+ T*a-'is a smooth function, everywhere nonnegatiue on I-.
Here T* means the adjoint of T for the natural Hilbert space structure on L 2 ( r )(that defined by means of the measure do or, if one prefers, by the metric induced by the Euclidean metric on R"). Under this hypothesis the variational method applies t o the oblique-derivative problem
.(A - A)u =f in R,
(37.44) (37.45)
(a:
+T)u=g
onr.
364
[Chap. 111
BOUNDARY VALUE PROBLEMS
First of all, after division of Eq. (37.45) by a we may assume that a = 1. We consider then the following sesquilinear form. (37.46)
a(u, v) =
/ {ui + (grad R
u ) . (grad C)} dx
+ (Tu, C)r,
where ( , ),-stands for the duality bracket between H"'(T) (to which the trace of v belongs) and H-'/'(r)(to which Tu belongs). It is seen at once (by virtue of Theorem 26.2) that (37.46) is continuous on H'(R) x H'(R), and it follows from (37.43) (where now a = 1) that it is coercive on H'(R). The Lax-Milgram lemma implies that the problem (37.44)-(37.45) has one, and only one, (weak) solution. By taking T to be not a vector field but a first-order differential operator on I-, submitted to (37.43), condition (37.45) would generalize the radiation condition (37.39), as well as the preceding oblique-derivative boundary condition.
Exercises 37.1. Let R be a bounded domain (i.e., connected and simply connected open set) in the plane R2 whose boundary is a C" curve r. Let us denote by aldt the unit vector field tangent to r oriented clockwise. Let u be a C" function in harmonic in R. Show that if v is another C" function in such that u and v are conjugate harmonics [see (32.12)] in R, then v is necessarily a solution of the Neumann problem
n,
a,
(37.47)
Av=O
inn,
av
au
---
av
at
on
r
(the normal differentiation d/dv is oriented outward). 37.2. Let Q be a bounded open set in R", n 2 2, with C" boundary r, lying on one side of r. Let xo be an arbitrary point of Prove that the set of C" functions in which vanish in some neighborhood of xo is dense in HI@). [Hint: Let B,(xo) be an open ball centered at xo with very large radius R ; show, by a duality argument, that Hh(BR(xo)\{xo})= Hi(BR(xo)) and derive from this the sought result.] Deduce from the preceding result that, when To reduces to a single point, the hybrid problem ( 3 7 3 4 3 7 . 4 ) reduces to the Neumann problem. 37.3. Let B , denote the open ball centered at the origin, with radius R > 0, in the space R"(n 2 2). Write x' = (x', . . . , x " - ' ) and let g(x') be an L" function in R"-'. Prove that
a.
C V R )3
4 H JR" g w # J ( x ' 0) , dx'
extends as a continuous linear functional on HA(B,).
Sect. 371
365
NEUMANN AND OTHER PROBLEMS
Next, consider an open set as in Exercise 37.2 and let To be an open nonempty subset of its boundary r. Prove that there is a continuous linear functional on H1(R) which does not vanish on the subspace of the constant functions, but which does vanish on any C" functions in which is identically zero in some neighborhood, in of To. [ H i n t : Use a continuous extension mapping (cf. Appendix to Sect. 26) of H I @ ) into H;(B, + xo), where xo E To and R is sufficiently large, and exploit the result proved in the first part of the present exercise.] 37.4. Let R and To be as in Exercise 37.3. Call V the closure in H'(i2) of the space of C" functions in which vanish in a neighborhood of To. Let
a,
rl = r\ro.
n
Derive from the conclusion of Exercise 37.3 that the Dirichlet sesquilinear functional, Jn (grad u ) . (grad 17)dx, is coercive on V(cf. $37.2b),and that, as a consequence, the problem (37.3)-(37.4) when I = 0, has a unique solution u in V for every choice of data, f~ L2(R), g1 eL2(T),with both go and g1 identically zero in To. Explain how to obtain a solution of the same problem, but this time with go not identically zero in To. 37.5. Let R be a bounded open subset of the plane R2, lying on one side of its boundary r, which is a smooth curve. Let c(x) be a real-valued C' function in Describe the boundary value problem associated with the sesquilinear form
a.
(37.48) - + - - + c -au - - c -av -
ax1 ax1 ax2 ax2
au aii
ax1 ax2
au
av +-7v-----v) ac au
aX2ax1
ax2
ax
ac
au
dx.
axi ax2
What is the restriction of (37.48) to u, v E H;(R)? 37.6. Let R be the following elliptic region in the plane,
and let xn denote its characteristic function. Given any two integers p , q 2 0, compute (a/dx)q((a/ay)pxn[cf.(37.33)]. Give the expressions of the Neumann boundary operators .N, [defined in (37.35)] relativetoaand to thesesquilinear form
37.7. Let r, 8 be the polar coordinates in R2, g(eie)a sufficiently regular function on the unit circumference. Give a complete description, by means of
366
BOUNDARY VALUE PROBLEMS
[Chap. 111
Fourier series, of the solutions (when they exist) of the boundary value problem (37.49) A u = O
au
if r < l ,
au
when r = l ,
a-++-++u=g
ae
ar
where a, B, y are complex numbers. In the remaining exercises, R denotes a bounded open subset of R",with smooth boundary r, lying on one side of r.
37.8. Show that, whatever the complex number A, the sesquilinear form a(u, v ) =
s
{Auij + (Au)(Aij)} dx
n
is not coercive on H2(Q)(A is the Laplace operator on R", n 2 2). Describe the boundary value problem formarry associated with the equation a(u, v ) =
fij
n
Vv E H2(Q),
dx,
wherefc L'(R) [the student may assume that u E H4(R)]. 37.9. Let Vdenote the subspace of H&l) consisting of the functions u such that grad (Av) E L2(R; C"), equipped with the Hilbert norm (IlvllLcn, + Ilgrad(Av)ll,4 2
11.2
-
Show that the sesquilinear form a(u, u) =
I {(grad
u ) (grad ij)
+ (grad Au) - (grad Aij)} dx
R
is continuous and coercive on I/ and that, as a consequence, given any
f E H&?), there is a unique element u of V such that (37.50)
a(u, v ) =
n
(grad f) . (grad 5) dx,
V5 E Y
From the fact (Theorem 23.1) that A defines an isomorphism of HA@) onto H-'(SZ), derive that we have (37.5 1) (37.52)
A(Au)
a
+ u =f
-(A#) av
=O
in R, on r.
Show that, in addition to (37.52), u satisfies another, nontrivial boundary condition.
Indications on the General Lopatinski Conditions Throughout this section, C2 is a bounded open subset of R”whose boundary is a C“ hypersurface; we shall assume that R lies on one side of 8 0 . We shall consider an elliptic operator of even order m = 2mo,P(x, D), whose coefficients are complex functions, defined and C“ in an open neighborhood of We assume that we are given J boundary differential operators, Bj(x, D ) ( j = 1, . . . , J): These are differential operators with coefficients defined and C“ in an open neighborhood of the boundary dR, and they determine the boundary conditions in the problem we propose to study, namely that of “finding” and studying the solution of the problem:
a.
(38.1)
P(x, D)u =f
(38.2)
B ~ ( xD)U , =gj
in s1, on aR ( j = 1, . . . ,J ) ,
where f and the gj are, say, smooth functions in Pi and as1, respectively. After having encountered a wealth of particular cases, as we have in the earlier sections, it might seem reasonable to ask the following question: Given the operator P ( x , D), which are the sets of boundary operators {Bj(x, D)} (j= 1, . . . , J ) such that the problem (38.1)-(38.2) is well posed? Of course we must state in precise terms what we mean by “well posed.” Our choice of the meaning is dictated by the experience acquired on the subject. Without going into the details let us say that the “natural” meaning will not do: The natural meaning is that the problem is well posed if ( J gl,. . . ,g,) H u is a bijection of C“(Q) x {Cm(dsl)}Jonto C“(n). We need much stronger properties than this. Indeed, we have two basic requirements : Our definition should be invariant under diffeomorphism (this is true of the preceding one); it should be stable under “small” perturbations of the operators P(x, D), Bj(x, D)(1 S j 5 J ) . By small we mean that we may modify each one of these operators by adding 361
368
BOUNDARY VALUE PROBLEMS
[Chap. IT1
to it an operator of the same order, provided that the absolute values of the coefficients, in the latter, and of their derivatives (up to a certain order) are sufficiently small. These two requirements are natural, but it should be said that the reason for setting them is that, without them, the analysis could not be carried through. Now we observe that, if we deal with isomorphisms of a FrCchet space into another one, we cannot be sure of their stability under small perturbations (in a reasonable sense of the phrase). But this is true if we deal with isomorphisms between Banach spaces. In PDE theory, whenever faced with similar circumstances, one has the recourse of representing the space C" as the union of a family of Banach or of Hilbert spaces, ordered into a scale; the Sobolev spaces H s are usually the best suited for such an approach. We shall do it again here. We observe that if mj is the order of Bj(x, D), u H Bj(x, D)uI is a continuous linear map of HS(Z;Z)into H s - m j - 1 1 2(an)(here s is an integer L m j 1).
+
Dejinition 38.1. We say that the boundary problem (38.1)-(38.2) is well posed if, given any s >= sup(m, m, 1, . . . , m, l),
+
+
(38.3) is an isomorphism of Hs(Q) onto
(38.4)
H"-"(Q) x
n J
j= 1
H s - m j - 112
(an).
" Isomorphism," in the preceding statement, refers to the locally convex space (not the Hilbert space) structures. The fact that (38.3) is an isomorphism of HS(Q) onto (38.4) is equivalent to a pair of inequalities, the first of which reads as follows:
J
+ C IIY[Bj(X, j= 1
I
D)uIlIHs-mj-'/2(JR) 9
U E Hs(Q)*
We have denoted by y the trace on (Sect. 26).The second estimate applies to what is called the adjoint of the problem (38.1)-(38.2). Our purpose here is merely to provide an idea of how the analysis goes, and of the results to which it leads, without getting too involved with technicalities, and certainly without giving the proofs. We shall therefore not describe the adjoint problem (cf. Exercise 38.9), nor state what the adjoint estimate is. Observe that the estimate (38.5) leads to the uniqueness and the regularity of the solution, whereas the adjoint inequality leads to its existence.
Sect. 381
369
GENERAL LOPATINSKI CONDITIONS
In principle, we would like to find necessary and sufficient conditions, bearing on the coefficients of all the intervening operators, P(x, D) and the Bj(x, D), and on the open set R, in order that the problem (38.1)-(38.2) be well posed. But a moment of reflection shows that this is somewhat hopeless. Think of the extremely simple case where P(x, D ) = I - A, J = 1, Bl(x, D ) = Z, the identity. What are the values of the (complex) number I for which the corresponding problem (38.1)-(38.2), that is, the Dirichlet problem in R for I - A, is well posed? A precise answer to such a question is beyond our reach, except in very primitive cases, since it would require the precise knowledge of the eigenvalues of the problem. Thus we shall settle for somewhat less. Note that the dual of the space (38.4) can be identified with
(38.6) On the other hand, since C"(Q is dense in H"-"(R), the elements of the dual of the latter space can be identified with distributions in (regarded as a manifold with boundary), and of course it makes sense to speak of the elements of (38.6) which are C" functions: It means that they are ( J + 1)-tuples (f,gl,. . . , g J ) wherefE C"(n) and g j E Cm(8!2) for everyj. Definition 38.2. We say that the problem (38.1)<38.2) is elliptic if the kernel of the mapping (38.3) and its cokernel, i.e., the kernel of its transpose consist solely of C" functions. Since the kernel of (38.3) is closed, and contained in C"(Q), the topology induced on it by C"(n) is identical with that induced by H"(i2) [or with that induced by Hs+k(R), whatever the integer k 2 01. From this and from Rellich's lemma (Theorem 26.11) it follows that this kernel is a locally compact, hence finite-dimensional linear space. A similar argument applies to the transpose of the mapping (38.3):
a
38.1. If the problem (38.1)-(38.2) is elliptic, the mapping (38.3) has a finite-dimensional kernel and cokernel. In particular, it is a Fredholm operator (i.e., if N is its kernel and M its cokernel, its index, dim N - dim M , is finite; the ellipticity implies that M and N do not depend on s). The ellipticity of the problem (38.1)-(38.2) is also equivalent to a pair of inequalities, one bearing on the mapping (38.3) and the other on its adjoint. The estimate relative to (38.3) is obtained by a slight modification of (38.5): PROPOSITION
v u E HS(R),
BOUNDARY VALUE PROBLEMS
370
[Chap. 111
where we may take for s' any nonnegative integer < s, for instance s' = s - 1 (but of course the constant C depends on the choice of s'). We note that the estimate (38.7) is invariant under diffeomorphism, and under small perturbations. The latter now has the following meaning: We may add to P(x, D ) or to each one of the Bj(x, D)a differential operator of the same order (m and mi, respectively) whose principal part has sufficiently small coefficients; the coefficients of the lower order terms are only required to belong to C"(n). The most important new feature of (38.7), absent in (38.5), is that such an estimate is localizable: Suppose that every point xo E R has an open neighborhood U in R" such that the inequality in (38.7) is valid for every u E H"(R) whose support (as a distribution in R") is contained in U ; then it is valid whatever u E HS(R). For by using a C" partition of unity in a neighborhood of 0, whose elements have sufficiently small supports, one can piece the local estimates together (the "error terms" resulting from the partition of unity can be absorbed into the term const IIuJIHs- by increasing the constant). The preceding remarks enable us to considerably reduce the difficulties of the situation. First of all, we may localize it completely in an open neighborhood of a point xo of the boundary. Indeed, in the neighborhood of any point in the interior the estimate (38.7) (or rather, its localization) does not involve the boundary operators Bi(x, D)at all, and it is then a trivial consequence of GBrding's inequality (34.9, applied to P(x, D)*(l - AX)S-"'P(x,D)
instead of P(x, D)[ P ( x , D)* is the formal adjoint of P(x, D);see p. xiv]. Second, we may assume that the neighborhood U of xo E dR is an open ball centered at xo, and that U n dR lies in the hyperplane x" = X,, as we have done systematically in Sect. 27. Third, by choosing U sufficiently small and by perturbing P(x, 0)and the Bi(x, D),we may assume that, in U, P(x, 0)= p ( D ) , Bj(x, D) = bj(D), where p ( < ) and the b,(<) are homogeneous polynomials in n variables, of degrees m and mi, respectively. For the sake of simplicity let us assume that xo = 0 and that R is the upper half-ball { x E U ; X" > 0). The argument is of course closely related to, and in fact an elaboration of, the one in Sect. 27. Here we are concerned with the validity of the estimate I
vu E H"(R), supp u
c u.
Sect. 381
GENERAL LOPATINSKI CONDITIONS
37 1
Just as (38.7) corresponds to the problem (38.1)-(38.2), the estimate (38.8) is associated with the following boundary value problem:
(38.9)
p(D)u=f
(38.10)
bj(D)u = g j
inR, in U n dR,
j = 1,
. .. ,J .
Here f E H"-"(Q), suppf C U , and g j E Hs-mJ-1'2 (an), supp g j E U n d Q (in other words, f and the g j vanish identically in a neighborhood of the sphere which is the boundary of U ) . What we want is that the kernel and cokernel of the mapping
(38.11)
f4 H
(P(D)U,Y ( b l ( D ) 4 ,* * * r(b,(D)u)) 3
consist exclusively of C" functions in ? [recalling l that u E H"(n)]. Here y stands for the trace on the (n - 1)-dimensional open ball U n dR. As we have done in Sect. 27, we shall avail ourselves of a Fourier transformation with respect to the tangential variables. The problem (38.9)-(38.10) is thus transformed into an " initial value " problem for an ordinary differential equation (in the variable t = X" normal to the flat part of the boundary of Q), depending on the parameters t' = (tl,. . . , 5,-
=f
2 0,
(38.12)
p ( t ' , D,)C
(38.13)
b j ( ( ' , D,)Ei = J j ( j = 1, . . . ,J )
when
t
when t = 0.
We are going to look at the solutions of (38.12) when f = g1 = * . . = g, = 0 identically. Since all solutions of a linear ordinary differential equation with constant coefficients (here, depending on the parameters t') are easy to construct by means of the "characteristic roots," let us introduce the latter: We shall subdivide them into two sets, the first, I,, . . . , A,, made up of those whose imaginary part is greater than zero, the second, A,+ 1, . . . , A,, made up of those having a strictly negative imaginary part. Since the operator P ( x , D ) is elliptic and p ( D ) = P,(X,, D ) is its principal part, frozen at a point x, of the boundary, no characteristic root is ever real provided that 5' remains different from zero, which we are assuming throughout the present argument (when 5' = 0 all characteristic roots " collapse " to the value zero).
PROPOSITION 38.2. If the number n of independent variables is 23, the number of characteristic roots lying in the upper hav-plane is exactly equal to m, = m / 2 (and therefore to the number of roots lying in the lower half-plane). Proof. If n 7 2, the complement of the origin in Rn- is connected and we can find a smooth simple curve y joining t' # 0 to - t'. Along y the roots of p ( t ' , T), as a polynomial in T, can be represented as continuous functions of <'.Because of the homogeneity of p , it is clear that the number p of roots in
372
[Chap. 111
BOUNDARY VALUE PROBLEMS
the upper half-plane, at the point t‘, must be equal to the number v of roots in the lower half-plane, at the point -t‘. But p must be constant over y, otherwise some root would have to be real at some point of y. Thus p = v and pi-v=m. Q.E.D.
From now on we assume that (38.14) The number of roots of p ( t ’ , .) whose imaginary part is greater than zero, for every 5’ E R“-’\{O}, is exactly equal to mo = m/2. Why we require this to be true also when n = 2 (or n = 1) will be clear in a moment. Let us mention that the operators P ( x , D ) which satisfy this condition (38.14) at every point of 0 are often called strongly elliptic. The CauchyRiemann operator in the plane is not strongly elliptic. The importance of the characteristic roots I , , . . . , A,, belonging to the upper half-plane lies in the fact that, though every one of the functions exp(iS(5’)t) is a solution of the equation
~ ( l D,)h ‘ , = 0,
(38.15)
t > 0,
only those such that Im A,(t’) > 0 can be regarded as Fourier transforms, in the variables x’, of smooth functions (or even, for that matter, of distributions). Indeed, the other ones grow much too fast as 1 l’\ 4 + 00, as soon as t > 0. Let us assume for the moment that the Aj(t’) ( j = 1, . . . , m,) are pairwise distinct, for every t’ # 0. This restriction will be removed later. In this case all solutions of (38.15) are of the form
(38.16)
C j ( t ’ ) exp(iAj(t’)t).
h(t’, t ) = j= 1
If we want this function to satisfy the homogeneous boundary conditions
(38.17)
bj(t‘,D,)h = 0
j = 1, . . . , J,
when t = 0,
we must have
(38.18)
bj(5‘,A,(t’))C,(t’) = 0 ,
k= 1
j = 1, . . .,J .
Suppose that (38.18) had nontrivial solutions C(t’) = (Cl(t’), . . ., Cmo(t’)). Then we would be able to choose the latter to be any function of [’ we wished, provided that it be valued in the null space of the matrix
(38.19)
(bj(t’, ‘k(t‘))j= 1 ,
...,J , & =
1,
...,mo *
For instance, by multiplying it by a suitable scalar a(t’), we could achieve easily that h(t’, 0) not be of rapid decay as 1 [‘I -, co,and thus not be the
+
Sect. 381
GENERAL LOPATINSKI CONDITIONS
373
Fourier transform of a smooth function of x’, contrary to our requirement. We must therefore require that the matrix (38.19) define an injective mapping of Cmointo C J . Similar considerations concerning the cokernel of (38.12)(38.13), i.e., concerning the kernel of the transpose problem, lead us to the requirement that also the transpose of (38.19)must be injective. Both conditions can be combined and rephrased as follows:
(38.20) J
= m,
and the determinant of (38.19) is different from zero for every
5‘ E R“-’\(O). This formulation has the defect of assuming that the Aj(l S j S m,) are distinct. Rather than removing this restriction at the present stage, we shall avail ourselves of it in the construction of the solution of (38.15), that is, of (38.12) when the right-hand side f is identically zero, satisfying the boundary conditions (38.13). In fact, let us denote by Cj(t’,t) the solution of (38.12)(38.13) when f and all & vanish identically, except B j . We may write
(38.21)
o j ( t ’ , t ) = B j ( t ’ ) f j ( < ’ , t)7
where E j is the same as Dj except that Sj = 1 in its definition. The function E j must have the form (38.16), with constants C j satisfying the system of linear equations
where d j j . is Kronecker’s index. Cramer’s rule yields at once
(38.22)
f j ( t ’ 3
t ) = Bj(tf>t)/Nt’),
where
(38.23)
B(5’) = det[bj(t’, A k ( t ’ ) ) l >
i.e. B((’) is the determinant of (38.19), while Bj(t’7t ) is the same determinant, except that the row (bj(5‘, Al(t’)), .. . bj(5’, A,, , ((’))} has been replaced by {expWl(t’)t), . . . exP(i~mo(<’)~)l~ The noteworthy feature of formula (38.22) is that it remains valid even when the characteristic roots are not distinct, as the reader may easily ascertain (cf. Exercise 38.4). Observe that the functions E j are well defined (as funclions of (’): Indeed, they are symmetric functions of the characteristic roots lying in the upper-plane, and therefore they are functions of the coefficients of the polynomial in the variable T,
(38.24)
374
BOUNDARY VALUE PROBLEMS
[Chap. I n
Since no characteristic root ever crosses the real axis (as long as 5’ # 0) the coefficients of M + are continuous functions of In particular we see which hypothesis should replace (38.20) in the general case :
r’.
(38.25) J = mo and the functions P,((‘, t ) areJinite for all
r’ E R”-‘.
Of course, this is a condition bearing on the ratios B j / B (and not only on the denominator B: the latter may have zeros, but the factors which vanish must be canceled by factors in the numerator). There is an elegant restatement of (38.25), whose proof we leave to the student (Exercise 38.6): (38.26) the polynomials in T, bj(t‘, T) ( j = 1, . .., J =mo),are linearly independent modulo M ‘(t‘,T ) . It is obvious that there is an “invariant” formulation of the condition (38.3l), involving the normal (or, rather, the conormal) to the boundary dn of IR (without assuming that it has been flattened) and the boundary operators Bj(x, D,) (precisely, their principal parts), as well as the operator M+(x, D,),extracted from the principal part Pm(x, D,) of P(x, 0,) in the manner M +(D,, , 0,) was extracted from p(D,. , D,)-with the normal direction to the boundary playing the role of the t-axis. We leave this invariant formulation to the reader. The boundary operators which satisfy it are often said to cover P(x, D) or to associate with it the Lopatinski (or LopatinskiShapiro) boundary conditions. In order to complete the construction of the solution of (38.12)-(38.13) we have yet to construct the solution, denoted by 8, corresponding to the case of homogeneous boundary conditions, when J j = 0 for all j = 1, .. . , m , . We A
proceed in a manner analogous to that used to construct Gov in Sect. 27. We introduce the polynomial
n
Zmo
(38.27)
M - ( t ’ , 7) =
j=ol,+ 1
(7 -
Aj(t’>).
We introduce the solution B+(t’, t ) of the homogeneous equation
M + ( t ‘ , D,)E:+ =o, satisfying the initial conditions
(38.28)
(38.29) Similarly, we denote by E- the analogous solution of M - ( t ‘ , Of),!?- = 0. Of course, when the characteristic roots are distinct, 8, is of the form (38.16) with a suitable choice of the constants Cj(t‘).In fact,
(38.30)
E+(t‘, t ) = p(’
*
9
t)/v(J-,, Amo), *
>
Sect. 381
GENERAL LOPATINSKI CONDITIONS
375
where V(A,, . . . , Rmo) is the Vandermonde determinant det{(A:-l)l j , k 6 m a 1 whereas the numerator p(Rl, .. ., Rmo; t) is the same as the Vandermonde determinant, except that the last row, {AT-’, . . . ,A::-’], has beenreplaced by {exp(iA,t), . . . , exp(iA,,
t)}.
The expression (38.30) remains valid even when the roots A j “coalesce.” We have an analogous expression for E - , using the R j for j > m, . By the general argument for ODE described in Sect. 4, we see that +m
fl(t’,t ) = - Jr
8-(t’,t - t‘)f(t’, t’) dt‘
=-
satisfies the equation M - ( t ’ , Of)!, =f for t 0. Because of the signs of the imaginary parts of the Rj when j > m , and because of the integration from t to 03,fl is tempered with respect to t’. Let us then set
+
-s 1 8+(t’, f
(38.31)
D(t’,
t) =
+m
t - t‘)g-(t’, t’-
0
t”)f(t’, r”) d t ‘ d t “ ,
1’
and the same argument, applied now to 8 + ,shows that M’(t‘, D,)D =Ilfor t > 0 and that D is tempered with respect to g‘. Does 8 satisfy the homogeneous boundary conditions bj(t’,D,)D = 0 when t = 0 for all j = 1, . . . , m,? If it did, we would take D as the solution of (38.12)-(38.13) when the gj all vanish. It is clear that this is so when the orders of the bj(t’, Of) do not exceed m , - 1 (as in the Dirichlet problem); then the definition of g + implies that all the derivatives of D of order <m, vanish when t = 0. But if the orders of the b j exceed m, - 1 for sornej, this need not be the case. On the other hand, we need not worry: from the argument which precedes, we know how to construct the solution h of (38.15), satisfying bj((’, Dr)h = bj(t‘, D f ) 8 when t = 0 ( j = 1, . . . , m,), and it will suffice to take 0 = D - h. An explicit formula for 6 is easy to obtain, and so is one for the solution 0 = 6 8, * * * Om, of the general problem (38.12)(38.13). Let us retrace our steps. First of all, after an inverse Fourier transformation with respect to t’,we must go from the expression of u in terms off, gl,.. ., g,,,,, , to the estimate (38.8) (and to the analogous adjoint estimate). Then we must derive (38.7) from (38.8). We must also devise a concrete realization of the adjoint problem and submit it to a similar treatment. Finally we must show that these estimates, (38.7) and the analog for the adjoint problem imply what we want: existence and uniqueness modulo finite-dimensional kernel and cokernel, and regularity of the solutions up to the boundary. Of course, the latter part is quite technical and is not described here.
+ +
+
376
[Chap. 111
BOUNDARY VALUE PROBLEMS
Exercises 38.1.
(38.32)
Verify that the Dirichlet boundary conditions
-1
aj u
a d an
=gj
Hmo-1-
112
(an).
j = 0,..., rn,
- 1,
‘‘cover” every strongly elliptic operator P(x, D) of order 2m, (cf. Theorem 36.3). 38.2. Is it true that the radiation problem and the oblique-derivative problem considered in 537.5 are of Lopatinski’s type? 38.3. Consider the oblique-derivative problem (37.44)-(37.45). Prove that if the function LY E Cm(r)vanishes at some point of r, that problem is not of Lopatinski’s type. 38.4. Consider the boundary value problem (38.9)-(38. lo), assuming that m, = J = 2, and that
At’, t) = (7 - 4t’>)2(? with Im 1 > 0, Im p c 0 for any 5’ # 0 (t’E R”-’).Prove that, under these
(38.33)
hypotheses, the function defined in (38.30) is
(38.34)
E + ( t r ,t ) = teiL(C’)‘,
and the functions E j , defined in (38.22), are given by
Fj([‘, t ) = { ~ , b ~ (A(t‘))}-’teiL(C’)t, t‘, where D,= - i slat. (38.35)
38.5. Consider the boundary problem (38.9)-(38.10) taking p ( D ) = A’ (A is the Laplace operator in R“),b , ( D ) = A, b2(D)= AD,” (D,,= - i a/axj). In this case, compute g*(t’,t) and Fj(t’, t ) , j = 1, 2. (Cf. Exercises 37.8 and 37.9.) 38.6. Prove that conditions (38.25) and (38.26) are equivalent. 38.7. Let A be the Laplace operator in RZ,where we shall use polar coordinates r, 6. Let 0 denote the open unit disk r < 1, r the unit circumference r = 1. Describe all first-order operators B(D, , D,)such that
(38.36) (38.37)
Au = f Lz(R) ~ in R, B(D, , D,)u = g E H’/’(r)on r,
is a Lopatinski boundary value problem. 38.8. Let P(x, 0 ) be a strongly elliptic operator on 32 of order m = 2m, [thus P(x, D ) satisfies (38.14)]. For j = 1, . . ., J = mo set Bj = (d/av)’”~, where ml < m2 c . -.< mJ < 2m0 - 1. Prove that the boundary operators
Sect. 381
377
GENERAL LOPATINSKI CONDITIONS
{ B I ,.. ., BJ} cover P(x, D ) (see p. 374; a/dv is the differentiation along the normal to r, outwardly oriented). 38.9. Same notation as in Exercise 38.8. For j = J + 1, . . ., 2m,, set Bj = (a/av)"j with m J f l < . . . < m Z J< 2m, - 1 such that {m,, . . . , m2,} is a permutation of the interval (0, 1 , .. . , 2m, - 1) c Z, . Let P(x, D)* be the formal adjoint of P ( x , 0)(see Remark 37.2). Prove that there are 2J boundary operators C1,. . . , C, Cj+I , . . . , Cj,, uniquely determined, with the following properties:
(38.38) the order of C j is 2m, - 1 - m , + j and that of Cj" is 2m, - 1 - m j , (38.39) ,for all u, v
E
Cm(a),
cf J
jQP(x, D ) u t i d x - s u P ( x , D ) * v d x = n
j=1
r
BjuCj+judo
- IrB,+
-
u
C j v do.
The boundary value problem defined by ( P ( x , D)*,C , , . . . , C,) is sometimes called the adjoint of the problem defined by ( P ( x , D), B,, . . . , BJ). Formula (38.39) is called a generalized Green formula. 38.10. Use the same notation as in Exercise 38.9. We take P ( x , 0)to be a second-order strongly elliptic differential operator on J = 1, and B, = Z, the identity. Describe the adjoint boundary problem and compare with the problem in $37.3. 38.11. Use the same notation as in Exercise 38.9. Let P ( x , D ) be an arbitrary strongly elliptic operator of order m = 2m, on take J = m, and B j = (a/dv)j-' (a/& is the differentiation along the normal to r, outwardly oriented). Describe the adjoint problem.
a,
a;
This Page Intentionally Left Blank
CHAPTER IV
Mixed Problems and Evolution Equations
This Page Intentionally Left Blank
39 Functions and Distributions Valued in Banach Spaces As we shall see in the next section, mixed problems combine certain features of the Cauchy problem (Chapter 11) with features borrowed from the boundary value problems (Chapter 111). As in the Cauchy problem there are n + 1 independent variables: n " space" variables x = (A?, . . . , x"), and one " time" variable t . We shall deal with functions and distributions defined in a " cylinder" Q x 10, T[, where R is an open subset of R" (where x varies) and T a number greater than zero (thus t ranges in the interval 10, T[-or in any interval of the real line). It is highly convenient to regard these functions of ( x , t)as functions of t , valued in a function space in the variable x : Most of the time, the latter will be one of the Sobolev spaces, H'(Q), HA(Q), H-l(R), . . .. Such an approach leads to a considerable streamlining of the arguments, and also, as should be expected, to fairly general results, which can then be applied to a variety of cases-often beyond those used as a starting point. But it has an obvious drawback; that of requiring the reader to be familiar with functions and distributions valued in infinite-dimensional linear spaces. The purpose of this section is to provide the basic facts about such functions and distributions. In view of our needs in the remainder of the chapter, the basic facts are straightforward generalizations of the scalar case. They hardly require special mention, and to a large extent our intentions here are more psychological than mathematical : We hope to convince the reader that there is not much difference between distributions valued in a (complex) Hilbert space and scalar ones. Essentially the same rules apply to both kinds.
381
382
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
[Chap. IV
39.1 Continuity and Differentiability of Functions Valued in a Banach Space
-= -=
Let J denote an open interval To t TI in the real line. In most of what follows it could be replaced by an open subset of a &dimensional Euclidean space Rd (d 2 1) or by a smooth manifold. We shall deal first with functions defined in J and valued in a complex Banach space E. Since E is a topological space it is obvious what we mean by such a function f being continuous at some point t of J, or in some subset of J. To say that f is differentiable at t is equally obvious: It means that the difference quotient (39.1)
6,f(t)
= h-'(f(t
+ h) - f(t)}
converges in E as h # 0 converges to 0 (h real and small). Note that this definition fully exploits the fact that E is a topological linear space: In addition to convergence, we have also used vector subtraction and scalar multiplication of vectors. The limit of 6,f(t) is the derivative of f a t t , f ' ( t ) [or (df/dt)(t) or f(t)]. If it exists regardless of what the point t of J is, it defines a function f ' in J. If the latter function is continuous, we say that f is once continously differentiable, or C'. More generally, for any k, 1 5 k 5 +co, we say that f is Ck,or k-times continuously differentiable, iff ' is Ck'for every k' < k. We say that f is analytic in J if, whatever to E J , the Taylor series of f at t o , (39.2) converges in E uniformly in some compact neighborhood of to in J. This implies that f can be extended as a holomorphic function to the complex values of t in some open neighborhood U of J in the complex plane C. The fact that f is holomorphic in U means that, whatever t E U , the quotient (39.1) converges in E when h # 0 converges to 0 in the complexplane. It should be noted that one can equip E with a topology different (at least when dim E = + 00) from the one defined by the norm, still compatible with its linear structure, and yet relevant to many a situation, for instance, with the weak topology a(E, E*), or with the topology of uniform convergence on the compact subsets of the dual E* of E. This leads to different classes of continuous, continuously differentiable, analytic functions valued in E : Such a function f will be, for instance, weakly (or scalarly) continuous (resp. continuously differentiable, resp. analytic) if the same is true of every scalar function (39.3)
t~
(x*, f ( t ) ) ,
x* E E*
Sect. 391
B-VALUED FUNCTIONS AND DISTRIBUTIONS
383
(the bracket expresses the duality between E and its topological dual, E*). We shall not pursue here the question of the relations between the various classes of functions defined by using different linear (locally convex) topologies on E (see Exercises 39.2 and 39.3).
39.2
Integration of Functions Valued in a Banach Space
The definition of the Riemann integral of a continuous function f : f -+E is immediate : (39.4)
is the limit of the Riemann sums (39.5) The fact that the Riemann sums make up a convergent net (when f is continuous) follows directly from the fact that E is complete. The operation (39.4) extends, as Lebesgue integration, to functions f E L’(J; E). The Lebesgue spaces L p ( J ;E) (1 =< p < + a)are not difficult to define. It suffices to complete the linear space of E-valued continuous functions with compact support, C:(J; E),for the natural norm (39.6) The trouble with this definition is that the elements of L p ( J ;E) are “ideal objects.” To recognize them as classes of functions modulo the standard equivalence relation of being equal almost everywhere (a.e.) requires the introductions of the standard concepts of Lebesgue theory, and first of all of measurable functions. For the sake of brevity, we shall follow the Lusin approach : Dejinition 39.I . The function f : J -+ E is measurable given any compact set K c J and any number E > 0, there is a compact subset K‘ of K such that the measure of K\K’ is less than E and that the restriction off to K ’ is continuous.
We may then introduce the spaces Y P ( J ; E) consisting of the functions f : J + E having the following two properties: (i) f is measurable; (ii) JJf/j, belongs to the “scalar” space Lp(J)= Lp(J;C). This definition is also valid f o r p = +oo. Note that iff is measurable (with values in E), its norm llfll, is also measurable (with values in R+).Then let N p be the linear subspace of
MIXED PROBLEMS AND EVOLUTION EQUATIONS
384
[Chap. IV
P ' ( J ; E) consisting of the functions f which vanish almost everywhere or, which amounts to the same, such that
(39.7) The completion Lp(J;E) of C:(J; E) for the norm (39.6) can be identified with the quotient space P ( J ; E)/Np-also " normwise," if we equip the latter with the quotient norm derived from (39.6) [which can be regarded as a seminorm on Y p ( J ; E)]. The quotient Banach space P m ( J ; E)/N" is the space L" ( J ; E). The reader should beware that, although every L"(J; E) (1 2 p 5 + co) is a Banach space, in general, that is, unless E i t s e y i s a Hilbert space, L2(J;E) is not a Hilbert space, and unless E itself is reflexive, Lp(J;E) (1 < p < + co) is not reflexive. It is true, however, that if E* is the dual of E, equipped with the dual Banach space structure, the dual of Lp(J;E) for 1 2 p < 00 is canonically identifiable to L4(J;E*), q = p / ( p - 1). The duality is expressed by
+
If E is a Hilbert space, L2(J;E) is equipped with the canonical Hilbert space structure defined by the inner (Hermitian) product
(39.9)
(f, g)L2( J;E) =
JJ
(f (
t ) d~ t ) ) E dt-
In relation with the latter remark, we might also observe that, when E is a Hilbert space, the Fourier transformation (with respect to the variable t )
f(z) = S f m e - ' " f ( t ) dt
(39.10)
--m
defines an isomorphism (for the Hilbert space structure) between Lz(R: ; E) and L2(R,' ;E).This is no longer true, of course, when E ceases to be a Hilbert space. When E is merely a Banach space, and L2(R' ;E) carries the canonical Banach space structure, the Fourier transformation is an isomorphism for the locally convex structures underlying the Banach space structures, but not for the latter: More specifically, the Plancherel formula is no longer true (Exercise 39.6).
39.3
Distributions Valued in Banach Spaces
A scalar distribution T in J is a continuous linear map C:(J) + C. It is therefore natural to define an E-valued distribution as a continuous linear map C,"(J) + E. The space of such distributions will be denoted by 9 ' ( J ; E).
Sect. 391
B-VALUED FUNCTIONS A N D DISTRIBUTIONS
385
It is equipped with the topology of uniform convergence (of mappings valued in E) on the bounded subsets of Cp(J). Any locally L' function valued in E, f, defines an E-valued distribution by the formula
4
(39.1 1)
-1
4 E CCm(J>.
4(t)f(r)dt,
As in the scalar case (39.1 1) defines an injection of Li,,(J; E) into 9 ' ( J ; E) (the injection is continuous for the natural topologies). Any element of L/,,(J; E), viewed as a distribution, is called afunction. A distribution T in J with values in E is called a Radon measure in J , valued in E,if it can be extended as a continuous linear map C(zJ)+E. Of course, functions are particular cases of Radon measures: L/,,(J; E) c A ( J ; E). The standard operations on distributions are defined as in the scalar case, for instance, differentiation of distributions : (39.12)
d
drT($) = -T(dr4),
d
'=dt'
or multiplication by a scalar C" function a : (39.13) m 4 ) = TW). Binary operations such as tensor product and convolution are more delicate, since there is not necessarily a pairing between the values of distributions considered here. We return to this important question at the end of this section. Of course, there is no difficulty in defining the tensor product between distributions, all of which, except possibly one, are scalar. We leave it to the reader. The local structure theorem is valid for E-valued distributions: This is because E is normed. In general, when E is a locally convex space, e.g., a Frkchet space but not a Banach space, the theorem is not valid. Thus E being a Banach space, T an arbitrary E-valued distribution in J, and J' an arbitrary relatively compact open subset of J, we may write M
(39.14)
a:f,
T=
in J',
k=O
where the f k can be taken to be Radon measures, or continuous functions, or L2 functions, in J, with values in E. The proof is similar to the one in the scalar case. We shall now define the Fourier transform of an E-valued tempered distribution. First of all, let us introduce the space Y(E) of C" functions in R', valued in E, rapidly decaying at infinity, that is, such that (39.15)
sup ((1
t€R'
+ t2)k
M
j=O
Ild/u(l)llE] <
+
00,
k , M = 0, 1, . . ..
386
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
The topology on Y(E) is defined by the norms in (39.15). It is easily seen (as in the scalar case) that the Fourier transformation (39.10) defines an isomorphism of Y(E) onto itself. Let us then define Y’(E) as the space of continuous linear mappings Y = Y ( C ) -,E, identified with a subspace of Q’(R’ ; E), that of the tempered E-valued distributions on the real line. The Fourier transform 9 T of an element T of Y’(E) is defined by the formula (39.16)
(9”T)(4)= T(9”4),
4 E 9.
It is seen at once that 9, thus defined on Y’(E), extends the Fourier transformation (39.10) defined, say, on L2(R’; E). It is an isomorphism of Y’(E) onto itself. If E = F*, the dual of a Banach space F, 9 : Y‘(E)--* Y’(E) is the transpose of 9” :Y(F) -P Y(F). Of course, we could not have adopted this as a definition of the Fourier transform, since not every Banach space is the dual of another Banach space. The definition of the Sobolev spaces on the real line, with values in the Banach space E, follows the usual procedure: If s is any real number, H”(E) is the space of E-valued tempered distributions u whose Fourier transform G ( t ) belongs to the space L2(R’; E) with respect to the measure (1 2’)s dt. We have continuous injections, with norms 1, H”(E)4 H”’(E) if s’ < s. The space Y(E) is dense in each H”(E)and H-’(E*) can be canonically identified with the dual of H”(E).Thus H“(E)is reflexive if and only if E is reflexive. When E is a Hilbert space, one equips H”(E) with the natural Hilbert space structure derived from that of E. The local Sobolev spaces can be defined in the obvious way: H,”,,(J; E) is the space of distributions T defined in J , valued in E, such that aT E H‘(E) whatever a E C:(J). As for H,”(J;E), it is the subspace of H”(E)made up of the elements of H“(E) which have compact support contained in J .
+
39.4 Binary Operations on E-Valued Distributions There are many operations bearing on pairs (or N-tuples) of distributions and functions which we have not yet looked at-operations which are vital to the processes of analysis, such as multiplication of a distribution with a (smooth) function, convolution of two distributions, tensor product of two distributions (which, in the latter case, must “ depend” on independent variables). The difficulty lies in the fact that we must be given a “natural” way to pair the values, that is, to pair vectors, possibly belonging to different linear spaces. To handle all these situations in a unified manner, we introduce three Banach spaces E, F, G and a continuous bilinear map fi : E x F + G . Let us mention four important examples of such mappings:
Sect. 391
387
B-VALUED FUNCTIONS A N D DISTRIBUTIONS
Example 39.1. Take F = E*, the strong dual of E, and G field, /3 the duality bracket mapping (e, e * ) w (e*, e ) .
= C,
the complex
Example 39.2. Let E = F = G be a Banach algebra and /I the multiplication in it. Example 39.3. Let X, Y, Z be three Banach spaces and E = L ( X ; Y),the space of bounded linear operators X Y,equipped with the operator norm, F = L(Y; 2) and G = L(X; Z). Take j? to be the composition mapping (A, B ) W B A. --f
0
Example 39.4. The preceding three examples are particular cases of the following: Take F = L(E; G) and let 0 be the mapping (e, A) H Ae.
Let us refer to the continuous bilinear mapping p : E x F + G as a pairing.
PROPOSITION 39.1. If /3 is a pairing, u (resp. v) an element of C?(J; E) [resp. of Cm(J; F)] ,then tH/I(u(f), v(t)) belongs to Cfm(J; G ) . This proposition is evident. It follows at once from the closed graph theorem that the bilinear mapping (39.17)
(u, v)H/~(u,
v) : C p ( J ; E) x C m ( J ;F) -+ C,"(J; G)
is separately continuous. Consider now the mapping (39.18)
4 I--,J j?(u(t), v ( t > > Wdt
from C p ( J ) into G. It is immediately checked to be continuous and hence to define a distribution u * v on J , with values in G. For fixed v, and using the B density of Cp (J;E) in 93' ( J ; E), we may extend the mapping U H u*vto the B whole of W ( J ; E). We have thus defined the j?-multiplication of E-valued distributions in J with F-valued C" functions (also defined in J ) . One can check that the p-multiplication is a continuous bilinear map
W ( J ; E) x C"(J; F) -+ 9 ' ( J ; G). The reader may apply this definition in Examples 39.1 to 39.3. He will see, for instance, that if T is an E-valued distribution and A ( t ) a C" function of t with values in the Banach space of operators L(E; G), A(t)T is well defined as a G-valued distribution. Let us momentarily denote by x the variable in J , by y the one in another open interval J'. Let S (resp. T) be an E-valued (resp. F-valued) distribution in J (resp. in J'). In the same manner as in the scalar case we may prove that 4(x9 Y ) Ty(4(x,Y ) )
388
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
is a continuous linear map of C:(J x J') into C p ( J ; F). We may therefore form the pairing j(S,, Ty(4(x,y))) E 9 ' ( J ; G). If $ ( x ) is any scalar C" function with compact support in J , equal to one in a neighborhood of the x projection of the support of 4, we may set (39.19)
(S, 0 Ty)(#) = B(S, Ty(4(x7 Y)>)($). 7
B
It is checked at once that the definition (39.19) does not depend on the choice of the cutoff function J/.By applying it to decomposable functions #(x, y ) = 4j(x)4;Cy) and availing ourselves of Fubini's theorem for distributions, we see also that we could have interchanged the role of x and y in order to arrive to the definition [i.e., first compute S,(c$(x, y)) E Cp(J'; E) and form P(S,(+(x, y ) ) , T,)]. We have thus defined the 8-tensor product S, @ T,: It B
defines a separately continuous bilinear map 9 ' ( J ; E) x 9 ' ( J ' ; F) + 9 ' ( J x J ' ; G). From there we can easily define the j-convolution S * T of two distributions B on the real line, valued in E and in F, respectively. We use the formula (39.20)
(S * V(4) = (S, 0 T,)(4(x B
B
+Y)),
4 E C,m(R1).
As usual the formula (39.20) does not make sense for arbitrary distributions; the pair S, T must be " convolvable," i.e., meet certain requirements of support or of decay at infinity. It will make sense, for instance, when either S or T has compact support, or when both S and T belong to L' (with values in E and F, respectively). Much more can be said, but at the elementary level where we have placed ourselves most of it would be a mere repetition of the scalar theory. It is so for most properties of the Fourier transformation (e.g., the Paley-WienerSchwartz theorem) and of the convolution, and their mutual relations. Exercises
(In the following exercises, E,F, G stand for complex Banach spaces and J for an open interval in the real line, where the variable is t.) 39.1. Let m be a nonnegative integer, A(r) a C" function in J , valued in the Banach space of bounded linear operators of E into F, L(E; F). Show that its transpose, 'A(t), is a C" function in J , valued in L(F*; E*) (the asterisk indicates duals). 39.2. Let rn be an integer 2 1, u(t) a mapping of J into E. Prove that, if the complex-valued function (x*, u(t)) in J is C", whatever the continuous
Sect. 391
B-VALUED FUNCTIONS AND DISTRIBUTIONS
389
linear functional x* on E, then the function u(t) itself is P-'. [Hint: Prove first that if all the functions (x*, u ( t ) ) , x* E E*, are Lipschitz continuous in J , the same is true of u itself.] 39.3. Let H denote a Hilbert space, el, . .., e m , ..., a complete orthonormal system in H (assume dim H = + 00). Consider the function u(t) in the closed interval 0 5 t 5 1, defined as follows:
u(0) = 0, u(t) = (1 if
- O)e,+,
t = O(l/rn)
+ Oe,
1 + (1 - 8) m+l
Prove that, given any x* E H*, the scalar function (x*, u(t)) is continuous in [0, 11, but that u is not continuous at t = 0, as a function valued in the normed space H. 39.4. Let 0 be an open subset in the complex plane, where the variable is denoted by z. Let /? denote a continuous bilinear map of E x F into G , u (resp. v) a holomorphic mapping of 0 into E (resp. F). Prove that p(u, v) is a holomorphic mapping of 0 into G . 39.5. Let K be a compact interval in the real line, f(t) a continuous mapping of K into the Banach space E. Prove that the linear functional on the dual E* of E,
x*
(39.21)
HIK
(x*, f(t)> dr
defines a unique element of E, and that this element is equal to the Riemann integral defined in $39.2. Would this have been true if we had not assumed E to be complete, but only normed? 39.6. Let E be the two-dimensional space C2,equipped with the norm IlX((E=
sup( 1 x1 I , 1 x2I ),
x = ( 2 ,x2) E
c2.
Give an example of an L2 function f(t) in the real line, valued in E, with Fourier transform f(7) such that (39.22)
39.7. Describe the set of pairs (s, s') E R2 such that the Dirac measure on the diagonal of R" x R", 6(x - y ) , as a distribution of the variable x valued in the space of distributions in y , belongs to the space H"(R:; H"'(R:)). Derive from the result that the local structure theorem does not hold in .9'(R;; Cm(R,!)). 39.8. Let H be a Hilbert space, A a densely defined, possibly unbounded, self-adjoint linear operator on H. Prove the following assertion: In order
390
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
that every distribution u(t),defined in the real line and valued in H, which is a solution of the homogeneous equation du
- =Au,
(39.23)
dz
be a C' function of t , valued in H, it is necessary and sufficient that A be bounded. Then all solutions of (39.23) are analytic functions of t in the real line, valued in H (and in fact can be continued to the complex values of t as entire functions in C). 39.9. Let E be a Banach space, f a locally integrable function, defined in the real line and valued in E. Prove that there is a subset S of R', having Lebesgue measure zero, and a separable closed linear subspace E, of E such that f ( t ) E E, for all t E R'\S. 39.10. Let E, F be two Banach spaces, L(E;F) the Banach space of bounded linear operators of E into F, f + + A ( z )a mapping of R' into L(E; F) such that the following holds : (39.24) Given any u E E, any v* E F* (dual of F), tt+
Is it then true that, whenever u(t) is a measurable function in R', valued in E, A(t)u(t) is a measurable function in R' valued in F? 39.11. Let E, F, A ( t ) be as in Exercise 39.10 but suppose now that A ( z )E L"(0, T; L(E; F)). Prove that u H Au is a bounded linear map of L2(0, T; E) into L2(0,T; F). 39.12. Let z;Z be a bounded open subset of R",having a C" boundary and lying on one side of it. Let m be an integer 2 1 and let gj(*, t) (,j = 0, . . . , rn - 1) be m C'(0 5 k 5 00, k independent 0f.j) functions o f t in the closed interval [0, TI (T > 0) valued in the Sobolev spaces H r n - j - ' l 2(r),respectively. Show that there is a C" function of Z, 0 5 t 5 T, valued in H"(R) such that, for e a c h j = 0, . . . , m - 1,
+
39.13. Let E, F be two Banach spaces, L,(E; F) the space of bounded linear operators E + F equipped with the topology of pointwise convergence in E. Let X be a compact Hausdorff topological space, f a continuous mapping of X into E, A a continuous mapping of X into L,(E; F). Is it true that A f is a continuous mapping of X into F?
40 Mixed Problems. Weak Form Consider a region (i.e., an open set) R in space, bounded by a hypersurface r = aR. Suppose that f2 is filled with matter, for instance that R is a wall, or a metal bar, and that a certain temperature g is maintained on r, by means of some heating (or cooling) device. We may wish to study the temperatureat the various points of R. Of course this will evolve in time, as the heat percolates from the surface to the interior: It is a function u = u(x, t ) of space and time. It will depend on its initial values, that is, on the temperature distribution at the time (usually taken as the origin) when we started to heat the outer surface r. This distribution of temperature at time t = 0 will be denoted by u,,(x). Note that the temperature maintained on r, g , need not be homogeneous in space, it may vary from point to point; in other words, it may depend on x . In fact, it may also vary with time, and therefore g = g(x, t ) . A very simple argument based on the way heat is transferred, from atom to atom, inside the matter which fills R, shows that the variation of the temperature u in R in space and time obeys (up to a certain degree of approximation) the heat equation
au
(40.1)
-=k
dt
Au,
where k is the conductivity constant, which depends on the nature of the material which fills the region R. Equation (40.1) is valid, provided that this material is homogeneous (i.e., its characteristics, reflected in k , do not depend on x nor, for that matter, on t ) , isotropic (propagation of heat is the same in all directions), and that there is no creation or absorption of heat, for instance due to chemical reactions, inside R. In case the latter phenomenon takes place, Eq. (40.1) should be replaced by the inhomogeneous heat equation au
- = k AU -+f , at
wheref= f ( x , t ) is a function in R describing the creation or the absorption of heat. In dealing with more complicated processes, for instance when 391
3 92
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
[Chap. IV
homogeneity and isotropy are lacking, one may be forced to replace the operator - k A by a more general elliptic (see Definition 23.2) second-order differential operator
and indeed we shall be looking at the analog of (40.2) where A(x, t , 3/dx) has been substituted for -kA. But it is useful to keep in mind the meaning of the problems we shall be studying, in the case of Eq. (40.2) (where, however, possibly after a change of the units, we may assume k = 1). Suppose thus that u represents the temperature in the region R, determined by the initial temperature uo and by the surface temperature g. We shall have (40.4)
au _ -- Au + f ( x , t ) at
(40.5)
u = uo(x) in Q
(40.6)
u = g ( x , t)
on
r
in Q, when t =0, for every t > 0.
One recognizes in (40.6) the kind of boundary condition encountered in the Dirichlet problem. On the other hand, (40.5) is analogous to the initial condition in the Cauchy problem. In view of this double analogy, Problem (40.4)-(40.5)-(40.6)is often called a mixed problem. Let us point out that the boundary condition adjoined to (40.4)-(40.5) need not be of the Dirichlet type. It might very well be of the Neumann type (cf. Sect. 37): Instead of imposing the temperature on the boundary r, we may fix the heat flow (or f l u x ) through I-. This amounts to replacing (40.6) by the new condition (cf. $37.1): (40.7)
au av
- = h(x, t )
on I‘
for every t > 0,
where a/av denotes, as usual, the normal derivative at the point x of r. One may also impose a “hybrid” kind of boundary condition: Maintain the temperature on some part of the boundary while fixing the heat flux on the remaining portion of it. Let us write
r = rov rl,
ron rl = 0 ,
and require (cf. $37.2) (40.8) u = g(x, t ) on To,
au - = h(x, t ) on av
rl
for every f > 0.
Sect. 401
393
MIXED PROBLEMS. WEAK FORM
The heat operator dldt - A is the archetype of the parabolic operators. Mixed problems may, however, be posed for other than parabolic operators -e.g., for hyperbolic ones. If the hyperbolic operator under study happens to be second order, there will be two initial conditions, in accordance with the requirements in the Cauchy problem : We will have to preassign the values of u and of its first derivative with respect to the time t , au/at, at t = 0. Mixed problems for second-order hyperbolic equations (and systems of equations) occur naturally in the theory of sound, for instance to describe the evolution of the air pressure inside a room where noise is produced, or in electromagnetism, e.g., to describe the evolution of the electromagnetic field in some region of space (in the latter example, one would deal with a system of equations, specifically the Maxwell equations), and so on. The simplest examples of mixed problems for hyperbolic equations are of the kind (40.9) (40.10)
u = uo(x),
(40.11)
u =g(x, t )
au
- - u l ( x ) in R
at
at
on
r
t = 0,
for every t > 0.
In physical applications there would be a factor of the form V 2 (where V represents a velocity) in front of the Laplacian A, at the right in (40.9). We have assumed that the units of space and time have been chosen here so as to have V = 1. The Dirichlet boundary conditions (40.11) can be replaced by a different kind of boundary condition, e.g., by conditions of the Neumann kind or by a mixture of Dirichlet and Neumann conditions, exactly like in the parabolic case. We shall focus our attention mainly on mixed problems for parabolic equations, with Dirichlet boundary conditions, of which (40.41440.5)-(40.6) is the prototype. We shall, however, consider the case where - A is replaced by a more general elliptic operator, A , like that given in (40.3). On the other hand, it is convenient to restrict the variation of the time t to some interval [0, TI, T > 0, although in certain instances we will want to let it vary from - 00 or from 0 to + 00. Thus the problem we propose to study is the following one : (40.12)
at
u =f(x, t)
(40.13)
u(x, 0) = uo(x)
(40.14)
u(x, t ) = g(x, t)
in R x 10, T [ , in R, in
x 10, T [ .
394
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
We shall state our hypotheses on the data of this problem in more precise terms. In later sections, we also study some of its variants, e.g., when the boundary condition (40.14) is replaced by (40.7) or (40.8), as well as the hyperbolic mixed problems such as (40.9)-(40.10)-(40.11). We mainly follow the variational approach-i.e., deal with the weak problem, and therefore with the Sobolev spaces H"(Q), &(a),H-"(SZ), . . . (cf. Sects. 22 and 23). First we must state in a precise manner our hypotheses on the differential operator A. It is convenient to rewrite it in the variational form [cf. (23.6)J
We begin with the assumption on the nature of the coefficients: (40.16)
aiJ,bj, c E Lm(Qx 10,TD
for all i , j = 1, . . . , n.
At this stage let us mention the following result:
LEMMA 40.1. If (40.16) holds, A(x, t , alax) defines a boiurded linear operator L2(0,T ; H'(R)) -+L2(0,T ; H-'(G!)). Proof. It suffices to prove the assertion when A is replaced by any one of the terms (a/axi)aii(x,t)(a/axj),b'(x, t ) a/axi, C ( X , t). Since a/axj is a bounded linear operator L2(0,T; H"(i2)) .+L2(0,T ; H"-'(R)), m = ... , - 1,0, 1, ..., it is enough to show that multiplication by an arbitrary element of
Lm(R x 10,TD
defines a bounded linear operator of L2(0,T; Ho(R)) = LZ(R x 30, TD into Q.E.D. itself, which is well known. Now we come to the elripticity assumption about A . It will be stronger than the standard uniform ellipticity as described in (23.13). In particular, if it holds for A it will not hold for - A . Observe that the change from A to - A corresponds, roughly speaking, to a time inversion. But the parabolic problems are not invariant under time inversion-unlike the hyperbolic ones (generally speaking). This has something to do with the difficulty of sending back (to where it came from) heat which has been diffusing into some matter (or, at a higher level of sophistication, with the difficulty of putting HumptyDumpty back together again). At any rate, our condition will be (40.17) (40.18)
for some co > 0 and (almost) every ( x , t ) E R x 10,T [ : col(lZ
5 Re
2 i,j=l
aij(x, t)cirj
for all
(E
C"
Sect. 401
395
MIXED PROBLEMS. WEAK FORM
Let us mention a simple, but important, situation where (40.17) is satisfied. This is when the coefficients aij are real and symmetric, i.e., aij = aii, and the quadratic form a q x , t)Ci i.j= 1
sj
is positive definite, uniformly with respect to (x, t ) . Now we come to the interpretation of the boundary conditions. In accordance with the precedent set in the " weak " Dirichlet problem (Sects. 22 and 23) we suppose that g(x, 1 ) is a trace on the boundary r, depending on t , more precisely that it is a function of t , 0 < t < T , valued in the space of traces Tr'(Q) = H'(Q)/H;(Q). But, just as in the Dirichlet problem, it is convenient to introduce a representative g = g(x, t ) E H'(Q) of g. For each t , or for almost each t , 0 < t < T , the trace of @, i.e., its canonical image in the quotient space Tr'(Q), is equal to g. Condition (40.6) will then be restated as follows : (40.19)
u
- g E HA(Q)
for almost every
t,
0 < t < T.
But in the present discussion the matter of the regularity of the intervening functions, either data or unknown, cannot be waved aside. We must decide what kind of regularity in t to assign to @ or to require from u. From empirical considerations it follows that a " good " function space is the one we now define : Definition 40.1. We denote by
(40.20)
Q,
the space of functions w such that
w E L'(0, T ; H'(R)),
dw - E L2(0,T ; H-'(Q)), dt
equipped with its natural norm
It is seen a t once that Q, is a Hilbert space and that C"([O, TI; H'(Q)) is dense in it. The notation Q, might create some confusion if we are dealing with more than one interval [0, TI or more than an open set R. More generally we ought to introduce spaces @(a,b ; Q). But since the only purpose of making a formal definition of Q, is to simplify the statements and the proofs which follow, we shall leave it at that. Definition 40.2. We denote by Q,, the subspace of Q, consisting of the functions w such that w( ., t ) E HA(Q) for almost every t , 0 < t < T.
Since HA(Q) is closed in H'(R), so is a0in Q,. We shall always equip 9, with the Hilbert space structure induced by that of 9.
396
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
Condition (40.19) now reads (40.22)
u-g'EaO.
Observe that every function w E CD can be extended to the closed interval [0, T] as an absolutely continuous function valued in H-'(R) and, as such, it has a limit at + O in H - ' ( n ) . We could interpret the initial condition (40.13) in that light. But there are advantages in seeking a stronger kind of continuity -continuity not in H-'(ZZ) but in H o ( n )= L2(ZZ).As a matter of fact, this is automatically achieved if we substitute (Do for as we now show. Let us provide the space Co([O,TI ; L2(Q))with the " maximum norm " O
LEMMA 40.2. The natural injection (40.23)
Crn([O, TI ; H;(s1)) + C0([0, TI; L2(Q))
can be extended us a confinuousinjection a0+ Co([O,TI; L2(ZZ)). Proof: Let u E a0; we define a function on the symmetric interval 1- T, T [ by setting ii(t) = u(t) for t > 0, ii(t) = u ( - t ) for t < 0. It is clear that u ~ i i is a continuous injection of a0= aO(O, T )into a0(- T, T). Let now a E Crn(R), a(t) = 0 for t < - T,a(t) = 1 for t > 0. Let u E Cm([O, TI; HA(i2)). We have
SC(0r)S' { l l i i ( A I l ? + l l ~ t ( . J ) l l w ~ -T
5 2C(a)lll 111 (for the mapping u ~ i hasi norm 2). Since the restriction of aii to [O, T] is equal to u, we see at once that the natural injection (40.23) is continuous when the first space carries the norm 111 111 induced by a0[defined in (40.21)], hence has a unique continuous extension to a0[where the space Cm([O,TI ; HA(Q)) Q.E.D. is dense]. Thus the elements of a0can be regarded as continuous functions in the closed interval [0, TI, valued in L2(n).If the same were true of the elements of a, we would have a very convenient interpretation of the initial condition (40.13). Actually, this is the case when ZZ is bounded and has a C' boundary (and lies on one side of it; Exercise 40.4). But in order not to lose generality,
Sect. 401
397
MIXED PROBLEMS. WEAK FORM
we shall not restrict our choice of the open set R. Rather, we are going to limit (very mildly) the choice of the function g entering our boundary condition: We are going to require that g(-, t ) converge in LZ(R)as t -+ +O. Let us state the existence and uniqueness theorem : THEOREM 40.1. Suppose that the coeflcients of A(x, t , a/dx), in the variational form (40.15), aij, b', ci(l 5 i, j 5 n ) all belong to Lm(R x 10, T [ )and that A ( x , t, dlax) satisfies the ellipticity assumption (40.17). Then, to every f € L 2 ( 0 , T ; H-'(R)), every uo EL'(R), and every g E L2(0,T ; H'(R)) such that
3lj € L 2 ( 0 ,T ; H-'(R)), -
(40.25)
at
(40.26) g(., t ) converges in L2(R)to a function g(. , 0 ) as t > 0 goes to zero, there is a unique function u E L2(0, T ; H'(R)) haVing the following properties:
(40.27) (40.28) (40.29)
- + A(x, t , d/dx)u =f L3.4
at
u( -,t ) -
a(.,
t) E
HA(R)
in R x 10, T [ ,
for almost every t , 0 < t < T,
u( -,t ) converges in L2(R) to uo as t -+
+ 0.
Remark 40.1. If u E L'(0, T ; H'(R)) satisfies (40.27), u belongs to @ (Definition 40.1). If, moreover, (40.28) holds, then u - s" E @, and, therefore, by Lemma 40.2, u - s" is a continuous mapping of the closed interval [0, T ] into L2(R). Then, by (40.26), we see that u ( - , t ) must converge in L 2 ( n )as t -, +0, to a limit which we shall denote by u ( * , 0). Condition (40.29) simply means that
(40.30)
u(.,0)=uo.
Remark 40.2. We may apply Theorem 40.1 in the case where f we obtain the following result:
= 0. By virtue of Lemma 40.2
THEOREM 40.2. The mapping u H u( * , 0 )from
m0 into L'(R)
= 0,
is surjective.
Such results as Theorem 40.2 can be generalized to Hilbert spaces (or Banach spaces) other than HA(R), L2(R), H-'(R). They are very simple particular cases of the theory of interpolation (and of the trace method of interpolation; cf. Exercise 25.1). Theorem 40.2 implies that L'(R) = Ho(R) is an interpolation space between HA(R) and H-'(Q).
398
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. Iv
Remark 40.3. The validity of Theorem 40.2 explains why there is no compatibility condition, in Theorem 40.1, linking uo and lj. In view of (40.28)(40.29) one could have expected that uo - g(*, 0) should belong to HA(R). But notice that such a requirement would be meaningless, since uo belongs to L2(Q).More important, the value of l j at t = 0 cannot be relevant to the statement of Theorem 40.1 : Indeed Theorem 40.2 tells us there always is a function h in a0such that h ( . , 0) = l j ( . , 0). The requirement (40.28) on u would not be modified if we substituted l j - h for 8. Remark 40.4. Theorem 40.1 may of course be applied to R = R". Then, the boundary condition (40.28) becomes void, as H'(R") = HA(R"). In this case, Theorem 40.1 asserts that, given any f € L 2 ( 0 ,T; H-'(R")), there is a unique solution u E L2(0, T; H'(R")) of the equation u, + Au =f with arbitrarily preassigned initial value u ( * , 0) in L2(R").This is no longer a mixedproblem-it is a Cauchy problem (see Chapter 11). Remark 40.5. Usually, an existence and uniqueness statement like Theorem 40.1 is complemented by a statement of continuous dependence on the data. In the present situation, this would take the form that the mapping
(ft uo, u, from L2(0, T ; H-'(Q)) x L2(R) x CP into L2(0, T ; H ' ( 0 ) ) is continuous. As a matter of fact, assertions like this one usually follow at once from the existence and uniqueness of the solution, via the closed graph theorem. But in our case, we have the additional condition (40.26) which perturbs the situation somewhat. Thus 4 and u belong to a linear subspace of CP (Definition 40.1) which, at first sight, is not closed. Therefore we cannot apply the closed graph theorem. Nevertheless continuity holds. This follows from an estimate derived at the same time as the existence of the solution [Estimate (41.26)]. Exercises 40.1. Let V be a Hilbert space, V' its antidual, H a third Hilbert space such that we have continuous injections, with dense images, V ci H ci 8'. Furthermore we assume that the injection H 4 V' is the "adjoint" of the injection V 4 H. Let T be any number greater than zero and call @,(O ,T; V) the space of V-valued functions u(t) in [0, T ] such that du - E Lz(O,T ; V'), (40.3 I ) E L2(o,T ; V), dt equipped with the Hilbert norm
(40.32)
MIXED PROBLEMS. WEAK FORM
Sect. 401
399
Prove that Ca([O, TI; V) is dense in cP,(O, T ; V) and that the natural injection of Ca([O, TI; V) into Co([O, TI:H) can be extended as a continuous injection of @,(O, T ; V) into Co([O, TI: H). 40.2. For almost every t , 0 5 t 5 T,let A ( t ) be a bounded linear operator of V into (see Exercise 40.1), such that, whatever v, w E V ,
v'
(40.33)
the function t w (A(t)v,w)- belongs to Lm(O,T).
Prove that v ( t ) w A ( t ) v ( t )defines a bounded linear map of L'(0, T ; V ) into L'(0, T ; V'). (( , )- is the bracket of the antiduality between V and
v'.)
40.3.
(40.34)
Consider an operator A ( x , t , ajdx) = IPI.
(-1)Ip1 (&)pap,q(x, 141 6 m
r)(t)".
rn 2 1 ,
whose coefficients belong to La(R x [0, q).State precisely the hypotheses and the conclusion in the theorem analogous to Theorem 40.1 where A(x, t , d/ax) is chosen according to (40.34) (state with special care what the boundary conditions and the hypotheses on the boundary data should be). 40.4. Let T be a strictly positive number and u(x, t ) a function in R: x 10, T [ such that u E L2(0,T ; a l ( R \ ) ) ,
du - E L2(0,T ; H-'(R!+)). dt
Call ii(x, t) the function in R' x 10, T [ equal to u(x, t ) for x > 0 and to u ( - x , t ) for x < 0. Show that we have
ii E L'(0, T ; H'(R1)),
dii - E L2(0,T ; K 1 ( R 1 ) ) . dt
Indicate how we can derive from this the following result: LEMMA40.3. Let Q be a bounded open subset of R" whose boundary is a C' hypersurface (and which lies on one side of it). Let Qo be an open subset of R" containing Q. There is a continuous linear mapping E~ : H'(C2) -+ Hb(Ro) such that rREn = Identity in H'(R) (rQ is the mapping "restriction to Q"), and furthermore such that E~ extends as a continuous linear mapping of @(O, T ; Q) into @,(O, T ; 0,) (Definitions 40.1 and 40.2). 40.5. Let R be as in Lemma 40.3. Show that the space @ (Definition 40.1) can be naturally, and continuously, injected into Co([O, TI; L'(Q)).
400
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
40.6. Let Q be as in Lemma 40.3 but suppose now that its boundary is a C m + l hypersurface (m 2 1 arbitrary). Consider the space cDm(O, T ; a) of functions u(x, t ) in x 10, T [ such that u E L'(0, T ; H"+'(R)),
du dt
- E L2(0,T ; H"-'(Q)).
Prove that @"(O, T ; S2) can be naturally injected into Co([O,TI; Hm(R))and that the injection is continuous when we equip @"(O, T; Q) with the obvious norm. [Hint: Use Theorem 26.A.3 and Remark 26.A.1.)
41 Energy Inequalities. Proof of Theorem 40.1: Existence and Uniqueness of the Weak Solution
to the Parabolic Mixed Problem For
7
> 0 we define the sesquilinear form on H‘(R) a,(t; u , u ) = 7
x H’(Q):
au ao J2fid x + f J a’j(x, t ) ax^ - dx ax’ -.
i,j=l
0
It is the analog, in the present situation, of the functional (23.7). Our hypotheses (40.16) and (40.17) imply at once (41.1) There is T~ > 0 such that, for aN Otr
collull:
t
>T
5 2 Re a,(t;
~ all , u E H1(R)and
almost all t,
u, u ) .
We recall that 1 1 ~ 1 1 : is the norm of u in H ’ ( f 2 ) .The constant co is the same as in (40.17). In (41.2) we may take u dependent on t . As a matter of fact, we derive from (41.2), for arbitrary u E L2(0,T; H’(R)),
C, T
(41.3)
c,,
~lull:d t
5 2 Re ~ ~ ~ a u, ,( ut ); dt. 401
402
[Chap. IV
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
Now we define two sesquilinear forms on CPo x CP (Definition 40.2): (41.4) (41.5)
WU,
u) =
J
T
{ -
0
W(u, v ) =
/
T
dt,
+ a,(t; u, u)> dt.
( ( u , , 5)
0
+ a,(t; u, 0))
We observe that (for u E Qo) (41.6)
2 Re ~ ~ (U Ju dt, =
Jn
(I u(x, T )1' - I u(x, 0)1)'
dx
(cf. Lemma 40.2). If we combine (41.3) with (41.6), we obtain the following important " energy inequalities " :
LEMMA 41.1. If(40.16) and (40.17) hold, then we have, for all all u E Qo (Definition 40.2)
T
> T~ and
(41.7)
S 2 Re '21z(u,u ) +
n
I u(x, T)I' dx,
(41.8) I2
Re
a:(u, u) +
1I n
u(x, 0)1'
dx.
The proof of Theorem 40.1 is based on the exploitation of these inequalities: (41.7) will lead to the existence of the solution, (41.8) to its uniqueness. In some way the parameter t > 0 must be introduced. This is done by transforming the unknown u. We set U
= e-"(u - g).
It is clear that U E CP, and that U , + (t + A)U
(41.9)
=F
with (41.10)
F=e-rt
(
f--at
Ag)
E
L'(0, T ; H-'(R)).
The initial condition (40.28) now reads (41.1 1)
U ( . , 0)
=
u,
where (41.12)
u, = uo -g(*.
0).
Sect. 411
403
ENERGY INEQUALITIES
Existence of the Solution
I
We shall apply the following generalization (due to J. L. Lions) of the Lax-Milgram lemma (Lemma 23.1):
LEMMA 41.2. Let E be a Hilbert space, b a linear subspace of E , 2 l ( ~h) , a sesquilinear functional on E x f) having the following properties : (41.13) for each fixed h E f), w H ‘rl(w, h) is a continuous linear functional
on E ; (41.14)
there is cb > 0 such that, for every h E l ~ ,
CbIlhll; 2 I%U(h,h)l. Conclusion. There is a bounded linear map G of the antidual E’ of E into E , with norm scb-’, such that, for every continuous antilinear functional on E , (41.1 5)
2l(G1, h)
=
A(h)
for every h E b.
Proof. By (41.13) t o every h E b there is an element Rh of E such that %(w, h) = (w, Rh), for all w E E.By (41.14) we have IlhllE 5 Cb-’IIRhIl,,
VhE f). Thus R defines an injective linear map f) + E (in general, not continuous). Also (41.16) shows that the mapping R h w h, from R f ) equipped with the norm of E, into E, is continuous. It extends by continuity to the closure of Rb in E and, by setting it equal to zero on the orthogonal of Rf), to the whole of E. This extension, which we denote by GT, has a norm 5 cb-’. Its adjoint, which we denote by GI, is a bounded linear operator E + E with the same norm. We have, for all w E E and all h E $, (4 1.1 6)
21(Glw, h)
= (G,w,
Rh),
= (w,
GYRh),
= (w,
h)E.
Now let J denote the canonical isomorphism of E’ onto E. We set G I = G ,J 1 for all 1E E’. Q.E.D. We shall apply Lemma 41.2 with the following choice of E: It will be the space of pairs (0,uo), v E L2(0,T ; HA(R)), uo E L2(R). The norm o n E will be the natural one:
As for
it will be the subspace of E consisting of the pairs (u, uo) such that + +O in L2(R), and furthermore such that
u E m0, uo = limit of u( -,t ) as t
(41.17)
the limit in L2(Q) of u ( * , t ) as t < T tends to T is zero.
404
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
(We have tacitly applied Lemma 40.2.) In our situation, the form rU of Lemma 41.2 will be the form
a, given by (41.4).t
((w,wo>,(h, h o ) ) + + ' W ~A),,
The energy estimate (41.7) shows, in view of (41.17), that the hypothesis (41.14) of Lemma 41.2 is verified; as for (41.13) it is verified in view of the fact that t, c Q0. We may therefore take advantage of the conclusion. The continuous functional A on E will be
where F and Uo are the functions in (41.9) and (41.11), respectively, and ( , ) is the bracket of the duality between H-'(R) and Hh(Q). We reach the conclusion that there is ( V , V,) E E such that (41.19)
N,(K h ) = j ' ( F , h ) d t 0
+
1
Uo(x)h(x,0 ) d x ,
V h E 6.
R
T [ ;H h ( 0 ) ) . In this case, Eq. (41.19) reads
Let us first choose h in C:(]O, (41.20) which implies
v,+ (z + A)V = F
(41.21)
in the sense of distributions in 10, T[with values in H - ' ( 0 ) , or, also, in the sense of distributions in R x 10, T[. Equation (41.21) implies at once that V, E L2(0, T ; H-'(C!)), hence V E 'Do. In particular, V may be viewed as a continuous function in the closed interval [0, TI with values in LZ(Q).If then we return to (41.19), we see by integration by parts that
rUu,(v,h ) = ( V , f i )
I
t=o
+ s T < K + (z + A)K h ) dt 0
and therefore, by (41.21), that (41.22)
( V , h)
I
t=O
= JRV(x,O)h(x, 0) d x =
Uo(x)h(x,0 ) dx.
n
Clearly we may take h E lj such that h(x, 0) is any given element of HA(R). Thus (41.22) implies V(x,0) = Uojx) a.e. in 0. We see that V is a solution of (41.9)-(41.11).
t It is clear that, whereas PI:(w, h ) can be extended to 'Do x L2(0, T; HA@)), %,(w, h) can be extended as a sesquilinear functional on L'(0, T; HA@)) x m0.
Sect. 411
405
ENERGY INEQUALITIES
I1
Uniqueness of the Solution
By subtraction, it is a matter of proving that if W E(Do satisfies (41.23)
w,+ (z + A)W = 0,
(41.24)
W( * , 0) = 0,
we must have W E 0. By (41.5) and (41.23) we have
=/ ( W , + T
2I:(w,W )
0
(z
+ A ) W , W ) dt =o,
and hence our contention by (41.8) [in view of (41.24)].
I11 Estimation of the Solution Let us return briefly to the argument of part 1 (existence of the solution) of the present section, in particular, to our choice of the spaces E, b, of the form a,etc., when we have applied Lemma 41.2. We observe that the norm, in E', of the antilinear functional A, given by (41.18), is
If then we apply the conclusion of Lemma 41.2 and, in particular, the estimate on the norm of the operator G, we find that the norm, in E,of the solution U of (41.9)-(41.11), is S2c;'I\Al\ = CoIIAll. In other words,
s IIW., T
(41.25)
0
T
t)ll;l(n)
dtS
c;(J0 I F ( . , t)ll;-qn)
dt
+ llu*ll:2(*l}.
At this point we return to the original unknown u, and express U in terms of u and 8. We derive from (41.25) the following estimate:
1 Ilu(., t ) T
(41.26)
0
- Q(.,
t)llilcn) d f
5 K"lnIuo(x) - a x , O)IZ d x f
1 Ilf(., T
0
T
+ j0 {IlQ(. , f)ll;f(*)
f
Il8*( Oll& * 9
t)II;-I(n)
df
df).
This is the estimate announced in Remark 40.5. The constant K depends on 0,A ( x , r, a/dx), and also on the number T > 0.
406
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
Exercises 41.1. Let V, H be as in Exercise 40.1, and the operator-valued function A(t) as in Exercise 40.2. Let us assume now that there are two constants loE R, co > 0 such that, for almost every t E [0, TI,
(41.27)
Re(A(t)v, v>-
+ Aollvll~2 coIIvII~,
Vv E V.
Duplicate the argument in Sect. 41 so as to prove the following theorem:
THEOREM 41.1. Under the preceding hypotheses, given any function f ( t ) E L'(0, T; V') and any element uo E H, there is a unique function u E Lz(O,T ;V) such that (41.28)
du - -t A(t)u = f in 10, T [ ; dt
(41.29)
u(0) = uo.
[By the result in Exercise 40.2, we derive du/dt E L'(0, T ; V') from Eq. (41.28) and by the one in Exercise 40.1 we see that u can be regarded as a continuous function, in the closed interval [0, TI,valued in H, from which we obtain the meaning of (41.29).] 41.2. Write down explicitly and check the energy estimates (41.7) and (41.8) in the case where the operator A(x, t , a/ax) in Theorem 40.1 is minus the Laplace operator in the space variables x . 41.3. Let V and A ( t ) be as in Exercise 40.2. Let v E L2(0, T ; V) be a solution of the equation
(41.30)
dv dt
- + A(t)v = 0 in 10, TI.
Show that there is a constant C > 0 such that
(41.31) Apply the method in Exercise 20.2 to prove that if v(0) = 0 [v(t)is an absolutely continuous function in the closed interval 0 5 t I T valued in V'],then necessarily v(t) = 0 for all t < T. 41.4. Let R be a bounded open subset of R" whose boundary r is a C" hypersurface and which lies on one side of it. Let r = To u rl,To n rl = fa and let g(x, t ) be a C' function of t E R', vanishing for t > 0, valued in
407
ENERGY INEQUALITIES
Sect. 411
H2(R). Let u o ( x ) ~ L 2 ( RShow ). how to apply Theorem 41.1 to prove that the mixed problem : (41.32)
u, = Au
in R x 10, T[,
(41.33) u(x, 0) = uo(x) in R, (41.34)
onr,,
u-g=O
a -(u-g)=O av
onr,,
Vt>0,
has one and only one solution UEL*(O,T; V) for a suitable choice of V, Hi(R) c V c H'(R), and under suitable hypotheses on uo and g (state the latter hypotheses precisely). 41.5. Let R and r = aR be as in Exercise 41.4. Let P ( x , t, D) =
1
a,(x, t)DU
1.1 5 2m
be a uniformly strongly elliptic operator in R (see Sect. 36; m 2 1). Make the assumption that a11 partial derivatives with respect to x of order 5 M , of every coefficient a = ,belong to Lm(Q x [0, q).Let u, EL'(R), gj( , t) be a once continuously differentiabIe function of t, 0 t 5 T,with values in H " - j - 1/2 u-1 for e a c h j = 0, . . . , rn - 1. Show how to apply Theorem 41 .I to derive that, under suitable additional conditions linking uo and the g j , which the student must precisely state (cf. Exercise 40.3), the mixed problem
-
(41.35)
du dt
- + P(x, t , DJu
(41.36)
=0
in R x 10, T [ ,
u(x, 0) = uo(x) in R,
(41.37) has a unique solution u E@"(O,
T; R) (see Exercise 40.6).
42 Regularity of the Weak Solution with Respect to the Time Variable We have established the existence and uniqueness of the solution u to the mixed problem (40.27)-(40.28)-(40.29) and we should now examine its regularity, under appropriate regularity assumptions on the data. But if we compare the present situation t o that of boundary value problems, and regard a mixed problem as a boundary value problem, of a peculiar kind, relative to a truncated cylinder in a space R"", namely x 10, T [ , we cannot help but notice the fact that the region in question can never be smooth: It must always have corners at the boundary of the set R lying in the hyperplane t = 0 (unless R is the whole space R",and then there is no boundary at all-but then our problem is a global Cauchy problem!). Indeed, such corners introduce singularities, even in the simplest of situations (see Exercise 44.2). One may then be willing to limit the investigation of the regularity of the solution near the portions of the boundary which are sufficiently smooth, and indeed this can be carried out by methods not unlike those used in the elliptic boundary problems (Sect. 27), and which are, of course, rather complicated. Actually, this comment would also apply to the study of the regularity with respect to the space variables only. Under suitable assumptions, the latter can be derived from analogous results for elliptic equations, of the kind obtained for Iz - A in Sect. 27 (but, needless t o say, more general). When the coefficients of the operator under study are independent of t, one can go rather far in this direction and at not too great a cost, for instance by the continuous semigroups method (Sect. 45) or by the method of the Laplace transform (Sects. 43 and 44). Indications of what can be achieved in this way are given in Exercise 44.4. In this section, we merely discuss the regularity with respect to the time variable t . The reason we are able t o do so is that such a discussion could as well apply to abstract evolution equations (where of course the peculiarities of the domain or of the data are not there t o create complications). However, 408
Sect. 421
REGULARITY OF THE WEAK SOLUTION
409
a discussion of the regularity with respect to the space variables must perforce take these peculiairities into consideration, and would necessarily greatly increase the technicality of the text. At the relatively elementary level where this book would like to remain, the indications contained in Exercise 44.4 should suffice. Thus, we must assume that the data are suitably regular with respect to t : This concerns the coefficients of the differential operator A ( x , t, d/dx), the right-hand side f (x,t), and the function g(x, t ) which determines the boundary value of the solution. It does not concern the open set !2 (in particular, its. boundary = an)nor the initial datum uo-although, as we shall see, stronger regularity properties on u, A g with respect t o t will require more stringent regularity conditions with respect to x, on the part of uo [cf. (42.19)]. If we wish t o get a clear idea of the kind of results to expect, and of the methods t o use in order t o prove them, it is perhaps advisable to study thoroughly the case of3rst-order differentiability. This is what we are now going t o do. Let us make the following assumptions (in addition to those already made in Theorem 40.1): (42.1) The coefficients aij(x, t), b'(x, t ) , c(x, t ) of A(x, t , a/dx) havefirst-order t derivatives (in the distribution sense)-all of which belong to Lm(Q x 10, T[). Let us denote by A , = A,(x, t , a/dx) the differential operator obtained by differentiating with respect t o t all the coefficients of A :
A ,v
= (d/dt)(AV)- Av, .
Note that (42.1) is equivalent to the following: The coefficients aij, bj, c are uniformly Lipschitz continuous functions oft, in [0, TI, with values in L"(Q). Next we introduce the hypotheses that (42.2) f, E L2(0,T ; H - ' ( n ) ) , (42.3) 9, E L2(0,T ; H'(!2)), (42.4)
g,, E L2(0,T;H-'(Q))
(i.e., g, E m),
gl(-,t ) converges in L2(Q) fo afunction gt(., 0 ) as t + +O.
Needless t o say, we are assuming that EL'(O, T ; H'(R)); therefore, under the present hypotheses, g is an absolutely continuous function, in the closed interval [0, TI, with values in H1(d),and the initial value g(., 0) in (40.26) must belong t o H'(!2). We might ask whether, under hypotheses (42.1) to (42.4), we may conclude that the solution u of (40.27)-(40.28)-(40.29) has the following properties : (42.5) (42.6)
u, E a, u t ( * ,t ) converges in L 2 ( ~as) t
-, +O.
410
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
Condition (42.5) implies that u, like #, is absolutely continuous in [0, r ] with values in H'(Q). Therefore, its initial value uo must lie in H'(Q), and not merely in L2(R). Furthermore, condition (40.28) would imply that u - g E Co([O,r ] ; HA(R)), and consequently, we should have uo - i( -,0) E HA(Q). From now on we assume that this is so. Note that, -414, likef, is an absolutely continuous function in [0, TI valued in H -'(sZ). As a consequence of this, the initial value u , ( * , 0) is completely determined by Eq. (40.27): u,( * , 0) = ui
(42.7)
=f( * ,0) - A(x, 0,qax)uo.
By (42.6) we must have ul eL2(Q).Generally speaking, we see that our regularity requirements with respect to t have led to increased regularity with respect to x [and also, of course, with respect to ( x , t)]. If we differentiate (40.27) with respect to t, we obtain (42.8)
u,,
+ A(x, t, a/ax)u, =f,- A,(x, t , a/ax)u.
This suggests that we look at the following mixed problem:
+ A(x, I , d/dx)u =f,- A , u
in R x 10,T [ ,
(42.9)
u,
(42.10)
u ( - , t ) - #,(., t) E H i ( Q )
for almost every t, 0 < r < T,
(42.11)
u ( - , t) converges in L2(Q)t o
ui as t
+ +O.
By Theorem 40.1 we know that there is a unique solution u E CP to this problem. We are going to show that u = u, . This will prove (42.5) and (42.6). In order to prove this, we shall prove that w(*, t ) = 110
+
I
u(*, t') df'
is equal to u. Observe that w E C"([O, TI;H'(Q)) and that w,E a.We have
= Aw
+ f A , ( w - u) dr +J 0
in view of the expression of ul [in (42.7)] and of the fact that, by integration by parts, ,d.u
dt =
j k w , dt = Aw - A(x, 0,i?/ax)uo - Ik,w dt. 0
0
By subtraction from (40.27) we obtain (42.12)
(u - w),
+ A(u - w) =
Sect. 421
REGULARITY OF THE WEAK SOLUTION
41 1
On the other hand,
from which, by (42.lo), w( t ) - g( ., t ) - [uo - g( ., O)] E Hh(Q) a.e. in [O, TI. a ,
By the remark following conditions (42.5)-(42.6) we know that uo - d( * , 0 )E HA(Q). Thus w(.,t ) - J ( * ,t ) E NA(IZ) a.e. in [0, TI, and therefore, by (40.28), (42.13) u ( . ,t ) - w( t ) E Hh(R)
for almost every t, 0 < t < T.
9 ,
Finally, (42.14)
u ( * ,t ) - w(*,t ) converges to 0 in Lz(Q)as t -+ +O.
As a matter of fact, by (42.13), u - w E a0 hence u - w E Co([O, T];L2(Q)) (Lemma 40.2). We therefore have the right to rephrase condition (42.14) in the form : (42.15)
(u - w ) ( * , 0) = 0.
Consider now the following problem : (42.16) (42.17) (42.18)
hE#O,
h, + Ah
=fi E
Lz(O,T;H-'(Q))?
h ( . , 0) = 0.
By Theorem 40.1 we know that h is uniquely determined. Moreover, the estimate (41.26) simplifies considerably in the present situation. It reads, here, (42.19) Let us momentarily fix T and let T' be any number, 0 < T' 5 T. Let f '1 be any element of L2(0,T'; H-'(R)) and let hb denote the solution of (42.16)(42.17)-(42.18) but with T' substituted for T and f 'j for f,. We may extend f 9 to [0, T] by setting the extensionf, t o be zero in IT', TI, and then let h denote the solution of the same problem (but now with T andf,). By the uniqueness of the solution, we must have h = h' in [0, T'[.By (42.19) we obtain
412
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. 1V
This shows that the constant K 2 can be chosen independently of T in any finite interval [0, To]. We choose now h = u - v. Then (42.16)-(42.17)-(42.18) are satisfied with fl
[cf. (42.12)].
= I r A , h dt 0
By virtue of our hypothesis (42.1)on the coefficients of A(x, t , d/dx) and of Lemma 40.1,we have
(42.21)
s
Now let T' be the largest number, 0 T' 5 T,such that h = 0 in [0, TI; clearly, T' exists (a priori, it could be zero). From (42.21)we derive that f l = 0 for 0 5 t S T', hence, by (42.19)and (42.21),
Suppose we had T' < T. We would derive from the preceding inequalities
(42.22) 1 5 K ~ K : ( T- ~')~/2. But by our earlier remark on the fact that K can be taken to be independent of T 5 To (and that the same is evidently true of Kl) we may now make T converge to T' < T; (42.22)is absurd! We have proved that we must have T' = T, i.e., h vanishes identically, that is, u = w in [0, TI,'and therefore v =ur. In conclusion, we have proved the following result, when m = 1 : THEOREM 42.1. In addition to all the hypotheses in Theorem 40.1 we make the following ones :
(42.23) all the derivatives of order s m with respect to t , of the coeflcients aii, bj, c (1 5 i, j s n) of A(x, t, a/dx) belong to Lm(R x 10, T[), (42.24) f " ) E L2(0,T ; H-'(Q)), 0 sj 5 m, (42.25) S c j )E L2(0,T ; H'(R)), gem+') E L2(0,T ; Z--'(R)), 0 2 j 2 m, (42.26) ij('")(-,t ) conuerges in L 2 ( R )as t -P +O. Let then u denote the solution of problem (40.27)-(40.28)-(40.29), whose existence and uniqueness are asserted in Theorem 40.1. Assume that thefollowing is true:
(42.27) (42.28)
dj)(
*,
u(j)(.,0) eL2(R), 0 ) - ~ ( j ) ( 0) E H;(Q)), a,
0 s j 5 m,
o 5j 5 m
- 1,
Sect. 421
41 3
REGULARITY OF THE WEAK SOLUTION
Then the following is true:
(42.29)
d j )
E L2(0,T ; H'(C?)),
0 5 j 5 m,
drn+') E Lz(O, T ; H - ' ( Q ) ) ,
t ) E HA(Q) for almost every t, 0 < t < T, and for (42.30) ucn(.,t ) - $ j ) ( everyj=O, ..., m - 1 , a ,
(42.31)
ti('")(.,
t ) converges in L2(R) as t
+ +O.
The proof of Theorem 42.1 for general m > 0 is straightforward : by induction on m and by applying Theorem 42.1 when m = 1 (which we have proved) to d'"-').We leave the details t o the student. A remark is in order concerning the conditions (42.27) and (42.28): The reader should not get the impression that they bear on the derivatives of the solution u, rather than on the data [the conditions (42.23) to (42.26) obviously bear on the data]. As a matter of fact, (42.27)-(42.28) bear on the initial datum uo and on the derivatives f")(-,0) o f f at t = O . Indeed, from the differential equation (40.27) we derive that u , , and more generally all the u ( j ) , 0 s j 5 rn, are continuous functions in the closed interval [0, T] with values in H - ' ( Q ) [we are tacitly using the hypotheses (42.23) and (42.24)], therefore u"'(., 0) is well defined, for 0 5 j m, as an element of H-'(R). Furthermore, this element can be expressed as a linear function of uo and of thef(r)(-, 0) forj' < j-although expressing it can be cumbersome (especially i f j is very large). For instance,
s
0 ) = - A ( x , 0, a/ax)uo +f(*i O), urr(. 0) = - A ( x , 0, a/ax)u,(. 0 ) - Ar(x, 0 , a/ax)uo +ft(. , 0 ) = A(x, 0 , a/ax)%, - A(x, 0, a/ax)f(* ) 0 ) Ud.7 9
9
+sr(.,
- Ar(X, 0 , a/ax).o 01, and so on. A particular case where the statement simplifies considerably-insofar as the conditions (42.27) and (42.28) are automatically satisfied-is that where
(42.32) (42.33) (42.34)
f ( j ) ( . , ~=o, ) g(j)(.,
0) = 0, uo
O s j s m - I, 0
5j 5 m - I,
=o.
Indeed, in this case, all the d j ) ( ., 0), 0 5 j 5 m, vanish. Note that if we combine (42.24) with (42.32), we see that we can extend f t o ] - 03, T [ by setting f = 0 for t < 0, and that, thus extended,
(42.35) f c n , L 2 ( - co,T ; H - l ( Q ) ) , 0 5 j 5 m ;
f
=0
for t < 0,
similarly,
(42.36) gcn E L 2 ( - co,T ; H1(Q)), 0 z j 5 m ;
g?j
=0
for t < 0.
414
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
We obtain the following
COROLLARY 42.1. Suppose, in addition to the hypotheses in Theorem 40.1 and to the hypotheses (42.23) to (42.26), that (42.32)-(42.3+02.34) hold. Then we have u(j'(*,0) = 0 for every j = 0, . .. , m.
In other words, if we extend the solution u by zero to 1-co, 01, we see that (42.37) u("+')
u(j) e L 2 ( - co,T ; H'(Q)), 0 5j E L2(- 03, T ; H-'(C?)), u =0
m; for t < 0.
Exercises 42.1. Let V, H be as in Exercise 40. I and let A be a bounded linear operator V + V' such that
(42.38)
Re(Av, v>-
+ Aollvll~2 c o l l ~ l l ~ , Vv E V,
for suitable constants L o , co ref. (41.27)]. Let u(t) E L2(0, T ; V) satisfy (42.39)
U,
+ AU = 0,
0
~ ( 0=) u0 E H.
Prove that we have
drn+l) E Lye, T; V'),
(42.40) uCi)E L2(0,T ; V) (0 5 j 5 m);
if and only if A h o EH,
(42.41)
.., m.
j=O,.
42.2. Use the same hypotheses (and notation) as in Exercise 41.1. Let K be the canonical linear isometry of V onto V'. We may regard K as a positive operator, generally unbounded, in H. Let us assume that we have
T ;L(V; V')),
(42.42)
A")
(42.43)
f ( j ) E L2(0, T; V'),
E L"(0,
0
0 5j 5 m ; j
m.
Let u ( t ) € L 2 ( 0 ,T; V) be the solution of (41.28)-(41.29) (see Exercise 41.1 and Theorem 41.1). Prove that (42.42H42.43) implies that u is a C" function, in the closed interval [0, TI, valued in V'. Show that the following properties off and uo , (42.44)
K"-'-jf"'(O)
.
E H, j = 0, . . , m
- 1;
K"uo E H,
impIy that (42.45)
~"'(0)E V, j = 0, . ..,m - 1;
~("'(0)E H.
Sect. 421
415
REGULARITY OF THE WEAK SOLUTION
42.3. Using the result of the preceding exercise and adapting the reasoning in Sect. 42, prove the following:
THEOREM 42.2. In addition to the hypotheses in Theorem 41.1, we suppose that (42.42), (42.43), and (42.44) hold. Then the solution u ( t ) of (41.28)-(41.29)has the following property: (42.46) u(j) c L 2 ( 0 ,T;V), j
= 0,
.. ., m ;
u("'+')E L'(0,
T;V').
Derive from (42.46)that uis a Cm-' (resp. a Cm)function in the closed interval [0, TI,valued in V (resp. in H). 42.4. Apply Theorem 42.2 in the case where V = H'(R"), H = L2(R"), and - A ( t ) = A , the Laplace operator in the space variables (i.e., in R"). Prove that, in this case, we have (42.47)
~"'(0) E H2('"-j)(Rn),
Can anything be asserted about u('"+l)(O)?
j = 0,... ,rn.
43 The Laplace Transform Treatment of mixed problems such as the ones we have been studying in the preceding sections is easier when the coefficients of the differential operator A(x, t, a/dx) do not depend on the time variable t . In this case, two fairly general methods, in addition to the abstract method of Sects. 40 and 41, are available to us: the Laplace transform method and the theory of semigroups. These methods lead to rather precise results, and have a wide range of application, beyond the particular situation in which we are going to use them. We shall begin with the Laplace transform. We shall briefly recall its definition, its main properties-with a twist in the direction of functions, and distributions!, valued in a Banach space E (where the norm will be denoted by
II
Ile).
We deal with functions and distributions defined in the real line R. Traditionally, the Laplace transform of a function f (t) is defined (formally) by
+
where u iz is a complex variable (often denoted by p ) . That there must be some relation between the Fourier and the Laplace transform is evident. We know that the former is closely linked with the underlying additive group structure of R-therefore, the latter should partake of this link. But the positive half-line R, = 10, co[is not an additive group, and the first thing we do is to extend the integration to the whole real line-after extending f by zero to ] - co,O[. The formal definition (43.1) can now be rewritten
+
+W
(43.2)
e-i"(e-u'f(t))d t ,
9 j ( a f iz) = -03
where, we repeat,f(t) = 0 for t < 0. Formula (43.2) underlines the fact that 9'f (u + it) is the Fourier rrunsform of e-"ff(t), where the real number a plays the role of a parameter. Let us agree to go on writing r for the variable on R', t for the one in the dual of R1, that is, on the Fourier transform side. Then, 416
Sect. 431
THE LAPLACE TRANSFORM
still formally, we shall set for any distribution T on the real line, T 1- a,0[, i.e., T E9 ; , (43.3)
Y T ( a + iz)
41 7 =0
in
= %(e-"'T).
Of course, the right-hand side of (43.3) will make no sense-unless e-"T is Fourier-transformable,'' which is to say, tempered. Suppose for a moment that Tis a continuous function5 If for some go real, e-"Offis tempered, then, because of the fact thatf(t) = 0 for t < 0, e-'ff, 0 > oo, is not only tempered: In fact, it decays at infinity faster than some exponential eFEf.For such 0 the integral at the right in (43.2) makes sense and we have the right to differentiate with respect to r~ and to z under the integral sign. If we apply the differential operator 8/80 i to that right-hand side, we obtain zero, which means that Yf(a iz) is a holomorphic function of + iz in the " vertical" half-plane 0 > o,,.The same observation applies to distributions: "
+ a/& +
PROPOSITION 43.1. Suppose that T E 9; and that e-"O' T is tempered (co real). Then Y T ( a + iz) = %(e-"'T) is a holomorphic function of 0 + iz in the half-plane 0 > no. The preceding statement remains valid if the distribution T is valued in the Banach space E. The only thing which must be clarified is the meaning of "tempered." If we denote, as usual, by Y the space of C" functions on the real line whose derivatives of all order decay at infinity faster than any power of ItI-', the space Y ( E ) of tempered E-valued distributions is, by definition, the space of continuous linear mappings of 9 ' into E (these mappings are regarded as E-valued distributions by restricting them from 9' to the dense subspace C:). One can then prove that a disiribution E, deJined on the real line, and ualued in E, is tempered if and only if it can be written m
T = C j=O
(3;
where the fj are continuous functions R + E with the property that, for some integer k 2 0, the nonnegative functions
are bounded on R (both m and k depend on T ) . Thus the properties of tempered scalar distributions extend without modification to tempered distributions valued in Banach spaces. We shall say that T E 9 \ ( E ) is Laplace-transformable if e-""'T is tempered for some real number B,, .
418
[Chap. IV
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
The inversion formula for the Laplace transform follows at once from the inversion (or reciprocity) formula for the Fourier transform and from (43.3):
(43.4)
T = 9-'(eU'8T(o
+ it)).
The reader should keep in mind that %-' transforms distributions in the variable 5 into distributions in t ; CJ continues to play the role of a parameter. In a formal sense (which could be made rigorous), (43.4) can be rewritten as
that is,
(43.5) where y is a vertical line (p; R e p = CJ > oo}.Formula (43.5) is classical. When dealing with distributions, it should be suitably interpreted. The advantages of the Laplace transform are akin to those of the Fourier transform : They lead to a symbolic calculus which, more precisely, means that they transform convolution into multiplication (when these make sense). It is well known that W+is a commutative convolution algebra. Note that iff and g are locally integrable functions, both vanishing for t 0, their convolution can be defined, regardless of their rates of growth at infinity. Indeed,
-=
+m
(43.6)
(f* g)(t) = J- af(t - S M S )
= S'f@- s)g(s) ds. 0
The same circumstances makes the definition work for two distributions S, T vanishing for t < 0. We note that if cp E C:,
(F * c pw = (T, d s + 0 ) 9
is a C" function of t , vanishing identically for t > to [assuming that cp(t) = 0 for t > t o ] . Since S = 0 when t < 0, the intersection of the supports of S and F * cp is compact, and we may form
<s,F- * C P )
= ( S ,9
(r, d s + o > > , 3
which, by the Fubini theorem for distributions, is equal to
(T*, WS,cp(s +
o>>= s * cp). (7-9
By definition, S * T is the distribution cp H(S, T * cp). This extends (43.6) at once. The commutativity of the convolution in 9'+is evident; there is a unit element, 6. Furthermore, 9'+ is an integral domain (this follows at once from Theorem 43.2, the theorem of supports; for a proof, see Exercises 43.4 t o 43.6).
Sect. 431
419
THE LAPLACE TRANSFORM
PROPOSITION 43.2. Let S , T E 9: . Suppose that both e-uo'S and e-aotTare tempered. Then, in the halfplane 0 > oo, (43.7)
9 ( S * r ) = (YS)(IpT).
Proof. If 0 > no, not only are C - ~ ' S e, - T tempered: They are rapidly decaying at infinity. Their convolution is therefore well defined and rapidly decaying at infinity. We have
(43.8)
e-"'(S * T ) = (e-.'S)
* (eVufT),
hence 9 ( S * 7') is a holomorphic function of (T -t- iz in the half-plane 0 > go. If we apply the Fourier transformation to both sides of (43.8) and use the fact that, under the present circumstances, F ( U * V ) = F U F V , we immediately obtain (43.7). Q.E.D. When dealing with E-valued distributions, the convolution makes no sense, even when the distributions vanish for t < 0, unless E is a Banach algebra (cf., Example 39.2). We need some kind of pairing (or coupling) between the value spaces. On this subject we refer to 539.4. The pairing which, from our viewpoint, will be the most valuable is the one between elements of E and elements of L(E; F), that is, bounded linear operators of E into another Banach space F. As usual, L(E; F) will be equipped with the operator norm, which we denote simply by 1) // (see Example 39.4). When S E 9'+(L(E; F)) and T E 9'+(E), the convolution S * T is defined in the obvious manner: (43.9)
(S * T)(cp) = s
cp E c,m
For general spaces E and F it makes no sense to ask whether such a convolution is commutative: Operators act on vectors, but the latter do not "act" on the former. Nor does it make sense to seek a unit element. When E = F, the situation is a bit more standard. Then 9'+(L(E; E)) is a convolution algebra, never a commutative one unless dim E = 1. But it has a unit element, Z , 6, where 1, is the identity mapping of E and 6 the Dirac measure on the real line (at the origin). Then, for convolution, 9'+(E) is a 9>(L(E; E))-(left)-module. It suffices t o state that formula (43.7) remains valid under the hypotheses of Proposition 43.2, when S i s valued in L(E; F) and Tin E. The reader may wonder why the Laplace transform is now more important to us than the Fourier transform. The reason is that, if we content ourselves with dealing with distributions which vanish for t < 0, then we can apply the Laplace transform to far more distributions than we can the Fourier transform-the Laplace transform makes sense for distributions which grow exponentially at + 00, whereas only the tempered ones are Fourier-transformable. This is quite important if we wish t o apply these transformations to differential equations : For instance, the solutions of an ordinary differential
420
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
equation on the real line are linear combinations of exponential monomials of the form tkeZt,which, after multiplication by the Heaviside function H(t) = 0 for t < 0, H ( t ) = 1 for t > 0, are always Laplace-transformable, whereas they are not tempered, unless Re 5 S 0. The Paley-Wiener-Schwartz theorem, which characterizes the Fourier transforms of the distributions with compact support, has an analog in the theory of the Laplace transformation. The analog characterizes the Laplace transforms of the distributions with support in [0, + 031 : THEOREM 43.1. Let h(p) denote a holomorphic function in the half-plane Re p > g o , valued in the Banach space E. The two foZlowing conditions are equivalent:
(43.10) there is a distribution T E 9;(E) whose Laplace transform is equal to h@);
+
(43.11) there is a1 real, a. 6 a1 < 00, a constant C > 0, and an integer k 2 0 such that, for all complex numbers p , Re p > al, (43.12) Proof. We shall consider only scalar functions and distributions ; the proof is identical when they are valued in E. Let us first prove that (43.10) implies (43.11). In (43.10) it is tacitly understood that T = T is Laplace-transformable. Let therefore ob be a real number such that T, exp( - ab t ) is tempered, which means that
c (d/dt)'fi, k
e-"o"Ti =
j=O
where the f j are continuous functions on the real line, with the property that, for some integer M 2 0,
(43.13)
Ifj(t)I
forall ~ E R ,j = O ,..., k .
S M ( 1 + Itl)M
Let now al > sup(o,, ob) and take a
c exp[
= Re p
2 al.We have
k
e-urTi =
- (o - ob)t](d/df)yj
j=o
where g j = exp[ - (a - ab)t]&. Setting of (43.13), we see that \gj(t)I 5 M(1
E = a1 -
+ I tl)Me-er,
tlt E
a& and availing ourselves
R, 0 5 j 5 k.
Sect. 431
42 1
THE LAPLACE TRANSFORM
Consequently, the Fourier transforms of 7. We obtain that
oj of the gj are bounded Cmfunctions
from which (43.12) follows at once. Next we prove that (43.11) implies (43.10). We may assume that aI is greater than zero. We write, as usual, p = a + iz, and we consider, for fixed a > al,the function w(p) = ~ - ~ - ~ h ( Itp )is. an integrable function of T on the real line. Let us set 1 f(t)=211
1
ecu+i')rw(a
-m
1 + ir) d r = 2ni
eP'w(p)d p , yo
where y,, denotes the (oriented) vertical line Re p = a. In view of the decay of w(a + i ~ )as r + f 0 0 [which follows from (43.12)] we may apply the Cauchy integral theorem and conclude, as expected, that the integral definingf(2) is independent of CT > 0,.As a matter of fact, we have (43.14) -m
where C, is a positive constant, independent of a > al.Take t < 0 fixed and make a go to -t00 : The right-hand side goes to zero, hence the left-hand side, which is independent of a, must be zero. This shows thatf(t) = 0 for t < 0. The inequality (43.14) also proves that e-"'fis rapidly decaying at + 00 for any a > a'. We set
T = (d/dt)k "f.
(43.15)
+
It is clear that T E 9'+and that it is Laplace-transformable. By our definition off; the Laplace transform of T is identical to h.
Exercises 43.1. Let T be an arbitrary number greater than zero, u a distribution belonging to 9; . Prove that there is a continuous functionfin the real line R', vanishing for t < 0, and an integer rn 2 0, depending on u, such that (43.16) 43.2.
u = (d/dt)"'f in ] - co, T [ .
Prove that any distribution u E 9; can be written as a series tm
(43.17)
422
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
where the& are continuous functions in the whole real line, the mi form a nondecreasing sequence of integers 20, and for each j = 0, I , . . . ,f j ( t )= 0 unlessj - 1 S t < j + 1 . 43.3. Suppose that the distribution u E W+is Laplace-transformable, i.e., e-% is tempered for some real number 6.Prove that there is a continuous function f in R', f ( t ) = 0 for t < 0, also Laplace-transformable, and an integer m such that u = (d/dr)"fin the whole real line. 43.4. Derive from Theorem 43.1 that a distribution u E B'+, which is Laplace-transformable, vanishes in the open half-line t < to if and only if there are constants C , k > 0, and u1 E R such that (43.18) IYu(p)I 5 C(1
+ IPI k,
P E C, Re P > o l .
exp[-tdRep)l,
Derive from this that, given any distribution w E 23: , if w * w = 0 for t < T, we must necessarily have w = 0 for t < T/2. 43.5. Call a the largest real number having the following property: (43.19) (43.20)
Given any T > 0, any two distributions u, u E 9'+, u*u =O
t
for
Prove that 0 5 a 5 1. Show that, if u (43.2 1)
(tu) * u
* (tu) = 0
for
t < aT.
* u = 0 for t < T, we also have
+ u * (to) = 0,
for t < T.
Derive from (43.19) and (43.21) that then (43.22) (43.23)
(tu) * (tu) =
o
for t < a2T,
[u * ( t u ) ] * [u * ( t u ) ] = 0
for
r < (1 + a ' ) ~ .
[Hint: Convolve (43.20) with u * (tu).] By applying the conclusion of Exercise 43.4 in conjunction with (43.23), derive that a = 1. 43.6. From the conclusion in Exercise 43.5, conclude that if u, u E 9'+ are such that u * u = 0 if t < T, then necessarily
(43.24)
u * (ePru)= 0
for all t < T, p E C.
Show that this implies, when u and u are continuous functions, that (43.25)
u(t
- s)u(s)
=0
for all s E R, t < T,
and that if there is a sequence of numbers sj > 0, converging to 0, such that u(sj) # 0, then necessarily u(t) = 0 for t < T. From this, either by regularization or by the conclusion of Exercise 43.2, derive the theorem of supports in one variable:
Sect. 431
THE LAPLACE TRANSFORM
423
THEOREM 43.2, Let u, v be distributions belonging to 9 ; . Assume that the support of v contains the origin. Then, if the convolution u * u vanishes f o r t < T, u itself must vanish f o r t < T(T > 0). 43.7. I n what follows, H ( t ) denotes Heaviside's function, equal t o one for t > 0 and to zero for t < 0, and T(a) = jof e-'ta-' dt (Re a > 0) is the
Euler gamma function. Compute the Laplace transforms of the following distributions : (i) h c j ) , j = 0,1, . . . (S is the Dirac measure); (ii) H(t)t"/T(a+ l), Re a > - 1 ; (iii) H ( t ) cos I , H ( t ) sin t ; (iv) H ( t ) e r f t a / r ( + a l), 5 E C, Re a > - 1. 43.8. Let f ( p ) , g ( p ) be two polynomials in one variable, with complex coefficients, leading coefficient one, and no common factors. Prove that there is a unique distribution U E 9'+whose Laplace transform is equal to the rational function f ( p ) / g ( p )and compute it. 43.9. Show that if a is a real number 20, the function e-a'ip-l'z is the Laplace transform of [H(t)/4;] exp( -a2/4t).
44 Application of the Laplace Transform to the Solution of Parabolic Mixed Problems We return now to the differential operator A(x, t, d/ax) of Sect. 40 and following, but we assume throughout the present section that its coefficients are independent of t. We denote the operator by A(x, djdx) or simply b y A :
with the following assumptions:
(44.1)
ajk
(1 Z j , k 6 n), b'
(1 _Lj5 n), c belong to L"(R);
(44.2) there is a constant co > 0 such that, for almost all x in C? and for all
r
E
C",
n
Re
1
j,k= 1
ajk(x)CjCk2 c O I
(I2-
As in Sect. 41 we associate with A a sesquilinear form on H'(Q) x H1(R), except that we want now to have the parameter T, which was real, replaced by the complex parameter p : a,,(u, u) = p
l uij n
dx
+ f
j,k=l
au a6 ajk(x)-. - d x R axJ a x k
The analog of (41.1) is valid here: (44.3) There is oo real such that, for allp E C, Re p > oo,for all u E H1(R), (44.4)
CoIlull&lcn,
5 2 Re a,(u, 4. 424
Sect. 441
425
APPLICATION OF THE LAPLACE TRANSFORM
From the Lax-Milgram theorem (Lemma 23.1) we derive that p + A is an isomorphism of Hh(R) onto H-'(R) [we recall that if u, v E Hh(Q),
+ N x , aPx)>u, fi>. 1 inverse of ( p + A ) : HA(R) -,H - ' ( Q ) . Of course,
(44.5)
a,@,
0)
= ((p
Let G(p) denote the G(p) is an isomorphism of H-'(R) onto HA(R) [(?(p) is the Green operator of p - A for the weak Dirichlet problem in Q]. From (44.4) we derive (44.6) The operator norm of G(p) is 5 2 / c 0for all complex numbers p such that Re p > g o . We note that ( p + A ) is obviously a holomorphic function of p , Re p > g o , valued in L(HA(R);H - ' ( Q ) ) . We shall apply the following result:
LEMMA 44.1. Let 0 be an open subset of the complex plane, T(p) a holomorphic function of p in 0 valued in L(E; F ) (E, F are two Banach spaces) such that, for each p E 0,T ( p ) is an isomorphism of E onto F. Then T(p)-' is a holomorphic function of p in 0 , valued in L(F; E). Proof. We begin by showing that T - ' ( p ) is a continuous function of p in 0 with values in L(F; E). Let po denote an arbitrary point in 0.Since T(p,) is an isomorphism, there is a constant Co > 0 such that llellE iCoIIT(po)ellF
Therefore, for p
E
for all e E E.
0 sufficiently close to po ,
llellE
COllT(p)ellF
s cOllT(p)e//F
+ cOIIT(p) + 311ellE.
- T@O)lI
llellE
This shows that the operator norm of T(p)-' is locally bounded in 0. But T-'(P) - T-YPo)
= T-'(Po){T(Po) -
T(P)v-'(P),
which shows that, for p close enough to p o , IIT-YP) - T-'(Po)Jl
s 2C;IITtPo) - maI.
Thus T - ' ( p ) is a continuous function of p in 0.We have T(p)T-'(p) = IF, the identity mapping of F. We apply the Cauchy-Riemann operator a/dp to both sides, and obtain T(p)(d/ap)T-'(p)= 0, from which [after multiplication by T-'(p) on the left], (d/dp)T-'(p) = 0, which is what we wanted. Q.E.D.
We are now in a position to apply Theorem 43.1 : G(p) is a hoIomorphic function of p in the half-plane Re p > o,, , valued in the Banach space
426
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
[Chap. IV
L(H-'(R); HA(R)) whose operator norm is bounded independently of p. Hence it is the Laplace transform of a distribution G, E 9'+(L(H-'(Q);Hi(R)), which satisfies the following two equations: (44.7)
( d / a t + A(x, dldx))G, = 1 6, ,
(44.8)
G,(d/dr
+ A(x, d p x ) ) = 1 6 , ,
where 1 denotes the identity mapping of H - ' ( Q ) in (44.7) and the one of Hh(R) in (44.8) [6, denotes the Dirac measure in the r variable, and 1 6 , is the operator-valued distribution which to any test function cp assigns the value cp(O)l]. Equations (44.7) and (44.8) are the Laplace transforms of the inversion equations ( p + A)&) = I , G(p)(p + A ) = I . We consider the mixed problem au
- + A(x, d/dx)u =f in R x 10, 7".
(44.9)
at
u - g E HA(S1) almost everywhere in [0, TI,
(44.10)
4.0) = UO
(44.11)
3
under the assumptions of the previous sections:
f ~ L ' ( 0 T; , H-'(n)),
(44.12)
E L2(0,T ; H'(R)),
(44.13)
uo EL'(Q),
9 dt EL'(O, T ; H-'(Q)),
(44.14) ust > O g o e s t o O , g ( . , t ) conuergesinL'(l2)
[tog(.,O)]
Of course, we know by Theorem 40.1 that there is a unique solution u endowed with good properties. But the question here is how to express this solution in terms of the data, by means of the operator G, . In order to do this, we begin by substituting U = u - g for u and by setting (44.15)
F
=f -
(Z +
- A(x, d/dx)g]
[EL'(O,T ; H-'(Q))].
Our problem is now transformed into the following one:
au
(44.16)
at + A(x, a@)u = F in n x 10, T [ ,
(44.17)
U E L'(0, T ; HA(SZ)),
(44.18)
u(.,t ) converges to U , in ~'(n) as t -+ + O .
We know solution U exists, is unique, and belongs to C,([O, T],Lz(sZ)) (Lemma 40.2). We wish to express U in terms of F and Uoby means of GI.
Sect. 441
APPLICATION OF THE LAPLACE TRANSFORM
427
First of all, we extend F t o - co < t < + co by setting it equal to zero outside of [0, TI. We may now regard F as an element of g;(H-'(R)), with support contained in the closed interval [0, TI. Next we define V = U in [0, TI,V = 0 for t < 0 and t > T . We have, by the standard formula for distribution derivatives of continuous functions with jumps [here the functions are valued in H-'(R)],
where (dUjdt) denotes the element of L2(- co, + co;H - ' ( f t ) ) ,equal to dUjdt in [0, r ] and to zero outside of [0, r ] ; 6, (resp. 6,) is the Dirac measure at t = 0 (resp. at t = T). Finally we obtain (44.19)
dV dt
+A
V
=F +
UoSo - U ( . , T)6, in R x R.
This yields at once (44.20)
V
=G
* F + GUo - ( G U ( . , T)) * 6,
(in R x R),
where the convolutions are performed with respect to the variable t while G acts as an operator on the values of F, U , , U(*, T ) . These values belong to H-'(R), and the result of the action of G is to map them into HA(R). Observe that the support of (GLI(-,T ) )* 6, = G t - , U ( - , T ) is contained in [T,+a[. Consequently, if we restrict all functions and distributions to the open interval 10, T [ ,we get V = GU, + G * F, that is, (44.21)
U
= GU,
+G* F
in R x 10, T[.
This is the natural generalization of the standard formula for ordinary differential equations [see (11.21)]. In practice, in order to express, or to compute, the solution u of (44.9)-(44.10H44.1 l), it is best to use (44.21). But if one wants to have a general formula for u, in terms of uo , g, andf, it is possible to derive one from (44.21)-provided that one proceeds with care. Let us write go instead of g(. , 0). From (44.21) we draw
Let us apply once again the formula for the distribution derivative of a continuous function having a "finite" jump at the origin. We obtain
428
[Chap. IV
MIXED PROBLEMS AND EVOLUTION EQUATIONS
whence (44.21')
where
It would be a mistake to apply formula (44.8) and conclude from it that M * g vanishes identically. Indeed, the two sides in (44.8) operate on HA(R). But the values of g do not lie in the latter space, outside of the trivial case where we could have taken @ = 0 to begin with. What can be seen right away is that, if gl is another function, valued in H'(R), like g, and furthermore such that 0 - dl E HA(R) for almost every t > 0, then necessarily
M * J =M*J,. This means that M * acts on functions g of t (as a convolution with respect to t ) valued in the quotient space H1(R)/Ht(Q),that is, the space of traces of functions belonging to H'(R) on the boundary aR of R (Sect. 26). Under suitable regularity assumptions on Iz and on A(x, a/&), M * g is a convolution operator on t , acting as an operator from H'/'(dR) to ~ ' ( n ) : (44.21") ( M * g)(x, t ) =
1'1 M ( x , y ; o an
x
t - s)g(y, s) do,, ds,
E R,
t > 0,
where do,, is the area measure on an, and g(y, t ) is a suitably regular function of t > 0, valued in H1/'(8Q).Of course, M * g is then the solution of the mixed problem at
a2
+ A (x, -
u = 0 in R x 10, T [ ,
u(x,o)=o, ~ E R ; u ( y ~ ) = ~ ( y , t yeasz, ), t>o. We shall now discuss some examples. They concern the heat operator L = - a- A , at
A=
(a ) ' + ... + ( a)' ax
ax"
.
Example 44.1 The Fundamental Solution of the Heat Equation ( R = R") The probIem under study is the Cauchy problem (44.22) (44.23)
au
- - AU at
~
L'(0, T ; H-'(R")),
l = u0 ~ EL'(R"), = ~
Sect. 441
APPLICATION OF THE LAPLACE TRANSFORM
429
where u denotes the unique solution belonging to L2(0,T; H'(R")). In this case, since there are no boundary conditions, we have u = U , and therefore [cf. (44.21)] (44.24)
u = Gu,
+ G *f
in R" x 10, T [ .
We wish to compute the operator G explicitly. Its Laplace transform &) is the inverse of p - A : H'(R")+ H-'(R"). By Fourier transformation, we see at once that (44.25) G(p)rp(x) = ( 2 ~ ) /ei"'c(p ~" Here
+ I 5I2)-'i$(5) d5,
p E C,"(R").
4 denotes the Fourier transform of rp :
4(t) = ~e-'"''rp(x) dx. We may apply the inversion formula (43.5) for the Laplace transform
where yo is a vertical line R e p = CT > 0. At first sight, the exchange in the order of integrations might not seem legitimate, since the integrand in the integral with respect to p is not absolutely integrable (over yu). But in fact we are applying the Fubini theorem for distributions and the reader may imagine that there is tacitly a convergence factor of the kind X(Imp), with x E C,"(R1). converging appropriately to 1. Let us compute
The correct way to handle this is t o regard K(5, t)e-'' as the inverse Fourier transform of the function of T, (a -k
iT -k
l5Iz)-'
which is clearly square-integrable over y u . We know that K = 0 for t < 0. In fact (cf. Exercise 43.3, part iv), we know that
(44.26) Observe that
K(5, t ) = H(t) exp(- I 5 1.t).
430
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
in other words, K is the fundamental solution of the Fourier transform of the differential operator a/& - A with respect to x . Since G , cp
= (2n)-" J e i " ' t K ( ( ,t ) @ ( { ) dc,
which shows that K((, t ) is the Fourier transform of G with respect to x, we immediately obtain
M) 4t ,
(44.27)
which is the standard fundamental solution of the heat equation (see Sect. 6 , $6.1).
Temperature Distribution inside a Metal Bar
Example 44.2
This example concerns a mixed problem where the number of space variables is one and the open set R is a finite interval, say ]a, b [ .It is "concretely" visualized as the problem of determining the distribution and variation of the temperature inside a metal bar, or a homogeneous wall, whose thickness is finite (value: L = b - a) but whose other dimensions, height and width, are infinite, and whose two faces (corresponding to x = a and to x = b) are heated according to a given program (represented, for us, by the function a). The precise statement of the problem is
(44.28) (44.29)
u(b, t )
u(a, t ) = go(t)T
(44.30)
U(X,
= gb(t)l
0) = u0(x),
a <x
0 < t < T,
< b.
In agreement with the general case, we assume uo E L2(a, b) and that the derivatives of ga and gb (together with these functions themselves) belong to L2(0,T ) . The choice of the function a is particularly simple, in the present situation: a(x, t, = (b - a)-'{ga(f)(b - x, + gb(t)(X - a)>. We may perform the change of function U
all = -(b F = -at
- a)-"g:(W
=u -
- XI
If we set, in addition to the above,
uo = uo - a ( * , O h
8 and set
+ gL(t)(x - a ) ] .
Sect. 441
APPLICATION OF THE LAPLACE TRANSFORM
we transform the problem (44.28)-(44.29)-(44.30) (44.3 1) (44.32)
au
a2u
at
ax2
U = 0 when x
-F,
into the problem
O
a<x
and when x = b, whenever 0 < t < T,
=a
(44.33)
43 1
U
=
U,
at t = O .
We wish to compute the operator G in this case. Its Laplace transform c ( p ) is the Green function of the operator
in the interval ]a, b [ . We shall always assume R e p 2 0. We have
(44.34)
b
aP)cpW
C(x, x ' ; P)cp(X) dx,
= Ja
cp E C"(I4 bl),
with (see Exercise 29.12) (44.35)
G(x, x ' ; p )
= G,(x - x ' ; p )
+ h(x, x ' ; p ) ,
where (44.36)
G,(y;p)
1
= ---=
+
{ ~ ( y ) e - ' ~ iH(-y)eYJ'),
2JP
and (writing L = b - a) (44.37)
- exp[ - 2 , l ~ ( L - x$)]] The reader will easily verify that G, and h are bounded functions of p in any half-plane Re p 2 c,,> 0. Let us also recall that G,(y; p ) is a fundamental and that h(x, x'; p ) is the solution of the homogeneous solution o f p equation ( p - (d/dx)2}R= 0 such that h(a,x';p)
+ G,(a
- x';p) =h(b,x';p)
+ G,(b
- x';p) = O
432
for any x’, p. As a consequence of this, for all (44.38)
[Chap. IV
MIXED PROBLEMS AND EVOLUTION EQUATIONS
(p
XI,
3
- - G(x, x ’ ; p ) = 6(x
a < x’
< b, p, Rep > 0,
- x’),
G(a, x‘; p ) = G(b, x‘; p ) = 0.
(44.39)
6
Of course, throughout this argument, denotes the branch of the squareroot function which is greater than zero for p real and greater than zero. Observe that e(x, x ’ ; p) can be represented as an infinite series of terms of the form
It can be easily shown (or else found in tables of Laplace transforms) that p - l l Z e - & (a 2 0) is the Laplace transform of H(t)(nt)-”’ exp( - a2/4t) (see Exercise 43.9). If we take this fact into account, we derive from (44.36) that G, is the Laplace transform of the standard fundamental solution of the heat equation [cf. (44.27)] : (44.41)
The expression of the inverse Laplace transform of h(x, x’; p) is more complicated : (44.42)
h(x, x’; t )
=El 2Jnt
j= 1
f
x-x‘
2
]
(exp[ - ( j ~7)
+ exp[ - f (jL -
F)’]
- e x p [ - ;1( j L + T
- exp[ - -1 (j.
+
+
-
6)21
+ a - xs)2])
t
2
In order to derive the expression of the solution u to the problem (44.28)(44.29X44.30) it suffices to apply formula (44.21).
to
The preceding formulas remain valid when a tends to - co and/or b tends + co.For instance, let us consider the case a = 0, b = co. Then,
(44.43)
+
G(x, x’; t ) = G,(x - XI, t ) - G,(x
+ x’, t).
Sect. 441
433
APPLICATION OF THE LAPLACE TRANSFORM
Let us introduce the function (44.44)
~,(x,t ) = ( n t ) - 1 / 2
(x E R, t 2 01.
exp(-y2/4t) dy 0
Application of the representation formula (44.21) leads straightforwardly to the following expression: (44.45)
1 u(x, t ) = 2
I
-x
aE, -(y, t)uo(x - y) dy ay
aE, -1‘(x, at
t - s)g(s) ds
for the solution of the mixed problem: (44.46) (44.47)
au azu -=at
vx>o,
ax2’
u(x, 0) = uo(x), vx
Vt
>o;
u(0, r) =g(t),
> 0;
V t > 0.
Exercises 44.1. Verify that, in the case where uo and g are continuous functions in the function u(x, t) given by (44.45) is a conthe closed half-line [0, a[, tinuous function of (x, t ) for x > 0,t > 0, and that the first term on the right-hand side converges to uo(x) when t -+ +0, whereas the second term converges to g(t) when x -+ +O. 44.2. Consider the mixed problem (44.46)-(44.47) where one takes uo = 0, g = 1. Show that the solution u is not continuous at the “corner” (0,0) in the closed quadrant {(x, t) E R2; x 2 0, t 2 O}. 44.3. Consider the mixed problem (44.46)<44.47) where one takes uo = 0. What are the implications on the boundary function g ( t ) in (44.47), of the hypotheses (44.13) (where R = R:)? Verify that, if (44.13) holds, then, as
+
t+
+o,
44.4 Let m be an integer 2 1, R a bounded open subset of R” with C” boundary (and lying on one side of it). Consider the problem:
(44.48) (44.49)
(44.50)
au
--Au=f at
UI,=~ (u - g)(-,t ) E H;(R)
i n R x 10, +a[, = uo
in R,
for almost every t > 0.
434
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
The following hypotheses are made, concerning the data:
(44.51)
, f ~ L ' ( o , +0 0 ; H"'-'(Q)),
uo E H"-'(Q),
By using Laplace transformation with respect to t , and applying Lemma 40.3 (Exercise 40.6),prove that the solution u of (44.48)-(44.49)-(44.50) has the following regularity properties with respect to the variables x :
(44.53)
24 E L'(0,
+
CQ;
du - E LZ(0, + C Q ; dt
H"(Q)),
H"-2(Q)),
and consequently,
+
(44.54)
u E CO([O, a[; Hm-yQ)).
44.5. Let V c H be two Hilbert spaces. The injection is continuous and has dense image; its "antitranspose" is a continuous injection of H into the antidual V' of V. Consider a positiue isomorphism A of V onto V' (thus (Av, V}- 2 c o ~ l v ~for l ~ every v E V; co > 0). Show that one can define the positive square root A 'I2 of A and that it is an isomorphism of V onto H. Apply the Laplace transformation t o the abstract Cauchy problem
d2u -+Au=f,
(44.55)
dt2
(44.56) t=O
vt>0;
dt t = o
= U, E
H,
assuming also that f(t) is a continuous function o f t >= 0 valued in H. Derive from the formula for the Laplace transforms that in a sense to be made precise, we may write,
(44.57) u(r)
= cos(tA'/2)uo
+ sin(tA1I2)A-'l2u I + /Isin(s~1/2)~-1/2f(t- s) ds,
for t 2 0. Compare with (13.10). 44.6. Let Q be an open subset of R". Apply the results of Exercise 44.5to the case where V = HA(R), H = L2(Q),and A is defined by a strongly elliptic differential operator on 0,A(x, a/ax). We recall that the sesquilinear form on HA(Q) x HA(R) associated with A ( x , d/dx) must be Hermitian and coercive
Sect. 441
APPLICATION OF THE LAPLACE TRANSFORM
435
(Definition 23.1). Consider then the mixed problem, with Dirichlet-type boundary data : (44.58)
d2U
-+ A(x, a/dx)u =f -
(44.59) (44.60)
( T > 0),
in R x 10, T [
at2
= u1
u l r _ o- U o v
u ( . , t) - g(., t ) E &(R)
in R,
for a.e. t E 10, T [ .
Make the following assumptions about the data: (44.61)
f~Co([O, TI; Ho(Q)),
uo E Hh(d),
s" E cqo, TI; H * - j ( n ) ) ,
(44.62)
u1 E Ho(R),
j = 0,1, 2.
Prove that u
(44.63)
aG
= -UO
at
+ Gu, + G * f + M * g,
where G is the inverse Laplace transform of ( p z + A)-' [regarded, say, as an isomorphism of L2(Q) onto Hh(R) n HZ(R)under suitable assumptions on 01, and (44.64) (the convolutions must be understood with respect to the time t). 44.7. Take R to be a bounded interval ]a, b[ (with length L > 0) in the real line and A = -d2/dx2. Show that the distribution denoted by G in Exercise 44.6 is given, in the present case, by (44.65) G(x, x'; t )
= +{K(x- x', t )
- K(x
+ K(x' - X, t ) - K(x + x', t + 2a)
+ x', t - 2b)},
where (44.66) K(y, t )
= H(y)H(t - y )
+
+m
H(t - y - 2jL) j= 1
[H( .) is Heaviside's function].
45 Rudiments of Continuous Semigroup Theory We continue to deal with the case where the differential operator A = A(x, a/dx) is independent of the time variable t . Until now we have regarded A as a bounded linear operator Hh(S2) + K’(t-2); we have exploited the fact that, for T > 0 large enough, T A is an isomorphism of the former space onto the latter. But in availing ourselves of the continuity, or boundedness, of A, we have had to pay a price. This has been already alluded to in the preceding section: If A had been a bounded linear operator of a Hilbert space H into itself, we would have been allowed to use the exponential function e - r Ain the solution of the initial value problem:
+
(45.1)
u,+Au=f;
O
[cf. (44.21) and following remarks]. We are prevented from following this path by the fact that A is an operator from one Hilbert space into a different one. In this case, its exponential makes no sense. But we could regard A as an unbounded linear operator in one and the same space, H = Lz(12),and try to form the exponential e-‘”. It is not unreasonable to expect that the latter will turn out to be a bounded linear mapping of H into itself and even, in some instances, a linear automorphism of H. This approach would have several advantages. For one, we would operate within L2(Q) and make better use of our information, in the case where f~ Lz(O, T; Ho(S2))--and not merelyfeL’(0, T; H-’(Q)). More importantly, we might be able to handle initial value problems for differential operators A which do not fulfill the strong ellipticity condition (44.2). That such expectations are not excessive is shown in the following two examples (both are Cauchy problems). 436
Sect. 451
CONTINUOUS SEMIGROUP THEORY
Example 45.1
437
Cauchy Problem for the Heat Equation
This is the same as Example 44.1. We shall use the conclusions reached there. In this case, --A = A, the Laplace operator in R". There is only one possible choice, here, for the operator e-'", and this is the convolution, with respect to x, with the fundamental solution of the heat equation
For t > 0, we have
J
+m
~ ( x r)dx , = 1,
-m
and when t + +0, E(-,f) converges to the Dirac distribution 6(x). It follows from this that convolution with E (with respect to x) defines a bounded linear operator of L2(R") into itself; when t -+ +0, this operator converges to the identity.
Example 45.2
Cauchy Problem for the Schrodinger Equation
We look at problem (45.1)-(45.2) when A = -iA (see Sect. 6, Example 6.2). Note that our ellipticity hypothesis (44.2) is not satisfied here. However, it is easy to solve the problem, iff E L2(0,T; L2(R"))and u EL'(R"), simply by performing a Fourier transformation with respect to x . By the argument used in Sect. 6 (ioc. cit.) we find that e-ra makes sense: It is the convolution in the x variables with the fundamental solution of the Schrodinger operator E(x, t )
= exp
[
-i(n - 2)-
___
:1(2zy
exp
(- g).
In order to see that, for each t , convolution with E ( - , t) defines a bounded linear operator on L2(R"),it is convenient to return to its Fourier transform (in x): (45.3)
E(c, t ) = i ~ ( texp( ) - it I t [ 2,
(cf. Example 6.2) and show that multiplication with ?,*!( t ) defines a bounded linear operator on Lz(R,,). In fact, for t > 0, multiplication with I?(-, t) defines a unitary operator on L2(R,) and therefore, E * is a unitary operator on L ~R") ( . (X)
438
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
[Chap. IV
In both preceding examples, we have (45.4)
[ E ( . , t ) *] 0 [ E ( . ,t ’ ) * ]
(45.5)
E( ., 0) *
=E(.,t
+ t’),
0 < t, t’,
= Identity,
which are properties one should expect of an exponential e - f A .These properties are generalized in the definition of a continuous semigroup?: Definition 45.1. Let E be a Banach space. A mapping t of R, into L(E;E) is called a continuous semigroup fi it has the following threeproperties: (45.6)
YsYr= YS+( (s, t 2 O),
(45.7)
Fo= I, the identity of E,
(45.8)
the mapping t - Y f of R, into L ( E ; E ) is continuous when the latter space carries the topology of pointwise convergence in E.
The latter property, (45.8), means that, given any to E R , and any e E E, t -+ 1 , . The student might ask why we have not adopted a stronger kind of convergence, specifically the convergence in the sense of the operator norm. Had we adopted this kind of convergence, we would have excluded the very important examples 45.1 and 45.2. For instance, when r > 0 converges to zero, it is not true that E(5,t), given by (45.3) as a multiplication operator on L:, converges to the identity in norm: given t > 0, we may choose 15 I = (n/t)”* and thus check that the norm of g(5, t ) - E(5, 0) in LF is equal to 2. As a matter of fact, the only semigroups of operators, continuous for the operator norm, are those of the form exp( - tA) with A bounded (Exercise 45.1). In the applications of interest to us, we always start with a given differential operator A = A(x, d/Jx), which we regard as an unbounded operator in some Banach space E;most of the time E = L2(Q).We wish to express the solution u of the initial value problem (45.1)-(45.2) in terms of the data f, uo by means of the exponential function e - r A .The latter is not well defined and, under favorable circumstances, its role will be played by a continuous semigroup {F,}associated with A . If A were bounded, Ti would be equal to e - f A. Observe that, in such a case, we could recover A from Ff by the formula
Il(F,- Ff,,)ellE+ 0 as
A
=
I
lim - ( I - e - ‘ A ) , t
r++o
t We are relating a semigroup stto the “inverse exponential ” e - l A for the sake of consistency in the remainder of the book where, most of the time, A denotes a strongly elliptic differential operator, and hence defines a positive linear operator. In accordance with this the infinitesimal generator (see Definition 45.2) will be --A, and not A .
Sect. 451
CONTINUOUS SEMIGROUP THEORY
439
where the limit might be taken in the sense of the operator norm. There is an analogous formula when A is unbounded, but the limit must then be taken in the sense of pointwise convergence of operators in E: Definition 45.2. We shall denote by 9 ( A ) the linear subspace of E consisting of those vectors e E E such that h-'(e - T h e ) converges in E as h > 0 goes to zero. For every e E 9 ( A ) we set 1 Ae = lim - (e - F h e ) . h-+o h The linear operator - A : 9 ( A ) + E is called the infinitesimal generator of the semigroup {F,}.t
A priori 9 ( A ) could be thought of as small, even as consisting only of the origin in E. In fact, we are going to see soon that 9 ( A ) is dense in E. Let us observe right now that every T, commutes with A , since F , commutes with h-'(I - Y h ) , h > 0, and thus TI9 ( A ) c 9 ( A ) . We shall derive a certain number of properties of the infinitesimal generator A [among these, the fact that 9 ( A ) is dense] which will enable us to characterize all the linear operators (defined on dense linear subspaces of E ) which generate continuous semigroups. It helps to keep in touch with the case where A is bounded, i.e., where F,is the true exponential e - t A .Note that, in this case, we have (45.9)
d -dtT f + A T , = O ,
(45.10)
t>O,
Fo= I.
We have the right t o form the Laplace transform of F , , since e-IIAlltFIis bounded on R + . More correctly we introduce the Laplace transform R ( p ) of the operator-valued function H(t)F-,. By (45.9) and (45.10) we have (45.11) where 6 is the Dirac distribution. A Laplace transformation yields (45.12)
(P + A)R(P) = I,
and since R ( p ) obviously commutes with A , we see that R(p) = ( P I + A ) - : R(p) is the resolvent of - A [we know that R ( p ) exists, and is a bounded linear operator on E, for ] p ( > llAII]. See footnote to page 438.
440
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
When A is not bounded, although we cannot define e-rA directly, we may define the Laplace transform R(p) of H ( t ) Y , (we assume, until further notice, that -A is the infinitesimal generator of the semigroup TI).This is due to the following lemma. LEMMA 45.1. Let {Yr}be a continuous semigroup on E. There are constants M , B > 0 such that IIYtll 5 MeB'for all t 2 0. As usual, we have denoted the operator norm by
I( 11.
Proof. Let a be any strictly positive number and let Badenote the image of the closed interval [0, a] under the mapping f H TI.By (45.8) Ba is compact for the topology of pointwise convergence on L(E; E), hence [TVS, D&K, Theorem 33.13 it is bounded for the operator norm. Let us set Ma = sup, llFrll(note that Ma 2 1). Given an arbitrary positive number t, let m be the largest integer such that ma 4 t. By (45.6) we have
11YJ = ~~Yr-m,,Ym,,~~ = llY~-mYrll 6 M;+' 5 MaeBma MaeB', where B = (1/a)log M,, , Q.E.D. We now have the right to define, for Re p >B, the constant in Lemma 45.1,
We know that R(p) is a holomorphic function of p , Re p > B, valued in L(E; E). Since every Yr commutes with A so does R(p). As a matter of fact we have
PROPOSITION 45.1. If Re p > B (cf. Lemma 45.1), the range of R(p) is contained in 9 ( A ) and we have (45.13)
+
(PI A)R@) = R@)(pI
+ A ) = I.
Proof. Let e E E, h > 0 be arbitrary. We have
I, e-Pt(Tt+,,t m
h - ' ( Y h - z)R(p)e = 1 2 - 1 1 h
= - (eph- 1)
Y t ) e dt
+m
e-p'Y,e dt
- -1 Jho e - p t Y re dt.
When h + + O the first term converges to pR(p)e and the second one to -.Foe = -e. Q.E.D.
PROPOSITION 45.2. Suppose the complex variablep converges to 00 in a sector 1 Im p I c C Re p (C > 0). Then, for every e E E,pR(p)e converges to e in E.
Sect. 451
CONTINUOUS SEMIGROUP THEORY
Proof. We have (writing p
=6
441
+ i7 with 6,T real) +m
pR(p)e
- e = p joe-"(Y,e
- e) dt
from which +m
IlpR(p)e - e l l s 5 (1 + '2)s
e - r l l ~ , , u e- eIl,dt+o
0
when CT+ +co, by virtue of Lemma 45.1 and by Lebesgue's dominated convergence theorem. Q.E.D. COROLLARY 45.1. 9 ( A ) is dense in E. Proof. Combine Propositions 45.1 and 45.2.
We have shown that A has a dense domain and that R(p) = R@; - A ) = (pl+A)-'forRep>B. By inverse Laplace transformation applied to the equation ( P I + A)R(p) = Z we see that F,satisfies Eq. (45.9) in the distribution sense. Actually, it is not difficult to show that if e E 9 ( A ) , Yre is differentiable at each point t > 0 and therefore (d/dt)(F,e ) = -A Y , e for t > 0 in the classical sense. We are now in a position to characterize all the infinitesimal generators of the continuous semigroups:
THEOREM 45.1. (Hille-Yosida) Let A be a linear operator with domain 9 ( A ) dense in the Banach space E.Suppose that,for some A0 > 0, the resolvent R(A; - A ) = (1Z + A)- of - A exists and is a bounded linear operator on E for all integer values of 1 > 1, . Then thefollowing two conditions are equivalent : (a) - A is the infinitesimal generator of a continuous semigroup {Y,}; (b) there are constants M , B 0 such that, for all k = 1, 2, . . . , and all integers m > sup(A0, B), (45.14) Proof. (1) (a) =.(b). If (a) holds, we have seen that R(1; -A) = R(1) = OD e-"'9, dt for Re 1 > B, the constant in Lemma 45.1. We may differentiate R(1) with respect to 1 under the integral sign. We obtain
5:
+m
R ( k ) ( l )=
jo(- t)ke-Lt'91dt.
442
[Chap. IV
MIXED PROBLEMS AND EVOLUTION EQUATIONS
On the other hand, since R(A) = (U + A ) - ' , we can easily check that R'k'(A) = (- I)%! R(A)k+', whence by Lemma 45.1,
from which (45.14) is derived, noting that pR(p) = (I + p - ' A ) - ' . (2) (b) *(a). Let us set J , = (I + ( I / m ) A ) - ' , m an integer greater than A. . By (45.14) we know that the J , , m > sup(Ao, B), form a bounded set of linear operators on E. If e E 9 ( A ) , e - J , e = m - ' A J m e = m-'J,Ae, hence IIJme- ellE5 (C/m)IIAellE+ 0 as m -, co. Since 9 ( A ) is dense in E, this means that J , e + e, as m + + co, for every e E E. Let us now consider
+
t 2 0.
TI = exp( - tAJ,) = exp(mt(J, - I))= e-'"' exp(mtJ,),
m
Again by (45.14) we have
Observe that m(l - B/m)-' - m = (m/(m - B))B. We obtain
It is evident that all the operators J , , "Ttcommute with A and that they commute among themselves. Furthermore, "TI- "9, = {exp[- t A ( J , - J , ) ] - I) exp(- tAJ,) =
-exp(-tAJ,,)/'A(J.
-.I,>exp[-sA(J,
- J,)J ds
0
=
"TI- s ( J m - J,)A ds. -1: '"Ts
Let then e be an arbitrary element of 9 ( A ) . We have, by (45.19,
ll'"Fl e - "TI ell S M 2 ll(Jm- J,)Aell
I'
eZBsds
0
if m,n > 2B. We have seen that J m A e converges to Ae. It follows that the "TIe form a Cauchy sequence in E and converge to a limit, which we denote by TIe. The convergence is uniform with respect to t in any bounded interval [0, T],T < 00. Once more by (45.15), we see that, when 0 5 t 5 T, the form a bounded set of linear operators, hence (again by [TVS, D&K,
+
Sect. 451
443
CONTINUOUS SEMIGROUP THEORY
Prop. 32.51) we conclude that " ' F i e converges to F i e , for any e E E, uniformly with respect to t on bounded intervals. From this it follows at once that, given any eEE, t b F r e is a continuous function of t in R + ;that FsFr = F s +and i that F o= I, for these properties are true when is substituted for F.In passing, note that we have, by virtue of (45.19, llFtll 2 MeB'
(45.16)
for all t 2 0.
What remains to be shown is that - A is the infinitesimal generator of the semigroup {Ti}. Let -A' be the infinitesimal generator of {Y,}.In view of (45.16) we know that, for Re A sufficiently large, the resolvent R(A; - A ' ) is equal to
I0
I0
+m
+m
e - " F , dt
=
lim m++m
= R(A;
exp( - At) exp( - tAJ,) dt
=
lim
(AZ + AJ,,,)-'
m++m
-A),
where the limits are taken in the sense of the pointwise convergence in E. In other words, for Re I large enough, AI A and AI + A', which are bijections of 9 ( A ) and 9 ( A ' ) , respectively, onto E, have the same inverse, hence are equal, which is only possible if A = A'. The proof of Theorem 45.1 is now complete.
+
We have seen that every continuous semigroup {Yr}on E has an infinitesimal generator - A with dense domain 9 ( A ) , whose resolvent R(A; - A ) exists and is a bounded linear operator for Re A larger than some constant B 2 0, and furthermore has property (b) of Theorem 45.1. Conversely, to any such operator - A corresponds a continuous semigroup {F,}of which - A is the infinitesimal generator. In fact, the semigroup {Ti}is unique. This can be shown in several ways. One way is to recall that the Laplace transform is injective [on the Laplace-transformable distributions belonging to W+(L(E; E))],hence the resolvent R(A; - A ) determines F t (by inverse Laplace transformation). As a matter of fact, the proof of the implication (b) +(a) in Theorem 45.1 has yielded a representation formula from the semigroup Trstarting from the infinitesimal generator - A : it is the expression of F ias the pointwise limit of the '"Ti: (45.17) m++m
(Pointwise means that, given any e E E, F t e is the limit in E of the vectors " F i e ; we recall that the convergence is uniform with respect to t on every finite interval 0 5 t 6 T < + co.)
444
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
Another remark connected with Theorem 45.1 is that the injinitesimal generator of a continuous semigroup {F,},- A , is a closed operator. This means that the graph of A : 9 ( A ) + E,which is a subset of E x E,is closed there, or, equivalently, that if a sequence ej in 9 ( A ) converges in E to an element e and if Aej converges also in E, to an element f, we must have e E O ( A ) and Ae = f. Indeed, for M large enough, ( I + (l/m)A)-' is a continuous linear operator. If we apply it to (ej + (I/m)Aej),which we know converges to(e ( l / m ) f )= g , we see thatej must converge to(Z + ( l / m ) A ) - ' g and since it also converges to e this means that e belongs to the range of ( I + ( l / m ) A ) - ' , hence to 9 ( A ) by Proposition 45.1, and that we have
+
m
This implies at once f = Ae.
Application to Parabolic Mixed Problems We are now going to indicate briefly how one can apply the theory of continuous semigroups to the solution of certain parabolic mixed problems. In order to simplify the exposition, we shall put some restrictions on the problem under study. We shall deal with a bounded open subset R of Rn.We consider a second-order linear partial differential operator in R, of the kind considered in Sect. 44:
(although, under the present circumstances, the fact that A is written in the variational form is of no importance: our hypotheses will be such that we could commute a/axk and aJk(x)in the double sum above). We assume that A satisfiesthe uniform strong ell@ticity assumption (44.2) : (45.18)
Re
ajk(x)[,E 2 cl[lz,
for all x
E
R, 5 E Cn.
j,k= 1
In addition to these hypotheses we shall make all the necessary assumptions on the regularity of the boundary an of R and on the coefficients ajk(x), bj(x), c(x) to ensure the validity of the following property: (45.19)
Vu E H;(Q),
Au E L2(n)
*
u EHZ(n).
When -A is the Laplacian, sufficient conditions for (45.19) to hold are provided in Sect. 27 (see Theorem 27.2) and, as indicated there, the same result extends, by identical methods, to more general differential operators. We
Sect. 451
445
CONTINUOUS SEMIGROUP THEORY
know [cf. (44.3) and following remarks] that there is a real number I. such that, if A > A,, I + A is an isomorphism of H { ( Q ) onto H - ' ( Q ) , which, combined with (45.19), implies that (45.20)
if I > I o , I
+A
Ilt
is an isomorphism (in an obvious sense) of
H;(Q) n H 2 ( Q ) onto L2(Q). Let us denote by ( I + A ) - ' the inverse mapping, regarded, however, as a bounded linear operator ofL2(Q)into itself. Its range will be H{(R) n H2(Q), which we regard as the domain 9 ( A ) of A . Still for I > I , , let us consider an arbitrary function u E HA(Q) n H 2 ( Q ) and
Il(1 + A)ull:
= (1 - ~o)211ull;
+ 2(2 - 20) Re(u, (A0 + A M 0 + II(20 + A)ull:
where ( , ), and 11 (lo denote the inner product and the norm in L2(Q).Again by (44.3) we obtain
+ Il(n0 + A M : s II(L + 4 . 4 1 1 2 , . By (45.20) we have the right to substitute ( I + A)-'f for u, with f arbitrary
(45.21)
(A - 10)211ull;
in L2(Q).We derive from (45.21): (45.22)
(2 - Ao>ll(A
+ A)-'fIlo
5 llfllo
J.
3
> 10,
fEL2(Q>.
By iteration we derive
for all I > I ,
and all f € L 2 ( Q ) .
If we apply Theorem 45.1, we reach the conclusion that - A is the infinitesimal generator of a continuous semigroup {Yt}on L2(Q).This in turn means that we can now solve the mixed problem (45.24)
u,+Au=f,
=uo,
UI r=o
wherefand uo are valued in L5(Q),by the formula (45.25)
~ ( t= ) Y tu0
+ J: 9, -J (s ) ds.
Of course, we must specify our assumptions about the regularity offwith respect to f. These may depend on the situation. But observe that, without additional information, the first term on the right-hand side of (45.25) is merely a continuous function valued in L2(Q)-although, for each fixed t , Yruo belongs to the domain of A , i.e., HA(R) n H 2 ( Q ) . Thus, i f f is integrable with respect to t (say on [0, q),the second term will be absolutely continuous, in other words will be better (in general) than the first one, and the solution u will be continuous.
446
[Chap. IV
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
Exercises (In the following exercises, E will lways denote a 3mplex Banach space, with norm 11 ]Ie; the operator norm will be denoted by 11 11.)
45.1. Let {F,}be a semigroup of operators on E, continuous for the operator norm [this means that (45.6)445.7) hold and that t HY, is a continuous mapping of [0, m[ into the Banach space L(E; E)]. Let R ( p ) be the Laplace transform of H ( t ) I l ,i.e., R(p) = ( p l f A ) - ' , -A being the infinitesimal generator of the semigroup {F,}.Prove that, for p real positive, large enough, R(p) is an invertible map of E onto itself. Derive from this fact that A is bounded and I = , e-rA. 45.2. Let { U , } be a continuous semigroup (Definition 45.1) of unitary operators on E (assumed to be a Hilbert space). Show that onecan define, in a unique manner, U , for t < 0, in such a way that { U , } , E becomes a continuous group of unitary operators on E. Derive from this that the infinitesimal generator of the semigroup { U , } t 2 0 is an antiself-adjoint operator, densely defined, in E. 45.3. Can you give an example of a continuous group of operators {Il} on a Hilbert space E whose infinitesimal generator is not bounded (Exercise 45.1) nor antiself-adjoint (Exercise 45.2)? 45.4. Let H be a Hilbert space, A a densely defined self-adjoint operator in H. Assume that A is positive and has a bounded inverse and consider the abstract Cauchy problem :
+
,
(45.26)
d2u
-
dt2
+ A U =O,
t ER,
=u0€H,
u(
=u,EH.
ull r=O
1=0
Show that there is a continuous unitary group U , in H x H such that the solution u of (45.26) can be linked to the Cauchy data uo, u1 by the formula
(45.27)
and this relation leads to the formula (to which the student is asked to give a precise meaning) : (45.28)
+
u(t) = ~os(A'/~t)u,
Compare with (13.10) and Exercise 44.5.
sin(A1'2t)ul.
Sect. 451
CONTINUOUS SEMIGROUP THEORY
447
45.5. Let {Fr} be a continuous semigroup on E such that there is a constant C > 0 for which the following is true:
(45.29) Ve E E, the function of t 2 0, TIe, can be extended as a continuous function in the sector I Im t I S C Re t of the complex plane, holomorphic in the interior (and valued in E). Prove that there are positive constants M , B such that (45.30)
lIF,Il 6 Me B(Rc I),
Vz E C,
I Im z J 5 C Re z.
Derive from this that the resolvent R(1) of the infinitesimal generator of Y t is a holomorphic function, valued in the Banach spaceL(E; E), in the region R e A + C J I m I )> B .
AEC,
(45.31)
45.6. Use the notation of Exercise 45.5. Prove the converse of the result stated there, namely that if the resolvent R(1) of the infinitesimal generator exists and is a holomorphic function of A, with values in L(E; E), in the region (45.31), then (45.29) holds, possibly for a smaller value of the constant C > 0. 45.7. Let R be a bounded open subset of R",whose boundary is a smooth hypersurface and which lies on one side of it. Let P(x. D ) be a strongly elliptic differential operator in R, whose coefficients all belong to Ca(si>. Show that P(x, D) defines an unbounded linear operator in L*(R), with domain H2"(R) n Ht(R), A , and that the resolvent R(1) of - A is a holomorphic function, valued in L(L2(Q);L2(R)), in a region of the kind (45.31). [The student is permitted to assume the validity for the operator P(x, 0)of the analog of Theorem 27.2.1 45.8. Let A be a densely defined self-adjoint operator in a Hilbert space H; suppose that A is positive and, in fact, A 2 co Zfor some co > 0. Show that the continuous semigroup {Tr) generated by - A satisfies the inequality
(45.32)
IITrIIS const exp( -co t),
V t 2 0.
Show that the inverse of A , A - ' , which is a bounded linear operator on E, verifies +m
A-' =Jo
Frdt.
45.9. Let H and A be as in Exercise 45.8. Let uo be an element of H and f ( t ) a continuous mapping of [O, co[ into H, converging to f m in H as t -+ + co. Prove that the solution u(t) of the problem
+
(45.33)
du dt
+ Au = f ,
Vt > 0;
u(0) = uo,
448
MIXED PROBLEMS AND EVOLUTION EQUATIONS
converges, as t -+
+
CQ,
[Chap. IV
to the solution u, of the equation Au, = f , .
(45.34)
45.10. Let R be a bounded open subset of R”, whose boundary r is a C“ hypersurface and which lies on one side of it. Let g be a complex-valued function on r, belonging to H 3 ” ( r ) , and let u(x, t) be the solution of the mixed problem
au
(45.35) (45.36)
in Q,
-=Axu at
for t > O ;
= uo(x) E L2(Q),
u/ r=O
(45.37)
u =g
on
r.
Apply the result in Exercise 45.9 to show that, when t -+ verges to the solution u,(x) of the Dirichlet problem: (45.38)
Au,=O
inR,
u=g
+ CQ, u(x, t) con-
onr.
Could one have predicted the conclusion from physical considerations, interpreting u, uo , and g as temperatures?
46 Application of Eigenfunction Expansion to Parabolic and to Hyperbolic Mixed Problems As in Sects. 44 and 45, we continue to deal with a differential operator A = A(x, a/&) whose coefficients are L" functions in the open set Q c Rn, independent of the time variable t :
A
= -
c"
a j k
-a
a + c b j ( x ) - a + c(x). j=l ax' n
( X ) T
j , k = l ax1
ax
In this section, however, we shall make use of the eigenvalue expansions described in Sect. 34. We refer throughout to the concepts and notation used in that section. In particular, the operator A will be assumed to beformally se!jkdjoint and strongly elliptic [cf. (34.3) through (34.6)]. We shall also make the hypothesis (34.9), namely that the open set Q is bounded. We recall that we may find a number K such that K + A , viewed as a continuous linear operator HA(Q) - + H - ' ( Q ) [which is the dual of H t ( Q ) ] becomes positive definite and, as a matter of fact, an isomorphism [it suffices to take K > 1,; see (34.7)]. Its inverse G ( K ) ,when composed on the right with the natural injection of HA(Q) into H-'(R), becomes a compact operator on the former space; when composed on the left, it becomes a compact operator on the latter. At any rate, its spectrum is discrete, and consists of a sequence of strictly positive numbers converging to zero. We shall instead look at the eigenvalues of A , I , related to those of G ( K )by the relation (46.1)
1 = x-l
- K,
x E spectrum of G(K).
The eigenvalues of A form the sequence (34.1 1) of real numbers, converging to +a.Thus, except possibly for a finite number of them, they are all strictly positive. 449
MIXED PROBLEMS AND EVOLUTION EQUATIONS
450
[Chap. IV
We shall then make use of the eigenfunctions Ej ( j = 1 , 2 , . . .) of A defined in (34.19). Proposition 34.2 provides us with a new method of solving the mixed problem: au
- + Au =f in R x 10, T[,
(46.2)
at
(46.3)
u ( * , 0) = uo,
where we seek a solution u valued in HA(R) (i.e., assuming zero boundary values; we recall that the situation can always be transformed into this one, by a change of the unknown function). We shall take
f E Lz(O,T ; H-'(Q)),
(46.4)
uo E L2(R),
and seek
(46.5)
u E L2(0,T; HA(Q))
such that u, E Lz(O,T; H-'(Q)).
Let us apply Proposition 34.2 and write tm
f(x,t>=
(46.6)
+m
uO=
Cfj(t)Ej, j= 1
CuojEj, j= 1
+m
(46.7)
U(X, t ) =
1 uj(t)Ej.
j= 1
For the sake of simplicity let us assume in the remainder of this section that all summations with respect to j range from 1 to + 03, without repeating it each time. Since AEj = Aj Ej , Eqs. (46.2)-(46.3) transform into the following sequence of ordinary differential initial value problems :
(46.8),
us
+ LjUj =
fj,
uj(0) = ~0 j
(46.9)j
0
< t < T,
9
whose unique solution is given by
(46.10)
uj(r) = uoj exp( - A j t)
+
s'
0
exp[ - Aj(t - s)]fj(s) ds.
Let us denote by u j ( t ) the integral at the right and set
(46.1I)
v(x, t ) =
c u,(t)E,(x).
We know that
(46.12)
C l u o j I z = IIuOIILz(n) < +a.
Sect. 461
APPLICATION OF EIGENFUNCTION EXPANSION
45 I
On the other hand by virtue of Proposition 34.2
where the constant C depends on T and on the differential operator A . We remind the reader that the Hilbert space structure of HA@) and, by duality, that of H-'(O), is defined by the inner product on HA(O), (46.14)
Nu, u)) = 44 0)
+
&2(*)
where a(u, u) is the sequilinear form associated with A [see (34.16)]. The standard Holder's inequalities for convolution yield
whence, with a suitable constant C, > 0, depending on T and on A like C,
by applying Proposition 34.2 once more. If we combine (46.13) and (46.15), we obtain
Of course, these existence results, and the related estimates, are nothing new t o us: They are particular cases of the theorems, such as Theorem 40.1, obtained by the energy inequalities-or of those obtained by Laplace transformation (Sects. 43 and 44), or by the continuous semigroups theory (Sect. 45). But there are advantages to the eigenfunction expansion approach: In
452
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
many instances, it provides us with rather concrete, and computable, approximations of the solution. It is immediately apparent that it can be extended to equations of other types than parabolic, for instance to an important class of hyperbolic equations, such as the wave equation, and also to equations, like Schrodinger's, which are neither parabolic nor hyperbolic (see Exercises 46.5 and 46.6). We shall discuss here the case of hyperbolic equations, as a transition to the results of Sect. 47. The differential equations under study will now be of order 2 with respect to t , specifically [A having the same meaning as in (46.2)]:
a%
- + Au =f in R x 10, T [ .
(46.17)
at2
As for the initial conditions. they are now (46.18)
au -(*,O)=u,.
u(.,O)=u,,
at
The boundary conditions will once again be implicit in the fact that we seek a solution u with values in HA(R). We shall use the series representations of f, uo , and u in terms of the Ej , as in our treatment in this section of the parabolic problem (46.2)-(46.3). Here we also need a series representation for u1:
u1 = C U l j E j . We are going to prove the following theorem:
THEOREM 46.1. Suppose that the open set R is bounded and that the differential operator A(x, a/ax) in 52 is formally self-adjoint and strongly elliptic [i.e., satisfies (34.2) and (34.6)]. Then, to every set of data, (46.19)
uo E HA(Q),
u1 E Lz(Q),
f~ L'(0, T ;L2(R)),
there is a unique f i c t i o n u such that
which satisfies (46.17X46.18). Pro05 The coefficients uj in the series representation (46.7) must satisfy the second-order differential equation (46.21)j
U;
+
l j
uj
=A,
0 < t < T,
[Cf. (46.8)j1,
Sect. 461
453
APPLICATION OF EIGENFUNCTION EXPANSION
and the initial conditions [cf. (46.9)i] (46.22),
Uj(0) =UOj,
Uj(0) = u , j .
The unique solution of (46.21)j-(46.22)j is given by (46.23)
U j ( t ) = uoj C 0 S ( t J i j )
+ u,j
sin(tJj)
Jij
The uniqueness of the solution u(x, t ) follows at once from Proposition 34.2 and from the uniqueness of the solutions uj of (46.21)j-(46.22)j. The fact that (46.20) holds will now be derived from the expressions (46.23). Incidentally, let us point out that the reader should not be bothered by the fact that the square roots of the numbers i j (not all of which are necessarily positive) appear in (46.23): It is seen at once that the functions of those square roots are actually functions of (J,> - i. We continue to assume that HA(Q) is equipped with the inner product (46.14) and H-l(L2) is equipped with the dual Hilbert space structure; thus we are free to apply Proposition 34.2. We have
JT
We use the fact that, for a suitable constant C > 0 and for all j
=
I , 2, . . . ,
(46.25)
In view of (46.25), we derive from (46.24): (46.26)
114-, t)Il"d(fl) I IIUollHo~(n) + CIIUlIIL2(R)+ q l l l c
1
t)llLZ(*)
dt,
which proves that u E L"(0, T ; HA(Q)). Next we show that u is continuous in the closed interval [0, T] with values in HA(Q). In order to see this it suffices
454
[Chap. IV
MIXED PROBLEMS A N D EVOLUTION EQUATIONS
to derive from the expressions (46.23) an inequality slightly more precise than (46.24), namely
(46.27)
(C ( K + i j ) l u j ( t ) - uj(t’>I21112 5
(c
(K
+ I l j ) ( u o j I Z H ; ( tt,’ ) ) 1 / 2 +
(cIulj12K3(t,
t‘))1/2
where we have used the notation
We have, for a suitable C > 0 and all j = 1, 2, . . . , all
t , t‘ E
[0,T],
I K ~ ( ~ , t ‘ ) ] 5 2C.
I Hj(t, t‘)l 5 2,
If we fixj, Hj(t, t’) and Kj(t, t’) converge to zero as t’ converges to Z. Hence, by the dominated convergence theorem (applied to series), the first two terms on the right-hand side of (46.27) tend to zero as t’ tends to t. In view of the fact that the norm of the integral is not greater than the integral of the norm, we see that the third term is not greater than
,:.Ilf(.,
d I l L ~ ( r l ds, )
and therefore converges to zero when t’ -+ r. Finally, consider the fourth and last term on the right-hand side of (46.27). To every E > 0 there is an integer N , > 0 such that
On the other hand, there is q > 0 such that It - t’ I 5 tj implies
I Kj(t - s, t’ - $1 5 E for every j = 1, . . . , N , , every s, t , t’ E [0, TI.From these two facts it follows easily that the last term also tends t o zero when t’ -+ t . A similar argument applies to the t derivative of u(x, t), u,(x, t) =
C Uj(t)Ej(X).
Sect. 461
APPLICATION OF EIGENFUNCTION EXPANSION
455
It suffices to observe that -
(46.30)
UJ(O =
+ u1 c o s ( t a j )
- Jnj uoj sint(J7)
+ jIh(s)cos((t - S)J&} 0
ds.
We leave the details to the student: One must prove that
(C ] u p )- u J ( t ’ ) ) 2 } 1 ’ 2 converges to zero as
t’
tends to t.
Remark 46.1. If, in addition to the assumptions in Theorem 46.1, we make the hypothesis that f E Co([O, T]; H-‘(R)), we may derive from the conclusion in Theorem 46.1 and from the equation (46.17) that azu/8tz belongs to Co([O,TI;H-’(Q)). Remark 46.2. An essential difference between hyperbolic mixed problems and parabolic ones is that, in the former, time is revertible: By assuming that the right-hand side f in Eq. (46.17) was defined for -T S t 5 0, we could have solved the backward mixed problem, i.e., the same problem as (46.17)(46. i 8) except that (46.17) would have to be verified for (x, t ) E R x ] - T, O[. This is not so in the case of a parabolic mixed problem! Remark 46.3. The reader should compare formula (46.23) with formula (13.10). One may say that the latter is the “global analog” of the former: It is the analog of the former when R = R” (and T = co). Here again, as in Remark 34.1, we see that the series representations in terms of the eigenfunctions EJx) have a strong (and deep!) similarity with the Fourier inversion formula. The analogy would be even more obvious if we had used Fourier series instead of Fourier integrals (i.e., if we had dealt with functions on the torus T“ rather than on R”; cf. Exercise 13.2).
+
Exercises
46.1. Let R be a bounded interval a < x < b in the real line. Solve, by using the eigenfunction expansion described in Example 34.1, the mixed problem (46.31) (46.32) (46.33)
U(X,
0) = u&)
u(a, t ) = s,(t),
E
L’(u,b),
u(b, t> = g*W,
Compare with the results in Example 44.2.
t
> 0.
456
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
46.2. Here we ask. the same question as in Exercise 46.1 but with (46.31) replaced by
(46.34)
aZu
a2u
atz
ax2’
-=-
a<x
t>0,
and (46.32) replaced by
(46.35)
U(X, 0) = u&)
E
au
- (x, 0)
HA(R),
at
= u~(x) E Lz(!2).
Compare with Exercise 44.7. 463. Let !2 be the disk x2 y2 < R2 (R > 0) in the plane. By using the eigenfunction expansion described in Example 34.2, solve the mixed problem
+
(46.36)
au = Au, at
( x , ~ E) R,
(46.37)
t
>0
UI t = O = uo E L2(!2)
(46.38)
u(
- ,t ) E HA(R),
t > 0.
46.4. Here we ask the same question as in Exercise 46.3 but for the wave equation dZU -- Au,
(46.39)
(x, ~ ) E R ,
t
> 0,
at2
and with the initial conditions,
(46.40)
“
uI-o
= u, E Hd(R),
2 at
I
=
111
EL2(R).
t=O
46.5. Let A be the same operator as in Eq. (46.2) but consider now the Schrodinger-like” equation
(46.41)
+ iAu =f at aU
in R x
10, T [
(j =
fi),
with initial condition u ( * ,0) = u, ELZ(S2).
(46.42)
Assume thatfEL’(0, T; H-l(R)) and uo are given as in (46.6). Determine the coefficientsuj(t) in the expansion (46.7) of the solution u of (46.41)-(46.42). Show that the expansion (46.7) does converge in Lz(O, T; H;(Q)). What happens when T + cn?
+
Sect. 461
APPLICATION OF EIGENFUNCTION EXPANSION
457
46.6. This is the same question as in Exercise 46.1 but with the Schrodinger equation (46.43)
au
-=---,
at
a2u ax*
a<x
OstST
substituted for the heat equation (46.31) [also, the side conditions (46.33) are imposed only up to time TI.
47 An Abstract Existence and Uniqueness Theorem for a Class of Hyperbolic Mixed Problems. Energy Inequalities
In Section 46 we obtained a result (Theorem 46.1) on the existence and uniqueness of solutions to certain hyperbolic mixed problems. Actually that result also gave some interesting eigenfunction expansions for the solution. But the class of equations and problems to which it applies is by far too restricted. We must be able to handle problems other than the Dirichlet one: for instance the Neumann problem (§37.1), the “hybrid” problem of 437.2, radiation and oblique-derivative problems such as those considered in 437.5, Dirichlet and Neumann problems for higher order strongly elliptic operators (Sects. 36 and 38), and so on. Generalization to include these problems would be desirable also for the parabolic mixed problems; indications on how it could be carried out have been given in a series of exercises-40.1, 41.1, and 42.1 to 42.3 (for applications, see also Exercises 41.4, 41.5, and 42.4). It is for the sake of generality that in this section we state and prove an abstract theorem, susceptible to applications in a wide range of problems of the kind mentioned above. Nevertheless, i t might be convenient to think of concrete cases, such as mixed problems for a second-order differential operator of the kind A ( x , t , a/&), where
+
satisfies the usual strong ellipticity assumptions (40.17) [and also satisfies the boundedness assumption (40.1 a)]. In contrast to the operators considered 458
Sect. 471
459
HYPERBOLIC MIXED PROBLEMS
in Sect. 47, the ones we deal with here are allowed t o have coefficients which vary with t . The fact that A(x, f , a/ax) is written in the variational form enables us to introduce the usual sesquilinear form, here depending on t , a(t; u, u ) =
j
ajk(x,
j,k=l
fl
2 1b'(x,
au ac dx + axkax] j=1
f)--
fl
au
t ) -U d x ax'
+ Jnc(x, t)uU dx. In the manner described in some detail in Sect. 37 such a form defines a bounded linear operator (now depending on t , 0 2 t 2 T ) from a Hilbert space V into its antidual V'. We recall that the antidual V' of V is the Banach space of continuous antilinear functionals on V (we shall systematically denote by ( , )- the bracket of the antiduality between V and V'). In the case of the differential operator A(x, t , d/Jx) above, V might be Hd(R) if the boundary conditions in the problem under study are the Dirichlet ones, or else some other function space, most often an intermediary one between HA(R) and H'(R) (see Sect. 37). If, instead of A(x, t , a/ax), we were to study a strongly elliptic operator of order 2m (with coefficients depending on time), the space V could be H,"(Q)), or some intermediary space between H,"(R)and H"(R). At any rate, we have a continuous sesquilinear form on V x V, a(t; u, v), depending on t , and for every such t E [0, TI,it defines a bounded linear such that operator A ( t ) : V
+v',
(47.1)
a(t; u, v) = (A(t)u,
v>-,
u, v E v.
But the reader should be careful not to identify this "abstract" operator A ( f ) with the restriction to V of the differential operator A(x, t , ,?/ax):A ( t ) carries information which, in general, A(x, t , a/&) does not. Indeed, the choice itself of V implies certain boundary conditions on the solution, as explained in Sect. 37. In the case where V = Ht(R) we may, in a sense, identify A ( t ) with A(x, t , a/ax), viewed as a bounded linear operator Ht(R) + H-'(R&-but otherwise this is not so. We refer t o Sect. 37 for the more general interpretation. Thus, in the abstract setup, we shall start from the operator A(t) : V + or from the sesquilinear functional a(t; u, v). Some of the limitations encountered in the special situation of Sect. 46 carry over t o the present, more general situation : We shall assume that A(t) is formally self-adjoint, in the sense that
v',
(47.2)
(A(f)u, v)-
= (u,
A(t)v)-,
OStST,
or equivalently, for the same u, v, t , (47.2')
a(t; u, v)
= a(t; v,
u).
U,VEV,
460
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
We shall make the usual coerciveness hypothesis :
(47.3) there are constants co > 0, A. real, such that, for all 1 2 A,, all u E V and all t , 0 S t T,
((1 + A(t))u,u>- 2 ~ o l l ~ l l ' v . We must also specify our requirements on the regularity of A(t), or of a(t; u, v) with respect to t. We are going to assume that
(47.4) for each pair of elements u, v of V , t H a(t; u, v) is a C' function in the closed interval [0, TI. The initial value problem we wish to study is the following:
(47.5) (47.6)
u,,(t)
+ A(t)u(t) = f ( t ) , u(0) = uo
9
0 < t < T,
u m = u1,
where the t subscripts mean differentiation with respect to t. We are going to prove the following result : THEOREM 47.1. Suppose that A(t) is formally self-adjoint, i.e., that (47.2) holh, and that the coerciveness hypothesis (47.3) is satisfied. Suppose furthermore that a(t; u, v) E C'([O, TI) whatever u, v E V ,i.e., that (47.4) holds. Then, to every u0 E V , u1 E H , f E L'(0, T ; H ) , there is a unique function u E L"(0, T ; V )such that u, E Lm(O,T ; H ) and such that (47.5)-(47.6) holds. Furthermore, u is a continuous function in the closed interval [0,TI valued in V, , the space V equipped with its weak topology, and v, is a continuous function in [0,TI valued in H, .? There is a constant C > 0, independent o f f , u,, , ul, and o f t , 0 t _I T, such that, for these t's,
(47.7)
IIu(t>llv+ Ilur(t)llH 5 c(11u011v + IIUIIIH
+ J;IIVOII~dt'J*
Proof. The proof consists of three steps: First, we prove some identities and inequalities of the "energy" type; next we prove the existence of the solution, by the so-called Galerkin method (Example 35.l), using (but only in a finitedimensional context!) the energy estimates established in part I. Finally we prove the uniqueness of the solution, essentially by a modification of the energy estimates. There are many similarities with the proofs of Theorems 40.1 and 41.1 (which apply to the parabolic evolution equations) but there are also marked differences, in particular in the regularity requirements with respect to the variable t , and also in the fact that we must take f valued in H
t As usual, since we regard functions as distributions, this means that u and u, might have
to be modified on a set of measure zero.
Sect. 471
46 1
HYPERBOLIC MIXED PROBLEMS
v’.
and not in Technically, this is due to the fact that, in the derivation of the energy estimates, we are forced to take the inner product o f f ( t ) with u,(t)the latter belongs to H, not to V, and the inner product is therefore t o be computed in H.
I
Energy Inequalities
Let cp be an arbitrary element of C’([O, TI; V) such that \Ir L’(0, T ;H) [as a consequence, cp,, E L’(0, T;V’)].We have
= cptt - A(t)cp E
Let us write, for any v, w in V, a,(t; v, w) = (d/dt)a(t;v, w), and take twice the real part of both sides in (47.8). It yields (47.9) We have used the property (47.2’), i.e., the fact that a(t; v, w) is Hermitian. We integrate both sides in (47.9) from 0 to t . We obtain (47.10)
Ilcp‘(0Il; + 4 t ; cp(Q> cp(t)>
=
11% 1: + 4 0 ; ‘Po > cpo)
+ J‘ { ( W I( P m H + a&’ ;W’),cp(t” 0
dt’.
Let us now avail ourselves of the coercivity property (47.3). We see that there is a constant C > 0, independent of cp (hence of q0, cp,, JI) and o f t such that
Let us introduce the following norm: N ( t ; c p ) = SUP {Ilcp(t‘>ll: Ojf’jt
+ llcp‘~~’~ll:,~”z.
Possibly after increasing the constant C, we derive from the preceding inequality :
462
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. 1V
The classical Gronwall inequality (see Exercise 11.10) enables us to derive from (47.1 I):
in other words, and possibly after some increasing of C,
II Existence of the Solution We are going to apply a simplified version of Galerkin's method (Example 35.1). Theidea ofthis method is simple (see Sect. 35): In view of the separability of V, we may represent V as the closure of the union of a strictly increasing sequence of finite-dimensional linear subspaces VJ . By the density of V in H we see that H is also the closure (in H!) of the union of the V,. By using suitable projections of V and H onto the V, we transform our infinite-dimensional problem into a system of ordinary differential equations in V, (i.e., data and unknown are valued in VJ). The latter yield a unique solution (for suitably chosen initial conditions), u,, which are then shown to converge, in an appropriate manner, to the sought solution u of problem (47.5)-(47.6). In the present situation we shall use a sequence w l , . .., wj , . . . in V which forms a complete orthonormal system in H. Obviously such sequences do exist. The finite-dimensional linear subspace V, will be, in our case, the linear span of wl, ..., w,. We may select complex numbers uiJ such that the partial sums
converge to uo in V. We then define
as the solution of the initial value problem: (47.14) (47.15)
( u h t ( t )+ a ( t ; uJ(t),wj) = (f(t), wj),,, uj(0) = U ' O J ,
( U M O ) = (u1, Wj),,,
1 5-j5 J ,
s
1 5-j J.
This is a system of J ordinary second-order linear differential equations with C' coefficients in [0, 7") Indeed,
Sect. 471
463
HYPERBOLIC MIXED PROBLEMS
and its suffices to apply (47.4). From our choice o f f the right-hand sides of (47.14) are integrable functions on [0, TI; hence the (unique) solutions uj are C' functions and their second derivatives are L' functions in [0, TI.As a consequence, we have the right to apply the energy inequality (47.13) with cp = uJ. We see that there is a positive constant C,, depending only on the operator A ( t ) and on T, such that (47*16)
lluJ(t)l\~ + \l(uJ)t(t)lli
c l ( II'OII:
+
Il'lIIA
+ (/;\lf('')llH
2
dt')
)'
We integrate both sides of (47.16) from 0 to T ; we reach the conclusion that the uJ [resp. the (UJ),] form a bounded sequence in L2(0, T; V) [resp. in L2(0,T ; H)]. One can therefore extract a subsequence (u,J which converges weakly in L2(0, T ; V) to a function u, while the (uJy), converge weakly in L2(0, T ; H), to u,. It is easily seen that the limit u satisfies (47.5)-(47.6) and also, by virtue of (47.16), the energy inequality (47.7). Thus u E L"(0, T ; V), u, E L"(0, T ; H). Observe that it follows from this, from Eq. (47.9, and from our hypotheses on a(?;v, w) and on f, that u,, E L'(0, T; V'). We derive that (47.17)
u, E Co([O,TI;V').
u E Co([O,TI; H),
This presumes that we have possibly modified u and u, on a set of measure zero; in particular, we may assume that they are bounded functions in [0, TI with values in V and H, respectively. Let v' (resp. h) be an arbitrary element of V' (resp. H), to an arbitrary point, {ti} a sequence converging to to in the closed interval [0, TI. We know that u(tj) [resp. u,(tj)],.j = 0, 1, . . . , remains in a bounded subset of V (resp. H), and hence there is a subsequence {j,,} such that
rresP. ( u ( t j v ) ,
u(rjv))-
h)H1
converges. By virtue of (47.17) it can only be to (47.18)
(v',
u(t0)) -
b P . (u,(to>t h)H1.
By an elementary argument this means that ( ~ ' 3
u(tj)>-
[respa ( U d t j ) , h ) ~ l
converges to (47.18). This proves the continuity properties of u and u, when we regard them as valued in V, and H,, respectively.
Ill Uniqueness of the Solution We consider a solution u E L"(0, T; V) of the homogeneous equation (47.19)
u,,
+ A(t)u = 0,
0 < t < T,
464
MIXED PROBLEMS AND EVOLUTION EQUATIONS
[Chap. IV
with u, EL"(O, T;H) and u(0) = u,(O) = 0. Let us set
U(t) = fu(t') dt' 0
and integrate (47.19) from 0 to t'. We get "t'
"I'
We exploit now the fact that U,E L"(0, T;V), U,,= u, E L"(0, T;H) and A(t)U(t)E Co([O,TI;V).We may write (47.20) (U,,(t'), U,(f'))"
+ a(t' ;Uft'),U,(t')) -
r'
a,(t"; U(t"),U,(t')) dt" = 0. 0
We take twice the real part of the left-hand side of (47.20) and integrate from 0 to t . We obtain (47.21) IlU,(t>IIk + a(t; U(0, W)) = j:a,(t'; U(t'), U(t')) dt'
+ 2 Re f a t ( t ' ; U(t'), U(t) - U(t')) dt'. 0
We reason as in part I, using the coercivity hypothesis (47.3). We derive from (47.21), for a suitable constant C > 0,
N ( t ; U ) 2 5 c JLN(t';uy dt' 0
+ $J(t) 0
- u(f')l\;dt' 5 5 C t N ( t ; U ) 2 ,
which implies U(t) = 0 if 0 2 t 2 to = (5C)-'. As usual in such a situation, we may now repeat the argument in the interval [ t o , TI instead of the interval [O, r]. We easily reach the desired conclusion, that U(t), and therefore also u(t), must vanish identically for 0 5 t 5 7'. Q.E.D.
Remark 47.1. It can be shown that the solution u of (47.5)-(47.6), whose existence and uniqueness are stated in Theorem 47.1, enjoys more regularity with respect to t than what is stated in this theorem. It can be shown, in fact, that u (resp. u,) is a continuous function o f t , 0 S t 5 T, valued in V (resp. in H) and not only in V, (resp. in Ho). But the proof of this fact is considerably more involved than the proof of the weaker statement in Theorem 47.1 ; it can be found in [LM, Vol. 1, Chap, 3, $8.41, from which we have adapted the argument in this section.
BIBLIOGRAPHY (References and Further Reading)
On the Subject of Functional Analysis and Distribution Theory
Gelfand, I. M. and Silov, G., “Generalized Functions,” Academic Press, New York, 1964 (English translation; contains a wealth of information on and explicit computation of special functions and distributions, fundamental solutions, Fourier transforms, etc.). [TDI Schwartz, L., “Theorie des Distributions,” Hermann, Paris, 1966 (still the best and most comprehensive exposition of the theory). [TVS, D & K] Treves, F., “Topological Vector Spaces, Distributions and Kernels,” Academic Press, New York, 1967. [Yl Yosida, K., “ Functional Analysis,” Springer-Verlag, Berlin and New York, 1968 (2nd ed.; a basic reference on the subject of functional analysis, containing extensive material and describing most aspects of the subject).
On the Subject of Linear Partial Differential Equations (General)
[LPDO]
Bers, L., John, F. and Schechter, M., “Partial Differential Equations,” Wiley (Interscience), New York, 1964 (covers the basic material from a viewpoint somewhat different from the present book). Ehrenpreis, L., “ Fourier Analysis in Several Complex Variables,” Wiley (Interscience), New York, 1970 (the theory of overdetermined systems of linear PDEs with constant coefficients, and a rather unorthodox view of its ramifications, by one of the architects of the theory). Hormander, L., “ Linear Partial Differential Operators,” SpringerVerlag, Berlin and New York, 1963 (the general theory of linear PDEs as it was during the 1960s; difficult but very rewarding reading). Palamodov, V. P., “Linear Differential Operators with Constant Coefficients,” Springer-Verlag, Berlin and New York, 1970 (English translation; an exhaustive, and somewhat overwhelming, exposition of the theory of overdetermined systems of linear PDEs with constant coefficients-more rigorous but less diverse than Ehrenpreis’ book). Treves, F., “ Linear Partial Differential Equations with Constant Coefficients,” Gordon & Breach, New York, 1967.
On the Subject of Linear Partial Differential Equations of Special Type Agmon, S., “Lectures on Elliptic Boundary Value Problems,” Van Nostrand-Reinhold, Mathematical Studies, Princeton, New Jersey, 1965. 465
466
tF1 [L MI
BIBLIOGRAPHY
Friedman, A., “ Generalized Functions and Partial Differential Equations,” Prentice-Hall, Englewood Cliffs, New Jersey, 1963. Friedman, A., “Partial Differential Equations of Parabolic Type,” Prentice-Hall, Englewood Cliffs, New Jersey, 1964. Friedman, A., “Partial Differential Equations,” Holt, New York, 1969. Lions, J. L., “Equations differentielles opckationelles et problemes aux limites,” Springer-Verlag, Berlin and New York, 1961. Lions, J. L. and Magenes, E., “ Non-homogeneous Boundary Value Problems and Applications,” Springer-Verlag, Berlin and New York, 1972.
On Some of the Special Topics Touched upon in the Book [A1
[F SI [HI
[Wl
Aubin, J. P., “Approximation of Elliptic Boundary Value Problems,” Wiley (Interscience), New York, 1972. Dinkin, E. B. and Yushkevich, A. A., “Markov Processes,” Plenum, New York, 1969 (English translation). Fix, G. and Strang, G., “An Analysis of the Finite Element Method,” Prentice Hall, Englewood Cliffs, New Jersey, 1973. Helms, L. L., “Lntroduction to Potential Theory,” Wiley (Interscience), New York, 1969. Hobson, E. W., “The theory of Spherical Harmonics,” Cambridge Univ. Press, London and New York, 1931. Lavoine, J., “ Calcul symbolique des distributions et des pseudo-fonctions,” CNRS, Paris, 1959. Watson, G. N., “Theory of Bessel Functions,” 2nd ed., Macmillan, New York, 1944. Widder, D. V., “ Laplace Transform,” Princeton Univ. Press, Princeton, New Jersey, 1946.
Bibliographical References
[I1
PI
Seminaire Schwartz, ‘‘ Equations aux dbrivkes partielles,” Institut Henri Poincark, Paris, 1955. Stampacchia, G., “ Equations elliptiques du second ordre a coefficientsdiscontinus,” aminaire Eq. DCr. Part., College de France Paris, 1963164.
Index characteristic (surface), 163 coercive (form), 205,460 compatibility conditions, 9, 39 cone property, 220 conjugate harmonic (functions), 308 connected component of the identity, 2’+ in Lorentz group, 54,131 continuous group of operators, 446 continuous semigroup of operators, 438 curl, 9
A
adjoint of a differential operator, xiv analytic, 22, 24, 382 analytic-hypoelliptic, 22 analytic singular support, 118 antitranspose, 205 antidual, 205,459 averaging operator (or shift), 295
B D
barrier, 272 Bessel equation, 328 Bessel functions, 328, 330 bicharacteristic (curve), 165, 168 bicharacteristic (strip), 168 Brown, Robert, 301 Brownian motion, 301 C
capacitable sets, 292 capacity potential, 291 capacity distribution, 291 capacity, 291,292 Cauchy formula, 38 (inhomogeneous) Cauchy formula, 38 Cauchy-Kovalevska theorem, 146, 148, 165 Cauchy problem, 92, 98, 102, 120, 143 Cauchy-Riemann equation, 6, 34, 308 Cauchy-Riemann equations (in several variables), 10,40 anti-Cauchy-Riemann, 7 causality principle, 114 characteristic (cone), 162 characteristic (covectors), 162 characteristic (curve), 165 characteristic (equation), 168 characteristic (set), 168
determined (systems), 8, 127 difference scheme, 339 Dirac’s equations, 128 Dirichlet integral, 287, 355 Dirichlet problem, 189 weak or variational form, 196 discrete, 295 classical, 268 discrete convergence, 336 discrete Laplacian, 296 divergence, 9 domain (in the plane), 306 doma:? of influence, 112,121, 139 double layer, 80
E elliptic boundary value problem, 369 elliptic PDE, 6 elliptic (differential operator), 163 energy (total), 116 energy (density), 1 18 energy, 116, 117, 287 energy inequalities, 402,461 entire function, 36 error (cut-off), 336 (discrete), 336
467
468
INDEX
escape probability, 301 Euler Gamma function, 74,423 excessive (function), 295,303 exponential type (functions of), 125, 151 external approximation (of a Hilbert space), 333 convergent, 333 stable, 333
F finite difference method, 338 finite difference operator, 338 formal adjoint, xiv, 362 formally self-adjoint, 322 forward Cauchy problem, 123 Fourier inversion formula, xiv, 42 (in I variable), 34 Fourier transform, xiv, 34, 386 Frkhet derivative, 197 Fredholm operator, 369 Fresnel integral, 44 Friedrichs lemma, 236 functions valued in a locally convex space, 105 fundamental solution, 17 right-left, 28
G
Galerkin method (approximation), 333, 462 GArding’s inequality, 348 Glrding’s theorem (on hyperbolic eq), 123 Gaussian (or normal) probability distribution, 301 generalized Green’s formula, 237,377 generalized (solution of the classical Dirichlet problem), 268 Gevrey (classes), 25 gradient, 8 Green’s formula, 79, 362 Green’s function, 271 GronwaIl inequality, 95
H Haar measure, 70 Hamiltonian field, 168
Hamilton-Jacobi equation, 168 harmonic function, 4,295 harmonic measure, 269 harmonic minorant (greatest), 288 Harnack’s inequalities, 84 Hartog’s theorem, 40 heat equation, 5,41, 391 heat flow (lines of), 309, 354 Heaviside’s function, 26,420 Hermitian or self-adjoint form, 207,332,459 Hilbert basis, 318, 325 Hille-Yosida (theorem), 441 Holder continuous, 223 Holmgren’s theorem, 181, 183 holomorphism, 306 Huyghens principle, 59 hybrid problem, 354, 357 hyperbolic first-order systems, 123, 132 hyperbolic partial differential equations, 6, 160,458 hyperharmonic functions, 280 hypoelliptic linear PDEs, I8
I index (of an operator), 369 infinitesimalgenerator (of a semigroup), 439 interpolation (of Hilbert or B-spaces), 327, 397 isothermic (line), 308 isotropic media, 4
J Jacobi elliptic function sn, 313 Jordan curves, 36 K
Klein-Gordon equation, 58
L Laplace equation (operator), 4 Laplace transform, 416 Laplace-transformable (distributions), 417 Lebesgue spine, 276 Legendre equation, 316
469
INDEX
Legendre functions, 316 associated Legendre functions, 319 Legendre polynomials, 317 light-cone, 49,54, 56, 164 Lipschitz continuous function, 223,260 local structure (of a distribution), 385 Lopatinski boundary conditions, 374 Lorentz transformation, 54, 5 5 , 58, 131 lower semicontinuous function, 280
M manifolds, 231 mathematical expectation, 296 maximum principle, 82, 85,265 Mean Value theorem, 81 metaharmonic equation, 85 mixed problems, 392 multi-index, 14 mutual energy, 287
N neighboring points (in a lattice), 294 noncharacteristic (direction), 162 normal (to boundary), 37, 243
0 oblique derivative problem, 353 ordinary differentia1 equations, 10, 26, 89 order (of an operator), 14 orthogonal (group), 69 overdetermined (system), 8
P pairing (of distributions valued in a Banach space), 387 Paley-Wiener theorem, 127, 15 1 parabolic equations, 6, 393 Pauli (matrices), 128 pay-off, 295 Perron's method, 278 Pointwise (convergence), 438 (limit), 443 Poisson formula, 84, 308 Poisson integral, 86
Poisson kernel, 310, 312 (m-)polar set, 216 polar set, 292, 293 potential (Newton's, logarithmic), 281, 282 (global), 282 (Green's), 216 principal part, 135, 161 principal symbol, 162, 347 propagation of singularities, I14
R Radiation problem, 363 Radon measure, xvii, 385 random walk (discrete), 294 (continuous), 299 rapidly decaying at infinity (C" functions), xvi, 385 reciprocity formula (for potentials), 286 recurrent sets, 303 regular (points), 271, 302 Rellich's lemma, 235, 248 resolutive (functions), 269 resolvent (of an operator), 324, 439 Riemann function, 90 Riesz potentials, 67 Riesz representation (for superharmonic functions), 289 (discrete), 303
S scalarly continuous, resp. differentiable (etc.), function, 382 Schrodinger equation, 7,43 Schwarz reflection principle, 39 Seidenberg-Tarski theorem, 123 shift operator, 295 signum, 27 singular support, 49, 59 smallest eigenvalue, 329 Sobolev inequalities, 21 7 Sobolev spaces ( H ' (R")), 107, 215 ( H ' ( G ) ) , 193 (ff"',P(Q)), 210 (on a manifold), 232 spherical harmonics, 3 18 star-shaped, 247 strongly elliptic, 323, 347, 372
470
INDEX
strongly (or strictly) hyperbolic, 125, 132, 160 subharmonic (functions, distribution), 28 1 subsolutions, 263 superharmonic (function, distribution), 263, 28 1 supersolutions, 263 support, xiv. 59 supports (theorem of), 423 symbol, 6
U underdetermined (system), 8 uniformly elliptic, 207 unit sphere (area of the), 74 V
variational form (of boundary value problems), 196, 355 (of an operator), 204, 394
T W tempered distributions (scalar), xvi (valued in a B-space), 386,417 time of first exit, 301 trace (of a function on a boundary), 240, 395 transform of a distribution (under a linear transformation), 53 transition probability (discrete), 294 (continuous), 300 transpose of a differential operator, xiv
wave equation and operator (also d’Aiembertian), 5 well-posed boundary value problem, 358 well-posed Cauchy problem, 120 well-posed Dirichlet problem, 274 Weyl’s lemma, 24, 73
z Zero-one law, 304
Pure and Applied Mathematics A Series of Monographs and Textbooks Editors
Samuel Eilenberg and Hyman Baa93 Columbia University, New York
RECENT TITLES
CARLL. DEVITO.Functional Analysis Formal Groups and Applications MICHIELHAZEWINKEL. SIGURDUR HELCASON. Differential Geometry, Lie Groups, and Symmetric Spaces ROBERTB. BURCKEL. An Introduction to Classical Complex Analysis : VoIume 1 JOSEPH J. ROTMAN. An Introduction to Homological Algebra C. TRUESDELL A N D R. G. MUNCASTER. Fundamentals of Maxwell’s Kinetic Theory of a Simple Monatomic Gas : Treated as a Branch of Rational Mechanics SIMON. Functional Integration and Quantum Physics BARRY GRZEGORZ ROZENBERG A N D ARTOSALOMAA. The Mathematical Theory of L Systems DAVID KINDERLEHRER and GUIDOSTAMPACCHIA. An Introduction to Variational Inequalities and Their Applications H. SEIFERT AND W. THRELFAU. A Textbook of Topology; H. SEIFERT. Topology of 3-Dimensional Fibered Spaces LOUISHALLE ROWEN.Polynominal Identities in Ring Theory DONALD W. KAHN.Introduction to Global Analysis SACHS. Spectra of Graphs DRACOS M. CVETKOVIC, MICHAELDOOB,A N D HORST ROBERT M. YOUNG.An Introduction to Nonharmonic Fourier Series MICHAELC. IRWIN. Smooth Dynamical Systems JOHN B. GARNETT. Bounded Analytic Functions EDUARD PRUCOVEEKI. Quantum Mechanics in Hilbert Space, Second Edition M. SCOTT OSBORNE AND GARTHWARNER. The Theory of Eisenstein Systems K. A. ZHEVLAKOV, A. M. SLIN’KO,I. P. SHESTAKOV, A N D A. I. SHIRSHOV. Translated by HARRY SMITH. Rings That Are Nearly Associative JEAN DIEUWNN~. A Panorama of Pure Mathematics ;Translated by I. Macdonald JOSEPH G. ROSENSTEIN. Linear Orderings AVRAHAM FEINTUCH AND RCHARD SAEKS. System Theory :A Hilbert Space Approach ULF GRENANDER. Mathematical Experiments on the Computer HOWARD OSBORN. Vector Bundles : Volume 1, Foundations and Stiefel-Whitney Classes K. P. S. BHASKARA RAOA N D M. BHASKARA RAO.Theory of Charges RICHARD V. KADISON A N D JOHN R. RINGROSE. Fundamentals of the Theory of Operator Algebras, Volume I EDWARD B. MANOUKIAN. Renormalization BARRETT O’NEILL.Semi-Riemannian Geometry : With Applications to Relativity LARRY C. GROVE. Algebra IN PREPARATION
ROBERT B. BURCKEL. An Introduction to Classical Complex Analysis : Volume 2 RICHARDV. KADISON AND JOHN R. RINGROSE. Fundamentals of the Theory of Operator Algebras, Volume I1
E. J. MCSHANE.Unified Integration A. P. MORSE.A Theory of Sets, Second Edition STEVENROMAN.The Umbra1 Calculus JOHN W. MORGAN AND HYMAN BASS (Eds.). The Smith Conjecture