N.K. Nikol kij (Ed.)
Functional Analysis q’ Linear Functional Analysis
L
.
Springer-Verlag Berlin Heidelberg New York London Paris Tokyo Hong Kong Barcelona Budapest
Encyclopaedia of Mathematical Sciences Volume 19
Editor-in-Chief:
R.V. Gamkrelidze
List of Editors, Authors and Translators Editor-in-Chief R.V. Gamkrelidze, Academy of Sciences of the USSR, Steklov Mathematical Institute, ul. Vavilova 42, 117966 Moscow, Institute for Scientific Information (VINITI), ul. Usievicha 20a, 125219 Moscow, USSR Consulting Editor N.K. Nikol kij, Steklov Mathematical St. Peterburg D-l 11, USSR
Institute,
Fontanka
27,
Author Yu.1. Lyubich, Department of Mathematics, State University Stony Brook, Stony Brook, NY 11794-3651, USA
of New York at
Translator I. Tweddle, Department Tower, 26 Richmond
of Mathematics, University of Strathclyde, Livingstone Street, Glasgow Gl lXH, Scotland
Linear Functional Analysis Yu.1. Lyubich Translated from the Russian by I. Tweddle
Contents Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Chapter 1. Classical Concrete Problems
..........................
0 1. Elementary Analysis ........................................ ........................................ 1.1. Differentiation 1.2. Solution of Non-linear Equations ........................ .................................... 1.3. Extremal Problems 1.4. Linear Functionals and Operators ....................... ........................................... 1.5. Integration 1.6. Differential Equations .................................. $2. The Fourier Method and Related Questions .................... 2.1. The Vibrations of a String .............................. 2.2. Heat Conduction ...................................... 2.3. The Classical Theory of Fourier Series .................... 2.4. General Orthogonal Series .............................. ............................... 2.5. Orthogonal Polynomials ........................... 2.6. The Power Moment Problem 2.7. Jacobian Matrices ..................................... .................... 2.8. The Trigonometric Moment Problem 2.9. The Fourier Integral ................................... ................................ 2.10. The Laplace Transform ........................... 2.11. The Sturm-Liouville Problem 2.12. The Schrodinger Operator on the Semiaxis ................ ............................. 2.13. Almost-Periodic Functions ................................... 0 3. Theory of Approximation ............................. 3.1. Chebyshev Approximations 3.2. Chebyshev and Markov Systems ......................... 3.3. The Chebyshev-Markov Problem ........................
4 6 6 6 7 10 10 11 16 19 19 21 21 27 28 29 34 35 36 39 45 47 51 53 53 56 58
2
Contents
3.4. The L-problem of Moments ............................. 3.5. Interpolation and Quadrature Processes .................. 3.6. Approximation in the Complex Plane .................... 3.7. Quasianalytic Classes .................................. 9 4. Integral Equations ......................................... 4.1. Green Function ...................................... 4.2. Fredholm and Volterra Equations ....................... 4.3. FredholmTheory ..................................... 4.4. Hilbert-Schmidt Theory ................................ 4.5. Equations with Difference Kernels ....................... 4.6. The Riemann-Hilbert Problem .......................... Chapter 2. Foundations
and Methods
60 61 65 69 70 70 74 f) 76’ 77 78 82
............................
85 85 85 88 93 95 97 100 101 107 107 109 111 112 113 115 117 119 119 127 132 138 140 144 147 153 163 165 165 174 178 181
Q1. Infinite-Dimensional Linear Algebra .......................... .................................. 1.1. Bases and Dimension ................. 1.2. Homomorphisms and Linear Functionals 1.3. The Algebraic Theory of the Index ....................... 1.4. Systems of Linear Equations ............................ 1.5. Algebraic Operators ................................... 1.6. General Principles of Summation of Series ................ 1.7. Commutative Algebra ................................. 4 2. Convex Analysis ........................................... 2.1. Convex Sets .......................................... ................................... 2.2. Convex Functionals 2.3. Seminorms and Norms ................................. ............................ 2.4. The Hahn-Banach Theorem ................................ 2.5. Separating Hyperplanes ........................ 2.6. Non-negative Linear Functionals 2.7. Ordered Linear Spaces ................................. 9 3. Linear Topology ........................................... 3.1. Linear Topological Spaces .............................. 3.2. Continuous Linear Functionals .......................... 3.3. Complete Systems and Topological Bases ................. 3.4. Extreme Points of Compact Convex Sets .................. 3.5. Integration of Vector-Functions and Measures ............. ........................................ 3.6. w*-Topologies ..................................... 3.7. Theory of Duality ........................... 3.8. Continuous Homomorphisms 3.9. Linearisation of Mappings .............................. 0 4. Theory of Operators ........................................ 4.1. Compact Operators ................................... 4.2. The Fixed Point Principle .............................. ............... 4.3. Actions and Representations of Semigroups ......... 4.4. The Spectrum and Resolvent of a Linear Operator
.
/ Contents
4.5. One-Parameter Semigroups ............................. 4.6. Conjugation and Closure ............................... ........... 4.7. Spectra and Extensions of Symmetric Operators /... 4.8. Spectral Theory of Selfadjoint Operators ............... .................................... 4.9. Spectral Operators 4.10. Spectral Subspaces .................................... 4.11. Eigenvectors of Conservative and Dissipative Operators ..... 4.12. Spectral Sets and Numerical Ranges ..................... .......................... 4.13. Complete Compact Operators ............................. 4.14. Triangular Decompositions 4.15. Functional Models .................................... 4.16. Indefinite Metric ...................................... ...................................... 4.17. BanachAlgebras Q5. Function Spaces ........................................... ................................ 5.1. Introductory Examples ................................. 5.2. Generalised Functions 5.3. Families of Function Spaces ............................ 5.4. Operators on Function Spaces .......................... Commentary
on the Bibliography
................................
197 206 207 211 215 216 220 222 227 232 242 242 244 252 255 261
Bibliography
.................................................
261
Author Index
.................................................
273
Subject Index
.................................................
277
Up to a certain time the attention of mathematicians was concentrated on the study of individual objects, for example, specific elementary functions or curves defined by special equations. With the creation of the method of Fourier series, which allowed mathematicians to work with rbitrary’ functions, the individual approach was replaced by the lass’ approach, in which a particular function is considered only as an element of some unction space More or less simultaneously the development of geometry and algebra led to the general concept of a linear space, while in analysis the basic forms of convergence for series of functions were identified: uniform, mean square, pointwise and so on. It turns out, moreover, that a specific type of convergence is associated with each linear function space, for example, uniform convergence in the case of the space of continuous functions on a closed interval. It was only comparatively recently that in this connection the general idea of a linear topological space (LTS)’ was formed; here the algebraic structure is compatible with the topological structure in the sense that the basic operations (addition and multiplication by a scalar) are continuous. Included in this scheme are spaces which, historically, had appeared earlier, namely FrCchet spaces (metric with a complete translation invariant metric), Banach spaces (complete normable) and finally the class which is the most special of all but at the same time the most important for applications, Hilbert spaces, whose topology and geometry are defined in a manner which goes back essentially to Euclid - the assignment of a scalar product of vectors. Using contemporary formal language we can say that LTSs form a category in which continuous homomorphisms, or continuous linear operators (we usually apply the last term to homomorphisms of a space into itself), serve as morphisms. Specific classical examples of linear operators are differentiation and integration or, in a more general form, differential and integral operators. As a rule, integral operators are continuous but this cannot be said of differential operators. Thanks to the construction of a sufficiently general theory of linear operators it became possible to include the latter case. The story was repeated at the operator level. At first mathematicians studied individual operators but later on it turned out to be useful and necessary to pass to classes. First and foremost in this connection, multiplication of operators (as a rule non-commutative) went out and algebras of operators appeared. Moreover many natural function spaces are also algebras (commutative, of course: typical multiplication is the usual one, i.e. pointwise, or its Fourier equivalent - convolution on an Abehan group). With regard to topology all these situations are covered by the concept of a topological algebra but with a considerable excess of generality. A satisfactory approach is achieved in the narrower setting of 1 Translator note. I will use LTSs for the plural and write an LTS rather than the correct a LTS for the indefinite form since the former reads more smoothly. Other abbreviations of similar type will be treated in the same way.
< Preface
5
Banach algebras which have proved to be extremely fruitful in harmonic analysis, in representation theory, in approximation theory and so on. The development of functional analysis ran its course under the powerful inlhrence of theoretical and mathematical physics. Here we may mention, for example, spectral theory, which evolved from wave mechanics, the technique of generalised functions (or distributions), whose construction and widespread introduction was preceded by the systematic practice of using the b-function in quantum mechanics, ergodic theory, whose fundamental problems were posed by statistical physics, the investigation of operator algebras in connection with applications to quantum field theory and statistical physics and so on. At the present time the ideas, terminology and methods of functional analysis have penetrated deeply not only into natural science but also into such applied disciplines as numerical mathematics and mathematical economics. In the introductory volume presented below the classical sources of functional analysis are traced, its basic core is described (with a sufficient degree of generality but at the same time with a series of concrete examples and applications) and its principal branches are outlined. An expanded account of a series of specific sections will be given in the subsequent volumes, while certain questions closely connected with linear functional analysis have already been elucidated in previous volumes of the present series. We only touch fragmentarily upon non-linear aspects. The general plan of the volume was discussed with R.V. Gamkrelidze and N.K. Nikol kij and individual topics with A.M. Vershik, E.A. Gorin, M.Yu. Lyubich, A.S. Markus, L.A. Pastur and V.A. Tkachenko. Valuable information on certain questions which are elucidated in the volume was kindly provided to the author by V.M. Borok, Yu.A. Brudnyj, V.M. Kadets, M.I. Kadets, V.Eh. Katsnel on and Yu.1. Lyubarskij. The author offers profound thanks to all the named individuals. He is also most grateful to Dr. Ian Tweddle for his translation of the work into English.
Chapter 1 Classical Concrete Problems $1. Elementary Analysis The overwhelming majority of functional relations encountered in mathematics and its applications are non-linear nthe large’ but linear nthe small For example the linear theory of elasticity is founded on the assumption that stresses depend linearly on deformations when the latter are small. The method of mathematical analysis, which is firmly established in the works of Newton and Leibniz, consists of two basic procedures: local linearisation of the functions being studied (differentiation) and reconstruction of the global relation according to its local structure (integration . Functional analysis was conceived in the womb of classical mathematical analysis but it took shape as an independent discipline only at the end of the 19th and the beginning of the 20th centuries. 1.1. Differentiation. If we speak about classical analysis using modern language, we can say that one of its fundamental objects is a dlfirentiable real or complex function of n real variables, i.e. a mapping f: G --f @ (G a domain in Euclidean space W), having the property that at each point x E G the increment f(x + h) -f(x) as a function of the displacement vector h can be approximated by a linear function to within order o( lhl) as Ihl+ 0:
f(x + 4 -f(x) The uniquely determined
= Mx),~) + 4lW.
(1)
vector g(x) in this situation is the gradient Vf(x), whose
af (1 < k < n). The principal coordinates are the partial derivatives ax of (1) is by definition the dzfirential notation h - dx and
df of tke function
linear part
f at the point x. In classical
df = (VfW, dx) = $I $x,. Thus the function f is locally linearised:
f(x + d4 =fW + kil g dx,. The function f is said to be smooth if its partial derivatives exist everywhere in G and are continuous. Any smooth function is differentiable. If a function has continuous partial derivatives of all forms up to order r inclusive, then we say 1Not only direct integration but also indirect processes such as the solution ofdifferential equations.
.
5 1. Elementary Analysis
I
that it belongs to the class2 C’(G). The class C”(G) is defined as the intersection of all the P(G). The enumerated classes can also be considered on closed domains, for example on the interval [a, b] c R (and even on arbitrary subsets x of W). If the function f belongs to C’(G) (I c co) then at any point x E G we have TaylorS formula f(X + dx) =f(x)
+ mtl i
i,+..TinE,
dxilarnfaxin dxfl 1
xi
+ o(ldxl Y
n
i.e. to within order o(ldxI the function f is a polynomial of rth degree in dx. The inner sum in Taylor formula is the mth differential of the function f at the point x. For a function f E C G) we can formally write down its Taylor series by passing in Taylors’ formula to I = co (and of course rejecting the error term). If the Taylor series of a function f E Cm(G) converges to it in some neighbourhood of the point x E G, then f is said to be analytic at the point x. A function which is analytic at all points of the domain G is said to be analytic on G. A function which is analytic on the whole space is said to be an entire function. All that has been said above carries over immediately to complex functions of n complex variables. In the 18th century the latter situation had already begun to separate out little by little into a distinct branch of mathematical analysis (complex analysis) in view of the fact that complex analytic functions of one or several complex variables have very distinctive properties. In particular, from the definition of an analytic function (complex or real) one easily obtains the uniqueness theorem: an analytic function which is flat at some point (i.e. such that all its derivatives are equal to zero at this point) is identically zero. The standard example of an infinitely differentiable non-zero function which is flat (at x = 0) is f(x)
= {z ’
I: f ;;
In what follows, either it will be stated or it will be clear from the context whether the functions under consideration and their independent variables are real or complex. 1.2. Solution of Non-linear Equations. This is one of the first applications of linearisation. Let us consider the equation f(x) = 0, where f is a real function which is smooth on a certain interval [a, b] c R. The existence of a root is guaranteed if, for example, f(u) < 0 and f(b) > 0. The uniqueness of the root is guaranteed if the derivative f x) > 0 for all x. Assuming these conditions are satisfied, let us denote the root by X. If some approximation x,, to the root x is known then, using (l), we obtain f(x0)
+f xo)@
- x0) = 0,
*For convenience we also put Co(G) = C(G), where C(G) is the class of continuous functions on G.
a
Chapter 1. Classical Concrete Problems
from which XSC1~Xo-flo.
f(xo)
In exactly the same way the next approximation x2 is formed from x1 (if a < x1 < b) and so on. This iterative process is called Newton process or the method of tangents, since formula (2) corresponds to approximating the graph of the function f by the tangent to it at the point (x,,f(x,)). Theorem 1. Let f be a function
of class C2 [a, b] satisfying
f(a) < 0, f(b) > 0; f x) 2 /A > 0,
the conditions:
If x)1 < M.
lf I% --xl
2/J d--q,
5M
where q < 1, then for the initial approximation x,, Newton the interval [a, b] and converges to the root X with bound 2P b, - Xl < 5Mq2m
(m=O,1,2
process is defined on
,... ).
Thus the linearisation of the original equation generates an iterative process which converges very rapidly3 to the root. We note however that the initial approximation x0 must be chosen with sufficient accuracy according to (3). Condition (3) cannot be completely effective since the unknown root X appears in it. This deficiency can be easily removed but the local character of Newton process is essentially connected with its construction by means of linearisation. Example. Letf(x)=x3+ax+b(-co<x
x m+l =fhA
Cm= 0, 1, Z...).
If we only know that f is continuous then the behaviour of the iterations as m + cc can turn out to be extremely complicated. However under certain stronger assumptions about f global convergence of the process can take place. 3The rate of convergence. defined by the bound (4), i.e. O(q ), is said to be quadratic while with bound O(q it is linear; in such cases we may call the process rapidly convergent in contrast to slowly conuergent processes for which a typical bound is O(m- (a > 0).
5 1. Elementary Analysis
9
Theorem 2. Zf the function f satisfies the Lipschitz condition Mxd with 0 < q < 1, then the iterative approximation with bound
-f(x*)l
G 41x1 - x21
(5)
process defined by it converges for any initial
Ix, - Xl < 1x0 - Xlq”, where X is the root of the equation x = f(x). root is guaranteed by condition (5).
A sufficient condition
The existence and uniqueness of the
for (5) to hold is that f should be differentiable
and
If l s 4. Theorem 2 is widely used in applications. Example.
In celestial mechanics we encounter Kepler
equation
x-ssinx=[
(6)
with unknown x and parameters E, c (0 < E c 1; -co < c < co). Since the function f(x) = E sin x + [ satisfies the conditions of the previous theorem, equation (6) has a unique real root X, which can be found by means of the iterations x,+i = E sin x, + c (m = 0, 1,2, . . .) with any x0. Theorem 2 extends without change in statement (or proof) to mappings of the semiaxis or of any interval into itself4. This is important because many of its applications are of a local character. Example (Implicit function theorem). Let f (x1, x2) be a function of class C2 in a neighbourhood
df of the point (0,O) E lRz such that f (0,O) = 0. Then if ax
# 0, 2 (0.0)
there exists on some neighbourhood lx1 1 < q a unique function x2 = &xi) which satisfies the equation f (x1, x2) = 0 and the condition 4(O) = 0. In fact the equation under consideration can be written in the form x2
= 1x1 + P(X,,
x2),
(7)
where 1 is a constant, plto,oj = 0 and Vpl,,,,, = 0. We seek the root x2 by considering x1 as a parameter belonging to a neighbourhood of zero. Since the derivative with respect to x2 is in modulus less than any given q (0 -c q -c 1) on a sufficiently small neighbourhood of the point (0, 0), we may choose q > 0 and M > 0 such that for Ix1 I < the interval lx21 < Mq is mapped into itself by the function (7). By Theorem 2 equation (7) has a unique root x2 = 4(x1) for lx1 I < q. 41n general, if X is a complete metric space and the mapping f: X + X contracts uniformly, i.e. dist(fi,fy) d q dist(x, y) where q c 1, then the iterative process x,,,+~=fx, (m = 0, 1,2, . .) converges for any x,, to the unique root aofthe equation x =fx i.e. to the fixed point of the mapping/; moreover we have the bound dist(x,, F) Q q” dist(x,, %) (m = 0, 1, 2, . . .) (contraction mapping principle).
10
\
Chapter 1. Classical Concrete Problems
1.3. Extremal Problems. Let the function f: G + R (G a domain in I?) be differentiable at the point x E G. If it attains a local extremum at this point then df = 0 which is equivalent to the equation
Vfb) = 0,
(8)
i.e. x is a statihury (critical) point for $ This necessary condition for an extremum is one of the first basic results of differential calculus. Its discovery is due to Fermat (1643). Further development in this direction came about on the creation of variational calculus (Johann Bernoulli, Euler, Lagrange). In variational calculus we consider functions of variables which are of a more complicated nature than points in R”. For example, in the classical brachistochrone problem it is required to find the curve joining two given points in a vertical plane such that a heavy particle moving along it takes the least possible time. Here the time of motion is a function on the set of smooth curves joining the given points. Functions of such a type are given the name functionals. More abstractly a functional is defined as a real or complex function on any set X (i.e. as a mapping of X --) R or X + C). In variational calculus necessary conditions of the type (8) are established for a wide range of problems on the extremum of functionals. Condition (8) in coordinates represents a system of n equations with n unknowns xi, . . . , x,:
af 0 ig=
(1 < k < n).
The corresponding conditions in variational calculus are ordinary or partial differential equations. They can have deep independent meaning. For example, the dynamics of a wide class of mechanical systems are described by equations of this type; they arise in electrodynamics, in the theory of relativity and so on. In physical theory constructed in the standard way we encounter a certain functional called action and the fundamental equations of the theory can be written as the necessary conditions for an extremum of the action. This approach is called the variational principle of the given theory (or the principle of least action, although the attainment of even a local extremum on trajectories is not at all necessary and therefore it is better to speak of the principle of stationary action). 1.4. Linear Functionals and Operators. The abstract interpretation of functions as elements (points) of some set (usually a real or complex linear space) is fundamental to functional analysis. Such an approach, whose pioneers were Volterra and Hadamard, opened up exceptionally wide possibilities for applying geometric and algebraic methods in analysis. The following spaces may serve as initial examples of linear function spaces (real or complex): the spaces C’(G) (0 < r < co; G c W), the space B(X) of bounded functions on an arbitrary set X, the space C(X) of continuous functions on a topological space X, the space CB(X) = C(X) n B(X). They are all inlinite-
5 1. Elementary Analysis
11
dimensional as in general are the great majority of function spaces5 (with the exception of the case where functions on a finite set are being considered). The simplest class of functionals on a linear space is the class of linear functionals, i.e. additive, homogeneous functionals: Nfl
+fd
= t4fl)
+ bKc2),
for any scalar a.
Naff) = ati(f)
Using standard algebraic terminology, we can say that homomorphism of the linear space into its ground field. a linear space E into itself (or, in a more general form, subspace L c E into E) is said to be a linear operator
a linear functional is a A homomorphism A of a homomorphism of a on E (with domain of
definition D(A) = L).
Example 1. Let us consider the space C[a, b] of continuous functions on the interval [a, b]. Every point x0 E [a, b] defines a linear functional 6.J f ] = f (x0). This functional (with x0 = 0) is called the b-function in the terminology of Dirac who systematically worked with it in quantum mechanics6. Example 2. In the space C[a, b] multiplication of a function by any fixed continuous function g(x) (in particular, by x) is a linear operator defined on the whole space. Example 3. In the space C”(G) any vector field (a,(x), . . . , a,(x)) (x E G) with components u&c) E Cm(G) defines a linear operator
@f)(x) = g1 4(x)g. k
Example 4. In the space C[a, b] differentiation
f H g is a linear operator
with domain of definition C a, b]. If desired we can restrict the domain of definition by imposing a boundary condition, e.g. f(u) = 0 or f(u) = f(b). Clearly this procedure changes the operator although its effect on a function that remains in its domain of definition is not altered. This way of considering supplementary conditions is essential for the application of the abstract theory of operators. The theory of linear operators is one of the fundamental divisions of functional analysis. 1.5. Integration. The classical figures of mathematical analysis understood the definite integral as he sum of an infinitely large number of infinitely small sA function space can be formally defined as any subspace of the space G(X) of all (real or complex) functions on an arbitrary set X. In particular, if X = Z (the set of all integers) or X = N = {O, 1,2,. . . }, then B(X) becomes the space of sequences (two-sided or one-sided in the respective cases). irac defined the b-function by means of the formal requirements 6(x) = 0 (x # 0), 6(O) = co, &6(x) dx = 1, which give rise to the integral representation of the identity operator: f(x) = &6(x - y)f(y) dy; the d-function is the prototype of the modem concept of generalised function (or distribution).
12
Chapter 1. Classical Concrete Problems
elements”. Cauchy (1823) gave this idea a precise meaning when applied to continuous functions. Then Riemann (1854) introduced the class of all functions for which Cauchy definition of the integral is meaningful. For continuous functions Stieltjes (1894) introduced integration with respect to a eight function’ which allowed the connection to be made between ure’ integration and discrete summation. These achievements of the 19th century were superseded in the 20th century by the works of Lebesgue and his successors, as a result of which the general theory of measure and integration was created. This was accompanied by an important extension of the collection of function spaces: there appeared the space L X, ,u) of Lebesgue integrable functions on any set (space) X endowed with some measure p, the space L X, p) of measurable functions the square of whose modulus is in L1(X, p) and, following this, the entire family Lp(X, p) (1 < p < co) of spaces of measurable functions for which the pth power of the modulus belongs to L X, p). A measure p on X is a countably additive function on some a-algebra’ of subsets of the set X which takes values in [0, co]. If p(X) < co the measure is said to be finite. The sets appearing in the associated a-algebra are said to be measurable. A real function f on X is said to be measurable9 if for any half open interval [a, b) c [w its inverse image f-’ [a, b) is a measurable subset of X lo. Given a measurable function f 2 0, i.e. f(x) > 0 for all x E X, then its Lebesgue integral with respect to the measure p is defined as
s
f dP = sup kzo WW %
~,+A
X
where the supremum is taken over all sequences 0 = a, < a, < a, < . . . , a, + co. This integral is a non-negative number or + co. In the first case the function f is said to be Lebesgue integrable or summable. For a real measurable function whose sign is unrestricted we put f+ = max(f, 0), f- = - min(f, 0) and say that f is summable if both functions f+ , f- are summable, or equivalently IfI is summable. Further, by definition,
Finally, if f is complex, f =fl + ifi, then f is said to be summable if fi and f2 are summable and also by definition n modern times the classical concept has been revived with non-standard analysis. aA u-algebra is any family of subsets containing X and closed with respect to the operations of complementation and countable union. y definition measurability of a complex function reduces to measurability of its real and imaginary parts. The basic concepts of the theory of probability may be introduced on the basis of measure theory: a probability space is a set X with a normalised measure n (n(X) = I), called a probability distribution (or probability measure) on X; random events are the measurable sets A c X (p(A) is the probability of the event A); random variables are measurable functions and so on. This approach, which is now generally accepted, was proposed by A.N. Kohnogorov in the early 1930s.
0 1. Elementary Analysis
13
Along with each function g E L’(X, p) every measurable function f such that If1 < 191is summable. It therefore follows from the elementary inequality Ia + blP < 2P-‘(lalP + lblp) that all the spaces Lp(X, p) (p > 1) are linear. If fE L’(X,p) and
then f(x) # 0 only on a set of measure zero or, as it is usually expressed, f(x) = 0 Equality almost everywhere is an equivalence relation on the set of all measurable functions; moreover equivalent functions fi, fi are simultaneously summable or not summable and in the first case almost eoerywhere.
This allows us to identify them in what follows. From now on L’(X, 11)and along with it all the L!‘(X, FL)are in fact converted into linear spaces which consist of the classes under the stated equivalence relation. The space L’(X, p) occupies a special place in the family Lp(X, p) (p 2 1) because of the fact that on it there is a natural scalar product
(this definition is meaningful by virtue of the inequality labI < t(lal” + lb12)). All the properties required in linear algebra of a scalar product are fulfilled here. Therefore in L2(X, ,u) we have a Euclidean geometry (in general, it is inlinitedimensional). In particular we can speak of orthogonal functions fi, f2: (fi ,f2) = 0. A function f is said to be normalised if (f,f) = 1. A system of functions { fi}y is said to be orthogonal if (fi,fk) = 0 (i # k) and orthonormal if (fi,fk) = 8ik. It is particularly important that we can define in L’(X, ,u) the distance d(f, ,fi) between any two elements by introducing, just as in Euclidean geometry, the norm [IfI/ = ,/(f,f) and setting d(f,,f,) = [Ifi -f211. The triangle inequality for the distance follows from the triangle inequality for the norm
llfi +.&II G llfill + Ilfill, while the latter comes from the Cauchy-Bunyakovskij-Schwarz inequality. Thus L2(X, cl) is turned into a metric space (and hence a topological space) which, in particular, allows us to talk of convergence of a sequence (f.)p in L2(X, p). This is what is known as mean square convergence. Especially important among the results of Euclidean geometry which carry over to L2(X, p) is Pythagoras
Theorem.
Zf fi, fi are orthogonal
then
14
Chapter 1. Classical Concrete Problems
Ilfl +M2 = llfil12 + llf2112. In any Lp(X, ,u) (p 2 1) there is also a natural norm
llfll = (Ix IfI’&)? but for p f 2 it is not connected with any scalar product. The corresponding triangle inequality is called Minkowski inequality. As a result for any p 2 1 we may turn LP(X, p) into a metric space by putting d( fl,f2) = llfi -fill. Riesz-Fischer
Theorem.
All the metric spaces Lp(X, p) are complete.
In any real or complex linear space E a functional properties is called a norm:
)If 1) with the following
1) Ilf II > 0 (f z 0);
2) llaf II = ltll Ilf II for every scalar u; 3) llfi +f2ll
G llfill
+ Ilfill.
A norm generates a metric (just as in the examples given above) and, consequently, a topology. A linear space provided with a norm is said to be normed. A normed linear space which is complete with respect to the corresponding metric is called a Banuch space. Banach not only initiated the use of this important class of spaces but he also developed its theory deeply. l1 A Banach space in which the norm is generated by a scalar product (i.e. )If II = J( f, f)) is called a Hilbert spuce12 (without the completeness requirement it is called a pre-Hilbert space). The space L2(X, ZJ)is a Hilbert space and all the spaces Lp(X, p) (p 2 1) are Banach spaces. The most interesting and furthest developed aspects of functional analysis are connected with Hilbert spaces. Let us return to the question of integration of a function. Let Z be any interval on the real axis and d(x) (x E I) a non-decreasing function. For any bounded interval (a, /3) c Z we shall call the increment A(8 - 0) - n(a + 0) its I-measure; the jumps of the function L at the boundary points of the interval Z will be called the A-measures of these points. By means of this a certain measure Z.L~can be uniquely defined first of all on the a-algebra of Bore1 sets13 and then also on the a-algebra of sets of the form B u N where B is a Bore1 set and p:(N) = 0 (,u* is the outer measure defined by a measure p, i.e. p*(A) is the inlimum of the measures of all measurable sets which contain A). The latter is called the Lebesgue a-algebra for a given function I and the measure pLIis called the Lebesgue-Stieltjes measure defined by the function 1. The integral of a function f E L Z, pl) is usually written in the form ii Even finite-dimensional normed spaces (all of which are complete) are objects which are highly interesting and also useful for application (for example, in numerical analysis). r21n honour of Hilbert who introduced (in the setting of L*) methods of Euclidean geometry into analysis. i3 This is the smallest a-algebra containing the compact sets Q c 1. If a measure on a locally compact opological space X is defined on all the Bore1 sets and is finite on all the compact sets then it is called a Bore! measure.
4 1. Elementary Analysis
15
If f is continuous then this integral is just the usual Stieltjes integral. Consequently for any fE J!,’(I, PJ the integral (9) is called its Lebesgue-Stieltjes integral. The space Lp(Z, pl) is often denoted by Lf;(Z) or more concisely Lf; (provided the domain of,definition appears in the description of the function A). In the case J,(x) = x the mesure pI is called Lebesgue measure. For this measure we write briefly Lp(Z) or even simply Lp if the interval Z is already fixed. The theory of Lebesgue also provides a suitable context for the indefinite integral, characterising functions of the form xf(t) dt + const. (x E I, a E I) s (f is locally summable, i.e. s L’ on some neighbourhood of each point) by means of the property of absolute continuity: the image of any set of measure zero has measure zero. In this situation F x) =f(x) almost everywhere. Any nondecreasing function I is differentiable almost everywhere and 1’ is locally summable, which allows us to extract from I its absolutely continuous part, the integral of L We can introduce analogously the Lebesgue measure and integral on the circle (however this reduces to the case of the interval C-n, n]). For any two measures pl, pLzon spaces X,, X, we form naturally their direct product pL1 x pL, on Xl x X,. For this, iffE L X, x X,, pI x p2), we have F(x) =
(summability
fh, ~2) &, &z = 41 fh ~2) &z f Xl xx2 s Xl s X2 at each step of the repeated integration is guaranteed). This is the
theorem of Fubini.
In particular, the construction of Lebesgue measure on R” starting from Lebesgue measure on W is covered by what has been said. A basic property of the integral on any space X with measure ,u is that the functional +(f) = j- f-h X
on L X, ~4)is linear14. In general, integ ration is a powerful tool for the construction of functionals both linear and non-linear. For example, the simplest form of functionals arising in variational problems is
ICYI =
sa
(x,Y,Y dx
(Y E C a,bl),
(10)
14There is in addition the important property of positiuity, namely $(f) 2 0 if f 2 0. For a linear functional positivity is equivalent to monotonicity: if fI Cfi then $(f,) C $(fi). 15Also operators.
16
\
Chapter 1. Classical Concrete Problems
where f is a given, sufficiently smooth (not lower than C2) function of three variables. The functional ction’ for a conservative mechanical system with s degrees of freedom has form 12
ICd =
m, 4) dt,
(11)
s t1 where t .is time, q = q(t) (ti < t < t2) runs through the set of curves of class C2 in s-dimensional configuration space, q(t) = (ql (t), . . . , qS(t)) and L is the Lagrange function, i.e. L = T - V where T is kinetic energy and V is potential energy. Action has an analogous structure in other physical theories where it appears as the integral of a certain agrangian’. In many situations the integral representation of a functional is an inevitable consequence of its intrinsic properties. For example as F. Riesz showed (1911) any positive linear functional on C[a, b] has form tiff)
= S b) dw4, (1 where Iz is some non-decreasing function on [a, b]. It is essentially uniquely defined, i.e. to within an additive constant on the set of points of continuity. In other words, the measure p1 is uniquely defined. Example.
If n(x) = 0(x < x0), J.(x) = 1(x > x,,) then $(f) =f(x,)
= S,,[f].
1.6. Differential Equations. As we have already said above, extremal problems in function spaces are one of the sources of differential equations. Let us consider, for example, the simplest problem on the extremum of a functional of the form (10). The differential (variation) of this functional is equal to 61 =
where 6y is the increment
* (fySy +f,dy s II
(oariation)
dx,
(12)
of the element y E C a, b], 6~’ is the
of the derivative y f, = af and f,, E -af Since 6~’ = ay" ay @y) then, with the assumption that y E C2[a, b], we obtain on integrating by parts, corresponding
variation
(13)
For an extremum it is necessary that 61 E 0. Since 6y is any function of class C2, there follows from (13) the Euler-Lagrange equation:
This is a second order differential equation with respect to the extremal function y (under the condition of regularity f,,,, # 0).
$1. Elementary Analysis
17
Analogously from the variational principle and the expression (11) for action in mechanics we can deduce the equations of motion in generalised coordinates, which are called the Lagrange equations: d 8L --...--= aL aqk
o
(1 < k < s).
dt 84,
Here L = T - V where
and the quadratic form T is positive definite. Consequently tions can be written in the form
the Lagrange equa-
with certain coefficients ~~,~~(q). We can resolve this system with respect to dj (1 < j < s) since the matrix (pkj) is non-singular. The equilibrium positions of the conservative mechanical system under consideration coincide with the stationary points of the potential energy:
-= av 0
(1 < k < s).
aqk
If q = 0 is an equilibrium position and if the motions which begin close to zero with small initial velocities remain as such for all time16, then it is natural to linearise the equations (14) for q, 4: J$
Here Akj = -
a2v
aqkaqj
pkj(O)dj
Equations
+
i j=l
Akjqj
=
0.
(15) describe (approximately)
(15) small oscilla-
@’
tions close to stable equilibrium. For s = 1 we have the harmonic oscillator, ~4~ + Iq, = 0, oqcillating according to the rule q1 = A cos ot + B sin ot with A frequency 0 = -. In the general case, because of the positive definiteness of J P the quadratic form T, we can transform the system (15) by means of a nonsingular linear substitution
qj=$ CjlQt
(1 <jGs)
to the form 16The assumption about the stability of the equilibrium position q = 0 which is contained in this sentence is valid if the potential energy V(q) has a strict local minimum at the point q = 0 (LagrangeDirichlet theorem).
Chapter
18
1. Classical
(j, +
o:Ql
Concrete
Problems
(1 < 1 < s),
= 0
i.e. to system of s independent harmonic oscillators (normal modes) with frequency spectrum (cc&. The general solution of the system (15) is the superposition of these harmonic oscillations-this is a very simple example of harmonic analysis. If the system is not conservative then in the Lagrange equations the right hand sides appear as non-potential generulised forces. These forces can act in such a way that the total energy of the system will be diminished (dissipation). In this case damped oscillations appear in the linear approximation. We note that equations (15) become exact if we replace L by its second differential at the point 4 = 0. Up to now we have considered ordinary differential equations describing the dynamics of a system with finitely many degrees of freedom. Systems with infinitely many degrees of freedom are usually described by partial differential equations. As an example let us consider small vibrations of a uniform string (with density p). In equilibrium let it occupy the interval [0,1] of the x-axis and let it undergo a tension 1 and then begin to oscillate (without friction) in a horizontal plane x, u so that at time t the point (x, 0) is displaced to the point (x, u(x, t)) (transuerse vibrations). A satisfactory approximation to the action is provided by l J=; (puf - Au, dt dx. ss11 0 This is a quadratic (in the obvious sense) functional on the space of smooth functions u(x, t) (0 d x < 1, t1 < t < t2). With the assumption that u E C2 there follows from the variational principle 6J E 0 the one-dimensionul wave equation
a%
@=a
,a% ax'
(
u=
-.L J>P
(16)
Equation (16) arises systematically in the description of oscillations of onedimensional systems. Moreover in certain situations it can be exact (for example, in electrodynamics). For - co < x, t < co its general solution has the form u(x, t) =f(x
+ at) + g(x - at),
(17)
where f, g are arbitrary functions (of class C . Thus u(x, t) turns out to be the superposition of two truvelling wuoes moving with speed a along the x-axis in opposite directions. Given the initial conditions u(x, 0) = $w, Utb, 0) = w (i.e. the initial configuration
(18)
of the string and the initial assignment of velocities) 2
he
n-dimensional
Au = $I $.
wave equation
has form e
For n = 2 this is the equation
at2
= a2Au, where
of small transverse
A is the Laplace vibrations
operator
of a uniform
or Laplacian: membrane.
0 2. The Fourier Method and Related Questions
19
D lembertS formula follows from (17): u(x t) = 4(x + 4 + 4(x - 4 7 2
+ 1 2u
*+lzt _ W) x s at
&*
For a finite string (0 < x < 1) boundary conditions must be given in addition, for example u(0, t) = 0, u(l, t) = 0.
(19)
If moreover 4 E C2, $ E C1 and 4, #‘, II/ vanish at the ends of the interval [0,1] (this is all necessary in order that u E C2), then the solution is found as before by D lembert formula where 4, $, which are initially given for 0 < x < I, are then extended to the whole axis as odd periodic functions (with period 21). We give one further classical partial differential equation - the heat equation for a bar? au at=
a2u
(20)
(u = u(x, t) is the temperature at the point x at time t > 0). In order to extract a unique solution it is necessary to give the initial condition u(x, 0) = d(x) and, if the bar is finite, boundary conditions at each end such as (19).
0 2. The Fourier Method and Related Questions This method’ works effectively in a wide circle of problems of mathematical physics. Attempts to justify it in a sufficiently general form led to the creation of spectral theory, a most important branch of functional analysis. Eigenvalues (the spectrum in this or some more general sense) and eigenvectors of linear operators are the basic concepts of this theory. Finite-dimensional spectral theory is completely resolved by algebraic techniques but the infinite-dimensional situation requires analytical methods. 2.1. The Vibrations of a String. In this problem the physical significance of the Fourier method is the construction of the solution by means of the superposition of standing waves i.e. solutions of form u(x, t) = X(x) T(t). Separation of ouriables results from substitution in equation (16):
X”
-c-c X
T” u2T
-oJ,
2
where w = const. The boundary conditions (19) require that X(0) = 0, X(I) = 0. I*
The n-dimensional
heat equation
au
has form at = K Au.
1Daniel Bernoulli and Euler were in possession of this method long before Fourier.
20
Chapter 1. Classical Concrete Problems
The frequency spectrum is determined and correspondingly X,(x) = sin y x,
from these to be o, = y (n = 42, 3, . . .)
T,(t) = A, cos y
t + B, sin 7
t.
The series
formally satislies the equation, since it is linear, and clearly also the boundary conditions. The initial conditions (18) immediately reduce to the problem of expansion in trigonometric series: 4(x) = nzl A, sin yx,
I)(X) =
l B,?
sin y x.
(21)
At one time a bitter controversy arose over this problem between Euler and D lembert, which was only resolved in the 19th century after the work of Fourier and after N.I. Lobachevskij (1835) and Dirichlet (1837) gave a more general definition of the idea of a function and established the first tests for the convergence of Fourier series. In the present context the Fourier series (21) of the function 4 is defined by the Euler-Fourier2 formulae for the Fourier coefficients An=;
‘((x)sinyxdx, s0
which arise formally3 from the orthogonal interval [0, Q. Likewise
system of functions sin 5x
(22)
on the
’ I&X) sin F dx. s0 With this the procedure for solving the problem on the vibrations of a linite string is formally completed. B. = &
Theorem. Zf 4 E C O, 11, + E C’ [0, 1] and 4, +“, $ oanish at the points x = 0, 1, then the series
A.cosyt+B,sinyt with coeflicients defined by formulae (22), (23), converges uniformly on the strip O<x
8 2. The Fourier Method and Related Questions
2.2. Heat Conduction.
21
For a solution of the form u(x, t) = X(x) T(t) we obtain
from equation (20): X” -c-z X
T lCT
-m.
2
If 0 < x < 1 then under the boundary conditions (19) we obtain the same spectrum as in the previous problem and the same X,,(x), but now T,(t) = e-rwo2~, which gives u(x, t) = g
&?(nn~l)2t
sin Ex l
n=1
as the formal solution of the problem with initial condition being defined by the formulae (22).
u(x, 0) = d(x), the A,
Theorem. Zf 4 E C2 [0, lJ and 4, 4” vanish at the points x = 0, 1, then the series (24) converges uniformly on the half-strip 0 < x < 1, t > 0 to the solution of the heat equation with boundary conditions ~(0, t) = 0, ~(1, t) = 0 and initial condition 4% 0) = 4(x).
In the problems considered above, the solution u(x, t) is obtained in the form of a series in the eigenfunctions
sin yx
d2
of the operator dxz on [0, l] with the
conditions ~1,~~ = 0, u(,+ = 0. The essence of the Fourier method is the expansion in terms of the eigenfunctions of a linear differential operator (a sufficiently complete description of this class of operators will be given in the sequel). 2.3. The Classical Theory of Fourier Series. For simplicity let us assume in (21) that 1 = rc. If the series (21) for 4 (with I= rr) converges to it everywhere on [0, rr] then it converges on the whole axis to the function 4 obtained from 4 by extending it as an odd function to [-a, 0] and then as a periodic function (with period 271)to the whole axis. If f is any (not necessarily odd) periodic function with period 2n which belongs to4 L’( - x, rr), then its Fourier series is defined as $ +
l (a, cos nx + b. sin nx),
where a, = i
: f(x) cos nx dx, s II
b. = i j:
these are its Fourier coefficients. 4 We can say that f is an L function on the unit circle T.
x
f(x) sin nx dx;
22
Chapter 1. Classical Concrete Problems
For complex f the complex form of the Fourier series
is used more frequently; to transfer to this we have to take C.=i(a.-ib,sgnn)=l
2x
: f (x)eeinx dx s x
(c-, = ;E, for real f ). Moreover the partial sums of the series (25’) may be written in the form S,(x) = , &” c,einx. m.
The system (einx)” Ex is orthonormal on T if the measure of T is normalised to 1. A fundamental problem is to investigate the convergence of the Fourier series of a function f to itself or, in a more general form, to reconstruct the function f from its Fourier series. For example, in the proofs of the theorems in Sections 2.1, 2.2 use is made of the result that the Fourier series of a smooth function converges uniformly to it. The following is a more subtle test. Dirichlet-Jordan Theorem. Zf f is a continuous function then its Fourier series converges to it uniformly. The proof of this theorem relies on the representation the form w4
= &
Q,,(x - 5)f(5)
of bounded oariation
of the partial sums in
d&
(26)
where D,,,(x) is the Dirichlet kernel:’ D,(~)
=
C
einx
=
InlCm
sin(m
+
*lx.
(27)
sin X 2
It possesses the following basic properties: 1) &jFn D,,,(x) dx = 1; 2) if a function f o L’ vanishes on some neighbourhood
lim = D,,,(x)f(x) m-e0 s -z
of zero then
dx = 0.
sWe note that in formula (26) there appears the important convolution functions on the circle:
construction for two summable
(B *f)(x) = &
; dx - Of(t) de. s I For the convolution of functions the corresponding Fourier coefkients are multiplied (Conoolution
Theorem).
together
0 2. The Fourier Method and Related Questions
Property 2) follows from the Riemann-Lebesgue then
23
theorem: if h E L’ (- co, 00)
lim m h(x)e ” dx = 0. i4-Q-J I -00 In turn, the principle of localisation follows from property 2): the convergence and the sum of the Fourier series of a function f~ L’ at a point x depends only on the germ of the function at this point. The insufficiency of continuity alone even for the convergence of the Fourier series at each point was first shown in 1876 by Du Bois-Reymond in an example, after which Lebesgue (1905) constructed a continuous function whose Fourier series converges everywhere but not uniformly. This line of development, which compromised pointwise convergence as a tool for reconstructing a function from its Fourier series, was brought to a definite conclusion when A.N. Kolmogorov (1926) produced an example of a function in L’ whose Fourier series diverges everywhere. In another direction, N.N. Luzin raised the question of the convergence almost everywhere of the Fourier series of any function f~ L This question was resolved positively by Carleson (1966) and shortly afterwards Hunt extended Carleson result to all Lp (p > 1). A natural method of reconstructing a function from its Fourier series is provided by Cesriro summation Sk(X)
=
j&
j.
Mx)
=
4(x
&
-
Of63
dt,
(28)
where Fk(x) is the Fejtr kernel: . ,k+l sin -x F,(x) =
2 (k + 1) sin’ G
(29) 1 _ &!-)
=
einx;
the Sk(x) are called the Fejtr averages of the function 5 Lebesgue-Fejkr Theorem. to f (x) almost everywhere6. f uniformly.
For any function
f
E L’ the Fejtr averages converge
Zf f is continuous then its Fejtr averages converge to
The proof of this theorem is based on the representation basic properties of the Fejtr kernel: 6 At all Lebesgue
points,
i.e. points x such that lim1 E-O 2E
c
If(x + 0 -f(x)1 & = 0.
(28) and the following
24
Chapter 1. Classical Concrete Problems
1) &fin F,(x) dx = 1; 2) h-n,+, F&c) = 0 uniformly
on [ -
T, rr] outside of any neighbourhood
of zero;
3) Fk(X) 2 0 (- co < x < co).
As compared with the Dirichlet kernel, 3) is the most significant property and it leads to quite different behaviour of the FejCr averages from that of the partial sums of the Fourier series. Corollary
1. A function f E L’ is uniquely determined by its Fourier series.
This justifies the usual notation f(x) -
f c,einx II= -03 for the correspondence between functions and their Fourier analogous notation in the real form).
series (and the
Corollary 2 (Theorem of Weierstrass). Any continuous periodic function (with period 24 is the uniform limit of a certain sequence of trigonometric polynomials
T,(x) = 1
Inl<m
c,,,einx.
(30)
The theory of Fourier series for functions of class L2 turns out to be particularly satisfactory. The reason for this is the Euclidean geometry and completeness of the space L Moreover the L2-theory is most important for many applications, which is explained in particular by the physical significance of the integral of the square of the modulus. For example, the total energy of a string (in the approximation for small vibrations) is equal to E=f
pu;+Izu:)dx; I0 in quantum mechanics the integral of the square of the modulus of the wave function Ii/(x, t) over any volume Q is the probability of finding particles in this volume and so on. In the Lz-theory it is necessary to consider for f E L2 the quantity
&if)
= llf- &Al2 =&
The following expression for E,(f)
E.(f)=& from which follows immediately
and, consequently,
s
; If(x) - $,,(412dxs n
is easily obtained:
; I If(
dx - ,I); lnk12,
Bessel inequality
(31)
5 2. The Fourier
Further for any trigonometric
Method
polynomial
and Related
Questions
25
(30)
IV- T,,l12= IV- $,,112+ IlSm- Ll12 2 I%
(32)
(here Pythagoras theorem can be applied because of the orthogonality of the function j i S,,,to all einx (I nl < m) and therefore to all trigonometric polynomials of degree <m, in particular, S,,, - T,). The inequality (32) shows that the mth partial sum of the Fourier series of the function f (and only it) gives the best least squares approximation to f by means of trigonometric polynomials of degree <m (the error of this approximation is equal to ,/E,(f)). If f is continuous we can assume that T,(x) converges uniformly to f as m + co and moreover in L2. The last assertion is correct for any f~ L2, so that f can be approximated in L2 to any degree of accuracy by continuous functions. But then from (32) E,(f) + 0 as m + co; thus we have Theorem. For any f E L2 its Fourier series converges to f in L2, i.e. in mean
square. In this case Parseual
equality7 holds (see (31)):
$,
IC 2= ;I;
If(xN” dx. 5%
On the other hand the Riesz-Fischer theorem on the completeness of the space L2 implies the convergence in L2 of any trigonometric series f c,einx n= -m
(33)
which satisfies the condition
But then (33) will be the Fourier series of its sum. Thus we have a one-to-one correspondence 9 between the function space L2( -7c, rr) and the space of scalar (complex) sequences (c,)? which satisfy condition (34). The latter space, which is usually denoted by’ 12, is L2(Z, p) where p is the measure on Z such that9 p( (n}) = 1 for all integers n. Parseval equality says that 9 is an isometry. Moreover 9 is an isomorphism of the linear spaces L2( --a, rr) and 12. It is lthough Parseval wrote down this equality away back in 1799, it was only proved considerably later. *The notation 1’ is also used for the space of one-sided numerical (complex) sequences, the squares of the moduli of whose terms form convergent series (we also retain it when we restrict to real numbers). The spaces Ii’ (p 2 1) are defined similarly. 91f r({n)) = pL,, where (p(.) is a given (weight) sequence of positive numbers, then we use the notation 1,’ (similarly Ii (p 2 1)).
26
appropriate identity
Chapter
1. Classical
Concrete
to note here that from Parseval
Problems
equality follows the more general
where not only f but also g belong to L the d, being the Fourier coefficients of g. This means geometrically that (9f, 9g) = (f, g). The problem of absolute convergence occupies a significant place in the theory of trigonometric series. In the case of a series of the form (25 absolute convergence everywhere is clearly equivalent to c. lc,I < a. In the case of series (25) it is equivalent to
but this is no longer obvious (in one direction) and constitutes the Denjoy-Luzin theorem: if the series zl (a, cos nx + b. sin nx)
is absolutely convergent on a set of positive measure then (35) holds. In contrast to this we have the example of the series
which terminates for each x that is commensurable with x. If a trigonometric series converges everywhere then by (35) it converges uniformly, its sum f is continuous and it is the Fourier series of J Conversely if a continuous function f is given then the absolute convergence of its Fourier series depends essentially on the behaviour of its modulus of continuity
4@ = ,y=, Ifcd --f(Y)1
x . as 6 + 0. We say thatfbelongs to the Lipschitz C~USS with exponent a (0 < u< 1) and write f~ Lip a if o(6) = O(6 . S.N. Bernshtein proved that if f~ Lip a for some a > f then its Fourier series is absolutely convergent while counterexamples are to be found in Lip 3. The functions with absolutely convergent Fourier series form not only a linear space but also an algebra, for if
fib) = n= ?-53 and f = fr
c;l)einx,fz(x)
=
f2, then
lo Or Hiilder class. I1 A more general condition
is PO o(S)S-qz
f -al
da < co (E > 0).
C$!)einx
I
9000923 27
0 2. The Fourier Method and Related Questions
C” =
f
I= -co
c!$j2)
(n = 0, * 1, &- 2, . . .),
the convolution of the sequences (~a)), (ci2’), and
This is the Wiener algebra W, so-called because of its connection following theorem. Theorem of Wiener.
Zf f E W and
with the
f(x) is nowhere equal to zero, then 1 E W. f
The algebra W is a very interesting and significant example of a commutative Banach algebra. It is now customary to apply this term to a Banach space which has the structure of an associative, commutative algebra with identity whose norm has the following properties relating to this structure: llfifill
G llfill
* Ilf2ll;
e and
IMI = 1.
The theory of commutative Banach algebras (under the name commutatioe normed rings) was originated by I.M. Gel and but the theorem of Wiener served as its first touchstone. We note that the natural norm on the algebra W is IIf II = 1. Ic,I. In other words, the Fourier series expansion transforms W isometrically and isomorphically into the Banach algebra 1’ with convolution as multiplication. 2.4. General Orthogonal Series. The purely geometric character of the L2theory of Fourier series was tirst realised by Hilbert who considered in place of the system (einx) in L2( - rc, rr) an arbitrary orthogonal system (~Jx)) in L a, b); it is even possible to consider such a system in L X, /.A)for any space X with some measure p, since this in no way affects the argument. Defining the generalised Fourier coeficients as
we can introduce for any function f E L X, p) its generalisedt2 Fourier series c. c,~&c). The system (4 is said to be complete if it is possible to approximate any f E L2(X, 11)in mean square to any degree of precision by means of linear combinations of functions of the system I3 . However, as before, the partial sums he word eneralised’ will be dropped in the sequel. I3 In this definition orthogonality is not necessary. We can also discuss the completeness of systems in other topologies, in particular in P(X, cc)(p 3 1).
1111 llll llllll IllIIIIIIII11 Ill1 IIIlllll II11 lllll Illlllll III #f&j&+ MAT0003040368%
28
Chapter 1. Classical Concrete Problems
of the Fourier series are the best approximations of this type. The property of being complete for the system (4”) is therefore equivalent to the fact that for any fe L2(X, p) its Fourier series converges to f in mean square. Even if the system is not complete, the Fourier series of all functions in L’(X, p) converge in mean square because of Bessel inequality
and the Reisz-Fischer theorem. In the case of a complete system (and only in this case) the functions f~ L2(X, p) are uniquely determined by their Fourier coefficients. In order that the series converge in L X, p) to f, it is necessary and sufficient that Parseval equality should hold:
; IC 2114 =12 x IfI2 G. s The system (4 is therefore complete if and only if Parseval equality holds for all fe Z, X, p). In this case the mapping f~(c,) turns out to be an isometric isomorphism of the spaces L2(X, p) and If where v, = ~~~~~~ 25. Orthogonal Polynomials. Suppose there is given on some interval Z of the real axis a non-decreasing function C(X) such that all the powers xL (k = 0, 1,2, . . .) belong to Lz. Suppose that the set of points of increase of the function 0 is infinite. Applying the process of orthogonalisation (with normalisation ) to the system of powers we obtain the system of orthogonal polynomials (5(x))& (deg Pk = k), which is uniquely defined for the given function e (or measure pO). If e is absolutely continuous its derivative p = 0’ is called a weight and then we speak of polynomials which are orthogonal with respect to the given weight p. The classical examples, which arise in problems of mathematical physics, are: 1) p(x) = e+ (- co < x < 00) - Hetmite polynomials; 2) p(x) = eVX (0 < x < 00) - Laguerre polynomials; 3) p(x) = 1 (- 1 < x < 1) - Legendre polynomials.
All three systems are complete in the corresponding Lz. In other words, in each of these spaces the system of powers is complete. As a result of Weierstrass theorem on the uniform approximation of continuous functions by means of polynomials, the system of powers is complete on a finite interval for any Q. On an infinite interval whether it is complete depends on the weight. For example, if p(x) = e + xt then the system is not complete since m xke- lxl sin(2n In 1x1)dx = 0 (k=0,1,2,...) s -G3 (any function which has arbitrary close mean-square approximations by means of polynomials for the given weight is orthogonal to sin(2a In /xl)). This example of Stieltjes is covered by the following test (M.G. Krejn, 1945). 141t is possible to carry it out so that the highest coefficients are positive; this is assumed in the sequel.
$2. The Fourier Method and Related Questions
Theorem.
+)(--co
29
Zf a’(x) is the derivative of the absolutely continuous part of a function
<x < a)and
complete in Li.
* In o’(x) dx > - 00, then the system of powers is not s -,1
The converse assertion is true if C(X) is absolutely continuous p(x) satisfies the conditions: 1) SUP P(X) -= 00;
2) P(-4
and the weight
= p(x);
3) for x > 0 the function -ln(l + x’)p(x) is non-decreasing tive to In x. Thus if the conditions listed still hold and OD ln ~(4 -oo i-2 s
and convex rela-
dx = -co,
then the system of powers is complete in Li. The following general criterion for completeness was given by B.Ya. Levin. Theorem.
ODda(x) (x)‘ ’<1 2 sJ-T-2
Let mDz,be the set of all polynomials satisfying the condition
and put
In order that the system of powers be complete in Li it is necessary and sufficient that O” In M(x) dx = ~ s -m 1 This theorem (1953) and S.N. by polynomials a general theory and in L$
is related to the criteria of N.I. Akhiezer and Mergelyan (1954), relating to weighted uniform on the whole axis. In his lectures (1957) B.Ya. which covers weighted approximation both in
S.N. Bernshtein approximation Levin developed the uniform case
2.6. The Power Moment Problem. Let us denote by Z any one of the three intervals (- co, co), [0, co), [0, 11. Let a(x) (x E I) be a non-decreasing function and as in Section 2.5 suppose
(xl” da(x) < co sI (in the case of Z = [O, l] this condition follows from the boundedness of a). The numbers mk =
fI
xk da(x)
(k=0,1,2,...)
30
Chapter
1. Classical
Concrete
Problems
are called the power moments of the function G (measure p,). If we interpret the measure ,LQ,as a mass distributed on Z then m, is the mass of the whole od ml is the static moment, m2 is the moment of inertia relative to the point x = 0. The problem is to give an intrinsic characterisation of the class of moment sequences (mr)2. For any Z an obvious necessary condition is the positivity of the Hankel matrix A4 =
(t?lk+j)kqlj=o:
$ m,+j&<j
2 0 for all n = 0, 1,2,. . . and for all (ci, rz,. . . , 5.) E R
This condition is also sufficient in the case Z = (- 00, 00) (theorem of Hamburger). In the case Z = [0, co) necessary and sufficient conditions are provided by positivity of the matrices M and M, = (mk+j+l)&=e; in the case Z = [0, l] we have an analogous (but more cumbersome) criterion, although the following theorem is of greater interest. Theorem of Hausdorff. In order that (mk) be a moment sequence on [0, l] it is necessary and suflicient that the inequalities A ,z
t (-1,’ j=O
0
7
(n,k=0,1,2
> 0
mk+j
,...)
hold.
As before the necessity is trivial. The proof of the sufficiency can be carried out on the basis of the uniform approximation of a function f~ C[O, 1) by means of the Bernshtein polynomials
Bnkf)=kio00 ;f
g Xk(l - x) !
This tool was proposed at one time by S.N. Bernshtein for the proof of Weierstrass theorem. At the same time we obtain from Weierstrass theorem the definiteness of the moment problem on an interval, i.e. the uniqueness of the corresponding measure. As the example of Stieltjes shows, definiteness fails in general on the semiaxis or the whole axis. Carleman (1922) found the following suflicient condition for definiteness of the moment problem on [0, co): -
= co.
It is satisfied for example by mk = k! In this case the unique (up to an additive constant) solution of the moment problem is a(x) = 1 - emX. On the whole axis an analogous sufficient condition is as follows: -
=
co.
The sequence mZk = T(k + i) along with mzk+i = 0 (k = 0, 1, 2, . . .), which cor-
$2. The Fourier Method and Related Questions
31
responds to the weight p(x) = iemX2, provides an example where this condiJn tion is satisfied. The investigation of the moment problem is closely connected with the properties of orthogonal polynomials. Let us consider the moment problem on the whole axis for the sequence (mk);. We introduce on the linear space of real polynomials the linear functional 8(R) = 2 m,r,
R(x) = i r,xk . k=O ( 1 If the given moment problem is solvable and C(X) is any solution of it, then k=O
8(R)
=
m R(x) da(x). s -Co
(36)
Thus, all solutions generate one and only one linear functional and one and only one scalar product (P, Q) = @(PQ) (it is important for this construction that the polynomials form not only a linear space but also an algebra). Positivity of this scalar product follows from positivity of the functional 8, while for its nondegeneracy it is necessary and sufhcient that the set of points of increase of the function 0 be infinite, which in turn is equivalent to the rank of the matrix A4 being infinite - this will be assumed from now on. It follows from what has been said that the system of orthogonal polynomials (s(x)): is uniquely defined by the moments (mk)$ even in the indefinite case. We shall assume further that m, = 1 and then P,(x) E 1. The theory set out below was established in 1922 in the works of Hellinger, Nevanlinna and M. Riesz and was subsequently developed further by H. Weyl(1935). Emma. &f the series xk Ipk(z)12 converges even at one point z (im z # 0) then it converges uniformly on each bounded region.
In all cases the quantity I(‘) = ‘(1 2lim zl
k
ipktz)12)-’
is called the WeyZ radius at the point z. The lemma means that on the set im z # 0 the Weyl alternative holds: either r(z) = 0 for all z or r(z) > 0 for all z. Let us introduce the system of polynomials Q,(Z)=@x~‘+~(x)) associated with (Pk)$. If we
(k=0,1,2,...),
put
w,(z) =
then as a result of (36) we have
O3 da(x) __ s -,x-z
(im z # 01,
32
Chapter
Q&J
1. Classical
=
Concrete
-wAzP&)
Problems
+ cd4
where the ck(z) are the Fourier coefficients of the function (x - z)-‘. Bessel inequality gives
Consequently
the point w,(z) belongs to the set W(z) defined by the inequality
This is a disk of radius r(z) for r(z) > 0 and a point for r(z) = 0. Correspondingly W(z) is called the Weyl disk or point. Theorem 1. The moment problem is definite in the point case and indefinite in the disk case.
In fact, if in the point case there are two solutions a,(x), a2(x) then w,,(z) = w,,(z) for im 2 # 0, i.e. J~*$g$J~m$!g, from which it follows that r~i(x) essentially coincides with a2(x) (see Section 1.5). In the disk case for each point c E W(z) there exists a solution a(x) of the moment problem such that w,,(z) = {. A solution a(x) is called z-extremal if the point w,(z) lies on the boundary of the disk, i.e. equality is attained in (38) or in other words, Parseval equality holds for (x - z)- Therefore, if the system of powers is complete in L$ then a(x) is z-extremal for all z (im z # 0). The converse is true even in a strengthened form. Theorem 2. Zf the solution o(x) is z-extremal system of powers is complete in Lz.
for some z (im z # 0) then the
Since this solution is z-extremal for all z, we can call it simply extremal. The unique solution in the point case is referred to in the same way. Analogous to Theorem 2 we have the following theorem. Theorem 3. Zf the moment problem has a unique solution o(x) then the system of powers is complete in La.
The description of all solutions of the moment problem in the indefinite case is of considerable interest. Some further concepts are required for its presentation. We say that a function U(Z) which is analytic on the half-plane im z > 0 belongs to the class of Neualinna (in short - the class N) if im U(Z) 2 0. For example, any function w,,(z) is of this type. Theorem 4. In order that the function u(z) (im z > 0) be in the class N it is necessary and suflicient that it permits an integral representation
$2. The Fourier Method and Related Questions
u(z) = a2 + /I + where z(x) is a non-decreasing
Corollary.
I
OD1+x2
~ -Q, x-2
bounded function,
dW,
a 2 0 and /II is real.
w(z) (im z > 0) be of the form
In order that the function w(z) =
33
* dr(x) -a, X’ s
where T(X) is a non-decreasing bounded function, it is necessary and sufficient that w(z) belongs to the class N and satisfies the condition
lim
y+ +a0
lyw(iy)l
< 00.
In the indefinite case the series z IQ(z)12 converges uniformly region. Let us put
in each bounded
al 1(4 = z T Qk(0)Qk(zh
a12(4 = 1 + z T Pk(0)QkW~
a,,(z) = - 1 + z 7 QkWk(z)9
az2(4 = z T SWJ&9.
This defines the Nevalinna matrix N(z) =
all(z) ( azl(4
a12(4 az2b9 > ’
an entire matrix-function of the complex variable15 z, which is real on the real axis and identically unimodular, i.e. det N(z) E 1. Theorem 5. The formula
s
m do(x) -= .-,x-z
- al&M4 a2 1(zM4
- all(z) - az2(4
establishes a 1 - 1 correspondence between the set of solutions of the moment problem and the set of functions of class N augmented by the constant 00. Under this correspondence the set of extremal solutions is mapped onto the set of real constants (00 included).
Corollary. Zf a(x) is an extremal solution of the moment problem then w,(z) is a meromorphic function. Its poles are clearly real and coincide with the points of increase of the function a(x).
Is It is of zero exponential type: l&i In IM n
la&l
maxi.k
I4
= o .
34
Chapter 1. Classical Concrete Problems
2.7. Jacobian Matrices. If j G k - 2 the polynomial x$(x), being a linear combination of the polynomials P,(x), PI(x), . . . , Pkml(x), is orthogonal to P&c). But then xPk(x) is orthogonal to q(x). Consequently
xpk(x) = a,P,W
(k=0,1,2,...)
+ bkec+l(x) + BkPk-16)
(39)
Here P-,(x) z 0, pk
=
(xpk(x)>
ak = pk-l(x))
The recurrence formula the initial
conditions
(xpk(x)9
b/c =
pk(x))y =
(xpk-l(x)v
pk(x))
fxp,tx), =
bk-,
pk+l(x))
>
o,16
(b-, = 0).
(39) with k > 1 defines the sequence (Pk(x))g
P,(x) = 1, PI(x) = 7.
under
By means of (37) this formula
defines the associated sequence (Qk(x))z undOer the initial conditions
Q,,(x) = 0,
Ql(4 = ;. The ignite
matrix
is called the Jacobian matrix of the given moment sequence (mk); (or of the given a(x)). We remark that J uniquely determines the moments since from the expansion Xk
=
~
ckjpj(x)
j=O
there follows mk = $0 (k = 0, 1, 2, . . .). It can be shown that for any real matrix of the form (40) with positive off-diagonal elements the sequence of polynomials (Pk(x))g defined by it is orthogonal with respect to some a(x) (- co < x < co). We shall consider the matrix J as a linear operator on the (infinite-dimensional) linear space of sequences (vk)z of complex numbers. The recurrence formula (39) (with the replacement of x by an arbitrary z E C) shows that the sequence II(z) = (4(z)); is an eigenvector for J with corresponding eigenvalue z. Clearly, apart from multiples, it is unique in this property, i.e. all eigenspaces of the operator J are one-dimensional. The Weyl alternative implies that either II(z) $ 1* for all z (im z # 0) (the point case) or II(z) E 1* for all such z (the disk case)l’. The criterion for definiteness of the moment problem is thereby reformulated in terms of the spectral theory of Jacobian matrices. l6The latter follows from the positivity of the leading coefficients. utting this differently we can say that equation (39) (with x replaced by z (im z # 0)) either has a unique solution up to normalisation in I2 or all its solutions belong to I2 and moreover for all z. the same case results.
$2. The Fourier Method and Related Questions
35
2.8. The Trigonometric Moment Problem. Suppose there is given on the circumference of the unit circle a measure p which is generated by a non-decreasing function a(x) (-a < x < K). Then the numbers
..=d=,=l~nd~=~~~e
o(x)
(n=O,fl,f2
)...)
are called the power moments of the measure p or the trigonometric moments of B. Here again the problem is to give an intrinsic characterisation of the class of moment sequences (c,)?~. Analogous to the theorem of Hamburger we have the function
Theorem of F. Riesz and Herglotz In order that (c,)?~ be a moment sequence, it is necessary and sufficient that the Toeplitz matrix (c~-j)5j=o be positive, i.e. (41) for all n = 0, 1, 2, . . . and for all (tl, . . . , r.) E C The corresponding measure ,u is unique.
The necessity is trivial. Su.iciency: from (41) &
.
with tr, = eiLt (0 G k < n; t E R) we have
(n + 1 - Im()c,e ’ 2 0
(see (29)). Therefore the functions
are non-decreasing. Moreover they are uniformly bounded: a,(a) = 2ac,. By a well-known theorem of Helly there exists a non-decreasing function a(x) and a subsequence (a,,(x)) which converges weakly to G(X) in the sense that I lim f (4 k&) = x f (4 W4 k-m s --II s -,, for all f o C [ - n, n]. The function a(x) is the desired function. The uniqueness of the desired measure is a simple consequence of the uniqueness of the function with given Fourier coefficients. 0 The trigonometric moment problem has a natural continuous analogue. It is to give an intrinsic description of the class of functions c(n) (- cc < 1 < co) admitting a representation of the form 00 c(A) = eiAXdo(x), (42) s -03 where G(X) is a non-decreasing function. A necessary and sufficient condition for this, established by Bochner (1932) and independently by A.Ya. Khinchin (1937),
36
Chapter
1. Classical
Concrete
Problems
is the analogue of the Riesz-Herglotz theorem. It consists of continuity positiuity of the function c(A). The latter means that
and
for all (A,, . . . , A,) E R” and all (rl, . . . , <J E C” (n = 1,2, 3, . . .). In the terminology of ,probability theory we can say that the characteristic functions of the probability laws are exactly the continuous positive functions satisfying the normalisation condition c(0) = 1. 2.9. The Fourier Integral. This continuous analogue of Fourier series appeared (1822) in the solution of the problem of heat conduction in an infinite rod. In this case the boundary conditions drop out and only the initial condition U(X, 0) = 4(x) remains. The formal solution obtained by separation of variables has the form (for K = 1) E(A)e-azreiAx dl,
(43)
--m m qi(x)epiAx dx. s Here it is required that 4 E L’ although, after eliminating heuristic transformations from (43) and (44), the formula
(44
where E(A) =
-(x-Y)%40&)
E(A) by means of the
dy
is obtained, which applies to a much wider class of initial functions 4. Thus in this instance the Fourier transform E(A) played an important but temporary role. This particular circumstance cannot detract from the significance of the Fourier transform and its generalisations both as a tool for investigating numerous problems and as an independent object. In many ways the theory of the Fourier integral parallels that of Fourier series. The formal inversion of the Fourier transform has the form18
: I
(44*)
i j
E(l)e ” dA,
but in fact it is valid under very stringent restrictions, IsThis
is clear from
(43) for f = 0. Formally
combining
(44) and (44*) we obtain
relation m eiixPy I --m
which
is the typical
form
for the continuous
dl
= 2716(x
situation.
for example, if E(1) is
- y),
the orthogonality
1 1
4 2. The Fourier Method and Related Questions
summablelg.
31
For any c,dE L’ the inversion formula is such that ; q@) dEj = lili
&
” E(A)?
dl
s I N (from which 4 may be determined almost everywhere by differentiation). Consequently 4 is uniquely determined by its Fourier transform. The L*-theory of the Fourier integral is contained in the following theorem of Pluncherel. Let Q belong to L*( - co, co). Then the Fourier-Plancherel transform*’ E(I) = Jili
&
s
:N &)eeiax
dx
exists in mean square and the inversion formula b(x) = iii
&
-1 E(IZ)e ” dl f
holds (convergence in mean square) as does the corresponding
Parseual equality
* IE(l)1* dil = -1 1&x)12 dx. s --Co f
Hence it is clear that the Fourier L*-transform is an isometric isomorphism of the space L*( - co, co) of functions in x and in 1. These two spaces are identified formally, but such a point of view is not always acceptable (for example, in quantum mechanics the variable x may have the sense of a coordinate and then the variable A is associated with an impulse of the particle; their physical identification contradicts the principle of uncertainty). We can also refer to the formula (42) as a variant of the Fourier transform: this is the Fourier-Stieltjes transform of the function a(x) or the Fourier transform of the measure p.. We note that the Bochner-Khinchin theorem can be utilised in the construction of the L*-theory of the Fourier transform. Let us consider a function 4(x) which is zero for x < 0. If this 4 E L*(O, co) then its Fourier-Plancherel transform has the form oN qi(x)eeiAx dx. E(A) = lim .+J& s The specific character of this case lies in the fact that the values of E(A) are almost everywhere boundary values*’ for the function q5(x)emiAx dx
which is analytic on the half-plane im I < 0. Moreover I9 In the general case E(n) is continuous, tends to zero as 1rlj + co, but may not belong to L - co, co). 200r Fourier Lz-transform. 21Along non-tangential paths.
38
Chapter
1. Classical
Concrete
Problems
s
* I.&a - @)I2 da = 0m 1q5(x)12e-28 x < 0m@(x)l* dx, --oo s s
i.e. @ belongs to the linear space H* of functions f(2) which are analytic in the half-plane im 1 c 0 and satisfy the condition If(a - @)I* da -z co.
The analogous definition
for p 2 1 gives a whole family of spaces, the Hardy
classes HP.
The theorems stated below on functions of class H* are due to Wiener and Paley. Theorem 1. Any function f E HZ can be represented in the form f(A) = &
1:
&c)e-iAx
dx
(im 1 < 0),
(45)
where q5E L*(O, 00).
The function 4 in (45) is obtained as the inverse Fourier transform of the boundary function fa for f, taken for x > 0 (for x < 0 this procedure gives zero). The existence almost everywhere of the boundary values for f E HZ (and in general HP) and the fact that fa belongs to L* (respectively Lp) follow from the classical theory of Fatou. The boundary function fa for f E H* (f # 0) has the important property
s s
* IWtMI
-m
dl < o.
1 +I2
and, conversely, for any function + E L*( - co, co), + 2 0 such that
m Iln WI dl < co , --coTTF
there exists f E HZ for which lfal = $. A function 4(x) on the whole axis (or on the semiaxis) is said to be of compact support if it vanishes outside of some finite interval (a, b). If such a function 4 E L*(a, b) then its Fourier-Plancherel transform reduces to E(1) = -&
t I
I I
b d(x)emiAx dx. sa
By means of this identity E(I) is extended to the whole complex plane as an entire function of exponential type p < max(lal, lbl) and in addition, by the theorem of Plancherel, it belongs to L* on the real axis. Theorem 2. If f (A) is an entire function of exponential type p which belongs to L* on the real axis, then it can be represented in the form
I
!
9 2. The Fourier
Method
and Related
39
Questions
where t$ c L2( - p, p). The class of entire functions described in this theorem is denoted by Bj. We note further a continuous analogue of the theorem on functions of class W which is important for applications. Theorem of Wiener. Zf 4 E L’( - CO, CO) and its Fourier transform E(I) # 1 (- CO< 1 < 00) then the function (1 - E(L))-’ - 1 is the Fourier transform of a function t+bE L’( - c0, 00).
L’( - co, co) is a Banach algebra (without identity) under convolution m f(t - 4dd dr I --P) as multiplication. The Fourier transform transforms it into the Wiener algebra W( - co, co) of functions with the usual (pointwise) multiplication. (f * g)(t) =
2.10. The Laplace Transform. We shall say that the function f(t) (t > 0) belongs to class A if it is Lebesgue measurable and there exists k = k(f) such that f(t) = O(ekt) almost everywhere. The class A is a linear space. If f~ A then the analytic function f(A) =
03f(t)e-A’ dt s0 is defined in the half-plane re 1 > k(f). The class 2 of all such functions is also linear. The homomorphism f H f” is called the Lupluce transform and under it f is called the preimage for its image f The preimage is uniquely determined for a given image, and moreover the Riemunn-Mellin inversion formula holds:
(a > k(f)). This formula opens up a wide range of possibilities for calculations using the theory of residues. For example, let f(A) be continued to the whole plane as a meromorphic function with poles {&} T. To make matters precise let us assume k(f) > 0 and put A0 = 0. If f is bounded on some sequence of left semicircles with diameters [a - /?.i, bl + j&i] (j?. + co), then
s
[ 1
df (7) dt = lim c Res ‘?eA’ “-+cop~-rrlia, A=&
,
(46)
the limit being uniform on each finite interval. The residues in (46) have the form of degree < n, - 1 and nk is the order of the
Pk(t)eAk’, where Pk is a polynomial
pole & for f$.
Consequently,
under the stated conditions
the integral off (and
40
Chapter 1. Classical Concrete Problems
under certain additional requirements also f itself) turns out to be the sum with brackets of the series of quasipolynomials &Pk(t)e@ (here the & are indexed according to increasing 11, - u I); moreover convergence is uniform on each finite interval. Formula (46) is called the Expansion Theorem. We note further that if the function f~ n is continuous then f(t)
=
lim
LZj!(!!!+tfln)(F),
n-tm
.
the limit being uniform on each finite interval (a, b) (a > 0). The specific character of this formula of Widder, which goes back for its idea to Laplace ethod of larger numbers lies in the fact that use is made in it of the value of the Laplace transform only for large I > 0. A most important property of the Laplace transform is that it transforms the operation of differentiation in n under the condition f(0) = 0 into the operation of multiplication by 1 in ;?. 22 The more general relation
.m =m, -f(O) is valid for all absolutely continuous f such that f’ E n (in this case f~ /i automatically). If f, g are two functions of class n then the function s0
t - 4d4
dt
is called their Laplace convolution and is denoted as before by f * g. It is easy to see that it also belongs to the class n and fG =fi, i.e. the Laplace transform transforms convolution into multiplication.23 We can say that the Laplace transform (as also the Fourier transform) expresses algebraically basic operations of classical analysis. On this is founded the most commonly used modem version of operational calculus, which was formally established by Heaviside in the last century as a tool for the solution of differential equations and certain other forms of functional equations. We give several examples on this theme, first of all the traditional Example 1. Let us consider the linear differential
D[y] = y(m) + qy(-
equation of mth order
+ *** + u,y =f(t)
(47)
with constant coefficients and right hand side f~ A. Let y be the solution under he Fourier transform has a similar property but in the class 15 here is an analogous result for the Fourier transform on L - co, co) and this is not accidental. The Fourier convolution is converted into the Laplace convolution if!(t) = 0 and g(t) = 0 for t < 0. The Laplace convolution has a causal character in the sense that the value of (f t g) (t) depends only on the values of the functions f and g in the region z < t. From the physical point of view the fundamental difference (within a great formal similarity) of the Laplace transform from the Fourier transform lies in the causality of the corresponding convolution.
4 2. The Fourier Method and Related Questions
41
the initial conditions y (O) = 0 (0 < k < m - 1). It is easy to see that y E A. From (47) we have
where D(n) = 1” + a, Am-’ + . . . + a,,, is the characteristic polynomial. The ratio1 nal function oo is the Laplace transform of the quasipolynomial
Q(t) = CF.".
[ 1 $
3
where the sum is taken over ail roots of the polynomial convolution theorem
D(A). According to the
and because of the uniqueness of the preimage, Q *f = y, i.e.24 y(t) =
The quasipolynomial
’ Q(t s0
r)f(z) dT.
Q is called the Cauchy function of the operator D.
Example 2. Now let the right hand side in equation (47) be one-periodic: + 1) = f(t). Let us assume that the characteristic polynomial does not have roots of the form in (where n is an integer). Then there is a unique one-periodic solution, namely
f(t
y(t) =
f &einz la=-co
(48)
(the c, are the Fourier coefficients of the function f ). We shall obtain this solution by applying the Laplace transform: D(A)j(A)
+ A(n) =&).
(49) Here A(n) is a polynomial of degree < m - 1 whose coefficients are linear combinations of the initial values y(O), . . . , y - (O). From the periodicity of f it follows that f(A) = $$
(h(l)
= I (l
- t)e dt)
0
and y(n) is expressed in a similar way. Therefore from (49) Yl(4 e l
fl(4
A(4
= D(A)(e” - 1) - o(n)’
24This final result is valid for any locally summable j.
Chapter 1. Classical Concrete Problems
42
Since the left hand side is regular at the root A, of the polynomial
D(A),
Thus (49) leads to the solution y(t) =
in
’ Q(t - z)f(z) dz - c
s0
inite form in contrast to formula (48). Example 3. Let 0 < a < 1. The function f&(t)
= $
’ (t - ~)a-Y(T) dz s0
is called the integral of order a of the function f~ A (i.e. the convolution functions &
of the
ta-’ and f). If &, = g is given then (50) can be regarded as an
equation for f (Abel
integral equation). By the convolution
theorem
from which we obtain
f(t) = ; ( r(l
(51)
a)j; (t - 4-=&) dr).
The operator (51) is naturally called the derivative of order a. The Laplace transform can be applied not only to functions but also to measures: (re A > k). m e- do(t) s0 Here 0 is a function of bounded variation on each interval [0, T] (T > 0) subject to the requirement s(n) =
a(t)
= O(eTk .
(52)
In connection with such functions o we use the term Lupluce-Stieltjes This scheme covers in particular Dirichlet series l 3
be a> k)
transform.
(53)
with arbitrary coeffkients a, satisfying the condition
2 4 I = O(Nk). IPI=1 When a, E 1 the series (53) delines the Riemunn c-function
which plays an
Q2. The Fourier Method and Related Questions
43
extremely important role in analytic number theory, where it and certain other Dirichlet series find application.” If the function Q is non-decreasing and satisfies the condition (52) for every k > 0 and S(A) (A > 0) is its Laplace-Stieltjes transform then the function S( - A) is absolutely monotonic, i.e. it is infinitely differentiable and all its derivatives (including itself) are non-negative. Theorem of Bernshtein.
Any absolutely monotonic function f(x) (x < 0) may be
represented in the form f(x)
= SW ext da(t), 0
where ts is a non-decreasing function. This result can be regarded as a solution of the exponential moment problem. In contrast to the Fourier transform, the Laplace transform is applicable to increasing functions but it is nevertheless required that their growth is no greater than exponential. However it is possible to modify the Laplace transform in such a way that the restrictions connected with growth rate are completely removed and moreover it becomes possible to work with functions on finite intervals in the same way as on infinite intervals. In fact, let 0 < b < cc and suppose that the function f(t) (0 < t < b) is locally summable. The local Luplace transform (1.L.t.) of the function f is that function F(A), which is defined and locally bounded for sufficiently large Iz > 0, such that for all /? E (0, b) F(1) =
(t)e-” dt + &(A, /?), f0 where the remainder .$A, /3) satisfies the condition (54) i.e. for A+ + cc it has exponential type < - /?. Iffe A then its Laplace transform is its l.L.t. We note that if the l.L.t. exists for a given preimage f then it is defined up to a term of exponential type < -b as 1 + + co. The question of conditions for the existence of the l.L.t. seems to be non-trivial but in applications it usually has an automatic positive resolution. The uniqueness of the determining function for a given l.L.t. is an essential fact. This results from the following procedure for inverting the l.L.t. (Yu.1. Lyubich, 1966). Theorem. Let F(1) (d 2 p > 0) be the l.L.t. off(t) (0 < t < b). Then thefunction
Dirichlet introduced these series for the proof of his theorem (1837) that there are infinitely many primes in any arithmetic progression (CI+ nd), , where a, d are relatively prime natural numbers.
44
Chapter
can be continued analytically
1. Classical
Concrete
Problems
to the domain
pb = {l: re C < b, l4 CO,b)) and lii
{@(t + is) - @(t - is)) =
(7) dz
(0 < t < b).
s0
(55)
If f is smooth then the function
can be continued analytically
sm
F(l)e dl be C < 0) cc to the domain Pa, it has limiting values Y(t f i0) and
Y(t + i0) - Y(t - i0) =f(t)
(0 < t < b).
In the series of applications it is necessary to weaken the requirement (54) by allowing II to tend to infinity only through some set A4 of complete relative measure, i.e. the measure of the intersection M n [0, r] is asymptotically equal to r (I --, + co). This modification (which we shall denote by l.L.t ) is adequate if the l.L.t F(I) can be extended to a function which is analytic on some angular domain
-a < arg 1 < /3 0 c a, /I < z and increases no faster than exponen( tially. In this case the inversion formula (55) and, consequently, the uniqueness theorem for the preimage remain valid (Yu.1. Lyubich and V.A. Tkachenko, 1965). Hence we obtain the following analogue of the Wiener-Paley theorem: if the l.L.t F(A) is an entire function of exponential type rr then it is a finite Laplace transform: F(1) = F(il; a) =
df(t)e-
dt.
s0
In particular, if ~7= 0 then F = 0. Let us consider one of the applications of the local Laplace transform. According to Delsarte (1934) a continuous function f(t) (- co < t < co) is said to be periodic in mean (p.i.m.) if for some finite a, b (a < b) it satisfies an integral equation of the form b
f(t + s) do(s) = 0 (-m
@&I
f(A) --- &)
>
45
0 2. The Fourier Method and Related Questions
where b
A(n) =
eeAs do(s),
@&I) = Jab eAs do(s) J:f(t)e-@
dt.
sD The function f” is meromorphic and the roots A,, I,, A,, . . . of the characteristic equation A(n) = 0 are its poles. Formally applying the Expansion Theorem (in differentiated form) we arrive at the generalised Fourier series:
where P,(t) = e-
Res [&)e ] 1=1,
is a polynomial of degree < v, - 1, v, being the multiplicity of the root I,. It turns out that this series is uniformly summable to f on each finite interval by a certain special method (L. Schwartz, 1947). From this follows the uniqueness theorem: if the generalised Fourier series of the p.i.m. function f is zero then f = 0. However, it is possible to establish this fact independently by using the l.L.t if we note that f(J) is such. In the case of a zero series f s an entire function of exponential type CJ= 0 and consequently f” = 0. We emphasise that hidden in this argument is a non-trivial application of the theory of growth of entire functions. If the p.i.m. function f is equal to zero on [a, b] then it is equal to zero on the whole axis because cD~= 0 and hence{ = 0. Also it is easy to deduce this theorem on the unique determination of a p.i.m. function by its values on the eriod’ from Titchmursh convolution theorem: if the functions f(t), g(t) (0 < t c 6) are locally summable on (0, b) and if the Laplace convolution f * g is equal to zero while flCO,ej # 0 for arbitrarily small E > 0, then g = 0. However Titchmarsh theorem can also be established by means of the local Laplace transform. In fact the finite Laplace transforms F(1; j?), G(1; /3) (0 < L < jI) are connected by the relation F(I; fl)G(A; /?) - R(1; /?) = 0 where R(1; j?) = JI g(t)e-
But then the l.L.t /
j
dt /D:Js)e-
ds = O(e-
.
for g is equal to zero.
Problem. This designation relates to the problem the eigenvalues and eigenfunctions of the linear differential opera-
2.11. The Sturm-Liouville
of determining tor
(r(x), q(x) are continuous
with boundary conditions
functions, p(x) is smooth, r(x) > 0, p(x) > 0, q(x) is real)
46
Chapter 1. Classical Concrete Problems
YW
- htY(4
= 0,
Y’(b) + h,Y@) = 0
(h, , hz E Iwu {co}; if, for example, h, = co then the corresponding condition has the form y(a) = 0). In 1836 Sturm and Liouville independently constructed a systematic theory of this problem (with q(x) = 0), which generalises the theory of Fourier series of the form (21) at least for smooth functions. In the 20th century the Sturm-Liouville problem and its generalisations became an essential attribute of the mathematical apparatus of quantum mechanics. The point is that in quantum mechanics physical quantities are described by linear operators?, their possible values are the eigenvalues of the corresponding operators and they are realised in the states of the quantum mechanical system, these being defined by the corresponding eigenfunctions. In particular -y” + q(x)y is the operator of total energy” (Hamiltoni~n~~): (-y”) corresponds to kinetic energy, q(x)y to potential energy. The eigenvalues of an energy operator are the energy levels of the system. We shall consider the operator L on ~!,:(a, b), (a(x) = c r(c)&), assuming for simplicity that its domain of definition D(L) consists of those functions of class C2 [a, b] which satisfy the boundary conditions. A basic property of the operator L is contained in the identity bxY,l~
Y2) = (Ylv JxY21)
(Yl, Y2 E w-49
which allows us to call L symmetric by analogy with the finite-dimensional case. As a consequence of this, the eigenvalues of the operator are real and eigenfunctions corresponding to a pair of distinct eigenvalues are orthogonal. Consequently the set of eigenvalues is no more than countable. Each eigenspace is one-dimensional (it cannot be two-dimensional because of the presence of the boundary conditions). Theorem. The set of eigenvulues of the Sturm-Liouville problem can be represented us a sequence 1, < A2 c -** < 1, < *** tending to co. Any function f E D(L) can be expanded in a unljkmly convergent generulised Fourier series with respect to the orthonormulised system of eigenfunctions:
f(x) = zl
c,Y,(x)
(G = (5 Y.) = ~bfWd4~C4d~). a
For any f E Lz(u, b) the generulised Fourier which is equivalent to Pursevul equality
series converges to
f
in Lg(u, b),
n.t lcn12= ~'lfM12~(4dx (I being satisfied. 261nitially without linearisation! ” For the given boundary conditions. 28 By analogy with the Hamiltonian function in classical mechanics. We note that the operator is also called the SchrBdinger operator.
4 2. The Fourier Method and Related Questions
41
The proof of this theorem was outlined in essence in the works of Sturm and Liouville but it was not until 1896 that a rigorous version was given by V.A. Steklov. One of the methods of proof is based on the variational theory of eigenvalues developed by Fischer and Courant. In this theory the equation L[y] = Ay is regarded as an Euler-Lagrange equation for the problem of finding the minimum of the quadratic functional (L[y], y) under the constraint condition (y, y) = 1. It turns out that the desired minimum is the least eigenvalue ;1, and is attained at a corresponding eigenfunction y, . If we already have the eigenvalues il, < * * * < 1. and corresponding eigenfunctions y, , . . . , y, then 1n+l =
min{Wyl,
Y) : (Y, Y)
= 1,(Y, yk) = 0 (1 < k < 4>
and this minimum is attained at a corresponding eigenfunction Y,+~. Thus we obtain an infinite increasing sequence (2,): and it is easy to prove that 1, --, co. The inequality
(f~W4 follows from the construction it the expansion in Lz(a, b):
just described and letting n + cc we obtain from
(56)
The condition f E D(L) is not essential for convergence in Lz. If nevertheless fe D(L), we therefore have in Lz
L[f
1 =
kzl
tL[f
1, Ykbk
=
ktl
ckAkYk.
The uniform convergence of the series (56) follows from this as a result of the bound
IfWl G constIILCflll (requiring a separate proof). 2.12. The SchrSdinger Operator on the Semiaxis. Let us consider the operator L[y] = -y” + q(x)y (0 < y < co) with boundary condition y’(0) - hy(0) = 0 (h E 88u (co}). We assume the potential q(x) to be real and (for simplicity) continuous. On L’(0, co) the set of eigenvalues of such an operator can turn out to be finite or even empty. For example, with q(x) = 0, h = 0 we require non-
trivial solutions of the problem y” + ly = 0, y’(0) = 0, these being (arbitrary multiples of) I& A) = cos fix. None of these belongs to L’(O, co). We note however that Jl(x, A) is bounded for each 12 0 which suggests weakening the requirements as x + cc on the eigenfunctions, i.e. to let them lie outside of the class L2. From the point of view of quantum mechanics this is admissible since
48
Chapter 1. Classical Concrete Problems
the energy spectrum can be both discrete and continuous.2g Unfortunately, however, a requirement such as boundedness is not sufficient in general (V.P. Maslov, 1961). The problem arising here is resolved in a certain sense only in a sufficiently wide context of functional analysis (I.M. Gel and - A.G. Kostyuchenko, 1955). It is possible to by-pass the difficulty by analytical means if we introduce a particular collection of eneralised’ eigenfunctions for which we do .not formulate explicit boundedness of growth or decrease at infinity. This approach, which was opened up by the pioneering research of H. Wey13 was developed by Titchmarsh (1946) and then substantially simplified by B.M. Levitan (1956). If h = tan a -z < c1< 2 , the boundary condition at zero can be expressed (” in the form y 0) cos a - y(0) sin a = 0. Let us denote by I&, A) the solution of the equation L[y] = Ay which satisfies the boundary condition through the initial conditions $(O, A) = cos tl, + O,1) = sin Q. We take an arbitrary interval [0, b] and consider the Sturm-Liouville problem L[y] = ly with the same condition at zero and any desired condition y b) cos /3 + y(b) sin fi = 0. If {A,}? is the set of its eigenvalues then {I&, A,)} r is a set of eigenfunctions; in general, however, these are not normalised. Put a, = I[$(., A)llL2C0,bj.Then for any f~ L2(0, b) we will have in mean square
f(x)= CLtit.7u y. Let us introduce
the non-decreasing
function
Then
where
4,(4 = Parseval
bf(xM(x, 4 dx. f0
equality in similar notation
has the form
n general it is mixed. For example, in the spectrum of the hydrogen atom there is a countable system of discrete energy levels Em< 0 corresponding to the states of an electron which is connected with the nucleus and a continuous spectrum E > 0 corresponding to the possibility of unbounded escape from the nucleus. soThe first work of H. Weyl on the Schrodinger operator (so called much later) on the semiaxis appeared in 1909-1910 well in advance of the foundation of quantum mechanics. The results set out below are essentially contained in these works.
5 2. The Fourier Method and Related Questions
s
49
b
lf(412
dx
CQ iFbtn)12 s -Co
=
0
dPb(&
It turns out that the family of functions (P~(A))~,~ is uniformly bounded on each finite interval. This allows us to apply the theorem of Helly for b -+ cc (and arbitrary variation j3) and to establish the following basic result. Theorem 1. For the SchrBdinger operator L[y] = -y” + q(x)y with continuous real potential q(x) and boundary condition y’(0) - hy(0) = 0 (h E R u {CD}) there exists a non-decreasing spectral function p(l) (- co < 1 <: co) such that fir any f~ L2(0, co) the generalised Fourier transform F(I)
= , m
-(x)+(x,
A) dx
-+ s 0 exists in Lz( - co, co), the Weyl inversion formula31
(58)
N
f(x) = lim N+x
s
-N
FW(x,
holds (conuergence in L’(0, 00)) and Parseval
s
oalf(xN2 dx =
4 dA4
(5g*)
equality
O”If 412WI s --a,
is valid.
In the simplest case of q(x) z 0 and boundary condition y 0) = 0 we have $(x, A) = cos fix. Under the condition tj b, A) = 0 we obtain: (n=0,1,2,...); +l
)
([a] denotes the integral part) for 1 > 0 while &,(A) = 0 for I < 0. Letting b + 00 we obtain the unique limit function: p(l) = ;Jn After the substitution
(A > 0);
p(A) = 0 (A < 0).
,/A = ~1the eigenfunction f(x)
= t
expansion takes the form
; j=: CD(~) cos px dp,
where 31Although the inversion formula is obtained heuristically from (57) on letting b -) co, its rigorous derivation requires a deeper argument.
50
Chapter 1. Classical Concrete Problems
pxdx G(p) =plis:f(x)cos (both defined in L’(O, co)). Plancherel theorem has now been obtained for the case of an even function32. In general the spectral function p(A) is not unique. The situation here is analogous to the power moment problem. We shall consider $(x, [) for complex c (im c # 0). Lemma. Zf $(x, co) c L O, co) fir some co (im co # 0) then tj(x, c) E L2(0, co) for all ( (im [ # 0).
The quantity -’ > is called the Weyl radius at the point [. The lemma implies that on the set im c # 0 we have the Weyl alternatiue: either r(c) = 0 for all c or r(c) > 0 for all [. The Weyl disk or point is defined by the inequality a, IwJ/(x, CC-)+ 0(x, 01’ dx < - s, f 0
where 0(x, c) is the solution of the equation L[y] = cy for the initial conditions 0(0, [) = -sin ~1,0 0,[) = cos a. The non-emptiness of the set of values of w defined by the inequality (59) is established by passage to the limit from the finite interval. Now the inequality b
Iw$(x,
0 + 0(x, [)I2 dx < - s
(60)
s 0
is satisfied if the solution &(x, [) = w$(x, c) + 0(x, LJ is such that the Wronskian of the pair 4,,,, &, equals zero at the point b. Such solutions clearly exist and therefore (60) defines a certain disk C,. Obviously C,, c Cb2 if b, > b,. Thus C, = fib C, is non-empty and this is also the disk or point of (59). This argument provides the reason for calling C, the limit disk (or point). Its principal result is Theorem 2. Zf im c # 0 the equation L[y] L2(0, co).
= (y has a solution belonging to
In the disk case, all solutions of the equation L[y] = cy belong to L O, co) while in the point case only multiples of 4(x, [) belong to L2(0, co). Incidentally, it is clear from this that the realisation of one or other case of the Weyl alternative does not depend on the boundary condition at zero, i.e. it depends only on the potential. For example, if the potential is bounded from below or if it only admits
321t is appropriate to note that even Fourier deduced his integral formulae heuristically by passage to the limit from series.
0 2. The Fourier Method and Related Questions
51
a bound q(x) > - cx’ (c a positive constant) then the point case occurs (HartmanWintner, 1949). Theorem 3. The spectral function the disk case.
is unique in the point case but not unique in
The The of the y’(0) -
following fundamental definition is only meaningful in the point case. set of points of increase of the spectral function p(n) is called the spectrum Schrodinger operator L[y] = -y” + g(x)y with boundary condition hy(0) = 0. If p is a point in the spectrum, the solution J/(x, p) is called a generalised eigenfunction corresponding to the point p. In this sense (58*) gives an expansion with respect to the generalised eigenfunctions of the Schriidinger operator 33. We have a proper eigenfunction I& p) (i.e. $(*, p) E L2(0, co)) if and only if p is a jump point of the spectral function. The set of eigenvalues is called the discrete spectrum of the operator. This term is connected with the fact that any isolated point of the spectrum is an eigenvalue. The set of non-isolated points of the spectrum34 is called the continuous spectrum. Theorem 4. Zf q(x) -+ + w (x + co) the spectrum of the Schriidinger is purely discrete.
This test of H. Weyl was substantially
improved by A.M. Molchanov
operator
(1953).
Theorem 5. A necessary and sufficient condition for the discreteness of the spectrum of the Schriidinger operator with potential q(x) which is bounded below is that x+ll d0 d< + + 00 (x-+a) sx for each a > 0.
In contrast to this we have Theorem 6. Zf jz lq(x)l dx < co then the positive semiaxis Iz 2 0 is contained in the continuous spectrum and does not contain the discrete spectrum while in the negative semiaxis the spectrum is purely discrete. 2.13. Almost-Periodic
(a.p.fs) was introduced
Functions. The class AP35 of almost-periodic and studied in 1925 by BohrJ6.
functions
33As a result of (58) and (58*) the spectral function provides a measure on the spectrum with respect to which the system of generalised eigenfunctions is orthonormal in the sense that
34Among these may be eigenvalues. 35In a more detailed notation, AP(W). 36Following initial investigations of Bohl and Esclangon.
52
Chapter 1. Classical Concrete Problems
A complex continuous function f(t) (t E R) is said to be an a&. if for any E > 0 there exists a relatively dense set of e-almost periods, i.e. numbers r such that supr If(t + r) -f(t)1 < E. A set M c R is said to be relatively dense if there exists I> 0 such that each interval of length 1 contains a point of M. Bochner (1927) showed that the definition of an a.p.f. is equivalent to the following: from every sequence of ranslations’ (f(t + rJ) it is possible to extract a subsequence which converges uniformly on the whole axis. This definition opens up the possibility of a more transparent construction of the theory of a.p.fs. All periodic functions, in particular all exponentials eilt (A E W), are a.p.fs. The class AP is a linear space. Therefore all exponential polynomials ~~cJi*k’ are a .p .fs37. The uniform limit on the whole axis of a sequence of a.p.fs is also an a.p.f. Thus any function which allows an arbitrarily close uniform approximation by exponential polynomials is an a.p.f. Theorem of Bohr. Any a.p.jI is the ungorm limit on the whole axis of a sequence of exponential polynomials.
This analogue of Weierstrass theorem is closely connected with the theory of Fourier series of a.p.fs where an analogue of Lz-theory plays a key role. Lemma.
For any a.p.f.
f
the mean
f(t) dt exists.
Zf f 2 0 and m(f) = 0 then f = 0.
This allows us to introduce on the space AP the scalar product (f, g) = m(c) (AP is not only a linear space but also an algebra). Let us consider the continuous family of exponentials: cl(t) = ei” (A E R). It is orthonormal: (e,, e,) = &,. The following definition therefore arises naturally: CA(f) = (f, e,) is called a Fourier coeflicient of the a.p.f. f. Following on from this we introduce formally the Fourier series f(t) - C cl(f )e
In fact this series contains no more than countably many non-zero terms, since Bessel
inequality ktl I%,(f )I2 G (f9f)
holds for any collection (1,) . . . , A,}. The set of those I for which cl(f) called the Bohr spectrum of the function f.
# 0 is
n the terminology of physics this is the superposition of harmonic oscillations with arbitrary frequencies. In particular, the coordinate functions describing small oscillations of conservative mechanical systems are of this type.
8 3. Theory of Approximation
The theorem of Bohr reduces to Parseval T kl(fV
53
equality
= (f,f),
which means that the Fourier series of an a.p.f. f converges to f in the sense of and introduce the distance d(f,,f,) = + co, where S, is the nth partial sum of the 1% -fill then 4f, 4) -, 0 as n Fourier series of the function f for an arbitrarily chosen enumeration of the non-zero terms. It is also possible to sum the Fourier series of an a.p.f. f to f using a method of FejCr type (Bochner, 1927). Bohr mean-square: if we put llfll = ,/(f,j)
$3. Theory of Approximation 3.1. Chebyshev Approximations. According to the theorem of Weierstrass any function f which is continuous on the interval [a, b] allows an arbitrarily close uniform approximation by means of polynomials. The problem therefore arises naturally of determining amongst all polynomials of degree no higher than n that polynomial P for which the deviation’
4f, P) = max If(x) - W)I o<xSb
is as small as possible. The formulation of this problem (for a real function) is due to P.L. Chebyshev (1853)2 who started from considerations connected with the theory of mechanisms. Later on this approach developed into an independent and deep area of mathematical analysis. The existence of a polynomial of best approximation is obvious, since the function d(f, P) is continuous on the space of polynomials of degree G n and tends to + co at infinity. The problem of uniqueness is rather more subtle and is closely linked with the problem of effectively constructing the polynomial of best approximation. Theorem of polynomial of (a < x 6 b), it X1<X2< <X,+, R(x) = f(x) -
Chebyshev 3. In order that a polynomial P(x) (deg P < n) be the best approximation for a real-valued continuous function f(x) is necessary and sufficient that there exist a collection of points in [a, b] (Chebyshev alternant) at which the remainder P(x) satisfies the conditions:
1) IR(xdl = d(f,P) (1
= -sgn R(x,)
(1 < k 6 n + 1).
t On the space C(X) of continuous functions on any compact set X the natural metric (corresponding to uniform convergence and therefore called uniform) is defined as d(fr&) = jlfr -frl[, where llfll = maxlf(x)l. Thus the deviation d(f, P) is the distance from P to J .e.long before the appearance of Weierstrass’ theorem. 3The proof of this theorem was developed gradually and a complete version did not appear until the work of Bore1 (1905).
54
Chapter 1. Classical Concrete Problems
Corollary.
The polynomial
of best approximation
for given
f, n is unique.
Proof. The semi-sum S of two polynomials of best approximation P, Q is also a polynomial of best approximation. It is easy to show that the Chebyshev alternant for S acts in the same way for P and Q. It follows from this that P and Q coincide at the n + 2 points of the aiternant. 0
A similar result holds for best uniform approximations of continuous periodic functions by means of trigonometric polynomials, the only difference being that for a given degree n the Chebyshev alternant consists of 2n + 2 points in a period. We can deduce this from the previous result by substituting
x = tan i.
Let us consider some examples. Example 1. P.L. Chebyshev posed and solved the following problem: among all polynomials of degree n with leading coeffkient a, = 1 find the one whose maximum modulus on [ - 1, 1) is least (the polynomial deviating least from zero). It was shown by him to be the polynomial
T,(x) = 1 (n = 0),
T,(x) = &
cos(n arccos x)
(n 2 1);
these were subsequently called the Chebyshev polynomials. The verification of this answer using Chebyshev theorem causes no problem if we note that the desired polynomial has the form x” - Q.-i(x) where Q.-i(x) is the polynomial of best approximation (deg QnV1 < n - 1) for x” on [ - 1, 11. It is interesting to note that the Chebyshev polynomials are orthogonal on (- 1, 1) for the weight l/J(l - x2). Example 2. We consider Weierstrass
function
W(0) = 2 a’ cos(b ) k=O
where 0 < a < 1 and b is an odd integer > 1. This is a continuous 2n-periodic function which does not have a derivative at any point if ab > 1. It is given by its Fourier series which converges uniformly to it. It can be shown that the partial sums of this series are the best uniform approximations for W by means of trigonometric polynomials. This situation is encountered extremely rarely, in contrast to best approximations in mean square where it is always the case. For any complex 2aperiodic continuous function f we shall denote by E,(f) the deviation from f of trigonometric polynomials4 of degree
0=0<2n
If (4 - U4l
G @, + W,(f
),
(61)
4The same notation is also used in an algebraic situation and also in subsequent generalisations which cover both situations.
5 3. Theory
55
of Approximation
where L, =
= PnWl dx s --z is the Lebesgue constant (the L’-norm of the Dirichlet series). Weierstrass theorem shows that E,(f) + 0 as n + co. At the same time L, -+ co but grows slowly. In fact as Lebesgue showed (1910), $1,
n < L, < $ln
n + O(1).
Inequality (61) can therefore be useful as a bound for the rate of convergence of a Fourier series. This is the case for example with smooth functions since we have Theorem of Jackson. Let f be a function of class C’ on the circle and let q(6) be the modulus of continuity of its rth derivative (r 2 0) (o,(6) E o(6), the modulus of continuity off ). Then E,-,(f)
< BP,
0
i
n-
where the constant B, depends only on r. In particular, E,(f)
ly f (” E Lip a then
= O(ner- .
(64
A bound of type (63) was also obtained by Jackson in the algebraic situation. In the class of functions which are analytic on the interval [a, b] we have the bound of S.N. Bernshtein T 1n-n l&(f) < 1, (65) n-ccl and moreover if f is an entire function hm YE,(f) = 0. (66) +oD To S.N. Bernshtein is due a series of deep results, the main point of which lies in the fact that the rate of convergence of the deviation E,(f) to zero determines the class of smoothness of the function f. We formulate in particular those theorems which provide converses to the bounds mentioned above. Theorem 1. Zf (64) is satisfied for a function f E C[a, b] then f l C a, b] and f (” belongs to the Lipschitz class with exponent a on each interval [aI, b,] c (a, b). Theorem 2. Zf (65) is satisfied for a function f~ C[a, b] then f is analytic on [a, b] and in the case of (66) it is an entire function.
Similar theorems hold in the trigonometric situation. One of the proofs of Theorem 1 is based on the following remarkable of Bernshtein: if T(8) is a trigonometric polynomial of degree n then
inequality
56
Chapter
1. Classical
Concrete
Problems
further if f(z) is an entire function of exponential type o which is bounded on the real axis, then sup If’(x)1 G c -y"<, If( -mx)Ixcm (equality holds for the functions f(z) = e *iez). This result can be generahsed considerably (B.Ya. Levin, 1951). The inverse problem of approximation, which was also formulated and resolved by S.N. Bernshtein (1938), consists of constructing a continuous function f with a given sequence (E,),” of deviations. The obvious necessary condition that (E,),” tend to zero monotonically turns out also to be sufficient. 3.2. Chebyshev and Markov Systems. A system (u&j of real or complex continuous functions on a topological space X is called a Chebyshev system if each linear combination of these functions, which is not identically zero, has no more than n roots. This class of systems and the related class of Markov systems (see below) arise very naturally in many questions in the theory of approximation and its applications. The standard examples of Chebyshev systems are:
1, z, . . .) Z”
(z E @I;
1, cos x, sin x, . . . , cos mx, sin mx
(0 < x < 274.
A further example is the system of exponentials erlox, . . . , e X
(-co<x
with real, pairwise distinct exponents A,, . . . , L,. Using a logarithmic we can transform it into the Chebyshev system x20, . ..) XL,
substitution
(x > 0).
The inequality E det(u,(xi))&o
# 0
for all pairwise distinct x,,, . . . , x, provides a criterion under which (u& is a Chebyshev system. Clearly any Chebyshev system is linearly independent. Let X be any compact space. Given a system (u& c C(X) and f E C(X) there clearly exists a linear combination Calruk with least deviation from f in the uniform metric. Tonelli-Haar-Kolmogorov Theorem. Let (u& c C(X) be a linearly independent system. For any f E C(X) there is a unique linear combination P = 1 I,+, with least deviation from f in the uniform metric if and only if (ut)$ is a Chebyshev system.
Interestingly, it is far from true that Chebyshev systems (with n > 0) exist on each compact space (Mairhuber, 1956; Sieklucki, 1958). Theorem. If a real Chebyshev system (with n > 0) exists on the compact space X then X can be embedded homeomorphically in the circle.
51
5 3. Theory of Approximation
A similar result holds in the complex situation (Schoenberg-Yang,
1961).
Theorem. Zf X is a finite polyhedron on which there exists a complex Chebyshev system (with n > 0) then X can be embedded homeomorphically in the plane.
Suppose that X is a compact space, (ZJ& c C(X) is a complex Chebyshev system and f E C(X). The following criterion for best approximation is due to A.N. Kolmogorov (1948). Theorem. The linear combination P = I;=,, uniform metric, zf and only if the inequality
min re F(x){f(x)
- P(x)}
xeMp
holds for all linear combinations
&u, deviates least from f in the
F = I&,
> 0
alruk, where
Mp = (x: If(x) - P(x)1 = max}. Corollary. Zf the compact space X contains at least n + 2 points then the set Mr for the best approximation P also contains at least n + 2 points.
In the case X = [a, b] the best approximation by linear combinations of a real Chebyshev system can be characterised as before by the existence of a Chebyshev alternant (S.N. Bernshtein, 1937). The following theorem of Jackson shows that the Chebyshev property of a system is connected with the properties of best approximations not only in the uniform metric but also in the L’-metric: zf(ut)i is a real Chebyshev system on [a, b] then for any real function f E C[a, b] the linear combination P = CA,+, which deviates least from f in the L’-metric is unique. In the class L’(a, b) we do not have uniqueness of the best approximation no matter what the system (u& t Ll(a, b).
A certain strengthening of the requirements made on the system leads to the possibility of effectively calculating the least deviation in the L’-metric. A system (u& (v < co) of real or complex continuous functions on a topological space X is called a Markov system if each subsystem (u& (0 < n < v + 1) is a Chebyshev system. Let (u,& be a real Markov system on the interval [a, b]. We form the combinations vo
=
uo,
1)" =
u,
-
n-l c
ankuk
k=O
where for n 2 1 these have least deviation from zero in the L’-metric. It can be shown that the function v, has exactly n roots xnl < *** < x,, in (a, b). Let f E C[a, b]. We put P. = ~~=o I&u&, where the A&are defined by the system of equations Pntxn+l,i)
=f(-%+l.i)
(1
G
i G
n +
1).
Zf the remainder f(x) - P,(x) alternates in sign as x takes Theorem of Markov. on the successive values x,,+~, 1, . . . , x”+~,“+~, then for the L’-metric P,, has least
58
Chapter
1. Classical
deviation from f among all combinations deviation is equal to
Concrete
Problems
of the form c;+
a&&. Further
the least
b
Is a
f(x)
w
v,+Ax) dx .
A.N. Korkin and E.I. Zolotarev (1873) constructed a polynomial of degree n with leading coefficient 1 having least deviation from zero in L’( - 1, 1). It proved to be the Chebyshev polynomial of the second kind: m4 The corresponding
1 sin(n + 1) arccos x = ji J(1 - x2) -*
least deviation is equal to A.
from Markov theorem. We note that the Chebyshev polynomials on (- 1, 1) with weight J(l - x2).
These results can be deduced
of the second kind are orthogonal
For any system of real continuous functions (a& (v < co) on the interval [a, b] and any system of real numbers (ck)l; we can consider the moment problem which consists of finding a nondecreasing function Q(X) (a < x < b) satisfying the conditions 3.3. The Chebyshev-Markov
Problem.
b
sa
t.&(x) da(x) =
Ck
(0 < k < v + 1).
An obvious necessary condition for the existence of a solution system of real numbers (a& (n < v + 1) such that
&i. a&u&(X)2 0
(a < X < b),
is that for any
(67)
the inequality
should hold. It can be shown that if uo(x) > 0 (a < x < b) then this condition is sufficient for the solvability of the moment problem under consideration. Suppose that it is satisfied and let v = n < CO. The Chebyshev-Markov problem consists of finding for a given continuous function Q(x) 2 0 (a < x < b) and a given point 5 E (a, b) the greatest value of the integral
and the least value of the integral
0 3. Theory
59
of Approximation
on the set of all solutions of the moment problem just described’. Let us assume that (u& is a normalised Markov system in the sense that A
uo . . . u, > 0, x0.. . x, > (
if a G x0 < * * * < x, < b (0 < m < n). We assume further that the function Q associated with this system does not disturb its arkovicity’ even if in the following sense:
All these conditions are satisfied, in particular, if u&) = xk (0 G k < n), 51 E C(“+‘)[a, b] and SZ(“+l)(x) 3 0 (a < x < b). With regard to the given moments ck (0 < k < n) we assume that the inequality (68) is strict for all non-zero collections (at); which satisfy condition (67). Then we can show that among the solutions to the moment problem there exists a unique piecewise-constant function a< which has a jump at the point < and satisfies the condition 2p + q < n + 2 where p and q are respectively the number of jump points in the open interval (a, b) and on its boundary. The Chebyshev-Markov inequalities assert that c+o c+o Q(x) do(x) < fax) d+(x) sa sa
e-0 C-0 Q(x) d4x) > dq(4. sa Q(x) sLI Thus the function a,(x) is the desired solution of both extremal problems and moreover this solution does not depend on 8. The piecewise-constant solutions of the moment problem naturally lead to quadrature formulae of Gaussian type. In fact if x1 < . * * < x, form the set of jump points of this solution with pl, , . . , p, the corresponding jumps, then the approximate formula
s
ab.fCx) Wx)
x j$ Pjftxj)
is exact for all linear combinations f = CzCo &a, and solutions d of the given moment problem. The function d can be assigned a priori, then the moment sequence is calculated and the nodes Xi and weights pi (1 G j < r) are already 5The first work
of P.L. Chebyshev
(1874)
was devoted
to this case with Q(x)
= 1.
60
Chapter 1. Classical Concrete Problems
defined by it. In particular in the Gaussian case, i.e. U&X) = xk (0 < k < 2r - 1; a < x < b), we have the roots x1, . . . , x, of the orthogonal polynomial P,(x) as nodes, while the weights (the Christoff coeficients) are calculated from the formula6
Qr(xj) ~_ _ Res-QAx)
pi = Pi(Xj)
(1 Gj < I),
XCXj P,(x)
where Ql(x) is the associated polynomial. The corresponding piecewise-constant function is the solution of the Chebyshev-Markov problem if < = xi for some 1. If in addition we put D(x) E 1, then the Chebyshev-Markov inequalities take the form x,+0 x,-o I-1 do(x) G i Pj, do(x) a & Pj* j=l sa s (I 3.4. The L-problem of Moments. Another interesting type of moment problem goes back to the investigations of A.A. Markov: it is required to find a real measurable function p(x) which satisfies the constraint
Ip(x)l < L
(a < x G b)
(69)
(L is a given positive number) and has prescribed moments b
sa
uk(x)P(x)
dx
=
(0
ck
<
k <
n)
(70)
with respect to the given real System (n& c L’(a, b). We call this statement of the question the L-problem of moments. The criterion for its solvability is closely connected with least deviations from zero. It is immediately clear that if a solution to the problem (69)-(70) exists, then for any collection of numbers (at& satisfying the condition n c k=O
akCk
=
l,
we have the inequality L~~b[k~oa,uk(x)jdxZ1. Thus if
where (ak) satisfies (71), we obtain L > 1 as a necessary condition bility of the problem (69)-(70).
for the solva-
61t is appropriate to note here that in any L:(I) the roots of the orthogonal polynomials are simple and lie inside the interval I.
0 3. Theory of Approximation
61
N.I. Akhiezer and M.G. Krejn (1938) obtained the following result. Theorem. Suppose the system (u& has the property that for each non-zero linear combination 1 akuk its set of zeros has measure zero. The L-problem (69)-(70) has a solution if L 2 1. This solution is essentially’ unique if and only if L = 1,in which case it has the form ~(4
= L w
OM,
where Q^is a solution of the dual extremal problem’
0bIQ(x)1 dx = min I on the set of functions
of the form
The solution for the power and trigonometric cases of the L-problem moments can be described effectively in a certain sense.
of
3.5. Interpolation and Quadrature Processes. The exact solution of the best approximation problem lends itself relatively rarely to an effective description while an approximate solution can require complicated calculations. The more traditional method of approximating functions, namely interpolation, is effective and simple from the computational point of view, although the approximations provided by it can turn out to be very coarse. Interpolation of a function consists of constructing those linear combinations of the given system (u& (e.g. polynomials) which take the same values as f at the given points (interpolating points). A more refined procedure also requires equality of derivatives up to a given order (with corresponding assumptions about their existence) so that, for example, from this point of view Taylor formula is an interpolation formula. We restrict ourselves below to interpolation with simple interpolating points, i.e. without regard to derivatives. If (u& (v < co) is a Markov system on the space X then for any interpolating points Xi E X and for any numerical values yj (0 < j < n < v + 1) there is a unique linear combination P = ‘&, il,u, such that P(x,) = yj (0 <j < n). It can be expressed in the following form:
where C& is the linear combination of the functions uc, . . . , u, which satisfies the conditions @Jxj) = ~5,~(in this sense the systems of functions (CD& and interpolating points (Xi) are biorthogonal). ’i.e. up to values on a set of measure zero. he solution $ is not unique but the signs of any two solutions coincide almost everywhere.
62
Chapter 1. Classical Concrete Problems
In the case U&C) = xk (0 < k < n) on the interval [a, b] we have 44 @kfx)
= w’(xk)(x
-
xk)’
where
and (72) becomes the Lagrange interpolation
If this polynomial
formulag:
is expressed in terms of the basis
1, (x - x,), (x - x0)(x - Xl), . . . , (x - x0) (x - Xl). . .(x - X,-l) of the space of polynomials of degree
is defined. If N.i.s. converges even only pointwise on some set containing all the interpolating points, then its sum f is a solution of the interpolation problem (j=O,1,2 ,... ). = Yj The convergence of N.i.s. with some or other interpolating points in the complex plane (in particular, with the interpolating points xj = j lo) has been studied in considerable detail. It is interesting that this tool can be used for the proof of the famous theorem of Lindemann that the numbers CI# 0 and e” cannot be simultaneously algebraic (for a = 1 this yields the transcendental nature of the number e and for a = zi that of the number rr) (A.O. Gel ond, 1929). Let (uk)g be a Markov system and for each n = 0, 1, 2, . . . let interpolating points xnj (0 < j < n) be given. The sequence of linear combinations ftxj)
similar (but of course more complicated) formula resolving the problem of interpolation multiple interpolation points was constructed by Hermite. lo In this case N.i.s. converges or diverges simultaneously with the Dirichlet series
mt-l?% ck’
k=,,
and the same holds for absolute convergence.
with
$3. Theory of Approximation
pn
=
i k=O
63
nkUk
defined by the conditions P,(X,j) =f(xmj) (0 <j < n) is called an interpolation process for the given function f. In the case where xJ does not depend on n (n 3 1) we speak of an interpolation process of Newton type One of the fundamental questions relating to interpolation processes is that of convergence (in some sense or other) of the sequence (Pm)2 to the function f. A positive resolution of this question depends on the distribution of the interpolating points and the properties of the function f. For example, if the Markov system (uk); is complete in L a, b) and consists of functions which are continuous on [a, b] and if the Xnj (1 < j < n) are the roots of the combinations %=%-
1
(n = 1,2, . ..)
ankuk
k=O
which have least deviation from zero in the L metric, then, under the conditions of Markov theorem, the interpolation process for any f E C[a, b] converges to f in the L metric. In particular, the sequence of interpolating polynomials constructed on the roots of the Chebyshev polynomials of the second kind, i.e. on the interpolating points =
XJ
COS
(1 <j < n),
f$
converges. For uniform convergence of an interpolation process it is necessary to impose stringent restrictions on the function f. For example, if f is an entire function the sequence of interpolating polynomials converges uniformly to f on any interval [a, b] for any distribution of the interpolating points. This follows easily from the formula for the remainder term f(x)
(a < l, < b),
- P,(x) =f; n(;;)cO(x).
which holds for all f o C u, b]. Moreover, as Faber showed (1914), for any distribution of interpolating points in [a, b] there is a continuous function for which the sequence of interpolating polynomials is not uniformly bounded. For the function f(x) = 1x1 on [ - 1, l] the sequence of interpolating polynomials corresponding to the equally spaced interpolating points X
2j - n = n
(O<j
is unbounded at each point other than 0, + 1 (S.N. Bernshtein, 1916). The quadrature process for the system of nodes (x.j)oGjcn on the compact space X with measure p (p(X) < co) is defined by the sequence of approximate ” In contrast to this we speak of an interpolation
process of Lagrange
type
in the general case.
64
Chapter 1. Classical Concrete Problems
formulae s X
(n = 0, 1,2, . ..)
flP x j$o Pnjf(+)
(73)
for the integral which is regarded as a functional off E C(X). The coefficients pni are found, for example, by requiring that the formula is exact for f = uk (0 < k < ,n), where (uJ2 c C(X) is a Markov system. Under this condition formulae (73) can be written in the form
s X
fdcL=
P,&
(n = 0, 1, 2, . . .),
s X
where (PJF is the interpolation process for f corresponding to the given system of nodes. Therefore if (P,): converges to f in the L’-metric, the quadrature process for f also converges:
In fact considerably less is required for the convergence of a quadrature process. For example, it will converge for all f E C(X) if the system (z+)g is complete in C(X), u0 > 0 and all the pnj are positive. All these requirements are satisfied for a Gaussian quadrature process on C[a, b] since in this case, from the fact that the formula abf(x) da(x) z j$ Pjf(xj) s
is exact for all polynomials
of degree < 2r - 1, we obtain
S[ b
Pj
=
*
44
1 2
-
O’(Xj)(X
Xj)
do(x).
If we now use piecewise-polynomial interpolation with equally spaced interpolating points in C[u, b], negative coefficients appear in the corresponding Cotes quadrature formulae (for a(x) = x, 0 < x < 1). R.O. Kuz in (193 1) showed that for the Cotes quadrature process continuous functions exist on which it diverges. Convergence of interpolation and quadrature processes can be studied with the help of best approximations. Suppose that (uk)z is a complete Markov system in C(X), f E C(X) (X compact), xnj (0 < j < n) are interpolating points or nodes, @JxJ) = akj, S, = &, lZnkulris the nth best uniform approximation for f and E, = 1)f - $11 is the nth least deviation. From the interpolation formula (72) we obtain the bound
Ilf - call G (1 + ~ %, where hf. = max i xeX
k=O
I&(x)1.
$3. Theory of Approximation
The interpolation
65
process for f therefore converges if lim M,E,
n-ml
= 0.
(74)
The analogous condition for convergence of the quadrature process has the form lim p.E. = 0
“-Yo
The sufficient condition for convergence of a quadrature formulated above follows immediately from this.
process which was
3.6. Approximation in the Complex Plane. If the continuous function f on the compact set X c @ allows arbitrarily close uniform approximation by means of polynomials, then f is analytic on the interior int X of the set X and it can be extended analytically onto all the bounded connected components of the complement @\X. For a long time the sufficiency of these conditions remained an open problem. After early articles of Appel and Runge, the works of Walsh (1926), M.A. Lavrent v (1934) and M.V. Keldysh (1945) were fundamental landmarks in the path towards its resolution. The final positive solution of the problem of polynomial approximation is due to S.N. Mergelyan (1951) who proved the following theorem. Theorem. Zf the complement of the compact set X is connected then each function f which is continuous on X and analytic on int X is the uniform limit of a sequence of polynomials.
In particular, if X is nowhere dense and its complement is connected then every continuous function on X can be approximated by polynomials with arbitrary degree of accuracy (theorem of Laorent o). The simplest example of this situation is a Jordan arc (Walsh case), in particular a closed bounded interval ( Weierstrass case). For an arbitrary compact set X c @ rational functions with poles outside of X are the natural means of approximating to continuous functions which are analytic on the interior of X. The following criterion obtained by A.G. Vitushkin (1967) makes use of the idea of continuous analytic capacity of a set M c @: cl(M) = sup Res f(z) /
I ==a)
, I
(75)
where f runs through the class of all functions which are continuous on the entire plane, analytic outside of some compact set Q/ c M, bounded in modulus by unity and tend to zero at infinity.” he rather simpler notion of analytic capacity (denoted by y(M)) is defined by the same formula (75) but on a larger class of functions f, namely those which are defined and analytic outside of some compact set Q, c M, bounded in modulus by unity and tend to zero at infinity. Clearly a(M) d y(M) and it can be shown that if M is open then a(M) = y(M). T e analytic capacity of a disk is equal to its radius.
66
Chapter 1. Classical Concrete Problems
Theorem. In order that an arbitrary function which is continuous on the compact set X and analytic on int X be the untform limit of a sequence of rational functions with poles outside of X, it is necessary and sufficient that
c@\X)
= @\int
X)
(76)
for any open disk D.
This condition means that the complement of X is sufficiently massive close to the boundary ax. It is satisfied in particular if the complement of X is the union of a finite set of finitely connected regions (theorem of Mergelyan). It is also satisfied if X has zero Lebesgue measure so that in this case any function which is continuous on X is the uniform limit of a sequence of rational functions with poles outside of X (theorem of Hartogs and Rosenthal). An example of a compact set on which there is no rational approximation was constructed by E.P. Dolzhenko (1962). If X is nowhere dense the criterion (76) takes the form Y(D\X) = Y(D). This is a necessary and sufficient condition for each function which is continuous on X (int X = 0) to be the uniform limit of a sequence of rational functions with poles outside of X. Let X be a compact set whose complement Y is simply connected. Let us consider the conformal mapping w = 4(z) of the domain Y onto the exterior of the unit disk, normalised so that #(co) = 1. The polynomial part F,(z) of the Laurent series of the function [4(z)]” (n = 0, 1,2, . . .) at infinity is called the nth Faber polynomial for the compact set X. For the closed unit disk this is z” and for the interval [ - 1, l] it is the Chebyshev polynomial T,(z). Theorem of Faber. Any function f which is analytic on some neighbourhood U of the compact set X can be expanded in terms of Faber polynomials as a series which converges untformly on some neighbourhood V (X c V c U):
where (necessarily)
1 cn = 2% s ,jJ=l+e t,k being the inverse function
of 4 and
(n = 0, LZ E
> 0 suficiently
. ..X
(77)
small.
We can say that this is a conformally invariant form of the Taylor expansion. Formulae (77) result from the fact that the systems (F,(ll/(C))),” and (c-“-l); are biorthogonal on the circle IQ = 1 + E. Now suppose that we are given any system of analytic functions of the form P”(Z) = $i
+ *=$+1 F
(lzl > r > 0; n = 0, 1,2, . ..).
0 3. Theory
of Approximation
67
Then there exists a unique sequence of polynomials is biorthogonal with (p,(z)); in the sense that
(@Jz)); (deg @” = n) which
But then we can associate with each function f(z) which is analytic on the disk IzI < R (R > r) the series
2C
n=O
Z)
(78)
with coefficients 1 fKM0 dl. cn = 2ni s ,jcp The problem of convergence arises once again in this context, also the moment problem etc. In particular, if we are given a sequence of interpolating points {z.}; lying in the disk IzI < I, then on putting
II-1 P,(Z) = kgo (z - G-l we will have
n-1 cp,(d = kGo(z - Zk)
(n = 0, 1,2, . ..)
and (78) becomes Newton interpolation series. The corresponding problem in this case is equivalent to the interpolation problem AZ
= W”
(n = 0, 1,2, . . .),
moment (79)
where
We note that the problem (79) can be transformed of the simpler form
back into a moment problem
The uniqueness of the solution of problem (79) is clear since the set of interpolating points has a limit point in the disk IzI < r < R. The question of conditions for solvability of this problem is complicated even in the case where the set of limit points for the interpolating points is finite. In a more subtle version of the problem of type (79) we require that the function f belongs to the class of analytic functions on the disk lzl < I and all limit points of the set of interpolating points lie on the boundary IzI = r. Such a solution (if
68
Chapter 1. Classical Concrete Problems
it exists) need not be unique but if, for example, we require in addition that the function f is bounded (and for notational simplicity we take r = 1) then the solution is unique if and only if O(1 - kl)
= mJ
(80)
(theorem of Blaschke). The lack of uniqueness when (80) fails is shown by the straightforward construction of the Blaschke product:
Clearly IB(z)l < 1 for all z such that JzJ < 1. The interpolation problem of the form (79) on the disk lzl < 1 with the additional boundedness requirement If(z)1 < 1 is called the Neuanlinna-Pick problem l3 . Positivity of the matrix l-wjwk ( 1ZjS
m >
j,k=O
is a criterion for its solvability. It is natural to consider in the class E of all entire functions, or some subclass of it, the interpolation problem of the form (79) for a sequence z, -+ co. We do not get uniqueness in E since there always exists an entire function o # 0 with roots at the points14 z, (n = 0, 1,2,. . . ) (theorem of Weierstrass), although uniqueness can be restored with roper’ correlations between the asymptotic distribution of the points .z~ and the a priori restrictions of growth of the function $ Theorem. For any sequence of interpolating points z, --, COand any sequence of values (w.); there is an entire function f which satisfies conditions (79). The general form of such functions is f = f. + wg, where f. is one of them, w is an entire function with simple roots z, (n = 0, 1,2, . . .) and no other roots and g runs through the class E. For fixed o each sequence {w.}$ defines a Lagrange interpolation series (L.i.s.):
2 w.o z z)’ :)n=O which formally solves problem (79). The investigation of convergence of the L.i.s. requires powerful analytic tools although the problem is simplified substantially for a particular choice of interpolating points and class ofinterpolating functions.
r3A more general version of the Nevanlinna-Pick problem consists of interpolating in a given disk or half-plane by means of analytic functions with values from another disk or half-plane. r41t is possible to construct the function o in such a way that all the roots z, are simple and there are no other roots.
0 3. Theory
69
of Approximation
Example. Let f be an entire function of exponential L2 on the real axis. By the Wiener-Paley theorem
type 71which belongs to
f(z) =& s:n4(4e-iZx dx,
(81)
where 4 E L2( - rr, a) and consequently (82)
in mean square, where
Substituting
(82) in (81) we obtain (83)
which is an expansion off in the L.i.s. corresponding to integer interpolating points. It is easy to see that the series (83) converges uniformly on each compact X c @. This theorem of Kotel ikou has important applications in the theory of radio communication. It establishes an isometric isomorphism of the space of entire functions B,” and the sequence space 12. From the physical point of view it effects continuously a discretisation of the transmitted signal. 3.7. Quasianalytic Classes. As Bore1 showed (1895), for any sequence of numbers (a,); there exists an infinitely differentiable function f on [0, l] which satisfies the conditions f( (O) = a n
(n = 0, 1, 2, . . .).
The solution of this nterpolation’ problem is not unique since there exist non-zero functions which are flat (at zero). However if the derivatives of a function 4 E C” [0, l] are subject to bounds of the form max 14
x)l< Z&r”
(n=0,1,2
)... ),
(84)
OCXGl
where I > 0 and M. > 0 is a sequence which does not increase too rapidly, then 4 cannot be flat if it is non-zero. For example, with M, = n! it follows from (84) that 4 can be extended analytically to the disk IzI < r-l and therefore if 4 is flat then 4 = 0. Let (M,),” be any sequence of positive numbers. We say that the function 4 E C” [0, l] belongs to the class C(M,) if there exists I > 0 for which the bounds (84) are satisfied. The class C(n!) coincides with the class of analytic functions on [0, 11. The class C(M,,) is said to be quasianalytic if it does not contain flat functions 4 # 0, i.e. if the corresponding uniqueness theorem holds in C(M,). The problem of quasianalyticity, which was formulated by Hadamard in 1912, con-
70
Chapter 1. Classical Concrete Problems
sists of characterising those sequences (A&),” for which the class C(M,) is quasianalytic. Its complete solution was obtained by Carleman (1926) and in rather different terms by Ostrowski (1930). Carleman
Criterion.
Put
p.=infvM,
(n=1,2,3
,... ).
fan
In order that the class C(M,)
Ostrowski
Criterion.
be quasianalytic
it is necessary and suflicient that
T(r) = sup &(r Ilao I
> 0).
be quasianalytic
it is necessary and suflicient that
Put
In order that the class C(M,)
The equivalence of conditions (85) and (86) can be demonstrated directly without reference to the problem of quasianalyticity. We outline the proof of the sufficiency of condition (86). Without loss of generality we can assume that the function 0 is infinitely differentiable on the semiaxis x > 0, satisfies the bounds Ib%)l
GK
(x 2 0; n = 0, 1,2, . ..).
belongs to L2(0, co) and & (O) = 0 (n = 0, 1,2, . . .). Then its Fourier-Plancherel transform E(1) belongs to the class H2 on the lower half-plane and satisfies the bounds Ml (n= 1,2,...), IWI G Inl” from which we obtain that on the real axis 1&(A)1 < (T(A))-’ and, consequently,
s
4, ln IEaGUl dA = _ 00; -03 1 +A2
this contradicts a well-known property of functions of class H2. The necessity of condition (86) can be established by using the construction the function E(A) of class HZ for a given lEa( (- 00 < Iz < 00).
of
$4. Integral Equations 4.1. Green differential
Function. This object arises naturally
in the inversion of linear
operators. Let us consider the operator L on C[a, b] which is defined
by the expression
71
$4. Integral Equations
J[y-J = y(n) + p,(x)y -”
+ * -* + p,(x)y
for functions y E C a, b] satisfying conditions
FiCYl z ab 12 d4Y*Yx)) I(
of the form
drib = 0
(1 < i < n).
(87)
The coefficients pi(x), an(x) are assumed to be at least continuous and the pi are functions of bounded variation. If the measures 7i are concentrated at the ends of the interval, then conditions (87) become boundary conditions; in the general case they may be called distributed conditions. The general solution of the equation Icy] =f(f~ C[a, b]) may be written in the form y =
YO +
i j=l
AjUj,
where y, is a fixed particular solution, (uj); is a basis of the space of solutions of the homogeneous equation I[u] = 0 and (Aj) runs through the space @” (or R” in the real case). Substituting (88) in (87) we obtain the system of linear algebraic equations j$ F,[Uj]Aj
= -&[Yo]
(1 < i < n)-
(89)
Let us assume that under conditions (87) the homogeneous equation I[u] = 0 has only the trivial solution. This is equivalent to the homogeneous system i
Fi[Uj]Bj
= 0
(1 < i < n)
j=l
having only the trivial solution’. But then the system (89) has a unique solution for any y,, i.e. the equation L[y] =f has a unique solution for any right hand side fe C[a, b]. For the explicit construction of the inverse operator I,-’ we have to find a particular solution by the method of variation of parameters. As a result it turns out that the operator L-r is integral,
with a certain kernel G. This kernel is called the Green function of the operator L and has the following structure
G(x, 67 =
(91)
or this, in turn, it is necessary (but not sutlkient) that the linear functionals Fi (1 C i Q n) on CR- a, b] are linearly independent. A necessary and sufficient condition is the linear independence of the restrictions of the F, to the space of solutions of the equation l[u] = 0.
72
Chapter 1. Classical Concrete Problems
We can find the functions vi’*)(<) independently of the calculations mentioned above from the following conditions, which uniquely determine the Green function: for each fixed r (a -C < < b) the function G(x, <) belongs to the class Cne2[u, b], its (n - l)th derivative exists and is continuous for x # 5 while for x = r it has a unit jump and the nth derivative exists and is continuous for x # <; moreover the differential equation
4C% 01 = 0
(x + 5)
is satisfied along with the conditions
Vi),CW, 01 = 0
(1 < i 6 n).
(92)
The description which has just been given can be substantially reduced if we invoke the terminology of the d-function: for each fixed r the Green function G(x, 5) is the solution of the differential equation
LCG(x,01 = 6(x - 0 which satisfies the conditions (92). From this point of view formula (90) is obvious since formally b
1 Gb, 51f(O dt [S a
IS b
1 sa =
’ W - tlf(O
dt =fW
1s
G(xvOf(t) dt = b(F,LCG(x,5)l.f(<) & = 0
(1 < i < n). a In the case where the conditions (87) are boundary conditions and the coefhcients P&C) are sufficiently smooth (namely pk E C a, b]) L has an adjoint differential operator L* defined by the expression2 4
a
1*cz-J =(-l)
(-l) l(p,(x)z) - ~~*+p x)z
(93 and certain boundary conditions. These are connected with the resulting integration by parts of Lagrange formula
b (KYIz - YWI} dx = W@), Z(b)) - Wb.4, %4),
sa where Y(x) = (y(x), y x), . . . , y - (x)), Z(x) is defined similarly and B is a bilinear form of the vectors Y 2. It is easy to see that the form B is non-degenerate. Subjecting the function y to the boundary conditions (87), wz extract an ndimensional subspace R of the 2n-dimensional space zf vectors Y = (Y(a), Y(b)). Let us consider the s_ubspace R’ of those vectors 2 = (Z(a), Z(b)) which are orthogonal to every YE R in the sense that ere for simplicity the concept of the adjoint operator is interpreted in the real sense, which is the natural one if the functions pk, a&, q are real.
13
0 4. Integral Equations
w(b), Z(b))- mw, Z(a))= 0. It is n-dimensional
and can be described by the n boundary conditions = 0
~i[Z]
where the 4 are linearly independent
(1 GiGn),
(87*)
linear forms of 2. The differential operator (87*) is such that
L* given by the expression (93) and conditions b
b
L[y]z
dx =
s a s (I Moreover, if z E C a, b] and the identity
yL*[z]
dx.
(94)
b
b
L[y]z
dx =
yw dx
sa s a is satisfied for ally E D(L) and some w E C[a, b], then z E D(L*) and w = L* [z]. The differential operator L is said to be self adjoint if L* = L. Such an operator necessarily has even order n = 2m and the form l[y] = yt2m) + (nl(x)y Putting y = L-’ [f]
-”
+ *.* + n,(x)y.
in the identity (94) we obtain Iabjkdx=jabyL*[z]dx.
Therefore if L* [z] = 0 the function z is orthogonal to every continuous function and so must be identically zero. Thus we can invert L* as well as L and after the substitution z = (L*)- h], identity (94) takes the form b
b
b G*(x,
ss0 (I where G* is the Green
51fCW3
dx
dt
b
=
W, ss a
WWf(5)
dx
&,
LI
function of the operator L*. Consequently, G*b, 0 = W, 4,
and for a self-adjoint
operator L its Green
function is symmetric:
G(x, 5) = W, 4. It is now clear that the systems (vi )); and (vi ;in (91) are bases for the space of solutions of the equation 1* [u] = 0, the udjoint equation for I[u] = 0. Example. The Green function of the operator interval [a, b] with boundary conditions YW
- hlY(4
= 0,
l[y]
y b) + Jw@) = 0
has the form
W, 67=
= y” - q(x)y
UlW2(5)
(x
G
0,
~2w41(5)
(x
2
09
on the
14
Chapter
1. Classical
Concrete
Problems
where (ur , u2} is a basis for the space of solutions of the equation Z[u] = 0, chosen so that u1 satisfies the condition at the point a, u2 satisfies the condition at the point b and the Wronskian of this pair of solutions is equal to 1. We note that by inverting the operator L the problem of finding eigenvalues and eigenvectors, L[w] = Iw, (95) is reduced to the homogeneous
linear integral equation b
w(x)
=
1
G&v
h~(0
dt
s II with parameter 1 (generally complex). If A is not an eigenvalue then the inhomogeneous equation LEYI = LY + f can be solved for any J Moreover
it is equivalent
to the integral equation
b
Y(X) =
G(x, BY(~) dt + g(x), sa where g = L-’ [S]. Thus the types of linear integral equations which have been introduced are naturally associated with linear differential operators and can be used for the study of the latter (in particular, this applies to the Sturm-Liouville problem). The realisation of this approach requires an independent development of the theory of linear integral equations which was carried out in the works of Fredholm, Hilbert and Schmidt; these played a prominent role in the creation of functional analysis. A
4.2. Fredholm and Volterra Equations. The definition of these classes does not require linearity; indeed non-linear integral equations also form an interesting and important subject. Suppose we are given an interval Z c R, a function g(x) on Z and a function K(x, 5, y) on Z x Z x IR. An equation of the form W, 5, y(5)) d5 = g(x) b E 1) sI is called a Fredholm integral equation of the 1st kind. By a Fredholm integral equation of the 2nd kind we mean an equation of the form (x E I). W, t> ~(0) dt + ~$4 sI Now let us put Z, = I n (- co, x] (x E I). If in the above we change the interval of integration to Z, we obtain Volterra integral equations of the 1st and 2nd kinds in the respective cases. A general theory of integral equations of the 1st kind is lacking even in the linear case, although certain particular equations of this type have been successY(X) =
$4. Integral Equations
15
fully investigated by special methods. Abel equation is an example of such (see Section 2.10). For integral equations of the 2nd kind the existence and uniqueness of solutions under specific conditions has been established by means of the contraction mapping principle. In more complicated cases topological methods connected with fixed point principles can often be successfully applied. Theorem. Let I = [a, b], g E C[a, b] and suppose that the function K(x, 5, y) on I x I x R is continuous and satisfies a Lipschitz condition for the variable y with constant M < (b - a)- Then the Fredholm integral equation b
Y(X) =
K(x,
sD
t,
y(5))
&
+
(a
g(x)
<
x
<
b)
has a unique continuous solution.
To reduce the problem to the contraction mapping principle it is sufficient to consider the mapping Ton C[a, b] which is defined by the right hand side of the equation. Since IU IM~
- VY,)W
GM
(a < x < b),
b I(YI(~) - YAOI d5 s ll
we have in the uniform metric WY,,
TY,) G Mb - 44y,,
~2).
Therefore the solution not only exists and is unique but it is also the uniform limit on [a, b] of the iterative process Y,+,(X) =
b K(x, 5, v.(O) s0
& + g(x)
(n=O,
1,2,...)
for any y. E C[a, b]. Corollary. Let I = [a, b], g E C[a, b] and suppose that the function K(x, 4, y) on I x Z x R is continuous and satisfies a Lipschitz condition for the variable y with some constant M. Then the Volterra integral equation Y(X) =
x K(x, t, ~(0) Ia
has a unique continuous min(b - a, M-l).
solution
dt + g(x) on the
(a s x < b) interval
[a, a + 1) where
1=
We obtain from this, in particular, the theorem of Picard on the existence in some neighbourhood of zero of the solution of the Cauchy problem 2 =f(x,
Y(X)),
and its uniqueness, where f is a continuous
Y(O) = Yo
(96)
function which satisfies a Lipschitz
76
Chapter 1. Classical Concrete Problems
condition equation
for y. In fact, problem Y(X)
(96) is equivalent
=
xf(5, I
~(0)
&
to the Volterra
integral
+ ~0.
0
In what follows we will again consider only linear integral equations. 4.3. Fredholm Theory.
Let us consider the Fredholm
integral equation
Y(X) =
b W, Oy(t) dt +f(x) (97) Ia on the interval a d x G b, assuming that the kernel K is continuous on the square a < x, < < b andfe C[a, b]. We also look for a solution in C[a, b] (all functions considered can be complex). The equation (97) is the continuous analogue of the system of algebraic linear equations
Yi= iil
jYj+ fi
(1 < i < n).
This analogy is highly significant: the original construction of Fredholm theory was based on passage to the limit from the system (98) to the equation (97) as n-co.
The Fredholm alternative is the central result of the theory: either the equation (97) has a solution for any f~ C[a, b] or the homogeneous equation b u(x)
=
K(x,
MO
dt
(a < x < b)
WV)
drl
(a < x < b),
(99)
I II has a non-trivial solution. In addition to this we can assert that 1) the subspace of solutions of equation (99) is finite-dimensional, 2) a necessary and sufficient condition for the solvability of equation (97) with fixed f is that f be orthogonal to all solutions of the adjoint homogeneous equation b u(x)
=
Kh
w*)
I (I of the spaces of solutions of equations (99) and (99*) are
3) the dimensions equal.3 Thus the basic facts of the theory of finite systems of algebraic linear equations hold for Fredholm integral equations of the 2nd kind. When applied to the problem of eigenfunctions, b w(x)
=
I
sa
W.-G
t%W
d5,
3If they are equal to m then 2) reduces to m orthogonality conditions (J 0,) = 0 (1 Q k d m) where (ok)7 is a basis of the space of solutions of equation (99*).
77
$4. Integral Equations
this yields, in particular, the finite-dimensionality of the corresponding eigenspaces.4 Here the eigenvalues turn out to be the roots of a certain entire function (Fredholm determinant) from which it follows that, if the set of eigenvalues is infinite, then it can be expressed as a sequence tending to infinity. This last result was noted in Section 2.11 for the Strum-Liouville problem but it can now be regarded as having been established for the more general problem (95). 4.4. Hilhert-Schmidt Theory. The Fredholm theory remains valid for the space L2(Z) (even in the case of an infinite interval I) if it is assumed that K(x, 5) is a Hilbert-Schmidt kernel, i.e. if it is square integrable as a function of x and 5 and Hermitian: K(<, x) = K(x, 5) (these conditions are satisfied if Z is finite and K is the Green function of a self adjoint linear differential operator). All the eigenvalues then turn out to be real and eigenfunctions corresponding to distinct eigenvalues are orthogonal. We have here a complete analogue of the spectral theory of Hermitian matrices, the only difference being that the set of eigenvalues is infinite because the space L2(1) is infinite-dimensional. In fact we associate with the kernel K(x, 5) a (Hermitian) quadratic functional QCYI =
W, OY(X)Y(~) dx dt ssI I just as a (Hermitian) quadratic form is associated with any Hermitian matrix. The variational method applied to the functional Q under the condition s leads to the following fundamental Theorem.
, Iv(x)12 dx = 1 result of Hilbert-Schmidt
theory.
Suppose that the equation
WG 5)W & = 0 sI with a Hilbert-Schmidt kernel has only the trivial solution in L Z). Then the eigenvalues form an infinite sequence tending to infinity and a complete orthonormal system (4 ;” can be constructed from its eigenfunctions. A most important consequence of this result is the theorem on the complete system of eigenfunctions in the Strum-Liouville problem and, in general, for any linear self-adjoint differential operator on a finite interval. Let us denote by 12,the eigenvalue corresponding to the eigenfunction 4 Then we have the expansion of the kernel
4We can even prove the finite-dimensionality of the collection of root-spaces (r= 1,2,3,...) (K is the integral operator with kernel K(x, I), 1 is the identity operator). w,(A) = {w: (1 - nzcyw
= 0)
78
Chapter 1. Classical Concrete Problems
which converges in mean square on Z x I. Consequently Q[y]
=
2
(101)
!!!!!
n=1
4
'
where the c, are the Fourier coefficients of the function y with respect to the system (4Jp, i.e.
cn =sYC&W - dx. I
The identity (101) is just the theorem on the reduction of the Hermitian functional Q to its principal axes. We note that the relation
quadratic
follows from (100) on applying Parseval equality. If all the 1, > 0 (which is equivalent to positivity of the functional Q) and if further the kernel K is continuous and the interval Zis finite, then the expansion (100) converges uniformly on the square Z x Z (theorem of Mercer). In particular K(x,
x)
=
f n=l
y, ”
from which follows the trace formula: zl ; = j- K(x, 4 dx. ” I 4.5. Equations with Difference
Kernels.
We are concerned with integral equa-
tions whose kernels are of the form W, 0 = 4~ - 5) (x, r E 0 where the function a(t) (also called the kernel) is defined on the interval .Z = Z - I. First let us consider the equation m (--oo<x
5 4. Integral Equations
19
a,
A(I) = s
-CO
(-co<1<00)
a(t)e- ’ dt
be different from 1 for all 1.
In fact, using the language of Fourier transforms, we can express equation (102) algebraically: Y(I) = A@)Y@)
+ G(1).
If A&) = 1 and the equation is solvable we obtain G(1,) = 0, which cannot hold for all g E ~5’.Conversely, if A@) # 1 for all 1, then by Wiener theorem (1 - A(I))-’ where B is the Fourier transform equation (102) is satisfied by
= 1 + B(1),
of a certain function
bEL
Consequently
y=g+b*g
and clearly this is the unique solution. The theory of the equation (103) OD4~ - 0~63 d5 + g(x) (x ’ 0) s0 on the semiaxis and in the class6 L 0, co) with kernel a E L - co, co) is considerably more complicated. A technique for solving equation (103) in the case where g = 0 and the kernel decreases exponentially was indicated by Wiener and E. Hopf in 1931 and then generalised and developed by many authors. Moreover deep connections with the theory of analytic functions, functional analysis and even topology were found. The topological aspect is connected with the use of the index Y(X) =
v = ind(1 - A) z - & arg(1 - A(I))lZ ,, (for solvability of the equation for all g E L’ it is necessary as before that A(1) # 1 for all 1 and we will assume that this condition is satisfied in what follows). Because the curve [ = 1 - A(1) (- 00 < 1< co; A( f co) = 0) is closed the index is always an integer. Theorem 2. For soloability of equation (103) for all g E L’ it is necessary and sufficient that the index be non-negative. For uniqueness of the solution it is necessary and suflicient that the index be equal to zero. We shall describe a method of proof which goes back to Wiener and Hopf and was sufficiently generalised by M.G. Krejn (1958). It consists of fuctorisation of the function (1 - A(L))i.e. the decomposition 6The theory presented below extends to many other classes,in particular to Lp (p > 1).
80
Chapter 1. Classical Concrete Problems
(1 - A(n))-’ = [l + B+(L)][l where
B+(A) =sm
b, (t)e ’ dt,
+ B-(A)]
(--oo<~
B-(A) =
0
m b- (t)e- ’ dt s0
are functions which can be extended analytically to the upper and lower halfplanes respectively (b, E L’(0, co)), If v > 0 the factor 1 + B-(A) can always be chosen so that it has no roots in the lower half-plane. Moreover (1 + B-(A))-’ = 1 +
om/I?-(t)e- ’ dt (im 3, < 0), I where b- E L’(0, co). The factor 1 + B+(A) will have exactly v roots (counted according to multiplicity) in the upper half-plane (by the principle of the argument); their locations can be chosen arbitrarily, after which the factorisation will be uniquely determined (so that for v = 0 it is unique). More specifically 1 + B+(A) = R(1)(1 + c(A)), where R(A) is a rational function which has the prescribed roots in the half-plane im A> 0, a pole of order v at 1= - i and no other roots or poles, and (1 + c(A))-’ = 1 +
m j?+ (t)e ’dt s0
(im A 2 0)
where j?+ E L 0, 00). For v = 0 OD/I+ (t)e ’dt (im il 2 0). s0 For v = 0 the factorisation is brought about by the possibility of choosing a unique branch of ln(1 - A(I)) which tends to zero as A--* f co. It turns out that (1 + B, (A))-’ = 1 +
ln(1 - A(A)) =
m l(t)e- ’ dt
(-co<~
s -UZ
(a special case of a theorem of Wiener and Leuy), hence (1 - A(A))-’
= exp[ -I:a
l(t)e-
t]
exp[ -J:
l(t)eei dt]
and further the factors obtained can be written in the form m 1 + ODb- (t)e- ’ dt l+ b, (t)e- ’ dt, s0 s0 (also with the help of the same theorem of Wiener and Levy). For v > 0 the index is cancelled out by the introduction of the factor R(1). To solve equation (103) when v = 0 we extend the function g and the unknown function y by setting them equal to zero on the semiaxis x < 0, after which we
81
4 4. Integral Equations
can write the equation in the form Y(X) =
m 4x - OY(5) & + g(x) + w f -CC
(--oo<x
where W
= -
m 4x - 5)~(5) d5 (x < 0) s0 and h(x) = 0 (x 2 0). Converting to Fourier transforms and using the factorisation we obtain
w 1 + B-(n) The left hand side L’(0, co) extended Fourier transform and /I(x) = 0 if x function from L’( Consequently
= (1 + B+(Q)G(L)
+ (1 + B, (I))H(L).
of this equality is the Fourier transform of some function from by zero on the left. On the right hand side the first term is the of the function f = g + /3 * g, where p(x) = b, (- x) (x < 0) 2 0. The second term is the Fourier transform of a certain - co, 0) extended by zero on the right. Y(A) = (1 + B-(n))
O3f(x)e- x dx. s0
(105)
The arguments outlined’ can be reversed, i.e. (105) is in fact the solution (it is unique because of the previous part). For v > 0 each factorisation produces a certain solution from formula (105). Thus we obtain an infinite set of solutions parameterised by divisors of roots with total multiplicity v which lie in the upper half-plane. The space N of solutions of the homogeneous equation u(x) =
OD4x - 5)N) d< (x > 0) uw s0 is v-dimensional in the given case. All the functions appearing in N are bounded8. For v < 0 the adjoint’ homogeneous equation u(x) =
(106*) m a(< - xb(O 4 (x > 0) s0 has index (-v) > 0 and so the space N’ of its solutions is lvl-dimensional. If equation (103) is solvable in this case, S must be orthogonal to all solutions of equation (106*) (and, conversely, under this condition equation (103) is solvable). Thus a necessary condition for solvability of equation (103) for all g is that v 2 0. e draw attention to their similarity with separation of variables. sA suitable construction of a basis in N shows that each function u E N is absolutely continuous, u’ E I,’ and u(co) = 0. 9 More precisely, transposed, but we will not make use of this term, relying upon the context.
82
Chapter
1. Classical
Concrete
Problems
This situation is related to the Fredholm case but in general there is the significant difference that dim N’ # dim N. The difference dim N - dim N’ is equal to the index v in all cases, i.e. dim N =
v (v>O) 0 (v
dim N’ =
0 I -v
(v 2 0) (v
Such situation is typical of singular integral equations, i.e. equations whose kernels have sufficiently strong singularities. The first general results in the theory of singular integral equations were obtained by M. Noether (1921). This theory is closely connected with the so-called Riemann-Hilbert problem, a particular case of which is the relation (104). Its solution was given by F.D. Gakhov (1936). 4.6. The Riemann-Hilbert Problem. Let I be a simple closed smooth contour in the extended complex plane c = @ u {co} and let a([), b(c) be two continuous functions on I with a(c) # 0 for all < E I. The complement c\r consists of two domains Gi and G, (interior and exterior; co E G, u I . The problem is to find two functions f;:(z) (z E G,), f,(z) (z E G,) which are analytic on the respective domainslo, continuous up to the boundary and connected by the relation .fx) = Nx?(~)
+ w3
(r E n
(107)
The index of this problem is defined as
v = & Marg 43, where Ar is the increment for a circuit of the contour I with Gi on the left. In (104) I = R E R u {co}, Gi is the lower half-plane, G, is the upper half-plane, 40 = (1 - 4W b(l) = (1 - 45))- ,M) = Y(C) andf,(l) = W3. Usually a bounded contour I is considered and it is assumed that a, b E Lip 6 for some 6. Let us suppose further that these conditions are satislied and assume for convenience that 0 E Gi. If v = 0 there is defined on the contour I a single-valued function a(c) = In u(c), which allows us to take the logarithm in the corresponding homogeneous problem Sit0
=
(r
u(Oge(C)
E n
(108)
The additive problem hi(l)
-
~440
=
a(0
K
E r)
now arises, the solution of which (unique if #J co) = 0) is given by the integral of Cuuchy type
lo If co E G, then f. has to be regular
at infinity.
83
$4. Integral Equations
This produces for a(c) the factorisation
where gi, ge are the exponentials Of 4i, &. They are continuous on Gi, G’, analytic on Gi, G, respectively and have no roots. Such a factorisation is defined up to a common constant factor in gi and ge, If v > 0 we can nullify the index by dividing a(<) by [‘, after which we can apply the previous factorisation. This produces the desired factorisation of a([); moreover g,(z) = G((z(-‘) as lzl+ co which leaves the possibility of multiplying the solution obtained, (gi, ge), by an arbitrary polynomial of degree - 1. A criterion for solvability when v < - 1 is the orthogonality condition Wh(O 4 = 0, fr where hi is the interior part of the general solution of the associated homogeneous problem
(its index is equal to -(v + 2)). If for any index we denote by N the space of solutions of problem (108) and by N’ the space of solutions of problem (108*) (with the previous requirement of regularity at infinity), then dim N =
(v 3 0) ov + 1 (v < 0)’
dim N’ =
0 I -v-
(v> -1) 1 (v< -1)’
Therefore in all cases dimN-dimN’=v+
1.
(109)
For v = - 1 the situation becomes that of the Fredholm case. The Riemann-Hilbert problem reduces to a singular integral equation if we look for a solution in the form of an integral of Cauchy type
(in addition
the condition
f,(co) = 0 arises). By Sokhotskij
1* For Y = 0 it was one-dimensional.
formula (with the
84
assumption
Chapter 1. Classical Concrete Problems
that $ E Lip 6),
where the integral is to be interpreted as a principal value, i.e. as the limit of the integral along the arc T’,(c) = {r: r E r, Ir - cl > E} as E + 0. Substitution in (107) leads to a singular equation of the form
(110) where cc(c)= (1 + a([))/2, /3(c) = (1 - a([))/2, r(c) = b(c) and consequently 43 + P(C) = 1. Conversely, if an equation of the form (110) is given and cr([) + /l(c) # 0 everywhere on r, then (by inversion of the above calculations) it reduces to the Riemann-Hilbert problem with coefficients
which allows us to define the index of equation (110) and to obtain the same facts that are available for the Riemann-Hilbert problem. These constitute the theory of Noether:
1) finite-dimensionality of the space N of solutions of the homogeneous equation; 2) a criterion for solvability of the inhomogeneous equation in terms of orthogonality of the right hand side to the space N’ of solutions of the adjoint homogeneous equation; 3) the equality dim N - dim N’ = v (the difference from (109) is explained by the fact that f,(m) = 0). The theory of Noether remains valid for more general singular equations
where the kernel k([, r) has a
eak’ singularity:
K, 4 = our: - V),
O
The theories of Fredholm and Noether are to a considerable degree algebraic. This circumstance played a significant role in the algebraization of classical analysis with which the formation of functional analysis was associated.
Chapter 2 Foundations and Methods $1. Infinite-Dimensional
Linear Algebra
All the ituations discussed in this paragraph will be concerned with a prescribed basic linear space E # 0 over an arbitrary (within the limits of this section) field K (or simultaneously with several such spaces). The elements of the space E, i.e. oectors or points, will usually be denoted by lower case Roman letters, while lower case Greek letters will be used for scalars, i.e., elements of the ground field. The identity of the field will be denoted by the figure 1. The standard function spaces (C, Lp etc.) lie outside of the range of traditional linear algebra, since it is restricted to the finite-dimensional case. Consequently in the context of functional analysis this restriction is unacceptable, the reason being that the spaces considered below are generally infinite-dimensional. Infinite-dimensional linear algebra comes in at the foundation of functional analysis as a suitable lgebraic model’ for the theory of linear topological spaces, which occupies a central place in functional analysis. 1.1. Bases and Dimension. By an algebraic’ basis, or Hamel basis, of a space E we mean a maximal (with respect to inclusion) linearly independent subset. Using Zorn lemma (or, equivalently, transtinite induction) we can easily prove Theorem 1. Any linearly independent subset basis. Corollary
of a space
E can be extended to a
1. There exists a basis in E.
Corollary 2. A basis of any subspace L # 0 can be extended to a basis of the whole space E.
Moreover the extending vectors2 form a basis of a subspace M such that the whole space E is the direct sum of L and M (L i M = E, i.e. M is a complement for L). Let a basis (ei}is, of the space E be given (I is an index set). In the unique representation of a vector x E E with respect to the given basis the set of indices of coefficients which are different from zero is finite. It is called the support of the vector x and is denoted by supp x. We have x
=
ie&
ti%
his word can be omitted in the purely algebraic context of the present paragraph. Zlf they exist.
Chapter 2. Foundations and Methods
86
which can also be written formally as X = 1
(the & are called the coordinates of the vector x with respect to the given basis). Any other basis {Uj}j,J can be expressed in the form uj
=
1 isI
&jet,
where the transfer matrix a = (aij) is infinite (if E is infinite-dimensional) but each of its columns has finite support, i.e. it contains only a finite set of non-zero elements; in addition, the matrix a is invertible: if ek
and b =
(Bjk),
=
C je.l
bjk
(k
E 1)
then jTJ
Q&k
=
hk
(i,
k E 0
zI
fijiail
=
sjl
09
l E 4.
(1)
If x has coordinates (&)ier with respect to the initial basis and (~?i)~,~with respect to the new basis, then 5i
The formal distinction have
=
jTJ
QijVj-
between the index sets Z and .Z is not essential since we
Theorem 2. All buses of the space E are equipotent. Proof.
In the previous notation i H
we have the mapping SUPP
f?i
(i E I)
hl
of the set Z into the set of finite subsets A of the set .Z. The preimage of each A is finite. Therefore card Z G card J. Similary, card .Z G card I. 0 The power (cardinality) of any basis is called the algebraic dimension of the space E and is denoted by dim E. As a common index set for all bases we can take the least ordinal of power dim E. Let us denote this by n. We note that, after identifying .Z with I, we can write formulae (1) in the form ub = e, bu = e, where e = (~5,~)is the identity matrix. The invertible matrices whose columns have finite support form a full linear group GL(n, K). On any set S a function rp: S + K is said to have finite support if its support supp cp = {s: q(s) # 0} is hnite3. The functions of finite support form a subspace Q+,(S, K) of the space @(S, K) of all functions (it is a proper subspace if the set S is infinite). In @,-JS,K) we have the canonical basis {~~}sos: d,(t) = &(t E S). The rp on a topological space is said to have compact support if its support supp cp= {s: q(S) # 0) is compact. This coincides with the present notion of we give S the discrete topology. 3A function
0 1. Infinite-Dimensional
representation
Linear Algebra
a7
of any rp E @e(S, K) is cp =
c
(Pws = c (PM4.
SOSUPPCP
SES
Since, clearly, dim @e(S, K) = card S, any cardinal is the dimension of some linear space. The dimensions of isomorphic spaces (E, x E,) are obviously equal and conversely isoinorphism of the spaces follows from equality of their dimensions (a l-l correspondence between the bases can be extended by linearity to an isomorphism of the spaces). Example. If B is a basis in E, then E x Q&3, K). Hence it follows in particular that, if K is infinite or E is infinite-dimensional, then4
card E = max(dim
E, card K).
We can use this formula to calculate the cardinal v = dim @(S, K) for infinite S (if S is finite then v = card s). We have card @(S, K) = max(v, card K). But v 2 card K, for even if S = N E (0, 1,2, . . .} the functions cp,(s) = ~1’(a E K) are linearly independent. Thus dim @(S, K) = card @(S, K) = (card K)cardS. We note that dim @(S, K) > card S, i.e. dim @(S, K) > dim @e(S, K).
(2)
An interesting quantity associated with a linear space E (dim E > 1) is the least power for a set of proper subspaces covering E. It turns out to be countable if E is infinite-dimensional equal to card K if E is finite-dimensional
and K is infinite, and K is infinite,
equal to card K + 1 if K is finite. The intersection and sum are defined for any set Y = {L,} of subspaces. The latter consists of all possible finite sums x,, + *a* + x,,(x, E L,). The set Y is said to be independent if L, n Cr+ L, = 0 for all rr. The sum of an independent set of subspaces is called a direct sum5 and is denoted by z. Its basis can be obtained by combining the bases of the summands and therefore its dimension
f K is finite and E is finite-dimensional
then clearly
card E = (card K)d E. e can also define the external direct sum FE, of a set of spaces {E,} as the subspace of the cartesian product IIE, which consists of those elements x = (x,) with finite support. Further, if there is a scalar product (. , .), on each E,, we can introduce a scalar product in 2PE, by putting (x, y) = Z(x,, y,). In this case we refer to the orthogonal sum of the spaces E, and use the symbol @ in place of + .
88
Chapter 2. Foundations and Methods
is equal to the sum of their dimensions. In the general case the dimension of a sum of subspaces does not exceed the sum of their dimensions. The intersection of all subspaces which contain a given non-empty set X c E (i.e. the smallest such subspace) coincides with the linear hull Lin X, i.e. with the set of all linear combinations of vectors in X. The dimension dim(Lin X) is called the rank of the set X and is denoted by rk X. It is equal to the power of any maximal linearly independent subset Y c X (each such Y is a basis in Lin X). One of the fundamental constructions of linear algebra associates with each subspace L c E the factor space E/L, consisting of the classes of the following equivalence relation (congruence modulo L): x=y(modL)ox-yEL. The class of the vector x is denoted by [xl. The operations on the classes are defined naturally by
Cxl + CYI = cx + Yl9
dxl
= Caxl,
which are valid because the right hand sides of these identities are independent of the choice of representatives of the classes on the left hand sides. Any complement M of the subspace L is isomorphic to E/L (the natural mapping x H [x] serves as the isomorphism). All complements of a given subspace L are therefore isomorphic and consequently they all have the same dimension. It is called the codimension of the subspace L and is denoted by codim L. If codim L = 1 (and only in this case) L is a maximal (with respect to inclusion) subspace different from E and is called a hyperplane. A subset of the form A = x + L (x E E, L a subspace) is called an afme manifold. In addition we put dim A = dim L, codim A = codim L. If dim A = 1 it is called a line; if codim A = 1 it is a hyperplane. 1.2. Homomorphisms and Linear Functionals. For any linear spaces E,, E, the linear space of homomorphisms6 h: E, + E, is denoted by Hom(E,, E,). In particular Hom(E, K) is the space of linear functionals on E. It is called the algebraic conjugate of E and is denoted by E’. If a basis B = {e,}iel is given in E then to each f E E” there corresponds a function f(ei) on B and this correspondence is an isomorphism (so that E’ x @(B, K))’ since f(x)
=
iTI
f(ei)W),
where the &(x) are the coordinates of the vector x with respect to the basis B. We stress that the coordinate functionals ti do not form a basis in EX if E is infinite-dimensional, since the infinituples (f(ei))i,, can be chosen arbitrarily and 6According to the standard definition, (h, + h,)x = h, x + h,x, (c&)x = a(hx). Further, if h: E, -B E,, g: E, + E, are homomorphisms, then their product (superposition or composition) gh: E, + E, is also a homomorphism. n other words @,JS, K)* x @(S, K) for any S.
0 1. Infinite-Dimensional
Linear
Algebra
89
not just with finite support. However the set {gi)is, is linearly independent because of the biorthogonality relation: &(er) = 6,. As a result of (2), dim EX > dim E if (and only if) E is infinite-dimensional.* If L is a subspace of E, the restriction & of any f E EX is a linear functional on L. Conversely, any f0 E L# can be extended to an f E EX by choosing any complement M for the subspace L and setting f(x) = fe(u) where x = u + u (u E L, u E M). Hence it follows in particular that if e E E and e # 0 there exists f E E’ such that f(e) # 0 (it is possible to make f(e) = 1). The same result shows that the linear functionals on E separate points: if x, y E E and x # y there exists fc E” such that f(x) # f(y). When applied to L = Lin{e}, the extension method described above shows that, if a subspace V does not contain the vector e, there exists f E E” such that f(e) # 0 and flY = 0. Any subspace is therefore the set of solutions of a system of linear equations h(x) = 0 ({J) c E”). The subspace L’ = {f: f E E’, f(x) = 0 (x E L)) of linear functionals which are orthogonal to the subspace L is called the annihilator9 of L. In the language of projective geometry the mapping L I-+ L * is a correlation. It is an involution in the following sense. For any subspace N c E” we define its annihilator in E to be the subspace N, = {x: x E E, f(x) = 0 (f E N)). Then” (L 1)1 = L. We note the further obvious implications L1cLz*L+L:,
Nl = Nz *Wd.
= WA1
and the correlation relations
for any families of subspaces {L,} and {N,} in E and E” respectively. With each homomorphism h: E, -P E, are associated two subspaces: its kernel Ker h c E,, the set of solutions of the homogeneous equation hx = 0, and its image Im h, the set of those y for which the inhomogeneous equation hx = y is solvable. The image Im h is isomorphic to the factor space E,/Ker h (theorem of E. Noether); the canonical isomorphism is h x] = hx. Example. Let us consider the homomorphism r: E” + L# (L is a subspace of E) obtained by restricting functionals to L: rf = flL. Clearly Ker r = L’, *In this case we have the formula dim P
= (card K)d E.
his definition remains meaningful for any non-empty of E”. lo In general (Nl)’ # N although N c (NJ ‘.
subset X c E and as before X ’ is a subspace
90
Chapter
2. Foundations
and Methods
Im r = L”. Consequently L” x E#/L’. It is appropriate to point out here that is cp H cpj, where cpE (E/L)# and (E/L)’ x L’ (the canonical isomorphism j: E + E/L is the natural homomorphism). Returning to our consideration of kernels and images, we note that injectivity (monomorphicity) of a homomorphism h is equivalent to the identity Ker h = 0 and surjectivity (epimorphicity) is equivalent to the identity E,/Im h = 0. This factor space is called the cokernel and is denoted by Coker h (and E,/Ker h is the coimage, denoted by Coim h). The dimensions of the subspaces Ker h, Im h are called respectively the defect and rank and are denoted by def h, rk h; their codimensions are the codefect (codef h) and corank (cork h). Clearly, as a consequence of Noether theorem, we have codef h = rk h. Now let us consider the extremely important canonical homomorphism u: E + E#” which is defined by the formula
h)(f) =fM (xEE,fEE+). It is injectiue since, if f(x) = 0 for all f E E#, then x = 0. Identifying E we obtain the inclusion reflexive if E# ” = E.
Theorem. In order that E be algebraically that it be finite-dimensional. Proof
Im u with
E c E ##. The space E is said to be algebraically
If E is infinite-dimensional
reflexive it is necessary and sufficient
then dim E* # > dim E. q
We note that as well as its small annihilator NI c E any subspace N c E” also has a big annihilator: N’ c E# #. Moreover NI = N’ n E. Examplel.
IfN=OthenN,=EandN =E .
Example 2. If N = E” then N, = 0 and N* = 0. With each homomorphism h: E, + E, there is associated its algebraic conjugate hX: Ef -+ Ef which is defined by the formula
@#f)(x) = f&4
(xEEl,fEE;).
There is a certain symmetry ( uality between h and h” which is reflected, in particular, in the following system of formulae: Ker h# = (Im h) Ker h = (Im h l; (3) Im hX = (Ker h) Im h = (Ker hX)I. (4) The result that orthogonality of the vector y to all solutions of the adjoint homogeneous equation h#f = 0 is a criterion for solvability of the equation hx = y is contained in these. We see also that injectivity (surjectivity) of a homomorphism his equivalent to surjectivity (injectivity) of the conjugate homomorphism h#. In addition
to formulae (3) and (4) we note the relations:
Ker h” x (Coker h)#,
Coker hX x (Ker h)
Im h# z (Im h)#.
Q 1. Infinite-Dimensional
Substitution the identities
Linear
of h# for h in the left members Im h## = [(Im h)l]‘,
91
Algebra
of formulae
Ker h## = [(Ker
(3) and (4) leads to h) ‘1 I.
The homomorphism h# #: Ef ” + E2## is the natural extension of the homomorphism h from E, to Ef # . The linear spaces over a given field K are the objects of a category in which the homomorphisms serve as morphisms. The following contravariant adjoint jiinctor” acts on this category: E H E#, h H h# ((gh)# = h#g#, l# = 1). The more general contravariant functor E H Hom(E, E,) (E, is a fixed space) transforms the morphisms by means of the commutative diagram
The covariant functor E H Hom(E,, E) is constructed similarly. With each linear space E there is associated the set End E = Hom(E, E) = L(E) of endomorphisms or linear operators on E (defined on the whole of E). Under all the available algebraic structures on L(E) it is an associative (noncommutative if dim E > 1) algebra with identity 1 over the field K. The invertible elements of this algebra, i.e. the automorphisms of the space E, form a group Aut E z GL(n, K). The idempotents of the algebra L(E), i.e. operators P satisfying the condition P2 .= P, are closely connected with the geometry of subspaces; in fact in this case E=ImP-kKerP,
PI,,,=1
and conversely if E = L i M the representation arbitrary vector x E E defines a linear operator Im P = L and Ker P = M. This is the projection the projection onto M along L. If {ui} is a basis is the coordinate of the vector x with respect corresponding to the vector ui, then
x = u + u (u E L, u E M) of an P (Px = u) for which P2 = P, onto L along M and 1 - P is in L, (uj} is a basis in M and ti to the basis (q} U {vi} in E
Px = c &Ui. I Associated with any decomposition of a space E into the direct sum of subspaces L, is a family of projections P, which are defined by the conditions Im P, = L,,
Ker PO = 1. L,.
Clearly for any x E E there is only a finite set of its projections I1 In addition
the mapping
h H h” is linear: (h, + h,)*
= h: + h,X, (u.h)# = Ax.
P,x different from
92
Chapter
2. Foundations
and Methods
zero and x = 1 P,x. d
We note that the projections
under consideration
are pairwise annihilating,
i.e.
P,P, = 0 (r # 0) and the family P,, is total in the sense that
n Ker P, = 0. 0
Invariant subspaces of any linear operator A: E -+ E can be conveniently characterised in terms of projections. In order that a subspace L be inuariant’2 (i.e. AL c L) it is necessary that any projection P onto L should satisfy the relation PAP = AP and it is sufficient that even just one projection should satisfy this relation. The stronger commutativity condition PA = AP implies the invariance of not only the subspace L = Im P but also of its complement A4 = Ker P. In this case we say that the subspace L is completely invariant under (or reduces completely) the operator A, about which we say in turn that it decomposes into the direct sum of the restrictions Al, and Al,. This construction extends in an obvious way to the situation where the space E is decomposed into a direct sum of an arbitrary family of invariant subspaces. The famous problem of the existence of non-trivial (i.e. different from 0, E) invariant subspaces is trivial in the purely algebraic context. Theorem. If the field K is algebraically closed and dim E > 1 then any linear operator A: E -+ E has a non-trivial invariant subspace. Proof. We can assume that E is infinite-dimensional and that A does not have any finite-dimensional invariant subspaces other than the zero subspace. Then for any x E E, x # 0 the system (Akx}; is linearly independent. The desired subspace is Lin { Akx} 7. 0
In certain cases it is easy to describe all invariant
subspaces.
Example. Let us consider differentiation as an operator D on the space II of all polynomials in one variable. The invariant subspaces are precisely the subspaces II, of polynomials of degree at most n (n = 0, 1,2, . . .), the zero subspace 0 and II. The operator D is unicellular in the sense that its invariant subspaces form a chain under inclusion: 0 c II, c II, c ... c II. Consequently D has no non-trivial completely invariant subspaces. If the subspace L is invariant for the linear operator A, then it is invariant for all polynomials P(A). This obvious fact has a partial converse (Yu.1. Lyubich, 1955). Theorem. Let the subspace L c E be invariant for the operator Q(A) where Q is some polynomial of degree n. Let us suppose that there exists a set {I,, . . . , A,} c K such that I2 We also say that L reduces
the operator
A.
5 1. Infinite-Dimensional
Linear Algebra
93
1) the equations Au - I,u = ~(1 6 k < n) are solvable in L for all o E L, 2) the equation A(A)d = 0, where A(L) = flICkSn(l - A,), has only the trivial solution in L. Then L is invariant for A. In fact it is shown under the stated conditions that if x E L and Q(A)x E L then A E L (1 d m d n). It is even possible to relax the requirement that A is defined on the whole of E, but then it is necessary to have x E D(A . The following well-known result in analysis is a consequence of this general formulation. Lemma of Esclangon. Let x(t) (t > 0) be a bounded solution of an nth order linear differential equation with constant coefficients and bounded right hand side. Then all the derivatives xc (t) ) (1 < m < n) are also bounded.
A further development of the approach described leads to bounds for the intermediate derivatives which are similar to A.N. Kolmogorov inequalities. 1.3. The Algebraic Theory of the Index. A homomorphism to be Noetherian if Ker h and Coker h are finite-dimensional. homomorphism its index is defined as
ind h = def h - cork h = dim(Ker
h: E, + E, is said If h is a Noetherian
h) - dim(Coker
h).
A Noetherian homomorphism with zero index is called a Fredholm homomorphism. In this case surjectivity is equivalent to injectivity, i.e. the Fredholm alternative holds: either the inhomogeneous equation hx = y is solvable for all y or the homogeneous equation hx = 0 has a non-trivial solution. It follows from formulae (3) and (4) that the homomorphism h# is Noetherian if and only if the homomorphism h is Noetherian and in the Noetherian case cork h# = def h 2
def h# = cork h, so that ind h# = -ind
h. It is appropriate
to note here that
ind h = def h - def h#. The product gh of two Noetherian index has the logarithmic property:
homomorphisms
is Noetherian
and its
ind(gh) = ind g + ind h. Consequently the product of two Fredholm homomorphisms is also a Fredholm homomorphism. This property is preserved under mall’ perturbations. The following definition is required for a more precise formulation: a homomorphism is said to be of finite rank if its image is finite-dimensional. Theorem.
Zf A: E -+ E is an operator of finite rank then 1 - A is a Fredholm
operator. Proof. We take a complement of the subspace L = Im A in the whole space: L i M = E. If x = u,, + u (uO E L, u E M) then x = u + (1 - A)u, where u =
94
Chapter
2. Foundations
and Methods
u0 + AU E L. Consequently L + (1 - A)M = E and it is easy to see that this decomposition is direct. Let us consider the operator A”= Al, (clearly the subspace L is @variant under A). From what has gone b_efore, Coker(1 - A) z Coker(1 - A). At the same time Ker(1 - ,A) = Ker(1 - A). Consequently 1 - A is Noetherian and ind(1 - A) = ind(1 - A) = 0, since every operator on a linitedimensional space is a Fredholm operator. 0 The perators of finite rank on E form a two-sided ideal @ in the algebra L(E). Thus if U E Aut E and A E @ then U - A = U(1 - U-IA) is a Fredholm operator. It is useful to record the general form of an operator of finite rank. It is
where (u,J;l is an arbitrary system of vectors and (fk)T is an arbitrary system of linear functionals. For the (Q) we can always choose a basis in Im A and then the system (fk) is linearly independent. If these systems are biorthogonal (and only in this case) A is a projection. In a certain sense the investigation of arbitrary Noetherian operators reduces to the previous situation. Theorem.
If the operator
T: E ---* E is Noetherian R, (left and right regularisors) such that R,T=
1 -A,
where A, B have finite rank. Conversely, stated regularisations is Noetherian.
there exist operators
R, and
TR, = 1 - B, any operator
which allows both of the
Proof. If A is a projection onto Ker T and P is a projection onto Im T and if 7 = TIKerA (clearly 7: Ker A + Im T is an isomorphism), then R, = (1 - A)? , R, = ?lP (B = 1 - P). The converse statement follows from the previous theorem and the inclusions Ker T c Ker(1 - A), Im T 3 Im(1 - B). 0 Remark. It can be said that the Noetherian modulo @ (moreover R, = R, (mod @)).
It follows from the logarithmic
operators on E are invertible
property of the index that
ind T = -ind R, = -ind R,. Suppose that the operator I/: E + E is small in the sense that 1 + R,V and 1 + VR, are invertible. Then (T + V)R, = (1 - B(l + I/R,)- (l
+ 1/R,),
and R,(T + V) = (1 + R,V)(l
are Fredholm
- (1 + R,V)- )
operators, from which it follows that T + V is Noetherian ind(T + V) = -ind
R, = ind T.
and
Q1. Infinite-Dimensional Linear Algebra
95
From this emerges the important fact of the stability of the index under small perturbations which is entirely natural since the index takes integral values. The calculation of the index of some particular operators is usually an extremely interesting and often very difficult problem, as for example in the case of the famous Atiyah-Singer theorem on the index of an elliptic linear differential operator. 1.4. Systems of Linear Equations. Let F = (fi)ie, be an arbitrary system of linear functionals on the space E (the index set I can be finite or infinite). Let us consider the system of linear equations fitx)
=
i
(i E I),
where the yli are given scalars. Clearly a necessary condition of the system (5) is that the implication
(5)
for solvability
(consistency)
holds for any system of scalars (jli)iel with finite support. Conversely, if (6) is satisfied the system (5) is equivalent to a subsystem corresponding to a maximal linearly independent subset F, c F. If we introduce the homomorphism h,: E + @(I, K) by means of the formulae (hFx)(i) = fi(x) (i E I) we can write the system (5) in the form of an equation h,x = y, where y(i) = vi (i E I). Lemma. Zffi,...,fm are linearly independent linear functionals, the system of equations h(x) = vi (1 < i d m) is solvable for any right hand sides vI, . . . , q,,,. Proof. Because the set M = { 1, . . . , m} is finite we can identify @(M, K)# with @(M, K). The conjugate homomorphism h; maps any collection (Ai) to cE1 A&. By the condition hz is injective. Consequently h, is surjective. 0 Corollary. If the system (5) has finite rank (i.e. rk F is finite) then for its consistency it is necessary and sufficient that condition (6) be satisfied.
In the case of infinite rank condition
(6) is no longer sufficient.
Example. Let (ei}i E1 be a basis in E and { ri}i EI the set of coordinate functionals. Because of their linear independence the system of equations ti(x) = qi(i E I) satisfies condition (6) but it is solvable only for those (qi)ior of finite support. In the general case, (6) is necessary and sufficient for the consistency of each finite subsystem of system (5), i.e. for the reduction to the case of linearly independent equations. If the system (5) is consistent and x0 is some particular solution of it, then the general solution is described by the formula x = x0 + z, where z is the general solution of the corresponding homogeneous system: fi(z) = 0
(i E I).
(50)
96
Chapter
2. Foundations
and Methods
We denote the subspace of solutions of system (5,) by NF. If {,z~}~.~ is a basis of it, the general solution of the system (5) has the form + C 04iZj, j where (aj)j EJ runs through all possible scalar systems with finite support. For uniqueness of the solution of system (5) it is necessary and sufficient that NF = 0, .e. system (5,) should have only the trivial solution. In this case the system of functionals F is said to be total. Since NF = Ker h, we have X
=
X0
codim NF = rk h,. In the case where the system consists of one equation f(x) = 0 (f # 0) we have codim NF = 1, i.e. NF is a hyperplane (NF = Ker f). Conversely, for any hyperplane H c E there exists a linear functional f # 0 such that Ker f = H. In fact, since dim(E/H) = 1, we can identify E/H with the held K and then f: E + E/H is the natural homomorphism. Finally, if Ker fi = Ker f2 then fi and f2 are proportional. Thus the set of hyperlanes in the space E can be identified with the projective space PE” consisting of the classes under proportionality on E#\ (0). In contrast to the very concise representation of the system (5) in the form of the equation h,x = y we can represent it in scalar form jJ$J
ccijC;j
=
Vi
ti
E 03
(7)
where (c;i)jsJ are the coordinates of the vector x E E with respect to some basis Iej>jeJ and aij = J(ej). In addition only solutions of finite support are sought. However we can dispense with this restriction if the rows in (7) have finite support. From the geometric point of view a system of the form (i E I)
(5#) is obtained, where xi is the vector with coordinates (IX~~)~,,and f is an unknown linear functional. The condition ftxi)
=
Vi
T &xi = 0 =s T liqi = 0
(A E Go(Z, K))
(6
is sufficient for solvability of the system (5#) since, if the system (xJisl is already linearly independent, it can be extended to a basis, after which the functional f is defined by the values vi on the vectors xi and is extended arbitrarily on the other basis vectors. With this we have established Theorem of Toeplitz. For consistency of the system of equations (7) when in each equation the set of non-zero coefficients is finite13, it is necessary (triuial) and sufficient that all its finite subsystems be consistent.
I3 We shall sometimes
call such a system
a Toeplitz
system.
$1. Infinite-Dimensional
Linear
Algebra
97
It is interesting that, under specific conditions on the field K, this theorem generalises14 to systems of algebraic (non-linear) equations (see Section 1.7). 1.5. Algebraic Operators. A linear operator A: E + E is said to be algebraic if there exists an annihilating polynomial cp # 0 for A: q(A) = 0. In this case an annihilating polynomial of least degree is uniquely determined up to a scalar factor. We can therefore take it to be manic and then it is called the minimal polynomial of the operator A. All annihilating polynomials are divisible by the minimal polynomial. Example
polynomial
1. Any projection is 1’ - II.
P is algebraic: P2 = P. If P # 0, 1, its minimal
Example 2. If A is a nilpotent operator, i.e. A” = 0 for some m, then it is clearly algebraic and its minimal polynomial has the form ,?* where r < m. The number r is called the index of nilpotence of the operator A. If E is finite-dimensional every linear operator on E is algebraic. This property is fundamental to the classical theorem of Jordan, thus allowing the theorem to be generalised to algebraic operators on an infinite-dimensional space (whose minimal polynomials factorise into linear factors over the field K). The set of eigenvalues of an algebraic operator A is called its spectrum and is denoted by spec A. Associated with each 1 E spec A are the root subspaces W,(A) = (x: (A - Il)kx
of orders k = 1,2,3, . . , . They are all invariant the eigenspace and clearly
= 0}
under A. The subspace W,(L) is
0 $2 W,(l) c W*(k) c ... . The union w(n) = kul w,@) is called the maximal root subspace at the point 1 (correspondingly, the vectors x E W(n)\ (0) are called root vectors). If there exists r > 1 such that W(n) = W,(n) then the least such r is called the order(ord A) of the eigenvalue I; otherwise ord 1= co. It is easy to see that, if ord 1 = r < co, then in the chain of root subspaces the first r - 1 inclusions are strict and the rest are equalities. The operator Alwrll, has a Jordan structure, i.e. it decomposes into a direct sum (in general of an infinite set’ 5, of finite-dimensional invariant subspaces on each of which the matrix of the operator with respect to a suitable basis is a Jordan cell 14This range of questions is closely connected with the well-known GGdel-Mal sev theorem compactness in mathematical logic. IS For finiteness of this set it is necessary and sufficient that the eigenspace W,(I) be finite-dimensional (then all the W,(I) are finite-dimensional).
on
98
Chapter
2. Foundations
and Methods
.
1 0
...
0
0’
0
/z 1
...
0
0
0
0
/I
...
0
0
.
.
.
...
.
.
0
0
0
...
2
1
0
0
0
...
0
2,
of order
of this fact with the following
Let A be an algebraic
Cp(n)
=
~
(n
-
operator
and let its minimal
s , ,
j=l
2, are pairwise distinct. Then {S} ’1 is the set of all eigenvalues of the whereA,,..., operator A, their orders are equal to the corresponding rj and the space E decomposes into the direct sum of the maximal root subspaces: E = I+ &) i ... i Vi &).
(8)
In particular, if all the roots of the minimal polynomial are simple (and only in this case), the operator A is diagonalisable, i.e. E decomposes into the direct sum of the eigenspaces. A decomposition of the identity operator 1 into the sum of root projections PI, . ..) P, is defined by the decomposition (8). These projections have the following explicit representation: Pk = cp,(A), where (pk is the interpolating polynomial corresponding to the conditions: cp,(Aj) = djk; The root projections case
q# Aj) = 0
(1 Q i < rj - 1; 1 <j
commute with the operator
< s).
A and in the diagonalisable
(spectral resolution of the operator). A further consequence of the spectral theorem is the invertibility of the operator A - Al for all L $ spec A. The inverse operator (A - Al)-’ is called the resolvent of the operator A and is denoted by R,(A) (abbreviated, R,). Correspondingly K\spec A is called the resolvent set and its points are said to be regular for A; it is therefore denoted by reg A. We have a decomposition into partial fractions:
5 1. Infinite-Dimensional
where P/O) = Pj, the root projections,
99
Linear Algebra
and 4” = (A - Ajl,iPj. Consequently
R,(A) = y, where 8 is a polynomial in two variables (deg, 8 < deg cp- 1). The operator p(A) is naturally defined for any rational function p whose poles are different ,from A,, . . . , A,. Moreover the Lagrange-Sylvester formula holds:
The correspondence p H p(A) is called the functional This is a homomorphism of the algebra of rational on spec A into the algebra of operators. We note mapping theorem: spec p(A) = p(spec A). Now let A be any operator, (Aj>; some collection (1 <j < s) and q(A)
=
fi
(A
-
calculus for the operator A. functions which are regular in this context the spectral of its eigenvalues, rj = ord Aj
Aj),
j=l
where vi < rj + l(1 < j d s). Let us consider the linear equation cp(A)x = 0 with unknown x E E. The subspace of its solutions N = Ker q(A) is invariant under A and the operator Al, is algebraic. Its minimal polynomial coincides with cp. By the spectral theorem N = w (n,) i ... i w&) (these subspaces, which are root subspaces for A, are contained in N) and, in more detail, Al, has a Jordan structure. We will discuss two classical examples from this point of view. Example 1. The difirentiation operator D = d/dt on the complex function space Cm(R) has all I E C for its eigenvalues (of infinite order). The root subspace w,(A) consists of all functions of the form z(t)e , where rc is a polynomial of degree
cp(D)x = 0
p(l) = fi (A - ij) j=l
>
has the form x(t) = i
7rj(t)e
j=l
where the rrj are arbitrary polynomials
of degrees < vi - 1 (1 < j < s).
Example 2. Let us consider the shift operator
cw,
= 5,+1
(n = 0, 1,2, . ..)
loo
Chapter 2. Foundations and Methods
on the space s of all complex sequences x = (t;,)$. As in the previous example, its eigenvalues fill out the whole plane @ and have infinite order. For il # 0 the root subspace V&(n) consists of all sequences of the form (a(n)l ;, where rt is a polynomial of degree < k - 1. Consequently the general solution of the linear difference equation with constant coefficients cp(T)x = 0
q(A) = fi (A - ljp; j=l
cp(O)z 0 >
has the form (n=0,1,2 where the 7cjare arbitrary polynomials
)... ),
of degrees
1.6. General Principles of Summation of Series. In this Section we will indicate a further interesting application of the methods of linear algebra to problems of analysis. Let L be any subspace of the space s of all complex sequences x = (5.): which is invariant under the shift operator T. The following definition (or if preferred, axiom of summation) is due to A.N. Kolmogorov (1925). A linear functional 0 on L is called a general&d sum (or a method of summation) of the series to + l1 + t2 + * * * if it satisfies the equation
a(Tx) - O(X) = - &,
(x = (Lk E L) (9) (i.e. the equation (T IL)#o - 0 = - &,, where &-, is regarded as a linear functional on L). Further, we shall say that L admits summation if a generalised sum exists on it. We denote the class of such subspaces by z. On the subspace C of those x for which the series to +
n N = 0.
We call the subspace M Z-maximal if M E z and there is no extension i@ 3 M in the class z which preserves the generalised sum on M. Every L E Z can be extended to a z-maximal subspace in such a way that the generalised sum on L is preserved. Theorem 2. Let M be an invariant subspace. Then M is Z-maximal if it is a complement of the subspace N in the whole space s.
Naturally,
Zorn
lemma is used in the proofs of the results listed.
if and only
0 1. Infinite-Dimensional
Linear Algebra
101
If L E z uniqueness of the generalised sum on L is equivalent to the absence on L of linear functionals p # 0 which are invariant with respect to shift: p(Tx) = p(x) (x E L). This in turn is equivalent to the surjectivity of the operator (T - lh.. The generalised sum is not unique even on the subspace I’, i.e. even under the condition of absolute conoergence: l&,1 + ICI 1 + ... < 00. However it can be shown that. on I’ there is a unique non-negative generalised sum, i.e. a(x) 3 0 if x 2 0 (i.e. if r. 2 0 (n = 0, 1, 2, . . .)); 1 must therefore coincide with the ordinary sum. For any L we put L, = {x: x E L, x 2 O}. Theorem 3. A non-negative generalised sum exists on a subspace L E C if and only if L, c 1
Let us now consider the invariant subspace Q of all sequences of the form where all the lj are different from 1 and 0 and the 7rj are t, = Cpl polynomials. Since Q n N = 0, we have Q E L: The operator T - 1 on Q is surjective and so the generalised sum on Q is unique. It can be described in a straightforward manner. To this end we note that if x E Q the power series c. e <,,[” converges on some disk Ill < r,. Its ordinary sum f,(c) (the generating function of the sequence x) is rational and its poles are ;Z:’ (1 < j < m). ?Tj(n);ljn,
Theorem 4. The unique generalised sum on Q is defined by the formula o(x) = f,(l).
The non-negativity requirement is not satisfied here. For example, if x = (A ; where A > 0 (1 # l), then cr(x) = (1 - A)In conclusion we note that Q is contained in all z-maximal subspaces. It is interesting that the generalised sum on each C-maximal subspace is unique. 1.7. Commutative Algebra. The term lgebra’ in this section means a commutative associative algebra with identity element e # 0 over some field K. The algebra @(S, K) of scalar functions on an arbitrary set S, or any of its subalgebras, can serve as an example. The algebras @(S, K) and all their subalgebras are called function algebras.
We stress that according to the definition of a subalgebra the identity e must belong to it. In contrast to this, ideals do not contain the identity (with the exception of the ideal which coincides with the whole algebra). The presence of the identity allows us to consider the field K as being embedded in the algebra. In this way the identity of the field K is identified with e (and in general c1= ae). An ideal M of an algebra 2l is said to be maximal if M # 9I and there is no ideal fi 1 M which is different from M and 9I. With the help of Zorn lemma it can be shown that any ideal I # % is contained in some maximal ideal (theorem of Krull). Therefore in ‘$Ithere exists at least one maximal ideal. An algebra with a unique maximal ideal is said to be local. A very simple example is provided
102
Chapter 2. Foundations and Methods
by 2I = K i E with multiplication (LX+ x)(/I + y) = ap(a, /I E K, x, y E E)16. The subspace E c ‘8 is the unique maximal ideal. In any algebra ‘$Ieach element a generates the ideal (a) = { y: y = ax (x E 2l)}. The set of non-invertible elements therefore coincides with the union of all maximal ideals. Consequently an algebra 2I is a field if and only if its unique maximal ideal is the zero ideal. If I # ‘3 is an ideal in 2I then the factor algebra 2l/I is a field if and only if the ideal Z is maximal. The radical of an algebra Iu is the set Rad ‘2Iof those elements a E ‘$Isuch that e - ax is invertible for all x E ‘$I. Theorem. The radical of an algebra 9I coincides with the intersection maximal ideals.
of all its
Proof. If M is a maximal ideal and a E M then e - ux 4 M and if this holds for all M it follows that e - ax is invertible for all x E ‘9I,i.e. a E Rad ‘3. Conversely let a E Rad ‘3l and suppose that a $ M for some maximal ideal M. Then the image of the element a in the field ‘%Z/Mis invertible, i.e. there exists x E ‘2I such that ax E e (mod M). But then e - ax E M which implies the non-invertibility of this element. 0 Corollary
1. Rad 2I is an ideal.
An algebra 9I is said to be semisimple if Rad 2I = 0. Corollary
2. For any algebra 2l the factor algebra %Z/Rad 9I is semisimple.
If the element invertible:
a E ‘3 is nilpotent,
i.e. urn = 0 for some m, then e - a is m-l
(e
-
a)-’
=
1 k=O
uk.
But if a is nilpotent so also is ax for any x E 9I. Therefore all nilpotent elements are contained in the radical of the algebra. They form an ideal called the nil radical. The nil radical coincides with the intersection of all simple ideals (an ideal I # 9I is said to be simple if there are no divisors of zero in the factor algebra ‘%/I). If E is finite-dimensional its nil radical coincides with its radical. For each maximal ideal M in an algebra 2I the field ‘+3/M is an extension of the field K, for the latter is canonically embedded in 5X/M by means of the chain of natural homomorphisms1 7:
We call an algebra spectral if U/M = K for all M, i.e. the codimensions of all maximal ideals are equal to one. This condition is a fundamental axiom of the I6 In general if 21L,is an algebra without identity the construction = K i I, with multiplication (a + x)(B + y) = a/3 + (ay + fix + xy) is called the unitisation of 9llo. I7 We stress that a homomorphism of algebras with identities must by definition map identity to identity.
0 1. Infinite-Dimensional
Linear Algebra
103
general spectral theory set out below, the principal prototypes of which are to be found in the spectral theory of operators, in the theory of commutative Banach algebras and in algebraic geometry. In what follows the algebra 2I is assumed to be spectral. The spectrum of an element a E 2I is the set spec a of those 1 E K for which the element a - le is not invertible ‘* . The remaining I E K are said to be regular, the set of regular values is denoted by reg a and it is called the resolvent set for a since it is the domain of definition of the resolvent R,(u) = (a - le)-‘. Example. If a = ue then spec a = {a}, R, = (a - I)- . In particular {0}, spec e = (1). We note two fundamental relationships for the resolvent: uR, = e + AR,;
spec 0 =
R, - R, = (A - ,u)RAR,.
The second of these is called Hilbert identity and is easily deduced from the first. We denote by 93 the set (space) of maximal ideals of an algebra 2I and for each ME 9.R we denote by u(M) the image of the element a E in X/M. Since ‘%/M = K we obtain a scalar function ii on W. For invertibility of the element a it is necessary and sufficient that d(M) # 0 for all M, i.e. that a”be invertible in the function algebra @(%I&K). The mapping a H B is an algebra homomorphism: ‘$I+ @(9.X, K). It is called the canonical functional representation of the algebra 2I. Its kernel coincides with Rad ‘$I and its image % is a subalgebra of @( IR,K); moreover it is sufficiently rich in the sense that the functions in @ separate the points of the space W: if M, # M2 then M, $ M, and then for a E M, \M, we have ii = O,Z(M,) # 0. If 2I is semisimple (and only in this case), ‘$I is isomorphic to I. Thus any semisimple spectral algebra can be considered as an algebra of functions. Theorem. element a.
The set of values of the function
a”coincides with the spectrum of the
Proof. We have: spec a = {A: 3M E 2X s.t. a - i2e E M} = {A: 3M E !UI s.t. ii(M) = A}. 0 Corollary
1. The spectrum of any element is non-empty.
Non-invertibility to be quusinilpotent Corollary algebra.
of an element a implies that 0 E spec a. An element a is said if spec a = (0).
2. The set of quusinilpotents
Thus any nilpotent
is quasinilpotent.
coincides
with
the radical
of the
A more general result holds.
I8 For this and the definitions which follow immediately it is not necessary that the algebra be spectral or even commutative.
104
Chapter 2. Foundations and Methods
Corollary 3. Zf a is an algebraic element and cp is its minimal polynomiaP9, then cpfactorises over the field K into linearfactors and spec a coincides with the set of roots of the polynomial cp. Proof. It follows from p(a) = 0 that q@(M)) = 0 and so spec a is contained in the set of roots. Let rc/ be an irreducible (over K) factor of the polynomial cp. Because cp is minimal, the element $(a) is non-invertible. Consequently $(6(M)) = 0 for some M and therefore $ is linear and its root belongs to spec a. 0
In particular, if a is idempotent, the identity then spec a = (0, l}. Corollary 4 (Spectral Mapping spec e(a) = B(spec a) holds.
i.e. a2 = a, and a is different from zero and Theorem).
For any polynomial
0 the identity
This 0(a) is invertible if and only if e(n) # 0 (A E spec a). Consequently we have the functional calculus p H p(a) for all rational functions p which are regular on spec a and spec p(a) = p(spec a). For a maximal ideal M the natural homomorphism X~M:‘9 -P ‘%/M = K is a character, i.e. a linear functional on % which is multiplicative in the sense that XM(ab) = XM(a)XM(b)(XM # 0 and in fact X,,Je) = 1). The mapping M H X~ of the space of maximal ideals to the set of characters is bijective. Let us consider some specific examples of spectral algebras. Example 1. An algebra % over the field [F2 = (0, l> is said to be Boolean if all its elements are idempotent, i.e. the identity x2 = x is satisfied in 9I. Any Boolean algebra 2l is spectral since B/M is a Boolean field for any maximal ideal M and then M/M = IF, since the identity element is the unique invertible idempotent. We now observe that any Boolean algebra ‘%is semisimple since, if a E Rad 2l, we have spec a = (0) so that a - e is invertible and then from the identity a(a - e) = 0 it follows that a = 0. The image ‘$I of the algebra ‘3 under the canonical functional representation is isomorphic to 2L However the functions G E % take only the values 0 and 1 and can therefore be identified with the corresponding subsets of the space 93 of maximal ideals. Moreover a sum transforms into a symmetric difference while a product becomes an intersection. Thus we obtain the result that any Boolean algebra is isomorphic to a certain algebra of sets (theorem of Stone). Example 2. Let K = [F, be any finite field (card K = q = pm, where p is prime and m > 1). The identity x4 = x holds in K. If it holds in a field E 1 K then E = K since the equation x4 = x cannot have more than q roots in E. Therefore if this identity holds in an algebra 9I over K then ‘3 is spectral. Also it is semisimple 19These concepts are defined just as in the case of a linear operator. An algebraic operator on the space E is an algebraic element of the algebra L(E) and its spectrum in L(E) coincides with the spectrum in the previous sense if the minimal polynomial of the operator factorises into linear factors over the field K.
4 1. Infinite-Dimensional
Linear Algebra
for the same reason as in the case 4 = 2. Therefore 9I is isomorphic subalgebra @(‘9JI,IF4)(YJI is the space of maximal ideals). Example 3. If the field K is algebraically satisfies the condition
105
to some
closed, any algebra ‘$I over K which
dim ‘9I < card K
(10)
is spectral. In fact, let M be a maximal ideal, i.e. E = %/M is a field. If x E E\K then x - 1 is invertible for all ;1 E K. Since the set of elements of the form (x - A)‘-’ (1 E K) has power card K, it is linearly dependent. Hence it follows that x is algebraic over K and so x E K, which is a contradiction. Condition (10) is satisfied if the algebra 9I has an infinite family I of generators such that card I < card K; consequently such an 2I is spectral. This conclusion remains valid for finitely generated algebras but here a completely different argument is required; this uses the well-known Nullstellensatz of Hilbert. We can now generalise the last result in the following way. Let us consider the algebra K [X] of polynomials in any collection X = { xi}is I of indeterminates over the field K. It is clear that each polynomial f depends only on some finite collection of indeterminates. For any t E K’ the value f(t) E K is defined naturally and if f(t) = 0 then t is called a root of the polynomial J A common root of all polynomials belonging to a certain set is called a root of the set (without loss of generality we can take the set to be an ideal in K[X]). Theorem of Lang.
Zf the field K is algebraically closed and card X < card K
then any ideal J # K[X]
(11)
has a root.
satisfied and we come back Proof If X is finite, condition (11) is automatically to Hilbert Nullstellensatz. Suppose X is infinite. We can assume that J = M, a maximal ideal. The required root is t = (ti)isl where ti is the image of the polynomial Xi in the field K[X]/M = K. 0 Corollary. Suppose that the field K is algebraically closed and that (11) is satisfied. For consistency of a system of algebraic equations of the form
f,(t) = 0
(fv E KC-U, v E NJ
in the unknown t = (ti)iot (N is some index set), it is necessary (trivial) and suflicient that each of its finite subsystems be consistent.
In fact, the ideal J c K [X] which is generated by the system (f,), EN is different from K[X]. In the contrary case 1 E J, i.e. there exists a collection f,,, . . . , f,, such that Ill forsomeg,,..., g,,, in K[X]; m) is not consistent,
but then the system of equations fvj(t) = 0 (1 G j G
106
Chapter
Condition
2. Foundations
and Methods
(11) is necessary as the following example shows: L(L
-
a)
=
1
(a
E K,
X
=
{X,),,
K in
(%>h
where o is a fixed index which does not belong to K. Let us now consider the function algebra @(S, K) for arbitrary S, K. Associated with each point s E S there is a character cpH p(s) and correspondingly there is a maximal ideal A4, = {cp: q(s) = 01. Clearly @(S, K)/M, = K. The maximal ideals MS are said to be standard. If S is finite all maximal ideals are standard and the algebra @(S, K) is spectral. Suppose that S is infinite. Let us consider the ideal G+,(S, K) of functions with finite support. Every maximal ideal which contains it is non-standard and, conversely, any non-standard maximal ideal M contains @J&S,K) since M $ M, for any s E S, i.e. there exists II/, E M such that Ii/,(s) = 1, and then 6, = I,~~,,S, E M. Theorem. Zf S is infinite and card S < card K then for any non-standard maximal ideal M c @(S, K) the field E = @(S, K)/M is infinite-dimensional over K. Proof. We take an injective function E E @(S, K) and show that the images of its powers in E are linearly independent. Suppose in contradiction to this that x(s) E M for some polynomial rc # 0. The set of roots of the equation X(&(S)) = 0 is finite and so its characteristic function x has finite support. Consequently x E M and rc(s) + x E M. But R(E(s)) + x(s) # 0 for all s, i.e. B(E) + x is invertible. 0 Corollary
1.
Zf S and K are infinite then @(S, K) is not spectral.
A proof is required only in the case card S > card K. But then there exists R c S such that card R = card K and, since R is infinite, the algebra @(R, K) is not spectral. But @(R, K) x @(S, K)/J, where J is the ideal of functions which vanish on R. Moreover a factor algebra of any spectral algebra is also spectral. This follows from the general result: if h: ?I + 2 is a surjective algebra homomorphism then the preimages of the maximal ideals of the algebra f? are all maximal ideals of the algebra 9I which contain Ker h; further a/hx f!/M. We remark incidentally that if the field K is finite, K = IF,, then the algebra @(S, K) is spectral since the identity (pq = cpis satisfied in it. Corollary 2. Zf S = N = (0, 1,2, . . .} and K is any infinite field then for any non-standard maximal ideal M c @(S, K) the field E = @(S, K)/M is infinitedimensional over K.
The field E is said to be an ultrapower of the field K. The ultrapowers of the field R! are in fact those of its extensions which are introduced in non-standard analysis2’ by means of ultrafilters. This approach is equivalent to the previous one because of the l-l correspondence between the maximal ideals M c @(S, K) and the ultratilters U on S: M(U) = {q: 3A E U s.t. (~1” = O}. n this context it is interesting to note that we can define the standard field R as the factor algebra of the algebra of fundamental sequences over the field Q of rational numbers by the maximal ideal of sequences tending to zero.
8 2. Convex Analysis
107
Among the function algebras on a topological space S the algebra C(S) of continuous functions (real or complex) first attracts attention. If S is compact, all maximal ideals are standard (and consequently C(S) is spectral). In fact, if the maximal ideal M is such that for any point s E S there exists a function cp,E M with cp,(s)# 0, then cp,(t) # 0 in some neighbourbood V, of the point s. By choosing a finite covering Kl, . . . , Vs we , obtain the function
which has no zero, i.e. it is invertible in C(S); this is a contradiction. Thus if S is compact, the space of maximal ideals of the algebra C(S) is naturally identified with S. In the non-compact situation we usually impose some condition t infinity’ on the functions; for example, we require boundedness, i.e. in place of the too extensive algebra C(S) we consider the algebra CB(S). If S is completely regular (but in general not compact) the space of maximal ideals of the algebra CB(S) is naturally identified with the Stone-Tech compactilication j?(S) and CB(S) is identified with C@(S)) (I.M. Gel and - A.N. Kolmogorov, 1939). If S is compact, then C(S) is a Banach algebra under its usual norm. Algebras of continuous functions on compact spaces are important not only because they occur in analysis but also because of their special role in the general theory of commutative Banach algebras. We remark here that a commutative Banach algebra over the field of complex numbers is spectral (Ge&nd - Mazur theorem’ ’).
In the Wiener algebra W all maximal ideals are standard. Thus if a function f E W has no zero then it is invertible in W.
9 2. Convex Analysis From now on the ground field K will always be R or @ and where possible we will not distinguish these cases. As before the underlying space will be denoted by E. 2.1. Convex Sets. A set X c E is said to be conuex if for each pair of points x1, x2 E X the mterval
[Xl, x2] = {x: x = 7x1 + (1 - 7)x2 (0 < r < l)} is contained in X. Any subspace is convex. Any point of the form x = k 7,x,
1,7,20 ) ( g1 =, = is said to be a convex combination of the points x1, . . . , x,. k=l
(12)
The analogous assertion over R is false. For example, the Banach algebra C over R is not spectral. See the proof of the Gel and-Mazur theorem in Section 4.17.
108
Chapter
2. Foundations
and Methods
Theorem of CarathCodory. Zf E is finite-dimensional, dim E = r, then any convex combination of the form (12) is a convex combination of some r + 1 points from the collection (x&. In order that a set X c E be convex it is necessary and sufficient that for each finite collection (xk)l of points of X all its convex combinations belong to X. For any set X c E the set of all convex combinations of all possible finite collections (x& of points of X is convex. This is the smallest convex set containing X. It is called the convex hull of the set X and is denoted by Co X. If we note that the intersection of any family of convex sets is convex, we can define Co X equivalently as the interesection of all convex sets containing X. The convex hull of a finite collection (x& is called a conuex polytope. For n = 1 this is a point and for n = 2 it is an interval. We note further that the sum of any family of convex sets is convex. For example, all affine manifolds x + L (x is a point and L is a subspace) are convex. A point x is said to be an algebraic interior point of the convex set X if x E X and for each u E E there exists E > 0 such that [x, x + ED] c X. The set of algebraic interior points of a set X is called its algebraic interior and is denoted by ai X. It is easy to see that ai X is convex. If x E ai X and y E X then the half-open interval [x, y) = [x, y] \ ( y> is contained in ai X. For fixed x E ai X the union of the half-open intervals [x, y) (y E X) coincides with ai X. A convex set X is said to be algebraically open if all its points are algebraic interior points, i.e. X = ai X. For any convex set its algebraic interior is algebraically open. A convex set X is said to be algebraically solid if its algebraic interior is non-empty. A set X c E is said to be circled if it is invariant with respect to the action x H lx of the group [[I = 1. In the real case this means central symmetry, i.e. xeX*--XEX. A set X c E is said to be balanced if it is invariant with respect to the action x H px of the semigroup 1pi < 1. For a convex set X this property is equivalent to being circled. A balanced convex set is said to be alsolutely convex. Any subspace is absolutely convex. Any point of the form x
=
k k=l
PkXk
($/k~+
is called an absolutely conuex combination of the points x1, x2, . . . , x,. In order that a set X be absolutely convex it is necessary and sufficient that for each finite collection (xk)l of points of X all of its absolutely convex combinations belong to x. The intersection of any family of absolutely convex sets is absolutely convex. If X is absolutely convex then ai X is also absolutely convex. The absolutely convex hull of an arbitrary set X c E is defined (in two equivalent ways) similarly to the convex hull.
0 2. Convex Analysis
109
A set X c E is said to be conal if it is invariant with respect to the action x H ;Ix of the group 1> 0. A conal convex set is call a wedge. Any subspace is a wedge. In order that a conal set be a wedge, it is necessary and sufficient that it also be an additive semigroup (i.e. invariant with respect to addition). Any point of the form k=l
is called a non-negative combination of the points x1, . . . , x,. In order that a set X be a wedge it is necessary and sufficient that for each finite collection (x& of points of X all its non-negative combinations belong to X. The intersection of any family of wedges is a wedge. For any set X c E the set of all non-negative combinations of all possible collections (xk)l of points of X is the smallest wedge which contains X, i.e. it coincides with the intersection of all wedges containing X. If K is a wedge, the intersection &,=r CK, if non-empty, is the largest subspace contained in K. It is called the edge of the wedge. A wedge is said to be a cone if its edge is zero or empty. The closed ray {y: y = ;Ix (2 2 0)} and the open ray {y: y = Izx (A > 0)} defined by a non-zero vector x are cones. In the first case the edge is zero, in the second it is empty. For any wedge K # E its algebraic interior ai K is a cone (its edge is empty). A wedge K is said to be generating if ,& cK = E* Convex sets of one type or another arise naturally in connection with linear inequalities. Let F = (f;:)iE, be some system of real linear functionals on E. Then the set of solutions of the system of linear inequalities fitx) 2 Vi (i E 1)~
(13)
where the ri are arbitrary real numbers, is convex and the set of solutions of the corresponding homogeneous system fi(x) > 0 (i E 1)
(13,)
is a wedge. Its edge is the space of solutions of the homogeneous system of linear equations A(x) = 0 (i E I). The wedge of solutions of system (13) is therefore a cone if and only if the system F is total. For the strict analogue of system (13,), i(x) > 0 (i E I), the set of solutions is a cone’ (it may be empty). 2.2. Convex Functionals.
A real functional
q on2 E is said to be subadditive if
the triangle inequality 4(x + Y) G 4(x) + Cl(Y) 1It is suffkient that just one of the inequalities be strict. 2 Or on an additive subsemigroup Q t E.
(~9 Y E E)
110
Chapter 2. Foundations and Methods
holds for it. This property implies the other inequalities: 4(x - Y) 2 4(x) - 4(Y);
0 < q(0) < q(x) + 4(--x).
It follows from the latter that max(q(x), q( -x)) 2 0. The set of subadditive functionals contains all non-negative constants and all real additive functionals. A non-negative combination of subadditive functionals is subadditive, i.e. the subadditive functionals form a wedge. The least upper bound of any family of subadditive functionals which is pointwise bounded above is subadditive. A further useful construction is the composition o 0 q, where W: R, + R, is a non-decreasing subadditive function (for example, o(5) = r/(1 + c) ({ 2 0)) and q is a non-negative subadditive functional; clearly under these conditions the functional o o q is subadditive. A subadditive functional q on3 E is said to be sublinear if it is positively homogeneous, i.e. q(Ax) = Iq(x) for all I > 0, x E E. In this case q(0) = 0, q(crx) < aq(x) for all real a and @k 2 O)q k$ xk G k$ Lkdxk) ( > The set of sublinear functionals is a wedge. The least upper bound of any family of sublinear functionals which is pointwise bounded above is sublinear.
Example. Let {fi}ie~ be a family of real linear functionals which is pointwise bounded above. Then the functional q(x) = sup&(x) is sublinear. A real functional g on4 E is said to be convex if g
(it is sufficient for this to be satisfied for n = 2). All sublinear functionals are convex but the class of convex functionals is considerably more extensive. For example, if o: R + I&! is any non-decreasing convex functiot?, then for any convex functional q the composition o 0 q is convex. The set of convex functionals is a wedge. The least upper bound of any family of convex functionals which is pointwise bounded above is convex. Each convex functional q generates two families of convex sets: {x: q(x) < a},
{x: q(x) < a}
(CI E R).
A set X c E is said to be absorbent if for each x E E there exists 1> 0 such that I-lx E X. In other words E = UlzO 1X. Clearly any absorbent set contains zero. If X is convex a necessary and sufficient condition for absorbency is that OEaiX. r on some wedge K c E. 40r on a convex set X c E. 5For this it is necessary and suffkient that o be the integral of a non-negative, non-decreasing function.
111
Q2. Convex Analysis
Let X be a convex absorbent set. Since in this case I,X c I,X (0 < il, < A,), it is natural to introduce for X its gauge (or Minkowski functional): q&)
= inf{l:
A > 0, x E AX}
(x E E).
This is a non-negative6 sublinear functional which lmost’ defines the set X: q&) < 1 ox E ai X. Therefore, if X is algebraically open, we have X = (x: qx(x) c. l}. In general qx(x) G 1 for x E X. Conversely, for any non-negative sublinear functional q, the set D4 = {x: q(x) < l} is convex, absorbent and algebraically open and its gauge coincides with q. The set 0, is called the q-unit bull. We have a l-l correspondence q H D4 between the non-negative sublinear functionals and the convex absorbent algebraically open sets. If we remove the requirement of being algebraically open the collection of gauges is not increased since qx = qaix. 2.3. Seminorms and Norms. A non-negative sublinear functional p with the property that p([x) = p(x) if ICI = 1 is called a seminorm. In other words, the following are the defining properties of seminorms: 1) p(x) 2 0;
2) p(x + y) G p(x) + P(Y);
3) I
= HP(x).
The last property is called absolute homogeneity. Example
1. If g is a sublinear
functional
then p(x) = supI+, g(cx) is a
seminorm. Example 2. If f is a linear functional then IfI is a seminorm. The gauge of any absolutely convex absorbent set is a seminorm. The mapping p H Dp is a bijection of the set of seminorms onto the set of absolutely convex, absorbent, algebraically open sets. This property allows us to introduce seminorms in a purely geometric way. Associated with each seminorm p is the subspace Ker p = {x: p(x) = 0}, its kernel. Seminorms with zero kernel are norms. In other words, norms are those seminorms whose unit spheres contain no rays. It follows from the inequality IP(X) - P(Y)1 G p(x - Y)
that if x - y E Ker p then p(x) = p(y). Thus in a natural way p generates a seminorm on the factor space E/Ker p. This factor seminorm is a norm. The set of all seminorms (norms) is a cone. The least upper bound of any family of seminorms which is pointwise bounded above is a seminorm. Example. Let F = (fi}i Er be a pointwise bounded family of linear functionals. Then the functional p(x) = sup I.mI
6qx(x) = 0 (x # 0) if and only if the ray defined by the vector x is contained in the set X.
(14)
112
Chapter 2. Foundations and Methods
is a seminorm. We will see in what follows that this is a universal method for constructing seminorms. In order that the seminorm (14) be a norm, it is necessary and sufficient that the family F be total. In the general case Kerp = (I Kerfi. I 2.4. The Hahn-Banach Theorem. Let 4 be a sublinear functional on E and f a real linear functional on a subspace L c E which satisfies the condition f(x) < q(x) (x E L). Then f can be extended to the whole space E in such a way that the majorant 4 is preserved, i.e. there exists a real linear functional f on E such that f , = f and f(x) < q(x) (x E E). The proof is carried out with the help of Zorn lemma. The basic step consists of extending by one dimension, i.e. to a subsFace 1 = L + Lin, {e} where e 4 L. It is sufficient to choose f(e) so that f(x) + c$(e) < 4(x + ae) for all c1E R, x E L. But it is easy to see that
sup f(X) - 4(X - 4 ~ c1 xEL,aso
inf
4(Y + Be) - f(Y)
xsL,po
B
.
Any value of f(e) lying between these bounds satisfies the required condition. This theorem is one of the basic principles of linear functional analysis. Corollary 1. Let p be a seminorm on E and f any linear functional on a subspace L c E which satisfies the condition If(x)1 < p(x) (x E L). Then there exists a linear functional f on E such that f L = f and If(x)1 < p(x) (x E E).
ProoJ If E is a real space we can extend fin such a way that f(x) < p(x) (x E E). Substituting -x for x here, we obtain the required result. In the complex case it is sufficient to extend the real linear functional g = re f so that g x)< p(x) (x E E) and then to put f(x) = g(x) - @(ix). 0 A linear functional f on E is said to be subordinate to a seminorm p if there exists c 2 0 such that If(x)1 < cp(x) (x E E). The greatest lower bound of such c is called the p-norm of the functional f and is denoted by Ilf 11,.Thus
IfWl G IIf II,PW (x E 9 and it is impossible to substitute a smaller constant in place of the p-norm. Clearly ifp#O
The set of linear functionals subordinate to a fixed seminorm p is a linear space and moreover II. lip is a norm on this space. Corollary 2. Let p be a seminorm on E and let f be any linear functional on a subspace L c E which is subordinate to p. Then f can be extended to the whole of E without change of p-norm, i.e. there exists a linear functional f” on E which is subordinate to p and such that fl, = f and Ilf l, = IIf lip.
$2. Convex
This follows from Corollary
Analysis
113
1 on replacing p by Cp, where C = )If 11 p.
Corollary 3. Let p be a seminorm on E, let x,, E E and suppose that p(x,) # 0. Then there exists a linear functional f which is subordinate to p and such that fbd = I and Ilfll, = 1.
Any such functional is said to be p-supporting for the vector x0. It can be constructed by defining the linear functional f-Jax,,) = ap(x,) on the one-dimensional subspace Lin{x,} and then extending f. to E without change of p-norm. Let us now denote by Sz the set of linear functionals which are subordinate to p and such that Ilfll, = 1. Then, as a result of Corollary 3, p(x) = max If(x)1 = max re j(x).
JEq feS$ Thus any seminorm is the maximum of some family of linear functionals (necessarily subordinate to it). The following more general assertion is established similarly to Corollary 3: if q is a non-negative sublinear functional on E, then for each vector x0 there exists a q-supporting real linear functional f: f(xo) = 4(x0),
f(x) G 4(x)
(x E E).
2.5. Separating Hyperplanes. From here up to the end of Section 2 the basic space E is assumed to be real. Let us consider an arbitrary hyperplane H = {x: f(x) = a} (f is a non-zero linear functional, a E R). The complement E\H is divided into two algebraically open half-spaces’ H, = (x: f(x)
> a},
H- = {x: f(x) < a>.
We also put i?, = H, u H = E\H- and H- = H- u H = E\H+. All these sets are convex. For any two non-intersecting sets X, Y c E we say that a hyperplane H (or a linear functional f which defines it) separates X and Y if one of these sets lies in H- and the other in fi+ or one lies in H, and the other in fi-. Clearly this is equivalent to the assertion that H separates the convex hulls Co X and Co Y; it is therefore enough to study the separation of convex sets. The fundamental results in this direction are due to Ascoli (1932), Mazur (1933) and Eidelheit (1936). Theorem. For any non-empty convex algebraically open set X and any nonempty convex set Y disjoint from X there exists a separating hyperplane. Proof. Let x,, E X, y, E Y and put z. = y, algebraically open set D = X - Y + zo. Since bent. Since X n Y = 0 we have that z. # D a linear functional f which is q,-supporting f we replace interchanged.
f by (-f)
and a by (-a)
the hyperplane
- x0. Let us consider the convex 0 E ai D it follows that D is absorand so qn(zo) > 1. We construct at the point zo. Then if x E X, H is not
altered
but H,
and H-
are
114
Chapter
2. Foundations
and Methods
y E Y we have f(x - y + zo) < 1 and hence f(x) < f(y) and c1E ~up,~~f(x) < inf,,, rf(y). Since X is algebraically open, the upper bound c1is not attained. If we now take the hyperplane H = {v: f(v) = a} then X c H- and Y c H,. Cl We shall say that a hyperplane
H (or a linear functional
f which defines it)
almost separates the sets X, Y c E if one of them lies in H- and other in H,
(moreover X and Y can even intersect). Here we can also restrict attention convex sets. From the theorem on the separating hyperplane follows
to
Corollary 1. Let X and Y be convex sets with X algebraically solid and Y non-empty and disjoint from ai X. Then there exists a hyperplane which almost separates X and Y
In fact a hyperplane which separates ai X and Y almost separates X and Y Remark. For any two non-intersecting non-empty convex sets in a tinitedimensional space there exists a hyperplane which almost separates them. The fundamental result, which is generally known as the geometric form of the Hahn-Banach theorem, follows in turn from Corollary 1. Corollary 2. Let X be a convex algebraically solid set and A an afine manifold such that A n ai X = 0. Then there exists a hyperplane H 3 A such that
HnaiX=IZI(i.e.aiXcH-oraiXcH+). Proof. Let G = {v: f(v) = a} be a hyperplane which almost separates X and A: f(x) < CI(x E X), f(y) 2 a (y E A). If A = x0 + L, where L is a subspace, we have f(z) > u - f (xe) (z E L) and so f(z) = 0 (z E L). Consequently, the hypercontains the manifold A. Moreover f(x) < f(xo) plane H = {Y: f(y) = f(x,)) (x E X) and hence f(x) < f(x,,) (x E ai X), i.e. ai X c H- . 0 Corollary 3. Any convex algebraically open set X # E is the intersection of some family of algebraically open half-spaces.
In fact, for each point u $ X there exists a hyperplane H(v) such that v E H(v) and X c H-(v). Since moreover v $ H-(v) we have X = f-) H-(v). 0+x
In other words, X coincides with the set of solutions of a system of linear inequalities of the form f,(x) < CI, (v $ X). Now let X be a convex algebraically solid set, X # E. Then from the above x =
ix H-(v),
i.e. X is contained in the set of solutions of a system of linear inequalities form f,(x) < u, (v 4 ai X).
of the
115
$2. Convex Analysis
A hyperplane H = {u: f(u) = cz> is said to support* wxEXfb) = a. Corollary
4. Zf X is a convex algebraically
a convex set X if
solid set different from E, then
where S is the family of support hyperplanes of X. 2.6. Non-negative Linear Functionals. Associated with each wedge K is the wedge’ K # of linear functionals which are K-non-negative in the sense that f(x) 2 0 (x E K) (notation: f 2 0). All functionals f 2 0 (f # 0) are positive on the algebraic interior of the wedge K. In the case where K is a cone, a functional f is said to be K-positioe (f > 0) if f(x) > 0 for all x E K, x # 0. Theorem of M.G. Krejn. Suppose that the subspace L c E contains an algelinear braic interior point 2 of the wedge lo K . Then any (K A L)-non-negative functional f on L can be extended to a K-non-negative linear functional on E. Proof. If f # 0 then Ker f n ai K = 0. Therefore Ker f is contained in some hyperplane H = (x: g(x) = 0} such that H n ai X = 0. The functional g provides the required extension. 0 Corollary. For any algebraically negative linear functional g # 0.
solid wedge K # E there exists
a K-non-
Proof. If R E ai K then (-A) $ K since K # E. On the line L = Lin{R} there is the (K n L)-non-negative linear functional f # 0: f(az2) = a. It remains to extend it to a K-non-negative functional on E. 0 Example 1. Let S be a space with a finite measure p, let E = B(S, R), the space of bounded functions S + [w and let K = B, (S, R), the cone of non-negative functions. Its algebraic interior ai K consists of those functions with positive lower bound. Let us consider the subspace L = L’(S, p) of summable functions and the (K n L)-non-negative linear functional
Jb) =
cpdp fS on it. Since 1 E L n ai K, the functional J can be extended to a K-non-negative linear functional J”on B(S, R). Now for any set M c S we can put &kf) = J(xM), where xM is its characteristic function. With this we have constructed a finitely additive extension of the measure p to the algebra of all subsets of the set S. Correspondingly, *As above, this term carries over to functionals. We note that if q is a non-negative sublinear functional then a functional which is q-supporting at the point x,, (4(x,) # 0) is a support functional for the set {x: q(x) < 4(x,)}. 91t is a cone if the wedge K is generating. his wedge must therefore be algebraically solid.
116
Chapter 2. Foundations and Methods
where the integral can be interpreted as before, i.e. as the least upper bound of Lebesgue integral sums. Naturally, this integral is only finitely additive. From what has been said it is also clear that the integral with respect to a finite, finitely additive measure gives the general form of a non-negative linear functional on B(S, R). For the space C(S) of continuous functions on a compact space S we have the theorem of F. Riesz on the general form of a non-negative linear functional:
f(d=scp dp, X
where p is a regular Bore1 measure” which is uniquely determined functional J The first step of the proof is the extension off to B(S).
by the
Example 2. Many variants of the classical moment problem are to be found in the following abstract scheme. In the space E there are given a wedge K (0 E K) and some system U = (Ui)isl of vectors such that Lin U n ai K # 0. It is required to find a K-non-negative linear functional f which satisfies the system of equations s(Ui) =
Ci
(i E I),
where the ci are given scalars (the moments of the required functional f with respect to the system U). It turns out that for solvability of this abstract moment problem it is necessary and sufficient that the implication & aiui E K * gI aici 2 0 should hold for systems of scalars (cli)isl with finite support. We only need to establish the sufficiency. Without loss of generality we can assume that the system U is linearly independent. Then f is uniquely defined on L = Lin U by the given equations and from the condition we see that it is (K n L)-non-negative; finally we can extend it to a K-non-negative linear functional on the wholel’ of E.
i i A Bore1 measure is said to be regular if its value on each Bore1 set M coincides with the supremum of its values on the compact sets Q c M and with the inllmum of its values on the open sets G = M. In what follows, all measures on locally compact topological spaces are assumed to be regular Bore1 measures (or this property is to be proved). he problem of constructing a linear functional f which is subordinate to a given seminorm p and satisfies a system of equations f(uJ = ci (i E I) can be solved similarly with the help of the HahnBanach theorem. A criterion for solvability is that the inequality I;, aicil < Cp(z,
a,,,>
(C is a positive constant)
should hold for all (c&, , oftinite support. Moreover the solution will satisfy the condition 11f I&,,< C.
117
$2. Convex Analysis
In particular let us consider from this point of view the trigonometric problem II ein*da(x) = c, (n = 0, f 1, f2, . . .). s -C Here we are looking for a non-negative linear functional
f(v)=
moment
z rp(x)wd s -X
on C( -71, rr) which satisfies the equations f(e,) = c, (e,(x) = einx, n = 0, +_1, +2, . . .). A criterion for solvability is that, for each non-negative trigonometric polynomial T(x) = En a,einx, the combination Cna,c, must be non-negative. But, by the well-known theorem of Fejtr and F. Riesz, the condition T(x) 2 0 is equivalent to 7’(x) = ( V(x)12 where V is a trigonometric polynomial. The Riesz-Herglotz criterion (positivity of the matrix (c.-,)~,=,) follows easily from what has been said. 2.7. Ordered Linear Spaces. This class of spaces was introduced and first studied by L.V. Kantorovich (1935). A linear space E is said to be ordered if there is given on it a partial ordering which is compatible with its linear structure: 1) x
2) x
(a>O).
It follows from property 1) that (x < y) A (z < w) * x + z < y + w and moreoverxay
y > 0 o y E K, so that x G y o y - x > 0. Conversely, if the space zero and
E is already ordered, the set K = {y: y > 0} is a cone containing
xdy-sy-XEK. The elements of the cone K are said to be non-negative in accordance with their characterisation by the inequality y 2 0. Example. The linear space @(S, [w)of all real functions on any set S is ordered pointwise: cp < +-+cp(s) < e(s) (s E S). This ordering is defined by the cone of non-negative functions. Any subspace L of an ordered linear space E is automatically ordered, the corresponding cone being K n L where K is the cone in E. We now describe the standard method of ordering the space of homomorphisms Hom(E, El), where E, is also an ordered linear space and Q c E, is the corresponding cone. We assume that the cone K in E is generating. Then the homomorphisms h: E + E 1 which map K into Q form a cone in Hom(E, E 1) and the corresponding order is defined by this. In particular, h > 0 means that x 2 0 => hx 2 0 and, because of the linearity of the mapping h, this is equivalent to its monotonicity: x < y Z+ hx < hy.
Chapter 2. Foundations and Methods
118
Taking E, = Iw(Q = W,) we obtain the natural ordering on the space of linear functionals on E. It is defined by the cone K#. We note further here that the definitions of subadditivity, sublinearity and convexity used in Section 2.2 for real functionals carry over word for word to mappings of any linear space into any ordered linear space. Of special interest among ordered linear spaces are lattices, in which (by definition) for each pair of vectors x, y there exists a supremum sup& y), their least upper bound’ 3: sup(x, y) = min(z: z > x, z > y}. We note immediately
that14
sup(x + z, y + z) = sup(x, y) + z, sup(lx, ny) = 1 sup(x, y) Example.
(2 > 0).
The space @(S, [w)is a lattice: (sup@, $))(s) = max(cp(s), $6)).
If an ordered linear space is a lattice it is called a oector (or linear”) lattice or a Riesz space. A subspace L of a vector lattice is called a sublattice if sup(x, y) E L for all x, y E L. For example, @,,(S, Iw) is a sublattice of @(S, [w). A cone K which orders a space E is said to be minihedral if sup(x, y) exists for all x, y E K. Clearly the cone of a vector lattice is minihedral. Moreover it is generating since if x+ = sup(x, 0) and x- = sup( - x, 0) then x+ > 0, x- >, 0 and x=x+ -xx_. Remark 1. If x = u - u where u > 0 and v > 0, then u = x, + z and u = x- + z where z 2 0. In this sense, among the stated representations of a vector x as the difference of two non-negative vectors, there is a minimal one: x = X+ - x-. Remark 2. Any minihedral generating cone K c E turns E into a lattice. For the proof we put x - y = u - u (u, v E K) and then sup(x, y) = sup(u, u) + (Y - u)Now we introduce the modulus of an element x of a vector lattice E by putting Ix 1 = x+ + x- = sup(x, - x). The mapping x I+ 1x1is sublineaP and moreover 1x1 > 0 (x # 0), 10) = 0, i.e. the usual properties of modulus are preserved under this generalisation. Any element r 2 0 of a vector lattice E with cone K defines a cube C, = {x: 1x1d r} = {x: - r < x < r}. A linear functional f on E is said to be Iocally
i3The existence of an infmum inf(x, y), the greatest lower bound (also part of the definition of a lattice), is automatically guaranteed here: inf(x, y) = -sup(-x, -y). i41n particular the relation sup(x, y) + inf(x, y) = x + y follows from this. isIn fact these words are often omitted. l6 This follows from the subadditivity of the mappings x I+ x+, x H x-. We note also that lax1 = Ial (xl for all a E R, x E E.
6 3. Linear Topology
bounded if it is bounded on each cube. These functionals form a linear s ce E’ note that which is ordered by means of the cone of non-negative functionals if f > 0 then f E E’). (_ Theorem 1. In order that f be locally bounded it is necessary and suflicient that it be the difference of two non-negative functionals.
The sufliciency is trivial. For the proof of the necessity we put g(x) = ~UPo,,,J-(Y) (x > 0). This functional is additive and positive-homogeneous on the cone K. It is uniquely extended by linearity to K - K and then to the whole of E. By construction, g(x) 3 0 and g(x) > f(x) (x > 0). It remains to put h(x) = g(x) - f(x)- 0 The constructed functional g is sup(f, 0). At the same time we have also proved Theorem 2. The space E’ of locally bounded linear functionals E is a lattice.
on a vector lattice
A vector lattice E is said to be Dedekind complete if any non-empty subset X c E which is bounded above has a supremum sup X. If this is only required for countable sets, the lattice will be called Dedekind o-complete. The space @(S, R) is a Dedekind complete lattice but GO(S, R) is not even Dedekind a-complete. Making Theorem 2 more precise, we can also assert that E’ is a Dedekind complete lattice.
$3. Linear Topology 3.1. Linear Topological Spaces. If on a linear space E there is given a Hausdorff topology with respect to which the operations of addition and multiplication by a scalar are continuous i, then E is called a linear topological space (LTS). In particular, under addition an LTS is an abelian topological group. Therefore the system of open (closed) sets in an LTS is invariant with respect to translations x H x + y and homotheties x H ax (a # 0) (i.e. these are homeomorphisms). We note also that affine properties such as convexity, being circled or balanced and absolute convexity are invariant with respect to closure. A linear topology on E is completely determined by a base of neighbourhoods of zero (which clearly has to be assigned so that not only general topological requirements but also linear topological ones are satisfied). We note that in any LTS there exists a base of balanced neighbourhoods of zero. In fact if U is some neighbourhood of zero, then, by the continuity of multiplication by a scalar, there exist E > 0 and a neighbourhood V of zero such that aV c U if 101)< E. Then
.e.the mappings (x, y) H x + y and (a, x)wax product topologies into E are continuous.
of E x E and K x E (K = R or C) under their
120
Chapter 2. Foundations and Methods
w= u LYV IWe is a balanced neighbourhood of zero contained in U. In any LTS all open sets are algebraically open. We note further that any neighbourhood ofzero in an LTS is an absorbent set. We give several simple example of LTSs. Exainple
1. The usual topology
on the ground field K is linear.
Example 2. The function space @(S, K) is an LTS under the topology of pointwise convergence (i.e. the product topology since @(S, K) = KS). Since any subspace of an LTS is an LTS under the induced topology, we have in particular that @e(S, K) is an LTS. This provides a method of converting an arbitrary linear space E into an LTS since E x @,,(B, K) where B is a basis. Generally speaking the topology obtained depends on the choice of basis. However, the following result holds. Theorem. On any finite-dimensional unique linear topology.
space (and only on such a space) there is a
We note that a topological criterion for finite-dimensionality of an LTS is local compactness. A metric is one of the basic methods of defining a topology. If on a linear space E there is given a metric d which is invariant with respect to translations2 and satisfies the conditions d(ax, 0) < d(x, 0)
(1~4 < l),
lim d(ex, 0) = 0 E-0
(15)
then E, equipped with this metric, is an LTS. The function v(x) = d(x, 0) is a pseudonorm, i.e. 1) V(X) > 0 (x # 0), v(0) = 0,2) v(ax) < V(X)(Ial < l), 3) v(x + y) < v(x) + v(y), and it is affinely continuous at zero, i.e. continuous in each direction3: lim e.+OV(EX) = v(0) (=O). Conversely, any pseudonorm which is affinely continuous at zero determines an invariant metric d(x, y) = v(x - y) which satisfies the conditions (15), i.e. finally, it generates a linear topology. In this construction are included not only normed spaces but also certain others of detinite interest. Example 1. In the space (S, p) of measurable functions on a set S with finite measure ,u the pseudonorm
vb) =
s
Ids)l
dp
s 1 + I4al
is afftnely continuous at zero. The metric produced topology of convergence in measure.
by it defines the linear
.e.such that d(x + z, y + z) = d(x, y) (in what follows it is simply called invariant). 3 Continuity in each direction at a given point is clearly a concept which does not require a topology on E.
0 3. Linear Topology
121
Example 2. The space LP(S, p) (0 < p < 1)4 is defined by the condition
and it is topologised
by this pseudonorm.
Remark. Sometimes a linear topology is introduced by means of a quasinorm, i.e. a functional N which posesses all the properties of a norm except possibly the triangle inequality, which is replaced by iV(x + y) < C[N(x) + N(y)] where C is a positive constant. A base of neighbourhoods of zero is given by inequalities of the type N(x) < E(E > 0). If the linear topology on E is given explicitly by means of a metric then E is called a linear metric space; but if it is only a question of the existence of such a metric on an LTS, then it is said to be metrizable5. Clearly, satisfying the first axiom of countability, i.e. the existence of a countable base of neighbourhoods of zero, is a necessary condition for this.
Birkhoff-Kakutani Theorem. If an LTS satisfies the first axiom of countability then it is metrizable and moreover the metric which defines its topology can be chosen to be invariant and to satisfy conditions (15). Remark. The authors mentioned in fact resolved the problem of metrizability for topological groups. It was shown additionally that on any (Hausdoti) topological group the bounded uniformly continuous functions separate the points. Hence it follows that the underlying space of a topological group is completely regular. In particular any LTS is completely regular. The weaker property of regularity is obtained from simple considerations: if G is a topological group, X c G is a closed set, x # X and U is a symmetric neighbourhood of zero such that U + U is contained in (G\X) - x, then the neighbourhood x + U of the point x does not intersect the neighbourhood X + U of the set X. From the Birkhoff-Kakutani theorem follows
Corollary. In any linear metric space there exists a topologically invariant metric satisfying condition (15).
equivalent
LTSs E,, E, are said to be topologically isomorphic if there exists a linear homeomorphic mapping (topological6 isomorphism) of E, + E,. Isometric isomorphism of linear normed spaces is a special case of this (i.e. the existence of a linear isometry from E, onto E,). If E,, E, are finite-dimensional then topological isomorphism is equivalent to algebraic isomorphism, i.e. to the equality dim E, = dim E,. The problem of classifying infinite-dimensional LTSs up to topological isomorphism is extremely complicated. Certain advances in this direction are connected with the gener4As before, functions which are equal almost everywhere are identified. 5The space @(S, K) with uncountable S is not metrizable. his word may be omitted in a context where it is naturally implied.
122
Chapter 2. Foundations and Methods
alisation of the concept of dimension (Banach-Mazur, 1933; A.N. Kolmogorov, 1958). In his time Banach introduced the conjecture that all infinite-dimensional separable Banach spaces are homeomorphic (not linearly, in general). M.I. Kadets proved the validity of this conjecture in 1966. However, for example, the spaces Z’(O, 1) (p > 1) are mutually non-isomorphic (Banach-Mazur, 1933). The spaces of continuous functions on compact metric spaces with cardinality of the continuum are isomorphic to each other (A.A. Milyutin, 1966). A metrizable LTS E is said to be complete, or to be a Fr6chet space, if it is complete (in the usual sense) with respect to some (and consequently any) invariant metric which generates the topology on E. Remark 1. Completeness may be lost if the invariance of the metric is removed. For example, the usual topology on Iw is generated by any metric of the type d(x, y) = If?(x) - 13(y)l, where 0: Iw+ Iw is a local homeomorphism, although [w is not complete with respect to this metric if the function 19is bounded.
Remark 2. In the class of all LTSs we can define the property of completeness by requiring the convergence of every fundamental net (xi) (i.e. a net (xi) such that limi,k(x, - x,J = 0). For metrizable LTSs this definition reduces to the previous one. Remark 3. We can complete any LTS in the standard way. Moreover the completion of a metrizable LTS will be a Frechet space, the completion of a normed space will be a Banach space and the completion of a pre-Hilbert space will be a Hilbert space. Frtchet spaces have the important property of barrelledness. A barrelled space is an LTS in which each barrel contains a neighbourhood of zero, where barrel means an absorbent absolutely convex closed set. The barrelledness of a FrCchet space follows from Baire theorem that a complete metric space is of the second category, i.e. it cannot be the countable union of nowhere dense sets. In fact if an LTS E is of second category then it is barrelled. For the proof we consider an arbitrary barrel T c E. Since E=&T
PI=1
and all the nT are closed, for some m the set mT is not nowhere dense, i.e. it has an interior point; then T also has this property. But if x E int T, then -x E int T and consequently 0 E int T, i.e. a neighbourhood of zero is contained in T. We can characterise barrelledness in terms of seminorms. In any LTS continuity of a seminorm’ p is equivalent to its continuity at zero and for this in turn it is necessary and sufficient that the p-unit ball DP should contain a neighbourhood of zero*. For an LTS to be barrelled it is necessary and sufficient that each lower his also applies to any subadditive functional 4 with q(0) = 0. sThe last criterion is correct not only for seminorms but also for any non-negative sublinear functionals. This can also be expressed in the form of a boundedness requirement on some neighbourhood of zero.
123
§ 3. Linear Topology
semicontinuous seminorm on it be continuous. In fact, let E be barrelled and let p be a lower semicontinuous seminorm. Then the set (x: p(x) < l} is closed and consequently it is a barrel. If U is a neighbourhood of zero which is contained in it, then U c D, since all points of U are algebraic interior points. Conversely, if T is a barrel then its gauge qr is a lower semicontinuous seminorm and then, by the condition, it is continuous and consequently the q,-unit ball is a neighbourbood of zero. Banach-Steinhaus Theorem. If a family of continuous seminorms {pi)iE, in u barrelled space E is pointwise bounded, then p(x) = supipi(x) is a continuous seminorm.
In fact the seminorm p is lower semicontinuous. Under the conditions of the Banach-Steinhaus theorem the family of seminorms (pi}ipI turns out to be bounded on some neighbourhood of zero (in the case of a Banach space the family is bounded on each ball), and so the theorem is often called the principle of uniform boundedness (in the case of a Banach space we obtain p,(x) < C 11 x 1)(C = constant, i E I) or, what is equivalent, boundedness of norms for the family { pi}i EI). Uniform boundedness of a family of seminorms in any LTS is equivalent to its equicontinuity. From the point of view of classical analysis the Banach-Steinhaus theorem is the quintessence of the method of condensation of singularities. We give some examples of its application. Example 1. We show that there exists a continuous function f(t) with period 271, for which the partial sums S,(t; f) of its Fourier series are not uniformly bounded. Assuming the contrary, we would have in the Banach space of continuous functions on the circle a sequence of seminorms p,(f) = maxc,,, I&(t; f )I (n = 0, 1,2, . . .) which is pointwise bounded. By the Banach-Steinhaus theorem, p,(f) < C II f 11(C = constant). But then the Lebesgue constants L, turn out to be bounded which, as is well-known, is false. Example 2. Many methods for the summation of numerical series are covered by the following scheme proposed by Toeplitz (1911). Let a triangular numerical matrix ,4 = (&J,,Ci~k be given. We associate with any series having partial sums sir (k = 0, 1,2, . . .) the sequence Oi = k$o AikSk
(i = 0, 1, 2, . . .),
and if the latter converges to (r, then we shall say that the initial series is summable A. The question arises of the regularity of such methods of summation. It turns out that for regularity it is necessary and sufficient that the following conditions be satisfied: to 0 by means of the matrix
1) lim 1, = 0; i+cc
2) lim i+m
i k=O
il, = 1; i
k=O
We. shall prove only the necessity of 3). Let us consider the Banach space c of
124
Chapter 2. Foundations and Methods
convergent sequences s = (s,); with norm llsll = sup. Is,/. Under the condition, the sequence of continuous seminorms 1Oi(S)l (i = 0, 1,2, . . .) converges pointwise and consequently it is pointwise bounded in c. By the Banach-Steinhaus theorem (C = constant; i = 0, 1, 2, . . .). It just remains for us to put sk = sgn 1, (0 < k < i) for arbitrary
i.
Example 3. Let us consider on a compact space X with finite measure p the quadrature process
with nodes (x,,~}~~+
and coefficients (p,j)o~~” which satisfy the condition J&d
=
f
uk
(0
dp
<
k
<
4,
X
where (a,‘); is a complete Markov system. If the process converges for all f E C(X) then, by the Banach-Steinhaus theorem, (C = constant; n = 0, 1, 2, . . .).
1; Pnjf(%jIl G cllf II
Since for each n there exists in C(X) a function f satisfying the conditions f(xnj) = sgn pnj (0 < j < n), llfll < 1, it follows that
It is easy to show that this necessary condition for convergence is also sufficient. Returning to general problems, let us assume that a total family {pi}isr of seminorms is given on the linear space E: thus for any x # 0 there exists i such that pi(X) # 0. Then we can introduce on E a linear topology with base of neighbourhoods of zero Llit,i2,...,im;c
=
X: max
pi,(X)
< &
(E > 0)9.
lSk<m
All these neighbourhoods are convex (and even absolutely convex). A linear topology which has a base of convex neighbourhoods of zero is said to be locally convex and an LTS with such a topology is called a locally convex space (ZCS). The credit for separating out this class of LTSs, which has become firmly established in contemporary mathematics, belongs to von Neumann (1935). Specialising the previous example let us consider a total family { f;->i, I of linear function& on a linear space E. The base of neighbourhoods of zero his is the weakest topology under which all the seminorms pi are continuous.
1 P
0 3. Linear Topology
I/il,i2,...,imie
x: max Ifi,(x)l < E
= I
(E > 0)
1
defines a locally convex topology”
I
on E.
1
of pointwise convergence in the space @(S, K) is defined by the family of linear functionals f,(q) = cp(s)(sE S) and consequently it is locally convex. In this topology @(S, K) is complete. Example
1. The topology
Example 2. Let S be any Hausdorff topological space and let cpE C(S). For each compact set Q c S we put po((p) = maxo 1~1. The topology of uniform convergence on compact sets is defined by means of this family of seminorms on C(S). In this topology C(S) is complete whenever S is locally compact. We can define similarly on C(S) a topology of uniform convergence on any family of compact sets {Qi}ie, which covers all of S. The topology of pointwise convergence is included in this scheme. The topology of uniform convergence on any set S can be defined in the space B(S) by means of the norm I(cpII = sup, Icp1,with respect to which B(S) is a Banach space. Theorem. seminorms.
In any LCS the topology
We can form this family from the hoods of zero (such neighbourhoods If the topology on E is defined by E is metrizable. A suitable invariant
is defined by a certain
total family
of
gauges of the absolutely convex neighbourform a base at zero). a countable family of seminorms { p,}g then metric in this case has the form:
Thus, for example, if S is o-compact (i.e. it is the union of countably many compact sets) then C(S) is a locally convex Frechet space (but not a Banach space if S is not compact). Let us pass to the consideration of subspaces in an LTS E. They are all LTSs under the induced topology (LCSs if E is of this type). Particularly important are closed subspaces. For example, in the case of Frechet (or Banach) spaces only closed subspaces are Frichet (or Banach) spaces, since subspaces which are not closed are incomplete. Any closed subspace of a complete LTS is complete. If L is a closed subspace in any LTS E then the factor space E/L with its standard topology’ ’ is an LTS. If in addition E is an LCS then E/L is also an LCS and if E is barrelled then E/L is also barrelled. If E is metrizable and d is an This is the weakest amongst those for which all functionalsfi are continuous. Any linear functional on E which is continuous under this topology is a linear combination of the functionals A (i E I). r1 This is the strongest topology under which the natural homomorphism j: E + E/L is continuous, i.e. the open subsets of E/L are precisely those sets whose preimages are open in E. We note that in addition the homomorphism j is open. If the closedness of L is dropped then E/L fails to be Hausdorff.
126
invariant
Chapter 2. Foundations and Methods
metric on E then E/L is metrizable
by means of the invariant
Z(X, Y) = inf d(x, y),
metric (16)
X.Y
where x, y E E run through the corresponding classes X, Y E E/L. Further if E is complete then E/L is also complete in the metrizable case. Thus for Frechet spaces all factor spaces by a closed subspace are Frechet spaces. The situation is analogous for Banach spaces. In this case definition (16) reduces to the form
llxll = inf Ibll. XSX
A factor space of a Hilbert space by a closed subspace is a Hilbert space. In any LTS E the closure of any subspace is a closed subspace. It follows from this that any hyperplane in an LTS is either closed or dense. We note further that finite-dimensional subspaces in any LTS are closed. In any LTS a subset X is said to be bounded if it is absorbed by each neighbourhood U of zero (U can be assumed to be balanced) i.e. X c 1U for some I = n(X, LJ) > 0. For example, any finite subset and in general any compact subset are bounded. The class of bounded sets is invariant with respect to finite unions, linear combinations and closure. For LCSs we can add to this list the formation of absolutely convex hulls. Each subset of a bounded set in an LTS is clearly bounded. In any LTS boundedness of a set X is equivalent to the condition that if {x.>T c X, {a,,}? c R, and limnda, a, = 0 then lim,,, CI,X, = 0. The boundedness of a set is therefore equivalent to the boundedness of all of its countable subsets. It is clear that if the topology is weakened the collection of bounded sets is not diminished. In order that a set in an LTS be bounded, it is necessary, and in the case of an LCS also sufficient, that all continuous seminorms (or even just the basic seminorms) be bounded on it. For example, in @(S, K) boundedness is equivalent to pointwise boundedness. For boundedness of a set in a normed space it is necessary and sufficient that it be contained in some ball llxll < p. The converse of this obvious remark leads to a criterion fir normability of an LTS, i.e. for the existence of a norm which generates the given topology. Theorem of Kolmogorov. In order that an LTS be normable it is necessary and suflicient that there exist a bounded convex neighbourhood U of zero.
In fact U must then contain an absolutely convex neighbourhood K The gauge of the neighbourhood V is the desired norm. An LTS is said to be locally bounded if it has a bounded neighbourhood W of zero. In this case for any sequence of numbers Q, > 0 which converges to zero the sets u, W form a base of neighbourhoods of zero. Consequently any locally bounded LTS is metrizable and local boundedness of an LCS is equivalent to
Q3. Linear Topology
127
normability. Therefore, if a metrizable LCS is not normable then the balls in it are not bounded. Thus the concept of boundedness in the linear topological sense does not coincide with the usual metric boundedness in the class of metrizable LTSs (even LCSs). In a normed space these concepts are equivalent and in a linear metric space with invariant metric topological boundedness implies metric boundedness. 3.2. Continuous Linear Functionals. Continuity of a linear functional f # 0 is equivalent to the closedness of the hyperplane i?Z, = {x: f(x) = a} for any ~1.In fact closedness of H, for any c1is equivalent to closedness of Ho and for closed If,, the factor space E/H, is an LTS and f induces a bijection f: E/H, + K. Transferring the linear topology from E/H, to K with the aid off: we obtain from the commutative diagram E-K / j
3 \/
WHO in which j is the natural homomorphism, that f is continuous. It just remains to take account of the uniqueness of the linear topology on K. Continuity of a linear functional on an LTS is equivalent to its continuity at zero and this in turn is equivalent to the boundedness of the functional on some neighbourhood of zero. In a normed space we can choose the open unit of zero. We also consider the ball D = (x: llxll < l} as such a neighbourhood closed unit ball fi = {x: llxll < l} and the unit sphere S = o\D = (xi llxll = l}. Boundedness of a linear functional S on any one of these three sets D, D, S implies its boundedness on the other two and thereby is equivalent to its continuity. But if f is bounded on S (and only in this case) then it is subordinate to the basic norm II * II, and moreover its II * II-norm (subsequently called simply the norm of f) is equal to
llfll = ,w& x IfbN* This least upper bound is not altered on replacing the unit sphere by either the open or the closed unit ball. In view of what has been stated, linear functionals on a normed space, which are subordinate to the basic norm, are said to be bounded but in fact this property is equivalent to continuity. As a consequence of a corollary of the Hahn-Banach theorem the basic norm can itself be represented in the form
lbll = max IfWl. II/II=1 The essential difference of this formula from the previous one, its ual is that here the least upper bound is attained (at a functional which is supporting at x).
128
Example.
Chapter 2. Foundations and Methods
In the Banach space 1’of numerical sequences x = (<.)g with norm
any bounded sequence f = (a,): defines a continuous
f(x) =
linear functional:
O%5.; llfll = sup ” b .
If the sequence (la,l); does not attain its least upper bound then If(x)1 also does not attain its least upper bound on the unit sphere llxll = 1. In any LCS the continuity of a linear functional is equivalent to its subordination to some continuous seminorm and is also equivalent to its subordination to some finite collection of basic seminorms. For LCSs the Hahn-Banach theorem guarantees that it is possible to extend any continuous linear functional defined on a subspace to a continuous linear functional on the whole space. In the case of a normed space there is an extension which preserves the norm. The geometric form of the Hahn-Banach theorem in any real LTS guarantees that if an aftine manifold A does not intersect the interior of a convex set X with non-empty interior then it is contained in a closed hyperplane which does not intersect int X. In fact int X is a non-empty convex, open, and consequently algebraically open, set. We take a hyperplane H 3 A which does not intersect int X. It is closed since it is not everywhere dense. The topological variant of the theorem on the separating hyperplane is established similarly: for any non-empty open convex set X and any non-empty convex set Y which does not intersect it there is a closed separating hyperplane. A set X in an LTS is called a convex body if it is’ convex, closed and has non-empty interior. At each boundary point of a convex body X # E there exists a support hyperplane; all such hyperplanes are closed and X is the intersection of all the closed half-spaces of the form f(x) < c1which contain it, where the pairs f, a correspond to all possible support hyperplanes. In an LCS there is a further variant of the theorem on the separating hyperplane: for any non-empty closed convex set X and any non-empty compact convex set Q which is disjoint from X there exists a closed separating hyperplane. This situation is reduced to the previous one by constructing disjoint convex neighbourhoods for X and Q. The hyperplane which we want turns out to be even strictly separating:
In particular, if L is a closed subspace in an LCS E and x,, $ L, there exists a continuous linear functional f on E such that f IL = 0, f(xo) # 0 (we can take f (x0) = 1). This conclusion also remains valid in the complex case. Thus if L # E there exists a continuous linear functional f # 0 such that f IL.= 0. From this follows the very useful.
0 3. Linear
Topology
129
Lemma of F. Riesz. If L is a closed subspace of a normed linear space E and L # E then for any 6 (0 < 6 c 1) there exists in E a vector x ( llxll = 1) whose distance from L satisfies the inequality: d(x, L) = inf I/x - y 112 6. YEL
Proof. Let f be any linear functional such that II f I( = 1, f IL = 0. We choose x to satisfy the conditions [lx/l = 1, If(x)1 2 6. Then for all y E L we have IIX - Yll a If(x - YN = IfWl 2 6 0 In any LCS (real or complex) each closed subspace is the intersection of the closed hyperplanes which contain it, i.e. it is defined by a system of linear equations of the form J(x) = 0, where the fi are continuous linear functionals. If E is real then any closed convex set in E is the intersection of the closed halfspaces which contain it, i.e. it is defined by a system of linear inequalities of the form L(x) < cli, where the f;: are continuous linear functionals and the cli are real numbers. For compact convex sets we need consider only half-spaces corresponding to hyperplanes of support. If K is a wedge with non-empty interior in a real LTS E then all K-non-negative linear functionals are positive on the interior of K and are consequently continuous. Thus in the topological variant of the theorem on the extension of nonnegative functionals the resulting functionals are automatically continuous. For any wedge K c E the set of non-negative continuous linear functionals is denoted by K*. If K is a closed cone in a Banach space E then for each vector w E K there exists a functional f E K* such,that f(w) > 0 (theorem of M.G. Krejn). For any LTS E (real or complex) the continuous linear functionals form a linear space E*, called the conjugate12 of E. The case E* = 0 is possible13, although if E is an LCS then E* is total. We can use the latter property to introduce a locally convex topology on E defined by means of the set E*. Since the new topology is weaker than the initial topology (as a result of the continuity of all f E E*), it is called the weak topology (or w-topology) while the original topology is called the strong topology. We note that such a weakening of the topology does not change the set of continuous linear functionals, i.e. E* remains as before since, if a linear functional f is weakly continuous, there exists 6 > 0 such that from a certain system of inequalities of the form If;(x)1 < 6 (h E E*, 1 < i < n) follows the inequality )f(x)1 < 1. But then it follows from the equations J(x) = 0 (1 < i < n) that f(x) = 0. Thus f E Lin { fi , . . . , f.} c E*. It is also clear from what has been asserted that in general if the topology on E is given by a total space F of linear functionals then E* = F. The question of topologies on E* leads to a very general consideration which we will leave aside for the time being. However if E is a normed space then E* ore precisely, its topological conjugate. Clearly E* c EX but in general E* # EX; for example @(S, K)* = @+,(S, K). If E is finite-dimensional then E* = Es. 13As an example we can consider E = Lp(O, 1) for 0 < p < 1. This is a Frtchet space in which there are no open convex subsets except E, 0.
130
Chapter 2. Foundations and Methods
is also normed (as was mentioned above) and even a Banach space (as is easily verified). We can now consider the second conjugate E** and the canonical homomorphism v: E + E ** defined by the formula w
u-1 = f(x)
(XE E,feE*).
Since I(vx)(f)l G llfll llxll and equality is attained at a support functional f for x, then IIvx II = [[XII, i.e. v is an isometry. Thus we can always consider a normed space E as a subspace of the Banach space E** and identify the completion of the space E with its closure E c E ** (E is a Banach space if and only if it is closed in E**). A Banach space E is said to be refexive14 if E = E** (i.e. if the canonical isometry v is surjective). In contrast to the algebraic situation there exist inlinitedimensional reflexive Banach spaces (for example, all Hilbert spaces are of this type (see below)). If E is finite-dimensional then it is reflexive since in this case all linear functionals are continuous. Reflexivity of the space E* is equivalent to the reflexivity of the space E. James (1964) established the following remarkable criterion: for the reflexivity of a Banach space E it is necessary and sufficient that each continuous linear functional on E should attain its norm on the unit sphere Is. It is appropriate to note here that for any Banach space the set of continuous linear functionals which attain their norms is dense in E* (Bishop-Phelps, 1961). James (1951) constructed an example of a Banach space E for which dim(E**/E) = 1, and moreover E** is isometric to E (but of course not canonically). A Banach space is said to be uniformly convex if whenever 0 < E < 2 there exists 6 > 0 such that from the conditions ~/XII = 11yJ[ = 1, [Ix - yll > E there follows 11(x+ y)/2II < 1 - 6. For example, the spaces Lp(S, ,u) for 1 < p < COare of this type (Clarkson, 1936). Any Hilbert space is uniformly convex16. All uniformly convex spaces are reflexive (D.P. Mil an, 1938). This classical result follows easily from James criterion. A Banach space is said to be uniformly smooth if for any E > 0 there exists 6 > 0 such that from the conditions llxll = 1, llyll < 6 there follows [Ix + yll + [Ix - yll < 2 + EII yll. Uniform smoothness (convexity) of the space E is equivalent to uniform convexity (smoothness) of the space E* (V.L. Shmul an, 1940). Thus all uniformly smoooth Banach spaces are reflexive. All spaces Lp(S, p) (1 < p < cc) are uniformly smooth. Any Hilbert space is uniformly smooth. The unit sphere of a Banach space is said to be smooth at the point x if there is a unique support functionalf, at this point. A Banach space is said to be smooth I4 More precisely, topologically reflexive. 15The necessity is obvious: for any f~ E* there exists a support functional < E E** (r(f) = jlfll, llrll = 1) and, if E** = E, then r = x E E, T(f) = f(x), llrll = Ilxll. 16As a consequence of the parallelogram identity: I/x + yJIz + /Ix - yllz = 2( llxllz + Ijyllz). Inckhtally, this identity for a Banach space implies that it is a Hilbert space (Jordan-von Neumann, 1935).
5 3. Linear Topology
131
if its unit sphere is smooth at each point. In this case there is defined the support mapping x H f, from the unit sphere of the space E into the unit sphere of the space E*. Smoothness of the unit sphere at a point x is equivalent to differentiability the norm at this point in the sense of GAteaux, i.e. the existence of the limit
of
(x, y> = lim IIX + ryll - llxll z l-r0 for all y. The space E is uniformly smooth if and only if this limit exists uniformly for all x, Y (llxll = 1, llvll = 1). In any separable Banach space the set of smooth points is dense in the unit sphere (Mazur, 1933). A Banach space is said to be strictly convex if there are no line segments on its unit sphere or equivalently if the condition IIx + yll = [[xl] + llyll (x, y # 0) implies y = ax (tl > 0). Clearly this property is weaker than uniform convexity. Strict convexity of a reflexive space is equivalent to smoothness of the conjugate space. A more general result holds: if E* is smooth or strictly convex then E is correspondingly strictly convex or smooth. The converse assertion is false (Day 1955; Klee, 1959). Returning to the situation of an arbitrary LCS E, we note that each closed convex set X c E is weakly closed (and consequently for any convex X c E its weak and strong closures coincide). In fact, if x0 4 X, there exists a functional f~ E* such that SUP,,~ re f(x) < re f(x,), and then E\X contains the weak neighbourhood { y: re f(y - x0) < E} for sufficiently small E > 0. We remark further that in an LCS the weak topology coincides with the strong topology on compact sets since the weak topology is still Hausdorff. In specific function spaces we can usually obtain an explicit description (the general form) of all continuous linear functionals. In this series of results, many of which were established long ago by F. Riesz, the first place is taken by his theorem on the general form of a continuous linear functional on the space C(S) of continuous (real or complex) functions on a compact space S:
where o is a uniquely defined real or complex measure”. Its proof makes use of the extension of a functional to B(S) by means of the Hahn-Banach theorem, which allows the possibility of applying the functional to characteristic functions of sets’8. ere we have to extend the concept of measure to allow any real or complex expression: w = wr - o, + q - tw,, where the y (1 < k < 4) are measures in the restricted sense, i.e. they are non-negative. Summability with respect tow is defined as summability with respect to [WI = C& q. In the situation under consideration it is possible (but not obligatory) to use the term generalised measure (in the real case a convenient term is charge, which is associated with potential theory). ‘*A similar approach was noted in Section 2.6 where we were concerned with non-negative linear functionals on C(S). Here it is appropriate to point out that non-negativity implies continuity.
132
Chapter 2. Foundations and Methods
If we define the norm (total variation) of the measure o by putting
then it can be proved that IIf II = ~~o~~.The space M(S) of measures on S (with the properties described above) turns out to be a Banach space which is isometrically isomorphic to the space C(S)*. In what follows these two spaces are identified. If S is a locally compact space then continuous linear functionals on C(S) have a form similar to the previous one but with measures of compact support (a measure is said to have compact support if it is equal to zero on sets which are disjoint from some compact set). In the space Lp(S, p) (1 G p < co) continuous linear functionals are identified with the elements of the space Lq(S, p), where q is related to p by Young relation p-l + q-1 = 1: f(cp) =
ss
(rp E LP, l/bE Lq).
cpe 4
In addition llfll = ll+llLs which allows us to consider (Lp)* = Lg. However we must add that if p = 1 then q = co and Lm(S, CL)is the space of measurable functions which are bounded except for values taken on a set of measure zero. As usual, two functions which coincide almost everywhere are identified. The norm on L S, ,u) is defined as
and under it L S, p) becomes a Banach space which is conjugate to L S, p). From what has been said it is immediately clear that the spaces Lp are reflexive for 1 < p c co. The spaces B(S), C(S), M(S), L S, p), L (S, p) are not reflexive (except for finite S). The case p = 2 (q = 2) is distinguished by the fact that the conjugate space can be identified with the initial space. This property is possessed by all Hilbert spaces (it is a stronger requirement than reflexivity). Theorem
of F. Riesz. If E is a Hilbert
continuous linear functional
space then the general form is given by the formula
f(x) = (X>Y)
of a
(x E 4 Y E E);
further Ilf II = Ilvll. Remark. For a Hilbert space E the isometry of E* onto E defined by the theorem of F. Riesz is linear in the real case but semilinear in the complex case (i.e. additive and af H oly( f )).
3.3. Complete Systems and Topological Bases. We have already discussed on several occasions complete systems of functions in certain function spaces. In exactly the same way a system of vectors V = (z+)~~~in an LTS E is said to be
8 3. Linear Topology
133
complete if its linear hull is dense, i.e. its closed linear hull coincides with E. For example any Hamel basis is such a system but a complete system can be considerably thinner.
Example. The countable system of functions t” (0 < t < 1; n = 0, 1,2, . . .) is completelg in C[O, 11, whereas every Hamel basis has the power of the continuum. If there exists in an LTS E a countable complete system of vectors then E is separable. If E is an LCS a necessary and sufficient condition for a system I/ c E to be complete is that any continuous linear functional f which annihilates V be identically zero. Therefore in proofs that specific systems are complete, theorems on the general form of continuous linear functionals are usually applied. Example 1. Let us consider in I.‘( - ?r, a) the system of exponentials (eian& z with arbitrary (in general complex) exponents 1,. The continuous linear functionals which annihilate the given system are identified with those functions cpE L2( - X, 71)such that II cp(t)e n’dt = 0 (n E Z). f -It Putting
a4 =s* -*
cp(t)e ’ dt,
we arrive at the entire functions of class B,’ with roots at the points d,. Consequently, in order that the given system of exponentials be complete, it is necessary and sutlicient that {A,} be a uniqueness set for the interpolation problem Il/(&,) = 0 (n E Z) in the class B, The trivial situation of this type is where the set {A,> has a limit point. Therefore we may consider that I, + co. In this direction we obtain, for example, the following result (A.I. Khejlits, 1969): let 1, = n + 0(n), where 0(n) = O(1); then the system (e n is complete in L2( - R, rr) if and only if
,z, em@reA,(~))= mo, where A,(e) = 2, wn + ; n=--00
- fw)
(the prime signifies the omission of the term with n = 0; convergence of the series is assumed). Example 2. Let us consider a non-empty compact set S c C whose plane Lebesgue measure is zero. The Hartogs-Rosenthal theorem on the denseness in 19Further, any system of the form (tnk)~&, where 0 G n, < n2 < ... and ck,I n;’ = DC),is complete in C[O, l] (theorem of Mirntz).
134
Chapter 2. Foundations and Methods
C(S) of the set R(S) of rational functions with poles outside of S can be established in the following way. Suppose that R(S) is annihilated by the linear functional on C(S) defined by the complex measure p which is concentrated on S. Then
pj=J-$+o
(z$S).
We take any rectangle X for which p(aX) = 0 and 1,(8X n S) = 0 (A, is linear Lebesgue measure). Then
Hence it follows that p = 0. A system of vectors (ei)g in a separable LTS E is called a topological basis or a Schauder basis if for each x E E there is a unique expansion in a series:
X = 5 tiei. i=O
For example, in lp (1 < p < co) the system (6i)~ is a Schauder basis. Topological bases in C[O, l] and Lp(O, 1) (1 < p < co) were described by Schauder in the 1920s. Later on much effort was expended on the construction of Schauder bases in specific spaces2’ and also in the proof of the existence of such a basis in any separable Banach” space; however the last conjecture was disproved by Enflo (1973). Soon after this counterexamples were constructed in subspaces of the spaces Lp (Davy (p > 2), Shankovskij (p < 2)). Clearly, any Schauder basis is a complete linearly independent system22. The coefficients 5i in the expansion of a vector x are called its coordinates (just as in the algebraic situation). They are clearly linear functionals of x. The image E^of the homomorphism x H (&(x))go mapping E into the space of sequences is called the coordinate space of the given basis. Example. Let E be a separable Hilbert space and let (e&z be a complete orthonormal system (it can be obtained by the orthogonalisation process from any countable dense set X c E). This orthonormal basis is a Schauder basis in E. The unique expansion of a vector is its Fourier series X =
(X,
e&i,
i=O
the coordinate
space coincides with l2 and an isometry
between E and l2 is
r” For example, the question of the existence of a Schauder basis in the space of functions which are analytic on the unit disk and continuous up to the boundary remained open for a long time. A positive solution was obtained by S.V. Bochkarev (1974). *i It can be shown that in the class of Frechet spaces a necessary condition for the existence of a Schauder basis is that E* be total. Thus, for example, there is no Schauder basis in Y(0, 1) for O
Q 3. Linear
established by means of Parseval
Topology
135
equality
All this is proved in exactly the same way as for classical Fourier series in L2( - R, n). As a corollary we obtain the result that all separable Hilbert spaces are isometrically isomorphic to one another. In the example considered the coordinate functionals are continuous and the situation is exactly the same for any Schauder basis (eJ$ in a Banach space E (see Section 3.8). Moreover the systems (ei); c E and (C)g c E* are biorthogonal:
Zf (A,), EE is a sequence of real numbers which satisfies sup II, - nl < a, ”
then the corresponding system of exponentials logical basis in L2( - 71,n).
(e n ,,z is an unconditional
topo-
It is not possible to improve the constant $: if L, = n - $sgn n the system even fails to be complete (Levinson, 1936). On the subject of unconditional convergence of vector series C&, xi we note that in a Banach space this is equivalent to 1) convergence of all its arts’ CEO Xir32) convergence Of all series I& EiXi (Ei = f 1). In a complete LCS E a sufficient condition for unconditional convergence of a series I& xi is that all series ~~ op(xi) converge (p runs through a family of basic seminorms). It is natural to call such convergence absolute. In a Banach space absolute convergence of a series means that csO llxi II < co. In any infinite-dimensional Banach space there exists a series which converges unconditionally but not absolutely 23 It is even possible (Pelczy6ski, 1969).
to construct
a reflexive
separable
Banach
space having
no unconditional
basis
136
Chapter 2. Foundations and Methods
(Dvoretzki-Rogers,
1950) which cannot happen in the finite-dimensional case space the set of sums obtained from all possible rearrangements which preserve convergence is an alline manifold (Steinitz, 1916). In any infinite-dimensional Banach space there is a counterexample to this (V.M. Kadets, 1986). The concept of an unconditional topological basis can be generalised to the case of a non-separable LTS E by applying this term to a system of vectors (ei)isI with respect to which there exists for each x E E a unique unconditionally convergent series (theorem of Riemann). For a convergent series in a finite-dimensional
x = C &ei iel
(the set of coordinates 5i different from zero is no more than countable). Completeness and linear independence are clear in this case also. The coordinate space is now a subspace E^c @(I, K) which is invariant with respect to rearrangments (it is symmetric) and moreover each e E E^has no more than countable support. Example. Let E be any Hilbert space and let B = (ei)isI be a complete orthonormal system (its existence can be shown with the help of Zorn lemma). This orthonormal basis is an unconditional topological basis. The unique expansion of any vector x is its Fourier series X =
C iEI
(X9
e&i,
and the coordinate space coincides with lf = L Z, v) (v is the measure on I such that u( {i}) = 1 for all i E I); an isometry between E and lf is established by means of Parseval equality
In particular,
the system of exponentials (eiar)2c u on the whole axis -co < basis in the space B2, the completion of the pre-Hilbert space AP of almost-periodic functions. Thus B2 is isometrically isomorphic to 1;. In any LTS E the least power of all the complete systems is called its topological dimension and is denoted by dimt E. In the finite-dimensional case it is equal to its algebraic dimension, while in the infinite-dimensional case it is the least power of all the dense subsets. In particular, E has countable topological dimension if and only if it is separable. A system of vectors (ei)iol in a linear metric space E with metric d is said to be 6 - distal if t < co is an orthonormal
d(ei, er) > 6 > 0
(i # k).
A system is said to be distal if it is b-distal for some 6. The power of a distal system does not exceed dimt E if E is infinite-dimensional. In fact in each ball d(x, ei) < J/2 there is a point xi from the dense subset and xi # xlr (i # k). For an infinite-dimensional space E the power of any complete distal system is equal to dimt E.
37
5 3. Linear Topology
In a Hilbert space E any orthonormal system (ei)i=l is distal: d(e,, e = ,/2 dimt E. (i # k); therefore the power of any orthonormal basis in E is equal e isometriConsequently, all Hilbert spaces of the same topological dimension cally isomorphic. Y Example. The topological dimension of the space Bz is equal to the power of the continuum. Let E be an infinite-dimensional normed linear space. By Zorn lemma there exists for arbitrary 6 (0 c 6 < 1) a maximal b-distal normalised system (ei)i.,. It is complete since otherwise it would be possible by Riesz lemma to extend it in such a way that the &distal property is preserved. Therefore its power is equal to dimt E. This result remains valid if in place of (ei)ip, we construct by transfinite induction a system (uj)jpJ with the following properties: 1) the index set J is well-ordered, 2) llujll = 1 0 E J); 3) d(Uj, Lj) 2 6 where Lj is the closed linear hull of the subsystem (I&<~. If we now choose fi E E* such that llfi II = 1, Ifi > 6/2 and fil=, = 0, then the system (f,)j,, will be 6/2 - distal. Consequently, dimt E* 2 dimt E. If E* is separable then E is also separable. If E is reflexive then dimt E* = dimt E. The approach to the theory of topological dimension of normed spaces which has been described was proposed by M.G. Krejn, M.A. Krasnosel kij and D.P. Mil an (1948) as a basis for the investigation of those properties of subspaces which are stable under small perturbations. It is natural to measure the size of a perturbation by means of the gap 8(L, M) = max
sup d(x, M), sup d(y, L) ,
( XSS‘
>
YSSM
where S,, S, are the unit spheres of the subspaces L, M. Theorem 1. 1f 8(L, M) = 8 < 4 then dimt L = dimt M. Proof. 24 Suppose that (Ui)iel is a maximal &distal normalised system in L with 28 < 6 < 1. We take E < 612 - 8 (E > 0) and choose vi E M such that
d(ui, Vi) < 8 + E. Then the system (Vi)iel will be [S - 2(8 + s)]-distal. quently, dimt L < dimt M. 0 Theorem 2. Zf 8(L, M) = 0 -K 4 and the subspaces dimt(E/L) = dimt(E/M).
Conse-
L, M are closed then
Proof. Suppose that dimt(E/L) > dimt(E/M). We take in E/L a maximal b-distal normalised system ([Ui])ipr, where [Ui] is the class of the vector ui E E, IIUiII < 1 + E (E > 0). Let US consider the class (Ui) modulo M. The system is not distal. Therefore for any q > 0 we can find i,, i2 E Z such that (< ))iol d((u,,), (Ui,)) < q and then there exists z E M such that d(ui, - ui2) Z) < q. Consequently 24For the infinite-dimensional case. The finite-dimensional an application of topological methods.
case requires a separate treatment with
138
Chapter
2. Foundations
and Methods
d( - l uiz7 L, - d( -, ui29 z,, ’ llzll
6-v 2(1 + E) + r/’
Letting 6 -+ 1, E + 0, q + 0 we obtain 8(L, M) 2 $, contrary to the given condition. 0 Remark 1. In the case of a Hilbert require 8(L, M) < 1.
space it is sufficient in both theorems to
Remark 2. The modified gap 8(L, M) = max
sup d(x, S,), sup d(y, S,) ( XeSL YCSM >
is a complete metric on the set of subspaces of a Banach space (I. Ts. Gokhberg - A.S. Markus, 1959), whereas the original gap does not satisfy the triangle inequality. Moreover they are topologically equivalent because of the inequality 1 < 8(L, M)/B(L, M) < 2. 3.4. Extreme Points of Compact Convex Sets. For any convex set X in a linear space E a point x E X is called an extreme (or extremal) point if it does not belong to any interval [x,, xZ] c X whose end points are both different from x. This is equivalent to the assertion that if x = (u + u)/2 where u, u E X then u = u. The set of extreme points of a convex set X is denoted by extr X. Its role is contained in the following fundamental result. Krejn-Mil an Theorem. Let E be an LTS and suppose that E* is total. Then any compact covex set Q c E coincides with the closed convex hull of the set extr Q.
In this connection we note that the set of extreme points of a compact convex set need not be closed and hence it can fail to be compact. Example. Let us consider in R3 any circle C and any interval [x1, x2] not lying in the plane of the circle but having a common point c with it different from x1, x2. For the compact set Q = Co(C u [x1, xJ) the set of extreme points is (C\{C))” {x19 x2). Nevertheless, if E is finite-dimensional any compact convex set Q c E coincides with the convex hull of its set of extreme points. This assertion had already been proved by Minkowski (by induction on the dimension). In any LTS the closure of a convex set is convex, from which it is clear that the closure of the convex hull of any set X is the smallest closed convex set containing X. We call this the closed convex hulP of the set X. The convex hull of a set which is actually closed may fail to be closed. Example. The convex hull of the set of points (<, r2) (r > 0) in R2 consists of zero and the points (5, q) for which q > r2 and < > 0. Its closure contains in addition the semiaxis (0, q) (q > 0).
25The
closed absolutely
convex
hull is defined
similarly.
5 3. Linear Topology
139
In a finite-dimensional space the convex hull of any compact set is closed and consequently it is compact (its boundedness is obvious). In an infinite-dimensional space (even if it is a Hilbert space) the convex hull of a compact set may fail to be closed. In certain infinite-dimensional spaces there exist compact sets whose closed convex hulls are not compact. This is essentially the effect of incompleteness of such a space. In any complete LCS the closed convex hull of a compact set is compact. For a locally convex FrCchet space a similar assertion holds in the weak topology: the weak 26 closure of the convex hull of a weakly compact set is weakly compact (Krejn-Shmul an theorem). If the closed convex hull of a compact set Q in an LTS E with total E* is compact then all of its extreme points are contained in Q (D.P. Mil an, 1947). We now give a proof of the Krejn-Mil an theorem. First let us assume that E is an LCS (without loss of generality we can also assume that it is real). We put K = Co(extr Q). If K # Q then any point z E Q\K can be separated from K by a closed hyperplane {x: f(x) = a} such that f(x) < a (x E K), f(z) > a. We put B= maxXGaf(x). Clearly /I > a and so the hyperplane H = { y: f(y) = /I} does not intersect K or, consequently, extr Q. This situation leads to a contradiction since we are able to find an extreme point of the compact set Q in H. For this purpose let us call a closed alline manifold A extreme if A n Q # 0 and whenever an interior point of an interval [a, b] c Q belongs to A then the entire interval is contained in A. For example, the hyperplane H is extreme. Using Zorn lemma it is easy to show that there is a minimal extreme manifold M contained in H. It is a singleton set since otherwise there exists a continuous linear functional g which is not constant on M and then the smaller extreme manifold {y: y E M, g(y) = r}, where y = max xGMnp g(x), is contained in M. But any singleton extreme manifold must simply consist of an extreme point of the compact set Q. Now if we only know that E* is total, from what has been proved the theorem is true for the weak topology, i.e. Q = e where e is the weak closure of the set C = Co(extr Q). Denoting the strong closure by c we have C c c c e. But c is compact, consequently weakly compact and so also weakly closed. Therefore c = e, i.e. Q = c 0 It is not possible to dispense with the condition that E* be total: there exists an LTS in which a certain non-empty compact convex set has no extreme point (Roberts, 1977). The condition of compactness in the Krejn-Mil an theorem is also essential; for example, it is not possible to replace it by closedness and boundedness. Example. If S is a connected compact space the closed unit ball 1)cp11< 1 in the Banach space of real continuous functions on S has no extreme points other than the constants f 1. 26It is also the closure in the strong (original) topology since the space is locally convex.
Chapter 2. Foundations and Methods
In this connection we stress that the closed unit ball of an infinite-dimensional Banach space is not compact, since otherwise the space would be locally compact and consequently finite-dimensional. One of the many useful corollaries of the Krejn-Mil an theorem is that under its conditions the maximum of any continuous convex functional on a compact convex set is attained on its set of extreme points. 3.5. Integration of Vector-Functions and Measures. The initial development of this important analytic tool was carried out in the 1930s. We give the Gel andPettis definition of the integral of a vector-function (1938). Let S be a set with a measure p, E an LTS with total E* and x(s) (s E S) a uector-function, i.e. a function with values in E. Let x(s) be integrable in the sense that all scalar functions of the form f@(s)) (f E E*) are summable with respect to the measure p. Then the linear functional
is called the integral of x(s) with respect to the measure p. Thus, in general, J E (E*) #, however the most interesting case is when J(f) = f(F) where X E E. If such an x exists it is unique; it can be identified with J and we put correspondingly
y=ss44dp. Theorem. Suppose that ,u(S) < CO and x(s) is an integrable vector-function. If the closed convex hull X of the set of values x(S) is compact then the integral-of x(s) with respect to the measure ,u is a vector X E E (further, X E X if p(S) = 1). Proof. We may assume that p(S) = 1 and that E is a real space. For any collection {S,}; c E* we consider the continuous mapping Fx = (J(x)); from X into R Its image is a compact convex set. Moreover (.T(f;:)); E Im F since otherwise there exists a vector (ai); such that
i.e. for the functional
g = ~~=r aif the inequality
dp >I-;g(x) a g(x(s)) !J s&(s)) S
is satisfied, which is absurd. If we now consider the closed hyperplanes H, = {z: f(z) = J(f)} (f E E*)in E, we see that the family of compact sets H, n X has the finite intersection property and so its intersection X, is non-empty. The vector X E X, is what is required. 0 Integrability with respect to a generalised measure o is defined as integrability with respect to each of its components and, correspondingly, the preceding
141
Q3. Linear Topology
theorem can be formulated componentwise ” . In particular it is true if the generalised measure o is absolutely continuous with respect to some measure /A (infinite in general), o(A) =
(P E L’(S, PI, A = 9, ~(4 & sA while the function x(s) satisties the previous conditions. In the most developed situation we have to integrate a function x(s) with values in a Banach space E which is defined and weakly continuous on a locally compact topological space S provided with a generalised measure o. In this situation the norm of the function is measurable since 11x(s)11= suplf(x(s))l (f E E*, II f I[ = 1) and ail the f@(s)) are continuous. If
~(4 = then x(s) is integrable
s
IWII b-4 < ~0
(in this case it is natural
to say that it is absolutely
integrable2’) and the integral x(s) dw E E**
XC s S
is a vector in E; moreover llX\l < v(x). In fact for any E > 0 we can choose a compact set Q, c S such that
Il~(~)IIW-4 < E.
s s\Q. The set x(QE) is weakly compact. By the Krejn-Shmul an closed convex hull is also weakly compact. Consequently x, E
theorem its weakly
x(s) dw E E.
s Q. Further IIX - X,IlE*+ =
sup
fsE** 11/11=1IS S\Qc
f(x(s))
do < E.
Since E is closed in E** we deduce that X E E. Let S be any space with a measure p. Following Bochner (1933) we can define the Banach space L S, p; E) of absolutely integrable (summable) vector-functions with values in the Banach space E as the L completion of the space of simple 27Or with respect to (w(. 28If S is compact then sup, If(x(s)l < cc for all f E E*, hence by the Banach-Steinhaus theorem If(x(s))l < Cllfll and consequently Ilx(s)ll < C, i.e. any weakly continuous function on a compact set with values in a Banach space is bounded in norm. Moreover it is absolutely integrable with respect to any generalised measure with finite total variation.
142
Chapter 2. Foundations and Methods
functions, i.e. finite combinations of the form29 xkuk&), where the uk are arbitrary vectors in E and the xk are characteristic functions of measurable sets Sk such that ,u(S,) < co and Si n Sk = 0 (i # k). Moreover if x E L’(S, p; E) the function 11x(s)11is measurable and
llxll =
ss
IWII &
On L’(S, p; E) the Bochner integrul of x with respect to the measure ,u is defined as the strong limit of integrals of simple functions and
The Lebesgue theory of integration extends to this situation with no significant changes. All functions in L1(S, p; E) are Gelfand-Pettis integrable and the values of the integrals in both senses coincide in this case. If x E L’(S, 1~;E) the set function
on the a-algebra associated with the measure p is countably additive and in this sense it is a vector measure. For any vector measure m(A) its ouriution is the measure
Id (4 = sup $r II&%) II < 00, where (Ak}; runs through all finite measurable partitions original example
ImA
=
sA
of the set A. In the
IIx(s)II G.
The total variation (or norm) of a vector measure m is the quantity IIml1 = Iml (S). A vector measure m is said to be absolutely continuous if l(rnll < co and m(A) =
~(4 dImI,
sA where x E L’(S, Iml; E). We say that a real Banach space E has the RudonNikodjm property (RN) if every vector measure with finite norm is absolutely continuous3’. The RN property has a purely geometric characterisation which is closely connected with the Krejn-Mil an theorem. r9Any summable function is the limit almost everywhere of some sequence of simple functions. Therefore its set of values is separable (except possibly for values taken on a set of measure zero). 30The classical Radon-Nikodjm theorem asserts that if there are two measures p, v (on a common o-algebra) and p(A) = 0 + v(A) = 0 then v(A) = sA
where p is some function (the density or Radon-Nikodjm
P dp, deriuatiue).
$3. Linear Topology
143
Let M c E be a closed convex bounded set. A point z,, E M is said to be strongly exposed if there exists a functional f E E* such that f(z) < f(zo) for all z E M (z # zo) and the diameter of the ap’f(z) > f(zO) - E(z E M, E > 0) tends to zero as E + 0. Phelps-Rieffel Theorem. A necessary and suficient condition for (I Banach space to have property RN is that each of its closed bounded convex subsets be the closed contiex hull of its set of strongly exposed points. Since all strongly exposed points are clearly extreme3 ‘, in an RN-space E each closed bounded convex set is the closed convex hull of its set of extreme points32 (Lindenstrauss, 1966). The converse assertion holds if E is a conjugate space (Huff-Morris, 1976). Each separable conjugate Banach space has the property RN (Dunford-Pettis, 1940). We now use the tools of integration to give another formulation of the KrejnMil an theorem. Let E be an LTS with total E*, let Q c E be a compact convex set and put S = extr Q. Let us consider on the compact set S a normalised measure p and the function x(s) = s. The centre of muss xp =
s dp IS belongs to Q and the Krejn-Mil an theorem implies that any point X E Q is the centre of mass of some normalised measure p5 on S. A significant refinement of this result is due to Choquet (1956): if the compact set Q is metrizable then pz (X E Q) can be chosen to be concentrated33 on extr Q and not on its closure S (in this situation extr Q is a G, set). Such a pz is called a Choquet measure for the point X. It can be regarded as a solution of a singular abstract moment problem. This way of looking at it embraces in a single form all those classical moment problems in which the required measure is finite (the general case needs an additional limiting process). For example, in the power moment problem on the whole axis we have a convex set Q of positive (in the sense of the corresponding Hankel matrix) sequences (m& (m. = l), its extreme points are {t"}$ (t is a real parameter) and the integral representation al m, = tk da(t) (k=0,1,2,...) s0 corresponds to Choquet theorem. The converse is false. However, as we see, strongly exposed points play the same role as extreme points in a series of cases. We note here that any weakly compact convex set in a Banach space is the closed convex hull of its set of strongly exposed points (Amir-Lindenstrauss, 1968). In a uniformly convex Banach space every point of the unit sphere is a strongly exposed point of the closed unit ball. 32Thus, for example, if S is a connected compact space with card S > 1 then C(S) is not an RN-space. 33We say that a measure v on a set X is concentrated on a subset M c X if v(X\M) = 0. If X is a topological space the smallest closed subset on which the measure v is concentrated is called its support and is denoted by supp v.
144
Chapter
2. Foundations
and Methods
The integration of vector-functions also plays a significant role in the extension of complex analysis to the vector situation, which in turn is absolutely essential for constructing spectral theory. A vector-function x(c) defined on some domain G of the complex plane @ and taking values in an LTS E with total E* is said to be analytic on G if all scalar functions of the form f(x(c)) (f E E*) are analytic on G34. Theorem. If x(c) is an analytic function on a domain G with values in a Banach space E then at each point [,, E G it has a strong expansion as a power series
(K - lol < 6 = d(t-o,@\@I,
(17)
in which the vector coeflicients are uniquely determined. Proof. We have
f&(c)) = kzotk(f
)(c - cdk
(fEE*),
where
= &
f(W) s I<-[ol=p(C-
4 ro)k+l
(0 < p < 6).
Since x(c) is weakly continuous on the circle I[ - [,, 1 = p, we have tk( f) = f (x,), wherex,EE(k=0,1,2 ,... ). Moreover /lxkII < M(p)pek, where M(p) = sup16-501=P[lx(c) 11.Consequently, the power series (17) converges absolutely on the disk Ic - co 1 < p for any p < 6, i.e. finally, on the disk I[ - co1 < 6. Its sum s(c) - x(c) since f(s(c)) = f(x([)) for all f E E*. 0 The further development of the theory of analytic vector-functions with values in a Banach space presents no problem. Concerning this we note only the Cauchy-Hadamard formula for the radius of convergence of a power series with vector coefficients ck (k = 0, 1,2, . . .):
The sum of the power series is analytic inside the disk of radius r. 3.6. w*-Topologies. Let us consider an arbitrary linear space E and its algebraic conjugate space E’. Since E’ c @(E, K) the topology of pointwise convergence is defined on ES. It is called the w*-topology on E#. The LCS EX is w*-closed in @(E, K) and consequently it is w*-complete. If E is an LTS, its conjugate space E* is contained in E’ and so the w*topology is also defined on E*. This is the weakest topology in which all the linear 34A vector-function
which
is analytic
on the whole
plane is said to be entire.
145
$3. Linear Topology
functionals of the form R(f) = f(x) (X E E, f E E*) are continuous (these are the canonical images of the elements x E E in the space (I?*)#). Each w*-continuous linear functional E* has the stated form. A fundamental result of a general character concerning the w*-topology is the Banach-Alaoglu Theorem. Let E be an LTS and U c E a neighbourhood of zero. Then the set U” consisting of those linear functionals on E which satisfy the condition35 supXScr If(x)1 G 1 (and so are continuous) is w*-compact. Proof We put A = {I: 111< l> and consider the set of functions @(U, A) with its product topology. This is compact by Tikhonov theorem. The mapping Rf = f Iu of the set U” c E* provided with the w*-topology into @(U, A) is a homeomorphism between U” and RU But, as is easily verified, RU” is closed. Consequently it is compact. 0
From the Banach-Alaoglu Corollary
theorem and the Krejn-Mil an
1. U” coincides with the w*-closed
theorem follows
convex hull of the set extr U
Now let E be a normed space. Then we have on E* at least three natural locally convex topologies: the strong topology defined by the norm of the linear functionals (with respect to which E* is a Banach space); the weak topology defined by the family of strongly continuous, i.e. bounded (for the norm), linear functionals; and the w*-topology 36. Applying the Banach-Alaoglu theorem to the open unit ball D t E we obtain the closed unit ball of E* as Do. Thus we have established Corollary 2. Zf E is a normed space the closed unit ball in the Banach space E* is w *-compact. By the Krejn-Mil an theorem the ball Do is the w*-closed convex hull of its set of extreme points. Hence it is clear, incidentally, that a Banach space need not be the conjugate of some normed space; for example the spaces C[O, 11, L’ [0, l] are not conjugate spaces (even up to isomorphism). The w*-compactness of the closed unit ball in the space of measures C(S)* (S compact) is the content of a theorem of Helly. The spaces of continuous functions on compact sets have the special property of universality with respect to the class of all Banach spaces. This also follows from the Banach-Alaoglu theorem. Corollary 3. Any Banach space E is isometrically isomorphic to a closed subspace of the space of continuous functions on some compact set. 35For any non-empty X c E the set X0 = {f: f E E*, supxEX If(x)l < 1) is called the polar of the set X. The polars of a set and of its absolutely convex hull clearly coincide. It is also clear that X0 is absolutely convex and w*-closed. The term olar’ is also used in a relative sense, i.e. for the intersection of X0 with subspaces of EY (in particular, with E*). The notation X0 is retained in this situation. If X is a subspace then X0 is its annihilator: X0 = XI. 36This is also often called weak in view of the fact that it is weaker than the norm topology. In order to avoid confusion we will not do this (sometimes we call the w*-topology the weakened topology).
146
Chapter 2. Foundations and Methods
Proof. The canonical isometry u: E + E** associates with each x E E the function (ox)(f) = f(x) which is w*-continuous on the ball Do. The norm of this function in C(DO), i.e. maxfED If(x is equal to IIxII. 0
If E is separable the w*-topology on the unit ball Do c E* is metrizable. Now, as is well-known, any compact metric space is the image of the Cantor set 6 c [0, l] under some continuous mapping. Thus in the case under consideration C(D”) is isometrically isomorphic to a subspace of C(G) and this in turn is isometrically isomorphic to a subspace of C[O, 11. Thus we have the Banach-Mazur Theorem. Any separable Banach space is isometrically to a closed subspace of C[O, 11.
isometric
We give a further application of the metrizability of the w*-topology on Do in the separable case. Let E be a separable Banach space with a closed cone K. Then there exists a continuous linear functional f > 0 (M.G. Krejn, M.A. Rutman, 1948). For the proof we note that the compact set Do, being metrizable, is separable. Consequently there is also a countable dense subset {f.}? of the intersection Do n K*. It just remains to put, for example, f = c,, 2~ ,. Remark. Separability of the space E in the last assertion is essential. Any Banach space E coincides with the subspace of E** consisting of all w*-continuous linear functionals on E*. Thus if E is reflexive (and only in this case) each continuous linear functional on E* is w *-continuous. The weak topology on E is induced by the w*-topology of the space E**. With respect to this topology E is dense in E**, since for any < E E** and any finite collection (fi); c E* the system of linear equations J(x) = <(A) (1 < i < n) has a solution x E E. The unit ball D c E is also w*-dense in the unit ball D** c E** which contains it, since otherwise there exists an element 5 E D** which is separated from D by a w *-continuous linear functional; this is a functional f E E* such that r(f) # 0 and flD = 0 - a contradiction. Theorem. For reflexivity of a Banach space E it is necessary and sufficient that the ball llxll < 1 be weakly compact. Necessity.
This follows from Corollary
2 of the Banach-Alaoglu
theorem.
Sufficiency. The image of the closed ball in E** is w*-compact and consequently w*-closed. On the other hand it is w*-dense in the closed ball of the space E**. These balls therefore coincide and then we also have E = E**. 0 Corollary. A necessary and sufficient condition for each closed bounded convex set in a Banach space E to be weakly compact is that E be reflexive.
This result is extremely useful in the theory of extremal problems, in particular in the theory of approximation. Reverting to the latter, let us consider a nonempty closed subset Yin any metric space E. For any point x0 an element y, E Y is called a best approximation in Y if
8 3. Linear Topology
4x0,
147
yo)= inf 4x0,Y) (sf(x,,
Y)).3’
YEY
If E is a reflexive Banach space then for any non-empty closed convex set Y c E and any point x0 E E a best approximation y, E Yexists. In fact it is sufficient to look for y, in the intersection Yeof the set Y with the ball d(x,, y) < d(x,, Y) + 1. Now Ye is weakly compact and the functional d(x,, y) = [lx0 - yll is weakly lower semicontinuous. Therefore d(x,, y) attains its inlimum on Y,. The condition of reflexivity is necessary because of James criterion. We note in passing that the theorem on the uniqueness of the best approximation for all x0 E E and all convex Y c E holds if and only if E is strictly convex. Thus in a uniformly convex Banach space E any non-empty closed convex set Y c E (in particular, any closed subspace) is a Chebysheu set in the sense that a best approximation y, E Y exists and is unique for each point x0 E E. N.V. Efimov and S.B. Stechkin have investigated the general problem of the convexity of Chebyshev sets (1961). Example. Let us consider the problem of the best approximation in a Hilbert space E38. Let L c E be a closed subspace, let x0 E E and let y, be the best approximation in L for x,; put z0 = x0 - y,. Then for any y E L the function CPM = lb0 + ~~11~= llzol12 + 22 reh
Y) + ~zll~l12
(r E w
has a minimum at the point r = 0, hence re(z,, y) = 0. Consequently, (z,,, y) = 0 (in the real case this is immediate, while in the complex case it comes from substituting iy for y in the result). Thus y, E L and x0 - y, is orthogonal to the subspace L (i.e. to every vector in L). These properties characterise y, geometrically as the orthogonal projection of the vector x0 on the subspace L. The value de of the deviation of x0 from L is calculated by Pythagoras theorem: d, = Jcll%ll”
- llYol12).
3.7. Theory of Duality. Let E and F be two linear spaces and let (x, y) (x E E, y E F) be a bilinear functional such that Vx # 0 3y with (x, y) # 0 and Vy # 0 3x with (x, y) # 0. Then we say that there is a duality between E and F or that (E, F) is a dual pair of spaces. Example 1. Let E be any linear space, let F = E put (x, f) = f(x). Clearly this is a dual pair. Example 2. If E is an LTS (E, E*) is a dual pair with (x, If (E, F) is a dual pair there is total on E. Thus F defines
its algebraic conjugate, and
and E* is total (for example, if E is an LCS) then f) = f(x). is a natural embedding of F in E# under which F a locally convex topology on E which is denoted
(x,,, y,,) is called the deviation of the point x,, from the set Y “*i.e. we are concerned with the methodof least squares.
148
Chapter 2. Foundations and Methods
by o(E, F). The space E*, the conjugate of E in this topology, coincides with F. Now we can consider the w*-topology on F = E*. Clearly it coincides with the topology o(F, E) for the dual pair (F, E) and consequently F* = E. In this way a complete symmetry between the linear topology space?’ E and E* is obtained. This symmetry also appears in the following property of reciprocity of polars. Theorem. Let (E, F) be a dual pair, X c E (X # 0) and Y = X0, the polar of X in F. Then Y”, the polar of Y in E, coincides with the a(E, F)-closed absolutely convex hull 2 of the set X. In particular, if X is absolutely convex and o(E, F)closed then X = Y” (i.e. X0’ = X). Proof. Since 1(x, y) 1 < 1 (x E X, y E Y) we have X c Y” and hence 8 c Y”. Let x0 I$ 2. By the theorem on the separating hyperplane, there exists y E F such that 1(x, y)( < 1 (x E 2) and 1(x0, y)l > 1. But then y E Y and consequently x0 4 Y”. It follows that 2 = Y”. IJ Corollary 1. Let (E, F) be a dual pair, L a subspace of E and N = L’, the annihilator of L in F. Then the annihilator NL of N in E coincides with the a(E, F)-closure of L. In particular, if L is a(E, F)-closed then L = N1, i.e. (Ll), = L.
We note further that, clearly, for any family of sets {Xi} in E
Hence on replacing Xi by Xi” we obtain from the previous theorem Corollary 2. For any family of a(E, F)-closed absolutely convex sets Xi c E the polar of their intersection coincides with the a(F, E)-closed absolutely convex hull of the union of their polars X,!‘.
If (E, F) is a dual pair we say about a locally convex topology on E for which E* = F that it is consistent with the given duality. The weakest among such topologies is a(E, F). It turns out that the class of topologies which are consistent with the given duality can be described and, moreover, among these topologies there is a finest. Mackey-Arens Theorem. Let (E, F) be a dual pair. In order that a locally convex topology on E be consistent with the given duality it is necessary and sufficient that it be a topology of uniform convergence on some family of absolutely convex subsets of the space F which are compact in the topology a(F, E). Necessity. The polar U” of any neighbourhood of zero U c E is compact in the topology o(F, E) by the Banach-Alaoglu theorem, since by assumption F = E* and o(F, E) is the w*-topology. The initial topology T on E coincides
391n this connection, if E is an LTS to start with, its initial topology is to be replaced by the weak topology; moreover E* is preserved and it is provided with the w*-topology. f
;
0 3. Linear
Topology
149
with the topology of uniform convergence on the family {UO} where U runs through the family T of absolutely convex (open) r-neighbourhoods of zero in E. In fact, if U E T and XXEE, sup I(x,y)I < 1 , YEW 1 I then U c I$, since 1(x, y) I < 1 (x E U, y E U’), U” is compact and the function cp,(y) = 1(x, y)l on U” is continuous and, consequently, attains its maximum. On the other hand if WE T we can find U E T such that its a(E, E*)-closure 0 is contained in W. Thus 0 = Uoo I> VU and so VU c W. l’,=
Sufl~ciciency. The topology p being considered on E is stronger than a(E, F), hence F c E*. If f E E* then If(x)1 < 1 on a p-neighbourhood
u, = {x: x EE, I<x, Y> I < 1 (Y E Q>> (Q is an absolutely convex o(F, E)-compact for x E Q”, i.e. f E Qoo (the second polar is in the space E* 3 F and therefore closed and hence f E F. This shows that E* = F.
subset of F). Consequently If(x)1 < 1 taken in E*). Since Q is also compact in E* we have Qoo = Q. Thus f E Q 0
Corollary. If (E, F) is a dual pair the strongest topology on E which is consistent with the given duality is the topology of uniform convergence on all absolutely convex a(F, E)-compact subsets of F.
This topology is called the Mackey topology and is denoted by z(E, F). An LCS E whose topology coincides with 7(E, E*) is called a Mackey space. Any barrelled (and also any metrizable)
LCS is a Mackey space.
Remark. If (E, F) is a dual pair, the closure of any convex set X c E is the same for all topologies which are consistent with the given duality since, as a result of the theorem on the separating hyperplane, the operation of closure on the class of convex sets is determined by the set of continuous linear functionals. Taking this as his starting point, D.A. Rajkov (1962) constructed a purely algebraic theory of duality and obtained an algebraic variant of the KrejnMil an theorem. Many important questions in the theory of duality for LTSs are connected with the consideration of bounded sets. Theorem of Mackey. Zf (E, F) is a dual pair the same sets are bounded for all topologies on E which are consistent with the given duality. Proof. By the Mackey-Arens theorem it is sufficient to show that each o(E, F)bounded set X c E is z(E, F)-bounded. Moreover we need consider only the case where X is countable: X = {x.}?. If it is not r(E, F)-bounded there exists {a,,}? c R, such that lim,,, a, = 0 but {a,,x y does not tend to zero in the topology z(E, F). Then we can find E > 0 and an absolutely convex o(F, E)compact set Q c F such that maxy.o I( a,x,, y)l > Efor an infinite set of suffices Y (we can suppose that this holds for all n). There is therefore a sequence
150
Chapter 2. Foundations and Methods
~Y,>Y’ = Qsuchthat<x,,Y.> + cc. At the same time, sup” 1(x,, y) ( < co for each y E F because X is o(E, F)-bounded. This situation contradicts the BanachSteinhaus theorem. 0 Corollary.
In any LCS weak boundedness of a set is equivalent to its bounded-
ness.
We introduce for any LT.3 E the uniform topology /I(E*, E) on E* which is defined to be the topology of uniform convergence on all bounded subsets X c E. For a normed space E this topology coincides with the norm topology of E*, i.e. its strong topology. Clearly &E*, E) is always finer than the w*-topology4’ and the strengthening of the topology can lead to an enlargement of the conjugate space. The space of uniformly continuous linear functionals on E* is called the second conjugate of E and is denoted by E **. The standard canonical homomorphism u: E + E** is injective if E* is total. In this case we have an embedding E c E** in place of the equality under the weak topology. The space E is said to be (topologically) reflexive if E = E** and the initial topology on E coincides with /J(E**, E*). In other words, reflexivity of E means that u is an isomorphism of the LTSs E and E**. Moreover E is necessarily an LCS. If E is reflexive, it is clear that E* is also reflexive under its uniform topology. Theorem. In order that an LCS E be reflexive it is necessary and suflicient that it be barrelled and each weakly closed bounded set X c E be weakly compact. For the proof we note that, in terms of duality, barrels in an LCS E can be characterised as the polars of w*-bounded4’ subsets Y c E*. Therefore any barrel in E contains a neighbourhood of zero for the topology on E induced by /?(E**, E*). It is clear from this that if E is reflexive it must be barrelled. The second necessary condition follows from the Banach-Alaoglu theorem. Conversely, if the stated conditions are satisfied, we have t?(E*, E) = T(E*, E) and hence by the Mackey-Arens theorem, /?(E*, E) is consistent with the duality, i.e. E = E**. Moreover /?(E, E*) is stronger than the initial topology on E for the reasons set out above. On the other hand, any /?(E, E*)-neighbourhood of the form u,,, = {x: IfWl < E U-E Y,}, where Y c E* is a bounded set, contains a barrel, namely the polar of the bounded set 2/e Y and so if E is barrelled there is a neighbourhood of zero for the initial topology contained in Ur,,. q Thus weak compactness of the bounded weakly closed sets is a criterion for reflexivity of a barrelled LCS (in particular, a locally convex Frechet space). A barrelled LCS in which all the closed bounded sets are compact is called a Monte1 space. All finite-dimensional spaces are of this type according to the classi@However the topology j?(E*, I$) may not be consistent with the duality. 41 They can also be assumed to be absolutely convex.
Q 3. Linear
Topology
151
cal Borel-Lebesgue theorem. Among Banach spaces only the finite-dimensional ones are Monte1 spaces, but infinite-dimensional Monte1 space are found among Frechet spaces, for example42, C” [0, 11. The space ,4(G) of functions which are analytic on the domain G c C” provided with the topology of uniform convergence on compact sets is a Monte1 space ( Weierstrass-Mantel theorem). It follows from the general criterion for reflexivity that every Monte1 space is reflexive. Moreover the conjugate of a Monte1 space is a Monte1 space under its strong topology. The determination of weak compactness of subsets can be difficult, because the weak topology is not in general metrizable. The following result is useful in this situation. Eberlein-Shmul an Theorem. Zf E is a complete LCS and the subset X c E is such that each sequence (xn}y c X has a weak limit point, then its weak closure X is weakly compact.
In the proof of this theorem we use the following criterion for a@*, E)continuity of a linear functional on E* which goes back to Banach. Theorem. Let E be a complete LCS. Zf a linear functional on E* is w*-continuous on the polars of the neighbourhoods of zero in E, then it is w*-continuous (and consequently it is generated by an element x E E). Proof. Let us consider the linear space L of all linear functionals on E* which satisfy the condition of the theorem. Clearly L I> E and (L, E*) is a dual pair. We can introduce on L the topology p of uniform convergence on the polars of the neighbourhoods of zero in E since the functionals in L are bounded on these sets. This locally convex topology induces the initial topology on E and, since E is complete, it is therefore closed in L. By the Banach-Alaoglu theorem the polars of the neighbourhoods of zero in E are compact in the topology o(E*, E). On each such set P the topology a(E*, L) is on the one hand stronger than a(E*, E), since L 3 E, while on the other hand it is weaker, since it is, by construction, the weakest topology under which all the glP (g E L) are continuous. Consequently p is a topology of uniform convergence on a certain family of absolutely convex o(E*, L)-compact sets. By the Mackey-Arens theorem it is consistent with the duality of the pair (L, E*), i.e. L* = E*. But then if f E L* and f IE = 0 we have f = 0. Thus L = E. 0
Turning to the proof of the Eberlein-Shmul an theorem, we stress that the non-triviality of the assertion lies in the fact that E need not be weakly complete.43 However the space (E*)” is complete in the w*-topology and therefore Y, the closure of X in (E*)#, is compact. It remains to show that Y c E, i.e. to establish for any functional y E Y its w *-continuity on E* which, by the preceding theorem, reduces to the w*-continuity of the restriction ~1,~ on the polar of any 42 The topology
of C” [0, l] is defined lIdIn = max ObkSn
43 For example,
the Banach
space C(X),
by the countable max OSx+l where
Iv l
family
of norms
(n=0,1,2
X is an infinite
compact
,... ). set, is such a space.
152
Chapter 2. Foundations and Methods
neighbourhood of zero U c E. It suffices to establish the absence of this for some E > 0 we can construct and {f y c U” such that If,(z)1 > E for all m, lf,(z Ifm(xn)l < +E for m > n. This leads to a contradiction on taking a weak limit w*-limit point f of {f ?. 0
continuity at zero. In inductively ix.}? c X x,)1 c 3.s for m < n and point x of {x.}? and a
We note further the following Shmul an-Dieudonne criterion: in a complete LCS a necessary and sufficient condition for the weak compactness of a weakly closed set X is that the intersection of any sequence V, 13 V, 1 9.. of closed convex sets having non-empty intersection with X be non-empty. A very general consideration of the problem of weak compactness was carried out by Grothendieck (1952) and further results were obtained by Ptak (1954). To the latter is also due the credit for discovering the deep connection between the theory of duality and topological properties of homomorphisms, which are well-known as Banach theorems on the closed graph, the inverse homomorphism and the open mapping. We will discuss these fundamental results later on but for the time being we will be concerned with corresponding aspects of duality. First of all we formulate a criterion for completeness of an LCS in terms of the conjugate space. Theorem. In order that an LCS E be complete, it is necessary and sufficient that a hyperplane in E* be w*-closed whenever its intersections with the polars of the neighbourhoods of zero in E are we-closed.
For the proof we associate with an arbitrary LCS E the space E^ of linear functionals f on E* for which all the sets f- O) n U” are w*-closed (U runs through the neighbourhoods of zero in E) or, equivalently, whose restrictions f IuO are w*-continuous. We embed E in ,!? in the canonical way and introduce on i the topology of uniform convergence on the polars U The preceding theorem follows from the following assertion. Lemma. For any LCS E the space .@is its completion, i.e. 1) E is complete, 2) the topology of E is induced by the topology of i?, 3) E is dense in E^. Proof The completeness of E^follows from the w*-compactness of each polar U” and the completeness of the space of continuous functions C(U . The coincidence of the induced and the initial topologies on E is trivial. Finally the topology on E^ is consistent for the dual pair (fi, E*) by the Mackey-Arens theorem, i.e. @)* = E*. Moreover if E is not dense in E^there exists f E (I?)* such that f # 0, f IE = 0, which is a contradiction since f E E*. 0 Remark. The criterion for completeness of an LCS can be reformulated as follows: E is complete o each linear functional on E* which is w*-continuous on the polars of the neighbourhoods of zero is w*-continuous. An LCS E is said to be fully complete (or to be a Ptdk space) if each subspace of E*, whose intersections with the polars of the neighbourhoods of zero in E
Q3. Linear Topology
153
are w*-closed, is w*-closed. It follows from the previous result that a fully complete space is complete. The converse is false, however the following holds. Banach-Krejn-Shmul an
Theorem.
Each locally convex Frtchet
space is fully
complete.
3.8. Continuous Homomorphisms. Let E, and E, be LTSs. Continuity of a homomorphism h: E, + E, is equivalent to its continuity at zero, for which in turn it is sufficient (and for locally bounded E, also necessary) that h be bounded in the sense that the image of some neighbourhood of zero is bounded. Example. In the space Cm[O, l] differentiation d/dt is continuous but not bounded. For any continuous homomorphism h the images of the bounded sets are bounded, since if (x,)? is a bounded sequence and a, + 0, then tl,x, + 0 and hence a,(hx,) = h(a,x,) + 0. If E, is a normed space and the homomorphism h: E, --*E, maps bounded sets into bounded sets then h is bounded and so continuous. Example. Let E be an LCS and j? the same space but now provided with its weak topology, which we will asume to be different from the initial topology (as a rule this is generally the case). Then the identity homomorphism E”+ E is not continuous (and so not bounded) but it takes bounded sets into bounded sets. In the case of normed spaces E,, E, boundedness of a homomorphism h is equivalent to finiteness of its norm
llhll = sup IIWI. IIXII=1 In this case the set of bounded (i.e. continuous) homomorphisms of E, + E, is itself a normed space (even a Banach space if E, is a Banach space). For any LTSs E,, E, the set of continuous homomorphisms of E, + E2 is denoted by Homc(E,, E2). It is a linear space44 and can be turned into an LTS by the following standard procedure: a covering 52 of E, by bounded sets is chosen, after which a base of neighbourhoods of zero in Homc(E,, E,) is given by the sets NV,, = {h: hX c V}, where X E 0 and V is an arbitrary neighbourhood of zero in E,. This is the Q-topology or the topology of uni$orm convergence on the family a. It becomes stronger as the family Sz is extended but, at the same time, it is not changed if we adjoin to the sets of !2 all their subsets, finite unions and linear combinations; it is also unaffected if all the sets in Q are replaced by their closures and if E, and E, are LCSs, all the sets in Sz can be replaced by their closed absolutely convex hulls. If E, is an LCS and { pi)i. I is a family of basic seminorms the Q-topology on Homc(E,, E,) is defined by the family of seminorms 7Cx,i(h) = supx E x p i (hx) (X E Sz),i.e. Homc(E,, E,) turns out to be an LCS. 441t is even an algebra (associative, with unity) in the case E, = E,. We use the notation: Homc(E, E) = X(E).
154
Chapter 2. Foundations and Methods
Among the Gtopologies on Homc(E,, E,) the weakest is the topology of pointwise convergence and the strongest is the topology of uniform convergence on all the bounded subsets of E, . The latter is termed briefly the uniform topology. In the case Homc(E, K) = E* this is just the topology /?(E*, E). For normed spaces E,, E, the uniform topology on Homc(E,, E,) is generated by a norm4’. The topology of pointwise convergence on Homc(E,, E,) is sometimes called the strong topology46, since the weak topology defined in the case where Ez is total by the neighbourhoods ofzero {h: If( < E} (x E E,, f E Ez, E > 0)is also used (this is clearly locally convex). The topology of uniform convergence on compact sets is also contained in the class of Gtopologies. In this case we have the well-known criterion for precompactness 47 of a family of continuous (not necessarily linear) mappings of E, + E, (E, metrizable), namely equicontinuity of the restrictions to each compact set Q c E, and pointwise precompactness (Arzelri-Ascoli theorem). The following assertion follows in turn from the Banach-Steinhaus theorem (and also bears the same name4*). Theorem. Zf E, is a barrelled space and E, is any LCS then each pointwise bounded family F c Homc(E,, E,) is uniformly continuous and consequently uniformly bounded on some neighbourhood of zero.4g Proof. Let {pi}i.J be a family of basic seminorms. Let us consider the continuous seminorms p,(hx) (h E F) on E,. According to the condition xi(x) = sup,pi(hx) < co. Consequently the seminorms 7Ziare continuous and hence the uniform continuity of the family F follows immediately. 0 Remark. If an LCS E, is such that the Banach-Steinhaus theorem holds for E, = K (i.e. in E:) then E, is barrelled. In fact any barrel T c E, is the polar of a w*-bounded and therefore equicontinous set F c E:. But then on some neighbourhood of zero U c E, we will have 1f (x)1 < 1 for all f E F and hence
UCT
451f E is a normed space, the norm on U(E) satisfies the usual properties: I/h,h, 11< llhl 11Ilh,II, 11111 = 1. Thus the continuous linear operators on E form a Banach algebra f!(E) which is noncommutative if dim E > 1. We note here that in any Banach algebra the operation of multiplication is continuous. 461n the case of E* it coincides with the w*-topology. set X in an LTS E is said to be precompoct if it is relatively compact in the completion of the space E (a set is said to be relatively compact if its closure is compact). The property of precompactness is weaker than relative compactness (in a complete space they are clearly equivalent), but it is convenient to work with it in view of Hausdorff criterion: X is precompact o for any neighbourhood of zero U there exists a finite U-net, i.e. a covering of the set X by neighbourhoods of the form x, + U (1 < i < m). It is also called the principle of uniform boundedness. 4QIf E, is a normed space uniform boundedness on some neighbourhood of zero is equivalent to uniform continuity. If both spaces E,, E, are normed then uniform continuity of a family F is equivalent to boundedness in norm: sup,,, jlhll < co.
5 3. Linear
155
Topology
For any LTSs E,, E, and any h E Homc(E,, E,) the conjugate5’ homomorphism h*: Ef + ET is defined by the formula @*f)(x) = f&x) (x E E,,fe Ef). It is important to know for what topologies on the dual space h* will be continuous. Lemma. Let ET and E,* be provided respectively where hS2, c 52,. Then h* is continuous.
with Q,- and On,-topologies
Verification of this basic result presents no problem. In particular, h* is continuous if ET, E,* are provided with their w*-topologies or if both have their uniform topologies. In the latter situation if both spaces are normed then
Ilh*II = IIW We note in passing the duality relations of the homomorphism h E Homc(E,, E,) annihilators considered below are relative necessarily dual pairs). With no restrictions on the LTSs E, and
between the images and the kernels and of h* E Hom(Ez, ET). All the to the pairs (E,, E:), (E,, Ez) (not E, we have the formula
Ker h* = (Im h)’ and the inclusions Im h* c (Ker h)‘,
Ker h c (Im h*)l,
(the closure of Im h* is taken in the w*-topology Im h* = (Ker h)‘,
Ker h = (Im h*)l,
Im h c (Ker h*)l on51 ET). If Ez is total we have Im h = (Ker h*)l.
In all three cases equality is established by considering a continuous linear functional which annihilates the smaller space. In many applications we have to deal with discontinuous homomorphisms, however the weaker property of closedness of the graph (h E Hom(E,, Ed) I-‘,,= {k WL,el = El x E, usually holds; this is called closedness of the homomorphism itself. Example. Let E, = C O, 11, E, = C[O, 11, where the topology on C[O, l] is the usual one and the topology on C O, l] is that induced by C[O, 11. Differentiation d/dt: E, -+ E, is discontinuous but closed. Closed Graph Theorem (CGT). Let E,, E, be LCSs with E, barrelled and E, fully complete. Then if the homomorphism h: E, + E, is closed it is continuous.
ore precisely, the topological conjugate. Clearly h* = h#),; and moreover note the following formulae: 1) (hi + h,)* = h: + hz; 2) (ah)* = ah*; 3) (hg)* h E Homc(E,, E,) has a continuous inverse then h* is invertible and (h*)-’ = not be continuous. If E, is a reflexive Banach space the w*-topology on ET coincides with the normed space ET and the weak closure of any subspace of ET coincides with
Im h* c ET. We also = g*h*; 4) l* = 1. If (h-l)* but (h*)-’ may weak topology of the its strong closure.
156
Chapter
2. Foundations
and Methods
In the proof of the CGT we make use of the following closedness of a homomorphism.
general criterion for
Lemma 1. Let E,, E, be LCSs and let h E Hom(E,, E,). A necessary and sufficient condition for closedness of h is that the space A,, of those f E Ez for which h#f E E: be dense in Ef under the w*-topology5’. Proof. .For any LTSs E,, E, the conjugate space of E, x E, is naturally identified with E: x Ef in accordance with the formula cp(x, y) = g(x) + f(y) (g E E:, f E Ez). Moreover for any h E Hom(E,, E,) the annihilator of its graph I, is the subspace Ih’ of functionals of the form f(y - hx) where f E A,,. Now let Et, E, be LCSs and suppose that h is closed. If y E E,, y # 0 then, since (0, y) 4 I,, there is a functional f E A,, such that f(y) # 0. Conversely, if for each y E E,, y # 0 there exists a functional f, E A,, such that f,(y) # 0, then r,, = n, Ker f,, from which the closedness of the graph I, follows immediately. 0
We also require the following Lemma 2. Zf E,, E, are LCSs and E, is barrelled, then for any h E Hom(E,, E,) the intersections of the set A,, c Et with the polars of the neighbourhoods of zero V c E, are w*-closed. Proof. If the neighbourhood V c E, is absolutely convex, the closure of its inverse image is a barrel in E, and therefore it contains a neighbourhood U of zero in E,; the w*-closedness of the set A,, n V” follows from its representation by means of the system of inequalities 1f(hx)l < 1 (x E U), 1f(y)1 < 1 (y E V). 0 Proof of the CGT. By Lemma 1 the set A,, c Ef is w*-dense and by Lemma 2 it is w*-closed since the space E, is fully complete. Consequently A,, = Ez, i.e. h is weakly continuous. If V is an absolutely convex neighbourhood in E,, its closure v is weakly closed and therefore the inverse image h-’ v c E, is a barrel. But then there exists a neighbourhood of zero U in E, such that lJ c h-l c i.e. hUcE,:O
The following result is one of the most important
consequences of the CGT.
Inverse Homomorphism Theorem (IHT). Let E,, E, be LCSs with E, barrelled and E, fully complete. Zf the homomorphism g: E, -+ E, is bijective and continuous then it is a topological isomorphism, i.e. g-l is continuous.
In fact the inverse of a closed (and so of a continuous) homomorphism is closed. In a certain sense the next fundamental theorem is the dual assertion for the CGT. Open Mapping Theorem (OMT). Let E,, E, be LCSs with E, barrelled and E, fully complete and suppose that the homomorphism g: E, + E, is surjective and continuous. Then g is an open mapping.
or
any LCS E a subspace
N c E* is w*-dense
if and only if it is total.
5 3. Linear Topology
157
Proof. The space Ker g is closed 53, therefore we can pass to the factor space G = EJKer g and consider the bijective homomorphism y: G + E, associated with g by the relationship g = yj where j: E, --* G is the canonical homomorphism. Full completeness is inherited by factor spaces and y is continuous (because g is continuous and j is open). According to the IHT, y-’ is continuous, i.e. y is open, but then g is also open. 0 As Banach had already shown, all three theorems are true for Frechet spaces without the assumption of local convexity. Their applications are numerous and varied (the IHT is used particularly often). The first application which we note relates to complemented subspaces. A closed subspace L in an LTS E is said to be complemented if it has a closed complement s4 M. For examp 1e, a n y closed subspace of finite codimension, in particular any closed hyperplane, is of this type. In an LCS every tinite-dimensional subspace is complemented. If E is a Hilbert space, any closed subspace L c E is complemented, the orthogonal complement Ll = {y: (x, y) = 0 (x E L)}
being a closed complement for L. The decomposition L i L’ = E (in this case we also write55 L @L’ = E) is obtained by associating with each x E E its orthogonal projection x’ on L (moreover x - x’ is the orthogonal projection of the vector x on L*). Lindenstrauss and Tzafriri (1971) showed that if every closed subspace of a Banach space E is complemented then E is isomorphic to a Hilbert space. If L is a complemented subspace in an LTS E and M is its topological complement then the projection P onto L along M is closed. Thus if E is a barrelled fully complete LCS or a Frechet space, P is continuous. If P is a continuous projection on a Banach space E, P # 0, then llPl[ 2 1. If lIPI/ = 1 or P = 0 it is said to be an orthoprojection since in a Hilbert space this corresponds exactly to the orthogonal decomposition E = Im P @ Ker P. A bounded operator A on a Banach space is said to be a contraction56 if 11 AlI < 1 (a uniform contraction if IIA 11< 1). It is precisely in this case that distance is not increased: d(Ax, Ay) < d(x, y) (respectively d(Ax, Ay) < qd(x, y) where q < l), i.e. A is a contraction (uniform contraction) in the metric sense. We now apply the IHT to a problem connected with metrization of an LTS. Two invariant metrics d,, d, on a linear space E are said to be equivalent if they 53For any closed homomorphism 9. 541t is called a topological complement. In this connection we say that E decomposes into the topological direct sum of the subspace L and M. This concept extends naturally to any finite and then by means of closure also to any infinite collection of subspaces. A system of subspaces is said to be complete if the closure of its sum is the whole space E. e also use the term orthogonal sum. This term extends immediately to the case of an arbitrary family of pairwise orthogonal subspaces (if the family is infinite it means the closure of the algebraic orthogonal sum). 56If a contraction A is invertible and A-’ is also a contraction then A is an isometry. Isometries of a Hilbert space onto itself are called unitary operators.
158
Chapter 2. Foundations and Methods
define the same topology. We say further that d, is weaker than d2 (or subordinate to d,) if the corresponding topologies are in this relationship. If the inequality d,(x, y) < Cd&, y) holds for some constant C > 0 then d, is weaker than dl. Theorem. Zf the space E is complete under both metrics d, and d2 and also d, is weaker than d, then the given metrics are equivalent. Proof. The identity mapping acts continuously from the d,-topology d,-topology. According to the IHT we have a homeomorphism. 0
into the
Two norms II.II 1 and II * IIz on a linear space E are said to be equivalent if the corresponding metrics are equivalent. The relation of subordination is defined correspondingly and moreover a necessary and sufficient condition for this is that the inequality IIx II I < C IIx IIz be satisfied for some positive constant C. Thus for equivalence of the norms it is necessary and sufficient that a llxll z < I/XII 1 < b llxll z (x E E) where a, b are positive constants. Any two norms on a linitedimensional space are equivalent. From the preceding theorem we obtain Corollary. Zf a space E is a Banach space with respect to each of two norms, one of which is subordinate to the other, then these norms are equivalent.
Equivalent renorming is a powerful method for investigating topological properties of Banach spaces. We mention one example. The following easily established result holds in a Hilbert space: if x, + x weakly and IIx,(I + llxll then x, + x. It is very convenient to have this H-property in a Banach space. It does hold in a uniformly convex space (V.L. Shmul an, 1939) but in the general case it is lacking. M.I. Kadets (1957) showed that in any separable Banach space it is possible to introduce an equivalent norm possessing the H-property. He used this result in the solution of a problem of Banach on the homeomorphism of all infinite-dimensional separable Banach spaces. We note that in any separable Banach space it is possible to introduce an equivalent strictly convex norm (Clarkson, 1936) and also an equivalent smooth norm (Day, 1955). The separability requirement is essential. In 1941 Day showed that a reflexive Banach space need not have a uniformly convex equivalent norm. The possibility of introducing a uniformly convex norm is equivalent to the super-reflexivity of the space. This deep result was obtained by Enflo (1972). The concept of super-reflexivity, which is defined in the following way, is also deep. Let E, E, be Banach spaces. We say that E, is finitely representable in E if for each finite-dimensional subspace L c E, and each a > 1 there exists an embedding i: L + E which is a-close to an isometry:
a-l lbll G IIWI G allxll
(x E L).
The space E is said to be super-reflexive if each space which is finitely representable in it is reflexive. Super-reflexivity of E* is equivalent to super-reflexivity of E (James, 1972). As a corollary to Enflo theorem the complete duality between uniformly convex and uniformly smooth Banach spaces shows that the possibility
0 3. Linear
Topology
159
of introducing a uniformly smooth norm is also equivalent to super-reflexivity. Further in a super-reflexive space E there is an equivalent norm which is simultaneously uniformly convex and unformly smooth. Returning after this digression to other applications of the IHT, let ,us consider a separable Banach space E with a Schauder basis B = (ei):. Let E be the corresponding coordinate space. By introducing the norm IlPll = SUP i$ tiei (t = (5iE) n II II on E we turn it into a Banach space. The expansion in terms of B can be regarded as a homomorphism g: E + E which is clearly continuous ( 11911= 1) and bijective. According to the IHT, h = g-i is also continuouss7 and then
15.1< zllell < 2. ’ ’ Ileill
lIeill
IIXII.
Thus in a Banach space the coordinates of a vector with respect to a Schauder basis are continuous linear functionals (theorem of Banach). A Schauder basis (e,); in a separable Hilbert space E is called a Riesz basis if the set i? coincides with 12. In this case “:P
J
i& lti12 < C0
(t E G
and, by the Banach-Steinhaus theorem, II [ Ill2 < C II [ 11~.Since i is complete under both norms they are equivalent, i.e. 1: E^+ l2 is an isomorphism of the Banach spaces. Choosing an orthonormal basis (ai): in E we consider the isometry of l2 + E which is defined so that Si H Ui (i = 0, 1,2, . . .). Then the composite isomorphism E + E^+ l2 + E carries the Riesz basis (eJ$ onto the orthonormal basis (ui)$. Conversely, the image of an orthonormal basis under the action of any isomorphism of E + E is a Riesz basis, since under topological isomorphisms a Schauder basis carries over to a Schauder basis and the coordinate spaces are preserved. Thus we have Theorem of Bari. The Riesz bases in a separable Hilbert space E are the images of the orthonormal bases under all possible isomorphisms of E + E.
The Riesz bases therefore form an equivalence class with respect to the action of the group of automorphisms of the Hilbert space. Corollary. Any Riesz basis (ei)$ is unconditional sense that infi llei 11> 0, sup, (Iei II < CO.
and almost normalised
Toeplitz (1926) showed that any unconditional basis in a Hilbert space is a Riesz basis.
almost normalised
hus
9 is an isomorphism
of the Banach
spaces B, E.
in the
Schauder
160
Chapter 2. Foundations and Methods
Thus all unconditional almost normalised Schauder bases in a Hilbert space (which can be regarded as 1’) are equivalent. The situation is similar in I’, in only in the stated spaces (Lindenstrauss-Zippin, CO 58 and (up to isomorphism) 1969). Now we make a simple but useful remark relating to the theory of bases. Let (ei)g be a complete minimal system of vectors in a Banach space E. Then to each x E E there corresponds the formal expansion x N boo ci(x)ei, where (ri)~ c E* is the system which is biorthogonal to (ei)$. The partial sum S, = &o li(‘)ei (n = 0, 1,2, . . .) is a bounded projection onto Lin{ei};. If(ei)$ is a Schauder basis, the sequence (S,): converges strongly (to the identity operator) and consequently it is uniformly bounded (by the Banach-Steinhaus theorem): sup IIS I < co*
(18)
”
Conversely, it follows from (18) that (ei)g is a Schauder basis since if x E Lin(e,}$ there exists a number v = v(x) such that S,x = x for all n > v(x), i.e. (S,,x); converges to x on a dense subspace and therefore also on the whole of E (because of (18)). We now mention two examples which illustrate the effectiveness of the IHT in concrete analysis. The first of these is due to Banach (1931). Example 1. Let us consider the system of linear equations
(1% Any sequence x = (&.)y such that the series (19) converges with sum equal to vi for each i = 1,2, 3, . . . is called a solution of the system. It turns out that, if the system (19) has a unique solution for all (rli)~, then the rows of the matrix (4~) have finite support, i.e. (19) is a Toeplitz system. For the proof we consider the space R of all x = (&,)y for which all the series (19) converge and introduce on R a locally convex topology by means of the seminorms Pitx)
=
suP $1 ” I
(i = 1, 2, 3, . . .).
%ktk I
This is a Frtchet space. The system (19) defines a continuous homomorphism g: R + s, where s is the space of all sequences with the topology of pointwise convergence (s is also a locally convex FrCchet space). According to the condition g is bijective. Consequently, h = g-l is continuous. Since the topology on R is stronger than the pointwise topology, the linear functionals <,(hy) on s are continuous. But then (Y
=
(qj)?,
nk
<
ao).
s* cO is the space of sequences converging to zero with the sup norm. We note that co*z I
0 3. Linear Topology
161
The functionals <,(hy) (y E s) are linearly independent since this is true of &(x) (x E R, k = 1,2,3, . . .). Consequently, the rows of the matrix (/Jr) are linearly independent. By the theorem of Toeplitz the system of equations
has solution y = (qj)y for any (c&‘. But then cl, = &(hy) (k = 1,2, 3, . . .) and the series
kzl f converge. Because of the arbitrariness have finite support.
(i = 1,2, 3, . . .)
of (lk)y, the rows of the matrix (ai& must
Example 2. Let (A4.k be a sequence of positive numbers defining a quasianalytic class of functions on [0, 11. Let us consider the Banach space B(A4,) of infinitely differentiable functions cp(t) (0 < t < 1) for which 1 llcpll = sup max I# (t)l < co. n M. o
sup I(P n M,,
O)1 <*
and cpis uniquely determined by (cp O))~be cause of the quasianalyticity of the class. It is natural to ask if there exists for an arbitrary numerical sequence a = (a& satisfying the condition llall = sup H < cc n Mn a function cpE B(M,) such that cp O) = a, (n = 0, 1,2, . . .). The answer turns out to be negative except for the analytic case M, = O(n!). For the proof we consider the Banach space b(M,) of sequences satisfying condition (20) and the homomorphism g: B(M,) --, b(M,) defined by the formula (gcp), = cp O). Clearly g is continuous ( I(g II = l), injective and, in the case where there is a positive answer to the question posed, bijective. But then h = g-l is continuous. Let a, = o(M,) (n + co). Then a = 2 anan, n=O
where the series converges in b(M,). Consequently
(W(t) =n=O f an;, . where the series converges in B(M,)
and so in particular
at the point t = 1.
162
Chapter 2. Foundations and Methods
Moreover
Setting a, = &,M,
we obtain Mk < Ck!.
The last example of the application of the IHT which we present here is concerned ith the regularisation of Noetherian operators. Let E be any LTS in which the IHT holds and let T be a Noetherian operator on E whose image Im T is closed5’ (its kernel Ker T is also closed since it is finite-dimensional). The regularisors for T are defined by the formulae (see Section 1.3) R, = (1 - .4)FT- , $ = ? , where A is a projection onto Ker T, P is a projection onto Im T and T = TIKerA. The projections A, P can be chosen to be continuous and FT-’ is continuous in accordance with the IHT. Therefore both regularisors turn out to be continuous. A sufficient condition for the preservation of the Noetherian property, normal resolvability and the value of the index under the change from T to T + V, where I/ E 2(E), is that the operators 1 + R,V and 1 + VR, be invertible. An effective formulation is obtained from the following general result: in any Banach algebra the elements of the form e - x with I(xIJ < 1 are invertible.60 Thus in the Banach space E it is sufficient to impose on the perturbation V the condition of smallness of norm (Atkinson, I. Ts. Gokhberg, 1951), namely
IIVII < max(llRrII-
IIW-9.
Invertibility of the operator 1 - U where II UII -C 1 is a simple but very useful result. In terms of the equation x = Ux + y it implies unique solvability for all y (this also follows from the contraction mapping principle). Among its many applications there is, for example, the theorem on stability of buses: if (ei): is a Schauder basis in a Banach space E and (ci)g is a sequence of coordinate functionals then each system of vectors (ni); satisfying the condition
iz IICII * llei - 411< 1 is a Schauder basis in E (M.G. Krejn, D.P. Mil an, the identities
X = 2 liei,
UX
=
igo
rifei
M.A. Rutman, -
1940). In fact
4)
i=O
define a linear operator U with norm less than 1, therefore S = 1 - U is an automorphism; but Se, = ui (i = 0, 1, 2, . . .). n operator T with closed image is said to be normally resolvable. This term is connected with the equation TX = y. If E* is total, Im T = (Ker 7’*)I. In this case normal resolvability is equivalent to the identity Im T = (Ker T*),. 601n fact, the Neumann series ~~=oxk converges absolutely (since jlx’(I < Ilxll’) and its sum is (e - x)-i. We note that it follows from this result that the set of invertible elements of a Banach algebra is open.
163
5 3. Linear Topology
Here is a further example. Wiener-Paley theorem. Zf (ei)g is an orthonormal system (ui)$ which satisfies the inequality
Iliz %(eiI
%)I2
G
0 i$
Jail2
basis in a Hilbert space, any
(0<8<
1).
for all possible systems of numbers (ai)” is a Riesz basis. 3.9. Linearisation of Mappings. Let E,, E2 be Banach spaces and let G c E, be an open set. A mapping F: G + E, is said to be (Frtchet) dtfferentiable at the point x E G if there exists a bounded homomorphism g: E, + E2 such that
IIW + d - Fx - wll = 4II4I)
@E&, II4 -+O).
The homomorphism g is uniquely defined. It is called the (Frdchet) derivative of the mapping F at the point x and correspondingly it is denoted by F’(x) (further it can be written in the traditional way: dF(x) = F’(x) dx). From standard results of elementary analysis we note the formula of finite increments in the form 1 Fx, - Fx, =
F’(x, + z(x2 - x1))(x2
- x1) dz.
s0 The following are conditions which are sufficient for its validity: 1) [x1, x2] c G; 2) F is differentiable at all points of the interval [xi, x2]; 3) the derivative F’(x, + t(x2 - x1)) is a weakly continuous function of r (0 < z < 1). They are clearly satisfied in a convex domain G for a differentiable mapping whose derivative is weakly continuous. If moreover q = SUP,,~ IIF’(x)I) < co then the mapping satisfies the Lipschitz condition lb,
- Fx, II G 4 lb2 - XI II
(~1,
x2
E ‘3.
Consequently it is uniformly continuous and extends by continuity>0 a mapping F: G + E,. Let E, = E, and suppose that FG c G, q < 1. Then F is a uniform contraction and consequently there exists in G a unique fixed point Z; for any x0 E G the iterates FnxO converge to X with bound O(q”). For any differentiable mapping F: G + E, the mapping x H F’(x) is defined from the set G into the Banach space Homc(E,, E,). If this mapping is differentiable at some point x E G then by definition its derivative F”(x) (the second derivative of F) belongs to the space Homc(E,, Homc(E,, E,)). Subsequent derivatives are defined in a similar way. Linearisation of the mapping @: G + E, allows us to construct the Newton process x,+1
=
x,
-
[@‘(x,)]- x,
(n=0,1,2,...)
for solving the equation @x = 0. The convergence of this process and of certain of its modifications was investigated in detail by L.V. Kantorovich (1948). In all cases, if 1) the root x is simple in the sense that the operator Q’(X) has a bounded
164
Chapter 2. Foundations and Methods
inverse, 2) the second derivative W(Y) exists and is continuous in the uniform topology on some ball D(X, 6) = (x: 11x- Xl1 < S}, then for all x0 in a sufficiently small ball 0, the Newton process will be defined and will converge to the root X In fact, the formula Fx = x - [@(x)1- x defines a differentiable
mapping F: 0, + E, :
F x) dx = [F x)]- F x)
dx)[F x)]-lFx.
F x) is continuous in the uniform topology and F F) = 0. Therefore supxE,,. 11F x) 11< 1 for sufficiently small E. A further standard area for applications of linearisation is the local theory of differential equations. The language of functional analysis provides a single means for considering evolutionary systems, both finite-dimensional (systems of ordinary differential equations) and infinite-dimensional (partial differential equations, integro-differential equations). An bstract’ evolutionary system in a Banach space E has the form i = @(x, t), where x = x(t) is a vector-function with values in E defined on some interval Z of the real axis (we will assume further that it is on the semiaxis t > 0) and having continuous derivative i(t), and #: G x Z + E where G c E is, as before, some open set (for simplicity we will take G = E). If the mapping @ is continuous and in each ball D c E it satisfies a Lipschitz condition in x uniformly with respect to t on each finite interval, then the Cauchy problem which consists of finding a solution of the equation satisfying the initial condition x(O) = x0 has a unique solution on some interval 0 < t < E. The proof follows the classical scheme of Picard:
x(t) =XI-J +sf@(x(4,4
i.e. the required solution is a fixed point of the mapping defined by the right hand side on the space of continuous vector-functions on [0, E]. For sufficiently small Ethis mapping is a uniform contraction on a neighbourhood of the constant x0. The requirement of continuity of the mapping @ is not satisfied (in the Banach space setting) if, for example, @ is a differential operator. Therefore the scheme which has been described is not suitable for partial differential equations. Even for the investigation of the linear equation 1= Ax where the operator A is unbounded we require special methods which are connected with spectral theory (see Section 4.5).
$4. Theory
of Operators
165
0 4. Theory of Operators 4.1. Compact Operators. A homomorphism h: E, + E, of any LTSs is said to be compact or completely continuous if there exists in E, a neighbourhood of zero whose image is relatively cpmpact.’ Any compact homomorphism is clearly bounded and so continuous. The property of compactness is valuable because it is the basis of a satisfactory generalisation of Fredholm theory. Such a generalisation was effected for normed spaces by F. Riesz (191Q for LCSs by Leray (1950) and for arbitrary LTSs by Williamson (1954). The classical Fredholm theory is easily covered by these generalisations since linear integral operators are compact in the corresponding function spaces under the usual assumptions (for example, operators on C(S) (S compact) with continuous kernel or operators on L2(X, p) with Hilbert-Schmidt kernel are of this type). The compact homomorphisms form in Homc(E,, E,) a subspace Ci(E,, E,) which is invariant with respect to the natural action of !i!(E,) x !i?(E,): h H ahb (a E i?(E,), b E !i?(E,)). Thus (E-(E, E) = CC(E) is a twosided ideal in the algebra L?(E). If E, is complete c(E,, E,) is closed in the uniform topology. The subspace B(E,, E,) of continuous, finite rank homomorphisms and consequently its uniform closure g(E,, E,) are contained in 6(E,, E,). There is considerable interest in the question of when (I;(E,, E,) and g(E,, E,) coincide, i.e. of the possibility of uniform approximation of an arbitrary compact homomorphism by continuous, finite rank homomorphisms. In fact Fredholm constructed his theory of integral equations in this way by approximating their kernels by degenerate ones, i.e. by kernels of the form
Let E, be a barrelled LCS with a Schauder basis (ei)g and let h be a compact homomorphism of any LTS E, into E,. Let us consider the partial sums
S,X= k tiei
(x=2&i).
i=O
The family of linear operators S, (n = 0, 1,2, . . .) is equicontinuous (by the Banach-Steinhaus theorem). Therefore if U is a neighbourhood of zero in E, such that hU is relatively compact, (S,h),” converges to h uniformly on U and consequently this convergence is uniform on each bounded set X c E,. But all the S,h are clearly of finite rank and continuous. Thus in the case under consideration S(E,, E,) = CC(E,, E2). We say that an LTS E has the approximation property if the identity operator (and hence also any continuous operator) is a limit point of the set g(E, E) = g(E) 1 A continuous mapping X -P Y of any HausdorlT topological spaces is said to be compact a covering of the space X by open sets whose images in Y are relatively compact.
if there is
166
Chapter
2. Foundations
and Methods
of finite rank operators under the topology of uniform convergence on relatively compact sets. We saw above that any barrelled space with a Schauder basis has the approximation property. We can show as above that if E, has the approximation property then g(E,, E,) = 6(E,, E2) for any LTS E,. The converse of this assertion holds in the category of Banach spaces. In the example constructed by Enflo of a separable Banach space with no Schauder basis even the approximation property is lacking. Remark. Let El, E, be reflexive Banach spaces, at least one of which has the approximation property. If the Banach space Homc(E,, E2) is reflexive (and only in this case), every continuous homomorphism of E, + Ez is compact, i.e. Homc(E,, E,) = CC(E,, E,) (Holub, 1973). Passing directly to Fredholm theory in any LTS E, we denote by A an arbitrary compact linear operator on E and fix a neighbourhood of zero lJ for which the image AU is relatively compact. A theory will be constructed for the operator T = 1 - A, one of its basic results being that T is a Fredholm operator (i.e. it is Noetherian and ind T = 0) and it is normally resolvable (i.e. Im T is closed). We note at once that the subspace Ker T is finite-dimensional since AIKcrT = 1 and consequently Ker T is locally compact. Lemma 1. The operator T is normally resolvable. Proof. Using the compactness of A, it is easy to verify that if X c u and X is closed then TX is closed. Therefore the ylinder’ V = T-‘(Tu) = u + Ker T is closed while at the same time the cylinder V, = T-‘(TU) = U + Ker T is open. Consequently 2 = V\ V, is a closed set. Its nflation’ 2 = {x: x = az (a E 2, a 2 l)} together with the cylinder V covers the whole space E. But TV = To is closed. It remains to establish the closedness of the set Tz = fi. This follows from the fact that the set T(u n v,) is closed and does not contain zero. 0 Corollary.
The subspaces E, = Im T” (n = 1,2, 3, . . .) are closed.
The Fredholm Alternative. Either the operator T-’ is continuous) or Ker T # 0.
T is invertible (and in this case
Proof. Suppose Ker T = 0. Using the compactness of Tit is easy to verify that the homomorphism S which is inverse to E 5 Im T is continuous Let us suppose that Im T # E and take x 4 T. We consider the n-dimensional subspace L, = Lin{Tkx};-’ (n > 1, Lo = 0). Since AIL, is injective and maps L, n 0 into a compact set we have that L, n 0 is compact. Because of the closedness of Im T and the continuity of S we can construct a neighbourhood of zero U’ such that (L, + U n Im T c TU. Now we take a balanced neighbourhood of zero u” such that u” c U n u’ and choose in L, n 0” (n 2 1) any point x, & L,-, + U Clearly x, # x, (n # m). We have x, = TY,-~ + a&, where y,-, E L,-,. Since Ty,-, E L, + t?” then TY,-~ E TU and hence y,-, E U because T is injective. Consequently the set ’ Under
additional
assumptions
about
E this is guaranteed
by the inverse
homomorphism
theorem.
5 4. Theory of Operators
167
{Ay,} is relatively compact. But then it must be finite since Ay, - Ay, z -x,+,(mod
L,)(n > m)
and hence Ay, - Ay, $ U”. As a result the set (x contradiction. 0
turns out to be finite - a
Now let us consider the root subspaces W, = Ker T” (n = 1,2,3,. . .). Since the intersections W, n 0 are compact, if W, # W,-, (n = 2,3, . . .) we can choose for each n a point x, E (W, A u)\(W,-, + U). Then Ax, - Ax, $ U (n > m) in spite of the relative compactness of {Ax,}. Consequently there exists a number v such that W, = W,,, = . . * (W, # WV-,). The maximal root subspace W, is tinitedimensional. The operator T maps the invariant subspace E, = Im T’ into itself and moreover E,+1 is its image. But Ker T n E, = 0. By the Fredhohn alternative E,+1 = E, and then E, = E, (n > v), in particular E,, = E,. Therefore for any x E E there exists y E E such that T = T , i.e. x - T E W,. Thus we have established Lemma 2. The following
direct sum decomposition
holds:
E = Ker T’ i Im T Corollary.
The operator
T is a Fredholm operator.
Proof. The codimension of the subspace Im T r> Im T’ is clearly finite and equal to the codimension of Im( T lKerTY) (since T Ii,,, rV is bijective). As a result it is equal to dim(Ker T) = def T. The conjugate operator A* of a compact operator A on a Banach space is compact (theorem of Schauder), however this is false in general even in an LCS. Nevertheless, in any LTS def T* = def T since Ker T* = (Im T)* and Im T is a closed subspace with codimension equal to def T. Since T” = 1 - A,, where A, is a compact operator, and (T * = (T*) we have def(T*)” = def T” (n = 1, 2, 3, . . .). The property of compactness of an operator on an infinite-dimensional space is a special kind of smallness. This is evident in particular in the result that the index is stable under compact perturbations (Atkinson, I.Ts. Gokhberg, 1951). Theorem. Let S be a normally resolvable Noetherian operator on an LTS E for which the IHT holds and let V: E + E be a compact operator. Then S + V is Noetherian, normally resolvable and
ind(S + V) = ind S. Proof. The regularisors R,, R, for S are continuous.
In the usual notation
we
have R,(S+
V)=l+(R,V-A),
(S + V)R, = 1 + (VR, - B).
Since the parts added to the identity operator are compact we have that R,(S + V) and (S + V)R, are normally resolvable Fredholm operators, from which all the assertions follow. 0
168
Chapter
2. Foundations
and Methods
One of the most important areas of application of this theorem is the theory of singular integral equations 3. The kernel of a singular integral operator is frequently the sum of a standard singular kernel (for example, the Cauchy kernel ([ - z)-‘) and a regular4 kernel which defines a compact operator. In this case the calculation of the index is reduced to that for the standard kernel. Compactness of integral operators with regular kernels is brought about by the existence of finite rank approximations. Moreover the properties of an operator are better the more rapidly it can be approximated by finite rank operators. This observation leads us to define the approximation numbers (anumbers) of a compact homomorphism h: E, + E, of normed spaces as its deviations (n = 0, 1,2, . ..) a,+,@) = g*; Ilh - 911 n from the subspaces ?j, = &(E,, E,) = {g: g E 7j(E,, E,), rk g < n]. Clearly, llhll = a,(h) 2 a,(h) 2 . . . and a,(h) + 0 if and only if h E $j(E,, E,). Putting G,(E,, E,) = ‘&JX,, E,) we define further the class G,(E1, E2) (0 c p < co) by the requirement
The spaces GJE,, E,) (0 < p < co) are invariant with respect to the action of f?(E,) x f?(E,); consequently G,(E) = G,(E, E) is a two-sided ideal of the algebra WI. For any h E GJE,, E2) the quantity
N,(h) = (.fl b.(h)lp~ defines a quasinorm. Thus G,(E,, E,) is a complete LTS. If E, and E, are Hilbert spaces then N, is a norm. In this case the class G,(E,, E,), whose elements are called Hilbert-Schmidt homomorphisms, is of special interest. Suppose that E, and E, are separable and let (ej”);, (e~“)~ be orthonormal bases in E,, E, respectively. In order that h E Homc(E,, E,) be a Hilbert-Schmidt homomorphism it is necessary and sufficient that i$ Ilhei lZ =
I(he$
ej2 12< co,
i, j=O
since this quantity is also N2 (the Hilbert-Schmidt norm). Any integral operator on L X, p) with a Hilbert-Schmidt kernel is a Hilbert-Schmidt operator, for example, the operator on l2 defined by the matrix (aji)Ti=o such that
,I, 3 In this context S.G. Mikhlin in a Hilbert space. 4 Or eakly singular
(1947) discovered
lclji12< 00. the stability
ofthe
index under
compact
perturbations
44.
Theory
of
Operators
169
One of the numerous applications of the class $ in a Hilbert space E is concerned with the theory of bases. Let (I@ be an orthonormal basis in E and (ui): an o-linearly independent system which is quadratically close to the basis (ei); in the sense that
Then it is a Riesz basis and it is called a Bari basis (in honour of N.K. Bari who established the result in question in 1951). The proof is based on the fact that the operator defined on E by the equations Aei = e, - ui (i = 0, 1,2, . . .) is a HilbertSchmidt operator and so is compact. If T = 1 - A then Tei = Ui (i = 0, 1,2, . . .). But Ker T = 0 because of the o-linear independence of the system (Ui)~ . Therefore T is an automorphism of the Hilbert space E. Remark. M.G. Krejn (1957) characterised a Bari basis (Ui)gmby means of a certain collection of intrinsic properties, namely, 1) completeness, 2) o-linear independence, 3) quadratic closeness of the Gram matrix ((Ui, Uj))Tj=o to the identity, i.e.
i go I(%, uj) - 4j12 < C0* Let E,, E2 be Banach spaces. A homomorphism nuclear if it can be expanded in a series of the form
h: E, + E2 is said to be
(21)
where fk E ET, uk E E,, llfkll = 1, )IuJ = 1 and & (~1 < co; thus series (21) converges in the uniform operator topology so that all nuclear homomorphisms are compact. The class of nuclear homomorphisms %(E,, E,) in Homc(E,, E2) is an L?(E,) x g(E,)-invariant subspace (in f?(E) it is a two-sided ideal 3(E)). The nuclear norm in YI(E,, E,) is defined as v(h) = inf ck lcxkl over all representations of the form (21). Clearly llhll < v(h). With respect to the norm just introduced ‘%(E 1, E2) is a Banach space. Pietsch (1963) showed that all homomorphisms of the class 6, are nuclear and moreover for any p > 0 there exists a nuclear operator on 11 which does not belong to (Sp. In a Hilbert space the class of nuclear operators coincides with 6,. If A: E + E is a nuclear operator on a Banach space E we associate with each of its representations in the form (21) (f. E E*, u, E E) the number (22)
which we may call the truce5 of the operator A and denote by tr A whenever this 5 More precisely, not equivalent.
the tensor
truce.
There
are several
other
definitions
of trace which
in general
are
hapter 2. Foundations and Methods
170
number is independent of the representation under consideratio#. Grothendieck (1955) proved that the existence of the trace for all nuclear operators on E is equivalent to the approximation property; moreover in this case the trace is a continuous linear functional on %(E) (with respect to the nuclear norm v) and its norm is equal to one. This applies in particular to a Banach space with a Schauder basis. On Enflo space there exists a nuclear operator which does not have a trace, i.e. it is such that the quantity (22) depends on the representation of the operator in the form (21). In any Banach space all operators of class 6, have a trace (Pietsch, 1963). If a Banach space E has a Schauder basis (ei); then corresponding to each operator A E L?(E) there is a matrix a = (aji)~ii=, which is defined by the expansion Ae, = 2 ajiej
(i = 0, 1, 2, . . .).
j=O
We can consider its trace tr a = g ajj j=O
whenever this series converges. If A is a nuclear operator tr a exists and is equal to tr A and so the trace of the matrix of the operator does not depend on the choice of basis’ (G.M. Litvinov, 1979). Consequently, if the basis is unconditional,
(A.S. Markus, V.I. Matsaev, 1971). Let A be a nuclear operator on a Banach space E which has the approximation property and suppose that v(A) < 1. Then we can define the determinant det(1 + A) = exp{tr[ln(l
+ A)]},
wheres ln(1 + A)d; zl (- l)n- . The prior condition v(A) < 1 can be removed by the following means: whatever the nuclear operator A, the function D(n) = det(1 + LA) of the complex variable 6 Clearly tr A is a linear functional on its domain of definition. n this situation we can refer to the matrix trace of the operator A. n general, if cp(l) is an analytic function on the disk (II < r, for any operator A E Q(E) such that l[All < r we put m p 0) d-4 = 1 n!A *=0 This is one of the principal versions of jimctional caZculus in Q(E).
8 4. Theory
of Operators
171
1 is analytic o\n the disk 111< [v(A)]-‘. It can be shown that D can be continued analytically to the whole plane, i.e. it is an entire function. It remains to take as definition: det(1 + A) = D(1). Remark.
If
A = k=l2 gk(.bkT where rk E E, gk E E* and & llgkll ’llVk[l < Co, then det(1 + AA) = 1 + F c,A”, n=l
det(gjJujs)):,,,l (n = 1, 2, 3, . . .) (Koch-Grothendieck where cn=&,<...<jn mula - a generalisation of a well-known formula from linear algebra).
for-
Example. det(1 + g(-)u) = 1 + g(u). The theorem on multiplication of determinants extends to the situation just described. As a result of this it follows that det(1 + A) # 0 is a criterion for invertibility of an element 1 + A (A E s(E)) in the algebra %(E) with the identity adjoined. Continuous linear functionals on the Banach spaces 6,(E) where E is a Hilbert space can be described in terms of the trace, namely f(A) = tr(AF) where F E 6,(E)
- + - = 1 , and the correspondence thus established between : > 6,(E)* and 6,(E) is in fact an isometric isomorphism. For p = 2, i.e. in the space of Hilbert-Schmidt operators, we can introduce a scalar product c
(A, B) = tr(AB*), with respect to which &(E) (with the previous norm) is a Hilbert space. In this particular case the previous formula follows from the theorem of Riesz. The concept of a nuclear homomorphism carries over to the category of LCSs if we replace the conditions on fk, uk by the following: the family {fk} is equicontinuous, the family {&} is bounded and its closed absolutely convex null U provides a norm on 0 = Lin U with respect to which 6 is a Banach space. All nuclear homomorphisms are compact. The theory of the trace outlined above extends to nuclear operators on an LCS (G.M. Litvinov, 1979). Starting from the previous definitions Grothendieck (1955) introduced the important class of nuclear spaces as those9 spaces for which every continuous homomorphism into a Banach space is nuclear. The class of nuclear spaces is closed with respect to completions and passage to subspaces and factor spaces (and certain other
his
is one of several
equivalent
definitions.
172
Chapter
2. Foundations
and Methods
natural operations). Nuclear spaces are encountered frequently in analysis. In particular the usual spaces of infinitely differentiable and analytic functions and many natural spaces of sequences are nuclear. It is appropriate to note that all nuclear barrelled complete LCSs are Monte1 spaces (consequently they are reflexive). Thus if a space is nuclear and Banach, then it must be finite-dimensional. If E,, E, are LCSs a homomorphism h E Homc(E,, .I&) is said to be absolutely summing if each unconditionally convergent series c,,x” in E, is mapped to an absolutely convergent series 1. hx, in E,. If E,, E, are Hilbert spaces the absolutely summing homomorphisms are precisely the Hilbert-Schmidt homomorphisms (Grothendieck, 1956). The same author also established the following deep result. Theorem. Zf h: E, + E 2, g: E, + E, are absolutely of Banach spaces then gh: E, + E, is nuclear.
summing homomorphisms
From this we obtain immediately the Dvoretski-Rogers theorem on the existence in any infinite-dimensional Banach space E of a series of vectors which converges unconditionally but not absolutely. In fact, in the contrary case, the operator 1: E + E is absolutely summing, therefore nuclear (1 = 12) and consequently compact. Closely connected with the Dvoretski-Rogers theorem is Grothendieck criterion for nuclearity of a space: in order that a locally convex Frechet space be nuclear it is necessary and sufficient that all unconditionally convergent series of vectors in it converge absolutely. Associating with any LCS E the space I1 (E) of unconditionally convergent series and the space 1’{E} of absolutely convergent series, Pietsch (1963) extended this criterion to the general case. He provided these spaces with natural locally convex topologies such that the embedding i: l E) + l E) is continuous and for nuclearity of the space E it is necessary and sufficient that i be a topological isomorphism. The problem of the existence of a Schauder basis in a nuclear space has a rather different character than in a Banach space, since all nuclear locally convex Frechet spaces have the approximation property. However there also exists in this class a space with no Schauder basis” (N.M. Zobin, B.S. Mityagin, 1974). If there is a Schauder basis in a space of this class, then it is absolute (A.S. Dynin, B.S. Mityagin, 1960). A nuclear LCS has certain special structure. For its description we make use of (without discussion) the general topological concept of the projective limit of a directed family of topological spaces. A projective limit of LTSs (LCSs) is an LTS (LCS) which is complete if all the spaces in the given family are complete. Any complete LCS is isomorphic to a projective limit of Banach spaces. Any complete nuclear LCS E is isomorphic to a projective limit of a certain family of Hilbert spaces. If in addition E is metrizable, the family concerned can be chosen
lo We note that any nuclear
locally
convex
Frkchet
space is separable.
$4. Theory of Operators
173
to be a sequence (E,),” (with the natural ordering); in this sense E is countablyHilbert.
Let us pursue further the question of topologising the tensor product E, 60 E, of two LCSs E,, E,, which is closely connected with the preceding material. The algebraic object E, @ E, is obtained by factorising the space of formal linear combinations’ ’ c Ak(Uk x Vk) bk E El> vk E E2) k
by the linear hull of the set of all combinations
of the following types:
u x (VI + VJ - u x Vl - u x v,;
(Ul x u2) x v - u1 x v - u2 x v;
u x uv - a(u x v).
au x u - u(u x v);
The canonical image of the element u x v E E, x E, in E, @ E, is denoted by u @ v. If E is a further linear space then for any bilinear mapping 8: E 1 x E, + E there exists a unique linear mapping 4: E, @ E, + E such that the following diagram is commutative (j is the canonical mapping):
For any two seminorms p (I), pt2’ on E,, E, respectively we define their tensor product by
(p
@p
where the infimum
(w)= inf T p (uk)p
uk)
is taken over all decompositions
(w
E E,
C3 E,),
of the form
w=~u,@v,. k
This is a seminorm
on E, 6 E, which has the cross-property: (p
@ pq(u
@ 0) = p (u)p
v).
If E, , E, are LCSs their topologies are given by families of seminorms {p! , respectively. Then the family {p!‘) @ pr)} defines a locally convex topology on E, 8 E, which is the strongest under which the canonical mapping j is continuous’2. The completion of the space E, ($3E, with respect to this topology is denoted by E, @ E, and is called the topological tensor product of the spaces {pr)}
El,
E,.
Theorem of Grothendieck. If E,, E, are metrizable w E E 1 @ E, can be represented in the form
LCSs then each element
I1 i.e. the space of functions of finite support C&E, x E,, K). here are other natural topologies on E, @IE, but we will not discuss them here.
174
Chapter 2. Foundations and Methods
w = k=l2 ak(ukC3v,), where u& E E,, lim&+, u& = 0, u&E E,, lim&,,
II& = 0 and c;=i
lakl < 03.
In this sense all elements of E, 6 E, are nuclear. If E, , E, are normed spaces then E, & E2 is a Banach space with norm defined by the ,tensor product of the norms given in E, and E,. Particular interest attaches to the tensor product of the form E* @ E. In this case we have the natural homomorphism Tfk
z (
@
=
$fk( k
>
from E* @ E to S(E). In the Banach space case the topological tensor product E* @ E is also a Banach space. Since the inequality v(zw) G IIw 1)(v is the nuclear norm) holds for all w E E* @ E, then r can be extended by continuity to a homomorphism 2 E* &I E + %(E) which is clearly surjective. Consequently ‘%(E) z (E* @ E)/Ker z This topological
isomorphism
is an isometry.
4.2. The Fixed Point Principle. A famous theorem of Brauer asserts that if X is a compact convex set in a finite-dimensional linear space then any continuous mapping F: X + X has a fixed point13. It was extended to inlinitedimensional normed spaces by Schauder (1930) and then to LCSs by A.N. Tikhonov (1935). Theorem. Let E be an LCS, X c E a non-empty closed convex set and F: X + X a continuous mapping such that the set Y = FX is relatively compact. Then there exists a fixed point: x = Fx. Proof. If x # Fx for all x E X, there exists a neighbourhood of zero U such that x - Fx 4 U (x E X). Let V be an absolutely convex neighbourhood of zero such that V + V c U. We can construct a finite V-chain (x& + V}; on the set K The convex hull P c X of the set {xk}T is compact and lies in a subspace of dimension
$4. Theory
of Operators
175
this leads to the contradiction y - Fy = Py - Fy = I$1 Tj(PYi, - FYij) + 2 rjtFYij - Fy) ’ ’ + ’ c ’ j=l
for some
Zj
> 0
(Cj
Zj
= 1). 0
The fixed point principle has many important will go into the details of some of these here.
and interesting applications.
We
Example 1. Let us consider a Banach space E which is ordered by a closed cone K. Let T: E + E be a compact (in general non-linear) mapping which is monotone in the usual sense, i.e. x < y + TX < Ty, and positively homogeneous: T(ax) = aTx (a > 0, x E E). If under these conditions there exists a vector u 2 0 ( /lull = 1) such that TV > cu for some c > 0, then there exists an eigenvector e 2 0 (e # 0) corresponding to some positive eigenvalue15 112 c. This theorem of Rutman is a generalisation of the classical Perron-Frobenius theorem concerning a matrix with non-negative elements16. For the proof l7 we consider the closed cone K, = {x: x 2 E1IxI1u} (0 < E -K 4) in E and a functional f E K* such that f(u) > 0, Ilf II = 1. If x E K, then TX > Ellxll cu. We extract from K, a closed convex subset X, by imposing the additional conditions: IIx (I < 1, f(x) > ef (0). It is non-empty: if x E K, and llxll = 1 then x E X, (in particular, u E X,). If x E X, then llxll 3 &f(u) and consequently TX # 0 for x E X,. The formula TX + 2eIITxllu FEx = IITx + 2.~1)TxIlull
defines a continuous mapping of the set X, into a relatively compact subset of it (because of the compactness of T). By the fixed point principle there exists x, E X, such that FExE = x,, hence IIx,II = 1 and TX, = &x, - p,u, where 1, 3 c, & > 0. Using the compactness of T once again, we choose E, JO such that that sequence ( Tx,,),“=~ converges. Then (x,,)$i will also converge to some x > 0 ( (Ix II = 1) and (Q, will converge to some 1 3 c. We obtain in the limit TX = Ax. Example 2. Let us consider the problem of the existence of a non-trivial closed invariant subspace for a continuous linear operator A on an LTS E. Using the algebraic construction (see Section 1.2) with subsequent closure does not generally give the required result since the possibility that Lin{Akx}, , = E for all x E E is not excluded (however in this case E is separable). In 1984 Read constructed a counterexample in a certain non-reflexive Banach space. However, if A is a compact operator on an arbitrary Banach space, it has a non-trivial closed invariant subspace (Aronszajn-Smith, 1954).
IsThe concepts of eigenvalue and eigenvector for non-linear mappings are defined exactly as for linear mappings: Te = r2e. r6The corresponding generalisation for integral operators was obtained by Yentsch (1912). he fact that the Perron-Frobenius theorem can be obtained by the method of fixed points (in fact from Brauer theorem) was noted by P.S. Aleksandrov and Hopf (1935).
176
Chapter 2. Foundations and Methods
Does there exist such a subspace for a pair of commuting compact operators? Using the fixed point principle, V.I. Lomonosov (1973) obtained the following general result”. Theorem. Let A be a non-zero compact operator on an infinite-dimensional LCS E. Then the algebra (A)’ of all continuous operators which commute with A is (topologically) reducible, i.e. there is a non-trivial closed subspace which is invariant for all” TE (A)‘.
The following lemma, which also has an important constitutes a fundamental step of the proof.
independent
significance,
Lemma. Let 52 be an irreducible subalgebra of the algebra of continuous operators on an LCS E which contains a compact operator A # 0. Then there is a compact operator B E 9l which has a fixed point x # 0. Proof. For any y E E, y # 0, the subspace L, = {z: z = Ty (T E ‘%)}is different from the zero subspace, invariant for all S E I and, consequently, dense. We choose a point x0 such that Ax, # 0 and a convex neighbourhood of zero U such -. that the set Q = Ax, + AU 1s compact and does not contain zero (clearly, Q is convex). Then for each y E Q there is an operator TYE Isuch that T,y - x,, E U. By continuity Tyv - x0 E U for all v E Q close to y. There exists therefore a finite collection {K}; c such that for each point v E Q at least one of the inclusions TV - x0 E U holds, i.e. at least one of the inequalities q( TV - x,,) < 1 (1 < i < n) holds, where 4 is the gauge of the neighbourhood U. We put
1(r) = max(1 - z, 0) 4i(v) =
wiV
- x0))
,z 4(4W
The mapping
(1 < i < n).
- x0))
n @V
=
2 i=l
4i(V)KV
(0 E
Q)
is continuous and so @Q is compact; moreover @Q c x0 + U since if TV 4 x0 + U then $i(v) = 0. The mapping F = @A has a fixed point x in x0 + U. It is fixed for the compact operator B=
~ ~i(AX)T,AE~ i=l
and x # 0 since 0 $ x0 + U. 0 I8 For Banach spaces; however the extension to LCSs does not require significant changes. 19(A)’ is called the centralizer (in general algebraic terminology) or the cmnmutant (in operator theory terminology) of the operator A. The centralism (commutant) M’ of any set M c UT(E) is defined as the intersection of all (A)’ (A E M). If !!I c f?(E) is a subalgebra then QI n %’ is the centre of the algebra 9X. In any case it contains the scalar operators A = 11 (I. E K). 201t is said to be hyperinvariant for A.
$4. Theory of Operators
111
It is suhicient to prove the theorem of Lomonsov in the complex case. If (A)’ is irreducible, by the Lemma there is a compact B E (A)’ for which L = Ker(1 - B) # 0. Since L is finite dimensional and invariant for A then Al, has an eigenvalue ,4.The eigenspace Ker(A - Al) is invariant for TE (A)’ and consequently it coincides with E, i.e. A = Al, hence A = 0 since it is compact and E is infinite dimensional. 0 Remark. The infinite-dimensionality of E is not used until the last step. Therefore if we require that A is not scalar 21, the assertion of the theorem will also be true in the finite dimensional case but of course this case allows a purely algebraic treatment: a classical theorem of Burnside asserts that any irreducible subaglebra of the algebra End E with finite dimensional E coincides with End E. Therefore if (A)’ is irreducible, then A commutes with all operators in E. But then, by Schur lemma, A is scalar. Corollary 1. Zf T is a continuous operator on an LCS22 E and the algebra (T) contains a compact operator A # 0, then T has a non-trivial closed invariant subspace. Corollary 223. Let 2I be an irreducible algebra of operators on an LCS E which contains a compact operator A # 0. Then its centraliser ’ consists of the scalar operators. Example 3 (Markov-Kakutani theorem). If E is an LCS, X c E is a compact convex set and (Ai}ieI is any commuting family of continuous affine mappings of X + X, then there exists a common fixed point x E X for all Ai. In fact, for each Ai its set of fixed points Xi is non-empty, convex, compact and invariant for all Aj. By induction these properties carry over to finite intersections the intersection of all the Xi is non-empty. xil n . * * n Xin. Consequently One of the applications of the Markov-Kakutani theorem is concerned with invariant means. Let us define this important concept. Let S be any semigroup (to be specific we take it to be multiplicative). Any non-negative linear functional m on the space B(S) which is invariant with respect to left shijts (m[dJ = m[f, where bt(s) = &ts)) and normalised in the sense that m[l] = 1 is called a left-invariant mean on S. Clearly, the value of ml-41 is contained between inf 4 and sup 4 from which it follows that the functional m is continuous (llrnll = 1)24. If a left-invariant mean exists” on S then S is said to be left-amenable. If S is a group and m is a left-invariant mean, by putting 4*(s) = 4(s-l) and m*[b] = m[r5*] we obtain a right-invariant mean. Thus for groups (and clearly also for Abelian semigroups) we can speak simply of amenaIn the real case we have to require in addition that A is not a root of any quadratic equation with real coefficients and negative discriminant. 22dimE>10verCand>20verR. his is a generalisation of Schur lemma to the intinite-dimensional case. 24Conversely ifp E B(S)* and satisfies the conditions y(l) = 1,llp)l < 1, then p 2 0. In fact if cpE B(s), llrpll = 1 and cp > 0, then 111- cplj d 1, hence ~(1 - cp) < 1, i.e. p((p) > 0. *’ Moreover it need not be unique.
178
Chapter 2. Foundations and Methods
bility. Invariant means were brought into use by von Neumann (1929) who established the amenability of abelian groups and gave a simple example of a non-amenable group: the free group with two generators. Theorem.
Any Abelian semigroup is amenable.
Proof. In the space B(S)* the set A = {f:f(l)
= 1, U-11 < l> = {f:f(l)
= l,f>
O>
is w*-compact by the Banach-Alaoglu theorem, convex and invariant with respect to the mappings which are the conjugates of shifts: (A,f)q5 =f(&. Clearly all the A, are affine and w*-continuous on A and At1Af2 = A,,,l, i.e. the family {A,} commutes since the semigroup is abelian. By the Markov-Kakutani theorem there is a common fixed point m for all the A,. This is also an invariant mean. 0 An invariant mean on the additive semigroup of natural numbers N is called a Banach limit. This is a non-negative linear functional on the space of bounded sequences which is equal to 1 at (1, 1, . . . ) and invariant with respect to the shift (a,) + (a,,,). Its value at a convergent sequence is the same as the limit. A Banach limit provides a regular method for the summation of series with bounded partial sums. 4.3. Actions and Representations of Semigroups. Let S be a topological semigroupz6 and X a Hausdorff topological space. An action of S on X (or in X) is a family of continuous mappings T(s): X + X (s E S) such that T(st) = T(s)T(t) and the orbit T(s)x is continuous in s for each fixed x E X. If there is an identity e in S then T(e) is idempotent. If T(e) = 1, the action is said to be monoida12’. This condition is usually assumed to be satisfied if S is a group2* and consequently in this case all the T(s) are bijective. The actions of a semigroup S on a space X are also called (topological) dynamical systems on X; in this context X is called the phase space, the points x E X are the states of the system and the variable s E S is time. The following is the classical situation: S = R (pow), S = R, (semiflow2’), S = N (iterations of a mapping F: X + X). One of the central problems of topological dynamics is the existence of an invariant measure, i.e. a measure p on X such that
for all 4 E L1 (X, p).
sX
WW
& =
sX
4(x) dp
Z6i.e. a semigroup provided with a HausdorlI topology with respect to which multiplication is continuous. * semigroup with an identity is sometimes called a monoid. 281n a topological group not only multiplication but also inversion s I+ s-l is continuous (by definition). * or the sake of brevity we sometimes also call them flows.
6 4. Theory
of Operators
179
Bogolyubov-Krylov Theorem 30. If the phase space X of a dynamical system is compact and the time semigroup S is left-amenable (in particular, Abelian), there exists an invariant measure p on X with ,u(X) = 1. Proof. Let m be a left-invariant mean on S. We take any point x0 E X and consider the linear functional f (4) = m[q5(T(*)xo)] on C(X). Since it is nonnegative, by the theorem of Riesz it can be represented in the form jx 4 d,u. This measure p has the required properties. 0
Any semigroup S acts on itself (X = S) by left shifts: T(s)x = sx. Therefore if S is a compact abelian semigroup there is a measure on it which is invariant with respect to shifts. For groups a considerably more general result holds. Theorem of Haar.
On any locally compact group there is a left inoariant measure
(left Haar measure). In the same way there also exists a right Haar measure. Each of them is unique up to a positive multiple (von Neumann, 1936) but a left Haar measure need not be a right Haar measure. Groups for which left and right Haar measures are the same are called unimodular. In particular, all compact and all discrete groups are of this type. Haar measure is a very valuable tool for constructing the theory of representations of locally compact groups, in particular for the construction of general harmonic analysis. By a representation of a topological semigroup S in an LTS E we mean a linear action T on E, i.e. a strongly continuous homomorphism T: S -+ f?(E). Thus all the T(s) are continuous linear operators on E and T(s)x is a continuous function of s for each x E E; T(s1s2) = T(s,)T(s,). If T(s) depends continuously on s in the uniform topology then the representation T is said to be uniformly continuous.
A representation is said to be bounded if all its orbits are bounded. For a Banach space E this is equivalent to boundedness in norm: sups 11T(s)11< co. If lIT(s < 1 (s E S) the representation is said to be contracting and if all the T(s) are isometries it is called isometric. Any contracting representation of a group is isometric. Any bounded representation T of a semigroup S in a Banach space is contracting in the equivalent norm
IM = max IM sup s IITWII > . ( A representation of a group G in a Hilbert operators T(g) (g E G) are unitary.
space is said to be unitary if all the
Theorem. Let S be a right-amenable topological representation of it in a Hilbert space E. Zf 3o In a somewhat
generalised
form.
semigroup and T a bounded
180
Chapter 2. Foundations and Methods
infs IIW-4
2 ~llxll
(a = constant > 0),
then there exists on E an equivalent Hilbert norm with respect to which T is unitary. Proof. We consider on S the bounded function 4Xx,Y(s) = (T(s)x, T(s)y) (x, y E E) and put (x, y) = m[&J. Then (T(s)x, T(s)y) = (x, y) (s E S) and a2 llx112 < 6, x> G b2 11412(B = sups IIWI). In many situations this theorem allows us to restrict attention to unitary representations. The following are classical examples: 1) finite-dimensional representations of finite groups, whose theory was developed at the end of the 19th and the beginning of the 20th centuries (Frobenius, Molin, Bumside, Schur); 2) representations of compact groups in Hilbert space (the theory of unitary representations of compact groups was essentially constructed in the work of Peter and H. Weyl, 1927). If T is a finite-dimensional representation of a semigroup S, the function x(s) = tr T(s) (s E S) is called its character. This term reflects the fundamental classical result that, for example, unitary representations with identical characters are indistinguishable from the geometric point of view; more precisely, they are equivalent in the sense defined below. Let T,, T2 be representations of a topological semigroup S in LTSs E,, E, respectively. Any continuous homomorphism V: E, + E, which satisfies the relation T,(s)V = VT,(s) (s E S) is called an intertwining operator for the pair of representations T,, T2. For example, V = 0 is such an operator. If among the intertwining operators there is a topological isomorphism U (so that T,(s) = UT,(s)Uwe say that the representations T,, T, are equivalent. If the space is a Hilbert space and U is an isometry we use the term unitary equivalence. In a series of situations generalisations of the concept of trace allow us to consider characters of infinite-dimensional representations and to apply them for classification purposes. On the other hand, the characters of one-dimensional representations are very important and interesting (especially in the Abelian case). They are the continuous homomorphisms of a semigroup S into the multiplicative group of the field C and are called (one-dimensional) characters of the semigroup S. For example, the exponentials are the characters of the additive semigroup R, (the character which is identically zero is usually excluded from consideration). Now let G be a locally compact group and p a right Haar measure. The group G acts by right shifts on the Hilbert space L2(G, cl). This unitary representation R of the group G is said to be regular” and is the starting point of harmonic analysis on a group. If G = Iw or G = T (the unit circle) then p is Lebesgue measure and harmonic analysis on G is Fourier analysis. We now pass to a
31This term is retained for representations by means of shifts in any function spaces on a group which are invariant with respect to shifts.
$4. Theory
of Operators
181
consideration of the basic aspects of the spectral theory of operators with which harmonic analysis has a deep connection. 4.4. The Spectrum and Resolvent of a Linear Operator. Many applications require that spectral theory covers linear operators which are not necessarily defined on the whole of the underlying LTS E and are not necessarily continuous. The domain of definition D(A) of an operator A on E can be any subspace of E and in general it is neither closed nor dense, i.e. A is simply a homomorphism of D(A) + E. Such a generality has significant implications for the algebra of linear operators. For example D(A + B) = D(A) n D(B) and it can happen that D(A + B) = 0 for dense D(A), D(B). We note here that D(U) = B-ID(A), the B-preimage of the subspace D(A). In accordance with the usual formal rules, just one change in the domain of definition changes the operator. If D(A) c D(J) and A aA, = A then A scalled an extension of the operator A and we may write A c A (in this connection the operator A is said to be a restriction or part of the operator A . Let A, B be operators such that BA c AB and B is defined on the whole space E. Then we say that they commute even if BA # AB, since if BA = AB and D(AB) = E then D(A) = E, a restriction which is unacceptable for many applications. Example. Suppose that D(A) = D(B) = E and A, B commute (AB = BA). If A is injective, the operator A-’ (D (A-’ ) - Im A) commutes with B in the sense defined above. The theory of operators is concerned in the first place with their properties which are invariant with respect to automorhisms of the underlying space. From this point of view the operator UAW’ (U is a topological automorphism) is just as suitable as A; in this connection, just as for representations, we say that they are equivalent (the term similar operators is also used). In the finite-dimensional case the problem of classifying linear operators up to similarity reduces to the theory of elementary divisors. In the infinite-dimensional case the problem has on the whole a transcendental nature. However knowledge of certain, if not all, invariants already allows the possibility of penetrating deeply into the internal structure of an operator. The spectrum is the most important invariant of this sort. As the ground field in spectral theory it is expedient both from the internal point of view and in connection with applications to settle on the field C, which has a unique combination of properties: algebraic closure, local compactness, connectedness. A complex number I is said to be a regular point of a linear operator A on an LTS E if A - 11 maps D(A) bijectively onto E and the resoluent R, = R,(A) = (A - II)-’ is a continuous operator on E + D(A) (in this connection it is convenient to regard R,(A) as an operator on E + E by using the inclusion of D(A) in E; in any event Im R,(A) = D(A)). The set reg A of regular points of an operator A is called the resoluent set and its complement spec A is the spectrum.
182
Chapter 2. Foundations and Methods
The spectrum of an operator contains the set of all eigenvalues and sometimes reduces to this, for example, if A: E + E is a continuous algebraic32 operator. Example. Let P be a projection different from 0, 1. Then the set of its eigenvalues is (0, l} and if P is continuous spec P = (0, l} (otherwise, spec P = C). An eigenvalue 1 is said to be normal if corresponding to it there exists a linitedimensional root subspace with an invariant topological complement L and A # spec(A IL). For any linear operator A and all 3, E @we put A1 E A,(A) = Im(A - Al). The eigenvalues il are characterised by the fact that the homomorphism (A - 11): D(A) + A,(A) is not invertible. For all other A the inverse homomorphism is defined. As before we can call it the resoloent of the operator A, denote it by R, = R,(A) and regard it as an operator on E with domain of definition A,(A); Im R,(A) = D(A). The eigenvalues of an operator A are contained in the approximate spectrum spec,A which is defined in the following way: I E spec,A if there is a neighbourhood of zero V such that for any neighbourhood of zero U a vector x $ V can be found for which Ax - kx E U. The set spec,A = spec A\spec,A is called the residual spectrum. The union rzg A = reg A u spec,A is characterised by the fact that the resolvent R,: A1 + E is a continuous operator (but AI # E for I E spec,A in contrast to the situation for regular points). Points 1 E rzg A (i.e. L $ spec,A) are said to be quasiregular. If an operator A on a complete LTS E is closed33, the subspace A,(A) is closed for all 1 E rzg A. In this case the factor space E/A, is called the defect space and its topological dimension n, is called the defect number of the operator A at the point A. The approximate spectrum of an operator A on a normed space E is the set of those ;I E C for each of which there exists a quasieigensequence of vectors (x.)p :
l&n IIX ) > 0. lim (Ax, - kc,) = 0, n+co n-+4, We can assume moreover that llxnll = 1 (n = 1,2, 3, . . . ). The quasiregular points 1 can be characterised by an inequality of the form
IlAx - WI 2 c~II4l
(x E D(A), cI. > 0).
Thesetr~gAisthereforeopen(ifI~--l
and D.P.
Theorem. Let A be a closed operator on a Banach space E. Then the defect number n, is constant in a sufficiently small neighbourhood of any quasiregular point. 32The requirement of continuity is essential; the definition of the spectrum in an LT.5 takes topology into account. 33This is guaranteed if reg A # (21.
183
5 4. Theory of Operators
For the proof it is sufficient to note that if y = Ax - J.x = (Ax - px) + (A - ,u)&y and (IyJI = 1 then d(y, A,u) < I,u - Ilc;‘. Therefore we have the bound for the gap 1 1 > <8(A,, A,,) 6 1~ - L( max L 2 cl l - 1p - A1 if 1~ - Lj < *cl. But then np = dimt(E/A,)
= dimt(E/A,)
= n,. 0
Corollary 1. The defect number n, is constant on each connected component of the set rzg A.
In particular, if rzg A is connected, then either rzg A = reg A or rzg A = spec, A. Corollary
2. The resolvent set reg A and the residual spectrum spec,A are open.
In fact, these sets are separated out from rrg A by the respective conditions n, = 0 and n, # 0. Corollary
3. The spectrum of an operator A is closed.
However it can turn out to be empty. Example. For the differentiation operator on L O, 1) the natural (maximal) domain of definition consists of all absolutely continuous functions with derivative in L*(O, 1). Let us restrict the domain of definition by imposing the condition 4(O) = 0. We obtain a closed operator with no spectrum since its resolvent
’e ll/(t)dt s0 is a bounded operator on the whole of L*(O, 1) for ah I E @. We note further that any non-empty closed set S c @ is the spectrum of some closed linear operator. For this we consider the Hilbert space L*(S, ,u), where p is a measure on S which is positive on all open subsets X of S. Let us denote by A the operator obtained by multiplying by the independent variable [: (A&(c) = &(C). Its domain of definition consists of those q4E L*(S, p) for which c+(l) belongs to the same L*. The operator A is closed and if S is compact (and only in this case) then A is even bounded. It is easy to see that spec A = spec,A = S. The points with positive measure are the eigenvalues in this example. For any operator A the resolvent R, is an analytic function of 1 on reg A since the power series (R,+)(s) =
converges in the uniform is34 R,.
2 R;+ L - ,u)” n=O operator topology for II - ~1 < IIRpll-’ and its sum I’
34This argument shows once again that reg A is open. Analyticity of the resolvent is the basis for the application of the methods of complex analysis in spectral theory.
184
Chapter 2. Foundations and Methods
Suppose that the operator A is bounded and D(A) = E. The Laurent series (23) converges in the domain )A1> l/All under the uniform operator topology and its sum is R,. Thus spec A lies in the disk 111< 11All. We see that the spectrum of a bounded operator is compti.,On the other hand spec A # 0 since otherwise R, is an entire function which tends to zero as 111+ co and so, by Liouville theorem, Rt F 0 which is impossible for an inverse operator. The smallest p = p(A) > 0 for which spec A lies in the disk 111< p is called the spectral radius of the operator A. Clearly p(A) < /[All but we also have GelIfand formula3 5 p(A) = lim l IjA = inf ~IIA I. n+co n The existence of the limit and its equality to the inlimum is guaranteed by the theorem of Fekete that if a sequence (01,) of real numbers is subadditive, i.e. a ,,, < a, + a,,,, then 5 tends to the infimum of the sequence as n + co. Further, n the outer radius of convergence r of the series (23) is equal to the stated limit and r > p(A) since the domain of convergence of the series is contained in reg A. On the other hand, r < p(A) since the resolvent is analytic on the domain IA) > p(A). If p(A) = 0, i.e. spec A = (O}, the operator A is said to be quasinilpotent. In particular, any bounded nilpotent operator is of this type, however the class of quasinilpotent operators is more extensive than this. Example.
On Lp(O, 1) (1 < p < co) the integration W(s)
is not nilpotent
but it is quasinilpotent.
1 (W)(s) = (n _ l)!
s
=
operator
’ 4(t) dt s0 In fact
; (s - O?(t)
dt
(n = 1,2, 3, . ..).
hence
IlZ = 0l 0; . Consequently, p(Z) = 0 by Gel and formula. For any bounded operator A defined on the whole space the circle 111= p(A) contains points of the approximate spectrum. Let A be an isometric operator defined on a closed subspace D(A) c E. Its image Im A is clearly also closed. Since 35Hence it follows that the functional calculus for the operator A operates on the disk )I[ < p(A).
8 4. Theory
of Operators
185
all points L with 111 # 1 are quasiregular. The defect number n, is constant in the disk IA) c 1 and in its exterior 111> 1. Its value on the disk is equal to n, = dimt(E/Im A). We denote its value on the exterior by n,. It follows from the formula A,(A) = A,-,(A-‘) (111 # 1) that no0 = dimt(E/D(A)) so that n, = O-SD(A) = E (clearly n, = OoIm A = E). The pair (n,, n,) is called the deficiency index of the isometric operator. If there exist quasiregular points in the unit disk, then no0 = n,. In order that an isometric operator map the whole space onto itself (in a Hilbert space this means that it is unitary) it is necessary and suflicient that it have deficiency index (0,O). The approximate spectrum of any isometric operator is unitary, i.e. it lies on the unit circle. If the operator is defined on the whole space then its spectrum lies in the disk 111< 1, the circle 111= 1 contains the points of the approximate spectrum but it can also contain quasiregular points; then they are regular and n, = 0. If n, # 0, the spectrum is the disk 111 < 1 and the approximate spectrum is the circle In( = 1. The spectrum of an operator which maps the whole space isometrically onto itself lies on the unit circle. To conclude this section let us consider the structure of the spectrum of a compact operator. This is the next in complexity after the algebraic case. All linear operators on a finite-dimensional LT.3 are compact and algebraic; the spectrum of any of these coincides with its set of eigenvalues. Theorem. The spectrum of a compact operator A on an infinite-dimensional LTS E consists of zero and a no more than countable set of eigenvalues diflerent from zero. The unique limit point of this set, zf it is infinite, is zero.
Proof. Each point 1 E spec A\(O) is an eigenvalue by the Fredholm alternative. The point A = 0 belongs to the spectrum since otherwise the operator A-IA = 1 is compact, which is impossible for an infinite-dimensional E. Let V be a balanced neighbourhood of zero whose image is relatively compact. Let us suppose that there exists a set (A,} r of eigenvalues such that I&l 2 E > 0; let e, (n = 1,2, . . .) be corresponding eigenvectors. Now consider L, = Lin{e,}l; clearly dim L, = n. We can choose in L, n 0 a vector x, 4 L,-, + V. Since Ax, - &x. E L,-, and Ax, E L,-l (n > m) we have Ax, - Ax, 4 EV, which is a contradiction. 0 Let us enumerate the eigenvalues which are different from zero (if they exist): . We denote their orders by vl, vl,. . . , and their maximal root subspaces &,L... by WI, W,, . . . . They are all finite-dimensional and have invariant complements Vi = Im(A - A1l)yl, . . . . Moreover spec(Alw,) = {A,} and spec(Aj,J = spec A\{&}. Hence it follows that on a Banach space E the resolvent R,(A) has a pole of order v, at the point & and has no other singularities apart from the points 1, and zero, i.e. it is meromorphic on @\ (0).
Chapter 2. Foundations and Methods
186
If a compact operator has no eigenvalues different from zero (i.e. it is quasinilpotent in the Banach situation), it is said to be a volterra operator. The following example explains the origin of this term. Example.
In Lp(O, 1) (1 < p =Gco) Volterra
integral operator
(0 < s < 1) ’ R(s, W(t) dt s0 with continuous kernel R(s, t) is a Volterra operator. In particular the integration operator I is a Volterra operator. Let A be a non-negative compact operator on a Banach space E with closed cone K. Let us suppose that the cone K is total, i.e. Lin = E. If the spectral radius p = p(A) is positive, then p is an eigenvalue for A and A* and corresponding to it there are an eigenvector x E K and an eigenfunctional f~ K* with f(x) > 0 (M.A. Rutman, 1938). The condition p(A) > 0 is satisfied if there exists a vector u > 0 (u # 0) such that Au 2 cu for some c > 0 (then p(A) > c). (A&(s) =
Let us consider the integral operator 4(s) (O<s 0 and q(s,) > so then p(A) > 0. In fact A > 0 and for u(t) we can take the function which is equal to E - 1t - soI for 1t - so 1< E and equal to zero outside of this interval (where E > 0 is sufficiently small). Example.
4.5. One-Parameter Semigroups. The representations T(t) of the additive semigroup Iw, of non-negative numbers with T(0) = 1 are designated by the term in the title. The representations of the additive group [ware called one-parameter groups. Each bounded linear operator A on a Banach space E generates a oneparameter group
co A?” T(t) = eAt = 1 I n.
(t E R).
(24)
Moreover ;[T(t)x] i.e. the vector-function
= AT(t)x = T(t)Ax
(x E E, t E W),
u(t) = T(t)x is the solution of the Cauchy problem
G(t) = Au(t)
(--cO-ct
u(0) = x
(the solution can easily be shown to be unique). The operator A is infinitesimal for the representation T in the sense that A = T 0). Because the operator A is
9 4. Theory of Operators
187
bounded, the representation T is continuous not only in the strong topology but also in the uniform topology: lim II T(t + z) - T(t)11 = 0
(t E R).
r-10
Conversely, if a representation T(t) (t E R,) is uniformly continuous, then T(t) = eAr, where A is some bounded operator. This follows from the general theory which is presented below. Let T(t) be any one-parameter semigroup. Since T(t + s) = T(t)T(s), the function a(t) = In ((T(t)(( is subadditive, a(t + s) < a(t) + a(s), and, by the continuous variant of Fekete theorem, the limit
~ = lim ln IITWII -
T(n) =-sm
T(t)e-
dt
0
is therefore defined on the half-plane II, = {A: re 1 > g}. Since 1)T(t)(I G MeC’ for some c > e, M > 0, we have
The values of the operator function ?(A) commute pairwise and satisfy the Hilbert identity
F(a)- T;(p)= (a - p)T(a)qp).
I
The subspace L = Im F(A) therefore does not depend on 1. The operator f(A) maps E bijectively onto L (injectivity follows from the uniqueness theorem for the Laplace transform, since, if ?(A)x = 0, then f(,u)x = 0 for all p E II,). Let us put A(1) = 11 + [?$)IThis does not depend on 1 and defines a closed operator A (D(A) = L) with R,(A) = F(A). The subspace L is dense since [T(A)]*
= -SW T*(t)e-
dt
0
and, analogous to the previous situation, Ker@(A)]* = 0. We note that, since R,(A) commutes with all T(s), the operator A also has this property. Let us consider the Cauchy problem ti(t)
=
Au(t)
(t
> O),
u(0) = x E D(A).
IfpEII,andy=Ax-px,then x = R,y = -
m (T(s)y)es0
ds,
(25)
Chapter 2. Foundations and Methods
188
and hence m T(t)x = Consequently &(t)x]
i.e.
T(t)x
(T(t + s)y)e-
s0 the function
T(t)x
ds = -ee”
has a continuous
= ,uT(t)x + T(t)y = T(t)Ax
is a solution of the Cauchy problem
m (T(s)y)eC sf
ds.
derivative for = AT(t)x
t
2 0 and
(t 2 01,
(25). In particular,
putting
t
= 0
we obtain Ax = 1 [T(t)x]I1=o. The Cauchy problem (25) can be considered36 for any operator A. In the situation described above it does not have solutions other than T(t)x, i.e. the corresponding homogeneous problem C(t) = Au(t) (t > 0), u(0) = 0 has only the trivial solution. This is the case (Yu.1. Lyubich, 1960) even under the much weaker restriction of growth of the resolvent on the ray il > 1, > 0:
-lim lnll&II --coo a++m 1 (this cannot be relaxed further). In fact s R,u(O) = u(t)edt + e-hR,u(s) (s > 01, f0 from which it is obvious that R,u(O) is the local Laplace transform for [-u(t)]. If u(O) = 0 then u(t) = 0 by the uniqueness theorem for the l.L.t. The subspace I(A) c D(A) of those x for which the Cauchy problem has a solution (independently of uniqueness37) is called the initial subspace. It can fail to be dense (and can actually be the zero subspace) even in the case where D(A) is dense. But if a bound of the form IIRlll = O(lAl is satisfied in some halfplane II, contained in reg A and D(A) is dense, then I(A) is dense (Yu.1. Lyubich, 1964). If x E I(A) and u(t) is a corresponding solution, then clearly its values belong to I(A) for all t (in this sense I(A) is invariant). If the Cauchy problem is determinate, the evolutionary operator U(t) transforming u(O) into u(t) (t > 0) is defined on Z(A). Clearly U(0) = 1 and U(t + s) = U(t) U(s). If U(t) is bounded for each3* t > 0, we have a representation of the semigroup R, in the subspace I(A) (in general it is not closed). However if sup04tSE II U(t)11 c co for some E > 0,
361t is implicit that the solution is continuously differentiable and its values belong to D(A) for all t 2 0 (in a relaxed formulation - for t > 0). 37A Cauchy problem for which the solution is unique is said to be determinate (this property does not depend on x). 3sIn this case the Cauchy problem is said to be pointwise stable on I(A) (simply pointwise stable if I(A) is dense).
189
$4. Theory of Operators
then3’ the extension by continuity of U(t) onto Z(A) preserves continuity of the trajectories and we obtain a representation of the semigroup R, in the closed subspace Z(A). Its infinitesimal operator is the closure of the operator A (S.G. Krejn - P.E. Sobolevskij, 1958). The infinitesimal operator of a one-parameter semigroup is also said to be generating for this semigroup or to be a generator of it since, clearly, the semigroup with a given inhnitesmial operator is unique. The question arises of describing the class of generators of all possible one-parameter semigroups. Miyadera-Feller-Phillips Theorem. In order that a densely defined linear operutor A on a Bunuch space be the infinitesimal operator of a one-parameter semigroup, it is necessary and suflicient that reg A contain a half-plane ll, in which the inequality M llR:ll G (re k _ c)”
(n=0,1,2,...;
M a positive constant)
is satisfied. Necessity.
This follows from the bound II T(t)11 < Me” and the formula 1 d , (-1)” a3 RI! = (n _ I)! dl ’ = (n - l)! 0 T(t)t s
- dt.
Sufficiency. Under the given conditions the Cauchy problem (25) is determinate and the initial subspace Z(A) is dense. Since s u(t)e-
we have by Widder u(t) = lim n+m
dt = R,(u(t)e-
s0 inversion formula (0 < (- LRJ lu(0)
+ I
-
- u(O)), t
< s):
kzo ((n
shlik
.
Ri+ (s) A=ll/t
Hence it follows that Ilu(t < Me” llu(O)ll, i.e. for the evolutionary operator: II U(t)11 < Meet. Since A is closed under the conditions of the theorem, it is the infinitesimal operator of the representation U. 0 An immediate consequence of this result (taking account of the connection between the bounds for the representation and the resolvent) is the Hille-Yosida Theorem. In order that a densely defined linear operator A on a Banach space be the infinitesimal operator of a one-parameter semigroup of contractions, it is necessary and suflicient that the right half-plane II, be contained in reg A and that the inequality
391n this case the Cauchy problem is said to be stable on I(A) (simply stable if I(A) is dense).
190
Chapter 2. Foundations and Methods
be satisfied in II,.
The operator A in this case is said to be dissipative. Example. The semigroup R, acts by means of shifts on LP(R+): (T(t)&(s) = 4(s + t) (s, t 2 0). Clearly II T(t)11 < 1. The infinitesimal operator of this regular representation
d ds
is equal to -. It is defined on the absolutely continuous functions
which, together with the derivative, belong to Lp. This is a dissipative operator. The theory of one-parameter groups is reduced to the previous case by considering representations of the semigroups IR, . In particular, the infinitesimal operators of one-parameter groups of isometries can be characterised (I.M. Gel and, 1939) as those operators whose spectrum lies on the imaginary axis and for which the bound 1
IIWI G Ire
(re 1 # 0)
is satisfied. These operators are said to be conservative. Example. infinitesimal
A regular representation
of the group R! in Lp(R) is isometric. Its d operator - can be described just as in the case of R, but now it is ds
actually conservative. Conservative and dissipative operators have many remarkable properties, in particular the spectral radius of a bounded conservative operator A on a Banach space E is equal to its norm (V.Eh. Katsnel on, 1970). In fact, the function f(eA ) (x E E,fe E*) is an entire function of exponential type
= Idd,(f( eAtx N I<. ~(4
f If(eAW sup
G PCW~I
- Ilxll.
Putting t = 0 we obtain If(Ax)I < p(A)IlfII * (IxIJ and consequently llAl/ < p(A). Incidentally, Bernshtein inequality itself can be interpreted as the coincidence of the spectral radius with the norm of the differentiation operator on the space B, of entire functions of exponential type < 0 which are bounded on the real axis (it is remarkable that the differentiation operator on B, is bounded). A general mechanism underlying inequalities of Bernshtein type and results pertaining to it in spectral theory was brought to light by E.A. Gorin (1980). It was based on the functional calculus associated with integral representations of positive functions. From the point of view of the Cauchy problem there is great interest in the study of asymptotic (as t + co) properties of one-parameter semigroups. For example, boundedness (i.e. supt /IT(t)11 < co) is equivalent to Lyapunov stability
4 4. Theory of Operators
191
for the corresponding Cauchy problem. For bounded one-parameter semigroups on a reflexive Banach space we have the ergodic theorem: the strong limit T(t) dt
exists. It is a bounded projection (for semigroups of contractions it is an orthoprojection) on the subspace of lixed points {x: T(t)x = x (t 2 0)} = Ker A (A is the infinitesimal operator). This theorem owes its origin to the analysis of the foundations of statistical mechanics, which goes back to Koopman and von Neumann40. There is an important class of representations for which the ergodic theorem is true without restriction on the space - almost-periodic representations. A representation T is said to be almost-periodic (a.~.) if all of its orbits are relatively compact41. Example 1. In the space AP(R) of almost-periodic functions on R the representation of the group Iw by means of shifts (the regular representation) is a.p. The definition of an a.p.f. carried over without change to functions on the semiaxis R, . We have correspondingly a regular a.p. representation of the semigroup R, in AP(R+). The general form of an a.p.f. on the semiaxis is 4 = do + h, where boo(t) tends to zero as t + co, and 4r can be extended to the whole axis as an a.p.f. (Frechet, 1941). Example 2. Let A be a compact operator on a Banach space E which does not contain any subspaces isomorphic to the space c of convergent sequences (5,); (in particular, E may be reflexive). Then, if the representation e (t E R) is bounded, it is a.p. In fact, for the function u(t) = e% (x E E) the derivative G(t) = Au(t) is a.p. because of the compactness of A and the function itself is bounded. Consequently u(t) is a.p. (here we are using a generalisation due to M.I. Kadets (1969) of the Bohl-Bohr theorem on the integral of an a.p.f.). By restricting an a.p. representation of the group R to the semigroup R, we obtain, clearly, an a.p. representation but it is such that all the operators T(t) (t E W,) are invertible and their inverses are bounded (in this sense it is a group representation of the semigroup R,). There is a different kind of a.p. representation of the semigroup R, , all the orbits of which tend to zero as t + co. Further, if the space E can be decomposed into the topological direct sum of the spaces 4oThis author established (1932) the ergodic theorem in Hilbert space. The generalisation given here is due to Larch. Usually formulations of ergodic theorems are given for semigroups of powers (I )?. The ontinuous’ case considered in the text is easily reduced to the iscrete’ case (such a reduction occurs quite often). Further, as a rule, we restrict ourselves to only one of these cases. 41 In exactly the same way a.p. is defined for bounded representations of any topological semigroups in any LTS. The property of boundedness is not required a priori in a barrelled LCS since it follows from the relative compactness of the orbits by the Banach-Steinhaus theorem. In general we can define a.p.fs on any topological semigroup; they can be not only scalar-valued but also vector-valued.
192
Chapter
2. Foundations
and Methods
E,, E, in which a.p. representations T,, T, are given (for example, Tl a group representation and To having orbits tending to zero) then the natural representation Tl i To defined on E is a.p. We have the boundary spectrum splitting-off theorem (M.Yu. Lyubich - Yu.1. Lyubich, 1984): let T be an a.p. representation of the semigroup [w, in a Banach space E; then E can be decomposed into the topological direct sum of invariant subspaces E,, E, such that Tl = TIE1 is a group representation and the orbits of T, = TIE0 tend to zero. The subspaces E,, E, are called the boundary and interior subspaces respectively and the projection P associated with them (Im P = E,, Ker P = E,) is called the boundary projection. It arises in the following way. Corresponding to an a.p. representation T there is a Bohr compact set - the strong closure of the family (semigroup) of operators { T(t)},,o. This is an Abelian semigroup which is compact in the strong topology. In any compact semigroup there is a smallest two-sided ideal - the Sushkeuich kernel (A.K. Sushkevich, 192843; Numakura, 1952). For an Abelian semigroup (and in certain other cases) the Sushkevich kernel is a compact group. The boundary projection for an a.p. representation is the identity of the Sushkevich kernel of the Bohr compact set. Hence it is clear, incidentally, that [[PII < sup 1)T(t)11 and in particular, if T is a contraction then P is an orthoprojection. In the latter case the representation Tl is an isometry. The theory which has been described carries over to a.p. representations of any Abelian (and certain non-Abelian) topological semigroups, in particular, to semigroups of powers of an a.p. operator 44. A prototype of the general theory of a.p. representations, the boundary spectrum splitting-off theorem for bounded semigroups of operators on a reflexive Banach space, was constructed by Jacobs (1957), after which De Leeuw and Glicksberg (1961) obtained corresponding results in the weak topology for any Banach space. In the weak a.p. case the orbits for T, do not tend to zero in general, even weakly, and they only have zero as an adherent point. Furthermore To can turn out to be a group representation (for example, (T(t)f)(s) = eifsf(s) (s, t 3 0) in L’(R+)). 4.6. Conjugation and Closure. Let A be a linear operator on an LTS E and suppose that its domain of definition D(A) is dense. Then the conjugate operator A* on E* can be defined by means of the relation (A*f)(x) =f(Ax) for those f~ E* for which the right hand side of the identity is continuous on D(A) and, consequently, can be uniquely extended to a continuous linear functional on E. If E is a Hilbert space the definition of the conjugate operator is usually modified on the basis of Riesz theorem so that A* acts on the space E itself: (Ax, Y) = (x9 A*Y)
(x E W),
Y E WA*)).
Therefore (aA)* = ZA*. In this case we call A* the adjoint of A.
43The fundamental work of A.K. Sushkevich is concerned with finite semigroups. 441n particular, any operator of the form A = V + V, where V is compact, 11Y” 11< 1 for some m and sup,,, l[A” 11< co is of this type (Yosida - Kakutani, 1941).
5 4. Theory
of Operators
193
Theorem. Zf a densely defined operator A on an LTS E has continuous inverse A-‘: E --, D(A) then A* has inverse (A*)-‘: E* + D(A*) and (A*)-’ = (A-‘)*. Thus, for example, (A*)-’ is continuous under the topology of uniform convergence on bounded sets and also in the w*-topology. Corollary. Zf A is a densely defined operator on a Banach space, then reg A* = reg A and consequently spec A* = spec A. In the Hilbert space context reg A* = reg A and spec A* = spec A (the bar denotes complex conjugation). The defect space E/A, (A E spec,A) is isomorphic to the subspace Ai = N;i z Ker(A* - 11). The preceding theorem has a converse in a Banach space: if A is densely defined and closed and A* has a bounded inverse then A also has a bounded inverse (consequently, (A-‘)* = (A*)-‘). An operator A on a Hilbert space is said to be selfadjoint if A* = A. For example, any orthoprojection is of this type. If A c A* which, in the case of dense D(A), is equivalent to the identity (Ax, Y) = (x, AY)
(~3 Y E WA)), (27) then A is said to be a symmetric operator. Selfadjoint operators play a central role in quantum mechanics, where they are used to give a suitable description of physical quantities (coordinates, impulses, energy, etc.). The principal credit for constructing the spectral theory of selfadjoint operators is due to Hilbert and von Neumann. We say that an operator A on an LTS E admits closure if the closure of its graph is the graph of some operator A, the closure of the operator A. Clearly A c Aand Ais closed. In order that A admit closure, it is necessary and sufficient that a closed extension exist for it; moreover Ais the least closed extension for A. A continuous operator always admits closure; this is just its extension by continuity to D(A). The domain of definition of a closed continuous operator is therefore closed and if it is also dense then it coincides with the whole space. Conversely, any continuous operator defined on a closed subspace is closed. A conjugate operator A* is always closed (even w*-closed). In particular, in a Hilbert space E any selfadjoint operator is closed and any densely defined symmetric operator admits closure since A c A*. Moreover it is clear that x is symmetric. If a symmetric operator A is defined on the whole space E, then A is a bounded selfadjoint operator (Hellinger-Toeplitz theorem). In fact, since A c A* and D(A) = E we have A = A*. But A is then closed and consequently it is bounded by the closed graph theorem. Theorem. Zf a densely defined operator A on a Hilbert then A* is densely defined and A** = A.
space admits closure,
A closed densely defined operator on a Hilbert space which commutes with its adjoint is said to be normal. In particular all unitary operators (A*A = AA* = 1) are of this type.
194
Chapter 2. Foundations and Methods
4.7. Spectra and Extensions of Symmetric Operators. Let us consider a densely defined closed symmetric operator A on a Hilbert space E. It follows from the identity (27) that IlAx - 1x11 2 lim 4 llxll (x E D(A)), (28) from which it is obvious that all 1 with im I # 0 are quasiregular. The defect numbers n, , n- associated with the half-planes im il > 0, im il < 0 are in general unequal, however, if there is a real quasiregular point, then n, = n- . The pair (n+, n-) is called the deficiency index of the operator A. Theorem 1. In order that A be selfadjoint, it is necessary and sufficient that its deficiency index be (0, 0), i.e. that spec A c R.
Necessity. If A* = A then Nr = 0 whenever im A # 0, since all eigenvalues of a symmetric operator are real. Sufficiency. If n, = n- = 0 there exists for any 1 (im I # 0) and any x E D(A*) a y E D(A) such that Ay - ily = A*x - Ix and hence A*(x - y) = L(x - y). Consequently x - y = 0, i.e. x = y E D(A). 0 If A” is a closed symmetric extension of the operator A then A, < n, and ti- < n- . The operator A”is loser’ to selfadjointness than A since45 A c A”c A”* c A*. Is it possible to choose A o be selfadjoint, i.e. does A admit a selfadjoint extension? This important question was investigated by von Neumann (1929) with the help of the Cayley transform which is defined for an operator A and any A (im 1 # 0) by the formula V, = VA(A) = (A - ill)(A - 11)-l. The homomorphism V,: AX + An is surjective and isometric46 since if II = a + ifi we have IlAx - Ax112= IlAx - axlIz + /1211xl12 (x E D(A)) and hence 11 Ax - Ix II = IIAx - Ix 11.Moreover the subspace Im( V, - 1) = D(A) is dense. Conversely, any isometric operator V for which Im(V - 1) is dense is the Cayley transform of the densely defined closed symmetric operator A = (XV - Al) (V - 1)-l (it follows from the denseness of Im( I/ - 1) that unity is not an eigenvalue of the operator V and therefore (V - 1)-l: Im(V - 1) + D(V) is defined). By Theorem 1 the Cayley transform of a selfadjoint operator (and only of such an operator) is unitary. The question about selfadjoint extensions therefore reduces to the existence of unitary extensions of the isometric operator F ,which in turn is equivalent to the defect subspaces N,, Nr being isometric. Thus we have 45For any two linear operators A,, A, with dense domains of definition the inclusion A, c A, implies that A: 3 A:. 461mplicit in the construction of the Cayley transform is the idea of spectral mapping: the linearfractional transformation 5 I+ (I - A)(c - ;i,-’ maps the real axis onto the unit circle, therefore its operator analogue must transform symmetric operators into isometric operators.
8 4. Theory
of Operators
Theorem 2. In order that A admit a selfadjoint suflicient that its defect numbers be equal.
195
extension, it is necessary and
From what has been said it is also evident that in the case n, = n- the set of selfadjoint extensions can be parameterised by the isometries of NA onto N;i, in particular, for differential operators (I.M. Glazman, 1949). Example 1. Let us consider the operator D = f $ on L2(0, 1) which is defined
for all absolutely continuous
functions with derivative in L2. We will denote by
Do its restriction under the boundary conditions d(O) = 4(l) = 0. This is a closed symmetric operator and Dd = D. Each 1 E @ is an eigenvalue for D, the eigen-
space NA is one-dimensional and it is generated by the function cl(s) = eiL. The deficiency index of the operator D,, is therefore equal to (1, 1). The isometries of NA + Nr take the form V’,(O)e, = BepBer (/I = im 1, 8 a parameter, 181 = 1). The selfadjoint extension De of the operator D, corresponding to the isometry V,(O) is such that (De - ll)(D’
- Il)- ,
= 8eMBer.
Putting (De - zl)- , = u, we obtain D
- %I = e,,
Dev - Iv = tle-8ex,
and hence V=
e, - 8e-Bex n-X .
Set 1 = i. Then u(1) = or(O), where o = (1 - eQ(e - 0)-l is a parameter which, together with 0, runs through the unit circle. Thus the extension De of the inimal’ operator D, can be described as the restriction of the aximal’ operator D under the boundary conditions 4(l) = 04(o). Example 2. The operator D on L O, co) defined as in the previous example can be restricted to Do by imposing the condition d(O) = 0. As before, D$ = D but now the deficiency index is (0, 1) since e, E L2(0, co) only if im Iz > 0. Consequently D,, does not have selfadjoint extensions (or even symmetric extensions other than itself). Example 3. The operator D on L2( - 00, 00) defined as before is selfadjoint. The operator iD on L2( - co, co) is conservative and only operators of the form iA (A = A*) are conservative on a Hilbert space (see inequality (28)). Therefore if A is a bounded selfadjoint operator we have p(A) = l/All (which can also be
established by elementary means). From this it is easy to establish the nonemptiness of the spectrum of any selfadjoint operator B. In fact if 0 # spec B then A = B-’ is a bounded selfadjoint operator and A # 0. Therefore p(A) = IIAlI > 0. Now at least one of the numbers &p(A) is in spec A and then its reciprocal must be in spec B. We note incidentally that p(A) = 0 * A = 0 (i.e. a non-zero self-
196
Chapter
2. Foundations
and Methods
adjoint operator cannot be quasinilpotent). Even if the selfadjoint operator A is not presupposed to be bounded but spec A = (0) then A = 0. In fact, in this case R, (A E IX, L # 0) is a bounded seifadjoint operator and spec R, = ( -n-l}. Consequently 11 Rlll = IL/-’ which leads to the inequality IJAxll’ - 21(,4x, x) > 0 for ail x E D(A) and L E R\(O). But then (Ax, x) = 0 and hence A = 0. The last conclusion can be made on the basis of the polarisation formula tAxp Y) = i kio ik(A(x + iky), x +
Q ),
which is valid for any linear operator A on a Hilbert space E. Incidentally, it follows from this formula that if a Hermitiun quadratic functional (AZ, z) (z E D(A)) is bounded on the unit sphere then the operator A is also bounded and /[All < 2 SU~,,~,,=~I(Az, z)l (the converse is obvious). The functional (Ax, y) (x E D(A), y E E) is sesquilineur, i.e. linear in x and semilinear in y, i.e. it is additive and if y is replaced by ay then it is multiplied by 5. For any sesquilinear functional B(x, y) which is defined for x in some subspace L and for all y E E there exists by Riesz theorem a unique linear operator A such that /?(x, y) = (Ax, y) (x E D(A) = L, y E E). Symmetry of the operator A is equivalent to Hermitian symmetry of the functional, B(y, x) = /3(x, y), which, in turn, is equivalent to the quadratic functional /3(x, x) being real. The operator iD on L 0, 00) is dissipative and in general all operators on a Hilbert space of the form iA (A c A*, n, = 0) and their adjoints4’ are dissipative but these are by no means all the dissipative operators (for example, (-I), where I is the integration operator on L 0, 1)). The following criterion for dissipativity of a closed densely defined operator A on a Hilbert space is an easy consequence of the Hille-Yosida theorem: re(Ax, x) < 0 Remark.
Dissipative
by the requirement
(x E D(A)); operators
re(A*y, y) G 0 (y E WI*)).
on a Hilbert
space are often defined only
re(Ax, x) < 0 (which in addition
they
otate’ through
im(Ax, x) > 0). Under this the half-plane re 1 > 0 is quasiregular but we can have n, > 0. For a dissipative operator in our sense n, = 0. We note in this connection that symmetric operators with zero defect number and only such operators are maximal in the sense that they do not admit symmetric extensions (since their Cayley transforms do not admit isometric extensions). A symmetric operator A is said to be non-negative (A > 0) if (Ax, x) > 0, positive (A > 0) if(Ax, x) > 0 (x # 0) and positive definite (A >> 0) if (Ax, x) 2 c llxllz (c is a positive constant) (in all cases x E D(A)). For example, any orthoprojection is non-negative, multiplication by c > 0 is a positive delinite operator. If (e,); is an orthonormal basis and (A,); is a sequence of positive numbers which tends to zero, the corresponding diagonal operator 47Dissipativity
is preserved
under
conjugation
even in the Banach
space case.
Q4. Theory of Operators
197
Ax = f &,<,e, n=O is positive but it is not positive definite. For a non-negative operator A all points I < 0 are quasiregular and therefore n - n-. If A* = A > 0, the spectrum of the operator A is non-negative. The sieirum of a positive definite selfadjoint operator lies on the corresponding semiaxis I > c. In the real Banach space of bounded selfadjoint operators the non-negative operators form a closed solid cone (1 is an interior point) as a result of which we can work with inequalities in this space. This approach improves substantially the norm bounds. 4.8. Spectral Theory of Selfadjoint Operators. The simplest48 spectral theorem (after the algebraic case) concerns a compact selfadjoint operator A on a Hilbert space E and asserts that E coincides with the closure of the orthogonal sum of the eigenspaces E, corresponding to all possible eigenvalues4g (Il,}; (zero is included if it is an eigenvalue). This theorem is proved just as in the tinitedimensional case. With a view to generalisation it is convenient to express it in the form of a spectral resolution
where P,, is an orthoprojection
Ax = .f Iz,P,x, n=O onto E, and x = 2 P,x. n=O
A family of orthoprojections (I!,( - cc -C I < co) on a Hilbert space E is called an orthogonal resolution of the identity if 1) @n@p= Q% (2 < PL), 2) lim,, +m (EAx = x, lim,, --coC?,x = 0 (x E E), 3) the function CE,x (x E E) is continuous on the left with respect to 1 on the whole axis5’. Under these conditions for any interval A = [cr, /I] the operator (E, = (E, - C& is an orthoprojection. Any orthogonal resolution of the identity (El defines a family of measures m, on R which are generated by the non-decreasing functions (CEAx,x) (x E E). Suppose that the scalar function 4 is measurable with respect to all the m,. It is then said to be measurable with respect to EA. We define the integral 48But it does nevertheless include the Hilbert-Schmidt theorem on integral equations with Hermitian-symmetric kernels. 49To be precise we assume that the set of eigenvalues is infinite. A compact selfadjoint operator with finite spectrum is algebraic. soAn orthogonal resolution of the identity can be given on some interval, for example [0,2x], rather than on the whole axis. In this example the conditions analogous to 2) have the form E, = 0, (E,, = 1.
198
Chapter 2. Foundations and Methods
with respect to the operator-valued the subspace
measure J da, as the linear operator for which
D(@)={x:xEE,~~
--m IW12 4%
4 < ~0 1
(assumed further to be dense) serves as domain of definition (@x, Y) =
m 44 d&x, s -00
Y)
and the identity
(x7 Y E WY)
is satisfied. It can be shown that in addition @* =
m 44 6. s--CO Consequently it is necessary and suffkient for selfadjointness of the operator @ that the function 4 be real-valued almost everywhere (with respect to each measure m,). For boundedness of the operator @ it is necessary and sufficient that the function 4 be bounded in the sense that
ll4l =sy(eywIdl)<
00
(in which case 1)@11 = 11411). In general
(x E ww
IlW12 = -1 MU2 4%x, 4 s
The construction which has been described is called the functional calculus for the given orthogonal resolution of the identity a,. The correspondence 4 H @ is linear and & 42 H Qp,Qz if at least one of the functions 41, 42 is bounded. The functional calculus is an isometric isomorphism on the algebra of bounded functions into the algebra of bounded linear operators. Example. Let A4 c R be a Bore1 set and let xM be its characteristic function. Since & = xM and xM = xM, the operator
qf=
is an orthoprojection.
ODx&J s -m In particular c&=
The spectral resolution in the form
de, =
sM
6
m del = 1. s -U3
of a compact selfadjoint operator A can be expressed
$4. Theory of Operators
A=
199
co 1 da,, s -00
where (IFAX =
c P& A”<1
(x E E).
Theorem.. For any selfadjoint operator A there exists a (unique) orthogonal resolution of the identity (3, (- co < 1~ co) such that the spectral resolution A=
* IIdC& f --oo
holds. Moreover 4(A) zr
m W) s -a,
d%
for all admissible51 4.
In particular
we have the formula for the resolvent R,=
m daL ~ s -mA-P
(P E reg A).
It is easy to see from this that the spectrum of an operator A coincides with the set of points of increase of the function 52 (IF,. The eigenvalues are the points of discontinuity of the function (EA.The operator PA = %,+, - GA is an orthoprojection onto the corresponding eigenspace. The dimension of this subspace is called the multiplicity of the eigenvalue 1. The eigenvalues form the so-called discrete spectrum spec,A. The continuous spectrum spec,A is defined as the complement in spec A of the set of eigenvalues of finite multiplicity53. In terms of the resolution of the identity it can be characterised as the union of the set of non-isolated points of increase and the set of eigenvalues of infinite multiplicity. This characterisation combining these two sets is just the result that there exists a normalised quasieigensequence (x,)7 ( llxnll = 1, Ax, - Lx, --f 0) with no convergent subsequence. The intersection spec,A n spec,A can turn out to be non-empty. However the whole space E can be decomposed into the orthogonal sum of the closed linear hull E, of the eigenvectors and the closed invariant subspace E, of vectors with purely continuous spectrum, i.e. those x E E for which (@,x, x) is continuous. Then in turn E, = E, @ E,, where the summands are the spaces of singular and absolutely continuous vectors (these concepts are of course defined through the corresponding properties of the function (CE,x, x)). The extraction of the space E,, is particularly important for the theory of scattering (see the end of this section). If ” For rational rp with poles outside of spec A this definition is equivalent to the previous one. ‘*i.e. with the union of the sets of points of increase of all functions (EAx, x) (x E E). s3 SpeccA and spec,A are defined for a normal (in particular a unitary) operator in exactly the same way.
Chapter
200
2. Foundations
and Methods
E,, = E we say that A is an operator with absolutely continuous spectrum and in the general case Ale-= is called the absolutely continuous part of the operator A.
Similar terminology can be introduced for the remaining types of spectra. Of the other properties of the resolution of the identity corresponding to a selfadjoint operator A we note that the measure jd6i commutes with A, i.e. (EA commutes with A for any interval A = [a, /I]. In fact B A@, =
J. 6,
@,A = A&&w-
s a-o One of the approaches to the spectral theorem for selfadjoint through the Cayley transform V = (A - il)(A
operators is
+ il)-‘,
a unitary operator for which Im(V - 1) is dense. The point is that the spectral theorem also holds for unitary operators for which the proof can be effected by quite simple means. Theorem. For any unitary operator U there exists a (unique) orthogonal resolution of the identity (3, (0 < a < 274 with 2n
U= s0
eiO d@01.
Proof. Let us consider the group of powers (U”)$, . It is easy to verify that the Toeplitz matrix with elements (U”-“5, x) (x E E) is positive. By the Riesz-Herglotz theorem there exists a non-decreasing function a(~; x) (0 < c1< 24 which is a solution of the trigonometric moment problem 2n
eina da(a; x) (n=O, +l, +2 ,... ). (29 f0 Without loss of generality we can assume that a(a; x) is continuous on the left on (0,271) and that ~(0; x) = 0. Under these conditions a(a; x) is unique and (VX,
x)
=
2n
a(2n; x) =
da(a; x) = (x, x).
s0 Consequently a(a; x) < llxll’ for all a. Performing
polarisation
in (29), we obtain
2% (U%
Y) =
eina dz(a; x, y)
(n = 0, *l,
+2 ,... ),
s0
(30)
where z(a; x, y) = t j.
~(a; x + Py)
(x,
Y
E El.
Since there is only one possible function r in (30), it follows that $a; x, y) is a sesquilinear functional of x, y and hence r(a; x, y) = &x, y), where (5, is a
5 4. Theory
of Operators
201
bounded selfadjoint operator. The required properties verified without particular difficulty. 0
of the family (a,} are
In order to pass to the selfadjoint operator A it is necessary to express it using the Cayley transform: A = i(V + l)(V - l)-’ = i
Zn eia + 1 do = s oe --l=
co f --a,
1 d@,,
where @A= E2.Zarcco,l. There are several other proofs of the spectral theorem for a selfadjoint operator which are associated with different variants of the moment problem. For example, we can note that the resolvent R, satisfies the inequality im(R,x, x) > 0 (x # 0) in the half-plane im p > 0 and, as we already knows4, 1 IIRJI G ,im p,
(im P # 0).
We can therefore make use of the integral Nevanlinna class: (R,x, 4 =
and it turns out as before that R, =
where GA is an orthogonal
representation
of functions of the
s
m do@; x) -COTy’
m dC& -mI-q’ s
al s
resolution of the identity. Hence A = pl + R;’ =
-Co
I d(E,.
Now we have a further fruitful argument: the operator A is conservative, i.e. it generates a unitary group T(t) (t E [w);since all the functions (T(t)x, x) (x E E) are positive, Stone theorem that T(t) =
m eiAr de, s -cc
follows from Bochner theorem and hence by differentiation at t = 0 we obtain a spectral resolution of the operator A. Correspondingly T(t) = eiAr. If A is a densely defined closed symmetric operator with equal defect numbers, its resolutions of the identity are by definition generated by its selfadjoint extensions. For a non-selfadjoint operator A the resolution of the identity is not unique, therefore the question of its invariant (i.e. independent of the resolution) properties arises. 54All this follows from the formula the help of Hilbert identity.
im(R,x,
x) = (im p)
llR,,xjl z which can be easily established
with
202
Chapter 2. Foundations and Methods
Theorem. Zf the defect number n = n, = n- of the operator A is finite, then all of its selfadjoint extensions have exactly the same continuous spectrum.
The general property of invariance of the continuous spectrum under mall perturbations which in some sense or other are finite rank or even compact is evident in this result and also in the following. Theorem of Weyl. Zf A, B are selfadjoint spec,(A + B) = spec,A.
operators with B compact, then
In contrast to this the discrete spectrum is not invariant: for any selfadjoint operator A on a separable Hilbert space there exists a selfadjoint compact operator B with arbitrarily small norm such that the system of eigenvectors of the operator A + B is complete (von Neumann, 1935). Remark. Let A, B be closed operators on a Banach space. The operator B is said to be A-compact if D(B) 1 D(A) and BIDCA)is compact under the graph norm of the operator55 A (for this it is necessary and sufficient that the operator BRA(A) be compact56 for some I which is regular for A). For an A-compact operator B the set of points of the spectrum of the sum A + B that lie in any connected component G of the set reg A is at most countable (if it is countable then it condenses to 8G). All these points are normal eigenvalues. Let us consider certain classical situations from the point of view of the abstract spectral theorem. Example
1. The operator 5 1 on L2( - rr, a) with periodic boundary
condi-
tions +( - rt) = d(z) is selfadjoint. Its resolution of the identity is generated by the Fourier series expansion: CWM4
= “?A cneins.
The spectrum of the given operator consists of the eigenvalues 1, = in and the corresponding eigenfunctions are eins (n E Z). This is an example of a purely discrete spectrum (there is no continuous spectrum). Example 2. The resolution of the identity for the operator i $ on L2( - co, co)
has the form (@M(S) = &
:m &&@” dp, s where 6 is the Fourier-Plancherel transform. The spectrum of the given selfadjoint operator fills out the whole axis, however there are no eigenvalues in it since his is llxllA = llxll + llAx]l (x E D(A)). Completeness of the space D(A) under this norm is equivalent to closedngs of the operator A. 56 Hence it is clear that if B is compact then it is A-compact for any A.
5 4. Theory
of Operators
203
the functions eips do not belong to L2. This is an example of a purely continuous spectrum. Example 3. Let us consider the Schrodinger operator -y” + q(x)y on L’(O, co) with continuous (for simplicity) real potential under the boundary condition y’(0) - hy(0) = 0 (- co < h < co). First of all we define it only on the functions of finite support that are of class C2 and satisfy the boundary condition; in this role we denote it by Qo. The operator Q,, is symmetric, densely defined but not closed. The domain of definition of the adjoint operator Q$ consists of the functions z E L2 n C1 such that z’ is absolutely continuous, -z” + q(x)z, z E L2 and z’(0) - hz(0) = 0. Moreover Q$z = -z” + q(x)z. The domain of delinition of the closure Q0 is extracted from D(Q8) by imposing the additional condition lim [y’(x)z(x) - y(x)z’(x)] = 0 (z E NQ,*))x+cc This condition can be satisfied automatically (for example, this is the case if the potential is bounded from below) and then Q. is a selfadjoint operator. In the contrary case the deficiency index of the operator Q. is (1, 1). This is precisely the Weyl alternative, since in the oint case’ the equation Qtz = lz (im L # 0) has only the trivial solution in L2, while in the isk case’it has a one-dimensional space of solutions. In any event a selfadjoint extension Q of the operator Q. exists (it is unique in the first case but not in the second). All these extensions are in one to one correspondence with the spectral functions, namely, if p(l) is a spectral function then the resolution of the identity for the corresponding selfadjoint extension Q is generated by the inversion formula associated with p(l): P%m)
(in the notation
=
1. fw~(~~ A dP(PL) f -0D of Chapter 1, Section 2.12). From Parseval
identity
i.e. each measure j d((lF,f,f) is absolutely continuous with respect to the measure Jdp(l), its density being equal to the square of the modulus of the generalised Fourier transform of the function $ In this sense the resolution of the identity 6, for the Schrodinger operator is absolutely continuous with respect to the measure jdp(l). A similar result holds for the three-dimensional Schrbdinger operator -Au + q(x)u on the whole space (A.Ya. Povzner, 1953) and for more general operators (Mautner, 1953; Yu.M. Berezanskij, 1956). Example 4. Let us consider a Jacobian matrix J with positive off-diagonal elements. Given an orthonormal basis (e$’ of a Hilbert space E the matrix J defines a homomorphism J^: E + s (s is the space of all complex numerical sequences). Restricting 3 to vectors of finite support we obtain a densely defined symmetric operator J,-, on E. It is not closed, therefore we have to pass to its
204
Chapter 2. Foundations and Methods
closure IO. The adjoint operator J,* is the restriction of the operator J^to vectors which satisfy the condition .?u E E. In accordance with the Weyl alternative the equation J$u = lo either has only the trivial solution for all L (im 1 # 0) and then the operator .& is selfadjoint (point case) or the space of solutions of this equation is one-dimensional for all L (im 1 # 0) in which case the deficiency index of the operator Jo is (1, 1) (disk case). In any event a selfadjoint extension A of the operator Jo exists (it is unique in the first case but not in the second). All these extensions are in one to one correspondence with the extremal solutions a(n) of the power moment problem associated with the given Jacobian matrix. In fact, if CZi is the resolution of the identity for the operator A, then a(2) = (C?,e,, eo). Returning to the Schriidinger operator on the semiaxis, we note that progress in investigating its spectral properties was largely dependent on the discovery and development5’ of the technique of so-called transformation operators (Delsarte, 1938; A.Ya. Povzner, 1948; B.M. Levitan, 1949) which establishes the equivalence of the operator Q and the elementary operator -y” with boundary condition y’(0) = 0. In this connection it is important that the transformation operator has the form 1 + K, where K is a Volterra integral operator with continuous kernel, so that the inverse operator has a similar form. V.A. Marchenko (1950) applied transformation operators to a wide range of problems on the spectral analysis of the Schriidinger operator. In particular he obtained in this way the asymptotic expression for the spectral function5*
and established a theorem on the uniqueness of the solution to the inverse problem of reconstructing the operator from its spectral function. A linear procedure for solving this problem, which is also based on the use of transformation operators59, was given by I.M. Gel and and B.M. Levitan (1951). In the physically important inverse problem of scattering theory60 the input data (to begin with the so-called limiting phases) are linked with the asymptotic behaviour61 of the solutions of the equation Qy = ly as x + co. An effective n connection with generalised shift (g.s.). The simplest g.s. (V(t)cp)(s) = (cp(s + t) - cp(s- t))/2 arises from the wave equation &@t2 = a2u/8s2; the general theory of g.s.is in many respects parallel to the theory of group representations (B.M. Levitan, 1962). 58The right hand side of this relation is the spectral function of the simplest operator, q(x) E 0. t is a question of the construction of a certan linear integral equation for the kernel K(x, 0, after which the formulae q(x) = 2 dK(x, x)/dx, h = K(0, 0) are applied. All this goes through under certain conditions on ~(1) which are close to being necessary. Necessary and sufficient conditions for p(L) to be the spectral function of a SchrGdinger operator on the semiaxis with continuous potential were obtained in a different way by M.G. Krejn (1951). 60Recently the inverse problem of scattering theory has been at the centre of attention of many investigations thanks to its utilisation for integrating the Korteweg-de Ries equation and other non-linear equations of mathematical physics. 61 The description of which forms the direct problem. We note that both problems (direct and inverse) under consideration are concerned with the stationary theory of scattering, which is formulated in terms associated with the energy spectrum and not with evolution in time.
$4. Theory of Operators
205
method for its solution (V.A. Marchenko, 1955) therefore required the use of transformation operators ttached’ to infinity:
The existence of such operators for potentials which decrease sulliciently rapidly as x + cc was established by B.Ya. Levin (1955). The inverse problem of scattering theory on the whole axis was solved by L.D. Faddeev (1964), also on the basis of the technique of transformation operators. Remark. For differential operators of order n > 2 transformation operators of the form 1 + K, where K is a Volterra integral operator, do not in general exist (V.I. Matsaev, 1960), however in the case of analytic coefficients the transformation operator was constructed by L.A. Sakhnovich (1961). In non-stationary scattering theory the problem is to compare the asymptotic behaviour as t + + cc of a physical system which is evolving in time t with its behaviour as t + - co. Example. The Schriidinger operator Q (just as any other selfadjoint operator) generates a group of unitary operators eiQt. In quantum mechanics there corresponds to this a non-stationary Schriidinger equation for the wave.function +:
where H is a Hamiltonian
h
(energy operator), h = g and h is Plan&s
If a particle with mass m is in a force field with potential H$ = - &A$
constant.
q(x) (x E R3), then
+ q(x)+
(for example, this represents an a-particle in the field of a massive nucleus62 which was the situation in the historical experiments of Rutherford). The effect of scattering is created by the potential q, i.e. it is associated with the deviation of the actual evolution from the free (unperturbed) evolution which is determined by the Hamiltonian
This deviation is described by the wave operators W, = lim e-iHteiK3t, t++m
which are clearly isometric. The limits here are naturally treated as strong limits 62This is two-particle scattering. N-particle scattering with N > 2 is a much more complicated process. It was investigated for N = 3 by L.D. Faddeev (1963).
206
Chapter 2. Foundations and Methods
and the first question concerns their existence (if not on the whole of L2 then on subspaces which have to be described). Following that we can define the scattering operatoF3 s = (w+)- C,
but this construction is satisfactory only if Im W- = Im W+ since then S maps isometrically onto D(W+). It is even better if we also have D(W-) = D(W+), since then S is a unitary operator on a certain Hilbert space. The elucidation of these cases is the second important question of scattering theory. Clearly the stated questions can be posed in a general form, i.e. for a pair of selfadjoint operators A,, A on a Hilbert space. Such an bstract’ theory is capable of enveloping both the quantum and the classical scattering processes (acoustic, optical etc.). The non-trivial nature of the problem is underlined by the fact that, if A is a selfadjoint operator for which zero is not an eigenvalue, then eiArx (x # 0) cannot have a limit under the norm as t + + co (as also for t + - 00). This also prevents the existence of wave operators in the presence of a discrete spectrum for A,. 64 The rigorous theory of scattering is oncentrated on the absolutely continuous spectrum. D(W-)
Kate-Rosenblum Theorem. Zf A = A, + B where B is a nuclear operator, the wave operators W, are defined on the subspace E,,(A,) and map it onto E,,(A). Thus the scattering operator S is defined and is unitary on E,,(A,).
Each of the operators W, intertwines the representations eiAot L(Ao)’
eiA
E,,(A)*
Therefore the absolutely continuous parts of the operators A,, A are unitarily equivalent. In applications the conditions of the Kato-Rosenblum theorem (1957) are not usually satisfied, however Kuroda (1960) discovered more effective conditions for this purpose. They in turn are covered by a theorem of M.S. Birman and M.G. Krejn (1962), in which the resolvents of the operators A,, A are required to be nuclear. It is even sufficient that the difference of the resolvents of q5(A,) and &A) be nuclear, where the function 4 belongs to a certain sufficiently wide class (M.S. Birman, 1962). 4.9. Spectral Operators. Functions of a selfadjoint operator are normal operators. However the spectrum of a normal, for example unitary, operator is not in general real. If
@=
w 44 6, s --Q,
(31)
63The Heisenberg S-matrix is unitarily equivalent to this operator by means of the Fourier transform q9H $ (i.e., from the physical point of view, the transition from the coordinate space into the impulse space). 64The case of commuting A, A,, is not of interest in scattering theory.
8 4. Theory of Operators
201
where el. is an orthogonal resolution of the identity, then spec @ is the closure of the set of essential values65 of the function 4 (spectral mapping theorem). The spectral resolution (31) is written more naturally in the form
@= scPd&I,
(32)
where J d@, is the operator-valued measure in the complex plane C which is the image of the measure J de, under the mapping 4. The support of the measure in (32) coincides with the spectrum of the operator @. The spectral resolution of a normal operator in the form (32) provided the starting point of the theory of spectral operators on a Banach space which was established by Dunford and Bade in the 1950s. In this theory it is postulated that there exists a resolution of the identity G(M) which is a projection-valued measure on C (E(C) = l), commutes with a given closed densely defined operator A and is such that spec(Al ,,,,eo,J c M (in this case the operator A is said to be spectral). The resolution of the identity for any spectral operator is unique and if A is bounded66 then A=
where R is a quasinilpotent
sc
Aa
+ R,
operator which commutes with A. The term A=
sc
Mqdil)
(33)
is also a spectral operator with the same resolution of the identity, spec A = spec A and moreover the whole spectrum is approximate. We say that operators of the form (33) are of spectral type. Therefore n is called the scalar part of the operator A and R is its quusinilpotent part. The representation of a bounded spectral operator as the sum of an operator of scalar type and a quasinilpotent operator is unique. The class of spectral operators is rather narrow. For example, the shift operator on the two-sided lp space (1 < p < co; p # 2) is not spectral (Fixman, 1959). The Id i ds
operator - - on Lp( - co, co) or on Lp( - rc, 7~)with periodic boundary conditions also fails to be spectral if p # 2. We shall describe below a much wider class of operators (those with so-called separated spectrum). 4.10. Spectral Subspaces. Let A be a closed densely defined operator on a Banach space E. A closed subspace L c E (L # 0) is said to be spectral for A if
65A value LXis inessential if q(L) # a almost everywhere with respect to each measure m, = j d&x, x). 66For unbounded spectral operators the situation is considerably more complicated than that described below.
208
Chapter 2. Foundations and Methods
1) L is invariant6’ and D(A) n L is dense in L, 2) spec(A1,) c spec A, 3) if a closed invariant subspace M # 0 is such that D(A) n M is dense in M and spec(d(,) c spec(A 1L) then M c L. We shall call a compact set Q c spec A spectral if there exists a spectral subspace L = L(Q) for which spec(Al,) = Q (this subspace is clearly unique). As a trivial example we have: if A is bounded then the compact set spec A is spectral, the corresponding spectral subspace L being E. It can happen that no other spectral compact sets exist. Example. Let us consider the Banach space of scalar functions which are analytic on the disk IL1 -C 1 and continuous up to its boundary under the usual norm rnaxu,=r Id(n)]. We have on it the operator n of multiplication by 1. The spectrum of the operator n coincides with the disk D = (1: 111 < l}. Suppose that Q is a non-trivial spectral compact set with corresponding spectral subspace L. We put A = D\Q. If p E A, for any 4 E L the function (A - $‘#(A) is regular for A = p, but then b(p) = 0. Thus if 4 E L we have & = 0. It then follows from the uniqueness theorem that 4 = 0, i.e. L = 0 in spite of the definition of a spectral subspace. Any compact set Q c spec A which is open in spec A is spectral. The corresponding spectral subspace is obtained as the image of the projection
p = - &
s t-
R,dl,
where I is a simple closed contour separating Q from its complement Q’in spec A. Moreover 1 - P is the projection onto the spectral subspace corresponding to the compacP set Q This construction is contained in the functional calculus of F. Riesz, which we shall describe below under the assumption that A is bounded. Let us put
d(A) = - & j- 4(4R, dl, lwhere 4 is a function which is analytic on spec A, i.e. on some neighbourhood G 1 spec A, and I is a closed contour which does not cross itself, lies in G and contains69 spec A in its interior. Formula (34) defines a homomorphism of the algebra of functions which are analytic on spec A into the algebra of bounded operators, such that if #(A) = 1 then 4(A) = A.
67This term requires a more precise definition since A need not be defined everywhere on E. Inuariance of a subspace L means that if x E D(A) n L then Ax E L. The restriction Al, has to be interpreted as the restriction of A to D(A) n L. It is clearly closed along with A, however D(A) n L may fail to be dense in L when D(A) is dense. 68Thus the spectral subspace L = Im P is completely reducing. If Q consists of a single point n and L is finite-dimensional, then p is an ordinary eigenvalue and L is the corresponding maximal root subspace. 6qThe neighbourhood mentioned and the contour r can be disconnected. The integral does not depend on the choice of contour.
Q4. Theory of Operators
209
For the calculus just described7’ Dunford (1943) established the spectral mapping theorem, spec 4(A) = 4 (spec A), and the superposition theorem: ($0 Q)(A) = $@(A)) under the condition that Q is analytic on spec A and $ is analytic on hvec 4. If the function qj is analytic on the disk 1II < r, where r > p(A), then, expanding it in a Taylor series and using the expansion (23), we obtain from (34)
i.e. we arrive at the previous definition of the function from the operator. We shall say about an operator A on a Banach space E that its spectrum is separated if each compact set Q c spec A which is the closure of its set of interior points relative to spec A is spectral. For example, any spectral operator is of this type: the spectral subspace L(Q) is the image of the projection
CZQ =s @(dl).
Q
Separatedness of the spectrum of an operator is ensured by a certain bound on the growth of the resolvent close to the spectrum. In fact, suppose that spec A lies on a curve C c @ and that in a neighbourhood of each point p E spec A the inequality
IM
G &W,
C))
(A $ Cl,
(35)
is satisfied, where M,,(6) is a decreasing function of 6 > 0 (6 < do = do@)) such that the Levinson condition do
In In M,(6) da < co s0
(36)
is satisfied. Then A is an operator with separated spectrum (Yu.1. Lyubich V.I. Matsaev, 1960). In the proof of this theorem a special functional calculus is used, with the help of which quasiprojections71 onto the spectral subspaces are constructed. It also allows us to establish (under conditions (35), (36)) the completeness of the system of spectral subspaces corresponding to any covering of the spectrum by finite arcs of the curve C. Example. The Schrodinger operator on L2( - 00, co) with periodic complex potential has separated spectrum (V.A. Tkachenko, 1964). For a bounded operator A with real spectrum Levinson condition is equivalent to Ostrowski condition with respect to the weight a(t) = lleiArII (t E R): his can be developed further in the case of a spectral operator. ” These are operators Q, associated with a compact set Q and a neighbourhood U = Q such that @ILCpi= 1 and cD~,~) = 0 for compact sets K lying outside of U. certain heoryof duality’ also holds for spectral subspaces under conditions of type (36) (Bishop, 1959; Yu. I. Lyubich, V.I. Lomonosov, V.I. Matsaev, 1973).
210
Chapter 2. Foundations and Methods
(37,)
The latter signifies the non-quasianalyticity 73 of the Fourier image of the class L,‘(R) of functions which are summable with respect to the weight a. Starting from (37,) it is possible to construct the functional calculus &A) =
m qb(t)eeiA’ dt (4 E mw) f -02 independently of the previous approach; this is the operator Fourier transform, which allows us to obtain functions of compact support of the operator A, in particular quasiprojections onto spectral subspaces. Moreover L(Q) is constructed as the intersection of all the subspaces {x: J(A)x
= x}
(($1, = 1).
The requirement of boundedness of the operator A is not essential, the important thing being rather that it generates a group74 T(t) (t E W). The stated conditions of growth of the resolvent or exponential cannot be weakened: under any more rapid growth rate there arise operators for which the spectrum on any closed invariant subspace coincides with the spectrum on the whole space. For a bounded operator I/ with spectrum on the unit circle the requirement
cm ln IIv I
n=-m n 2 +l
(37,)
is the analogue of Ostrowski condition and it is equivalent to Levinson condition; thus separatedness of the spectrum of the operator75 V follows from this. In particular, any isometric operator on a Banach space has separated spectrum but it is certainly not the case that any such operator is spectral. Any closed subset F of the circle (or line in the case of (37,)) is the spectrum of some non-quasianalytic operator ” . If the system of eigenvectors of a non-quasianalytic operator is complete, its spectrum S coincides with the closure of its set ,4 of eigenvalues (N.K. Nikol kij, 1971), since in the contrary case there exists a compact set Q c S such that Q n /i = 125.But then there is a quasiprojection which annihilates all the eigenvectors and is the identity on L(Q). A bounded operator A on a Banach space E is said to be decomposable if for any open covering u;=, Gk 1 spec A there exists a system of spectral subspaces
73An operator A which satisfies condition (37,) is said to be non-quasianalytic. 741n this general case the condition (37,) is stronger than (36). ” Under condition (37,) the operator, as before, is said to be non-quasianalytic. ny non-empty closed subset F c C is the spectrum of some normal operator (selfadjoint if F c R, unitary if F c T (the unit circle)), namely the operator of multiplication by the independent variable on the space L2(F, p). where p is any measure with support F. As regards this we note that a bounded set A c C is the set of all eigenvalues of a bounded operator on a separable Hilbert space if and only if it is of type F, (L.N. Nikol kaya, 1970).
Q4. Theory of Operators
211
(,I,& such that spec(AILk) c Gk (1 < k s n) and c;=i L, = E. The theory of this class of operators has been developed in depth during the last 25 years in the works of Foias and his school. 4.11. Eigenvectors of Conservative and Dissipative Operators. Completeness of the system of eigenvectors (eigenspaces) of a conservative operator A on any Banach space E is equivalent to the almost-periodicity of the corresponding isometric representation” T(t) (t E R). The necessity of almost-periodicity is obvious: if (xk}y is a collection of eigenvectors corresponding to the eigenvalues {i&}y and 11~- En x,II < E, then
T(t)x - k xkeiAkf -c E (t E R). k=l II /I Hence it is clear that the orbit of the vector x is relativly compact. The sufficiency of almost-periodicity of the representation T for completeness of the system of eigenvectors of the operator A can be established with the help of the theory of representations of compact groups.‘* In particular, the spectral theorem for compact selfadjoint operators is generalised by this method: if A is a compact conservative operator on a reflexive Banach space E then its system of eigenvectors is complete (Yu.1. Lyubich, 1960). The reflexivity condition can be relaxed: it is sufficient that E should contain no subspace isomorphic to c. Moreover there is a counterexample in c (Yu.1. Lyubich, 1963). Example. Let (A,); be a sequence of distinct positive numbers which converges to zero. Let us consider the diagonal operator
4L) on c. It is compact and conservative: e
= (%t”) C) = (e n’&).
However its eigenvectors S,,,= (6,,,,), (m = 0, 1,2, . . .) only generate the subspace c0 of sequences which converge to zero. Remark. The ergodic theorem for an almost-periodic representation T(t) (t E R) follows easily from the completeness of the system of eigenvectors of the generator A: the existence of the limit 1 S GW.4 dt 5! s s -*
for each eigenvector x is clear, i.e. T(t)x = eiCrx for Ax = ipx. ” By the same token this is also valid for any bounded representations: sup IIT(t)ll = M < co. I We note that this approach also leads to the classical theorem of Bohr for almost-periodic functions - the case of a regular representation in AP(R).
212
Chapter 2. Foundations and Methods
According to the ergodic theorem the strong limit s Pn = lim T(t)eP’ dt s-%0s -,y exists for each J E R. This is a projection (llPJ < M, therefore if T is isometric, on W, = Ker(A - iA1) and also PAP,, = 0, i.e. PAlw, = 0 (1 # cl). Corresponding to each vector x E E we have a Fourier series in terms of eigenvectors 79 of the operator A:
Pn is an orthoprojection)
(in the case of a regular representation in AP(R) this is the classical Fourier series of an a.p.f.) The uniqueness theorem holds: Vk P2x = 0 * x = 0. The proof reduces to the scalar case if we consider the a.p.f. f(T(t)x) (f~ E*) since the Fourier coefficients of this function are c1 = f(P,x). Using the FejCr-Bochner method of summation for Fourier series of a.p.fs we can associate with each subgroup I c T a projection Pr (with bound llP,-ll < M) on the closure of the sum of the eigenspaces corresponding to eigenvalues 1 E I (N.K. Nikol kij, 1971). The replacement of subgroups by arbitrary subsets is not possible (Rudin, 1962). Thus once again we have evidence of a close connection between harmonic analysis and the spectral theory of operators. There is however another approach to the problem of completeness of the system of eigenvectors of a conservative compact operator, which is based on an analytic technique using a bound for the resolvent (even weaker than (26)). We say that an operator A on a Banach space E is quasiconservative if spec A is purely imaginary and M
IIJUI G m
(re A # 0, M a positive constant).
If this inequality is satisfied for re I > 0 (spec A lies in the half-plane re 1 G 0) we say that the operator is quasidissipative. It is clear that in a Hilbert space each operator of the form iA, where A is similar to a selfadjoint operator, is quasiconservative but the class of quasiconservative operators is not exhausted by these (A.S. Markus, 1966). Moreover even in a Banach space this class has an interesting spectral theory. Its simplest manifestation can be seen in the result that if an operator is conservative then the order of each eigenvalue is equal to one, i.e. there are no root vectors other than eigenvectors. It can be shown further that a quasinilpotent quasiconservative operator must be zero, because in a reflexive Banach space E the system of eigenvectors of a compact quasiconservative operator turns out to be complete and moreover (even without the requirement of compactness) there exists a system of uniformly bounded projections P2( lIPAIl < M, PAP,, = 0 if A # p) onto he set { ik PA # 0) coincides with the set of eigenvalues.
5 4. Theory
of Operators
213
the eigenspaces. Hence it follows that any normalised system {e,} of eigenvectors corresponding to pairwise distinct eigenvalues is distal:
Consequently the power of the set of eigenvalues is no more than dimt E. Generally speaking for operators related to selfadjoint operators we can expect a roperly organised’ system of eigenvectors to have properties close to those of an orthonormal system. Mukminov-Glazman Theorem. Let iA be a bounded dissipative operator on a Hilbert space and {&,}y a subset of its set of eigenvalues such that
c
m#n
(im A,)(im A.,) IA,-;1,12 < O
Then a system {u.}? of corresponding its closed linear hull E,.
(38)
normalised eigenvectors
is a Bari basis of
Proof. Without loss of generality we can assume that the sum of the series (38) is equal to q2 c b. Since im(A(au, + /?u,), au,,, + flu,) 2 0 for all a, B, we have
Ikl~ %)I2 G
4(im A,)(im A,) I4Pl - Xl”
(m # n).
Hence by (38)
In addition
from which it follows that the system (u.}? is o-linearly system is a Bar-i basis in E, by Krejn criterion. lJ
independent. Thus this
The connection between selfadjoint and unitary operators on a Hilbert space, which is realised by means of the Cayley transform, extends to dissipative operators and contraction operators correspondingly. If iA is a dissipative operator its Cayley transform I = (A - ,Il)(A
- ;11)+
(im 1 > 0)
is a contraction and also Im(V, - 1) = D(A) is dense. Conversely, any contraction V for which Im(V - 1) is dense (since Ker(V - 1) = 0) is the Cayley transform of the dissipative operator iA = (TV - Al)(V - 1)-l. This remark allows us to transform theorems on dissipative operators into analogous theorems on contractions and conversely. Thus, for example, we have the analogue of the Mukminov-G&man theorem for contractions with condition
214
Chapter 2. Foundations and Methods
- IUZ)
(39)
m#n
which results from replacing Iz,, 1, in (38) by their imagesso with respect to the fractional linear transformation I = -i([ + l)(c - 1)-l. Remark. Condition (39) is only meaningful for eigenvalues which lie in the disks1 111 < 1. This is not surprising if we take account of the theorem of Sziikefului-Nagy, Fobs and Lunger on the decomposition of an arbitrary contraction into an orthogonal sum of a unitary operator and a completely non-unitary (i.e. it has no unitary parts) operator.82 A similar remark can be made with regard to dissipative operators. Conditions (38) and (39) can be expressed in terms of Blaschke products: t38) *
jx” (s(
The weaker conditions
m
n
’ 0;
A -I, >o; I I
inf n m m mnZnI, - I,
(39) * 8” ( f-T,
; / > 0. ln n
44-Al >0 I n,i, I
inf n m rn#rl l-
arise in the problem investigated by Carleson (1958) of the interpolation cp(&) = ~1,(n = 1,2, 3, . . .) of bounded sequences (a,)? by functions cpof class H” which are analytic and bounded on the half-plane im 1 > 0 or, respectively, on the disk I;11c 1. In fact these conditions are necessary and sufficient for solvability of the stated problem for all (a,)? E I Starting from Carleson criterion, V. Eh. Katsnel on proved that if in the Mukminov-Glazman theorem condition (38) is weakened to the corresponding condition in (40), then the system {u ? will be a Riesz basis of its closed linear hull (and the situation is similar for contractions). The proof (for a contraction V) is based on the fact that for any finite set S of natural numbers a function cp E H” can be constructed on the disk 111 < 1 such that cp(l,) = 1 (n E S), cp(L,) = 0 (n 4 S) and 11cpII = suplql < C, where C does not depend on the choice of S. If only a finite set of interpolation conditions is retained (n < N), the function cp can be replaced by a rational (pN (in accordance with a theorem of Schur), after which a theorem of von Neumann can be applied to show that 11(pN(V) II i II (pNII. As N --, co the sequence of operators ((pN(V)) converges strongly to a projection Ps onto Lin{u,),,s along e. Since lIPsI\ < C for all S, we have that {un>y is an unconditional basis in its closed linear hull. * ith subsequent return to the previous notation. s1 Correspondingly (38) applies to eigenvalues which are not real. **For an isometric operator V this is the so-called Weld decomposition; moreover in this case the completely non-unitary part turns out to be a one-sided shift operator, i.e., it acts on a space of type L@ VL@ VZL@... , where L is some suitable subspace (and the unitary part acts on nzI V ).
0 4. Theory of Operators
215
4.12. Spectral Sets and Numerical Ranges. The theorem of von Neumann which has been mentioned is concerned with an aspect of spectral theory which was discovered by him; this is connected with subtle variants of functional calculus. We will discuss this topic briefly. Let A be an operator on a Banach space and S a closed subset of the complex plane C which contains spec A. The set S is said to be spectral for A if the bound II&t)II < sup,lql holds for all rational functions cpwhose poles lie outside of S. Von Neumann theorem asserts that the disk 111< 1 is a spectral set for all contractions on a Hilbert space.83 This is false on a Banach space 84, however, as V. Eh. Katsnel on and V.I. Matsaev (1967) showed, it becomes true for the disk I II < 3 and the radius p = 3 cannot be decreased for the class of all Banach spaces. For a normal operator on a Hilbert space its own spectrum is a spectral set (in consequence of the spectral theorem). Conversely, if any function which is continuous on (spec A) u {co} can be approximated uniformly on this set by rational functions, then the operator A is normal. Another useful substitute for the spectrum of an operator A is its so-called numerical range. In the case of a Hilbert space it is defined as the closure of the set of values of the quadratic functional (Ax, x) for x E D(A), llxll = 1. According to a theorem of Hausdorff the numerical range of any operator A is convex.s5 It contains the approximate spectrum and consequently the convex envelope of this set, but in general the numerical range is more extensive than the convex envelope of spec,A. For a normal operator these two sets coincide. The numerical range of an operator A on a Banach space is defined to be the closure of the set of values j&4x), where x E D(A), I/XII = 1 and f, is a support functional at the point x (the mapping x H f, is said to be supporting, as in the smooth case, but now, in general, it is not uniquely defined). The following example (Bonsall-Duncan, 1980) shows that the numerical range is not generally convex: for the space C(S) (S compact) with f, = 6,(,,, where s(x) is any point at which maxS Ix(s)! is attained, the numerical range of the operator defined by multiplication by a function n(s) coincides with the set of values of this function. The numerical range depends on the choice of support mapping but, if A is a bounded operator (D(A) = E), all its numerical ranges have the same convex envelope (Lumer-Phillips, 1961). This set coincides with the intersection of the half-planes re([e< r(e), where r(0) (0 < 8 < 24 is the minimum of those r for which IleAc(I G efKl (arg [ = 0), i.e. l&II G l/(lcl - r(0)) (arg 5 = 8, I[1 > z(0)). The convex envelope of the spectrum is contained in the numerical range. In all cases the numerical range of an operator A contains its approximate spectrum. * onversely, if the unit disk is a spectral set for an operator A (even on a Banach space) then A must be a contraction. This follows trivially from the definition with q(A) = A. * f the unit disk is a spectral set for all contractions on some Banach space then the space must be Hilbert (Foias, 1957). ssIt is enough to establish this result in 2-dimensional Euclidean space. But in this case the numerical range is an ellipse (which can degenerate into an interval) or a point (for a scalar operator).
216
Chapter 2. Foundations and Methods
For any choice of support mapping the numerical range of a dissipative operator lies in the half-plane re 1 < 0. Conversely, if the numerical range of a closed, densely defined operator lies in the half-plane re 1 < 0, the half-plane re 1 > 0 is quasiregular and if in addition n, = 0 the operator is dissipative. 4.13. Complete Compact Operators. A compact operator A on an LTS E is said to be complete if the system C,(A) of root vectors corresponding to the non-zero eigenvaluess6 is complete in Im A. In the case where Ker A = 0 this means that for the operator A-‘: Im A + E the system of root vectors is dense in its domain of definition (and it turns out to be dense in E if D(A-‘) = Im A is dense). In typical concrete situations, A-’ is the initial operator, for example, if A-’ = L, a differential operator, then A is the integral operator defined by the Green function of the operator L. Any selfadjoint (and even any normal) compact operator A on a Hilbert space is complete. It is natural to expect that completeness is preserved under small perturbations of these operators. This expectation is justified if the eigenvalues of the unperturbed operator tend to zero sufficiently rapidly. The first general result in this direction is due to M.V. Keldysh (1951). Theorem of Keldysh. Suppose that the operator A on a Hilbert space E has the form A = S(l + R), where S, R are compact operators, S is selfadjoint and the eigenvalues o, # 0 (n = 0, 1,2, . . .) of S, taken according to their multiplicities, satisfy the condition
O b <* for some p > 0.
(41)
Zf Ker A = 0 and Ker S = 0, the operator A is complete.
Condition (41) is equivalent to the requirement that S belongs to the class GP. We shall clarify this from a certain more general point of view. Let T by any compact operator on a Hilbert space E. Then T* T is a selfadjoint non-negative compact operator. Its positive eigenvalues can be enumerated according to multiplicity in decreasing order*‘: r1 2 rZ > .** . The numbers s,(T) = 4~1, SZ(T) = ,/rz, . . . are called the singular numbers (s-numbers) of the operator T (clearly s,(T) = II T II). It turns out that they coincide with the anumbers (D. Eh. Alakhverdiev, 1957). The class GP for a Hilbert space is therefore determined by the condition
For a selfadjoint operator this condition
is equivalent to (41).
Remark 1. It can be shown that if Ker T = 0 and Ker T* = 0, the operators T* T and TT* are similar (even unitarily similar). Therefore T* E GP o T E G,,. 86 We denote the system of root vectors corresponding to all eigenvalues by Z(A). s’ If T is of finite rank r then we put T” = 0 (n > r).
5 4. Theory of Operators
Remark 2. For any compact operator T on a Hilbert expansion
217
space we have a Schmidt
T=c”dT)(., GL, where (u,), (u,) are two orthonormal systems. The first consists of the eigenvectors of the operator A, = m while the second is made up of those of the operator A, = JTT*. This result - is closely - connected - with polar representations: T = U,A, = A,&, where U,: Im T* + Im T, U,.: Im T + Im T* are invertible isometries. Moving on to a concise account of the proof of Keldysh theorem, we recall that $, is a two-sided (not closed) ideal in the algebra 5?(E) of all bounded operators on the Hilbert space** E. We denote by P the orthoprojection onto the closed_ linear hull of the system Z,(A) = Z(A) and put Q = 1 - P. Then the operator A = QAQ = QA turns out to be a Volterra operator and therefore its Fredholm resoluent F(c) = (1 - CA -’ is an entire function of c. Moreover 2 E G,,. We can deduce from this that F(c) has growth of order no higher than p and of minimal type as l[l+ co: In 11 F(C)11 = o( Ill . On the other hand, using the given structure of the operator A, we can show that IIF(LJ/ is bounded on the pair of angles E < (arg (1 < x - E. Boundedness on the pair of angles larg 51 < E, n - E < larg cl < n follows from the Phragmen-Lindelof principle. Finally, F(c) is bounded on the whole plane. By Liouville theorem F(c) is constant. Consequently 1 = 0, i.e. QA = 0, which establishes the required inclusion Im A c Im P = Lin Z,(A). Corollary. If S, R satisfy B = (1 + R)S is complete.
the conditions
of the theorem, then the operator
In fact Ker (1 + R) = 0, therefore we have from Fredholm theory that 1 + R is an invertible operator; but then B is similar to A: B = (1 + R)A(l + R)- 0 Remark. V.I. Matsaev (1961) established that it is sufficient in Keldysh theorem to require that the operator S belongs to the two-sided ideal 6, which is defined by the condition: Z n- ,( T) < co. He developed a general method for estimating the growth of the resolvent using the distribution function of the s-numbers, from which he was able to obtain profound tests for completeness of a system and summability of the corresponding expansion. We note here that if A, B are selfadjoint operators with B E 6, and spec(A + Bi) c R, then A + Bi is an operator with separated spectrum and a complete system of spectral subspaces (corresponding to an arbitrarily line covering of the spectrum), since its resolvent satisfies Levinson condition (V.I. Matsaev, 1961). Example. Let us consider the differential operator L2(a, b) ((a, b) a finite interval) by the expression
A which is defined on
*sThe only non-trivial closed two-sided ideal in f?(E) is the ideal of compact operators (theorem of Calkin).
218
Chapter 2. Foundations and Methods
and boundary conditions” n-l @j[Y]
=
1 k=O
Under these conditions
{ajkytk'(a)
let the
+
bjky'k'(b))
=
'
(1 <j < 4.
rincipal part’ lo=
ff’ ( > define a selfadjoint operator H and let both operators A, H be injective (the latter does not restrict generality). We put S = HThis is a compact selfadjoint operator. Its eigenvalues have the form 1= io where o runs through the set 52 of roots of the characteristic determinant A(o) = det(@j[ei
]);,kzl
(the Ek are the nth roots of unity). The multiplicities of the eigenvalues do not exceed n - 1. Since A(o) is an entire function of exponential type we have
for any 6 > 0. The operator S therefore satisfies the conditions theorem for any” p > l/n. We now have
of Keldysh
But
is a Hilbert-Schmidt integral operator and is therefore compact. Consequently A-’ = S(l + R), where R is a compact operator. Applying Keldysh theorem we establish that the system of root functions of the operator ~4 is complete in L a, b).
Similarly we can obtain completeness theorems for systems of root functions of partial differential operators of elliptic type on bounded domains under boundary conditions, where the principal part of the operator is selfadjoint. Under conditions which are strictly stronger than those of Keldysh theorem it is possible to obtain a theorem on the expansion of each vector x E E in a series * e assume that the boundary conditions are linearly independent and the coefficients pJs) are measurable and essentially bounded. he weaker result that S E Gz follows from the fact that S is an integral operator with HilbertSchmidt kernel.
5 4. Theory of Operators
219
of root vectors of the operator A, which converges after a certain grouping (independent of x) of its terms (AS. Markus, 1962). In a generalisation of Keldysh theorem to operators on a Banach space (A.S. Markus, 1966) selfadjointness has a suitable replacement in the property of being quasiconservative. Theorem. Suppose that an operator on a Banach space has the form A = S(l + R), where S, R are compact operators and in addition S is a quasiconservative operator of class GP for some p > 0. If Ker A = 0 and Ker S = 0, the operator A is complete.
The eigenvalues of any compact operator can be enumerated in order of decreasing modulus, taking account of multiplicity: I,, A,, . . . (if the set of eigenvalues is finite we complete the infinite ail’ with zeros). If the series c. 1, converges unconditionally (and so also absolutely since it is a series of numbers) then its sum is called the spectral trace of the operator A. A natural and important problem is to determine conditions under which the spectral trace coincides with the tensor trace (clearly such conditions must ensure the existence of both traces). The first advance in this direction (not counting the classical theorem for the finite dimensional case) is due to V.B. Lidskij (1959), who showed that for any operator of class 6, on a separable Hilbert space E (in other words, the operator is nuclear since E is a Hilbert space) both the spectral trace and the matrix trace with respect to any orthonormal basis exist and they are equal.g1 By the same token the tensor trace is equal to the spectral trace in the given case. In further investigations (A.S. Markus - V.I. Matsaev (1971), K&rig (1986)) this result was extended to operators of class 6, on any Banach space. One of the important applications of Lidskij theorem is to provide a necessary condition for completeness of the system Z(A) of root vectors of a compact operator A on a separable Hilbert space, whose imaginary componentg2 im A is nuclear: l im Iz, = tr(im A). For the proof it is sufficient to orthogonalise the system L A). The diagonal elements of the matrix of the operator im A with respect to the resulting orthonormal basis are equal to im ;2, (n = 1,2, .. .). For a compact dissipative operator A on a Hilbert space, whose real component is nuclear, the race equation rewritten in the form $i re 1, = tr(re A), 91A fundamental result in this situation is that the matrix trace of a Volterra operator of class B, is equal to zero. If A is a bounded operator (D(A) = E) then re A fr $A
+ A*),
im A zf &A
- A*).
220
Chapter 2. Foundations and Methods
is not only necessary but also sufficient for completeness of the system J&4) (M.S. Livshits, 1954). In fact, for the invariant subspace E, = Lin C(A) the trace of the operator re(AIE1) is given by (42’). If E, # E then E, E Ef # 0 and then for the orthoprojection P on E,, we will have tr(P* re(Al,,)’ P) = 0; hence re(Al,,) = 0 since re A < 0. Consequently ( l/i)A(eO is selfadjoint (E, is invariant for A*). Since A IEOis compact it must have an eigenvector - a contradiction. It is clear from what has just been discussed that the inequality zt re J., 2 We -4 holds for any compact dissipative operator with nuclear real component. The system of root vectors of a nuclear dissipative operator is complete (V.B. Lidskij, 1959) since Inn, = tr A and In& = tr A*. In conclusion we note that completeness of a compact operator is not in general inherited by its restrictionsg3: any Volterra operator on a Banach space is the restriction of a complete compact operator acting on some larger Banach space (N.K. Nikol kij, 1969). 4.14. Triangular Decompositions. The theorem on the existence of a (nontrivial) closed invariant subspace for any compact operator A on a complex LCS E (dim E > l), with the help of Zorn lemma, leads to the existence of a chain of such subspaces which is maximal with respect to inclusion. Any such chain begins with zero and terminates with the space E. In the finite-dimensional case it has the form 0 c Lo c L1 c *.* c L, = E, where dim Lk = k (0 < k < n = dim E). Any system of vectors (~1; chosen so that eL E L,\Lt-I (1 < k < n) is a basis in E with respect to which the matrix of the operator is upper triangular. In the infinite-dimensional case the chain Z is not in general discrete (for the order topology) and therefore it cannot generate a basis for a triangular decomposition. The latter has to be connected directly with the chain Z. For each non-zero subspace M E Z we consider the closure M- of the union of all those L E Z which are contained in M but are distinct from it. Then M- E Z since the chain Z is maximal and clearly M- c M. If M- # M, the factor space M/M- is onedimensional. Those M for which this holds are called points of discontinuity of the chain Z. In the absence of points of discontinuity the chain Z is said to be continuous. If M is a point of discontinuity,
the factor operator A, can be identified with some scalar AM. Ringrose (1962) proved the following theorem (for a Banach space E). Theorem. The set of all non-zero values 1, corresponding to all possible points of discontinuity coincides with spec A\(O). For each I E spec A\(O) the number
931f an operator is complete along with all of its restrictions, then we say that it allows spectral synthesis.The problem of spectral synthesis goes back to a work of Wermer (1952) which is concerned with normal operators. The general situation was studied by A.S. Markus (1970).
8 4. Theory of Operators
221
of points of discontinuity for which 1, = A is equal to the dimension of the corresponding root subspace. Corollary. In order that the operator A be a Volterra operator it is necessary and sufficient that 1, = 0 for all points of discontinuity of the chain 2.
In particular, if the chain 2 is continuous, A must be a Volterra operator. Conversely, if A is a dissipative Volterra operator on a Hilbert space E and Ker A = 0, then all maximal chains of its closed invariant subspaces are continuous (M.S. Brodskij, 1969). A continuous maximal chain 2 for any Volterra operator A on a Hilbert space E results from inessential extension. In this construction a certain Hilbert space G (dimt G depends on A) is adjoined orthogonally to E and we take the operator A”on H = E @ G such that alE = A, A c= 0; .? has the required chain. A Volterra operator A is said to be unicellular if the maximal chain of closed invariant subspaces is unique 94. This term originates from the simplest finitedimensional example - a single Jordan cell. A non-trivial example95 is the integration operator Z on Lp(O, 1) (1 < p < co). In fact, it follows from the Titchmarsh convolution theorem and the completeness of the system of powers that any closed invariant subspace of the operator Z must consist of all functions which vanish almost everywhere on a fixed interval (0, a) (0 < a < 1). In exactly the same way we can establish the unicellularity of all integral operators of fractional order:
V,dW =&s’6-V-W) dt
(0 < c1< 1).
0
It is interesting that no two of these operators on L O, 1) are equivalent (I.Ts. Gokhberg - M.G. Krejn, 1967), a result which contrasts sharply with the equivalence of all unicellular operators on a finite-dimensional space. In a Hilbert space E instead of closed subspaces we can consider the corresponding orthoprojections. Corresponding to the subspace inclusion L, c L, we have the inequality P1 < Pz between the respective projections and therefore the chain of subspaces transforms into a chain of orthoprojections. Suppose we are given an arbitrary strongly closed chain W of orthoprojections and a function F: W + !i?(E). We can define the Brodskij integral A=
F(P) dP sW
as the uniform limit of the integral sums
94This is equivalent to pairwise comparability of the closed invariant subspaces. 95A further example: the weighted shif operator (&)g H (a,<,-,)~ (t+ = 0) on 1’ under specific conditions on the weight sequence (a,) (N.K. Nikol kij, 1967).
222
Chapter 2. Foundations and Methods
directed by the partitions PO < PI a* * < P., where all the Pk, Qk belong to the chain W and Pk-1 < Qk < Pk (1 < k < n). It is clear that not every function F is integrable in this sense. If A is a Volterra operator, 2 is a continuous maximal chain of closed invariant subspaces and W is the corresponding chain of orthoprojections, then we have the triangular decomposition (MS. Brodskij, 1961) P(im A) dP
A = 2i sW
(the existence of the integral is guaranteed). From this we extract a rich supply of information on relationships concerning properties of the real and imaginary components of a Volterra operator; for example there is the theorem of Sakhnooich that, if the imaginary component of a Volterra operator is a HilbertSchmidt operator, then so also is its real component; we also have the more general theorem of Matsaev: if A is a Volterra operator and im A E E$ then re A E $, (1 < p < co) and moreover N,(re A) < y,N,(im A), where the constant yp depends only on p. In the same direction for p = 1 (i.e. for im A nuclear) I. Ts. Gokhberg and M.G. Krejn (1967) established the deep spectral density theorem for the operator re A:
lim n+(R) 1s -=R a,
N,(dP*im
AodP),
R-W
where n+(R) is the number of eigenvalues (counted according to multiplicity) the operator re A in the semiaxis CR-‘, 00). 4.15. Functional Models. is a commutative diagram
of
A model of a linear operator A acting on an LTS E E-E
A
in which B is a linear operator on some LTS G and V is a topological isomorphism (an operator intertwining g6 A with B). Clearly the operators A and B are equivalent (similar) in this situation. Frequently we just call the operator B a model of the operator A. If E and G are Hilbert spaces and Y maps E isometrically onto G, we use the term unitary equivalence. On replacing an operator by its model all its linear-topological properties and its essential nature are preserved (for example, the spectrum, eigenvalues, approximate spectrum). Moreover a 96Here (as in certain other contexts) the term one space into another. As in the theory of ntertwining operator’ in a more general way, but not necessarily continuous invertibility. In
perator’ is used in spite of the fact that V acts from representations it is convenient to define the term namely to require only continuity of the operator V this situation we can speak of a quasimodel.
0 4. Theory of Operators
223
concrete model of an operator may prove simpler to study. For example, if it is a functional model (i.e. G is a function space), some analytic techniques can be applied to it. By way of illustration let us consider a Volterra operator A on a Hilbert space with one-dimensional imaginary component9’ and suppose that iA is completely non-selfudjoint (it has no selfadjoint parts). It follows from results of M.S. Livshits (1946) that A is unitarily equivalent to an operator f iZ;, where Z; is the integration operator98 on L’(O, 1) from s up to I (0 < I < co). The unicellularity of the operator Z; (which can be established by analytic means) therefore implies that of the operator A. However the last result can also be obtained directly from the triangular decomposition of the operator A (M.S. Brodskij, 1965) 99. In any event the very fact that the operator Z; is unicellular entitles us to fix it in the role of a standard triangular model which is not subject to further decomposition just as with a Jordan cell in the finite-dimensional case. The more general triangular model 1 (0 4 s d 2) WA(s) = +)cp(s) f i d0 dt ss generated by the real function a E L”(0, 1) is no longer a Volterra operator: the spectrum of the operator /i coincides with the set of essential values of the function u (the term a(s)cp(s)is analogous to the diagonal part of a matrix). Any completely non-selfadjoint bounded operator A with one-dimensional imaginary component and real spectrum is unitarily equivalent to an operator n defined by some non-decreasing function a and choice of sign in front of the integral. For the proof of this theorem MS. Livshits introduced a powerful unitary invariant, the characteristic function w,(A) = 1 - i(R,(A)e,
e)j,
where the channel vector e and the number j = + 1 are defined by the equation im A = &(. , e)e. It turned out that for the given class of operators coincidence of characteristic functions implies unitary equivalence. But for an operator n its characteristic function can be calculated immediately; in fact with j = 1 (to be snecific) (43)
Moreover for the operator A under consideration (with j = 1) the characteristic function maps the upper half-plane im 1 > 0 conformally onto the domain 1w 1> 1 and is regular at infinity. It can therefore be represented in the form (43) for some a(t). But then A turns out to be unitarily equivalent to the corresponding operator /i. ny operator with one-dimensional imaginary component is dissipative to within a factor + i. g8 The modification Z I+ Z’is occasioned by the desire to work with an upper- (rather than a lower-) triangular decomposition. In other respects it is immaterial. 99And Titchmarsh convolution theorem can be deduced from the fact that the operator Z is unicellular (Kalisch, 1962).
224
Chapter 2. Foundations and Methods
Where there is a non-real spectrum (necessarily no more than countable and consisting of normal eigenvalues), a iscrete’ triangular model is adjoined orthogonally to the previous ontinuous’ model; this is defined on 1’ by the formulae I ?k
=
Aktk
+
i .=g+,
&&i&
+
i
(Pb)jflk
dt
(k=1,2,3
,... ),
s 0
where (A,)? is the sequence of eigenvalues and /lk = im 1,. Correspondingly, Blaschke product
the
* &--& l-I-fz=l ;I - & appears in (43). Developing this approach, MS. Livshits (1954) constructed a triangular model for a completely non-selfadjoint bounded operator A with finite-dimensional imaginary component. In this case the characteristic function turns out to be a matrix of order r = rk(im A) and the role ofj is taken by a certain Hermitian matrix .Z such that Jz = 1. The characteristic matrix-function w,(n) (im A > 0) is J-expanding, i.e. W”(L)JW,(n) > I, and moreover on the real axis outside of the spectrum it is J-unitary: W”(n)JW,(L) = I. The corresponding analogue of formula (43) (with products of Blaschke type) was obtained by V.P. Potapov (1950). Passing to the limit as I + cc we can extend the triangular model further to the case where the imaginary component is nuclear. It is impossible to get rid of the condition of being completely non-selfadjoint, since the characteristic function does not change if an arbitrary bounded selfadjoint operator is adjoined orthogonally to A. However this clearly does not matter from the point of view of classification theory. We can say that for operators whose imaginary components are nuclear there is a valuable analogue of Jordan theorem. Incidentally, although the spectral resolution of a selfadjoint operator
reveals its structure sufficiently deeply, it is nevertheless useful to construct in addition a functional model for it. Its structure depends on the multiplicity of the spectrum which we will not define in the general case; we restrict ourselves to the case of a simple spectrum, i.e. where there exists a vector x such that the system (@lx)lo n is complete. lo1 Then, putting c#) = (a,~, x), we can show without difficulty that A is unitarily equivalent to the operator of multiplication by A on the space Li. Now let A be a contraction on a Hilbert space E. A natural measure of the non-unitariness of an operator A is the pair of non-negative selfadjoint operators To be precise it is assumed further that the non-real spectrum is infinite. lo1 A vector x with the stated property is said to be generating or cyclic.
5 4. Theory of Operators
1 - A*A and 1 - AA* (just as the imaginary measure of its non-selfadjointness). Let us put A, = ,/(l - A*A), The characteristic
225
component
of an operator is a
A, = ,/(l - AA*).
operator-function
W(A) = (-A
+ 1A,(l - AA*)-lb,)
I Im
is introduced in this context (Yu.L. Shmul an, 1953). The values of this function are contracting homomorphisms from I%& into Im A,. It is defined and analytic on the domain (reg A*)in particular on the disk 111 < 1, where it is usually considered. If the characteristic functions of completely non-unitary contractions coincide, then these contractions are unitarily equivalent (A.V. Shtraus, 1960). Szokefalvi-Nagy and Foias (1963) found a natural (in terms of characteristic functions) functional model for an arbitrary completely non-unitary contraction A. Of necessity it is rather complicated, since any contraction is the orthogonal sum of a completely non-unitary contraction and a unitary operator and any bounded operator becomes a contraction on normalisation. The construction of the model is significantly simplified if the sequence of powers (A ; converges strongly to zero (although even in this case getting information depends essentially on the measure of non-unitariness, more precisely on the rank of the operator A,). In order to describe the model operator in this situation we consider an arbitrary Hilbert space G and introduce the Hardy class H G) of analytic vector-functions
u(A)= jJ t&A”
n=O which satisfy the condition
(IAl < 1, u, E G (n = 0, 1,2, . ..)).
11412= nzo 11%112 < a* This is a Hilbert
space with natural scalar product
The shift operator lo3 (7%)(L) = Au(n) acts on it; it is clearly isometric (but not unitary: Im T = (u: u(O) = O}). The adjoint operator (T*u)(q
is a contraction
= @) ; No) = 2 u,+lA” n=o
(
u(A) = 2 U ” n=O >
(and the sequence of its powers converges strongly to zero). If
ro2This construction arose in another way in joint works of Szokefalvi-Nagy and Foias (1962) and for finite-dimensional A,, A, it is equivalent to the definition of M.S. Livshits (1950). A.V. Shtraus (1959) extended the concept of the characteristic function to unbounded operators. lo3 Strictly speaking, the shift is effected on the sequence of Taylor coefficients of the function u(A).
Chapter 2. Foundations and Methods
226
we now take G = Im A,, then T* turns out to be a quasimodel intertwining operator V: E + H2(G) is defined by the formula
for A. The
In fact 11Vxl12 = 2 (AJ , n=O
=
A,A ) = 2 ((1 - A*A)A , n=O
A”$
O{ IIA”412 - ll~“+w12) = llxllZ1
i.e. V maps E isometrically into H’(G). In addition it is clear that T*V = VA. Therefore the subspace L = Im V c H2(G) is invariant for T* and T*I, is unitarily equivalent to the operator A. If rk A, = 1, the model operator is the adjoint of a shift on the scalar Hardy class HZ. It is possible to describe all closed invariant subspaces for this operator (and hence also those for the initial operator A) (Beurling, 1949). Each such (non-zero) subspace has the form 8H2 = {cp: cp = e$ ($ E HZ)},
where 0 is a fixed function in H2 whose boundary values on the unit circle have modulus one almost everywhere (an interior function). The general approach to models of Szokefalvi-Nagy and Foias is based on the study of the so-called dilatations 2 of the operator. This term is applied to any operator A which acts on some Hilbert space E 2 E and which is associated with A by means of the relations A”=pAI” where P is an orthoprojection Theorem of Sziikefalvi-Nagy.
(n = 1,2, 3, . . .),
on E”with Im P = E. For any contraction A on a Hilbert
space there
exists a unitary dilatation. The original proof (1959) of this important theorem was connected with the trigonometric moment problem and the theory developed by M.A. Najmark (1940) of selfadjoint extensions of operators out of their basic space. lo4 There is, however, a more direct proof (Sziikefalvi-Nagy and Foias, 1967). First of all an isometric dilatation is constructed on ,& = E 0 E 0.. . in which E is embedded via the first summand. It has the form lo4 Figuring in this theory are non-orthogonal spectral resolutions of symmetric operators Pm J-m
where gA = I ,, 6, being an orthogonal resolution of the identity in a larger Hilbext space k and P an orthoprojection on E onto E.
$4. Theory of Operators
227
x0 0 Xl 0 x2 CD~~~~Ax~@A,x~~x~~x~~....
Now it is possible to extend an isometric dilatation to a unitary one. In fact, thanks to the Wold decomposition, we can suppose that the initial dilatation is a one-sided shift and then the corresponding two-sided shift will be its unitary extension. The theory of characteristic functions, functional models and dilatations found deep applications in mathematical physics.‘05 In particular, in quantummechanical scattering problems the characteristic matrix-function coincided with the so-called S-matrix of Heisenberg (MS. Livshits, 1956). This result is of a sufficiently general nature (Lax-Phillips, 1964; V.M. Adamyan - D.Z. Arov, 1965). It was used for the investigation of analytic properties of the S-matrix (B.S. Pavlov - L.D. Faddeev, 1972) in scattering theory for automorphic functions, which Lax and Phillips (1976) then developed in detail. The theory of dilatations proved to be an effective tool for investigating the so-called Redge singularity (B.S. Pavlov, 1970). From the physical point of view, non-selfadjointness or non-unitariness of the basic operators which describe the dynamics of physical systems reflects processes of dissipation (of energy or other ubstance’) caused by non-closedness (openness) of the systems, i.e. by their relations with the external world. Starting from these ideas M.S. Livshits (1963) associated with each open system a mathematical object called an operator node. This consists of a pair of Hilbert spaces H, E (H is the space of internal states of the system, E is the space of channels of communication of the system with the outside world) and a set of three continuous homomorphisms: a dynamical operator A: H + H, a channel operator J: E + E (J* = J, J2 = 1) and a communication operator I: E + H. It is assumed in addition that the relation im A = $TJT* is satisfied. Unitary equivalence is defined naturally in the category of operator nodes. The characteristic operator-function w(n)
= 1 - iJF’*R,(A)T
is a basic unitary invariant of an operator node. If an open system is decomposed into a chain of subsystems lo6 the characteristic function becomes factorised. Conversely, therefore, if we factorise the characteristic function into factors corresponding to elementary (in the physical sense) systems, we then obtain a model for the initial system in the form of a chain of elementary systems. Many of the concrete situations can be included in this scheme (M.S. Livshits, 1966). Metric. The structure introduced on a Hilbert space E by a operator J such that J2 = 1 (J # + 1) is called an indefinite metric,
4.16. Indefinite
selfadjoint
lo5 From nternal’ applications of this theory we note that of necessary and sufficient conditions for the existence of Riesz and Bari bases composed of root vectors of a contraction on a Hilbert space (N.K. Nikol kij - B.S. Pavlov, 1970). ro6Among the channels of communication are entrance and exit channels. Successive unions of systems are effected by connecting the exits of one system to the entrances of another. This procedure can be formalised in terms of what has been described.
228
Chapter 2. Foundations and Methods
or J-metric,
since it allows us to consider along with the usual distance the
pseudodistance
d&s Y) = t/CJCx- Y), x - Y) (with values in R u ilw) defined by the J-scalar product (u, o) = (Ju, u). The Lorentz metric in four-dimensional space, which is used in the special theory of relativity, is an example of this. Indefinite metrics on finite-dimensional spaces had already been studied by Frobenius (at the end of the 19th century). The first work dealing with operators on an infinite-dimensional space with an indefinite metric is due to L.S. Pontryagin (1944), further development being associated first of all with investigations of M.G. Krejn and I.S. Iokhvidov. At the present time the theory of linear operators on spaces with an indefinite metric represents by itself an extensive branch of functional analysis with outlets into numerous applications (theory of oscillations, quantum physics etc.). Clearly, we can represent the metric operator as the difference of two mutually orthogonal (non-zero) orthoprojections which form a decomposition of the identity: J=P+-P-;
P+ + P- = 1;
P+P- = P-P+ = 0.
Correspondingly, any x E E can be expressed as the sum of mutually vectors x, = P+ x, x- = P-x; moreover Jx=x+
-x-,
orthogonal
<x, x> = IIx+ II2 - lb- l12.
The index K = rk P- is the easure of indefiniteness of the space. It was assumed in the work of L.S. Pontryagin that ICis finite and many results were obtained initially under this condition. Remark. Suppose that on a complex linear space E there is given a Hermitian symmetric bilinear functional Oforallu~E+, u # 0 (i.e. E, is pre-Hilbert under the J-metric) and E = E- i E,. Starting with the decomposition x = u + u (u E E-, u E E,) we introduce a scalar product on E (x1, x2) = - (IQ, u2) + (ul, u2), thereby making E into a pre-Hilbert space. Completeness of the space E is equivalent to completeness of the space lo8 E, since u < co. If we have completeness, E turns out to be a space with indefinite metric: J = P+ - P-, where PT are the projections associated with the decomposition E = E- i E, .
l f E is finite-dimensional, K is the negative index of inertia of the form (x, x). lo8 It does not depend on the initial choice of the space E- .
$4. Theory of Operators
229
In what follows, E is a Hilbert space with indefinite metric; the notation introduced earlier remains fixed. The general form of a continuous linear functional on E, i.e. f(x) = (x, y), is easily reformulated in J-terminology: if z = Jy, then f(x) = (x, z). A subspace L c E is said to be positive (non-negative) if (x, x) > 0 for all x E L, x # 0 ((x, x) > 0 for all x E L). The following terms are introduced in a similar way: negative (non-positive) subspace. A subspace L is said to be neutral if (x, x) = 0 for all x E L. From the geometrical point of view (x, x) = 0 is the equation of a second order cone log; the positive vectors lie nside’ the cone, the negative ones utside’ it. The subspace E, = Im P+ is non-negative and maximal (with respect to inclusion) among the non-negative subspaces. Phillips (1959) and Yu. P. Ginzburg (1961) obtained the following important characterisation of maximal nonnegative subspaces. Theorem.
In order that L be a maximal non-negative subspace it is necessary that there exist a contracting linear operator ’ K: E, + E(E- = Im P-) such that L = Im(1 + K).
and suficient
Let A be a linear and (for simplicity) bounded operator on E. The operator A’ = JA*J is called the J-adjoint of A on the grounds that the identity (Ax, y) = (x, A ) is satisfied by it (and only by it). The formal properties of the adjoint are preserved in the J-adjoint: A” = A; (aA + /ID)’ = EA’ + FB (A@’ = B 1’ = 1. If A’ = A we say that A is J-selfadjoint. If A is invertible and A-’ = A’ then A is said to be J-unitary. The study of J-selfadjoint operators reduces to that of J-unitary operators with the help of the Cayley transform. For the sake of brevity therefore, unless otherwise stated, the assertions made in the sketch below relate only to a J-unitary operator A. First of all we note that it follows from the equality A-’ = J-lA*J that the spectrum of the operator A is symmetric with respect to the unit circle, i.e. if Iz E spec A then 1* E (1)-l E spec A. The spectrum cannot be unitary even in the finite-dimensional case. In general, a meaningful spectral theory for J-unitary operators requires that the index K is finite or at least the operator P- AP+ is compact. In this case A = U + I/ where U is unitary and I/ is compact. Consequently, the non-unitary spectrum of the operator A is no more than countable (if it is infinite, it condenses on the unit circle) and consists of normal eigenvalues L,, A:, A,, A,*, . . . (I 1,1 < 1 (k = 1,2, . . . )). The corresponding maximal root subspaces IV,, IV:, W,, W2*, . . . are neutral; each W, (Wk*) is lo9 Here the word one’ is used in the sense of elementary analytic (if preferred - algebraic) geometry. rr” Because of its geometric character we call this an angular operator. This is equivalent to the identity (Ax, Ay) = (x, y) in combination with the requirement of surjectivity; by itself the identity, which is known as J-isometry, is equivalent to the equality A*JA = J. The inequality A*JA d J (i.e (Ax, Ay) C <x, y)) defines the J-contractions. We can define J-dilatations similarly.
230
Chapter 2. Foundations and Methods
J-orthogonal to all R$ y* except W,*(W,); the subspaces W,, W,* (k = 1,2, . . .) are in duality with respect to the J-scalar product (consequently, dim Wk* = dim W,). Suppose that the index ICis finite. Then the dimension of the spectral subspace L, corresponding to the part of the spectrum which lies in the disk 111< 1 does not exceed IC and is equal to the dimension of the spectral subspace L, corresponding to the symmetric part of the spectrum. The direct sum L, = L, i L, is called the hyperbolic subspace of the operator A. The spectral subspace L, corresponding to the unitary spectrum is J-orthogonal to L, and L, + L, = E. By the same token (for finite IC) we can restrict attention to the case of unitary spectrum in the spectral analysis of J-unitary operators. In this connection substantial use is made of the Theorem of Pontryagin. If the index K is finite , any J-unitary’ l3 operator A has invariant maximal non-positive and non-negative subspaces.
The dimension of an invariant maximal non-positive subspace M is equal to K. It can be chosen in such a way that the spectrum of the operator Al, lies outside the disk 111 < 1. Let p(L) be the characteristic polynomial of the operator Al, (so that p(Al,) = 0). Then114 Im p(A) c M(l). But M(l) is a non-negative subspace. Consequently (p(A)y, p(A)y) 2 0 (y E E). Putting q(1) = p(n)p(n- , we obtain (q(A)y, y) > 0 and also q is a Hermitian-symmetric (real on the unit circle) polynomial in L and 1-l. This procedure for achieving definiteness (I.S. Iokhvidov, 1950) served as the key to the spectral theorem which is finally established for all’ ’5 specified J-unitary operators with unitary spectrum (Langer116, 1967). Although the procedure depends on the choice of subspace M, the final spectral resolution is unique. The main complication in the proof of the spectral theorem is associated with the roots of the polynomial q which lie on the circle 111= 1. Near to these ritical points’ the resolvent has power order of growth higher than the first. M.G. Krejn and Langer (1964) applied the indefinite metric to investigate the quadratic equation X2+BX+C=0
(44
in the algebra f!(H) of operators on a Hilbert space H. Here the coefficient B is assumed to be selfadjoint while C is compact, selfadjoint and non-negative. The operators And also under the condition that the operator P- AP+ is compact (M.G. Krejn, 1950). It is interesting to note that M.G. Krejn proof is based on the fixed point principle. 3A J-selfadjoint operator was considered in the work of L.S. Pontryagin. The transition to a J-unitary operator was effected by LS. Iokhvidov (1949) using the Cayley transform. I14 For any subspace L the subspace ~5 = {x: (x, y) = 0, (y E I,)} can be called its J-orthogonal quasicomplement. In general L n L ) # 0. ’I5 Without restriction of the type to compactness or finiteness of the index. 116J-selfadjoint operators were considered in the work of Langer (the finite index case is due to M.G. Krejn and Langer (1963)).
231
5 4. Theory of Operators
on the space E = H OH were considered. Since A is J-selfadjoint and P- AP+ = ,/C is compact, there exists a maximal non-negative subspace which is invariant for A. If K is the corresponding ngular’ operator, then X = KJC turns out to be a root of the equation (44). Also X*X < C since lllyll < 1. We note that equation (44) is naturally associated with the equation of linear oscillations Tii + Rzi + Vu = 0
(45)
Here ZJ= u(t) is a vector-function with values in a Hilbert space H and T, R, V are selfadjoint (generally unbounded) operators with T > 0, R 2 0 and V >> 0 (the forms generated by T and Vare the kinetic and potential energies respectively, R is a dumping operator). Under the substitution u = flu, equation (45) becomes Cfi+Bti+u=O
(46)
where C = V-“2TV”2, B = V-“2RV’12. The operators C and B are usually bounded in applications and after closure they satisfy the conditions formulated above. Exponential solutions u = uOeQf arise as eigenvectors of the quadratic pencil of operators S(p) = Cp2 + BP + 1, S(p)uO = 0. If X is an operator root of equation (44), 1 is any of its eigenvalues (2 # 0) and u0 is a corresponding eigenvector, then u,, is an eigenvector of the pencil S for the eigenvalue p = 1-l. A root X gives rise to another root Y = -X* - B, which is also a source of eigenvalues and eigenvectors of the pencil S. If we now put Z = -X - B, the pencil factorises: S(P) = (1 - Pm
- P-w.
The pencil S is said to be strongly damped if m
4 > 2J{ (x9 4 (Cx, xl>
(x # 0).
It is easy to see that in this case all the eigenvalues of the pencil are non-negative. Taking account of multiplicity, we shall denote by (-0;) those of them which are generated by the operator X and by (- 0:) those generated by the operator Y. It turns out that the pencil has no other eigenvaluesl” (and there are no other roots of equation (44)). Each of the corresponding normalised systems of eigenvectors {u:}, (0:) is a Riesz basis in H. In this sense the system of all eigenvectors of the pencil is doubly complete. The phenomenon of multiple completeness of the system of root vectors of a polynomial operator pencil had already been discovered by M.V. Keldysh (1951). Since then the theory of polynomial pencils has developed into an
And even no other points of the spectrum, i.e. the operator S(p) is continuously invertible for p different from (-CD;), ( - u$ .
232
Chapter
2. Foundations
and Methods
independent chapter of functional analysis and has found various applications in mathematical physics. Remark. The investigation of quadratic equations in the algebra 52(H) is, on the whole, a problem of transcendental difficulty. This is clear from the fact that for a continuously invertible operator A the existence of a root of the equation X2 - AX = 0 different from 0 and A is equivalent to the existence of a non-trivial closed invariant subspace. 4.17. Banach Algebras. This important topic is closely connected with the spectral theory of operators. In fact, let ‘3 be a complex commutative Banach algebra (in the sequel it will simply be called an lgebra’). Let us consider the isometric embedding T: ‘9I + !2(9I) which associates with each a E VI the operator T,x = ax (x E ‘3l).Invertibility of an element a is equivalent to invertibility of the operator T, in Q(a). Consequently, spec a = spec To and so information about the spectra of bounded operators carries over to ‘3. In particular, the spectrum of any element a E ‘$I is non-empty and compact, Gel and formula holds for the spectral radius p(a) etc. Example. Let A be a bounded operator on a Banach space E and let [A] be the smallest closed subalgebra of 2(E) which contains A, i.e. the uniform closure of the algebra of all polynomials in A. We denote the resolvent set and the spectrum of the element A in the algebra [A] by reg,A and spec,A respectively. Clearly, reg,A c reg A and hence spec A c spec,A; moreover p,,(A) = p(A) as a result of Gel and formula. It turns out that reg,A coincides with the unbounded connected component G, of the set reg A; correspondingly specJ = spec A u G, where G is the union of the bounded connected components of the set reg A. In fact, if 112)> p(A) we see by expanding in a Laurent series at infinity that R, E [A]. But then it follows from the Taylor expansion of the resolvent at the point 1 that R, E [A] for p in the maximal neighbourhood of the point I which does not intersect spec A. Connecting the point 1 to any point [ E G, by means of a polygonal line, we deduce after a finite number of steps that R, E [A]. Thus G, c reg,A. Now let H be a bounded connected component of the set reg A and let p E H n reg,A. Then for any E > 0 there is a polynomial such that II R, - p(A) II < E. Any point J, E 3H is contained in the approximate spectrum of the operator A. Let (x,,)? be a quasieigensequence: Ax, - Ix, + 0, 11x,11= 1. Applying the operator R, - p(A) to x,, we obtain in the limit: I(n - p)-’ p(n)1 < E. Thus the function (A - pL)-’ can be uniformly approximated by polynomials on aH, which contradicts the existence of a singularity at the point p E H. It is now clear that the equality spec A = spec,A holds if and only if spec A does not disconnect the plane. It is not difficult to determine that algebra defined by an operator A for which the spectrum of the element A coincides with that of the operator A. This is the smallest closed subalgebra [[A]] of f?(E) which contains A and the family of operators (&IA, n, where n is a set obtained by choosing one point from each bounded connected component of the set reg A (the algebra [[A]] does not depend on the choice of set A).
5 4. Theory of Operators
233
Returning to an arbitrary algebra ‘$I, we note that for each closed ideal J c 2I the factor algebra ‘2X/Jis a Banach algebra under its usual norm. But all maximal ideals are closed since the closure of an ideal J # !!I is an ideal .i # ‘$I, as a consequence of the fact that the set of invertible elements is open. The factor algebra E = %/M for a maximal ideal M is therefore a Banach field over @. If E # @ then for any < E E\@ and for all 1 E @ the non-zero elements r - I are invertible in,E. But then spec 5 = 0. With this we have established the Gel’&&Mazur theorem that a complex commutative Banach algebra is spectral. The characters x of an algebra ‘+Iare continuous because of the closedness of their kernels, these being maximal ideals. Since x(a) E spec a, we have Ix(a)1 < p(a) < Ilull. Moreover x(e) = 1 and so llxll = 1. Now let us consider the set ‘%R= 9R(‘%)of maximal ideals of the algebra 2I. It can be regarded as a subset of the unit sphere of the conjugate space ??I*, in which case it is closed since it is determined by the system of equations &,a,) = &,)&) (al, a, E 2I), x(e) = 1, which are defined by continuous functions of x. Consequently, %I is compact. Example 1. If ‘?I = C(S) (S compact) then !Ul@I) = S. Example 2. If 2I is an algebra with a single topological generator a, i.e. ‘8 = [a], the uniform closure of the algebra of all polynomials in a, then !IR(%) = spec a. We note that the problem of giving an explicit description of the maximal ideal space of a concrete algebra can be extremely difficult. An example of this is the famous problem which was solved by Carleson (1962); this is concerned with the algebra of functions which are analytic and bounded on the unit disk A = {A: 1 E @, IL1 < l}. The result is that A is dense in 9X. The canonical functional representation (Gel and transformation) aH Z (ii(x) = x(u)) is a morphism118 of 9I + C( I(2I)) which is clearly contracting. For it to be an isometry (i.e. for the identity p(u) = [lull to hold), it is necessary and sufficient that the identity llu2 II = IjaIl should hold in %. The Gel and image of the algebra Iin C(9R) is a subalgebra (generally not closed) which separates the points of the space 9.X It can fail to be dense in C(!U2). In the given context and also for many applications it is important to have an intrinsic characterisation of dense subalgebras of C(S) (S compact). The effective solution of this problem includes the Weierstrass approximation theorem (the subalgebra of polynomials is dense in C[O, 11). It was obtained by Stone (1937). Stone-Weierstrass Theorem. Suppose that a subalgebra 9I c C(S) is symmetric (i.e. along with each function cp it contains its complex conjugate @) and separates the points of S. Then !JI is dense in C(S). For the proof (and also independently of this goal) it is useful (following de Branges, 1959) to deal first of all with the real case. sIn the category of complex commutative Banach algebras over C (i.e. a continuous algebra homomorphism) YJI(‘%)is a contravariant functor into the category of compact spaces.
234
Chapter 2. Foundations and Methods
Real Variant of the Stone-Weierstrass Theorem. Let C,(S) be the Banach algebra of real continuous functions on a compact set S and let %,-, be a subalgebra. Zf the functions in 912,separate the points of S, then ‘%Ois dense in C,(S). Proof. Suppose that the subalgebra I&is not dense. Let us consider the set N # 0 of real measure on S which annihilate ll0and satisfy the condition 11v 11< 1. This is. a convex, centrally symmetric, compact subset of the space of measures on S under the w*-topology. By the Krejn-Mil an theorem N has an extreme point cr. Clearly, /loll = 1 and by construction Js cp do = 0 (cp E ll,,),in particular Jsda = 0. Hence it follows that the support of the measure (r contains at least two distinct points sl, s2. But there is a function $ in the subalgebra !I& which separates these points: I,&~) # $(sz). Without loss of generality we can assume that 0 < $(s) < 1 (s E S). Since cp$ E !lQ, for all cp E !l&, we have Is cp$ do = 0, i.e. the measure d? = + da annihilates 9$, and so the measure dp = (1 - +) da also has this property. Putting ? = r/llrll, 3 = p/IIpII we have Q = pz* + qfi (p, q > 0, p + q = 1). Consequently f = aa (a = constant), i.e. $lsuppa = constant. 0
For the proof of the complex Stone-Weierstrass theorem it is now sufficient to consider the set ‘%c= (0: 0 = re cp (cpE ll)}. If cpE % we also have (p E % according to the condition and so re cpE Consequently, !RO c and by the same token !RO is a subalgebra of the real algebra C,(S) and separates the points of the compact set S. According to the real Stone-Weierstrass theorem, ‘%eis dense in C,(S). But then % is dense in C(S). 0 A complex commutative Banach algebra canonical image is symmetric. Corollary.
If the algebra
is symmetric,
is said to be symmetric
if its
then its canonical image is dense in
wJw)). Remark. The Banach space C,(S) is also a vector lattice with respect to the usual ordering and moreover the two structures are compatible: if [lx/l < 1 and IyI < 1x1 (I.1 is the lattice modulus), then llyll < 1. We can say that C,(S) is a Banach lattice. The following variant of the Stone-Weierstrass theorem holds: any sublattice of the vector lattice C,(S) which contains 1 and separates the points of the compact set S is dense in C,(S). Closed subalgebras of the algebras C(S) (S compact) are called uniform algebras. An example of such an algebra on the unit circle T is the algebra A(T) offunctions which are the boundary values of analytic functions on the unit disk A. It is called the disk algebra and it is maximal among the non-trivial closed subalgebras of C(T) (Wermer, 1953). The maximal ideal space of the algebra A(T) is naturally identified with the closed disk A = A u T. In general, if a uniform algebra 2I on S separates points, then S is homeomorphically embedded in the maximal ideal space %Jl(%). For any algebra I and any element a E 2I we can construct, just as in the theory of operators, a functional calculus by putting
5 4. Theory of Operators
235
cpw =-&srp(l)(a r
le)-’ dA,
where cp is an analytic function on the spectrum, i.e. on some neighbourhood G r> spec a, and the contour r contains spec a in its interior. Moreover cp@Nw = cp@W)) (M E WW), f rom which we obtain immediately the spectral mapping theorem:
spec q(a) = cp(spec a). Example. Let W be the Wiener algebra, let f E Wand let cpbe a function which is analytic on the set of values of the function J Then the composition cp of belongs to W (theorem of Levy; the case q(c) = [-’ is a theorem of Wiener). An algebra ‘2I is said to be regular if for any compact set Q c 9X(%) and any maximal ideal MO # Q there exists an element a E % such that 51, = 0 and Z(M,) = 1. It now follows from this that for any two disjoint compact subsets Q1, Qz of 9JI there exists an element x E 2I such that ZIQ1 = 0 and )?Ipl = 1. If 2I is symmetric, x can be chosen so that 0 < Z(M) < 1 for all M. The theory of regular algebras was constructed by G.E. Shilov (1941). One of the fundamental results of this theory is the Local Theorem. Suppose that the algebra VI is regular and that the function cp belongs to 9l locally on ‘93,i.e. for each point M, E !JJIthere exists a neighbourhood U and an element a, E 2l such that ii,(M) = q(M) (M E U). Then cp E ‘&
The point is that, because of the regularity of the algebra, for any finite open covering of the compact set !JJI we can find a partition of unity as a sum of functions in a, each of which is concentrated on the corresponding element of the covering. Example. If on a neighbourhood of each point a periodic function cp can be expanded as an absolutely convergent trigonometric series (depending on the choice of point), then it has an expansion of this type on the whole period (Wiener, 1933). An important consequence of the local theorem is a further generalisation of the Wiener-Levy theorem: suppose that the algebra 9I is regular and that the function rp on !DI is such that for each point M,, E ‘2Xthere exists a neighbourhood U, an element a,, E 2X and a function . which is analytic on spec a0 and such that q(M) = cpo(iio(M)); then cp E ai. In certain cases this theorem allows us to apply multivalent analytic functions’ lg to elements of the algebra. Example. If a function satisfies the conditions
a E L R) is such that its Fourier
1 - A(1) # 0 (n E R),
transform
A(1)
ind(1 - A(I)) = 0,
1I9 An interesting variation on the topic of multivalent functions of an algebra element is the solution of algebraic equations with coefficients in the algebra. Non-trivial homomorphisms of the fundamental group nl(%R) into braid groups are global barriers to solvability (E.A. Gorin - V.Ya Lin, 1969).
236
Chapter
2. Foundations
and Methods
then the function ln( 1 - A(1)) is the Fourier transform of some function in L’(R). The role of this result in the theory of equations with a difference kernel is well-known (see Chapter 1, Section 4.5). By way of explanation we state that for the Banach algebra L’(R) (with identity adjoined) the Gel and transform and the Fourier transform coincide (1 = 1). The maximal ideal space of this algebra (or of the Wiener algebra which is isometrically, isomorphic to it) is naturally identified with the one-point compactihcation Iw = IR u (co}. In any algebra Cuan element a is said to be logarithmic if it can be represented in the form a = eX, where x E . All such elements are clearly invertible but in general not all invertible elements are logarithmic. Example 1. In the algebra C[O, l] all functions that are never zero are logarithmic, i.e. the functions which are invertible in the algebra. Example 2. In the algebra C(T) the logarithmic non-vanishing functions cp for which
functions are precisely those
ind cp = & A,(arg cp) = 0. We note that any invertible
II/ E C(T) can be expressed in the form
where )2 = ind $ and cp is a logarithmic function. The set exp VI of logarithmic elements of any algebra !!I is a subgroup of the group inv I of invertible elements. It turns out that the corresponding factor group depends only on the homotopy class of the maximal ideal space of the algebra. Arens-Royden Theorem. The factor group inv (U/exp I is isomorphic to the group H W, Z) of one-dimensional tech cohomologies of the maximal ideal space of the algebra 2I.
In the proof of the Arens-Royden theorem (as also in many other questions in the theory of Banach algebras) we make use of analytic functions of n algebra elements: a,, . . . , a,. In the corresponding functional calculus12’ the role of the spectrum is played by the combined spectrum spec a of the system a = (ai, . . . , a,), which is defined as the complement of the combined resolvent set. The latter consists of regular points (A,, . . . , A,) E C for each of which there exists a system b = (b,, . . . . b.) of algebra elements such that
k$l (ak- A&
= e.
It is easy to see that spec a coincides with the image of the mapping lzoIt
was developed
in the works
of G.E. Shilov
(1953,
1960) and of Arens
and Calderbn
ii: W + C” (1955).
4 4. Theory
of Operators
231
which is defined by the formula a”(M) = (5,(M), . . . , c,(M)). Since this mapping is continuous, spec a is compact. If it is polynomially convex, then, by the well-known Oka-Weil theorem, each function which is analytic on spec a admits arbitrarily close uniform approximation by polynomials. This allows us to extend by continuity the intrinsic construction of the element p(a) = rp(a,, a,, . . . , a,) from the algebra of polynomials in n variables to the algebra of functions which are analytic on the spectrum. A deeper analysis is required in the absence of polynomial convexity for the combined spectrum. Returning to regular algebras, we note their significance for spectral theory. Example.
Let a = (a,),, z be a real sequence such that a, >, 1, a,+, G anam,
In a, C 7 < *. n.zn +1
Then the Banach algebra 1; (with convolution) of scalar sequences that are summable for the weight a is regular (G.E. Shilov, 1947). Its Gel and image, the Wiener algebra W, with weight a, consists of those functions on the circle with Fourier coefficients in 1, The corresponding maximal ideal space is identified with the unit circle T. If V is a non-quasianalytic linear operator on a Banach space (spec V c ), we can put a, = 11V” 11.The functional calculus q(V)=
z
II=-al
a,V”
q(l) =
f
-al
a&
111= 1
> arises on the algebra W,. The spectrum of the operator turns out to be separated thanks to the fact that, for any compact set Q c T and any neighbourhood U 3 Q, there exists a function rp E W, such that lo = 1, cp(T\ U) = 0,O < cp < 1 (clearly q is not only non-analytic but it also does not belong to any quasianalytic class). We thus have convincing evidence that the methods of the theory of analytic functions are of paramount importance for the theory of Banach algebras (as indeed for spectral theory in general). Conversely, the theory of Banach algebras is a useful instrument for studying various classes of analytic functions, particularly in the setting of approximation theory. The concept of a boundary of the algebra plays an important role here. This term is applied to any subset X of the maximal ideal space W such that for each function u(M) (a E 2I) its maximum modulus is attained at some point A4, E X. The trivial boundary is X = W and, for example, in the case of the algebra C(S) (S compact) it is the only one. The boundaries for the algebra A(T) are precisely those subsets of the disk A which contain the circle T. Since in any algebra an extension of a boundary preserves the boundary property, it is natural to look for a smallest boundary. G.E. Shilov (1940) proved that any algebra has a smallest closed boundary (subsequently called the Shilou boundary). Without the requirement of closedness a smallest boundary may not exist, however such a boundary does exist if the maximal ideal space is metrizable and in this case it consists of the peak points, i.e. those MO E 9R (
238
Chapter 2. Foundations and Methods
for each of which there exists a, E ‘8 such that ii, = 1 and I&(M)] < 1 (M # M,). The results were obtained by Bishop (1959), who also gave the following application of them. Theorem. A sufficient condition for the set R,(S) of rational functions with poles outside of the compact set S c C to be dense in C(S) is that (with respect to plane Lebesgue measure) almost every point of S be a peak point for some function in R(S) = ,,(S).
A.A. Gonchar (1963) showed that the following condition is suflicient for a point s0 of a compact metric space to be a peak point of a uniform algebra 9I c C(S): there exist constants c < 1, N > 1 such that for any neighbourhood U of s0 there is a function cpE % for which cp(s,) = 1, Iv(s)] < c (s E U) and llqll < N. It follows from this that the set of peak points of the algebra ?I is of type G. Let % be a complex Banach algebra (in general non-commutative). A mapping a H a* of the algebra into itself is called an involution if 1) (a + b)* = a* + b*, 2) (au)* = Za*, 3) (ab)* = b*a*, 4) a** = a, 5) Ila*II = Ilull. The element a* is called the adjoint of a. If a* = a, then a is said to be selfadjoint. If a* commutes with a, we say that a is normal. In particular, all unitary elements (a*a = aa* = e) are normal. Example. The identity e is a selfadjoint unitary element. If the element a is invertible, then a* is also invertible and (a*)-’ = (a- *. Hence it follows that reg a* = i%gZ and spec a* = spec. 1 Any x E rU can be uniquely represented in the form x = a + bi, where a, b are selfadjoint :a = re x = (x + x*)/2, b = im x = (x - x*)/(2i). The selfadjoint elements of 2I form a closed real subspace re 2I. For any a E ‘$I the element a*a is selfadjoint. The closure of the non-negative linear hull of the set {x: x = a*a (a E ‘$I)} is a wedge %Xx,whose elements, as usual, are said to be non-negative (x > 0). Using ‘$I+ we define in the canonical way the collection 9Iz*, of non-negative linear functionals. Any f E !!I*, satisfies the inequality If(a)1 < f(e) /la/l and is consequently continuous ( 11f II = f(e)). Moreover, it is real on re ‘%or equivalently: f (a*) = f(a) (a E 2l). Such functionals are said to be symmetric. Any functional g E (re ‘$I)* can be uniquely extended to a symmetric functional g E %I*; also II 811= II 911. In the category of Banach algebras with involution the continuous homomorphisms h: VIZ,+ a2 which are compatible with the involutions, i.e. (ha)* = ha* (or, equivalently, h preserves selfadjointness of elements), are called morphisms ((*)-homomorphisms). A subalgebra of an algebra with involution is said to be symmetric if the involution does not leave it. The class of symmetric ideals (one- or two-sided) is defined similarly. Under factorisation by a symmetric two-sided (one-sided) ideal the involution carries over naturally to the factor algebra (factor space). Here the bar denotes complex conjugation.
5 4. Theory of Operators
A Banach algebra ‘?I with involution
239
is called a C*-algebra if
lb*4 = 11412 for all a E 2I. We mention
Example 1. The algebra C(S) of continuous
involution
(47)
two examples of C*-algebras. functions on a compact set S (the
is complex conjugation).
Example 2. The algebra f?(E) of bounded linear operators on a Hilbert E (the involution is the operator adjoint). In fact, if A E g(E) we have
IIA*AII = ,vl x
(A*&
4 = ,yl x
space
lbw2 = llAl12.
Any closed symmetric subalgebra of a C*-algebra clearly belongs to the same category. Any closed two-sided ideal of a C*-algebra is symmetric and the corresponding factor algebra is a C*-algebra. Gel and-Najmark Theorem. 1) Let be a commutative C*-algebra and !lJI its maximal ideal space. Then the Gel$znd representation is an isometric (a)isomorphism. 2) Any P-algebra is isometrically (*)-isomorphic to some operator algebra, i.e. a closed symmetric subalgebra of the algebra of operators !i!(E) for some Hilbert space E.
For the proof we note that for any C*-algebra the spectra of unitary elements lie on the unit circle, since if a- ’ = a* it follows from (47) that lluV1 II = Ilull = 1, hence p(a) < 1 and ~(a- < 1; the spectra of selfadjoint elements are rea1 2, since if a* = a we have that eia is unitary. Consequently, if 3is commutative and x is a character, then x is real on re 2I and hence it is symmetric. By the Stone-Weierstrass theorem is dense in C(%R). It remains to show that p(a) = Ilull for all a E W. If a = a* this is obtained by iterations of the identity lla211 = llal12, while in the general case we have llal12 = llu*al/ = &*a) < da*)&) = P2M. cl The non-commutative situation is significantly more complicated not enter into it. Let us consider two applications of the preceding theorem.
and we will
Example 1. Let S be a completely regular topological space. We will consider the commutative C*-algebra U(S) on it. The canonical mapping of S + .Xis a homeomorphism onto its image since CB(S) separates points from closed subsets. We can therefore regard S as a topological subspace of %R.But then S is dense in W since, if cpE C(roZ) and & = 0, we have cp = 0 because of the isometric property of the Gel and representation (which in this context becomes the I** We note in passing that the spectra of non-negative elements are non-negative.
240
Chapter 2. Foundations and Methods
extension by continuity). Consequently ‘9JIcan be regarded as a compactification of S. If coincides with the Stone-Tech compactification. All possible compactifications are in one-to-one correspondence with the closed symmetric subalgebras ‘3 c CB(S) which separate points. However the topology of the compactilication 9Jl(‘%)induces on S a topology which is coarser than its initial topology and may not coincide with it. For example, the Bohr compactification of the space R, which is defined by the algebra AP(R) of almost periodic functions, has this property. Example 2. Let A be a bounded normal operator on a Hilbert space. Then A and A* generate a commutative C*-algebra (this is the smallest closed subalgebra of i?(E) which contains A and A*). It can be shown that the spectrum of the operator A coincides with its spectrum as an element of this algebra ‘$I and can be identified with its maximal ideal space 93. The Gel and representation of --, C(YJI) is an isometric (*)-isomorphism. The inverse mapping can be regarded as a functional calculus. It can be extended (by continuity in the topology of pointwise convergence) to the algebra of bounded Baire functions cp(l) (A E spec A). The orthopojections forming an orthogonal resolution of the identity for the operator A correspond to the characteristic functions of the Baire subsets of the spectrum. Thus in this way we obtain the spectral theorem for a bounded normal operator. Banach algebras with involution play a significant role in the theory of representations and harmonic analysis. If G is a locally compact group and p is a right Haar measure, the convolution operation
(cp *bwd =scpb+knJMY) (9 EG) G
turns L G, ,u) into a Banach algebra (without identity if G is not discrete; we then have to adjoin an identity formally. This group algebra is commutative if and only if the group G is commutative and in this case its characters can be identified with the unitary characters of the group 3 by means of the Fourier transform
4(x) =sddx(d 44s) (x EG*) G
(if G is not discrete, the group algebra has another character - the coefftcient of the identity). The maximal ideal space of the group algebra coincides with G* if G* is compact (0 G is discrete), while in the general case it is the onepoint compactification of G*. The Gel and representation for the group algebra L G, ,u) coincides with the Fourier transform, i.e. for G = [wwe have the classical Fourier transform and for G = T it associates with each function its sequence of Fourier coefficients. We have correspondingly T* = Z and R* = R. The group
lz3These form the dual group G* (pointwise multiplication). compact-open topology, under which G* is locally compact.
The usual topology on G* is the
Q4. Theory
of Operators
241
algebra of a locally compact abelian group is semisimple, which is equivalent to the uniqueness theorem for the Fourier transform. An involution can be introduced on a group algebra by means of the formula cp*(g) = cp(g-‘). Although this is not a C*-algebra, the presence of an involution on it nevertheless turns out to be extremely useful, for example, in such a central question as the generalisation of Plancherel theorem 4 (A. Weil, 1940, M.G. Krejn, 1941). In this way D.A. Rajkov (1941) obtained a new proof of the Pontryagin duality principle, according to which for any locally compact abelian group the canonical mapping of G + G** is a topological isomorphism 5. The principle of duality contains, in particular, the assertion that the unitary characters of a locally compact abelian group separate its points (in this sense there are sufficiently many of them). This result extends to the non-commutative situation if we replace characters by irreducible unitary representations (GellfandRajkov theorem). Moreover, any unitary representation of a locally compact group can be expressed as the direct integral of irreducible unitary representations (M.A. Najmark, 1956). This theorem bears the same relationship to the previous one as Choquet theorem does to the Krejn-Mil an theorem and the technique of extreme points is actually used in the corresponding proofs. For a compact group the expression reduces to a discrete orthogonal sum (with a suitable modification this is also true for representations in a Banach space). For the regular representation of a compact group the decomposition into an orthogonal sum of irreducibles is a basic result from the Peter-Weyl theory. We note that ail irreducible representations of a compact group (even in a locally convex space) are linitedimensional and can therefore be taken to be unitary. Compact groups (even non-abelian) have their own particular principle of duality (Tannaka, 1939; M.G. Krejn, 1949); the object which is dual to the group is the so-called block-algebra. The study of any Banach algebra 2I with involution reduces to the case of a C*-algebra in the sense that there exists a C*-algebra @I, unique up to (*)isomorphism, which is associated with YI by means of a morphism p: + @ in such a way that, for any morphism h: 2I + 2 (B a C*-algebra), we have the commutative diagram
i2
suitably
ormal&l’
and correspondingly also a generalisation
Haar
measure
p* on G* appears
in the inuersion
formula
in the generalised Plancherel theorem. In this context it is appropriate of Bochner theorem on the representation of positive functions:
to note
CPM = x(g)WA where u is a finite measure (a > 0). The original proof (L.S. Pontryagin, 1934) was based on structure theory. The requirement contained in it of a countable base was subsequently removed (van Kampen, 1935).
242
Chapter 2. Foundations and Methods h 9X-i?
where ,Ais some morphism. The C*-algebra !8 is said to be universally enveloping for 9l. This is an extremely useful construction. For example, it works effectively in the proof of the Gel and-Rajkov theorem (where 9I is a group algebra). In the 195Os, Yu. M. Berezanskij and S.G. Krejn extended the scope of harmonic analysis, defining the generalised convolution on L S, m) for any locally compact space S with measure m:
where for fixed t C(s, r; t) is a Bore1 measure with respect to s and r separately and for fixed subsets in place of s and r it is a continuous function of t with compact support; it is required in addition that
s s
C(A, B; t) dm(t) < m(A)m(B).
We then have IIv+M < lldlll~l1 an d so a commutative structure
Banach algebra with
meusure126 C(A, B; t) arises on L S, m). It is assumed further that an
involutory homomorphism s H s* acts on S, preserving the measure m and acting on the structure measure so that an involution is defined on the Banach algebra under consideration by the formula q*(t) = @(t*). From these standpoints, for example, ordinary (group) and generalised (hypergroup) shifts can be given a unified treatment.
fj 5. Function
Spaces
5.1. Introductory Examples. The development of functional analysis is connected not only with the working out of its bstract’ intrinsic structures but also with the expansion of the stock of concrete function spaces. This has both an applied and a conceptual meaning. The revolutionary introduction into mathematical usage of the language and techniques of generalised functions can serve as a striking example (S.L. Sobolev, 1936; L. Schwartz, 1950-51). The general theory of interpolation of linear operators, which came into being right at the
lz6This is the continuous analogue of the structure constants of finite-dimensional algebra. It is assumed that relationships hold which guarantee associativity and commutativity. An identity may be lacking but it can be adjointed formally in the usual way.
0 5. Function Spaces
243
beginning of the 1960s (Gagliardo, Lions, Calderon, S.G. Krejn), developed on the one hand from concrete material relating to spaces of type Lp, while on the other hand it opened up a broad field of activity in new function spaces (for example, in symmetric spaces of measurable functions). Interpolation methods are intrinsically linked with embedding theorems of S.L. Sobolev, which are concerned with spaces of differentiable functions of several variables. In connection with multivariate approximation theory important advances were made in the theory of embeddings during the 1950s thanks to work of S.M. Nikol kij and O.V. Besov, in which new classes of differentiable functions were introduced, leading, in particular, to valuable converses of embedding theorems. At the present time we can say that function spaces form an independent topic, in the investigation of which abstract and concrete analytical methods are intertwined. The successful solution of many problems in the theory of partial differential equations depends on the right choice of function space (or family of spaces) appropriate for the given problem. Example 1. The Cauchy problem
au& t) -=
at
a2u(x, t)
ax2
(t
for the heat equation
2 0, -co < x < co),
4% 0) = 44x).
is initially set in the class of all sufficiently smooth functions but in this situation the solution is not unique. In fact in the non-quasianalytic class C(n ) (0 < E < 1) on the interval [0, l] there exists a function t+b(t)which together with all its derivatives vanishes at the end points. Let us put $(t) = 0 for t > 1. Then cm i) ‘(t) 4% t) = x0 (2n)IxZ”
(t 2 0, --co < x < co)
is a solution of the heat equation and U[~=~ = 0. The reason for this phenomenon lies in the possibility of rapid growth of u(x, t) as [xl+ co. Taklind (1937) showed that if h(x) is a non-decreasing odd function then the bound lu(x, t)l < A4exh@) defines a uniqueness class if and only if * dx -= s 1 h(x)
al.
(48)
This result is closely connected with a criterion for quasianalyticity. In particular, condition (48) is satisfied by the function h(x) = x, which defines the well-known uniqueness class lu(x, t)I < MeX2 identified by A.N. Tikhonov (1935). Example 2. In the Dirichlet problem for Laplace
Au = 0,
equation
4r = cp
in a domain G c R” with (sufficiently well-behaved) boundary I a key role is played by the following quadratic functional (known as the Dirichlet integral):
244
Chapter
2. Foundations
D[u] =
sG
(dx is Lebesgue measure). The Hilbert
and Methods
jVuj2 dx
space of functions W:(G)
lbll = JtDbl
+
with norm
lbll&(G))
(introduced by S.L. Sobolev) is intrinsically connected with this problem. This norm requires the existence of partial derivatives if only almost everywhere, however this is not enough. An adequate description of the space is obtained by the introduction of generalised derivatives (see below). Since Au = 0 is an Euler-Lagrange equation for the functional D[u], it is appropriate to change the formulation of the Dirichlet problem by requiring that D[u] = min, ulr = cp, u E W:(G). The question of second order smoothness is thereby separated from the questions of existence and uniqueness (after which it can be solved by methods specially adapted for it). However the variational formulation also contains a subtle aspect which concerns the boundary condition: an admissible function cp must be the limit of functions in W:(G) in a certain sense (requiring more precise definition) and it is desirable to have an intrinsic characterisation of the class of these functions. The answer to this question requires the introduction of the new function space W:12(IJ of functions which are smooth of order 3 in a certain integral sense. It turns out that the restriction homomorphism u H ulr acting from W,‘(G) into Wi’2(r) is continuous and surjective’ (Aronszajn, 1955; V.M. Babich - L.N. Slobodetskij, 1956). 5.2. General&d Functions. The simplest generalised function (although not a function in the usual sense) is the b-function which is in fact a continuous linear functional on the asic’ space of continuous functions C(W) (for the time being we restrict ourselves for simplicity to the one-dimensional case):
WI
= f(O),
and formally m G(x)f(x)
dx = f(0).
s --a, By formally integrating by parts we can extract from this equality an heuristic definition of the derivative 6’: m sl(x)f(x)
dx = -
m G(x)f’(x)
dx = -f’(O).
s -00 s -03 Thus we can define a linear functional s f] = -f’(O), however C’(W) will be the natural basic space for this (and 6’turns out to be continuous). For the simultaneous definition of all derivatives 8(“)[f] = (- l) ( (O)(n > 1) we have to take ’ The boundary sense.
function
cp = I& is defined
almost
everywhere
and is the limit
of u in a certain
integral
8 5. Function Spaces
245
Cm(R) as the basic space. On the other hand, however, the b-function is an elementary measure, namely the unit measure concentrated at zero.2 If we wish to regard all measures as generalised functions, then, because a measure can grow arbitrarily at infinity, we have to take for the basic space C,(R), the space of continuous functions with compact support. C~(R), the space of infinitely differentiable functions with compact support, is the universal receptacle of all the formulated requirements. It is clear from what has been said that a generalised function has to be defined not as an individual object but as an element of some class. Let G be an open set in R” and E some linear topological space of functions f: G + @which we will call the basic functions. The continuous linear functionals on E, i.e. the elements of the conjugate space E* (with its w*-topology), are called generalised functions over E. The first and most important example is E = C:(G) = 9(G) (to say that f: G + c has compact support means that there is a compact set Q/ c G outside of which f(x) = 0; the smallest such compact set, supp f, is the support of the function f). We introduce the locally convex (non-metrizable) DieudonndSchwartz topology on 9(G) by means of the family of seminorms ak,+-+k,
p,(f)
=
k ,,...,
2
k,=O
suP xsG
Pkl,...,k
n a.+.
f . . ax-
’
where p = (P~,,,.,,~,) is an arbitrary n-multisequence of continuous functions on G which is locally finite in the sense that for each compact set Q c G the set of multi-indices3 k = (k,, . . . , k,) for which pkla # 0 is finite. With 3(G) as the choice of basic space the generalised functions are called distributions. Thus 1 is a distribution if it is a linear functional on .9(G) which is subordinate to a finite family of seminorms pPcl), . . . , pPcl,. An effective criterion for continuity of a functional 1 E 9(G)’ is that for each compact set Q c G there exists an integer m = m(Q) (we shall assume it to be as small as possible) and C = C(Q) > 0 such that (49)
for all f E 9(G) with supp f c Q. If sup0 m(Q) = I < cc, we say that I is a distribution of finite order r. A distribution of order zero is a complex measure. We can also regard a locally summable function 0 as a distribution by identifying it with the measure J 8 dx. Such distributions are said to be regular and form a t is convenient to interpret the &function physically as a unit charge concentrated at zero. Then 6’ is a dipole concentrated at zero with unit momentum. However in quantum mechanics the concept of the S-function is essentially as a functional. he use of multi-indices in work with functions of n variables is very convenient since it leads at,+-+k. f aff It is also convenient to put Ikl = to abbreviations of the form: x& = x:1 . . . x2; ax)r =
ax:l... a+'
k, + ... + k,.
246
Chapter 2. Foundations and Methods
dense subspace L:,,(G) c g(G)*. In general it is far from being the case that we can speak of the values of a distribution 1 at points x E G (although the conventional notation A(x) is widely used because of its formal convenience4). If an open set H c G is given and A[f] = 0 for all f E g(H), we set5 &, = 0. The complement in G of the union of all such H is called the support of the distribution I and is denoted by supp A. The distributions with compact support form a space which is the conjugate of Cm(G). The topology on C”(G) is defined by the collection of seminorms appearing in (49), i.e. it is the topology of uniform convergence along with all derivatives on each compact set. Clearly, Cm(G) is a locally convex Frechet space. Remark 1. Without going into the general definitions we note that 5@(G)is the inductive limit of the family C*(X), where X runs through all closed balls
contained in G. The topology on g(G) is stronger than that induced by Cm(G). Remark 2. If 1, ,u E $3(G)*, H c G is an open set and (A - p)IH = 0, then we say that the distributions A, p coincide on H. If 8 E L,QH) and 21, = 8, then the values A(x) are defined at all Lebesgue points of the function 8. Selecting the cone of (pointwise) non-negative functions in g(G) we obtain the cone of non-negative distributions in g(G)*. It turns out that it coincides with the cone of measures. Multiplication by a function cpE Cm(G) is a continuous linear operator6 on g(G). Its conjugate operator on g(G)* is also called an operator of multiplication by ~1 WI U-1 = ACdl. F or example, (~6 = (p(O)& This operation does not allow extension to multiplication in g(G), which would convert g(G)* into a commutative algebra.’ Thus g(G)* is only a module over the algebra P(G). The differentiations Dj = a/ax, (1 < i < n) are also continuous linear operators on g(G) and consequently the conjugate operators D,+ act on $2(G)*, however, as already explained by the example of the b-function, it follows that we should put an/ax, = - DFA for any distribution A..
4For example, in this form it is convenient to make the change of variables x = Fy (F: G + G is a diffeomorphism): qx)f(F- ) WY)f(Y) dY = s0 sG Consequently, it is necessary to take as definition
dx det F F-lx)’
sThe natural embedding of g(H) into g(G) (extension by zero) is a continuous homomorphism whose conjugate is the homomorphism which restricts a functional to s(H). 6Continuous operators of multiplication on function spaces by a function (possibly not belonging to the given space) are called multiplicators. his circumstance can be regarded as a formal cause of the appearance of divergence in quantum field theory and the removal of divergence (i.e. normalisation of the theory) as the regularisation of multiplication of generahsed functions (N.N. Bogolyubov - O.S. Parasyuk, 1955).
247
0 5. Function Spaces
All linear differential .9(G)*:
operators with coefficients in Cm(G) are now defined on
They are all continuous under the w*-topology, which is taken as the standard topology on Q(G)*. As an example to illustrate the flexibility and convenience of the language of distributions, let us mention the general definition of one of the principal concepts of the theory of partial differential equations. For any linear differential operator L with coefficients in P(G) a distribution r,, which depends on the parameter y E G and satisfies the equations is called a fundamental solution. For example, the Laplace operator in R” has as a fundamental solution when n > 2 the function9 1 r,(x) = ux9 Y) = -@ _ 2)a”,x _ y,n-2v where O, is the area of the unit sphere of R” with the canonical Euclidean norm
IxI=J,$x;. A fundamental solution r, exists for any operator L with constant coefficients (Ehrenpreis, 1954; Malgrange, 1955) and clearly it has the formlo T,(x) = l-(x - y) (x, y E UP). An immediate application of this fundamental solution is the construction of solutions to the equation” L [A] = p by means of the formula Iz = r * p. Here we are using convolution of distributions. For its introduction it is necessary to consider first the convolution of the distribution p with an arbitrary function f E S+!“), i.e. the function
(cl *f)(x) = Am
- .)I-
This function belongs to Cm(W) but if the distribution
p has compact support
* 6, is the unit measure concentrated at the point y. In R” we can write 6,(x) = 6(x - y). This notation carries over to any domain G. 9If n = 2 we have l-(x, y) = k lnlxl. If n = 3 we can interpret l-(x, y) physically as the potential of a point charge. lo In a more complete formulation of the theorem it is asserted that I is a distribution of finite order. We note that in theorems of this type not only the formulations are important but also the constructions which allow us to establish further useful properties of the solutions. The generalised Fourier transform is usually applied in the construction of fundamental solutions of equations with constant coeffkients (see below). ii In the case where L is the Laplace operator this is Poisson equation describing the potential of a charge with ensity’ p.
248
Chapter 2. Foundations and Methods
then p *f E ~(I?‘) and, by definition, we can put (I * ,u) [f] = I * (p * f). Thus the equation L[A] = p is successfully solved (in the class of all distributions) for any distribution p with compact support. In particular, p can be a summable function with compact support but the solution 1 nevertheless turns out to be only a distribution in general (generalised solution). However, if the operator L is elliptic12 (and only in this case), all generalised solutions will be real analytic functions whenever the right hand side of the equation is a real analytic function (I.G. Petrovskij, 1937; Hormander, 1955). An analogous result relating to infinite differentiability holds for the so-called hypoelliptic operators13 (and only in this case) (Hormander, 1955). Generalised solutions are entirely natural from the physical point of view since they satisfy the equation in an integral sense: the equality L[A] = p means that 1[Laf] = ~[f] for all f E g(G), i.e. in the usual notation (L,* = L). W(Loff)(x) dx = PL(Mx) dx sG sG This form of the equation can have even greater physical meaning than the traditional one. The onjugate operator principle’ has already been used repeatedly to transfer the standard operations of analysis to generalised functions: if a continuous linear operator A is given on the basic space, then A* is its transfer to the space of generalised functions. In a more general situation we can be given a continuous homomorphism h: E + E, of the basic space into another function space. Since h * p E E* (,u E E:), it follows that h * p is a generalised function (on E). We can regard this as the transfer of the homomorphism h to the generalised functions over the basic space El. The most important example of this type is the generalised Fourier transform.
Suppose that the basic space E consists of functions on R” and E c L’(W). Then the ordinary Fourier transform14 f(c) =
f(x)e-i(c*x’ s o-4”
dx
is defined on E. It maps E bijectively to a certain function space ,? (contained in the space of continuous functions which tend to zero at infinity). Let us provide E with the topology carried over from E. As a result the Fourier transform can be transferred to the generalised functions on the basic space E. For example, the Fourier transform of any distribution on R” is a generalised function over the basic space of entire (after analytic continuation with respect to x) functions of lZAn operator L is said to be elliptic if its principal
part
at is defined by a homogeneous ax’
‘&,,, a,-
polynomial P(r,, . . . , rN) (symbol) with no real roots < # 0. i3 All elliptic operators are hypoelliptic. Infinite differentiability of solutions of elliptic equations with infinitely differentiable right hand sides was established by L. Schwartz (1951). r4(& x) is the canonical scalar product on R”.
0 5. Function
Spaces
249
exponential type that decrease more rapidly than any power of 1x1.In particular we can now speak of the Fourier transform of any locally summable function 0(x) without imposing any restrictions as 1x1+ co. It is clear that the generalised Fourier transform has the usual perational properties: differentiation Dj passes over to multiplication by irj, therefore any linear differential operator with constant coefficients passes over to the operator of multiplication by the corresponding polynomial; convolution becomes a product and so on; hence differential, convolution and some other equations are as usual transformed into algebraic equations. For example, the equation L[n] = p takes the form P(t)n = ,& and the problem reduces to determining the possibility of dividing a generalised function by a polynomial (division problem). The general approach to the Fourier transform was developed gradually (Bochner, 1932; L. Schwartz, 1951) and took its final form, which has been described above, in the work of I.M. Gel and and G.E. Shilov (1953), where they obtained with the help of the Fourier transform uniqueness classes for the Cauchy problem for systems of differential equations of the form
2 = ktl Ljk(t)CUkl
t1 Gj G m)9
(50)
the Ljk(t) being linear differential operators in R” with coefficients which depend continuously on t (but are constant with respect to x). If p is the least order of the operators Ljk, then the uniqueness class is defined by the bound max luj(x, t)I < Mealx14, j where15 q = p/(p - 1) and 0, M are positive constants. In this connection the consideration of generalised functions over suitable spaces of entire functions turned out to be essential, since it allowed the possibility of establishing uniqueness by using the Phragmen-Lindelof principle. The application of this sort of analytic technique requires an intrinsic characterisation of the space16 E* for those spaces E being used as basic spaces. We stress the lack of sensitivity of the method described to the type of system (for parabolic (according to Petrovskij) equations the uniqueness class was established by O.A. Ladyzhenskaya (1950) and for parabolic systems by SD. Ehjdel an (1953); in this situation G.H. Zolotarev (1958) obtained a criterion of Taklind type). The general result is well illustrated by the following example: for the equation
a~ a% z=a= ISIn fact we can take q = ~ PO
PII-1
where
PO c P is the so-called
(51)
reduced order. There is an effective
formula for its determination, from which we can see that p,, is a rational number tor c m; in particular, p,, = p if m = 1 (V.M. Borok, 1957). leThis is a certain (not very great) cost on account of the restrictions at infinity.
with
denomina-
250
Chapter 2. Foundations and Methods
with complex a the uniqueness class in the Cauchy problem bound Iu(x, t)l < MeuX2.
is defined by the (52)
For a > 0 this is the heat equation, for a < 0 it represents everse’heat flow and for a = i we have the simplest case of a non-stationary Schriidinger equation. We note that for a < 0 the Cauchy problem for equation (51) is incorrect in the sense that we do not have continuous dependence of the solution (in the uniform metric on an arbitrarily small time interval) on the initial function.” The technique of generalised Fourier transform allows us to identify correctness classes in the Cauchy problem for the system (50) (G.E. Shilov, 1955). Certainly they are, in general, no longer the uniqueness classes, although sometimes the two coincide. For example, the Cauchy problem for the heat equation in the class (52) is correct. Now let the basic space E be an extension of the space 9 = g(R with a weakened topology and also let 9 be dense in E for the weak topology. Then all continuous linear functionals on E remain such when restricted to 9 and are uniquely defined by their restrictions. Thus generalised functions on E can be regarded in the given case as distributions. The most remarkable example of this type is the Schwartz space S of functions of class Co3(R which decrease together with all their derivatives more rapidly than any power 1x1-’ (v = 0, 1,2, . . .). This is a locally convex Frechet space defined by the family of seminorms
All linear differential operators with polynomial coefficients act continuously on S. The Fourier transform is a topological automorphism of the space S. Generalised functions on S are called tempered distributions. All linear differential operators with polynomial coefficients act continuously on this class (i.e. on S*). The Fourier transform on S* is a topological automorphism. Any linear differential equation with constant coefficients has a tempered fundamental solution (Hbrmander, 1958; Lojasiewicz, 1959). This is a non-trivial refinement of the Ehrenpreis-Malgrange theorem which was formulated above; it is connected with lower bounds for polynomials in several variables, which allow the division problem to be solved successfully in the given situation. If the basic space E is a contraction of 9 with a stronger topology and also E is dense in ~3, then the generalised functions over E will not now be distributions in general (but all distributions will be generalised functions over E). Natural examples of this type of basic space are the non-quasianalytic classes of infinitely differentiable functions (generalised functions over such spaces are given the name ultradistributions). However quasianalytic and especially analytic classes are of no lesser interest in the scheme being discussed. We call continuous linear he physical significance of this fact is that the process of heat conduction is not reversible in time.
$5. Function
Spaces
251
functionals on such spaces analytic function& As a specific case let us consider the space A(@“) of entire functions of n complex variables under the topology of uniform convergence on compact sets, In order that a linear functional u on this space be continuous, it is necessary and sufficient that there exist a compact set Q c @” and a constant C such that
IdYll G C yEa; IA41
(f E 3).
We can say that the functional a is concentrated ml8 Q but this term is also used in the case where for any neighbourhood I/ 3 K there exists a constant C, such that
bCfll G G Example.
SUP
If(
Let (at) be any multi-indexed
(f E 4@ ).
(53)
scalar sequence satisfying
the
condition lim Ikl-
;l/lael= 0.
Then”
is an analytic functional concentrated at 0 and (54) gives the general form of such functionals. Let us denote by ,4*(Q) the set of analytic functionals which are concentrated on the compact set Q c @ Restricting ourselves now to real compact sets2’ (i.e. such that Q c 08”c Q=”where the last embedding is the canonical one), let us consider the linear space NV
= ,lJ A*({ x: x E R llxll < r}).
Martineau (1961) proved 21 that any a E A lR has a support, i.e. a smallest compact set supp c1on which c1is concentrated. This opened up a relatively simple path to the theory of so-called hyperfunctions on [w”which was created by Sato (1960). Hyperfunctions with compact supports are just the same as functionals from A R . If G is a bounded open set in Iw the space of hyperfunctions on G is by definition the factor space A*(G)/A*(aG). The passage to hyperfunctions on the whole of R” is effected by means of a covering by bounded neighbourhoods. Thus, roughly speaking, a hyperfunction is a ocally analytic’ functional. On the whole,
Thus all continuous linear functionals on A(a) have compact support. I9 Here we use the abbreviation k! = k,! . . . k,! 2o But for such compact sets it is still necessary in (53) to consider complex neighbourhoods. 21 In the proof Martineau uses cohomology theory. An lementary’ proof was given by HGrmander (1983).
252
Chapter 2. Foundations and Methods
this construction is such that hyperfunctions are not generalised functions in the sense used earlier. However in many respects we can work with them just as with generalised (and so also as with classical) functions; in particular we can consider differential equations in hyperfunctions. It is interesting that hyperfunctions were intended first of all to play the role of limiting values of analytic functions in situations where these do not exist in any usual sense. Because of this the setting of hyperfunctions allows us to obtain, for example, adequate generalisations of Bogolyubou theorem n the edge of the wedge’ (Martineau, 1968; Morimoto, 1979; Zharinov, 1980). We note that the original version of this theorem was obtained by N.N. Bogolyubov (1956) in connection with the construction of dispersal relations in quantum field theory. The theorem asserts the local analytic continuability of one into the other for a pair of functions f+, f- which are analytic on the domains G + iK (G is a domain and K is a cone in R under the condition that their limiting values on G coincide. The extended function turns out to be analytic on some complex neighbourhood V 2 G (we even have V 1 I, where I is the C-convex hull of the domain G (VS. Vladimirov, 1960)). 5.3. Families of Function Spaces. Although the list of families presented below contains many of the well-known function spaces it is certainly by no means complete and illustrates rather the variety of possible constructions. We note that the nature of a function space is determined to a considerable extent by the structure which is given on the domain of definition X of those functions from which the space is composed. For example, if X is a space with a measure, it is natural to consider measurable functions on it; on a topological space X we consider continuous functions (if we also look at discontinuous functions, they should at least be Borel” functions); a metric on X allows us to introduce the modulus of continuity and to define, for example, Lipschitz classes; on a smooth (analytic) manifold we can consider smooth (analytic) functions and so on. I. Let M(u) (U > 0) be a convex function such that M(u) > 0 (U # 0) and
lim u- (u)
= 0,
U-+0
Then we say that this is an N-function.
lim u- (u) “-+a3
= co.
Its Legendre transform
M*(u) = sup (uu - M(u))
(u 2 0)
II>0
is also an N-functionz3 and it is said to be dual to M (it is easy to see that iif** = M). Let X be a space with a o-finite measure p. We consider the set LM(X) of measurable functions u(x)@ E X) which satisfy the condition M*(u) =
sX
M*(l44l)
dp < 00.
.e. such that the inverse images of Bore1 sets are of the same type. 23The non-negativity of M*(u) is the classical Young inequality.
5 5. Function Spaces
253
We denote by L;$(X) the set of functions u(x) such that uu E L’(X, p) for all u E J+(X). This linear space has a norm Ilull =
SUP M*(u)<1
IS X
uu d,u I
(it can be shown that the supremum is finite) under which it turns out to be a Banach space; it is called the Orlicz space with norming function M. If M(u) = %( p > l), then M*(u) = 4 p + 4 = 1 and the Orlicz T l > space L;I, coincides with Lp (to within a multiple of the norm). Now let us assume that M(u) is of slow growth in the sense that Example.
M&4 fp M(u)
The general form of a continuous
< O
linear functional
on L$ is then
where v E L,,. By the same token L,,. turns out to be topologically isomorphic to the conjugate space of L$. II. We associate with each measurable function u(x)(x E X) its indicatrix24
m,(t) = I+: le4l > t>
0 2 0).
This is a non-increasing function which is continuous on the right. Let $(r) (r > 0) be a concave, non-decreasing function such that e(O) = 0, Il/(co) = co. The condition
II4 = oa+h(t)) s
dt < 00
defines the Banach space A,, which is called a Lorentz space. The conjugate of space M,, which consists of the measurable functions u(x) (x E X) for which A, is the Marcinkiewicz
1
ll4l = sup ~ II/(&a
I4 dp < ~0
sA (the general form of a continuous linear functional on ,4 ti is standard). In the space of measurable functions which are described above it is possible to construct a rich theory of non-linear integral equations, by choosing the space in accordance with the nature of the non-linearity (power, exponential, etc.). This 24 In the probabilistic situation this is a variant of the distribution function of a random variable. Just as we speak of identically distributed random variables in probability theory, here we can speak of equimeasurable functions (i.e. functions whose indicatrices coincide - this is also an important concept).
254
Chapter 2. Foundations and Methods
programme was realised in the 1950s and 1960s by M.A. Krasnosel kij in collaboration with several co-authors. III. Let X be a metric space with metric d and let U(X) (x E X) be a uniformly continuous function. The quantity
is called its modulus of continuity. Clearly, w,(6) is a non-decreasing, non-negative function with o(0,) = 0. Each non-decreasing positive function Q(6) (6 > 0, 52(0+) = 0) defines a space of Lipschitz type 25 8, consisting of the continuous functions u(x) (x E X) such that Ilull = lu(xJ
w@) <
+ sup -_ a>0 Q(6)
00
(x0 E X is a fixed point). This family of Banach spaces serves first and foremost the needs of approximation theory. IV. Among the spaces of smooth functions let us mention first of all the Soboleu spaces which are defined in their general form by the following procedure. Let G be an open set in R let 1 < p < cc and let m be an integer 2 1. The space WPm(G) consists of those functions whose generalised partial derivatives up to order m inclusive are functions of clasP U(G). Correspondingly we introduce the norm
The discrete series of spaces W, G) (m = 1,2,3, . . .) is included in the continuous family W, G) (s > 0) of Slobodetskij spaces. Ifs = m + r (0 < I < l), it is required of a function f E W, G) that f E W, G) and
Ilfll, = Ilfllm + J
m
(Ss G
G
> l/P
Ix’
_
Xyn+rP
The first differences of the partial derivatives g
dx’ dx”
< 00.
appearing here can be replaced
by differences of higher orders ” (this is advisable). The Nikol kij-Besoo spaces, which we now mention, arise in this way in combination with certain other useful modifications. The norms by means of which these spaces are introduced can be replaced by equivalent norms defined in terms of best polynomial approximations in the LP-metrics. This fact is the multivariate analogue of the JacksonBernshtein theorem. In the case G = R” the Fourier transform is an effective mechanism for constructing spaces of smooth functions; in terms of it smoothness is regulated by 25For Q(6) = 6” (0 < cz< 1) we have the class Lip a. 26i.e. regular distributions. 27Here it is necessary to assume that G is a convex domain.
9 5. Function Spaces
255
the rate of decrease at infinity. For a more refined account of smoothness it is appropriate to form a decomposition of the Fourier transform with the help of a partition of unity into a sum of functions of compact support ((P&Y in the class S which are concentrated on a sequence of spherical layers r&i < 1x1 < rk (P-0= 0, r, + co):
Finally, we can require that the sequence of norms (e.g. in Lp) of the inverse Fourier transforms of the terms in (55) decrease at a predetermined rate in accordance with the growth of the radii r,. An alternative to the last step is to first calculate a certain norm (taking the radii into account) for the sequence (qpkf)? and then to require that this function behaves itself sufficiently well at infinity (e.g. belongs to ~5~).These approaches, which were outlined in the mid1960s (S.M. Nikol kij, P.I. Lizorkin, Peetre), were systematically developed during the 1970s by Peetre, Triebel and Stijckert and led to the construction of very extensive families of spaces of smooth (of power order) functions which include the Nikol kij-Besov spaces, the Lizorkin spaces associated with derivatives of fractional order and so on. A corresponding theory for the general modulus of continuity has been constructed during the last ten years (G.A. Kalyabin, M.G. Gol man). Function spaces on manifolds are constructed naturally from those on charts (the same can be said about spaces of differential forms and, in general, about sections of vector bundles on manifolds). As has already been said above, in the theory of boundary value problems it is important to know to what space the boundary values of functions from the given space belong. Moreover it is important to have a collection of embedding theorems for individual spaces into others and also to know when the embedding mappings are compact. At the present time extensive information has been accumulated on this account. 5.4. Operators on Function Spaces. We have already touched upon this topic repeatedly when speaking of integral and differential operators. A formal notation of the type f’(x) = -Jm
6’6 - YMY) dy -a, allows us to regard linear differential operators as integral equations with generalised, highly singular kernels. The kernels of ordinary integral operators are regular so that these operators (or their powers) are compact on spaces of type’* LJ’. An intermediate position between the two cases mentioned is occupied by singular integral operators whose kernels are functions with discontinuities of
281n particular, kernels with bound K(x, y) = 0(1x - yl-“)( m c n) on a bounded domain G c R” are of this type.
Chapter 2. Foundations and Methods
256
such a type that, while compactness is lost,2g we still have boundedness on spaces of type Lp. Example.
The Hilbert operator
<x<00) Wd(x) =& s-1$f$ (-co
(56)
is defined and bounded on Lp( R) (1 < p < co) (theorem of M. Riesz). The integral (56) is obtained as a principal value. If cpE L’(R), it exists for almost all x (Zygmund, 1959), however the Hilbert operator acts neither on L’ nor on L”: if (~(5) = 1 (151 < l), rp(O = 0 (ItI > 11, then WV)(~) = ln z . It is useful for I I the investigation of the operator H to pass to the Fourier transforms 4, eq. Then it transforms into the operator of multiplication by $sgn 1. However the Fourier transform itself acts on Lp only for p = 2, so that for the time being only boundedness of the Hilbert operator on L2 is clear. From certain considerations of a special character we can obtain in addition its boundedness for p = 4, 6, . . . , after which the passage to arbitrary p 2 2 is effected with the help of the following. M. Riesz-Thorin Interpolation Theorem. Let 1 < pi < ri < co (i = 0, 1) with pi finite, put pa = [(l - a)p;’ + ap; -’ (0 < a G 1) and let r, be defined similarly. Zf an operator A is defined on the set of piecewise-constant functions (p(t) of compact support (--co < 5 < co) and maps them into functions in Lro n L in such a way that IIAqlI,, < Mi IIrp IIpi (i = 0, l), then (0 < a < l),
IIA~II,~ G WIMlp,
where h4. = M~- ;
(11. IISis the norm on L .
This theorem together with its immediate generalisations (Marcinkiewicz, 1939) became the starting point of the general theory of interpolation of linear operators which was referred to above. We will describe briefly the subject of this theory which at the present time is already far advanced and finds diverse applications. Let E,, E, be a pair of Banach spaces both of which are continuously embedded in the same LTS (a Banach pair). We can then construct the Banach spaces E, n E,, E, + E, by putting
lbll -% I= max(IlxllE,, II%J llxll E,+E,
=
inf x=x0+x,
(II%
IlEo
+
llxl
liEI).
A Banach space E which is contained in E, + E, and contains E, n E, is said to be intermediate for the Banach pair (E,, E,) if the embeddings E, n E, + f fact it is already appropriate in these cases to regard the kernel as a generalised function.
4 5. Function Spaces
251
E --* E, + E 1 are continuous. In particular, E, and E 1 are themselves of this type. We also note that, if E, c E, and the embedding is continuous, then E, n E, x E,andE,+E,xE,. An intermediate space for a Banach pair (E,, E,) is said to be an interpolation space if it is invariant for all linear operators A on E, + E, for which both E, and E, are invariant and the operators AlEo, AlE1 are bounded for the corresponding norms3’ (the linear space of such operators is denoted by Z(E,, E,)). The Interpolation Inequality. Zf E is an interpolation (E,, E,), then Al, is bounded and
MMI G ME max(ll&,II,
space fir
a Banach pair
IL%, II),
where ME is a constant independent of A (interpolation
(57)
constant)3’.
Proof. The operator Al, is closed, since it is closed (even bounded) under the weaker norm induced from E, + E,. It is therefore bounded. Further the space Z(E,, E,) is a Banach space with respect to both the norm which appears on the right hand side in (57) and the weaker norm [IA IIEo+s,. These norms are therefore equivalent. Finally, the restriction homomorphism of Z(E,, E,) + 2(E) is closed (which is easily verified with the norm IIAIIE,+E, on Z(E,, E,)). It is therefore bounded, i.e. (57) holds. 0 Remark. In the more general situation to which, in particular, the RieszThorin theorem relates we have to consider two Banach pairs (E,, E,), (F,, F,), spaces E, F which are intermediate for these pairs, and an interpolation in the sense that, for all homomorphisms A: E, + E, --, F. + F1 such that AE, c F,, AE, c F1 and AlEo, AlEI are bounded, we have AE c F. There is no problem in carrying over the interpolation inequality to this situation. As an example let us consider the Banach pair (L1 (W), Em(~)). The role of the embedding LTS is taken here by the space @ of measurable functions with the topology of convergence in measure on each finite interval. The space Q. = L’ + L” is topologically isomorphic to the space of functions rp E Q, for which the indicatrix m. f co. A Banach space E which is continuously embedded in Q. is said to be symmetric if 1) whenever q E E, + E Q. and 11&5)1< Iv(<)1 almost everywhere, it follows that $ E E and jl+llB < llqllE, 2) whenever cpE E, $ E Q. and cp, II/ are equimeasurable, it follows that I/ E E and ll$llE = IIqIIs. For example, all Lp (1 < p < co) are of this type, as are Orlicz, Lorentz and Marcinkiewicz spaces. E.M. Semenov (1964) proved that all symmetric spaces are intermediate for (L’, L”) and any interpolation space for a pair of symmetric spaces is topologically isomorphic to a symmetric space (it is symmetric if the interpolation constant is equal to 1). Calderon (1966) obtained a complete description of interpolation spaces for (L’, L”). In any event all separable symnder these conditions A is bounded both on E,, + E, and on E, n E 1 for the noms introduced on these spaces. 31 It can at once be assumed to be the least such constant. If we then have ME = 1, the interpolation space is said to be exact.
258
Chapter 2. Foundations and Methods
metric spaces are interpolation spaces for (L’, L”). The particular symmetric spaces mentioned above are interpolation spaces for (L’, Lm). Nevertheless there does exist a symmetric space which is not an interpolation space for the pair (L’, L”) (G.I. RUSSU, 1969). In situations where it does not seem to be possible to describe all interpolation spaces for some Banach pair, a sensible approach is to find an effective construction for producing a sufficiently extensive family of interpolation spaces. The interpolation methods3’ proposed for this purpose are functorial constructions and it is precisely in this that the guarantee of their success is contained. This point of view, which goes back to work of Aronszajn and Gagliardo (1965), has recently taken on a fundamental character (Yu.A. Brudnyj, N. Kruglyak). One further significant aspect of the theory of interpolation spaces is connected with one-parameter families of Banach spaces E, (0 < CI< 1) having the following properties: 1) for /I > LYthe space E, is continuously and densely embedded in E,; 2) for y > B > ci llXllEB
<
C(a,
p,
y)
IIxII~~-~)‘(y-=)IIxII~-=)‘(y-=)
(x
6 Ey),
where C(a, fi, y) is a positive constant. Such a family is said to be a scale (normal if IIxIIs, < llxllEb (B > a) and C(a, /I, y) = 1). Many scales (Ea),,4.Gl constructed under specific conditions turn out to consist of interpolation spaces for the pair (E,, E,) (Lions, 1958; S.G. Krejn, 1960; S.G. Krejn - Yu.1. Petunin, 1966)33. Because of this occurrence, scales prove to be a suitable tool for investigating boundary value problems for elliptic differential operators (Magenes, 1963). Returning to the consideration of specific operators on function spaces, we note first of all the multivariable analogue of the Hilbert operator:
Here q,, = y/lyl, the function 52(x, 0) is continuous
(for simplicity)
and
0(x, 0) d0 = 0.
s lel=l The integral (58) is obtained as a principal value. The singular integral operator Hc is defined and bounded on Lp( I?) (1 < p < co) if 52 belongs to Lq with respect to 13 - + - = 1 and its LP-norm is bounded with respect to x (the case n = 2, c : > p = 2 is due to S.G. Mikhlin, 1953; the general case is due to Calderon and Zygmund, 1956). 32 eal’ and omplex’ methods differ here. The latter are based on results from complex analysis such as Hadamard three-lines theorem. We note that with these methods it is possible to show, for example, that Nikol kij-Besov spaces are interpolation spaces for a pair of spaces from this same family or for a pair of Sobolev spaces etc. 33 In particular, the scales which can be constructed by the complex interpolation method.
6 5. Function Spaces
259
The Fourier transform with respect to y of the kernel of the operator Ho is called the symbol of this operator:
eY) 4x, 5) =ssz(xy e-i(C.y)
b-4”
dy.
IYI”
If 52 and consequently also c do not depend on x, then
In this case the composition theorem holds: the product of the symbols corresponds to the product of the operators (Calderon-Zygmund, 1952). This result has a natural connection with the convolution theorem but it is not so obvious since the integrals concerned are principal values.34 Since the correspondence between operators and symbols is clearly also linear, we have here a homomorphism of an algebra of operators into an algebra of functions, i.e. a certain variant of operator calculus. This is an effective route to the solution of multivariate singular integral equations (this was in fact projected in the late 1920s by Tricomi). In the general case it is useful to consider along with HB the operator
We naturally expect that the operator Go, whese investigation can turn out to be simpler than that of Ho, will at the same time imitate HO sufficiently well. The operator Gc is pseudodifirential. To explain this term let us consider an operator of the form U%)(x) = m
1 s
R”P(X, bN5)ei(~*x) d5,
where p, an arbitrary given function of power growth rate3’ with respect to <, is the symbol of the operator P. If p(x, 5) is a polynomial in r (with coefficients depending on x), then P is a difirential operator (with the same coefficients). If we eliminate the intermediate Fourier transform, we can represent the operator P as a double integral: u+m)
1 = pq
R” Rnp(x, &p(y)ei(e*x-Y) ss
d( dy.
(59)
34The concept of the symbol was introduced differently at an early stage in the development of the theory of singular integral equations and correspondingly the composition theorem was proved in another context (S.G. Mikhlin, 1936; Giraud, 1936; S.G. Mikhlin, 1955). Then the equivalence of the two definitions of the symbol was established (S.G. Mikhlin, 1956). Uniformly with respect to x on each compact set. The exponent of the power of growth (not necessarily integral) is called the order of the operator. Usually p is a function of class C” with respect to all its variables with suitable bounds on the derivatives.
260
Chapter
2. Foundations
and Methods
The final step leading to the concept of a pseudodifferential operator (PDO) in full generality consists of replacing p(x, <) in formula (59) by p(x, y, 5) (it now follows that we should call an operator of the form (59) a PDO in the restricted or proper sense). This class of operators is sufficient for studying many questions in the theory of elliptic differential equations (including the solution of the index problem (Atiyah-Singer, 1968), in which PDOs played an important role). We can regard the construction of a PDO as a quuntization which associates with each scalar function (symbol) an operator on a suitable function space. Thus, for example, problems such as quasiclassical asymptotics (passage to the limit from quantum mechanics to the classical situation by letting Plan&s constant /i tend to zero) are naturally studied in the setting of PDOs and their further generalisations (V.P. Maslov, 1965; V.P. Maslov, M.V. Fedoryuk, 1976; V.L. Rojtburd, 1976). The generalisations mentioned are Fourier integral operators (FZOs), whose theory was formulated during the mid 1960s and early 1970s thanks to works of V.P. Maslov, Hijrmander and other authors. PDOs were already in use at the beginning of this period, however many problems (including quasiclassical asymptotics and the Cauchy problem for hyperbolic equations) required the introduction of FIOs. The operators are constructed on Lagrange manifolds by gluing together local FIOs which have a similar form to PDOs but with exponential term exp(iS(x, y, t)), where in general S is some non-linear function. In conclusion we note that in the development of functional analysis functions of infinitely many variables played and continue to play a significant role. Such, for example, are the functionals which arise in variational calculus, states with an unrestricted number of particles in quantum field theory, and so on. Rigorous analysis has been successfully constructed on spaces of functions of infinitely many variables. A systematic account of it is contained in the monographs by Yu.M. Berezanskij (1978) and by Yu.L. Daletskij and S.V. Fomin (1983). It is most important here to have a satisfactory theory of measure and integration (in the field of applications we point out the Wiener measure in the theory of random processes, the Feynman integral in quantum mechanics and electrodynamics, secondary quantization). A fundamental difficulty is caused by the lack of a translation invariant measure on an infinite-dimensional LTS since it is not locally compact (the last condition is close to being necessary in the theorem of Haar). However for many purposes the so-called quasi-invariance is sufficient. In particular, aussian measures’, which include the Wiener measure, have this property. It is clear from what has been said that on an infinite-dimensional space we have to construct not measures n general’ but special classes of ood measures. A certain axiomatic scheme for measure theory in an inlinite-dimensional space was proposed by A.M. Vershik (1968) and he and V.N. Sudakov (1969) conducted an extensive investigation of probability measures. Generally speaking the connections between functional analysis and probability theory are deep and varied. A meaningful account of this aspect lies beyond the scope of the present volume.
Bibliography
261
Commentary on the Bibliography The principal events in the foundation of functional analysis have been elucidated in the articles of Bourbaki and in the book by Dieudonne (1983). This discipline attained a definite maturity in 1932, which is testified to by the simultaneous appearance in this year of three fundamental works: heorie des operations lintaires” by Banach, athematische Grundlagen der Quantenmechanik” by von Neumann and inear transformations in Hilbert space and their applications to analysis” by Stone. However many important topics (linear topological spaces, ordered linear spaces, Banach algebras etc.) were developed later on and surveys and monographs devoted to them appeared correspondingly later. At the present time there are many hundreds of titles of individual books dealing just with functional analysis and its applications. We hope that the selection presented below is representative of the subject (but it is by no means representative in terms of priority). As far as articles are concerned the list includes only certain surveys from spekhi Matematicheskikh Nauk” (in order to supplement the monographs) and works from previous volumes in the present series which are connected with our subject. Several books are also mentioned from the pravochnaya Matematicheskaya Biblioteka”, including the publication unctional Analysis” (1972) which was produced by 17 authors under the general editorship of S.G. Krejn. References in the text to articles (author, year) are not associated with the Bibliography. In the majority of cases the original source will be found without difficulty from the given information.
Bibliography* I. Basic
Texts
Kantorovich, L.V., Akilov, G.P. [1977]: Functional Analysis. Nauka: Moscow. English transl.: Pergamon Press: Oxford, 1982,Zbl.484.46003 Kolmogorov, A.N., Fomin, S.V. [1972]: Elements of the Theory of Functions and Functional Analysis, 3rd ed. Nauka: Moscow. French transl.: Mir.: Moscow, 1974, German transl.: VEB Deutscher Verlag der Wissenschaften: Berlin, 1975, Zbl.57,336; Zbl.90,87; Zbl.167,421; Zbl.299. 46001 Lyustemik, L.A., Sobolev, V.I. [1965]: Elements of Functional Analysis. Nauka: Moscow, Zbl.141,116; Zbl.44,325. English transl.: J. Wiley & Sons: New York, 1974 Riesz, F., Szokefalvi-Nagy, B. [1972]: Lecons d nalyse Fonctionelle. Akad. Kiado: Budapest, Zbl.64,354; Zbl.46,331 Rudin, W. [1973]: Functional Analysis. McGraw-Hill: New York, Zbl.253.46001
II. Additional
Akhiezer, transl.: Akhiezer, Transl.
Educational
Literature
N.I. [1965]: Lectures on Approximation Theory. Nauka: Moscow, Zbl.133,16. German Akademie-Verlag: Berlin, 1967 N.I. [1984]: Lectures on Integral Transforms. Vishcha Shkola: Khar ov. English transl.: Math. Monogr. 70: Providence, 1988,2b1.546.44001
*For the convenience of the reader, references to reviews in Zentralblatt fur Mathematik (Zbl.), compiled using the MATH database, have, as far as possible, been included in this bibliography.
262
Bibliography
Akhiezer, NJ., Glazman, I.M. [1977, 19783: Theory of Linear Operators in Hilbert Space I, II. Vishcha Shkola: Khar ov. English transl.: Pitman Adv. Publ: Boston, 1981, Zbl.467.47002; Zbl.143,365 Antonevich, A.B., Radyno, Ya.V. [1984]: Functional Analysis and Integral Equations. Minsk Univ. Press: Minsk (Russian), Zbl.539.46001 Atiyah, M.F., Macdonald, LG. [1969]: Introduction to Commutative Algebra. Addison-Wesley: Reading, Mass., Zbl.17536 Berezin, F.A., Shubin, M.A. [1972]: Lectures on Quantum Mechanics. Moscow Univ. Press: Moscow (Russian) Birman, M.Sh., Solomyak, M.Z. [1980]: Spectral Theory of Selfadjoint Operators in Hilbert Space. Leningrad Univ. Press: Leningrad (Russian) Brudnyj, Yu.A., Gorin, E.A. [1981]: Isometric Representations and Differential Inequalities. Yaroslavl’ Univ. Press: Yaroslavl’ (Russian), Zbl.559.34011 Courant, R., Hilbert, D. [1931]: Methoden der Mathematischen Physik, I. Springer-Verlag: Berlin, Zbl.15 Davis, M. [1977]: Applied Nonstandard Analysis. J. Wiley & Sons: New York, Zbl.359.02060 Edwards, R.E. [ 19651: Functional Analysis. Theory and Applications. Holt: New York, Zbl.182,161 Edwards, R.E. [1979, 1982-j. Fourier Series (a Modern Introduction) I, II. Springer-Verlag: Berlin, 2b1.424.42001,Zb1.599.42001 Frohlicher, A., Bucher, W. [1966]: Calculus in Vector Spaces without Norm. Lect. Notes Math. 30, Springer-Verlag: Berlin, Zbl.156,383 Gaier, D. [1980]. Vorlesungen iiber Approximationen im Komplexen. Birkhiiuser: Basel, Zbl.442. 30038 Gel and, I.M., Fomin, S.V. [1961]: Calculus of Variations. Phys.-Math. Section of Nauka: Moscow. English transl.: Prentice Hall: Englewood Cliffs, N.J., 1965 Zbl.127,54 Glazman, I.M., Lyubich, Yu.1. [1969]: Finite-Dimensional Linear Analysis. Nauka: Moscow, Zbl.188,74. English transl.: MIT Press: London, 1974 Halmos, P.R. [1950]: Measure Theory. Van Nostrand: New York, Zbl.40,168 Halmos, P.R. [1956]: Lectures on Ergodic Theory. Math. Sot. Japan: Tokyo, Zbl.73,93 Halmos, P.R. [1974]: A Hilbert Space Problem Book. Springer-Verlag: New York, Zbl.144,387 Helson, H. [1964]: Lectures on Invariant Subspaces. Academic Press: New York, Zbl.119,113 Kadets, V.M., Kadets, M.I. [ 19883: Rearrangements of Series in Banach Spaces. Tartu Univ. Press: Tartu (Russian), Zbl.685.46005. English transl.: Transl. Math. Monogr. 86: Providence, 1991 Kirillov, A.A. [1978]: Elements of the Theory of Representations. (2nd ed.) Nauka: Moscow. English transl.: Springer-Verlag: Berlin, 1976,Zbl.264.22001 Kirillov, A.A., Gvishiani, A.D. [1979]: Theorems and Problems from Functional Analysis. Nauka: Moscow. English transl.: Springer-Verlag Berlin, 1982,Zbl.481.46002 Koosis, P. [1980]: Introduction to HP Spaces. Cambridge Univ. Press: Cambridge, Zbl.435.30001 Krejn, S.G. [1971]: Linear Equations in Banach Space. Nauka: Moscow. English transl.: Birkhluser: Base], 1982,Zbl.233.47001 Kutateladze, S.S. [1983]: Foundations of Functional Analysis. Nauka: Novosibirsk (Russian), Zbl.581.46001 Loomis, L.H. [1953]: An Introduction to Abstract Harmonic Analysis. Van Nostrand: New York, Zbl.52117 Markus, A.S., Sementsul, A.A. [1985]: Series in Banach Spaces. Kishinev Univ. Press: Kishinev (Russian) Matsaev, V.I., Palant, A.A. [1975]: Spectral Theory of Operator Bundles. Donetsk Univ. Press: Donetsk (Russian) Maurin, K. [1959]: Methods of Hilbert Spaces. Warszawa. English transl.: Polish Scientific Publishers: Warsaw [1967], Zbl.86,309,Zbl.166,102 Natanson, I.P. [1949]: Constructive Theory of Functions. State Science Press: Moscow-Leningrad (Russian), Zbl.41,186 Nikol kij, N.K. [1980]: Lectures on the Shift Operator. Nauka: Moscow, Zbl.508.47001. English transl.: Springer-Verlag: Berlin, 1986,Zbl.587.47036
Bibliography
263
Phelps, R.R. [1966]: Lectures on Choquet Theorem. Van Nostrand: Princeton, Zbl.l35,362 Plesner, A.I. [1965]: Spectral Theory of Linear Operators. Nauka: Moscow. English transl.: F. Ungar Publ.: New York, 1969, Zbl.147,343 Reed, M., Simon, B. [1972-19791: Methods of Modem Mathematical Physics I-IV. Academic Press: New York, Zbl.242.46001; 2b1.308.47002,2b1.405.47007,2b1.401.47001 Rickart, C. [1960]: General Theory of Banach Algebras. Van Nostrand: Princeton, Zbl.95.97. Robertson, A.P., Robertson, W.J. [1964]: Topological Vector Spaces. Cambridge Univ. Press: Cambridge, Zbl.123,302, Zbl.251.46002 Rockafellar, R.T. [1970]: Convex Analysis. Princeton Univ. Press: Princeton, Zbl.l93,184 Sadovnichij, V.A. [1986]: Theory of Linear Operators. Moscow Univ. Press: Moscow (Russian) Saks, S. [ 19301: Theory of the Integral. Monografie Matematyczne: Warsaw-Lwbw. English transl.: Stechert & Co: New York, 1937,Zbl.17,300; Zbl.7,105 Schaefer, H.H. [1966]: Topological Vector Spaces. Macmillan: New York, Zbl.l41,305 Shilov, G.E. [1965]: Mathematical Analysis. Second Special Course. Nauka: Moscow. English transl.: Generalized Functions and Partial Differential Equations, Gordon and Breach: New York, 1968, Zbl.l77,363; Zbl.591.46030 Shilov, G.E., Fan, D.T. [1967]: Integral, Measure and Derivative in Linear Spaces. Phys.-Math. Section of Nauka: Moscow (Russian) Shilov, G.E., Gurevich, B.L. [1967]: Integral, Measure and Derivative. General Theory. Nauka: Moscow. English transl.: Prentice Hall: Englewood Cliffs, N.J., 1966, Zbl.l38,275; Zbl.l31,52 Smimov, V.I. [1959]: Course of Higher Mathematics, V. Phys.-Math. Section of Nauka: Moscow, Zbl.l14,30. English transl.: Pergamon Press: Oxford, 1964 Trenogin, V.A. [1980]: Functional Analysis. Nauka: Moscow (Russian), Zbl.571.46001 Vladimirov, V.S. [1979]: Generalised Functions in Mathematical Physics. Nauka: Moscow. English transl.: MIR: Moscow, Zbl.515.46033,Zbl.403.46036 Vulikh, B.Z. [1961]: Introduction to the Theory of Partially Ordered Spaces. Phys.-Math. Section of Nauka: Moscow, Zbl.l01,85. English transl.: Wolters-Noordhoff Sci. Publ.: Groningen, 1967 Wermer, Y. [1976]: Banach Algebras and Several Complex Variables. Springer-Verlag: New York, Zbl.281.46052 Yosida, K. [1965]: Functional Analysis. Springer-Verlag: Berlin, Zbl.126,115,Zbl.286.46002 III. Monographs Akhiezer, N.I. [1961]: The Classical Moment Problem. Phys.-Math. Section of Nauka: Moscow, Zbl.l24,62. English transl.:Edinburgh, London, 1965 Akhiezer, N.I., Krejn, M.G. [1938]: On Certain Questions in the Theory of Moments. Ukrainian State Science Press: Khar ov (Russian) Azizov, T.Ya., Iokhvidov, I.S. [1986]: Basic Theory of Linear Operators on Spaces with Indefinite Metric. Nauka: Moscow (Russian), Zbl.607.47031 Banach, S. [1932]: Theorie des Operations Lineaires. Polish Scientific Publishers: Warsaw, Zbl.5,209 Banach, S. [1987]: Theory of Linear Operations (with a survey by Pelczyhski, A. and Bessaga, C., Some Aspects of the Present Theory of Banach Spaces).North-Holland: Amsterdam, Zb1.613.46001; Zbl.5,209 Berezanskij, Yu.M. [1965]: Eigenfunction Expansion of Selfadjoint Operators. Naukova Dumka: Kiev, Zbl.142,372. English transl.: Providence, 1968 Berezanskij, Yu.M. [1978]: Selfadjoint Operators on Spaces of Functions with an Infinite Number of Variables. Naukova Dumka: Kiev, Zbl.449.47001. English transl.: Transl. Math. Monogr. 63: Providence, 1986 Besov, O.V., Bin, V.P., Nikol kij, S.M. [1975]: Integral Representations of Functions and Embedding Theorems. Nauka: Moscow, Zbl.352.46023. English transl.: J. Wiley: New York, Vol I, 1978, Vol II, 1979 Bessaga, S., Pelczynski, A. [1975]. Selected Topics in Infinite-Dimensional Topology. Polish Scientific Publishers: Warsaw, Zbl.304.57001
264
Bibliography
Bonsall, F.F., Duncan, J. [1971]: Numerical Ranges of Operators on Normed Spaces and of Elements of Normed Algebras. London Math. Sot. Lecture Note Series, No. 2,Zbl.207.448 Bourbaki, N. [1952, 1957, 19593: Integration. Hermann: Paris. Zbl.49,317,Zbl.80,268,Zbl.l15,49 Bourbaki, N. [1953,1955]: Espaces Vectoriels Topologiques. Hermann: Paris, Zbl.50,107,Zbl.67,83 Bourbaki, N. [1967]: Theories Spectrales. Hermann: Paris, Zbl.152,326 Branges, L.De [1968]: Hilbert Spaces of Entire Functions. Prentice Hall Inc.: Englewood Cliffs, New Jersey, Zbl.157,433 Brodskij, M.S. [1969]: Triangular and Jordan Representations of Linear Operators. Nauka: Moscow, Zbl.182,458. English transl.: Providence, 1971 Butzer, P.L., Berens, H. [1967]: Semigroups of Operators and Approximation. Springer-Verlag Berlin, Zbl.164,437 Collatz, L. [1964]: Funktionalanalysis und Numerische Mathematik. Grundlehren Math. Wiss. 120, Springer-Verlag: Berlin, Zbl.139,98 Colojoara, I., Foias, C. [1968]: The Theory of Generalised Spectral Operators. Gordon and Breach: New York, Zbl.189442 Cooke, R.G. [1950]: Infinite Matrices and Sequence Spaces. Macmillan: London, Zbl.40,25 Cornfel , I.P., Sinai, Ya.G., Fomin, S.V. [1980]: Ergodic Theory. Nauka: Moscow, Zbl.508.28008. English transl.: Springer-Verlag: Berlin, 1982, Zbl.493.28007 Daletskij, Yu.L., Fomin, S.V. [1983]: Measures and Differential Equations in Infinite Dimensional Spaces. Nauka: Moscow (Russian), Zbl.536.46031 Daletskij, Yu.L., Krejn, M.G. [1970]: Stability of Solutions of Differential Equations in a Banach Space. Nauka: Moscow. English transl.: Transl. Math. Monogr. Vol. 43, Providence, 1974,Zbl.233. 34001 Diestel, J. [1975]: Geometry of Banach Spaces. Springer-Verlag: Berlin, Zbl.307.46009 Diestel, J. [1984]: Sequences and Series in a Banach Space. Springer-Verlag: Berlin, Zbl.542.46007 Diestel, J., Uhl, J.J. [1977]: Vector Measures. Math. Surv. 15 Providence, RI., Am. Math. Sot., Zbl.369.46039 Dieudonne, J. [1981]: History of Functional Analysis. North-Holland: Amsterdam, Zbl.478.46001 Dixmier, J. [1969]: Les C*-algebres et leurs Representations. Gauthier-Villars: Paris, Zbl.174,186 Dragilev, M.M. [1983]. Bases in Kijthe Spaces. Rostov Univ. Press: Rostov (Russian), 2b1.556.46005 Dulst, D. van. [1978]: Reflexive and Superreflexive Banach Spaces. Math. Centre Tracts. Mathematisch Centrum: Amsterdam, Zbl.412.46006 Dunford, N., Schwartz, J.T. [1958]: Linear Operators I. General Theory. Intersci. Publ.: New York, Zbl.84,104 Dunford, N., Schwartz, J.T. [1963]: Linear Operators II. Spectral Theory. Self Adjoint Operators in Hilbert Space. Intersci. Pub].: New York, Zbl.128,348 Dunford, N., Schwartz, J.T. [1971]: Linear Operators III. Spectral Operators. I. Wiley and Sons: New York, Zbl.243.47001 Dzhrbashyan, M.M. [1966]: Integral Transforms and Representation of Functions on a Complex Domain. Phys.-Math. Section of Nauka: Moscow (Russian), Zbl.154,377 Ehfros, A.M., Danilevskij, A.M. [1937]: Operational Calculus and Contour Integrals. Ukrainian State Science Press: Khar ov (Russian) Faddeev, L.D. [1963]: Mathematical Questions of the Quantum Theory of Scattering for Systems of Three Particles. Tr. Mat. Inst. Steklova 69, 1-122. English transl.: Works Steklov Inst. Math., Vol. 69, 1965,Zbl.125,215 Floret, K. [1980]: Weakly Compact Sets. Springer-Verlag: Berlin, Zbl.437.46006 Friedrichs, K.O. [1965]: Perturbation of Spectra in Hilbert Space. Am. Math. Sot.: Providence, RI, Zbl.142,110 Gakhov, F.D. [1958]: Boundary Value Problems. Phys.-Math. Section of Nauka: Moscow. English transl.: Pergamon Press: Oxford, 1966,Zbl.141,80 Gamelin, T.W. [1969]: Uniform Algebras. Prentice Hall: New York, Zbl.213404 1 Gantmakher, F.P., Krejn, M.G. [1950]: Oscillation Matrices and Kernels and Small Oscillations of Mechanical Systems. State Science Press: Moscow-Leningrad (Russian), Zbl.41,355
Bibliography
265
Girding, L. [1954]: Applications of the Theory of Direct Integrals of Hilbert Spaces to some Integral and Differential Operators. Lect. Ser. Inst. Fluid Dynam. and Appl. Math., No. 11. Univ. Maryland, Zbl.55,342 Garnett, J.B. [1981]: Bounded Analytic Functions. Academic Press: New York, Zbl.469.30024 Ciel and, I.M., Graev, M.I., Pyatetski-Shapiro, 1.1.[1966]: General&d Functions Vol. 6. Theory of Representations and Automorphic Functions. Nauka: Moscow; Zbl.138,72. English transl.: Philadelphia, 1969 Gel and, I.M., Graev, M.I., Vilenkin, N.Ya. [1962]: Generalised Functions, Vol. 5. Integral Geometry and Questions Connected with it in the Theory of Representations. Phys.-Math. Section of Nauka: Moscow, Zbl.l15,167. English transl.: Academic Press: New York-London, 1966 Gel and, I.M., Rajkov, D.A., Shilov, G.E. [1960]: Commutative Normed Rings. Phys.-Math. Section of Nauka: Moscow, Zbi.134,321. French transl.: Gauthier: Paris, 1964 Gel and, I.M., Shilov, G.E. [1958a]: General&d Functions, Vol. 2. Spaces of ordinary and generalised functions. Phys.-Math. Section of Nauka: Moscow, Zb1.91,111, English transl.: Academic Press: New York-London, 1968 Gel and, I.M., Shilov, G.E. [1958b]: General&d Functions, Vol. 3. Some Questions in the Theory of Differential Equations. Phys.-Math. Section of Nauka: Moscow, Zb1.91,111. English transl.: Academic Press: New York-London, 1967 Gel and, I.M., Shilov, G.E. [1959]: Generalised Functions, Vol. 1. Generalised Functions and Operations on them. Phys.-Math. Section of Nauka: Moscow, Zb1.91,111. English transl.: Academic Press: New York-London, 1964. Gel and, I.M., Vilenkin, N.Ya. [1961]: Generalised Functions, Vol. 4. Some Applications of Harmonic Analysis. Rigged Hilbert Spaces. Phys.-Math. Section of Nauka: Moscow, Zb1.103,92. English transl.: Academic Press: New York-London, 1964 Glazman, I.M. [1963]: Direct Methods of Qualitative Spectral Analysis of Singular Differential Operators. Phys.-Math. Section of Nauka: Moscow, Zbl.143,365. English transl.: Oldboume Press: London, 1965 Gokhberg, I.Ts., Fel man, LA. [1971]. Convolution Equations and Projection Methods for their Solution. Nauka: Moscow (Russian), Zbl.214.385 Gokhberg, LTs., Krejn, M.G. [1965]: Introduction to the Theory of Non-selfadjoint Linear Operators. Nauka: Moscow, Zbl.138,78. English transl.: Providence, 1969 Gokhberg, LTs., Krejn, M.G. [1967]: Theory of Volterra Operators on Hilbert Space and their Applications. Nauka: Moscow, Zb1.168,120. English transl.: Providence, 1970 Gokhberg, I.Ts., Krupnik, N.Ya. [1973]: Introduction to the Theory of One-dimensional Singular Integral Operators. Shtiintsa: Kishinev, Zbl.271.47017. German transl.: Birkhauser: Base& 1979 Gould, S.H. [1966]: Variational Methods for Eigenvalue Problems. Univ. Press: Toronto, Zbl.156,124 Greenleaf, F.P. [1969]: Invariant Means on Topological Groups and their Applications. Van Nostrand: New York, Zb1.174,190 Gross, L. [1963]: Harmonic Analysis on Hilbert Space. Mem. Am. Math. Sot. 46: New York, Zbl.l18,322 Grothendieck, A. [1955]: Produits Tensoriels Topologiques et Espaces Nucleaires. Mem. Am. Math. Sot. 16,Zb1.55,97 Halmos, P.R., Sunder, V.S. [1978]: Bounded Integral Operators on L*-spaces. Springer-Verlag: Berlin, Zb1.389.47001 Hardy, G.H. [1949]: Divergent Series. Oxford, Clarendon Press: Oxford, Zbl.32,58 Hewitt, E., Ross, K.A. [1963, 19703: Abstract Harmonic Analysis I, II. Springer-Verlag: Berlin, Zb1.115,106,2b1.213,401 Hille, E., Phillips, R.S. [1957]: Functional Analysis and Semi-groups. Am. Math. Sot.: Providence, R.I,Zb1.78,100 Hirschmann, I.I., Widder, D.V. [1955]: The Convolution Transform. Princeton Univ. Press: Princeton, Zb1.65,93 Hiirmander, L. [1983, 19851: The Analysis of Linear Partial Differential Operators, I-IV. SpringerVerlag: Berlin, Zb1.521.35001,2b1.521.35002,Zb1.601.35001,Zb1.612.35001
266
Bibliography
Iokhvidov, IS. [1974]: Hankel and Toeplitz Matrices and Forms. Nauka: Moscow, Zb1.293.15023. English transl.: Birkhauser: Basel, 1982 Jarosz, K. [1985]: Perturbation of Banach Algebras. Lect. Notes Math. 1120, Springer-Verlag: Berlin, Zbl.557.46029 Johnson, W.B., Maurey, B, Schechtman, G., Tzafriri, L. [1979]: Symmetric Structures in Banach Spaces. Mem. Am. Math. Sot. 217: New York, Zb1.421.46023 Kaczmarz, S., Steinhaus, H. [1935]: Theorie der Orthogonalreihen. Fund. Kult. Nar.: Warsaw, Zb1.13,9 Kadison, R.V. [1951]: A Representation Theory for Commutative Topological Algebra. Am. Math. Sot.: New York, Zbl.42,348 Kahane, J-P. [1970]: Series de Fourier Absolument Covergentes. Springer-Verlag: Berlin, Zb1.195,76 Kantorovich, L.V., Vulikh, B.Z., Pinsker, A.G. [1950]: Functional Analysis in Partially Ordered Spaces. State Science Press: Moscow-Leningrad (Russian), Zbl.37,72 Kantorovitz, S. [1983]: Spectral Theory of Banach Space Operators. Lect. Notes Math. 1012, Springer-Verlag: Berlin, Zb1.527.47001 Karlin, S. [1968]: Total Positivity. Calif. Univ. Press: Stanford, Zbl.219.47030 Karlin, S., Shapley, L.S. [1953]: Geometry of Moment Spaces. Mem. Am. Math. Sot. f2,Zb1.52,185 Karlin, S., Studden, W.J. [1966]: Tchebyscheff Systems with Applications in Analysis and Statistics. Intersci. Publ.: New York, Zbl.l53,389 Kato, T. [1966]: Perturbation Theory for Linear Operators. Grundlehren 132, Springer-Verlag: Berlin, Zbl.148,126 Korovkin, P.P. [1959]: Linear Operators and the Theory of Approximations. Phys.-Math. Section of Nauka: Moscow (Russian), Zb1.94,102 Kiithe, G. [1969]: Topological Vector Spaces I. Springer-Verlag: Berlin, Zb1.179,170,Zb1.137,313 Krasnosel kij, M.A. [1962]: Positive Solutions of Operator Equations. Phys.-Math. Section of Nauka: Moscow, Zb1.121,106. English transl.: P. Noordhoff Ltd.: Groningen, 1964 Krasnosel kij, M.A., Lifshits, E.A., Sobolev, A.V. [1985]: Positive Linear Systems. Nauka: Moscow. English transl.: Heldermann Verl.: Berlin, 1989,Zb1.578.47038 Krasnosel kij, M.A., Rutitskij, Ya.B. [1958]: Convex Functions and Orlicz Spaces. Phys.-Math. Section of Nauka: Moscow, Zb1.84,101. English transl.: P. Nordhoff Ltd.: Groningen, 1961 Krasnosel kij, M.A., Vajnikko, G.M., Zabrejko, P.P., Rutitskij, Ya.B, Stetsenko, V.Ya. [1969]: Approximate Solution of Operator Equations. Nauka: Moscow, Zb1.194,179. English transl.: Wolters-Noordhoff Publ.: Groningen, 1972 Krasnosel kij, M.A., Zabrejko, P.P., Pustyl ik, E.I., Sobolevskij, P. [1966]: Integral Operators on Spaces of Summable Functions, Nauka: Moscow, Zbl.145,397. English transl.: Noordhoff Int. Publ.: Groningen, 1976,Zbl.312.47041 Krejn, M.G., Nudel an, A.A. [1973]: The Markov Moment Problem and Extremal Problems. Nauka: Moscow. English transl.: Transl. Math. Monogr. 50, Providence, 1977,Zb1.265.42006 Krejn, S.G. [1967]: Linear Differential Equations in Banach Space. Nauka: Moscow, Zb1.172,419. English transl.: Transl. Math. Monogr. 29: Providence, 1972 Krejn, S.G., Petunin, Yu.I., Semenov, E.M. [1978]: Interpolation of Linear Operators. Nauka: Moscow, 2131.499.46044. English transl.: Transl. Math. Monogr. 54: Providence, 1982, Zb1.493. 46058
Kuo, H-H. [1975]: Gaussian Measures in Banach Spaces. Lect. Notes Math. 463, Springer-Verlag: Berlin, Zbl.306.28010 Kutateladze, S.S.,Rubinov, A.M. [1976]: Minkowski Duality and its Applications. Nauka: Novosibirsk (Russian) Lacey, H.E. [1974]: The Isometric Theory of Classical Banach Spaces. Springer-Verlag: Berlin, Zbl.285.46024 Ladyzhenskaya, O.A. [1970]: Mathematical Questions of the Dynamics of Viscous Fluid. Nauka: Moscow (Russian) Landkof, N.S. [1966]: Foundation of Modem Potential Theory. Nauka: Moscow, Zb1.148,103. English transl.:Springer-Verlag: Berlin, 1972
Bibliography
261
Levin, B.Ya. [1956]: The Distribution of the Roots of Entire Functions. State Science Press: Moscow. German transl.: Berlin, 1962., Zbl.l11,73 Levitan, B.M. [1950]: Factorisation of Second Order Differential Equations by means of Eigenfunctions. State Science Press: Moscow-Leningrad (Russian) Levitan, B.M. [1953]: Almost-periodic Functions. State Science Press: Moscow (Russian) Levitan, B.M. [1962]: Generalised Shift Operators and Some of their Applications. Phys.-Math. Section of Nauka: Moscow. English transl.: Jerusalem, 1964,Zb1.192,489 Levitan, B.M,, Zhikov, V.V. [1978]: Almost-periodic Functions and Differential Equations. Moscow Univ. Press: Moscow. English transl.: Cambridge Univ. Press., 1982,Zb1.414.43008 Levy, P. [1951]: Les Problemes Concrets d nalyse Fonctionelle. Gauthier-Villars: Paris, Zbl.43,323 Light, W.A., Cheney, E.W. [1985]: Approximation Theory of Tensor Product Spaces. SpringerVerlag: Berlin, Zb1.575.41001 Lindenstrauss, J. [1969]: Geometric Theory of Banach Spaces. State Univ. Press: Florida Lindenstrauss, J., Tzafriri, L. [1977]: Classical Banach Spaces I. Sequence Spaces. Springer-Verlag: Berlin, Zbl.362.46013 Lindenstrauss, J., Tzafriri, L. [1979]: Classical Banach Spaces II. Function Spaces. Springer-Verlag: Berlin, Zbl.403.46022 Livshits, M.S. [1966]: Operators, Oscillations, Waves. Open Systems. Nauka: Moscow (Russian), Zb1.143,367 Luxemburg, W.A.J., Zaanen, A.C. [1971]: Riesz Spaces. North-Holland: Amsterdam, Zb1.231.46014 Lyubich, Yu.1. [1985]: Introduction to the Theory of Banach Representations of Groups. Vishcha Shkola: Khar ov, English transl.: Birkhluser: Basel, 1988,Zbl.623.22001 Maak, W. [1950]: Fastperiodische Funktionen. Springer-Verlag: Berlin, Zbl.37,57 Mackey, G.W. [1963]: The Mathematical Foundations of Quantum Mechanics. Benjamin: New York, Zbl. 114,440 Mackey, G.W. [1965]: Group Representations and Commutative Harmonic Analysis with Applications to Analysis, Number Theory and Physics. Univ. Press: Berkeley Mandelbrojt, S. [1935]: Series de Fourier et Classes Quasi-analytiques des Fonctions. GauthiersVillars: Paris, Zb1.13,llO Marchenko, V.A. [1977]: Sturm-Liouville Operators and their Applications. Naukova Dumka: Kiev, Zbl.399.34022. English transl.: Birkhiiuser: Base], 1986,Zbl.592.34011 Markus, AS. [1986]: Introduction to the Spectral Theory of Polynomial Operator Pencils. Shtiintsa: Kishinev. English transl.: Transl. Math. Monogr. 71: Providence, 1988,Zbl.678.47005 Maslov, V.P. [1965]: The Theory of Perturbations and Asymptotic Methods. Moscow Univ. Press: Moscow. French transl.: Gauthiers-Villars: Paris, 1972, Zbl.247,47010 Maslov, V.P. [1973]: Operator Methods. Nauka: Moscow, Zbl.288.47042. English transl.: MIR: Moscow, 1982 Massera, J.L., Schaffer, J.J. [1966]: Linear Differential Equations and Function Spaces. Academic Press: New York, Zb1.243.34107 Michael, E. [1952]: Locally Multiplicatively-convex Topological Algebras. Am. Math. Sot.: Providence, R.I,Zbl.47,355 Mikhlin, S.G. [1949]: Integral Equations and their Applications. State Science Press: Moscow, Zb1.39,113. English transl. Pergamon Press: London, 1957 Mikhlin, S.R. [1952]: The Minimum Problem for a Quadratic Functional. State Science Press: Moscow-Leningrad, Zb1.49,205. English transl.: Holden-Day: San Francisco, 1965 Mikhlin, S.R. [1962]: Multi-dimensional Singular Integrals and Integral Equations. Phys.-Math. Section of Nauka: Moscow, Zb1.105,303. English transl.: Pergamon Press: Oxford, 1965 Mikusinski, J. [1953]: Rachunek Operatorow (The Calculus of Operators). Polskie Towarzystow Matematyczne: Warsaw, Zb1.53,85. English transl. Pergamon Press: London, 1959,Zbl.88,330 Milman, V.D., Schechtman, G. [1986]: Asymptotic Theory of Finite Dimensional Normed Spaces. Springer-Verlag: Berlin, Zbl.606.46013 Morris, S.A. [1977]: Pontryagin Duality and the Structure of Locally Compact Abelian Groups. London Math. Sot. Lect. Notes Series. 29,Zb1.446.22006
268
Bibliography
Nagy, K.L. [1966]: State Vector Spaces with Indefinite Metric in Quantum Field Theory. Akad. Kiado: Budapest, Zb1.151441 Najmark, M.A. [1968]: Normed Rings. 2nd. ed. Nauka: Moscow. English transl.: Noordhoff: Groningen, 1964,Zb1.89,101 Najmark, M.A. [1969]: Linear Differential Operators. 2nd ed. Nauka: Moscow, Zb1.193,41. English transl.: F. Ungar Publ.: New York, Vol. I, 1967,Zbl.219,34001, Vol. II, 1968,Zb1.227.34020 Nakano, H. [1950]: Modulared Semi-ordered Linear Spaces. Mazuren: Tokyo Neumann, J. von [1932]: Mathematische Grundlagen der Quantemnechanik. Springer-Verlag: Berlin, Zb1.5,91 Nikol kij, N.K. [1974]: Selected Problems of Weighted Approximation and Spectral Analysis. Tr. Mat. Inst. Steklova 120. English transl.: Proc. Steklov Inst. Math. 120, 1976,Zb1.342.41027 Pekzynski, A. [1968]: Linear Extensions, Linear Averagings and their Applications to Linear Topological Classification of Spaces of Continuous Functions. Diss. Math.: Warzawa, Zb1.165146 Pelczynski, A. [1977]: Banach Spaces of Analytic Functions and Absolutely Summing Operators. Am. Math. Sot.: Providence, R.I., Zbl.384.46015 Petunin, Yu.I., Plichko, A.N. [1980]: The Theory of Characteristic Subspaces and its Applications. Vishcha Shkola: Kiev (Russian), Zb1.467.46016 Pietsch, A. [1965]: Nukleare Lokalkonvexe Rlume. Schriftenr. Inst. Math. Dtsch. Akad. Wiss., A, Berlin, Zbl.152,323 Pietsch, A. [1978]: Operator Ideals. VEB Deutscher Verlag der Wiss.: Berlin, Zb1.399.47039 Przeworska-Rolewicz, D, Rolewicz, S. [1968]: Equations in Linear Spaces. Polish Scientific Publishers: Warsaw, Zb1.181,405 Radjavi, H., and Rosenthal, P. [1973]: Invariant Subspaces. Springer-Verlag: Berlin, Zbl.269.47003 Rieffel, M.A. [1967]: Dentable Subsets in Banach Spaces with Application to a Radon-Nikodym Theorem in Functional Analysis. Thompson: Washington; appeared in Proc. Conf., Univ. California, Irvine 1966,71-77, 1967,Zb1.213,137 Rolewicz, S. [1972]: Metric Linear Spaces. Polish Scientific Publishers: Warsaw, Zb1.226.46001 Rudin, W. [1962]: Fourier Analysis on Groups. Interscience: New York, Zb1.107,96 Rudin, W. [1969]: Function Theory in Polydiscs. Benjamin: New York, Zbl.177,341 Rudin, W. [ 19801: Function Theory in the Unit Ball of C Springer-Verlag: Berlin, Zb1.495.32001 Samojlenko, Yu.S. [1984]: Spectral Theory of Collections of Selfadjoint Operators. Naukova Dun&a: Kiev (Russian), Zb1.543.47040 Schaefer, H.H. [1974]: Banach Lattices and Positive Operators. Springer-Verlag: Berlin, Zb1.296. 47023 Schapira, P. [1970]: Thtorie des Hyperfonctions. Lect. Notes Math. 226. Springer-Verlag: Berlin. Zb1.192,473 Schatten, R. [1950]: A Theory of Cross-spaces. Princeton Univ. Press: Princeton, Zbl.41,435 Schatten, R. [1960]: Norm Ideals of Completely Continuous Operators. Springer-Verlag: Berlin, Zb1.9494 Schmets, J. [1983]: Spaces of Vector-valued Continuous Functions. Lect. Notes Math. 1003. SpringerVerlag: Berlin, Zb1.511.46033 Schwartz, L. [1950, 19511: Theorie des Distributions I, II. Hermann: Paris, Zb1.37,73,Zb1.42,114 Schwartz, L. [1961]: Application of Distributions to the Study of Elementary Particles in Relativistic Quantum Mechanics. Univ. Press: California, Zb1.124,230 Segal, I.E. [1963]: Mathematical Problems of Relatavistic Physics. Am. Math. Sot.: Providence, R.I., Zb1.112,453 Semadeni, Z. [1982]: Schauder Bases in Banach Spaces of Continuous Functions. Lect. Notes Math. 918. Springer-Verlag: Berlin, Zb1.478.46014 Shohat, J.A., Tamarkin, J.D. [1943]: The Problem of Moments. Am. Math. Sot.: New York, Zb1.41,433 Shubin, M.A. [1978]: Pseudodifferential Operators and Spectral Theory. Nauka: Moscow, Zbl.451. 47064. English transl.: Springer-Verlag: Berlin, 1987,Zb1.616.47040 Sigal, J.M. [1983]: Scattering Theory for Many-bodies Quantum Mechanical Systems. SpringerVerlag: Berlin, Zb1.522.47006
Bibliography
269
Singer, I. [1970]: Bases in Banach Spaces I. Springer-Verlag: Berlin, Zbl.198,166 Stein, E.M. [1970]: Singular Integrals and Differentiability Properties of Functions. Princeton Univ. Press: Princeton, Zbl.207,135 Stein, E.M., Weiss, G. [1971]: Introduction to Fourier Analysis on Euclidean Spaces. Princeton Univ. Press: Princeton, Zbl.232.42007 Stone, M. [1932]: Linear Transformations in Hilbert Space and their Application to Analysis. Am. Math. Sot. Coil. Publ., IS: New York, Zb1.5,400 Suetin, P.K. C1984]: Series of Faber Polynomials. Nauka: Moscow (Russian) Szego, G. [1939]: Orthogonal Polynomials. Am. Math. Sot. Coll. Publ. 23: New York, Zb1.23,215 Sziikefalvi-Nagy B., Foias, C. [1967]: Analyse Harmonique des Operateurs de I space de Hilbert. Masson-Acad.: Budapest, Zbl.157,432 Taylor, J.L. [1973]: Measure Algebras. Am. Math. Sot.: Providence, RI., Zb1.269.28001 Triebel, H. [1983]: Theory of Function Spaces. BirhHuser: Basel, Zb1.546.46027 Voherra, V. [1930]: Theory of Functionals. Blackie: London Waelbroeck, L. [1962]: Theories des Algebres de Banach et des Algebres Localement Convexes. Univ. Press: Montreal, Zbl.145,167 Weil, A. [ 19401: L nttgration dans les Groupes Topologiques et ses Applications. Hermann: Paris Widder, D.V. [1941]: The Laplace Transform. Princeton Univ. Press: Princeton Wiener, N. [1933]: The Fourier Integral and Certain of its Applications. Cambridge Univ. Press., Zb1.6,54 Wiener, N., Paley, R.E.A.C. [1934]: Fourier Transforms in the Complex Domain. Am. Math. Sot.: New York, Zbl.11,16 Zelazko, W. [1984]: On Ideal Theory in Banach and Topological Algebras. Monogr. Inst. Mat. Universidad National Autbnoma de Mexico, No. 15: Mexico City Zygmund, A. [ 1959,1960]: Trigonometric Series I, II. Cambridge Univ. Press: Cambridge, Zb1.85,56 IV. Survey Articles from Usp. Mat. Nauk Antonovskij, M.Ya., Boltyanskij, V.G., Sarymsakov, T.A. [1966]: Outlines of the theory of topological semifields. 21, No. 4, 185-218. English transl.: Russ. Math. Surv. 21, No. 4, 163-192 (1966), Zb1.178,258 Arkin, V.L., Levin, V.L. [1972]: Convexity of the values of vector integrals. Measurable selection theorems and variational problems. 27, No. 3, 21-77. English transl.: Russ. Math. Surv. 27, No. 3,21-85 (1973), Zb1.254.49006 Berezanskij, Yu.M. [1984]: Projective spectral theorem. 39, No. 4,3-52. English transl.: Russ. Math. Surv. 39, No. 4,3-52 (1984), Zb1.577.47019 Birman, M.Sh., Solomyak, M.Z. [1977]: Bounds for the singular numbers of integral operators. 32, No. 1, 17-84. English transl.: Russ. Math. Surv. 32, No. 1, 15-89 (1977) Zb1.344.47021 Bochkarev, S.V. [1972]: Absolute convergence of Fourier series with respect to complete orthonormal systems.27, No. 2,53-76. English transl.: Russ. Math. Surv. 27, No. 2,55-81(1973), Zb1.236.42014 Bogolyubov, N.N. [1965]: Mathematical problems of quantum field theory. 20, No. 3, 31-40 (Russian) Bukhvalov, A.V., Veksler, A.I., Lozanovskij, G.Ya. [1979]: Banach lattices-certain Banach aspects of the theory. 34, No. 2, 137-183. English transl. Russ. Math. Surv. 34, No. 2, 159-212 (1979), Zb1.412.46015 Fet, A.I. [ 19561: A space of analytic functions and its application to the Cauchy-Kovalevski problem. II, No. 2,215-222 (Russian), Zb1.70,88 Gel and, I.M. [1956]: On certain problems of functional analysis. 1 I, No. 6,3-12,Zbl.73,94. English transl.: Am. Math. Sot. Transl., II. Ser. 16,315-324 (1960) Gel and, LM, Yaglom, A.M. [1956]: Integration in function spaces and its application to quantum physics. II, No. 1,77-114,Zb1.71,224. English transl.: J. Math. Phys. I, 48-69 (1960) Ismagilov, R.S. [1974]: The diameters of sets in normed linear spaces and the approximation of functions by trigonometric polynomials. 29, No. 3, 161-178. English transl.: Russ. Math. Surv. 29, No. 3, 169-186 (1974), Zbl.303.41039
270
Bibliography
Kadets, M.I., Mityagin, B.S. [1973]: Complemented subspaces in Banach spaces. 28, No. 6,77-94. English transl.: Russ. Math. Surv. 28, No. 6,77-95 (1973X Zbl.287.46029 Kantorovich, L.V. [1948]: Functional analysis and applied mathematics. 3, No. 6,89-185 (Russian), Zbl.34,212 Keldysh, M.V. [1971]: On completeness of the eigenfunctions of certain classes of non-selfadjoint linear operators. 26, No. 4, 15-41. English transl.: Russ. Math. Surv. 26, No. 4, 15-44 (1972), Zbl.225,47008 Khaplanov, M.G. [1956]: Infinite matrices in an analytic space. 11, No. 5,37-44,Zbl.73,334. English transl.: Am. Math. Sot., Transl., II. Ser. 13, 177-183 (1960) Krejn, M.G., Krasnosel kij, M.A. [1947]: Fundamental theorems on the expansion of linear operators and some of their applications to the theory of orthogonal polynomials and the moment problem. 2, No. 3,60-W (Russian) Krejn, M.G., Rutman, M.A. [1948]: Linear operators leaving invariant a cone in a Banach space. 3, No. 1,3-95 (Russian), Zbl.34129 Kuptsov, N.P. [1968]: Direct and converse theorems of the theory of approximations and semigroups of operators. 23, No. 4, 117-178. English transl.: Russ. Math. Surv. 23, No. 4, 115-177 (1968), Zbl. 159,434 Kutateladxe, S.S. [1975]: Choquet boundaries in K-spaces. 30, No. 4,107-146. English transl.: Russ. Math. Surv. 30, No. 4, 115-155 (1975), Zbl.331.46011 Lyubich, Yu.1. [1966]: Classical and local Laplace transforms in an abstract Cauchy problem. 21, No. 3, 3-51. English transl.: Russ. Math. Surv. 21, No. 3, l-52 (1966). Zbl.173,120 Madxhenes, Eh. (=Magenes, E.) [1966]. Interpolation spaces and partial differential equations. 21, No. 2, 169-218 (Russian), Zbl.173,158 Markus, AS. [1964]: Eigenvalues and singular values of the sum and composition of linear operators. 19, No. 4,93-123. English transl.: Russ. Math. Surv. 19, No. 4,91-120 (1964), Zbl.133,72 Mautner, F.I. [1955]: On eigenfunction expansions. 10, No. 4, 127-132. English transl.: Proc. Natl. Acad. Sci. USA 39,49-53 (1953), Zbl.50,119 Mil an, V.D. [1970]: Geometric theory of Banach spaces I. Theory of bases and minimal systems. 25, No. 3, 113-174. English transl.: Russ. Math. Surv. 25, No. 3, 111-170 (1970), Zbl.198,165 Mil an, V.D. [1971]: Geometric theory of Banach spaces II. Geometry of the unit sphere. 26, No. 6,73-149. English transl.: Russ. Math. Surv. 26, No. 6,79-163 (1972), Zbl.229.46017 Minlos, R.A. [1968]: Lectures on statistical physics. 23, No. 1,133-190. English transl.: Russ. Math. Surv. 23, No. 1, 137-196 (1968), Zbl.165,585 Mishchenko, A.S. [1976]: Hermitian K-theory. Theory of characteristic classes, methods of functional analysis. 31, No. 2, 69-134. English transl.: Russ. Math. Surv. 31, No. 2, 71-138 (1976), Zbl.427.55001 Mishchenko, A.S. [1979]: Banach algebras, pseudodifferential operators and their applications to K-theory. 34, No. 6,67-79. English transl.: Russ. Math. Surv. 34, No. 6,77-91(1979),Zbl.432.46050 Mityagin, B.S. [1961]: Approximate dimension and bases in nuclear spaces. 16, No. 4, 63-132. English transl.: Russ. Math. Surv. 16, No. 4, 59-128 (1961), Zbl.104.86 Mityagin, B.S. [1970]: Homotopy structure of the linear group of a Banach space. 25, No. 5,63-106. English transl.: Russ. Math. Surv. 25, No. 5, 59-103 (1970), Zbl.232.47046 Mityagin, B.S., Shvarts, A.S. [1964]: Functors on categories of Banach spaces. 19, No. 2, 65-130. English transl.: Russ. Math. Surv. 19, No. 2,65-127 (1964), Zbl.129.82 Nikishin, E.M. [1970]: Resonance theorems and superlinear operators. 25, No. 6,129-191. English transl.: Russ. Math. Surv. 25, No. 6, 125-187 (1970), Zbl.222.47024 Opojtsev, V.I. [1976]: A converse of the principle of contraction mappings. 31, No. 4, 169-198. English transl.: Russ. Math. Surv. 31, No. 4, 175-204 (1976), Zbl.335.54025 Pelcxynski, A. [1973]: On certain problems of Banach. 28, No. 6,67-75. English transl.: Russ. Math. Surv. 28, No. 6,67-76 (1973), Zbl.281.46014 Pelcxynski, A., Figel T. [1973]: On Entlo method of constructing Banach spaces without the approximation property. 28, No. 6,95-108. English transl.: Russ. Math. Surv. 28, No. 6,95-108 (1973), Zbl.305.46027 Peller, V.V., and Khrushchev, S.V. [1982]: Hankel operators, best approximations and stationary
Bibliography
271
Gaussian processes. 37, No. 1,53-124. English transl.: Russ. Math. Surv. 37, No. 1,61-144 (1982), Zbl.497.60033 Radzievskij, G.V. [1982]: The problem of completeness of root vectors in the spectral theory of operator-functions. 37, No. 2,81-145. English transl.: Russ. Math. Surv. 37, No. 2,91-164 (1982), Zbl.524.47009 Rubinov, A.M. [1977]: Sublinear operators and their applications. 32, No. 4,1133177. English transl.: Russ. Math. Surv. 32, No. 4, 115-175 (1977), Zbl.355.47002 Sadovskij, B.N. [1972]: Limit-compact and condensing operators. 27, No. 1,81-146. English transl.: Russ. Math. Surv. 27, No. 1,85-155 (1972), Zbl.232.47067 Sakhnovich, L.A. [1980]: Equations with a difference. kernel on a finite interval. 35, No. 4, 69-l 12. English transl.: Russ. Math. Surv. 35, No. 4,81-152 (1980), Zbl.444.45008 Tsekanovskij, Eh.R., Shmul an, Yu.L. [1977]: Theory of biextensions of operators on rigged Hilbert spaces. Unbounded operator colligations and characteristic functions. 32, No. 5,69-124. English transl.: Russ. Math. Surv. 32, No. 5,73-132 (1977), Zbl.447.47011 Ul anov, P.L. [1961]: Divergent Fourier series. 16, No. 3,61-142. English transl.: Russ. Math. Surv. 26, No. 3, l-75 (1961), Zbl.103,290 Ul anov, P.L. [1964]: Solved and unsolved problems in the theory of trigonometric and orthogonal series. 19, No. 1, 3-52. English transl.: Russ. Math. Surv. 19, No. 1, l-62 (1964), Zb1.125,312 Vershik, A.M., Gel and, I.M., Graev, MI. [1975]: Representations of the group of diffeomorphisms. 30, No. 6, 3-50. English transl.: Russ. Math. Surv. 30, No. 6, l-50 (1975), Zbl.317.58009 Vishik, MI. [1971]: Parametrix of elliptical operators with infinitely many independent variables. 26, No. 2, 155-174. English transl.: Russ. Math. Surv. 26, No. 2,91-112 (1972), Zbl.221.35070 Vishik, MI., Ehskin, G.I. [1967]: Elliptical convolution equations on bounded domains and their applications. 22, No. 1, 15-76. English transl.: Russ. Math. Surv. 22, No. 1, 13-75 (1967), Zbl.167, 448 Vishik, M.I., Ladyzhenskaya, O.A. [1956]: Boundary value problems for partial differential equations and certain classesof operator equations. If, No. 6,41-97. English transl.: Am. Math. Sot., Transl., II. Ser. 10,223-281 (1958), Zbl.73,99 Vlasov, L.P. [1973]: Approximative properties of sets in normed linear spaces. 28, No. 6, 3-66. English transl.: Russ. Math. Surv. 28, No. 6, l-66 (1973), Zbl.291.41028 Volevich, L.P., Gindikin, S.G. [1972]: The Cauchy problem and problems associated with it for convolution equations. 27, No. 4, 65-143. English transl.: Russ, Math. Surv. 27, No. 4, 71-160 (1973), Zbl.244.35015 Volevich, L.R., Paneyakh, B.P. [1965]: Certain spaces of general&d functions and embedding theorems. 20, No. 1, 3-74. English transl.: Russ. Math. Surv. 20, No. 1, l-73 (1965), Zbl.135,165 Zajdenberg, M.G., Krejn, S.G., Kuchment, P.A., Pankov, A.A. [1975]: Banach bundles and linear operators. 30, No. 5, 101-157. English transl.: Russ. Math. Surv. 30, No. 5, 115-175 (1975), Zbl.317.47018 Zharinov, V.V. [1979]. Compact families of locally convex topological vector spaces, FrechetSchwartz and dual Frechet-Schwartz spaces.34, No. 4,97-131. English transl.: Russ. Math. Surv. 34, No. 4,97-131 (1979), Zbl.412.46003 V. Articles from Previous Volumes of the Encyclopaedia of Math. Sciences Aleksandrov, A.B. [1985]: Theory of functions on a ball. In: Khenkin, Vitushkin (eds.): Several Complex Variables II. Vol. 8, 115-19O,Zbl.614.32005. English transl.: Enc. Math. Sc. 8, SpringerVerlag: Berlin (to appear) Arkhangel kij, A.V., Fedorchuk, V.V. [1988]: Basic concepts and constructions of general topology. In: Arkhangel kij. Pontryagin (eds.): General Topology I. Vol. 17. English transl.: Enc. Math. SC. 17, Springer-Verlag: Berlin, 1990, l-90 Cornfel , I.P., Sinai, Ya.G., Vershik, A.M. [1985]: General ergodic theory of groups of measure preserving transformations. In: Arnol (ed.): Dynamical Systems II. Vol. 2,5-l 12. English transl.: Enc. Math. SC.2, Springer-Verlag: Berlin, 1989, l-98
212
Bibliography
Dubrovin, B.A., Krichever, I.M., Novikov, S.P. [1985]: Integrable systems. In: Amol , Novikov (eds.): Dynamical Systems IV. Vol. 4, 179-248. English transl.: Enc. Math. SC.4 Springer-Verlag: Berlin, 1989, 173-28O,Zb1.591.58013 Dyn in, E.M. [1987]: Methods of the theory of singular integrals (Hilbert transform and CalderonZygmund theory). In: Khavin, Nikol kij (eds.): Commutative Harmonic Analysis I. Vol. 25, 197-202. English transl.: Enc. Math. SC.15, Springer-Verlag: Berlin, 199l,Zbl.661.42009 Evgrafov, M.A. [1986]: Series and integral representations. In: Gamkrelidze (ed.): Analysis I. Vol. 13,5-92. English transl.: Enc. Math. SC.13 Springer-Verlag Berlin, 1989, 1-82,2b1.655.40001 Fedoryuk, M.V. [1986]: Integral transforms. In: Gamkrelidze (ed.): Analysis I. Vol. 13, 211-253. English transl.: Springer-Verlag, 1989, 193-232,Zbl.658.44001 Khavin, V.P. [1987J: Methods and structure of commutative harmonic analysis. In: Khavin, Nikol kij (eds.): Commutative Harmonic Analysis I. Vol. 15, 6-133. English transl.: Enc. Math. SC. IS, Springer-Verlag: Berlin, 199l,Zb1.656.43001 Kirillov, A.A. [1985]: Geometric quantization. In: Amol , Novikov (eds.) Dynamical Systems IV. Vol. 4, 141-178. English transl.: Enc. Math. SC.4, Springer-Verlag: Berlin, 1989,137-172,Zb1.591. 58014 Kislyakov, S.V. [1987]: Classical problems of Fourier analysis. In: Khavin, Nikol kij (eds.): Commutative Harmonic Analysis I. Vol. 15, 135-195. English transl.: Enc. Math. SC.15, Springer-Verlag: Berlin, 199l,Zb1.655.42006 Sergeev, A.T., Vladimirov, V.S. [1985]: The future cone of complex analysis. In: Khenkin, Vitushkin (eds.): Several Complex Variables I. Vol. 8, 191-266. English transl.: Enc. Math. SC.8, SpringerVerlag: Heidelberg (to appear); Zb1.614.32001 Shafarevich, I.R. [1986]: Fundamental concepts of algebra. In: Kostrikin, Shafarevich (eds.): Algebra I. Vol. 11, 290 pp. English transl.: Enc. Math. SC. 12, Springer-Verlag: Heidelberg, 1989, l-258, Zb1.655.00002,2b1.674.12001 Tikhomirov, V.M. [1987]: Convex analysis. In: Gamkrelidze (ed.): Analysis II. Vol. 14,5-101. English transl.: Enc. Math. SC.14, Springer-Verlag: Heidelberg, 1990, l-92 Tikhomirov, V.M. [1987]: Approximation theory. In: Gamkrelidze (ed.): Analysis II. Vol. 14, 103260. English transl.: Enc. Math. SC.14, Springer-Verlag: Heidelberg, 1990,93-244,Zb1.655.41002 Vitushkin, A.G. [1985]: Remarkable facts of complex analysis. In: Vitushkin (ed.): Several Complex Variables I. Vol. 7,5-22. English transl.: Enc. Math. SC.7, Springer-Verlag: Heidelberg, 1989,1-18
VI. Reference
Rooks
from
the Series Spravacknayr
Matematichesknya
Bibiioteka
Babich, V.M. et al. [1964]: Linear Equations of Mathematical Physics. (Mikhlin, S.G. (ed.)) Nauka: Moscow, Zbl.122,330 Birman, MSh. et al. [1972]: Functional Analysis. (2nd ed.) (Krejn, S.G. (ed.)) Nauka: Moscow, Zb1.261.47001 Brychkov, Yu.A., Prudnikov, A.P. [1977]: Integral Transforms of General&d Functions. Nauka: Moscow, Zbl.464.46039 Ditkin, V.A., Prudnikov, A.P. [1971]: Integral Transforms and Operational Calculus. Nauka: Moscow, Zb1.99.89. English transl.: Pergamon Press: Oxford, 1965 Guter, R.S.,Kudryavtsev, L.D., Levitan, B.M. [1963]: Elements of the Theory of Functions. (Ul anov, P.L. (ed.)) Phys.-Math. Section of Nauka: Moscow, Zb1.115,47. English transl.: Pergamon Press: Oxford, 1966 Zabrejko, P.P. et al. [1968]: Integral Equations. Nauka: Moscow
Author Index Adamyan, V.M. 227 Akhiezer, NJ. 29,61 Alakhverdiev, D.Eh. 216 Alaoglu, L. 145 Aleksandrov, P.S. 175 Amir, D. 143 Appel, P.E. 65 Arens, R.F. 148,236 Aronszajn, N. 175,258 Arov, D.Z. 227 Arzela, C. 154 Ascoli, G. 113, 154 Atiyah, M.F. 95,260 Atkinson, F.V. 162, 167 Babenko, K.I. 135 Babich, V.M. 244 Bade, W.G. 207 Baire, R. 122 Banach, S. 14, 112,122, 123, 145,146, 151153, 157-160,261 Bari, N.K. 159,169 Berezanskij, Yu.M. 203,242,260 Bernoulli, D. 19 Bernoulli, Johann 10 Bemshtein, S.N. 26, 29, 30,43, 55, 57, 63 Besov, O.V. 243 Bessel, F.W. 24,28, 52 Beurling, A. 226 Birkhoff, G.A. 121 Birman, MS. 206 Bishop, E. 130,209,238 Blaschke, W. 68,224 Bochkarev, S.V. 134 Bochner, S. 35,52,53, 141,241,249 Bogolyubov, N.N. 179,246,252 Bohl, P. 191 Bohr, H. 51,52,191 Bonsall, F.F. 215 Borel, E. 53,69, 151 Borok, V.M. 5,249 Bourbaki, N. 261 Brauer, A. 174 Brodskij, MS. 221-223 Brudnyj, Yu.A. 5,258 Bunyakovskij, V.Ya. 13 Burnside, W. 177,180 Calderon, A.P. 236,243,257-259 Calkin, J.V. 217
Caratheodory, K. 108 Carleman, T. 30, 70 Carleson, L. 23,214,233 Cauchy, A. 12, 13, 144 Chebyshev, P.L. 53,54,58,59 Choquet, G. 143 Clairaut, AC. 20 Clarkson, J.A. 130, 158 Cotes, R. 64 Courant, R. 47 D lembert, J. 20 Daletskij, Yu.L. 260 Day, M.M. 131, 158 De Leeuw, K. 192 Delsarte, J. 44, 204 Denjoy, A. 26 Dieudonne, J. 152,261 Dirac, P.A.M. 11 Dirichlet, P.G.L. 17,22,42 Dolzhenko, E.P. 66 Du Bois-Reymond, P. 23 Duncan, J. 215 Dunford, N. 143,207,209 Dvoretzki, A. 136, 172 Dynin, A.S. 172 Eberlein, W.F. 151 Etimov, N.V. 147 Ehjdel an, SD. 249 Ehrenpreis, L. 247 Eidelheit, M. 113 Enflo, P. 134, 158, 166 Esclangon, E. 93 Euclid 4 Euler, L. 10, 19,20 Faber, G. 63,66 Faddeev, L.D. 205,227 Fedoryuk, M.V. 260 Fejer, L. 23,53, 117 Fekete, M. 184 Feller, W. 189 Fermat, P. 10 Fischer, E. 14,47 Fixman, V. 207 Foias, C. 211,214,215,225,226 Fomin, S.V. 260 Fourier, J. 19,20, 50 Frechet, M. 191
274
Author Index
Fredholm, I. 74, 165, 166 Frobeoius, G. 175180,228 Fubioi, G. 15 Gagliardo, E. 243,258 Gakhov, F.D. 82 Gamkrelidze, R.V. 5 Gel aod, I.M. 27,48,107,140,190,204,232, 233,239-241,249 Gel ood, A.O. 62 Giozburg, Yu.P. 229 Giraud, G. 259 Glazmao, LM. 213 Glicksberg, I. 192 Giidel, K. 97 Gokhberg, I.Ts. 138,162, 167,221 Gol mao, M.L. 255 Goochar, A.A. 238 Gorio, E.A. 5,190,235 Grotheodieck, A. 152,170-173 Haar, A. 56,179 Hadamard, J. 10,69, 144,258 Hahn, H. 112 Hamburger, H.L. 30 Hartmao, P. 51 Hartogs, F. 66 HausdortT, F. 34154,215 Heaviside, 0. 40 Hellioger, E. 31, 193 Helly, E. 35, 145 Herglotz, G. 35 Hermite, C. 62 Hilbert, D. 14, 27, 74, 103, 105, 187, 193, 201 Hille, E. 189 Holub, J.R. 166 Hopf, E. 79 Hopf, H. 175 Hormaoder, L. 248,250,260 Huff, R.E. 143 Hunt, R.A. 23 Iokhvidov, IS.
228,230
Jackson, D. 55,57 Jacobs, K. 192 James, R.C. 130,147,158 Jordan, G. 22,97,98,224 Jordan, P. 130 Kadets, M.I. 5, 122, 135, 158, 191 Kadets, V.M. 5, 136 Kakutaoi, S. 121, 177, 192 Kalisch, G.K. 223
Kalyabio, G.A. 255 Kaotorovich, L.V. 117,163 Karlio, S. 135 Kato, T. 206 Katsnel oo, V.Eh. 5,190,214,215 Keldysh, M.V. 65,217,231 Khejtits, A.I. 133 Khiochio, A.Ya. 35 Klee, V.L. 131 Koch, H. 171 Kolmogorov, A.N. 12,23,56,93, 100,107, 122,126 Kooig, H. 219 Koopmao, B.O. 191 Korkio, A.N. 58 Kostyucheoko, A.G. 48 Kotel ikov, V.A. 69 Krasoosel kij, M.A. 137,182,254 Krejn, M.G. 28,61,79, 115, 129, 137-139, 146,153,162,169,182,204,206,221,228, 230,241 Krejo, S.G. 189,242,243,258,261 Kruglyak, N. 258 Krull, W. 101 Krylov, N.M. 179 Kuroda, S.T. 206 Kuz io, R.O. 64 Ladyzheoskaya, O.A. 249 Lagrange, J.L. lo,17 Lang, S. 105 Laoger, H. 214,230 Lavreot v, M.A. 65 Lax, P.D. 227 Lebesgue, H. 12,15,23,55,151 Leiboiz, G.W. 6 Leray, J. 165 Levio, B.Ya. 29,56,205 Levinsoo, N. 135 Levitao, B.M. 48,204 Levy, P. 80,235 Lidskij, V.B. 219,220 Lio, V.Ya. 235 Liodemann, F. 62 Liodeostrauss, J. 143, 157 Lions, J.L. 243,258 Liouville, J. 46,47, 184 Litvioov, G.M. 170, 171 Livshits, M.S. 220,223-225,227 Lizorkio, P.I. 255 Lobachevskij, N.I. 20 Lojasiewin, S. 250 Lomooosov, V.I. 176,2 10 Larch, E.R. 191
Author Index Lumer, G. 215 Luzin, N.N. 23,26 Lyubarskij, Yu.1. 5 Lyubich, M.Yu. 5, 192 Lyubich, Yu.1. 43,44,92, 188, 192,209,211 Mackey, G.W. 148,149 Magenes, E. 258 Mairhuber, J.C. 56 Malgrange, B. 247 Mal sev, AI. 97 Marchenko, V.A. 204,205 Marcinkiewicz, J. 256 Markov, A.A. 57-60 Markov, A.A. (junior) 177 Markus, A.S. 5, 138, 170,212,219,220 Markushevich, A.I. 135 Martineau, A. 251,252 Maslov, V.P. 48,260 Matsaev, V.I. 170,205,209,210,215,217, 219,222 Mautner, F.I. 203 Mazur, S. 107,113,122,131,146,233 Mellin, H. 39 Mercer, T. 78 Mergelyan, S.N. 29,65,66 Mikhlin, S.G. 168,258,259 Mil an, D.P. 130,137-139,162, 182 Milyutin, A.A. 122 Minkowski, H. 14, 138 Mityagin, B.S. 172 Miyadera, I. 189 Molchanov, A.M. 51 Molin, F.Eh. 180 Montel, P. 151 Morimoto, M. 252 Morris, P. 143 Mukminov, B.R. 213 Miintz, C.H. 133 Najmark, M.A. 226,239,241 Nevanlinna, R. 31 Newton, I. 6 Nikodjrm, 0. 142 Nikol kaya, L.N. 210 Nikol kij, N.K. 5,210,212,220,221,227 Nikol kij, S.M. 243,255 Noether, E. 89 Noether, M. 82 Numakura, K. 192 Ostrowski, A.M.
70,210
Paley, R.E.A.C.
38, 163
Parasyuk, OS. 246 Parseval, M. 25, 28, 37,46,48,49, 53, 135 Pastur, L.A. 5 Pavlov, B.S. 227 Peetre. J. 255 Pelczyirski, A. 135 Perron, 0. 175 Peter, F. 180,241 Petrovskij, I.G. 248,249 Pettis, B.J. 140,143 Petunin, Yu.1. 258 Phelps, R.R. 130,143 Phillips, R.S. 189,215,227,229 Picard, E. 75,164 Pie&h, A. 169, 172 Plancherel, M. 37,50,241 Pontryagin, L.S. 228,230,241 Potapov, V.G. 224 Povzner, A.Ya. 203,204 Ptlk, V. 152 Pythagoras 13,25 Radon, J. 142 Rajkov, D.A. 149,241 Read, C.J. 175 Riemann, B. 12,23,39, 136 Riesz, F. 14, 16, 35, 116, 117, 129, 131, 132, 165,208 Riesz, M. 31,256 Ringrose, J.R. 220 Roberts, J.W. 139 Rogers, C.A. 136,172 Rojtburd, V.L. 260 Rosenblum, M. 206 Rosenthal, A. 66 Royden, H. 236 Rudin, W. 212 Runge, K. 65 Russu, G.I. 258 Rutman, M.A. 146,162,175,186 Sakhnovich, L.A. 204,222 Schauder, J. 167,174 Schmidt, E. 74 Schoenberg, I.Y. 57 Schrodinger, E. 46,47,203 Schur, I. 177,180,214 Schwartz, H.A. 13 Schwartz, L. 45,242,248,249 Semenov, E.M. 257 Shilov, G.E. 235-237,249,250 Shmul an, V.L. 130, 139, 151-153,158 Shmul an, Yu.L. 225 Shtraus, A.V. 225
215
276 Sieklucki, K. 56 Singer, I.M. 95,260 Slobodetskij, L.N. 244 Smith, K.T. 175 Sobolev, S.L. 242-244 Sobolevskij, P.E. 189 Sokhotskij, Yu.V. 83 Stechkin, SB. 147 Steinhaus, H. 123 Steinitx, E. 136 Steklov, V.A. 47 Stieltjes, K. 12, 28 Stiickert, B. 255 Stone, M.H. 104,201,233,234,261 sturm, c. 46,47 Sudakov, V.N. 260 Sushkevich, A.K. 192 Sylvester, J.J. 99 Szokefalvi-Nagy, B. 214,225,226 Taklind, S. 243 Tannaka, T. 241 Thorin, G.O. 256 Tikhonov, A.N. 145,174,243 Titchmarsh, E.C. 45,48,223 Tkachenko, V.A. 5,44,209 Toeplitz, 0. 96, 123, 159, 193 Tonelli, L. 56 Tricomi, F.G. 259 Triebel, H. 255 Tzafriri, L. 157
Author Index Van Kampen, E.R. 241 Vershik, A.M. 5,260 Vitushkin, A.G. 65 Vladimirov, V.S. 252 Volterra, V. 10 VonNeumann, J. 124,130,178,179,191, 193,194,202,214,215,261 Walsh, J.L. 65 Weierstrass, K. 24,28,65,68, 151,233, 234 Weil, A. 241 Wermer, J. 220,234 Weyl, H. 31,48,50, 51, 180,202 Widder, D.V. 40,189 Wiener, N. 27, 38, 39, 79, 80, 163, 235, 236 Williamson, J.H. 165 Wintner, A. 51 Wold, H. 214 Yang, C.T. 57 Yentsch, R. 175 Yosida, K. 189, 192 Young, L.C. 252 Zharinov, V.V. 252 Zobin, N.M. 172 Zolotarev, E.I. 58 Zolotarev, G.N. 249 Zygmund, A. 256,259
Subject Index Action 10, 178 Aftine manifold 88 Algebra, Banach 27,232 - - commutative 27 - Boolean 04 - function 101 - group 240 - local 101 - of sets 12 - operator 239 - - reducible 176 - semisimple 102 - spectral 102 - symmetric 234 - uniform 234 - universally enveloping 242 - Wiener 27 - - with weight 237 - with involution 238 Algebraic interior 108 Almost-period 52 Annihilator of a subspace 89 Ball, q-unit 111 Banach limit 178 Barrel 122 Bases, quadratically close 169 Basis, Bari 169 - Hamel 85 - Riesz 159 - Schauder 134 - unconditional 135 C*-algebra 239 Capacity, analytic 65 - - continuous 65 Central symmetry 108 Centralizer of a subalgebra 176 Centre of an algebra 176 -0fmass 143 Character of a representation 180 - of a semigroup 180 - of an algebra 104 Charge 131 Chebyshev alternant 53 Class, Hardy 38,225 - Holder 26 - Lipschitz 26 - Nevanlinna 32 - quasianalytic 69
Closure of a linear operator 193 Codefect of a homomorphism 90 Codimension of a subspace 88 Coefficients, Christoffel 60 - Fourier 21,52 - - generalised 27 Coimage of a homomorphism 90 Cokemel of a homomorphism 90 Combination, convex 107 - - absolutely 108 - non-negative 109 Commuting operators 181 Compact set, Bohr 192 -spectral 208 Complement of a subspace, algebraic - - - - topological 157 Condition, boundary 11,19,71 - distributed 71 - initial 18 -
Levinson
85
209
- Lipschitz 9, 163 - of regularity of a functional 16 Cone 109 - minihedral 118 -total 186 Constant, interpolation 257 - Lebesgue 55 Convergence, absolute 101,135 - mean square 13 - unconditional 135 Convolution 22,240 - of sequences 27 Coordinates of a vector 86, 134 Corank of a homomorphism 90 Cross-property of a seminorm 173 Decomposition, triangular 220,222 - Wold 214 Defect of a homomorphism 90 Deficiency index of a symmetric operator -of an isometric operator 185 Definiteness, procedure for 230 S-function 11 Derivative, Frechet 163 - generalised 244 - of fractional order 42 - Radon-Nikodym 142 Determinant, Fredholm 77 - of a linear operator 170 Deviation 53, 147
194
278 Differential of a function 6 - of a mapping 163 Dilatation 226 Dimension, algebraic 86 - topological 136 Direct sum of linear operators Disk algebra 234 Dissipation 18 Distribution 245 - of compact support 246 - of finite order 245 - probability 12 - regular 245 - tempered 250 Duality of linear spaces 147
Subject Index
92
Edge of a wedge 109 Equation, Abel integral 42 - characteristic 45 - Euler-Lagrange 16 - Fredholm integral 74 -heat 19 - Kepler 9 - Lagrange 17 - Laplace 243 - Poisson 247 - singular integral 82 - Volterra integral 74 -wave 18 Equimeasurability of functions 253 Equivalence of metrics 157 - of norms 158 - of operators 181 - of representations 180 - - unitary 180 Extension of an operator 181 Flow 178 Formula, Cauchy-Hadamard 144 - D lembert 19 - Euler-Fourier 20 - Gel and 184 - interpolation - - Hermite 62 - - Lagrange 62 - - Newton 62 - inversion, Fourier 37 - - Fourier-Plancherel 37 - - Riemann-Mellin 39 - - Weyl 49 - - Widder 40 - Koch-Grothendieck 171 - Lagrange 72 - Lagrange-Sylvester 99
- of finite increments 163 - quadrature, Cotes 64 - - Gaussian 59 - Taylor 7 Function, absolutely continuous 15 - absolutely monotonic 43 - almost-periodic 51 - analytic 7 - basic 245 - Cauchy 41 - characteristic 223 - differentiable 6, 163 - eigen- 51 - - generalised 5 1 -flat 7 - general&d 245 - Green 71 - integrable 12 - interior 226 - Lagrange 16 - measurable 12 - normalised 13 - of compact support 38,86 - periodic in mean 44 - positive 15, 36 - Riemann t;- 42 - smooth 6 - summable 12 - - locally 15 - Weierstrass 54 Functional, additive 11 - analytic 251 - bounded 127 --locally 118-119 - convex 110 - coordinate 88 -gauge 111 - Hermitian quadratic 77 - homogeneous 11 - - positively 110 - linear 11 - Minkowski 111 - monotone 15 - multiplicative 104 - non-negative 238 - positive 15 - sesquilinear 196 - subadditive 109 - sublinear 110 -support 113,115 - symmetric 238 - - Hermitian 196 Functional calculus 99, 104, 170, 198,208, 234
Subject Index Gap between subspaces 137 Generalised sum of a series 100 Group, amenable 177- 178 -dual 240 - one-parameter 186 - unimodular 179 Half-open interval 108 Half-space 113 Hamiltonian 46,205 Harmonic oscillator 17 Homomorphism, absolutely summit xg 172 - bounded 153 -closed 155 - compact 165 - conjugate -algebraic 90 - - topological 155 - finite rank 93 - Hilbert-Schmidt 168 - Noetherian 93 - nuclear 169 - *- 238 Hull, convex 108 - - absolutely 108, 138 - - closed 138 - linear 88 Hyperfunction 251 Hyperplane 88 - of support 115 - separating 113 -almost 114 - - strictly 128 Ideal, maximal 101 - - standard 106 - simple 102 - symmetric 238 Index of a homomorphism 93 - of an indefinite metric 228 - of nilpotence 97 Inequality, interpolation 257 - triangle 13, 109 Intimum 118 Integral, Bochner 142 - Brodskij 221 - Dirichlet 243 - Fourier 36 - Gel and-Pettis 140 - Lebesgue 12 - Lebesgue-Stieltjes 15 - of Cauchy type 82 - of fractional order 42 Invariant mean 177
279
Jordan cell 97- 98 - structure 97 Kernel, Cauchy 168 - difference 78 - Dirichlet 22 - Fejtr 23 - Hermitian symmetric 77 - Hilbert-Schmidt 77 - of a homomorphism 89 - of a seminorm 111 - of an integral operator 71, 76 - Sushkevich 192 L-problem of moments 60 I-measure 14 Lattice, Banach 234 - vector 118 - - Dedekind complete 119 - - Dedekind a-complete 119 Logarithmic property of the index Matrix, Hankel 30 - Heisenberg 206 - identity 86 - Jacobian 34 - Nevanlinna 33 - Toeplitz 35 Measure, Bore1 14 - Choquet 143 - finite 12 - general&d 13 1 - Haar 179 - invariant 178 - Lebesgue 15 - - Stieltjes 14 - of compact support 132 - operator-valued 198 -outer 14 - probability 12 - regular 116 - structure 242 - vector 142 -absolutely continuous 142 Method, Fourier 19 - of least squares 147 - of summation 100 -Cedro 23 -Fejer 23 - of tangents 8 Metric, indefinite 227 - invariant 120 Model of a linear operator 222 - functional 223
93
280
Subject Index
Modulus of continuity 26,254 Moment problem, abstract 116 - Chebyshev-Markov 58 - exponential 43 - Nevanlinna-Pick 68 - power 29 - trigonometric 35 Monoid 178 Multiplicators 246 Multiplicity of an eigenvalue 199 Nil radical 102 Nilpotent 102 Nodes of a quadrature formula 59 Norm, graph 202 - Hilbert-Schmidt 168 - nuclear 169 - of a function 13 - of a homomorphism 127,153 - of a measure 132 - of a vector 14, 111 - of a vector measure 142 Normal eigenvalue 182 Normal mode 18 Number, approximation 168 - defect 182 - singular 216 Numerical range of a linear operator R-topology 153 Operator, adjoint I92 - algebraic 97 - communication 227 - compact relative to another operator - complete 216 - completely non-selfadjoint 223 - completely non-unitary 214 - conservative 190 - contracting 157 - - uniformly 157 - damping 23 1 - decomposable 210 - differential 70,259 - - elliptic 248 - differentiation 11,99 - dissipative 190 - dynamical 227 - expanding 224 - Hilbert 256 - identity 11 - infinitesimal 186 - integral 71 - - Fourier 260 - isometric 226
- Laplace 18 - linear 11,91 - metric 228 - non-quasianalytic 209 - normal 193 - normally resolvable 162, 166 - pseudoditferential 259 - quasiconservative 212 - quasidissipative 212 - quasinilpotent 184 - scattering 206 - selfadjoint 73,193 - shift 99, 225 - - weighted 221 - symmetric 46,193 - - maximal 196 - - non-negative 196 - unicellular 92,221 - Volterra 186 -wave 205 Order of a pseudodifferential operator - of an eigenvalue 97
215
202
259
Parallelogram identity 130 Part of an operator 181 - quasinilpotent 207 - scalar 207 Pencil of operators, quadratic 231 - strongly damped 23 1 Point, algebraic interior 108 - extreme I38 - interpolating 61 - Lebesgue 23 - quasiregular 182 - regular 98, 181 - stationary 10 - strongly exposed 143 - Weyl 32,50 Polar 145 Polar representation of an operator 217 Polynomial, Bemshtein 30 - characteristic 41 - Chebyshev - - of first kind 54 - - of second kind 58 -Faber 66 - Hermite 28 - Laguerre 28 - Legendre 28 -minimal 97 - of best approximation 53 Principle, contraction mapping 9 - fixed point 175 - of duality 241
Subject Index - of uniform boundedness 123,154 - variational 10 Problem, Cauchy 164,186 - - pointwise stable 188 - - stable 189 - Dirichlet 243 - of scattering theory -direct 204 - - inverse 204 - Riemann-Hilbert 82 - - additive 82 - Sturm-Liouville 45 Process, interpolation - - of Lagrange type 63 - - of Newton type 63 - Newton 8,163 - quadrature -Cotes 64 - - Gaussian 64 Product, Blaschke 68 - of measures 15 - scalar 13 - tensor 173 Projection 91 - boundary 192 -root 98 Property, approximation 165 - of reciprocity of polars 148 - Radon-Nikodym 142 Pseudodistance 228 Pseudonorm 120 Quasimodel of a linear operator Quasinilpotent 103 Quasinorm 121 Quasipolynomial 40 Quasiprojection 209 Radical of an algebra 102 Radius, spectral 184 - Weyl 31,SO Random events 12 - variables 12 Rank of a homomorphism 90 - of a set of vectors 88 Ray 109 Regularisor 94 Representation 179 - almost-periodic 191 - bounded 179 - canonical functional 103 - contracting 179 - isometric 179,233 - regular 180
222
- uniformly continuous 179 - unitary 179 Resolution - of the identity 197 - spectral 98, 197, 199 Resolvent 103 - of a linear operator 98, 181 - - Fredholm 217 - of an element of an algebra 103 Restriction of an operator 181 Scale of Banach spaces 258 -normal 258 Schmidt expansion 217 !kmiflow 178 Semigroup, amenable 177-178 - one-parameter 186 Seminorm 111 - factor 111 Separation of variables 19 Sequence, locally finite 245 - quasieigen- 182 - subadditive 184 -weight 25 Series, Dirichlet 42 - Fourier 20,21,52,134,136 - - generalised 27,45 - interpolation - - Lagrange 68 - - Newton 62 - Neumann 162 -Taylor 7 Set, absorbent 110 - algebraically -open 108 - - solid 108 - balanced 108 - bounded 126 - Chebyshev 147 - circled 108 - conal 109 - convex 107 - - absolutely 108 - measurable 12 - precompact 154 - relatively compact 154 -dense 52 - resolvent 98, 103, 181 - spectral 215 Shift, generalised 204 - semigroup 179 Shilov boundary 237 u-algebra 12 Solution, fundamental 247
282
- general&d 248 Space, Banach 14 - barrelled 122 - configuration 16 - conjugate -algebraic 88 - - topological 129 - coordinate 134 - countably Hilbert 173 - defect 182 -factor 88 - Frechet 122 - fully complete 152 - function 11 - Hilbert 14 - intermediate 256 - interpolation 257 - linear topological 119 - - complete 122 - locally bounded 126 - - convex 124 - Lorentz 253 - Mackey 149 - Marcinkiewicz 253 - maximal ideal 103 - Monte1 150 - Nikol kij-Besov 254 - normed 14 -nuclear 171 - of Lipschitz type 254 - ordered 117 - Orlicz 253 - phase 178 - pre-Hilbert 14 - probability 12 - Ptak 152 - reflexive algebraically 90 - - topologically 130, 150 - Riesz 118 - Schwartz 250 - second conjugate 150 - Slobodetskij 254 - smooth 130 - Sobolev 254 - strictly convex 131 - super-reflexive 158 - symmetric 257 - uniformly convex 130 - - smooth 130 Spectrum 51,97, 103,181 - absolutely continuous 200 - approximate 182 -Bohr 52 - combined 236
Subject Index - continuous 51, 199 - discrete 51, 199 - residual 182 - simple 224 - singular 199-200 Subordination of metrics 158 - of norms 158 Sum of spaces, external direct 89 - - - orthogonal 87 Sum of subspaces, direct 87 ---topologicaf 157 - - - orthogonal 157 Support of a distribution 246 - of a function 86 - of a measure 143 - of a vector 85 - of an analytic functional 251 Supremum 118 Symbol of a differential operator 248,259 - of a pseudodifferential operator 259 System, biorthogonal 61 - Chebyshev 56 - complete 27, 132-133 - distal 136 i dynamical 178 - Markov 57 - - normalised 59 - minimal 135 - w-linearly independent 134 - orthogonal 13 - orthonormal 13,134 Theorem, boundary spectrum splitting-off 192 - closed graph 155 - convolution 22 - ergodic 191 - expansion 40 - implicit function 9 - inverse homomorphism 156 - local 235 - open mapping 156 - separating hyperplane 113,114,128 - spectral 98,197,200,230 - spectral mapping 99,104,207,209,235 - stability of the index 95 - superposition 209 Theory, Fredholm 76 - Hilbert-Schmidt 77 - Noether 84 - scattering, non-stationary 205 - - stationary 204 Topological isomorphism 121 Topology, consistent with duality 148
Subject Index - DieudonnC-Schwartz 245 - locally convex 124 - Mackey 149 -norm 150 - of convergence in measure 120 - of uniform convergence 125 - - on bounded sets 150 - - on compact sets 125 - strong 145 - tensor product 173 - uniform 150, 154 -weak 145 - weakened 145 Total family of linear functionals 124 - - of projections 92 - - of seminorms 124 Trace 170 -matrix 170 - spectral 219 - tensor 169 Transform, Cayley 194 - Fourier 36 - - generalised 49, 248 - - operator 210 - - -Plancherel 37 - - -Stieltjes 37 - Gel and 233 - Laplace 39 -finite 44 -local 43
283
- - -Stieltjes 42 - Legendre 252 Travelling wave 18 Ultradistribution 250 Ultrapower 106 Unit sphere 127 Variation of a functional 16 Vector, channel 223 - cyclic 224 -root 97 Vector-function, absolutely integrable - analytic 144 - Bochner integrable 142 -entire 144 - Gel and-Pettis integrable 140 -simple 141-142 w*-topology 144 Wedge 109 - generating 109 Weight 28 Weights of a quadrature formula Weyl disk 32, 50 - point 32, 50 - radius 31,50 Young
relation
132
59
141