Fourier analysis - PDF Free Download

UNIVERSIDAD COMPLUTENSE Fourier Analysis Javier Duoandikoetxea Translated and revised by David Cruz-Uribe, SFO Grad...

373 downloads 1755 Views 10MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

UNIVERSIDAD COMPLUTENSE

Fourier Analysis

Javier Duoandikoetxea

Translated and revised by

David Cruz-Uribe, SFO

Graduate Studies in Mathematics Volume 29

o

TR

Mil 45

American Mathematical Society Providence, Rhode Island

Editorial Board James Humphreys (Chair) David Saltman David Sattinger Ronald Stern ANALISIS DE FOURIER by Javier Duoandikoetxea Zuazo Published in Spanish by Addison-Wesley and Universidad Aut6noma de Madrid in 1995 Translated from the Spanish by David Cruz-Uribe, SFO 2000 Mathematics Subject Classification. Primary 42B15, 42B20, 42B25. ABSTRACT. The purpose of this book is to develop Fourier analysis using the real variable methods

introduced by A. P. Calder& and A. Zygmund. It begins by reviewing the theory of Fourier series and integrals, and introduces the Hardy-Littlewood maximal function. It then treats the Hilbert transform and its higher dimensional analogues, singular integrals. In subsequent chapters it discusses some more recent topics: H I and BMO, weighted norm inequalities, Littlewood-Paley theory, and the T1 theorem. At the end of each chapter are extensive references and notes on additional results.

Library of Congress Cataloging-in-Publication Data Duoandikoetxea, Zuazo, Javier. [Analisis de Fourier. English] Fourier analysis / Javier Duoandikoetxea ; translated and revised by David Cruz-Uribe. p. cm. — (Graduate studies in mathematics ; v. 29) Includes bibliographical references and index. ISBN 0-8218-2172-5 1. Fourier analysis. I. Title. II. Series. QA403.5 .D8313 2000 00-064301 515'.2433—dc21

Copying and reprinting. Individual readers of this publication, and nonprofit libraries acting for them, are permitted to make fair use of the material, such as to copy a chapter for use in teaching or research. Permission is granted to quote brief passages from this publication in reviews, provided the customary acknowledgment of the source is given. Republication, systematic copying, or multiple reproduction of any material in this publication is permitted only under license from the American Mathematical Society. Requests for such permission should be addressed to the Assistant to the Publisher, American Mathematical Society, P.O. Box 6248, Providence, Rhode Island 02940-6248. Requests can also be made by e-mail to reprint -permiss ion@ams . org. © 2001 by the American Mathematical Society. All rights reserved. The American Mathematical Society retains all rights except those granted to the United States Government. Printed in the United States of America. The paper used in this book is acid-free and falls within the guidelines established to ensure permanence and durability. Visit the AMS home page at URL: http: //www.ams org/ 10 9 8 7 6 5 4 3 2 1

06 05 04 03 02 01

Dedicated to the memory of Jose Luis Rubio de Francia, my teacher and friend, who would have written a much better book than I have

Contents

Preface

xiii

Preliminaries

xvii

Chapter 1. Fourier Series and Integrals

1

§1. Fourier coefficients and series

1

§2. Criteria for pointwise convergence

2

§3. Fourier series of continuous functions

6

§4. Convergence in norm

8

§5. Summability methods

9

§6. The Fourier transform of L' functions

11

§7. The Schwartz class and tempered distributions

12

§8. The Fourier transform on LP, 1 p < 2

15

§9. The convergence and summability of Fourier integrals

17

§10. Notes and further results Chapter 2. The Hardy-Littlewood Maximal Function §1. Approximations of the identity

19 25 25

§2. Weak-type inequalities and almost everywhere convergence 26 §3. The Marcinkiewicz interpolation theorem

28

§4. The Hardy-Littlewood maximal function

30 ix

x

Contents §5. The dyadic maximal function §6. The weak (1,1) inequality for the maximal function

Contents

xi

32

§3. An interpolation result

121

35

§7.

A weighted norm inequality

§4. The John-Nirenberg inequality

37

123

§8.

Notes and further results

§5.

Notes and further results

38

126

Weighted Inequalities

133

Chapter 3.

The Hilbert Transform

49

§1. The AP condition

133

§2. Strong-type inequalities with weights

137

§3. Al weights and an extrapolation theorem

140

§4. Weighted inequalities for singular integrals

143

§1.

The conjugate Poisson kernel

49

§2.

The principal value of 1/x

50

§3. The theorems of M. Riesz and Kolmogorov

51

§4. Truncated integrals and pointwise convergence

55

§5. Multipliers §6. Notes and further results Chapter 4.

Chapter 7.

58 61

Singular Integrals (I)

69

§1.

Definition and examples

69

§2.

The Fourier transform of the kernel

70

Notes and further results

§ 5 . Chapter 8.

Littlewood-Paley Theory and Multipliers

147 157

§1. Some vector-valued inequalities

157

§2. Littlewood-Paley theory

159

§3. The Hi5rmander multiplier theorem

163

§4. The Marcinkiewicz multiplier theorem

166

§5. Bochner-Riesz multipliers

168

§6. Return to singular integrals

172

§3. The method of rotations

73

§4. Singular integrals with even kernel

77

§5. An operator algebra

80

§6. Singular integrals with variable kernel

83

§7. The maximal function and the Hilbert transform along a parabola

Notes and further results

178

85

§8. Notes and further results

184

Singular Integrals (II)

91

§7. Chapter 5. § 1.

The CalderOn-Zygmund theorem

Chapter 9.

The Ti Theorem

195

91

§1. Cotlar's lemma

§2. Truncated integrals and the principal value

195

94

§2. Carleson measures

§3. Generalized CalderOn-Zygmund operators

197

98

§3. Statement and applications of the T1 theorem

201

§4. CalderOn-Zygmund singular integrals

101

§4. Proof of the Ti theorem

§5. A vector-valued extension

205

105

§5.

212

§6. Chapter 6.

Notes and further results H I- and BMO

107 115

§1. The space atomic 1/ 1

115

§2. The space BMO

117

Notes and further results

Bibliography

217

Index

219

1

Preface

Fourier Analysis is a large branch of mathematics whose point of departure is the study of Fourier series and integrals. However, it encompasses a variety of perspectives and techniques, and so many different introductions with that title are possible. The goal of this book is to study the real variable methods introduced into Fourier analysis by A. P. Calder& and A. Zygmund in the 1950's. We begin in Chapter 1 with a review of Fourier series and integrals, and then in Chapters 2 and 3 we introduce two operators which are basic to the field: the Hardy-Littlewood maximal function and the Hilbert transform. Even though they appeared before the techniques of Calder& and Zygmund, we treat these operators from their point of view. The goal of these techniques is to enable the study of analogs of the Hilbert transform in higher dimensions; these are of great interest in applications. Such operators are known as singular integrals and are discussed in Chapters 4 and 5 along with their modern generalizations. We next consider two of the many contributions to the field which appeared in the 1970's. In Chapter 6 we study the relationship between Hl BMO and singular integrals, and in Chapter 7 we present the elementary theory of weighted norm inequalities. In Chapter 8 we discuss Littlewood-Paley theory; its origins date back to the 1930's, but it has had extensive later development which includes a number of applications. Those presented in this chapter are useful in the study of Fourier multipliers, which also uses the theory of weighted inequalities. We end the book with an important result of the 80's, the so-called T1 theorem, which has been of crucial importance to the field. ,

At the end of each chapter there is a section in which we try to give some idea of further results which are not discussed in the text, and give

xiv

Preface

references for the interested reader. A number of books and all the articles cited appear only in these notes; the bibliography at the end of the text is reserved for books which treat in depth the ideas we have presented. The material in this book comes from a graduate course taught at the Universidad AutOnoma de Madrid during the academic year 1988-89. Part of it is based on notes I took as a student in a course taught by Jose Luis Rubio de Francia at the same university in the fall of 1985. It seemed to have been his intention to write up his course, but he was prevented from doing so by his untimely death. Therefore, I have taken the liberty of using his ideas, which I learned both in his class and in many pleasant conversations in the hallway and at the blackboard, to write this book. Although it is dedicated to his memory, I almost regard it as a joint work. Also, I would like to thank my friends at the Universidad AutOnoma de Madrid who encouraged me to teach this course and to write this book. The book was first published in Spanish in the ColecciOn de Estudios of the Universidad Aut6noma de Madrid (1991), and then was republished with only some minor typographical corrections in a joint edition of AddisonWesley/Universidad AutOnoma de Madrid (1995). From the very beginning some colleagues suggested that there would be interest in an English translation which I never did. But when Professor David Cruz-Uribe offered to translate the book I immediately accepted. I realized at once that the text could not remain the same because some of the many developments of the last decade had to be included in the informative sections closing each chapter together with a few topics omitted from the first edition. As a consequence, although only minor changes have been introduced to the core of the book, the sections named "Notes and further results" have been considerably expanded to incorporate new topics, results and references. The task of updating the book would have not been accomplished as it has been without the invaluable contribution of Professor Cruz-Uribe. Apart from reading the text, suggesting changes and clarifying obscure points, he did a great work on expanding the above mentioned notes, finding references and proposing new results to be included. The improvements of this book with respect to the original have certainly been the fruit of our joint work, and I am very grateful to him for sharing with me his knowledge of the subject much beyond the duties of a mere translator. Javier Duoandikoetxea Bilbao, June 2000

Preface

xv

Acknowledgment: The translator would like to thank the Ford Foundation and the Dean of Faculty at Trinity College for their generous support during the academic year 1998-99. It was during this year-long sabbatical that this project was conceived and the first draft of the translation produced.

Preliminaries

Here we review some notation and basic results, but we assume that they are mostly well known to the reader. For more information, see, for example, Rudin [14]. In general we will work in Rn. The Euclidean norm will be denoted by H If x E IP and r > 0, B(x,r) = {y E R n : - yl < r} is the ball with center x and radius r. Lebesgue measure in IV is denoted by dx and on the unit sphere Sn -1 in ItIn by do . If E is a subset of W1 , IEI denotes its Lebesgue measure and XE its characteristic function: XE(x) = 1 if x E E and 0 if x cl E. The expressions almost everywhere or for almost every x refer to properties which hold except on a set of measure 0; they are abbreviated by "a.e." and "a.e. x." -

If a = (al,

, a n ) E NT' is a multi-index and f :

Da f =

C, then

f 841 • • • 34"

where la' = al + • • + a n and x° = Let (X, p) be a measure space. LP(X,p,), 1 < p < oo, denotes the Banach space of functions from X to C whose firth powers are integrable; the norm of f E LP(X,p) is

IlfIlp=

(f

1/p

IflPdp)

• xvii

xviii

Preliminaries

L' (X, denotes the Banach space of essentially bounded functions from X to C; more precisely, functions f such that for some C > 0,

Chapter 1

,u({x EX:If (x)1 > C}) = 0. The norm of f, Ilf11,,, is the infimum of the constants with this property. In general X will be 1R (or a subset ofilln) and dm = dx; in this case we often do not give the measure or the space but instead simply write L. For general measure spaces we will frequently write LP(X) instead of LP(X, /.2); if is is absolutely continuous and du w dx we will write LP(w). The conjugate exponent of p is always denoted by p': 1 1 — — 1. P

Fourier Series and Integrals

The triangle inequality on LP has an integral version which we refer to as Minkowski's integral inequality and which we will use repeatedly. Given measure spaces (X, /2) and (Y, v) with d-finite measures, the inequality is 1/p

f (x, y) dv (y) dp,(x))

Lu

x' f (x, y)IP dit(x))

1/p

dv(y).

The convolution of two functions f and g defined on 1n is given by

f * g(x) = f f (y)9(x - Y) dY f f (x - 09(Y) dY 118. whenever this expression makes sense. The spaces of test functions are C e'(Rn), the space of infinitely differentiable functions of compact support, and S(ln), the so-called Schwartz functions. A Schwartz function is an infinitely differentiable function which decreases rapidly at infinity (more precisely, the function and all its derivatives decrease more rapidly than any polynomial increases). Given the appropriate topologies, their duals are the spaces of distributions and tempered distributions. It makes sense to define the convolution of a distribution and a test function as follows: if T E C7(R n ) / and f E C7(118n), then T * f (x) (T, Tx h where Ay) = f(—y) and Tx f (y) f (x y). Note that this definition coincides with the previous one if T is a locally integrable function. Similarly, we can take T E S(Rnr and f E S (8 n ). We denote the duality by either (T, h or T(f) without distinction. References in square brackets are to items in the bibliography at the end of the book. Finally, we remark that C will denote a positive constant which may be different even in a single chain of inequalities.

1. Fourier coefficients and series The problem of representing a function f, defined on (an interval of) IR, by a trigonometric series of the form

00

(1.1)

f (x) =

E ak cos(kx) + bk sin(kx)

k=0

arises naturally when using the method of separation of variables to solve partial differential equations. This is how J. Fourier arrived at the problem, and he devoted the better part of his Theorie Analytique de la Chaleur (1822, results first presented to the Institute de France in 1807) to it. Even earlier, in the middle of the 18th century, Daniel Bernoulli had stated it while trying to solve the problem of a vibrating string, and the formula for the coefficients appeared in an article by L. Euler in 1777. The right-hand side of (1.1) is a periodic function with period 27r, so f must also have this property. Therefore it will suffice to consider f on an interval of length 27r. Using Euler's identity, eikx = cos(kx) + i sin(kx), we can replace the functions sin(kx) and cos(kx) in (1.1) by fe ikx : k E Z1; we will do so from now on. Moreover, we will consider functions with period 1 instead of 27r, so we will modify the system of functions to { e 2riks k E Z}. Our problem is thus transformed into studying the representation of f by (1.2)

(x) = Y cke27rikx k=— co

1

2

1. Fourier Series and Integrals

If we assume, for example, that the series converges uniformly, then by multiplying by e-27rinix and integrating term-by-term on (0,1) we get

2. Criteria for pointwise convergence

In order to study S N (x) we need a more manageable expression. Dirichlet wrote the partial sums as follows:

fl

cm =

f (x)e-27inlx dx

s N 1( x ) =

because of the orthogonality relationship (1.3)

1

e 27rik x e -27rimnx dx = {0 if k

=

m

1 if k = m.

Denote the additive group of the reals modulo 1 (that is IR/Z) by T, the one-dimensional torus. This can also be identified with the unit circle, S 1 . Saying that a function is defined on T is equivalent to saying that it is defined on R and has period 1. To each function f E L 1 (T) we associate the sequence {f(k)} of Fourier coefficients of f, defined by (1.4)

f(k) =

f(x ) e -27rikx dx.

The trigonometric series with these coefficients, 00 f(k) ,27riki, (1.5)

E

E f(t)e-21,2kt dt e 1

k=—N 1

f

27rtkx

0

f (t)D N(x - t) dt

= f f (x - t)DN(t) dt, where DN is the Dirichlet kernel, DN(t) =

Ee

27rikt .

k=—N

If we sum this geometric series we get (1.6)

DN(t) =

sin(x(2N + 1)t) sin(7rt)

This satisfies

f

o lDN(t)dt = 1

k=—oo

is called the Fourier series of f.

3

and

1

IDN

S< < 1/2.

sin(xS)'

We will prove two criteria for pointwise convergence.

Our problem now consists in determining when and in what sense the series (1.5) represents the function f . 2. Criteria for pointwise convergence Denote the N-th symmetric partial sum of the series (1.5) by SN (x); that is, f (k) e 2niks

S'N f (X) = k=—N

Note that this is also the N-th partial sum of the series when it is written in the form of (1.1). Our first approach to the problem of representing f by its Fourier series is to determine whether lim SN1(x) exists for each x, and if so, whether it is equal to f (x). The first positive result is due to P. G. L. Dirichlet (1829), who proved the following convergence criterion: if f is bounded, piecewise continuous, and has a finite number of maxima and minima, then lim S Nf (x) exists and is equal to -Pf (x+) + f (x-)]. Jordan's criterion, which we prove below, includes this result as a special case.

Theorem 1.1 (Dini's Criterion). If for some x there exists 6 > 0 such that

f (x + t) - 1(x) t

11.t1< .5

dt < co,

then lim SN1(x)

N—>co

= 1(4

Theorem 1.2 (Jordan's Criterion). If f is a function of bounded variation in a neighborhood of x, then Jim SN (x)

= 2 [f (x+) + f (s

-

)]•

At first it may seem surprising that these results are local, since if we modify the function slightly, the Fourier coefficients of f change. Nevertheless, the convergence of a Fourier series is effectively a local property, and if the modifications are made outside of a neighborhood of x, then the behavior of the series at x does not change. This is made precise by the following result.

4

1. Fourier Series and Integrals

Theorem 1.3

hood of x, then

(Riemann Localization Principle). If f is zero in a neighbor-

5

2. Criteria for pointwise convergence

Proof of Theorem 1.3.

fi rn SN f (x) = 0.

Suppose that f (t) = 0 on (x - 5, x + 6). Then f(x t)

SN f (x) =

N—,oo

sin(7(2N

+ 1)t) dt

sin(irt)

J3iti<1/2 = (ge 71. )-(N)+ (9e'r(-N),

An equivalent formulation of this result is to say that if two functions agree in a neighborhood of x, then their Fourier series behave in the same way at x.

where

From the definition of Fourier coefficients (1.4) it follows immediately that

is integrable. By the Riemann-Lebesgue lemma we conclude that

If (k)i

2i sin(7t) X{6
g(t)

lim SNf (x) = 0.

IIf Iii,

N

but a sharper estimate is true which we will use to prove the preceding results. Lemma 1.4

f (x - t)

(Riemann-Lebesgue). If f E L 1 (T) then lim f (k) = 0.

1=1 Proof of Theorem 1.1. SN f

Since the integral of DN equals 1,

(x) - f (x) = f

1/2

-1/2

[f (x - t) - f (x)]

sin(7(2N

+ 1)t)

sin(irt)

dt

= fiti<6 1,1t,<1,2• Proof.

Since E 27rix has period 1, J(k)f =_ - =-

By the Riemann-Lebesgue lemma both of these integrals tend to 0. The second if we argue as in the previous proof, the first since by hypothesis the function

f ( x ) e -27rikx dx

J f (x - 1/2k)e

sin(7t) X{ I t l <8} (t) is integrable. (Recall that if

-21r a k x dx.

Hence,

f (k)

f (x - t) - f (x)

f ( x ) e -27rik(x+v2k) dx

fl

0 [f (x) - f(x - 1/2k)] e -271-th r dx.

If f is continuous, it follows immediately that lim f (k) = 0. ikj+00 For arbitrary f E L l (T), given E > 0, choose g

continuous such that II f - 9111 < €/2 and choose k sufficiently large that 1,g(k)1 < E/2. Then j(k)l < f(f - gr(k)I + Ig(k)I < II f - 9111 + 1.4(k)1 < €.

< 6, sin(7t) and in are equivalent.)

1=1

Since every function of bounded variation is the difference of two monotonic functions, we may assume that f is monotonic in a neighborhood of x. Since Proof of Theorem 1.2.

1/2 SN f (X) =

1-1/2

1/2

f (X — t)DN(t)

dt =-

/0

[f (x - t) + f (x + t)]DN(t) dt,

it will be enough to show that for g monotonic lim

/ 1/2 1 g(t)DN(t) dt = - g(0+).

N—,co 0

2

Further, we may assume that g(0+) = 0 and that g is increasing to the right of 0. Given € > 0, choose 6 > 0 such that g(t) < E if 0 < t < 6. Then

f

1/22 g(t)DN(t) dt f 6 1 + / 0 (5

6

1. Fourier Series and Integrals

Again by the Riemann-Lebesgue lemma, the second integral tends to 0. We iy the second mean value theorem for integrals' to the first integral. Then for some v, 0 < v < 5,

fo g(t)DN(t) dt = g(b 8

Furthermore,

f:DN(t) dt

fv

< C.

—)

fD

N (t)

1 sin(7r(2N + 1)t) (sin(7rt) 6 sin(7r(2N + 1)t) dt J v 7rt 1 1 dt + 2 sup irt 6 sin(7rt) m>o

dt.

dt

m sin(7rt) dt

a

IITaxIIY = 00 .

(Recall that the operator norm of Ta IITa = supfilTa xily : ,Chap 1 ^ 1}.) A proof of this result can be found, for example, in Rudin [141xlix ter 5]. Now let X = C(T) with the norm II • II„, and let Y = C. Define TN : X Y by

-

1/2

TN

f=

SN f (0)

= f

f (OD

N(t) dt.

—1/2

LN

by 1/2

LN 8

g(t)DN(t) dt < Cc.

3. Fourier series of continuous functions

If f satisfies a Lipschitz-type condition in a neighborhood of x, that is, (x — f (x)1 < CIO' for some a > 0, Iti < d , then Dini's criterion applies to it. However, continuous functions need not satisfy this condition or any other convergence criterion we have seen. This must be the case because of the following result due to P. du Bois-Reymond (1873). Theorem 1.5. There exists a continuous function whose Fourier series di-

verges at a point.

Du Bois-Reymond constructed a function with this property, but we will show that one exists by applying the uniform boundedness principle, also known as the Banach-Steinhaus theorem. Lemma 1.6

(Uniform Boundedness Principle). Let X be a Banach space, Y a normed vector space, and let {Ta } aE A be a family of bounded linear h monotonic on [a, bb then there exists c, a < c < b, such that

z

sup ga ll < oo

Define the Lebesgue numbers

1

7

operators from X to Y. Then either

511 P a

1 7rt

or there exists x E X such that

Hence,

1 If is continuous and

3. Fourier series of continuous functions

b

h0=h(b — ) f 0+ h(a+) f 0. a

IDN(t)Idt; —

1/2

it is immediate that ITN f Ii< LNII f III DN(t) has a finite number of zeros so sgn DN(t) has a finite number of jump discontinuities. Therefore, by modifying it on a small neighborhood of= each discontinuity, we can form a continuous function f such that 1 and ITN f I > LN — E. Hence, I I TNII = LN. Thus if we can prove that LN 00 as N ---* oo, then by the uniform boundedness principle there exists a continuous function f such that lim sup I SN f

= 00;

N—>co

that is, the Fourier series of f diverges at 0. 4 Lemma 1.7. LN = —2 log N + 0(1). Proof. 1/2

LN = 2

=2

Jo

sin(7r(2N + 1)t)

N+1/2 sin(7rt) dt + 0(1) 7rt

N-1 k+1

=2

E

k:-=0

2 N-1 k

dt + 0(1)

f

0

sin(7rt) dt + 0(1) 7rt

I

1 sin(711)1

t

dt + 0(1)

8

1. Fourier Series and Integrals

f 0

N-1 1

I sin(7rt)I k=1

t

k

dt + 0(1)

5. Summability methods

9

Theorem 1.9. The mapping f 1—> {j(k)} is an isometry from L 2 to £ 2 , that is,

E CC

= 4 log N + 0(1).

=

7r

k=-oo

If(k)I2.

Convergence in norm in L 2 follows from this immediately.

4. Convergence in norm The development of measure theory and LP spaces led to a new approach to the problem of convergence. We can now ask: (1) Does lim II St■Ti — fli p N-4co

0

for f E LP (T)?

(2) Doeslim SNf(x) = f (x) almost everywhere if f E LP(T)? N

We can restate the first question by means the following lemma.

Lemma 1.8.

SN f converges to f in LP norm, 1 < p < co, if and only if

there exists Cp independent of N such that (1.7) IISNf lip 5- Cpilf lip-

Proof. The necessity of (1.7) follows from the uniform boundedness principle.

5. Summability methods In order to recover a function f from its Fourier coefficients it would be convenient to find some other method than taking the limit of the partial sums of its Fourier series since, as we have seen, this approach does not always work well. One such method, Cesaro summability, consists in taking the limit of the arithmetic means of the partial sums. As is well known, if lim ak exists then a + • + ak lim 1 also exists and has the same value.

To see that it is sufficient, first note that if g is a trigonometric polynomial, then SNg = g for N > deg g. Therefore, since the trigonometric polynomials are dense in LP (see Corollary 1.11), if f E LP we can find a trigonometric polynomial g such that II f — g11 p < e, and so for N sufficiently large

IIsNf - flip 5_11sN(f - g)11p+11sNg - 911p+ Ilf

The second question is much more difficult. A. Kolmogorov (1926) gave an example of an integrable function whose Fourier series diverges at every point, so the answer is no if p = 1. If f E LP, 1 1). Until the result by Carleson, the answer was unknown even for f continuous.

-

Define UN f (X) =

k=0

= I f (t)

o

=

When p = 2, the functions e2iTikx} form an orthonormal system (by (1.3)) which is complete (i.e. an orthonormal basis) by the density of the trigonometric polynomials in L 2 . Therefore, we can apply the theory of Hilbert spaces to get the following.

Eskf(x)

1

slip c (Cp+ 1 )e.

If 1 < p < oo, then inequality (1.7) holds, as we will show in Chapter 3. When p = 1, the L 1 operator norm of SN is again LN, and so by Lemma 1.7 the answer to the first question is no.

N +1

J

1

1

ED k (z-t)dt

N +1 k=0

f(t)FN(X — t) dt,

where FN is the Fejer kernel, FN(t) = N 1+ 1

EDk(o _ N 1+ 1 (n(7r si si(Nr+t) 1)t) ) 2 . k=0

FN has the following properties:

FN(t) > 0,

I

10

1. Fourier Series and Integrals

(1.8)

j k I 27zkt Pr(t) = E r e

lim FN (t) dt = 0 if 6 > 0. N -*°° f5
k=-oo

Because FN is positive, its L 1 norm coincides with its integral and is 1. This is not the case for the Dirichlet kernel: its integral equals 1 because of cancellation between its positive and negative parts while its Ll norm tends to infinity with N. Theorem 1.10. If f E LP, 1 0, fol Pr (t) dt = 1,

(1.10) lim

<

f

iti<6

lim iiPr * f — f Ilp = O.

- t) - f (•)Il p FN(t) dt f (. - t) - f (•)11 7,FN(t) dt +

211fli p F f N(t)

dt.

Since for 1 0.

Theorem 1.12. If f E LP, 1 < p < oo, or if f is continuous and p = co, then

p 1/2

- 1/2

/01 <1/2

Therefore, we can prove a result analogous to Theorem 1.10.

Proof. Since f FN = 1, by Minkowski's inequality we have that f fllp =

1 - r 2 1 - 2r cos(27t) + r 2

is the Poisson kernel. The Poisson kernel has properties analogous to those of the Fejer kernel:

7. -4-

fllp = O-

Nlim ->co

Since the function u is harmonic on Izi < 1, it is the solution to the Dirichlet problem:

Du -= 0 if Izi < 1,

Ilf (. — t) — g.)11p = 0,

and the same limit holds if p = oo and f is continuous, the first term can be made as small as desired by choosing a suitable S. And for fixed 6, by (1.8) the second term tends to 0. ❑ Corollary 1.11.

u = f if 1z1= 1 , where the boundary condition is interpreted in terms of Theorem 1.12. In Chapter 2 we will study the almost everywhere convergence of o- N f (x) and Pr * f (x).

(1) The trigonometric polynomials are dense in LP, 1 < p < oo.

6. The Fourier transform of L l functions

(2) If f is integrable and f(k) = 0 for all k, then f is identically zero.

Given a function f E Ll(litn), define its Fourier transform by

A second summability method is gotten by treating a Fourier series as the formal limit on the unit circle (in the complex plane) of 0.

(1.9)

u(z) =

E f(ozk+ k=0

-1

f (k)Iki

,

z = re 2 ' Jo

f (0 = f f (x)e -27' . dx,

(1.11)

lir'

where x • e = x1 + x262 + of the Fourier transform:

k=-oo

•••+

x,,,

n.

The following is a list of properties

(1.12) (a f + Ogr = a f + 0."g (linearity); Since If (k)} is a bounded sequence, this function is well defined on Izi < 1. (1.13) and f is continuous; II/1100 5 Iii as It can be rewritten 00 lim f (0 = 0 (Riemann-Lebesgue); 12 u fr e zirie ) = porikle2iriko_f/2 lel —> CC' f (t)Pr (9 - t) dt, -1/2 (f * 9r= f9; k=-oo (1.15)

E

/

11

where

r(t) dt = 1,

liFN111 = f

6. The Fourier transform of L' functions

Ilf

12

(1.16)

1. Fourier Series and Integrals

(Thf )-() = f , where Thf (x) = f (x + h); ( f e 27rih•x (o = f - h); )

(1.17)

(1.18)

if p E O n (an orthogonal transformation), then (i(P . ))-(0 =

f (p);

if g(x) =

f (A -l x), then

Ca

(-27rix.i.fr(6) =

The space of bounded linear functionals on S, , is called the space of tempered distributions. A linear map T from S to C is in 5' if lim T(qk) = 0 whenever lim

k--■oo

c5k = 0 in S.

Theorem 1.13. The Fourier transform is a continuous map from S to S such that

(6).

rg = fli,

The continuity of f follows from the dominated convergence theorem; (1.14) can be proved like Lemma 1.4; the rest follow from a change of variables, Fubini's theorem and integration by parts. In (1.19) we assume that Of/Oxi E Ll and in (1.20) that xj f E Ll. Unlike on the torus, Ll(Rn) does not contain LP(111n), p > 1, so (1.11) does not define the Fourier transform of functions in those spaces. For the same reason, the formula which should allow us to recover f from f ,

13

With this topology S is a Frechet space (complete and metrizable) and is dense in LP(R), 1 < p < oo. In particular, S C L 1 and (1.11) defines the Fourier transform of a function in S.

Xe) = 1(A);

f ) (6) = 27 i6if Ox j x)

(1.20)

7. The Schwartz class and tempered distributions

f (x) = f (6)0" c16. -

Equality (1.22) is referred to as the inversion formula. To prove Theorem 1.13 we need to compute the Fourier transform of a particular function. Lemma 1.14. If (x) e - 11#12 then f (e)

may not make sense since (1.13) and (1.14) are all that we know about f, and they do not imply that f is integrable. (In fact, f is generally not integrable.)

Proof. We could prove this result directly by integrating in C. but we will give a different proof here. It is enough to prove this in one dimension, since in II8n f is the product of n identical integrals. The function f (x) = e -7x2 is the solution of the differential equation + 27rxu = 0,

7. The Schwartz class and tempered distributions A function f is in the Schwartz class, S(IRn), if it is infinitely differentiable and if all of its derivatives decrease rapidly at infinity; that is, if for all a, /3 E sup Ix a R3 f(x)i = Pa,n(f) < oo. Functions in C c" are in 5, but so are functions like e -ix12 which do not have compact support. The collection {p,„,01 is a countable family of seminorms on S, and we can use it to define a topology on S: a sequence {0k} converges to 0 if and only if for all a, /3 E kli p a 0(0k) = 0.

u(0) = 1. By (1.19) and (1.20) we see that u satisfies the same differential equation with the initial value 11(0) = f u(x) dx = I e -7X2 dx = 1. Therefore, by uniqueness,

f = f.

Proof of Theorem 1.13. By (1.19) and (1.20) we have

Do f =

C(Da x/3 f)-(),

SO

CrY3f()1 _< CIIDaxafIii

14

I. Fourier Series and Integrals

The Ll norm can be bounded by a finite linear combination of seminorms of f, which implies that the Fourier transform is a continuous map from S to itself. Equality (1.21) is an immediate consequence of Fubini's theorem since f (x)g(y) is integrable on Rn x From (1.18) and (1.21) we get

f f (x)"g(Ax) dx = ff (x)A

-

g (A -1 x) dx.

If we make the change of variables Ax = y in the first integral, this becomes

f

f (A -1 x)g(x) dx = f f (x)g(A -1 x) dx;

if we then take the limit as A

co, we get

f (0) ( f "g x) dx = g(0) f f (x) dx.

8. The Fourier transform on LP, 1 < p < 2

Theorem 1.17. The Fourier transform is a bounded linear bijection from 5' to S' whose inverse is also bounded. Proof. If Tn T in S', then for any f E S, Tn(f )

f(0) = f f(e)d, which is (1.22) for x = 0. If we replace f by Tx f, then by (1.16),

f (x) = (Tx f)(0) = f (Tx fr(0 d = f f

Furthermore, the Fourier transform has period 4, so its inverse is equivalent to applying it 3 times; therefore, its inverse is also continuous.

❑

If we define T by t(f)=T(f), then it follows from Corollary 1.15 that

(Tfl= T. And if T E L l then by the inversion formula we get that T(x) = in particular, T is a bounded, continuous function.

If f E LP, 1 < p < oo, then f can be identified with a tempered distribution: for cb E S define

T1(0) = f f Clearly this integral is finite. To see that Tf is continuous, suppose that oo. Then by Holder's inequality, 0 in S as k (bk

If we let f (x) = f (-x), we get the following corollary.

(Jr=

Corollary 1.15. For f E 5, f , and so the Fourier transform has period 4 (i.e. if we apply it four times, we get the identity operator). Definition 1.16. The Fourier transform of T E S' is the tempered distribution T given by f E S.

By Theorem 1.13, T is a tempered distribution, and in particular, if T is an integrable function, then T coincides with the Fourier transform defined by equation (1.11). Likewise, if is a finite Borel measure (i.e. a bounded linear functional on Co(P), the space of continuous functions which vanish at infinity), then is the bounded continuous function given by

f

= Tn(f) T ( f ) = (f)•

8. The Fourier transform on LP, 1 < p < 2

Let g(x) = e'lx 12 ; then by Lemma 1.14,

T(f) = T(f),

15

e -27rix• d it ( x ) .

llin

For .5, the Dirac measure at the origin, this gives us b = 1.

ITf(4k)I

Ilf IlplIsbk112Y-

norm of functions of the form xaq5k, Then II Ok II p' is dominated by the and so by a finite linear combination of seminorms of Ok; hence, the left-hand side tends to 0 as k -+ oo. Moreover, when 1 co lxi
1 2 is referred to as the Plancherel theorem.

16

1. Fourier Series and Integrals

Proof. Given f, h E S, let g = h, so that g = h. Then by (1.21) we have that (1.23)

fh= f

9. The convergence and summability of Fourier integrals

17

We digress to give another corollary of Riesz-Thorin interpolation which is not directly related to the Fourier transform but which will be useful in later chapters. Corollary 1.21 (Young's Inequality). If f E LP and g E L 9 , then f * g E

If we let h = f then we get IIf112 = 11/11 2 for f E S. Since S is dense in L 2 , the Fourier transform extends to all f in L 2 with equality of norms. Finally, the continuity of the Fourier transform implies the given formulas for f and f as limits in L 2 , since f 703(0,R) and fxB(o,R) converge to f and f in L 2 . ❑ If f E LP, 1 i) and f2 = f — Therefore, / = fl + f2 E + L 2 . However, by applying an interpolation theorem we can get a sharper result. Theorem 1.19 (Riesz-Thorin Interpolation). Let 1 < po, pi, q0, qi < oo,

and for 0 < 0 <1 define p and g by 1 1-0 0 1 1-0 0 + , + . P Po P1 qi q qo If T is a linear operator from LP° + LP' to Lq° + Lql such that

IIT f'Igo

mollfIlp.

for f E

and 11T f

f

for f E LP'

then MO-8 019 11f Ilp for f ELP.

L 7', where 1/r +1 = 1/p + 1/q, and

IIf*911,

IlflIplIglIg-

Proof. If we fix f E LP we immediately get the inequalities

IV* 91Ip C IlflIplIgIll and

IIf *911. C lIfIlplIglIp'• The desired result follows by Riesz-Thorin interpolation.

❑

9. The convergence and summability of Fourier integrals The problem of recovering a function from its Fourier transform is similar to the same problem for Fourier series. We need to determine if and when lim _a—.fBR

f

e 27ix. =

f (x),

where BR = {Rx : x E B}, B is an open convex neighborhood of the origin, and the limit is understood either as in LP or as pointwise almost everywhere. If we define the partial sum operator SR by

(SR.i)

= xBR f ,

then this problem is equivalent to determining if The proof of this result uses the so-called "three-lines" theorem for analytic functions; it can be found, for example, in Stein and Weiss [18, Chapter 5] or Katznelson [10, Chapter 4]. Corollary 1.20 (Hausdorff-Young Inequality). If f E LP, 1 < p < 2, then f E LP' and f IIp-

Proof. Apply Theorem 1.19 using inequality (1.13), Ilfjj ac Mk, and the Plancherel theorem, II f 112 =II f 112. ❑

lim SR/ = f. R-00 Analogous to Lemma 1.8, a necessary and sufficient condition for convergence in norm is that

IISRIli

cplIfIlp,

where Cp is independent of R. When n = 1 this is the case; we will prove this in Chapter 3. We will also prove several partial results when n > 1, but in general there is no convergence in norm when p 2. We will discuss this in Chapter 8. In the case n 1, if B = (-1, 1) then

SR f (x) = DR * f (x),

18

1. Fourier Series and Integrals

where DR is the Dirichlet kernel,

—

sin 271-Rx

R

)

7TX

This is clearly not integrable, but it is in Lq(R) for any q > 1, so DR * f is well defined if f E LP, 1 < p < oo. Almost everywhere convergence depends on the bound

II sup ISRf Hip C Cplif IIP R

.

This holds if 1 < p < oo (the Carleson-Hunt theorem) but we cannot prove it here. For the Fourier transform, the method of Cesar° summability consists in taking integral averages of the partial sum operators, f RS

o- Rf(x) R

t 0

f (x) dt,

19

lim w(x, t) = f (x)

(1.28) (

e 27rix

DR(X)

10. Notes and further results

t->o+

in LP or pointwise almost everywhere. One can show that u(x, t) is harmonic in the half-space IV++ 1 = Rn x (0, co). When n = 1 we have an equivalent formula analogous to (1.9): Do z= x it, (1.29) u(z) f f ()e 27riz + f f e 2lriz e 13

which immediately implies that u is harmonic. The limit (1.27) can be interpreted as the boundary condition of the Dirichlet problem, Au = 0 on RV, u(x, 0) = f (x), x E R n . It follows from (1.25) that u(x, t) = Pt * 1(x),

where /5,(0 = e-211-t0. One can prove by a simple calculation if n = 1, and a more difficult one when n > 1 (see Stein and Weiss [18, p. 6]), that u Rf (r) = FR * 1 (x), F (1) ( 1.30) = Pt(x) where FR is the Fejer kernel, (t2 ± 1 x 12)±-1 • 1 sin 2 Rx) This is called the Poisson kernel. (1.24) FR(x) = R D t (x) dt R 0 R(71- x) 2 In the case of Gauss-Weierstrass summability, one can show that the Unlike the Dirichlet kernel, the Fejer kernel is integrable. Since it has propfunction w(x, t) = w(x, Vzirrt) is the solution of the heat equation erties analogous to (1.8), one can prove that in LP, 1 < p < co, 0U- .) - Aw = 0 on IRV, Ot lira o- R f = f. w(x , 0) = f (x) x E The proof is similar to that of Theorem 1.10. In Chapter 2 we will prove two and (1.28) can be interpreted as the initial condition for the problem. We general results from which we can deduce convergence in LP and pointwise also have the formula almost everywhere for this and the following summability methods. w(x, t) = Wt * f (x), The method of Abel-Poisson summability consists in introducing the factor e -27rtil into the inversion formula. Then for any t > where Wt is the Gauss-Weierstrass kernel, 0 the integral converges, and we take the limit as t tends to 0. If we instead introduce the Wt(x) = t - ne - -Ix1 2 / 0 . (1.31) factor e' t2 IW, we get the method of Gauss-Weierstrass summability. More This formula can be proved using Lemma 1.14 and (1.18). precisely, we define the functions and determining if lim o- Rf (x) f (x). When n = 1 and B = (-1,1),

(1.25)

u(x, t) = f

(1.26) (x w

f

and then try to determine if (1.27) lim u(x, t) t-,0+

f(e)e27rix.c c,

10. Notes and further results

f(e)e2rix•

10.1. References. The classic reference on trigonometric series is the book by Zygmund [21], which will also be a useful reference for results in the next few chapters. However, this work can be difficult to consult at times. Another comprehensive reference on trigonometric series is the book by Bary [1].

f (x),

20

1. Fourier Series and Integrals

There are excellent discussions of Fourier series and integrals in Katznelson [10] and Dym and McKean [4]. The book by R. E. Edwards [5] is an exhaustive study of Fourier series from a more modern perspective. The article by Weiss [20] and the book by KOrner [12] are also recommended. An excellent historical account by J. P. Kahane on Fourier series and their influence on the development of mathematical concepts is found in the first half of [9]. The book Fourier Analysis and Boundary Value Problems, by E. Gonzalez-Velasco (Academic Press, New York, 1995), contains many applications of Fourier's method of separation of variables to partial differential equations and also contains historical information. (Also see by the same author, Connections in mathematical analysis: the case of Fourier series, Amer. Math. Monthly 99 (1992), 427-441.) The book by 0. G. Jorsboe and L. Melbro (The Carleson-Hunt Theorem on Fourier Series, Lecture Notes

in Math. 911, Springer-Verlag, Berlin, 1982) is devoted to the proof of this theorem. The original references for this are the articles by L. Carleson

(On convergence and growth of partial sums of Fourier series, Acta Math. 116 (1966), 135-157) and R. Hunt (On the convergence of Fourier series, Orthogonal Expansions and their Continuous Analogues (Proc. Conf., Edwardsville, Ill., 1967), pp. 235-255, Southern Illinois Univ. Press, Carbondale, 1968). Kolmogorov's example of an L 1 function whose Fourier series diverges everywhere appeared in Une serie de Fourier-Lebesgue divergente partout (C. R. Acad. Sci. Paris 183 (1926), 1327-1328). 10.2. Multiple Fourier series. Let T 71 be the n-dimensional torus (which we can identify with the quotient group IEV/Zn). A function defined on Tn is equivalent to a function on Rn which has period 1 in each variable. If f E L i (Rn) then we can define its Fourier coefficients by

(v) = f f ( x ) e -27ris•v

it-G, V

E Zn ,

vEZu

One can prove several results similar to those for Fourier series in one variable, but one needs increasingly restrictive regularity conditions as n increases. See Stein and Weiss [18, Chapter 7].10.3. The Poisson summation formula. Let f be a function such that for some 6 > 0,

If( )I

21

(In particular, f and f are both continuous.) Then

E

f(x + v) = f (v)e 2 '" vEz , vez This equality (or more precisely, the case when x 0) is known as the Poisson summation formula and is nothing more than the inversion formula. The left-hand side defines a function on T n whose Fourier coefficients are precisely f (v). 10.4. Gibbs phenomenon. Let f (x) = sgn(x) on (-1/2, 1/2). By Dirichlet's criterion, for example, we know that SN f (x) converges to f(x) for all x. To the right of 0 the partial sums oscillate around 1 but, contrary to what one might expect, the amount by which they overstep 1 does not tend to 0 as N increases. One can show that ( =2 0/ sin(Y) y dy Jim sup SNf N- 00 x

1.17898... .

,

This phenomenon occurs whenever a function has a jump discontinuity. It is named after J. Gibbs, who announced it in Nature 59 (1899), although it had already been discovered by H. Wilbraham in 1848. See Dym and McKean [4, Chapter 1] and the paper by E. Hewitt and R. E. Hewitt (The Gibbs-Wilbraham phenomenon: an episode in Fourier analysis, Arch. Hist. Exact Sci. 21 (1979/80), 129-160). Gibbs phenomenon is eliminated by replacing pointwise convergence by Cesar° summability. For if m < f (x) < M, then by the first two properties of Fejer kernels in (1.8), m < Q N f (x) < M. In fact, it can be shown that if M < f (x) < M on an interval (a, b), then for any e > 0, m - E < aN f (X) < M e on (a + e, b - e) for N sufficiently large.

Corollary 1.20 was gotten by an immediate application of Riesz-Thorin interpolation. But in fact a stronger inequality is true: if 1 < p < 2 then

f( v ) e 2n-zs.v .

If(x)I < A(1 + Ix1) -n-a and

10.5. The Hausdorff-Young inequality.

and construct the Fourier series of f with these coefficients,

E

10. Notes and further results

_< A(1 +

.

„1/p 11/11P'

( (;)1/p'

n/2

Ilf

This inequality is sharp since equality holds for f (x) = e'lx 12 . This result was proved by W. Beckner (Inequalities in Fourier analysis, Ann. of Math. 102 (1975), 159-182); the special case when p is even was proved earlier by K. I. Babenko (An inequality in the theory of Fourier integrals (Russian), Izv. Akad. Nauk SSSR Ser. Mat. 25 (1961), 531-542).

22

1. Fourier Series and Integrals

In the same article, Beckner also proved a sharp version of Young's inequality (Corollary 1.21). 10.6. Eigenfunctions for the Fourier transform in L 2 (R). Since the Fourier transform has period 4, if f is a function such that f Af , we must have that A 4 1. Hence, A ±1, ±i are the only possible eigenvalues of the Fourier transform. Lemma 1.14 shows that exp(-7rx 2 ) is an eigenfunction associated with the eigenvalue 1. The Hermite functions give the remaining eigenfunctions: for n > 0,

h

n (x)

=

() n

n

10. Notes and further results

23

Theorem 1.22. Let {Tz } be an admissible family of operators, and suppose that for 1 < po, pi, go, qi < co and y E R,

IlTiallgo 5 Mo(Y)11fIlpo

and

IlTi+i y f

MI(y)IIf il1,

where for some b < sup e -b lul log Mi(y) < oo yez

,

j = 1, 2.

Then for 0 < 6 < 1, Re z 0 and p, q defined as in Theorem 1.19, there exists a constant Mo such that

exp(7rx 2 ) — dn exp(-7rx 2 )

iiTz f Ilq c AfellfIlp.

dx n

satisfies h n = (—i)nk,. If we normalize these functions, en hn [(47)-n \72 1]. / 2 h n

Ilhnii2 we get an orthonormal basis of L 2 (18) such that

f

CO

eri)en•

n=0

Thus L 2 (R) decomposes into the direct sum Ho ED H 1 G I/2 ED H3, where on the subspace Hi, 0 < j < 3, the Fourier transform acts by multiplying functions by This approach to defining the Fourier transform in L 2 (R) is due to N. Wiener and can be found in his book ( The Fourier Integral and Certain of

its Applications, original edition, 1933; Cambridge Univ. Press, Cambridge, 1988). Also see Dym and McKean [4, Chapter 2].

In higher dimensions, the eigenfunctions of the Fourier transform are products of Hermite functions, one in each coordinate variable. Also see Chapter 4, Section 7.2. 10.7. Interpolation of analytic families of operators. The Riesz-Thorin interpolation theorem has a powerful generalization due to E. M. Stein. (See Stein and Weiss [18, Chapter 5].) Let S = {z E C : 0 < Re z < 1} and let {Tz } zE s be a family of operators. This family is said to be admissible if given two functions f, g E Ll (IV), the mapping

fRzn

Tz(f)g dx

is analytic on the interior of S and continuous on the boundary, and if there exists a constant a < it such that zi log is uniformly bounded for all z E S.

f

Tz(f)9 dx

10.8. Fourier transforms of finite measures. As we noted above, if bt is a finite Borel measure then f is a bounded, continuous function. The collection of all such functions obtained in this way is characterized by the following result. Theorem 1.23. If h is a bounded, continuous function, then the following

are equivalent: (1) h = Ti for some positive, finite Borel measure tt; (2) h is positive definite: given any f E L l (R n ),

frf0

h(x - y) f (x) f (y) dxdy O.

This theorem is due to S. Bochner (Lectures on Fourier Integrals, Princeton Univ. Press, Princeton, 1959; translated from Vorlesungen caber Fouriersche Integrale, Akad. Verlag, Leipzig, 1932). Also see Katznelson [10, Chapter 6].

Chapter 2

The Hardy-Littlewood Maximal Function

1. Approximations of the identity Let cb be an integrable function on Rn such that f cb = 1, and for t > 0 define q t (x) = t - nO(t -l x). As t 0, q t converges in S' to 6, the Dirac measure at the origin: if g E S then O t (g) = f t - Th 0(t 1 x)g (x) dx = f 0(x)g(tx) dx, and so by the dominated convergence theorem, 6(g) = 9(0) 6 (9)• fir t,c) Since S * g = g, for g E S we have the pointwise limit li m * g(x) = g(x)• >o Because of this we say that {O t : t > 0} is an approximation of the identity. The summability methods in the previous chapter can be thought of as approximations of the identity. For Cesar() summability, = F1 and FR = 01/R. (See (1.24).) For Abel-Poisson summability, = P1 (see (1.30)) and for Gauss-Weierstrass summability, = W1 (see (1.31)). We see from the following result that these summability methods converge in LP norm.

Theorem 2.1. Let 16 : t > be an approximation of the identity. Then lim 116 * f - f 11 7, = 0 t->o if f E LP, 1 < p < cc, and uniformly (i.e. when p = co) if f E Co(Rn). 25

1

26

2. The Hardy-Littlewood Maximal Function

l'-nof. Because 0 has integral 1, Ot *

The relationship between weak (p, q) inequalities and almost everywhere convergence is given by the following result. In it we assume that (X, t) = (Y, v).

Given E > 0, choose S > 0 such that if IN < 6,

f

6

2 110111 .

(Note that S depends on f.) For fixed 5, if t is sufficiently small then

II * 1 - f Ilp

f

Ivl<5 / t

< E.

T* f (x) = sup ITtf (X)I • If T* is weak (p, q) then the set E LP (X, u) : lim Tt f (x) = f (x) a.e.} t—a o

10(011 f (- + ty) -1(.)11p

is closed in LP (X , t).

+ 2 11 11p10(01 clY 1Y1 5 it

T* is called the maximal operator associated with the family {Tt }. Proof. Let {} be a sequence of functions which converges to f in LP (X , norm and such that Tt fn (x) converges to fn (x) almost everywhere. Then

As a consequence of this theorem, we know that there exists a sequence { tk}, depending on f, such that tk —> 0 and lim Ot k *

Theorem 2.2. Let {Tt} be a family of linear operators on LP (X , pc) and define

6

10(Y)1 dY < flY1 . 6 /t Therefore, by Minkowski's inequality

27

When (X, = (Y, v) and T is the identity, the weak (p, p) inequality is the classical Chebyshev inequality.

f (x) - f (x) ff0(0V (x - ty) - f (x)] dy• Ilf(- + h)

2. Weak-type inequalities and almost everywhere convergence

1(x) = f (x)

a.e.

Hence, if lim0 t * f (x) exists then it must equal f (x) almost everywhere. In Section 4 we will study the existence of this limit

2. Weak-type inequalities and almost everywhere convergence

+ pax E X : 1(f — fn)(x)I > A/2}) (2C(2 +

Ilf fdp) P

CllfA , 11p) q

(

00

Actx E X : lim sup 1Ttf(x) f (x)I > 1/k}) t-.t o

= 0.

If T is strong (p, q) then it is weak (p, q): if we let EA = {y E Y : ITf(y)1 > A}, then

< 1 71113 < ( clIfIlp) q Aq

pax E X : lim sup ITt f (x) — f (x)1 > 0)) t-40

k=1

and we say that it is weak (p, oo) if it is a bounded operator from LP(X, t) to L"(Y, v). We say that T is strong (p, q) if it is bounded from L P (X , 1.1) to Lq (Y, v).

T f (x) q dv v(EA) = f dv 5_ A EA fEA

(f - fn )(x)1 >

and the last term tends to 0 as n —> oo. Therefore,

Let (X, ti) and (Y, v) be measure spaces, and let T be an operator from LP(X, it) into the space of measurable functions from Y to C. We say that T is weak (p, q), q < oo, if v({y E Y : 1Tf (y)I > A1)

pax E X : lim sup 1Tt f (x) — f(x)1 > t—,t o 5 itax e X : lim sup ITt(f — f.)(x) t—>t o p{{x E X : T* (f — fri )(x) > A/2})

)

By the same technique we can also prove that the set If E LP (X, tt) : lim Tt f (x) exists a.e.} t--4 0 is closed in LP(X, ii). It suffices to show that pax E X : lim sup Tt f (x) — liminf Tt f (x) > t—>t o t—,to

= 0,

28

2. The Hardy-Littlewood Maximal Function

and this follows as in the above argument since lim sup Tt f (x) - lim inf Tt f (x) < 2T* f (x). t->t o t-40

(If Tt f (x) is complex, we apply this argument to its real and imaginary parts separately.) Since for f E S approximations of the identity converge pointwise to f, we can apply this theorem to show pointwise convergence almost everywhere for f E LP, 1 o IC6t * f (x)I is weakly bounded. 3. The Marcinkiewicz interpolation theorem

Let (X, p) be a measure space and let f : X -> C be a measurable function. We call the function af : (0, oo) [0, oo], given by af(A) = pax EX: If (x)I > Al), the distribution function of f (associated with p). Proposition 2.3. Let ch, : [0, oo) such that 0(0) 0. Then

Ix

[0, oo) be differentiable, increasing and

Theorem 2.4 (Marcinkiewicz Interpolation). Let (X, p) and (Y, v) be measure spaces, 1 < po < 131 < oo, and let T be a sublinear operator from LP° (X + LP1 (X, to the measurable functions on Y that is weak (po, po) and weak (pi, P1). Then T is strong (p, p) for po 0 decompose f as fo + fl, where fo = fx{x:if

(A)af (A) dA.

ITIo( x )1 +

ITf(x)I

aTf(A) aTfo (A/ 2 ) + aTf, ()/2).

We consider two cases. Case 1: 2)1 = oo. Choose c = 1/(2A1), where Al is such that IIT9I100 < A11191100. Then aT f i (A / 2) = 0. By the weak (po, po) inequality, PO

( 2 A o /011Po)

hence, 00

Ix fo

if(x),

A P-1—P° (2A0)P°

(A) dA

II f

P — Po

IT(io + 11)(x) I I Tfo (x) I + I Tfi (x) , IT(Af)I IAIITf I, A E C.

AP

(2A0)P° (14.1)P -P°

dA dµ

fi r'',

Case 2: p1 < oo. We now have the pair of inequalities PI

2A,

p f AP -l a f (A) dA.

Since weak inequalities measure the size of the distribution function, this representation of the LP norm is ideal for proving the following interpolation theorem, which will let us deduce LP boundedness from weak inequalities. It applies to a larger class of operators than linear ones (note that maximal operators are not linear): an operator T from a vector space of measurable functions to measurable functions is sublinear if

If (x)IP° d p,dA c:: _ po

( s (s:I.1)1 (/xc) o if

f (x)IP° ff

= p(2A0)P°

and then change the order of integration. If, in particular, ql()) AP then

LP I (it ).

SO

To prove this it is enough to observe that the left-hand side is equivalent

(2.1)

7

fl = f X{x:If(x)I
IlTfP

to

(X)I>CA}

the constant c will be fixed below. Then fo E LP° (A) and fi E Furthermore,

aTh(A/2)

f (x)l) dp =-- f

29

3. The Marcinkiewicz interpolation theorem

aTL (A/2) ( A MG) , i = 0, 1.

From these we get (arguing as above) that 11 1111 7; P

f AP f

-

1 - P°

00

ffx:If(x)1>cA}

d,a dA

( 2A 1 )Pi

Ar° P — Po cP — Po p2P0

(2,40)P 0 (x)IP°

)

If (x)IP1 dµ

I{ x:1 f (x)IcAl p2P 1 AP' - p cP1P1

f

30

2. The Hardy-Littlewood Maximal Function

We can write the strong (p, p) norm inequality in this theorem more precisely as (2.2)

11T.f ip

2/FP (

1 +

1

P PO Pi _

p)

Aol-° A7 f

where

0

P P1

1

-

0

Po

0 < <1.

When p1 = co this is the constant which appears in the proof; when p1 < oo it is enough to take c such that (2A0c)n 0 = (2A1c)P 1 and then simplify. 4. The Hardy-Littlewood maximal function Let Br = B(0, r) be the Euclidean ball of radius r centered at the origin. The Hardy-Littlewood maximal function of a locally integrable function f on Rn is defined by 1

M f (x) = sup ,f If (x - y)1 dy. r>0 'Br I B,

(This can equal +co.) If we let 0 = IBi I -1 xB i , then (2.3) coincides for nonnegative f with the maximal operator associated with the approximation of the identity {0,} as in Theorem 2.2. Sometimes we will define the maximal function with cubes in place of balls. If Q, is the cube [-r, ?in, define (2.4)

M' f (x) = sup

1

r>o (2r)n

(x - Y)I dy.

When n 1, M and M' coincide; if n > 1 then there exist constants cn and C„, depending only on n, such that (2.5)

cnM' f (x) M f (x) < C n M' f (x).

Because of inequality (2.5), the two operators M and M' are essentially interchangeable, and we will use whichever is more appropriate, depending on the circumstances. In fact, we can define a more general maximal function (2.6)

Theorem 2.5. The operator M is weak (1,1) and strong (p,p), 1 p < co. We remark that by inequality (2.5), the same result is true for M' (and also for M"). It is immediate from the definition that

1

(2.3)

31

4. The Hardy-Littlewood maximal function

1

Mil f (x) = sup f I f (y)i dy, Q3xIQI Q

where the supremum is taken over all cubes containing x. Again, M" is pointwise equivalent to M. One sometimes distinguishes between M' and M" by referring to the former as the centered and the latter as the noncentered maximal operator. Alternatively, we could define the non-centered maximal function with balls instead of cubes.

(2.7) so by the Marcinkiewicz interpolation theorem, to prove Theorem 2.5 it will be enough to prove that M is weak (1, 1). Here we will prove this when n = 1; we will prove the general case in Section 6. In the one-dimensional case we need the following covering lemma whose simple proof we leave to the reader. Lemma 2.6. Let ficj ac A be a collection of intervals in R and let K be a

compact set contained in their union. Then there exists a finite subcollection {lid such that K C I3 and xi; (x) < 2, x E R.

Proof of Theorem 2.5 for n = 1. Let EA = {x E : M f(x) > A}. If x E EA then there exists an interval Ix centered at x such that 1 f (2.8)

Ifl >

III

A.

Let K C EA be compact. Then K C U /x , so by Lemma 2.6 there exists a finite collection {I) } of intervals such that K C U 3 1-3 and E 3 X/3 < 2. Hence, 1 2

IKI

II

A

f Ifl < -f f A

< -11f111; A

the second inequality follows from (2.8). Since this inequality holds for every compact K C EA, the weak (1, 1) inequality for M follows immediately. ❑ Lemma 2.6 is not valid in dimensions greater than 1, and though one could replace it with similar results, this is not the approach we are going to take here. (For such a proof, see Section 8.6.) The importance of the maximal function in the study of approximations of the identity comes from the following result. Proposition 2.7. Let 0 be a function which is positive, radial, decreasing (as a function on (0, co)) and integrable. Then sup 10 t * f (x)I < 110111 mf (x)-

t>0

32

2. The Hardy-Littlewood Maximal Function

Proof. If we assume in addition to the given hypotheses that 0 is a simple

function, that is, it can be written as

0(x) =

IBr

since 11011 1 =

E

cubes, open on the right, whose vertices are adjacent points of the lattice (2 —k Z)n. The cubes in U k Qk are called dyadic cubes. From this construction we immediately get the following properties: (1) given x E IR" there is a unique cube in each family Qk which contains it;

3 X Br., (x)

with ai > 0, then * f (x) =

33

5. The dyadic maximal function

(2) any two dyadic cubes are either disjoint or one is wholly contained in the other; * f (x) 5- 110111.Mf (X)

3

An arbitrary function 0 satisfying the hypotheses can be approximated by a sequence of simple functions which increase to it monotonically. Any dilation cbt is another positive, radial, decreasing function with the same integral, and it will satisfy the same inequality. The desired conclusion follows at once.

(3) a dyadic cube in Qk is contained in a unique cube of each family Q3 , j < k, and contains 2n dyadic cubes of Qk+1. Given a function f E LL(Rn), define Ek f (x) =

E CV f 1) Xci(x);

QEQk

Corollary 2.8. If 10(x)I < 11)(x) almost everywhere, where 1,1, is positive,

Ek f is the conditional expectation of f with respect to the a-algebra generated by Qk. It satisfies the following fundamental identity: if 1 -2 is the union of cubes in Qk, then

radial, decreasing and integrable, then the maximal function sue t Icb t * f(x)I is weak (1, 1) and strong (p,p), 1 < p oo.

12Ek f = f f.

This is an immediate consequence of Proposition 2.7 and Theorem 2.5. If we combine this corollary with Theorem 2.2 we get the following result.

Corollary 2.9. Under the hypotheses of the previous corollary, if f E LP, 1 < p < oo, or if f E Co, then

o

Ek f is a discrete analog of an approximation of the identity. The following theorem makes this precise; first, define the dyadic maximal function by

(2.9)

Mdf (x) = sup I,Ek f (x)1. k

-40 * f (x) = ( f

1 (x)

a. e.

In particular, the summability methods discussed in Chapter 1, Section 9 (Cesar°, Abel-Poisson and Gauss-Weierstrass summability), each converge to f (x) almost everywhere if f is in one of the given spaces.

Proof. Since we have convergence for f E S, by Theorem 2.2 we have

convergence for f E S = LP (or f E Co if p = co). The Poisson kernel (1.30) and the Gauss-Weierstrass kernel (1.31) are decreasing; the Fejer kernel (1.24) is not but Fi (x) < min(1, (7rx) -2 ).

Theorem 2.10. (1) The dyadic maximal function is weak (1,1). (2) If f E

Ek f (x) = (x) a. e.

Proof. (1) Fix f E L l ; we may assume that f is non-negative: if f is real, it can be decomposed into its positive and negative parts, and if it is complex, into its real and imaginary parts. Now let fx E

5. The dyadic maximal function In R 1 we define the unit cube, open on the right, to be the set [0, 1)n, and we let Qo be the collection of cubes in Rn which are congruent to [0, 1)n and whose vertices lie on the lattice Zn. If we dilate this family of cubes by a factor of 2 — k we get the collection Qk, k E Z; that is, Qk is the family of

k—>co

: Mdf (x) > A} =

UC2 k3

where

S2 k = {x E illn: Ek f (x) >Aand Eif(x)< Aif j

34

2. The Hardy-Littlewood Maximal Function

The sets 52 k are clearly disjoint, and each one can be written as the union of cubes in Qk. Hence,

off over Qi is at most A. Therefore, 1

rfx ER' : Mdf(x) > Ali =

1 141 IQ3I k

1 f Ek f A f k vf f

A (2) This limit is clearly true if f is continuous, and so by Theorem 2.2 it holds for f E L 1 . To complete the proof, note that if f E LL then fxQ E L 1 for any Q E Q0. Hence, (2) holds for almost every x E Q, and so for almost every x E This proof uses a decomposition of I1 which has proved to be extremely useful. It is called the Calderon-Zygmund decomposition and we state it precisely as follows. Theorem 2.11. Given a function f which is integrable and non-negative,

and given a positive number A, there exists a sequence {gi} of disjoint dyadic cubes such that

(2) (3 )

f (x) < A for almost every x U ;

U Qi A<

1

We are now going to use Theorem 2.10 to prove Theorem 2.5. In fact, it is an immediate consequence of the following lemma and inequality (2.5). (Recall that M' is the maximal operator on cubes defined by (2.4).) Lemma 2.12. If f is a non-negative function, then

I {x ElWn :M'f(x)>4 n A}1 2n}{x E R n : Mdf (x) > A} . Given this lemma, by the weak (1,1) inequality for Md proved in Theorem 2.10, Ifx E lir :

f (x) > All < 2n {X E R n : Mdf(x) > 4 -n A} I <

(Since 1141 f =

Il f lll.

(VD, we may assume that f is non-negative.)

Proof of Lemma 2.12. As before, we form the decomposition {x EI[B n : Mdf(x)>

y1/411f111;

A} = UQ3•

Let 2Q 3 be the cube with the same center as Q3 and whose sides are twice as long. To complete the proof it will suffice to show that

f f 2"A.

iQj I {x

Iqj

1 f f n f< 2 A. fQ j 1C2 i11Qii el;

6. The weak (1, 1) inequality for the maximal function

— L—I k

(1)

35

6. The weak (1, 1) inequality for the maximal function

E

113 n :

Proof. As in the proof of Theorem 2.10, form the sets 52k and decompose

each into disjoint dyadic cubes contained in Qk; together, all of these cubes form the family {Q3}.

f (x) > 4nAl C U 2(2 3'

Fix x U 3 2Q 3 and let Q be any cube centered at x. Let 1(Q) denote the side length of Q, and choose k E Z such that 2 k-1 < 1(Q) < 2 k . Then Q intersects m < 2 n dyadic cubes in Qk; call them R1, R2, , R. None of these cubes is contained in any of the Q3 's, for otherwise we would have x E U 3 2Q 3 . Hence, the average of f on each R, is at most A, and so

Part (2) of the theorem is then just the weak (1,1) inequality of Theorem 2.10. If x 1..}i Q3 then for every k, Ek f (x) < A, and so by part (2) of Theorem 2.10, f (x) < A at almost every such point. iFinally, f 1 by the definition of the sets flk, the average of f over Q 33 is 1 .1c2 j IQI greater(21 than A; this is the first inequality in (3). Furthermore, if Q3 is the dyadic cube containing Q3 whose sides are twice as long, then the average

m 2kn 1 fQnR,

f < i=1

f

IQI IRil JR

f

< 2nmA < 4nA.

36

2. The Hardy-Littlewood Maximal Function

As a consequence of the weak (1,1) inequality and Theorem 2.2 we get a continuous analog of the second half of Theorem 2.10.

7. A weighted norm inequality Theorem 2.15. If B is a bounded subset of IV , then

Corollary 2.13 (Lebesgue Differentiation Theorem). If f E LL(Rn) then 1

urn —

IBri

f (x — y) dy f (x) a. e.

fB,

From this we see that If (x) I < M f (x) almost everywhere. The same is true if we replace M by M' or M".

fB M f 2IBI +C f

R.

Proof. M f < 2 f I Ix EB:Mf (x) > 2)41 dA

This follows immediately from the fact that iBr i fB, I f (x - y) - f (x)I dY M f (x) + If (x) I.

f

< 21B1+2 f

1

lim — f If (x — y) — f (x)I dy 0 a.e. r•—>o+ 1Br I

lim j --'co 1 13i I In;

Hence,

The proof is simple: since f is not identically 0, there exists R > 0 such that If' > c > O.

A k:If(x)1>A}

)ddxd)' A < f (x)I C _dx

A

(x(

A

7. A weighted norm inequality Theorem 2.16. If w is a non-negative, measurable function and 1 A} f( x:mw(x) A

1 f

c

(21xi)n h„ lf = 2-ixin • ,

Nevertheless, we do have the following.

c

: Mf(x) > 2A}I clA f —

R.

(2.11)

Now if 1x1 > R, BR C B(x, 21x1), so M f (x)

1{x E B

= C f 1f(x)Ilog+ 1f(x)1dx.

Proposition 2.14. If f E L' and is not identically 0, then M f

3R

dA.

{x EB:Mf (x) > 2A} C E B : M fi(x) > Al.

= f (x).

This follows immediately from the inclusion B i C B(x, 2r3 ), where r3 is the radius of Bd . A similar argument shows that the set of Lebesgue points of f does not change if we take cubes instead of balls. The weak (1,1) inequality for M is a substitute for the strong (1,1 inequality, which is false. In fact, it never holds, as the following resul t shows.

I

EB:Mf (x) >

Decompose f as fi + f2, where fl = fx{ x: if( r )> A } and f2 = f — fi . Then

The points in 1}In for which limit (2.10) equals 0 are called the Lebesgue points of f. If x is a Lebesgue point and if {Bi } is a sequence of balls such that B1 D B2 D • • • and n j B3 = {x} (note that the balls need not be centered at x), then 1

If1 log+ Ifl,

where log+ t = max(log t, 0).

We can make the conclusion of Corollary 2.13 sharper: (2.10)

37

Proof. It will suffice to show that IlmfIlL-(w) < and that the weak (1, 1) inequality holds; the strong (p, p) inequality then follows from the Marcinkiewicz interpolation theorem. If Mw(x) = 0 for any x, then

1

38

2. The Hardy-Littlewood Maximal Function

w(x) = 0 almost everywhere and there is nothing to prove. Therefore, we may assume that for every x, Mw(x) > 0. If a > II f Il L oo( mw) then {x:I f (x)1>a}

Mw(x) dx = 0,

and so I {x E R n : If(x)I > a}I = 0; that is, If (x)1 < a almost everywhere. From this it follows that Mf(x) < a a.e., so IlMf I Los (w) < a. Hence, 11M/IIL—(w) IlfIlLoo(mw)• To prove the weak (1, 1) inequality we may assume that f is non-negative and f E L l (Rn). (If f E L 1 (Mw) then h = fx B(0 ,3) is a sequence of integrable functions which increase pointwise to f .) If {Q 3 } is the CalderOnZygmund decomposition of f at height A > 0, then as we showed in the proof of Lemma 2.12, {x E Rn :

f (x) > 4'A} c U2(2 3 ;

hence, f{s:M' f (s)>4n

w(x)

dx <

f2c 2 w(x) dx

Al

2 Igil l2Q1 ji f2c2, w(x) dx <

2' A 2nC A

L

(y)

(

i2Q11 31 f2Q, w(x) dx) dy

f(Y)Arza(Y)

8. Notes and further results

is most easily grasped when stated in the language of cricket, or any other game in which a player compiles a series of scores of which an average is recorded." Their proof, which uses decreasing rearrangements (see 8.2 below) can be found in Zygmund [21]. Our proof follows the ideas of Calder& and Zygmund (On the existence of certain singular integrals, Acta Math. 88 (1952), 85-139). The decomposition which bears their name also first appeared in this paper. The method of rotations, discussed in Chapter 4, lets us deduce the strong (p, p) inequality for n > 1 from the one-dimensional result. For a discussion of questions related to Theorem 2.2, and in particular the necessity of the condition in that theorem, see de Guzman [7]. The Marcinkiewicz interpolation theorem was announced by J. Marcinkiewicz (Sur l'interpolation d'operations, C. R. Acad. Sci. Paris 208 (1939), 12721273). However, he died in World War II and a complete proof was finally given by A. Zygmund (On a theorem of Marcinkiewicz concerning interpolation of operations, J. Math. Pures Appl. 34 (1956), 223-248). Both Marcinkiewicz interpolation and Riesz-Thorin interpolation in Chapter 1 have been generalized considerably; see, for example the book by C. Bennett and R. Sharpley (Interpolation of Operators, Academic Press, New York, 1988). The weighted norm inequality for the maximal operator is due to C. Fefferman and E. M. Stein (Some maximal inequalities, Amer. J. Math. 93 (1971), 107-115). 8.2. The Hardy operator, one-sided maximal functions, and decreasing rearrangements of functions. Given a function g on R+ = (0, oo) the Hardy operator acting on g is

defined by

Since M n w(y) < C„Mw(y), we get the desired inequality. If w is such that Mw(x) < w(x) a.e., then these inequalities hold with the same weight w on both sides. Functions which satisfy this condition are called A l weights, and we will consider them in greater detail in Chapter 7. 8. Notes and further results 8.1. References.

The maximal function for n = 1 was introduced by G. H. Hardy and J. E. Littlewood (A maximal theorem with function-theoretic applications, Acta Math. 54 (1930), 81-116), and for n > 1 by N. Wiener (The ergodic theorem, Duke Math. J. 5 (1939), 1-18). In their article, Hardy and Littlewood first consider the discrete case, about which they say: "The problem

39

Tg(t) = -1 t- fot g(s) ds, t E R + .

If g E L 1 (R+) is non-negative, then, since Tg is continuous, one can show that (2.12)

E(A) = fE(A) g(t) dt,

where E(A) = {t E : Tg(t) > Al. From this and (2.1) we get liTgli p < 1 < p < oo. Other proofs of this result and generalizations of the operator can be found in the classical book by G. H. Hardy, J. E. Littlewood and G. POlya (Inequalities, Cambridge Univ. Press, Cambridge, 1987, first edition in 1932), or in Chapter 2 of the book by Bennett and Sharpley cited above.

40

2. The Hardy-Littlewood Maximal Function

For a function f on IR, the one-sided Hardy-Littlewood maximal functions are defined by 1 f t+h

M+ f (t) = sup - h>0 h

t

1

If (s)I ds and M - f (t) = sup -L f

t

h>0 u -h

I f (s)I ds.

The maximal function as defined by Hardy and Littlewood corresponds to A4 for functions on R± (with 0 < h < t in the definition). When If! is decreasing this maximal function coincides with the Hardy operator acting on IfI. If f is a measurable function on RV, we can define a decreasing function f* on (0, co), called the decreasing rearrangement of f , that has the same distribution function as f: --

41

8. Notes and further results

which is similar to (2.12) for the Hardy operator. The strong (p, p) inequality now follows as before with constant p' (which is sharp). We leave the details of the proof of this and of Lemma 2.17 to the reader. An inequality similar to (2.13) holds for the maximal operator acting on functions on Rn; in fact, we have the following pointwise inequality: there exist positive constants cm and Cn, such that cm (Mf)*(t) f**(t) Cn (Mf) (t), t E R ± .

The left-hand inequality is due to F. Riesz; the right-hand inequality is due to C. Herz (n = 1) and C. Bennet and R. Sharpley (n > 1). See Chapter 3 of the above-cited book by these authors for a proof and further references. 8.3. The Lorentz spaces LP q. ,

f* (t) = inf{A : af(A) < t}.

Because f and f* have the same distribution function, by (2.1) their LP norms are equal, as well as any other quantity which depends only on their distribution function. (Cf. Section 8.3 below.) The action of the Hardy operator on f* is usually denoted by f**. Hardy and Littlewood showed that for functions on R±, (2.13)

Ifx :M f (x) > AII < Ifx : f**(x) > All,

so the weak (1, 1) and strong (p,p) inequalities for A4 follow from the corresponding ones for T. --

A beautiful proof of the weak (1, 1) inequality for md was given by F. Riesz as an application of his "rising sun lemma" (Sur un theorême du maximum de MM. Hardy et Littlewood, J. London Math. Soc. 7 (1932), 10-13). Given a function F on IR, a point x is a shadow point of F if there exists y > x such that F(y) > F(x). The set of shadow points of F is denoted by S(F). -

Lemma 2.17. Let F be a continuous function such that lim F(x) = -co and lim F(x) = -Foo. x->+00 Then S(F) is open and can be written as the disjoint union U i (aj , bi ) of finite open intervals such that F(ai) F(bj).

Given f E OR) and A > 0, define F(x) = fox I f(t)I dt - Ax. Then E(A) = {x : M+ f (x) > A} equals S(F). Using Lemma 2.17 one can deduce that E(A

1 )

f I f (x)I dx Eps )

A

Let (X, it) be a measure space. LP 4 (X) denotes the space of measurable functions f which satisfy oc• = f [t i/Pf*(t"q cit t lig < 00 P

A

when 1 0

when 1 < p < co. When p q,

Ilf Ilp,P = Ilf*Ilp = IlflIP and we recover LP. In general, however, II • Il p , q is not a norm since the triangle inequality only holds when 1
ClIflIP. The Marcinkiewicz interpolation theorem can be generalized to these spaces; this lets us, for example, give a version of the Hausdorff-Young inequality which is stronger than Corollary 1.20: if f E LP (W'), 1 < p < 2, then f E 'P and there exists a constant B p such that Ilf IIp',p 5 BplifiiP• For more information on decreasing rearrangements and the LP , g spaces, see Stein and Weiss [18, Chapter 5] and the book by Bennett and Sharpley cited above. Also see this latter book for more on interpolation theorems, particularly the so-called real method of interpolation which is well suited -

42

2. The Hardy-Littlewood Maximal Function

43

8. Notes and further results

Lorentz spaces. These spaces were introduced by G. G. Lorentz in two papers (Some new functional spaces, Ann. of Math. 51 (1950), 37-55, and On the theory of spaces A, Pacific J. Math. 1 (1951), 411-429).

Groningen, 1961), and M. M. Rao and Z. D. Ren, ( Theory of Orlicz Spaces, Marcel Dekker, New York, 1991).

8.4. L log L.

8.5. The size of constants.

In the proof of Proposition 2.14, the integrability of M f failed at infinity, and did not exclude local integrability. However, the example f (x) =-x+ 1 (log x)+ 2 x( 0 , 1 / 2 } shows that even local integrability can fail. A partial

In the proofs in this chapter, the constants which appear in the weak (1,1) and strong (p,p) inequalities, 1 p < co, for M and M' are of exponential type with exponent n. For the strong (p,p) inequality, the method of rotations (see Corollary 4.7) gives a better constant of the form C p n for M. Furthermore, one can show that there exist constants Cp , independent of n, such that

converse of Theorem 2.15 is true and characterizes when M f is locally integrable.

Theorem 2.18. If f is an integrable function supported on a compact set B, then M f E L l (B) if and only if f log+ f E L 1 (B). This is due to E. M. Stein (Note on the class L log L, Studia Math. 32 (1969), 305-310); a proof can also be found in Garcia-Cuerva and Rubio de Francia [6, p. 146]. At the heart of the proof is a stronger version of the weak (1, 1) inequality and a "reverse" weak (1, 1) inequality for the maximal function: (2.14)

Ifx E

: M f (x) >

< A

(2.15)

I X:1 f (x)1>Al21

f I{x E Rn : M f (x) > All >f

If (x)I dx,

I f (x)I dx.

A

Inequality (2.14) follows from the weak (1, 1) inequality applied to fi f x{x:if (.)1>Al2}; inequality (2.15) follows from the Calderon-Zygmund decomposition if we replace M by M". In this and related problems it would be useful to consider functions f such that f log+ f E L l as members of a Banach space. Unfortunately, the expression f f log+ f does not define a norm. One way around this is to

introduce the Luxemburg norm: given a set C2 C EV, define (2.16) j

11.fliilogL(Q)

inf {A > 0 :

1 If (x) \I logAIf (x)I/ dx < 1} . f2

(

)

By using this we can strengthen the conclusion of Theorem 2.18 to the following inequality: (B)- < II fil L log L(B)• (See Zygmund [21, ChapC,,., ter 4].) By replacing the function t log+ t in (2.16) by any convex, increasing function 43, we get a class of Banach function spaces, L`b(S2), which generalize the LP spaces and are referred to as Orlicz spaces. These have a rich theory; for further information, consult the books by M. A. Krasnoserskii and Ya. B. Rutickii, (Convex functions and Orlicz spaces, P. Noordhoff,

Ilmfllp

cplIfIlp,

1

This result is due to E. M. Stein, and the proof appeared in an article by E. M. Stein and J.-O. Stromberg (Behavior of maximal functions in W for large n, Ark. Mat. 21 (1983), 259-269); also see E. M. Stein, Three variations on the theme of maximal functions (Recent Progress in Fourier Analysis, I. Feral and J. L. Rubio de Francia, eds., pp. 229-244, North-Holland, Amsterdam, 1985). A partial generalization of this result is possible: let B be a convex set which is symmetric about the origin and define m B f (x) = suP , r> 0 1

f If (x — v)1 dY• B

Then if p > 3/2 there exists a constant Cp independent of B and n such that

IIMBfIlp 5_ cplIfIlp. For the weak (1, 1) inequality, the best constant known for M is of order n. (See the article by Stein and Striimberg cited above.) For the non-centered maximal function defined by (2.6) (but with balls instead of cubes), the best constant in the strong (p, p) inequality must grow exponentially in n. See L. Grafakos and S. Montgomery-Smith (Best constants for uncentered maximal functions, Bull. London Math. Soc. 29 (1997), 60-64). We note that when n 1 the best constants are known for the one-sided and for the non-centered maximal operator M" but not for M. In the case of the weak (1,1) inequality for M", one can use Lemma 2.6 to get C = 2, and the function f (t) = X[o,1] shows this is the best possible. For the strong (p, p) inequality for M", see the article by Grafakos and Montgomery-Smith cited above. For lower and upper bounds on the weak (1,1) constants for M, see the article by J. M. Aldaz (Remarks on the Hardy-Littlewood maximal function, Proc. Roy. Soc. Edinburgh Sect. A 128 (1998), 1-9).

44

2. The Hardy-Littlewood Maximal Function

8.6. Covering lemmas.

A standard approach to proving that the maximal function in 118 11 is weak (1, 1) is to use covering lemmas. Here we give two; the first is a Vitali-type lemma due to N. Wiener (in the paper cited above). If B = B(x, r) and t > 0, then we let tB = B (x , tr) .

Theorem 2.19. Let {Bi } j0 j be a collection of balls in Rn. Then there

exists an at most countable subcollection of disjoint balls {Bk} such that U Bj C U5Bk.

The second is due independently to A. Besicovitch and A. P. Morse; for a proof and further references, see the book by M. de Guzman (Differentiation of Integrals in 1 Rn , Lecture Notes in Math. 481, Springer-Verlag, Berlin, 1985). Theorem 2.20. Let A be a bounded set in RV', and suppose that {B X } XEA

is a collection of balls such that B x = B(x, r i ), rx > 0. Then there exists an at most countable subcollection of balls {B 1 } and a constant Ci „ depending only on the dimension, such that A c U Bi and

Bi (X) < Cn .

8. Notes and further results

45

Wiener's lemma; when n = 1, Lemma 2.6 can be applied even to nondoubling measures. An example of p such that M;" is not weak (1, 1) is due to P. Sjogren (A remark on the maximal function for measures onlr, Amer. J. Math. 105 (1983), 1231-1233). For certain kinds of measures a weaker hypothesis than doubling implies that M'a' is weak (1, 1); see, for example, the paper by A. Vargas (On the maximal function for rotation invariant measures in R n , Studia Math. 110 (1994), 9-17). 8.7. The non-tangential Poisson maximal function.

Given f E LP(Rn), u(x, t) = Pt * f (x) defines the harmonic extension of f to the upper half-space R n++1 = R n x (0, 00), and lim t _,0 Pt * f (x) is the limit of this function on the boundary as we approach x "vertically", that is, along the line perpendicular to Rn. More generally, one can consider a non-tangential approach: fix a > 0 and let

t) : ly — xi < at} be the cone with vertex x and aperture a. We can then ask whether a.e. x E R n . lim u(y, t) = f (x) F a (x) = {(y,

(y,t)—, (x,0) (y,t)Era(x)

Since this limit holds for all x if f is continuous and has compact support, it suffices to consider the maximal function sup Iu(y,t)I, (y,t)er„.(x) so as to apply Theorem 2.2. But one can show that there exists a constant Ca , depending on a, such that u a*(x) =

The same result holds if balls are replaced by cubes; more generally, the point x need not be the center of the ball or cube but must be uniformly close to the center. Using the Besicovitch-Morse lemma, we can extend our results for the maximal function to LP spaces with respect to other measures. Given a non-negative Borel measure p, define the maximal function M f (x) = sup 7

1

r>0 f13

If (x — Y)Idti(Y),

(If y(14) = 0, define the ,a-average of f on Br to be zero.) Then one can show that MN, is weak (1, 1) with respect to /2; hence, by interpolation it is bounded on LP (p), 1 < p < co. Note that if we define Mi", as the maximal operator with respect to cubes, then Ma and MI', need not be pointwise equivalent unless tt satisfies an additional doubling condition: there exists C such that for any ball B, p(2B) < C y(B). Furthermore, if we define the non-centered maximal operator then this need not be weak (1, 1) if n > 1 unless p satisfies f (x) a doubling condition. In this case the weak (1, 1) inequality follows from

ua(x) Ca Mf (x), and so u a is weak (1,1) and strong (p,p), 1 < p < oo. These results can be generalized to "tangential" approach regions, provided the associated maximal function is weakly bounded. For results in this direction, see, for example, the articles by A. Nagel and E. M. Stein (On certain maximal functions and approach regions, Adv. in Math. 54 (1984), 83-106) and D. Cruz-Uribe, C. J. Neugebauer and V. Olesen (Norm inequalities for the minimal and maximal operator, and differentiation of the integral, Publ. Mat. 41 (1997), 577-604). 8.8. The strong maximal function.

Let R(hi, , ha) = [ — hi, hi] x • • x [—ha, h id. If f E LL, define the strong maximal function of f by =

sup

1 >0I n(nll•

If • •

, kJ)!

(x

—

(h1,... ' hp )

dY •

2. The Hardy-Littlewood Maximal Function

46

Then M8 is bounded on LP (Rn), p > 1; this is a consequence of the onedimensional result. However, Ms is not weak (1, 1): the best possible inequality is x E 118n : Ms f (x) > All C

l If (

:

)1 (1 + log+ I f dx.

This result is due to B. Jessen, J. Marcinkiewicz and A. Zygmund (Note on the differentiability of multiple integrals, Fund. Math. 25 (1935), 217-234). A geometric proof was given by A. COrdoba and R. Fefferman (A geometric proof of the strong maximal theorem, Ann. of Math. 102 (1975), 95-100); also see the article by R. J. Bagby (Maximal functions and rearrangements: some new proofs, Indiana Univ. Math. J. 32 (1983), 879-891). The analog of the Lebesgue differentiation theorem, 1 , max(hi)— 0 (rt(ii, • • • , hri)i

f (x - y) dy = f (x) a.e.

lim

,

is false for some f E Ll, but is true if f (1 + log+ ifl) n 1 is locally integrable, and in particular if f E LL for some p > 1. If in the definition of Ms we allow rectangles (that is, parallelepipeds) with arbitrary orientation (and not just with edges parallel to the coordinate axes), then the resulting operator is not bounded on any LP, p < CXD and the associated differentiation theorem does not hold even for f bounded. For these and other problems related to differentiation of the integral see de Guzman [7], the book by the same author cited above, and the monograph by A. M. Bruckner (Differentiation of integrals, Amer. Math. Monthly 78 (1971), Slaught Memorial Papers, 12). 8.9. The Kakeya maximal function. Given N > 1, let RN be the set of all rectangles in Rn with n - 1 sides of length h and one side of length Nh, h > 0. The Kakeya maximal function is defined by kNf(x) = sup

xERER.N

1

— 1

R

Ifl.

Since each rectangle in 'RN is contained in a ball of radius cn .N h, the Kakeya maximal function is bounded pointwise by the Hardy-Littlewood maximal function: (2.17)

K Nf (

x ) < cn Nn-1 m f ( x ) .

It follows immediately that KN is bounded on LP, 1 < p < An important problem is to determine the size of the constants as functions of N for the LP estimates for the Kakeya maximal function. Inequality

47

8. Notes and further results

(2.17) gives the trivial estimate Nn -1 for the constant in the weak (1,1) inequality, and it is clear that in KAT has norm 1. Interpolating between t Nf lip < criN(n_i)1, these values we geilic

IlfIlp, 1

when p = n, which, again by interpolation, would give the following bounds: < Cn (log NrP

IlfIlp; (2) if 1 n then IIKNf

This conjecture is closely related to two other problems: the Hausdorff dimension of the Kakeya set—a set with Lebesgue measure zero which contains a line segment in each direction (see Stein [17, p. 434]); and the boundedness properties of Bochner-Riesz multipliers (see Chapter 8, Sections 5 and 8.3). This conjecture has been proved completely only when n = 2: see the paper by A. COrdoba (The Kakeya maximal function and the spherical summation multipliers, Amer. J. Math. 99 (1977), 1-22). In this paper he also discussed the connection with Bochner-Riesz multipliers. COrdoba later proved the result in all dimensions when 1 < p < 2; see A note on Bochner-Riesz operators (Duke Math. J. 46 (1979), 505-511). M. Christ, J. Duoandikoetxea and J. L. Rubio de Francia (Maximal operators related to the Radon transform and the CalderOn-Zygmund method of rotations, Duke Math. J. 53 (1986), 189-209) extended the range to 1 < p < (n + 1)/2. A great deal of activity on this problem has been spurred by the work of J. Bourgain (Besicovitch type maximal operators and applications to Fourier analysis, Geom. Funct. Anal. 1 (1991), 147-187). In this paper he proved the conjecture for 1 < p < (n + 1)/2 + c n , where e n is given by an inductive formula (for instance, e3 = 1/3). T. Wolff (An improved bound for Kakeya type maximal functions, Rev. Mat. Iberoamericana 11 (1995), 651-674) improved this to 1 1/2 such that the conjecture is true for p < cn if n is large enough. For a discussion of recent results on this problem and its connection with the Kakeya set, see the survey article by T. Wolff (Recent work connected with the Kakeya problem, Prospects in Mathematics, H. Rossi, ed., pp. 129162, Amer. Math. Soc., Providence, 1999).

11■11■1

Chapter 3

The Hilbert Transform

1. The conjugate Poisson kernel

Given a function f in S(R), its harmonic extension to the upper half-plane is given by u(x, t) = Pt * f (x), where Pt is the Poisson kernel. We can also write (see (1.29)) u(z) =

f

00

f (0e 27r"•

+ f o f(0e 27r" .

where z = x + it. If we now define 0 iv (z) = f f (0e27". de — f f e 27" .z .

then v is also harmonic in R 2+ and both u and v are real if f is. Furthermore, u + iv is analytic, so v is the harmonic conjugate of u. Clearly ; v can also be written as sgn (o e - 2 7rt0

v(z) = f which is equivalent to (3.1) where (3.2)

v(x, t) = Qt f(x),

Qt()

_1 sgn()e-27rtll.

If we invert the Fourier transform we get 1 x (3.3) Qt (x) = 7r t2 + x2 49

50

3. The Hilbert Transform

the conjugate Poisson kernel. One can immediately verify that Q (x, t) Q t (x) is a harmonic function in the upper half-plane and the conjugate of the Poisson kernel Pt(x). More precisely, t2

1 t + ix Pt(x)x2 + iQt(x) +

3. The theorems of M. Riesz and Kolmogorov

Fix 0 E S; then (7rQt — 00(0) = ifn tx2 C15(x ±x 2 dx fixi>t (4' (xx) dx )

— 7rz

f

which is analytic in Im z > 0. In Chapter 2 we studied the limit as t —> 0 of u(x, t) using the fact that {Pt } is an approximation of the identity. We would like to do the same for v(x, t), but we immediately run into an obstacle: {Q t } is not an approximation of the identity and, in fact, Q t is not integrable for any t > 0. Formally, 11111 Qt(X) =

1

;

7rx this is not even locally integrable, so we cannot define its convolution with smooth functions.

0(x)

>e X

dx,

E S.

To see that this expression defines a tempered distribution, we rewrite it as

— 0(0) 0(x) p. v. 1 (0) = + dx

0(x)

this holds since the integral of 1/x on E < that

If we take the limit as t 0 and apply the dominated convergence theorem, we get two integrals of odd functions on symmetric domains. Hence, the limit equals 0. ❑ As a consequence of this proposition we get that

.

=- p. v. 1 .

Hf = lir Q t * f, 1

Hf=—

1 * f, p. v.-1 x sgn(Of (6)•

The third expression also lets us define the Hilbert transform of functions in L 2 (R); it satisfies (3.5) (3.6)

11f112,

11H f112 = H(Hf)= —f,

g-ff •Hg

3. The theorems of M. Riesz and Kolmogorov

Therefore, it will suffice to prove that in S' lim (Qt —

Given a function f E S, we define its Hilbert transform by any one of the following equivalent expressions:

(3.4)

S'

1

j )

(H f)(e)

C( 1 10'11 . + 11x011 )•

(6) sgn(.

v. x

< 1 is zero. It is now immediate

Proof. For each E > 0, the functions 0,(x) = x -1 X{Ixi>e} are bounded and define tempered distributions. It follows at once from the definition that in

t-o

f (x - Y) dy,

1

lim Q t * f (x) = — lim t—>o

dx;

1 Proposition 3.1. In S', lim Q t = —1 p. v. —. t--4) 7r x

lim

0(x) dx

(

and by the continuity of the Fourier transform on S' and by (3.2) we get

1

1.(o

f

t2 ± X2 x ) fix l< t = f x0(tx) dx f 0(tx) dx. fH, 1 1 + x 2 fixi>1 x(1+ x 2 )

We define a tempered distribution called the principal value of 1/x, abbre p . viated p. v. 1/x, by €—>0

(x ) d 2+ x

€—'() fl YI> , Y

2. The principal value of 1/x

p. v. — (0) = lim

51

0.

The next theorem shows that the Hilbert transform, now defined for functions in S or L 2 , can be extended to functions in LP, 1 All < -11f11 3 .

(2) (M. Riesz) H is strong (p,p), 1 0 and f non-negative. Form the CalderOn-Zygmund decomposition of f at height A (see Theorem 2.11); this yields a sequence of disjoint intervals {/3 } such that f (x) 5 A for a.e. x 9 = L.J

PI

—A lIfIli,

A JR

Let 2/3 be the interval with the same center as 1 -3 and twice the length, and let Cr = U i 2/3. Then < 2 IC/I and

I {x E R

1Hb(x)1 > A/211 < 1 9* I + I{x Cr 1Hb(x)1 > A/211 2

1Hb(x)1dx. A ± A— f Ickyl *

—

Now 1Hb(x)1 < E 3 1Hb3 (x)1 almost everywhere: this is immediate if the SUM is finite, and otherwise it follows from the fact that Ebi and EHb3 converge to b and Hb in L 2 . Hence, to complete the proof of the weak (1,1) inequality it will suffice to show that

1 f < 2A.

A<

j

I/31

Given this decomposition of R, we now decompose f as the sum of two functions, g and b, defined by if x if x E 1,

g(x) = { fi(x)

f

and b(x)

f R\2/

iHb (x)1 dx ;

Even though bi S, when x 213 the formula bi (y) d _ y dy

Hbi(x) = L x

is still valid. Denote the center of .1-3 by ci ; then since bi has zero integral, fR\24

IHb3(x)Idx = .k2ij

where IR.\2/3

f

bi(y) dy

5_ I lbi (y)1 (f

f(x)1 > Al 1

< I{x E R: 1Hg(x)1 > A/2}I + I{x E R :1Hb(x)1 > A/211. We estimate the first term using (3.4): I{x E R: IHg(x)I > A/2115_ ( 3,2- ) 2 f IHg(x)1 2 dx R\21, = j‘t ffiz g(x) 2 dx

<

8 f g(x) dx

A R

dx

bi(y) ( 1 x — y

Ibi(Y)1 (1.11 \21,

Then g(x) < 2A almost everywhere, and b3 is supported on 1 -3 and has zero integral. Since H f = Hg + Hb, Ifx E

53

3. The theorems of M. Riesz and Kolmogorov

R\2/,

1 ) — ci dy dx

ly

Ix — yllx — c.71

dx) dy

1-1-71 dx) dy. C31

The last inequality follows from the fact that ly — c 3 1 < 1117 1/ 2 and Ix — YI > lx — c3 1/ 2. The inner integral equals 2, so

E

f

1b3(y)1 dy < 411f

IHL5 (x)Idx <

31

Our proof of the weak (1,1) inequality is for non-negative f , but this is sufficient since an arbitrary real function can be decomposed into its positive and negative parts, and a complex function into its real and imaginary parts.

54

3. The Hilbert Transform

4. Truncated integrals and pointwise convergence

55

(2) Since H

is weak (1, 1) and strong (2, 2), by the Marcinkiewicz interpolation theorem we have the strong (p,p) inequality for 1 2 we apply (3.6) and the result for p < 2: 11H

f

li p = sup {If H f 91:

{if

sup

f • Hgi :

_ II f Ilp suP{Ill

_< 1} _<

1}

In the Schwartz class it is straightforward to characterize the functions whose Hilbert transforms are integrable: for 0 E S, Hq E Li- if and only if f cb = 0. We leave the proof of this as an exercise for the reader. For more information on the behavior of the Hilbert transform on Ll, see Section 6.5. In Chapter 6 we will examine the space of integrable functions whose Hilbert transforms are again integrable. and the space (larger than L°°) in which we can define H f when f is bounded. (Also see Section 6.6.)

Cp'ii flip.

The functions g and b in the proof of the first part of this theorem are traditionally referred to as the good and bad parts of f. It follows from these proofs that the constants in the strong (p', p') inequalities coincide and tend to infinity as (p, p) and p tends to 1 or infinity. More precisely, Cp =-- 0(p) as p cc, and Cp = 0 ((p - 1) -1 ) as p -3 1. By using the inequalities in Theorem 3.2 we can extend the Hilbert transform to functions in LP, 1 < p < oo. If f E L 1 of functions in S that converges to and ff, is a sequence f in Ll (i.e. lim lifr, - flu = 0), then by the weak (1, 1) inequality the sequence {Hf,} is a Cauchy sequence in measure: for any E > 0 , lim

I{X E R : j(H - Hfm )(x)i > E}1 = 0.

Therefore, it converges in measure to a measurable function which we define to be the Hilbert transform of f. If f E LP, 1 < p < oo, and {A} is a sequence of functions in S that converges to f in LP, by the strong (p,p) inequality, sequence in LP, {HA} is a Cauchy so it converges to a function in LP transform of f . which we call the Hilbert In either case, a subsequence of {H fn}, depending on f, converges pointwise almost everywhere to H f as defined. The strong (p,p) inequality is false if p 1 or p oo; seen if we let f x[0 , 1 1. Then this can easily be

4. Truncated integrals and pointwise convergence

For E > 0, the functions y-1X{IYI>E} belong to Lq (R) , 1 < q < oo , so the functions f(x) = f f (x Y) dy IYI›(

Y

are well defined if f E LP, p > 1. Moreover, HE satisfies weak (1, 1) and strong (p, p) estimates like those in Theorem 3.2 with constants that are uniformly bounded for all E. To see this, we first note that (y1

X{Iy1>€})

=

e -27riyC •A

= lim

< lyl
fc
= -2i sgn (0 lim

dy

. is n(27y) dy y

N—>00 f

2/rNK1 sin ( t )

t

dt.

This is uniformly bounded, so the strong (2, 2) inequality holds with constant independent of E. We can now prove the weak (1, 1) inequality exactly as in Theorem 3.2, and the strong (p,p) inequalities follow by interpolation and duality. If we fix f E LP, 1 1 and in measure if p = 1. To see this, fix a sequence { fm } converging to f in L. Then H f = lim H = lim lira HE fn = lim lim H E fn = lim HE f; 71 .- CX) E.-4)

H f (x) and H f

1 7

log

i x x -

is neither integrable nor bounded.

1, 1

the second and third equalities hold because of the corresponding (uniform) (p, p) inequality. We now want to show that the same equality holds pointwise almost everywhere.

56

3. The Hilbert Transform

Hence,

Theorem 3.3. Given f E LP, 1 < p < oo, then

1

H f (x) = !in(13 H,f(x) a.e. x E

(3.7)

—

Y

Since we know that (3.7) holds for some subsequence 111,,f1, we only need to show that lira H,f (x) exists for almost every x. By Theorem 2.2 (and the remarks following it) it will suffice to show that the maximal operator -

H* f (x) = sup1H, f (x)1 E>0

is weak (p,p). This however, follows from the next result. Theorem 3.4. H* is strong (p, p), 1 e}

Lemma 3.5. If f E S then H* f (x) M (H f )(x) + CM f (x). Proof. It will suffice to prove this inequality for each HE with a constant

independent of E. Fix a function 0 E S(R) which is non-negative, even, decreasing on (0, co), supported on {x E R : lx1 < 1/2} and has integral 1. Let 0,(x) = ( -1 0(x/E). Then 1 1 1 1 (Y)] - X{l u j>,} = ( OE * P. v. -x (Y) + [ X{ly>(} ( OE * P. v. x and the convolution of the first term on the right-hand side with f is dominated by M (H f)(x) (cf. Proposition 2.7). It will suffice to find the pointwise estimate for the second term when e 1 since it follows for any other € by dilation. If 1y1 > 1 th en

Y

1

0(x) X

i lxl<1/2 Y

dx =

0(x) (1 f ixl<1/2

]. /2 Lk C -2- ;

1 ) dx y-x

/(x)IxI dx

- lim

6-4) fi x l > j

x

< C.

—

transform are strong (p,p), 1 0. Now fix A > 0 and form the Calder6n-Zygmund decomposition of f at height A. Then we can write f as f=g+b=g+Eb 3 . The part of the argument involving g proceeds as in Theorem 3.2 using the fact that H* is strong (2,2). Therefore, the problem reduces to showing that I{x V 9* : H* b(x) > All 5

11b111.

Fix x S2*, e > 0 and bi with support Ii. Then one of the following holds: = .ri , (2) (x - €, x + n = 0,

(1)

(X —

E, X + E) n

(3) x — E E or x + E E In the first case, II,bj(x) = 0. In the second, H,bi(x) = Hbi(x); hence, if we let cj denote the center of Ii , since bi has zero average, 1 1 1H,bj ( x)1 _< Ibi(y)I dy <_ y x Ix -

3

In the third case, since x Cr, Ii C (x - 3e, x 3€), and for all y E Ix - y1 > €/3. Therefore, 3 f x+3€ 1bi(y)Idy. dy < 1H,bi(x)1 < I 6 x -3E lx YI If we sum over all j's we get (

O(Y x) dx

C 1\ OE * P. v. x ) (Y) 5 1 + y 2 '

Proof of Theorem 3.4. Since both the maximal function and the Hilbert

1H,b x 1 <

if 1y1 < 1 then

— (

and by Proposition 2.7 the convolution of the right-hand term with f is dominated by M f (x).

To prove this we need a lemma which is referred to as Cotlar's inequality.

1 -

57

4. Truncated integrals and pointwise convergence

)

11b•111

Ix

CjI 2

IIihI -

3 E

Ilbj111

fx+3c x-3(

CMb(x).

ib(y)Idy

58

3. The Hilbert Transform

It follows from this that I{x ft

: H*b(x) > A}1

x S*:

Ix

lijicip

> A/21

2

fiR,2,i ix_ci I

cplIfIlp-

A

For an application of this result, let a = -R, b = R. Then Sa ,b is the partial sum operator SR introduced in Chapter 1: SRf = DR * f , where DR is the Dirichlet kernel. Hence,

5. Multipliers E L"(Rn),

Proposition 3.6. There exists a constant Cp , 1 < p < oo, such that for all

' III C d x+—Ibili l

-T-0111.

Given a function m by

L. By making the obvious changes in the argument we see that this is still true if a = -oo or b = oo. Hence, we have the following result. a and b, -oo < a A/2Q < - Pill - A

59

5. Multipliers

we define a bounded operator Tm on L 2 (Rn)

IISR.fllp 5- GplIfIlp with a constant independent of R. This yields the following corollary. Corollary 3.7. If f E LP(R), 1 0 and let be a measurable subset of {x E JR' : Im(x)! > limbo - €} whose measure A is finite and positive, and let f be the L 2 function such that f =- xA. Then

11 Trnf 112 > (11 M Ilco — 6 )11f 112'

We say that m is the multiplier of the operator Tm , though occasionally we will refer to the operator itself as a multiplier. When Tm can be extended to a bounded operator on LP we say that m is a multiplier on L.

For example, the multiplier of the Hilbert transform, m(e) --= sgn(e), is a multiplier on L. More generally, given a, b E R, a < b, define ma,b(0 = X(a,b)(0 • Let Sa ,b be the operator associated with this multiplier: We have the equivalent expression i

Sa,b = - (Ma HM_ a - MbHM-b), 2

where Ma is the operator given by pointwise multiplication by e maf (x) e 2iriax f(x).

When p = 1 we do not have convergence in norm but only in measure: lim Ifx E R :ISRf (x) .f(x)1> Eli = 0. From this result and from Corollary 3.7 we see that there exists a sequence {SR, f (X)} that converges to f(x) for almost every x, but the sequence depends on f. Given a family of uniformly bounded operators on LP, any convex combination of them is also bounded. Hence, starting from Proposition 3.6 we can prove another corollary. Corollary 3.8. If m is a function of bounded variation on R, then m is a

multiplier on LP, 1 < p < oo. Proof. Since m is of bounded variation, the limit of m(t) as t -oo exists, so by adding a constant to m if necessary we may assume that this limit equals 0. Furthermore, we may assume m is normalized so that it is right continuous at each x E R. Let dm be the Lebesgue-Stieltjes measure associated with m; then

(Sa,bf ) - (0 X(a,b)(0/ (O•

(3.9)

lim IISRf - flip = 0.

2'

ax ,

To see this, note that by (1.16) the multiplier of iMa HM, is sgn(e - a). Ma is clearly bounded on LP, 1 < p < oo, with norm 1. Therefore, from (3.9) and the strong (p, p) inequality for H, we get that So is bounded on

m(e) = J dm(t) = -co

x(_,„,,o(t)dm(t) = I x( t ,,,)(e)dm(t). 1R

Therefore, (Tmf)-(0

=

f X(t,oc)(0/(0 dm(t),

60

3. The Hilbert Transform

and so Tm f (x) = f St,c. f (x) dm(t).

By Minkowski's inequality

Ithnl(t)

cplIfIlp f Idml(t);

the integral of Idmi is the total variation of m and by assumption this is finite. Given a multiplier, we can construct others from it by translation, dilation and rotation. Proposition 3.9. If m is a multiplier on LP(118 71 ), then the functions defined by m.(" + a), a E R n , m(A0, A > 0, and m(p0, p E 0(n) (orthogonal

transformations), are multipliers of bounded operators on LP with the same norm as Trn .

The proof of this result follows at once from properties (1.16), (1.17) and (1.18) of the Fourier transform. If in is a multiplier on LP(R), then the function on R n given by in() = m(1) is a multiplier on LP(Rn). In fact, if Tm is the one-dimensional operator associated with m, then for f defined on R n , Tin f (x)

(• • x2, • • • xv)(xi)•

Then by Fubini's theorem and the boundedness of Tin on LP(R), f (x)1P dx =

f

< C

(LITrn f (-, x 2 ,

. ,x n )(x i )IP dxi) dx 2 • • dx n

fR If (x i , . . . , x n )IP dxi • • dxn•

If we take m = x( 0 , 0„), then by Proposition 3.6 m is a multiplier on LP(R), 1 0} will be a multiplier on LP(R n ) Further, by Proposition 3.9 the same will be true for the characteristic function of any half-space (since it can be gotten from the one above by a rotation and translation). The characteristic function of a convex polyhedron with N faces can be written as the product of N characteristic functions of halfspaces, so it is also a multiplier of L P (R n ), 1 < p < oo. This fact has the , llowing consequence. Corollary 3.10. If P C Rn is a convex polyhedron that contains the origin,

then

lim IISAp f — f = 0 , 1 < p < CC,

6. Notes and further results

61

where Sap is the operator whose multiplier is the characteristic function of AP -= fAx : x E Pl.

In light of the above observation, it is perhaps surprising that when n > 1, the characteristic function of a ball centered at the origin is not a bounded multiplier on LP018' ), p 2. We will examine this and related results for multipliers on IV in Chapter 8. (Also see Section 6.8 below.) 6. Notes and further results 6.1. References.

The proof of the theorem of M. Riesz first appeared in Sur les fonctions conjuguees (Math. Zeit. 27 (1927), 218-244) but had been announced earlier in Les fonctions conjuguees et les series de Fourier (C. R. Acad. Sci. Paris

178 (1924), 1464-1467). The proof in the text became possible only after the Marcinkiewicz interpolation theorem appeared in 1939. See Section 6.3 below for the original proof by Riesz and another one due to M. Cotlar (A unified theory of Hilbert transforms and ergodic theorems, Rev. Mat. Cuyana 1 (1955), 105-167). Additional proofs due to A. P. CalderOn (On the theorems of M. Riesz and Zygmund, Proc. Amer. Math. Soc. 1 (1950), 533-535) and P. Stein (On a theorem of M. Riesz, J. London Math. Soc. 8 (1933), 242-247) are reproduced in Zygmund [22]. Kolmogorov's theorem is in Sur les fonctions harmoniques conjuguees et les series de Fourier (Fund. Math. 7 (1925), 23-28). Another proof due to L. H. Loomis (A note on Hilbert's transform, Bull. Amer. Math. Soc. 52 (1946), 1082-1086) can be found in Zygmund [22] and an elegant proof due to L. Carleson is given by Katznelson [10]. Our proof uses the CalderOn-Zygmund decomposition in the same way that it appears in Chapter 5 for more general singular integrals. It is derived from the proof of A. P. Calder& and A. Zygmund (On the existence of certain singular integrals, Acta Math. 88 (1952), 85139). Theorem 3.5 is in the paper by M. Cotlar cited above. 6.2. The conjugate function and Fourier series.

The results in this chapter can also be developed for functions defined on the unit circle. Historically, they evolved in close connection with the theory of Fourier series discussed in Chapter 1. Given a function f E L l (T) (equivalently, a function in LI([°, 1])), its harmonic extension to the unit disk is gotten by convolution with the Poisson kernel Pr

(t)

1 — r2

—

1— 2r cos(27rt) r2 '

0 < r < 1,

62

3. The. Hilbert Transform

, -1 the harmonic conjugate of Pr * f (t) is gotten by convolution with the conjugate Poisson kernel Q r (t) =

2r sin(27t) 0 < r < 1. 1 - 2r cos(27t) + r 2'

The analogue of the Hilbert transform is the conjugation operator which is defined by (3.10)

.7(x) = lira Q r * f (x) = lim r-1

6-40 Lt<1/2

circle, let u = Pr * f and v = Qr * f ; then F(z) = u + iv is analytic in the unit disc and hence so is F(z) 2k . By Cauchy's theorem 1 f 27i

The theorems of M. Riesz and Kolmogorov hold for the conjugation operator. Hence, by an argument similar to the one in Section 5 it can be shown that the partial sum operators SN f of the Fourier series of f are uniformly bounded on LP and so SNf converges to f in LP. (Cf. Chapter 1, Section 4.)

fo 2ir

S[f](x) =

-i sgn(k)f(k)e2Iriks;

-00 when the Fourier series of f is written as in (1.1), then the conjugate series is gotten by switching the coefficients of the sine and cosine terms and taking their difference instead of their sum: 00 = ak sin(2xkx) - bk cos(27kx). k=o

E

When f E L l then

coincides with the Fourier series of

I.

Even for continuous functions the second limit in (3.10) exists because of subtle cancellation and not because the numerator gets small for t close to zero. It can be shown using the Baire category theory that there exists a continuous function f such that for every x E [0, 1],

fo

1/2 I

— t) — f + dt = Do. tan(7t)

The same phenomenon occurs with the Hilbert transform. For these and further results, see the books by Bary [1], Katznelson [10], Koosis [11] and Zygmund [21]. 6.3. Other proofs of the M. Riesz theorem. The original proof of M. Riesz of the LP boundedness of the conjugate function is based on the analyticity of powers of an analytic function. The easiest case is when p = 2k, k an integer. Given a function f on the unit

z

iv(re a ) 12k dt < E

(2k) /

j=1 \2j

Iv(reit)12k-23

2-

dt.

0

By applying Holder's inequality (with the appropriate exponent) to each term on the right-hand side, we get /27r

i u ( re it)12k

dt < Ck

27r

lu(reit)12k dt,

(3

,

Given a function f E L 1 (T), its conjugate Fourier series is the trigonometric series

t er F(re")2k dt = F(0) 2k dz = — 27 0

F(z) 2 k 1

If F(0) = 0 then by taking the real part of F( e") we get that

f (x - t) - f (x + t) dt. tan(7t)

Both limits can be shown to exist for almost every x.

63

6. Notes and further results

where Ck is independent of the radius of the circle. Taking the limit as the radius tends to 1, u tends to f and v to f . (When F(0) = f(0) 0 replace

f by f—f(0).) From the inequality for p 2k, the whole theorem can be obtained using interpolation and duality, but since interpolation was not available in 1923, Riesz next considered the case when p > 1 is not an integer. In order to get that (u+iv)P is analytic, he assumed u > 0 (which is true if f is non-negative and not identically zero) and then used a clever estimate. For technical reasons, this argument does not work for p an odd integer, and the inequality for these values of p is gotten by duality. In a letter dated November 1923, Riesz explained to Hardy how he arrived at the proof and showed him the essential details (reproduced in M. L. Cartwright, Manuscripts of Hardy, Littlewood, M. Riesz and Titchmarsh, Bull. London Math. Soc. 14 (1982), 472-532). From the fact that (u + iv)2 = u2 v 2 2iuv is analytic we deduce H(f 2 - (H f) 2 ) = 2f • H f, which gives the following formula due to Cotlar: (3.11)

(H f) 2 = f 2 + 2H( f • H f).

An alternative way to prove this (proposed by Cotlar in his paper) is to show directly that Hf*Hf =

f * f — 2i sgn(0(f*Hf).

It follows from (3.11) that if Ilk/flip 5 .✓c?, +1)11f112p. In fact,

5

cplIfIlp, then II H f II2P < (CC +

+ 2 iill(f .11 f)iip

Ilfflp+2cpil f H f lip

64

3. The Hilbert Transform

IlfIl2p+ 2 cplIf112pIliff112p

65

6. Notes and further results

The best constant in the weak (1, 1) inequality has a more complex expression:

and the desired inequality follows by solving a quadratic equation. If we begin with (3.4) and repeatedly apply this inequality we get, for instance, that 11H f 112k < (2 k - 1)111. 11 2 k for integers k > 1. By the RieszThorin interpolation theorem (Theorem 1.19), we get

IlHflip

cplIfIlp,

p

> 2.

We get the Riesz theorem for p < 2 by using duality as in the proof of Theorem 3.2.

Cl

=

1 + 1

+

+•••

1

This was first proved by B. Davis (On the weak type (1, 1) inequality for conjugate functions, Proc. Amer. Math. Soc. 44 (1974), 307-311) using Brownian motion. A non-probabilistic proof was later given by A. Baernstein (Some sharp inequalities for conjugate functions, Indiana Univ. Math. J. 27 (1978), 833-852).

An elementary real-variable proof when p = 2 (without using the Fourier transform) is the following:

11 1-1,111 2 = f

- HEHEf)

s

11f11211H,H€1112,

and the kernel of HE HE , which is the convolution of y -1 x{i y 1,0 with itself, can be computed explicitly and shown to be in Ll. (By a dilation argument, it suffices to consider the case E = 1.) This proof is attributed to N. Lusin (1951) by E. M. Dyn'kin (Methods of the theory of singular integrals: Hilbert transform and CalderOn-Zygmund theory, Commutative Harmonic Analysis, I, V. Havin and N. Nikolskii, eds., pp. 167-259, Springer-Verlag, Berlin, 1991). A similar argument had been used earlier by Schur for a discrete version of the operator. Another real-variable proof when p = 2 using Cotlar's lemma is in Chapter 9, Section 1.

6.4. The size of constants. The best constants in the theorems of Kolmogorov and M. Riesz are known. For the strong (p, p) inequality, 1 < p < co, the best constants are Cp = tan(ir/2p), 1 < p < 2, and Cp cot(ir/2p), 2 <- p < co. This was first proved by S. K. Pichorides (On the best values of the constants in the theorems of M. Riesz, Zygmund and Kolmogorov, Studia Math. 44 (1972), 165-179); a much simpler proof was later given by L. Grafakos (Best bounds for the Hilbert transform on LP(111 1 ), Math. Res. Let. 4 (1997), 469-471). In the proof of the M. Riesz theorem due to Cotlar given above, we get the estimate C2p < Cp N/q, + 1. If we begin with C2 = 1, the resulting bounds for C2 k are sharp. This fact was first observed by I. Gohberg and N. Krupnik (Norm of the Hilbert transformation in the space LP, Funct. Anal. Appl. 2 (1968), 180-181).

6.5. The Hilbert transform on Ll. As we noted in Section 3, if f E Ll then H f need not be. However, unlike the case of the maximal function, there exist f E L i such that H f E L l . For example, if f = x( 04 ) - x ( _ 1 , 0 ) then H f (x) log(1x 2 - 11/x 2 ) is integrable. A simple necessary condition for H f to be integrable if f E L l is that f(0) = f f = 0. If f is such that the identity (H f) - (1;) = sgn(0M) holds, then it suffices to note that the Fourier transform of an integrable function is continuous, and f (0 and -i sgn(0 f (0 are both continuous only if f(0) = 0. This argument clearly works if f E L 1 fl L 2 since in this case the Fourier transform identity is valid. But one can show that this identity holds if f, H f E Ll. This would follow if cb * (H f) = H(0 * f) for every E S: take the Fourier transform of both sides and simplify. This equality can be proved as is done for more general singular integrals in the paper by A. P. Calder& and 0. Capri (On the convergence in Ll of singular integrals, Studia Math. 78 (1984), 321-327). Similarly, when f,Hf E L l we have the identity H(Hf) = -f. We originally gave this in (3.5) for L 2 functions, and it can be extended to LP, 1 < p < co, using the results in this chapter. For f and H f integrable, a proof using complex analysis is due to E. Hille and J. D. Tamarkin (On the absolute integrability of the Fourier transform, Fund. Math. 25 (1935), 329352). The interesting paper by J. F. Toland (A few remarks about the Hilbert transform, J. Funct. Anal. 145 (1997), 151-174) examines this question and provides additional historical information. In general, H f Ll , so we cannot even define the Hilbert transform of H f . Nevertheless, if we replace the Lebesgue integral with a more general one, then it is possible to define H(Hf) and to get the identity. (See Zygmund [21, Chapter 7].) The above-cited paper by Toland uses this extended notion of the integral.

66

3. The Hilbert Transform

13.6.

L log L estimates.

The following result characterizes the local integrability of the Hilbert transform; it is analogous to Theorem 2.18 for the maximal function. Theorem 3.11. Let B and C be two open balls such that (1) If f log+ If

I c L l (C) , then Hf

B C C.

E L 1 (B).

(2) Conversely, if f > 0 and Hf E L l (C), then f log+

Ifi E L 1 (B).

The first part of the theorem is due to A. P. CalderOn and A. Zygmund (see the paper cited above). It can be sharpened using Orlicz spaces (cf. Chapter 2, Section 8.4, and Zygmund [21, Chapter 4]). The second part is due to E. M. Stein (Note on the class L log L, Studio, Math. 32 (1969), 305310). The hypothesis f > 0 can be replaced by a weaker condition which measures (in some sense) the degree to which Pt * f differs from a positive harmonic function. See the article by J. Brossard and L. Chevalier (Classe L log L et densite de l'integrale d'aire dans R n++1 , Ann. of Math. 128 (1988), 603-618). 6.7. Translation invariant operators and multipliers.

Let Th be the translation operator: Th f (x) = f (x — h). An operator T is said to be translation invariant if it commutes with Th o T = T o T h , h E Rn. The Hilbert transform, for example, is translation invariant. Such operators are completely characterized by the following result. Theorem 3.12. Given a linear operator T which is translation invariant and bounded from LP (Rn) to Lg (Rn) for any pair (p, q), 1 < p, q < oo, then there exists a unique tempered distribution K such that Tf = K * f, f E S.

6. Notes and further results

67

multiplier of T, at least if k E following the definition in Section 5. But if T is bounded on LP, then by duality (since its adjoint is the convolution with kernel K(x) = K(—x)) it is bounded on LP'. Thus by interpolation it is bounded on L 2 , so the condition that K E L°° is necessary, and our assumption that a multiplier m. is in L" is justified. It would be interesting to characterize the distributions K that define bounded operators (equivalently, to characterize the multipliers on LP), but a complete answer is known only in two cases. Proposition 3.13. (1) T is bounded on L 2 (Rn) if and only if k E L°° and the norm of T as an operator on L 2 is the norm of K in L' . (2) T is bounded on L l (R n ) if and only if K is a finite Borel measure, and the operator norm of T equals the total variation of the measure.

Note that since p. v. 1/x is not a measure. Proposition 3.13 gives another proof that the Hilbert transform is not bounded on L l . For all of these results, see Stein and Weiss [18, Chapter 1]. 6.8. Multipliers of Fourier series.

The multipliers of trigonometric series are precisely bounded sequences, {)(k)}kEz. The associated operator is defined for f E L 2 (7) by TA f (x) =

(k) f (k)e27rikx kEz

(For multiple Fourier series the definition is the same except that we take k E Zn .) TA can be written as the convolution of f with a distribution whose

If such a T is bounded from LP to Lq and not zero, then p cannot be greater than q. An elegant and very simple proof of this fact is due to L. HOrmander (Estimates for translation invariant operators in LP-spaces, Acta Math. 104 (1960), 93-439). First note that limh_, 09 IIf + T1 fll p = 231 11f 1113. Let IITII Thq be the norm of T as a bounded operator from LP to Lg . Since T commutes with translations,

Ilf + Thflip;

iiTf +

1

if we let h tend to infinity we get IlTf

214— 11 q11 71 1p,q 11f I lp,

Fourier coefficients are {A (k)}, and there are results analogous to those in Proposition 3.13. If we know the boundedness properties of a multiplier on LP(111), then we can deduce results for multipliers of series by restricting the multiplier to the integers. Theorem 3.14. Given 1 < p < oo, suppose the operator T defined by Tf = K* f is bounded on LP(R). If K is continuous at each point of Z and A(k) = K(k), then the operator TA associated with this sequence is bounded on LP(T) and IIT11•

and this is possible only when p < q.

There is also an analogous n-dimensional result. For a proof see Stein and Weiss [18, Chapter 6].

Clearly, if T f = K * f then T is linear and translation invariant. By the properties of the Fourier transform, (T f) = K f , which implies that K is a

Starting from Theorem 3.14, one can deduce results for the partial sum operators SN of a Fourier series from the corresponding results for SR in

68

1

3. The Hilbert Transform

Chapter 4

Section 5, and in particular one gets the convergence in LP norm of the partial sums of a Fourier series. (Cf. Chapter 1, Section 4.)

1 1

6.9. The Hilbert transform of characteristic functions. If A is a subset of R with finite measure, one can show that 2IAI sinh(7rA)• This result was first proved by E. M. Stein and G. Weiss (An extension of a theorem of Marcinkiewicz and some of its applications, J. Math. Mech. 8 (1959), 263-284). A simple proof is given by Zygmund [22, p. 15]. Rx E R : 1 -11(XA)(x 1 > )

=

Singular Integrals (I)

1. Definition and examples The singular integrals we are interested in are operators of the form

CNY') f (x - y) dy, T f (x) = Limo y

(4.1)

where C2 is defined on the unit sphere in Rn, Sn -1 , is integrable with zero average and where y' = With these hypotheses, (4.1) is defined for Schwartz functions since it is the convolution of f with the tempered distribution p. v. C2(x')/lx In, defined by

(4.2) p. v.

fl(x')

(g5) = limo q(x) dx E---4) fi x >, lxin S2( ) [0(x) = fixi<1 ix in

0(0)] dx +

51 (x')

0(x) dx.

fixt>i

(The second equality follows since SI has zero average.) Since 0 E S, both integrals converge. Note that we could also assume, for instance, that 0 in (4.2) or f in (4.1) is a Lipschitz function of order a with compact support. Proposition 4.1. A necessary condition for the limit in (4.1) or (4.2) to exist is that 11 have zero average on Sn -1 . Proof. Let f E S(Rn ) be such that f (x) = 1 for any IxI < 2. Then for IxI < 1 ,

T f (x) =

Q(i) dy.

f (x - y) dy + lim c-4)

Lly1<1 69

1

70

4. Singular Integrals (I)

f

The first integral always converges but the second equals lim

s."-1

❑

In higher dimensions we consider two examples given by A. P. CalderOn and A. Zygmund. First, let f (xi, x2) be the density of a mass distribution in the plane. Then its Newtonian potential in the half-space R 3+ is f (Yi, Y2) dyidy2• L2 (X1 — y i ) 2 + (X2 + Y2) 2 + X3

u(x i , x 2 , x3) =

The strength of the gravitational field is gotten by taking partial derivatives of the potential. The component in the x 3 direction is equivalent to a multiple of the Poisson integral which we have already considered. The other two are similar to one another; for example, for the first we formally have that lim —, xi ( X2 , , X3 )

71

f (x)0A (x) dx A a f f (x)¢(x) dx,

When n 1 the unit sphere reduces to two points, 1 and -1, and fl must take opposite values on them. Thus on the -real line, any operator of the type in (4.1) is a multiple of the Hilbert trans form.

x3-4) OX1

Given any function 0, define OA (x) = A -71 0(A -l x); then

(V) du (y') • log(1/€),

and this is finite only if the integral of 52 on Sn -1 is zero.

au ,

2. The Fourier transform of the kernel

so we can define the homogeneity of a distribution as follows. Definition 4.2. A distribution T is homogeneous of degree a if for every test function 0- it satisfies

T(0A) = A a T ( 0)•

A simple computation shows that the distribution given by (4.2) is homogeneous of degree -n; details are left to the reader. Proposition 4.3. If T is a tempered distribution which is homogeneous of degree a, then its Fourier transform is homogeneous of degree -n - a. Proof. By Definition 1.16 and property (1.18), if 0 E S then

t(cbA) = T(:kA.)) = A - nT(ch-i) = A - n - aT() = A - n - aP¢).

f (yi, y2) (xi — y1)

= — iv [(x1 — yi) 2 + (x2 — y2)2.13/2 dyidy2.

This integral does not converge in general, but it exists as a principal value if f is smooth and is in fact the value of the limit of au/ax i . It corresponds to a singular integral of the form (4.1) in R 2 with C2(x') = -xi/jxj. (In polar coordinates this becomes cos(0).) The second example is gotten from the logarithmic potential associated with a mass distribution f (xi, x2) in the plane (i.e. the solution of the equation Au = f): 1 u(xi, x2) = I f (y i , y2) log ( y1)2 + (x2 _ y2)2]1/ 2 ) dyi 42. R2 {(x1 _ This integral converges absolutely if, for example, f has compact support, and partial derivatives can be taken under the integral sign. However, this is not possible for second derivatives: one can show that 0 2 u/Oxiax 2 is given by an operator of the form (4.1) with 52(x') = 2xix2/1x1 2 . For more information on this operator, see Section 5.

2. The Fourier transform of the kernel A function f is homogeneous of degree a if for any x E Rn and any A > 0, f (Ax) = A a f(x).

As a corollary to this proposition we can easily calculate the Fourier transform of f (x) = ixi -a if n/2 < a < n. For in this case f is the sum of an L 1 function (its restriction to { < 1}) and an L 2 function, so by Proposition 4.3, f is a homogeneous function of degree a - n. Hence, since f is rotationally invariant, f = ca nI a n. We calculate c a ,, using Lemma 1.14 and (1.21): ,

e'l x12 1xra dx = c a ,,„

EV

e'l x12 1xl a- n dx;

since _ 2

fo e " r" dr =

1

-Fb I2

1"

(1 + 2 )

we see that (4.3)

(IXI

a

)(0 = xa

(r) Isl a—n

F (1)

In fact, this formula holds for all a, 0 < a < n. For 0 < a < n/2 it follows from the inversion formula (1.22). Further, since Ixl - a tends to 141/2 as a n/2, and since the Fourier transform is continuous, (4.3) holds with a = n/2.

72

4. Singular Integrals (I)

Theorem 4.4. If 12 is an integrable function on Sn -1 with zero average, then the Fourier transform of p. v. f2(4/IxITh is a homogeneous function of degree 0 given by

(4.4)

m(e) = Isn-iC2(u) [

log

(iu .10) i sgn(u • 6')] da(u).

Proof. By Proposition 4.3 the Fourier transform is homogeneous of degree

lel = 1. Since 1-2 has zero average,

0; therefore, we may now assume that

9(0 e -27riy• dy

m(e. ) urn

1) dr + .1 116 e-27,-iru•e rue ( e -2 7r2 = inn f fl(u) [f_ -

0)

s. -1

da(u).

r

Thus m(e) = I 1 - i12, where 1

=lim f

s .-1

C2(u) f (cos(271- ru • e) - 1)

r

]

do- (u),

1/E dr] f2( ) [1 sin(27rru • 6) da(u).

By the dominated convergence theorem we can exchange the limit and outer integral. In each inner integral make the change of variables s 27rriu • el. We may assume that u • e 0, so in /2 we get 271u.C1/e

As e 0 this becomes sgn(u • e)

sin(s) sgn(u

co sin(s) •

ds =2 sgn(u e).

ds cos(s)—

1 271/./.1 -7

s

cos (cos(s — 1 (s) ds log l27 - log lu • el. s )

1-2€ (u) = 2 (52(u) +S2(-u)), 52 0 (u) = 2 (S)(u) - S2(-u)), Corollary 4.5. Given a function Cl with zero average on Si1-1 , suppose that 12 O E L 1 (S n-1 ) and St e E L 9(sn-1 ) for some q > 1. Then the Fourier transform of p. v. Q(4/1x1n is bounded.

I 12 e(u)1 log+ i12c(u), da(u) < oo. (Recall that log + t = max(0, log t).) The sufficiency of this condition follows from the inequality A> 1, B > O. AB < A log A + e B , For in the region D = {u E

: 1 51 (U)I ^ 1},

S2(u) log ( lu 1.0 da(u) )

S2 is bounded in the complement of D, and so the integral is finite.

as € ---4 0 this becomes )

In formula (4.4) the factor multiplied against S2 has two terms: the first is even and its contribution is zero if S2 is odd; it is not bounded but any power of it is integrable. The second is odd and its contribution is zero if 52 is even; further, it is bounded. Since any function 52 on STh -1 can be decomposed into its even and odd parts,

5 L2,Q(u)llog(211-2(u)l) do- (u) + ID Iu • '1 -112 do- (u).

ds

—.

After the change of variables in /1 we get f 1 d s 2'l u 'W' (cos(s) - 1)— + .127,-ki.ek s 1

, the constant ❑

From (4.4) one can easily find an integrable function f2 such that m is not bounded. Nevertheless, in Corollary 4.5 we can substitute the weaker hypothesis ft, E L log L(Sn -1 ), that is,

dr

+ I cos(27rru • e) (r /2 = lirn u f

If we integrate against f2, which has zero average on terms disappear and we get the desired formula for m.

we immediately get the following corollary.

' O Llyi
E

73

3. The method of rotations

3. The method of rotations

Corollary 4.5 together with the Plancherel theorem gives sufficient conditions for operators T as in (4.1) to be bounded on L 2 . In this and the following section we develop techniques due to Calder& and Zygmund which will let us prove they are bounded on LP, 1 < p oo. Let T be a one-dimensional operator which is bounded on LP (R) and let u E Sn-i . Starting from T we can define a bounded operator Tu on R n as follows: let L t, = {Au : A E R} and let L ul be its orthogonal complement in RTh. Then given any x E R", there exists a unique Xi E R and x E Liu- such

74

4. Singular Integrals (I)

that x = xiu + X. Now define Tu f (x) to be the value at x1 of the image under T of the one-dimensional function f (• u + If Cp is the norm of T in LP(R), then by Fubini's theorem, f (x)IP dx = f

u

f IT (.t. (- + ±)(xi)IP

f

f If (.0 1.4,- IR

dXidX

)(X1)17) dXidX

If(x)IP dx. Operators gotten in this way include the directional Hardy-Littlewood maximal function, 1 fh Mu f (x) = sup— 2 I f (x - tu)I dt, h> 0 h —h and the directional Hilbert transform,

1

00.

If we let Q(u) = 1 for all u, then M0 becomes the Hardy-Littlewood maximal function in Rn; thus the method of rotations shows it is bounded on LP(Rn), p > 1, starting from the one-dimensional case. Further, it is interesting to note that the LP constant we get is 0(n) since 1S n-1 1/1B ( 0 1 )1 = n. (See Chapter 2, Section 8.5.) On the other hand, the method of rotations does not give us the weak (1, 1) result. (See Section 7.6.) We can also apply Proposition 4.6 to operators of the form (4.1) when Q is odd. In fact, if we fix a Schwartz function f , then T f (x) = lim f f

Proposition 4.6. Given a one-dimensional operator T which is bounded on LP(R) with norm Cp , let Tu be the directional operators defined from T. Then for any C2 E V(5' 1 ), the operator 7h defined by

-

R) I fs(o,R)

moilf (x _ widy.

If we rewrite this integral in polar coordinates we get 114-Qf (x) = R su>p

= 1 lim f 2E

-1 drdo- (u) 1B(0,1)1Rn is,,--11C2(u)1 ff I (x - ru)Irm

dr f (x - ru) — do- (u) r

dr

firl>f

f - ru) — da(u);

.---,

0 Sn.-1

dr

C2(u) L iri
+ 1- f 11(u) i 2 sn - 1

f (x - ru) — dr do- (u).

r

Because f E S, the inner integral is i ri> unilformly bounded, so we can apply the dominated convergence theorem to get

=

By using this result we can pass from a one-dimensional result to one in higher dimensions. Its most common application is in the method of rotations, which uses integration in polar coordinates to get directional operators in its radial part. For example, given Q E L l (gn 1 ), define the "rough" maximal function

00

since I has zero average, we can argue as we did in (4.2) to get

fl(u)Tu f (x) do- (u)

is bounded on L P (R n ) with norm at most CAQIIi.

1 -11Alf (X) = su P R > o IBA

Q(u)

6--,0 2 f

7T c—, 0 i t i >c

(4.5)

Q(u)

f

1

f To f (x) f

f

1B(0, 1)119 -1 P(u)1Muf (x) d°-(u)• Corollary 4.7. If Il E Li(sn-i) then M0 is bounded on LP(R n ), 1 p <

dt Hu f (x) - lim f f (x - tu)— .

Since the operators Tu obtained from the operator T are uniformly bounded on LP(Rn), any convex combination of them is also a bounded operator. Hence, the next result is an immediate consequence of Minkowski's integral inequality.

75

3. The method of rotations

f

Sn -1

1-2(u)Hu f (x) do- (u).

Since the Hilbert transform is strong (p, p), 1 < p < co, we have proved the following. Corollary 4.8. If Q is an odd integrable function on Sn -1 , then the operator T defined by (4.1) is bounded on LP(Rn), 1 < p < c.c.

In Corollary 4.8 we implicitly defined T f for f E LP as a limit in LP norm. However, as with the Hilbert transform, we can show that (4.1) holds for almost every x. By an argument similar to the one above we can show that the maximal operator associated with the singular integral, (4.6)

t

11( ) f (x - y) dy T* f (x) = su p E> o f yi>e

76

4. Singular Integrals (I)

4. Singular integrals with even kernel

77

satisfies

n +1 2

r (v)

f (x) < -7r2- fs”, i l52(u)1H,:f (x) do - (u), where IL: is the directional operator defined from the maximal Hilbert transform. (See Chapter 3, Section 4.) Therefore, we can apply Proposition 4.6 and Theorem 2.2 to get the following. Corollary 4.9. With the same hypotheses as before, the operator T* defined by (4.6) is strong (p,p), 1 < p < oc. In particular, given f E LP, the limit (4.1) holds for almost every x E

4. Singular integrals with even kernel If the function Cl in the singular integral (4.1) is even, then the method of rotations does not apply since we cannot represent the singular integral in terms of the Hilbert transform. However, we ought to be able to argue as follows: by (4.9), n

Rj f (x) c r, P. v. fRr, Yi

f ( x Y) d y, 1 < j < n,

where =

n

fq(T f) = j=1

j=1

and the operator R3 T is odd since it is the composition of an odd and even operator. If we can show that Ft3 T has a representation of the form (4.1), then by Corollary 4.8, T is bounded on LP. In the rest of this section we will make this argument precise by showing that Ri T has the requisite representation.

-*

in + 1 cn

T f

An important family of operators with odd kernels consists of the Riesz transforms, (4.7)

Sj

7r

2

•

The constant is fixed so that for f E L 2 ,

Let C2 be an even function with zero average in L 9 (SP -1 ), for some q > 1, and for € > 0 let

(4.8)

(4.10)

(Ri f ) ()

=

-

this in turn implies that

E

(4.9)

1

(4.11)

=- -I,

where I is the identity operator in L 2 . Since the Schwartz functions are dense in LP, p > 1, this identity in fact holds in every LP spat e. To prove (4.8) one could use Theorem 4.4 directly; instead we will argue as follows. With equality in the sense of distributions, — ixl-

n+1

= (1 -

n) p. v.

xj

(p. v.

142+ 1 )

Ri (K, * f) = (Ri K,) * f.

Lemma 4.10. With the preceding hypotheses, there exists a function IC

which is odd, homogeneous of degree -n and such that lim RjK,(x) = ki(x) in the L°° norm on every compact set that does not contain the origin.

RiK,(x) - RiK,(x) = c m lim f = 1 1 n (41xl -n+1 ) = 127i ni axi -n+1 )(0

( -V-) 7

= C r,

ly i<1.

xi

Y3

XII

sl

IX Y xi - yi C2(0 — Y l n+l 'Y in d

[K (y) - K (v)] dv

y;

since St has zero average,

27rki (1) 1

X{ixi›,}.

Proof. Fix x 0 and let 0 < e < v < lx1/2. Then for almost every such x, by Corollary 4.9

lxin+ 1

so if we apply (4.3) we get x

C2(e)

Note that K, E LT, 1 < r < q. Thus if f E C''')(118n) then by taking the Fourier transform we see that

j=i

a

Kf(x) =

1

=

xi - yi Lly1
_

xi ) xin+1

^(y) d IYIn

78

4. Singular Integrals (I)

Therefore, if we apply the mean value theorem to the integrand we get (4.12) x

D

1Q(01 dy < Cigi1 f 1/33K, (x) — RiK,,(x)1 < C n+1 c
Furthermore, if Kj,,(x) = kj (x)x{i x i > ,}, then A, = RiK, — and 11A€111 Cq'11Q11q.

IXIn+1

f

sr, i

lk i(u)I do-(u)

K; (x) = lim Ri K,(x)•

To find the desired function k3 , fix A > 0; then again for almost every R3K,(Ax) =lim cn

6—>0 flAs_y>8

Xi — Yi n K,/ A (y) cn 6—,0 4-y1>S/A — Yl n+

= liiu

dy

Hence, for almost every x, K;(Ax) = A -71 K; (x). The set of measure zero where equality does not hold depends on A, but since K; is measurable, the set =

{(x,

A) C 1871 x (0, oo) : K; (Ax)

—n

0

if x 0 and px/Ix1 D nsp; otherwise.

This function is measurable, homogeneous of degree —n (by definition) and odd. Further, K; (x) = kj(x) almost everywhere. To see this, let x 0 be such that x o = px/IxI Dn Sp . (The set of x such that xo E D n S,, has measure zero since D n Sp has measure zero.) Then for almost every .\, ki (Ax0) =- A — nki (x 0 ) = A'KI(x0) K;()xo)• Lemma 4.11. The kernel kj defined in Lemma 4.10 satisfies

f

j (u)I

1dx iki(x) _ (x) — Ri Ki/2(x)1 dx

I Kj (x) — RiKi/2(x)1 <

fl 91

•

Therefore, the first integral on the right-hand side above is bounded by To bound the second integral, note that ClIC2 111 < IR1K1 / 2 (x)1dx < C Rj K / 2 1l p < Ok i / 2 11 p < C Pi g .

do- (u) <

To prove the second assertion it will suffice to show that li 0 1111 < since A, = E —n Ai (E -1 X). But then = LIRJ,(x) — K3 , 1 (x)1 dx

K; (x)}

has measure zero. Therefore, by Fubini's theorem there exists a sphere centered at the origin of radius p, Sp , such that D n Sp has measure zero. We now define kj(x) = (Ix' )n KJ * (Ix'

1 log 2 1 f — log 2 II

1.<1xl<2

R K,/ ),(x).

=

(gn)

-+ 1 f IR1K1/2(x)1 dx. log 2 i 1; then if we take the limit as € —> 0 we get (4.13)

Ax — y3 yin+i K,(y) dy

Proof. By the homogeneity of kJ,

Hence, for any a > 0, {R i K,} is a Cauchy sequence in the L°° norm on {1xl> a}. So for almost every x we can define K; by The function R i K, is odd, so (by modifying K; on a set of measure zero if necessary) .K; is also an odd function.

79

4. Singular integrals with even kernel

lA i (x)1dx. 1Ki(x)1dx + f 2 ixi<2

The first integral is bounded by 01/33Ki li p < CiiKliiq 5 Cigiq; above we showed that the second integral has the same bound. To see that the third integral is bounded, we argue as we did for (4.13) to get l0i(x)1 < n-1.

❑

Theorem 4.12. Let Q be a function on Sn -1 with zero average such that its odd part is in L l (SP -1 ) and its even part is in Lq(Sn -1 ) for some q > 1. Then the singular integral T in (4.1) is bounded on L 1'(111n), 1 < p < oo. Proof. By Corollary 4.8 we may assume 7 is even. Further, by arguing as we did for the Hilbert transform, it will suffice to establish the LP inequality for functions f E C,"(R n ). For such f , T f = lim,_,0 K, * f. From (4.9) and (4.11) we see that K, * f =

n

R3 j=1

((R K,) * f),

80

4. Singular Integrals (I)

and in the notation of Lemma 4.11, * f =

(RiK,)

* f.

* f

By Lemma 4.10, ki is an odd, homogeneous kernel of degree -n, so by Corollary 4.9,

s.-- lk (u)I da(u)Ilf Ilp

* flipf

,

clIf2 11q1lf Ilp.

By Lemma 4.11,

f Ilp 5- IloElllllflp 5- Cigiqiif If we combine these estimates and use the fact that Rj is bounded in LP we see that

!IKE flip

81

5. An operator algebra

The multiplier p(o/ien is homogeneous of degree zero and its restriction to Sn -1 coincides with P(e'), so it is a C°° function. (In fact, it can be shown that T is a linear combination of compositions of Riesz transforms.) The operator T is not a singular integral of the type given in (4.1) since P() may not have zero average on ST 1-1 . However, we can get such a singular integral by subtracting a constant, that is, a multiple of the identity operator. This particular operator T is a special case of the operators in the following result. Theorem 4.13. If m E C'(30 \ {0}) is a homogeneous function of degree

0, and T, is the operator defined by (Tni f) = mf , then there exist a

cl1Q11q1lf Ilp.

= of p. v.

Since the right-hand side is independent of e, by Fatou's lemma we get gni) 5- ClgiglifilP and this completes our proof. Another proof of Theorem 4.12 is given below in Chapter 8. (See Corollary 8.21.)

Let P(e) = E a ba V be a polynomial in n variables with constant coefficients and let P(D) be the associated differential polynomial, that is, the operator given by =

b a Da f. a

It follows from (1.19) that (P(D)f)(e) = P( 27rie) f(0-

Define the operator A to be the square root of the positive operator -A: (4.14) (Af ) (0 = 271 W () • If P is a homogeneous polynomial of degree m, then (4.15)

P(D)f = T(Am

where the operator T is defined by (Tf)

52(' )

* f.

Since any homogeneous function of degree 0 is the sum of a constant and a homogeneous function of degree 0 with zero average on Sn -1 , Theorem 4.13 is an immediate consequence of the following lemma. Lemma 4.14. Let m E C'(Rn \ {0}) be a homogeneous function of degree

0 with zero average on Sn -1 . Then there exists C2 E C"(Sn -1 ) with zero average such that iii() = p.v.S2(x 1 )/Ixin.

5. An operator algebra

P(D)f

EC

and S2 E C"(Sn -1 ) with zero average such that for any f E S,

=

P(0

f (").

Proof. Since m is a tempered distribution, in exists. Hence,

( n r.n ) - n C rn()

ax:

where C is a constant. The function anm/04 is homogeneous of degree -n, in C°°(Rn \ {0}) and has zero average on Sn -1 . Furthermore, an rn anrn p. v. + E CaDaS, ax; , ,

axn

lal
where S is the Dirac measure at the origin, since the difference between anm/aCI and p. v. Onm/aC is a distribution supported at the origin. If we take the Fourier transform of both sides of this equation we get an m + E cc,(27riw. ) CCM(0 = (13 - v. axri i The left-hand side and the first term on the right-hand side are homogeneous distributions of degree 0, so the polynomial on the right-hand side reduces to a constant. Thus the right-hand side is a homogeneous function

82

4. Singular Integrals (I)

of degree 0 which is in C'(Ili n \ {0}). Since this is valid for 1 < i < n, coincides on \ {0} with a homogeneous function of degree -n. We will denote its restriction to Sn -1 by a To see that S2 has zero average on Sn -1 , fix a radial function ti5 E S which is supported on the annulus 1 < Ixl < 2 and which is positive on the interior. Then

=

f

R.”

12(x')

0(x) dx = c f S"-1

83

6. Singular integrals with variable kernel Let P(x, D) be a homogeneous differential polynomial with variable coefficients: P(x, D) =

b a (x)D°. lal=m

Since for f E S,

C2(u) do- (u),

Daf( x ) = f

where c > 0; furthermore, since is radial and m is homogeneous, rti(0) = m(g;) c f

6. Singular integrals with variable kernel

(2 7T i)apo e 27rzx.0

we have that P(x, D)f (x) = LP(x, 27i)f(e)e 2if i x•

m(u) do- (u) = 0.

Finally, to see that in, is identical to p. v. Q(x')/Ixin, consider their difference, C2(xl) - p. v. lxin

Further, with A as defined by (4.14), this has a representation analogous to (4.15): (4.16)

P(x, D)f = T(Ani f),

where T is defined by

which is supported at the origin. If we take the inverse Fourier transform of this difference we get a polynomial which must be constant since both m and (p. v. 52(x')/Ixin) are bounded. Further, this constant must be zero since both m and 11 have zero average on Sn-1.

❑

Theorem 4.15. The set A of operators defined by Theorem 113 is a com-

(4.17)

T f (x) = f n a(x, o- (x,

=

f(e)e 2 1'e

ck,

P(x,i0

The function a is homogeneous of degree 0 in the variable e. If we substitute the definition of f(e) into (4.17) we get

mutative algebra. An element of A is invertible if and only if m is never zero on ST1-1 .

( , e) f

T f (x) I a x

f

(y)e

-2nly• tdy e27'zx.1

de.

Thus formally, Proof. To see that A is an algebra, it is enough to note that given m l and m2 with associated operators Trn , and Tm2 , Tm , 0 Tm 2 Tmim2. Since the identity has the function 1 as its multiplier, Tm is invertible if and only if Ti hn E A, and 1/m E C"°(Rn \ {0}) precisely when m(e) 0 for any e E Sne-1. ❑

The operator T in (4.15) associated with the polynomial P is thus invertible if and only if P(e) is never zero on Sn 1 , that is, if P(D) is an elliptic operator. If the coefficients of P are real, then m must be even and Am = (-0)m/ 2 . Thus the problem of solving P(D)u = f reduces to solving (-A)m/ 2 u = T -1 f. (For more on this operator, see Section 7.7.)

(4.18)

T f (x) = f K (x, x - y)f (y) dy, lir,

where (4.19)

K (x, z) =ff a(x, 0e 2 " i z .

In other words, for fixed x, K(x,.) is the inverse Fourier transform of a(x, •). By Theorem 4.13, for each x there exists a constant a(x) and a function S2(x,•) E C °° (Sn -1 ) with zero average on Sn -1 such that K (x, z) = a(x)b(z) p. v.

Q(x,z')

zln

.

84

4. Singular Integrals (I)

This example is the motivation for studying singular integrals with variable kernel: (4.20)

T f (x) = I i f

52

)

f (x - y) dy.

IYI>c

We can again apply the method of rotations. Theorem 4.16. Let St(x, y) be a function which is homogeneous of degree

0 in y and such that: (1) 51(x, -y)

When S2 is even there is a result analogous to Theorem 4.17 whose proof resembles that of Theorem 4.12. It can be found in the first article by A. P. CalderOn and A. Zygmund cited in Section 7.1. Operators of the form (4.17) are a special class of pseudo-differential operators. For more information, see Chapter 5, Section 6.9.

7. Notes and further results 7.1. References.

-12(x, y);

(2) Sr (u) = sup IC1(x, u)1 E L 1 (S n-1 ). x

Then the operator T given by (4.20) is bounded on LP(IIV), 1 < p < 00.

Proof. Because St is odd, we can argue as in the proof of Corollary 4.8

(where the kernel does not depend on x) to get (4.21)

85

7. Notes and further results

52(x, u)Hu f (x) do-(a), f E S.

T f (x) = 7r f

2

Therefore, (u) I

IT f (x)1 < 1

2 s n-i

f (x)1 du (u)

The method of rotations was introduced in an article by A. P. CalderOn and A. Zygmund (On singular integrals, Amer. J. Math. 78 (1956), 289309). The examples in Section 1 come from an earlier article by them (On the existence of certain singular integrals, Acta Math. 88 (1952), 85-139) which will be discussed in Chapter 5. The proof of Theorem 4.12 for even kernels is taken from course notes by A. P. CalderOn (Singular integrals and their applications to hyperbolic differential equations, University of Buenos Aires, 1960). It can also be found in the books by Zygmund [22] and Neri [13]. For a general overview of singular integrals and their applications, see the survey article by A. P. CalderOn (Singular integrals, Bull. Amer. Math. Soc. 72 (1966), 427-465). 7.2. Spherical harmonics.

for some q, 1 < q < co, then T is bounded on LP(V), q' < p < 0o.

A solid harmonic is a homogeneous harmonic polynomial, and its restriction to 512-1 is called a spherical harmonic. The spherical harmonics of different degrees are orthogonal with respect to the inner product on L 2 (Sn -1 ), and among those of a fixed degree there exists an orthogonal subcollection. Thus L 2 (Sn -1 ) has an orthogonal basis composed of spherical harmonics. Note that if n = 2, the basis of spherical harmonics for L 2 (S 1 ) is just fei k° k E Z}.

Proof. Apply Holder's inequality with exponents q and q' to get

The functions Yk(x)e - "Ix 12 , where Yk is a solid harmonic of degree k, are eigenfunctions of the Fourier transform:

and the desired result follows at once from Proposition 4.6. Theorem 4.17. If in the statement of Theorem 4.16 we replace (2) by

(4.22)

(4.23)

sup (i x

u) l 9 da- (u)) = Bg < oo

f (x)I 5_ 7r Bq (f I Hu f (x)lq' do- (II)) lig'

2

If we raise this to the q'-th power and integrate with respect to x we get that

5 If condition (4.22) holds for some value of q, then it holds for any smaller valne, so we obtain the desired inequality for all p, q' < p < 00. ❑

(yk x ) e k (

yk () e — W . 7

2

(This is referred to as the Bochner-Hecke formula.) The utility of spherical harmonics for studying singular integrals is suggested by the following formula: for k > 1, ( p. Yk(z i )) (0 =k n "

(n

2 r (n-I-k)

yk W) .

86

4. Singular Integrals (I)

For a singular integral with II E L 2 (Sn -1 ) and even, one can give a diferent proof of Theorem 4.12 using the decomposition of It into spherical harmonics. (See Stein and Weiss [18, Chapter 6].) The properties and uses of spherical harmonics can be found in the books by Stein [15], Stein and Weiss [18] and Neri [13]. Another discussion of spherical harmonics, as part of the general theory of harmonic functions, can be found in the book by S. Axler, P. Bourdon and W. Ramey (Harmonic Function Theory, Springer-Verlag, New York, 1992). 7.3. Operator algebras. Besides the operator algebra studied in Section 5, Calder& and Zygmund considered others of the same type. (See Algebras of certain singular integral operators, Amer. J. Math. 78 (1956), 310-320.) In particular, let Aq be the set of operators Q(Y )

T f (x) = a f (x)

IYI n ,

E-4)

fly1>E

7. Notes and further results

87

It is conjectured that if 1 < p < Do and p and q' satisfy this condition, then inequality (4.24) holds. It is also conjectured that (4.24) holds for this range of p and q' if Hu is replaced by the associated maximal operator, or by the directional maximal function, M u . The following results are known: when n = 2 the conjecture for H u was proved by A. P. Calder& and A. Zygmund (On singular integrals with variable kernel, Appl. Anal. 7 (1978), 221-238); when n > 3 they proved it for 1 < p < 2. M. Cowling and G. Mauceri (Inequalities for some maximal functions, I, Trans. Amer. Math. Soc. 287 (1985), 431-455) showed that the same range holds for M u . M. Christ, J. Duoandikoetxea and J. L. Rubio de Francia (Maximal operators related to the Radon transform and the CalderOn-Zygmund method of rotations, Duke Math. J. 53 (1986), 189-209) showed that the conjecture is true for all these operators and for p in the range 1 < p < max (2,

f (x - y) dy,

2

where a E C and Si E Lq(S n-1 ). If we define 7.5. L log L results. As we noted at the end of Section 2, if C/ E L log L(Sn -1 ) is even then the Fourier transform of p. v. C2(4/1x1n is bounded. Further, Theorem 4.12 can be extended to the case when the even part of St is in L log L(Sn -1 ). (See the first article by Calder& and Zygmund cited in Section 7.1.)

IlTllq = la! + igiLq(s ,=-1), then there exists a constant Bq which depends only on q such that

Pi o T2 1 q, < BaTiligliT211q• The set A q is a semi-simple commutative Banach algebra.

For more information on algebras of singular integrals, see the article by A. P. Calder& (Algebras of singular integral operators, Singular Integrals,

Proc. Sympos. Pure Math. X, pp. 18-55, Amer. Math. Soc., Providence, 1967). 7.4. More on the method of rotations. In the proof of Theorem 4.17, we could deduce that II Tf 11 7, from inequality (4.23) provided that (4.24)

f (x)1 1 ' do- (u))

p/q'

) 1 IP < dx

C I f il p

11p.

In the proof of Theorem 4.17 we used that inequality (4.24) is immediate if p > q'. However, there are also values p < q' for which (4.24) is true, and this allows us to improve the theorem. If f is the characteristic function of the ball, then one can show that (4.24) cannot hold unless n n -1 + 1. p 4/

These are the best possible results. In the same paper, Calder& and Zygmund noted that if is any function such that 0(t)/t log t 0 as t oo, then there exists an even function SI such that 0(C2) E L 1 (Sn -1 ) but the Fourier transform of p. v. C2(4/Ixin is unbounded. Later, M. Weiss and A. Zygmund (An example in the theory of singular integrals, Studia Math. 26 (1965), 101-111) showed that given any such (/) there exists an even 52, 0(52) E L1(sn-1 ) and a continuous function f E LP (R n ) for every p > 1, such that for almost every x, lim sup E-40

C2(Y') f (x - y) dy

✓ I>E Wi n

=Do.

7.6. Weak (1,1) inequalities. Proposition 4.6 does not extend to weak (1,1) inequalities because the Lorentz space L l, " is not a normed space. (See Chapter 2, Section 8.3.) Hence, we cannot use the method of rotations to prove weak (1, 1) inequalities for singular integrals of the type (4.1) or even for the rough maximal function M0 defined in (4.5). However, the following results are known:

88

4. Singular Integrals

(I)

89

7. Notes and further results

If 52 E L log L(Sn -1 ) then M0 is weak (1, 1). For n = 2 this was proved If we consider the behavior of /a on LP, then homogeneity considerations by M. Christ ( Weak type (1,1) bounds for rough operators, Ann. of Math. show that the norm inequality 128 (1988), 19-42); for n > 2 this is due to M. Christ and J. L. Rubio de (4.25) Francia ( Weak type (1, 1) bounds for rough operators, II, Invent. Math. 93 clIfilp I IIafllq (1988), 225-237). It is unknown whether Mil is weak (1,1) if f2 E L l (Sn-1 ). If f2 E L log L(Sn -1 ) then the operator T in (4.1) is weak (1, 1). This was proved by A. Seeger (Singular integral operators with rough convolution kernels, J. Amer. Math. Soc. 9 (1996), 95-105). Partial results were obtained earlier by M. Christ and J. L. Rubio de Francia (in the above papers) and by S. Hoffman (Weak (1,1) boundedness of singular integrals

with nonsmooth kernels, Proc. Amer. Math. Soc. 103 (1988), 260-264). By the counter-examples given above this is the best possible result for

can hold only if

1 1 a q p n

(4.26)

In fact, this condition is also sufficient for 1 < p < n/a. Theorem 4.18. Let 0 < a < n, 1 < p < n/a, and define q by (4.26). Then - a)) inequality (4.25) holds. If p = 1 then I a satisfies the weak (1, n

T for arbitrary Q. However, it is unknown whether is weak (1, 1) when QE Li(sn-1) and is odd. P. SjOgren and F. Soria (Rough maximal operators and rough singular integral operators applied to integrable radial functions, Rev. Mat. Iberoamericana 13 (1997), 1-18) have shown that for arbitrary f2 E L l (Sn -1 ), T is weak (1, 1) when restricted to radial functions. 7.7. Fractional integrals and Sobolev spaces. If 0 E S then (-A0)(0 =

472 16 2 0W;

in Section 5 we defined the square root of the operator -A by (A0)(0 =

27r1e10()•

More generally we can define any fractional power of the Laplacian by (( - A) a/2 0)

= ( 27 1W a (b(0.

Comparing this to the Fourier transform of Ix1', 0 < a < n (see (4.3)), we are led to define the so-called fractional integral operator /a (also referred to as the Riesz potential) by

/a cb(x)

1 f 0 (Y) dy, Ya JIRn IX Yi n—a

where

ia r k2

Then with equality in the sense of distributions,

(Lo) (o = 1 -

-

1 a ck o. -

(II. p) n/ (n-a)

For a proof, see Stein [15, Chapter 5]. Another proof uses the following inequality due to L. Hedberg (On certain convolution inequalities, Proc. Amer. Math. Soc. 36 (1972), 505-510): /a f(x) callf1T,Pinmf(x)l-aPln,

(4.27)

1 Al; apply Proposition 2.7 to the first part and Holder's inequality to the second. Now choose A so that both terms are the same size. Theorem 4.18 then follows from (4.27) and from the boundedness of M. Closely related to the fractional integral operator is the fractional maximal function,

Ma f (x) = sup

„0 IBrI l —ain

fBrif(x_

dy, 0 < a < n.

Ma satisfies the same norm inequalities as /a . The weak (1, n/(n - a)) inequality can be proved using covering lemma arguments (see Chapter 2, Section 8.6), and Ma is strong (n/a, oo): by Wilder's inequality a/n 1

r(1) (n—a.) •

<

/a f ( x )I >

Ifx E

IBr i1 —a/n

L if(x_014 (f r

(x

winla dy)

IlflIn/a.

The strong (p, q) inequalities then follow by interpolation. For any positive f it is easy to see that Ma f(x) < C Ia f (x). The reverse inequality does not hold in general, but the two quantities are comparable in norm.

1

90 4.

Singular Integrals (I)

Chapter 5

Theorem 4.19. For 1 < p < oo and 0 < a < n, there exists a constant Ca,p such that

IIIaf IIP

1

Ca,piiMaf

This result is due to B. Muckenhoupt and R. Wheeden ( Weighted norm inequalities for fractional integrals, Trans. Amer. Math. Soc. 192 (1974), 261-274). Also see D. R. Adams, A note on Riesz potentials (Duke Math. J. 42 (1975), 765-778) and D. R. Adams and L. Hedberg (Function Spaces and Potential Theory, Springer-Verlag, Berlin, 1996). An important application of Theorem 4.18 is in proving the Sobolev embedding theorem. For 1 1 an integer, define the Sobolev space L Pk (R") to be the space of all functions f such that f and all its derivatives (defined as distributions) up to orddr k are in LP. More precisely, let Lk be the closure of C c" under the norm

Ilf114 =

IlDafIlp• lalGk

Theorem 4.20. Fix 1 1 an integer. Then (1) if p < n/k, then L Pk (R n ) C L r (R) for all r, p < r < q, where 1/q = 1/p — k/n; (2) if p = n/k, then L Pk (Rn) C L r (R n ) for all r, p < r < cx); (3) if p > n/k and f E L Pk(R n ), then f differs from a continuous function on a set of measure 0.

Singular Integrals (II)

1. The Calderon-Zygmund theorem In the previous chapter we used the Hilbert transform to study singular integrals. In this chapter we are going to consider singular integrals whose kernels have the same essential properties as the kernel of the Hilbert transform. This will let us generalize Theorem 3.2 to get the following result. Theorem 5.1 (Calderon-Zygmund). Let K be a tempered distribution in R' which coincides with a locally integrable function on RP \ {0} and is such that

This result is due to S. L. Sobolev (On a theorem in functional analysis, Mat. Sb. 46 (1938), 471-497; English translation in Amer. Math. Soc. Transl. ser. 2, 34 (1963), 39-68).

(5.1)

For all the results in this section, see Stein [15, Chapter 5], the book by Adams and Hedberg cited above, and the books by R. Adams (Sobolev Spaces, Academic Press, New York, 1975) and W. Ziemer (Weakly Differentiable Functions, Springer-Verlag, New York, 1989).

Then for 1 21v1(x — y) — K (x)1dx < B, l

111(*fllp and

I{x E R n * f(x)I > A}I

We will show that these inequalities are true for f E S, but they can be extended to arbitrary f E LP as we did for the Hilbert transform. Condition (5.2) is usually referred to as the Hiirmander condition; in practice it is often deduced from another, stronger, condition called the gradient condition. 91

92

5. Singular Integrals (II)

croposition 5.2. The Hi5rmander condition (5.2) holds if for every x 0

(5.3)

I 77 K(s) 1 5 lxin+

1

93

I. The Calderon-Zygmund theorem

E C l (S n-1 ) is sufficient by Proposition 5.2. But much weaker conditions are sufficient. For example, define

•

woc(t) = sup{I12(ui) - 52(u2)I lui - u21

t, ul,u2 E S n-1 }.

Then we have the following result. This follows from the mean value theorem; details are left to the reader. Proof of Theorem 5.1. Since this proof is (essentially) a repetition of the

proof of Theorem 3.2, we will omit the details. Let f E S and let Tf = K*f. From (5.1) it follows that IlTfIl ,12 < AII f 112• It will suffice to prove that T is weak (1, 1) since the strong (p, p) inequality, 1 2 it follows by duality since the adjoint operator T* has kernel K* (x) = K (-x) which also satisfies (5.1) and (5.2). To show that f is weak (1, 1), fix A > 0 and form the CalderOn-Zygmund decomposition of f at height A. Then, as in Theorem 3.2, we can write f = g b, where g E L 2 and is bounded by VA and b is the sum of functions which are supported on disjoint cubes Qi and have zero average. The argument now proceeds as before, and the proof reduces to showing that (5.4)

I Tbj (x)1 dx < C f lb3 (x)1dx,

Proposition 5.3. If /2 satisfies

L

(5.5)

1 woo

t

)

t

dt Goo,

then the kernel /2(4/1xin satisfies (5.2).

Condition (5.5) is referred to as a Dini-type condition. Proof. IK(x - y) - K(x)I =

SZ((x - y)')

C2(x')

Ix - yi n 142 I 12 ((x y) ' ) Q(xl)l

+ IQ(x)1 Ix

IX — yi n

1

1

Yl n Condition (5.5) implies that /2 is bounded, so on the set {lad > 21y1} the second term is bounded by Clyl/lxin+ 1 . Thus its integral on this set is finit e. On this set we also have that lyl

where Q* 3 is the cube with the same same center as Q 3 and whose sides are 2.\/71, times longer. Denote their common center by ci. Inequality (5.4) follows from the HOrmander condition (5.2): since each bi has zero integral, if x Q*3

Ks — y)' — x't < 4— SO

P((x - y)') - Q(41 dx < f fixl>21y1

Ix - YI n flxl>21y1

Tbi(x) = fQ K(x - y)bi (y) dy = i [K (x - y) - K(x - ci)]bi (y) dy;

=

2n15--11

(IxI / 2 ) n

f 2 CO ( t )

A

dt

< C.

hence, fRn

cocc(41Y1/14) dx

1Tb .(x)I dx <

\Q; 3

lb .(y)1 Q.;

3

IK (x - y) - K (x - ci)1 dx) dy.

For similar results, see Section 6.2. From Proposition 5.3 and Corollary 4.5 we immediately get the following corollary to Theorem 5.1.

\Q;

Since Rn Q:; C {x E IR n : Ix - ci l > 21y - cil},

the term in parentheses is bounded by B, the constant in (5.2).

❑

We now consider singular integrals with a homogeneous kernel of degree -n, such as we studied in the previous chapter: K(x) 52(x')/Ixr. What must we assume about /2 for the HOrmander condition (5.2) to hold? Clearly,

Corollary 5.4. If 12 is a function defined on Sn -1 with zero integral and satisfying (5.5), then the operator T f (x) =

p. v.

f IY Rn

is strong (p,p), 1 < p < co, and weak (1, 1).

f (x - y) dy

94

5. Singula Integrals (II)

Since (5.5) implies that C2 is bounded, we already had the strong (p, p) inequality via the method of rotations. Nevertheless, we now also have the weak (1, 1) inequality.

(If 11 -1 < c or if Z -1 > R, then it suffices to consider just one of the two integrals.) We treat /1 first: K(x) dx +

Il =

In Theorem 5.1 we assumed that the operator was bounded on L 2 (via the hypothesis that K E In this section we give conditions on K which imply this property and which make the associated operator bounded on LP, 1 21Y1

Then for all

1

1) dx,

(x)1dx C (A + B).

K(x) dx -

1

so that exp(2iriz • 0 = -1; then if we To evaluate /2, let z = make the change of variables x x - z in /2 we get

-

Hence,

J K (x) dx < A, 0 < a < b < oo; a
(5.7)

Llx1
SO

2. Truncated integrals and the principal value

(5.6)

95

2. Truncated integrals and the principal value

212

4

1 -1 <1x-zl
K(x - z)e -21r i x . dx.

: (x)e-271-2,X•e dx -

1K(x)Idx _< B, a > 0;

this in turn el-im i
11C(X — y) - K (x)1dx < C , y E Rn

21121

.k

K(x - z)e

ICI

dx;

lif(x) - K(x - z)1 dx

1ICE,R(01 C, where C is independent of e and R.

(x)1dx

Condition (5.7) is equivalent to

.1:

(5.9)

x1
-q.K L-K 1 - '
1x11K(x)1dx B i el. a > 0.

The first integral is bounded by C since 11 --1 = 21z1. The other two integrals are bounded by 2B: since 11 -1 < R, R+1M -1 < 3 (R -111-1). ❑

To see this, note that 00 fix1
IxIIK(x) dx < k=o f2

-

-I a
a

(x)1dx <

1K(x)Idx.

2 -k dB,

then Corollary 5.6. If K satisfies the hypotheses of Proposition 5.5, I

k=0

and

11-Kc,R * f flip

cplIfIlp,

1

<

and K (x)1dx 5 f 11K (x)I dx < 2B'. L< IIxl<2a lacl<2a a

Proof. If we fix then whenever e <11 -1 < R,

KE,R(0 = =

K(x)e-27r2X• dx

1{X E R n

* f(x)1 > A}1

A1 11.f111, —

with constants independent of e and R. When p = 2 this result follows immediately from Proposition 5.5; for the rest note that (5.8) implies the Hi5rmander condition,

Lx
K(x)e-27ix. dx K(x)e-7`ix. 2 dx + fc<111<11 -1 fl1-1
VCR

flxi>21y1

IK,

'

R(X — y)

-

K,,R(x)1dx < C,

with C independent of e and R. The details are left as an exercise for the reader.

96

5. Singular Integrals (II)

When the kernel is homogeneous, K(x) = C2(x')/1x1n, conditions (5.6) and (5.7) are equivalent to CZ E L 1 (S 12-1 ) and

97

2. Truncated integrals and the principal value

Corollary 5.8. If we add to the hypotheses of Proposition 5.5 the assump-

tion that

S2(u) dcr(u) = O.

f

lim

K(x) dx

c
Since JCR E L 1 , KE,R * f is well defined for f E LP. Ideally we could define the singular integral T by the pointwise limit

exists, then T f (x) = lim f K (y) f (x — y) dy IYI>f

T f (x) = lim KE,R * (x), Tr:0 o )

but this limit need not exist even if f E S. In this case we have convergence as R oo, so the problem reduces to determining when the principal value distribution of K, p. v. K(0) = lim fTI>c K (x)0(x) dx, 6-4) I

E

S,

can be extended to an operator which is bounded on LP, 1 < p < no, and is weak (1,1). Example 5.9 (An application to K(x) = Ixl — n —it ). The function

is locally integrable on Rn \•031 and satisfies the hypotheses of Proposition 5.5. In fact,

exists.

dx

Proposition 5.7. Given a function K which satisfies condition (5.7), the

tempered distribution p. v. K exists if and only if lim

fE
dx

n-1 1 log 2, Llxl<2a IxI n — IS In + it1 IVK(x)I < IxI n+1 •

K (x) dx

exists. Proof. Suppose that the tempered distribution exists. If we fix v5 E S which

is identically 1 on B(0,1), then p. v. K(0) = lim

fE
LI

K(x) dx + f K (x)0(x) dx.

Hence, 11KE,R * f Ilp < CPI f IP with CC independent of c and R. However, the limit of the truncated integrals does not exist almost everywhere since

L

CO

>1

K (x)0(x) dx < 111x1011

2— k=0

.12k
Therefore, the limit of the first integral must also exist. Conversely, suppose the limit exists; denote it by L. Then p. v. K(0) = cb(0)L + f

K (x)(0(x) — (0)) dx + f

ixiiK(x)Idx < 2 BIT 01100.

dx

= sn-11 1 — E -it

—it '

i s i < 1

0 does not exist. The limit does exist if we and the limit of E -it as e choose an appropriate subsequence Ickl, for example, ck e-27k/t. If we take this sequence then lim K„,R * f (x) =

K (x)0(x) dx.

Again, the second integrixlal
< - 1S n-1 1, Sn-1° a—" t —it

✓ a
f

J

f — Y) — f (x) dy + f

f — Y) dy,

IY> 1

and this defines an operator which is strong (p,p), 1 < p < co, and weak (1,1). This operator is the convolution of f with the tempered distribution which we will refer to as p. v. 1xl — n —it , even though it depends on our choice of the sequence {Ek}: P. v.

0(x) — 0(0)

1 i xi<1

ixin+it

dx + (

x xl>i 19:1nL

dx.

98

5. Singular Integrals

(II) 3. Generalized CalderOn-Zygmund operators

99

Despite its appearance this distribution is not homogeneous of degree — it (see Definition 4.2); that is, it does not satisfy 1 —n—it 13 - v. 1 (0), 1)- v. it (OA) = A

the remaining LP spaces only depends on the HOrmander condition, which can be adapted to operators which are not convolution operators. Let A be the diagonal of Rn x A = {(x, x) : x E Rfl}. By a proof nearly identical to that of Theorem 5.1 we can show the following.

where ¢),(x) = A — ThO(A — lx). This can be shown by a straightforward computation, but here we are going to give an indirect argument which also yields its Fourier transform. Given z such that Re z < n, the function ixi' is locally integrable and homogeneous of degree —z. It defines a tempered distribution which is also homogeneous of degree —z:

Theorem 5.10. Let T be a bounded operator on L 2 (R n ), and let K be a function on Rn x Rn \ A such that if f E L (111Th) has compact support then

lx1"±

lx1n±it

firi<1 (5 1:1! (°)

°(x)

dx + ¢(0)

fi x i<1

+ fix>i (15x1.9

dx

—i (b(°) dx + fixi>1 Ix z dx + 1Sn 1z 1 co) ;

°( )

this expression makes sense for Re z < n+1 except if z = n. This distribution is homogeneous of degree —z, and if we let z = n + it, we see that it differs from the one we call p. by a multiple of the Dirac delta. (This is homogeneous of degree —n, which implies that p. cannot be homogeneous.) This new distribution is thus the unique distribution that is homogeneous of degree —n — it and which coincides away from the origin with the locally integrable function ix tn — i t . Given the Fourier transform of 'xi' (see (4.3)), (0 = r ( n 2 z ) 1 1 (1x iz F (I) if we let z = n + it we get the Fourier transform of the homogeneous distribution, which in turn gives us the Fourier transform of p. v.

(P. v.

ixi n+it) (0 =

_it F(( r (k72 +2 4 + it k2

3. Generalized Caldertin-Zygmund operators

Up to this point the operators we have been studying could be written as convolutions with tempered distributions. One practical advantage of this hypothesis is that we could use the Fourier transform to deduce the L 2 boundedness of the operator. If we assume this then the boundedness on

x ■Z supp(f)•

Further, suppose that K also satisfies

(5.11)

We can rewrite this as

= fi<1 xi

T f (x) = I K (x , y) f (y) dy,

(5.10)

l Ixl z(o) = f ix( ) dx.

(Cb)

2

1K (x, y) — K (x, z)1 dx < C, flx—yl>21u—zI flx-yl>21x-wi

IK(x,y)— K(w,y)Idy < C.

Then T is weak (1,1) and strong (p,p), 1 < p < co.

The given representation holds only for functions in L 2 of compact support, but this suffices for the proof since it is only applied to the functions b3 gotten from the CalderOn-Zygmund decomposition. condition (5.10) is used to prove the weak (1, 1) inequality for T, and condition (5.11) is used to prove the same inequality for its adjoint. Following the terminology of Coifman and Meyer [2], we say that K : R x \ A —0C is a standard kernel if there exists S > 0 such that IK(x,y)I <

(5.12) (5.13)

(x, y) — K(x, z)I < C

lx

C

Z1 6

— vI n+' IX — WI 6

if ix —y > 2 1Y —

if x — > 2jx — wi. IX Yl n+6 Standard kernels clearly satisfy the HOrmander conditions (5.10) and (5.11). The following are examples of operators which have this type of kernel. (1) The Cauchy integral along a Lipschitz curve. Let A be a Lipschitz function on R (i.e. A' = a E L") and let F = (t, A(t)) be a plane curve. With this parameterization we can regard any function f defined on F as a function of t and conversely. Given f E S(R), the Cauchy integral (5.14)

1 K (x, — K(W,

Cr f (z) =

y)
1

f"

(t)(1+ ia(t))

2xi t + iA(t) — z defines an analytic function in the open set

dt

52 + ={z= x +iy EC: y>A(x)}.

100

5. Singular Integrals (II)

Its boundary values on I',

-

-4)

are given by [1(x) +

7r

T f (x) = lim

E-'0 fl x _ y l>,

f(Y) dy , y + i(A(x) - A(y))

x

whose kernel, K(x, y) =-

1 x - y + i(A(x) - A(y))'

satisfies (5.13) and (5.14) with 6 = 1. (2) CalderOn commutators. If 11,, then we expand the kernel (5.15) as a geometric series: K(x, y) =

1

co

x k=0

(A(x) - A(y)) k —y

It is, therefore, natural to consider the following operators: given a Lipschitz function A on R and an integer k > 0, define Tk

The examples in the previous section suggest that for any CalderOn-Zygmund operator,

f(t)(1 ia(t)) x - t + i(A(x) - A(t)) dt •

lim

This leads us to consider the operator

(5.15)

101

4. Calderon-Zygmund singular integrals

4. CalderOn Zygmund singular integrals

lim Cr f (x + i(A(x) + €)),

2

f (x) = lim f ( E-4.1 ir_yi>,

A(x) - A(y)) f (y) dy. x - y j x - y

The associated kernels, A(x) - A(y)) k 1 K k(x , y) = ( x - y x - y'

are also standard kernels with 6 = 1. Definition 5.11. An operator T is a (generalized) Calderon-Zygmund op-

erator if

(1) T is bounded on L 2 (Rn); (2) there exists a standard kernel K such that for f E L 2 with compact support, T f (x) = f K (x , y)f (y) dy, x supp(f).

Theorem 5.10 implies that a Caldercin-Zygmund operator is bounded on LP, 1 < p Goo, and is weak (1, 1). Hence, given an operator with a standard kernel, the problem reduces to showing that it is bounded on L 2 . We will return to this question in Chapter 9.

T f (x) = urn f

K(x, y)f (y) dy,

at least for f E S. However, this is not necessarily the case. From (5.12) we can deduce that T, f (x) = f K(x, y)f (y) dy Ix-y1>€

makes sense for f E S(Rn). Nevertheless, the limit as E 0 need not exist or may exist and be different from T f (x). An example of the first is the whose operator we defined in Section 2 as convolution with p. v. kernel, K (x, y) = Ix - , is standard. (Recall that this operator is strong (2, 2) and so a CalderOn-Zygmund operator.) By an argument identical to that in Proposition 5.7 we can prove the following result. Proposition 5.12. The limit

lim TE f (x) E-,c) exists almost everywhere for f E CI7 if and only if (5.16)

lim

f

K(x, y) dy

E-'13
exists almost everywhere.

The existence of the limit (5.16) does not imply that it is equal to T f (x): for example, consider the identity operator I. This is a Calderon-Zygmund operator associated with the kernel K (x,y) = 0—clearly, /f(x) = 0 if x supp( f)—but (5.16) equals 0. Furthermore, this example shows us that an operator is not characterized by its kernel since the zero operator also has the zero kernel. In general, so does any pointwise multiplication operator T f (x) = a(x) f (x), a E L°°. However, these are the only ones; this is shown by the following result whose proof is left to the reader. Proposition 5.13. If two CalderOn-Zygmund operators are associated with the same kernel, then their difference is a pointwise multiplication operator.

It is important to assume that a Calderon-Zygmund operator is bounded on L 2 since without this or a similar hypothesis, Proposition 5.13 would be

f

-

102

5. Singular Integrals (II)

false. For example, the derivative is an operator with kernel 0 (f'(x) = 0 if x 1Z supp(f)) but it is not a pointwise multiplication operator. A CalderOn-Zygmund singular integral is a CalderOn-Zygmund operator that satisfies (5.17)

T f (x) =liT,f (x).

To determine when this equality holds for f E LP(W'), we will proceed as we did for the Hilbert transform and examine the maximal operator T* f (x) = sup IT, f (x)1. e>0

By Theorem 2.2, if T* is weak (p,p) then the set { f E LP : li

TE / (x) exists a.e.}

is closed in LP. Hence, if (5.17) holds for a dense subset, say f it holds for any f E LP.

E

then

4. CalderOn-Zygmund singular integrals

Proof of Lemma 5.15. We will prove that f (0) < C (M (IT f ) ( 0 ) 1 / + M (0))

with C independent of c. Our argument will be translation invariant and so actually yields (5.18). Fix c > 0; let Q = B(0, c/2) and 2Q = B(0, c), and define fi = f X2Q and f2 = f — h. Then T f2(0) =K(0, Y) f (Y) dY = f (0). 1v1>€ If z E Q then, since K is a standard kernel,

ITf2 z) — 712(0)1

(K (z, y) - K(0, y)) f (y) dry

(

(p, p), 1 E IYI n+6

(Y)I

< C k=0 f2ke
Lemma 5.15. If T is a Calderdn-Zygmund operator then for any v, 0 < 5 1, and for any f E cgo ,

(5.18)

103

The desired inequality follows from this immediately.

Theorem 5.14. If T is a Calde-On-Zygmund operator then T* is strong

11

< C k=0

T* f (x) C, (M (IT il v )(x) 11v + M f (x)) .

lyln+b

2 -k5 1 f If(Y)14 ( 2k O n 1Y1< 2 k+IE

< C6M f(0).

It follows from this inequality that

To prove this we need the following result due to Kolmogorov. Lemma 5.16. Given an operator S which is weak (1, 1), v, 0 < v < 1, and

(5.19)

a set E of finite measure, there exists a constant C depending only on v such that

If 7', f(0) = 0 then there is nothing to prove. If not, fix A such that 0 < A < 1TE f (0)1 and let

fE 1S f (x)l v dx

v ilf

-

A' min (1E1, 0 IcIlf111/1E1 v Av-11E1dA + v

(z) I.

and

(x)l v dx = v f A v-1 1{X E E : IS f (x)1 >

vf

C m f ( 0 ) + IT .f (z)I +

Qi = {z E Q : IT f (z)I > A/3}, E Q IT h(z)I > A/3}, Q2 =

Proof. By (2.1) and the weak (1,1) inequality,

<

ITE.f ( 0 )1

dA Q3 =

dA

f.

if CM f (0) < A/3, Q if CMf (0) > A/3.

{0

Then Q = QiU Q2 U Q3, so IQI 1(211 + IQ21+1Q31. However, CAv-211fIlidA.

1Q11

-A-

LITf (z)I dz

3

1(21A1 (7' f)(0),

104

5. Singular Integrals (II)

by the proof of Theorem 2.10 it suffices to take the integral over E instead of all of Rn. Therefore, by Lemma 5.16

and by the weak (1, 1) inequality for T, 7 1(221 I{z E Q IT fi (z)l > A/311 5_3C

Llf (z)I dz < — AC 1(21m f ( 0 ).

3C

-

Let B be a separable Banach space. A function F from IV to B is (strongly) measurable if for each b' E B* (the dual of B) the map x (F(x), b') is measurable. If F is measurable then it follows that the scalar function x H IIF(x)IIB is also measurable. Therefore, we can define LP(B) as the space of (equivalence classes of) measurable functions from 1[8n to B such that

A < C (M(Tf)(0) + M f (0)) . This is true for any A < IT, f (0)1 and so (5.18) holds when v = 1. If 0 < v < 1, then it follows from (5.19) that ITE.i( 0 )i v CM.f( 0 ) v + IT.f(z)r) + If we integrate in z over Q, divide by IQI and raise to the power 1/v, we get 1/v 1 f IT fi(z)j v dz) IQI

.

By Lemma 5.16,

,c1(21-111filli

CM f (0),

and this completes the proof.

❑

Proof of Theorem 5.14. Inequality (5.18) with v = 1 immediately implies that T* is strong (p, p) since both T and M are. To show that T* is weak (1,1) we could argue as we did in the proof of Theorem 3.4; instead we will give a different proof which uses (5.18) with v < 1. From this inequality we have that

IIFIILP(B) = (

+ Ifx E R n : M(IT fl y )(x) 11 ' > A/2C}I,

and the first term on the right-hand side satisfies the desired estimate. As for the second term, by Lemma 2.12 2nIfx E

: Md(IT fr)(x) > 4 -n A v 11,

where Md is the dyadic maximal operator (2.9). Let E = {x E 118n : Md(ITf r)(x) > Avl; then E has finite measure if f E C'°. But then iEl 5- — 1 f IT f (Or dY; /x v E

I

1/p

IlF(X)11PBdx)

1 < p < oo,

The space LP(B), 1 < p < CO, is a and I f II„„ = sup{IIF(x)IIB : x E Banach space. If f is a scalar function in Li' and b E B, define the function f • b from Rn to B by (f • b)(x) = f (x)b. This function is in LP(B) and its norm is The subspace of LP(B) consisting of finite linear combinations of functions of this type, denoted by LP B, is dense if 1 p < oo. B, define its integral to be the element of Given F = 3 f3 • 1) 3 E B given by

.0

E

IR n

F(x) dx =

E I

fi(x) dx) bi.

f F(x) dx extends to L l (B) by continuity. For a function F E L l (B), this integral is the unique element of B that satisfies

The map F

(fie, F(x) dx, b') =

I{x E Rn : T* f (x) > All <1{x E lltri : M f (x) > A/ 2C}1

: M(ITf l u )(x) 1 /' > All

❑

5. A vector valued extension

Hence, in every case we have that

Ifx E

v lif

The desired inequality now follows if we replace A' by 4 - n Av.

IQRM(Ti)(0) + Mf( 0 ))•

f (0)Iv
C

jEj

If Q3 = Q then A < 3CM f (0); if Q3 = 0 then + 1Q21

105

5. A vector-valued extension

(F(x), bi ) dx

for all b' E B* .

If F E LP(B) and G E LP' (B*), then (F,G)(x) = (F(x),G(x)) is inte-

grable; furthermore,

= sup

f

..

(F(x),G(x)) dx

IIFIlLy(s) 5 1 } •

From this we see that LP'(B*) C (LP(B))*. Equality is not true in general, but is, for example, if 1 < p < oo and B is reflexive. Let A and B be Banach spaces and let ,C(A, B) be the space of bounded linear operators from A to B. Suppose that K is a function defined on

106

5. Singular Integrals (II)

Rn x \ A which takes values in £(A, B) and T is an operator which has K as its associated kernel: if f E L'(A) and has compact support, then

6. Notes and further results

107

We leave it to the reader to fill in the details of the outlined proof of Theorem 5.17 by following the scalar model.

T f (x) = f K(x, y) • f (y) dy, x supp(f).

For such operators there is a vectorial analogue of Theorem 5.10. Theorem 5.17. Let T be a bounded operator from LT (A) to LT (B) for some r, 1 < r < co, with associated kernel K. If K satisfies

(5.20) (5.21)

f f

jx-yl>21y-z1

ix-yl>21x-wl

IIK(x y) K(x,z)lIc (A,B)d x C, 11-1((x

— K , OHL(A,B) dy < C

then T is bounded from LP(A) to LP(B), 1 

The proof of Theorem 5.17 initially follows the outline of the scalar result: inequality (5.20) together with the boundedness from L'(A) to LT (B) yields the weak (1, 1) inequality; then the Marcinkiewicz interpolation theorem (which can easily be shown to be true in these spaces) shows that T is bounded for 1 < p < r. To prove that T is bounded for r < p < oo we must pass to the adjoint operator. However, this presents a problem since, as we noted above, LP' (A*) need not be equal to LP(A)*. When A is reflexive (and it will always be so in our applications in Chapter 8) they are equal, and in that case it is enough to note that the kernel associated with the adjoint operator T* is K(x, y) = K* (y, x) E L(B* , A*). Inequality (5.21) for K is equivalent to (5.20) for K, so by repeating the above argument we get that T* is bounded for 1 < p < r'. When LP'(A*) LP(A)* we must first consider the finite dimensional subspaces of A. Given such a subspace Ao, let To : LT (Ao) 12(B) be the restriction of T to functions with values in Ao. The kernel associated with To is Ko E L(A0, B), the restriction of K to A0. Since 111(011 < inequalities (5.20) and (5.21) hold for K 0 with constants independent of the subspace A 0 . Therefore, arguing as before, To : L 9' (B*) —>

(A'6), 1 < q < r',

with a constant independent of A0. So by duality, To is bounded from LP(Ao) to LP(B), r < p < oo, with a constant independent of A0. That it is bounded on all of LP(A) now follows since LP ® A is dense in LP(A).

6. Notes and further results 6.1. References. The Calderon-Zygmund decomposition and its application to proving the weak (1,1) inequalities for singular integrals appeared in the classic article by both authors (On the existence of certain singular integrals, Acta Math. 88 (1952), 85-139). In this article they used the gradient condition (5.3); L. HOrmander first observed that (5.2) is sufficient (Estimates for translation invariant operators in LP-spaces, Acta Math. 104 (1960), 93-139). Also see Stein [15]. Proposition 5.5 is due to A. Benedek, A. P. CalderOn and R. Panzone (Convolution operators with Banach-space valued functions, Proc. Nat. Acad. Sci. U.S.A. 48 (1962), 356-365). The tempered distribution p. and its generalizations were first considered by B. Muckenhoupt (On certain singular integrals, Pacific J. Math. 10 (1960), 239-261). For more on generalized CalderOn-Zygmund operators, consult Coifman and Meyer [2], Journe [8] and Stein [17]. The Cauchy integral has a long history: see, for example, the book by N. T. Mushkelishvili (Singular Integral Equations, P. Noordhoff, Groningen, 1953). The associated commutators were first studied by A. P. CalderOn (Commutators of singular integral operators, Proc. Nat. Acad. Sci. U.S.A. 53 (1965), 1092-1099). Commutators are discussed again in Chapter 9. For more on vector-valued singular integrals see Garcia-Cuerva and Rubio de Francia [6, Chapter 5], the article by J. L. Rubio de Francia, F. J. Ruiz and J. L. Torrea (Calderon-Zygmund theory for operator-valued kernels, Adv. in Math. 62 (1986), 7-48) or the monograph by J. L. Torrea (Integrales Singulares Vectoriales, Notas de Algebra y Analisis 12, Univ. Nacional del Sur, Bahia Blanca, 1984). The survey article by C. Fefferman (Recent progress in classical Fourier analysis, Proceedings of the I.C.M. (Vancouver, B.C., 1974), Vol. 1, pp. 95118, Canad. Math. Congress, Montreal, 1975) contains a succinct discussion of singular integrals and their connection to many other problems in analysis.

6.2. A more general Dini-type condition. The HOrmander condition (5.2) holds for kernels of the form K(x) = Q(x')11x1n under weaker hypotheses than those in Proposition 5.3. Given p E 0(n), let

IIPII = suPflu — pul : u E Sn-11.

108

5. Singular Integrals (II)

If we define (t) = sup f IIPII
(pu) - Q(u) I da- (u),

then the Dini-type condition

f

o 1 4'1 (t) dt < co t

implies that (5.2) holds for K. This condition also implies that Q E L lo g + L. This result is due to A. P. CalderOn, M. Weiss and A. Zygmund (On the existence of singular integrals, Singular Integrals, Proc. Sympos. Pure Math. X, pp. 56-73, Amer. Math. Soc., Providence, 1967). Later, Calder& and Zygmund proved that this condition is also necessary for the HOrmander condition to hold (A note on singular integrals, Studia Math. 65 (1979), 77-87). 6.3. Nonisotropic dilations. The Euclidean norm I • I on R is associated with a family of dilations 6t(x) = (tx1,... ,txn) in the sense that IS t (x)I = tlxl, t > 0, x E Given al, , a n, > 0, we can define a family of dilations that act on each variable differently: 6t(x) =(t al xi,

11 5 t(x)ii =

tiIxii

and if z E Sn -1 then IlzII = 1. Define • as follows: given x E there exists a unique x' E Sn -1 such that Ot (x') = x for some t > 0. Let Ilx11 = t. This is not an actual norm since the triangle inequality is only satisfied up to a constant L > 1: Iix + Yii 5-

27 (1966), 73-86) and M. de Guzman (Singular integral operators with generalized homogeneity, Rev. Acad. Ciencias Madrid 64 (1970), 77-137). In Chapter 8, Section 7, we will use the method of rotations to study the kernels associated with this structure. 6.4. Spaces of homogeneous type. Singular integrals can be studied in very general spaces. R. Coifman and G. Weiss considered this question in their book Analyse harmonique nonconmutative sur certains espaces homogenes (Lecture Notes in Math. 242, Springer-Verlag, Berlin, 1971). For a discussion in a slightly less general setting, see Stein [17, Chapter 11. Let X be a space with pseudo-metric p, that is, a function p : X x X —> [0, co) such that (1) p(x, y) = 0 if and only if x = y; = P(Y,x); (2) P(x, (3) there exists L > 0 such that p(x, z) < L(p(x, y) + p(y, , z)). A space of homogeneous type is a topological space X with a pseudo-metric p such that the balls B(x, r) form a basis of open neighborhoods of X, and there exists a positive integer N such that for any x E X and r > 0, the ball B(x, r) contains at most N points xi such that p(x x 3 ) > r/2. The second property holds if there exists a Borel measure p, on X such that

,t a "xn).

Then for each S t there exists a quasi-norm '11 such that

+

Given Re with this quasi-norm we can get results analogous to Theorem 5.10 by considering conditions like

f

211Y-z1IIK(x, y) - K(x, z)I dx < C llx-v1^ and the symmetric condition gotten by changing the order of the variables. This kind of singular integral appears primarily in connection with the parabolic operators introduced by E. Fabes and N. Riviere (Singular integrals with mixed homogeneity, Studia Math. 27 (1966), 19-38). Also see C. Sadosky (On some properties of a class of singular integrals, Studia Math.

109

6. Notes and further results

0 < 1.1(B (x , r)) < A p(B (x , r /2)) < oo. (Such a measure kt, is called a doubling measure. We first discussed doubling measures in Chapter 2, Section 8.6.) If X is a space of homogeneous type, then given a kernel K(x, y) E L 2 (X x X,dp,® diz) we can define the operator T f (x) =

K (x , y) f (y)

If IITflI q 1, and if for any y, z E X, (5.22)

(x ,, y) ^ 2Lp(yz) 11((x' y)

- K (x , z)I dp,(x) < C,

then T is strong (p,p), 1 < p < q, and weak (1, 1). Very recently, in connection with the study of the Cauchy integral, the question has arisen of extending the theory of singular integrals to nonhomogeneous spaces: topological spaces X with a pseudo-metric p and a measure p which is not doubling. F. Nazarov, S. Treil and A. Volberg (Weak type estimates and Cotlar inequalities for CalderOn-Zygmund operators on nonhomogeneous spaces, Int. Math. Res. Let. 9 (1998), 463-487) have shown

5. Singular Integrals (II)

110

that the above result remains true if: p is a metric; there exists a constant v > 0 such that for all x E X and r > 0, (B (x, r)) < ru; and we replace (5.22) with the assumption that there exists b > 0 such that

(x, y) - K (x, z)1 < c

if p(x, y)

2p(y, z).

P(x Y) u±s

6.5. The size of constants. In the proof of Theorem 5.1, if we form the Calderon-Zygmund decomposition at height A -1 A, then the weak (1,1) constant becomes Cr,(A + B), and by interpolation the strong (p, p) constant is dominated by

1 < p < co. ,

In general, this is the best possible constant asymptotically as p -+ 1 or

p oo. To see this, suppose the kernel K satisfies the gradient condition with constant C (5.3) and also satisfies {x

> Ixln

,

Ixi > 0.

Let B = B(0,1) and let B* = B(0, R), where R is such that C/R < c/2. Fix f =1-131-1/PXB; then II f Ilp = 1. Further, for x B*,

-

I x In

-

cIB1 1-1 /P 2141

The vector-valued extension of the theory of singular integral operators in Section 5 has wide application, since many of the operators in harmonic analysis can be viewed as vector-valued singular integrals. This approach is due to Rubio de Francia, Ruiz and Torrea (see the reference given in Section 6.1). Here we will show that the Hardy-Littlewood maximal function can be treated as a vector-valued singular integral. We first extend Theorem 5.17 to include the case when T is a bounded operator from L°°(A) to L°°(B). The proof is essentially the same. Fix f, form the Calderon-Zygmund decomposition of< f as g +b, 2nd where IIg(x)IA f 11A at height A, and decompose a.e. Then, since T is bounded from L°°(A) to L"(B), for some constant /13 > 0,

E R n : IIT f (x)IIB > OA} C {x E lir : IITb(x)IIB >

Now fix a non-negative function 0 E S such that 0 > 1 on B(0, 1). Define the maximal operator

Mmf (x) =suplc5t* f(x)I, t>0

01311-1-1/n-1/p

Ix I n +1

If we apply the It follows from this that for p small, IlTf 11 7, = 0((p - same argument to the adjoint operator, then by duality we get that for p large, IlTf IIp = 0 (p). This proof can easily be adapted to the case when K is the kernel of one of the Riesz transforms, and a similar argument holds for Calderon-Zygmund operators. (See Stein [17, p. 42].)

6.6. More on truncated integrals. Given a CalderOn-Zygmund singular integral T, that is, a CalderOnZygmund operator that satisfies (5.17), it is natural to ask when 7', f converges to T f in LP. If 1 < p < oo, this is immediate: if f E LP then by Theorem 5.14, T* f E LP, so convergence follows from the dominated convergence theorem. This argument fails when p = 1, but nevertheless it can be shown that if f,Tf E L l , then Tc f ->Tf in L l . This result is due to

.

The proof then continues as before.

f IK (x - y) - K (x)I dy

f (x)I >1131 1- VP > &RI-1/p

A. Calder& and 0. Capri (On the convergence in Ll of singular integrals, Studia Math. 78 (1984), 321-327).

6.7. Vector-valued singular integrals and maximal functions.

P(Y z) s

C„' (A + B)p 2 p -1

111

6. Notes and further results

where 0 t (x) = t - n0(x/t). Since 0 E S, it suffices to take the supremum over rational t. By our choice of 0, M f (x) Mcb(If1)(x)• (The reverse inequality was proved in Proposition 2.7.) Thus, to prove norm inequalities for M it will suffice to prove them for M. Let A = 111 and B = f"(Q ± ), and define the vector-valued function K (x) = {O t (x)} tE Q + . Then K takes values in r(A, B) and we can define an operator T to be convolution with K. Since

11 11 (x)11B = m f (x), it follows that T : L°°(A)

suP I t>o

L°°(B) is bounded. Further, since 0

E 5,

IYI

- y) - Ot(x)I < Ix n+1 , Ixl > 2IyI > 0,

which implies the HOrmander condition (5.20) for K. Hence, by Theorem 5.17 (as extended above), T : LP(A) -> LP(B); equivalently, Mm is bounded on LP. The boundedness of Mm was originally proved by F. Zo (A note on approximation of the identity, Studia Math. 55 (1976), 111-122).

112

5. Singular Integrals (II)

o . 8 . Strongly singular integrals. In Theorem 5.1, given an arbitrary tempered distribution with bounded Fourier transform, the HOrmander condition (5.2) seems to be the weakest hypothesis possible to get LP boundedness. It is of interest, however, to see how this hypothesis can be weakened if stronger conditions are assumed on the Fourier transform. The prototypical operator motivating this is the convolution operator Ta ,b = Ka,b * f , 0 < a < 1 and b > 0, where ka,b(0 = 0()

II b

141+6 ' where S = (na/2- b)/(1- a). Hence, this kernel does not satisfy the gradient condition (5.3). (It also fails to satisfy the HOrmander condition; this fact follows from the next result.) However, these two growth conditions are sufficient to determine the behavior of Ta, b on LP. Theorem 5.18. The operator Ta b is bounded on LP if

p <

(b I n) (n/2 + .5) b + 5

,

-

A

ik(01

(1 + 10—n0/2'

Rn,

f

1

Then the operator Tf = K * f is bounded on LP, 1 < p < oo, and satisfies a weak (1, 1) inequality.

This result was later generalized by P. SjOlin (LP estimates for strongly singular convolution operators in Rn, Ark. Mat. 14 (1976), 59-64). 6.9. Pseudo-differential operators.

IK„,b(x)I

1

Theorem 5.19. Let K be a tempered distribution on R n with compact support which coincides with a locally integrable function away from the origin. Suppose there exists 0, 0 < 0 < 1, such that

lxl>21Y1'9K (x - y) - K (x)I dx B, 1Y

ika,b(01 < - (1 + and it can be shown that for x close to the origin,

1 2

further illustrates the delicate relationship between the conditions on K and K.

'

and 0 is a C°° function which vanishes near the origin and is identically equal to one outside of some compact set. This integral arises in the study of the LP convergence of multiple Fourier series (cf. Chapter 1, Section 10.2). Clearly,

(5.23)

113

6. Notes and further results

na/2 - b 1-a'

and is unbounded if the reverse inequality holds. If equality holds in (5.23), then Ta, b maps LP into the Lorentz space LP , P'

The LP boundedness of To was first shown by I. Hirschmann (On multiplier transformations, Duke Math. J. 25 (1959), 221-242) when n = 1 and by S. Wainger (Special trigonometric series in k dimensions, Mem. Amer. Math. Soc. 59 (1965)) in higher dimensions. Also see the survey article by E. M. Stein (Singular integrals, harmonic functions and differentiability properties of functions of several variables, Singular Integrals, Proc. Sympos.

Pure Math. X, pp. 316-335, Amer. Math. Soc., Providence, 1967). The endpoint inequalities were proved by C. Fefferman (Inequalities for strongly singular convolution operators, Acta Math. 124 (1970), 9-36). He showed that they are consequences of the following general result, which

Pseudo-differential operators are generalizations of differential operators and singular integrals. They are formally defined as in equation (4.17), that is, T f (x) = f a(x, ).1()e 27rix .

where a, the symbol of T, is a complex-valued function defined on W' x Conditions must be imposed on a to ensure that T f is well defined for f E Cc"(111n). If a(x, = m(0 is independent of x, then T is the multiplier associated with m; if a(x, = b(x) is independent of T corresponds to pointwise multiplication by b. Although in these cases T is bounded on L 2 if and only if m or b are bounded functions, this is not true in general. For example, = b(x)m(6) with m,b E L 2 can be unbounded, but by applying Holder's inequality it is immediate that T is always bounded on L 2 . Symbols are classified according to their size and the size of their derivatives. The simplest classification is motivated by the study of elliptic differential operators: given m E Z, we say that a E S m if C. ,0(1+

ILY1140 (x, -

Pseudo-differential operators can be rewritten as (5.24)

T f (x) = f

n

K (x, x - y) f (y) dy,,

where K is given by (4.19). When a E S ° then one can show that K satisfies a HOrmander condition (cf. (5.10) and (5.11)) and that T is bounded on L2.

114

5. Singular Integrals (II)

Chapter 6

Hence the theory developed in this chapter applies and T is bounded on LP, < p < oo. The composition of pseudo-differential operators yields a corresponding symbolic calculus. In the simplest cases (pointwise multiplication and multipliers) the symbol of the composition of two operators is the product of the symbols. (Cf. Theorem 4.15.) In general, given pseudo-differential operators T1 and T2 with symbols in Sml and S rn2 their composition is a pseudo-differential operator with symbol in S M1+"12 . Further, the symbol of T1 o T2 has an asymptotic expansion whose dominant term is the product of the symbols of T1 and T2 and the remaining terms are symbols of pseudo-differential operators in Smi±m 2- i, j > 0. If v E S tm is an elliptic operator (that is, it has a lower bound Icr(x,e)I > C(1 + p m ) , then there exists a symbol in S - rn such that the composition of the corresponding operators is the identity (which is in S°) plus an error term with symbol in S. Thus, the theory of pseudo-differential operators with symbols in Sm is readily applicable to the problem of the inversion of elliptic operators. When considering the corresponding problem for other differential operators, a more general class of symbols arises. We say that a E Sprn5 if

IDIDZu(x,

< A c0(1 ii)n-Plc4+61/3 1

.

This class includes the previous one: with this notation Sr' becomes Sino . If T is a pseudo-differential operator with symbol in S76., then it can again be written in the form (5.24). One can show that the kernel K satisfies the standard estimates (5.12), (5.13) and (5.14) if and only if m = 0 and p = 6 < 1. A remarkable result due to A. P. CalderOn and R. Vaillancourt (A class of bounded pseudo-differential operators, Proc. Nat. Acad. Sci. U.S.A. 69 (1972), 1185-1187) gives us that a pseudo-differential operator with symbol in SP P for some 0 < p <1 is bounded on L 2 , and so we can again apply the techniques of this chapter to show that it is bounded on LP. The proof of Calder& and Vaillancourt uses Cotlar's lemma (see Chapter 9, Section 1). Their result is false when p = 1: there exist symbols in S'2 ,1 (and so in L°°) whose associated pseudo-differential operators are unbounded on

L2 .

When m 0, the properties of pseudo-differential operators whose symbols are in SP are studied using a scale of spaces which measure the reguP, 6 larity of functions; for example, the Sobolev spaces (see Chapter 4, Section 7.7) or the Lipschitz spaces (see Stein [15, Chapter 5]). For further results on pseudo-differential operators and additional references see Stein [17], Journe [8] and Coifman and Meyer [2].

H' and BMO

1. The space atomic 1/ 1 In Chapter 3, Section 3, we saw that the Hilbert transform of an integrable function is not, in general, in L I. , and that a necessary condition for this is that the function have zero integral. Here we are going to define a subspace of L 1 whose image under any singular integral is in Ll. We begin by defining its basic elements—atoms. Definition 6.1. An atom is a complex-valued function a defined on IV which is supported on a cube Q and is such that

f

a(x) dx = 0 and

IlaII

Q

^

1(21

.

Proposition 6.2. Let T be an operator as in the hypotheses of Theorem 5.10 whose kernel satisfies condition (5.10). Then there exists a constant C

such that, given any atom a, C.

Proof. Since a E L 2 , Ta is well defined. Let Qt be the cube with the same center as Q, cQ, and side length 2 \Fi times larger. Then, since T is bounded on L 2 , )1/2

ITa(x)I dx 5_ IQ * I 112 iTa(x)I2 dx Q*

115

6. H 1 and BMO

116

Theorem 6.4. H 1 (Rn) = Halt (IV) and their norms are equivalent.

< C.

We will not prove Theorem 6.4; a proof can be found in either GarciaCuerva and Rubio de Francia [6, Chapter 3] or Stein [17, Chapter 3].

L

L Q. ITa(x)Idx "Rn\Q*

2. The space BMO

K (x,y)a(y) dy dx

f [K(x,y) — K (x, cc2)1a(y) dy

Li •f

Given a function f E LL(Rn) and a cube Q, let fc2 denote the average of f on Q:

dx

Q

n \Q *

111# f(x) = sup

(6.1) O

We now define the space atomic H 1 , denoted by Hlt , by : a3 atoms, Ai E C,

E 1A3 1 <

.

Q3x

BMO = {f E

Clearly, Halt is contained in L'. Define a norm on Halt by

IlfIlli = inf{ E 'Ail : f = t

i

1

f Q

If —

where the supremum is taken over all cubes Q containing x. Each of these integrals measures the mean oscillation of f on the cube Q; we say that f has bounded mean oscillation if the function MO f is bounded. The space of functions with this property is denoted by BMO:

3

3

.

Define the sharp maximal function by

< C.

f IQ'

fc2 =

QfRn\Q*IK(X,Y) IC(X)cc2)1dx la(Y)1 dy

=

117

< cl Q,11/2 (f ia(x) 12 dx ) 1/2

Further, since a has zero average, by condition (5.10)

1-1 ,t (R n)

2. The space BMO

(:)c : M# f E

We define a norm on BMO by

Ilf II. = Ilm # f11.•

A 3 aj }

With this norm H alt is a Banach space. (Details are left to the reader.) Further, it follows immediately from Proposition 6.2 that singular integrals are bounded from Halt to L l .

Corollary 6.3. Let T be an operator as in Proposition 6.2, and let f E Halt . Then

This is not properly a norm since any function which is constant almost everywhere has zero oscillation. However, these are the only functions with this property, and it is customary to think of BMO as the quotient of the above space by the space of constant functions. In other words, two functions which differ by a constant coincide as functions in BMO. On this space is a norm and the space is a Banach space. It is easy to see that we have the pointwise inequality

IlTfIii 5_ ClIfilHL•

M * .i(x) CnM f (x)

The space Halt is the largest subspace of Ll(Rn) for which Corollary 6.3 holds in the following sense: let R1, , R n be the Riesz transforms in Rn (the Hilbert transform if n = 1), and define the space

with the Hardy-Littlewood maximal function. The constant depends on the dimension n, but if we replace M by M", the variant of the maximal operator where the supremum is taken over all cubes containing x (see (2.6)), the constant is 2.

H 1 ( 7') =

If E L i (11r) : R3 f E L i (Rn ), 1 5_ j n}

with the norm

(6.2)

=

Then the following is true.

Proposition 6.5. 1

+E

(6.3)

Ilf II. 5_ sue ainofc

1

fc2 If(x)— al dx <

M#(1fI)(x) 2M#f(x).

Ilf II.;

118

6. H 1 and BMO

Proof. The second inequality in (6.2) is immediate. To prove the first, note that for all a,

If(x)

dx f

If (x) — al dx + f — fQIdx < 2 f

If(x)— dx.

Now divide both sides by IQ', take the infimum over all a and then the supremum over Q. Inequality (6.3) follows from (6.2) (with a = I fQ1) since 1

1(21

-

IQ'

Let a = T f2(cc2). Then, since (5.11) holds and since T is bounded on

L2 , 1

IQI

I

Q

IT f (x)— al dx

(x)I dx

IQI 1

Q

Inequality (6.2) defines a norm equivalent to II II., one which provides a way to show that f E BMO without using its average on Q: it suffices to find a constant a (that can depend on Q) such that

If(r)-

0

ixi

) I xI <

IQI

C (.1

L

f

.

if(x)12dx)1/2

[K (x, y) — K (cc 2 , y)] f (y) dy dx

Q bt f .,, \Q. IK (x, y) — K (c(2

It follows from (6.3) that if f E BMO then I fl is also in BMO. However, the converse is not true. This confirms what the definition suggests: being in BMO is not just a question of size. Clearly L' C BMO, but there are also unbounded BMO functions. The typical example on RI is {log (

+

f

al dx C

with C independent of Q.

f (x) =

f

1 /2 ITfi(x),2dx) 0

wi

119

+ f(71 417-f2(x)_ T f2(cQ)Idx

1 dx < — f if (x) — fQ l dx.

f (x)I — I f

2. The space BMO

1,

- 1 .

But it is easy to see that the function sgn(x)f (x) BMO even though f is its absolute value. The next result shows the connection between BMO and singular integrals. Theorem 6.6. Let T be an operator as in the hypotheses of Theorem 5.10 whose kernel satisfies condition (5.11). Then if f is a bounded function of compact support, T f E BMO and

IIT flI* 5_ CUL°.

I dY dx • Ilf 1100

0 Theorem 6.6 is a first step towards our goal of finding a space which contains the images of bounded functions under singular integrals. However, the set of bounded functions with compact support is not dense in L", so we cannot extend T to the whole space by continuity. Instead, we will extend the definition of T to one which holds on all of Let f be a bounded function and let Q be a cube in W1 centered at the origin. Define Q* as above and again decompose f as fi = f XQ* • Since fi is bounded and has compact support, T fi is well defined as an L 2 function; hence T fi(x) exists for almost every x. For x E Q define (6.4) T f (x) = T fi(x) + f [K (x, y) — K( 0 , Y)Lf2(y) dy

•

This integral converges since it is bounded by

II/II. f f \Q. IK(x, y )— K(0, y)ldy, provided K satisfies (5.11).

Proof. Fix a cube Q in lir with center cQ, and let Q* be the cube centered at cQ whose side length is 2 \Ft times that of Q. Decompose f as f = fi+ f2, where fi = fxcp.

Now let Q be some other cube centered at the origin which contains Q. Then we have two definitions of T f (x) for x E Q. Let fl = f xce and f2 = f XR. \Q*; then the difference between the two definitions is

120

6. H I- and BMO

ILI”\Q*

Ilit".\Q*

3. An interpolation result

(x, y) - K(0, y)] f (y) dy

T( - fi)(x) + f

[K (x y) K( 0 , Yaf (Y) = - I ce\Q*

K(0, y)f(y) dy,

and this is independent of x. Hence, the two definitions coincide as functions in BMO since in BMO we identify functions which differ by a constant. We can, therefore, take (6.4) as the definition of the image of f under T. Further, by arguing as we did in Theorem 6.6, we can show that if f E L" then T f E BMO. Finally, note that we do not have to take cubes centered at the origin but can take any cube. If f is a bounded function of compact support, then (6.4) coincides with our original definition since for Q large enough, f = f1. More generally, if f E then the two definitions agree as BMO functions: since

S

Jinn \Q *

Example 6.7. Let f (x) sgn(x); we will find the image of f under the < a/2; Hilbert transform, whose kernel is K (x, y) = [ir(x - y)] -1 . Let then

= 2 log

a

x - y

dy + lim

N—>oo

- 2 log(a).

In applying interpolation theorems (such as the Marcinkiewicz interpolation theorem) we often use the fact that an operator is bounded from L' to L°°. Here we show that we can replace this with the weaker condition of boundedness from L°° to BMO. Theorem 6.8. Let T be a linear operator which is bounded on LP° for some Po, 1 < po < co, and is bounded from L' to BMO. Then for all p, Po < p < co, T is bounded on LP.

The proof of Theorem 6.8 requires two lemmas. Lemma 6.9. If 1 < Po < p oo and f E LP° then

(6.5)

f

Md f (x)P dx < C f

RP'

f (x)P dx,

where Md is the dyadic maximal operator (2.9).

K(0, y)f (y) dy

converges absolutely to a value independent of x, the two definitions differ by a constant.

7rH f (x) = p. v.I

121

3. An interpolation result

—a I LN a

N (

1 1

x-Y

sgn(y) dy

By Lemma 2.12 we can replace the dyadic maximal function in (6.5) with the Hardy-Littlewood maximal function. Thus, while the reverse of the pointwise inequality MO f (x) < CM f (x) is not true in general, Lemma 6.9 provides a substitute norm inequality. The proof of Lemma 6.9 uses a technique known as a "good-A inequality"; more precisely, it depends on the following result. Lemma 6.10. If f E LP° for some po, 1 < po < oo, then for all -y > 0 and

A > 0, 1-(x E R a : Md f (x) > 2A, M#f (x) 7A)- I 2n-ylfx E Il8 n : Md f (x) >

Hence, ignoring the constant, 2

H(sgn(x)) - log 7r This is an indirect proof that log Ix' is a function in BMO. This result also agrees with our original motivation for the Hilbert transform since there exists an analytic function in the upper half-plane whose real part tends to sgn(x) and whose imaginary part tends to log Ix' as we approach the real axis. We can give such a function explicitly: i log(iz) = i log(-y + ix) = i log(x2 + y 2)1/2 + arctan ( x / y ) . As y -4 0 the limit of this function is 7r - sgn(x) i log 2

Proof. Without loss of generality we may assume that f is non-negative. Fix A, y > 0. If we form the CalderOn-Zygmund decomposition of f at height A, then the set {x E 1[8 n : Md f (x) > A} can be written as the union of disjoint, maximal dyadic cubes. (See Theorem 2.11; we can substitute f E LP° for the hypothesis f E L l in the proof.) If Q is one of these cubes, then it will suffice to show that (6.6)

Ifx E Q : Md f (x) > 2A, /10 f (x) 7All 5_ 2 n71(21•

Let Q be the dyadic cube containing Q whose sides are twice as long. Then, since Q is maximal, To < A. Furthermore, if x E Q and Md f (x) > 2A, then Md(fxQ)(x) > 2A, so for those x's, Md((f - fcdX(2)(x) Md(fxQ)(x) - fQ

> A.

122

6. H I' and BMO By the weak (1,1) inequality for Md (Theorem 2.10),

Ifx E Q

If (x) -

Md((f fedX(2)(x) > Ali ^

dx

Now suppose that f E LP has compact support. Then f E LP° and so T f E LP°. Therefore, we can apply Lemma 6.9 to T f . In particular, since 1Tf(x)1 Md(Tf)(x) a.e.,

f

< 2nIQI 1 f If (x) - f dx A IQ! C?

2nIQI

IT f (x)IP dx f Md(T f)(x)P dx < C f M# (T f)(x)P dx

A xiEliQ f M#f (x). If the set {x E Q : Mdf (x) > 2A, 111# f (x) < -yAl is empty there is nothing to prove. Otherwise, for some x E Q, f (x) < -yA and so inequality (6.6) holds.

C f If (x)IP dx. 0

Proof of Lemma 6.9. For N > 0 let

= I pA 19-1 1{X E

IN

: Mdf (x) > A}1 dA.

IN is finite since IN .< —NP - P° f PoA P°-1 1{x E R n :

Po

and Mdf IN

E LPO

= 2P < 2P

> A} I dA,

:

2P f" < 2P+n7IN ryP pAP-11{x E

o Fix -y such that 2P+n7 = 1/2; then

<

4.

The John Nirenberg inequality -

In this section we examine the rate of growth of functions in BMO. We first consider log( 1 /Ix I ) which we gave as an example of an unbounded function in BMO. On the interval (-a, a), its average is (1- log a); given A > 1, the set where

since f E LPG. Furthermore,

N/2 f pAP -1 1{x E R' : Mdf(x) > 2A}1 dA 4) N/2 f pAP -1 (1{x E : Mdf(x) > 2A, f (x) -yA}1 Ifx E

(6.7)

Mdf (X)

2P+ 1 fr N/ 2 yP

f (x) > -yA I) dA : M#f (x) > All clA.

pAP-11{x E W • M # f(x) > All dA.

If f(M#f)P = oo then there is nothing to prove. If it is finite then we can take the limit as N oo in inequality (6.7) to get (6.5). Proof of Theorem 6.8. The composition M# o T is a sublinear operator. It is bounded on LP° since both these operators are, and it is bounded on L" since

Ilm# ( 77)11. = cll f Ilco. Hence, by the Marcinkiewicz interpolation theorem M# o T is bounded on LP , Po A has measure 2ae -A-1 . The next result shows that in some sense logarithmic growth is the maximum possible for BMO functions. Theorem 6.11 (John-Nirenberg Inequality). Let f E BMO. Then there exist constants C1 and C2, depending only on the dimension, such that given any cube Q in R I' and any A > 0, < I{x E Q : 11(x) - fQI > (6.8)

Proof. Since (6.8) is homogeneous, we may assume without loss of generality that Ilf II. = 1. Then Tcji L,f(x)_ fQI dx <1.

(6.9)

Now form the CalderOn-Zygmund decomposition of f - fQ on Q at height 2. (This is done as in Theorem 2.11 except that we begin by bisecting the sides of Q to form 2n equal cubes and repeating this for all cubes formed. Inequality (6.9) replaces the hypothesis that f E L l .) This gives us a family of cubes {Q1, 3 } such that 2 <

1

Ich,J1

(x)

fc21dx 2n+1

124

6. Ill and BMO

21. Hence,_ c2 A Q , : 1log /2 C2A where C2 log 2/2n+ 2 . If A < 2n+ 1 th en e1g

and If (x) - fQ I < 2 if x Ui Q i ,j . In particular,

E IQI,jI 5_2 fQ I f (x) - fQI dx -21 1QI, 3

and

IfQ

ij

fQ

I=

1 1Q 1,

0

so we can take C, = N/1

j1 Li.3( f (x) - fci) dx < 2n+1 .

IfQ1,j.k

1/p

= sup ( —a, f If (x) - fc2IP dx) 1W1 Q Q

is a norm on BMO equivalent to II • ii*• equality is immediate. But by Theorem 6.11 if(x)

fQi,J1 5- 2 if x E

E

-

fcli P dx

PAP-11{X EQ:If (x) — fc2I > dA

0

Q1,j,k7 Cl

k

Gather the cubes Q1, 3 ,k corresponding to all the (21,3's and collectively rename them {Q2, 3 }. Then

since the reverse in

Proof. It will suffice to prove that II.f II*,p

f(21,; I < 2n+1 7

E IQ1,j,kl 5 2 1Q1,31.

IQ'

f j0

00

PAP-1

e-c2A/11111. dA.

Make the change of variables s C2 A/ Ilfll *; then we get 1

IQI

Lif(x)-/QIP dx

7,1(21,

CiP

lif II.)P f (x) ds

\ C2

0

= CipCYPr(P)Ilfg, which yields the desired inequality.

and if x Uj Q2, j , If (x) — fQI

Ifx EQ:If (x) — f(21 > All 15 IQ

Corollary 6.12. For all p, 1 < p < co,

On each cube Qi, 3 form the CalderOn-Zygmund decomposition of f fch,, at height 2. (Again by assumption the average of this function on Qi o is at most 1.) We then obtain a family of cubes {Q 1 , j , k } which satisfy the following:

(x)

125

4. The John-Nirenberg inequality

If (x) — fc2i,, I +

—

/Q1 <

2 ± 2n+1 < 2 ' 2n+1 .

Repeat this process indefinitely. Then for each N we get a family of disjoint cubes {QN, j } such that

I f (x) - fQl N • 2n+ 1

As a consequence of the proof of Corollary 6.12 we get two additional results. Given the size of the constant Cp , if we expand the exponential function as a power series we immediately get the following.

Corollary 6.13. Given f E BMO, there exists A > 0 such that for any cube Q, 1

if x Ui QN, j and

E

IQI

eAlf(x)-fc21 dx < 00. Q

5-

Fix A > 2n+ 1 and let N be such that N2 72 + 1 < A < (N + 1)2n+ 1 . Then ifx EQ :If (x) — fQI > All

E 1

e -Nlog 2 1(21 e -c2 A

Further, the proof of Corollary 6.12 (with p 1) can be readily adapted to show that the converse of the John-Nirenberg inequality (Theorem 6.11) holds.

Corollary 6.14. Given a function f, suppose there exist constants C,, C2 and K such that for any cube Q and A > 0,

Ilx E Then f E BMO.

11(x) — fc2I > All Cie-c2A/K IQI•

6. 1/ 1 and BMO

126 5.

Notes and further results

5.1. References.

For the development of Hardy spaces and in particular the space 1/ 1 , see Section 5.2 below. E. M. Stein (Classes HP, multiplicateurs et fonctions de Littlewood-Paley, C. R. Acad. Sci. Paris 263 (1966), 716-719, 780-781) first proved that singular integrals and multipliers are bounded from 1/ 1 to L'. The space BMO was introduced by F. John and L. Nirenberg while studying PDE's (On functions of bounded mean oscillation, Comm. Pure Appl. Math. 14 (1961), 415-426). There they proved Theorem 6.11. That singular integrals take Lc° into BMO was first observed by S. Spanne (Sur l'interpolation entre les espaces , Ann. Scuola Norm. Sup. Pisa 20 (1966), 625-648), J. Peetre (On convolution operators leaving LP'A spaces invariant, Ann. Mat. Pura Appl. 72 (1966), 295-304), and E. M. Stein (Singular integrals, harmonic functions, and differentiability properties of functions of several variables, Singular Integrals, Proc. Sympos. Pure Math. X, pp. 316-335, Amer. Math. Soc., Providence, 1967). The sharp maximal function was introduced by C. Fefferman and E. M. Stein (HP spaces of several variables, Acta Math. 129 (1972), 137-193); in the same paper they proved Theorem 6.8. 5.2. The HP spaces.

The Hardy spaces were first studied as part of complex analysis by G. H. Hardy ( The mean value of the modulus of an analytic function, Proc. London Math. Soc. 14 (1914), 269-277). An analytic function F in the unit disk D = {Z E C : IZI < 1} is in HP(D), 0 0} is in HP(R 2+ ) if

= {z = x + iy :

sup f I F(x + iy)IP dx < 00. y>0

When p > 1 it can be shown that HP coincides with the class of analytic functions whose real parts are the Poisson integrals of functions f in LP(T) or LP(R), respectively. We can therefore identify HP(D) with LP(T) and HP(R 2+ ) with LP(R). This identification does not hold, however, for p < 1. Functions in HP(D) and HP(R 2+ ) have a rich structural theory based on "factorization" theorems which allow them to be decomposed into well understood components. For a thorough treatment from this perspective, see Koosis [11] or Rudin [14]. Unfortunately, these results cannot be extended to higher dimensions using the theory of functions of several complex variables. E. M. Stein

5. Notes and further results

127

and G. Weiss (On the theory of harmonic functions of several variables, I: The theory of HP spaces, Acta Math. 103 (1960), 25-62, also see Stein [15, Chapter 7]) defined HP(Rn++ 1 ) by considering vector-valued functions which satisfy generalized Cauchy-Riemann equations. This extension is only valid if p > 1 — 1/n, which does include H I . This space H I- is isomorphic (by passing to boundary values) to the space of functions in Ll(R") whose Riesz transforms are also in L 1 (R"). (This is the space we called H l (R") in Section 1.) D. Burkholder, R. Gundy and M. Silverstein (A maximal characterization of the class HP, Trans. Amer. Math. Soc. 157 (1971), 137-153) proved a crucial result for establishing a strictly real-variable theory of HP spaces. Given F = u + iv, analytic in the upper half-plane, they showed that F is in HP if and only if the non-tangential maximal function of its real part, u a (x0) sup{lu(x, y)I : (x, y) E r a (X0)}, where Fa(x0) = {(x, y) E R 2+ : Ix — xol < ay} , is in LP(R). (See Chapter 2, Section 8.7.) Since u(x, y) = Py * f (x), the Poisson integral of its boundary value f , this characterizes the restriction to R of the real parts of functions in HP(R 2+ ). This space of functions on R, which is customarily denoted by HP(R) (or sometimes by Re HP(R)), is equivalent when p =1 to the space H 1 (R) defined in Section 1. The original proof of this result used Brownian motion; a complex analytic proof was given by P. Koosis (Sommabilite de la fonction maximale et appartenance a 1/1, C. R. Acad. Sci. Paris 28 (1978), 1041-1043, also see [11, Chapter 8]). The key feature of this result is that the fact that u is a harmonic function is irrelevant, in the sense that the same characterization of the boundary values holds if we replace the Poisson kernel by any approximation of the identity defined starting from 0 E S with non-zero integral. This leads to an elegant definition of HP in higher dimensions: given 0 E S(R'2 ) with non-zero integral, we say a function f E HP(Rn) if M f (x0) = sup-C[0 y * f (x)I : (x, y) E r a (X0)} is in LP(W'). The resulting space is independent of 0: in fact, one can replace MM by the so-called grand maximal function .A4 f (x) = sup Mo f (x), where the supremum is taken over all such 0. For p > 1 — 1/n this space is isomorphic (again by taking boundary values) to the HP spaces of Stein and Weiss above. This approach is developed in the excellent article by Fefferman and Stein cited above. Also see Stein [17, Chapter 3].

128

6. 11 1 and BMO

The atomic decomposition of functions in HP(R) is due to R. Coifman (A real variable characterization of HP, Studia Math. 51 (1974), 269-274); it was extended to higher dimensions by R. Latter (A decomposition of HP(Rn) in terms of atoms, Studia Math. 62 (1978), 93-101; also see R. Latter and A. Uchiyama, The atomic decomposition for parabolic HP spaces, Trans. Amer. Math. Soc. 253 (1979), 391-398). For simplicity we used an atomic decomposition as the basis of our definition of H 1 (Rn)—the space we called Halt (Rn). Using these methods, R. Coffman and G. Weiss (Extensions of Hardy spaces and their use in Analysis, Bull. Amer. Math. Soc. 83 (1977), 569-645) defined HP spaces on spaces of homogeneous type (see Chapter 5, Section 6.4). Their definition only holds for po < p < 1, where po is a constant which depends on the space being considered. A monograph on the real theory of HP spaces with applications to Fourier analysis and approximation theory is due to Shanzhen Lu (Four Lectures on Real HP Spaces, World Scientific, Singapore, 1995). Finally, we mention another characterization of HP spaces in terms of the Lusin area integral. If u is a harmonic function on Rn++ 1 , let

dy dx

S(u)(xo) = (f

112

1-11(xo)

where r ah (xo) = {(x, y) E RI.+1 — xol < ay, 0 < y < h} and

5. Notes and further results

129

5.3. Duality. The results on boundedness from H 1 to L 1 and from L°° to BMO are dual to one another. This is a consequence of a deep result due to C. Fefferman (announced in Characterizations of bounded mean oscillation, Bull. Amer. Math. Soc. 77 (1971), 587-588). Theorem 6.15. The dual of Hl(Rn) is BMO. The article by Fefferman and Stein cited above gives two proofs of this result. Three appear in Garcia-Cuerva and Rubio de Francia [6]. The duals of the HP spaces, p < 1, are Lipschitz spaces. This is due to P. Duren, B. Romberg and A. Shields (Linear functionals on HP spaces with 0 < p < 1, J. Reine Angew. Math. 238 (1969), 32-60) on the unit circle and to T. Walsh (The dual of HP(1117+ 1 ) for p < 1, Can. J. Math. 25 (1973), 567-577) in Rn. Also see [6, Chapter 3]. 5.4. The Hardy-Littlewood maximal function on BMO. The maximal function is bounded on BMO: if f E BMO then M f E BMO. This was first proved by C. Bennett, R. A. DeVore and R. Sharpley ( Weak Lc° and BMO, Ann. of Math. 113 (1981), 601-611); simpler proofs were given by F. Chiarenza and M. Frasca (Morrey spaces and HardyLittlewood maximal function, Rend. Mat. Appl., Series 7, 7 (1981), 273-279) and D. Cruz-Uribe, and C. J. Neugebauer (The structure of the reverse HOlder classes, Trans. Amer. Math. Soc. 347 (1995), 2941-2960). See also a proof in Torchinsky [19, p. 204]. 5.5. Another interpolation result.

Ivu l 2 =

au ay

2

n

v (i ax3 j=1

2

(This is called the area integral since if n = 1, S(u) 2 is the area of the image of F ah(x0), counting multiplicities, under the analytic function F whose real part is u.) Then u E HP(R7+1 ) if and only if S(u) E LP (R) and u(x, y) 0 as y co. An alternative characterization is gotten by replacing the area integral by a Littlewood-Paley type function which is its radial analogue:

1/2 00 g(u)(x) = IV u(x , y)1 2 y dy) This characterization is due to A. P. Calder& (Commutators of singular integral operators, Proc. Nat. Acad. Sci. U.S.A. 53 (1965), 1092-1099). See also Garcia-Cuerva and Rubio de Francia [6], Stein [15], Zygmund [21] and the article by Fefferman and Stein cited above.

In applying the Marcinkiewicz interpolation theorem one often uses that the operator in question is bounded on L°° or satisfies a weak (1, 1) inequality. In Theorem 6.8 we showed that we can replace the first hypothesis by the assumption that the operator maps L' into BMO. One can also use H 1 to replace the weak (1, 1) inequality. Theorem 6.16. Given a sublinear operator T, suppose that T maps 11 1

into L l and for some pl, 1 < pl < oo, T is weak (pi,p1). Then for all p, 1 < p < pl, T is bounded on LP. This result can be generalized to the other HP spaces. For details and references, see Garcia-Cuerva and Rubio de Francia [6, Chapter 3]. 5.6. Fractional integrals and Sobolev spaces. For fractional integral operators I a (see Chapter 4, Section 7.7), the critical index is p = n/a: we would expect I a to map Vil a to but it does not. Instead, we have a result analogous to Theorem 6.6.

130

6. H l and BMO

Theorem 6.17. If f E Ln/a then Ia f E BMO.

Theorem 6.18. There exist positive constants

and C2 such that for all

locally integrable f , C 11/Ia f (x) <

131

5.7. Vanishing mean oscillation.

The proof of this theorem follows at once from an inequality relating the fractional integral, the sharp maximal function and the fractional maximal function.

5. Notes and further results

(I a f)(x) C2Mal (x).

Given a function f E BMO, we say that f has vanishing mean oscillation, and write f E VMO, if

.

Q

tends to zero as I.(21 tends to either zero or infinity. As in BMO, there are unbounded functions in VMO. For example, f (x) = {loglog(111x1),

0, Both of these results are due to D. R. Adams (A note on Riesz potentials, Duke Math. J. 42 (1975), 765-778). When p = 1, besides the weak (1, n/(n, — a)) inequality in Theorem 4.18, we also have an analogue of Corollary 6.3: < This result is due to Stein and Weiss see the article cited above. Similar inequalities hold for all the HP spaces: see the article by S. Krantz (Fractional integration on Hardy Spaces, Studia Math. 73 (1982), 87-94). For Sobolev spaces, the corresponding limiting theorem would be that /k if f E L k then f E BMO. However, this is only known when k = 1: see the paper by A. Cianchi and L. Pick (Sobolev embcddings into B1110, VMO and L o , Ark. Mat. 36 (1998), 317-340). In the general case we have as a substitute an exponential integrability result (cf. Corollary 6.13). Theorem 6.19. Given 0 < a < n and any cube Q, then there exist constants Cr and C2 such that

if f E L n/ a (Q),

Cillfilp]/"(Q)

P) < C2.

(x) — f Qi dx

< 1/e, Ixl> 1 /e,

is in VMO, and in fact it can be shown that in some sense it has the maximum rate of growth possible for functions in VMO. Clearly, if f E Co (i.e. the space of continuous functions which vanish at infinity) then f E VMO; further, VMO is the closure in BMO of Co . VMO lets us sharpen Theorem 6.6 as follows: given a singular integral operator T, if f E Co then T f E VMO. The space VMO was introduced by D. Sarason (Functions of vanishing mean oscillation, Trans. Amer. Math. Soc. 207 (1975), 391-405). Also see the survey article by R. Coifman and G. Weiss (Extensions of Hardy spaces and their uses in analysis, Bull. Amer. Math. Soc. 83 (1977), 569-645). 5.8. Commutators and BMO.

We can give another characterization of BMO in terms of singular integrals. Let T be a singular integral, and for a locally integrable function b, let Mb be the operator given by pointwise multiplication by b: Mb f (x) = b(x)f (x). Define the commutator [b, T] by MbT — TMb. If Tf = IC* f then for f E Cea° [b, T] f (x) = f (b(x) — b(y))K (x — y) f (y) dy, x supp( f). art

Theorem 6.19 was proved by N. S. Trudinger (On imbeddings into Orlicz spaces and some applications, J. Math. Mech. 17 (1967), 473-483) when a = 1 and in general by R. S. Strichartz (A note on Trudinger's extension of Sobolev's inequalities, Indiana Univ. Math. J. 21 (1972), 841842). Sharp constants have been found by J. Moser (A sharp form of an inequality by N. Trudinger, Indiana Univ. Math. J. 20 (1971), 10771092) and D. R. Adams (A sharp inequality of J. Moser for higher order derivatives, Ann. of Math. 128 (1988), 385-398). Also see the books by W. Ziemer (Weakly Differentiable Functions, Springer-Verlag, New York, 1989) and D. R. Adams and L. Hedberg (Function Spaces and Potential

Clearly, if b E L' then [b,T] is bounded on LP, 1 < p < oo. Since both M b T and TMb are bounded exactly when b is a bounded function, it would be reasonable to conjecture that this is also the case for [b, T]. However, due to cancellation between the two terms, a weaker condition is sufficient.

Theory, Springer-Verlag, Berlin, 1996).

103 (1976), 611-635) who used them to extend the classical theory of HP

Theorem 6.20. Given a singular integral T whose associated kernel is standard, then the commutator [b, T] is bounded on LP, 1 < p < oo, if and only if b BMO.

Commutators were introduced by R. Coifman, R. Rochberg and G. Weiss (Factorization theorems for Hardy spaces in several variables, Ann. of Math.

132

6. H I- and BMO

spaces to higher dimensions. They proved that b E BMO is sufficient for [b, T] to be bounded and proved a partial converse. The full converse is due to S. Janson (Mean oscillation and commutators of singular integral operators, Ark. Mat. 16 (1978), 263-270). This paper also contains a simpler proof of sufficiency due to J. 0. StrOmberg which is reproduced in Torchinsky [19, p. 418]. The commutator [b, T] is more singular than the associated singular integral; in particular, it does not satisfy the corresponding weak (1, 1) estimate. However, the following weaker result is true.

Chapter 7

Weighted Inequalities

Theorem 6.21. Given a singular integral T and b E BMO, then : l[b,T] f (x)I > All < Clibll. If(( (1)I+, log+ I E

(If(y)I)) dy.

Theorem 6.21 is due to C. Perez (Endpoint estimates for commutators of singular integral operators, J. Funct. Anal. 128 (1995), 163-185). In this paper he also discusses the boundedness of [b, T] on H 1 . A. Uchiyama (On the compactness of operators of Hankel type, TOhoku Math. J. 30 (1978), 163-171) proved that the commutator [b, T] is a compact operator if and only if b E VMO.

1. The A P condition

In this chapter we are going to find the non-negative, locally integrable functions w such that the operators we have been studying are bounded on the spaces LP (w). (These are the LP spaces with Lebesgue measure replaced by the measure w dx.) We will refer to such functions as weights, and for a measurable set E we define w(E) = fE Our first step is to assume that the Hardy-Littlewood maximal function satisfies a weighted, weak-type inequality, and deduce from this a necessary condition on the weight w. To simplify our analysis, throughout this chapter we will replace our earlier definition of the Hardy-Littlewood maximal function with 1

n, M f (x) = sup i— QDx

Q

If(y)idY,

where the supremum is taken over all cubes Q containing x. This is the non-centered maximal operator we originally denoted by M" (see (2.6)), but recall that it is pointwise equivalent to our original definition (2.3). The weighted, weak (p, p) inequality for M with respect to w is (7.1)

-7 fR,„ If (x)iPw(x)dx. wax E R : M f (x) > Al) <-AC

Let f be a non-negative function and let Q be a cube such that f(Q) = fQ f > 0. Fix A such that 0 < A < f(Q)/IQI. Then Q C {x E ll : M(fx(2)(x) > A}, so from (7.1) we get that w(Q) <

Li

f (x)IP w(x) dx

.

133

7. Weighted Inequalities

134

Since this holds for all such A, it follows that (7.2)

w(Q) fi (( Q21 ) ) P C

f IflPw.

1. The A p condition

zero. Taking the union over all such cubes we see that Mw(x) > Cw(x) holds only on a set of measure 0 in Ill'. Case 2: 1 < p < oo. In (7.2) let f = wl - P'xc2; then

Given a measurable set S C Q, let f = Xs. Then (7.2) becomes (7.3)

S11 ) w(Q) IQ

P

_< Cw(S).

From (7.3) we can immediately deduce the following: (1) The weight w is either identically 0 or w > 0 a.e. For if w = 0 on a set of positive measure S (which we may assume is bounded), then (7.3) implies that for every cube Q that contains S, w(Q) = 0. (2) The weight w is either locally integrable or w = oo a.e. For if w(Q) co for some cube Q, then the same is true for any larger cube, and so by (7.3), w(S) oo for any set S of positive measure. Note that while we had assumed a priori that w was locally integrable, this is actually a consequence of the weighted norm inequality. To deduce the desired necessary conditions, we will consider two cases. Case 1: p = 1. In this case inequality (7.3) becomes w(Q) < C w( 8)

1Q1 -

151

Let a = inf{w(x) : x E Q}, where inf is the essential infimum, that is, excluding sets of measure zero. Then for each E > 0 there exists S, C Q such that I SE I > 0 and w(x) < a + e for any x E SE . Hence, for all E > 0,

w(Q) < C(a + E),

1Q1

and so w(Q ) < C inf w(x). IQ' inf

135

w(Q) (1 f

1Q1Q w

Equivalently, (7.6)

l- P) < C f

GcTil L w )( m 1

where C is independent of Q. (Note that to derive (7.6) we have assumed that w 1- p' is locally integrable. To avoid this assumption we can replace it by min(w 1-71 , n) and take the limit as n tends to infinity. However, since w > 0 a.e., (7.6) implies that w 1- p' is locally integrable.) Condition (7.6) is called the Ap condition, and the weights which satisfy it are called A p weights. Theorem 7.1. For 1 A}) — ACp

L,f(x)rw(x)

dx

holds if and only if w E Ap . Proof. We proved the necessity of the A p condition above. To prove sufficiency, we first consider the case p = 1. This case is a corollary to Theorem 2.16. The right-hand side of the weak (1,1) inequality (2.11) contains Mw, and since w E A1, by (7.5) we can replace Mw by Cw. Now suppose that p > 1 and w E Ap . Given a function f, we first show that inequality (7.2) holds, and so inequality (7.3) also holds. By Holder's inequality

fQ Ifl) P = (FQ1-1 L If Iw i /Pw -1 /P) P 1

for any cube Q, (7.4)

w(Q) < Cw(x) a.e. x E Q.

IQ'

This is called the Al condition, and we refer to the weights which satisfy it as Al weights. Condition (7.4) is equivalent to (7.5)

Mw(x) < Cw(x) a.e. x

Clearly, (7.5) implies (7.4). Conversely, suppose that (7.4) holds and let x be such that Mw(x) > Cw(x). Then there exists a cube Q with rational vertices such that w(Q)/IQl > Cw(x), so x lies in a subset of Q of measure

1/1 1'w PI Q

1 w i II P1(21 Q (11 . c (1.91 fQ IflPw) (Q)) (

1

)

Now fix f E LP(w); we may assume without loss of generality that f is nonnegative. We may also assume that f E L l since otherwise we can replace f by f XB(o,k) and the following argument will yield constants independent of k. (Note that the previous argument shows that f is locally integrable.) Form the CalderOn-Zygmund decomposition of f at height 4 - nA to get a collection

7. Weighted Inequalities

136

of disjoint cubes {Q 3 } such that f (Q > 4 - n A 1Q .31. (See Theorem 2.11.) Then by the same argument as in the proof of Lemma 2.12, we can show that Ix E

: Mf(x) > A} C U3Qj.

Therefore,

1

(7.7)

(— f wow s

IQ' Q

1

Tr-1

Q0

< C.

(— f

w (x) -1 < sup 2

xEQ

w

z

—1

(x) -1 = inf wi

(x) )

wi(C2)

IQI )

If we substitute this into the left-hand side of (7.7) for the negative exponents we get the desired inequality. ❑

w(3c2 ; ) 2. Strong-type inequalities with weights

< crP w(Qi ) ▪ c3" .7)

E

Using the Marcinkiewicz interpolation theorem, we can immediately derive a strong (p,p) inequality from Theorem 7.1. Let w E A q , 1 < q < p. Then L°°(w) = L°° with equality of norms since by (7.3), w(E) = 0 if and only if 1E1 = 0. Therefore, we have the inequality

J,Q,3!,) P f Ifl'w kW

CI;

▪ C 3nP ( -4An Y

IIMIIILOO(w)

IfIPW7

IlfIlL-(w)•

If we interpolate between this and the weak (q, q) inequality we get

where the second inequality follows from (7.3), the third from (7.2) and the ❑ fourth from the properties of the Q3's. The following properties of

(3) We need to prove that

By the Al condition, for x E Q and i = 0, 1,

(Here we dilate the cubes by a factor of 3 instead of 2 because M is the xeC2 non-centered maximal operator.)

wax E l : M f (x) > Al)

137

2. Strong-type inequalities with weights

AP weights are consequences of the definition.

Proposition 7.2.

(7.8)

f

1M.f I P w Cp

f n 1/1Pw

for all p > q. A much deeper result is that this inequality remains true when p = q. Theorem 7.3. If 1 < p < oo then M is bounded on LP (w) if and only if

(1) Ap c A q , 1 5_ p < q.

w E Ap .

(2) w E AP if and only if w l- P' E AP ,.

Since the strong (p,p) inequality implies the weak (p,p) inequality, necessity follows from Theorem 7.1. Sufficiency would follow from the above interpolation argument if we could show that given w E AP , there exists q, 1 < q < p, such that w E A q . The existence of such a q comes from the following key result.

(3) If wo, w1 E Al then wowl —P E Ap . Proof. (1) If p = 1 then 1

( IQI fc2 —

wi–g,

q-1 sup

<

1

(w(Q ))-1

p w(x) -1 = if w(x)) < C x EQ

IQ'

•

If p > 1 then this follows immediately from Holder's inequality.

Theorem 7.4 (Reverse Wilder Inequality). Let w E AP , 1 0, depending only on p and the AP constant

of w, such that for any cube Q, (7.9)

(2) The A P, condition for w 1-71 is

( 1 f

1/(1+0

Fc2f

1(21./ Q C IQI 1 1.1,wi – P)( 1- fQ w(i – Pixi – P)) 11-1 (w

< c,

and since (p' - 1)(p - 1) = 1, the left-hand side is the A p condition raised to the power - 1.

w.

The name of Theorem 7.4 comes from the fact that the reverse of inequality (7.9) is an immediate consequence of Wilder's inequality. To prove this result we first need to prove a lemma.

138

7. Weighted Inequalities

Lemma 7.5. Let w E A p , 1 < p < oo. Then for every a, 0 < a < 1, there exists 0 < < 1, such that given a cube Q and S C Q with 151 < (API, w(S) 5 Ow(Q)•

Now sum over all the cubes in the decomposition at height Ak; we then get w(C2k +1 ) < i3w(f2k). If we iterate this inequality we get w(Ik) < O k w(C20). Similarly, 1541 < a k 1S11 0 1; hence,

Proof. If we replace S by Q \S in inequality (7.3) we get

w(Q) (1 — If 1SI

w(S)

C

1

wi+E

,„1-Ef =

IQ'

w(Q),

which gives us the desired result with = 1 — C -1 (1 — a)P.

,00

IS2k1= 0.

Therefore,

alQI then C — (1 — a)P

= li

nQk

C(w(Q) w(S))•

IQ'

139

2. Strong-type inequalities with weights

❑

Q\c2p

E

j

,„

w1+E

k=0 k\ i4 + 1

< W(Q) 4-1-1W(C2k) 1Q1 1(21 —

k=0 Proof of Theorem 7.4. Fix a cube Q and form Calderon-Zygmund dec° n a —i ) ( k+i), Ap k w(lo). + 1 Ep w compositions of w (Q) with respect to Q at heights given by the increasing = sequence w(Q)/IQI A o < Al < • • < Ak < ; we will fix the Ak's below. 1(21 k=0 1(21 (The decomposition is formed by repeatedly bisecting the sides of Q; cf. the Fix e > 0 such that (2na -1 )'/3 < 1; then the series converges and the last proof of Theorem 6.11.) term is bounded by CA,Sw(Q)/1Q1. Since A o = w(Q)/1Q I we have shown the For each Ak we get a family of disjoint cubes {Qk such that desired inequality. w(x) Ak if xf{S2k =U Qk 7 As a corollary to the reverse Holder inequality we get the property of weights needed to complete the proof of Theorem 7.3. Ap 1

❑

Ak

W < 2 n Ak. IQk,31 .1(21, 0

<

Corollary 7.6.

It is clear from their construction that C2k+1 C C2k. If we fix Qk oo from the Calderon-Zygmund decomposition at height Ak, then Qk j 0 n S/k 4. 1 is the union of cubes Qk+1,, from the decomposition at height Ak+1. Therefore, IQk, 30

n 12k+1 I = E .

1 \---■ f Ak+1

kk+, 1

w

Ak+1 fQ,,,. 2nAk

Jo,. Ak+i Fix a < 1 and choose the Ak's so that 2 n Ak/Ak+i = a; that is, Ak (2na l)kw(Q)/1Q1. Then ,

(1) AP

=

Fourier analysis

Read more

Fourier analysis

Read more

Fourier analysis

Read more

Fourier Analysis

Read more

Fourier Analysis

Read more

Fourier Analysis

Read more

Principles of Fourier Analysis

Read more

Fourier And Wavelet Analysis

Read more

Fourier and Wavelet Analysis

Read more

Modern Fourier analysis

Read more

Exercises in Fourier Analysis

Read more

Wavelets with Fourier Analysis

Read more

Fourier analysis and convexity

Read more

Fourier Analysis, Self-Adjointness

Read more

Fourier analysis: an introduction

Read more

Discrete Fourier Analysis

Read more

Fourier Analysis and Convexity

Read more

Higher order Fourier analysis

Read more

Modern Fourier analysis

Read more

Fourier analysis: an introduction

Read more

Fourier and wavelet analysis

Read more

Principles of Fourier Analysis

Read more

Classical Fourier Analysis

Read more

Discrete Fourier analysis

Read more

Fourier and wavelet analysis

Read more

Fourier Analysis, Self-Adjointness

Read more

Fourier analysis and approximation

Read more

Fourier analysis : an introduction

Read more

Fourier Analysis, Self-Adjointness

Read more

Fourier Analysis - An Introduction

Read more

Recommend Documents

Fourier analysis

Ibookroot October 20, 2007 FOURIER ANALYSIS Ibookroot October 20, 2007 Princeton Lectures in Analysis I Fourier A...

Fourier analysis

Fourier analysis

Fourier Analysis Translation by Olof Staffans of the lecture notes Fourier analyysi by Gustaf Gripenberg January 5, 2...

Fourier Analysis

Fourier Analysis

Fourier Analysis

Principles of Fourier Analysis

Principles of Fourier Analysis © 2001 by Chapman & Hall/CRC © 2001 by Chapman & Hall/CRC Studies in Advanced Mathema...

Fourier And Wavelet Analysis

Fourier and Wavelet Analysis

George Bachman Lawrence Narici Edward Beckenstein FOURIER AND WAVELET ANALYSIS Springer George Bachman Lawrence Nari...

Modern Fourier analysis

Graduate Texts in Mathematics 250 Editorial Board S. Axler K.A. Ribet Graduate Texts in Mathematics 1 2 3 4 5 6 7 8...