Duality for Nonconvex Approximation and Optimization (CMS Books in Mathematics)

Q> Canadian Mathematical Society Societe mathematique du Canada Editors-in-Chief Redacteurs-en-chef J. Borwein K.Dilch...

Author: Ivan Singer

64 downloads 883 Views 13MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Q>

Canadian Mathematical Society Societe mathematique du Canada Editors-in-Chief Redacteurs-en-chef J. Borwein K.Dilcher

Advisory Board Comite consultatif P. Borwein R. Kane S. Shen

CMS Books in Mathematics Ouvrages ofe mathematiques de la SMC 1

HERMAN/KUCERA/SIMSA

2

ARNOLD

3

BORWEIN/LEWIS

Convex Analysis and Nonlinear Optimization, 2nd Ed.

4

LEVIN/LUBINSKY

Orthogonal Polynomials for Exponential Weights

5 KANE

Equations and Inequalities

Abelian Groups and Representations of Finite Partially Ordered Sets

Reflection Groups and Invariant Theory

6

PHILLIPS

TWO Millennia

7

DEUTSCH

8

FABIAN ET AL.

of Mathematics

Best Approximation in Inner Product Spaces Functional Analysis and Infinite-Dimensional Geometry

9 KRI^EK/LUCA/SOMER

17 Lectures on Fermat Numbers

Computational Excursions in Analysis and Number Theory

10

BORWEIN

11

REED/SALES

(Editors)

Recent Advances in Algorithms and Combinatorics

12 HERMAN/KUCERA/SIMSA 13

NAZARETH

14

PHILLIPS

15

BEN-ISRAEL/GREVILLE

Counting and Configurations

Differentiable Optimization and Equation Solving Interpolation and Approximation by Polynomials Generalized Inverses, 2nd Ed.

16 ZHAO Dynamical Systems in Population Biology 17 GoPFERT ET AL. 18

Variational Methods in Partially Ordered Spaces

AKIVIS/GOLDBERG

Differential Geometry of Varieties with Degenerate

Gauss Maps 19

MIKHALEV/SHPILRAIN/YU

20 BORWEIN/ZHU 21

Techniquesof Variational Analysis

VAN BRUMMELEN/KINYON

22 LuccHETTi

Combinatorial Methods

Mathematics and the Historian's Craft

Convexity and Well-Posed Problems

23

NICULESCU/PERSSON

Convex Functions and Their Applications

24

SINGER

25

HIGGINSON/PIMM/SINCLAIR

Duality for Nonconvex Approximation and Optimization Mathematics and the Aesthetic

Ivan Singer

Duality for Nonconvex Approximation and Optimization With 17 Figures

^ Sprimger

Ivan Singer Simion Stoilow Institute of Mathematics 014700 Bucharest Romania Editors-in-Chief Redacteurs-en-chef Jonathan Borwein Karl Dilcher Department of Mathematics and Statistics Dalhousie University Halifax, Nova Scotia B3H 3J5 Canada cbs-editors @ cms.math.ca

Mathematics Subject Classification: 46N10,49N15,90C26,90C48 Library of Congress Cataloging-in-PubHcation Data Singer, Ivan. DuaHty for nonconvex approximation and optimization / Ivan Singer. p. cm. — (CMS books in mathematics; 24) ISBN-13: 978-0-387-28394-4 (alk. paper) ISBN-10: 0-387-28394-3 (alk. paper) ISBN-10: 0-387-28395-1 (e-book) 1. Convex functions. 2. Convex sets. 3. Duality theory (Mathematics) 4. Approximation theory. 5. Convex domains. 6. Convexity spaces. I. Title. II. Series. QA640.S56 2005 515'.8-dc22

2005051742

Printed on acid-free paper. © 2006 Springer Science+Business Media, Inc. All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer Science+Business Media, Inc., 233 Spring Street, New York, NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights. Printed in the United States of America. 9 8 7 6 5 4 3 2 1 springeronline.com

To the memory of my wonderful wife, Crina

Contents

List of Figures Preface

xi xiii

1

Preliminaries 1.1 Some preliminaries from convex analysis 1.2 Some preliminaries from abstract convex analysis 1.3 Duality for best approximation by elements of convex sets 1.4 Duality for convex and quasi-convex infimization 1.4.1 Unperturbational theory 1.4.2 Perturbational theory

1 1 27 39 46 47 71

2

Worst Approximation 2.1 The deviation of a set from an element 2.2 Characterizations and existence of farthest points

85 86 93

3

Duality for Quasi-convex Supremization 3.1 Some hyperplane theorems of surrogate duality 3.2 Unconstrained surrogate dual problems for quasi-convex supremization 3.3 Constrained surrogate dual problems for quasi-convex supremization 3.4 Lagrangian duality for convex supremization 3.4.1 Unperturbational theory 3.4.2 Perturbational theory

101 103 108 121 127 127 129

viii

Contents 3.5 Duality for quasi-convex supremization over structured primal constraint sets

131

4

Optimal Solutions for Quasi-convex Maximization 4.1 Maximum points of quasi-convex functions 4.2 Maximum points of continuous convex functions 4.3 Some basic subdifferential characterizations of maximum points

137 137 144 149

5

Reverse Convex Best Approximation 5.1 The distance to the complement of a convex set 5.2 Characterizations and existence of elements of best approximation in complements of convex sets

153 154

Unperturbational Duality for Reverse Convex Infimization 6.1 Some hyperplane theorems of surrogate duality 6.2 Unconstrained surrogate dual problems for reverse convex infimization 6.3 Constrained surrogate dual problems for reverse convex infimization 6.4 Unperturbational Lagrangian duality for reverse convex infimization 6.5 Duality for infimization over structured primal reverse convex .constraint sets 6.5.1 Systems 6.5.2 Inequality constraints

169 171 175

6

7

8

9

Optimal Solutions for Reverse Convex Infimization 7.1 Minimum points of functions on reverse convex subsets of locally convex spaces 7.2 Subdifferential characterizations of minimum points of functions on reverse convex sets Duality for D.C. Optimization Problems 8.1 Unperturbational duality for unconstrained d.c. infimization 8.2 Minimum points of d.c. functions 8.3 Duality for d.c. infimization with a d.c. inequality constraint 8.4 Duality for d.c. infimization with finitely many d.c. inequality constraints 8.5 Perturbational theory 8.6 Duality for optimization problems involving maximum operators 8.6.1 Duality via conjugations of type Lau 8.6.2 Duality via Fenchel conjugations

161

184 189 190 190 198 203 203 209 213 213 221 225 232 244 247 248 252

Duality for Optimization in the Framework of Abstract Convexity . . . 259 9.1 Additional preliminaries from abstract convex analysis 259 9.2 Surrogate duality for abstract quasi-convex supremization, using polarities AG : 2^ ^ 2^ and AG : 2^ ^ 2^^^ 267

Contents 9.3 Constrained surrogate duality for abstract quasi-convex supremization, using families of subsets of X 9.4 Surrogate duality for abstract reverse convex infimization, using polarities AG : 2^ ^ 2 ^ and A c : 2^ -> 2^^^ 9.5 Constrained surrogate duality for abstract reverse convex infimization, using families of subsets of X 9.6 Duality for unconstrained abstract d.c. infimization 10 Notes and Remarks

ix

270 271 273 275 279

References

329

Index

347

List of Figures

1.1

3

1.2

12

1.3 1.4 1.5

40 44 49

2.1 2.2 2.3 2.4 2.5 2.6 2.7

86 88 90 91 92 95 96

5.1 5.2 5.3 5.4 5.5

153 156 157 159 163

Preface

In this monograph we present some approaches to duaUty in nonconvex approximation in normed Hnear spaces and to duaUty in nonconvex global optimization in locally convex spaces. At the first stage of development of approximation theory in normed linear spaces, the "best approximation" of an element by linear subspaces, and more generally, by convex sets (i.e., the minimization of the distance of an element to a convex set) was studied. Later, the following two main classes of nonconvex approximation problems were considered: "worst approximation," i.e., the maximization of the distance of an element to an arbitrary set, and "reverse convex best approximation," i.e., the minimization of the distance of an element to the complement of a convex set. These may be called "anticonvex" problems (following Penot [175], who has used this term in the more general context of optimization theory). The first results on duality for these problems were obtained in the papers [73], [74]. In optimization theory in locally convex spaces, first linear optimization problems, and more generally, convex optimization problems, i.e., the minimization of a convex function on a convex set (clearly, best approximation of an element by the elements of a convex set belongs to this class of problems) were studied. Later, the duality results obtained in this direction were extended to duality results for nonconvex problems, based on generalizations of convexity and of the methods of convex analysis by Elster and Nehse [60], Balder [13], Lindberg [134], Dolecki and Kurcyusz [48], Dolecki [47], and others. Independently, some classes of nonconvex optimization problems of a different type were studied, which Hiriart-Urruty [102] called "convex-anticonvex" problems (and we shall also adopt this terminology), since they have the following specific structure. They are minimization problems in which convexity is present twice.

xiv

Preface

in the constraint set and/or in the objective function, but once in the reverse way; namely, these are "convex maximization," i.e., maximization of a convex function on a convex set (or equivalently, minimization of a concave function on a convex set), "reverse convex minimization," i.e., minimization of a convex function on the complement of a convex set, and "d.c. optimization," i.e., optimization problems involving differences of convex functions. Of course, the latter also encompasses convex optimization problems as a particular case. The first results on duality for these problems were obtained in the papers [215], [218], [220], [217], and the paper [280]ofToland. For some time, approximation theory and optimization theory have developed independently, in parallel. In the 1960s it was observed that optimization, i.e., the minimization or maximization of a function, contains approximation as a particular case. Indeed, approximation is the minimization or maximization of a particular function on a normed linear space X, namely, the function f{y) = \\xo-y\\

(yeX).

Thus, in the 1970s there appeared naturally the idea of studying them together in this spirit, as reflected for example by the titles of the monographs of Laurent, Approximation et optimisation {191 Qi) [129], Holmes, A Course on Optimization and Best Approximation (1972) [106], Krabs, Optimierung und Approximation (1975) [122], Hettich and Zencke, Numerische Methoden der Approximation und semi-infiniter Optimierung (1982) [96], Glashoff and Gustaffson, Linear Approximation and Optimization (1983) [84], and Jongen, Jonker, and Twilt, Nonlinear Optimization in IR^. I. Morse Theory, Chebyshev Approximation (1986) [114]. The same point of view also appeared in parts of other monographs on optimization theory. On the other hand, going in the opposite direction, Cheney and Goldstein [32] have extended a result on the existence of best approximations to a result on the existence of optimal solutions of minimization problems. Starting with [212], [213], there was suggested and systematically carried out a program of work in this direction, namely, to show that many methods and results of approximation theory are so strong that they can be generalized to yield new methods and results in optimization theory. Subsequently, others also adopted this latter point of view (e.g., Wriedt [295], Berdyshev [17]). In the present monograph we shall study these two theories and their interactions, going from approximation to optimization and vice versa. It has long been known that duality is a powerful tool in the study of approximation and optimization problems. For problems of approximation in a normed linear space X, namely, of minimization or maximization of the distance to a given subset of X, "duality" means simply their study with the aid of the elements of the conjugate space X*. In a general setting, "duality theory" in optimization means the simultaneous study of a pair of optimization problems, related in some way, namely, the initial problem, called the "primal problem," of minimization or maximization of a function on a subset of a locally convex space X, and the "dual problem" of minimization or maximization of a function on a subset of a locally convex space W, with the aim of obtaining more information on the primal problem (on its "optimal value," on its "optimal solutions," etc.). In general (with the exceptions of Sec-

Preface

xv

tions 9.3 and 9.5), W is a set of functions on X, or alternatively, W is an arbitrary set, but paired with X with the aid of a function on the Cartesian product X x W called a "coupling function." In fact, although the latter is apparently more general, it turns out that these two methods are equivalent. We shall avoid the use of the term "duality" in other senses (so instead of "dual space" we shall use "conjugate" space; instead of "duality" between families of subsets we shall use "polarity"; etc.). The monographs devoted to approximation theory in normed linear spaces ([210], [211]) and those containing some chapters or sections on approximation in such spaces (e.g., Akhiezer [1], Cheney [31], Tikhomirov [277]) treat duality mainly for the case of best approximation by convex sets or special classes of convex sets (Unear subspaces, cones) or do not consider duality at all (Deutsch [41], devoted to best approximation in inner product spaces; Braess [25]). Also, the monographs on approximation and optimization, mentioned above, of Laurent, Krabs, and others consider duality mainly for convex sets and functions, or like the one of Jongen, Jonker, and Twilt, do not consider duality at all. Furthermore, most of the existing monographs on optimization theory or convex analysis and optimization treat duality mainly for the convex and quasi-convex cases (e.g., Stoer and Witzgall [262], Auslender [11], loffe and Tikhomirov [111], Ekeland and Temam [54], Elster, Reinhardt, Schauble, and Donath [61], Barbu and Precupanu [14], Pshenichnyi [182], Ponstein [180], Hettich andZencke [96], Glashoff and Gustaffson [84], Ekeland and Tumbull [55], Hiriart-Urruty and Lemarechal [104], Golshtein and Tretyakov [91], Borwein and Lewis [21]) or include some brief parts on nonconvex duality, especially on d.c. duality (e.g., Konno, Thach, and Tuy [120], Strekalovsky [267], Rubinov [193], Pallaschke and Rolewicz [169], Rockafellar and Wets [187], Tuy [284]). A section of the recent monograph of Rubinov and Yang, Lagrange-Type Functions in Constrained Nonconvex Optimization [201], presents the general theory of Lagrange-type functions and duality, developed mainly by the authors [201, Ch. 3, Section 3.2]. The monographs devoted especially to duality in optimization theory, by Golshtein, The Theory of Duality in Mathematical Programming and Its Applications (in Russian, 1971) [90], Rockafellar, Conjugate Duality and Optimization (1974) [185], and Walk, Theory of Duality in Mathematical Programming (1989) [293], treat only duality for convex optimization and some nonconvex generalizations of it. The monograph of Gao, Duality Principles in Nonconvex Systems: Theory, Methods and Applications (2000) [76], addressed to those working in applied mathematics, physics, mechanics, and engineering, presents a brief combination of Rockafellar's perturbational duality theory for convex problems and Auchmuty's [10] extended Lagrange duality theory as part of Gao's larger original theory of duality, which aims to encompass "duality in natural phenomena." Finally, the recent monograph of Goh and Yang, Duality in Optimization and Variational Inequalities (2001) [89], contains a short chapter on a nonconvex duality theory due to the authors (Goh and Yang [88]) for the classical mathematical programming problem in R". However, there is no monograph devoted to duality for nonconvex approximation and optimization problems.

xvi

Preface

There are detailed surveys on some of the approaches to nonconvex duaHty, described above. Thus, for the nonconvex duaHty results based on generalizations of convexity and generalizations of the methods of convex analysis see Martinez-Legaz ([143], [140]), and for the nonconvex duality results based on various Lagrangetype functions see the respective chapters of the monographs of Goh and Yang [89] and Rubinov and Yang [201]. Therefore, these approaches will be presented here more briefly, mainly in Chapters 1 and 10. The present monograph is devoted to the study of duality for the anticonvex approximation and convex-anticonvex optimization problems, in the above-mentioned senses. Note that these include a very broad class of nonconvex problems. For example, as we shall see in Chapter 8, the infimization of a lower semicontinuous function over a closed subset of a Hilbert space can be easily reformulated as the problem of infimization of a continuous linear function subject to a d.c. constraint, or alternatively, of a convex function subject to a reverse convex constraint. We shall concentrate here only on duality, so we shall not consider, for example, characterizations of primal optimal solutions involving only the primal constraint set and the primal objective function (with the exception of those, such as Remarks 4.1 and 7.1, that are used to prove duality results). We shall study duality only for global approximation and optimization, but some results for the local case will be also mentioned briefly in the Notes and Remarks. We shall not consider here duality for multiobjective optimization. In order to limit the size of this monograph, quadratic optimization and differentiable optimization will not be considered here; also, algorithms are not given here (for the latter, see for example the survey article Tuy [283] and the monographs Konno, Thach, and Tuy [120] and Tuy [284]). Being the first of this kind in the literature, the present monograph is based entirely on articles in mathematical journals. Some unpublished results and some new proofs are also given. Let us describe, briefly, the contents of the chapters of the book. In Chapter 1, after some preliminaries from convex analysis and abstract convex analysis, we give some results on duality for best approximation by elements of convex sets in normed linear spaces, and on duality for the infimization of convex and quasi-convex functions on convex sets in locally convex spaces. These will serve as a basis of comparison with the nonconvex duality results of the subsequent chapters and with the methods of obtaining them. In Chapter 2 we consider the deviation 5(G, JCQ) of a set G from an element XQ in a normed linear space X, i.e., the supremum of the distances ||g — XQ\\ = dist(g, XQ), over all g € G. We give duality formulas for 8(G, XQ) and characterizations of the elements go ^ G for which the above supremum is attained, i.e., of the so-called elements of worst approximation (or farthest points). Chapter 3 is devoted to the more general problem of quasi-convex supremization sup / ( G ) , where G is a set in a locally convex space X and f: X ^^ R is a. quasi-convex function. We introduce and study both unconstrained and constrained surrogate dual problems, as well as unperturbational and perturbational Lagrangian dual problems for quasi-convex supremization. Also, we consider surrogate duality for the case that the primal constraint set G is expressed with the aid of a "system."

Preface

xvii

In Chapter 4 we present various characterizations of the optimal solutions for quasi-convex supremization problems sup/(G), i.e., of the elements go e G such that/(go) = max/(G). In Chapter 5 we study the best approximation dist(xo, CG) by the complement CG = X\G of a convex set G in a normed linear space X, i.e., the infimum of the distances ||jco — z|| = dist(xo, z), over all z e CG. We give duality formulas for dist(jco, CG), and characterizations of the elements zo ^ CG for which the above infimum is attained. Chapter 6 is devoted to the more general problem of reverse convex infimization i n f / ( C G ) , where G is a convex set in a locally convex space X and f: X -^ R is a function. We introduce and study both unconstrained and constrained surrogate dual problems, and unperturbational Lagrangian dual problems for reverse convex infimization. Also, we consider surrogate duality for the case that the primal constraint set G is expressed with the aid of a system or with the aid of inequahties. In Chapter 7 we present various characterizations of the optimal solutions for reverse convex infimization problems i n f / ( C G ) (where G is a convex subset of a locally convex space X), i.e., of the elements zo ^ CG such that f(zo) = min / ( C G ) . Chapter 8 is devoted to "d.c. optimization," i.e., to optimization problems involving differences of convex functions. We first give duality results for the unconstrained infimization of the difference f — h of two functions on a locally convex space X, the first of them being arbitrary and the second one convex and lower semicontinuous. Next we give some characterizations for optimal solutions of such problems. We also study duality for the infimization of the difference f — h, where / , h are convex functions, on a constraint set defined by an inequality l{x) — k{x) < 0 or l(x) — k(x) < 0, where /, /: are convex functions, or on a constraint set defined by finitely many such inequalities. Furthermore, we present some results of perturbational Lagrangian duality for d.c. infimization. Finally, we present some duality results for the unconstrained problem of infimization of the pointwise maximum of two functions / and —h on a. locally convex space X, the first of them being arbitrary and the second one quasi-convex (or more particularly, convex) and lower semicontinuous (it turns out that this is, essentially, a d.c. problem). The framework of abstract convexity, which encompasses various generalizations of convex sets and convex functions, permits us to study optimization of more general functions on more general sets. In Chapter 9 we present briefly some duality results for such optimization problems. The concluding Chapter 10 contains some comments, bibliographical references, and additional results for each of the preceding chapters. We hope that this book will interest a large circle of readers, including those who want to use it for research or as a reference book, or for a graduate course, or for independent study (to this end, we have given detailed proofs of the results and several illustrations). I would like to express my profound gratitude to my long-time friend J.-E. Martinez-Legaz for his support of the project of this book and his generous help in its materialization. He has patiently and carefully read several versions of the whole manuscript, making valuable suggestions for corrections, improvements and

xviii

Preface

additions. Furthermore, I thank A.M. Rubinov for his stimulating interest and encouragement and for helpful comments on some parts. Also, I thank C. Zalinescu for prompt answers to some questions. I am grateful for the excellent working conditions ensured by the Simion Stoilow Institute of Mathematics of the Romanian Academy during the writing and preparation for print of the manuscript. Finally, I wish to thank J.M. Borwein for accepting to publish this book in his prestigious series and for his invaluable strong support in various stages of its production. Last, but not least, my thanks are due to Springer and the Canadian Mathematical Society, for their efforts and care in the production process. Bucharest, Romania October 2005

Ivan Singer

Duality for Nonconvex Approximation and Optimization

1 Preliminaries

1.1 Some preliminaries from convex analysis In this section we recall some basic definitions and results about convex analysis in the framework of normed linear spaces, in which we shall study the approximation problems, and in the more general framework of locally convex spaces in which we shall study the optimization problems. A (real) linear space is a set X in which there are defined two "vector operations," namely, an internal binary operation, called "addition," which associates, to each pair of elements x,y e X an element x -\- y e X, and an external binary operation, called "multiplication by a scalar," which associates, to each pair (a, x) consisting of a real number a e R and an element x € X, an element ax e X, with these operations satisfying the following conditions, for all x, j , z € X and a,b e R: (1)

X -\-y = y -{-X,

(2) x + (y-\-z) =

(x-\-y)-\-z,

(3) x-\-y=x-i-z=>y

= z,

(4)

a(x + y) =ax

(5)

(a + b)x = ax -\- bx,

(6)

a(bx) = (ab)x,

(7)

-^ay,

lx=x.

From these "axioms" one deduces easily that there exists a unique element 0 e X such that jc + 0 = 0 + jc = 0 for all jc e X. The "opposite element" of any

2

1. Preliminaries

jc G X is defined by -x := (-l)jc, and "subtraction" of elements is defined by X - y := X + (-y) (x, y e X). For example, the set R"^ of all ordered n-tuples of real numbers x = ( x i , . . . , x„), where I < n < +00, with componentwise vector operations X + y = (xi + y\, ...,Xn-\-yn).

ax = (axu ...,axn),

(1.1)

where a e /?, is a linear space. A normed linear space is a linear space X in which to each element x e X there is associated a real number \\x\\, called the "norm" of x, satisfying the following conditions, for all jc, j e X and a e R: (8)

||0|| =Oand||jc|| > 0 for each jc 7^ 0,

(9)

\\x + y\\<\\x\\

+ \\yl

(10) \\ax\\ = \a\ \\x\\. In a normed linear space X a sequence {JC„ } is said to "converge" to an element X G X, in symbols, x„ -^ x, or lim„^oo-^n = ^^ if lini^^oo Ikn —^11 = 0- A normed Unear space is called complete, if for every sequence [Xn] C X satisfying lim„,;„^oo \Un — XmW = 0 there exists an element JC e X such that Um„_>oo Xn — x. A complete normed linear space is also called a Banach space. Here are some important examples of Banach spaces, to which we shall refer later: (i) The space /^, i.e., the linear space /?" endowed with the norm \\x\\ = max \xi\.

(1.2)

(ii) The space /", i.e., the linear space R^ endowed with the norm n

lkll=2]|x,|.

(1.3)

i= \

(iii) The "Euclidean space" l^, i.e., the linear space R" endowed with the norm

11.^11 -

£kp. N

(1.4)

When w = 2 or n = 3, it is easy to visualize the "unit ball" Bx = {xeX\

\\x\\<\}

(1.5)

of these spaces (see Figure 1.1). While the above examples are of finite dimension n, here are some infinitedimensional versions:

1.1 Some preliminaries from convex analysis

3

n= l

«=3

Figure 1.1. (iv) The space / ^ of all bounded sequences of real numbers x = (Xi)^^ = fe), with the componentwise vector operations X + }; = (xi + y i , . . . , x „ + j„, . . . ) ,

ax = (axu

,axn,...),

(1.6)

where a e R, and with the norm (1.7)

sup \Xi\. l
(v) The space /^ of all summable sequences of real numbers x = (x/),^i = (xt) (i.e., such that X!/^i l-^/l ^ +^^)^ with the componentwise vector operations (1.6), and with the norm

El

(1.8)

(vi) The space /^ of all square summable sequences of real numbers x = (x/)^^j = (Xi) (i.e., such that J2Z\ \^i\^ < +oo), with the componentwise vector operations (1.6), and with the norm

X

=

\

EM'i=\

(1.9)

Let us also mention the following examples of function spaces, corresponding to the above examples. (vii) The space C([a, b]) of all continuous functions x: [a, b] -^ /?, on a closed interval [a, b], with the pointwise vector operations (x -f y)(t) = x(t) + y(t), {ax)(t) = ax(t)

(t e [a, b]),

(1.10)

4

1. Preliminaries

where a e R, and with the norm lUII = max \xit)\.

(1.11)

te[a,b}

(viii) The space L\[a,b]) of all measurable functions on [a,b] such that /^ \x(t)\dt < +00 (the integral being taken in the sense of Lebesgue), where two functions that coincide almost everywhere are considered identical (i.e., represent the same element of Z), endowed with the pointwise vector operations (1.10), and with the norm b

\x(t)\dt.

(1.12)

/ (ix) The space L^([a, b]) defined similarly to example (viii), with the condition /^ \x(t)\dt < +CXD and the norm (1.12) replaced, respectively, by /^ \x(t)\^dt < +00 and (1.13) Thus, the elements of L^([a, b]) and L^([a, b]) are equivalence classes of functions. We recall that a Hilbert space is a Banach space X in which for any x,y e X there is defined a number {x,y) e R, called the "inner product" of x and y, such that for all x,y, z € X and a e R wt have (jc, x) > 0, (JC, X) = 0 if and only if X = 0, (x.y) = (y,x), (x-\-y,z) = (x, z)-\-(y, z), (ax,y) = a(x, j ) , and in which the norm of x G Z is defined by ||jc|| := y/(x,x). The spaces I2, /^, and L^([a, b]) of examples (iii), (vi), and (ix) are Hilbert spaces, with the usual inner products n

(x.y)

^x/j/,

00

(x,y) = Y,^iyi^

pb

(x^y)=

x(t)y(t)dt,

(1.14)

respectively. For any two elements x, y in a normed linear space X one defines the distance between x and y by disi{x,y):=\\x-y\\.

(1.15)

In the framework of normed linear spaces, the "best approximation" amounts to the minimization of a distance, and the "worst approximation" amounts to the maximization of a distance. This fact permits us to use geometric intuition (but rigorous analytic proofs), and the connections of the phenomena become clearer and the arguments simpler than those occurring in the various concrete spaces. Thus, this general framework constitutes both a unified foundation for the classical approximation theory in various concrete normed linear spaces (which treats the problems in each space with ad hoc methods), and a powerful tool for obtaining new results (see [210], [211]).

1.1 Some preliminaries from convex analysis

5

We recall that a subset C of a linear space X is said to be convex if the relations c\,C2 e C imply that i^cy + (1 - i^)c2 e C for all scalars i^ satisfying 0 < ?^ < 1. Geometrically, this means that along with any two points c\,C2 G C, the set C contains the whole "segment" [c\, CJX := {i^c\ + (1 — ^)c2| 0 < ?^ < 1} joining them (or with "endpoints" ci, C2). For any set C in a linear space X, we shall denote by CO C the convex hull of C, i.e., the intersection of all convex sets containing C. Assuming that the reader knows some elements of general topology, we recall that a linear space X endowed with a Hausdorff topology T on X (i.e., such that for each pair of distinct elements x, y there are neighborhoods Ux of x and Uy of y such that Ux fMJy = 0) is called a topological linear space if the vector operations are continuous, that is, if the mappings (x,y) -^ x + y from X x X into X, and (a, x) -^ ax from R x X into X, are continuous. A topological linear space X is called locally convex if every neighborhood of any element x e X contains a convex neighborhood of x. Remark 1.1. (a) We shall not recall here the theories of normed linear spaces and locally convex spaces. These can be found, e.g., in the standard books of Dunford and Schwartz [49], Bourbaki [24], Day [40], Schaefer [204], Kelley and Namioka [117], Holmes [107], and Kothe [121], but the elements are also recalled briefly in some books on convex analysis (e.g., in loffe and Tikhomirov [111], Barbu and Precupanu [14]). (b) We shall not consider here best approximation in "seminorms", i.e., real numbers satisfying (9) and (10) above (see, e.g., Bourbaki [24]) or in "asymmetric norms," i.e., such that (10) is replaced by \\ax\\ — a\\x\\ for all jc G X and all « > 0 (see, e.g., Krein [123]), or in the more general "norms" introduced in [223]. (c) Unless specified otherwise, we shall consider every normed linear space X as being endowed with the ''norm topology'' (i.e., in which for any element x e X a fundamental system of neighborhoods of x is the family of all open balls [z G Z | ||z — XII < r} (r > 0) with center x)\ this space is locally convex. Therefore, the results given later for locally convex spaces are valid, in particular, in normed linear spaces (endowed with the norm topology), and hence the reader who does not want to work in locally convex spaces, may consider those results only in normed linear spaces. Moreover, for simplicity, the reader may consider the subsequent results only in the usual Euclidean space I2, i.e., in /?" endowed with the norm (1.4) (with the exception of a few specifically infinite-dimensional counterexamples, which the reader can omit). If Z is a locally convex space, a function / : X-> R := [—00, +00] is said to be (a) lower semicontinuous at X{) e X if for each k e R with k < f{xo) there exists a neighborhood U of JCQ such that k < / ( x ) for all x e U, or, equivalently, if /(xo);),

(1.16)

where liminf/Cj) : -

sup

inf/(y),

(1.17)

6

1. Preliminaries

with U(xo) denoting the set of all neighborhoods of XQ; (b) lower semicontinuous, if it is lower semicontinuous at each XQ e X; (c) upper semicontinuous at XQ e X (respectively, upper semicontinuous) if the function —/is lower semicontinuous at XQ (respectively, lower semicontinuous); (d) continuous at XQ e X (respectively, continuous), if it is both lower semicontinuous and upper semicontinuous at XQ (respectively, both lower semicontinuous and upper semicontinuous). For any subset C of a locally convex space X, we shall denote by C, int C, bd C, and CC the closure, the interior, the boundary, and the complement X\C, of C, respectively. The complement CC of any bounded open convex set C in a normed linear space is called a cavern. Lemma 1.1. Let X be a topological space, C c. X and f: X -> R a function. (a) If f is lower semicontinuous, then sup/(C) = sup/(C).

(1.18)

(b) If f is upper semicontinuous, then inf/(C) = inf/(C). (1.19) Proof (a) For (1.18) see, e.g., Bourbaki [23, Ch. 4, Section 6.2, Exercise 5]. (b) If / is upper semicontinuous, then —/is lower semicontinuous, whence by part (a) and the equality inf/(C) = - s u p ( - / ) ( C ) , we obtain (1.19). D Lemma 1.2. Let X be a topological space, and C Q X. Then CC = C(intC).

(1.20)

Proof See, e.g., Kuratowski [124, Ch. 2, Section 6, Part I].

D

For the study of semicontinuity and continuity (and of many other properties) it is convenient to introduce the following sets associated with any function f: X ^^ 'R:

(i) the epigraph of / , i.e., the subset cpi f of X x R defined by epi / := {(x, d)eX

X R\ f{x) < d};

(1.21)

(ii) the (sub)level sets Sdif) and Ad(f) of / , defined by Sdif) :=lyeX\

f(y)
{de R),

(1.22)

AAf)

f(y) < d]

(d e ~R).

(1.23)

:={yeX\

Then, a function f: X ^^ R is lower semicontinuous if and only if its epigraph e p i / is closed in X x R, or, equivalendy, all level sets Sdif) (d € R) are closed subsets of X; hence f:X -^ R is upper semicontinuous if and only if all sets Ad(f) are open (since — / i s lower semicontinuous if and only if all complements CAd(f) = {xeX\ fix) >d} = S-di-f) are closed).

1.1 Some preliminaries from convex analysis

7

If X is a linear space, a function f: X ^^ R := (—CXD, +OO) is said to be (a) linear if it is additive and homogeneous, i.e., fix + y) = fix) + fiy) fiax) = afix)

(X, y e X),

ix eX,ae

(1.24)

R);

(1.25)

(b) affine if / = O + c, where <^: X -^ R isa. linear function and c e R. We shall denote the zero function by 0; that is, 0(JC) = 0 for all x e X. The conjugate space X* of a locally convex space is the set of all continuous linear functions : X ^^ R endowed with the pointwise vector operations (O + vl/)(x) = {x) + ^ix) ia^Xx)

= ^iax)

ix e X),

ix eX,ae

R).

(l.zo)

The ''weak topology" cf{X, X*) on a locally convex space X (in which for any element x e X a. fundamental system of neighborhoods of x is the family of all sets of the form {y e X\ maxi 0)) is also a locally convex topology. We recall the following fact (see, e.g.. Holmes [107, p. 157]), which will often be used later: Lemma 1.3. If C is a weakly compact subset of a locally convex space X ii.e., compact for the weak topology a (X, X*)), then every O G X* attains its supremum on C ii.e.y for every O G X* there exists an element CQ G C such that O(co) = sup 0(C)). The conjugate space of any locally convex space, endowed with the "weak* topology" (7(X*, X) [in which for any element O G X* a fundamental system of neighborhoods of is the family of all sets of the form {^ G X*| maxi 0)] is a locally convex space. If X is a normed linear space, then X* is endowed with the pointwise vector operations (1.26) and with the norm ||cD||= sup |0(x)|

(1.27)

(CDGX*),

JCGX

lkll
with which it becomes a Banach space. For a Cartesian product X x Z of two locally convex spaces, we shall use the canonical identification of the conjugate space (X x Z)* with the Cartesian product X* X Z*; namely, for any O G X* and vl/ G Z*, one defines (O, ^ ) G (X x Z)* by (CD, ^)(x, z) := 0(x) + ^iz)

ix eX,ze

Z);

(1.28)

we shall often apply this to the particular case Z = R. In a locally convex space X, any set H of the form H = H^,d := {y e X\ <^(y) = d],

(1.29)

8

1. Preliminaries

where O e X*\{0} and d e R/is called a (closed) hyperplane. The sets yld '= iy ^ ^1 ^(y) > d},

V | , :== {y e X\ <^(y) < d},

(1.30)

where O e X*\{0} and d e R, are called closed half-spaces (the closed half-spaces determined by the hyperplane H — H<^j of (1.29)), and the sets

^Id '-= {y ^ ^1 ^ W > d}.

Ul, := {y e X\ O(^) < d},

(1.31)

where O e Z*\{0} and d e R, are called open half-spaces (the open half-spaces determined by the hyperplane H = H^j o/ (1.29)). Given two sets C\ and C2 in a locally convex space X, the hyperplane H = H^d of (1.29) is said to separate the sets C\ and Ci if C\ is contained in one of the closed half-spaces determined by H and Ci is contained in the other closed half-space determined by //, that is, if either C\ c V^^ and C2 c V^^ (if such d e R exists, or, equivalently, if supO(Ci) < inf 0(C2), we say that H or the function O e X*\{0} separates Ci from C2), or C2 c v | ^ and Ci c y | ^ (if such d e R exists, or, equivalently, if supO(C2) < inf 0(Ci), we say that H or O e X*\{0} separates C2 from C\). The hyperplane H = H^^ of (1.29) is said to strictly separate the sets C\ and C2 if Ci is contained in one of the open half-spaces determined by H and C2 is contained in the other open half-space determined by //, that is, if either C\ c JJ^^ and C2 £ U^^ (if such d e R exists, we say that // or O G Z*\{0} strictly separates Ci from C2), or C2 C f/<^ and Ci C t / | ^ (if such d e R exists, we say that // or ^ e X*\{0} strictly separates C2 from Ci). Clearly, O e Z*\{0} separates (respectively, stricdy separates) C\ from C2 if and only if — O separates (respectively, strictly separates) C2 from Ci. Let us recall now some well-known theorems on separation of convex sets, which will be important tools later. The following is simply called "the separation theorem" (or "the basic separation theorem"): Theorem 1.1. Let C\ and C2 be nonempty convex subsets of a topological linear space X, with int C\ ^ 0. Then C\ can be separated from C2 by a hyperplane, or, equivalently, there exists ^ € X*\{0} such that supO(Ci)
(1.32)

C2nintCi = 0 .

(1.33)

if and only if

For the proof see, e.g., loffe and Tikhomirov [111, Ch. 3, Section 3.1]. The main part is, of course, the "if" part, but the "only if" part will also be used later. Remark 1.2. In the particular case when C2 is a singleton, from Theorem 1.1 we obtain the following: Let C be a nonempty convex subset of a topological linear space X, with int C 7^ 0. There exists a hyperplane H passing through x, such that C lies in one of the closed half-spaces determined by H, if and only ifx ^ int C.

1.1 Some preliminaries from convex analysis

9

In locally convex spaces we have the following "strict separation theorem": Theorem 1,2. If C is a nonempty closed convex subset of a locally convex space X, and X ^ C, then there exists O € X*\{0} that strictly separates C from x, or, equivalently, such that supcD(C) < cD(x).

(1.34)

We shall also use the following less standard result, which complements the separation theorems 1.1 and 1.2. Theorem 1.3. Let X be a normed linear space, C a bounded convex subset of X, and zo e CC. If either int C i^% or C is closed, then there exists ^ e X*\{0} such that vI/(zo) = supvI/(C).

(1.35)

Proof Let us first observe that it is sufficient to consider the second case, i.e., that C is closed. Indeed, if we are in the first case, i.e., intC 7^ 0, and if zo ^ ^ ^ Cc = bdC, then, by the separation theorem, there exists ^ G X*\{0} satisfying (1.35); hence, we may assume that zo ^ CC, and then, by the second case applied to the bounded closed convex set C, there exists ^ G X * \ { 0 } satisfying ^(zo) = sup^(C) = supvI/(C). Thus, assume that C is a bounded closed convex set. Then, by zo ^ CC and the strict separation theorem, {cl>GX*|supcD(C) < O(zo)}7^0.

(1-36)

We shall show that every boundary point ^ (in the norm-topology of X*) of the set A\={^ ^ X*\{0}| sup ^(C) < cD(zo)} satisfies (1.35). To this end, it will be sufficient to prove that every point OQ of the set (1.36) is an interior point (in the norm-topology of X*) of A. Let supOo(C') < Oo(zo) and let e > 0. Then, for IIO — Ooll sufficiently small, we have |0(zo) — ^o(^o)l < ^/2. Furthermore, since the mapping O -^ sup 0(C) is subhnear on X* (see (1.93) and (1.94) below) and C is bounded, we have |supcDo(C)-supO(C)| < | s u p ( O o - 0 ) ( C ) | < ||0 - Oo|| sup ||c|| < + 0 0 , ceC

and hence, for ||0 - Oo|| sufficiendy small, we have | sup Oo(C) - sup 0 ( C ) | < s/2. Consequently, taking a > 0 sufficiendy small, and using that supOo(C) < ^o(^o) and (zo)-supcD(C) = [cD(zo) - Oo(zo)] 4- [Oo(zo) - sup Oo(C)] + [sup (Do(C) - sup 0(C)], we obtain that for HO - cI>o|| sufficiently small, 4>(zo) - sup ^(C) > 0.

D

10

1. Preliminaries

It is a well-known consequence of the strict separation theorem that a set C in a locally convex space X is closed and convex if and only if for every x ^ C there exists O € Z* sadsfying (1.34), or, equivalently, a closed half-space V such that C C V, X ^ V. In other words, C is closed and convex if and only if it is an intersection of a family of closed half-spaces. For any set C in a locally convex space X, we shall denote by coC the closed convex hull of C, i.e., the intersection of all closed convex sets containing C. A subset C of a locally convex space X is said to be evenly convex if for each X eCC there exists O € X* such that cD(c) < OU)

(c e C),

(1.37)

or, equivalently, an open half-space U such that C c. U, x ^ U.ln other words, C is evenly convex if and only if it is an intersection of a family of open half-spaces. It is also known that a connected set C is evenly convex if and only if for every x ^ C there exists a (closed) hyperplane H with x e H,C H H = 0. For example, any open convex set and any closed convex set are evenly convex. For any set C in a locally convex space Z, we shall denote by eco C the evenly convex hull of C, i.e., the intersection of all evenly convex sets containing C. A subset C of a locally convex space X is said to be evenly coaffine, if for each X eCC there exists (t> e X* such that ^(x) i 0(C)

(1.38)

(i.e., such that 0(x) ^ (c) (c G C)); or, equivalently, C is an intersection of a family of complements of hyperplanes; that is, C is evenly coaffine if and only if it is the complement of the union of a family of hyperplanes. It is also known (see, e.g., [254, Ch. 2, Proposition 2.2]) that every evenly convex set is evenly coaffine, and every connected evenly coaffine set is evenly convex; hence ([254, Ch. 2, Corollary 2.2]) a set C is evenly convex if and only if it is convex and evenly coaffine. For any set C in a locally convex space X\ we shall denote by eca C the evenly coaffine hull of C, i.e., the intersection of all evenly coaffine sets containing C. For a set C c Z and for its closed convex hull co C, we have CoC =

n((D,^)G(x*\{0})x/?5j(O) sup
= [x eX\

^((D, d) e (Z*\{0}) x /?, sup cD(C)
(1.39)

Furthermore, for a set C c Z and for its evenly convex hull eco C, we have ecoC = n(o,j)e(x*\{0})x/?Aj(cI)) ^(c)
(ceC)

= {x eX \ $(, d) e (Z*\{0}) x R, 0(c)
We recall that a hyperplane

(c e C)}. (1.40)

1.1 Some preliminaries from convex analysis H = {yeX\<^{y)=d},

11 (1.41)

where O e X*\{0}, J G /?, is a support hyperplane of a set C c Z, or, briefly, H supports C if C n H 7^ 0 and C is contained in one of the two closed half-spaces determined by H, that is, either C C {y eX\^{y)

(1.42)

C ^{y

>d}.

(1.43)

or eX\^(y)

Let us also introduce the following more general concept, in which we do not require C H / / 7^ 0: Definition 1.1. Let X be a locally convex space. We shall say that a (closed) hyperplane / / is a quasi-support hyperplane of a set C C X, or that H quasi-supports C, if we have either (1.42) and C (^[y eX\^{y)
(^>0)

(1.44)

C(^[yeX\

{e > 0).

(1.45)

or (1.43) and ^{y) >d + e]

Clearly, if also C H // 7^ 0, then (1.42) and (1.43) imply, respectively, (1.44) and (1.45), and H is a support hyperplane of C. Thus, every support hyperplane is a quasi-support hyperplane. However, the converse is not true, as shown by the following example. Example 1.1. Let X = CQ, the Banach space of all sequences x = (x„) converging to 0, with the norm (1.7), C = Bx := {y e X\ \\y\\ < 1} (the unit ball of X), and let us consider the function ^0 e X* defined by 00

1

^OM = Y^ —Xi

(X = {Xrr) e CQ).

(1.46)

Then C C X is a bounded closed convex set, OQ ^ X*, ||Oo|| = sup <^o(Bx) = 1, and there exists no x e Bx such that OO(JC) = 1. Hence, the hyperplane H = {yeX\ci>o(y) = l}

(1.47)

quasi-supports the unit ball Bx (by Lemma 1.4 below), but it does not support Bx^ One can also give such examples in finite-dimensional spaces, but only with unbounded closed convex sets C, as shown by Lemma 1.4 below (since a bounded closed set C in a finite-dimensional space X is compact, and hence every ^ e X* attains its supremum and its infimum on C). Indeed:

12

1. Preliminaries

Example 1.2. Let X = R^, with any one of the norms (1.2)-(1.4), C := {x = (x\,X2) e R^\xi > 0, x\X2 < — 1}, and let us consider the function o(x)=X2

(x = (xuX2)eR^),

(1.48)

Then C is a closed convex set, OQ e X*, ||cI>o|| = 1, and Oo(c) < supOo(C) = 0 (c e C). Hence, the hyperplane H = {y e R^\ Oo(y) = 0} quasi-supports the set C, but it does not support C. Remark 1.3. (a) From (1.42) or (1.43) it follows that if a hyperplane H quasisupports a set C, then H quasi-supports also co C. (b) Theorem 1.3 admits the following geometric interpretation: under the assumptions of Theorem 1.3, for each zo e CC there exists a quasi-support hyperplane of C passing through ZQ (see Figure 1.2).

Figure 1.2. Now we shall present some results on quasi-support hyperplanes and support hyperplanes, since they will play an important role in the geometric interpretations of the duality results in approximation and optimization. Lemma 1.4. A hyperplane (1.41) m a locally convex space X quasi-supports a set C C.X if and only if either d = supO(C) or d = inf c|)(C). Proof Assume that (1.41) quasi-supports C, say, (1.42) and (1.44) hold. Then, by (1.42), we have sup 0(C) < d. On the other hand, by (1.44), for each £ > 0 there exists Cs e C such that ^(Cg) > d — s, whence sup 0(C) > J — ^ for all 6: > 0, so sup 0(C) > d. Hence, finally, d = sup {C). The case of (1.43), (1.45) is similar. Conversely, assume now that d = supO(C), that is, H = {y e X\ 0 ( j ) = sup 0(C)}. Then clearly we have (1.42). Furthermore, for each ^ > 0 there exists Cs e C such that 0(ce) > sup 0(C) — e = d — e, which shows that we also have (1.44). The case d = inf 0(C) is similar, mutatis mutandis. D Corollary 1.1. Every hyperplane H quasi-supporting C can be written in the form // = {y G X| 0 ( j ) = sup 0(C)},

(1.49)

where 4> e X*\{0}, sup 0(C) € R, and conversely, every hyperplane H of the form (1.49), where O G X * \ { 0 } , sup 0(C) e R, quasi-supports the set C.

1.1 Some preliminaries from convex analysis

13

Proof. By Lemma 1.4, it is enough to consider the case when // = {_y G X| Oo(y) = infOo(C)}. In this case, by infOo(C) = - sup(-Oo)(C), we have H = {y e X\ - cDo(j) = sup (-Oo)(C)}, i.e., (1.49) for O = - O Q . D Let us consider now the particular case that X is a normed linear space. We recall that for any two elements x,y e X one defines the distance between x and y by (1.15). For any nonempty set C c X and any JCQ G X, one defines the distance from jco to C by dist(jco, C) := inf ||jco - c\\ .

(1.50)

ceC

The distance from JCQ to the empty set is defined by the following convention: dist(xo,0) = +00.

(1.51)

By (1.50), for any nonempty set C c X we have dist(jco, C) = dist(jco, C);

(1.52)

by (1.51) and 0 = 0, the equality (1.52) remains valid also for C = 0. The distance between two subsets Ci, C2 of a normed linear space X is defined by dist(C,,C2):=

inf

||ci-C2l|.

(1.53)

C\^C\,C2^C2

We shall need the following lemma, which gives a useful formula for the distance to a hyperplane. Lemma 1.5. Let X be a normed linear space, and let //(D j be a hyperplane (1.29), where 4> e X*\{0}, d e R. Then, for any XQ ^ //CD,J, we have distixo,H^j)=^-^^^^.

(1.54)

Proof For any y e H^4, 11-^0-3^1 >

|0(xo-j)l

\<^{x^)-d\

whence dist(xo,//*.,) > ^ ^ ^ ^ i = ^ .

(1.55)

On the other hand, if 0 < s < ||0||, there exists z G X such that |0(z)| > (||0|| - e) \\z\\ . Multiplying this inequality by ^ ^ ^ and putting

14

1. Preliminaries

we obtain \(xo)-d\>(\m-e)

<^(xo)-d <^iz)

|z|| = (||cI>||-^)||xo-}^ll

whence, since y e //o,j, distUo, H^j) < \\xo - y\\ < ^ ^ S ^ — ^ .

(1.57)

||0|| -£

Since s > 0 was arbitrary, we obtain the opposite inequality to (1.55), and hence the equality (1.54). D Using the distance (1.53) and applying Lemma 1.5, one can give the following characterization of quasi-support hyperplanes in normed linear spaces: Proposition 1.1. Let Xbea normed linear space. For asetC C. X and a hyperplane (1.41), the following statements are equivalent: 1°. / / quasi-supports the set C. T. We have (1.42) or {I A3\ and dist(C,//)=

inf

\\c-h\\=0.

(1.58)

ceC.heH

Proof. 1° => 2°. Assume 1°. Then we have either (1.42) or (1.43), so we may assume (1.42) (in the case of (1.43), replacing O by — , we arrive at the case (1.42)). Then, condition (1.44) means that for each 6: > 0 there exists Cs^C such that ^(Cg) > d — £. Using (1.42) as well, we obtain d > sup 0(C) > ^(Cs) > d - £

(£ > 0),

whence sup {y) = sup ^(C)}. Then, by Lemma 1.5, dist(c, H) = V^A^ic)

- sup0(C)| = ^ { s u p O ( C ) - cD(c)}

(c e C),

whence dist(C, H) = - ^ inf dist(c, H) = - ^ inf {supO(C) - cD(c)} = 0. IIOil ceC

IICDII ceC

2° =^ r. Assume 2° and (1.42) (in the case of (1.43), replacing O by - O , we arrive at the case (1.42)), and let ^ > 0. Then by (1.58), there exist Cs e C and he e H such that ||ce — h^W < v^^s- Hence, cD(/z,) - CD(Q) = <^(h, - Cs) < \m

Whs -Cs\\< £,

and thus 0(ce) > ^(h^) — £ = d — £, which proves (1.44).

D

1.1 Some preliminaries from convex analysis

15

Corollary 1.2. Let X be a normed linear space. For a ball B = B(x, r) c Z and a hyperplane (1.41), the following statements are equivalent: v. H quasi-supports the ball B. T. We have (1.58) with C ^ B and / / n i n t ^ = 0.

(1.59)

dist(jc,//) = r.

(1.60)

3°. behave

Proof. By the above, in order to prove the equivalence 1° ^ 2°, it will be enough to show that (1.42) (with C = B) <^ (1.59 ).

(1.61)

Suppose first that (1.59) does not hold, say, y e HCMniB, that is, (y) = d and lU — y\\ < ^- We may assume, without loss of generality, that 0(x) > 0 (replacing, if necessary, O and dhy —^ and —d in (1.41)). Let £ > 0 be such that \\x - z\\ < r for all z e X with \\y - z\\ < £ and let 0 < T] < Ihc such that for Zr^ := j^y — -^x we have ||}^ — Z/? || = jzz \\y — x\\ < £. Then ||jc - z^|| < r and O(z^) = j^d - TZ^O(X) > J, so (1.42) (with C = B) is violated. Conversely, suppose now that (1.42) (with C = B) does not hold, say, z e 5, 0(z) > d. We may assume, without loss of generality, that 0(x) < d (replacing, if necessary, cj) and ^ by — O and —d in (1.41)). Then, since ^(z) > d, there exists 0 < ^ < 1 such that }7^ := ^x + (1 — r])z satisfies 0(}^^) = J, so j ^ e H. Also, ||x — jr; II = (1 — ^) \\x — z\\ < r, so yr^ e int B, which shows that (1.59) is violated. Let us prove now the equivalence 2° 4^ 3°. Assume that 3° does not hold, i.e., dist(jv:, H) = a ^ r.lfa < r, let £ > 0 be such that a + s < r. Then, taking y e H such that \\x — y\\ < dist(jc, H)-\-£ = a-{-£ r,let6: > 0 be such that (2 > r + £. Taking, by (1.53), any >; e H such that dist(j, B) < dist(//, B) + £, we have dist(//, B) > dist(j, B) - £ = \\y - x\\ - r - £ > a - r - £ > 0, violating (1.58) of 2°. Finally, assume that 3° holds and let £ > 0. Then there exists y e H such that \\x - y\\
£

r+s

XH

r

y. r -\- £

Then ||x — z|| = ^ lU — yll < £, whence, since ^ > 0 was arbitrary, it follows that we have dist (H, B) =0, i.e., (1.58) of 2°. On the other hand, if there existed an element y e H D intB, then we would have dist(jc, H) < ||jc — }^|| < r, violating the assumption (1.60); hence, we also have (1.59) of 2°. D Lemma 1.6. Let X be a normed linear space, x e X, and r > 0. For any O e X* with ||(|)|| = 1, the hyperplane H C X defined by

16

1. Preliminaries H = {yeX\

4>(3; - x) = r} = {y e X\ <^(y) = ^(x) + r]

(1.62)

quasi-supports the ball B = B(x, r), and conversely, for each quasi-support hyperplane H of the ball B there exists a unique O € X* with || O || = 1 such that we have (1.62). Proof Let ^ be a hyperplane of the form (1.62), with ||cl>|| •= 1. Then, by Lemma 1.5, dist(jc, //) = |cl>(jc) - [4>(x) - r]| = r, and hence by Corollary 1.2, H quasisupports the ball S. Conversely, let // = [y e X\^\{x) = Ji} be a quasi-support hyperplane of the ball B, Then, for ^2 '•= i^,d2 := 1^, we have IIO2II = I and H = ^

110,11'

^

||cD,||'

II

^"1

{y G X\^2{y) — ^2}- Since H quasi-supports the ball B, we obtain, by Lemma 1.5 and Corollary 1.2, |02(JC) — J2I = dist(jc, //) = r. Hence, for O := sign {^2 - ^2(-^)}^2 we have HOH = 1 and ^{y - x) — IJ2 - ^2(-^)l = r ( j e / / ) , so / / ^ { j e X | c D ( _ y - j c ) = r}. Since both sets in this inclusion are hyperplanes, they must coincide, so H is of the form (1.62), with II -

cDO(j) = (O - <^')(x)}.

Since O^ 7^ O, both sets in this inclusion are hyperplanes, so they must coincide. Hence, since x belongs to the second set, it follows that x e H, and therefore, by (1.62), r = 0, in contradiction to the assumption that r > 0. Consequently, we have 0^ = 0 . D Remark 1.4. If for a function O G X* the hyperplane H defined by (1.62) quasisupports the ball B(x, r), then, necessarily, \\^\\ = 1, since by (1.60) and Lemma 1.5 we have

.^dist(.,//):=l^«^I^^^l±^ = - ^ . Il^ll ll^ll Definition 1.2. We shall say that a closed half-space V (respectively, an open halfspace U) quasi-supports a subset C of a locally convex space X if the hyperplane bd V (respectively, bd U) quasi-supports the set C. Lemma 1.7. A closed half-space V quasi-supports a set C if and only if it has one (and only one) of the forms Vi={yeX\<^(y)>sup(C)},

(1.63)

or V2 = {ye X\(y) < supO(C)},

(1.64)

1.1 Some preliminaries from convex analysis

17

and an open half-space U quasi-supports a set C if and only if it has one (and only one) of the forms Ux=[y

e X\ 0 ( y ) > sup(D(C)},

(1.65)

U2 = [y e X\ 0 ( j ) < sup 0 ( C ) } ,

(1.66)

or

where ^ e Z*\{0}, sup 0 ( C ) e R. Also, conversely, every closed half-space V of the form (1.63) or (1.64), and every open half-space U of the form (1.65) or (1.66), where O G Z*\{0}, sup 0 ( C ) e R, quasi-supports the set C. Proof This follows from Definition 1.2 and Corollary 1.1, since for y = Vj or V = V2 of (1.63) or (1.64), respectively, and U = UiorU = U2 of (1.65) or (1.66), D respectively, we have bdV =bdU = {y e X\ ^(y) = sup 0 ( C ) } . Corollary 1.3. Every closed (respectively, open) half-space V (respectively, U) quasi-supporting C and not containing C (respectively, int C j can be written in the form (1.63) (respectively, (1.65)), where O G X*\{0}, sup<^(C) G R, and conversely, every closed (respectively, open) half-space V (respectively, U) of the form (1.63) (respectively, (1.65)), where ^ G X*\{0}, s u p O ( C ) G R, quasi-supports the set C and does not contain C (respectively, int C). Proof This follows from Lemma 1.7, since for all O G X*\{0} we have C C V2 and int C Q U2 (by Lemma 1.8 below). D Lemma 1.5 implies the following useful formula for the distance to a closed half-space, respectively an open half-space: Corollary 1.4. Let O G Z * \ { 0 } , d e R and

V^,d : = {y e X\ <^(y) > d],

U^^d '= [y e X\ (y) > d}

(1.67)

Then, for any XQ ^ V^j, we have d — O(xo) dist(xo, V
(1-68)

Proof By /f^ c y^j, j , we have dist(xo, // dist(xo, V^,d)' On the other hand, for any y G V<^^d the segment [jc, y] contains the a point y' G //o,^, so dist(xo, H^^d) £ Ik — y'W S \\x — JII, whence dist(jco, //,d) = dist(xo, H<^,d) =

[TTT;

•

(1-69)

ll^ll But since XQ ^ V(D,J. we have ^(XQ) < d, whence by (1.69), we obtain (1.68).

D

18

1. Preliminaries

Remark 1.5. If d = +CXD, then for the sets defined by (1.29) and (1.67) we have H(^j = V^^d = U,t>,d = 0- Hence, formulas (1.54) and (1.68) remain valid also for d = +00, by the convention (1.51). The following result was used in the proof of Corollary 1.3: Lemma 1.8. Let C be a nonempty open subset of a locally convex space X and O G Z*\{0}. Then inf cD(C) < cD(c) < sup 0(C)

(c e C).

(1.70)

Proof. If supO(C) = +00, then, since is finite-valued, we have 0(c) < sup 0(C) (c e C). Assume now that sup 0(C) < +oo, and let c G C, so 0(c) 
0(x,) = ^ 3 ^ ^ W - Y^^^^W > supO(C), in contradiction to XQ G C . This proves the second inequality of (1.70). The proof D of the first inequality is similar. Remark 1.6. (a) Lemma 1.8 admits (by Lemma 1.4) the following geometric interpretation: If C is a nonempty open subset of a locally convex space X, then C has no support hyperplane. (b) Actually, as shown by the above proof, one can replace in Lemma 1.8 "open" by "linearly open," i.e., such that C = core C, where core C := {c G C\ Vjc G X, 3^ > 0, V^ G [-£, +6], rjx + {\ - r])c G C}. (1.72) We shall also use the following lemma. Lemma 1.9. Let X be a locally convex space, f: X -^ R a convex function (see (1.93) below), and G a subset of X satisfying the ''Slater condition'' Mf(X)

< sup/(G) < +00.

(1.73)

Furthermore, let us consider the sets A := Asup/(G)(/) = {yeX\

f(y) < sup/(G)},

(1.74)

S := 5sup/(G)(/) = {yeX\

f(y) < sup/(G)}.

(1.75)

Then A yi^ 0 and 5CA.

(1.76)

1.1 Some preliminaries from convex analysis Proof. Note first that by (1.73), we have A ^^. jco G A (by A ^ 0), let

Let jc G 5 be arbitrary. Taking any

(A2 = 1, 2, . . . ).

Xn = -XO + (l - -)x

19

(1.77)

Then, since / is convex and XQ G A, x e 5, we have fiXn)

< -f{X0)

n so Xn e A (n = 1,2,...). proves (1.76).

+ ( l - -)fix)

\

< sup/(G),

n/

Also, clearly, x„ -> x, which, since x e S was arbitrary, D

Remark 1.7. (a) For any lower semicontinuous function / we have A C 5,

(1.78)

and hence, by Lemma 1.9, if / is convex and lower semicontinuous, and satisfies (L73),then 5 = A.

(L79)

Indeed, if / is lower semicontinuous, then S is closed and A c 5, whence A c 5*. (b) If / is convex and upper semicontinuous, and satisfies (1.73), then int5 = A.

(1.80)

Indeed, since / is upper semicontinuous, A is open, so we have the inclusion 2 in (1.80). In the opposite direction, let x e int5, and, by (1.73), let XQ e A, so fixo) < sup / ( G ) . We may assume that JC / XQ (since otherwise x e A and we are done). Let yn := --X0 + ^ ^ x (« = 1, 2 , . . . ) . (1.81) n n Then, since jc G int 5" and int S is open, for sufficiently large n we have y„ G int 5 c S, so f(yn) < sup/(G). Butby (1.81), we have jc — ^ j „ - h ^ X Q , whence, since / is convex, we obtain fix)

< -^fiyn)

+ -^f(Xo)

< sup/(G),

so X G A, which proves the inclusion c in (1.80), and hence the equality (1.80). The polar set of a set C c X is the subset of X* defined by r

= {^e

X*| 0(c) < 1 (c G C)},

(1.82)

20

1. Preliminaries

and the bipolar of C is C°° := (C°)°. The classical "bipolar theorem" states that, for any subset C of a locally convex space X, we have C°° = co(C U {0}); hence, a set C containing 0 is closed and convex if and only if C°° = C. Since in optimization theory it is useful to work with functions having values in the extended real line R = [—oo, +oo], or in the extended positive axis /?+ = [0, +oo], it is necessary to give a precise meaning to expressions like oo — oo and 0 X 00. We recall (see, e.g., Moreau [164]) that the usual addition -\- on R = (—oo, +oo) admits two natural extensions to R, + and +, called upper and lower addition, respectively, defined by a+b = afb = a-\-b if either/? H {A, Z?} y^ id or a = b = ±oo, a + b = +00, afb = - o o if a = -b = ±oo.

(1.83) (1.84)

We shall use the notation ^ j l j « / = ^1 -i

i-«m,

(1.85)

and as usual, if /? H {<3, Z?} 7^ 0, we shall denote -j- and -f simply by +. According to the above, subtraction — on R will mean either -j— or H—. We shall use the well-known calculus rules with -j- and -j- on /? developed by Moreau; the proofs can be found, e.g., in [164]. For example (see, e.g., [164, p. 115, Proposition], or [254, Lemma 8.3]), -(a + b) = -af-b

{a,be~R).

(1.86)

It is also well known (see, e.g., [164, p. 119, Proposition], or [254, Lemma 8.2]), that a^b>c<^a>c-f-b

(a,b,ce^);

(1.87)

hence, in particular (see, e.g., Moreau [164, p. 120, Corollary]), a + b>0<^a>-b

(a,be^).

(1.88)

Let us also mention the following relation between the upper and lower addition and the order on R (see, e.g., [164, p. 119, Lemma]), which we shall use in later chapters: a + {bfc)>{a+b)fc

{a,b,ce~R).

(1.89)

It is well known too (see, e.g., Moreau [164, p. 122, Corollary] or [254, Lemma 8.3]) that for any set Z, any function / : X ^ /?, and any a e R v/e have inf (/(;c) +a) = inff(X)

+ a,

(1.90)

xeX

sup {f(x) -\-a) = sup f(X) + a. xeX

(1.91)

1.1 Some preliminaries from convex analysis

21

We shall also use the extension to the cases a = 0 and b e {—oo, +oo}, respectively a e {—oo, +co} and b = 0, of the usual multiplication x, defined by the conventions 0 X +00 = +00 X 0 = -hoo, 0 X - o o = - o o x 0 = 0.

(1.92)

We recall that if X is a linear space, a function / : X -> /? is said to be (a) convex if fm,

+ (1 - l^)X2) < l^f(x,) + (1 - l^)f(X2)

Ui, X2 G X, 0 < ^ < 1); (1.93)

(b) concave if the function —/is convex; (c) proper if / is not identically -\-oo, and f(x) > — oo for all x e X; (d) sublinear, if it is convex Sind positively homogeneous, i.e., f(ax) = af{x)

(x eX,a

e R,a>

0).

(1.94)

It is well known and easy to show that a function f: X -^ R is convex if and only if the set epi / C X x /? (defined by (1.21)) is convex. Conjugate functions will later be a basic tool for defining dual optimization problems. Given a locally convex space X, the (Fenchel) conjugate function of a function / : X ^- /? is the function f*:X*-^R defined by / * ( 0 ) = sup {cD(x) - f(x)}

(CD G X*).

(1.95)

xeX

The function / * is also called the convex conjugate of / (since / * is a convex function, for any f: X ^^ R), while the concave conjugate / ® : X* ^ /? of / is defined by /®(cD) = inf {(x) - f(x)]

(CD G X*);

(1.96)

however, in the sequel we shall consider mainly the convex conjugate /*, and therefore we shall omit the adjective "convex." The biconjugate of any function / : X -^ /? is the function /** \ X ^^ R defined by /**(x) = sup {cD(x) - /*(cD)} = sup {cD(x) - sup{cD(3;) - /(j)}} OGX*

= sup inf {f(y) - cD(y) + <^(x)}

y^X

(x G X).

(1.97)

When X* is endowed with the weak* topology cr(X*, X), its conjugate space coincides with X, and hence /** = (/*)*. By (1.97), we have /**
(fe'R^'y,

(1.98)

22

1. Preliminaries

also, / = /** if and only if f(zo) = sup inf lf(y) - iy) + O(zo)}

Uo e X).

(1.99)

For any function / : X -> Rom. locally convex space X we shall denote by fco the lower semicontinuous convex hull of / , i.e., the greatest lower semicontinuous convex minorant of / . We have the following classical theorem of Fenchel-Moreau (see, e.g., Ekeland and Temam [54, Ch. 1, Section 4, Proposition 4.1] or loffe and Tikhomirov [111, Ch. 3, Section 3.3, Theorem 1 and Corollary 1] or Barbu and Precupanu [14, Ch. 2, Corollaries 1.5 and 1.6]): Theorem 1.4. Let X be a locally convex space. We have

r* = fco

ifeR'').

(1.100)

Hence, for a function f \ X -> R, we have f = /** (or, equivalently, f = fco) if and only if f is the supremum of a set of continuous affine functions. For any function / : X ^- R, where X is a linear space, the ("effective") domain of / is the set d o m / := {x e X\ f(x) < +oo}.

(1.101)

We recall that if X is a locally convex space, then the support set supp / of any function / : X -> /? is the subset of X* defined by s u p p / := {O G X * | 0 < / } .

(1.102)

Also, the (X*, R)-support set Supp / of any function / : X ^- /? is the subset of X* X /? defined by Supp / = {(^, t/) G X* X /?| CD - J < / } .

(1.103)

Clearly, we have the following relations between Supp / and supp / : ( 0 , J ) G S u p p / 4 » (D < / + J o (D Gsupp(/-f ^ ) .

(1.104)

For any /, /z: X ^- /? we have the equivalence / epi/5epi/z;

(1.105)

indeed, we have f < h if and only if there exist no jc G X and d e R such that fix) >d> h{x). Furthermore, e p i / * = Supp/;

(1.106)

1.1 Some preliminaries from convex analysis

23

indeed, for any O e X* and J e /? we have the equivalences /*((!>) < J <^ sup {^(y) - f(y)} < d ^ ^ - f < d <> ^ - d < f.

(1.107)

Consequently, for any functions f,h: X -^ R satisfying / = /**, h = /z**, we have the equivalence f
(1.108)

indeed, by / = /**, h = /z**, (1.105), and (1.106), / < /z <^ / * > /z* <^ e p i / * c epi/z* <^ S u p p / c Supp/z. By (1.95), we have the so-called "Fenchel inequality" ^(x) < fix) 4- / * ( 0 )

(x eX,(^e

X*).

(1.109)

If X is a locally convex space, the subdijferential of a function / : X -> /? at a point zo ^ X is the subset df{zo) of X* defined by 9/(zo) := {O e X*| c|>(x) - cD(^o) + f(zo) < f(x)

(x e X)}.

(1.110)

We have (see, e.g., Ekeland and Temam [54, Ch. 1, formula (5.3)]) the implication 9/(zo) / 0 ^ / ( z o ) =/**(zo).

(1.111)

If f(zo) e R, then by (1.110) and (1.95), we have
^(zo) = f(zo) + r m

(1.112)

holds. Using (1.112), one deduces easily (see, e.g., Ekeland and Temam [54, Ch. 1, Corollary 5.2]) that for any function / : X ^- 7? we have the implication ^0 € a/(zo) ^ zo e a/*(Oo),

(1.113)

and if f{zo) = /**(zo) (in particular, if df(zo) 7^ 0), then cDo e a/(zo) ^ zo € a/*(Oo).

(1.114)

The subdifferential at a point zo ^ X with f{zo) e R can also be expressed as df(zo) = {<^e X*| fizo) - O(zo) = min {f(x) - <^{x)}};

(1.115)

xeX

indeed, O(x)-c|>(zo) < fix)-fizo)

ixeX)^

/(zo)-O(zo) < fix)-<^ix)

ix e X).

24

1. Preliminaries

A function f\X -> 7? is said to be subdijferentiable on a subset A of X if _ f{A) c R and dfizo) # 0 for each ZQ e A. As an example of subdifferentials, let us note that if / : X -^ /? is any function satisfying /(O) = 0, then, by (1.110), we have 3/(0) = {CD G X*| CD < / } = supp/.

(1.116)

In particular, if X is a normed linear space and f{x)

= \\x\\

(XGX),

(1.117)

then a/(0) = Bx*,

(1.118)

where 5x* = {O G Z * | | | 0 | | < 1}, the unit ball of X*. Remark 1.8. Obviously, the supremum in (1.99) is attained for some OQ G X* if and only if 4>o e df(zo). Hence in general, the supremum in (1.99) need not be attained (e.g., take any proper lower semicontinuous convex function / such that df(zo) = 0 for some zo ^ X), but if / is convex and continuous, then it is attained for some CDQ e X* (e.g., by Theorem 1.13 below, applied to a singleton G = {zo})We have the following classical theorem of Moreau-Rockafellar (see, e.g.. Holmes [106, p. 25]): Theorem 1.5. If X is a locally convex space and f,h:X^^R are convex functions such that one of them is continuous at some point 6>/dom / Pi dom h, then Hf + h)(xo) = df(xo) + dh(xo)

{xo e X).

(1.119)

Remark 1.9. It is easy to see that here the + signs are just the usual sums. We recall that if X is a linear space and / : X -> R is a. convex function, then for any XQ e X with /(JCQ) G R and any x e X, the limit w, , y f(xo-\-tx)/ (xo; x) := hm UO

f(xo)

n lom (1.120)

t

exists in R, and it is called the directional derivative of f at XQ in the direction x. By a theorem of Moreau and Pshenichnyi (see, e.g., Holmes [106, p. 27] or Laurent [129, Theorem 6.4.8]), if X is a locally convex space and f: X ^^ R is a convex function that is finite and continuous at XQ, then f(xo;x)=

max

cD(jc)

(x e X).

(1.121)

If C is any subset of a set X, the indicator function xc of C is defined by

1.1 Some preliminaries from convex analysis

25

The normal cone to a subset C of a locally convex space Z, at a point CQ e C, is the subset of the conjugate space X* defined by N(C; Co) = {O G X*| cD(co) = maxO(C)}.

(1.123)

Note that always 0 e N(C', CQ), so N(C; CQ) / 0. As another example of subdifferentials, let us mention that for any convex subset C of a locally convex space X we have ^Xcico) = N(C; Co)

(co e C).

(1.124)

If Co ^ C, then dxc(co) = 0; more generally, for any proper function f: X -> R, iff(zo) = +00, then by (1.110), dfizo) = 0. The extended normal cone to a set C at a point xo e X is the subset of X* defined by iV(C; xo) = {<0 G X*| cD(xo) = max 0(C)}.

(1.125)

In particular, for xo G C we have, clearly, N{C; XQ) = N(C', XQ). Let us recall how the normal cones to the level sets 5/(jco)(/) of (1.22) can be expressed with the aid of subdifferentials. Theorem 1.6. (see, e.g., loffe and Tikhomirov [111, p. 217, Proposition 2]). Let X be a locally convex space, and f: X ^ R a proper convex function, continuous at a point XQ G X, such that inf f(X) < f(xo) < +oo. Then f is subdifferentiable at XQ and ^ ( 5 / ( , , ) ( / ) ; xo) = U^>o/xVUo).

(1.126)

For any e > 0, the set of all e-normal directions, or briefly, the e-normal set, to a set C at a point CQ G C is the subset of X* defined by Ne{C\ Co) = {O G X*| O(co) > sup cD(C) - e].

(1.127)

n,>oA^,(C;co) = ^(C;co).

(1.128)

Obviously,

For any £ > 0, the set of all extended e-normal directions, or briefly, the extended 8-normal set, to a set C at a point XQ e X is defined by Ns{C', xo) = {O G X*| a>(xo) > sup cD(C) - s).

(1.129)

In particular, for xo G C we have, clearly, NgiC; XQ) = Ne(C; XQ). For any 6 > 0, the e-subdifferential of a function / : X -^ /? at a point Zo ^ X with /(zo) ^ Ris the subset 9e/(zo) of X* defined by

26

1. Preliminaries dj(zo)

:= {O G Z*| 0(x) - cD(zo) odsf(zo) = Sof(zo) = VUo).

(1.131)

For the ^-normal sets to Sf(xo)(f)^ we have the following theorem: Theorem 1.7. (see Hiriart-Urruty and Lemarechal [104, Ch. XI, Corollary 3.6.2]). Let X = R", f: X -^ R a finite convex function, XQ e X such that inf f(X) < f(xo), and £ > 0. Then A^.(5/Uo)(/); ^o) = ^^>odsW)(xo).

(1.132)

We recall that if Z is a linear space, a function f: X -^ R is said to be quasiconvex if fii^xi + (1 - i^)x2) < max{/(xi), /(X2)}

(xi, X2 G X, 0 < z^ < 1);

(1.133)

it is well known and easy to see that this happens if and only if all level sets Sdif) (d G R) of (1.22) are convex, or equivalently, all level sets Ad(f) of (1.23) are convex. Clearly, every convex function is quasi-convex, but the converse is not true. A function / : X -> R is said to be quasi-concave if the function — / i s quasiconvex. For any function f:X -> /? on a linear space X we shall denote by /q the quasi-convex hull of / , that is, the greatest quasi-convex minorant of / (i.e., the greatest quasi-convex function majorized by / ) . When X is a locally convex space, a function / : X ^- R is quasi-convex and lower semicontinuous if and only if all level sets Sdif) {d G R) are closed and convex. For any function / : X -^ /? on a locally convex space X we shall denote by /q the lower semicontinuous quasi-convex hull of / , i.e., the greatest lower semicontinuous quasi-convex minorant of / . We recall that for any function f: X ^^ R we have (e.g., by (1.153) below, applied to the polarity A = A^^ of (1.189) below), mf

/q(^) =

d ==

deR

xeco Sdif)

= sup sup deR OGX*

inf

deR

xeco Adif)

inf /(_y) = sup y^^

inf

f(y)

(x G X).

(1.134)

ex* y ^ ^

^{x)>d^(y)>d 0(v)>OU)-l When X is a locally convex space, a function / : X ^- R is said to be evenly quasi-convex if all level sets Sd(f) (d G R) of (1.22) are evenly convex. For any function / : X -^ /? we shall denote by /eq the evenly quasi-convex hull of / , i.e., the greatest evenly quasi-convex minorant of / . We recall that (e.g., by (1.153) below, applied to the polarity A = A ^^ of (1.191) below) for any function / : X -^ R we have fea(x) = ^

inf

deR xeecoSdif)

= sup sup deR

d=

inf

deR xeecoAdif)

inf f{y)=

'^^ (x)>d ^(y)>d

d sup

inf

eX*

>'^^ cD(v)>4>(^)

f(y)

(x e X).

(1.135)

1.2 Some preliminaries from abstract convex analysis

27

A function / : X -> /^ on a locally convex space X is said to be evenly quasicoaffine if all level sets S^if) {d e R) of (1.22) are evenly coaffine. For any function f:X -^ /? we shall denote by /qca the evenly quasi-coaffine hull of / , i.e., the greatest evenly quasi-coaffine minorant of / . We recall that (e.g., by (1.153) below, applied to the polarity A = A^^ of (1.193) below) for any function / : X -> /? we have /qca(^)=sup sup

iuf / ( > ' ) = sup

(x)=d <^(y)=d

iuf

f(y)

(xeX).

(1.136)

(y)={.x)

Finally, let us recall the following so-called minimax theorem (actually, inf sup theorem) of Sion-Kneser-Fan (Sion [261, Theorem 4.2']): Theorem 1.8. Let M be a set, N a compact topological space, f : M x N ^^ R a finite-valued function that is "convexlike" on M (i.e., for every X\,X2 € M and 0 < y? < 1 there exists x e M such that f{x, y) < r]f(x\, y) + (I — ri)f(x2, y)for all y e N), and ''concavelike" on N (i.e., for every y\, y2 ^ N andO < r] r]f{x, y\) + (1 — r])f{x, yi) for all x e M), and such that f(x, .) is lower semicontinuous on N for each x. Then sup inf /(jc, y) = inf sup / ( x , y).

(1.137)

We shall also use the following "inf-sup theorem" of Moreau (see [162, Corollary]): Theorem 1.9. Let C be a convex subset of a linear space E, and D a weakly compact convex subset of a locally convex space F. Furthermore, let cp: C x D -^ (—oo, +CX)] be a mapping such that for each y e D, the function (p{., y) is concave on C and for each x e C, the function (p(x, .) is convex and lower semicontinuous on D. Then supmin (p(x, y) = minsup (p(x, y). xeC y^^

(1.138)

y^D xeC

1.2 Some preliminaries from abstract convex analysis Let us present now some elements of abstract convex analysis, which will be used in the sequel; some additional elements will be given in Chapter 9. Proofs can be found in [254]. There are two very useful tools for defining the dual problem to a primal optimization problem. The first one is the concept of a polarity between families of subsets, which gives a connection between subsets of a set X and subsets of another set W. Namely, if X and W are two arbitrary sets (which we shall assume nonempty, without any special mention), a mapping A: 2^ ^- 2 ^ (where 2^ denotes the family of all subsets of X) is called a polarity if for any index set / we have

28

1. Preliminaries

A(U,,/Q) = Di^jAiQ)

{{Cihei c 2^),

(1.139)

or, equivalently, A(C) = DcecMlc})

(C c X),

(1.140)

with the usual conventions Uy,0C/ = 0, n,e0A(C/) = W.

(1.141)

Remark 1.10. (a) In the above, for a function f: X ^ Y and a set C c X, we have set, as usual, / ( C ) := {/(c) | c € C} (thus, for example, in Lemma 1.8, inf 0(C) = infcGC^(c), supO(C) = supcec^(c)); this should lead to no confusion with the fact that for a polarity A: 2^ -> 2 ^ and a set C c X, A(C) is the set n^ec A({c}) (by (1.140)), since A is defined only at subsets of X, not at elements c e X. (b) In our previous papers, as well as in [254], we have used (following Evers and van Maaren [65]) the term "duality" instead of polarity. However, here we shall adopt the term "polarity" (which is also used by several authors; see, e.g., Pickert [179]), in order to avoid overlapping with subsequent terms like "theorem of weak duality," "theorem of strong duality," etc. There are many natural examples of polarities. For example, if X is a locally convex space and W = X*, the conjugate space of X, then the mapping A: 2^ -> 2^* defined by A(C) = C°

(C^X),

(1.142)

with C° of (1.82), is a polarity. Clearly, every polarity A is antitone (i.e., C\ c C2 implies A(C2) c A(Ci)). The dual of a polarity A, i.e., the mapping A^ 2^ -^ 2^ defined by A\S) := {jc G X| 5 c A({jc})}

{S c W),

(1.143)

(C c X, 5 c W),

(1.144)

is again a polarity, and we have the equivalence S c A(C) <^ C c A'{S) whence A'' := {A')' = A. For any C c X, the set A'A(C) := A'(A(C)) c X

(1.145)

is called the A'A-convex hull of C. The mapping A^ A: 2^ -> 2^ is a "hull operator," i.e., for any set C c X we have C C A'A(C),

(1.146)

A'AA'A(C) = A'A(C),

(1.147)

C c C

=> A'A{C) c A'A(C).

(1.148)

1.2 Some preliminaries from abstract convex analysis

29

A set C C X is called ts!t^-convex if C = A'A(C). This happens if and only if for each jc G CC there exists w — w^ ^V^ such that C c A'({w;}), X e

(1.149)

CA\{W}),

that is, C and each outside point x e CC can be "separated" by a set of the form A'({M;}), where w e W, or equivalently, C is an intersection of a family of subsets of X of the form A\{w}), where w e W. By (1.143) applied to 5" = {w}, we have AX{W})

= {X

eX\w

e A({x})}

(w e W),

(1.150)

so C c X is A^ A-convex if and only if for each x e CC there exists w = Wx ^ W such that w e n,ecA({c}) = A(C), w i A({x}).

(1.151)

Remark 1.11. Instead of the term "A^ A-convex" of the language of "abstract convex analysis," sometimes the language of general topology has also been used in the literature. Namely, (1.146)-( 1.148) mean that A'A is a "Moore-Smith closure operator," and thus the A^A-convexity of C can also be expressed by saying that C is "Moore-Smith closed" for the operator A^A. However, in the sequel we shall use only the language of abstract convex analysis, since it will be convenient for applications, e.g., in dealing with "A'A-quasi-convex" functions. A function f \ X ^^ R is called A'A-quasi-convex if all level sets Sdif) (d e R) of (1.22) are A^A-convex, that is, if for each d e R and x e CC there exists w = Wd,x ^ ^ such that SAf)

c A\{w]),

Xe

(1.152)

CA\{W}).

For any function f: X -> J? we shall denote by /q(A'A) the A' A-quasi-convex hull (i.e., the greatest A'A-quasi-convex minorant) of / . We have (see, e.g., [254, p. 301, formulas (8.265) and (8.262)]) /q(A'A)(-^)=

inf xe^HSAf))

d=

sup

inf

f{y)

(x € X).

(1.153)

^eCA'({«;})

In the sequel we shall be interested in polarities for the case that X is a locally convex space and W = X*\{0} or W = (X*\{0}) x R. For the first case, let G be a subset of X. We mention now some special polarities A' = AQ : 2^ -^ 2^*^^^^ (/ = 1, 2, 3, 4), depending on G. (1) Let us first consider the polarity A = A^ : 2^ -^ 2^*^^^^ defined by AJ^(C)

:= {O G X*\{0}|O(c) < supO(G) (c e C)}

(C c X).

(1.154)

For this polarity we have, by (1.150), (A'^YiW)

= {xeX\

0(jc) < supcI>(G)}

(cD e X*\{0}).

(1.155)

30

1. Preliminaries

Lemma 1.10. (a) For any set G the polarity A = A^^ satisfies A|^j({g}) = 0 ( ^ e G ) ,

(1.156)

C A ^ ( G ) = {CD € X*\{0}| 3^ G G, O(^) = sup cD(G)}.

(1.157)

(b) The set G is (A^)^A^-c6>/2V^x if and only iffor each jc G CG there exists ^ = ^^e X*\{0} such that 0(g) < supO(G) < a)(x)

(geG).

(1.158)

Hence, ifG is (AQYA^-convex, then it is evenly convex. (c) A function f'• X -^ R is (A^jY A^-quasi-convex if and only if for each d e R andx e CSd(f) there exists O = ^d,x ^ ^*\{0} such that
(y e 5 j ( / ) ) .

(1.159)

Consequently, if f is (A^jYA^-quasi-convex, then it is evenly quasi-convex. Proof (a) By (1.154), A|^j({g}) = {O e X*\{0}| <^(g) < <^(g)] = 0 (g e G). Also, by (1.154) we have C A ^ ( G ) = {O G X * \ { 0 } | 3g e G, 4)(g) > supO(G)}, whence (1.157). (b) G is (A^)^AJ^-convex if and only if for A = A^, (1.149) holds with C = G, w = , that is, (1.159). Hence, by the definition of evenly quasi-convex D functions, we obtain the second statement. (2) Let us consider now the polarity A = A^ : 2^ ^ 2^*\^^^ defined by A^(C) := {O e A:*\{0}| sup0(C) < sup(G)}

(C c X).

(1.160)

(O e X*\{0}).

(1.161)

For this polarity we have, by (1.150), (AlY(W)

= {xeX\ 0(x) (G)}

Lemma 1.11. (a) For any set G the polarity A = A^ satisfies Af,}({^}) = X*\{0}

(geG),

A2^(G) = X*\{0},

(1.162) (1.163)

{AlYAl(G)=coG.

(1.164)

Consequently, G is (A^Y A^-convex if and only if it is closed and convex. (b) A function f:X -^ R is (A^Y A^-quasi-convex if and only if for each d e R andx e CSd(f) there exists 4> = ^d,x e X*\{0} such that supO(5^(/)) < supO(G) < ^(x). Consequently, if f is (A^YA^-quasi-convex, and quasi-convex.

(1.165)

then it is lower semicontinuous

1.2 Some preliminaries from abstract convex analysis

31

Proof, (a) By (1.160), we have ^\,^{[g}) = [<^ e X*\{0}| cD(g) < ^{g)} = X*\{0} (g e G) and A^iG) = {^ e X*\{0}\ sup (D(G) < supO(G)} = Z*\{0}. By (1.163) and the expression of co G given in [254], formula (2.131), we obtain (AlYAliG) = {AlYiX'Xm = {x eX\^(x)

<supcD(G) (O G Z*\{0})} = coG,

which proves (1.164). Consequently, G is (A^)^A^-convex if and only if G = (AlyAl{G) = cdG. (b) A function f:X -> R is (A^)'A^-quasi-convex if and only if for each d e R mdx e CSd(f) there exists O = O^,^ e X*\{0} satisfying (1.152) for A = A^, that is, (1.165). Hence, if this condition is satisfied, then each level set Sd(f) (d e R) is closed and convex, that is, / is lower semicontinuous and quasiD convex. (3) Let us consider now the polarity A = A^ : 2^ ^ 2^*^^^^ defined by A^(C) := {O G Z*\{0}| supO(G) ^ 0 ( 0 }

(C c X).

(1.166)

For the polarity A = A^ we have (AlnW)

= {xeX\ cD(x) # supc|>(G)}

(O G X * \ { 0 } ) .

(1.167)

Lemma 1.12. (a) For any set G the polarity A = A^ satisfies AU{g})

=0

igeG),

Ai;(C) c Alio

(1.168)

(C c X).

AliG) = A^iG), (Aj;)'({4))) c (Alyi{
(1.169) (1.170)

(
(1.171)

(b) G is (A^QY AQ-COHVCX if and only if for each x eZG there exists ^ = ^^ € X*\{0) such that (g)<supct.(G) = cD(x)

igeG).

(1.172)

Consequently, ifG is {A^Q)'A]^-convex, then it is (Ajj)'Ag-convex. (c) A function f:X -^ R is iA]j)'A^(j-quasi-convex if and only if for each d e R andx e ZSdif) there exists O = ^^ ^^ € A'*\{0} such that {x) = supcI>(G) i cD(5rf(/)).

(1.173)

Consequently, if f is (A^Q)'A^-quasi-convex, then it is evenly quasi-coaffine. Proof (a)By(1.166),wehave Af^|({g}) ={
32

1. Preliminaries (b) By (1.167), condition (1.172) means that we have G c {^l)'{m)^

X e C(A^^)^({4>}),

that is, for A = A^ we have (1.149) with C = G, w = ^, which yields the first assertion. Furthermore, by (1.171) and (1.170), we have G C (Aj,)^Aj,(G) c (AlYA'^iG)

= (AlYA^iG),

(1.174)

whence the second assertion follows. (c) A function f:X -^ R is (A^)'A^-quasi-convex if and only if for each d e R andx e CSd(f) there exists O = O^,^ e X*\{0} satisfying (1.152) for A = A^, u; = O, that is, (1.173). Hence, by the definition of evenly quasi-coaffine functions, we obtain the second statement. D Corollary 1.5. If a set G ^ X is {A^)' A]j-convex and A^^(G) = X*\{0},

(1.175)

G = [x eX\ 0(x) < sup 0(G) (O G Z * \ { 0 } ) } .

(1.176)

then we have

Proof. By (1.166), we have (1.175) if and only if for each O G X * \ { 0 } , we have O(^) (G) (O G A:*\{0})}.

(1.177)

If also G is (A^)^A^-convex, then by Lemma 1.12 (b), we have the opposite inclusion as well, and hence the equality (1.176). D Remark 1.12. By (1.176), the set G is an intersection of open half-spaces; i.e., it is evenly convex. Corollary 1.6. For a set G c X, let us consider the following statements: 1°. G is (A^QYA^Q-convex and A^(G) = X*\{0}.

(1.178)

(Aj;)\X*\{0}) = G.

(1.179)

2°. We have

3°. We have (1.116). 4°. G is (AJ.)'AJ^-CO«V^JC, and we have (1.178). Then r =>2° <^3° ^ 4 \

(1.180)

1.2 Some preliminaries from abstract convex analysis

33

Proof, r ^ 3°. By A^(G) = A^(G), formula (1.178) implies (1.175), which, by Corollary 1.5, implies (1.176). 2° <^ 3°. By (1.143) and (1.154), we have (Aj,)\X*\{0}) = {xeX\

0(x) supcI)(G). Hence, by Lemma 1.10(b), G is (Aj^)'Aj^-convex. Also, if (1.176) holds, then by (1.154), we have (1.178). D (4) Let us consider now the polarity A = A^ : 2^ ^- 2^*^^^^ defined by A4 (C) := {O e X*\{0}| cD(C) c (D(G)}

(C c X).

(1.182)

For this polarity we have (A^)^({cD}) = {xeX\

(x) e (G)}

(O e Z*\{0}).

(1.183)

Lemma 1.13. (a) For any set G ^ X the polarity A = A^ satisfies A^^ag}) = X'\{0] A^(C)CA^(C)

(geG),

(1.184)

(CCX),

(1.185)

A^(G) = X*\{0}, (A^)^A4.(G) = [xeX\

(1.186)
(cD e X*\{0})}.

(1.187)

Consequently, the set G is {A'^Y A"^-convex if and only if it is evenly coaffine. (b) A function f:X —> R is (A^Y A'l^-quasi-convex if and only if for each d e R andx e CSd(f) there exists O = Oj^^ € X*\{0} such that ^(SAf))

^ ^ ( G ) , cD(x) ^ (G).

(1.188)

Consequently, if f is (A'^QY A^-quasi-convex, then it is evenly quasi-coaffine. Proof (a) By (1.182), we have A\^^{[g}) = {O G X * \ { 0 } | CD(^) G <^{[g])] = X*\{0} (g G G) and A'^^(G) = {O G X * \ { 0 } | 0 ( G ) C 0 ( G ) } = X*\{0}. Also, formula (1.185) is obvious from (1.182) and (1.160). Next, by (1.186) and (1.183), we obtain (1.187). Furthermore, by (1.187), G is (A^)'A^-convex if and only if for each x G CG there exists O = O^^ ^ X*\{0} such that ^(x) ^ 0(G), i.e., if and only if G is evenly coaffine. (b) A function f \ X ^^ R is (A^)'A^)-quasi-convex if and only if for each d e R andx e CS^if) there exists O = O^,^ G X * \ { 0 } satisfying (1.152) for A = A^, w; = O, that is, (1.188). Hence, by the definition of evenly quasi-coaffine functions, we obtain the last statement. D

34

1. Preliminaries

Finally, let us mention now some special polarities A: 2^ ^ 2^^*^^^^^^^ that do not depend on a subset G of X. (1) For the polarity A^^: 2^ -^ 2^^* ^^^^^^^ defined by A^\C):={(^,d)

€ (X*\{0}) X R |supO(C) < d}

(C c X),

(1.189)

we have (A^^/A^kC) = COC (C G 2^), ZqaAnyA") = /q ( / ^

^R"").

(1.190)

(2) For the polarity A^^. 2^ -> 2^^* ^^^^^^^ defined by A^^(C) := {(^, J) G (Z* \ {0}) X R I (c) < d (c e C)]

(C c Z),

(1.191)

we have (A'^yA'\C)

= ecoC (C € 2^), /q((A>^rA'^) = U (f ^ ^ ' ' ) -

(1-192)

(3) For the polarity A^^: 2^ -^ 2^^* \ ^^^^""^ defined by A^^(C) := {(CD, d) e (X* \ {0}) x R\
(C c X), (1.193)

we have (A'^yA'\C)

= ecaC (C 6 2^), /q((A'3)'A>3) = /qca ( / e :^'').

(1.194)

The above expressions of co C, eco C, eca C, and /q, /eq, /qca with the aid of the polarities A^\ A^^, and A^^ depend on two parameters, ^ e X*_\ {0} and d e R. However, the sets C c X with 0 € C and the functions f: X -^ R satisfying /(0) = inf/(X\{0})

(1.195)

admit expressions of coC, ecoC, and /q, /eq with the aid of simpler polarities, depending only on one parameter e X* \ {0}. Indeed: (4) For the polarity A^^: 2^ -> 2^* \ ^^^ defined by A^\C)

:= {O G X*\{0}| sup 0 ( 0 < 1)

(C c X),

(1.196)

we have coC = (A^')^A^'(C) /qU) = /q((A0.yA0.)(x)

(C c X, 0 6 C),

(1.197)

( / G R \ / ( O ) = iuf/(X\{0}), X G X\{0}). (1.198)

Note that for any C c X the set A^^(C) U {0} c X* (with A^^ of (1.196)) is the usual polar C° of C (see (1.82)).

1.2 Some preliminaries from abstract convex analysis

35

(5) For the polarity A^^. 2^ -> 2^*^^^} defined by A^2(C) := {O G X* \ {0}| 0(c) < 1 (c G C)}

(C c X),

(1.199)

we have eco C = (A^^YA^^C) /eq(^) = /q((A02yA02)(x)

( C C X, 0 G C ) ,

(1.200)

( / G R \ / ( O ) = i n f / ( Z \ { 0 } ) , X G Z\{0}).

(1.201)

The second important tool for defining dual problems to a primal optimization problem, which gives a connection between functions on a set X and functions on another set W, is the following generalization of the conjugate (1.95): given two sets X and W and a ("coupling") function (p: X x W -^ /?, the Fenchel-Moreau conjugate function of a function f: X -^ R (with respect to cp) is the function /c(,^): W ^ i ? defined by f'^'^\w)

:= sup{(^(y, w) + -/(>')}

(u; G W),

(1.202)

where -|- denotes the lower addition on R (see (1.83), (1.84)). Theorem 1.10. For a mapping c: f e R -^ f^ e R , where R denotes the set of all functions f: X ^^ R, there exists a coupling function (p: X x W -^ R such that f = f'^f"^ {of (1.202))/or all f e ~R^ if and only if c satisfies the following two conditions: for any index set I {including the empty set 0, with the usual conventions inf 0 = +oo and sup 0 = — oo), (inf Z;)^ = sup/.^ /€/ '

( { / } , , / c -t)^

(1.203)

/e/

{a -f fY = -aff'

( / G /?^, a G R);

(1.204)

moreover, (p is uniquely determined by c. Proof See [237] or [254], Chapter 8. —X

D —w

Any mapping c: f e R -^ f' e R satisfying (1.203) and (1.204) is called ([237], [254]) a conjugation. In the sequel we shall use only the case W ^ R , i.e., where W is a set of functions w: X -> R, and (p = (p^^: XxW ^^ /? is the "natural coupling function" associated with W, defined by (p{x,w):=w{x)

{x eX,w

eW),

(1.205)

which is apparently a particular case, but in fact, it turns out to be equivalent (from X

the point of view of conjugations): given two sets X and W C. R , the FenchelMoreau conjugate function of a function f: X -> R {with respect to W) is the function / * : W -^ R defined by

36

1. Preliminaries r(w)

:= sup {w(y) + -f(y)}

(w e W).

(1.206)

yeX

The Fenchel-Moreau biconjugate of a function f: X -^ R (with respect to W) is the function /**: X ^^ R defined by /**(x) := sup {w{x) + -r(w)}

(X e X).

(1.207)

weW

By (1.207), (1.86), and (1.90), for any function / : X -> ^ we have /**(x) = sup {w{x) + - r (u;)} = sup {wix) f - sup [w(y) f weW

weW

-/(y)]}

yeX

= sup {w(x) + inf - [w(y) + - / ( j ) ] } wew

y^^

= sup inf {[/(y) 4- -w(y)] + K;(X)}

(JC e X),

(1.208)

whence /** < /

( / e T?"").

(1.209)

Remark 1.13. (a) Actually, a more precise notation would have to specify also that / * and /** are understood with respect to the given W 'O R ; however, the above notation will lead to no confusion, since W will always be clear from the context. (b) Let us show the equivalence of the above two concepts. Clearly, in the particular case of W c ;^^ and cp of (1.205), formula (1.202) reduces to (1.206). In the converse direction, given any pair of sets (X, W) and any coupling function (p: X X W -^ R,foY each w e W one can define a function w = w^p. X ^^ R by wix) := ^(x, w)

(X e X),

(1.210)

and hence a set Vj/ = W^ c /? by W := {w\ w eW} = {(p(x, w)\ w e W}.

(1.211)

The mapping w e W ^^ w e W defined by (1.210) need not be one-to-one. Indeed, for example, if Z is a locally convex space, W = X* x R and cp: X x W -^ R is the coupling function defined by cp(x, (O, J ) ) : = -X{yeX\^iy)>d}(x)

(x G X, O € X*, J G /?),

(1.212)

then, by (1.202), r ^ ^ ^ O , J ) = SUp{-X{,eX|0(v)>d}(^) + xeX

=

sup (-f(y))= yeX <^(y)>d

-inf veX 4>(v)>J

-fix)}

f{y)

(O eX*,d

e R)

(1.213)

1.2 Some preliminaries from abstract convex analysis

37

which is (modulo an inessential additive term + J ) the so-called quasi-conjugate of / , in the sense of Greenberg and Pierskalla [95], which plays an important role in duaUty for quasi-convex optimization; then, since -X{yeX\ix^iy)>fid}

= -X{yeX\iy)>d}

(M > 0),

we have jlw = w for all w = (O, d) e W = X* x R, /x > 0, so the mapping If ^- 25 is not one-to-one. Nevertheless, for any coupling function (p: X xW -> R we have the implication wu W2 eW,wi=W2=^

f^'^\wx)

= f^'^\w2),

(1.214)

since sup l(p(x, wi) f -fix)} xeX

= sup {W2(x) f

= sup {Si (x) f -fix)} xeX

-fix)}

xeX

= sup {(pix, W2) + xeX

-fix)}. ^

Hence, one can uniquely define a conjugation f e R /*(S):=/^(^)(u;)

y/

-> / * e /? by

iweW).

(1.215)

(c) For W = X* X R,it is convenient to denote the quasi-conjugate (1.213) of / , in the sense of Greenberg and Pierskalla, mentioned above, by / J . The second quasi-conjugate of / is the function (/J)^ : X -> R defined [95] by (fJ)dM

= -inf / J W

U € X),

(1.216)

(y)>d

and the normalized second quasi-conjugate of / is the function f^^iX defined [95] by

fyy = sup if J)',.

-> R

(1.217)

deR

It is well known and easy to see that for any function f. X -^ R wc have

r = sup/J, f>f'^>f'\

(1.218)

deR

where /*, /** are the Fenchel conjugates (1.95), (1.97). Corresponding to (1.100), we have} r'=/eq

(/e^""),

(1.219)

with /eq being the evenly quasi-convex hull (1.135) of / . (d) There are also other "conjugates" of a similar form, useful for duality in convex and quasi-convex optimization, that are particular cases of the Fenchel-Moreau

38

1. Preliminaries

conjugates /^^*^^ (for suitable coupling functions (p), for example, the "pseudoconjugates" defined by / ; ( 0 ) = -inf/(x)

(O eX\d

G R),

(1.220)

e /?),

(1.221)

xeX ^{y)=d

and the "semiconjugates" defined by /;(cD)=

- i n f f{x)

(CD eX\d

xeX ^(y)>d-\

for which one introduces the second conjugates (/J^)J,(/j)j and the normahzed second conjugates / ^ ^ , /^^, similarly to (1.216) and (1.217) respectively (mutatis mutandis). We have r«-/q

ifeR''},

(1.222)

with /q being the lower semicontinuous quasi-convex hull (1.134) of / . Let us return now to the more general case in which X and W are two arbitrary sets. For any polarity A : 2^ —> 2^ the conjugation of type Lau associated with A is the mapping L(A): R -^ R defined by f^^^\w)

:=

- i n f fix)

( / e^^^we

W).

(1.223)

xeCA'({u;})

One can show (see [254, p. 279, Theorem 8.14]) that the mapping c = L(A): —X —w R ^ R satisfies (1.203), (1.204) (i.e., it is a conjugation), and that the mappmg X \Y c((p): R -^ R defined by (1.202) is a conjugation of type Lau if and only if cp takes only the values 0 or —oo, i.e., if and only if (p = — Xc» for some subset C of X xW. If X and W are two sets, C is a subset of X, and A: 2^ ^ 2^ is a polarity, then for the "representation function" pc'. X -> {—oo, -hoo} defined by

{

—00 if X e C, P^

(1-224)

+00 if jc G LC, we have (Pc)^^^^=/OA(C).

(1.225)

For any polarity A: 2^ ^ 2 ^ , the dual of L(A): R -^ R is the mapping w

Y

L(Ay: R -^ R defined by g^(^)'(x):=

- i n f g{w) weW xe{^A'({w})

(geR'^^xeX).

(1.226)

1.3 Duality for best approximation by elements of convex sets

39

The dual L(A)^ of L(A) is again a conjugation of type Lau (namely, L(A)^ = L(AO, with A^ of (1.143)), and we have L{Ay = (L(A)O' = L(A). For any / : X ^ /?, the function (/^(A))/^(A)' : X-> R is denoted by /^(^)^(^)'. By (1.226) and (1.223), we have /'^^^^^^^'=/q(A'A)

(/e^""),

(1.227)

with /q(A'A) of (1.153). In particular, for f = pc of (1.224) (where C c X is any set), we have (PC)^^^^^^^^'=PA'A(C).

(1.228)

For the polarities A = A^^ of (1.189), A = A^^ ^f (1.191), and A = A^^ ^f (1.193) we have, by (1.223), /^^^"\
(f e J^, (O, d) e (X* \ {0}) x /?),

(1.229)

( / e /? , (O, d) e (X* \ {0}) x /?),

(1.230)

(f e^^,

(1.231)

xeX

/ ^ ( ^ Ho, d)= - inf / ( x ) JCGX

^

/ ^ ^ ^ " \ 0 , J) = - inf / ( x )

(O, J) G (X* \ {0}) x R).

jceX

V-

If A: 2^ ^ 2 ^ is a polarity, f e R , and zo e ^ , /(^o) > - o o , the subdifferential of f at ZQ with respect to the conjugation of type Lau L(A) is the subset ^^^'^Vfeo) of W defined by d^^^^f(zo) := [woeWlzoe = {woeW\zoe

CA'({U;O}),

f(zo) = -f'^^^Hwo)} = min f(x)};

CA^({U;O}), / ( Z O )

^^'^^^^

XGCA'({U;O})

here the assumption f{zo) > — oo is essential.

1.3 Duality for best approximation by elements of convex sets In this section we shall present some duality results and some methods of obtaining them, for best approximation in normed linear spaces by elements of convex sets. These will serve as a basis of comparison with the nonconvex duality results of Chapters 2 and 5 and with the methods of obtaining them. We recall that if G is a subset of a normed linear space X, any go ^ G for which the inf in (1.50) (with C = G) is attained, i.e., such that IUo-goll = inf Wxo-gh

(1.233)

40

1. Preliminaries

or, equivalently, such that ll^o-^oll
(geG),

(1.234)

is called an element of best approximation of (or a nearest point to) XQ in G (see Figure 1.3).

Figure 1.3. We shall denote by PG(-^O) the set of all nearest points to XQ in G, that is, VG{X^)

:= {go G G\ lUo - goll = inf Iko - g\\}.

(1.235)

geG

We shall denote by max (respectively, min) any sup (respectively, inf) that is attained. Thus, in (1.233) and (1.235) one can replace inf by min. Clearly, Vcigo) = {go} for all go e G. In finite-dimensional spaces Z, if G c X is closed, then Vcixo) 7^ 0 for all XQ e X (see, e.g., [210]), but as we shall see below, in infinite-dimensional normed linear spaces X we may have Vci^o) = 0, that is, elements of best approximation of XQ need not exist, even for closed sets G with "very good" geometric properties. One can also express VG(XO) with the aid of the (closed) ball B(xo,d):=[yeX\

\\xo-y\\
(1.236)

with center XQ and radius d = dist(xo, G), namely, VG(XO)

= Gn B(xo. dist(xo, G)).

(1.237)

We shall be concerned with the following two main problems: (1) Find convenient formulae for dist(xo, G). (2) Give characterizations of elements of best approximation, i.e., necessary and sufficient conditions in order that an element go ^ ^ satisfy (1.233) (that is, in order thaigoeVGixo)). For these problems, ''duality " means simply their study with the aid of the elements of the conjugate space X*. Indeed, this is quite natural in the light of the results of the next section, since best approximation by convex sets is a particular case of convex optimization, namely, it is the infimization of the convex function (1.264) on convex sets, and since the function (1.264) has very good properties (it is finite and continuous). We have the following basic formula for the distance to a convex set.

1.3 Duality for best approximation by elements of convex sets

41

Theorem 1.11. Let X be a normed linear space, G a convex subset ofX, and XQ e CG. Then distUo, G) = max {(D(jco) - sup 0(G)}.

(1.238)

\\n=\ In other words, we have dist(;co, G) > O(xo) - supO(G)

(O € X*, ||0|| = 1),

(1.239)

and there exists OQ G X* such that IIOoll = 1,

(1.240)

dist(xo, G) = Oo(xo) - supOo(G).

(1.241)

Proo/ We have (1.239), since lUo - g\\ > (xo -g)>

O(xo) - sup cD(G)

(CD e X\ ||0|| = 1).

Furthermore, since XQ G CG, we have dist(jco, G) > 0. Let A:={y e X\ \\XQ - y\\ < dist(jco, G)} = int^(jco, dist(xo, G)). (1.242) Then A is a nonempty open convex set, and G (1 A = &. Hence, by the separation theorem, there exists OQ G X * \ { 0 } such that sup(Do(G) < inf Oo(A);

(1.243)

we may assume without loss of generality (dividing by ||Ooll, if necessary) that ||a>o|| = 1. We have Oo(xo)-supcDo(G)>0;

(1.244)

indeed, otherwise, from (1.243) we would obtain Oo(jco) < inf Oo(A), in contradiction to XQ G A. Let us consider the hyperplane Ho := {y G X\
(1.245)

By Lemma 1.5, (1.244), and \\<^o\\ = 1, we have dist(xo, Ho) = Oo(xo) - supOo(G) = inf {Oo(xo) - ^o(g)} geG

< inf | | x o - g | | =dist(xo, G). geG

If dist(jco, Ho) < dist(jco, G), then there exists ho G Ho such that \\xo — ho\\ < dist(xo, G), so ho G A. Hence, using also that ho G Ho and formula (1.243), we obtain

42

1. Preliminaries Oo(/2o) = supcDo(G) < inf OoCA) < Oo(/io),

so ^o(^o) = inf Oo(A), which contradicts Lemma 1.8. Therefore, we must have OoUo) - sup ^o(G) = dist(jco, //o) = dist(xo, G).

(1.246) D

Remark 1.14. (a) For various particular classes of convex sets G, e.g., for convex cones G, linear subspaces G, or finite-dimensional convex sets G, formula (1.238) for dist (xo, G) takes simpler forms (see, e.g., [210, 211] and the references therein), (b) From (1.238) it follows that for XQ e CG we have dist(xo, G) =

max

{^(XQ) - sup 0(G)}

eXM|cD|| = l sup(I>(G)<4>(.vo)

max

{0(jco)-supcD(G)};

(1.247)

GXM|CD|| = 1 sup4>(G)<4>(.vo)

indeed, for any O G Z* with ||4>|| = 1, supcI)(G) > O(xo), we have sup
O(JCO)

—

Now we shall deduce from Theorem 1.11 some other duality formulas for the distance to a convex set, and we shall give for them some geometric interpretations. Corollary 1.7. Let X be a normed linear space, G a convex subset ofX, and XQ e CG. Then dist(xo, G ) =

max

|0(jco) - sup4)(G)|

(DGA:M|CD||::=I sup(G)<(jco)

max

|0(jco)-supcD(G)|.

(1.248)

GXM|(t>||=i sup{G)<
Proof. By (1.247), we have the inequalities < in (1.248). On the other hand, for any O G X* with ||0|| = 1, supcI)(G) < 0(xo), we have |0(jco) - supO(G)| = O(jco) - sup 0(G), whence by (1.239), dist(jco, G) >

sup

|0(jco) - supO(G)|,

OeXM|(D|| = l supcI)(G)<0(jco)

and hence, finally, we obtain (1.248).

D

Remark 1.15. (a) One cannot omit in (1.248) the conditions supO(G) < O(xo) and supO(G) < 0 = yi {y = {yx.yi) ^ X). Then dist (jco, G) = 1, and |Oo(xo) - supOo(G)| = |0 - J | = J > 1, so (1.247),

1.3 Duality for best approximation by elements of convex sets

43

with s u p O ( G ) < <^(xo) or supOCG) < O(jco) omitted, fails (in this example, supOo(G) =d >0= o(xo)). (b) Using also Lemma 1.5, we can give the following geometric interpretation of the first equality in (1.248): We have dist(jco, G) =

max dist(xo, H) HeHc.xQ

max

inf

\\xo - yh

(1.249)

OeX*\{0} yeX supO(G)<(jco) 0(y)=supO(G)

where HG,XO denotes the set of all hyperplanes that quasi-support the set G and that strictly separate G and JCQ. Indeed, it is enough to consider the hyperplanes H = {yeX\<^{y)

= %n^<^{G)} (CD G X*, | | 0 | | = l,supcD(G) < O(xo));

(1.250)

the second equality in (1.248) has a similar interpretation, with "strictly separate" replaced by "separate" and sup 0 ( G ) < 4>(jco) replaced by sup 0 ( G ) < O(xo). (c) Under the assumptions of Theorem 1.11 we also have dist(jco, G) =

max

|0(jco) - s u p O ( G ) | ,

(1.251)

eX\\\\\ = \

supO(G)
with A of (1.242). Indeed, in the proof of Theorem 1.11 we have shown that there exists Oo ^ X* satisfying (1.240) and (1.243), and that for any such OQ we have (1.246), which yields (1.251). Corollary 1.8. Let X be a normed linear space, G a convex subset ofX, C G . Then dist(xo, G) =

and XQ G

max |cD(jco) - d\ = max l^(-^o) - d\. a>GXM|||=i,jG/? OGXM|ci>||z=i,je/? supO(G)<J<(I>(.ro)

sup(G)<
(1.252) Proof. For any O G X* with \\^\\ = 1, sup 0 ( G ) < O(jco), we have max

|0(jco) - d\ =

max

|0(xo) - d\ = O(xo) - s u p O ( G ) ,

d^R

deR

supO(G)<6/<0(jco)

sup4>(G)<J<0(.x:o)

whence by (1.248) and (1.239) we obtain (1.252).

D

Remark 1.16. (a) Geometrically, the first equality of Corollary 1.8 means that dist(jco, G) = =

max dist(xo, H) max

inf

^£X*,deR VGX supO(G)<J(v)=sup
lUo —jIL

(1.253)

44

1. Preliminaries

XQ^

(a)

(b) Figure 1.4.

where HQ ^^ denotes the set of all hyperplanes that strictly separate G and XQ. Indeed, it is enough to consider the hyperplanes H =

{y€X\^(y)=d} (O G Z * , | | 0 | | = \,d e /?,sup(D(G) (xo)); (1.254)

the second equality in (1.252) has a similar interpretation, with "strictly separate" replaced by "separate" and sup 0(G) < d < 0(xo) replaced by sup 0(G) < d < O(jco) (see Figures 1.4a and 1.4b). (b) The reduction principle: The usefulness of formulas (1.249) and (1.253) consists in the fact that they reduce the computation of the distance from a convex set to the computation of the distance from a hyperplane, and that there exists a simple formula for the computation of the distance to a hyperplane, namely, Lemma 1.5 (which is very convenient for applications in various concrete spaces, since for these spaces the general form of continuous linear functions O e Z* is well known and simple). This basic idea, which we shall call the reduction principle, will be applied later also to nonconvex approximation and will be extended to convex and nonconvex optimization. (c) One can give some geometric consequences of the above results, using halfspaces instead of hyperplanes. The usefulness of those distance formulas consists again in the "reduction principle": they reduce the computation of the distance from a convex set to the computation of the distance from a half-space, and there exists a simple formula for the computation of the distance from a half-space, namely. Corollary 1.4. Duality results for the distance, such as Theorem 1.11, can be used to derive characterizations of nearest points, e.g., the following. Theorem 1.12. Let X be a normed linear space, G a convex subset ofX, and XQ e CG. For an element go e G, the following statements are equivalent: WgoeVcixo)2°. There exists OQ e X* satisfying (1.240) and Oo(xo) - sup Oo(G) = llxo - goll.

(1.255)

1.3 Duality for best approximation by elements of convex sets

45

3°. There exists <J>o e X* satisfying (1.240) and «I>o(xo -g)>

llxo - ^oll

{g e G).

(1.256)

4°. There exists o € X* satisfying (1.240) and cI>o(go) = max cDo(G),

(1.257)

o(^o-^o) = 11x0-foil.

(1-258)

Moreover, one can take the same OQ in statements 2°, 3°, and 4°. Proof. V => 2°. If 1° holds, then by ||jco - ^oll = dist(xo, G) and (1.238) we have 2°. T ^ 3 M f 2 ° holds, then ^oUo - ^) > ^o(-^o) - sup cDo(G) = \\XQ - goII

(g e G).

3° = ^ 4 M f 3° holds, then ^o(go) - ^o(g) = ^oUo - g) - ^o(^o - go) > 11^0 - goII - ^oUo - go) > 0 Iko - goII < ^o(^o - go) < 11-^0 - goII •

(g € G),

4° => l M f 4 ° holds, then lUo - goll = ^oUo - go) < ^oUo - g) < 11-^0 - g\\ that is, go eVcixo).

{g ^ G), •

Remark 1.17. (a) Any function ^o ^ ^* satisfying (1.240) and (1.258) is called a "maximal function" of the element jco — go- The usefulness of Theorem 1.12 for applications in various concrete normed linear spaces is due to the fact that for these spaces the general form of maximal functions of the elements of the space is well known and simple (see, e.g., [210] and the references therein). (b) For various particular classes of convex sets G, e.g., for convex cones G, hnear subspaces G, or finite-dimensional convex sets G, Theorem 1.12 takes simpler forms, which yield some results on characterizations of best approximations, for example the classical theorem of S. Bernstein on the characterization of polynomials of best approximation of degree < AZ of continuous functions on a closed interval [a, Z?], in terms of "points of alternation" of XQ — go, i.e., points at which •^0 — go takes the value |JCO — goll with alternating signs (see, e.g., [210, 211] and the references therein). (c) Theorem 1.12 admits some geometric consequences, such as the following characterization of nearest points: For XQ 6 CG we have go G PG(-^O) if and only if there exists a hyperplane HQ that supports G at go, separates G and XQ, and satisfies llxo-goll =dist(JCO,//o).

(1.259)

46

1. Preliminaries

Indeed, if ^o ^ ^G(-^O), then the hyperplane //Q := {y e X\ ^o(y) = supOo(G)}, with o of Theorem 1.12, passes through ^o (by (1.257)), supports G, separates G and xo (by (1.255) and XQ e CG), and satisfies (1.259) (by (1.255) and Lemma 1.5). Conversely, if Ho = {y e X\ ^o(y) = do) supports G at ^o, separates G and XQ, and satisfies (1.259), then, by Lemma 1.4, we may assume that do = supOo(G). Then, since G c {j G X|Oo(j) < sup Oo(G)} and Ho separates G and xo, we have sup Oo(G) < Oo(xo), whence by (1.259) and Lemma 1.5, we obtain \\xo - ^oll = dist(xo, Ho) = Oo(-ro) - sup Oo(G) = Oo(xo - go), so Oo satisfies (1.240) and (1.255), whence go e VG(XO)In the case in which nearest points exist. Theorem 1.12 permits the following sharpening of the basic distance formula (1.238): Corollary 1.9. Let X be a normed linear space, G a convex subset ofX, and xo an element ofCG admitting a nearest point go G Vcixo). Then dist(xo, G) =

max

{cD(jco) - supcD(G)}.

(1.260)

eN(G-go) ||0|| = 1

Proof. The inequality > in (1.260) is obvious from (1.239). On the other hand, if Oo G X* is as in 4° of Theorem 1.12, then ||Oo|| = 1, OQ e A^(G; go) and we have dist(xo, G) = \\xo - goII = ^o(-^o - go) = ^o(-^o) - sup Oo(G), whence (1.260), with the max attained at 4>o.

•

1.4 Duality for convex and quasi-convex infimization In this section we shall present some duality results and some methods of obtaining them, for infimization of convex and quasi-convex functions on convex sets in locally convex spaces (we consider also the latter, since for the validity of some results only the quasi-convexity of functions is needed, i.e., the convexity of their level sets, rather than their convexity, i.e., the convexity of their epigraphs). These will serve as a basis of comparison with the nonconvex duality results of Chapters 3, 4, 6, and 7 and with the methods of obtaining them. Although the infimization of quasi-convex functions is, actually, nonconvex infimization, we shall include it in this section (rather than devoting to it a separate chapter), since quasi-convexity belongs to the field of "generalized convexity." In the first part we shall present some elements of the theory of "unperturbational dual problems," i.e., of dual problems defined without using perturbations. In the second part we shall present some elements of the theory of "perturbational dual problems," i.e., of dual problems defined with the aid of perturbations of the primal problem, and we shall show that various unperturbational dual problems can be deduced from the perturbational theory, in a unified way, for suitable choices of particular perturbations.

1.4 Duality for convex and quasi-convex infimization

47

1.4.1 Unperturbational theory Assume that we are given a constrained "primal" problem, called a "problem of infimization," (P)

a = inf/(G),

(1.261)

where the "constraint set" G is a subset of a locally convex space X and f: X ^^ R is a function, called "the objective function." When G = X, problem (P) is called "unconstrained." Any go e G for which the inf in (1.261) is attained, i.e., such that /(go) = inf/(G),

(1.262)

is called a (global) "optimal solution" of problem (P). The set of all optimal solutions will be denoted by Scif), that is, Scif)

'= {go e G\ f(go) = inf/(G)};

(1.263)

naturally, one can also write min instead of inf in (1.262) and (1.263). If G is a convex set and / is a convex (respectively a quasi-convex) function, then (P) of (1.261) is called a problem of convex (respectively, quasi-convex) infimization. As has been observed above, best approximation may be regarded as a particular case of infimization, by taking Z to be a normed linear space, xo ^ X, and / : X -> R the convex function fiy):=\\xo-y\\

(yeX);

(1.264)

indeed, then inf/(G) = dist(jco, G),

(1.265)

and the optimal solutions go ^ ^ of problem (P), for this case, are the elements of best approximation of XQ by G. Therefore, it is natural that many results on infimization can be applied to the particular case of best approximation. Moreover, in the converse direction, although the extension from the particular function / of (1.264) to a function / : X ^^ P on a locally convex space Z is a rather big step, it turns out that many results and methods of the theory of best approximation can be extended to results on the infimization of functions. For example, since for the function (1.264) we have Sdif) = [yeX\

\\xo - y\\
(d e P),

(1.266)

the balls Bixo, d) of (1.236) will be replaced by the level sets 5 j ( / ) of (1.22). Also, if X is a normed linear space, JCQ G X, and / is the function (1.264), then for any go e Xv^Q have (see, e.g., [212, Lemma 4.1]) a/(^o) = [^e

X*| cD(xo - go) = \\xo - goW, ll^ll < 1};

(1.267)

48

1. Preliminaries

clearly, when go 7^ XQ, one can take ||0|| = 1 in (1.267). Therefore, the "maximal functions" of Remark 1.17(a) will be replaced by the elements of 9/(go)In the present section and the next one, we shall be concerned with the following two main problems for convex (respectively, quasi-convex) infimization: (1) Find convenient formulae for inf/(G). (2) Give characterizations of optimal solutions, i.e., necessary and sufficient conditions in order that an element go e G satisfy (1.262) (that is, in order that In the duality results for best approximation, the distance dist(;co, G) is expressed by equalities involving the continuous linear functions 4> G X*, such as formula (1.238) in Theorem 1.11. In the theory of duality for the more general case of infimization of convex functions, instead of such equalities there appears a "dual problem" of supremization of a "dual objective function" on a "dual constraint set," and the infimum occurring in the primal problem (P) of (1.261) is not necessarily equal to the supremum in the dual problem; they are equal only under some conditions on the primal variables G and / , but those are satisfied by the particular function (1.264) on a normed linear space X (for example, / of (1.264) is convex and continuous). This explains why in Section 1.3 on best approximation by convex sets no "dual problem" has appeared explicitly. There is also another essential difference between the duality theories for best approximation by convex sets and for convex infimization. Namely, while the results of Section 1.3 have been proved by arguments that work directly in X and X* (e.g., separation theorems for convex subsets of X), problem {P) of (1.261) requires one to find the "lowest point" of the graph (or, equivalently, of the epigraph) of the restriction / | G of the convex function / to the convex set G c X (see Figure 1.5, in which X = R^, endowed with its natural topology); therefore, duality results for (1.261) are often obtained by applying separation theorems for the epigraph (1.21) of / , which is a subset of X x /?, by functions in (X x /?)* = X* x R (thus, instead of working with functions on X, one works with functions "one floor higher"); this is made possible by some useful connections between / and epi / (for example, it is well known that / is a convex function if and only if epi / is a convex set). There are two main types of dual problems to any (convex or nonconvex) constrained primal optimization (infimization or supremization) problem: "Lagrangian dual problems" and "surrogate dual problems." Roughly speaking, a Lagrangian dual problem, say, to (1.261), is an optimization problem whose objective function is defined by replacing the primal constraint set G by the whole space X, at the price of adding a "penalty term" to the objective function / (in order to compensate the "violation of the constraints"), and a surrogate dual problem to (1.261) is an optimization problem whose objective function is defined with the aid of the same objective function / , but replacing the constraint set G by a family of "surrogate constraint sets" (usually related in some way to G). We have the following basic theorem of Lagrangian duality. Theorem 1.13. Let X be a locally convex space, G a convex subset of X, and f: X ^ R a proper convex function that is continuous at some point ofGH dom /

1.4 Duality for convex and quasi-convex infimization

49

//

Figure 1.5. {i.e., finite and continuous at some point ofG). Then inf/(G) = max inf [f{y) - 0 ( j ) + inf cD(G)}.

(1.268)

4>GX* yeX

Proofi Clearly, for any G and / we have inf/(G) > inf {/(g) - (D(g) + inf cD(G)} geG

> i n f { / ( j ) - sup inf [f{y) - <^{y) -f- inf 0(G)}.

(1.269)

Let us prove now the opposite inequality and the attainment of the sup. Observe first that if inf/(G) = - o o , then, by (1.269), we have the equality (1.268), with the max being —oo, attained at all O e X*. Hence, we may assume that inf/(G) > —oo. Let M := {(g, ri)eXxR\geG,r]<

inf/(G)}.

(1.270)

Then M and e p i / are nonempty convex sets, with i n t e p i / = {(x,d) e X x ^\ fM < d) ^ & (since / has a point of continuity). Also, M H intepi / = 0; indeed, otherwise, if (g, r]) e M Hint epi / , then r] < inf/(G) < f{g) < rj, which is impossible. Hence, there exists (by Theorem 1.1) (^, fi) e (X x R)* = X* x /? separating epi / from M, i.e., such that sup ( ^ , /X)(X, d) < inf (Vl/, ^M)(g, T]), (x,d)eepif i8,r])eM

which implies that

(1.271)

50

1. Preliminaries ^(x) + MJ < ^(g) + M inf/(G)

(U, d) e epi / , g G G ) .

(1.272)

Clearly, ^ ^ 0. Also, /L6 / 0, since otherwise ^ ( x ) < vl/(g) for all x e dom / and g e G, that is, ^ separates d o m / from all points g e G, which is impossible by the "only if" part of Theorem 1.1 (indeed, since by our assumption / is a proper convex function that is continuous at some point go e G Ci dom / , we have go G G n intdom/ 7^ 0). Moreover, /x < 0 (since otherwise, taking (x,d) e e p i / with d -^ +00 in (1.272), we would arrive at a contradiction). Hence, dividing by ~fi (> 0) and taking d = f(x) in (1.272), and O o : = - i v i / (6X*\{0}),

(1.273)

we obtain Oo(x) - fix) < cDo(g) - inf/(G)

(X e d o m / g e G),

(1.274)

(x G d o m / g G G).

(1.275)

whence inf/(G) < fix)-

cDo(x) + cDo(g)

Consequently, inf/(G) < fix) - (Do(x) + infcDo(G)

(x e X),

which together with (1.269), proves (1.268) (with the max attained for O = OQ)- D Remark 1.18. (a) This proof illustrates the "epigraphic methods" of proofs in convex analysis, mentioned above. Later we shall also give another proof of Theorem 1.13, deducing it as a particular case of more general results. (b) Any condition involving the primal constraint set that ensures that (weak or strong) duality holds, e.g., the condition of Theorem 1.13 that / should be continuous at some point of G fl dom / , is called a constraint qualification. There are also other constraint qualifications that ensure the validity of the strong duality formula (1.268), e.g., the following condition discovered by Attouch and Brezis ([7, Corollary 23]): X is a Banach space, G is a closed convex subset ofX,f\X^^Risa lower semicontinuous proper convex function, fl«JU^>o/x(G —dom/) = X (in particular, the latter equality is satisfied when dom f = X). For some other constraint quaUfications see, e.g., Hiriart-Urruty and Lemarechal [104]. (c) In particular, when X is a normed linear space and / is the function (1.264), formula (1.268) means that dist(jco, G) = max inf {||xo-JC|| - CD(JC) + inf 0 ( G ) } . OGX*

(1.276)

xeX

Theorem 1.13 suggests that the left-hand and right-hand sides of the duality formula (1.268) be split into two optimization "problems," namely, the initial primal problem (P) of (1.261) and the "dual problem"

1.4 Duality for convex and quasi-convex infimization (D)

p = supA.(X*) = sup X(cD),

51

(1.277)

where A(cD) = inf {f(y) - cl>(^) + inf 0(G)}

(cD e X*);

(1.278)

yeX

this is a Lagrangian dual problem in the sense mentioned above, with the penalty terms 7T^(y) := -(D(y) + inf 0(G)

(y e X);

(1.279)

in other words, the Lagrangian dual problem (1.277), (1.278) "penaHzes," via the term (1.279) added to the primal objective function / (with lower addition), the fact that the initial constraint set G is replaced by the whole space X. The "dual constraint set" is the whole conjugate space X* (so (D) is an unconstrained supremization problem), and the "dual objective function" is X: X* -^ R of (1.278). The numbers a and fi are called the (optimal) values of problems (P) and (D) respectively. With the above notations for the Lagrangian dual problem (1.277), (1.278), the "duality inequality" (1.269) can be written as a>p.

(1.280)

For any primal-dual pair {(P), (D)} of optimization problems, when a = ^ (that is, when the values of (P) and (D) coincide), one says that weak duality holds, or that there is no duality gap', when a > p, one says that there is a duality gap. If we have a = fi and the dual problem (D) has an optimal solution, that is, if the value of (D) is attained for some OQ e X*, then one says that strong duality holds (see, e.g.. Theorems 1.11 and 1.13). Besides the use of constraint qualifications, another method of getting rid of a possible duality gap of a primal-dual pair [(P), (D)} of optimization problems is to replace the dual problem (D) by a new dual problem (DO

)6' = supV(X*),

(1.281)

for which a = fi' (possibly without assuming any constraint qualification). For the case of Lagrangian dual problems, one way of doing this is that of replacing the Lagrangian (1.287) by an "augmented Lagrangian" L': X x X* -> /?. To this end, a useful tool is provided by abstract convex analysis (for some details, see, e.g., [254, Section 0.8a]). Let us return now to the Lagrangian duality result (1.268). Applying formula (1.268) to a hyperplane G = H = [y e X\ (Do(jc) = JQ}, where OQ e X*\{0}, do e R, and observing that for any O G X* we have inf(H) = |''^<'

'fr^.*r'*^'

(1-282)

we obtain the following result, which gives a formula for the infimum of a function on a hyperplane:

52

1. Preliminaries

Corollary 1.10. Let X be a locally convex space, H = [x e X\<^oM = do], where Oo e X*\{0}, do e R, and f: X ^^ R a proper convex function that is continuous at some point of H H dom / . Then inf

xeX ^o(x)=do

f{x) = maxmflf(y)-ri(t>o(y)-hr]do}, rjeR yeX

(1.283)

Remark 1.19. One can obtain max^>o instead of max^^/? in the right-hand side of (1.283), by taking, instead of the hyperplane H = {x e X\ ^oM = do}, the closed half-space D:={x

eX\^oM>do},

(1.284)

where OQ and do are as above (so // = bd D, the boundary of D) and assuming that / is a proper convex function that is continuous at a point of D D dom / . Indeed, we have, for any 4> G X*,

inf OCD) = I "^0 ^ ^

[ -cx)

*^rVr'*^'

(1-285)

if Jy? > 0, O = r]^o,

whence by (1.268) (with G = D), we obtain inf

fix) = max inf {f(y)-r]o(y) + r]do}.

xeX ^o(x)>do

(1.286)

ri>0 yeX

The following is a useful tool for the study of the Lagrangian dual problem (1.277) to (P) of (1.261): the function L: X x X* -^^ defined by L(jc, O) := f(x) - (t>(x) + inf 0(G) (x e X, O e X*), (1.287) is called the Lagrangian function, or simply the Lagrangian, associated with the primal-dual pair {(P), (/))}, or with the dual problem (D). Thus, by (1.278), (1.287), and (1.277), X(0) = inf L{y, cD)

(O e X*),

(1.288)

yeX

P= sup inf L(j,(D);

(1.289)

therefore, conversely, (D) of (1.277) may be called the dual problem associated with the Lagrangian function (1.287). Duality results for inf / ( G ) , such as Theorem 1.13, can be used to derive characterizations of optimal solutions of convex optimization problems, e.g., the following one, due to Pshenichnyi and Rockafellar (see, e.g., [106, p. 30]): Theorem 1.14. Let X be a locally convex space, G a convex subset of X, and f: X -^ R a convex function that is continuous at some point of G. Then for an element go ^ G the following statements are equivalent: r. go e Scif) (i.e., /(go) = min/(G)).

1.4 Duality for convex and quasi-convex infimization

53

2°. There exists o e X* such that 4>o e Sfigo),

(1.290)

ct)o(go) = min cI>o(G).

(1.291)

Proof. If figo) = min/(G), then by Theorem 1.13, there exists o e X* such that f(go) = inf {f{y) - o{x} + inf
(1.292)

xeX

whence f(go) < fix) - cDo(x) t inf cDo(G)

(x e X).

(1.293)

Taking x = go in (1.293), we obtain 0 < -Oo(go) + inf cI>o(G), which, since go e G, yields (1.291). Furthermore, by (1.293), we have figo) - Oo(^o) < figo) + sup (-Oo)(G) = f(go) - inf a)o(G) < / ( x ) - Oo(x)

(X G X),

so (1.290) holds. Conversely, assume 2°. Then by (1.290), we have f(go) - ^o(go) = inf {fix) - cI>o(x)}, xeX

which, together with (1.291), yields (1.292). Hence, by Theorem 1.13 and go ^ G, we obtain /(go) = min / ( G ) . D Remark 1.20. (a) Let us also mention a more classical proof of Theorem 1.14, based on the fact that 2° is equivalent to -A^(G;go)na/(go)7^0.

(1.294)

By the definition (1.122) of XG. 1° can also be written in the form ( / + XG)igQ) = m i n ( / + XG)iG), and, by the definition (1.110) of the subdifferential, this equality holds if and only if 0 e 9 ( / -J- XG)(^O)- But since dom XG = G, by Theorem 1.5 and formula (1.124) we have dif + XG)igo) = a/(go) + dxGigo) = dfigo) + NiG; go), so 1° is equivalent to 0 e 9/(go) + A^(G; go), that is, to (1.294). (b) In the particular case that X is a normed linear space and / is the function (1.264), from Theorem 1.14 one obtains again Theorem 1.12 on the characterization of the elements of best approximation by using the subdifferential formula (1.267). In the case that optimal solutions exist. Theorem 1.14 permits the following sharpening of the basic Lagrangian duality formula (1.268):

54

1. Preliminaries

Corollary 1.11. Let X be a locally convex space, G a convex subset of X, and f: X ^ R a proper convex function that is continuous at some point ofGD dom / . If problem (P) has a solution, say go, then min/(G) = /(go) =

max

inf {f(x) + (x) + - s u p O ( G ) } .

(1.295)

^eNiG;go)xeX

Proof The inequality > in (1.295) with max replaced by sup, is obvious. On the other hand, if OQ e X* is as in Theorem 1.14, then - O Q e N(G; go), and we have inf/(G) = /(go) = inf {f(x) - o(x) + inf Oo(G)}, xeX

whence (1.295), with the max attained at O = —OQ.

•

We have the following simultaneous characterization of primal and dual solutions, and strong Lagrangian duality: Proposition 1.2. Let X be a locally convex space, G a convex subset of X, go G G, f: X -> R a function, and Oo ^ ^*- The following statements are equivalent: r. We have (1.292). 2°. go is a solution of problem (P) (of (1.261)), 4>o is a solution of the dual problem (D) (of (1.277)), and we have strong duality (i.e., a = fi, with fi being attained). Proof If 1° holds, then, by the duality inequality (1.280) we have a = min / ( G ) < /(go) = inf {f(x) - o(x) + inf c|>o(G)}

xeX

which yields 2°. Conversely, if 2° holds, then /(go) = min / ( G ) = « = ^ = inf {f(y) - <^o(x) + inf cDo(G)}.

D

xeX

In the preceding, the primal constraint set G has been an arbitrary convex subset of a locally convex space X. One can consider some more structured ways of expressing the primal constraint sets G c X. For example, one can take G = {x e X\u(x) e T],

(1.296)

where Z is a set, T is a subset of a set Z, w : X ^- Z is a mapping, and f: X -> R is a function. It is convenient to introduce the following terminology: Definition 1.3. (a) Any triple (X, Z, w), consisting of two sets X, Z and a mapping u: X ^^ Z is called a system. Given a system (X, Z,u), any subset 7 of Z is called a target set. (b) A linear system is a triple (X, Z, w), consisting of two locally convex spaces X, Z and a continuous linear mapping u: X ^^ Z. (c) A convex system is a triple (X, Z, w) consisting of a locally convex space X, a partially ordered locally convex space Z == (Z, <), and a "convex mapping" w: X -> Z, i.e., such that u(cxi + (1 -c)x2)

<cu(xi)-\-(l

-c)u(x2)

(xu X2 G X, 0 < c < 1). (1.297)

1.4 Duality for convex and quasi-convex infimization

55

Remark 1.21. One can also consider (see, e.g., Combari, Laghdir, and Thibault [33]) convex systems (X, Z U {+oo}, u) in which w: X ^- Z U {+00} is a proper convex function (in the sense (1.297)), but in the sequel we shall limit ourselves to the classical case u: X -> Z. Definition 1.4. For a linear (respectively, a convex) system (X, Z, w), a target set r c Z, and a continuous linear (respectively, a convex) function / : X ^- /?, the primal infimization problem (P)

a = a,-.^T)j=

inf / ( x )

(1.298)

MU)Gr

is called a linear (respectively, a co^v^x) programming problem. Remark 1.22. (a) In (1.283), the primal constraint sQi G = H = {x e X\ ^oM = do] may be regarded as being "structured" in the sense of (1.296), namely, by taking Z = R, u = Oo, and T = [do] (a singleton). A similar remark is valid also for the constraint set (1.284) occurring in (1.286), by taking Z = R, u = <E>o, and T = {d e R\d > Jo}. (b) Problem (1.298) is equivalent to problem (P) of (1.261). Indeed, given X, Z, w, r , and / as above, the programming problem (1.298) is nothing but the optimization problem (1.261) with G = {x e X \ u{x) e T} := u~\T). Conversely, every optimization problem (1.261) can be written in the form of a programming problem (1.298), for example, by taking Z = X, M = /x, the identity operator in X (i.e., M(X) = X for all jc E X), and T = G. Naturally, a constraint set G of an optimization problem (1.261) can be written in different ways in the form (1.298), which give rise to different dual problems. Note that the advantage of (1.296), (1.298), is that one can also use the properties of T and u. (c) One can combine problems (1.261) and (1.298), considering the infimization problem (P)

Qf= inf / ( x ) ,

(1.299)

xeG u(x)eT

where X, Z,u,T and / are as above, and G is a subset of X, called an "abstract constraint set." However, we shall not pursue this direction. Let us give now some results of unperturbational Lagrangian duality for problems with structured primal constraint sets in the sense of (1.296), which will be used later. Some other unperturbational Lagrangian duality theorems will be deduced in Section 1.4.2 from more general results on perturbational duality. The following result holds for arbitrary extended-real valued functions. Proposition 1.3. Let X be a linear space, and let fj\,... functions. Then

Jrn'- X -^ R be m -\- \ m

inf f{y) > inf 0' = l,...,m)

{i = \,...,m)

f(y) > sup inf Ifiy)

^Yriiliiy)],

(1.300)

56

1. Preliminaries

with the upper addition + of (1.84) and the uppermultiplication x of (1.92), which we have denoted simply by x, that is,

here, as well as throughout the sequel, Y^=\ "^eans upper addition and R+ — [0, +oo). Proof. By (1.301), for any y e X we have fiy) + X[xex\i,M
jf max,<,<„ /.(y) > 0,

= sup\fiy)^Tr]My)\.

(1.302)

Consequently, by (1.302) and the well-known inequality inf sup > supinf, inf /(>^) > yeX li(y)<0 {i=\,...,m)

inf f {y) = inf {f{y) ^ X{xex\h(x)
yeX

= inf sup f(y) + T

mU{y) > sup inf f{y) + V ^///(j) •

•

Remark 1.23. The first two terms of (1.300), that is, (P<)

a=

inf

/(j),

(1.303)

/(j),

(1.304)

\eX liiy)<0 0 = 1 w)

respectively (P<)

a=

inf \eX /,(v)<0 (/• = l,...,m)

ar^ ''structured'' primal convex programming problems in the sense (1.298), with Z = R"^, u: X -> R"^ defined by u(y) = (li(y),...Jm(y))

(y e X),

(1.305)

and T = {z e R'^lz < 0}, respectively T = {z e /?^U < 0}. By the right-hand sides of (1.310) and (1.326), one can define the Lagrangian dual problem to each of these primal problems by (D) where

^ = supA(/?:^),

(1.306)

1.4 Duality for convex and quasi-convex infimization

57

m

Xi^)^mf{f(y)

+ Yr]ili{y)}

(^ ^ (m, • •• ,rir„) € R"l),

(1.307)

that is, P of (1.315), and it is natural to associate to each of the pairs {(P<), (D)] and {(P<), (D)} the Lagrangian (function) m

L(y. vl/) = f(y) + J2 riMy)

(J G X, vl/ = (r/i, . . . , r?^) G /?7).

(1.308)

i=\

In the next result of strong duality the assumptions on the functions / and / i , . . . , /^ are very general. Theorem 1.15. Let X be a linear space, and let fj\,... Jm'- ^ -^ R be m + I convex functions. If the ''Slater constraint qualification*' (dom/) n{y eX\ li(y) < 0 (/ = 1 , . . . , m)} / 0

(1.309)

is satisfied, then inf

f(y) =

yeX //(>')<0 (/=!,...,m)

inf

f(y) = max infl f{y) + T rjMy)]^

yeX //(>')<0 (i=l,...,m)

rjeR'l yeX I

^ '=^

(1-310)

J

Proof For simplicity, let us set / , : = { ! , . . . , m}.

(1.311)

For the first equality of (1.310), it is enough to prove the inequality < . Let >; G X be such that li(y)
(i e Im).

(1.312)

Let us denote the left-hand side of (1.310) by of, and put yn = y + -(yo-y)

(n = i , 2 , . . . ) .

(1.313)

n Then, by (1.313) and the convexity of//, we have

// (yn) < (l - -)li(y) +

-//(JO)

<0

(ielm;n

=

h2,...),

\ n/ n whence, by (1.313) and the convexity of / , a =

inf yeX liiy)<0 Helm)

f{y) < fiy^) < ( l - -)f{y) \

n/

+ -f{yo)

(« = 1, 2 , . . . ) .

n

(1.314)

58

1. Preliminaries

We shall show that a < f(y), which, since y e X with //(}')< 0 (/ = 1 , . . . , m) was arbitrary, will prove the first equality of (1.310). If f{y) = +oo, we are done. If f(y) = - 0 0 , then by (1.314) and jo e d o m / , we have a = -co = f(y). Assume now that f(y) e R. If fiyo) = - o o , then by (1.314), a =-oo < f(y). Finally, if /(yo)G/?, then by (1.314), (A2=l,2,...), -fiyo) < (l - -)f(y) n \ n/ whence, passing to the limit as AT ^- +oo, we obtain a < / ( y ) . Let us prove now the second equality of (1.310). Put m

P := sup inf j / ( y ) + V r]My)\-

(1-315)

By Proposition 1.3, we have a > p. Hence, if a = —oo, then fi = —oo, and therefore, for any r] e R^, \ ^ 1 Of = - o o < inf /(y)4-y]^7///(j)

and we are done. On the other hand, by the Slater condition (1.312), we have a < f(yo) < +00, so it remains to consider the case a e R. Define v: R"^ ^ Rhy v(z):=

inf f(y)

(z = (zu ..., Zm) e R^-

(1-316)

yeX li(y)
Then v is convex ( see, e.g., Ekeland and Temam [54, Ch. 3, Lemma 5.2]) and decreasing (i.e., v(z') > v(z'') for z\ z'' e R"^, z' < z") for the natural (product) order on R"^. Moreover, by (1.312), there exists s > 0 such that //(yo) 5 —^forall / e Im- Let zo := - ( ^ , . • •, 6:) G 7?^. Then for any z e R"^ with zo < ^ < 0, we have - o o < Of = i;(0) < v{z) < v(zo) =

inf f(y) < f(yo) < +oo,

xeX li{y)<-£

SO V is finite valued on [—£, 0]^. Hence, since i; is convex, it cannot take the value —oo, and thus it is proper (recall that i;(0) = a e R,so v ^ +oo). Furthermore, since v is decreasing, for any z e R^ with zo < z, that is, any z e [—£, +oo)^, we have v(z) < v(zo) < +oo, and hence 0 e int(domi;). Consequently, v is subdifferentiable at 0 (see, e.g., Ekeland and Temam [54, Ch. 1, Proposition 5.2]), so there exists r]^ e R"^ such that m

v{z)-viO)>-J2^Ui 1=1

(zeR""),

(1.317)

1.4 Duality for convex and quasi-convex infimization

59

or equivalently, m

inf f{y) + T ',(J)<M

rf!z, >a

(z e R"").

(1.318)

'= '

(i6/„)

We claim that rj'^ e /?™. Indeed, let j e /„ and define a sequence {z"} Q R'" by in

if/=7,

(J 319)

then, from (1.312) and (1.319) we have li(yo) < z" (i e /„), whence, by (1.312), (1.316), (1.317), and (1.319), + oo > fiyo) - viO) >

inf f{y)-v(0)

= v{z")-v{0)>-nii°

yeX 1 /,.\

(n = l , 2 , . . . ) , •'

n

and therefore r]^j > 0, proving the claim. Let us prove now that m

a<mf\f(y) yeX

+ Tri%(y)\.

I

.^j

(1.320) J

Let J e X. If there exists j e Im such that lj{y) = +00 or if f{y) = +00, then, by the rules (1.84) and (1.92) for + and x, we have f(y) + Yl?=\ ^%iy) = + ^ Assume now that //(j) < +00 (/ e Im) and f(y) < +00, and let J:={i

eUli{y)

=-00}.

(1.321)

Define a sequence {y"} c R^ by

Then //(y) < yf (/ G /^; AZ = 1, 2 , . . . ) , whence, by (1.318) and (1.322), m

a < f(y) + J2 nhl = f(y) + E ''°''(>') - « E''°-

^1-323)

If there exists j e J such that ry^ > 0, then, passing to the limit in (1.323) for n -> +00, we obtain a contradiction with of > — (X). Therefore, rj^ = 0 for all / G J, and hence, by (1.92) and (1.323), we have (1.320). Consequently, m

P
+

which yields the second equality in (1.310).

f^

Yrj%(y)\
60

1. Preliminaries

Remark 1.24. (a) For a locally convex space X and finite continuous convex functions l\,... Jrn'- ^ ^^ R. Theorem 1.15 is the particular case Z = R^ of Corollary 1.12(b) below. (b) Applying Theorem 1.15 to a locally convex space X, m = 1 and the affine function l\ = —OQ + ^o, where OQ e X*\{0} and do e R, we obtain that if f: X -> /? is a convex function such that (dom/) n{yeX\

o(y) > do} / 0,

(1.324)

then inf yeX ^o(y)>do

f(x)=

inf

f(x) = maxmf[f{y)-r]
yeX ^o{y)>do

T]>0

+ rjdo}.

(1.325)

yeX

The second equahty of (1.325) is nothing other than (1.286), but now it has been proved under different assumptions. Under some additional restrictions, but without assuming any constraint qualification, one can obtain a formula of the above type, with max replaced by sup. Indeed, we have the following result: Theorem 1.16. Let K be a compact convex set in a topological linear space X, and let f,l\, ... ,lm'. K -> R be m -\- \ finite valued lower semicontinuous convex fiinctions. Then f(y) = sup inf f(y) + T r]My)\.

inf yeK

nm yeK I

^

(1.326)

J

(i=h...,m)

Proofi For each y e X with // (j) < 0 (/ = 1 , . . . , m) we have f

f(y) =sup\

^

1

f(y)^TriMy)\.

whence inf yeK //(>')<0 {i = \,...,m)

f{y) = inf sup f(y) + T ^///(j) • yeK ^j^m I ^ ^

^ '=^

(1.327)

J

Hence, by (1.327) and the minimax Theorem 1.8, we obtain (1.326).

D

Let us pass now to (unperturbational) surrogate duality for quasi-convex infimization. A surrogate dual problem to a primal infimization problem {P) of (1.261) is a supremization problem whose objective function is defined with the aid of the same objective function / , but replacing the constraint set G by a family of "surrogate constraint sets" (usually related in some way to G). We have the following result of (strong) surrogate duality for quasi-convex problems (P) of (1.261):

1.4 Duality for convex and quasi-convex infimization

61

Theorem 1.17. Let X be a locally convex space, G a convex subset ofX,f:X-^ R an upper semicontinuous quasi-convex function, for which the constraint set G is ''essentialJ' that is, inf/(X) < inf/(G) < +oo,

(1.328)

and XQ an element ofX such that /(xo)
(1.329)

Then inf/(G) =

max

inf

f{y).

(1.330)

O€X*\{0} yeX sup 0(G)<')=supO(G)

Proof Since / is upper semicontinuous, by (1.329) and Lemma 1.1 we have /(jco) < inf/(G) = inf/(G), whence JCQ ^ G. Hence, since G is convex, by the strict separation theorem there exists 4> G X * \ { 0 } such that sup 4)(G) = sup cD(G) < O(jco).

(1.331)

Consider any O e X*\{0} satisfying (1.331), and any £ > 0. Take g = ge e G such that / ( g ) < i n f / ( G ) + e,

(1.332)

and define a function (^: [0, 1] ^^ /? by ^(^) := cD(^xo + (1 - ^)g) = i^cD(xo) + (1 - ^)<^{g)

(0 < i^ < 1). (1.333)

Then (p is continuous on [0, 1] and by (1.331), we have <^(0) = ^{g) sup 0(G) < O(xo) = <^(1), so there exists i^o ^ [0, 1) such that

<

(^(z^o) = supO(G).

(1.334)

yo:=i^oXo + ( l - ^ o ) g .

(1-335)

Put

Then, by (1.333) and (1.334), cD(3;o) - 0(^0X0 + (1 - ^o)g) = ^(^o) = sup cD(G);

(1.336)

furthermore, by (1.335), the quasi-convexity of / and (1.329), (1.332), we get f{y^) < max {/(JCQ), f(g)} < max {inf/(G), inf/(G) + 8} = inf/(G) + s. (1.337)

62

1. Preliminaries From (1.336) and (1.337) it follows that

inf

f(y)
+ 6,

yeX (i>(y)=sup^(G)

whence, since O e X*\{0} satisfying (1.331) and e > 0 have been arbitrary, it follows that inf/(G) >

sup

inf

f(y).

(1.338)

OeX*\{0} >"^^ supO(G)(xo)^00=supcI>(G)

On the other hand, by (1.328) and since / is upper semicontinuous and quasiconvex, the set A := {y eX\f(y)<

inf f(G)}

(1.339)

is nonempty, open and convex; furthermore, clearly, A Pi G = 0. Hence, by the separation theorem, there exists OQ G X * \ { 0 } such that supcDo(G) < inf 4>o(A).

(1.340)

Then inf o(A) < OO(JCO) (since by (1.329) we have XQ e A, and so Lemma 1.8 applies), whence by (1.340), sup Oo(G) < OO(JCO). Hence by (1.338), we obtain inf/(G) >

inf

f(y).

(1.341)

yeX Oo(v)=supOo(G)

Let US show that in (1.341) equality holds, which will complete the proof. If not, then there exists yo ^ X with ^o(jo) = sup Oo(G), such that inf/(G) > f(yo) (so yo e A). Thus, the hyperplane H:={yeX\

(Po(y) = sup Oo(G)}

(1.342)

contains yo, and hence in the open neighborhood A of yo there exists y\ e A such that Oo(yi) 0 sufficiently small, since then ji is sufficiendy near to yo (so y\ e A), and 1 /x ^o(ji) = :; ^o(yo) ^o(-^o) 1 — /x 1 —M = -; supcI)o(G) Oo(xo) < supOo(G). 1 — /x 1 — /x But this contradicts (1.340).

D

Remark 1.25. (a) Geometrically, Theorem 1.17 means that under the assumptions (1.328) and (1.329), we have inf/(G) = max i n f / ( / / ) , HeHcxn

(1.343)

1.4 Duality for convex and quasi-convex infimization

63

where HG,XO denotes the set of all hyperplanes that quasi-support the set G and that strictly separate G and XQ (see Lemma 1.4); thus, (1.343) reduces the computation of inf / ( G ) to that of inf / ( / / ) , for H e H-cxo^ so it may be called a "hyperplane theorem" of surrogate duality; note also that (1.343) generalizes the distance formula (1.249). In other words, Theorem 1.17 gives the following extension to quasi-convex optimization of the ''reduction principle" of Remark 1.16(b): it permits one to apply any formula known for i n f / ( / / ) to the computation o/inf/(G). (b) The above proof of Theorem 1.17 shows that in (1.330) it is enough to take the max over the set {CD G X*\{0}| sup cD(G) < inf 0(A)}

(1.344)

(where A is defined by (1.339)), which is contained in the set {O e X*\{0}| supO(G) < cD(jco)}

(1.345)

occurring in (1.330). On the other hand, in (1.330) one can take the max over the larger set {O € X*\{0}| supcD(G) < O(jco)},

(1.346)

as follows by slightly modifying the above proof (namely, replacing the sign < by < in (1.331), and ^o e [0, 1) by z^o e [0, 1] in (1.334)-( 1.336)). Therefore, it is natural to ask whether one can further enlarge the set (1.346), e.g., to the "barrier cone" of G, defined by G^ := {(D G X*\{0}| sup0(G) < +oo},

(1.347)

i.e., whether inf/(G) = max

inf

fiy).

(1.348)

cI>(>')=supcI>(G)

However, the answer is negative, even when G is a closed convex set and / is a finite continuous convex function on a finite-dimensional space X, as shown by the following example: In X = R, let G = {x eX\ -2<x f(x)=x (xeX), (Do(jc)=jc (xeX).

<-!},

JCQ = - 3 ,

(1.349) (1.350) (1.351)

Then supcDo(G) = - 1 < -foo (so OQ G G^) and /(XQ) = - 3 < - 2 = inf/(G), but {X G X\ cDo(x) = supcDo(G)} = {xeX\x whence

= -\} = {-1},

64

1. Preliminaries inf/(G) = - 2 <

inf

xeX OoU)=supOo(G)

f(x) = / ( - I ) = - 1 ,

and hence (1.348) does not hold. Thus, it is necessary to obtain a smaller set than G^ in the right-hand side of the duality formulas involving the hyperplanes {x e X\ ^(x) = sup 0(G}, and this was accompUshed above by the assumptions (1.328) (i.e., that the constraint set G is essential) and (1.329). Actually, in Chapter 3 we shall see that under certain assumptions, the right-hand side of (1.348) gives an expression for sup / ( G ) (so it cannot be an expression for inf/(G) when / is not constant on G). (c) A fixed element XQ e X satisfying (1.329) arises quite naturally in some optimization problems. For example, as has been observed above, best approximation may be regarded as a particular case of infimization, by taking X to be a normed linear space, XQ e Z, and f: X -^ /? the convex function (1.264). Indeed, then, by (1.265), we have 0 < dist(xo, G) (or equivalendy, XQ e CG) if and only if (1.329) holds. Thus, Theorem 1.17, in the geometric form (1.343), is an extension of Remark 1.15(b). Lagrangian and surrogate duality theorems are closely related, as shown by the following observations: Remark 1.26. (a) The substitution method: Let us recall the following method of deducing Lagrangian duality results from surrogate duality results, introduced in [214], which we shall call the substitution method: Given a set G c X and a function f:X -^ /?, if we have a surrogate duahty result for inf/(G), expressed using, e.g., i n f / ( / / ) , for H belonging to a family of hyperplanes or closed half-spaces or open half-spaces (see the above reduction principle), and if we know a Lagrangian duality formula for i n f / ( / / ) , then by substituting it in the surrogate duality result for inf/(G), we can obtain a Lagrangian duality result for inf/(G). Usually, it is convenient to assume that / is convex, since then the known Lagrangian duality theorems may be applicable. For example, following [214, pp. 247-248], let us show that using this substitution method, the surrogate duality result of Theorem 1.17, together with the Lagrangian duality formula for i n f / ( / / ) , given in Corollary 1.10 (with o = ^,do = sup 0(G)), imply the Lagrangian duality Theorem 1.13. Indeed, if the constraint set G is "inessential," i.e., if inf/(G) = inf f(X), then, by (1.269), we have mff(X)

= inf/(G) > sup inf {f(y)-

<^{y) + inf 0(G)) >

inff(X)

(the last inequahty follows by taking O = 0), which impHes (1.268). On the other hand, assume now that the constraint set G is "essential," i.e., that we have (1.328), so there exists XQ e X satisfying (1.329). In order to handle this case, note that by Remark 1.25(b) and since — inf (—0(G)) = sup 0(G) < O(jco) is equivalent to inf (—0(G)) > (—O)(jco), we can write (1.330) in the equivalent form

1.4 Duality for convex and quasi-convex infimization inf/(G) =

max

inf

4>eX*\{0} 0(jco)
yeX cI>(>')=infO(G)

f{y).

65

(1.352)

Nov^ formula (1.352) and Corollary 1.10 (with OQ = O, JQ = inf 0(G)) imply inf/(G) = =

max

inf

/(y)

(G) (v)=inf cD(G)

max

maxinf { / ( y ) - n O ( y ) + r/infO(G)}.

OeX*\{0} n^R yeX ct)Uo)
Hence, there exist

OQ G X * \ { 0 }

(1.353)

'

with Oo(xo) < inf c|>o(G) and rjo e R such that

inf/(G) = inf [f(y) - m^oiy) + r?oinf cI>o(G)}.

(1.354)

yeX

We claim that rjo > 0. Indeed, by (1.354) and (1.329), we have inf/(G) < fixo) - rio^oixo) + r/oinf Oo(G) < inf/(G) - m^oixo) f

Tjoinf^oiG),

whence r/o{-Oo(xG) + inf cDo(G)} = -rjo^oixo) + r/oinf Oo(G) > 0. This inequality, together with Oo(xo) < inf Oo(G), impHes that r]o > 0, which proves the claim. Now, by rjo > 0, we have r^o inf Oo(G) = inf y7o^o(^). and hence by (1.354), inf/(G) = inf {/(y) - r]oo(y) + inf r?oOo(G)}, yeX

which together with (1.269), impUes (1.268) (with the max attained for O = rjo^o), completing the proof of Theorem 1.13. Note that although in the case of Lagrangian duality for convex optimization, the direct method of proof is simpler than the above substitution method, it will turn out that in sonie other cases, Uke that of reverse convex optimization, the situation is different (see Chapter 7). (b) In the converse direction, one can show (see [214, p. 247]) that Theorem 1.13 of Lagrangian duality and the "duaUty inequahty" > in Theorem 1.17 of surrogate duality (which is the simpler part of Theorem 1.17) imply Theorem 1.17. Now we shall show that, replacing hyperplanes by closed half-spaces, one can also give corresponding "half-space theorems" of surrogate duality, involving the whole barrier cone G^ (defined by (1.347)) in the right-hand side. Indeed, we have the following half-space theorem of weak duality. Theorem 1.18. Let X be a locally convex space, G a subset ofX with G^ ^ 0, and f: X -^ R a function. The following statements are equivalent:

66

1. Preliminaries 1°. We have inf/(G) =

sup

inf

f{y) = sup

^(y)<supcD(G)

inf

f(y).

(1.355)

(G)

2°. For each d e R,d < inf/(G), there exists Oj e X*\{0} such that supO^(G) <
{X e Adif))-

(1.356)

3°. For each d e R,d < inf/(G), there exists Oj e X*\{0} such that supcD^(G) < Ax)

{X e SAf))-

(1.357)

Proof. The second equality in (1.355) is obvious, since G^ 7^ 0 and since for O e X*\G^ we have {y e X\^{y) < sup 0(G)} = X. Rather than giving a direct proof of Theorem 1.18, as in [231], we shall show how the result can be deduced from a more general duality theorem of Chapter 3. Define a polarity A = A^ : 2^ ^ 2^*\<0J by A^(;(C) := [^ e X*\{0}| (D(c) > sup
(C c X).

(1.358)

For this polarity we have, by (1.150), (A'cYim)

= {xeX\ (x) > sup
(O e X*\{0}).

(1.359)

Then 1° is nothing but (3.56) below and we shall show that conditions 2° and 3° above are equivalent to conditions 2° and 3° of Chapter 3, Theorem 3.3, respectively, for W = X*\{0}, A = A^, and a = inf/(G). Indeed, first, for each d e R, d > inf/(G), each g e G such that d > f{g) > inf/(G) and each O e X*\{0} we have g e A j ( / ) nC(A^)^({4>}) (note: equivalendy, one can observe directly that we always have inf/(G)>^^:=

sup inf/(C(A^^)^({cD})) eX*\{0}

=

sup

inf

fix),

(1.360)

because G ^ [x e X\ (x) < supcI>(G)}). Second, clearly, condition 2° above is satisfied if and only if for each d e R,d < inf/(G), there exists Oj e Z*\{0} such that

AAf) n C(A^^)^({0^}) = AAf) n{xex\

Ax) G Z * \ { 0 } we would have sup 0(G) = sup 0(G) = sup 0 ( Z ) = +CXD,

1.4 Duality for convex and quasi-convex infimization

67

so G^ = 0), and if G is convex, then the converse is also true (take any x ^ G, and apply the strict separation theorem). (b) Geometrically, formula (1.355) means that inf/(G) = sup i n f / ( V | ^ ) ,

(1.362)

where V^^ is as in (1.30), with d = sup 0(G), i.e., the smallest closed half-space determined by O containing G; note that if O G G^, then V^^ ^ X. (c) If G^ 7^ 0 and (1.355) holds, then we also have inf/(G) =

sup supcD(G)<J

inf /(jc),

(1.363)

^ ^-

or, geometrically. inf/(G) = sup inf/((V),

(1.364)

VeV GCV

where V denotes the set of all closed half-spaces in X. Indeed, by (1.362) we have the inequality < in (1.364), and on the other hand, for any set V with G c y we have inf/(G) > inf/(V). We have the following result of strong duality. Theorem 1.19. Let X be a locally convex space, G a subset ofX with G^ ^ 0, and f: X ^ R a function. The following statements are equivalent: 1°. We have inf/(G) = max

inf

/(y).

(1.365)

cD(>')<supcD(G)

2°. There exists O^ € X*\{0} such that supcD«(G)
(XGA«(/)),

(1.366)

where a = inf/(G). Proof Assume 1°. If of = - o o (so (1.365) holds by (1.360)), (1.366) is vacuously satisfied for any Oa e X*\{0}. If a > - o o , then, by (1.365), there exists O^ e G^ such that a = inf/(G) =

inf

f{y),

(1.367)

0„(>0<supO«(G)

whence by Chapter 3, Lemma 3.4 (a) (with $^ = {y G X| ^^(y) - o o , then by 2° and Chapter 3, Lemma 3.4(a), we have inf

f(y)>a

=

mff(G),

yeX <^a(y)<SUpa(G)

whence by (1.360), we obtain (1.365).

D

As an application of Theorem 1.19, let us give now a sufficient condition for strong duality. Theorem 1.20. If X is a locally convex space, G a convex subset ofX with G^ ^ 0, and f: X ^ R a function such that A«(/) is nonempty, convex, and open, where a — inf/(G) (in particular, if f is upper semicontinuous and quasi-convex), then we have (1.365). Proof Since G and Aa(f) are nonempty convex subsets of X with Aa(f) open, and since G Pi Aa(f) 7^ 0, by the separation theorem there exists O e X*\{0} such that sup4>(G) < inf 0 ( A J / ) ) < (x)

(x € A , ( / ) ) ,

where the last inequahty holds by Lemma 1.8. Hence, by Theorem 1.19, we have (1.365). D We also have the following result on surrogate duality formulas having inf^ez,cD(j)<supO(G) f(y) in the right-hand side (half-space theorems): Proposition 1.4. Let X be a locally convex space, G a subset of X, f: X -^ R a function satisfying (1.328), and XQ an element of X satisfying (1.329). The following statements are equivalent: \\ We have {1355). 2°. We have inf/(G) =

sup

inf

f{y)

(1.368)

supO(G)(G)

Also, a similar equivalence holds for the corresponding strong duality equalities (i.e., for (1.365) and sup replaced by max in (1.368)j. Proof By (1.329), we have sup

inf

f(y) < f(xo) < inf/(G),

cI>(jco)<sup')<supcD(G).

and hence if 1° holds, then we have 2°. On the other hand, by G <^ [y e X\ ^(y) sup ^

Qb

inf

/(_y) >

yeX cD(j)<sup0(G).

sup

inf

69

/(>'),

^ ^b yeX , u p n G ) < ^ { x o ) ^(^)<sup0(G)-

and hence if 2° holds, then we have 1°. The proof for the corresponding strong duality equalities is similar.

D

Remark 1.28. Besides the close connection between Lagrangian and surrogate du-> R WQ ality theorems, mentioned in Remark 1.26(a), for any function f:X have the following obvious relation between Lagrangian dual values ^Lagr (see, e.g., (1.268)) and surrogate dual valuesfi^unof type (1.355): inf/(G) > y^suiT := = sup

sup oex*\{0} inf

inf f(y) y^^ cD(v)<sup4)(G) f{y) > sup inf {/(g) + 0(g) + - sup 0(G)}

iy)<sup(^(G)

> sup inf {f{y) +
(1-369)

thus, in particular, the equality a = p^un holds for a larger class of problems (1.261) than the equality a = y^Lagr- However, surrogate duality a = j^surr is useful even for some convex problems for which we have a = ^Lagr (whence also a = )6surr). as shown, for example, by the problem of best approximation (see Remark 1.25(c)). Furthermore, surrogate dual problems are also convenient for computations. Replacing the closed half-spaces {x e X\ (x) < supO(G)} by surrogate constraint sets of the form {x e X\^(x) e 0(G)}, one obtains other theorems of surrogate duality, for example, the following: Theorem 1.21. Let X be a locally convex space, G a subset of X, and f: X ^ a function. The following statements are equivalent: 1°. We have inf/(G)-

sup

inf

exn{0}

y^^

f{y).

R

(1.370)

2°. For each d e R,d < inf/(G), there exists <^d ^ ^*\{0} such that AAf)

n{xeX\

Ax) e AG)} = 0.

(1.371)

3°. For each d e R,d < inf/(G), there exists Oj e X*\{0} such that SAf) n[xeX\

d(x) e 0^(G)} = 0.

(1.372)

Proof One can proceed similarly to the above proof of Theorem 1.18, defining a polarity A = A^: 2^ ^ 2^*^^^^ by A^(C) := {CD G X*\{0}| 0(C) H 0(G) = 0}

(C c X),

(1.373)

70

1. Preliminaries

and observing that for this polarity we have, by (1.150), (AlYiW)

= {xeX\ cD(x) ^ 0(G)}

inf/(G)>^^:=

sup

(O e Z*\{0}),

(1.374)

inf/(C(AS)^({4>}))

^eX*\{0}

=

sup inf fix). ^^^*\{«Uu)i0(G)

(1.375) D

Remark 1.29. Formula (1.370) suggests splitting the left-hand and right-hand sides of this duality formula into two optimization "problems," namely, the initial primal problem (P) of (1.261) and the "dual problem" (D)

p=

sup A(
(1.376)

OeX*\{0}

where A(0)=

inf

f(y)

(OeX*\{0});

(1.377)

yeX 4>(y)eO(G)

this is a surrogate dual problem in the sense mentioned above, with the "surrogate constraint sets" QG,^ := {y e X\ cD(j) e cP{G)}

(O e X*\{0});

(1.378)

in other words, A(cD) = inf/(^G.o)

(^ e X*\{0}).

(1.379)

Naturally, similar remarks can be made also for the preceding surrogate duality formulas. The next result of strong duality corresponds to Theorem 1.19. Theorem 1.22. Let X be a locally convex space, G a subset ofX, and f \ X ^^ R a function. The following statements are equivalent: 1°. We have inf/(G) =

max

inf

CDGX*\{0}

VGX

f{y).

(1.380)

2°. There exists 4>« G X*\{0} such that A^if)

r^[xeX\

cl>,(x) G CD,(G)} - 0.

(1.381)

Proof The proof is similar to the above proof of Theorem 1.19, using now the set Q = [yeX\<^o.{y)e<^AG)}. D

1.4 Duality for convex and quasi-convex infimization

71

We also have the following result corresponding to Proposition 1.4. Proposition 1.5. Let X be a locally convex space, G a subset ofX,f:X-^Ra function satisfying (1.328), andxo an element of X satisfying (1.329). The following statements are equivalent: W We have {\310). T. We have inf/(G)=

sup CDGX*\{0}

inf y^^

f{y).

(1.382)

^

Also, a similar equivalence holds for the corresponding strong duality equalities (i.e., for (1.380) and sup replaced by max in (1.382)| Proof. The proof is similar to that of Proposition 1.4, observing that we have sup

inf

/(>;)
OUo)€0(G) ^(v)e4>(G).

and using the obvious inclusion G c. [y e X\ ^{y) e 0(G)}.

D

7.4.2 Perturbational theory The theory of perturbational dual problems, of which we shall present some elements in this section, is a convenient tool to handle duality both for problems (1.261) with general constraint sets G and for problems (1.298) with the structured constraint sets (1.296). Let us consider the primal infimization problem (P) of (1.261). Clearly, inf/(G) = inf/(X),

(1.383)

where / : X ^- /? is the function defined by 7 ( x ) = . / ( x ) + xo(x) = {^(^^

l l %

(1.384)

Thus, problem (1.261) and the primal problem {P)

a = \nif{X)

(1.385)

have the same value. Moreover, if / | G ^ +oo, which we shall assume in the sequel without any special mention, then problems (P) and {P) have the same optimal solutions; indeed, if go 6 G, /(go) = inf/(G), then /(go) = £(go) + XG(go) = inf/(G) = £ n f / ( X ) , and conversely, if JCQ G X and /(JCQ) = inf/(X), then f{xQ)\XG(-^O) = /(-^o) = inf/(X) = inf/(G) < +oo, whence XQ e G and /(xo) = inf/(G). Furthermore, if G is convex and / : X ^- /? is a function such that / | G is convex (respectively, quasi-convex), then / is convex (respectively, quasi-convex)

72

1. Preliminaries

on the whole space X. Therefore, we shall assume from the beginning that we are given an unconstrained primal infimization problem (P)

a = inf0(Z),

(1.386)

where (/>: X ^^ R is a. function, and then, taking in particular 0 = / -f XG and a suitable perturbation /?, the duality theory for (P) of (1.386) will yield a duahty theory for (P) of (1.261). A classical way of defining a dual problem to the primal infimization problem (P) of (1.386) is to embed it into a family of "perturbed" infimization problems, as follows. Let Z be a locally convex space (called set of "perturbations" or of "parameters"), and p: X x Z ^^ R a. function (called a "perturbation function," or "parameterization"), such that p(jc,O)=0(x)

(xeX),

(1.387)

a=mfp(x,0).

(1.388)

so (P) of (1.386) is nothing other than (P)

xeX

With the aid of this perturbation function, (P) is embedded into the family of "perturbed" (or "parameterized") infimization problems (P,)

viz) := inf p(x, z)

(z e Z);

(1.389)

xeX

indeed, then (P) = (PQ) and a = v(0),

(1.390)

One defines the Lagrangian dual problem associated with the perturbation function p (or relative to the parameterization {Z, p)) as the unconstrained supremization problem {D)

y0:=supA(Z*),

(1.391)

where A: Z* -^ P is the dual objective function defined by X{^) := inf {inf {p{x, z) - ^(z)}}

( ^ e Z*).

(1.392)

By the canonical identification of X* x Z* with {X x Z)*, given by (1.28), we can write X{^) = - sup { - inf [p{x, z) - ^(z)}} = - sup sup {^{z) - p(x, z)} xeX

=

^^2

xeX zeZ

- s u p {(0,vi/)(x,z)-/7(x,z)} = -/7*(0,vl/)

(vJ/eZ*),

(1.393)

(x,z)eXxZ

and thus (D) is nothing other than the problem of supremization of the concave upper semicontinuous function X:

1.4 Duality for convex and quasi-convex infimization (D)

p = sup X(vy) = sup {-/7*(0, ^ ) } .

73

(1.394)

One says that vv^a/c duality holds if or = y6, and strong duality holds if a = ^ and the sup in the dual problem (D) of (1.391) is a max, i.e., it is attained for some ^0 ^ 2* (or in other words, if (D) has an optimal solution ^o)- There are known various sufficient conditions for achieving strong duality, of which we shall mention only the following one: Proposition 1.6. (see, e.g., Ekeland and Temam [54, Ch. Ill, Propositions 2.3 and 2.2]). With the above notation, assume that a is finite, p is convex, and there exists an element XQ e X such that the function z -^ p(xo, z) is finite and continuous at z = 0. Then inf0(X) = maxA.(Z*);

(1.395)

i.e., we have a = ^ and the sup in (1.391) is attained for some ^o e Z*. The following is a useful tool for the study of the Lagrangian dual problem (1.391), (1.392) associated with p: the function L: X x Z* ^ J defined by L(x, ^ ) := inf {p(x, z) - ^(z)}

(x eX,^

e Z*),

(1.396)

zeZ

is called the Lagrangian function, or simply the Lagrangian, associated with p. Thus, considering the partial functions pAz):=p{x,z)

(xeX^zeZ),

(1.397)

we have L(x, vi/) = M{pAz)

- ^{z)} = -p:m

(xeX,^

e Z*).

(1.398)

zeZ

By (1.392) and (1.396), X(^) = inf L(jc, ^ )

( ^ G Z*),

(1.399)

xeX

and hence by (1.391), fi = sup inf L(x,vl/).

(1.400)

On the other hand, by (1.387), (1.397), and (1.398), Hx) = PAO) > PT(0) = sup v,ez*{^(0) + Lix, ^ ) } = sup L(jc, vl/) (jc G X),

(1.401)

vPeZ*

and hence by the inequality inf sup > sup inf and (1.400), we obtain the "duality inequality"

74

1. Preliminaries a = inf 0(Z) > inf sup L(x, ^) > sup inf L(x, ^) =

fi.

(1.402)

Actually, one is interested in obtaining conditions for "weak duality," i.e., the equality a = inf sup L(x, ^) = sup inf L{x, ^) =

fi,

(1.403)

or strong duality, i.e., (1.403), with the second sup of (1.403) attained for some ^0 ^ ^*; in general, it is convenient to use, to this end, some minimax theorems, such as Theorems 1.8, 1.9. If in addition Px(0) = p**(0) (x e X) (e.g., if for each x e Z the partial function px of (1.397) is proper, convex, and lower semicontinuous), then, similarly to (1.401), there follows (P(x) = PAO) = PT(0) = sup vi;ez*{^(0) + L(x, vl/)} = sup L(jc,vl/) (x G Z ) ,

(1.404)

vl/eZ*

and thus in this case, a = inf sup L(jc, ^ ) .

(1.405)

Remark 1.30. (a) If px = /?** (x e X) (i.e., if for each JC e Z the partial function Px of (1.397) is proper, convex, and lower semicontinuous), then for all jc e X and z e Z WQ have p(x, z) = Pxiz) = PT(Z) = sup {vl/(z) - /.^(vl/)} VI>GZ*

= sup {vI/(z) + L(x,vI/)},

(1.406)

^eZ*

which expresses p with the aid of L. (b) It is well known and easy to show (see, e.g., Ekeland and Temam [54, Ch. Ill, Lemma 2.1 and Remark 2.1]) that if X and Z are linear spaces and f: X ^^ R is convex, then so is the "(optimal) value function" (called also "marginal function") v: Z -^ R (where v stands for "value") defined by (1.389); also, by (1.387) and (1.389), we have a = v(0). There are many duality results involving the value function v. For example, note that by (1.392) and (1.389), X(^) = inf inf {p{x, z) - ^(z)} = inf {inf p(x, z) - ^(z)} xeX zeZ

zeZ

= inf {viz) - ^(z)} = -i;*(^)

xeX

(^ e Z*).

(1.407)

zeZ

Also, by (1.394) and (1.407) we have P = sup A.(^) = sup {vI/(0) - i;*(^)} = i;**(0); ^GZ*

vi/eZ*

hence weak duality a = ^ holds if and only if v{0) = i;**(0).

(1.408)

1.4 Duality for convex and quasi-convex infimization

75

By (1.401) and (1.399), we have 0(jc) > L(jc, vl/) > A,(vl/)

(x eX,^

e Z*).

(1.409)

A pair (XQ, ^O) G X X Z* is called a saddle point of L if L(x, ^o) > ^(^0, ^o) > L(xo, vl/) When pAO) = p^i^),

(jc G X, ^ G Z*).

(1.410)

by (1.404) and (1.399) condition (1.410) is equivalent to 0(xo) = L(xo,^o) = ^(^o).

(1-411)

Theorem 1.23. If (1.405) holds, then for a pair (XQ, ^O) e X X Z* the following statements are equivalent: 1°. xo G X is a solution of the primal problem (P) of (1.386), ^o G Z* is a solution of the dual problem (D) of (1.391), and we have min0(X) = maxX(Z*).

(1.412)

2°. (xo, ^o) is a saddle point of the Lagrangian L. Proof See, e.g., [185, Theorem 2] or [54, Ch. Ill, Proposition 3.1].

D

One can show (see, e.g., [185]) that this duality theory is symmetric; i.e., one can embed the dual problem (D) of (1.391) into a family of perturbed problems that generates, as the dual problem to (D), the initial problem (P). The above scheme encompasses as particular cases many known unperturbational dual problems to convex infimization problems. For example, given a linear system (X, Z, u) (see Definition 1.3 (b)), let / : X -> /? and h\ Z ^^ /? be two convex functions, and let us consider the primal infimization problem {P)

oc=mf{f{x)^h{u{x))].

(1.413)

xeX

In what follows, for simplicity, when dealing with the composition of two functions, we shall omit the symbol of composition o between them; thus, instead of {h ou){x) and ( ^ o w)(jc) we shall write hu{x) and ^w(x), respectively. For problem (P) of (1.413), let (t) = f-\-hu{=

f + {hou))

(1.414)

and let us define the perturbation function p: X x Z -^ Rby p(x, z) := fix) + h(u(x) -z)

{xeX,ze

Z),

(1.415)

which satisfies (1.387). Then, by (1.398), (1.397), (1.415), (1.86), and (1.91), we have

76

1. Preliminaries - L ( x , vl/) = p;(vl/) = sup {^(z) - p(x, z)] zeZ

= sup {-^(u(x)

- z) + ^(u(x))

- fix) + -h(u(x)

- z)}

zeZ

= -fix)

+ ^iuix))

+ sup [-^iuix)

-z) + -hiuix)

- z)}

zeZ

= -fix)

+ ^iuix))

-h h\-^)

(jc G X, vl/ G Z*),

(1.416)

and hence by (1.399) and (1.391), the dual objective function k and the dual problem (D)are A.(vl/) = inf L(x, ^) = inf [fix) - ^uix) + -/i*(-vl/)} xeX

(D)

xeX

=-/*(VI/M) f-/z*(-4') (^ G Z*), ^ - sup {-/*(VI/M) + -/z*(-vl/)}.

(1.417) (1.418)

Given a convex system (X, Z, u) (see Definition 1.3 (c)) and / , h, (P), 0, and p as above, the dual objective function A. and the dual problem (D) are still (1.417) and (1.418) respectively, but with ^u e R instead of ^w G X* (because we apply X

Fenchel-Moreau conjugation (1.206) to W = R , instead ofW = X*). Remark 1.31. For any (not necessarily linear) mapping u: X ^^ Z and any ^ G Z*, ^ o w = vi/w is denoted by M * ( ^ ) , where u*: Z* -> X* is the "adjoint" of w, so M* is defined by w*(^)(jc) := ^(M(JC))

ix eX,^

e Z*);

(1.419)

however, in the present chapter we shall not use this notation in order to avoid confusion with the Fenchel and Fenchel-Moreau conjugate functions. From the above we obtain the following result, whose part (a) is a classical theorem of Fenchel-Rockafellar: Theorem 1.24. (a) (See [183].) Let (X, Z, u) be a linear system and / : X -> R, h: Z ^^ R two convex functions for which there exists an element XQ G d o m / such that h is finite and continuous at M(JCO). Then inf {fix) + hiuix))} = max { - / * ( ^ M ) -f - / z * ( - ^ ) } . xeX

(1.420)

vi/ez*

(b) Let iX, Z,u) be a convex system, where Z = iZ, <) is a partially ordered locally convex space, and let f: X -^ R, h: Z ^y R be two convex functions for which there exists an element XQ G dom / such that h is finite and continuous at w(xo) and h is increasing ii.e., Z\, Zi G Z, Z\ < Z2 ^ hiz\) < hizi))- Then we x X have (1.420), with ^u e R and Fenchel-Moreau conjugation f*:R -> R. Proof (a) Let (X, Z, w) be a linear system. By /(JCQ) < +00, /z(w(xo)) < +00, we have inf;cGX {fix) 4- hiuix))} < +00. Furthermore, since u is linear and / , h are

1.4 Duality for convex and quasi-convex infimization

77

convex, 0 of (1.414) and p of (1.415) (which satisfies (1.387)) are convex as well. Finally, since h is continuous at w(xo), the function Pxo' z^

p(xo, z) = f(xo) 4- hiu(xo) - z)

(1.421)

is finite and continuous at z = 0. Hence, by Proposition 1.6 and (1.417), we obtain (1.420). (b) Let (X, Z, u) be a convex system. Then infj^ex {fM + h(u(x))} < -\~oo (as in part (a)). Let us observe now that for any increasing convex function h: X ^^ R, the function hu is convex; indeed, since u is convex and h is increasing, for any x\,X2 ^ X and 0 < c < 1 we have h(u(cxi + (1 — c)x2)) < h(cu(xi) -f- (1 — c)u(x2)), which, since h is convex, is < ch(u(x\)) + (1 - c)h(u(x2)). Also, 0 of (1.414) is convex (since so are its summands), whence so is p of (1.415), and as in part (a), the function (1.421) is finite and continuous at z = 0. Hence, by Proposition 1.6 and (1.417), we obtain (1.420). D Remark 1.32. In the particular case that Z = X is a locally convex space and u = Ix, the identity operator in X (i.e., u(x) — x for all x e X), Theorem 1.24 (a) yields the following classical result (see, e.g., [183, 185]) on the problem of the infimization of the (upper) sum / -j- /z of two convex functions, that is, the problem (P)

a= inf {fix)+

hix)}:

(1.422)

xeX

If X is a locally convex space and f,h\ X -> R are two convex functions for which there exists an element XQ e dom / such that h isfiniteand continuous at XQ, then inf {fix) + hix)} = max {-/*(^) + -h\-^)}. jceX

(1.423)

^eX*

However, this result does not imply directly Theorem 1.24 (a) above when applied to the convex functions f: X -^ R and hu\ X ^> R, since its assumption is that hu is continuous at XQ, while in Theorem 1.24 (a) it is assumed only that h is continuous at w(xo). Note that formula (1.423) is symmetric in / and /z, since max {-/z*(vl/) + -/*(-vI/)} = max {-/*(^) -h

-h\-^)}.

Hence instead of assuming in the above result that h isfiniteand continuous at some xo € dom / , we may assume that f is finite and continuous at some XQ G dom/i. Let us give now an application to the primal "programming" problem (1.298), in the particular case that Z = (Z, <) is a partially ordered locally convex space and r c Z is the negative cone

78

1. Preliminaries T:={zeZ\z<0}.

(1.424)

i.e., to the problem (P)

a=

inf fix).

xeX u(x)<0

(1.425)

Corollary 1.12. (a) Let (X, Z, u) be a linear system, where Z — (Z, <) is a partially ordered locally convex space, and let f: X -> R be a convex function for which there exists an element XQ G dom / satisfying M(JCO) G i n t r ,

(1.426)

with T of (1.424) (this is called the ''Slater condition'' or the ''Slater constraint qualification "). Then we have inf f{x) = max inf {/(jc) + ^(w(jc))}, xeX u(x)<0

(1.427)

^eZlxeX ^

where Z ; :={vl/ G Z*|vl/ > 0 } ,

(1.428)

with ^ >0 meaning that ^(z) > Ofor all z > 0. (b) Let (X, Z,u) be a convex system, where Z = (Z, <) is a partially ordered locally convex space. Then for f and T as in (a), we have (1.427), with Z^ of (1.428). Proof (a) Since u(xo) e intT, we have u(X) n T 7^ 0 andxr(«(-^o)) < +C)0, and there exists an open neighborhood V of w(jco) such that u(V) C T, whence XT is continuous at u(xo). Hence, by Theorem 1.24 (a) above, applied to the convex functions / and h:=XT,

(1.429)

we have inf [fix) + XTiuix))} = max [-f*i^u) xeX

+ -xH-"^)}-

(1-430)

VJ/GZ*

But for any ^ G Z* we have -f*i^u)

= -sup {i^u)ix)

- fix)} = inf [fix) - ^iuix))},

xeX

(1.431)

^e^

- X ? ( - ^ ) = - sup {-vl/(z) - XTiz)} = inf vl/(r),

(1.432)

zeZ

which, together with (1.430), yield inf fix) = max inf {/(JC) - ^iuix))

xeX u(x)eT

^eZ*xeX

= max inf ^feZ*xeX

{/(JC)

+ ^iuix))

+ inf vl/(r)} + - sup ^ ( 7 ) } .

(1.433)

1.4 Duality for convex and quasi-convex infimization

79

(the last equality of (1.433) follows from the first one, replacing ^ by — ^ and using thatinf(-vl/(r)) = - s u p ^ ( r ) ) . Now^, by (1.424), we have

^(T)

=

(-00,0] [0,+00) 0 R

ifOvI/eZ*, if ^ = 0, ifOT^ vi/ G Z * , 0 ;^ vi/,0 ;^ vi/,

(1.434)

whence

{

0

if vi/ F 7*

which, together with (1.424) and (1.433), yields (1.427). (b) We can write formula (1.298) in the form a =

inf

xeX u{x)eT

f(x)

= inf {f(x)

xeX

+ XT(U(X))},

(1.436)

where XT denotes the indicator function of the set T. As in the proof of part (a), XT is finite and continuous at u(xo). Also, since T is convex, so is its indicator function XT- Furthermore, by (1.424), XT is increasing; indeed, if z\ G T, then XT(ZI) =0 < xr(^2)forallz2 ^ Z, while if zi ^ T a n d z i < Z2,thenz2 ^ T (since otherwise Z\ < Z2 < 0, contradicting z\ ^ T), whence XT(Z\) = Xrizi) = + o o . Consequently, by Theorem 1.24 (b) above, (1.420) holds with h : = x r . whence by (1.431), (1.432), and (1.435), we obtain (1.427). D Remark 1.33. (a) As shown by the above proof, we have the following extension of Corollary 1.12 (a): Let (X, Z, u) be a linear system, T a convex subset ofZ such that u(X) C\T ^ &, and f: X -^ R a convex function, for which there exists an element XQ E dom / satisfying the Slater condition (1.426). Then we have (1.433). In the particular case that Z = X is a locally convex space and u = Ix, the identity operator in X, this yields the following result: if X is a locally convex space, G a convex subset of X, and f:X -^ R a convex function for which there exists an element XQ e dom / such that jcoGintG,

(1.437)

then i n f / ( G ) = max inf {/(jc) - ^(x) + inf ^ ( G ) } ^eX*xeX = max inf {/(x) + vI/(jc) + - s u p ^ ( G ) } . ^eX*xeX

(1.438)

Note that (1.438) is nothing other than formula (1.268), which has been stated in Theorem 1.13 under a different assumption, namely, that f: X -^ R is SL proper convex function that is continuous at some point XQ e G H dom / . However, this

80

1. Preliminaries

fact is also a consequence of Remark 1.32 above and formulas (1.431), (1.432) for u = Ix, since for h = XG "^^ have domh — G. (b) Let us observe that for any convex subset T of Z, [+00

ifu{x)-z^T

= X.-'(r+z)W

(X6Z,ZGZ).

(1.439)

Hence, the perturbation function (1.415) is now p{x, z) := f{x)

+ XT{U{X) -

/•/ \ I

z)

r \

\ fi^)

i f w ( ^ ) e r + Z,

r^ AAC\\

(c) When (X, Z, u) is a convex system, where Z = (Z, <) is a partially ordered locally convex space, by the proof of Theorem 1.24 (b) the function ^u\ X -^ R occurring in (1.427) is convex for each vj/ G Z;j.; that is, {^u){Zl)

c Conv(X),

(1.441)

where Conv(X) denotes the set of all convex functions w. X ^^ R. Also, for T of (1.424), formula (1.440) becomes

pix,z)=U_}'^

^!";"!^^'

(1.442)

and thus the family ( P J of infimization problems of (1.389) becomes (P,)

v{z) = inf fix) xeX

iz e Z).

(1.443)

U{X)
The perturbations/?: XxZ -^ ^ o f problem (1.425), given by (1.387), (1.389), where the objective function is perturbed, are called horizontal perturbations, while the perturbation p defined by (1.442), (1.443), in which only the constraint set is perturbed, but the objective function is left unchanged, is called a vertical perturbation (see, e.g., Laurent [129]). Let us pass now to (perturbational) surrogate duality. We shall consider again the primal infimization problem (P) of (1.386), embedded into a family of perturbed optimization problems (1.389), with the aid of a perturbation p: XxZ ^^ R satisfying (1.387). Following Crouzeix [34], one defines the quasi-convex dual problem associated with the perturbation function p (or relative to the parameterization (Z, p)) as the unconstrained supremization problem (Aurr)

AuiT '= SUpAsurr(Z*),

where Asurr: Z* -> /? is the dual objective function defined by

(1.444)

1.4 Duality for convex and quasi-convex infimization Asurr(^) :=

inf

p U , z)

(^ € Z*).

81

(1.445)

{x,z)eX^Z vl/(z)>0

The main difference between the functions X of (1.392) and Xsurr of (1.445) is that in (1.392) the "penalty term" —^(z) is added to the objective function p(., z) of {Pz) of (1.389), for each z e Z, while in (1.445), ^(z) is used to form the new surrogate constraint sets {(;c, z) G X X Z| ^(z) > 0}

(^ G Z*);

(1.446)

thus, the quasi-convex dual problem is a surrogate dual problem. There exists a theory of quasi-convex dual problems analogous to the theory of Lagrangian dual problems. The role of Fenchel conjugates /*, /** for Lagrangian duality is played by the Greenberg-Pierskalla quasi-conjugates / J , /^^ (defined by (1.213), (1.217)) for the above surrogate dual objective function: Corresponding to (1.393), (1.407), and (1.408), we have now Xsurr(^) = -PI„0)(^, fturr =

^ ) = -v'o(^)

sup Asurr(^) -

sup

(^ iuf

G Z*),

(1-447)

i;(z) = SUp ( - ^ ^ ^ ( Z * ) = ^>^>^ (0),

(1.448)

where JCQ G X is arbitrary, and v is the (optimal) value function (1.389). Hence ^surr of (1.447) is always quasi-concave and weak* upper semicontinuous, and the inequalities (1.369) remain valid in this general case (by (1.218)). Furthermore, by (1.390) and (1.448), weak surrogate duality a = ^surr holds if and only if v(0) = vyy{0), or equivalently (by (1.219)), v{0) = i;eq(0). Corresponding to (1.396), the following is a useful tool for the study of the quasi-convex dual problem associated with p: the function Lsurr- X x Z* ^^ R defined by ^surr(^,^):=

inf

zeZ *(z)>0

p(jc,z)

(jc G X, vl/G Z*)

(1.449)

is called the quasi-convex Lagrangian associated with p. By (1.445) and (1.449) we have Asurr(^) = inf Lsurr(-^, ^ ) xeX

( ^ ^ Z*),

(1.450)

and hence by (1.444), the surrogate dual value is Psun = sup inf Lsurr(-^, ^ ) .

(1-451)

One can compute [240] that for the structured primal infimization problem (P) of (1.436), with any system (X, Z, u) and any target set 7, we have T

/ ^ vl/>| -

""^ ' ^

1 -f^^^ + X{x'eX\^(u{x'))>mf^{T)}(x)

if mf^(T)

G

^{T),

1 fix) + X{x'exinu(x'))>infnT)}(x) if inf^(r) ^ vi,(r), ^''^^^^

82

1. Preliminaries

and hence the surrogate dual value is Aurr = sup mini vT/c7*

I

inf

f{x),

inf

xeX ^(u(x))e^iT)

f(x)\.

xeX ^(u(x))>\nf^{T)

(1.453)

J

As an application to a particular case, let (X, Z, u) be a convex system, where Z = ( Z , < ) is a partially ordered locally convex space, and let us consider the structured primal programming problem (1.298), where T is the negative cone (1.424) in Z, i.e., the problem (1.425). We shall show now that in this case, for the perturbation (1.440), that is, for p(x, z) = fix) + Xu-HT+z)M = fM4-X{x'ex\uix')
(xeX^zeZ),

(1.454)

the surrogate Lagrangian (1.449), that is, ^surr(-^, - ^ ) =

inf {fix) zeZ ^(z)<0

= fix)+

+

X{x'eX\u(x')
inf X{x'ex\u(x')
(1.455)

^\z)<0

becomes /

(r

yjj^- lf(^^-^X{x'ex\^(u(x'))
ifx € X , 0 <

VI/GZ*,

and hence the surrogate dual value is Aurr-

sup

inf

fix).

(1.457)

One can deduce this from (1.452), but here is a direct proof: If ^ > 0, then ^zez,nz)
< 0}.

(1.458)

Indeed, the inclusion c is obvious; conversely, if x^ e X, ^^iuix')) < 0, so there exists z 6 Z, z > 0, such that VI/(M(JC')) = ^ ( - z ) , then for z' = w(xO + z we have w(jcO < z\ ^(zO = 0. Hence by (1.458), i^C X{x'eX\u(x')
= X{x'eX\^(u(x'))<0}ix)

ix G X),

which proves the part 0 < ^ G Z* of (1.456). On the other hand, if vl/ 2^ 0, then there exists z! ^ Z with z! > 0, vl/(z0 < 0. Then for any jc G X and for /JL > 0 sufficiently large, the element z = M(X) + /xz^ satisfies uix) < z and ^(z) = ^iuix)) + /x^(zO < 0, whence the second inf in (1.455) is 0, which proves the part 0 ^ vi/ G Z* of (1.456). Finally, by (1.450) and (1.456), we obtain (1.457). By (1.457) we have now, similarly to (1.369),

1.4 Duality for convex and quasi-convex infimization

83

a = inf fix) > fi',,„ xeX u(x)<0

:= max

inf

f{x) > max inf {/(jc) + ^(u(x))}

VI/(M(JC))<0

:=

fii^^,.

(1.459)

^

Hence if there holds strong Lagrangian duality (1.427), i.e., a = Pl^^^, then we also have strong surrogate duality a = yS^^j^; clearly, a similar statement holds also for the corresponding weak Lagrangian and weak surrogate dualities. Consequently, from Corollary 1.12(b) we obtain the following: Corollary 1.13. Let {X, Z, u) be a convex system, where Z = (Z, <) is a partially ordered locally convex space, and let f \ X -^ R be a convex function for which there exists an element XQ G dom / satisfying the ''Slater condition'' (1.426), with T of {\.424). Then we have inf /(jc) = max

inf

xeX

JceX

vi/ezi

M(JC)<0

^

f{x).

(1.460)

^{u{x))
Corollary 1.13 can be sharpened, as shown by the following classical theorem of surrogate duality for a quasi-convex programming problem (1.425), due to Luenberger ([136, Theorem 3]): Theorem 1.25. Let u: R^ ^ R^ be a convex mapping such that there exists XQ e R^ satisfying u{xo) <^ 0 {i.e., with all components of u(xo) being < 0) and let f: R" -> R be a quasi-convex function that is upper semicontinuous along lines {i.e., for every X\,X2 G /?", ^/{r]) := r]X\ + (1 — r])x2 is an upper semicontinuous function of r] for rj e [0, 1]). If inf xeR" f{x) is finite, then we have (1.460) with u{x)<0

X = R^ Z = R"^. Besides the above quasi-convex dual (1.444), (1.445) to the primal infimization problem (1.386), other surrogate dual problems are also used. For example, one defines [234] the 0-dual problem associated with the perturbation function p as the unconstrained supremization problem {Do)

^, :=supA,(Z*),

(1.461)

where XQ: Z* ^^ R is the dual objective function defined by Xe{^):=

inf

p{x,z)

(^ G Z*);

(1.462)

(x,z)eXxZ vl>(z)>-l

the only difference between the functions Asurr of (1.445) and XQ of (1.462) is that ^(z) > 0 is now replaced by ^(z) > —1. In the corresponding duality theory, the role of / * and / J is played by the semiconjugate / ^ defined by (1.221). It turns out that weak surrogate duality a = Po holds if and only if v{0) = v^^{0), or equivalently (by (1.222)), i;(0) = Uq(0).

84

1. Preliminaries

As another example, let us mention that one defines [234] the n-dual problem associated with the perturbation function p as the unconstrained supremization problem (D,)

yS, :=supX,(Z*),

(1.463)

where Xj^: Z"" ^^ R'l^ the dual objective function defined by A^(vl/) :=

inf

/7(x, z)

(^ e Z*);

(1.464)

{x,z)eXxZ vy(z)=0

the "surrogate Lagrangian" associated with

[{P),{DT^)}IS

L^{x, ^) := inf p(jc, z)

defined by

(x e X, ^ e Z*).

(1.465)

zeZ

The only difference between the functions Asurr of (1.445) and A,;,- of (1 -464), respectively the functions Lsurr and L^^, is that ^(z) > 0 is now replaced by ^(z) = 0. The role of / * and fj is played now by / J of (1.220). One can compute [240] that for the structured primal infimization problem (P) of (1.436), with any system (Z, Z, u) and any target set 7, we have Lj,(x,

^ ) = fix)

+ X{x'eX\^(uix'))e^iT)}(x),

(1.466)

and hence the corresponding surrogate dual value is Pn = sup

inf

vi/(=7*

^^^

fix).

(1.467)

xeX

^(uix))e^(T)

In the particular case that Z — X and u = Ix, formula (1.467) (with T = G c X) reduces to the right-hand side of (1.370) (where, clearly, sup(j>g;^*\{0} = sup4>ex*)One can deduce from (1.466) (see [240, p. 60]) that for the particular case of problem (1.425) and the perturbation (1.454) we have

LAx,-^)

=

fM + X{x'ex\^iuix'))o}ix) if X G X, 0 ^ vi/ e Z*, fix) otherwise.

(1.468)

whence Pn=fisurr=

SUp 0<^eZ*

iuf xeX ^(w(;c))<0

/(x).

(1.469)

2 Worst Approximation

We recall that the deviation (or excess) of a set G (assumed nonempty in this chapter, without any special mention) from an element XQ in a normed linear space X is the number 8(G, XQ) > 0 defined by 5(G,xo):=sup||g-xo||,

(2.1)

geG

and any go e G for which this sup is attained, i.e., such that llgo--^oll = s u p llg-xoll,

(2.2)

geG

or equivalently, such that llgo-^oll> \\g-xo\\

(geG),

(2.3)

is called an element of worst approximation of (or SL farthest point to) XQ in G (see Figure 2.1). For any (nonempty) subset G of a normed linear space X, and XQ e X, v/e shall denote by ^G(-^O) the set all farthest points to JCQ in G, that is, ^G(XO) := {go e G\ \\go - xo\\ = sup \\g - xo\\}.

(2.4)

geC

Similarly to the case of best approximation, we may have JG(-^O) = 0, even for bounded subsets G of Z with "very good" geometric properties. Note that we could assume that G is closed and convex, since it is well known and easy to see that for any XQ e X and any set C c X we have

86

2. Worst Approximation coG

-^ Xo

Figure 2.1. (2.5)

sup ||c-xol| = sup \y - Joll ceC

JGCOC

We shall be concerned with the following tv^o main problems', (1) Find convenient formulas for 5(G, XQ). (2) Give characterizations of elements of worst approximation (i.e., necessary and sufficient conditions in order that an element go ^ G satisfy (2.2), that is, in order that ^0 ^ ^G(-^O)). We shall obtain duality results, using the elements O of the conjugate space X*.

2.1 The deviation of a set from an element We shall need the following lemma: Lemma 2.1. Let X be a normed linear space, G a subset ofX, and XQ e X. Then sup suplOC^-xo)| = sup | s u p O ( g - x o ) | . OGX*

11011 = 1

(2.6)

geG

l|0||=l

Proof. Let us first prove the lemma for JCQ = 0, i.e., that we have (2.7)

sup sup|cD(g)|= sup |supcl)(^)|. OeX* geC

OGX*

\\n=\

iioii=i

g^G

To this end, it will be sufficient to show that for any O e Z*\{0}, sup|cD(g)|=max{|supcD(g)|,|sup(-cD)(g)|}. geG

geG

(2.8)

geG

Using that \a\ = max {a, —a), the left-hand side of (2.8) is sup 10(g)I = supmax{0(g),-)(g)}. geG

geG

On the other hand.

geG

geG

(2.9)

2.1 The deviation of a set from an element

87

I supO(g)| = max{supO(g), — supcl)(g)} = max{supO(g), inf(—0)(g)}, geG

geG

geG

geC

8^G

|sup(-cD)(g)| = max{sup(-cD)(g), inf
geG

S^G

SO the right-hand side of (2.8) is max{|supO(g)|, | sup(-0)(g)|} geG

geG

= max{sup(D(g), mf(-^)(g), geG

sup(-0)(g), inf <^(g)]

S^G

g^Q

geG

= max {sup ^(g), sup(-cD)(g)}, geG

geG

which together with (2.9) proves (2.8), and hence (2.7). Now let jco G X be arbitrary. Then, by (2.7) applied to the set G - xo, we obtain (2.6). D Theorem 2.1. Let X be a normed linear space, G a subset ofX, and XQ G X. Then s u p | | g - x o i l = sup |supO(G)-cD(xo)|. geG

(2.10)

\\n=\ Proof. By Lemma 2.1 we have supllg-xoll = sup sup \(^(g-xo)\= geG

geG

sup s u p | 0 ( g - x o ) |

^eX*

OGX*

\\n=\

geG

\\n=\

= sup | s u p O ( g - x o ) | , || = 1

geG

i.e., (2.10).

D

Remark 2.1. (a) When G is unbounded and XQ e X, formula (2.10) reduces to -f-cxD = -J-cxD. Indeed, if the right-hand side of (2.10) is < +oo, then by the uniform boundedness principle, G — xo is bounded, and hence so is G, which proves our assertion. Thus, only the case ofG bounded is of interest. (b) By Lemma 1.5, the main (i.e., the bounded) case of Theorem 2.1 admits the following geometric interpretation: IfG is bounded and XQ G X, then s u p 11^ -Xo\\= geG

s u p d i s t (Xo, //(D,sup(G)) ^eX*

IIOINI

=

sup

inf

llj-xoll,

(2.11)

^(y)=sup
where //o,supcD(G) is the hyperplane H^,suv<^(G) = {yeX\
(2.12)

2. Worst Approximation Equivalently, by (1.49), sup llg - xoll = sup dist Uo, / / ) , geG

(2.13)

H^HG

where HG denotes the collection of all hyperplanes in X that quasi-support the set G (see Figure 2.2). Thus, the reduction principle of Remark 1.16 (b) now takes the following form: formula (2.13) reduces the computation of the deviation of a bounded set G from XQ to the computation of the distances to the hyperplanes H G (c) For unbounded sets, formula (2.11) still holds (it reduces to +oo = +oo, as noted in (a) above), but (2.13) need not hold, as shown, e.g., by any closed affine subset G of X\ indeed, for such a set G, the left-hand side of (2.13) is +oo, but since a hyperplane H quasi-supports a closed affine set G if and only if H ^ G (and hence then H supports G), the last term of (2.13) is < dist (JCQ, G ) < +oo. The reason for this discrepancy is that for unbounded G there exists O € X* such that sup 0(G) = +00, whence //
Figure 2.2.

Corollary 2.1. Let X be a normed linear space, G a subset ofX, and XQ G X. Then sup||g-xo||= geG

supO(G) - O(xo) IIOI eX*\{0} sup

(2.14)

Proof By Theorem 2.1 and its proof, it is enough to show that sup | s u p O ( G ) | = sup supcI>(G).

(2.15)

11^11 = 1

The inequality > in (2.15) is obvious. In order to prove the opposite inequality, let IIOII = 1. If sup CD (G) > 0, then | sup O(G) | = sup O (G). On the other hand, if sup 0(G) < 0, then for OQ := - O we have ||4>o|| = 1, and |supcD(G)| = -supcI)(G) = info(G) < supOo(G), which proves the inequality < in (2.15), and hence the equality.

D

2.1 The deviation of a set from an element

89

Remark 2.2. Conversely, Corollary 2.1 implies Theorem 2.1. Indeed, by (2.14) we have |supcD(G)-0(xo)| sup \\g - xoll GX*\{0}

II ^11

geG

which, together with (2.16), yields (2.10). Corollary 2.2. Let X be a normed linear space, G a subset ofX, and XQ G X. Then sup||g-xo||=

sup

geG

,1^.,

(O,fif)e(X*\{0})x/? sup4>(G)>^

=

Il^ll

sup

ii^ii

(4),J)e(X*\{0})x/? 3geG,<^{g)>d

Il^ll

• (2.17)

Proof. Clearly, sup 0 ( 0 ) = sup {d e R\ sup 0(G) > d] = sup Id e R\ sup 0 ( 0 >d}

(O G Z * ) .

(2.18)

Hence, by Corollary 2.1 and (2.18), we obtain sup llg - xoll = geG

supO(G)-0(xo) — =

sup
=

II ^ 1 1

sup i
,

sup

sup

cDeX*\{0}

deR sup (G)>d

J-O(xo) ——-— II ^ 1 1

(2.19)

II ^11

sup d

which proves the first equality in (2.17). Finally, the proof of the second equality in (2.17) is similar, since sup 0(G) > J if and only if there exists geG such that 0(g) > J. D Remark 2.3. (a) In the converse direction. Corollary 2.2 implies Corollary 2.1, which in turn implies Theorem 2.1. Indeed, this follows by starting with the first equality of (2.17) and writing formula (2.19) in the reverse order. (b) Corollary 2.2 admits the following geometric interpretation: We have

90

2. Worst Approximation sup llg - xoll =

sup

geG

dist (XQ, U<^^d)

(^,d)e(X*\{0})xR sup^(G)>d

=

inf

sup

\eX (,d)e(X*\{0})xR sup(D(G)>^ (>')>^

=

IIJ-^OI

sup dist (XQ, V^^d) (,d)e(X*\mxR sup ^{G)>d

=

sup

inf

(2.20)

IIJ-XOII,

(,J)e(X*\{0})x/? >"^^^ , supcD(G)>J ^(.V)>^

with L^
UeU

(2.21)

VeV

where U and V denote, respectively, the collection of all open half-spaces in X and the collection of all closed half-spaces in X. Indeed, for the open half-space U^^d and the closed half-space V^^d of (1.67), we have U^d n G 7^ 0 <^ sup cD(G) > J,

(2.22)

and, respectively, V<^,df^Gy^0<^3ge

G, ^(g) > d.

(2.23)

Hence, by (2.17) and Corollary 1.4, we obtain (2.21) (see Figures 2.3 (a) and (b)). closed half-space

open half-space

XQ

XQ

(a)

(b) Figure 2.3.

Remark 2.4. Note also that for the hyperplane //o,^ = {>^ e X| 0 ( j ) = (i} we have //^^ n G 7^ 0 <^ 3g G G, cD(g) = J,

(2.24)

and sup \\g - XQW == g^G

sup

(O,^)e(X*\{0})x/? supO(G)=J

dist (xo, H<^,d) =

sup

inf \\y - XQ\\ ,

(cD,J)€(X*\{0})x/? >'^^ supcD(G)=J ^0')-^

(2.25)

2.1 The deviation of a set from an element

91

or equivalently, sup llg - xoll =

sup dist (xo, / / ) ,

geG

Hen

(2.26)

where 7i denotes the collection of all hyperplanes in X (see Figure 2.4).

Figure 2.4. One can also express the right-hand side of (2.10) as follows: Proposition 2.1. If G is a subset ofX and XQ e X, then sup I supO(G) - cI)(jco)| = sup I supcI)(G) - (l>(xo)|.

(2.27)

OeX* I|0II<1

11^11=1

Proof. By the above proof of Theorem 2.1, it is enough to consider the case XQ = 0, i.e., to show that (2.28)

sup |sup4>(G)|= sup |supO(G)|.

\\n<\

11^11=1

But, for any O G Z* and 0 < a < 1 we have sup |(acD)(G)| = a sup |0(G)| < sup |0(G)|, whence, since each <^ e Z* with \\^\\ < 1 can be written as O = aOo, with IIOoll = 1 and 0 < a < 1, we obtain (2.28). D In (2.13) one can replace hyperplanes by other sets, such as quasi-supporting closed or open half-spaces (see Figures 2.5a and 2.5b). Indeed, we have the following theorem: Theorem 2.2. Let X be a normed linear space, G a subset ofX, and XQ e X. Then sup 11^ - xoll = sup

inf

geG

y^^

||')>supcI)(G)

lly - xoll = sup <J>eX*

inf y^^

||0|| = l 0(>-)>sup4>(G)

(2.29)

92

2. Worst Approximation

or equivalently, sup \\g — jcoll = sup dist(jco, V) = sup dist(jco, U), geG

VeVc

(2.30)

UeUc

where VG (respectively, UG) denotes the collection of all closed (respectively, open) half-spaces that quasi-support G and do not contain G (respectively, int G).

closed half-space

open half-space

Figure 2.5. Proof We claim that for the set C := CO (xo, G),

(2.31)

sup lUo-cll = s u p \\XQ- g\\.

(2.32)

we have ceC

geG

Indeed, since G C C, we have the inequality > in (2.32). On the other hand, for any ^0-^0 + T!iL\ ^iSi ^ CO (xo, G), where ^o, ^i, • • •, ^m > 0, /yo + YlT=i Vi = 1.

^mgi) < Y] rji \\xo - gi II < sup lUo - ^11 ,

xo-imxo +

/=1

i=\

seG

whence, passing to the closed convex hull, we obtain the inequality < in (2.32), which proves the claim (2.32). Consequently, we may assume that JCQ 6 G in (2.29). Then O(jco) < sup<E>(G) for all O e Z*, and XQ ^ U := {y e X\^(y) > sup 4>(G)}. Hence by Theorem 2.1 and Corollary 1.4, we obtain s u p | | g - x o | | = sup |sup4>(G)-4)(xo)| geG

OeX*

= sup {sup 0(G) - O(jco)} = dist (JCQ, U), 11^11 = 1

which, by passage to U = {y e X\ ^(y) > sup 4>(G)}, yields also (2.29). Let us give now another formula for the deviation.

D

2.2 Characterizations and existence of farthest points

93

Theorem 2.3. Let X be a normed linear space, G a subset ofX, and XQ ^ X. Then

supllg-xoll geG

0

ifG = {xo}.

Proof. We may assume that XQ = 0 and G 7^ {0}. By Corollary 2.2, it is enough to show that in this case, d

sup (d

=

sup

II^11

1

/^ o^x

-—-.

(2.34)

eX* II^11 3geG,(g)>\

The inequality > in (2.34) is obvious. Conversely, for any (O, d) e (X*\{0}) x R for which there exists g e G such that 0(g) > d, let Oo := 3 0 . a

(2.35)

Then Oo(g) = ^O(g) > 1 and ^ ^ = i j ^ , which proves the inequality < in (2.34) and hence the equality. D Remark 2.5. It is necessary to consider the two cases in (2.33) separately, since for G = {jco} the left-hand side of (2.33) is 0 and the supremum on the right-hand side is —00. Of course, only the case G 7^ {XQ} is of interest.

2.2 Characterizations and existence of farthest points We shall first give some characterizations of farthest points to JCQ in G, where G is a subset of X and XQ e Z, i.e., some necessary and sufficient conditions in order that go e TG(XQ) (that is, ||go - XQII = sup^^^ ||g - xoll). Theorem 2.4. Let X be a normed linear space, XQ e X, and G a subset ofX. For an element go G G, the following statements are equivalent: V.goeJ^Gixo). 2°. There exists Oo E X* such that ll^oll = 1, ^o(go - xo) = sup llg - xoll.

(2.36) (2.37)

geG

3°. There exists Oo e X* satisfying (2.36) and l^oteo-^o)l = s u p | | g - x o l | . geG

(2.38)

94

2. Worst Approximation

Proof. Assume 1°. By a corollary of the Hahn-Banach theorem, we can choose cDo G Z* satisfying (2.36) and ^ o ( g o - ^ o ) = llgo-^ol|.

(2.39)

Then, by (2.39) and 1°, we obtain ^o(go - -^o) = llgo - ^oll = sup llg - xoll, geG

i.e., (2.37). Thus, 1° => 2°. The implication 2° => 3° is obvious. Finally, assume 3°. Then 11^0 - ^oll > l^o(go - ^o)l = sup 11^ - xoll, g&G

whence, since go G G, we obtain 1°.

D

Remark 2.6. (a) The equivalence T ^ 2° of Theorem 2.4 admits the following geometric interpretation: For an element go e G we have go e ^G(-^O) if and only if there exists a hyperplane HQ that supports the ball B = B(xo, sup^^^ 11^ — -^o II) at go. Indeed, let r := sup^^^^ \\g - xo\\. If go G J^G(XO) and OQ G Z * is as in 2° of Theorem 2.4, then by Lemma 1.6, the hyperplane Ho := {y e X\ o(y) = Oo(xo) + r]

(2.40)

quasi-supports the ball B; also, by (2.37) and go G Tci^o), we have ||go — -^oll = ^ and ^o(go - ^o) = sup^^^^ \\g - xo\\ = r, so go e B n //Q. Conversely, if there exists a hyperplane Ho that supports the ball B = B(xo, sup^^^^ \\g — xo\\) at go, then by Lemma 1.6, there exists a (unique) function OQ G X* with ||0oll = 1 such that we have (2.40). Then, since go e Ho, we obtain (2.37), and hence, by Theorem 2.4,go€ J'G(^O). (b) For any o e X* satisfying (2.36) and (2.37), we have (2.39) and ^o{g-xo)<\\go-xo\\ Oo(go) = max cDo(G)

(geG),

(2.41)

(i.e., OQ € N(G; go)).

Indeed, by the implication 2° =^ 1° above and (2.37), we have Wgo - -^oll = sup llg - xoll = Oo(go - -^o), geG

SO (2.39) holds. Also, by (2.36) and 1°, we have ^oig - xo) < \\g - xoll < llgo - -^oll

(8 e G),

i.e., (2.41). Furthermore, by (2.39) and (2.41), ^o(go) = ^o(^o) + llgo - .^oil > ^^oig)

(g G G),

(2.42)

2.2 Characterizations and existence of farthest points

95

whence, since ^o ^ ^ . we obtain (2.42). (c) For any go ^ ^G{^Q) and any hyperplane //Q as in (a) above, the sets G and B = B(xo, swpg^Q \\g — jcoll) lie in the same half-space Do := {y G X| ^o(y) < ^o(-^o) + sup^^Q llg - -^oll}, and HQ supports the set G at go; also, we have (2.43)

^0 G P/ZoUo),

i.e., go is a nearest point to XQ in HQ (see Figure 2.6). Indeed, by (2.41) and go G J='G{XO), we have G c Do; also, if \\y - XQW < sup^^^^ \\g - xo\\, then by ||Oo|| = 1, we have ^o(y - XQ) < \\y - xo\\ < sup^^^^ \\g - XQW, SO ^ C DO. Furthermore, by (2.37) and (2.42), we have 4>o(xo) + sup^^^ \\g - xo\\ = ^o(go) = sup Oo(G), which proves the second statement. Finally, go e HQ and, by (2.40) and (2.36), 11^0--^0II

^o(y-xo)

sup||g-xo|

< ||y-xol|

(y e Ho).

geG

Theorem 2.5. Let G be a subset of X, and XQ e X. For an element go e G the following statements are equivalent: l°.g0GJ^G(X0).

2°. There exists Oo e X* satisfying (2.36), (2.42), and I sup (I>o(G) - Oo(xo)| = max | sup 0(G) - cD(xo)|.

(2.44)

11011 = 1

Proof If go e ^G(-^O), then by Theorem 2.4 and Remark 2.6, there exists Oo e X* satisfying (2.36), (2.38), and (2.42). Also, by (2.42), (2.38), and Theorem 2.1, we have I supOo(G) - Oo(xo)| = 1^0(go) - ^o(-^o)l = s u p | | g - x o | | = sup | s u p O ( G ) - 0 ( ^ 0 ) 1 , geG

whence (2.44).

11011 = 1

96

2. Worst Approximation

Conversely, assume that there exists OQ G X* satisfying (2.36), (2.42), and (2.44). Then by (2.42), (2.44), and Theorem 2.1, l^oigo) - ^o(-^o)l = I supcDo(G) - Oo(xo)| = sup | supO(G) - O(xo)| II^INl = sup||^-xoll, geG

SO cDo satisfies (2.36) and (2.38). Hence, by Theorem 2.4, go e TG{XQ).

D

Remark 2.7. By Corollary 1.1 and Lemma 1.5, Theorem 2.5 admits the following geometric interpretation:/or an element go e G we have go ^ •^G(-^O) if and only if there exists OQ G Z* with ||0o|| = 1, such that the hyperplane {3;GX|(I>o(j) = supcDo(G)}

(2.45)

supports G at go and dist (xo, //(Do,supOo(G)) = max dist (XQ, / / ) ,

(2.46)

HeTic

where He is the collection ofhyperplanes defined in Remark 2.1(b); or equivalently, there exists a hyperplane Ho that supports G at go and such that dist (xo, Ho) = max dist (JCQ, H) Hen,G

(2.47)

(see Figure 2.7).

Figure 2.7.

None of the conditions (2.42), (2.44) can be omitted in Theorem 2.5, as shown by the following examples: Example2.1. Lei X = l\G = {^en\n = 1,2,...}, where {Cn} denotes the sequence of unit vectors in / ^ and let xo = 0. Then the function OQ ^ Z* defined by

2.2 Characterizations and existence of farthest points

97 (2.48)

satisfies (2.36). Furthermore, , |supOo(G) - Oo(xo)| = sup Oo I

n-l

er

= 1,

and by Theorem 2.1, sup I sup (G) - cD(xo)| = sup \\g -xo\\= OeX*

sup

geG

n-l

n

11^11 = 1

so Oo satisfies (2.44). However, writing gn = ^^n, 1, 2 , . . . } and Wgn -^Oll =

n- 1 -en

n- 1

whence TG(XO) = 0. Also, ^o(gn) = ^ hold, for any go e G.

<1

< 1,

AZ

we have G = {gn\n

(AZ = 1,2,...), = 1, 2 , . . . , so (2.42) does not

Example 2.2. Let X = R^, with the Eudidean norm, G = {(1, 0), (1, 1)}, and XQ = 0. Then the function OQ G X* defined by <^o(y) = yi

(y = (yuy2)eX)

(2.49)

satisfies (2.36). Furthermore, for go = (1, 0) G G we have ^o(go) = 1 =supOo(G), so go satisfies (2.42). However, sup llg - xoll = max (1, V2) = V2 > 1 = ||go - ^oll, geG

SO go ^

^G(XO)'

Also, for the function ^\ e X* defined by

V2

Oi(j) = — ( j i + j2)

(J = (}^i, yi) e Z),

we have ||Oi|| = 1 and I supOi(G) - cDi(jco)| - V2 > 1 = I supOo(G) - o(xo)|, so (2.44) is not satisfied.

(2.50)

98

2. Worst Approximation

Theorem 2.6. Let X be a normed linear space, XQ e X, and G a subset of X such that G / {^o}- For an element go e G, the following statements are equivalent: WgoeTcixo). 2°. There exists OQ G X* such that (2.51)

'i'oigo --^o) = 1,

1

llgo--xoll =

1

HI

11^0

(2.52)

r

1

max

=

3geG ,^(g)>«I>Uo)+l

II CD

(2.53)

IfG is weakly compact, these statements are equivalent to: 3°. There exists ^'^ e X* satisfying (2.51), (2.52), and max

k>n II

eX* ""

IIO

-.

(2.54)

supOUo)+l

Proof r => 2°. We shall show that if % e X* satisfies (2.51)-(2.53), then 4>o := -[T^^o satisfies (2.36), (2.42), and (2.44). Indeed, (2.36) is obvious. Also, by (2.53), (2-33), (2.10), ^'^ ^ 0, and (2.51), we have 1

1

I sup CD(G) - cD(xo)| |supcD;j(G) - <^^,(xo)\ sup > ^^-n n— Wioi llll \\%\\ ^ %igo) - o(xo) ^ 1 (2.55)

=

1 ^ 0 II

ll^oll

which yields (2.44). Finally, by (2.55) and go ^ G, we obtain ^ ,

,

%(So - xo)

I sup %(G) - %{xo)\

1 ^ 0 II

11^0 I

> sup Oo(G) - cDo(xo) > cDo(go - -^o), whence (2.42). 2° =^ 1°. We shall show that if G 7^ {xo} and CDQ e X* satisfies (2.36), (2.42), and (2.44), then sup cDo(G) ^ cDo(xo) and cD^^ := ,^^^^^G)-<^^(XO)'^^ satisfies (2.51)(2.53). Indeed, by (2.44), (2.10), and G ^ [xo] we have | sup Oo(G) - cDo(xo)| = sup^GG 11^ - -^0II > 0. Also, by (2.42), we have (2.51). Furthermore, by (2.36), the definition of CDQ, and (2.42), we have 1

l^ol

= |supcDo(G)-cDo(xo)| = |Oo(go-^o)l < Jl^o-^oll,

2.2 Characterizations and existence of farthest points

99

and on the other hand, by (2.44), (2.10), and since go ^ G^ J—J = I sup Oo(G) - Oo(xo)| = sup \\g -xo\\> ll'^oll see

\\go - ^oll,

(2.56)

whence we obtain (2.52). Finally, by (2.56) and (2.33), we get WJ

= ReG sup||g-xo|| =

^e^* max

ll^ll

that is, (2.53). 2° 4^ 3°. This follows from the fact that if G is weakly compact and O 6 X*, then sup (G) is attained (see Lemma 1.3). • Now we shall study the existence of elements go e G for which the sup in the left-hand side of (2.10) is attained (i.e., of farthest points go e TG(XO)). Definition 2.1. We shall call an optimal dual solution, or, briefly, optimal function (with respect to the pair (G, XQ)) any function OQ ^ ^* with ||o|| = 1 for which the first sup in the right-hand side of (2.10) is attained (i.e., any <^o ^ ^* satisfying (2.36) and (2.44)). Theorem 2.7. Let X be a normed linear space, G a subset ofX, and xo G X. The following statements are equivalent: 1°.^G(^O)7^0.

2°. There exists an optimal dual solution OQ G X* such that Oo attains its supremum on G.

(2.57)

Proof Condition (2.57) means that there exists go ^ G satisfying (2.42), so the result follows from Theorem 2.5. • Remark 2.8. By Lemma 1.5, for the hyperplane Ho:={yeX\

o(y) = sup cDo(G)},

(2.58)

where OQ e X*, ||4>o|| = 1, we have dist (JCQ, H) = \ sup Oo(G) - Oo(xo)|. Hence, a function OQ e X* with HOQII = 1 is optimal if and only if the hyperplane (2.58) satisfies (2.47); we shall call any such hyperplane an optimal hyperplane. Then Theorem 2.7 admits the following geometric interpretation: We have J^ci^o) # 0 if and only if there exists an optimal hyperplane Ho G HG such that HoCiG ^ 0. In the particular case that G is weakly compact, condition (2.57) can be omitted, as shown by the following: Corollary 2.3. Let G be a weakly compact subset of a normed linear space X, and jco G X. The following statements are equivalent: 1°.^G(^O)7^0.

2°. There exists an optimal dual solution OQ G X*.

100

2. Worst Approximation

Proof. Since G is weakly compact, every OQ ^ ^* satisfies (2.57) (see Lemma 1.3), so the result follows from Theorem 2.7. • Corollary 2.3 is no longer true without the assumption of weak compactness, as shown by Example 2.1. Moreover, one can modify Example 2.1 to show that in Corollary 2.3 the assumption of weak compactness of G cannot be replaced by the assumption of weak* compactness of G when X is a conjugate space: Example 2.3. Let X = /^ = c*, G = {^en\n

= 1, 2 , . . . } U {0}, and let XQ =

0. Then ^ ^ n ^ 0, so G is weak* compact. Furthermore, by the argument of Example 2.1, the function OQ ^ X* defined by (2.48) is optimal, but ^G(-^O) = 0Let us summarize the connections between existence of farthest points and existence of optimal dual solutions. To this end, we shall denote by OG(-^O) the set of all optimal dual solutions with respect to the pair (G, XQ). Theorem 2.8. Let G be a subset of a normed linear space X, and let XQ e X. Then

(a)^G(^o)#0^OG(^o)^0; (b)(9G(^o)7^0^^G(^o)^0; (c) ifG is weakly compact, then !FQ{XQ) ^ 9^ ^

OG{XQ)

^ 0;

Proof (a) is an obvious consequence of Theorem 2.5 (or of Theorem 2.7). (b) is shown by Examples 2.1 and 2.3. (c) is nothing else than Corollary 2.3.

•

Duality for Quasi-convex Supremization

Given a locally convex space X with conjugate space X*, a subset G of X and a quasi-convex function / : X -> /?, in this chapter we shall give duality results for the primal supremization problem (n

=

(PGJ)

« ' = < / = sup / ( G ) .

(3.1)

Any go e G for which the sup in (3.1) is attained, i.e., such that /(go) = sup/(G),

(3.2)

is called an optimal solution of problem (P^); these will be studied in Chapter 4. The set of all optimal solutions will be denoted by Mcif), that is, Mcif)

:= {go e G\ figo) = sup/(G)};

(3.3)

naturally, one can also write max instead of sup in (3.2) and (3.3). If / is a quasiconvex function, then (P^) of (3.1) is called a problem of quasi-convex supremization. Taking / ' := —/, which is a quasi-concave function, one can also write (3.1) as the infimization problem -a' = - s u p / ( G ) = i n f / ( G ) ;

(3.4)

thus, quasi-convex supremization is equivalent to quasi-concave infimization. However, here we shall consider only quasi-convex supremization. In contrast to the cases of convex and quasi-convex infimization (see Chapter 1, Section 1.4), it will turn out that for quasi-convex supremization the theory of surrogate duality is more developed (see Sections 3.1-3.3) than the theory of Lagrangian duality (see Section 3.4).

102

3. Duality for Quasi-convex Supremization

Our starting point for the study of surrogate duality will be the observation that worst approximation may be regarded as a particular case of supremization, by taking X to be a normed linear space, XQ e X, and / : X ^- /? the convex function (1.264); indeed, then sup/(G) = 5(G,xo),

(3.5)

the deviation (2.1) of G from XQ, and, for this case, the optimal solutions go ^ G of problem (P^) are the elements of worst approximation of xo by G. Although the extension from the particular function / of (1.264) to a function f: X -> R on a locally convex space X is a rather big step, it turns out that similarly to the case of passing from best approximation by convex sets to convex infimization, many results and methods of the theory of worst approximation can be extended to results on the supremization of functions. Similarly to the fact that formula (1.249) on the distance to a convex set extends to the surrogate duality formula (1.330) on quasi-convex infimization, it is natural to expect that formula (2.11) on the deviation will extend, under certain assumptions on G and / , to a formula like sup/(G) =

sup

inf

f{y),

(3.6)

^' '(G)

obtained formally by replacing in (2.11) the function / of (1.264) by a function / on a locally convex space X\ this will be achieved in Section 3.1. Next, corresponding to formula (1.355) on infimization, one would like to replace the hyperplanes [y e X\ 0(y) = sup 0(G)} of (3.6) by other sets, e.g., closed half-spaces. Therefore, in Section 3.2, we shall consider "unconstrained surrogate dual problems" to problem (P^) of (3.1), defined as supremization problems of the form ^ ' = supA.^(X*\{0}),

(3.7)

where X*\{0} is the dual set (unconstrained), and X^ = A.^ ^: X*\{0} -> P is a function (the dual objective function, depending on G and / ) of the form A^(cD) = inf /(^G.o)

(^ € X*\{0}),

(3.8)

with {^G,O}OGX*\{0} being a family of subsets of X related in some way to G. The right-hand side of (3.6) is indeed of the form (3.7), with k^ of the form (3.8), where the surrogate constraint sets ^G,4> are the hyperplanes ^G,^ = [yeX\

0 ( j ) = sup cD(G)}

(O G Z*\{0}).

(3.9)

Problem (3.7), with X^ of (3.8), is an unperturbational dual problem to ( P ^ , since it is defined directly, without using the method of embedding first (P^) into a family of perturbed primal problems, and it is a surrogate dual problem to (P^), since it replaces the primal constraint set G of (3.1) by a family of "surrogate constraint

3.1 Some hyperplane theorems of surrogate duality

103

sets" ^G,cD c X (O e X*\{0}) (while it keeps the primal objective function / unchanged). Next, more generally, in view of further applications, given an arbitrary set Z, a subset G of X and a function / : X ^ /?, for the supremization problem (P^) of (3.1) we shall consider in Section 3.3 a "surrogate dual problem" of the form ^ ^ = ^ ^ ^ = supA(W),

(3.10)

where W = WQ ^ is a. set (the dual constraint set) and A = A^ y^: W ^^ /? is the function (the dual objective function) defined by k^GjM = inf fiQcw)

(w e W),

(3.11)

with {^G,w}wew being a family of subsets of X related in some way to G. Then, taking Z to be a locally convex space, W = X*\{0}, and A = A^ of (3.11), problem (3.10) reduces to problem (3.7), (3.8). Furthermore, taking X to be a locally convex space, W c Z*\{0} or W c (X*\{0}) x R, and X = X' of (3.11), we shall obtain some useful unconstrained and "constrained" surrogate dual problems to problem (P^) of (3.1). Actually, instead of {^G,w}wew, we shall find it more convenient to use the equivalent language of polarities A: 2^ -^ 2^ (this will be explained in Section 3.2). In Section 3.4 we shall deal with Lagrangian dual problems to problem (P^) of (3.1). Finally, the general dual problem (3.10) will permit us to study (unconstrained and constrained) surrogate duality for more structured primal supremization problems (i.e., in which the primal constraint set G is expressed in more structured ways), by considering suitable dual constraint sets W and dual objective functions A = r ^ ^ : W - ^ ^ as in (3.11) (see Section 3.5).

3.1 Some hyperplane theorems of surrogate duality In this section we shall give some hyperplane theorems of surrogate duality, generalizing the (equivalent) geometric forms (2.11), (2.13) of Chapter 2, Theorem 2.1. Let us first give a lemma, in a somewhat more general form than needed in the sequel. For a linear space X, we shall denote by X* the set of all linear (not necessarily continuous) functions O: X -> /?. Lemma 3.1. Let X be a linear space, O e X*\{0}, and f: X ^ function, and let co(d):=

inf f(y)

(d e R).

R a convex (3.12)

yeX

If (D{d) > - o o {d e /?), then CO is finite and convex, and hence continuous on R.

(3.13)

104

3. Duality for Quasi-convex Supremization

Proof. Since O / 0, we have [y e X\ ^(y) = J} ^ 0, so (jo(d) < +oo (d e R), whence by (3.13), a)(R) c^ R. Lctdud2 e R,0 < IX < 1, and £ > 0. Then by (3.12) and co(R) c R, there exist y[,y2e X with cD(jj) = d\, ^(y!^) = d2, such that fiyD < co(di) -\-s (i = 1,2). But then, since O is Unear and / is convex, we obtain a)(ixdi + (1 - /x)^2) =

inf

f{y)

yeX 0(>')=M+(1-Ai)^2

< / ( / x j ; + (1 - /x)y^) < nf(y[) + (1 - /x)/(j2) < iio)(di) + (1 - /x)co(d2) + £, which, since 0 were arbitrary, proves that co is convex on R. Hence, by a well-known property of finite convex functions (see, e.g., [104], Chapter I, Theorem 3.1.1), a> is continuous on R. D Now we can prove the following theorem: Theorem 3.1. Let X be a locally convex space, with conjugate space X*, and G a subset of X. (3) If f: X ^^ R is a lower semicontinuous quasi-conv ex function, then sup/(G) (G)

(b) If either G is bounded and f: X ^^ R is a convex function satisfying inf fiy) > - 0 0

(O G Z*\{0}, d e R),

(3.15)

yeX ^(y)=d

or G is weakly compact and f: X -> R is an arbitrary function, then sup/(G) > sup inf f(y).

(3.16)

^' ^4)(y)=sup
(c) Consequently, if either G is bounded and / : X -^ R is a lower semicontinuous convex function satisfying (3.15), or G is weakly compact and / : Z -> R is a lower semicontinuous quasi-convex function, then we have the equality (3.6). Proof (a) Let / : Z -^ /? be a lower semicontinuous quasi-convex function, and assume, a contrario, that sup/(G) >

sup

inf

f(y).

(3.17)

f(y).

(3.18)

Then there exist go ^ G and s > 0 such that /(go)-^>

sup oexnio}

inf y^^

3.1 Some hypeq^lane theorems of surrogate duality

105

Hence, /(go)-^>

inf

f{y)

(OGX*),

(3.19)

yeX 4>(>')=sup4)(G)

and thus for any O e Z*\{0} there exists y = y^ e X with 0(3;) = sup 0(G), /(go) - e > f(y).

(3.20)

Case r . /(go) < +00. Let ^/(.o)-.(/) := {y ^ ^1 f(y) < f(8o) - e},

(3.21)

Then, by (3.20), Sf(gQ)-s(f) 7^ 0. Furthermore, since / is a lower semicontinuous quasi-convex function, Sf^g^^sif) is a closed convex set; also, since /(go) < +C)0, we have go ^ 5'/(gQ)_e(/). Hence, by the strict separation theorem, there exists Oo G X*\{0} such that Oo(go) > sup cDo(5/(,,)_,(/)).

(3.22)

We claim that inf

f(y) > /(go) - e;

(3.23)

^o(>')=supo(G)

indeed, otherwise, there would exist yo ^ X with Oo(jo) = supOo(G) such that f(yo) < /(go) - £ (so, yo e 5/(^,)_,(/)), whence by (3.22), Oo(jo) = sup Oo(go) > supOo(S/(^,)_^(/)) > ^o(yo). which is impossible. But (3.23) contradicts (3.18). Thus (3.17) cannot hold, which proves (3.14) for case 1°. Case 2°. /(go) = +00. Let d e Rbe any number such that d >

sup ^'

inf

f(y).

(3.24)

(y)=sup^(G)

Then by (3.24), SAf) 7^ 0, and by /(go) = +cx), we have go ^ SAf). so the above argument of case 1° yields (3.23) with /(go) - £ replaced by d, which contradicts (3.24). This proves (3.14) for case 2°. (b) Let G c X be a (nonempty) bounded set (hence supO(G) e R for all O G X*), and f: X ^ R a convex function satisfying (3.15). Then by Lemma 3.1, CO of (3.12) is continuous on R, for all O G X*\{0}. Assume now, a contrario, that sup/(G) ^) = coisup Oo(G)).

(3.26)

yeX o{y)=supo(G)

Choose a sequence {g„} c G such that ^o(gn) -^ sup Oo(G). Then c^(^o(gn))=

inf

f(y)<

f(gn)<

supf(G)

(« = 1,2,...),

(3.27)

yeX ^o(>')=^o(gJ

whence a;(supcDo(G)) = lim a)((t>o(gn)) < sup/(G),

(3.28)

in contradiction to (3.26). Thus, (3.25) cannot hold, which proves (3.16). Assume now that G is weakly compact and / : X -> /? is an arbitrary function satisfying (3.25), and hence (3.26), for some OQ e X*\{0}. Then, since G is weakly compact, there exists go e G such that Oo(go) = supcI)o(G) (see Lemma 1.3), whence a;(sup cDo(G)) = (o{<^o{go)) =

inf

f(y) < f(go) < sup / ( G ) ,

yeX cI)o(v')=Oo(go)

in contradiction to (3.26). Thus, (3.25) cannot hold, which proves (3.16). Finally, part (c) is an obvious consequence of parts (a) and (b).

D

Remark 3.1. (a) Formula (3.6) admits the following geometric interpretation: sup/(G) = sup i n f / ( / / ) ,

(3.29)

HeHc

where HG denotes the family of all (closed) hyperplanes that quasi-support the set G. Thus, the reduction principle of Remark 2.1 (b) extends now to the following form: formula (3.29) reduces the computation of sup f{G) to the computation of inf f(H) for the hyperplanes H e He(b) Theorem 3.1 remains valid if we also permit O = 0 in (3.6), since sup

inf

f(y) = sup

^' ' (D(v)=sup(G)

inf

fiy)-

(3.30)

0(v)==supcI>(G)

Indeed, the inequality < in (3.30) is obvious. On the other hand, for each ^ e X*\{0} and for OQ = 0 we have inf yeX ^{y)=sup(G)

/(^)>inf/(X)=

inf

f(y),

yeX a)o(v)=supOo(G)

whence we obtain the inequality > in (3.30). However, formula (3.6) has the advantage that it is a "hyperplane theorem" (by 3.29)), while for OQ = 0 we have {y e X\ ^o(y) = sup Oo(G)} = X, which is not a hyperplane. (c) The assumption of boundedness of G cannot be omitted in parts (b) and (c) of Theorem 3.1. Indeed, for example, if X = G = R and f(y) = 1 for all y e X, then the right-hand sides of (3.16) and (3.6) are -\-oo (since supO(G) = -hoo, so {y eX\

cD(j)

= sup 0(G)}

= 0, for all O G X * \ { 0 } ) , but sup / ( G )

= 1.

3.1 Some hyperplane theorems of surrogate duality

107

In the particular case G = [XQ] (a singleton, hence weakly compact), from Theorem 3.1 we obtain the following corollary: Corollary 3.1. Let X be a locally convex space, XQ e X, and f: X ^^ R a lower semicontinuous quasi-convex function. Then f(xo)=

sup

inf

f(y).

(3.31)

Remark 3.2. Formula (3.31) admits the geometric interpretation fixo) = sup i n f / ( / / ) ,

(3.32)

Hen xoeH

where H denotes the family of all hyperplanes in X. The sup in (3.31) need not be attained, even when / is finite, as shown by the following example: Example 3.1. Let 5 be a nonreflexive Banach space, let X = B*, endowed with the weak* topology a{B*, B), and let f(y) = \\y\\

(yeX).

(3.33)

Then Z is a locally convex space and / is a finite lower semicontinuous convex function on X (but it is not continuous at any XQ e X). Hence, by Corollary 3.1, we have (3.31), which, since X* = (B*, a(B*, B))* can be identified with B, means that ||xol|=

sup

inf

||x||=

sup disi(0,Hb,,,)

(xo e B'),

(3.34)

where H,,,, = {xeB''\xib)=xo(b)},

(3.35)

Thus, for each b e B, Hbx^ is a hyperplane in B*, and therefore, by Lemma 1.5, dist(0, Hb,xo) = l-^o(^)l/ ll^ll- Consequently, (3.34) becomes the well-known formula lko||=

sup 1 ^ ^ beB\{0}

(xoeB^).

(3.36)

ll^ll

But since B is nonreflexive, by a well-known theorem of R.C. James (see, e.g., [40], p. 63, part (7)), there exists XQ e B* for which the sup in (3.36), and hence in (3.34), is not attained. For the next hyperplane theorem we need some preparation.

108

3. Duality for Quasi-convex Supremization

Lemma 3.2. Let X be a locally convex space, ^ e X*\{0} and f: X -> R a function such that the (possibly empty) sets ^r = [<^{y)\ yeX,

f{y)
(re R)

(3.37)

are closed in R. Then the function co: R ^^ R defined by (3.12) is lower semicontinuous. Proof Letr e R, [zn] ^ Sr{(jo) = [z e X\(D{Z) < r}, Zn -^ zo-Then, given £ > 0, by Zn € Sr{a)) and (3.12) there exist j„ G X with 0(y„) = Zn such that / ( j „ ) < (o{Zn)-\-£ 0 was arbitrary, we conclude that (o(zo) < r, so Sr(co) is closed. Hence, since r was arbitrary, co is lower semicontinuous. D Now we can prove the following theorem: Theorem 3.2. Let X be a locally convex space, and G a bounded subset ofX. (a) Iff: X -^ Ris a function such that for each O e X*\{0} the sets E^ defined by (3.37) are closed in R, then we have (3.16). (b) If in addition, f: X ^ R is lower semicontinuous and quasi-convex, then we have (3.6). Proof (a) By Lemma 3.2, the function a;: R ^ R defined by (3.12) is lower semicontinuous, for all O e X*\{0}. Furthermore, since G is (nonempty and) bounded, we have supO(G) e R for all O e X*. Assume now that f: X ^^ R is a. function satisfying (3.25), and hence (3.26) for some OQ ^ X*\{0}. Choose a sequence {gn} ^ G such that Oo(g„) -^ sup Oo(G). Then we have (3.27), whence, since co is lower semicontinuous, we obtain 6^(supcDo(G)) < liminf co(o(gn)) < sup/(G), in contradiction to (3.26). Thus, (3.25) cannot hold, which proves (3.16). Finally, part (b) follows from part (a) and Theorem 3.1 (a).

(3.38)

D

3.2 Unconstrained surrogate dual problems for quasi-convex supremization While in the preceding section we have been concerned with "hyperplane theorems" of surrogate duality, now we want to consider as well other types of surrogate duality results for supremization, e.g., ''half-space theorems." To this end, as mentioned at the beginning of this chapter, we shall consider for the supremization problem (P^) of (3.1) a "surrogate dual problem" of the form (3.10), where W = W^j is a set (the dual constraint set) and X = X^ j-: W -> /? is the function (the dual objective

3.2 Unconstrained surrogate dual problems for quasi-convex supremization

109

function) defined by (3.11), with {QG,w}wew being a family of subsets of X related in some way to G. Our main tool will be the unifying framework of polarities A: 2^ —> 2 ^ . We shall first give some results using arbitrary polarities, and then we shall apply them to various concrete polarities. Let us fix, for simplicity of notation, the set G c X, so that we can write Qy^ instead of Qcw We then have the following basic lemma. Lemma 3.3. Given two sets X, W and a family {Qw}wew of subsets of X, there exists a unique polarity A: 2^ -^ 2 ^ satisfying Q^ = CA\{W})

(W e W),

(3.39)

namely, the mapping defined by A(C) :={w eW\C

^ C^^} = {w eW\CnQ^=

0}.

(3.40)

Proof By (1.150) and (3.40), for any set C c X we have CA^({W;}) = {X

eX\w^

A({jc})) = {x eX\x

^ te^} = ^ ^ ,

which proves (3.39). Furthermore, if A is a polarity satisfying (3.39), then, by (1.144) and (3.39), we obtain A(C) = lweW\C

^ CQ^},

that is, (3.40), which proves the uniqueness of A.

D

Remark 3.3. (a) Using (3.39) and (1.144), the dual objective function A,^ of (3.11) becomes X'^iw) = inf /(CA^({U;})) =

inf

f{x)

xeX

(w e W),

(3.41)

u;eCA({jc})

where A = AG : 2^ -> 2 ^ is a polarity (depending on G, but not on / ) . Then, by (3.10) and (3.41), the dual value (i.e., the value of the dual problem) becomes PI = sup inf f(CA\{w})) weW

= sup weW

inf

xeX we^A({x})

f(x).

(3.42)

Formulas (3.40) and (3.39) yield a one-to-one correspondence between families of subsets {^w}wew of X and polarities A: 2^ ^- 2^, so the two languages (3.11), (3.10) and (3.41), (3.42) are equivalent ways of expressing the dual objective function X^ and the dual value ^^. In the sequel we shall choose the language (3.41), (3.42), since this will allow us, by using (1.140), to express the results, e.g., on the relations between the primal and dual problems, in a more concise way. (b) If there exists WQ e W such that CA'({W;O}) = 0, or, equivalently, A\{wo}) = X, thenby (3.41), we have A."^(u;o) = inf0 = +oo, and hence by (3.42), y0^ = +oo. Thus,

110

3. Duality for Quasi-convex Supremization ^^ < +00 =^

CA'({U;})

/ 0 <^ A\[w}) ^X

(we W),

(3.43)

(c) In the particular case of Theorems 3.1 and 3.2, we have W = X*\{0}, and by (3.9) and (3.39), the surrogate constraint sets are CA\[^})

= {y eX\ ^(y) = sup 0(G)}

(O e X*\{0}),

(3.44)

where A = AG : 2^ ^ 2^*\<^^ is the polarity A^ of (1.166); also, the dual objective function (3.41) is A1(0)=

inf

f(y)

(cD6X*\{0}).

(3.45)

yeX )=supO(G)

We shall first give some necessary and sufficient conditions on G, / , and A, in order that a > )S^ or or < )6^ or a = )6^, where a e R is arbitrary, in terms of the level sets Sd(f) and Aj(/) of (1.22) and (1.23). Lemma 3.4. Let X be a set, Q <^ X, f: X ^ ^, and d e R. (a) We have inf/(^) > d

(3.46)

if and only if Ad(f)nQ

= &.

(3.47)

(h)If i n f / ( ^ ) > d,

(3.48)

SAf)nQ

(3.49)

then = id.

Proof (a) If yo e Ad{f) H ^ , then i n f / ( ^ ) < fiyo) < d. Conversely, if i n f / ( ^ ) < d, then ^ 7^ 0 and there exists >'o ^ ^ such that f(yo) < d, so

yoeAAf)nQ. (b) If there exists yo e S^if) n ^ , then i n f / ( ^ ) < f(yo) < d, so (3.48) cannot hold. n Proposition 3.1. Let X, W be two sets, f: X -^ R a function, A: 2^ -> 2^ a polarity, and a e R. The following statements are equivalent: 1°. We have a>^l

= sup inf /(CA^({it;})).

(3.50)

2°. We have Ad{f) n ZA\{W})

7^ 0

{w eW,deR,d>a).

(3.51)

7^ 0

{w eW,deR,d>oi).

(3.52)

3°. We have Sdif) n ZA\[W])

3.2 Unconstrained surrogate dual problems for quasi-convex supremization

111

Proof, r =4^ 2°. If r holds, then for each w; G W and J G /?, J > a, we have d > inf f(CA\{w})), whence by Lemma 3.4(a), we obtain (3.51). The implication 2° =^ 3° is obvious. 3° =» 1°. If 3° holds, then by Lemma 3.4 (b), we have X'^(w) = inf f(ZA\{w}))

(w e W,d e R,d > a),

whence ^^ = sup A,^(W) < infj>(^ d — ot.

•

Proposition 3.2. Let X, W be two sets, f: X ^ ^ a function, A: 2^ -> 2^ a polarity, and a G R. The following statements are equivalent: r. We have a
sup inf f(CA\{w})).

(3.53)

weW

2°. For each d e R, d < a, there exists w^ e W such that AAf)nCA'(lwd})^&.

(3.54)

3°. For each d e R, d < a, there exists Wd £ W such that SAf)nCA\{wd})

= &-

(3.55)

Proof r =^ 3°. If 1° holds and J G /?, J < a, then d
supinf/(CA'({K;})), weW

and hence there exists w^ e W such that d < inf / ( C A ' ( { M ; J } ) ) . Then by Lemma 3.4(b), we have (3.55). The implication 3° => 2° is obvious. 2° => 1°. Ifd and wj are as in 2°, then by Lemma 3.4(a), we have X^iWd) = mf

f(CA\{w,}))>d.

whence ^^ = sup A,^(W) > sup^^^ d = a.

•

Combining Propositions 3.1 and 3.2, we obtain the following result: Theorem 3.3. Let X, W be two sets, f: X ^^ R a function, A: 2^ -> 2 ^ a polarity, and a G R. The following statements are equivalent: 1°. We have a = Pl=

sup inf f(CA\[w}))\

(3.56)

weW

2°. We have (3.51) and for each d e R, d < a, there exists Wd e W satisfying (3.54). 3°. We have (3.52) and for each d e R, d < a, there exists Wd ^ W satisfying (3.55).

112

3. Duality for Quasi-convex Supremization

Now we shall give, for the case when a = a^ of (3.1), some convenient sufficient conditions in order that a' > yS^ or a' < ^^ or a' = yS^- ^^ ^^is end, let us first prove a lemma: Lemma 3.5. Let X and W be two sets, A: 2^ ^ following statements are equivalent: 1°. We have

2^ a polarity and XQ e X. The

H[xo]) = 0.

(3.57)

2°. We have xo ^ U^ew^\M).

(3.58)

Proof. If (3.58) does not hold, i.e., if there exists WQ ^ W such that XQ e A'({ifo}), then A({jco}) ^ AA'({u;o}) ^ w^o, so (3.57) does not hold. Conversely, if we do not have (3.57), i.e., if there exists WQ e A({jco}), then A\{wo}) 5 A'A({jco}) 3 JCo, so (3.58) does not hold. D Theorem 3.4. Let X and W be two sets, f: X ^ R a function, and AG : 2^ -)• 2 ^ (G C X) a family of polarities such that for any G C. X we have A{,}(lg}) = id

(geG).

(3.59)

inf/(CA^c({u;})) < supinf/(C A;^J({U;}))

(W

e W).

(3.60)

geG

Then, given G C. X, sup/(G)>)Sl^.

(3.61)

Moreover, if we have (3.59), (3.60) and f is A^^ Ac-quasi-convex, then sup/(G) = ^ l ^ .

(3.62)

Proof By (3.59) and Lemma 3.5, we have g e C Aj^}({w;}) (g e G,w e W), whence inf/(CA;^J({W;})) < f{g) (g eG,w e W). Therefore, by (3.60), inf/(CA^^({u;})) < supinf/(CA;^j({u;})) < sup/(G)

(w e W),

geG

and hence by (3.42), we obtain (3.61). Furthermore, if also / is Aj^Ac-quasiconvex, then by (3.111) below (applied to A = AG), we have sup / ( G ) =

sup

inf f(CA'a({w})) < sup inf f{CA'^({w})) = ^ 1 ^ ,

weCAciG)

whence by (3.61), we obtain (3.62).

ujeW

D

3.2 Unconstrained surrogate dual problems for quasi-convex supremization

113

Theorem 3.5. Let X and W be two sets, G a subset of X, f: X -> R a function, and A: 2^ -> 2 ^ a polarity. The following statements are equivalent, where a, = a'= sup f(G): r. We have

2°. We have

3°. We have

4°. We have

{a=)supfiG)
(3.63)

A(5,(/)) # 0

(d< a).

(3.64)

HAAf))

(d< a).

(3.65)

/ 0

Aa(f) c U^^wA\{w]).

(3.66)

Proof 4° ^ 1 ° . We have 4° if and only if for each d < a there exists Wd e W such that Ad(f) c A\{wd}), i.e., such that A j ( / ) n CA'({W;J}) = 0, which, by Proposition 3.2, is equivalent to sup / ( G ) < ^^. 2° =:^ 3°. If 2° holds, then since AAf) £ SAf), we have A(Aj(/)) 3 A(5^(/)) 7^ 0 (J < a). 3° => 4°. If w;j 6 A(Aj(/)) (J < a), then (d < a), Ad(f) C A'A(A^(/)) C A'({u;^}) C U^^wA\{w}) whence A«(/) = U^<«A^(/) c U^jew^'iM). 4° =^2°. If 4° holds, then for each d < a there exists Wd e W such that 5 j ( / ) c A J / ) c A'({it;j}), whence A{Sd(f)) 2 AA^({u;^}) 3 u;^.

D

Corollary 3.2. L^r Z, W be two sets, G a subset ofX,f:X^^R a function, and A: 2^ ^^ 2^ a polarity. Iffor each d < sup / ( G ) , the level set Sd(f) is A Q Acconvex (in particular, if f is A'Q AQ-quasi-convex), then (3.63) holds. Proof. The assumption that Sd{f) is AJ^Ac-convex {d < sup/(G)) means that for each d < sup / ( G ) and x e CSd(f) there exists w = Wd,x ^ W such that Sdif) c A'cd^dJ),

X e CA'a({^d,x})-

(3.67)

Hence, for each ^f < sup/(G) we have A j ( / ) c Sd(f) ^ Uu;evrA^(^({w;}), whence (3.66), and thus, by Theorem 3.5, implication 4° =^ 1°, there follows (3.63). D Combining theorems 3.4 and 3.5, we obtain the following corollary: Corollary 3.3. If we have (3.59), (3.60), and (3.66), then

sup f(G) = PI

(3.68)

114

3. Duality for Quasi-convex Supremization

Concerning simultaneous characterizations of optimal solutions of (P^) and of weak duality a^ = i^A, let us prove the following theorem: Theorem 3.6. Let X and W be two sets, G a subset ofX.f'.X^^R a function, and A: 2^ -> 2 ^ « polarity. For an element go e G and for a = ot^ = sup / ( G ) , the following statements are equivalent: r . We have go G Mdf) {i.e., /(go) = max / ( G ) ) and a = p^. T. We have AAf)

n CA\{W})

^0

(weW^deR,d>

/(go)),

(3.69)

and for each d e R, d < a, there exists w^ ^ W satisfying (3.54). 3°. We have SAf) n CA\{W})

#0

(weW,deR,d>

f(go)),

(3.70)

and for each d e R, d < a, there exists Wd ^ W satisfying (3.55). Proof r => 2°. If r holds, then f(go) = sup / ( G ) = a, and hence by Theorem 3.3, we have 2°. 2° ^ r. Assume 2°. Then by (3.69) and Proposition 3.1 (with a = /(go)), we have /(go) ^ P^- Furthermore, by the second condition of 2° and by Proposition 3.2, we have (3.53) with a = sup / ( G ) . Hence by go e G, we obtain Pl>

a = sup f(G) > f (go) > PI

Finally, the proof of the equivalence 1° <^ 3° is similar.

D

In the remainder of this section we shall assume, often without any special mention, that X is a locally convex space, with conjugate space X*, and G C. X, and we shall apply the preceding results to the special polarities A^ A^ of Chapter 1, Section 1.2. (1) By (1.155), for the polarity A = A^ : 2^ -> 2^*^^^^ of (1.154) the dual value (3.42) becomes fil^ =

sup

inf/(C(A^)'({0})) =

"

^^^*\^«}

sup

inf

f(x).

(3.71)

^^^*\{OUu)>s'upcI>(G)

Note also that for any 4> G X* we have supO(G) > —(X), since G 7^ 0. On the other hand, if there exists 4>o G X* such that sup Oo(G) = +00, then by (1.155) and a>o(x) G R, we have C(A]jy{{^o}) = {x e X| cDo(x) > supOo(G)} = 0. Hence by (3.71), ^^, < +00 =^ sup 0(G) e R(^

e X*).

(3.72)

3.2 Unconstrained surrogate dual problems for quasi-convex supremization

115

Theorem 3.7. Let X be a locally convex space, f: X ^^ R a function, and G a subset of X.

i^)If inf

fix)

0(g)

sup 0(G)

f(x)

(O e X*\{0}),

(3.73)

then sup/(G) >

sup

inf

/(jc).

(3.74)

f(x)

(3.75)

^^^*\s'upcI>(G)

(b) There holds sup/(G) <

sup

inf

if and only if for each d < sup / ( G ) there exists O = Oj e X*\{0} such that 0(y)<sup(D(G)

(yeSAf))

(3.76)

(by Lemma 1.10(c), this condition is satisfied, e.g., when f is (AQYA^j-quasiconvex). (c) If we have (3.73), then for each d s'upc|>(G)

Proof (a) This follows from Theorem 3.4 for A = A^ of (1.154), using (1.156), (1.155) and <^ix) > (g)} (geG,e

C(A|^P^({0}) = {xeX\

X*\{0}).

(3.78)

Alternatively, one can also give the following direct proof: We have sup / ( G ) > f(g) >

inf

fix)

(^ E G, CD e X*\{0}),

xeX 0(^)>0(g)

whence by (3.73), sup/(G) >

sup

sup

inf

OeX*\{0} seG ^ ( / ) | i ( ^ )

fix)>

sup

inf

fix)-

eX*\{0} ^(,)>-,t.p 0(G)

(b) This follows from Theorem 3.5, equivalence 2° <^ 1°, applied to AJ. , since formulas (3.75) and (3.76) mean, respectively, (3.63) for A = AQ and O G Al^iSdf))(c) This follows by combining parts (a) and (b). •

116

3. Duality for Quasi-convex Supremization

Remark 3.4. (a) Theorem 3.7 is a "half-space theorem of surrogate duality," since for each O e Z*\{0} the surrogate constraint set Q<^ = ^G, = {x e X\ sup 0(G)} is a (closed) half-space. Geometrically, formula (3.77) means that sup/(G) = sup i n f / ( y ) ,

(3.79)

VeVc

where VG denotes the family of all closed half-spaces that quasi-support the set G and do not contain G (see Corollary 1.3). (b) Theorem 3.7 remains valid, with the same proof, if we replace in it, and in the definition of A^^; Z*\{0} by any subset W o/X*\{0}. (c) In formula (3.73) the inequality sign can be replaced by equality, since the opposite inequality always holds. A similar remark holds also for formulas (3.171) and (3.185) below. Corollary 3.4. Let X be a locally convex space, G C X, and f: X ^^ R a function such that for each d d, that is, gd G ZSdif). Hence, since Sdif) is evenly convex, there exists Oj € Z*\{0} such that <^d{y) < ^d(gd)

(y e Sdif)).

(3.80)

Then, by (3.80) and g^ G G, we have <^diy) < ^digd) < sup diG)

iy e

Sdif)).

Consequently, by Theorem 3.7(b), we obtain (3.75).

D

Definition 3.1. For a locally convex space X and O e X*\{0}, a function / : X ^ R is called regular with respect to O if inf fix) = sup xeX (x)>d

inf

fix)

id e R).

(3.81)

j/^r, xeX j'<5^(^)>^'

For example, it is known (see [244], Remark 3.2) that if f: R" ^^ R and / is convex, then / is regular with respect to all O e iR"^)*. Using Theorem 3.7(a), let us prove the following: Corollary 3.5. Let X be a locally convex space, G a subset ofX such that supcD(G)G/?

(OGX*),

(3.82)

and f: X ^^ R a function that is regular with respect to all ^ G X*\{0}. Then we have (3.74).

3.2 Unconstrained surrogate dual problems for quasi-convex supremization

117

Proof. Let ^ e X*\{0}. Then by our assumption of regularity, we have (3.81) for d = sup 0(G) 6 R, i.e., inf

fix) =

sup

inf f(x).

(3.83)

But for each d' e R with d' < supO(G) there exists g = g^> e G such that d' < ^{g), whence [x e X\ ^(x) > d'] ^ [x e X\ 0(x) > 0(g)}. Consequently, sup

inf f{x)<

sup

inf

rl'czl?

X^X

oad

X^X

/(jc),

(3.84)

which, together with (3.83), yields (3.73). Hence, by Theorem 3.7(a), we get (3.74). D Combining Corollaries 3.4 and 3.5, we obtain the following: Corollary 3.6. Let X be a locally convex space, G a subset of X satisfying (3.82), and f:X—>R a function that is regular with respect to all O e Z*\{0} and such that for each d < sup f(G) the level set Sdif) is evenly convex (the latter condition is satisfied, e.g., when f is evenly quasi-convex). Then we have (3.77). (2) By (1.161), for the polarity A = A^ : 2^ -^ 2^*^^^^ of (1.160) the dual value (3.42) becomes Pl2=

sup inf/(C(A2.)^({CI>})) =

sup

inf

f(x).

(3.85)

Note that if there exists 4>o e X*\{0} such that sup Oo(G) = +oo, then by (1.161) and cDo(x) e R, we have C{Aly({^o]) = {x e X| CDO(JC) > supOo(G)} = 0. Hence by (3.85), P'

< +00 =^ sup 0(G) G /? (O G X*).

(3.86)

Lemma 3.6. (a) For any set G, i^l. < PI2. ^G

(3.87)

^G

(b) If X is a locally convex space, G c X, and f: X ^^ R is upper semicontinuous, then Pl^ =^'^2. ^G

(3.88)

^G

Proof (a) By the definitions, we have fi'^, =

sup

inf (jc)>sup4)(G)

fix) sup(G)

f(x) = ^^2 •

118

3. Duality for Quasi-convex Supremization (b) We have

{x eX\^(x)

>supO(G)} = {jc eX\(x) > supO(G)}

(O G X*\{0}),

and hence if f: X ^^ R is upper semicontinuous, then by Lemma 1.1,

inf

xeX supcI)(G)

fix) =

inf

fix)

(O e X*\{0}),

xeX (x)> sup (t>(G)

which yields (3.88).

D

Let us observe now that by (3.87), any condition ensuring (3.75) ensures also sup/(G) <

sup

inf

fix).

(3.89)

Hence, for example, from Corollary 3.4 we have the following result: Corollary 3.7. Let X be a locally convex space, G c. X, and f: X -^ R a function such that for each d < sup / ( G ) the level set Sdif) is evenly convex ie.g., let f be evenly quasi-convex). Then we have (3.89). Theorem 3.8. Inequality (3.89) holds if and only if for each d < sup / ( G ) there exists O = O^ G Z*\{0} such that sup ^iSdif))

< sup0(G).

(3.90)

Proof. This follows from Theorem 3.5, equivalence 4° ^ 1°, applied to A^, since formulas (3.89) and (3.90) mean, respectively, (3.63) for A = A^ and CD € AliSdif)). D Remark 3.5. (a) Condition (3.90) is satisfied, in particular, if SAf)^G

(J < sup/(G)).

(3.91)

(b) If / is (A^)^A^-quasi-convex, then by Lemma 1.11 (b), the above condition involving (3.90) is satisfied, and hence by Theorem 3.8, we have (3.89). However, this follows also from Corollary 3.2 applied to A^, or from Corollary 3.7 above, since by Lemma 1.11 (b) every iA^YA^-quasi-convex function is lower semicontinuous and quasi-convex, and hence evenly quasi-convex. Corollary 3.8. Let X be a locally convex space, G a subset of X satisfying (3.82), and f: X ^ R an upper semicontinuous function that is regular with respect to all 0 G X*\{0}, and such that for each d (^)>sup(G)

fix).

(3.92)

3.2 Unconstrained surrogate dual problems for quasi-convex supremization

119

Proof. By Corollary 3.5, we have (3.74), whence by Lemma 3.6 (b), we obtain the inequality > in (3.92). On the other hand, by Corollary 3.7 we have the inequality < in (3.92), and hence equality. D Remark 3.6. Corollary 3.8 is a "half-space theorem of surrogate duality," since the surrogate constraint sets ^ sup 0(G)} are (open) half-spaces. (3) By (1.167), for the polarity A^ of (1.166) the dual value (3.42) becomes

^k = ',^p,„. ^^^ ndAinm)) ^

= sup

OeX*\{0}

M

ci>ex*\{0}^. ^^^ l>{x)=sup
/(X).

(3.93)

Note that if there exists OQ G X * \ { 0 } such that sup Oo(G) = +00, then by (1.167) and Oo(jc) e R, we have C(A^)'({Oo}) = [x e X| OO(JC) = sup(Do(G)} = 0. Hence by (3.93), ^'

< +00 => sup4>(G) eRi^e

X*).

(3.94)

Let us also note that by the definitions, we have P'., =

sup

inf

f(x) <

sup

inf

f(x) = p'.,.

(3.95)

We have the following theorem of surrogate duality, which should be compared with Theorem 3.7(a).

Theorem 3.9. IfG^Xandf.X^J inf

fix) {x)=^(g)

then sup/(G) >

sup

inf

fix).

(3.97)

Proof Formula (3.97) follows from Theorem 3.4 applied to A = A^ of (1.166), using (1.168), (1.167) and C(A^^p\{cD}) = {xeX\

ix) = 0(g)}

(g G G, CD e Z*\{0}).

(3.98) D

Remark 3.7. A sufficient condition for the inequality (3.96) to hold for a given O G X*\{0} is the existence of an element g e G such that 0(g) = sup 0(G); hence in particular, (3.96) holds if G is weakly compact.

120

3. Duality for Quasi-convex Supremization

(4) By (1.183), for the polarity A = A^: 2^ ^ 2^*^'"! of (1.182) the dual value (3.42) becomes ^^. =

sup inf /(C(A^)'({cD})) =

sup

inf

f{x).

(3.99)

Let us also note that by the definitions, we have PI, =

sup

inf

fix) <

sup

f(x) = ^^2 •

inf

(3.100)

Theorem 3.10. Let X be a locally convex space, G a subset ofX, and f: X -^ R a function. We have sup/(G) <

sup

inf

fix)

(3.101)

if and only if for each d < sup / ( G ) there exists = j e X*\{0} such that <^(Sdif)) C cD(G).

(3.102)

Proof This follows from Theorem 3.5, equivalence 4° ^ 1°, appHed to A^ , since formulas (3.101) and (3.102) mean, respectively, (3.63) for A = A^ and ^ € AS(5^(/)). • Note that we always have inf

/(x)>sup

inf

fix)

(
(3.103)

but in order to obtain conditions for the inequality sup/(G) >

sup

inf

fix).

(3.104)

^^^^^^'UixfAo and the equality sup/(G)=

sup

inf

fix),

(3.105)

we cannot apply Theorem 3.4 to A = A^, because of (1.184). Similarly, we always have inf xeX

/(x)>sup Q^Q

inf

fix)

(O G X*\{0}).

(3.106)

xeX

but we cannot obtain conditions for the opposite inequality and the equality in Theorem 3.10 by applying Theorem 3.4 to A = A^, because of (1.162).

3.3 Constrained surrogate dual problems for quasi-convex supremization

121

3.3 Constrained surrogate dual problems for quasi-convex supremization In this section we shall consider "constrained surrogate dual problems" to problem (P^) of (3.1), defined as supremization problems of the form ^^ = sup X^(Wlj), where the dual constraint set WQ is a proper subset either of an arbitrary set W, or of (X*\{0}) X R, or of X*\{0}, depending on G, and the dual objective function is (3.11). For the families {^G,(,J)G(X*\{0})X/? too, one can apply Lemma 3.3, so we shall use the (equivalent) language of polarities A: 2^ -^ 2^^*^^^^^^^. Remark 3.8. In general, we shall not state separately the inequality parts > of the subsequent results, which hold for arbitrary functions f:X -^ R, but only the equality parts. Lemma 3.7. For any family of sets {A/}/^/ and any function f: have

U/e/ A/ —> R, we

inf inf/(Ay) = inf/(U/,/A,),

(3.107)

iel

sup sup/(A/) = sup/(U/e/A/).

(3.108)

iel

Proof The inequality > in (3.107) is obvious. Conversely, for each /x > inf/(U/e/A/) there exists a^ e U/^/A/, whence a^ e A/^ for some /^ ^ L such that M > /(«/x) > inf/(A/^) > infinf/(A/), whence, since /x > inf/(U/^/A/) was arbitrary, we obtain (3.107). This formula implies (3.108), since -sup/(U,e/A,) = i n f ( - / ) ( U , e / A , ) = infinf(-/)(A,) iel

= inf ( - s u p / ( A / ) ) = - s u p sup/(A/). i^^

D

iel

The following general duality theorem will be applied to various special polarities A: 2^ ^ 2^x*\m^R and A: 2^ ^ 2^*\{0}^ Theorem 3.11. Let X be a set, W ^ ^^, A: 2^ -> 2^ a polarity, f \ X -^ ~R a A' A-quasi-convex function, and G C. X. Then sup / ( G ) = - i n f f^^^\CA(G)).

(3.109)

Proof Let us first observe that by (1.139) we have U,,C;(CA({^}))

= C(n,,aA({g})) =

CA(G).

(3.110)

Hence, since f: X ^^ Ris A^ A-quasi-convex, by (1.153), (1.144), Lemma 3.7, (3.110), and (1.223), we obtain

122

3. Duality for Quasi-convex Supremization sup / ( G ) = sup /q(A'A)(g) = sup g^G

geG

sup

sup

iuf

f(CA\{w}))

we{lA({g})

inf f(CA'{{w}))

u;eU,,G(CA({g}))

sup

i-f^^^\w))

= - i n f f^^^\CA(G)).

n

u;eCA(G)

Remark 3.9. (a) By (1.223) and (1.144), one can also write (3.109) in the form sup / ( G ) =

sup

inf/(CA'({W;})) =

.eCA(G)

sup -'^^^^^^

inf

/(JC),

(3.111)

u^etlix})

which expresses sup / ( G ) as a *'sup inf," similarly to the preceding duality formulas. (b) Theorem 3.11 gives explicidy the reladon between the constraint sets, and the reladon between the objective funcdons, of the primal problem (P^) and the dual problem. Indeed, by Theorem 3.11, if X is a set, V^ c ;^^, A: 2^ ^ 2 ^ is a polarity, f e R , and G c X, then the supremizadon problem (Z)A)

y^A = sup AA(CA(G)),

(3.112)

where AA(M;)

= -f^^^\w)

= inf f(CA\{w}))

(w e C A ( G ) ) ,

(3.113)

might be called the "(A-)dual problem" to (P') (of (3.1)), while the set C A ( G ) and the function X^ of (3.113) might be called the "(A-)dual constraint set" and the "(A-)dual objective function," respectively. However, it will be more convenient to consider, instead of (DA), the infimizadon problem (DA)

h

= inf ( - A A ( C A ( G ) ) ) = inf / ^ ^ ^ ^ ( C A ( G ) ) = -y^A,

(3.114)

with AA of (3.113), as the (A-)dual problem to (P^) (of (3.1)), since then we will obtain a symmetric duality between abstract quasi-convex supremization problems and infimization problems with an abstract reverse convex constraint set (see Chapter 6, Remark 6.15 (b)). (c) Formulas (3.112)-(3.114) are surrogate dual problems, with "surrogate constraint sets" CA'({U;}) (W G C A ( G ) ) , instead of the inidal constraint set G of (3.1). Note that each A\{w}) (w e W) is A'A-convex (since A'AA'({M;}) = A\{w})), so each CA^({W;}) in (3.113) is a reverse A'A-convex constraint set. Let us first apply Theorem 3.11 to the special polarities A^^A^^,A^^:2^-> 2(x*\{0})xR ^^^ ^01^ ^02. 2X ^ 2^*\{0} of Section 1.2. (1) For the polarity A = A^^ of (1.189), we obtain the following corollary of Theorem 3.11: Corollary 3.9. Let Xbea locally convex space, / : X ^^ R a lower semicontinuous quasi-convex function, and G C X. Then

3.3 Constrained surrogate dual problems for quasi-convex supremization sup / ( G ) -

sup

inf

f(y).

123 (3.115)

(j)e(x*\mxR y^^^^ sup ^{G)>d

^(v)>^

Proof. For the polarity A = A^^ of (1.189) we have (1.190), so / is (A^^YA^^quasi-convex if and only if it is lower semicontinuous and quasi-convex. Hence, applyingfomiula(3.111) to A — A'^ we obtain (3.115). D Corollary 3.10. Let X be a locally convex space, f: X ^^ R a lower semicontinuous quasi-convex function, and G c. X. Then sup / ( G ) =

sup inf f(U),

(3.116)

where U denotes the family of all open half-spaces in X. Proof The open half-spaces in X are the sets of the form U^^d = [y^X\^{y)>d]^

(3.117)

where (cD, d) e (X*\{0}) x R, and sup <^(G) > d if and only if G n ^o,^ / 0Hence, (3.115) is equivalent to (3.116). D Remark 3.10. Formula (3.116) is another instance of the reduction principle: it reduces the computation of sup / ( G ) to the computation of inf f(U), for all U e U

with una

^0,

(2) For the polarity A = A^^ of (1.191), we obtain the following corollary of Theorem 3.11, which should be compared with Corollary 3.6: Corollary 3.11. Let X be a locally convex space, f: X -> R an evenly quasiconvex function, and G ^ X. Then sup / ( G ) =

sup

inf

fiy)=

(O,J)6(X*\{0})x/? y\^ 3geG,(g)>d ^ ( > ) > ^

sup

inf

(ct),g)eX*xG

-v^^ ^(>')>^(g)

f{y),

(3.118)

and if G is weakly compact, then sup / ( G ) =

sup

inf

f{y)=

sup cD(G)>^

^(>)>^

sup

inf

f(y).

(3.119)

cD(>')>supcD(G)

Proof For the polarity A = A^^ ^f (1.191) we have (1.192), so / is (A^2y^i2_ quasi-convex if and only if it is evenly quasi-convex. Hence, applying formula (3.111) to A = A^^, we obtain the first equality of (3.118). The second equality of (3.118) always holds, since sup

inf f(y) — sup

(cI>,J)e(X*\{0})x/? y^^ 3geG,cI>(g)>^ ^(>')>^

sup

OeX"^ {g4)^GxR ^{g)>d

= sup sup sup

inf

f{y)

J^^ ^ '^iy)>d

inf f{y) =

eX* geG deR >'^^ (g)>d iy)>d

sup

inf

f(y).

(^>,?)eX*xG ^^ >;^^^ ^ ^ ^(y)>^(8)

124

3. Duality for Quasi-convex Supremization

When G is weakly compact, the first equality of (3.119) follows from the first equality of (3.118), since sup (G) is attained for each O G X* (see Lemma 1.3). The second equality of (3.119) always holds, since sup

inf f(y) = sup

sup d ^(y^^^

sup

inf f(y) = sup

supO(G)>JO(j)>J

Corollary 3.12. Let X be a locally convex space, f:X convex function, and G c. X. Then sup / ( G ) =

sup

inf

fiy)-

•

0(>')>sup
-> R an evenly quasi-

inf / ( V ) ,

(3.120)

VeV

where V denotes the family of all closed half-spaces in X. Proof The proof is similar to that of Corollary 3.10, using the fact that the closed half-spaces in X are the sets of the form V^^d = [x

> J},

GX|0(JC)

(3.121)

where (
D

Remark 3.11. (a) In the particular case that / is a continuous quasi-convex function. Corollary 3.^2 follows from Corollary 3.10. Indeed, if ^ G ZY and ^ n G # 0, then U eV and L^ H G 7^ 0 (where U denotes the closure of U). Also, since / is upper semicontinuous, by Lemma 1.1 we have inf / ( ^ ) = inf fijj).

(3.122)

Hence, since / is also lower semicontinuous, from Corollary 3.10 we obtain sup / ( G ) =

sup

inf f{U) < sup

UeU

VeV

inf f{V) < sup / ( G ) ,

which yields (3.120) (indeed, for the last inequality observe that if g G V HG, then inf/(V)
^0.

(c) As an application to approximation, let us note that if Z is a normed Unear space, G c X, and XQ G Z , then, from Corollaries 3.10 and 3.12 applied to the finite continuous convex function f(y) = \\xo-y\\

(yeX)^

(3.123)

we obtain again formula (2.21) on the deviation of G from XQ. (3) For the polarity A = A^^ of (1.193), we obtain the following corollary of Theorem 3.11:

3.3 Constrained surrogate dual problems for quasi-convex supremization Corollary 3.13. Let X be a locally convex space, f:X coajfine function, and G c. X. Then sup / ( G ) = sup deR

sup

inf

J^^ ^ ^(y)=d

sup

125

-> R an evenly quasif{y)

inf

f(y).

(3.124)

Proof. For the polarity A = A^^ of (1.193) we have (1.194), so / is (A^^YA^^quasi-convex if and only if it is evenly quasi-coaffine. Hence, applying formula (3.111) to A = A^^, we obtain the first equality of (3.124). The proof of the second equality of (3.124) is similar to that of the second equality of (3.118), replacing everywhere X* by X*\{0} and the inequality sign > by equality. D Corollary 3.14. Let X be a locally convex space, f: X -^ R an evenly quasicoaffine function, and G C X. Then sup / ( G ) =

sup

inf/(//),

(3.125)

Hen

where H denotes the family of all (closed) hyperplanes in X. Proof The proof is similar to that of Corollary 3.10, using the fact that the hyperplanes in X are the sets of the form (1.29), where <J> G X*\{0} and deR. D Remark 3.12. Formula (3.125) is another version of the "reduction principle" (it reduces the computation of sup f(G) to the computation of'mi f (H), for all H eH with H f) G ^ &), which should be compared with the reduction principle (3.29) of Remark 3.1 (a). Note that in contrast to (3.29) (see Remarks 3.1 (c) and 2.1 (c)), formula (3.125) holds also for unbounded sets G. In the above duality results there are two parameters, O e Z* \ {0} and deR. For functions f: X ^^ R satisfying condition (1.195) of Section 1.2 we obtain duality results involving only one parameter, O e X* \ {0}. Theorem 3.12. Let X be a locally convex space, f: X -> R a lower semicontinuous quasi-convex function satisfying (1.195), and G ^ X, G ^ {0}. Then sup / ( G ) =

sup

inf

f(x).

(3.126)

sup (G)>1 ^ ^ ^ ^ > '

Proof By G 7^ {0} and (1.195) we have G\{0} / 0 and sup f(G\m

> inf /(X\{0}) = /(O),

whence, using that / is a lower semicontinuous quasi-convex function and (1.198), sup / ( G ) = sup / ( G \ { 0 } ) = sup / q ( G \ { 0 } ) = sup /q((A0iyA0i)(G\{0}).

126

3. Duality for Quasi-convex Supremization

Hence, by Theorem 3.11 applied to /q((Aoi)'AO') and G\{0}, and by (1.227) and ^L(A)L(AyL(A)^y.L(A)^^g obtain

sup / ( G ) = -inf/^^^"^(CAO^(G\{0})),

(3.127)

and thus by 0(0) = 0, (1.223) and (1.196), it follows that sup / ( G ) = -

inf

(-

OGX*\{0}\ supO(G)>l

inf

fix))=

.veC(AOi )'({(!>})

/

sup

inf

OGX*\{0} supO(G)>l

"i^,^ , '^^•'^^^

f(x).

D

Remark 3.13. The assumption G 7^ {0} cannot be omitted in Theorem 3.12. Indeed, for G = {0} we have sup / ( G ) = /(O), but {O e X*\{0}| supO(G) > 1} = 0, so the right-hand side of (3.126) is —00. Theorem 3.13. Let X be a locally convex space, f an evenly quasi-convex function satisfying (1.195), andG £X, G 7^ {0}. Then sup / ( G ) =

sup

inf

<1>€X*\{0}

^ ^ l

fix).

(3.128)

Proof The proof is similar to that of Theorem 3.12, using now (1.201) and (1.199). D Remark 3.14. (a) Similarly to Remark 3.13, the assumption G 7^ {0} cannot be omitted in Theorem 3.13. (b) As an application to approximation, let us note that Theorem 3.13 yields again Theorem 2.3. Indeed, we may assume that JCQ = 0 and G 7^ {0}. Then, by Theorem 3.13 appHed to the function f(y) = \\y\\

(yeX),

(3.129)

which satisfies (1.195), we have sup llgll = geG

sup

dist(0, {yeX\

0 ( j ) > 1}),

(3.130)

cDeX*\{0} 3geG,{g)>\

whence by Corollary 1.4, we obtain (2.33) for XQ = 0. One can obtain duality theorems for sup / ( G ) for many other classes of functions f: X ^^ Rby choosing suitable polarities A such that / is A^ A-quasi-convex and applying Theorem 3.11. Indeed, let us give here an example of such a result. We recall that a set G c X is called R-evenly convex if it is the intersection of a family of open half-spaces whose closures do not contain 0, and a function / : X -> /? is called R-evenly quasi-convex if all Sdif) (d e R) are /^-evenly convex. Corollary 3.15. Let X be a locally convex space, f: X ^^ R an R-evenly quasiconvex function, and G C. X. Then sup / ( G ) =

sup eX* 3geG,(g)>-\

inf xeX ^(-^)>-l

fix).

(3.131)

3.4 Lagrangian duality for convex supremization

127

Proof. From the general form of open half-spaces, it follows that a set G c X is /^-evenly convex if and only if it is the intersection of a family of sets of the form [x e X\ 0(jc) < - 1 } , where O e X*. Hence, if we define a polarity A^"^: 2^ -^ 2^*\{0} b y

A^\C)

= {CD G X*\{0}| 0(c) < - 1 (c G C)}

(C c X),

(3.132)

then G is (A'^)'A^'^-convex if and only if it is /?-evenly convex, so / : X -> /? is (A^^)^A^'*-quasi-convex if and only if it is /?-evenly quasi-convex. Hence, formula (3.111) yields the result. D

3.4 Lagrangian duality for convex supremization 3.4,1 Unperturbational theory Theorem 3.14. Let X be a locally convex space, / : X ^- R a function, and G a subset of X. Then s u p / ( G ) > sup inf{/(j)-cD(y)-hsupO(G)}.

(3.133)

Moreover, if f is a proper lower semicontinuous convex function, then sup / ( G ) = sup inf [f{y) - (D(y) -h sup cD(G)}.

(3.134)

Proof Since G is nonempty, we have sup 0(G) > —oo (O e X*). Let O € X* and J G /?, J ^. Consequently, sup / ( G ) > /(g^) > fig') - cD(g^) + J > inf {/(j) - 0(};) + J}, whence, since O e X* and J ;)-(D(y)] + supcD(G)}

ig e G).

Hence by (3.133) and (3.135), we obtain (3.134).

(3.135) D

Remark 3.15. (a) If (3.16) holds, then for any O e X*\{0} we have sup/(G) >

inf

fiy)>

yeX a)(>0=sup(G)

>

inf

inf

fiy)

yeX 4>(v)>sup(G)

{/(^)-(D(^)} + supa)(G)

yeX <^(y)>sup(G)

> inf [f(y)-^(y)} yeX

+sup ^{G),

(3.136)

128

3. Duality for Quasi-convex Supremization

whence, by Remark 3.1(b), sup/(G) > sup

inf

f(y)>

sup

(t>{y)=sup(G)

inf

f(y)

(p(y)>sup(G)

> s u p [ i n f [ / ( ^ ) - 0 ( > ; ) ] + supO(G)j,

(3.137)

which implies some relations between Lagrangian duality (3.134) and hyperplane and half-space theorems of surrogate duality (3.6) and (3.77) (for example, in this case the Lagrangian duality equality (3.134) implies the surrogate duality equalities (3.6) and 0.11)). (b) If G is bounded, then by the "substitution method" described in Remark 1.26 (a), combining Theorem 3.1 (c) and formula (1.283), with 4>o, d^ replaced by O and sup 0(G) respectively (which is a Lagrangian duality formula for the infimum of / on a hyperplane), we obtain sup/(G) =

sup

maxinf {/(>;)-6>;)+^supcD(G)},

(3.138)

from which one can deduce again the equality (3.134); however, the above direct method of proof is simpler. (c) In the particular case that X is a normed linear space and / is the finite continuous convex function (3.123), from (3.134) we obtain the following formula of Lagrangian duality for the deviation of a set G from JCQI sup \\g - xoW = g^G

sup

eX*\{0}

inf {||xo - y\\ - <^{y) + sup 0(G)}.

(3.139)

y^^

Let us show that (3.139) impUes Corollary 2.1, whence also Theorem 2.1 (by Remark 2.2). Indeed, by (3.139), we have sup ll^-xoll > i n f { | | x o - j | | + ^ ( x o - y ) - c D ( x o ) + s u p 0 ( G ) } geG

y^^

> -0(jco) + supO(G)

(O eZMlcDII = 1),

whence s u p | | g - x o | | > sup {-cD(xo) + supcD(G)}. geG

(3.140)

eX*

\m=\ In order to prove the opposite inequality, let g e G and e > 0. Choose O^ e X* with II ^ i = 1 such that \g - XQ) > \\g - XQW - s. Then sup {-0(xo) + supcD(G)} >^\g-XQ)

>

\\g-xo\\-£,

ex* l|0|| = l

whence, since geG

and s > 0 were arbitrary, we obtain sup {-O(xo) + sup 0(G)} > sup \\g - xoll ^eX*

geG

\\n=\ which, together with (3.140), yields the equality (2.14).

3.4 Lagrangian duality for convex supremization

129

3A.2 Perturbational theory In this section we shall develop a perturbational theory of Lagrangian duality for convex supremization, by suitably modifying the one for quasi-convex infimization (See Chapter 1, Section 1.4.2). Assume that we are given a constrained primal supremization problem a = sup/(G),

(3.141)

where G is a subset of a locally convex space X and f: X -^ /? is a function. Clearly, sup/(G) = sup7(X),

(3.142)

where / : X -^ /? is the function defined by fix) ifxeG, —oo if jc e CG. Thus, problem (3.141) and the primal problem (P)

a = supf(X)

(3.143)

have the same value. Moreover, if / | G # — oo, which we shall assume in the sequel without any special mention, then problems (P) and (P) have the same optimal solutions; indeed, i^go ^ G, /(go) = sup / ( G ) , then f(gol = /(go) + ^Xcigo) = sup/(G) = sup/(X)^and conversely, if jco G X and f(xo) = s u p / ( Z ) , then = sup/(G) > - o o , whence xo e G /(•^o) t -XG(XO) = 7(xo) = supf(X) and f(xo) = sup / ( G ) . Therefore, we shall assume from the beginning that we are given an unconstrained primal supremization problem (P)

Qf = sup0(X),

(3.144)

and then, taking in particular 0 = f-\-—XG and a suitable permutation p, the duality theory for (P) of (3.144) will yield a duality theory for (P) of (3.141). We shall define a dual problem to the primal supremization problem (P) of (3.144) by embedding it into a family of "perturbed" supremization problems, as follows. Let Z be a locally convex space (called a set of "perturbations" or of "parameters"), and p: X x Z -> /? a function (called a "perturbation function") such that /?(jc,O)=0(jc)

(JCGX),

(3.145)

so (P) of (3.144) is nothing other than a = supp(x,0); xeX thus, (P) is embedded into the family of supremization problems (P)

(3.146)

130

3. Duality for Quasi-convex Supremization (P,)

v(z) := SUP/7U, z)

(z e Z).

(3.147)

Let us define the Lagrangian dual problem associated with the perturbation function p as the unconstrained supremization problem (D)

^:=supA(Z*),

(3.148)

where X: Z* ^^ Ris the dual objective function defined by A(vl/) := sup {inf {p(x. z) - ^(z)}} xeX

(^ e Z*).

(3.149)

e Z*)

(3.150)

2^2

The function L: X x Z* ^^ R defined by L(x, vl/) := inf {p(x, z) - ^(z)}

(x eX,^

zeZ

is called the Lagrangian function, or simply the Lagrangian, associated with p\ note that this is the same as (1.396). Thus, considering the partial functions pAz) := p{x, z) (xeX^ze Z), (3.151) we have L(x, vl/) = M{pAz)

- ^(z)} = -p:m

(X € X, vl/ G Z*).

(3.152)

zeZ

By (3.145), (3.151), and (3.152), Hx) = PAO) > PT(0) = sup v,ez*{^(0) + L(x, vl/)} = sup L(jc,vl/) (x eX).

(3.153)

Furthermore, by (3.149) and (3.150), A(vl/) = supL(x, vl/)

(vl/ 6 Z*),

(3.154)

and hence by (3.148), P = sup supL(x, vl/). ^eZ*

(3.155)

xeX

Thus, by (3.153) and (3.155), a = sup0(X) > sup sup L(jc, ^) = j6.

(3.156)

.VGX vi/eZ*

If in addition, /7;,(0) = / ^ f W ' then (t>M = Px(0) = PT(0) = sup vi;ez*{^(0) + L(x, vj/)} = sup L(jc, ^ ) (jc G Z),

(3.157)

VI/GZ*

and thus in this case. a = sup0(X) = sup sup L(x, ^) = JCGXVI/GZ*

fi.

(3.158)

3.5 Duality for quasi-convex supremization over structured primal constraint sets

131

Remark 3.16. For the constrained primal supremization problem (3.141), let Z = X, 0 = / + — XG- Then the perturbation function p: X x X -^ R defined by p{x, z) := fix + z) +

-XGU)

fix + z) —oo

if.eG, if jc ^ G

satisfies (3.145), and the perturbational dual (3.148) yields the unperturbational dual of Section 3.4.1. Indeed, by (3.152) and (3.159), for any x G Z and vj/ G Z* we have L(x, vl/) = inf {p(x, z) - ^(z)}

(3.160)

zeZ

= inf {fix + z) - vl/(z) + vi/(;c) - vl/(x)} +

-xcix)

zeX

= inf {fix') - *(x')} + ^ix) + -XG(X)},

(3.161)

x'eX

whence, by (3.155) we obtain ^ = sup sup Lix, vi/) = sup inf {fix') - ^ix') + X G ( ^ ) } ' ^eX*xeX

vi>eX*-^'^^

which is nothing other than the right-hand side of (3.134).

3.5 Duality for quasi-convex supremization over structured primal constraint sets The primal constraint set G considered in the preceding sections of this chapter has been an arbitrary subset of a locally convex space X. Now we shall study some more structured ways of expressing the primal constraint sets G c X. In the present section we shall consider one of these ways, namely that of systems, and (surrogate and Lagrangian) duality for supremization in systems. We recall (see Chapter 1) that a system is a triple (X, Z, u), consisting of two sets X, Z and a mapping u: X -^ Z. Given a system (X, Z, u), a subset T of Z (called "target set"), and a function / : X —> /?, we shall consider the primal supremization problem ^ "^K-HT)./ = ^^P f^^^-

(3.162)

xeX u(x)eT

Remark 3.17. (a) If M(X) n 7 = 0, then w ' ^ r ) = {x e X|w(x) e T} = 0, whence a = sup0 = — oo. Therefore, in the sequel we shall assume, without any special mention, that w(X)nr#0.

(3.163)

132

3. Duality for Quasi-convex Supremization

(b) Problem (3.162) is equivalent to problem (P^) of (3.1). Indeed, given a system (X, Z, u) and T, f as above, problem (3.162) is nothing other than (3.1) with G = {x eX\ u{x) eT} = U-\T) (7^ 0).

(3.164)

Conversely, every problem (3.1) can be written in the form (3.162), by taking Z = X, u = Ix, the identity operator in X (i.e., u{x) = x for all x e X), and T = G. However, in the study of the "mathematical programming problem" (3.162) one can also use the properties of T and u. Now we shall assume that (X, Z, u) is a system in which X and Z are locally convex spaces, with conjugate spaces X* and Z*, 7 is a subset of Z, and f: X ^^ R is a function. There are several natural ways to introduce unconstrained dual problems to (3.162), which generalize the dual problems of the preceding sections. (l)Let W := M*(Z*)\{0} = [^u\ vl/ e Z*}\{0} ( c X*\{0}),

(3.165)

where w* is the adjoint operator of u (that is, w*(^)(jc) = ^u(x) for all x G X, ^ € Z*) and let A^,,^^^: 2^ -^ 2"*^^*^^^^^ be the polarity defined by ^ i - U r ) ( 0 := {w*(^) e w*(Z*)| w*(vl/)(c) < supuH^^)(u-\T)) = Ai_,(^)(C) n (w*(Z*)\{0})

(c e C)}

(C c X),

(3.166)

where A^_,^y,^: 2^ -> 2^*\^^^ is the polarity (1.154) (with G = u'^T)). since ^M(w-Hr)) = {^u(x)\ u(x) eT} =

VI/(M(X) DT)

(^ e Z*),

Note that (3.167)

we have, for any set C c X, Ai_,^y,^(C) = {^u\ ^ 6 Z*, ^(u(c)) < supvI/(M(X) n 7) (c e C)}.

(3.168)

Clearly, for the particular case X = Z, w = Ix (the identity operator on X), W = X*\{0} and T = G, the polarity A^_,^y,^ of (3.168) reduces to A]J of (1.154). In the converse direction, given any (X, Z, u) and T as above, by (3.166) we have Al_,^^^(C) c A ; ; _ , ( ^ / C ) ( C C X). For the polarity A = ^l-un ^^ (3.168), the dual objective function (3.41) and the dual value (3.42) respectively, become X'-,

(^M) =

^.-1(7-)

y6|,

inf

fix)

( ^ G Z*, ^u # 0),

(3.169)

xeX ^u(x)>sup^(u(X)r]T)

= sup l^^O

inf

fix).

(3.170)

^uix)>supyl'(u(X)nT)

Hence, by Remark 3.4 (b) (with W of (3.165)), we obtain the following generalization of Theorem 3.7 (c):

3.5 Duality for quasi-convex supremization over structured primal constraint sets

133

Theorem 3.15. Let (X, Z, u) be a system in which X and Z are locally convex spaces, let T be a subset of Z, and let f: X -> R be a function. If we have inf

f(x) < sup

inf

fix),

(3.171)

then for each d < ^^Pxex,u(x)eT /(-^) ^here exists ^ = ^ j e Z* with ^^w / 0 such that ^u(y)

< sup^{u(X)

n D

(ye SAf))

(3.172)

fix)'

(3.173)

if and only if sup fix) = sup

One can define, similarly, polarities

[^u 7^ 0| vl/ € Z*, sup ^uiO

inf

A;^.,^^,^

: 2^ -> 2"*^^*^^^^^ (/ = 2, 3, 4) by

< sup vl/(M(X) n 7)}

[^u / 0| vl/ e Z*, sup VI/(M(X) n

D

^ vi/f^(C)}

{vI/M ^ 0 | v l / e Z*,^w(C) c vi/(M(X)nr)}

(C c X),

(C c X),

(C c X),

(3.174) (3.175) (3.176)

and one can obtain for them results corresponding to those of Section 3.2. (2) Instead of A^,,^^,^: 2^ -> 2"*^^*^^^^^ of (3.168), let us consider the polarity A^V : 2^ -^ 2^*\^0J defined by A ' ! r ( 0 := [^ e Z*\{0}| ^uic) < s u p ^ ( 7 ) (c e C)}

(C c X);

(3.177)

thus, the only difference between (3.168) and (3.177) is that sup^iuiX) (1 T) is replaced by s u p ^ ( r ) . For A = A^'^^, the dual objective function (3.41) and the dual value (3.42) become A^2.^(vI/)= /S;„ =

mf sup

fix) inf

(vj/€ Z*\{0}), fix).

(3.178) (3.179)

^^^ \i^/ vl/M(A-)>supvl/(r)

Again, AJ^ of (1.154) is the particular case X = Z,u = Ix (the identity operator on X) and 7 = G, of the polarity A^'y^ of (3.177), but the converse direction no longer works, since we have only sup ^ ( M ( X ) DT) Z, the family of polarities A|^_, depends on subsets u~\T) of X, while the family of polarities /s}Jj depends on subsets T of Z, so one needs some care when generahzing the expression A{g}({g}) of (3.59) to the family A^^^^. In order to obtain duaHty theorems using the polarities A^J^^ of (3.177), let us first give the following generalization of Theorem 3.4: Theorem3.16. Let (Z, Z,u) be a system (so X and Z are two sets andu: X -^ Z is a mapping), T a subset ofZ (satisfying (3.163)), W a set, f: X ^^ R a function, and Auj: 2^ -> 2^ (T c Z) a family of polarities such that AuAu(x)}({x}) = ^ (xeu-\T)), i n f / ( C A 1 ^ ( { I . } ) ) ^l

^ (= sup inf / ( C A ; T({W}))). "' weW

(3.183)

Moreover, if we have (3.181), (3.182) and f is (AujY^uj-^l^^si-convex,

then

sup f{x)^fil^.

(3.184)

xeX u(x)eT

Proof By (3.181) and Lemma 3.5 applied to A = Au,{u(x)}^ we have ^ e ^KAU(X)}({^})

U

e u-'(T),

w e W),

whence inf/(CA;,

j,(,)j({u;})) < f(x)

(X e U-\T)^

W

e W).

Therefore, by (3.182), inf/(CA:,^({w;}))<

sup inf/(CA^ ^,(,)j({u;})) < sup f(x) xeX u(x)eT

"

(weW),

xeX uix)eT

and hence by (3.42) (with A = A^ 7), we obtain (3.183). Furthermore, if also / is A^ 7^ Aj^,7-quasi-convex, then by (3.111) (applied to A = A^j), we have sup f(x) xeX u(x)eT

=

sup inf/(CA:,^({W;})) < sup i n f / ( C A : ^ ( { U ; } ) ) = u;eCA„7-(w-i(r)) ^^^

whence by (3.183), we obtain (3.184).

p^^,

D

Note that the family of polarities A^^^^ of (3.177) obviously satifies (3.181). Hence, applying Theorems 3.16 (with W = Z*\{0}) and 3.5 to this family, we obtain the following generalization of Theorem 3.7:

3.5 Duality for quasi-convex supremization over structured primal constraint sets

135

Theorem 3.17. Let (X, Z,u) be a system in which X and Z are locally convex spaces, let T be a subset ofZ, and let f: X ^^ R be a function. inf

fix) 

sup

u(x)eT

inf

fix).

(3.186)

fix)

(3.187)

vi/M(^)>supvi/(r)

(b) The inequality sup fix) sup^(T)

holds if and only if for each d < sup^^^^^^^^^^y- fix) there exists vj/ = vj/^ G such that ^iuiy))

< sup vl/(r)

iy e 5^(/)).

Z*\{0}

(3.188)

(c) If we have (3.185), then for each d < sup^^;^ ^^^^^^T- fix) there exists ^ = ^d ^ Z*\{0} satisfying (3.188) if and only if sup fix) =

sup

xeX u(x)eT u(x)eT

inf

fix).

(3.189)

^u(x)>sup^iT)

Similarly, one can consider the polarities A^'^-: 2^ -> 2^*"^^^^ defined by AfjiO

:= {^ e Z*\{0}| sup^uiC)

< sup^iT)}

Al'riC)

:= {vl/ e Z*\{0}| supvl/(r) ^ vi/w(C)}

A^;^7^(C) := {^ G Z*\{0}| ^w(C) c vi/(7)}

(C c X), (C c X),

(C c X),

(3.190) (3.191) (3.192)

which are generalizations of the polarities (1.160), (1.166), and (1.182) respectively, and one can prove for them corresponding duality results. Remark 3.18. Concerning Lagrangian duality for the primal supremization problem (3.162), where (X, Z, w) is a system, T is a subset of Z, and / : X ^- R is SL function, we make here only the following observation, without entering into details: similarly to the way formula (1.268) for infimization is extended to the Lagrangian duality formula (1.433), the natural extension to systems of formula (3.134) for supremization should be sup fix) = max inf {fix) - ^iuix)) j,^X u{x)eT

^eZ^xeX

+ sup 4/(7)}.

(3.193)

Optimal Solutions for Quasi-convex Maximization

4.1 Maximum points of quasi-convex functions Let X be a locally convex space, / : X -> /? a function, G c Z, and go e G. Clearly, if /(^o) = +00, then go is an optimal solution of the primal supremization problem(PO(of(3.1)),i.e.,/(go) = max/(G),andif/(go) = - 0 0 , / | G # - 0 0 , then go is not a maximum point of / on G. Therefore, the cases of interest are those where /(^o) e R.

(4.1)

Remark 4.1. From (1.22) it is obvious that go ^ ^ is an optimal solution of ( P ^ if and only if G c 5/(,„)(/).

(4.2)

Theorem 4.1. L^r X be a locally convex space, W c^ R , A: 2^ -> 2 ^ a polarity, f a A^ A-quasi-convex function, and G C. X. For an element go e G the following statements are equivalent: 1°./(go) = max/(G). 2°. We have MSf^,M))

^ A(G).

(4.3)

Proof, r =^ 2°. By Remark 4.1, if/(go) = max / ( G ) , then for any set of functions W c 7? and any polarity A: 2^ -> 2^ we have (4.3) (since A is antitone).

138

4. Optimal Solutions for Quasi-convex Maximization

2° ^ r . Since / is A'A-quasi-convex, we have A'A(Sf(gQ)(f)) = Sf(go)(f)Hence if 2° holds, then by (4.3) and since A' is antitone, we obtain G c A'A(G) c A'A(5/(,„)(/)) - 5/(,„)(/), and thus by Remark 4.1, /(go) = max / ( G ) .

•

Corollary 4.1. Let Xbea locally convex space, / : Z ^- /? a lower semicontinuous quasi-convex function, andG C X. For an element g^ G G, the following statements are equivalent: 1°./(go) = max/(G). 2°. W^ /i«v^ {where A^^: 2^ -> 2^^*\<0J>^^ /5 the polarity (1.189)) A'^(%,o)(/))^A^\G).

(4.4)

Proo/ By (1.190), / is lower semicontinuous quasi-convex if and only of it is (A^^)'A^^-quasi-convex, so the result follows from Theorem 4.1 applied io W = (Z*\{0}) X /?andA = A^^ D Corollary 4.2. Let X be a locally convex space, f: X -^ R an evenly quasi-convex function, and G c. X. For an element go e G, the following statements are equivalent: r . / ( g o ) = max/(G). 2°. We have (where A^^. 2^ -^ 2^^*^^^^^''^ is the polarity (1.191)) ^''(Sfiso)(f))

^ A^2(G).

(4.5)

Proof By (1.192), / is evenly quasi-convex if and only if it is (A ^^)^ A ^^-quasiconvex, so the result follows from Theorem 4.1 applied to W = (Z*\{0}) x R and A = Ai2. n Corollary 4.3. Let X be a locally convex space, / : X -^ R a lower semicontinuous quasi-convex function, and G C. X. For an element go e G, /(O) < f(go), the following statements are equivalent: 1°./(go) = max/(G). 2°. We have {where A^^: 2^ -> 2^*\^^^ is the polarity (1.196))

Proof 1° =^ 2°, by Remark 4.1 and since A^^ is antitone. 2° ^ 1°. By /(O) < /(go), we have 0 G Sf^g^){f). Hence if (4.6) holds, then since (A^^)^ is antitone, we obtain, by (1.197) and since / is a lower semicontinuous quasi-convex function, G c {A^'yA^\G)

c (A«^)^A«^(5/(,,)(/)) = co5/(,,)(/) = 5^(,o)(/).

•

Remark 4.2. Condition (4.6) can be also written as Sf(gQ){fy c G°, an inclusion between the usual polar sets (1.82).

4.1 Maximum points of quasi-convex functions

139

Corollary 4.4. Let X be a locally convex space, f: X -^ R an evenly quasi-convex function and G C Z. For an element go e G with /(O) < f(go), the following statements are equivalent: 1°./(go) = max/(G). 2°. We have {where A^^: 2^ -> 2^*\<0J is the polarity (1.199)) A''(5/(,o)(/)) ^ ^""^G).

(4.7)

Pro6>/ The proof is similar to the above proof of Corollary 4.3, using now (1.200). D Now we shall give some subdifferential characterizations of maximum points. To this end, let us first introduce the following class of abstract quasi-convex functions: Definition 4.1. Let X be a locally convex space. We shall say that a function f:X -> /? is strongly evenly quasi-convex if all Aj(/) {d G R) of (1.23) are evenly convex. Remark 4.3. (a) Every strongly evenly quasi-convex function f: X -^ Ris evenly quasi-convex, since Sdif) = n^>^A^(/) (d e R) and since the family of all evenly convex sets is closed for intersections. (b) Every upper semicontinuous quasi-convex function / : X -> R is strongly evenly quasi-convex (since each A^(/) is open and convex, and hence evenly convex). Proposition 4.1. Let X be a locally convex space, f: X -^ R a strongly evenly quasi-convex function, and XQ e X such that /(JCQ) G R. Then a^(^")/(xo)/0,

(4.8)

where A^^: 2^ -> 2^^*^^^^^''^ is the polarity (1.19\). Proof Since / is strongly evenly quasi-convex, the set Af(xQ){f) is evenly convex. Hence since XQ ^ Af(xQ)(f), there exists OQ G X * \ { 0 } such that if)).

(4.9)

Therefore, we have f(x) > f{xo) for allx e X with o(x) > Oo(xo), whence by (1.223) applied to IV = (X*\{0}) x R, fixo)^

min

/ ( x ) = -/^<^">(cDo,cI>o(xo)),

xeX OoUo)

and thus by (1.232),

(OQ, OO(JCO))

e a^^^"V(xo).

•

140

4. Optimal Solutions for Quasi-convex Maximization

Theorem 4.2. Let X be a locally convex space, f: X -> R an upper semicontinuous quasi-convex function, and G C X. For an element go ^ G with /(go) ^ R, the following statements are equivalent: r . / ( g o ) = max/(G). 2°. We have 0 7^ a^^^"V(^o) ^ [(X*\{0}) x R]\A]^(G), and each (O, J) e 9^^"^ V(go) ^"^' «« optimal solution of the dual problem (D^n) (of (3.114)/or A = A^^X i.e., /^(^"HcD,^) = min /^^^"^([(X*\{0}) X R]\A'\G))

((O,

t/) € a^^^"V(go)).

(4.10)

3°. r/z^r^ exists (Oo, ^0) ^ 9^^^ ^/(go) ^^^^ '•5' «« optimal solution of the dual problem (D^n), i.e., such that /^(^">(Oo, Jo) = min /^^^"^([(X*\{0}) x

R]\A'\G)).

(4.11)

Proo/ Observe that since / is strongly evenly quasi-convex (by Remark 4.3 (b)), and /(go) e /?, we have a^^^"V(go) 7^ 0 (by Proposition 4.1). 1° ^ 2°. If 1° holds, then for any polarity A: 2^ ^ 2(^*\<°^^^^ such that every upper semicontinuous quasi-convex function is A^A-quasi-convex and d^^^^f(go) / 0 (hence in particular, for A = A ' ^ ) and any (
(4.12)

also, by (CD, J) e a^^^V(go), 1°, Theorem 3.11, and (4.12), /^(^>(cD, J) = - / ( g o ) = - m a x / ( G ) = min /^^^^([(X*\{0}) x /?]\A(G)). The implication 2° =^ 3° is obvious. 3° =^ 1°. If 3° holds, even with A ' ^ replaced by any polarity A: 2^ -> 2(x*\{0})xR ^^^Yi that every upper semicontinuous quasi-convex function is A^Aquasi-convex,wehave,by(Oo, Jo) e a^^^^/(go) ^ [(X*\{0})x/?]\A(G), (1.232), (4.11) (for A), Theorem 3.11, and go e G, /(go) = -/^^^\4>o,Jo) = - m i n /^^^^([(X*\{0}) X R)]\A(G)) = max / ( G ) .

D

Proposition 4.2. L^r X be a locally convex space, f: X -^ R a strongly evenly quasi-convex function, and XQ e X such that /(O) < f(xo) < +00.

(4.13)

a^'^^'/Uo) ^ 0,

(4.14)

Then

where A"^ w the polarity (1.199).

4.1 Maximum points of quasi-convex functions

141

Proof. By the above proof of Proposition 4.1, there exists OQ ^ ^*\{0} satisfying (4.9), whence f{x) > /(XQ) for all x G X with O Q U ) > OO(JCO). But by (4.13) we have 0 G A/(;CQ)(/), whence by (4.9), 0 = Oo(0) < Oo(xo). Consequently, /(xo)=

min

f{x)=

min

/(x) = - / ^ ^ ^ " ^ ( - - ^ c D o ) ,

andthus^cDoGa^(^">/(xo).

•

Theorem 4.3. Let X be a locally convex space, f an upper semicontinuous quasiconvex function satisfying (1.195), and G C. X such that /(O) y an optimal solution of the dual problem (D^oz) (of (3.114) for A = A^^), i.e., /^(A")() = min /^<^">((X*\{0))\A''2(G))

(O € a^<^">/teo)). (4.15)

3°. There exists OQ G 9^^^ ^ f(go) that is an optimal solution of the dual problem (D^oi), i.e., such that /^(^")(cI>o) = min /^^^"H(X*\{0})\A«2(G)).

(4.16)

Proof If 1° holds, then by our assumptions, /(O) < sup / ( G ) = /(go)- Hence since / is strongly evenly quasi-convex (by Remark 4.3 (b)) and /(go) G R, we have 9^^^ ^f(go) 7^ 0 (by Proposition 4.2). The remainder of the proof is similar to that of the above proof of Theorem 4.2, replacing (O, d), (4>o, do) G (X*\{0}) x R, and A^^ by O, Oo G X*\{0}, and A^^ respectively, and using Theorem 3.13. D Remark 4.4. (a) If X is a locally convex space, A^^: 2^ -> 2^*^^^^ is the polarity (1.199), / G ^ ^ , and xo e X with /(xo) G /?, then by (1.232) applied to A = A^^ we have d'^^^''^f(xo) = {<^o e X*\{0}| Oo(xo) > 1, /(xo) = - / ^ ^ ^ ' ' ^ O o ) } .

(4.17)

(b) In the particular case when X = R'\ Thach ([272], Definition 2.2 and the remarks made after it) has introduced a similar subdifferential, namely 9^/(xo) = {Oo G X*\{0}| cDo(xo) = 1, f(xo) = -/""(Oo)},

(4.18)

where / ^ is the "quasi-conjugate" of / defined [272] by /^(A«^)(0) = - i n f ,ex f(x) 0(.v)>i - s u p f(X) thus in fact.

if O G X*\{0}, (4.19) if 0 = 0;

142

4. Optimal Solutions for Quasi-convex Maximization d"f(xo)

= {Oo e X*\{0}| c|>o(xo) = 1, f(xo) = -/''^^''^(Oo)}.

(4.20)

For this subdifferential, Thach has proved some results corresponding to Theorems 4.2 and 4.3 above (see [272], Theorems 2.6, 6.1, and Corollary 6.1 ii)). If X is a locally convex space, then clearly, for any function / : X ^ /? we have d" fixo) ^ 9^^"^^ V(-^o)- Let us observe that if f: X -^ R is "strictly increasing along segments starting from 0" (i.e., for each x e X\{0} and 0 < y] < 1 we have f{r]x) < f{x)\ then a^^^''V(^o) = d^ f(xo). Indeed, if OQ e a^(^''V(^o) and Oo(jco) > 1, then 0 < ^ ^ < 1, whence by (1.232) (for A = A^^), we obtain / ( ^ , ,-^0 I < /(-^o) =

min f(x) < f (

xo ) ,

which is impossible. Therefore, Oo(jco) = 1, so OQ G 9 ^ / ( X O ) (since by OQ 7^ 0 we have / ^ ( O Q ) = /^^"^ H^o)). which proves our assertion. For example, if f: X -> R is ''strongly quasi-convex" (i.e., for each x,y e X with x ^ y and each 0 < ri < I we have f(r]x + (1 — r])y) < max {/(x), f(y)}) and if f(0) = min f(X) (in particular, if f satisfies (1.195)), then f is strictly increasing along segments from 0; indeed, for any x e X and 0 < ^ < 1 we have firjx) = f(r]x + (1 - rj)0) < max {f(x), /(O)} = / ( x ) . Also, the function / of (3.129) on a normed linear space X is strictly increasing along segments from 0. Hence, in these cases, 9^^^ V(-^o) = 9^/(jco). Another such case is given in (c) below. (c) In the particular case that X is a normed linear space, Jo e X, and / is the function f(y) = \\xo-y\\

(yeX),

(4.21)

for each XQ e X\{xo} we have a^(^°V(xo) - ( o o € X*|
I

^

ll'l'oll

l

J

•

(4.22)

Indeed, if XQ 7^ XQ, then by (1.232) (for A = A"^) and Corollary 1.4, we have 9^<^">/(xo) = {cDo e X*l cDo(xo) > 1, llxo - xoll = dist(xo, {x e X\ cDo(x) > 1))) = Uo e X*\ o(xo) > 1, llxo - xoll = I

^ - J ^ l ll«I>oll I

Now let 4>o e 3^*^ ^f(xo). If o(xo) > 1, then we obtain 1 - Oo(Jo) < ^o(-^o - Xo) < ||4>o|| llxo - Xoll = 1 - ^o(^o),

4.1 Maximum points of quasi-convex functions

143

which is impossible. Thus Oo(xo) = 1, which proves (4.22). Note that in this case Proposition 4.2 asserts that if 11^0 II < 11^0--^0II,

(4.23)

then 9^^^ VUo) # 0- This can also be seen directly, as follows: since ||Jo —-^oll i^ 0 (by (4.23)), there exists O^^ G X * \ { 0 } such that ^ ; ( x o - y o ) = l|Ooll 11^0-^0 II (by a corollary of the Hahn-Banach theorem). We claim that o(^o) < 0, then by (4.24) and (4.23) we obtain ll^oll 11^0-xoll = o ; j ( x o - x o ) < ^'^{-x^)

which is impossible. Thus, have Oo(xo) = 1 and

OQ(JCO)

(4.24) OQ(XO)

> 0. Indeed, if

< ||0;)|| llxoll < \\'o\\ 11^0-^oll,

> 0. Hence by (4.24), for

= o ^ ^ o ^^

OQ

1 - ^0(^0) ^ ^o(xo) - ^0(^0) ^ IIOp 111^0-^0II ^ ll^oll ll^oll^oUo) ll^oll^o(^o)

_~ '

that is, OQ G 9^^^ V(-^o) of (4.22), which proves our assertion. (d) As an application to approximation, let us give now another proof of Theorem 2.6, using Theorem 4.3. Let us denote XQ of Theorem 2.6 by x, and assume first that X = 0. Note that for JCQ = 0 and XQ = go 7^ 0, (4.22) becomes ^'^^^''Vteo) = 1^0 e X*| cDo(go) = 1, ll^oll

l^oll

Then by Theorem 4.3, equivalence 1° <^ 3°, applied to / of (3.129) (which satisfies (1.195) and /(O) < sup / ( G ) , since G ^ {0}), and by (4.22) (for XQ = 0, xo = go^ 0), Corollary 1.4, and (1.223) (for A = A^^), we obtain that go e G satisfies ||goll = max^eG 11,^11 if and only if there exists ^Q e X*\{0} such that o(^o)-l, llgoll = ^ , a n d = -dist(0,{>;eX|cD;(y)> 1}) l^ol = /^(A"^)(0;,)= =

min 3geG,<^ig)>\

min

f'^^''\)

[-dist(0,{yeX|l))]-

min

( - ^ ) ,

3geG,^(g)>l

i.e., the equivalence 1° <^ 2° of Theorem 2.6 for x = 0. If G is weakly compact and O' G X*\{0}, then sup 0'(G) is attained (see Lemma 1.3), so 2° <^ 3° of Theorem 2.6 for x = 0. Finally, if x G X is arbitrary, then applying the above to the set G^ = G — X, we obtain the desired conclusion.

144

4. Optimal Solutions for Quasi-convex Maximization

4.2 Maximum points of continuous convex functions In this section we shall assume that X is a locally convex space, f: X -^ /? is a function and G is a subset of X such that (1.73) holds. The assumption (1.73) is quite natural, since if the first inequality is not satisfied, then sup/(G) = inf f(X) < f(g) for all g e G, whence each g G G is an optimal solution of (P^) (i.e., a maximum point of / on G), and if the second inequality of (1.73) is not satisfied, then no go € ^ satisfying (4.1) can be a maximum point of / on G. Theorem 4.4. Let X be a locally convex space, f: X -^ R a function, and G a subset of X satisfying (1.73). For an element go e G, consider the following statements: 1°./(go) = max/(G). 2°. There exists <^o ^ X*\{0} such that Oo(go) =

max

4>o(j).

(4.25)

/(v)<sup/(G)

(a) If f is upper semicontinuous and convex, then 1° =^ 2°. (b) If f is continuous and convex, then 1 ° <^ 2°. Proof Let A:={ye

X\ f{y) < sup/(G)}, S := {y e X\ f{y) < sup/(G)}.

(4.26)

Then by (1.73), A # 0 . (a) Assume that / is upper semicontinuous and convex, so A is an open convex set. If 1° holds, then go ^ A. Hence by the separation theorem, there exists Oo € Z*\{0} such that supcDo(A)
(4.27)

sup Oo(5) < sup cDo(A) = sup cDo(A) < Oo(go).

(4.28)

whence by Lemma 1.9,

On the other hand, by 1° we have go G S, whence Oo(go) < sup Oo(5'), which, together with (4.28), yields (4.25). (b) Let / be continuous and convex, and assume 2°. If we had go G A, then since A is an open convex set, by Lemma 1.8 and Remark 1.7 (a) we would obtain ^o(go) < supOo(A) = supOo(A) < supOo(5), in contradiction to 2°. Hence go ^ A. On the other hand, go G G c 5*, so go G G n (5\A), that is, /(go) = max / ( G ) . D

4.2 Maximum points of continuous convex functions

145

Remark 4.5. (a) Theorem 4.4(b) admits the following geometric interpretation: when (1.73) holds and f is continuous and convex, an element go e G satisfies /(go) = max / ( G ) if and only if there exists a hyperplane HQ that supports the set S (defined in (4.26)) at go. Indeed, if f(go) = max / ( G ) and if OQ e Z*\{0} is as in Theorem 4.4(b), then the hyperplane Ho = {yeX\

o(y) = sup o(S)}

(4.29)

has the required properties. Conversely, every hyperplane that supports the set S is of the form (4.29), for some ^o e X*\{0} with supc|>o(5) e R (by Chapter 1, Corollary 1.1), and hence if go ^ ^o, then we must have (4.25). (b) For any go e G satisfying /(go) = max / ( G ) and any hyperplane Ho as in (a) above, the sets G and S lie in the same half-space Do:=[yeX\

o{y) < sup o(S)},

(4.30)

and ^0 supports the sets G and ^S at go- Indeed, since Oo(g) < sup ^o(S) (g e G) and Oo(y) < sup 00(5) (y e S), we have G c Do and S ^ Do. Furthermore, Oo(^o) < sup^o(G) < supOo(5), and by 2°, Oo(go) = supOo(5), whence ^o(go) = supOo(G) = supOo(5). (c) For any go ^ G satisfying /(go) = max / ( G ) and any hyperplane Ho as in (a) above, we have go e Ho and /(go) = min/(//o).

(4.31)

Indeed, by Lemma 1.9, we have sup oiS) = sup Oo(^). whence Ho = [yeX\

Oo(y) = sup c|>o(A)},

(4.32)

and hence, since A is open, by Lemma 1.8 it follows that >' ^ A for all y e Ho. Therefore, by /(go) = max / ( G ) , we obtain / ( j ) > s u p / ( G ) = /(go)

(ye Ho).

(d) Using normal cones (see Section 1.3, formula (1.123)), condition 2° of Theorem 4.4 can be written as N{S; go) ^ {0}.

(4.33)

Note also that for any Oo e X*\{0} satisfying (4.25) we have cI>o(go) = maxcDo(G);

(4.34)

^ ( 5 ; go) ^ ^ ( G ; go).

(4.35)

that is, using normal cones,

Indeed, if N(S; go) — {0}, this is obvious; on the other hand, if N(S; go) / {0}, then from go G G c 5 and (4.25) it follows that ^o(go) < supOo(G) < supc|>o(5) = Oo(go).

146

4. Optimal Solutions for Quasi-convex Maximization

Let us consider now the particular case when G = Theorem 4.4(a) yields the following corollary:

{JCQ},

a singleton. In this case,

Corollary 4.5. Let X be a locally convex space and f: X ^^ R an upper semicontinuous convex function. Then for each XQ e X satisfying inf/(X) < /(jco) < +00

(4.36)

there exists ^o ^ A'*\{0} such that Oo(xo) =

max

^o{y).

(4.37)

/(y)
Remark 4.6. (a) Geometrically, Corollary 4.5 means that if f \ X ^^ R is an upper semicontinuous convex function, then for each XQ e X satisfying (4.36) there exists a hyperplane HQ = {y e X\ ^o(y) = 4>o(-^o)} that supports the level set S/^XQ) = {y e X\ f(y) < /(JCQ)} at XQ. Moreover, from Remark 4.5(c) above it follows that for any such hyperplane HQ we have XQ G HQ and f(xo) = min / ( / / Q ) . (b) The assumption of upper semicontinuity is not necessary in Corollary 4.5, as shown by the following example: Let X be a normed linear space endowed with the weak topology a (X, Z*) and let / be the function f(y) = \\y\\

(yeX)

(4.38)

(i.e., (3.123) with XQ = 0). Then X is a locally convex space and / is a finite lower semicontinuous function on X that is not upper semicontinuous at any jo ^ ^, and (4.36) is equivalent to XQ 7^ 0. Also, by a corollary of the Hahn-Banach theorem, for each XQ e X\{0} there exists 4^0 € X*\{0} (X* is the same both for the weak and for the norm topology on X) such that ^o(^o) = ll^oll lUoll =

max ^oiy) = \eX ILvblkoll

max

^oiy)-

veX /(>0
In connection with the duality result of Corollary 3.1, we have the following result: Corollary 4.6. Let X be a locally convex space and f: X -^ R an upper semicontinuous convex function. Then for each XQ e X there exists 4>o ^ X*\{0} such that f(xo)=

min

f(y).

(4.39)

yeX oiy)=oixo)

Hence we have (3.31) with the sup being attained. Proof If xo satisfies (4.36), then (4.39) follows from the last part of Remark 4.6(a) above. On the other hand, if /(JCQ) = min/(X), then for each OQ e X*\{0} we have

4.2 Maximum points of continuous convex functions /(xo) >

inf

f{y)

147

> inf/(X) = /(XQ),

yeX

whence (4.39). Hence the last statement follows since the inequahty > in (3.31) always holds. D Remark 4.7. (a) Geometrically, Corollary 4.6 means that if f: X -^ R is an upper semicontinuous convex function, then for each JCQ G X there exists a hyperplane HQ with XQ G HQ such that f(xo)=mmf(Ho).

(4.40)

(b) If X is a normed linear space and / is the function (4.38), then condition (4.39) is equivalent, by Lemma 1.5, to II

II

•

lUoll =

II II

mm

l^o(-^o)l

IIJII =

y^^

,

11^0 II

^o(>')=^oUo)

and it is a well-known corollary of the Hahn-Banach theorem that such a function Oo e X*\{0} exists. In the case that / is also continuous, we have the following theorem: Theorem 4.5. Let X be a locally convex space, f: X ^^ R a continuous convex function, and G a subset ofX satisfying (1.73). For an element go e G the following statements are equivalent: P . / ( g o ) = max/(G). 2°. There exists OQ G X * \ { 0 } satisfying (4.34) and inf

/(>;)=

yeX
max

inf

f(y).

(4.41)

eX*\{0} yeX 0(v)=sup(G)

Proof 1° ^ 2 M f 1° holds, then by Theorem 4.4 and Remark 4.5(d), there exists Oo G X*\{0} satisfying supOo(G) = sup 00(5). Furthermore, by Remark 4.5(c), 1°, and Theorem 3.1 we have, for the hyperplane HQ defined by (4.29), inf

/(>;) = inf/(//o) = /(go) = m a x / ( G ) =

sup

inf

f(y).

2° =^ I M f 4)0 is as in 2°, then by (4.34), inf

fiy) < /(go).

yeX Oo(>0=supo(G)

Hence by Theorem 3.1 and (4.41), we obtain sup/(G) =

sup

inf

fiy)=

(v)=supO(G)

which, together with go e G, yields 1°.

inf

/(j)
Oo(v)=supo(G)

D

148

4. Optimal Solutions for Quasi-convex Maximization

Remark 4.8. Theorem 4.5 admits the following geometric interpretation: When / : X ^- R is 3. continuous convex function satisfying (1.73), an element go e G satisfies /(go) = max / ( G ) if and only if there exists a hyperplane HQ that supports G at go and such that mff(Ho)

= max i n f / ( / / ) ,

(4.42)

HeHc

where HG is the family of hyperplanes defined in Remark 3.1(a), that is, the family of all (closed) hyperplanes that quasi-support the set G. Let us consider now the set A^G ( / ) of all maximum points of / on G (see (3.3)) and the set Odf) of all functions o e X*\{0} satisfying (4.41), that is, O G ( / ) := |Oo G X*\{0}|

inf

f(y)=

o(v)=supOo(G)

max

inf

f(y)];

(4.43)

0(j)=:supO(G)

we shall call any element of Ocif) an optimal function (with respect to the pair (G, / ) ) . The optimal functions are nothing other than the optimal solutions of the dual problem (3.42), with W = X*\{0} and C A ' ( { 0 } ) of (3.44). Corollary 4.7. Let X be a locally convex space, / : X -> R a continuous convex function, and G a subset ofX satisfying (1.73). We have Mcif) 7^ 0 if and only if there exists an optimal function OQ G Ocif) such that Oo attains its supremum on G.

(4.44)

Proof Indeed, the condition means that there exist a function ^o G X*\{0} and an element go e G satisfying (4.41) and (4.34), so the result follows from Theorem 4.5. D Corollary 4.8. Let X be a locally convex space, / : X —> R a continuous convex function, and G a weakly compact subset ofX satisfying (1.73). We have Mcif) 7^ 0 if and only if there exists an optimal function OQ € Ocif)Proof Since G is weakly compact, every OQ G X* satisfies (4.44), so the result follows from Corollary 4.7. D Remark 4.9. As shown by Example 2.3 and the function (4.38), the sufficiency part of Corollary 4.8 is no longer true without the assumption of weak compactness, even in the particular case that X is a conjugate Banach space, / : X ^- /? is a finite continuous convex function, and G is a weak* compact subset of X. Let us summarize the connections between the existence of maximum points and of optimal functions with respect to the pair (G, / ) . Theorem 4.6. Let X be a locally convex space, f: X -^ R an upper semicontinuous convex function, and G a subset ofX satisfying (1.73).

(a)>fG(/)7^0^OG(/)7^0; (b)OG(/)#0^MG(/)^0; (c) if G is weakly compact, then Mcif)

7^ 0 ^ O G ( / ) 7^ 0.

4.3 Some basic subdifferential characterizations of maximum points Proof. Part (a) follows from the necessity part of Corollary 4.7. Part (b) is shown by Remark 4.9. Part (c) is Corollary 4.8.

149

D

4.3 Some basic subdifferential characterizations of maximum points In Section 4.1 we have given some characterizations of maximum points of lower semicontinuous convex functions using abstract subdifferentials, like 9^^^ ^/. In the present section we shall give some basic characterizations of maximum points of continuous, respectively, lower semicontinuous, convex functions, with the aid of usual subdifferentials and £-subdifferentials. We recall that a subset G of a normed linear space X is called proximinal if each XQ e X admits a best approximation in G, i.e., if P G U O ) 7^ 0 for ^H -^o ^ ^• Lemma 4.1. Let X be a normed linear space, C a proximinal convex subset of X, and G a subset ofX. We have the inclusion G^C

(4.45)

if and only if N(C;x)^NiG;x)

(x ebdC),

where N(C', x) denotes the ''extended normal cone'' ofCatx formula (1.125)).

(4.46) e X (see Chapter 1,

Proof Necessity: Assume (4.45) and let x ebd C, OQ e N(C; x). Then Oo(c) < Oo(x) (c G C) and G c C, whence Oo(g) < ^o(-^) (g ^ G), so OQ G iV(G; x) (actually, this part holds for any sets G, C c X with G c C and any x e X). Sufficiency: Assume that (4.45) does not hold, i.e., there exists go e G\C. Then, since C is a proximinal subset of X, there exists an element of best approximation X e C of go, i.e., such that ||^o - -^11 = miricec Wgo - c\\ := ro (> 0, since go ^ C,x e C); clearly, x ebd C.Let us consider the open ball O(go,ro) = {y e ^\ Wgo — y\\ < ro). Since C is convex and O(go, ro) is open and convex, and since O(go, ro) n C = 0, by Chapter 1, Theorem 1.1 we can separate C and O(go, ro); i.e., there exists OQ e X*\{0} such that y := sup cDo(C) < mf
(4.47)

Hence by Lemma 1.8, Oo(c)
(ceCye

O(go. ro)).

(4.48)

Then by x G C we have 4>O(JC) < y. But since x G bdO(go, ro), by (4.48) we also have Oo(jc) > y, so OO(JC) = y. Therefore, Oo(c — x)
150

4. Optimal Solutions for Quasi-convex Maximization

that is, o e N(C; x). On the other hand, since ^o ^ O{go, ro), by (4.48) we have ^o(go) > K, whence ^o(go - x) > y - y = 0, so OQ ^ N(G; x). Thus, (4.46) does not hold. D Theorem 4.7. Let X be a normed linear space, f: X -> R a continuous convex function, G a subset of X and go e G such that the level set Sf(g(^)(f) = {x e ^\ fM S f(go)} isproximinal and Mf{X)

< /(go) < +00.

(4.49)

The following statements are equivalent: 1°./(go) = max/(G). 2°. We have df(x) c N(G; X)

(X e X, f(x) = /(go)).

(4.50)

Proof By the first part of Remark 4.1, condition 1° is equivalent to G C Sf(gQ)(f), which, in turn, by Lemma 4.1 applied to G and the proximinal convex set C = Sf(gQ)(f), is equivalent to A^(%,o)(/); ^) ^ ^(G; X)

(X e bd 5/(,,)(/)).

(4.51)

But by (4.49) and since / is a continuous convex function, bd 5/(,„)(/) = 5/(,„)(/)\/l/(,„,(/) ^{xeX\

fix) = /(go)}

(4.52)

(see Remark 1.7). Hence by Theorem 1.6, we have 7V(5/(,,)(/); X) = ^ ( 5 / ( , , ) ( / ) ; x) = U,>or]df{x)

(x e bd 5/(,,)(/)).

Consequently, (4.51) is equivalent to (4.50).

D

Replacing in (4.49) inf/(X) by inf/(G), one can replace in (4.50) the extended normal cones A^, considered for all x G X with f(x) = /(go), by usual normal cones A^, considered only for elements g e G with / ( g ) = /(go). Namely, we have the following theorem: Theorem 4.8. Let X be a locally convex space, / : X -> R a lower semicontinuous convex function, and G a subset of X, such that G C int dom / . For an element go G G satisfying inf/(G) < /(go),

(4.53)

the following statements are equivalent: r . / ( g o ) = max/(G). 2°. We have df(g) c N(G; g)

(g e G, f(g) = /(go))-

(4.54)

4.3 Some basic subdifferential characterizations of maximum points

151

Proof. 1° => 2°. Assume 1° and let g e G, f{g) = /(^o), ^o e df{g). Then ^o(j) - Oo(g) < / ( y ) - / ( g ) = / ( j ) - / ( g o ) < 0

(jG % , , ) ( / ) ) .

(4.55)

But by 1°, G c 5/(,,)(/), and hence by (4.55), <^^{g') < cDo(g) {g' e G), so

^oeN(G;g). 2° => 1°. Assume that 1° does not hold, so there exists g' e G such that /(go) < fig'). Note that by (4.53), there exists g" e G such that f{g") < f(go) < f(g'). Let g^ := ng'^ (\ - r])g''

(0
(4.56)

Then, since f\[g'\g'] is convex and continuous, where [g'\ g'] is the segment {gr; | 0 < r] < 1}), there exists g = r]Qg' + (1 - r]o)g" G G, 0 < r?o < 1, such that / ( g ) = /(go), and the directional derivative f'{g\ g' — g) must be > 0; indeed, since g' — g = r(g-g^O, where r = ^ > 0, by [184], Theorem 23.1 we h a v e / ( g ; g-gO =

f\g\

t{g" - g)) = tfig; g" - g) < t{f{g") - f(g)) < 0, whence 0 < / ( g ; g^ -

8) + f(g', g-g') < f\g\ g ' - g ) . Hence since/Xg; g'-g) = maxci>e3/(g) O ( g ' - g ) (by (1.121)), it follows that there exists c|>o ^ 9/(g) such that Oo(g^ - g) > 0, i.e., such that Oo ^ A^(G; g). Thus, 2° does not hold. D Remark 4.10. (a) The assumptions (4.49) and (4.53) in Theorems 4.7 and 4.8 cannot be removed. Indeed, for example, if / : X -> R is differentiable and has a unique minimum on X at some go G intG, then conditions (4.50) and (4.54) are satisfied, since {x G X\ f(x) = /(go)} = {go} and a/(go) = {0} c A^(/; go), but /(go)=min/(X)^max/(G). (b) In Theorem 4.8 one cannot replace (4.53) by the weaker assumption (4.49), as shown by the following example: Let X = R^ with the Euclidean norm, f(x\,X2) = (1 — xi) + jc| (so / is convex and differentiable), and G = {0} x [ - 1 , +1]. Then for go = (0, 0) G G we have (4.49) and (4.54), but not /(go) = max/(G). Indeed, if g = (0, g2) G G, / ( g ) = /(go) = 1, thengi = 0, 1+gf = 1, whence g = 0, and 9/(0) = {V/(0)} = {(-1,0)} c A^(G; 0). On the other hand, /(go) # max/(G) (since /(go) = /(O, 0) = 1 < g^ + 1 = /(O, g2) for all (0, g2) G G, g2 # 0 ) . One can also see directly that (4.50) does not hold either: for x = (1, 1) (^ G) we have f{x) = 1 = /(go) and 9/(1, 1) = {V/(l, 1)} = { ( - 1 , 2)} (the gradient of / at (1, 1)), but ( - 1 , 2) ^ iV(G; (1, 1)), since for (0, 1) G G we have ( - 1 , 2)(0, 1) = 2 ^ (-1,2)(1,1) = 1. By introducing a parameter s, namely, by considering the ^-subdifferentials 9e/(go) (^ > 0) instead of the subdifferentials df(g) (g G G, / ( g ) = /(go)), and the ^-normal sets Ns(G; go) instead of the normal cones N(G; g) (g G G, / ( g ) = /(go)), one can transform the purely local conditions of (4.54) into global conditions. Indeed, we have the following: Theorem 4.9. Let Xbea locally convex space, / : X ^^ Ra proper lower semicontinuous convex function, and G a subset ofX. For an element go G G, the following statements are equivalent:

152

4. Optimal Solutions for Quasi-convex Maximization r . / t e o ) = max/(G). 2°. We have Ssf(go)£Ns(G;go)

{e>0),

(4.57)

Proof, r =^ 2°. Assume T and let £ > 0, OQ e a^/Cgo). Then 0 > fig) - f(go) > <^o(g -go)-s

(ge G),

whence Oo e 7V,(G; ^o). 2° =^ 1°. Let us first observe that if we have 2°, then by (1.131) and (1.128), df(go) = a>oa./(go) ^ n^^oNeiG; go) = A^(^; go),

(4.58)

so (4.57) holds also for s = 0. Assume now that 1° does not hold, that is, using (3.134), /(go) o G X* such tl^at /(go) < inf {/(x) - Oo(x) + supcDo(G)}, jceX

whence sup /(go) - inf {/(x) - cDo(^)} > o(go).

(4.60)

xeX

By (4.59) and (4.60), we have /(go) € /?. Let 8 := sup[oix) - fix)] - (cDo(go) - /(go)).

(4.61)

xeX

Then by (4.61), we have e >0 and sup{(|>o(x) - fix)} = Oo(go) - /(go) + £, xeX

SO cI)o G dsfigo). On the other hand, by the first inequahty in (4.60), sup cI>o(G) > /(go) - inf {fix) - ^oix)} = Oo(go) + 6:, JCGX

so o ^ A'.CG; ^o).

•

5 Reverse Convex Best Approximation

The study of reverse convex best approximation, that is, of best approximation by complements of convex sets, is motivated, among others, by its connections with the famous unsolved problem whether in a Hilbert space every Chebyshev set (i.e., such that each x e X has a unique element of best approximation in the set) is necessarily convex. Namely, it is known (see the Notes and Remarks to Section 5.2) that if a Hilbert space X contains a Chebyshev set that is not convex, then X also contains a Chebyshev set that is the complement CG of an open bounded convex subset G (7^ 0) of X. Geometrically, if G is a convex set with intG 7^ 0, and XQ e intG, the problem of finding dist(jco, CG) amounts to finding the greatest radius of an open ball with center XQ contained in G (see Figure 5.1); clearly, when xo G bd G, no such open ball exists, and we have dist (JCQ, CG) = 0.

Figure 5.1. We shall be concerned with the following two main problems: (1) Find convenient formulas for dist(jco, CG).

154

5. Reverse Convex Best Approximation

(2) Give characterizations of elements of (reverse convex) best approximation (i.e., necessary and sufficient conditions in order that an element zo e X satisfy zo e 7^CG(-^O), that is, zo ^ CG and ||xo - Zoll = dist(xo, CG).) We shall obtain duality results, using the elements O of the conjugate space X*. Remark 5.1. If XQ e bdG, and hence in particular, if G is any subset of X with int G = 0 and XQ G G, then dist(xo,CG) = 0.

(5.1)

Indeed, if XQ G bdG, then every ball with center XQ intersects CG, whence dist(xo,CG) = 0. This applies, in particular, if intG = 0 (hence G c bdG) and XQ £ G. Therefore, it is natural that in most of the subsequent results we shall assume that int G ^ &.

5.1 The distance to the complement of a convex set The following theorem gives an explicit formula for the distance to the complement of a convex set. Theorem 5.1. Let X be a normed linear space, G a convex subset ofX with int G 7^ 0, andxo e G. Then dist (xo, CG) = inf {sup cD(G) - O(jco)}.

(5.2)

\\n=\ Proof Let us first assume that XQ = 0, so formula (5.2) becomes dist(0,CG)= inf supO(G).

(5.3)

\\n=\ Since int G 7^ 0, for each z e CG there exists O^ e X* with ||0J| = 1 such that sup 0^(G) < 0^(z) (by the separation theorem). Hence Ikll > ^z(z) > supcI>,(G) > inf supO(G)
(z e CG),

which yields dist(0,CG)= inf llzll > inf sup ^ ( G ) .

(5.4)

On the other hand, since 0 € G, for each O G X* with ||c|>|| = 1 we have supO(G) > ^(0) > Oand CG^{X

e X\ (D(jc) > sup 0(G)},

(5.5)

5.1 The distance to the complement of a convex set

155

whence by Corollary 1.4, dist(0, CG) < dist(0, {jc G X| supcD(G)}) = supcD(G), which, together with (5.4), yields (5.3). Assume now that XQ e G is arbitrary. Then, since z ^ G if and only if z — XQ ^ G — xo, we have dist(jco, CG) = inf ||jco - z|| = dist(0, C(G - JCQ)),

(5.6)

zeCG

where G — XQ is a convex set containing 0, with int (G — XQ) ^ 0. Hence by (5.3), dist(0, C(G - jco)) = inf sup 0 ( G - XQ) = inf {sup a>(G) - O(xo)}, IIOII-l

(5.7)

||ct>|| = l

which, together with (5.6), yields (5.2).

D

Remark 5.2. (a) If int G = 0, the expression infyon^i{sup 0(G) - O(xo)} may have any value d >0. Indeed, for example, if X == C([0, 1]) and G = the (convex) set of all algebraic polynomials of norm < d, then int G = 0, so dist (0, CG) = 0, but G = {jc G X| ||jc|| < d] (by the classical theorem of Weierstrass on the uniform approximation of continuous functions by polynomials), and hence supO(G) = supcD(G) = d\\^\\ for all O e X*, so inf|,o||=i{supO(G) - 0(0)} = d. (b) If G is open, then by Lemma 1.8, CG 5 {JC G X\ 0(jc) > sup cD(G)},

(5.8)

and hence in particular, CG 2 {JC G X\ 0(jc) = sup 0(G)}. (c) By Lemma 1.5, Theorem 5.1 admits the following geometric interpretation: if G is a convex subset ofX such that int G 7^ 0, and ifxo G G, then dist (xo, CG) = inf dist(xo, //cD,supcD(G)) = eX*

^

11^11 = 1

inf

inf

eX*\{0}

yeX

||xo - yh

(5.9)

cD(j)=supO(G)

where //
0(3;) = sup 0(G)}.

Equivalently, using also Corollary 1.1, this means that dist (jco, CG) = inf dist (JCQ, / / ) ,

(5.10) (5.11)

HEHG

where HG denotes the collection of all hyperplanes that quasi-support G (see Figure 5.2); thus, this is another instance of the reduction principle (it reduces the computation of dist (jco, CG) to the computation of dist (JCQ, H) for H G HG)(d) Formula (5.2) remains valid for any closed convex set G (possibly with intG = 0) and any XQ G G, in a normed linear space X. Indeed, this follows by replacing in the above proof of Theorem 5.1 the separation theorem by the strict separation theorem.

156

5. Reverse Convex Best Approximation H

Corollary 5.1. If G is a convex subset ofX such that int G ^ 0 and ifxo e G, then dist (jco, CG) =

inf

(^,d)eiX*\{0})xR

itrf (J)e{X*\{0})xR (g)
IIOII

supO(G)<J

d - O(xo) ||0||

(5.12)

Proof. Clearly, sup^(G) = inf{JG /?|supO(G) < d]

(5.13)

(O e X * ) .

Hence by Theorem 5.1 and (5.13), we obtain dist(xo, CG) =

inf {sup 0(G) -

^{XQ)}

||0|| = 1

. =

supO(G)-0(xo)

inf O6X*\{0}

=

ll^ll

. . inf

inf

OGX*\{0}

deR

sup (t>iG)
d - cI>(xo)

llII

d -
inf

(*,rf)€(A-\(0|)xR

(5.14)

II (J) II

sup
which proves the first equality in (5.12). Finally, the equality of dist(jco, CG) with the third term of (5.12) follows similarly to (5.14), using that sup cD(G) = \nf[d e R\ ^(g)
{g e G)}.

D

Remark 5.3. (a) Conversely, Corollary 5.1 implies Theorem 5.1. Indeed, this follows by starting with the first equality of (5.12) and writing formula (5.14) in the reverse order. (b) Corollary 5.1 admits the following geometric interpretation: ifG is a convex subset ofX such that int G 7^ 0 and ifxo e G, then dist(xo,CG)=

inf

disi(xo, U
i,d)e{X*\{0})xR sup4>(G)<^

'

inf

(M)e(X*\{0})xR (g)
dist(xo,V^d),

(5.15)

5.1 The distance to the complement of a convex set

157

(b)

(a) Figure 5.3.

where U^^d and V^ ^ are the half-spaces (1.67). Equivalently, this means that dist (xo, CG) =

inf dist (XQ, U) = inf dist (JCQ, V), UeU VeV unG=i/\ vnG=0

(5.16)

where hi and V denote, respectively, the collection of all open half-spaces in X and the collection of all closed half-spaces in X (see Figures 5.3 (a) and (b)). Indeed, for the open half-space U(t>j and the closed half-space Vj of (1.67) we have U^j n G = 0 <^ sup 0(G) < d

(5.17)

and, respectively. y^ ^ n G = 0 <^ cD(^)
e G).

(5.18)

Hence by (5.12) and Corollary 1.4, we obtain (5.16). Note that in (5.13), and hence in (5.12) too, one can replace sup 0(G) < ^ by sup 0(G) < d. (c) The half-spaces U and V in (5.16) can be replaced by hyperplanes, that is, we have dist (jco, CG) =

inf dist (JCQ, / / ) ,

Hen HnG=^

(5.19)

where H denotes the collection of all hyperplanes in X. One can also express the right-hand side of (5.2) as follows. Proposition 5.1. If G is a convex subset ofX and ifxo € G, then inf {supcD(G) - O(jco)} = inf {supO(G) - cD(jco)}

OeX*

II^INl

eX*

I|0||>1

= inf {supO(G)-0(xo)}. OeX* ||cD||>l

(5.20)

158

5. Reverse Convex Best Approximation

Proof. Let us first assume that JCQ = 0, so formula (5.20) becomes inf sup(D(G)= inf s u p O ( G ) =

(DeX* llll = l

II>1

inf supcD(G).

OGX* II0||>1

(5.21)

If G = Z, then all members of (5.21) are equal to +CXD. Assume now that G # X . Then, since {O G X*| ||0|| = 1} c {O G X*| ||cD|| > 1}, we have inf supcD(G)> inf supO(G). OGX*

11011=1

(5.22)

eX*

\\n>\

On the other hand, let OQ ^ Z* Jl^oll > 1, be arbitrary. Then -^^ < 1, and since 0 e G, we have sup Oo(G) > 0. Hence, On 1 inf sup 0(G) 1 was arbitrary, we obtain, using also (5.22), that inf supcD(G)== inf sup 0(G). \m=\

(5.23)

iicDii>i

Furthermore, since G ^ X and G is convex, by the strict separation theorem there exists Oo € Z*\{0} such that sup Oo(G) < +oo.LetOo G X*with||Ool| = 1, sup Oo(G) < -\-OQ be arbitrary and let /x > 1. Then ||/xOol| > 1 and inf supO(G) < sup(/xOo)(G) = /xsupOo(G),

OeX* ||0||>1

whence, using that ^ > 1 and Oo G X* with ||Oo|| = 1 were arbitrary, we obtain inf supO(G)<

inf supO(G).

OeX*

OGX*

II^II>1

IIO||=l

(5.24)

By (5.23) and (5.24), it follows that inf s u p O ( G ) = inf supO(G)<

inf supO(G)<

inf supO(G),

(DGX*

OeX*

OGX*

OeX*

11^11 = 1

I|0||>1

1|01|>1

||cD|| = l

which yields (5.21). Assume now that XQ G G is arbitrary. Then G — JCQ is a convex subset of Z, containing 0, and hence, applying (5.21) to G — JCQ and using that sup 0 ( G — XQ) = sup 0(G) - 0(xo), we obtain (5.20). D Remark 5.4. Combining Theorem 5.1 and Proposition 5.1, one obtains further expressions of dist (jco, CG).

5.1 The distance to the complement of a convex set

159

One can replace in (5.9) hyperplanes by other sets, such as quasi-supporting closed or open half-spaces (see Figures 5.4 (a) and (b)). Indeed, we have the following theorem: Theorem 5.2. Let X be a normed linear space, G a convex subset ofX such that int G 7^ 0, and XQ e G. Then dist(xo, CG) = inf

inf

\\y - XQ\\ = inf

')>supcD(G)

inf

\\y - XQ\\ .

OeX* veX 11^11 = 1 cD(j)>supcD(G)

(5.25)

Proof. It will be enough to prove that dist (jcn, CG) =

inf dist (JCQ, U<^ suprG)).

(5.26)

\\n=\ where L^cD,supO(G) is the open half-space ^o,sup4>(G) = [yeX\

0 ( j ) > supcD(G)},

(5.27)

or equivalently, dist(xo, CG) = inf dist (JCQ, U),

(5.28)

UeUc

where KG denotes the collection of all open half-spaces that quasi-support G and do not contain int G. Since JCQ G G, we have JCQ ^ ^cD,supcD(G). whence by Corollary 1.4, dist(jco, f/(j>,supO(G)) = supO(G) - O(jco), for all O G X* with ||0|| = 1. Hence by Theorem 5.1, we obtain (5.26). •

Figure 5.4. Let us show now that in the case 0 G G 7^ {0}, it is enough to consider d = \'\n Corollary 5.1 above. Theorem 5.3. Let G be a convex subset of X, with 0 G G, and let XQ G G. If int G 7^ 0, then dist(xo,LG)=

mf ex*\{0} sup4>(G)
= II Oil

mf CDGX*\{0} ^{g)<\

(geG)

—-—-—. II CD II

(5.29)

160

5. Reverse Convex Best Approximation

Proof. By (5.12), we have the inequalities < in (5.29). On the other hand, since 0 G G, for any O G X*\{0} we have sup 0(G) > 0, and hence for any O G X * \ { 0 } and d ^ R with sup 0(G) < d we have d > 0. Then the function O' = ^O e Z*\{0} satisfies supO^(G) < 1. Also, J-O(xo) ^

l-cD-(xo) ^ l-cD^(xo)

ll^ll

^lioni

lio^i

'

which, by the last part of Remark 5.3(b), yields the inequalities > in (5.29), and hence the equalities. • Remark 5.5. By Corollary 1.4, one can also write the first equality of (5.29) in the following geometric form: dist (Jo, CG) =

inf

dist

(XQ,

{y

G X|CD(J)

> 1}),

(5.31)

OGG°\{0}

where G° is the (usual) polar (1.82) (with C = G) of G. In the case dim X < +oo, one can obtain more complete results. Indeed, let us first prove the following proposition: Proposition 5.2. If G is a convex subset of a finite-dimensional normed linear space X, and XQ G G, then dist (jco, CG) = dist (XQ, CG).

(5.32)

Proof Since G c G, we have CG ^ CG, whence dist(xo, CG) < dist(jco, CG).

(5.33)

Assume now that the inequality (5.33) is strict, so there exists ^ > 0 such that dist(jco, CG) 4- 2^ < dist(jco, CG).

(5.34)

Choosez G CG such that II jco - z | | < dist (JCQ, CG) 4-6:. Then z G G (since if Z G CG, then dist(xo, CG) < jjjco — zjj < dist(jco, C G ) + ^ , which contradicts (5.34)). Hence, z G G n CG c bd G = bd G, where the last equality holds by dim X < -\-OQ and the convexity of G. Consequently, there exists j G CG such that \\z — y\\ < £• Then by (5.34), we obtain Iko - y\\ < \\xo - z\\ + \\z - y\\ < dist(xo, CG) ^le

< dist(xo, CG),

in contradiction to j G CG.

D

Remark 5.6. (a) The assumption dim X < -hoo cannot be omitted in Proposition 5.2, as shown by Remark 5.2(a) above. (b) One can also give the following alternative proof of Proposition 5.2: Since int G = int G (by dim X < +oo and the convexity of G), we have, by (1.52) and Lemma 1.2, dist(jco, CG) = dist(jco, CG) = dist(jco, C(int G)) = dist (JCQ, C(int G)) = dist(jco, CG) = dist(jco, CG).

5.2 Elements of best approximation in complements of convex sets

161

Proposition 5.3. Let dim X < +oo, G a convex subset ofX, and XQ e G. Then we have (5.2). If in addition, 0 G G, then we have also (5.29). Proof. For the first part, by Theorem 5.1 and Remark 5.1, we have to prove that if int G = 0, then the right hand side of (5.2) is 0. Since dim X < -foo and G is a convex set with int G = 0, G is contained in some hyperplane {x e X\<^o(x) = d}, with Oo e X*, IIOoll = l,d e R. Hence, since XQ e G, we have 0 < inf {sup0(G) - O(xo)} < supOo(G) - Oo(jco) =d-d

= 0,

l|0|| = l

If, in addition, 0 € G, then J = 0, and hence, since XQ e G (so Oo(xo) = 0) and sup (/xOo)(G) = /x sup Oo(G) = 0 < 1 for all /x > 0, we obtain 0<

mf OGX*\{0}

l-cD(xo) ^ . ^ < mf 110)11

1

1 =

M>o ll/xOoll

..1 n mf — = 0.

n U

||^olU>OM

sup4)(G)
Remark 5.7. Alternatively, the first part of Proposition 5.3 also follows from Proposition 5.2 and Remark 5.2(d) applied to G.

5.2 Characterizations and existence of elements of best approximation in complements of convex sets We shall first give some characterizations of elements of best approximation of XQ in CG, where G is a convex set and XQ e G, i.e., some necessary and sufficient conditions in order that zo G PCG(-^O) (that is, zo e CG and ||xo — zoW = dist(xo, C G ) ) . To this end, we shall use the distance formula (5.2) of Theorem 5.1. Note that we do not need to consider the case of G closed (see Remark 5.2(d)), since in that case CG is open, so PCG(-^O) = 0Theorem 5.4. Let X be a normed linear space, G a convex subset of X with int G 7^ 0, and let XQ G G . For an element ZQ G CG, the following statements are equivalent: 1°. IUo-Zoll=ciist(jco,CG). 2°. We have Zo G bd CG = bd G,

(5.35)

and there exists OQ G X* such that sup cDo(G) - o(xo) = inf {sup cD(G) - cD(xo)}, ^o(zo - .^o) = lUo - zoll . 3°. There exists OQ G X* satisfying (5.36) and

(5.36) (5.37)

162

5. Reverse Convex Best Approximation sup cDo(G) - OoUo) = lUo - zoll.

(5.38)

Moreover, in T and 3° we may also assume that IIOoll = 1.

(5.39)

Proof. Assume 1°. Then we have (5.35). Since int G 7^ 0 and zo ^ CG, by the separation theorem there exists OQ G X* such that supOo(G) < Oo(zo); clearly, we may assume that ||Oo|| = 1. Hence by intG 7^ 0, Theorem 5.1, and 1°, we obtain \\xo -zo\\>

Oo(zo - -^o) > sup o(xo) > inf {sup 0(G) - O(xo)} ||cD|| = l

= dist(xo,CG) = \\xo - zoh whence (5.36) and (5.37). Thus, T =^ 2°. _ Assume now 2°. Then by (5.35), we have zo ^ bd G c G, whence o(zo) 3°. Furthermore, assume 3°. Then by (5.38), (5.36), int G # 0, and Theorem 5.1, we have \\xo - ZoW = supOo(G) - cDo(xo) = inf {supcD(G) - O(xo)} = dist(xo, CG),

\\n=\ so Zo ^ ^CG(-^O). Thus, 3° => 1°, which proves the equivalence of 1°, 2°, and 3°. Finally, if we have 2°, then by (5.37), |10o|| > 1 (since otherwise Oo(zo —-^o) S ll^oll Iko - ^oll < Iko - -^oll), so l i ^ < 1. Hence by (5.36), ^^P^(^) - ^(^0) = ^

11^0 II

11^0 II

i^f {supcD(G) - (xo)}

11^0 II if^^i^*^ < inf {supO(G)-0(jco)}, 11^11 = 1

and therefore s u p - ^ ( G ) - T^(xo)

11^0 II

= inf {supO(G) - (xo)},

11^0 II ^^^fj^

which shows that (5.36) is satisfied also for Oo replaced by TTI^, i.e., that in 2° we may assume (5.39). Consequently, in 3°, too, we may assume (5.39) (because in the above proof of the implication 2° =^ 3° we have used the same Oo). •

5.2 Elements of best approximation in complements of convex sets

163

Remark 5.8. When G is a bounded convex set, the equivalence 1° <:> 3° of Theorem 5.4 admits the following geometric inteq^retation:/or an element zo ^ CG we have zo ^ 7^CG(-^O) if and only if there exists ^o ^ ^* "^ith ||Oo|| = 1 such that the quasi-support hyperplane //o,supo(G) := [x € X\ OoU) = supOo(G)}

(5.40)

satisfies dist (xo, //ci>o,supci>o(G)) =

inf dist(xo, / / ) ,

(5.41)

HeHc

lUo - ^oll = dist(xo, //ci>o,supci>o(G)); or, equivalently (by Corollary \.l),for an element zo G CG we have if and only if there exists a hyperplane HQ e He satisfying dist(xo, Ho) = inf dist(A:o, / / ) ,

(5.42) ZQ G PCG(-^O)

(5.43)

HeHc

(5.44)

Iko-zoll =dist(xo,//o) (see Figure 5.5). Indeed, by XQ e G, we have ||4>o|| = 1 and Lemma 1.5, dist (xo, //cDo,supo(G)) =

I^O(-^O)

OO(JCO)

< sup o(^). and hence by

- sup Oo(G)| = sup cDo(G) - Oo(xo). (5.45)

•^0

J

m Hn

Figure 5.5.

Remark 5.9. (a) When G is unbounded, there exists (by the uniform boundedness principle) OQ G X* such that supcI)o(G) = +oo, so then //(i>o,supOo(G) = 0; hence in this case, (5.41) does not hold (since its left-hand side is +oc, while its right-hand side is finite). (b) The above proof of the implication 2° => 3° shows that for each pair zo, OQ as in 2° of Theorem 5.4, we have Oo(zo) = supa>o(G).

(5.46)

(c) When G is a bounded convex set, (5.46) is equivalent to zo G //o(G)This, together with (5.42), gives that Zo G 'P//<,o..supct>o(G)(-^o).

(5.47)

164

5. Reverse Convex Best Approximation

Now we shall give some examples in the plane R^ endowed with the Euclidean norm IkII : - V|xiP + |x2p

{X = (XUX2) e R ' ) ,

(5.48)

showing that various parts of 2° and 3° above cannot be omitted. Example 5.1. Let X = R^, with the norm (5.48), G = {y e R^\ \\y\\ < 1}, XQ = 0. Then for the element zo = (2, 0) e int CG and the function OQ e (R^)* defined by Oo(x)=x,

(x = ixuX2)eR^),

(5.49)

we have \\^o\\ = 1, supO(G) = ||0|| (O e (/?^)*), whence supOo(G) cI>o(-x:o) = infci>eXM|ci>iNi{sup4)(G) - (xo)] = 1, and ^oizo - XQ) = ||xo - Zoll = 2. Thus, (5.39), (5.36), and (5.37) hold, but (5.35), (5.38), and 1° are not satisfied. Example 5.2. Let X = R^, with the norm (5.48), G = {y e R^\ \yi \ < 2, |J2I < 1}, xo = 0. Then for the element zo = (2, 0) e bd CG and the function ^0 of (5.49) we have supcI>o(G^) - ^o(-^o) = 2, inf(|>GXM|0||=i{supcI>(G) - ^{XQ)} = dist (xo, CG) = 1 (by Theorem 5.1), and Oo(zo - XQ) = 2 = \\xo - Zoll • Thus, (5.35), (5.39), (5.37), and (5.38) hold, but (5.36) and 1° are not satisfied. Example 5.3. Let X = R^, with the norm (5.48), G = {y e R^\ max ( | ji |, | J2I) < l},xo = 0. Then for the element zo = (1,1) e bd CG and the function OQ of (5.49) we have supo(G) - c|)o(jco) = 1, infci>eXM|0||=i{supO(G) - c|)(xo)} = dist (xo, CG) = 1, Oo(zo - -^o) = 1 and \\xo - zo\\ = v ^ . Thus, (5.35)-(5.36) hold, but (5.37), (5.38), and 1° are not satisfied. Let us give now a characterization of best approximations in CG for the case 0 G G, using the distance formula of Theorem 5.3. Theorem 5.5. Let G be an open convex subset of X containing 0, and let XQ G G. For an element ZQ G CG, the following statements are equivalent: lMko-zoll=dist(xo,CG). 2°. There exists % G X*\{0} such that \\x,-z,\\='-^^^,

i ^ ^ ^ = WA " ""'

(5.50)

inf

CDGX*\{0} ^{g)<\{geG)

1:1^1^. ||(D||

(5.51)

3°. There exists % G X*\{0} satisfying (5.50), (5.51), and ^'^{g) <\ o;)(zo) = l.

{ge G),

(5.52) (5.53)

5.2 Elements of best approximation in complements of convex sets

165

Proof, r =:^ 3 M f 1° holds, then by Theorem 5.4 and Remark 5.9(b), there exists Oo G X*\{0} satisfying (5.39), (5.36), (5.38), and (5.46). Since G is an open set, by 0 G G and (5.39) we have sup Oo(G) > 0. Let ^o:=

^ 7 7 ^ ^0. supcDo(G)

(5.54)

Then by (5.39), OQ ^ 0. Furthermore, since G is an open set, by Lemma 1.8 we have (5.52). Also, by (5.46), we have (5.53). Hence, by 1°, Theorem 5.3, (5.52), and (5.53), we obtain lUo - zoll = dist (^0, C G ) =

11^0 I

inf

——-—

O6X*\{0} cD(g)
II Oil

11^0 I

whence (5.50) and (5.51). The implication 3° => 2° is obvious. 2° =^ 1°. Assume now that % e X*\{0} satisfies (5.50) and (5.51). Then by (5.50), (5.51), and Theorem 5.3, we obtain 11^0 - ^oll = —M—71— = O'

"

^"

inf

OGX*\{0}
— — - — = dist (xo, LG). IIOJI

D

Now we shall study the existence of elements zo G C G for which the dist in the left-hand side of (5.2) is attained (i.e., of elements of best approximation zo e T^CGUO)).

Definition 5.1. We shall call an optimal dual solution, or, briefly, optimal function (with respect to the pair ( C G , JCQ)) any function OQ G X* with ||Oo|| = 1 for which the inf in the right-hand side of (5.2) is attained (i.e., any OQ e X* satisfying (5.39) and (5.36)). Theorem 5.6. Let X be a normed linear space, G a convex subset of X with int G / 0, and XQ e G. We have PCG(-^O) T^ 0 if and only if there exists an optimal dual solution OQ G X* such that CGn{ye

X\ \\y - xoll = sup cDo(G) - Oo(xo)} / 0.

(5.55)

Proof The condition means that there should exist zo € C G and OQ e X* satisfying (5.39), (5.36), and (5.38), so the result follows from Theorem 5.4, implication

3° => r.

n

Remark 5.10. By Corollary 1.1, Lemma 1.5, and XQ e G, when G is a bounded convex set, a function OQ e X* with ||Oo|| = 1 is an optimal dual solution if and only if the hyperplane HQ = //(Do,supOo(G) G HG defined by (5.40) satisfies (5.41);

166

5. Reverse Convex Best Approximation

we shall call any such hyperplane an optimal hyperplane. Then Theorem 5.6 admits the following geometric interpretation: we have PCG(-^O) # 0 if and only if there exists an optimal hyperplane HQ e Tic such that C G n //o n bd B(xo, dist (JCQ, HQ)) / 0.

(5.56)

Since HoDbd 5(xo,dist (XQ, HQ)) = P/ZQUO), condition (5.56) can be also written in the form CGnP;,,(jco)7^0.

(5.57)

Now we shall show that if X is reflexive and G is an open convex subset of X, then Theorem 5.6 and Remark 5.10 can be improved; namely, conditions (5.55)(5.57) can be omitted. Theorem 5.7. Let X be a reflexive Banach space, G an open convex subset of X, and XQ G G. We have 'PCG(-^O) / 0 if and only if there exists an optimal dual solution Oo e Z* {or equivalently, an optimal hyperplane). Proof The necessity of the condition follows from Theorem 5.6. Conversely, assume now that there exists an optimal dual solution OQ G X* (so we have (5.39) and (5.36)), and let //Q be the hyperplane defined by (5.40). Then since X is reflexive, VHO(^O) 9^ 0- Let zo ^ 7^//o(-^o)« Then, since G is open, by Remark 5.2(b) we have zo ^ Ho ^ C G . Also, by Lemma 1.5 applied to the hyperplane (5.40), ||zo - -^oll = dist (zo, HQ) = \ supOo(G) - Oo(xo)|. But since xo e G, we have supOo(G^) - ^o(^o) > 0,so ||zo --^oll = supOo(G) - Oo(xo). Consequently, by Theorem 5.4, implication 3° ==> 1°, we obtain zo ^ ^CG(-^O)- D Finally, we shall summarize the connections between existence of elements of best approximation and existence of optimal dual solutions. To this end, we shall denote by ^CG(-^O) the set of all optimal dual solutions (with respect to the pair (CG,XO)).

Theorem 5.8. Let X be a normed linear space, G a convex subset of X, with intG ^ 0, and Xo € G. (a)7^CG(^o)#0=^^CG(^o)^0; (b)AG(^o)#0^7>CG(^o)7^0; (c) ifX is reflexive, and G is open, then 'PCG(-^O) 7^ 0 ^ -4CG(-^O) / 0; (d) it may happen that both PCG(-^O) = 0 CL^d W4CG(^O) = 0, e^en in the Hilbert space l^. Proof (a) is an obvious consequence of Theorem 5.4 (or of Theorem 5.6). (c) is nothing other than Theorem 5.7. Finally, (b) and (d) are proved by the following two examples, which complete the proof of Theorem 5.8:

5.2 Elements of best approximation in complements of convex sets

167

Example 5.4. Let X = /^ let G = {y = ( 3 . „ ) e / ' | £ ^ < l ) ,

(5.58)

n=\

and let XQ = 0. Then clearly, G is convex, and 5(0, l) = [y = (yn) G /^l ^ I j J < l [ C G c 5(0,2),

(5.59)

n=]

SOCG C C 5 ( 0 , 1) = {j G/^l ||};|| > 1}. Also, ^e^ e CG (n = 1, 2 , . . . ) , where en denotes the nth unit vector. Hence by || ^ ^ n || = ^ -> 1, we obtain distfxo,CG) = 1.

(5.60)

Furthermore, by (5.58), for each y = (y^) e CG we have 00

I

CXD

\\xo-y\\ = \\y\\ = J2\yn\>y]^ whence by (5.60), it follows that defined by

•PCG(^O)

I

> 1.

(5-61)

= 0- However, the function Oo e X*

satisfies ||^oll = 1. and by (5.60) and Theorem 5.1, oo

sup Oo(G) - Oo(xo) = sup V -yn = 1 = dist (XQ, CG) veG^« + l = inf {supcD(G)-4)(jco)}, l|0|| = l

soOo

G^CGUO).

Example 5.5. Let X = 1^, let oo

2

2

G = j^ = ( y „ ) e / ^ | ^ - ^ < l ) ,

(5.62)

n=\

and let XQ = 0. Then clearly, G is convex, and we have again (5.59). Also, ^ ^ ^ ^ CG (n = 1, 2 , . . . ) , whence again, we have (5.60). Furthermore, by (5.62), for each y = (yn) e CG we have

iixo-yii^=iMi^=i:.„^>i:-^>i, whence by (5.60), it follows that PCGUO) = 0- Consequently, since X is reflexive, by Theorem 5.7 we have ^ C G U O ) = 0D

Unperturbational Duality for Reverse Convex Infimization

Given a locally convex space X, with conjugate space X*, a convex subset G of X, and a function f: X ^^ R, in this chapter we shall give some results of unperturbational duality for the primal "reverse convex infimization" problem ( n

= (PGJ)

«' = < j

= inf / ( C G ) .

(6.1)

Any zo ^ CG for which the inf in (6.1) is attained, i.e., such that /(zo) = min/(CG), is called an optimal solution of problem (P^); these will be studied in Chapter 7. Taking G' := CG, one can also write (6.1) as the infimization problem

a'-

^Mf{G').

However, now we shall obtain dual problems that are different from the "usual" dual problems to convex and quasi-convex infimization problems (see Chapter 1, Section 1.4). In contrast to the cases of convex and quasi-convex infimization, it will turn out that for reverse convex infimization the theory of surrogate duality is more developed (see Sections 6.1-6.3) than the theory of Lagrangian duality (see Section 6.4). Our starting point for the study of surrogate duality will be the observation that best approximation by reverse convex sets CG may be regarded as a particular case of reverse convex infimization, by taking X to be a normed linear space, XQ e X, and / : X -> /? the convex function (1.264); indeed, then inf/(CG) = dist(xo,CG),

(6.2)

170

6. Unperturbational Duality for Reverse Convex Infimization

and for this case, the optimal solutions zo e CG of problem (P^) are the elements of best approximation of XQ by CG. Although the extension from the particular function / of (1.264) to a function f:X -^ /? on a locally convex space X, is a rather big step, it turns out that, similarly to the case of passing from best approximation by convex sets to convex optimization, many results and methods of the theory of best approximation by reverse convex sets can be extended to results on the reverse convex infimization of functions. In analogy to the fact that formula (1.249) on the distance to a convex set extends to the surrogate duality formula (1.330) on quasi-convex infimization, it is natural to expect that formula (5.9) on the distance to a reverse convex set will extend, under certain assumptions on G and / , to a formula like inf/(CG)=

inf

OeX*\{0}

inf

f(y),

xeX iG)

(6.3)

obtained formally by replacing in (5.9) the function / of (1.264) by a function / on a locally convex space X; this will be achieved in Section 6.1. Next, corresponding to formula (1.355) on infimization, one would like to replace the hyperplanes [y e X\ 0(_y) = sup 0(G)} of (6.3) by other sets, e.g., closed half-spaces. Therefore, in Section 6.2, we shall consider "unconstrained surrogate dual problems" to problem (P^) of (6.1), defined as infimization problems of the form fi' = mfX'(X*\{0}),

(6.4)

where X*\{0} is the dual set (unconstrained), and ^ = XQ ^: X*\{0} -^ R is a. function (the dual objective function, depending on G and / ) of the form r (CD) = inf / ( ^ a o )

( e X*\{0}),

(6.5)

with {^G,4)}(y) = supcD(G)}

(O e X*\{0}).

(6.6)

Problem (6.4), with X^ of (6.5), is an unperturbational dual problem to (P^), since it is defined directly, without using the method of first embedding (P^) into a family of perturbed primal problems, and it is a surrogate dual problem to (P^), since it replaces the primal constraint set G of (6.1) by a family of "surrogate constraint sets" ^G,4> c X (O e X*\{0}) (while it keeps the primal objective function / unchanged). Next, more generally, in view of further applications, given an arbitrary set X, a subset G of Z, and a function f \ X ^^ R, for the infimization problem (P^) of (6.1) we shall consider in Section 6.3 a "surrogate dual problem" of the form P' =P[,,=mfX{W),

(6.7)

6.1 Some hyperplane theorems of surrogate duality

171

where W = W[j ^ is a. set (the dual constraint set) and A. = A,^ ^^: W -^ /? is the function (the dual objective function) defined by X'cfiw) = inf /(^G...)

(w e W),

(6.8)

with {QG,w}wew being a family of subsets of X, related in some way to G, Then taking X to be a locally convex space, W = X*\{0}, and A. = A^ of (6.8), problem (6.7) reduces to problem (6.4). Furthermore, taking X to be a locally convex space, W c X*\{0} ovW^ (Z*\{0}) X /?, and X = X' of (6.8), we shall obtain some useful unconstrained and "constrained" surrogate dual problems to problem (P^) of (6.1). Actually, as in Chapter 3, instead of [QG.w}wew, we shall find it more convenient to use the equivalent language of polarities A: 2^ -> 2 ^ . In Section 6.4 we shall deal with unperturbational Lagrangian dual problems to problem (PO of (6.1). Finally, the general dual problem (6.7) will permit us to study (unconstrained and constrained) surrogate duality for more structured primal reverse convex infimization problems (i.e., in which the primal constraint set G is expressed in more structured ways), by considering suitable dual constraint sets W and dual objective functions X = X[jj : W -> R asin (6.8) (see Section 6.5). Remark 6.1. This chapter is devoted to unperturbational duality results only, since until the present there exists no perturbational duality theory for reverse convex infimization corresponding to those for convex infimization (see Chapter 1, Section 1.4.2) and convex supremization (see Chapter 3, Section 3.4.2). Similar to (1.383), we have i n f / ( C G ) = inf/(X), where / = / + XCG' ^^t the theory of Chapter 1 cannot be apphed directly to this function / , since for a convex set G, in general XCG is not convex. Another attempt could be to note that inf / ( C G ) = inf ( / + XCG)(^) = inf ( / +

- (-XCG))(^).

(6.9)

and h ^ c e to develog^a perturbational theory for infimization problems inf/(X), with / of the form / = / -j—h. In Chapter 8 we shall present a perturbational duality theory for such problems, but only when h is convex, so that theory cannot be applied to h = —XCG' where G is convex, i.e., to reverse convex infimization, since in general — XCG is not convex (however, note that it is quasi-convex when G is convex, since 5^(—XCG) = either G or X, for allJ € /?).

6.1 Some hyperplane theorems of surrogate duaUty Let us start with a generalization of Chapter 5, Remark 5.1. Remark 6.2. If G is a subset of a locally convex space X with intG = 0, and f: X ^ R is an upper semicontinuous function, then inf/(CG) = inf/(X).

(6.10)

172

6. Unperturbational Duality for Reverse Convex Infimization

Indeed, then by Lemmas 1.1 and 1.2, we have i n f / ( C G ) = i n f / ( C G ) = inf/(C(intG)) =

inff(X).

We have the following hyperplane theorem of surrogate duality, generalizing the (equivalent) geometric form (5.9) of Chapter 5, Theorem 5.1. Theorem 6.1. Let X be a locally convex space, G a convex subset of X, and f: X ^ R a function. (a) If f is upper semicontinuous, then inf/(CG)<

inf

inf

OGX*\{0}

f{y).

(6.11)

veX

4)(v)=sup4>(G)

(b) If f is quasi-convex, int G 7^ 0, and inf/(G)
(6.12)

then inf/(CG) >

inf

inf

OGXniO}

yeX (I)(v)=supO(G)

f{y).

(6.13)

(c) If f is upper semicontinuous and quasi-convex, intG ^ 0, and if (6.12) holds, then (6.3) holds. Proof If G = Z, then both sides of (6.11), (6.13), and (6.3) are +00 (since inf 0 = +00). Thus, we may assume that G ^ X. (a) If intG := 0, then (6.11) holds by Remark 6.2. If intG ^ 0, let O € X*\{0} and H '.= [y e X\ cD(j) = sup 0(G)}.

(6.14)

If sup ^(G) = +CXD, then // = 0, whence i n f / ( C G ) < infvGX.
inf

/(y),

yeX 0(j)=supO(G)

whence, since O G X*\{0} with sup 0(G) < -f-oo was arbitrary, we obtain (6.11). (b) Assume, a contrario, that inf/(CG) <

inf

OeX*\{0}

inf

veX 0(y)=:sup(G)

f{y)-=d.

Then by (6.15) and (6.12), there exist JCQ e CG and go ^ G such that

(6.15)

6.1 Some hyperplane theorems of surrogate duality / ( x o ) < J , f{go)
173 (6.16)

Since G is convex and intG 7^ 0, by the separation theorem there exists OQ € X*\{0} such that supcDo(G)
(6.17)

But since the function (/?: [0, 1] ^- /? defined by (p{r]) := Oo(^xo + (1 — ^)go) is continuous, and (^(0) = Oo(^o) ^ supOo(G), (/?(1) = <E>o(-^o) ^ supOo(G), there exists rio e [0, 1] such that ^0(^0^0 + (1 - r]o)go) = sup a>o(G).

(6.18)

Consequently, by the definition (6.15) of J, and by (6.18), the quasi-convexity of / , and (6.16), we obtain d <

inf

f(y) < fimxo + (1 - ^0)^0) < max{/(xo), f(go)} < d,

yeX

which is impossible. This proves (6.13). (c) This follows from (a) and (b).

D

Remark 6.3. (a) By inf 0 = +00, (6.3) is equivalent to inf/(CG) =

inf

inf

f(y).

(6.19)

cD€X*\{0} yeX sup4)(G)<+oo 0(v)=sup4>(G)

Formula (6.19) admits the following geometric interpretation: We have inf/(CG) =

inf i n f / ( / / ) ,

(6.20)

HeHc

where HG denotes the family of all hyperplanes in X that quasi-support the set G. This is another instance of the "reduction principle" (it permits one to reduce the computation o / i n f / ( C G ) to the computation of inf f(H), for all H e HG)(b) The first infimum in the right-hand sides of (6.3) and (6.20) need not be attained, even in the particular case that X is the Hilbert space /^ and / is a continuous convex function of the form f(x) = \\xo-x\\

(xeX),

(6.21)

where XQ e G, as shown by Example 5.5. (c) The condition ^ ^ 0 in (6.3) cannot be omitted, unless i n f / ( C G ) = inf/(X); indeed, for o = 0 we have {y e X\ <^o(y) = supOo(G)} = Z, so if we allow also O = 0 in the right-hand side of (6.3), then this right-hand side becomes inf f(X). (d) The assumption (6.12) is equivalent to inf/(G) = inf/(X).

(6.22)

174

6. Unperturbational Duality for Reverse Convex Infimization

Indeed, if (6.12) holds, then inf/(G) = min {inf/(G), i n f / ( C G ) } = inf/(X), and conversely, the fact that (6.22) inplies (6.12), is obvious. (e) In the particular case of best approximation, i.e., when X is a normed linear space and / is the function (6.21), where XQ e G, we have inf/(G) =dist (XQ, G ) = 0, so the assumption (6.12) is satisfied. The assumption (6.12) in Theorem 6.1(c) cannot be omitted, as shown by the following example. Example 6.1. Let Z be a normed linear space, G = {x G X | O O ( J C ) > 1},

(6.23)

where OQ G Z * \ { 0 } (SO G is an open half-space that does not contain 0), and / the function (6.21) with XQ = 0, that is, f(x) = \\x\\

(xeX).

(6.24)

Then inf/(G) =

inf

||x|| = - ^ > 0 =

inf

||x|| = i n f / ( C G ) ,

SO (6.12) is not satisfied. Furthermore, if O e X*\{0},supO(G) < +oo, then by (6.23), we must have (^ = rj^o for some r] e R, rj < 0, whence supcD(G) = sup(r]^o)(G) =-mf(-rj)<^o(G)

= r].

Consequently, for any such O we have inf

f(y)=

yeX

inf

f(y)=

yeX

0(>')=:sup(G)

n^Q(y)=r)

inf

||>.|| = - i -

yeX

> 0 = inf/(CG),

11^0 II

^o(v)=l

SO (6.3) does not hold. The same conclusions hold also for the closed half-space G = {x e X\^oM > 1}. However, in the case when X is a normed linear space and G is also bounded, with int G 7^ 0, the assumption (6.12) and the quasi-convexity of / can be omitted. Indeed, we have Theorem 6.2. Let X be a normed linear space, G ^ X a bounded convex subset of X with intG ^ 0, and / : X -^ R an upper semicontinuous function. Then we have (6.3). Proof The inequality < in (6.3) holds by Theorem 6.1 (a). In order to prove the opposite inequality, let jc e CG. Then by Theorem 1.3, there exists ^ e X*\{0} satisfying ^(x) = sup^(G). Then inf

inf

OeX*\{0}

yeX <^(y)=supO(G)

fiy) <

inf

fiy) < fix),

^iy )=supvI/(G)

whence, since x e CG was arbitrary, we obtain the inequality > in (6.3) and hence the equality (6.3). D

6.2 Unconstrained surrogate dual problems for reverse convex infimization

175

6.2 Unconstrained surrogate dual problems for reverse convex infimization While in the preceding section we have been concerned with "hyperplane theorems" of surrogate duality, now we want also to consider other types of surrogate dual results for reverse convex infimization, e.g., "half-space theorems." To this end, as mentioned at the beginning of this chapter, we shall consider for the infimization problem (P^) of (6.1) a "surrogate dual problem" of the form (6.7), where W = W[^ / is a set (the dual constraint set) and A = >.^ .: W -> /? is the function (the dual objective function) defined by (6.8), with {^G.w}wew being a family of subsets of X related in some way to G. Remark 6.4. In the sequel, when considering a reverse convex infimization problem (P^) = (PQ r), we shall assume, without any special mention, that G / X (since for G = X we have a' = inf / ( C G ) = inf/(0) = -hoo). As in Chapter 3, we shall express the surrogate duality results in the (equivalent) language of polarities A: 2^ -^ 2 ^ . Remark 6.5. (a) Using (3.39) and (1.144), the dual objective function X'' of (6.8) becomes y^(w)

= inf /(CA'({K;})) =

inf

f{x)

(w e W),

(6.25)

xeX U,'GCA((.V})

where A = AG : 2^ ^- 2^ is a polarity (depending on G, but not on / ) . Then by (6.7) and (6.25), the dual value (i.e., the value of the dual problem) becomes ^2 = inf inf f(CA\{w})) weW

= inf inf weW xeX

f(x).

(6.26)

U^GCA({JC})

AS has been observed in Remark 3.3(a), formulas (3.40) and (3.39) yield a oneto-one correspondence between families of subsets {^u;}u;eM^ of X and polarities A: 2^ -> 2^, so the two languages (6.8), (6.7) and (6.25), (6.26), are equivalent ways of expressing the dual objective function A.^ and the dual value )S^. In the sequel we shall choose the language (6.25), (6.26), since this will allow us, by using (1.140), to express the results, e.g., on the relations between the primal and dual problems, in a more concise way. Thus in particular, in this section we shall consider unconstrained surrogate dual problems (6.4) to (P^), with the dual objective function being of the form A^^(cD) = inf f(CA\{})) =

inf

f(x)

( e X*\{0}),

(6.27)

xeX CDGCA(U})

where A = AG : 2^ ^ 2^*\^^^ is a polarity (depending on G). Then by (6.4) and (6.27), the dual value (i.e., the value of the dual problem) will be

176

6. Unperturbational Duality for Reverse Convex Infimization P\=

inf

inf/(CA\{^})) =

inf

inf

f{x).

(6.28)

cDeCA({jc})

(b) If there exists WQ e W such that CA'({W;O}) = 0, then by (6.25), we have k\{wQ) = inf 0 = +CXD. Consequently, by (6.26), )S^ = inf inf f{CA\{w})),

(6.29)

weG'

where G' := {w; G W| CA^({U;}) # 0).

(6.30)

(c) We have P'^ = inf

inf

fix) = inf

inf

f(x).

(6.31)

weGA({x})

Indeed, (6.31) follows from (6.26) and inf

f(x) = H-oo.

(6.32)

Jc€(C(dom/))nCA'({u;})

(d) In the particular case of Theorems 6.1 and 6.2, we have W = Z*\{0}, and by (6.6) and (3.39), the surrogate constraint sets are CA^({(I>}) = [y e X\ (^(y) = sup cD(G)}

(cD e X*\{0}),

(6.33)

where A = AG : 2^ ^ 2^*^^^^ is the polarity A^ of (1.166), and the dual objective function is r^(c|>)=

inf

f(y)

(cD6X*\{0}).

(6.34)

Note that for the primal problems (P^) of (3.1) and (P"^) of (6.1), the surrogate constraint sets (3.44) and (6.33) are the same, so the dual objective functions A,^ and A,^ coincide on X*\{0}, and the only difference between the dual values yS^ and )6^ is that in (3.42) we take sup(j,^;j'*^|o}, while in (6.28) there occurs inf(D€x*\{0} • Moreover, in the sequel we shall use the same special polarities A as those used in Section 3.2. We shall first give some necessary and sufficient conditions on G, / , and A, in order that a < ^^ or a > ^^ or a = )S^, where a e R is arbitrary, in terms of the level sets SAf) and A j ( / ) of (1.22) and (1.23). Proposition 6.1. Let X, W be two sets, f \ X -> R a function, A: 2^ -> 2^ a polarity, and a G R. The following statements are equivalent:

6.2 Unconstrained surrogate dual problems for reverse convex infimization

177

1°. We have ct
inf inf f{Z^\{w])).

(6.35)

2°. We have Adif) n CA'({W;}) = 0

{weW,deR,d

(6.36)

{w eW,deR,d

(6.37)

3°. We have Sdif) n Z^\{w})

= 0

4°. We have A«(/)nCA'({w;}) = 0

(u; e W).

(6.38)

Proo/ 2° 4^ 1°. By Lemma 3.4, condition 2° is equivalent to inf /(CA'({M;})) >d

(w e W, d e R, d < a),

(6.39)

i.e., to inf /(CA'({M;})) > a (W e W), which is equivalent to 1°. Finally, the equivalence 2° <^ 3° follows from the inclusions Adif) c Sd(f) c Ad'if)

(d, d' eR.d
< a),

and the equivalence 2° 4^ 4° follows from ^^^(7) = Uj
(6.40) D

Proposition 6.2. L^r X, W be two sets, f: X ^ R a function, A: 2^ -^ 2^ a polarity, and a e R. The following statements are equivalent: r . We have ct>P'^=

inf inf /(CA^({W;})).

(6.41)

2°. For each d e R,d > a, there exists Wd ^W such that A^(/)nCA^({K;j})#0.

(6.42)

3°. For each d e R,d > a, there exists Wd e W such that Sd(f)nCA\{wd})y^0.

(6.43)

Proof r => 2°. If r holds md d e R,d > a > P'^ = inf^ew mff(CA\{w})), then there exists Wd e W such that d > inf f(CA\{wd})), whence by Lemma 3.4, we obtain (6.42). The implication 2° =^ 3° is obvious. 3° =^ r. If d e R,d > a, and Wd e W satisfy (6.43), say, Xd e Sd(f) n CA\{wd}), then y6^ = inf inf/(CA^({U;})) < inf f(CA\{wd]))

< f{xd) < d;

weW

hence j6^ < infd^a d = a. On the other hand, if there exists no d e R such that d > a, then ^^ < +oo = a. D

178

6. Unperturbational Duality for Reverse Convex Infimization Combining Propositions 6.1 and 6.2, we obtain the following result:

Theorem 6.3. Let X, W be two sets, f: X -^ R a function, A: 2^ ^ 2 ^ a polarity, and a e R. The following statements are equivalent: 1°. We have a=

inf inf/(CA^({U;})).

(6.44)

weW

2°. We have (6.36), and for each d e R, d > a, there exists Wd e W satisfying (6.42). 3°. We have (6.37), and for each d £ R, d > a, there exists Wd £ W satisfying (6.43). Let us give now, for the case of = a^ of (6.1), some convenient sufficient conditions in order that a^ < ^^ or a'' > ^^ or a'' = P''^, involving only G and A, but not / . To this end, we shall need some preparation. Lemma 6.1. Let X, W be two sets. Then for any polarity A : 2^ -> 2 ^ and any set P
(6.45)

Consequently, for ^^ of (6.26) we have fil = i n f / ( C A ' ( W ) ) .

(6.46)

Proof. By (1.140) (applied to A'), U^^PCA'({W})

= C(n^^pA'{{wm

= CA'iP),

which proves (6.45). Hence by (6.26), Lemma 3.7 of Chapter 3 applied to {A,),e/ =

{CA'({W})U^W,

and (6.45), we obtain /61 = inf inf /(CA'({W;})) = inf f(U^,wCA'({w}))

= inf/(CA'(IV)).

D

weW

Corollary 6.1. Let X, W be two sets, G a subset ofX,f:X-^R A: 2^ ^ 2^ a polarity. If G = A\W),

a function, and (6.47)

then we have the ''weak duality equality " a'' = P'^, that is, i n f / ( C G ) := inf inf f(CA\{w})).

(6.48)

weW

Proof If (6.47) holds, then CG = CA\W),

whence by (6.46), we obtain (6.48). D

6.2 Unconstrained surrogate dual problems for reverse convex infimization Proposition 6.3. Let X, W be two sets, G a subset ofX,f: A: 2^ ^ 2^ a polarity. (a) If we have

179

X ^ R a function, and

G c A\W),

(6.49)

then a' = i n f / ( C G ) < inf inf /(CA'({ii;})) =

fi^.

(6.50)

weW

(b) If X is a topological space, f: X ^^ R is upper semicontinuous, and intGCA'(W)

(6.51)

{where int G denotes the interior ofG), then we have (6.50). Proof (a) If (6.49) holds, then CG 2 CA^(W), whenceby (6.46), we obtain (6.50). (b) If (6.51) holds, then by (1.20) and (6.51), CG = C(intG) D CA^(W). Hence if / is upper semicontinuous, then by (6.46) and i n f / ( C G ) = i n f / ( C G ) (see Lemma 1.1), we obtain (6.50). D Remark 6.6. (a) The following conditions are equivalent: 1°. We have (6.49). 2°. We have W c A(G).

(6.52)

3°. We have A^A(G) c A\W).

(6.53)

Indeed, 1° <^ 2° by (1.144). Furthermore, 2° ^ 3°, since A' is antitone. Finally, if 3° holds, then G c A'A(G) c A^(W), so 3° => T. (b) If A: 2^ -^ 2 ^ is a polarity such that A'(W) = 0,

(6.54)

then (6.49) implies that G = 0, for which (6.50) is trivial. (c) If A: 2^ ^ 2^ is a polarity satisfying (6.54), then condition (6.51) implies that int G = 0. (d) It is well known and immediate (see, e.g., [254], p. 194, Remark 6.3(a)) that we have (6.54) if and only if the empty set 0 is A'A-convex (i.e., for each x e X there exists w eW such that jc ^ A'({u;})), or equivalently, A({jc}) ^^W {x e X). Proposition 6.4. Let X, W be two sets, G a subset ofX, f: X ^^ R a function, and A: 2^ ^ 2 ^ a polarity. IfG is A' A-convex, then G 3 A\W),

(6.55)

and hence a' = i n f / ( C G ) > inf inf/(CA^({M;})) = yS^. weW

(6.56)

180

6. Unperturbational Duality for Reverse Convex Infimization

Proof. By definition, G is A'A-convex if and only if Vx G CG, 3W eW, G c ^'{[w}), x e CA'({W;}).

(6.57)

Hence in particular, in this case VJC e CG, 3W eW,

X e CA'({M;});

that is, we have, using also Lemma 6.1, CG C UUJ^WCA\{W})

= CA\W),

(6.58)

which is equivalent to (6.55). Also, clearly (6.58) implies (6.56).

D

Remark 6.7. If A: 2^ ^- 2 ^ is a polarity satisfying (6.54), then we have (6.56). Theorem 6.4. Let X, W be Wo sets, G a subset of X, f: X ^^ R a function, and A: 2^ -> 2"^ a polarity. (a) If (6.49) holds and G is is!lS.-convex, then we have (6.48). (b) If X is a topological space, f: X -> R is an upper semicontinuous function, and A: 2^ -> 2^ is a polarity such that intG c A\W) c G,

(6.59)

then we have (6.48). Proof (a) If (6.49) holds and G = A'A(G), then by Proposition 6.4, G = A\W), and hence by Corollary 6.1, we obtain (6.48). (b) This follows by combining Proposition 6.3(b) and the implication (6.55) =^ (6.56). D Now we shall give a sufficient condition for strong duality in terms of the limiting case d = a. Theorem 6.5. Let X,W be two sets, G a subset of X, f: X ^^ R a function, and A: 2^ ^- 2 ^ a polarity satisfying (6.38), with a = ot^ — i n f / ( C G ) . If there exists If 0 ^ ^ suc^ that 5«(/)nCA^({i/;o})7^0,

(6.60)

i n f / ( C G ) = min inf/(CA'({U;})) = inf/(CA'({M;O})).

(6.61)

then

Proof If jco G Sa{f) n CA'({W;O}), then by (6.38) and Proposition 6.1, we obtain oi<^\=

inf inf/(CA^({W;})) < inf/(CA^({U;O})) < /(XQ) < a,

whence (6.61) (with the min being attained for M;O G H^).

(6.62)

D

6.2 Unconstrained surrogate dual problems for reverse convex infimization

181

Remark 6.8. (a) By (6.38), for any XQ e Saif) H CA'({U;O}) we have /(JCQ) = of, i.e., Sa(f) n CA\{WO]) C SAf)\Aa(f).

(6.63)

Moreover, since a = o?^ = i n f / ( C G ) , formula (6.61) shows that if we have (6.60), then every XQ e Sa(f)nCA\{wo}) is an optimal solution both of the primal problem (P^) of (6.1) and of the "surrogate primal problem" (P""^')

«CA'({u.o})./ '= mff(CA\{wo})).

(6.64)

(b) By the above proof, in Theorem 6.5 condition (6.38) can be replaced by any other condition ensuring that a = i n f / ( C G ) < ^^, e.g., condition (6.49) or, when X is a topological space and / is upper semicontinuous, condition (6.51). The condition of Theorem 6.5 is not necessary in order to have (6.61), as shown by the following example: Example 6.2. Let X be a normed linear space, W = X*\{0}, G = {x eX\ \\x\\ < 1},

(6.65)

/ the function (6.24), and A the polarity A^ defined by (1.160), so CA^({0}) = {X

eX\ cD(jc) > IIOII}

(CD G X*\{0}).

(6.66)

Then a = inf,,CG lUII = 1. K(f) = [x e X\ \\x\\ < 1}, Sa(f) = [x e X\ \\x\\ < l}.inf^6CA'({cD}) lUII = 1 (O G X*\{0}), so we have (6.38) and (6.61), but not (6.60) (since 0(x) < ||0|| ||x|| < ||0|| for all O G X*\{0}, x G 5«(/)). Concerning simultaneous characterizations of optimal solutions of (P^) and of weak duality a^ = ^ A ' ^^^ ^^ prove the following theorem: Theorem 6.6. Let X, W be two sets, G a subset of X, f: X -^ R a function, and A: 2^ ^ 2 ^ a polarity. For an element XQ G CG and for a = a^ = i n f / ( C G ) , the following statements are equivalent: 1°. XQ is an optimal solution of(P^) (i.e., /(JCQ) = min / ( C G ) ) and a = P^^. 2°. We have Adif) n CA\{U;}) = 0

{weW,deR,d

< /(JCQ)),

(6.67)

and for each d e R, d > a, there exists Wd e W satisfying (6.42). 3°. We have SAf) n CA\[W})

(weW^de

R,d < /(xo)),

and for each d e R, d > a, there exists Wd ^ W satisfying (6.43).

(6.68)

182

6. Unperturbational Duality for Reverse Convex Infimization

Proof. 1° ^ T. If 1° holds, then /(JCQ) = i n f / ( C G ) = a, and hence by Theorem 6.3, we have 2°. 2° ^ 1°. Assume 2°. Then by (6.67) and Proposition 6.1 (with a = /(XQ)), we have /(jco) < P''^. Furthermore, by the second condition of 2° and by Proposition 6.2, we have (6.41) with a = i n f / ( C G ) . Hence by XQ e CG, we obtain a = i n f / ( C G ) < f(xo)
(6.69)

Finally, the proof of the equivalence 1° 4^ 3° is similar.

EH

In the remainder of this section we shall assume, without any special mention, that Z is a locally convex space with conjugate space X*, and G c X (with G ^ Z), and we shall apply the preceding results to the special polarities A' = A|^ : 2^ -^ 2^*\{^^ (/ = 1, 2, 3, 4) of Chapter 1, Section 1.2. (1) For the polarity A^: 2^ -^ 2^*\{0Uefined by (1.160), we have (1.161), and hence the dual objective function (6.27) and the dual value (6.28) become k\, (CD) = inf /(C(A^)^({0})) =

inf

f{x)

(cD e X*\{0}),

(6.70)

cI>(jc)>sup4)(G)

P\, =

inf

inf fiddly(W))

=

inf

inf

f(x).

(6.71)

4)(jc)>supcD(G)

Remark 6.9. (a) For O G X * \ { 0 } such that supO(G) == +oc, we have {x e X\^{x) > supO(G)} = 0, whence inf^ex,o(jc)>sup0(G)/(-^) = +oo. Therefore, the inf (Dex*\{0} in (6.71) can be replaced by inf^^G''. where G^ is the "barrier cone" (1.347) of G. A similar remark is valid also for some of the subsequent results, but for simplicity, in the sequel we shall use only inf(i>ex*\{0}(b) By (6.31) applied to A = A^, we have ^;, = ^G

inf

inf

fix),

(6.72)

eX*\{0} xedomf 4>U)>sup0(G)

(c) By Proposition 6.3(a) applied to A = A^, we have i n f / ( C G ) =a' <

inf

inf

OGX*\{0}

f(x) = yg^ .

xeX

(6.73)

^G

(x)>sup(G)

Theorem 6.7. Let X be a locally convex space, G a closed convex subset ofX (with G ^ X), and f: X -^ R a function. Then inf/(CG) =

inf

inf

cI>eX*\{0}

xeX 0(jc)>supO(G)

f(x).

(6.74)

Proof By (1.164) and (1.163), we have G = coG = (AlYAliG)

=

(Aly(X*\m,

and hence by Corollary 6.1 for AG = A^, we obtain (6.74).

D

6.2 Unconstrained surrogate dual problems for reverse convex infimization

183

Remark 6.10. (a) Theorem 6.7 is a "half-space theorem of surrogate duality," since the surrogate constraint sets Q<^ = ^G, supO(G)} are (open) half-spaces. (b) Theorem 6.7 remains valid, if we replace in it, and in the definition ofA^, X*\{0} by any subset W o/X*\{0} andcoG by cow G of (9.5). Theorem 6.8. Let X be a locally convex space, f: X ^^ R an upper semicontinuous function, and G an open convex subset ofX. Then inf/(CG) =

inf

inf

O6X*\{0}

xeX (;c)>supO(G)

f{x) =

inf

inf

4)6X*\{0}

xeX 0(jc)>supO(G)

f(y).

(6.75)

Proof The first equality holds by Corollary 6.2 below, and the second equality holds by Lemma 1.1. D Remark 6.11. Theorem 6.8, too, is a "half-space theorem of surrogate duality," since the surrogate constraint sets [x e X\(x) > supO(G)} are (closed) halfspaces. (2) For the polarity A^ : 2^ -^ 2^*^^^} defined by (1.166) we have (1.167), and hence the dual objective function (6.27) and the value (6.28) become X^.^m

= inf /(C(A3^)^({CD})) =

mf

f(x)

(O e X*\{0}),

(6.76)

^(x)=sup(G)

P', =

inf

inf f(C(Aly([<^}))

=

inf

inf

f(x).

(6.77)

0(jc)=supcl>(G)

Remark 6.12. Now we can also give another proof of the following particular case of Theorem 6.2: If X is a normed linear space, then for any bounded open convex set G C. X and any upper semicontinuous function f: X -> R we have (6.3). Indeed, since G is an open set, by Lemma 1.8, (D(g) (jc) = sup 0(G). Thus by Lemma 1.12 (b), G is (A^)^A^-convex. Furthermore, by (6.78) and (1.167), G c nci>ex*\{0}(A^)'({O}) = (A^)'(X*\{0}). Hence by Theorem 6.4(a) (for W = X*\{0} and A = A^), we obtain (6.3). (3) For the polarity AJ^ : 2^ ^ 2^*^^^^ defined by (1.154) we have (1.155), and hence the dual objective function (6.27) and the dual value (6.28) become x;, (CD) = inf /(C(A^)'({cD})) = G

P', = ^^G

inf

O6X*\{0}

inf

inf

xeX 0(A-)>supO(G)

/(C(AM'({0})) ^ ^ ^

=

f(x)

inf

(O e X*\{0}), inf

f(x).

eX*\{0} xeX "^ ' cD(jc)>supcD(G)

'

(6.79) (6.80)

184

6. Unperturbational Duality for Reverse Convex Infimization

Note also that for any O e X*\{0} we have sup 0(G) > —oo, since G ^ 0. On the other hand, if sup 0(G) = +oo, then by (6.79), A^, (O) = +oo. (4) For the polarity A^: 2^ -^ 2^*^^^^ defined by (1.182) we have (1.183), and hence the dual objective function (6.27) and the dual value (6.28) become A^aO) = inf / ( C ( A 4 ^ ) ^ ( { 0 } ) ) =

mf

f(x)

(

(6.81)

cD(jc)^0(G)

^^4 = ^G

inf inf/(C(A^)^({cD})) = eX*\{0}

^

inf ^eX*\{0}

inf

f(x).

(6.82)

xeX 0(x)^cD(G)

Remark 6.13. We have inf/(CG) <

inf OeX*\{0}

inf xeX

f(x).

(6.83)

ix)^(G)

Indeed, by (1.183) we have G c (A^)^({cl>}) (O e X*\{0}), whence (6.49) for W = X*\{0}, and hence by Proposition 6.3(a) applied to A = A^, we obtain (6.83). Theorem 6.9. Let X be a locally convex space, G an evenly coaffine subset of X, and f: X -> R a function. Then inf/(CG) =

inf

inf

CDGX*\{0}

xeX

f(x).

(6.84)

0(x)^
Proof Since G is evenly coaffine, we have G = (A^)'A^(G) (by Lemma 1.13 (a)), whence by (1.186), G = (A^)XX*\{0}). Hence by Corollary 6.1 for A = A^, we obtain (6.84). D

6.3 Constrained surrogate dual problems for reverse convex infimization In this section we shall consider "constrained surrogate dual problems" to problem (P^) of (6.1), defined as infimization problems of the form p^ — iwiUiy^'o), where the dual constraint set WQ is a proper subset either of an arbitrary set W c /? , or of (Z*\{0}) X /?, or of X*\{0}, depending on G, and the dual objective function is (6.8). As in Chapter 3, Section 3.3, we shall use the (equivalent) language of polarities A: 2^ -^ 2 ^ , A: 2^ ^ 2ix^\m^R^ and A: 2^ ^ 2^*\{0}^ Remark 6.14. In general, we shall not state separately the inequality parts < of the subsequent results, which hold for arbitrary subsets G of X, but only the equahty parts. The following general duality theorem will be applied in the sequel to various special polarities.

6.3 Constrained surrogate dual problems for reverse convex infimization

185

Theorem 6.10. Let X be a set, W 2^ a polarity, f e ^ ^ , and G a A'A-convex subset of X. Then i n f / ( C G ) = - sup f^^^\A(G)).

(6.85)

Proof. Since G is A^A-convex, by (1.139) (applied to the polarity AO, Lemma 3.7, and (1.223), we have i n f / ( C G ) = i n f / ( C A ' A ( G ) ) = inf/(C(n,eA(G)A\{if;}))) = inf/(U,eA(G)(CA'({i/;})))=

inf

inf/(CA^({U;}))

weA(G)

=

inf (-/^^^Hu;)) = -sup/^^^>(A(G)).

D

weA(G)

Remark 6.15. (a) By (1.223) and (1.144), one can also write (6.85) in the form inf/(CG) =

inf

inf/(CA'({W;})) =

weA(G)

inf

inf

weA(G)

xeX weZA({x})

f(x),

(6.86)

which expresses i n f / ( C G ) as an "inf inf," similarly to the duality formulas of the preceding sections. Furthermore, by (6.32), one can also write (6.86) in the form inf/(CG)=

inf

inf

f(x)=

weA(G) xe{domf)n[^A'({w})

inf weA(G)

inf

f(x).

(6.87)

xedomf u;eCA(U})

(b) Theorem 6.10 gives explicitly the relation between the constraint sets, and the relation between the objective functions, of the primal problem (P^) and the dual problem. Indeed, by Theorem 6.10, if X is a set, W c ^ , A: 2^ ^ 2 ^ is a polarity, f e R , and G c X, then the infimization problem (D^)

^;=infXA(A(G)),

(6.88)

with A A of (3.113), might be called the "(A-)dual problem" to (P') (of (6.1)), while the set A(G) and the function A,A might be called the "(A-)dual constraint set" and the "(A-)dual objective function," respectively. However, it will be more convenient to consider, instead of (D^), the supremization problem (5^)

y^l = sup ( - A A ( A ( G ) ) ) = sup f^^^\A{G))

= -^^

(6.89)

as the (A-)dual problem to (P^) (of (6.1)), since then, as mentioned in Remark 3.9(b), we will obtain a symmetric duality between abstract quasi-convex supremization problems and infimization problems with an abstract reverse convex constraint set. Indeed, assume that W c J? and that the relations x,x' e X, w{x) = w{x') (w e W) imply x = x'; then X c /? , since each x e X can be identified with the function cpx : W -> R defined by cpxiw) = w(x) (w e W). If we start with the primal problem (P^) of (3.1) and a polarity A: 2^ -> 2^, then in the A-dual problem (Z)A) of (3.114) the set A(G) c W is AA^-convex (since AA'A(G) = A(G)), so C A ( G ) of (3.114) is a reverse AA'-convex constraint set.

186

6. Unperturbational Duality for Reverse Convex Infimization

Hence the bidual problem to ( P ^ , i.e., the A'-dual problem (D^,) to (DA), in the sense (6.89) (mutatis mutandis), is the problem sup f^^^^^^^'\A'A(G))

= sup f^^^^^^^^'(A'A(G))

(6.90)

= SUp / q ( A ' A ) ( A ' A ( G ) )

(here we have used the remark made after (1.226), that L(AO = ^(A)^ and formula (1.227)), and the latter coincides with the initial problem (P^) whenever / is A^A-quasi-convex (by [254], Corollary 8.22(a), according to which / is A^ A-quasiconvex if and only if sup/(G) = sup/(A'A(G)) for all subsets G of X). Conversely, if we start with the primal problem (P'') of (6.1) and a polarity A: 2^ -> 2^, then in the A-dual problem (D^) of (6.89) the objective function /^^^^ is AA'quasi-convex (since (/^^^^)q(AA') = fL(^)LiA')LiAy ^ fiwuMLW ^ fL(A)^^y (1.227), L(AO = L(A)\ and L(Ay =_L(Af = L(A)). Hence the bidual problem to (P"^), i.e., the A^-dual problem ( D A ) to (D^), in the sense (3.114) (mutatis mutandis), is the problem •j^f ^L(A)L(A')^CA'A(G)) =

inf/^^^^^^^^'(CA^A(G))

= inf/q(A'A)(CA'A(G))

(6.91)

(here we have used that L(AO = ^(A)' and formula (1.227)), which coincides with the initial problem (P'') whenever G is a A^A-convex subset of X (by [254], p.317. Corollary 8.25 (b), according to which G is A'A-convex if and only if i n f / ( C G ) = inf/q(A'A)(CA^A(G)) for all functions f: X -> R). This proves our assertion on the symmetric duality between abstract quasi-convex supremization problems and infimization problems with an abstract reverse convex constraint set. From now on, in this section we shall assume, without any special mention, that X is a locally convex space with conjugate space X*. For such X and for G c X, we shall give some applications of Theorem 6.10, in the form (6.86), to some special polarities A: 2^ -> 2^^*^^^^^^^^ and A: 2^ -> 2^*\
inf

inf /(JC) =

(O,J)G(X*\{0})x/? xeX sup (G)d

inf

OeX*\{0}

inf

xeX U)>supO(G)

f(x).

(6.92)

Proof (of the second statement). Assume that / is upper semicontinuous and G is open (and convex). Then, for any {<^, d) e (X*\{0}) x /? we have, by Lemma 1.1, inf

xeX ^{x)>d

fix) =

inf

xeX ^(x)>d

fix),

(6.93)

6.3 Constrained surrogate dual problems for reverse convex infimization

187

and by Lemma 1.8, we have sup 0(G) < d if and only if (g) < d (g e G). Hence by Corollary 6.4 below, we obtain (6.92) (since every open convex set is evenly convex). Finally, the second equality of (6.92) holds by (5.13). D Corollary 6.3. Let X be a locally convex space, f \ X ^^ R an upper semicontinuous function, and G an open convex subset ofX. Then inf/(CG)=

inf i n f / ( ^ ) ,

(6.94)

where hi denotes the family of all open half-spaces in X. Proof The proof is similar to that of Corollary 3.10, using now Corollary 6.2 and the fact that sup 0(G) < d if and only if for the open half-space U,i>j of (3.117) D we have G n U<j>^d = 0Corollary 6.4. Let X be a locally convex space, / : X ^- R a function, and G an evenly convex subset ofX. Then inf/(CG) =

inf

inf

(,d)e(X*\{0})xR XGX <^(g)d

f(x) =

inf

cDe(X*\{0})

inf

f(x),

xeX (x)>ig) (geG)

(6.95)

and if G is also weakly compact, then inf/(CG) =

inf

inf

f(x).

(6.96)

(O,J)G(X*\{0})x/? xeX sup (PiG)d

Proof This follows from Theorem 6.10 for A = A^^ using (1.192), (1.191), (1.230), and that if G is weakly compact, then for each O e X*\{0}, sup ^(G) is attained (see Lemma 1.3). • Corollary 6.5. Let X be a locally convex space, f: X ^^ R a function, and G an evenly convex subset ofX. Then inf/(CG)=

inf inf/(V),

(6.97)

VeV vnG=i^

where V denotes the family of all closed half-spaces in X, Proof The closed half-spaces in X are the sets V,j of the form (3.121), where (O, d) e (X*\{0}) X R. Also, 0(g) < d {g e G) if and only if G n Vo,^ = 0. Hence (6.95) is equivalent to (6.97). • Similarly, using the polarity A^^ of (1.193), one can prove the following: Corollary 6.6. Let X be a locally convex space, f: X -^ R a function, and G an evenly coafftne subset ofX. Then inf/(CG)=

inf i n f / ( / / ) ,

Hen //nG=0

where H denotes the family of all (closed) hyperplanes in X.

(6.98)

188

6. Unperturbational Duality for Reverse Convex Infimization

Remark 6.16. Corollaries 6.3, 6.5, and 6.6 are instances of the "reduction principle" (they permit one to reduce the computation of'mf fi^G) to the computation ofinffiU) or inf/(V) or Mf{H)for U eU with U D G = 0 or V e V with VnG = &orHen with H DG = 0, respectively). Let us consider now the case of convex sets G containing 0. Theorem 6.11. Let X be a locally convex space, f: X -^ R an upper semicontinuous function, and G an open convex subset ofX, with 0 e G. Then inf/(CG)=

inf

inf

f(x).

(6.99)

cI>eX*\{0} xeX supO(G)l

Proof. This follows from (6.93), Lemma 1.8, and Theorem 6.12 below.

D

Remark 6.17. (a) The assumption 0 e G cannot be omitted in Theorem 6.11. Indeed, if X = R and G = (1, +CXD) (so G is open and convex and 0 ^ G), then inf/(CG) = inf;t \} = {x e R \ X < ^J for each c < 0, and = 0 for c = 0, whence the right-hand side of (6.99) is infc R is an arbitrary function and G is a closed convex subset of X with 0 G G, then we have (6.99). Indeed, this follows from Theorem 6.10 applied to A = A^i of (1.196), using (1.197). Theorem 6.12. Let X be a locally convex space, f: X -^ R a function, and G an evenly convex subset ofX with 0 G G. Then inf/(CG)=

inf 4)eX*\{0} (g)<\ (geG)

inf

f(x),

(6.100)

xeX ^(x)>l

and if G is also compact, then inf/(CG) =

inf 4)GX*\{0}

inf f(x).

(6.101)

xeX

sup cD(G)l

Proof This follows from Theorem 6.10 applied to A = A^^ of (1.199), since for any set G with 0 G G we have that G is evenly convex if and only if G = (A^^yA^^(G) (by (1.200)). D Remark 6.18. (a) Similarly to Remark 6.17, the assumption 0 e G cannot be omitted in Theorem 6.12. (b) Under the assumptions of Theorem 6.11 we have also (6.100) and inf/(CG) =

inf

inf

O€X*\{0} sup(I>(G)
xeX ^U)>1

f(x).

(6.102)

6.4 Unperturbational Lagrangian duality for reverse convex infimization

189

Indeed, (6.100) holds by (6.99), Lemma 1.8, and (6.93), or by Theorem 6.12 (since every open convex set is evenly convex), and (6.102) holds by (6.99) and (6.93). (c) As an application to approximation, let us note that applying Theorems 6.11 and 6.12 to the function / of (6.21), and using Corollary 1.4, we obtain formula (5.29) on dist (JCQ, CG) for any open convex set G containing 0 and XQ. Duality theorems for i n f / ( C G ) can be obtained for many other classes of sets G C Z by choosing suitable polarities A such that G is A'A-convex and applying Theorem 6.10 (in the form (6.86)). For example, let us give the following corollary: Corollary 6.7. Let X be a locally convex space, f: X —> R a function, and G an R-evenly convex subset ofX. Then inf/(CG) =

inf

O€X*\{0} 0 ( ^ ) < - l igeG)

inf

xeX ^(-v)>-l

f{x).

(6.103)

Proof. Let A^"^: 2^ -^ 2^*^^^^ be the polarity (3.132). Then, as has been observed in the proof of Corollary 3.15, the set G is (A''^)^ A'"^-convex if and only if it is /^-evenly convex. Hence by Theorem 6.10, we obtain (6.103). D

6.4 Unperturbational Lagrangian duality for reverse convex infimization Now we shall apply the substitution method described in Remark 1.26 (a) to deduce Lagrangian duality theorems for reverse convex infimization by combining the preceding surrogate duality theorems with Lagrangian duality formulas for the infima of the convex function / on open half-spaces. Theorem 6.13. Let X be a locally convex space, f: X ^ R an upper semicontinuous function, and G an open convex subset ofX satisfying (dom/)nCG / 0 .

(6.104)

Then inf/(CG) =

inf

max inf {/(y) - r)^{y) + r] sup cD(G)}. (6.105)

CDGX*\{0} r7>0 3JC edom/, <1> (jc) > sup O (G)

yeX

Proof By Theorem 6.8 and inf

inf

/ ( y ) = +oo,

eX*\{0} yeX Jji:edom/,0(.r)>supO(G) 0(y)>sup(G)

we have the following surrogate duality result, which reduces the computation of i n f / ( C G ) to the computation of the infimum of / over open half-spaces:

190

6. Unperturbational Duality for Reverse Convex Infimization inf/(CG)=

inf

inf

OGX*\{0}

yeX

/(>;).

(6.106)

3jcedom/.0(.r)>supO(G) 0(.v)>sup4>(G)

By (6.104), i n f / ( C G ) ^ -hoo, and hence by (6.106), the set {O e X*\{0}|3x e dom / , (x) > sup 0(G)} is not empty. Let 4> be any element of this set. Then we have (1.324) with OQ, do replaced by O and sup 0(G) respectively, and hence by (1.325), we have the following Lagrangian duality formula for the infimum of / over the open half-space {y e X\^{y) > sup 0(G)}: inf

/(>^) = max inf {f{y) - r]{y) + r] sup 0(G)}.

yeX cD(v)>supO(G)

r]>0

(6.107)

\eX

Consequently, substituting the value (6.107) in (6.106), we obtain (6.105).

D

Corollary 6.8. Let X be a locally convex space, f: X -^ [—oo, -\-oo) an upper semicontinuous function, and G (^ X) an open convex subset ofX. Then inf/(CG) =

inf

max inf {/(>') - r](^(y) + r] sup 0(G)}.

OGX*\{0} r?>0 sup4>(G)<+oo

(6.108)

yeX

Proof. By our assumption, dom f = X, whence {O€Z*\{0}|3xedom/, cD(x) > sup(I>(G)} = {O € X*\{0}| supa>(G) < +oo}, so the result follows from Theorem 6.13.

D

Combining, in a similar way. Theorem 6.11 and Remark 1.24 (with ^Q and do replaced by O and 1), we obtain the following refinement of Corollary 6.8 for the case 0 G G: Theorem 6.14. Let X be a locally convex space, / : Z -> [—oo, -f-oo) an upper semicontinuous function, and G an open convex subset ofX, with 0 e G. Then inf/(CG) =

inf

max mf{f(y)

- r](^(y) + t]}.

(6.109)

cI>6X*\{0} 7i>0 yeX sup{G)<\

6.5 Duality for infimization over structured primal reverse convex constraint sets 6.5.1 Systems The primal constraint set CG considered in the preceding sections of this chapter has been the complement of a convex subset G of a locally convex space X. However, as in the case of supremization (see Chapter 3), there are several more-structured ways of expressing the primal constraint sets CG C X. In the present section we shall consider one of these ways, namely that of systems, and surrogate duality for reverse convex infimization in systems.

6.5 Duality for infimization over structured primal reverse convex constraint sets

191

We recall (see Chapter 3) that a "system" is a triple (X, Z, w), consisting of two sets Z, Z and a mapping w: X -> Z. Given a system (X, Z, M), a subset T of Z with M(X) 0 7 / 0 , and a function / : X ^^ /?, we shall consider the primal reverse infimization problem « = ««-'(Cr)./ =

inf

fM-

(6.110)

Remark 6.19. Problem (6.110) is equivalent to problem {P'')of (6.1). Indeed, given a system (X, Z, w) and 7, / as above, let G = U-\T).

(6.111)

f(x) = inf /(jc),

(6.112)

Then inf u{x)e^T

SO problem (6.110) is nothing other than (6.1) with G of (6.111). Conversely, every problem (6.1) can be written in the form (6.110), by taking Z = X,u = Ix, the identity operator in X, and T = G. However, in the study of the "programming problem" (6.110) one can also use the properties of T and u. Now we shall assume that (X, Z, w) is a system, where X and Z are locally convex spaces, with conjugate spaces X* and Z*, 7 is a subset of Z, and f: X -> R is a function. There are several natural ways to introduce unconstrained dual problems to (6.110) that generalize the dual problems of the preceding sections. (1) One can use the polarities A^.,^^^: 2^ -> 2"*^^*^^^^^ (/ = 1, 2, 3, 4) defined by (3.168) and (3.174)-(3.176). For example, replacing the dual objective function (6.70) and the dual value (6.71), respectively, by X'-,

(VI/M) =

inf

^U-UT)

fix)

(vl/ e Z*, V|/M ^ 0),

(6.113)

xeX

^uix)>sup^(uiX)nT)

PI, U-UT)

= inf

inf

^eZ*

xeX

fix),

(6.114)

^u^o ^M(jc)>supvi/(f/(X)nr) we obtain, by Remark 6.10(b) (with Vl^ = tion of Theorem 6.7:

M*(Z*)\{0})),

the following generaliza-

Theorem 6.15. Let iX, Z,u) bea system, where X and Z are locally convex spaces, let T be a subset of Z such that the set {x e X| M(JC) e T] is iu"^iZ*)\[0})-convex (see (9.1)j, and let f: X —> R be a function. Then inf

xeX w(jc)eCr

fix)=

inf

^eZ* vi/M^o

inf

xeX ^u{x)>sup^{u{X)nT)

fix).

(6.115)

192

6. Unperturbational Duality for Reverse Convex Infimization

(2) Instead of A^_,^^^: 2^ -^ 2"*^^*^\{0J of (3.168), one can consider the polarity ^1]T ' 2^ -> 2^*\{^J defined by (3.177), replacing the dual objective function (6.79) and the dual value (6.80), respectively, by r^„ (vl/) =

inf

fix)

(vl/ e Z*\{0}),

(6.116)

^u{x)>sup^(T)

yg;,, =

inf

inf

fix).

(6.117)

Theorem 6.16. Let (Z, Z,u) be a system, in which X is a topological space, Z is a locally convex space, and u: X -> Z is a mapping. Furthermore, let f: X -^ R be an upper semicontinuous function, and T a convex subset of Z, with iniT 7^ 0 and such that U~\CT)

C

w-'(Cr).

(6.118)

Then inf

xeX wU)eCr

fix)=

inf

^eZ*\{0}

inf

xeX ^'u{x)>sup^{T)

fix).

(6.119)

Proof Define G c X by (6.111). If X € C ( A 2 V ) ' ( Z * \ { 0 } ) , i.e., if there exists ^ e Z*\{0} such that ^w(jc) > s u p ^ ( r ) = s u p ^ ( i n t r ) , then by Lemma 1.8 we have uix) ^ intT, whence by (1.20), uix) G C(intr) = Cf. Hence by (6.118), (6.111), and (1.20), jc e

U-\CT)

C

M-i(Cr) = CG = C(int G).

Thus we have intGc(A^^V)^(Z*\{0}).

(6.120)

On the other hand, if JC € CG = U~\CT), then uix) ^ T, whence by our assumptions on T and the separation theorem, there exists ^1/ e Z*\{0} such that sup ^iT) < VI/M(JC), SO X G C ( A ^ V ) ' ( ^ * \ { 0 } ) . Thus we have (A2V)^(Z*\{0}) C G.

Consequently, by (6.120), (6.121), and Theorem 6.4(b), we obtain (6.119).

(6.121)

D

Remark 6.20. In Theorem 6.16 it is not assumed that u is continuous. Nevertheless, under the assumptions of Theorem 6.16, if ini T ^ {^ is replaced by int 7 = 0, then inf

/(jc) = inf/(X).

xeX u{x)eZT

Indeed, by int T = 0 and Lemma 1.2, we have

(6.122)

6.5 Duality for infimization over structured primal reverse convex constraint sets

193

Cr = C(intr) = C0 = z, whence by (6.118),

X c u-\Z) = U-\CT)

C

M-i(Cr).

Therefore, u~^ (CT) = X, and hence since / is upper semicontinuous. inf

xeX

f(x) = inf/(w-i(Cr)) =

mff(X).

M(jc)GCr

In the case that T is an open convex subset of Z, the assumption (6.118) can be omitted. Indeed, we have the following: Theorem 6.17. Let (X, Z,u) be a system in which X is a set, Z is a locally convex space, and u: X ^^ Z is a mapping, and let f: X -> R be a function and T an open convex subset ofZ. Then we have (6.119). Proof. Define G c X by (6.111). If jc e C(A2|y,)'(Z*\{0}), i.e., if there exists ^ e Z*\{0} such that ^u(x) > sup ^ ( r ) , then, since T is an open subset of Z, by Lemma 1.8 we have u{x) ^ T, whence x e u~^ (CT) = CG. Thus, we have G c (A^V)'(Z*\{0}).

(6.123)

On the other hand, if x € CG = w"'(Cr), then u(x) ^ T, whence by the separation theorem, there exists ^ e Z*\{0} such that sup 4^(7) < ^u(x), so X e C(A2J^)^(Z*\{0}). Thus we have (6.121). Consequently, by (6.123), (6.121), and Corollary 6.1, we obtain (6.119). • Corollary 6.9. Let {X, Z,u) be a system in which X is a set, Z is a locally convex space, and u\ X -^ Z is a mapping, and let f: X —> R be a function and T a convex subset ofZ, with int 7 7^ 0. Then inf

xeX M(x)GC(intr)

fix)

=

inf

vi/eZ*\{0}

inf

xeX vi/M(.v)>supvI/(r)

/(JC).

(6.124)

Proof This follows from Theorem 6.17, since int T is convex and open and since sup^(int7) = s u p ^ ( r ) . D Corollary 6.10. Under the assumptions of Corollary 6.9, /f 0 € int T, then inf

f{x) = inf

M(jc)eC(intr)

where 7° is the polar set (1.82) ofT.

inf / ( x ) , vi/w(.v)>l

(6.125)

194

6. Unperturbational Duality for Reverse Convex Infimization

Proof. Since 0 e int T, we have, by Lemma 1.8, supvl/(r) > vl/(0) = 0

(^ G Z*\{0}),

(6.126)

and hence for any ^ G Z*\{0} with sup ^ ( T ) < +oo, we obtain [x e X\ ^u(x) > supvl/(r)} = {x eX\ ^f'u(x) > 1},

(6.127)

where vl/' :=

? vi/; supvl/(r)

(6.128)

on the other hand, if s u p ^ ( r ) = +(X), then with the convention 1/ + oo = 0, (6.127) (for v|/^ of (6.128)) reduces to 0 == 0. Note also that for any ^ e Z*\{0}, we have ^ ' € 7°. We claim that inf

inf

^eZ*\{0}

f(x)=

xeX

inf

inf

^eT°

xeX

f(x).

(6.129)

Indeed, since for each ^ e T° and x e X with ^w(x) > 1 we have ^ / 0 and ^u(x) > s u p ^ ( r ) , the inequahty < in (6.129) is obvious. If this inequality were strict, then there would exist ^o ^ Z*\{0} such that inf

fix) < inf

xeX ^ou(x)>sup^o{T)

^eT°

inf

xeX 4'M(JC)>1

fix);

(6.130)

but then, for ^^ = (1/ sup ^o(T))^fo ^ T° we would have, by (6.127), inf ^eT°

inf

f(x) <

xeX ^ ' ^«U)>1

' -

inf

f{x) =

..^X %uix)>\

^

inf

/(x),

xeX -^ vi/oM(jr)>supvI/o(r)

in contradiction to (6.130). This proves the claim (6.129), which together with (6.124), yields (6.125). D Related to Corollary 6.10, let us also prove the following theorem: Theorem 6.18. Let (X, Z, a) be a system in which X is a set, Z is a locally convex space, and u: X -^ Z is a mapping, and let f: X ^ R be a function and T a convex subset of Z, with M(X)nint T 7^ 0. Then for any XQ e X with M(XO) G int T, we have inf

xeX M(jc)eC(intr)

fix) =

inf

^G(T-uixo))°

inf

xeX ^u(x)>l-^^u(xo)

fix).

(6.131)

Proof Let G = u'\iniT),

(6.132)

W = iT - w(jco))° = {vl/ G Z*| sup vl/(r) - ^uixo) < 1}, A(C) = {vl/ e W| ^uic) - ^uixo) < 1 (c G C)}

(C c X).

(6.133) (6.134)

6.5 Duality for infimization over structured primal reverse convex constraint sets

195

If X € C A ' ( W ) , i.e., if there exists ^ e Z"" such that ^u(x) - ^w(jco) > 1 > sup ^(T)-^u(xo), then by Lemma 1.8, M(JC) ^ intT, sox e w"^(C(intr)) = CG. Thus we have G c A\W). On the other hand, if x G CG, then by (6.132), u(x) e C(intr), and hence by the separation theorem, there exists ^ e Z*\{0} such that sup ^ ( 7 — u{xo)) < ^(u(x) - w(jco)). But since 0 e intT - w(xo), by Lemma L8 we have sup ^f(T — w(xo)) > 0, and hence for ^' e Z* defined by vl/' :=

! vy, sup ^ ( 7 — w(jco))

(6.135)

we have vl/^ G (T-u{xo)y = W, and^'(u(x)-u(xo)) > l,sojc G CA^(W).Thus we have A ^ ^ ) ^ G. Consequently, by Corollary 6.1, we obtain (6.131). D Remark 6.21. (a) Theorem 6.18 implies the particular case of Corollary 6.10 in which 0 G u(X) n int T (by applying Theorem 6.18 to any XQ e X with u(xo) = 0). (b) The particular case of Theorem 6.18 in which X is a linear space and u: X -^ Z is a linear mapping can also be deduced from Corollary 6.10, as follows. Define f: X ^~R andT' ^ Zhy f(x):=f{x+xo),

(6.136)

r :=T -uixo).

(6.137)

Then T' is a convex subset of Z with 0 G int T'. Hence by Corollary 6.10, inf

f\x)=

inf

xeX M(jc)eC(int7")

^e(T')°

inf

f{x).

(6.138)

xeX ^u(x)>\

But by (6.136), (6.137), and the linearity of w, we have the equalities inf

f\x)=

inf

xeX MU)GC(intr')

xeX M(jr+jco)eC(intr)

inf f\x)=

f(x-\-xo)=

inf

xeX ^u(x)>\

xeX ^u(x-\-xo)>\+^uixo)

inf

inf

f{x'),

x'eX M(x')eC(int T)

(6.139)

/(x+xo)

fix')

(^ G (ry),

(6.140)

x'eX ^u{x')>\+^u{xo)

which together with (6.138), yield (6.131). Using the polarity A^^^ : 2^ -> 2^*^^^^ of (3.177), we shall give now a sufficient condition for strong duality. Theorem 6.19. Let (X, Z,u) be a system in which X is a set, Z is a locally convex space, and u: X -> Z is a mapping, and let f: X -^ R be a function and T an open convex subset of Z. If problem (6.110) has an optimal solution, i.e., if there exists xo G X with M(XO) G ZT such that /(XQ) = (X{= inf/(w~^(Cr))), then inf

xeX M(jc)€Cr

/(x) =

min

vi/eZ*\{0}

inf

xeX

4/w(A)>SUpVli(r)

/(x).

(6.141)

196

6. Unperturbational Duality for Reverse Convex Infimization

Proof. Define G c X by (6.111). Then by the above proof of Theorem 6.17, we have G c (A^j7^)^(Z*\{0}). Also, since w(jco) G CT, by the separation theorem there exists ^ G Z*\{0} satisfying ^W(JCO) > supvl/(r), i.e., such that xo G C(A^^y^)^({vI/}). Hence since JCQ G 5 ' C ( / ) , from Theorem 6.5 and Remark 6.8(b) we obtain (6.141). D Using the polarity /^fj: 2^ -> 2^*\<^J of (3.190), we shall prove the following theorem: Theorem 6.20. Let (X, Z,u) be a system in which X is a set and Z is a locally convex space, and let f: X -^ R be a function, ^ G Z*\{0}, and T a closed convex subset of Z. Then inf xeX MU)eCr

fix)

=

inf vi;eZ*\{0}

inf

/(JC).

(6.142)

xeX ^uix)>sup^(T)

Proof Define G c X by (6.111). If jc G C(A22^)^(Z*\{0}), i.e., if there exists ^ G Z*\{0} such that ^u(x) > sup^(T), then clearly, u(x) ^ T, whence x e U-\CT) = CG. Thus, we have G c (A^2^)'(Z*\{0}). On the other hand, if x G CG = U~\CT), then u(x) ^ T, whence since T is a closed convex set, by the strict separation theorem there exists ^ G Z*\{0} such that supvl/(r) < ^u{x), so jc G C(A^2^)'(Z*\{0}). Thus we have G 5 (A^2^)XZ*\{0}). Consequently, by Corollary 6.1, we obtain (6.142). D The following corollary shows that if X is a topological linear space, u. X -> Z is a continuous linear mapping, and T is a closed convex subset of Z, then the assumption (6.118) can be omitted in Theorem 6.16. Corollary 6.11. Let (X, Z,u) be a system, in which X is a topological linear space, Z is a locally convex space, and u: X -^ Z is a continuous linear mapping. Furthermore, let f: X -^ R be an upper semicontinuous function, and T a closed convex subset of Z. Then we have (6.119). Proof. Let W = Z*\{0}. Then since T is a closed convex subset of Z, by Theorem 6.20 we have (6.142). Also, by our assumptions, for all vj/ e Z*\{0} we have ^u G Z*, whence {JC G X\^U{X)

> supvl/(r)} = {x e X\^u{x)

> sup^(T)].

(6.143)

Consequently, from (6.142), (6.143), the upper semicontinuity of / , and Lemma 1.1, we obtain (6.119). D Using the polarity Afj : 2^ -^ 2^*^^^^ of (3.191), we shall prove now the following surrogate duality theorem of hyperplane type: Theorem 6.21. Let (X, Z, u) be a system in which X is a topological linear space, Z is a locally convex space, and u: X ^ Z is a mapping that is either continuous

6.5 Duality for infimization over structured primal reverse convex constraint sets

197

or linear, and let f: X -^ R be an upper semicontinuous quasi-convex function and T a convex subset ofZ, with int T 7^ 0, satisfying (6.118) and inf f(x) <

xeX u(x)eT

inf

xeX M(jc)6Cr

/(jc).

(6.144)

Then inf

xeX M(x)eCr

fix)

=

inf

inf

^eZ*\{0}

xeX ^u(x)=sup^iT)

/(JC).

(6.145)

Proof Define G c X by (6.111). If JC e C(A^3^)^(Z*\{0}), i.e., if there exists ^ e Z*\{0} such that ^fu(x) = s u p ^ ( r ) , then by (6.118) and the above proof of Theorem 6.16, we have JC G C(intG). Thus int G c (A^3^)'(Z*\{0}). On the other hand, la d e R, d > a = inf f{u-\CT)). Then by (6.111) and (6.144), there exist JC G X with u(x) e CT and JC' G X with u(x') G T such that fix)

< d, fix')

< d.

(6.146)

But by uix) G Cr, our assumptions on 7, and the separation theorem, there exists ^ G Z*\{0} such that supvl/(r) <

(6.147)

VI/M(X).

Define (^: [0, 1] ^ /? by (pii^) := vl/(w(z^jc + (1 - i})x'))

(0 < 1^ < 1).

(6.148)

If u is continuous, then so is (^; on the other hand, if u is linear, then (pii^) =

?^[^M(X)

-

VI/M(X')]

+ ^uix')

(0 < ?> < 1),

(6.149)

and hence again, cp is continuous. Furthermore, by uix') e T and (6.147) we have (fiO) = ^w(jc') < supvl/(r),

(pil) = ^uix)

> supvl/(r).

Therefore, there exists 1^0 G [0, 1] such that ^iuii^ox + (1 - ^o)x)) = (fi^o) = supvl/(r),

(6.150)

whence ?^o-^ + (l — ^o)-^' ^ C(A^^7^)'({^}). Also, since / is quasi-convex andx, x' G ^dif) (by (6.146)), we have ?^o-^ + (1 - T^O)X' G A ^ ( / ) . Hence by Propositions 6.3(b) and 6.2, we obtain (6.145). D Remark 6.22. (a) By Remark 6.20, under the assumptions of Theorem 6.21, if int r 7^ 0 is replaced by int 7 = 0, then we have (6.122). (b) Similarly to Remark 6.3(d), the assumption (6.144) is equivalent to

198

6. Unperturbational Duality for Reverse Convex Infimization inf / U ) = inf/(X).

(6.151)

xeX u(x)eT

(c) Under the additional assumption that Z is a normed linear space and T is also bounded, the assumption of quasi-convexity of / and condition (6.144) of Theorem 6.21 can be omitted. Indeed, as in the preceding proof, we have intG c (A^3^)^(Z*\{0}). On the other hand, if x e CG = U-\CT), then by Chapter 1, Theorem 1.3, there exists ^ e Z*\{0} such that ^u(x) = s u p ^ ( r ) , i.e., X e C(AfjyiZ''\{0}). Thus (A^^7^)'(Z*\{0}) c G, so we can apply Theorem 6.4(b). (d) In the particular case that Z = X and u = Ix (and hence by (6.111), G = T), Theorem 6.21 reduces to the case intG ^ 0 of Theorem 6.1(c).

6.5.2 Inequality constraints Let X be a locally convex space and f,k: Z —> R two proper convex functions. In this section we shall consider the primal infimization problem (6.1) of the form (n

= (P^ f)

a'=a[,f=

inf / ( x ) ,

(6.152)

kix)>0

that is, infimization of a convex function / over a reverse convex strict inequality constraint CG = {X eX\k{x)

>0}

(6.153)

(so G = {x e X\k(x) < 0}). Conversely, given any subset G of X, taking ^ = XG we have (6.153), so problem (6.152) becomes the general reverse convex infimization problem a = i n f / ( C G ) . Note also that problem (6.152) may be regarded as problem (6.110) for the system (X, Z, w), where Z = R, u = k, and T = R_ = [T] e R\r] < 0). However, exploiting the formulation (6.152), we shall now obtain some surrogate and Lagrangian duality results, involving the conjugate functions /*, /c*. As in Section 6.4, our main tool will be again the substitution method. We shall assume that problem (6.152) is feasible, i.e., (dom/) n{x e X\k(x) > 0} # 0. Theorem 6.22. Let X be a locally convex space and f,k:X convex functions such that k = k**. Then

(6.154) -> R two proper

inf fix) = inf max inf {f(y) - r]^(y) -h ry/:*(0)} xeX

k(x)>0

OGX*

n>0

~

(6.155)

veX

'

= inf max{r]k*{(t>) - f^ir]^)}

:=

Proof Since k = /:**, applying (1.99), we obtain the equivalences

fi\

(6.156)

6.5 Duality for infimization over structured primal reverse convex constraint sets

199

k(x) > 0 <^ ^**(jc) > 0 <^ sup inf {k{y) - 0 ( j ) + 0(jc)} > 0 <^ 3^0 e X\ inf {k(y) - (^oiy) + ^o(x)}

> 0

yeX

^ 3 0 o G X * , r ( O o ) < ^o(x) <^{cDGX*|r(0) < cD(x)} / 0 , and thus {x e X\ k(x) >0} = {x eX\{^

e X*| r() < cl>(jc)} / 0}

= U^exAx e X\ r ( O ) < ^(x)}.

(6.157)

Hence by (6.157) and Lemma 3.7, we get the following surrogate duality result: a' = inf/(Uo€X*{^ ^ ^1 ^ * ( ^ ) < ^M})

= inf

inf

CI>GX*

f{x).

xeX

(6.158)

k*(
Now, as in the proof of Theorem 6.13, observe that for each O € X*, Qfci> : =

inf

fix)

(6.159)

xeX.k*W«^i.x)

is a convex optimization problem, over an open half-space, and hence by Remark 1.24(b), with do = k*{^) (e R, since /:*(0) < 0(jc) < + o o and since k%^) > —cxD, because k* is proper), we have the following Lagrangian duality formula for the infimum of / over this open half-space: inf

xeX

fix) = max inf {fiy) - r]iy) + r//:*(cD)} n>0 xeX

k*m<^(x)

= max {r^r(O) - sup in^iy) '?>0

-

fiy)}}

yeX

= max [r]k*i^) - f^r]^)}-

(6.160)

T]>0

Consequently, substituting the value (6.160) in (6.158), we obtain (6.156). Remark 6.23. (a) Let Z * ( / , ) ^ ) : = { 0 € X * | ( d o m / ) n { x eX\k*i^)

< cD(jc)} 7^0}.

Then for O ^ X*if,k) we have infx^x.k*()«t>{x) fix) (6.154), (6.158), and (6.161), we obtain inf/(x)= xeX k(x)>0

inf (t>eXHf,k)

rmLx{r]kH)-f^T]^)};

D (6.161)

= H-oo, and hence by (6.162)

^>0

note also that by (6.157), the feasibility assumption (6.154) can be written in the form X"" if k) 7^0. (b) X*(/, k) can be described by means of a d.c. constraint (see Chapter 8), as follows:

200

6. Unperturbational Duality for Reverse Convex Infimization X*(/, k) = {e X*| r (O) - (Xdom/)*(<^) < 0}.

(6.163)

Indeed, by (6.161), we have the equivalences O ^ Z * ( / , ) t ) < ^ ( d o m / ) n { j c G X | r ( 0 ) < (|>(x)} = 0 <^ d o m / c {jc G X\ ^(x) < r ( 0 ) } <^ supO(dom/) < r(cD) ^ r (^) - (Xdom/)*(0) > 0 (O e X*), proving (6.163). Corollary 6.12. Let X be a locally convex space and f,k:X functions, such that dom f = X and k = /:**. Then inf

inf fix) xeX kix)>0

-

max {r?r (cD) - /*(^0)}

-^ R two convex

// inf ^(X) < 0,

cI>edomA:*\{0} T]>0

-/*(0)

(6.164)

/;^inf/:(Z)>0.

Proof. We shall use that )t*(0) = sup{0(jc) - it(jc)} = - inf ie(X).

(6.165)

Case r. infkiX) < 0. Then k^O) > 0, so for X*(/, it) of (6.161), X*(/, k) = domr\{0}.

(6.166)

Indeed, the inclusion X*(f, k) c dom A:* is obvious, and by k*(0) >0 = 0(x) (x e X) we have 0 ^ X*(/, /c), which proves the inclusion c in (6.166). Conversely, if O e dom/:*\{0}, i.e., O 7^ 0, /:*(0) < -foo, then, taking any XQ e X = dom / with O(jco) > 0, we have r]Xo e X = d o m / and /:*(0) < O(y/jco) for sufficiently large yy > 0, so O 6 X*(/, /:), which proves (6.166). Hence by (6.162) and (6.166), we obtain the first part of (6.164). Case 2°. inf )t(X) > 0. Then /:*(0) < 0, so X*(/,)t) = d o m r .

(6.167)

Indeed, the inclusion Z*(/, k) C dom A:* is obvious, and by /c*(0) < 0 = 0{x) (x e X) we have also 0 e X*(f, k), which proves the inclusion c in (6.167). Conversely, we have seen in the argument of case 1° that dom /:*\{0} ^ X*(f, k) (for any value ofMk{X)), Moreover, also 0 G X%fk), since 0 e X = d o m / and k%0) < 0 = 0(jc), for any x e X, which proves (6.167). Finally, by the assumption of case 2°, the primal feasible set is {x e X\k{x) > 0} = X, so infxex,k(x)>o fM = mff(X) = - / * ( 0 ) . ' D Remark 6.24. If G is any closed convex subset of a locally convex space X, then the indicator function k := XG of G is convex and satisfies k = k**. Also, ^*(0) = sup 0(G) (O eX*), and

6.5 Duality for infimization over structured primal reverse convex constraint sets X*(/,

XG)

=

{O G X * |

(dom/) n {jc E X| supcD(G) <

CD(JC)}

# 0}.

201 (6.168)

Hence since CG = [x e X\ XGM > 0}, from the above we obtain that the conclusions of Theorem 6.13 and respectively of Corollary 6.8 remain valid if / : X ^- R is a proper convex function, respectively a convex function with dom f = X, and G is a closed convex subset of X satisfying (6.104), respectively G / X.

7 Optimal Solutions for Reverse Convex Infimization

7.1 Minimum points of functions on reverse convex subsets of locally convex spaces Let X be a set, / : Z ^- R a. function, G c X, and zo ^ CG. Clearly, if f(zo) = —oo, then zo is an optimal solution of the primal infimization problem (P^) (of 6.1), i.e., /(zo) = min/(CG), and if f(zo) = +oo, /|CG # +oo, then zo is not a minimum point of / on CG. Therefore, the cases of interest are those where f(zo) e R.

(7.1)

Remark 7.1. From (1.23) it is obvious that zo e CG is an optimal solution of (P'') (of (6.1)) if and only if (CG) H A / ( , , ) ( / ) = 0, or equivalently, A/(,,)(/) c G.

(7.2)

Theorem 7.1. L^f X ^^ a ^^r, W c ;^^, A: 2^ -> 2 ^ « polarity, f e 'R^, and G a A^A-convex subset of X. For an element zo ^ CG, the following statements are equivalent: 1°./(zo) = min/(CG). 2°. We have A(G) c A(A/(,,)(/)).

(7.3)

Prao/ 1° => 2°. By Remark 7.1, if /(zo) = min / ( C G ) , then for any set of functions W c. R and any polarity A: 2^ —> 2^ we have (7.3) (since A is antitone).

204

7. Optimal Solutions for Reverse Convex Infimization

2" ^ r . Since G is A'A-convex, we have A'A(G) = G. Hence if 2° holds, then by (7.3) and since A^ is antitone, we obtain A/(,,)(/) c A'A(A/(,,)(/)) C A'A(G) = G, and thus by Remark 7.1, f(zo) = min / ( C G ) .

•

Corollary 7.1. Let X be a locally convex space, f: X -^ R a function, and G a closed convex subset of X. For an element ZQ € CG, the following statements are equivalent: r./(zo)=min/(CG). 2\ We have (where A^^: 2^ -^ 2^^*\<0J)^^ is the polarity (1.189)) A^^(G)CA'kA^(,,)(/)).

(7.4)

Proof By (1.190), G is closed and convex if and only if it is (A^^)^A^^-convex, so the result follows from Theorem 7.1 applied to W = (X*\{0}) x R and A = Ai^ D Corollary 7.2. Let X be a locally convex space, f: X -> R a function, and G an evenly convex subset of X. For an element zo ^ CG, the following statements are equivalent:

l°./(zo)-min/(CG). 2\ We have {where ^^^\ 2^ -> 2^^*\^^^>^^ is the polarity (1.191)) A'2(G) c A^2(A/(,,)(/)).

(7.5)

Proof By (1.192), G is evenly convex if and only if it is (A^^)^A^^-convex, so the result follows from Theorem 7.1 applied to W = (X*\{0}) x /? and A = A^^ • Corollary 7.3. Let X be a locally convex space, f: X -^ R a function, and G a closed convex subset of X with 0 e G. For an element zo e CG, the following statements are equivalent: r./(zo)-min/(CG). 2°. We have (where A^^: 2^ -> 2^*\^^^ is the polarity (1.196)) A^^(G)CA«»(A^(,,)(/)).

(7.6)

Proof 1° =^ 2°, by Remark 7.1 and since A^' is antitone. 2° :=^ 1°. If (7.6) holds, then, since (A^^)' is antitone, we obtain, using 0 € G, (1.197), and the assumption that G is closed and convex, A^(,,)(/) c

(A«^)'AO^(A^(,,)(/)) C

(A«^)^A«k(^) = coG = G.

D

Corollary 7.4. Let X be a locally convex space, f: X -^ R a function, and G an evenly convex subset of X with 0 e G. For an element zo G CG, the following statements are equivalent: . r . / ( z o ) = min/(CG). 2°. We have (where A^^. 2^ -^ 2^*^^^^ is the polarity (1.199))

A^HG) C A^HAf^,^)(f)).

(1.1)

7.1 Minimum points of functions on reverse convex subsets of locally convex spaces

205

Proof. The proof is similar to the above proof of Corollary 7.3, using now (1.200). D Remark 7.2. The assumption on G is satisfied, in particular, when G is an open convex subset of Z, with 0 G G. In this case, condition (7.6) becomes G° c A/(^o)(/)°, an inclusion between the usual polar sets (1.82), and it implies (7.7), since A^2(G) = /S.^\G) (by Lemma 1.8). For any X, / , and G as above, we shall denote by 5c(^(/) the set of all optimal solutions of problem {P'') of (6.1), that is, 5CG(/)

:= {^0 G C G | /(ZO) = min/(CG)}.

(7.8)

Lemma 7.1. Let X be a locally convex space, f: X -^ R a convex function, and G a subset ofX, with G ^ X, satisfying inf/(G) < i n f / ( C G ) < +oo.

(7.9)

5cG(/)^bdCG.

(7.10)

Then

Proof Assume that we have (7.9) or only inf/(X) < i n f / ( C G ) < +oo, but not (7.10), so there exist XQ e X and zo ^ <5CG(/) (hence zo e CG) such that /(xo) < i n f / ( C G ) < 4 - O O ,

(7.11)

Zo^bdCG.

(7.12)

Then zo ^ XQ, and there exists TJQ G (0, 1) such that ^0^0 H- (1 - ^7o)^o e CG;

(7.13)

indeed, if we had yrj := ijzo + (1 - ri)xo G G for all rj e (0, 1), then we would obtain zo = hm^^^i >';^ G G H CG c bd G = bd CG, in contradiction to (7.12). But then by the convexity of / , zo ^ <5CG(/)» and (7.11), we obtain fimzo + (1 - ^o)^o) < mf(zo) + (1 - r]o)f(xo) < i n f / ( C G ) , which together with (7.13) contradicts the assumption zo ^ ^ C G ( / ) -

^

Theorem 7.2. Let X be a locally convex space, f \ X ^^ R an upper semicontinuous convex function, and G a convex subset ofX with G 7^ X, int G 7^ 0, satisfying (7.9). For an element ZQ G CG, the following statements are equivalent:

1°./(zo)=min/(CG). 2°. There exists OQ ^ X*\{0} such that inf

cI>o(zo) = supcDo(G), / ( > ; ) = min inf

yeX ^o(>')=supOo(<^)

inf yeX o(>')=supOo(G)

OeX*\{0}

f{y) = /(zo).

/(j),

(7.14) (7.15)

yeX 0(y)=sup(G)

(7.16)

206

7. Optimal Solutions for Reverse Convex Infimization

Proof, r => 2°. Assume 1°. Then since G is convex, with intG ^ 0, and since zo e CG, by the separation theorem there exists OQ e X*\{0} such that supOo(G) < Oo(zo) (< +00). On the other hand, by zo ^ ^ C G ( / ) ' (7.9), and Lemma 7.1, we have ZQ € bdCG c G, and thus ^oUo) < supOoCG) = supOo(G), whence the equality (7.14). Furthermore, by (7.9), Theorem 6.1, (7.14), and 1°, we obtain inf/(CG) =

<

inf cDeXniO}

inf f{y) vex ^ •" 0(v)=sup4)(G)

inf

/ ( > ; ) < / ( z o ) = inf/(CG),

yeX OoCvO^supOoCG)

and hence OQ satisfies (7.15) and (7.16). 2° => 1°. If 2° holds, then by (7.16), (7.15), (7.9), and Theorem 6.1, we obtain /(zo)=

inf f{y)= yeX ^o(>')=supOo(G)

inf ci>eX*\{0}

inf /(y) yeX 0(v)=supO(G)

which together with zo ^ CG, yields 1°.

=

inf/(CG),

D

Remark 7.3. (a) Since /(zo) ^ ^ , each of (7.15) and (7.16) implies sup Oo(G) e R. (b) A function OQ ^ ^*\{0} satisfying (7.15) and (7.16) need not satisfy (7.14), as shown by the following example: Let X = R^ with the Euclidean norm, / : X —> R the function (6.24), and G = {g e X\ \\g\\ < 1}, so S^^cif) = [x e X\ \\x\\ = 1}. Then the function OQ e X*\{0} defined by (5.49) satisfies (7.15) and (7.16) for each zo ^ <5CG(/)' but cI>o does not satisfy (7.14) for any zo ^ ^ C G ( / ) \ { ( 1 ' (^)}-_ (c) Theorem 7.2 admits the following geometric interpretation: If / : X -^ Ris an upper semicontinuous convex function, and G a convex subset of X with G 7^ Z, int G 7^ 0, satisfying (7.9), then for an element zo ^ CG we have zo ^ <5CG(/) if and only if there exists a hyperplane HQ e He such that zo € Ho, inf/(//o) = min i n f / ( / / ) ,

(7.17) (7.18)

inf/(//o) = /(zo),

(7.19)

where HG is as in Remark 6.3(a). Indeed, if zo ^ <5CG(/) and if 4)o € X*\{0} is as in Theorem 7.2, then the hyperplane HQ = //
{j€X|Oo(j)-supcDo(G)} (7.20) has the required properties. Conversely, every hyperplane HQ e HG is of the form (7.20) for some OQ e X*\{0} with supOo(G) < +00 (see Chapter 1, Corollary 1.1), whence if HQ satisfies (7.17)-(7.19), then we have (7.14)-(7.16).

7.1 Minimum points of functions on reverse convex subsets of locally convex spaces

207

Now we shall show that in the particular case that X is a normed linear space and G is also bounded, the assumption (7.9) can be omitted. Theorem 7.3. Let X be a normed linear space, f: X -^ R an upper semicontinuous convex function, and G a bounded convex subset ofX with G ^ X, inlG ^ 0. For an element ZQ G CG, the following statements are equivalent: r . / ( z o ) = min/(CG). 2°. There exists OQ G X * \ { 0 } satisfying (7.14)-(7.16). Proof r =^ T. If r holds, then since G is bounded and convex, with intG 7^ 0, and zo G CG c C(intG), by Theorem 1.3 there exists OQ G X * \ { 0 } satisfying (7.14). Then as in the above proof of Theorem 7.2, it follows (using now Theorem 6.2 instead of Theorem 6.1) that OQ satisfies (7.15) and (7.16). 2° =^ 1°. Indeed, this follows as in the above proof of Theorem 7.2, using now Theorem 6.2 instead of Theorem 6.1. D Let us consider now the set <SCG(/) of all minimum points of / on CG (see (7.8)), and the set ^ C G ( / ) of all functions OQ G X * \ { 0 } satisfying (7.15), that is, AG(/):={^OGX*\{0}|

inf

f{y)=

min

Oo(v)=sup(I>o(G)

inf

/(j)};

(7.21)

0(v)=sup(G)

we shall call any element of AZG (/) a best function (with respect to the pair (G, f)). The best functions are nothing other than the optimal solutions of the dual problem (6.28), with C A ^ ( { 0 } ) of (6.33). Theorem 7.4. Let X be a locally convex space, f: X ^^ R an upper semicontinuous convex function, and G a convex subset ofX with G 7^ X, int G / 0, satisfying (7.9); or let X be a normed linear space, f \ X -> R an upper semicontinuous convex function, and G a bounded convex subset ofX with G 7^ X, int G / 0. We have ^{^cif) 7^ 0 if and only if there exists a best function ^0 G X*\{0} such that (CG) n {X G X | o(x) = sup00(G),

inf

fiy) = f(x)}

^ 0.

(7.22)

VGX

cI>o(v)=supOo(G)

Proof Indeed, the condition means that there exist a function (I>o G X*\{0} and an element zo G CG satisfying (7.14)-(7.16), so the result follows from Theorems 7.2 and 7.3. D Remark 7.4. A function OQ G X * \ { 0 } is best if and only if the hyperplane //Q G HQ defined by (7.20) satisfies (7.18); thus //Q may be called a best hyperplane. Then the conclusion of Theorem 7.4 admits the following geometric interpretation: We have S^G^f^ 7^ 0 If cmd only if there exists a best hyperplane HQ G Tic such that (CG)n<S,/,(/)7^0, where Snoif) denotes the set of all minimum points of / on //Q.

(7.23)

208

7. Optimal Solutions for Reverse Convex Infimization

Now we shall show that in some cases Theorem 7.4 and Remark 7.4 can be improved. Theorem 7.5. Let X be a locally convex space, f: X ^^ R an upper semicontinuous convex function, and G an open convex subset of X with G / X satisfying (7.9); or let X be a normed linear space, f: X ^ R an upper semicontinuous convex function, and G an open bounded convex subset ofX with G ^ X. Also assume that Snif) 7^ i^for all hyperplanes H e He- We have <SCG(/) T^ 0 if and only if there exists a best function OQ G Z * \ { 0 } . Proof The necessity of the condition follows from Theorem 7.4. Conversely, assume now that there exists a best function OQ G X * \ { 0 } (SO we have (7.15)) and let HQ be the hyperplane defined by (7.20) (so HQ e HG)- Then by our assumption, Snoif) # 0- Let zo e Snoif), so we have (7.14) and (7.16). Then by the proof of Theorem 6.1, we have HQ C CG, whence zo e CG = CG (since G D is open). Consequently, by Theorems 7.2 and 7.3, we obtain zo G <SCG(/)Corollary 7.5. Let X be a reflexive Banach space, f \ X ^^ R an upper semicontinuous convex function, and G an open bounded convex subset ofX satisfying (7.9). We have S^Q{f) ^ 0 if and only if there exists a best function OQ G X * \ { 0 } . Proof By (7.9), there exist an infinity of numbers y e R such that mff(X)

(7.24)

and for any such y we have, clearly. 0^{yeX\f(y)
(7.25)

so {y e X\ f{y) < y} is bounded. Hence by [32], Lemma 4, all level sets {y e X\ f(y) S c] (c e R) of / are bounded. Consequently, since X is a reflexive Banach space, by [32], Theorem 7, it follows that Snif) ^ 0 for all hyperplanes H in X, so we can apply Theorem 7.5. • Let us summarize the connections between the existence of minimum points and of best functions with respect to the pair (G, / ) . Theorem 7.6. Let X be a locally convex space, f: X ^^ R an upper semicontinuous convex function, and G a convex subset ofX with G ^ X, int G 7^ 0, satisfying (7.9); or let X be a normed linear space, f: X -^ R an upper semicontinuous convex function, and G a bounded convex subset of X with G ^ X, intG ^ 0. Then

(a)5cG(/)^0=^AG(/)?^0; (c) under the assumptions of Theorem 7.5 {or Corollary 7.5), S^Q ( / ) ^ & O -4CG(/)^0; (d) it may happen that both S[^Q(f) = 0 andA^^cif) — 0' ^^^^ when X is the Hilbert space /^, / is a finite continuous convex function, and G is open, bounded, and convex.

7.2 Subdifferential characterizations of minimum points of functions

209

Proof. Part (a) follows from the necessity part of Theorem 7.4. Part (c) is nothing other than Theorem 7.5 (respectively, Corollary 7.5). Finally, parts (b) and (d) are proved by Examples 5.4 and 5.5, taking / of (6.21). D

7.2 Subdifferential characterizations of minimum points of functions on reverse convex sets Theorem 7.7. Let X be a set, W C^'R^ , A: 2^ ^ 2^ a polarity, f e ~R^, and G a A^ A-convex subset of X. For an element zo G X with f(zo) > —o^y the following statements are equivalent:

r./(zo)=min/(CG). 2°. We have 9^<^'/(zo) n A(G) ^ 0, f''^^\w)

(7.26)

= sup/'-'^>(A(G))

3°. We have (7.26), and each w G 9 ^ ^ ^ V ( ^ O ) the dual problem (D^) {of (6.89)), i.e., f^^^\w)

= max f^^^\A{G))

(w € 3^<^>/ao)). H A(G)

(7.27)

is an optimal solution of

{w e d^^^^f(zo) H A(G)).

(7.28)

4°. There exists WQ e d^^^^f(zo) H A(G) that is an optimal solution of the dual problem (D^), i.e., such that /^(^)(w;o) = max /^^^>(A(G)).

(7.29)

Proof r => 2°. If r holds, then zo e CG, and hence since G is A^A-convex (for the case G = 0, see Remark 7.5 below), there exists WQ ^ W such that G c A'i{wo}), zo e CA^({U;O}).

(7.30)

Thus by 1° and (7.30), we have /(zo) = min/(CG) < inf/(CA^({ii;o})) < /(zo),

(7.31)

whence/(zo) = min/(CA^({w;o})),and A(G) 3 AA'({w;o})) ^ u;o. Consequently, by (1.232), K;oea^(^7(zo)nA(G).

(7.32)

Furthermore, for any w e 9^^^V(^o) we have, by (1.232), 1°, and Theorem 6.10, f^^^\w)

= - / ( z o ) = - m i n / ( C G ) = sup /^^^>(A(G)).

The implications 2° =^ 3° =:^ 4° are obvious.

210

7. Optimal Solutions for Reverse Convex Infimization

4° => 1°. If 4° holds, say, WQ e a^^^VUo) H A(G), then since A' is antitone, we have A\{WQ}) 5 A'A(G), whence by (1.232) and since G is A'A-convex, we obtain zo e CA^({W;O}) C C A ' A ( G ) = CG. Also, by (1.232), (7.29), Theorem 6.10, and Zo ^ CG, we get /(zo) = -/^^^^i^o) = - max /^^^^(A(G)) = min

/(CG).

D

Corollary 7.6. Let X be a set, W c R , A: 2^ -^ 2^ a polarity such that the empty set 0 is A^ A-convex, and f: X ^^ R a function. For an element ZQ ^ X with fizo) > — 00, the following statements are equivalent: 1°. /(zo) = min / ( X ) . 2°. We have a^(^)/(^o)#0, f^^^\w)

(7.33)

= rmxf^^^\W)

(w e d^^^^fizo)).

(7.34)

3°. There exists WQ e 9^^^V(zo) such that f^^^\wo)

= max f^^^\W).

(7.35)

Proof This follows from Theorem 7.7 applied to G = 0, using that for any polarity A: 2^ -> 2^ wehave A(0) = W/(by (1.139) for / = 0). D Remark 7.5. (a) It is well known (see, e.g., [254], Remark 6.3 (a)) that the empty set 0 is A^A-convex if and only if A'(W) = 0, or equivalently, A({x}) / W for all X

eX.

(b) As an application to reverse convex best approximation, let us give now another proof of Theorem 5.5, using Theorem 7.7: By (1.223) and Corollary 1.4, for / of (3.123) and each O e A^2(G)\{0} we have /^(^")(cl>) = _inf/(C(A«2)^({0})) 1 - O(xo) = - d i s t (xo, {^ e X I cD(>0> 1}) = - l^li Hence by Theorem 7.7, equivalence 1° <^ 4° (for A = A^^), and by (4.22) (mutatis mutandis), it follows that we have 1° of Theorem 5.5 if and only if there exists O^) G X*\{0} satisfying (5.53), (5.50), (5.52), and (5.51). Remark 7.6. From the above it follows that under the assumptions of Theorem 5.5, if the dist in the left-hand side of (5.29) is attained, then the inf in the last term of (5.29) is attained (and hence so is the inf in the second term of (5.29)). Let us give now a characterization of minimum points for reverse convex infimization problems with a constraint set defined by a (nonstrict) inequality k(x) > 0, in terms of 6:-subdifferentials.

7.2 Subdifferential characterizations of minimum points of functions

211

Theorem 7.8. Let X = R^, f,k: X -> R two finite convex functions, and Zo e X such that 0 ^ {jc e X| k{x) > 0} 7^ X and inffiX)

< /(zo) < +00,

(7.36)

k(zo) = 0.

(7.37)

The following statements are equivalent: 2°. We have Uizo)

^

U^>O9.(M/)(ZO)

(S

Proof Let G := A^,,){k), so CG = [x e X\k{x) Remark 7.1, we have 1° if and only if AfuM)

> 0).

(7.38)

> 0}. Then by (7.37) and

^ A,^,om.

(7.39)

We claim that under the assumptions of the theorem, condition (7.39) is equivalent to SfuM)

^ S,^,,,(k).

(7.40)

Indeed, by (7.36), the continuity of / , k, and Remark 1.7, condition (7.39) implies (7.40). In the converse direction, by (7.36), let z e Af(zo)(f)' Then since / is continuous, Af(^Q){f) is open, so there exists a neighborhood V{z) of z such that Viz) c A/(,,)(/) c 5/(,,)(/). Hence if(7.40) holds, then V(z) c 5,(,,)(^). Therefore, since k is continuous, z e iniSk(zo){k) = A^^^^^ik), so (7.39) holds, which proves the claim. Now, by Chapter 4, Remark 4.1, condition (7.40) is satisfied if and only if /:(zo) = max/:(5/(,,)(/)).

(7.41)

But by Theorem 4.9, we have (7.41) if and only if Uizo)

c NASfUo)(fy^ zo)

(£ > 0).

(7.42)

Finally, by (7.36) and Theorem 1.7, ;V,(5/(,,)(/); zo) = U^>oa.(M/)(zo) and therefore (7.42) is equivalent to (7.38).

is > 0),

(7.43) D

We conclude this chapter with a theorem on minimum points for reverse convex infimization problems with a constraint set defined by a (nonstrict) inequality kix) > 0, using subdifferentials of / at the points of some level sets of / (in the spirit of Chapter 4, Theorem 4.8; however, the theorem below is less satisfactory, since it does not give a necessary and sufficient condition.

212

7. Optimal Solutions for Reverse Convex Infimization

Theorem 7.9. Let X, f, k, and ZQ be as in Theorem 7.8. (a) ///(zo) = min^ex,;t(A)>o / U ) , then dk(x) c U^>oa(M/)U)

(X e X, k(x) = 0, fix) = f(zo))-

(7.44)

(b) If f(zo) / min^ex.)tu)>o/(-^), ^^^« there exists x e 5/(zo)(/) satisfying k(x) = k(zo) such that dk(x)^N{Sf^,,^{f);x).

(7.45)

Proof (a) By Theorem 7.8 and its proof, condition f{zo) = minxeXMx)>o fM is equivalent to (7.42). Taking in both sides of (7.42) the intersection over all 6: > 0 and applying Theorem 1.6, we obtain

dk(zo) ^ n,>o^.(5/(zo)(/); ^o) = ^(5/(zo)(/); ^o) = u^>oa(M/)(zo). (7.46) Now, if X G X,k(x) = 0, fix) = fizo), then also fix) = min^'eXMx')>o fi^')Hence by (7.46) applied to zo replaced by x, we obtain (7.44). (b) By Theorem 7.8 and its proof, condition fizo) = mmxeXMx)>o fM is equivalent to (7.41). Hence if this does not hold, then by Theorem 4.8 (with / , G and go of that theorem replaced by k, 5/(^o)(/) and zo, respectively), there exists X e 5/(zo)(/) with kix) = kizo) satisfying (7.45). •

8 Duality for D.C. Optimization Problems

In the preceding chapters we have considered optimization problems with various primal constraint sets and objective functions. In this chapter we shall focus on "d.c. optimization problems," i.e., optimization problems involving differences of convex functions.

8.1 Unperturbational duality for unconstrained d.c. infimization In this section and the next one, we shall consider the following optimization problem: Let X be a locally convex space and let / , /i: X -> Rbc two convex functions. Then, the primal infimization problem (P)

a = inf ( / -f -h){X) = inf {f(x) + -h{x)}

(8.1)

.X€X

is called an unconstrained d.c. optimization problem^ and the objective function f -]- —his called a d.c. function; sometimes instead of "d.c." the term diff-convex is also used. Here the terms "d.c." and "diff-convex" stand for "difference of convex" functions. We use in (8.1) the "difference" / - j — h rather than f — h, since it may happen that both f(x) and /z(jc) are ±oo for some x e X; however, we shall simplify this in the present chapter, starting with the "notational convention" of Remark 8.1(b) below. If X is a locally convex space and / , /i, /, /c: X -> /? are four convex functions, then the constrained primal infimization problem

214

8. Duality for D.C. Optimization Problems (Pe)

a=

inf

xeX lix)+-kix)<0

{f{x) + -h(x)},

(8.2)

and the problem (8.2) with < replaced by <, that is, (P,)

a=

{/U) +-/z(x)},

inf

(8.3)

xeX l(x)+-k(x)<0

are called d.c. constrained d.c. optimization problems, or simply constrained d.c. optimization problems, and their constraint sets are called d.c. constraints (since / -i- —^ is a d.c. function); these will be studied in Section 8.3. The unconstrained and the constrained d.c. problems are also called, for short, d.c. optimization problems. Remark 8.1. (a) D.c. optimization problems (8.1) encompass, as particular cases, the convex supremization problems. Indeed, if G is a subset of a locally convex space X and /z: X -> /? is a convex function, then for f = XG the primal problem (8.1) becomes inf (xG + -h){X) = -sup/z(G),

(8.4)

i.e., a problem of convex supremization. Note also that by introducing an additional variable d e R, every convex supremization problem (8.4) can be written in the form -

sup J = (g,d)eG'

inf i-d),

(8.5)

{8^d)eG'

where G' := {(g, d) e G x R\d — h(g) < 0}, i.e., as the problem of infimization of the continuous linear (hence d.c.) function f'{x,d) = —d ((x,d) e X x R) over the reverse convex (hence d.c.) constraint set G^; in particular, when h is upper semicontinuous and G is closed, G' is closed. (b) In this chapter it will be convenient to use the following convention: Notational convention: For any a, Z? € /?, we shall denote a-^b:=a^b,

a - b == a-\- {-b).

(8.6)

One of the reasons for using (8.6) is that in (8.4) above, one of the functions occurring in the upper sum + (namely, XG) cannot have the value — oo at any point of X. Another motivation of (8.6) is that this convention will permit substantial simplifications of the notation, and will lead to no confusion, since we shall not need lower sums « -j- Z? (in fact, the lower sums were needed mainly for the expressions w{x) -j—f{x) occurring in the definition of the conjugate f*(w), where w e W, with W ^ R , but in the sequel we shall work with W = X* <^ R^, soin / * ( 0 ) , where ^ e X*, there will occur the usual differences 0(x) — f(x)). Thus, with this notation, the d.c. functions f + —h can be written as f — h, and problem (P) of (8.1) becomes

8.1 Unperturbational duality for unconstrained d.c. infimization {P)

a^M{f-h)iX)=mf[f{x)-h{x)].

215 (8.7)

xeX

(c) In the converse direction to (a) above, every d.c. infimization problem (8.7) can be "transformed" into a convex supremization problem on X x R (at least as regards their optimal solutions), as follows: Given a d.c. infimization problem (8.7), with f(X), h{X) c /?, let us consider the supremization problem (Pmax)

sup / ( X , J ) , (jc,J)eG

(8.8)

where G = {(jc, d)eXxR\

fix) -d <0},

(8.9)

fix, d) = hix) - d iix, d) eX X R);

here the constraint set G c. X x R and the objective function f: X x R -^ R axe convex. Then we have the following implications: zo solves (8.7) =^ (zo, fizo)) solves (8.8),

(8.10)

(zo, ^o) solves (8.8) =^ do = fizo) and zo solves (8.7).

(8.11)

Indeed, if fizo) - hizo) = mm{fix)

- hix)} =

xeX

min {d - hix)}, (x,d)eXxR f(x)-d<0

then fizo. fizo)) = hizo) - fizo) =

max {hix) - d} = max^ fix, d),

(x,d)eXxR fix)-d<0

(x,d)eG

whichprovesj;8.10). On the Other hand, if (zo,^o) ^ G and/z(zo)—^o = f(zo.do) = ^^^(x d)eG f(x^ d) = max(x,j)eXx/?{/2(-x) - d}, then /U)-^<0

fizo) - hizo) <do-

hizo) =

min {d - hix)} = min [fix) - hix)}, (x,d)eXxR fix)-d<0

xeX

which yields (8.11). (d) The relations between d.c. problems and reverse convex problems are more delicate. First, note that d.c. constrained d.c. infimization problems have several main particular cases, obtained by taking one of the functions f,h,l,k: X -^ R in (8.2), (8.3) to be identically 0 on X. Now clearly, the problem of reverse convex infimization of a convex function is a particular case of the problem of d.c. constrained infimization of a convex function (by taking h = I = 0 in (8.2),(8.3)). However, it is not clear whether the problem of reverse convex infimization of a convex function can be written as an unconstrained d.c. infimization problem. Indeed, if G C X is a convex set and / : X ^- /? is a convex function, although we have

216

8. Duality for D.C. Optimization Problems i n f / ( C o = inf {fix) + - (-XCGM)},

(8.12)

xeX

it is well known (see, e.g., [254], p. 135, Theorem 4.4) that G is convex if and only if —XCG is quasi-convex, so the function in the right-hand side of (8.12) is not a d.c. function, but only the difference of a convex function and a quasi-convex function. The set DC(X) of all finite d.c. functions on a set X has some remarkable stability properties with respect to the usual operations encountered in convex analysis. For example, it is well known that if / , fi e DC{X) (i = 1 , . . . , m), then the functions | / | , / + = m a x ( 0 , / ) , / - = m i n ( 0 , / ) , ET=i^ifi i^li ^ /?, / = 1 , . . . , m), maxi
a = inf/(G)

(8.13)

of infimization of a lower semicontinuous function f: X ^^ R over a closed subset G of a Hilbert space X can be reduced to the infimization of a continuous linear function over a d.c. constraint set. Indeed, introducing a new variable d e Rwt can write problem (8.13) as inf J,

(8.14)

(xJ)eG

where G := {(x, d) e G x R\ f (x) — d < 0], i.e., as the problem of infimization of the continuous linear (hence d.c.) function /(jc, d) = d {(x, d) e X x R) over the closed constraint set G (by the closedness of G and lower semicontinuity of / ) . Consider now a problem of this type, that is (mutatis mutandis), inf (c,x), XGG

(8.15)

8.1 Unperturbational duality for unconstrained d.c. infimization

217

where c e X and G is a closed subset of X. This problem can be rewritten as inf

(c,j).

(8.16)

xeX dist(jc,G)2<0

But we have dist (X, Gf = inf ||jc - gf = inf (x-g,xgeG

gf

geG

= \\xf+mf{-2(x.g)

+ \\gf)

geG

= \\xf - sup (2(x, g) - \\gf) = l(x) - k(x)

(X e X),

(8.17)

geG

where l{x) := \\x\\^ and k(x) := sup^^^^ (2(jc, g) - \\g\\^) (x e X) are (finite) lower semicontinuous convex functions. Hence problem (8.16) can be written as inf

(c,jc),

(8.18)

xeX l{x)-k{x)<0

where x -> (c, x) is a continuous linear function and {jc G Z | l(x) — k(x) < 0} is a d.c. constraint set, which proves our assertion. Note also that by introducing an additional variable d e R, the latter problem can be converted into the problem of infimization of a convex function subject to a reverse convex constraint set. Indeed, one can rewrite problem (8.18) in the equivalent way inf

(c,x),

(8.19)

{x4)eX^R lix)-d<0,d-k(x)<0

and then considering the convex set G := {(jc, d)eX

X R\ l(x) -d

<0},

(8.20)

{(C,X) + X G ( X , ^ ) } ,

(8.21)

we have inf

(c,jc)=

(x,d)eXxR l(x)-d<0,d-k(x)<0

inf xeX d-k(x)<0

where x -^ {c, x) + XG(X, d) is a convex function and [x e X\d — k(x) < 0} is a reverse convex constraint set. (c) Any 0-1 integer optimization problem can be converted into a d.c. problem, since a discrete constraint of the form x/G{0, 1}

(/ = l , . . . , m )

(8.22)

can be rewritten as m

0 < JC, < 1 (/ = 1, . . . , m),

m

^ j c / -Yl,^^

- ^'

^^-^^^

218

8. Duality for D.C. Optimization Problems

Returning to the general unconstrained d.c. infimization problem (P) of (8.7), we shall consider the dual problem (D)

p - inf {/z*(cD)-/*(0)}.

(8.24) j^

Proposition 8.1. Let X be a locally convex space. Then for any functions we have inf {f(x) - /z**(x)} = inf {h\^) - /*(O)}.

f,heR (8.25)

Proof By (1.97) and (1.95), we have inf {f{x) - /z**(x)} = inf {/(x) - sup {cD(x) - /z*(cD)}} = i n f { / ( x ) + inf

{h*m-^{x)\\

= inf inf {/(x) + {h*m - (x)}} = inf inf {/(x) + {h*m -
D

eX*

We have the following basic duality theorem for problem (8.7). Theorem 8.1. Let X be a locally convex space. (a) For any functions f,heR we have the inequality inf [fix) - h(x}} < inf {h*() - / * ( $ ) } . xeX

(8.26)

0eX* j^

(b) For any function h e R the following statements are equivalent:

r. h = /z**. 2°. We have inf {fix) - h(x)] = inf {/z*(0) - /*(c|>)}

( / e R"").

(8.27)

. By /z** < /z we have —/z < —/z**, whence inf {/(x) - h(x)} < inf {/(x) - /z**(x)},

(8.28)

jceX

OeX* X

Proof (a) Let f,heR

jceX

jceX

which, together with (8.25), yields (8.26). (b)Let/z eJ^. 1° r=^ 2°. If r holds, then by T and (8.25), we have (8.27). 2° =^ r . If 2° holds, then using that / * = /*** = (/**)*, and applying (8.27) both to / and to / replaced by /**, we obtain

8.1 Unperturbational duality for unconstrained d.c. infimization inf {fix) - hix)] = inf {/z*(cD) - /*{<$>)} ^

= inf [r*{x) - h{x)]

219

inf {^*() - /*"()}

if e 'R'').

(8.29)

xeX

But since by (8.6) and (1.84) we have a-a>0(a follows that

eJ),

from (8.29) for / = /z it

0 < inf [h{x) - h{x)} = inf {/i**(x) - h{x)} xeX

{x e X),

xeX

whence h**(x) - h(x) > 0 (JC G X), which together with (1.98) yields 1°.

D

Remark 8.3. (a) If G is a subset of X, then for f — XG "^^ have X G ( ^ ) = suP;cGx{^(-^) ~ XGM} = supO(G) (O G X*), whence inf {/z*(cD)-x*(0)}= inf {sup{cD(>;) -/z(>;)} - supcD(G)}. Consequently, by (8.4) and Theorem 8.1, it follows that, if h = h**, then sup/i(G) = - inf {XGM - h{x)] = - inf {/i*(cl>) - X G ( ^ ) } xeX

= sup cDeX*

OeX*

{-{sup{(j)-/z(j)}-supO(G)}} >eX

= sup {inf [/i(>;) - 0(>;)] + supO(G)}; thus, we have obtained again Theorem 3.14. (b) If G is a (nonempty) subset of X, then for h = —XCG ^ ^ have —XCGU) = - 0 0 for all X G G, whence ( - X C G ) * ( ^ ) = sup^^xl^C-^) " (-XCG)(-^)} =

+^

whenever G / 0, so one cannot apply Theorem 8.1 to reverse convex infimization (see formula (6.9)). X

(c) For any f,heR

satisfying a = inf {fix) - h(x)} > - 0 0 ,

(8.30)

xeX

we have dom/cdom/z.

(8.31)

Indeed, if there exists XQ Gdom / \ d o m h, i.e., /(XQ) < +oo, h(xo) = +oo, then a = inf;,ex{/(-^) - h{x)} = - o o . Note also that if (8.30) holds, then by (8.26), we have dom/z* c d o m / * .

(8.32)

(d) If both / = /** and h = /z**, then applying Theorem 8.1(a) to the functions /z*, / * : X* -> J, and using that (X*, or(X*, X))* = X, we obtain that P = inf {/i*(
xeX

xeX

which together with (8.26), yields a ~ fi, re-proving the main part of Theorem 8.1(b) for this particular case.

220

8. Duality for D.C. Optimization Problems

One can give the following geometric background to Theorem 8.1 in the case that (8.30) holds: Proposition 8.2. Let X be a set and f,h: X -^ R two functions satisfying (8.30). Then for any y > —oo the following statements are equivalent: 2\ f -h>y; ^•f>h + y. These imply the following statements, equivalent to each other: 4°. Supp / 5 Supp h - (0, y) (where Supp / denotes the (X*, /?)-support set (1.103)); 5 ° . e p i / * 2 e p i / i * - ( 0 , K); 6°./* y. Moreover, if f = /** and h = /z**, then all statements 1°, . . . , 8° are equivalent. Proof The equivalences 1° 4^ 2° 4^ 3° are obvious. 3° =^ 4°. By (1.103), the inclusion 4° means that {(CD, d) eX* X R\^-d

< f}^

{(O, d) e X* x R\^ - d < h} - (0, y)

= [(^,d-y)\(^-d

i.e., 4° means the implication ^-d^-d-{-y — do) > /z*(Oo)-y.Thenfor J := /i*(Oo)-}/ e /? wehave (OQ, t/) e [epi/z*-(0, y)]\epi /*. 6° =^ 5°. The inclusion 5° means that {(O, d) eX* X R\ /*((!))
{(O, d) e X* x R\ /z*(cD) < d] - (0, y) = {(cD,^-y)|/z*(cD)
i.e., 5° means the implication /2*(0) < d =^ / * ( ^ ) < d - y. But if 6° holds and /z*(cD) < d, then /*(cD) < /z*((I>) -y
+ y) < h} = (8.34) D

8.2 Minimum points of d.c. functions

221

Remark 8.4. (a) In general, 4° 7^ 3°. Indeed, if ^ > y > a, then we have 8° (and (8.26)), but not 1°. (b) Applying Proposition 8.2, implication 1° =^ 8°, to y = of, it follows that a < p. Thus we have obtained another proof of the inequality (8.26).

8.2 Minimum points of d.c. functions For quasi-convex supremization and reverse convex infimization, in Remarks 4.1 and 7.1 we have observed some simple characterizations of optimal solutions in terms of level sets. For d.c. infimization one can give a corresponding characterization of optimal solutions in terms of epigraphs: Remark 8.5. If X is a locally convex space and f: X ^^ R and h: X -^ R are two functions, then for an element zo ^ dom / the following statements are equivalent: 1°. /(zo) - h{zQ) = min,ex{/(x) - h{x)]. 2\ We have epi / c epi {h + /(zo) - hiz^)).

(8.35)

Indeed, by (1.87), we have 1° if and only if h -\- f(zo) — h{zo) < f, which, by (1.105), is equivalent to (8.35). Remark 8.5 suggests the following characterizations of optimal solutions. Theorem 8.2. Let X be a locally convex space, / : X —> R a lower semicontinuous convex function (not identically +00), and h. X -> R a finite convex function such that dh(x) 7^ 0 (x G X). For an element zo e d o m / the following statements are equivalent: 1°. f(zo) - h(zo) = min,ex{/(x) - h{x)}. 2°. For all (x,r) e X x R satisfying r-h{x)

= f{zo)-h{zo),

(8.36)

we must have fix)

> r + 0(jc' - x)

ix' e X, O G dh(x)).

3°. For all (x,r) e epi / satisfying (8.36), we must have (8.37). Proof r =^2°. Assume 1° and let (jc, r) G X x /? satisfy (8.36). Then fix') - hix') > /(zo) - hiz^) = r - hix)

ix' e X),

whence fix)

> r + hix') - hix) > r + ^(jc' - x)

(JC' € X, O G dhix)).

(8.37)

222

8. Duality for D.C. Optimization Problems

The implication 2° => 3° is obvious. 3° =^ 1°. Assume 3° and let jc G X. We want to prove that f(x) - h(x) > f(zo) -h(zo). Choose r:=h(x)

+ f{zo)-h(zo)-

(8.38)

Case (i): r < f(x). Then f(x) — h{x) > r — h(x) = f(zo) — h(zo). Case (ii): r > /(JC), that is, (JC, r) e epi / . Then by (8.38) we have (8.36), whence by 3°, fix')

> r + 0 ( x ' -x) = h(x) + fizo) - h(zo) + <^(x' - x) (x' G X, (D e dh(x)).

Consequently, taking jc' = JC in (8.39), we obtain f(x) — h{x) > /(zo) — ^(^o)-

D

Remark 8.6. (a) Condition 3° (or 2°) implies (taking jc = zo, ^ = /(•^) = /(^o)) that dhiz^) c a/(zo),

(8.40)

which is thus a necessary condition for 1°. In particular, when both / and h are differentiable, condition (8.40) becomes V/2(zo) = V/(zo).

(8.41)

(b) One cannot completely remove the parameter r in Theorem 8.2: condition 2° for one fixed r e R, and, in particular, for r = /(jc), i.e., condition dh{x) c df{x)

(JC G X, f{x) - h{x) = fizo) - h(zo)),

(8.42)

is not sufficient in order to ensure 1°, as shown by the following simple counterexample: Let X = R, and define f,h: R ^^ Rhy

fM=\

0

jc^2

ifjc<0,

•, X "> n0, if

,

jc^

^^^^ = "" 0

if r < 0

1^^-^'

(8.43)

ifjoO.

Then for ^Q = 0 we have dh{zo) = df(zo) = 0, and since f — his monotonically increasing, ZQ is the only point x e R satisfying f {x) — h{x) = f(zo) — h(zo), so (8.42) (and (8.40)) holds. However, 0 = f(zo) - Hzo) ¥^ mf,ex{f(x) - h(x)} = —oo. (c) One can use a linearization (LP^) of (P) of (8.7) to obtain an optimal solution. Namely,/or any optimal solution OQ of the dual problem (D) of (8.24), every optimal solution XQ of the linearized problem {LP^,)

inf {/(J) - Oo(j)}

(8.44)

is an optimal solution of (P) of (8.27). Indeed, since OQ is an optimal solution of (/)), /z*(Oo) is finite. Hence by the Fenchel inequality (1.109), and since XQ is an optimal solution of (LP<^Q), we obtain, using also (8.27), that

8.2 Minimum points of d.c. functions

223

fixo) - h(xo) < /(xo) - cDo(xo) + /i*(4>o) = inf [f(x) - 4)o(x)} + h*{o) xeX

=/z*(cDo) - r (Oo) =

inf

{/!*()-/*($)}

4>Gdom/?*

=

inf {/(jc)-/i(x)},

(8.45)

xedomf

which proves the assertion. Similarly to Chapter 4, Theorem 4.9, introducing now a parameter e, let us give a characterization of global minima using ^-subdifferentials. Theorem 8.3. Let X be a locally convex space, f: X -^ R a proper convex function, andh: X ^^ R a proper lower semicontinuous convexfunction. For an element Zo ^ dom / n dom/z the following statements are equivalent: r . /(zo) - /lUo) = mm,^x[f{x) - h{x)]. T. We have dsh(zo) C dsf(zo)

ie > 0),

(8.46)

deh{zo)^S,f{zo)

(£>0).

(8.47)

or, equivalently.

Proof 1° ^ 2°. Assume 1°, i.e., that f(zo) - h(zo) < f(x) - h(x)

(X € X).

(8.48)

Let £ > 0 and O € 9e/i(zo), i-e., ^ - /^(x) < -h{zo) - 0(x - zo)

(x e X).

(8.49)

Then by (8.48) and (8.49), we obtain £ + f(zo) - h(zo) < £ + fix) - h{x) < fix) - hizo) - <^ix - zo)

ix e X),

whence O G 9e/(zo), and thus 2° holds. 2° => r . Assume that 1° does not hold, i.e., there exists XQ e X such that fixo) - hixo) < /(zo) - hizo).

(8.50)

We claim that hizo). /(zo), fixo) e R and fixo)-fizo)<

hixo)-hizo)-

(8.51)

Indeed, observe first that since zo € dom/i and h is proper, we have /i(zo) ^ ^• Furthermore, since zo ^ d o m / and since the equality fizo) = — oo would contradict (8.50), we have fizo) ^ R- Similarly, since / is proper and since the equality /(xo) = -hoo would contradict (8.50), we have fixo) G R. Also, since h is proper, we have /i(jco) > —oo. Hence by hizo), fizo). fixo) ^ ^ and

224

8. Duality for D.C. Optimization Problems

h(xo) > —00, (8.51) holds (for h{xo) = -hoo this is obvious, and for h{xo) e R this follows from (8.50)), which proves our claim. Consequently, (xo, f(xo) - fizo)) e{Xx

/?)\epi (h - h(zo)).

(8.52)

Hence, since epi (h — h{zo)) is a closed convex set in X x R (because the function h — h{zo) is proper, lower semicontinuous, and convex), by the strict separation theorem and Chapter 1, formula (1.28), there exists {^, /JL) e (X x R)* = X* x /? such that sup

(vl/, /x)(x, d) < (vl/, M)(XO, f(xo) - /(zo)),

(8.53)

(x,d)etpi(h-h(zo))

which means that for v i— ^uP(jc d)Gepi{h—h(zo))^^'' ^^^•^' ^^ ^^ have ^ ( ^ ) + /xJ < V < vI/(xo) + fiifixo)

-

fizo))

ax,d)ecpi{h-h(zo)))-

(8.54)

We claim that we may assume // < 0 in (8.54). Indeed we cannot have /x > 0 (take d -> -\-oo in (8.54)). Furthermore, if (8.54) holds for /x = 0, then ^(x)
(x e dom(h - h(zo))-

(8.55)

Hence v < ^(XQ) + /x(/(jco) — f{zo)) for any fi e R sufficiently near to 0. On the other hand, we may assume, without loss of generality (considering h — h(zo) — inf(/i-/z(zo))(X) instead of A-/i(zo)), thatinf(/z-/z(zo))(^) > O.Then^ > Ofor all (x, d) eepi (h — h(zo)), whence by (8.55), for any /x < 0 and all (x, d) Gepi {h — h{zo)) we have ^(x) + /xJ < ^(jc) < v. Consequendy, for any /x < 0 sufficiently near to 0, too, we have (8.54), which proves our claim. Hence for such a /x, dividing by —/x (> 0) and taking d = h(x) — h(zo) in (8.54), we obtain vi/(^) _ h(x) + h(zo) <-/x

< --^(xo) 1^ iJi ix e dom[h - hizo)])-

- fixo) + fizo) (8.56)

Let

Then

1

y

/x

/x

(8.57)

6 + O(zo), whence by (8.56) and (8.57), hix) - hizo) > ^ix - zo) - £ fixo) - fizo) < O(xo - zo) - e.

ix e dom/z),

(8.58) (8.59)

Taking x = zo in (8.58), we see that ^ > 0. Also, by (8.58) we have O G dMzo). and by (8.59) we have O ^ a,/(zo), so a,/z(zo) 2 ^^/(zo), and thus (8.47) does not hold. If here £ = 0, i.e., if dhizo) 2 V(^o). then there exists also 6:^ > 0 such that d^'hizo) 2 ^e'fizo) (indeed, otherwise, by (1.131) we would obtain dhizo) = r\e>odshizo) ^ r\8>o^^f(^o) = 9/(zo), a contradiction); thus, (8.46) does not hold either. D

8.3 Duality for d.c. infimization with a d.c. inequality constraint

225

Remark 8.7. (a) Taking e = 0 in (8.47), one obtains again the necessary condition (8.40) for zo to be a global solution of the infimization problem (P) of (8.7). (b) In particular, Theorem 8.3 yields again Theorem 4.9 on convex supremization. Indeed, if G C X, then as observed at the beginning of Section 8.1, for / = XG we have inf ( / — h)(X) = — sup/i(G), so if zo edom /fldom h = G O dom /z, then 1° becomes h(zo) = suph(G); also, by (1.130) and (1.127), for zo ^ G wt have dsXcizo) = {O G X*| cD(x - zo) < XG(X) - xcizo) + s{xe X)} = {O e X*| (D(jc) - 0(zo) <s{xeG)} = N^G; zo),

(8.60)

so (8.46) becomes formula (4.57) of Theorem 4.9 (mutatis mutandis).

8.3 Duality for d.c. infimization with a d.c. inequality constraint In this section we shall consider the constrained primal infimization problem (8.3), that is, with the notational convention (8.6), the primal problem (Pd)

a=

inf

{f(x)-h(x)}.

(8.61)

xeX l(x)-k(x)<0

where X is a locally convex space and f,h,l,k: X -> /? are four convex functions; furthermore, we shall also mention some cases in which the strict inequality < in (8.61) is replaced by < . Remark 8.8. (a) We shall see in the last part of this section that the duality results for the optimization problems (8.61) encompass some important particular cases, identically 0 on obtained by taking one or more of the functions f,h,l,k:X->R X. (b) There are some cases (see Proposition 8.3 below) in which the strict inequality < in (8.61) can be replaced by <, that is, inf

xeX l(x)-k(x)<0

{fix) - h(x)} =

inf

xeX l(x)-kix)<0

{fix) - h(x)}.

(8.62)

However, in general this is not the case, as shown by the following example: Let Z be a normed linear space, G a convex subset of X with int G / 0, XQ € int G, fix) = \\xo -x\\ ix eX),h=OJ = 0,k = XG- Then inf

{fix)-hix)}=

XGX

inf

xeX l(x)-k{x)<0

inf

||jco - JC|| = dist (JCQ, CG) > 0,

(8.63)

||jco - x|| = dist (JCQ, X) = 0.

(8.64)

XGX

l(x)-k(x)<0

-XG(X)<0

{fix)-hix)}=

inf

xeX -XG(X)<0

226

8. Duality for D.C. Optimization Problems

The strict inequality < in problem (8.61) is crucial in obtaining the following main duality result: Theorem 8.4. Let X be a locally convex space and let f,h,l,k: convex functions, with h = h**,k = k**. Then inf

X -^ R be four

{fix) - h{x)}

xeX l(x)-k(x)<0

inf

max{/z*(vI/) + ^ r ( 0 ) - ( / + r7/)*(^ + y?0)|.

(,^)edomfc*xdom/z*

(8.65)

r]>0 ^

Proof Byh = /z** and (1.97) we have f(x)-h(x)

= f{x)-h*Hx) = fix) - sup inf [hiy) - ^iy) + ^ix)] - fix) + inf sup{-/z(j) + ^iy) - ^ix)\ =

inf {fix) - ^ix) + /z*(^)},

(8.66)

and, similarly, by /: = /:**, /(jc)-J^(jc)=/(x)-r*(x)

=

inf {/(x)-0(jc) + r ( 0 ) } .

(8.67)

Oedom/:*

Then by (8.67), {xeX\l(x)-k(x)

<0} = {xeX\

inf {/(jc) -
=

( J {x e X| /(x) - ) < 0},

(8.68)

Oedom^*

and hence by (8.68), Lemma 3.7, and (8.66), inf

{fix)-hix)}=

xeX /(X)-/:U)<0

inf

=

inf

inf

{fix)-hix)}

edomk* xeX l(x)-ix)+k*()<0

inf

inf

(0,^)edom^*xdom/z*

= where

{/(x)-/z(x)}

.vG U {VGX|/(V)-4>(V)+^*(O)<0} cDedom/t*

inf

(<^,^)eD

{/(jc)-vl/(x) + /i*(vl/)}

xeX l(x)-^ix)+k*i
inf

xeX l(x)-(x)+k*()<0

{fix) - ^ix) + /i*(^)},

(8.69)

8.3 Duality for d.c. infimization with a d.c. inequality constraint

227

D := {(O, ^ ) G d o m r x domh* \ d o m ( / - vl/ + h%^))

n{xeX\

l(x) - ^{x)

4- F ( O ) < 0} # 0 } .

(8.70)

Observe now that for ^ e domh* we have, clearly, d o m ( / — ^ + h*(^)) dom / , whence the equivalences

=

d o m ( / - vj/ + /z*(vi/)) n {jc e X| l{x) - cD(jc) + r (O) < 0} 7^ 0 < ^ d o m / n { x G X | / ( j c ) - c D ( j c ) - h r ( 0 ) < 0} 7^ 0 <^

inf {l{x) - 0 ( x ) + k*{^)} < 0 jcedom/ < ^ r ( 4 > ) - sup { 0 ( x ) - / ( j c ) } < 0 xedom f

4» r(cD) - sup{cD(jc) - (Xdomf+l)(x)} xeX

< 0

^rW-(Xdom/+/)*W<0,

(8.71)

and hence D = {(O, ^ ) G d o m r X d o m / z * | r ( 0 ) - (Xdom/ + / ) * ( 0 ) < 0}.

(8.72)

On the other hand, by (8.71), applying Theorem 1.15 with m = 1, and / and /i of that theorem replaced by those f — ^ + h*{^^) and / - O + k*(^) respectively, for which (O, 4/) G D , we obtain inf xeX

{ / ( x ) - v i / ( x ) + /z*(vl/)}

l(x)-<^ix)-\-k*()<0

= max inf {/(x) - vl/(jc) + /z*(vl/)} + /^[/(jc) - 0 ( x ) -h F ( O ) ] } = max {/i*(vl/) + r^r (O) + inf {f(x) r]>0

+ ry/(x) - (^ -\- r]^)(x)}

xeX

= max {/i*(vl/) + r ; r (O) - ( / + r]iy{^ + r;0)}, which, together with (8.69) and (8.72), yields (8.65).

(8.73) D

R e m a r k 8.9. (a) The right-hand side of (8.65) does not completely involve the conjugates / * and /*; however, assuming, in addition, the so-called (/, l)-constraint qualification 3xo G dom / n dom/, in which f or I is continuous,

(8.74)

one can express ( / + r]iy and (Xdom/ + 0* in terms of / * , /*, and (Xdom/)*. with the aid of infimal convolutions (see [132], p. 337, Proposition 2 and Remark 3; see also Section 8.4 below). (b) If / or / is not assumed to be convex, then by Proposition 1.3, the equality (8.65) of Theorem 8.4 will be replaced by the weaker formula

228

8. Duality for D.C. Optimization Problems inf

xeX l(x)-k(x)<0

{/(x) inf

-h(x)}> sup {/z*(vl/) + r]k*{^) - ( / + ^/)*{^ + r])}.

(8.75)

(^,vI/)GdomA:*xdom/2* „ > o ^*(O)-(Xdom/+/)*(O)<0 '-

(c) The above proof is, essentially, an application of a refined version of the "substitution method" of Chapter 1, Remark 1.26 (a). Indeed, the first two equaUties of (8.69) may be considered as a surrogate duality result for problem (P^) of (8.61). However, one cannot apply directly the substitution method, since we have no Lagrangian duality result available for the expression inf

{f{x)-h(x)}

(8.76)

xeX l(x)-{x)+k*i)<0

occurring in (8.69); note that Theorem 1.15 cannot be applied directly, since the function f — h of (8.76) is not convex. Therefore, in the above proof it was necessary to transform the expression (8.76) in a way that made it possible to apply the Lagrangian duality Theorem 1.15. The following result will be used to show that in some cases the strict inequality < in (8.61) can be replaced by <. Proposition 8.3. Let X be a topological linear space and f,l:X —> R two extended-real-valued functions on X. Under each of the following assumptions (a) / is upper semicontinuous, and 0 is not the value of a local minimum of I; (b) / is upper semicontinuous, I is concave, and sup/(X) > 0; (c) f = fi — /2, with / i , / convex, fi lower semicontinuous, and A:=[x

e X\ f2(x) > -oo} n dom /i n {jc G X\ l(x) < 0} / 0,

(8.77)

one has a := inf f(x) = inf f(x). xeX l{x)<0

(8.78)

xeX /(A-)<0

Proof (a) Since inf fix) < a,

(8.79)

XEX

l{x)<0

for a = —cx) we have (8.78). Let a > —oo. In order to prove that a < infjc^xj(x) a. If not, i.e., if there exists x e X with /(JC) = 0 such that fix) < a, then since / is upper semicontinuous, there exists a neighborhood V of JC such that /(>^) < a for all y e V.On the other hand, since 0 is not the value of a local minimum of /, there exists XQ e V such that /(JCQ) < 0. Hence a = inf^ex, iix)
8.3 Duality for d.c. infimization with a d.c. inequality constraint

229

(b) We shall show that 0 is not the value of a local minimum of / and apply (a). To this end, we have to show that if l(x) = 0 and V is a symmetrical neighborhood of X, then there exists y e V such that l(y) < 0. By the assumption sup/(X) > 0, there exists z E X such that l{z) > 0. Hence since V is a symmetrical neighborhood of X, it contains points of the half-line from z passing through x that lie beyond x, i.e., there exist y e V and i^ e (0, 1) such that x = i^y + (I - i^)z. Consequently, since / is assumed to be concave, 0 = l(x) > m(y) + (1 - i^)l(z) > m(y), and therefore l(y) < 0. (c) In order to prove that a < infjcexjix)aAffi(x) = -oo for all X e A, then we have a = infxexjix) - o o . Let X, :=(\--)x

+ -y

(A2 = 1, 2, . . . ) .

(8.80)

Then lim„_^+oo-^n = x, and since f\ is convex and x e dom/i and j G A c dom / i , we have Xn G dom /i (n = 1,2,...). Furthermore, since / is convex and y e A, we have Kxn) <(l\

-)l(x) n/

+ -l(y) < -l(y) < 0 n n

(/I = 1, 2, . . . ) .

(8.81)

Moreover, by (8.81), f\ {y) e R, the convexity of / i , and (8.80), a = inf f(x) < fi(xn) -

fiixn)

xeX l(x)<0

<{l--)Mx) n

+ -My)-f2(xn) n

(n = l , 2 , . . . ) .

(8.82)

Also, by the lower semicontinuity of /2, we have — liminf„^+oo fiixn) < —fiix). Hence passing in (8.82) to the limit as « -^ +oo, we obtain ot < f\{x) — f2{x). D Let us mention now some particular cases of Theorem 8.4. In the sequel we shall assume that X is a locally convex space. (1) Duality for the problem (8.7) of unconstrained d.c. infimization: Taking / = 0, /: = c > 0 in Theorem 8.4, we obtain again the main part of Theorem 8.1(b). Indeed, we have l{x) — k{x) = —c < 0 for all x e X. Also, for any O G X*, r(cD) = sup{^(jc)-c} =

—c

if d) = 0

+00

if O ^ 0,

'

so dom/:* = {0}, and hence the right-hand side of (8.65) becomes

(8.83)

230

8. Duality for D.C. Optimization Problems max {/2*(vl/) + y/F(0) - /*(vl/ + r].0)}

inf

^edomh* r/>0 ^*(0)-(Xdom/)*(0)<0

=

inf

{/z*(^) - /*(vl/) + max T](-C)} = inf {/z*(vl/) - /*(vl/)},

^edomh*

T]>0

^eX*

i.e., the right-hand side of (8.27). (2) Duality for the problem (6.152) of reverse convex infimization with one reverse convex inequality constraint: Taking h = I = 0 in Theorem 8.4, we obtain again the first part of Corollary 6.12. Indeed, the left-hand side of (8.65) becomes infxeXMx)>o fix). Also, by (8.83) (mutatis mutandis), we have /z* = X{0}, so domh* = {0}, and hence the right-hand side of (8.65) becomes inf max {/z*(0) + r]k*{^) - /*(0 -h rj^)} 4>Gdom^* r]>0 k*m-(xdomfrw)-/*(ryO)}.

(8.84)

edomk* r]>0 A:*(O)-(Xdom/r(O)<0

But under the assumptions of the first part of Corollary 6.12, by dom / = X we have Xdom/(^) = x i ( ^ ) = supcD(X) = X{0}(^); also, ^ ( 0 ) = sup(-/:)(X) = - inf ^(X) > 0. Hence, for O e dom^*. /:*(0)>0 -oo < 0

r(^)-(Xdom/)*(0)

tfO.O, if CD ^ 0,

so k*(^) - (Xdom/)*(^) < 0 if and only if O 7^ 0. Consequently, (8.84) becomes inf

max {77^(0)-/*(^cD)},

OGdomfc*\{0} r;>0

which is the first part of the right-hand side of (6.164). (3) Duality for the problem of convex supremization, with a constraint set determined by one d.c. inequality: Taking / = 0 in Theorem 8.4, we obtain the following result: Proposition 8.4. IfhJ,k: /:**, then sup

h(x)=

X -^ R are three convex functions, with h = h**, k = sup

xeX

,^eX*

i(x)-k(x)
min{-{/z*(vI/)-hyyr(cD)-(^/)*(vI/ + r7cD)}}. ^^^

k*m-rm
Proof. For / = 0 we have dom / = X, so Xdom/ = 0, and hence by Theorem 8.4 (with / = 0) we obtain inf

xeX l{x)-k(x)<0

-h(x) =

inf

max {/z*(vl/) + r^r(O) - (rjlTi^ + rj^)},

(<J),vI/)edomA:*xdom/2* /y>0 k*(
whence (8.86) follows.

n

8.3 Duality for d.c. infimization with a d.c. inequality constraint

231

(4) Duality for the problem of convex supremization, with one convex constraint set (a particular case of (3) above): Taking /: = 0 in (3) above (so k = /:**), we obtain the following result: Corollary 8.1. Ifh, I: X -^ R are two convex functions, with h — /i** and {x G X| /z(jc) > -oo} n {x G X| /(jc) < 0} 7^ 0,

(8.87)

then sup h{x) = sup hix) = sup min{-{/?*(vI/) - (r]iy(^)}}. xeX

xeX

l{x)<0

VI/GX*

(8.88)

^^^

l{x)<0

Proof. The first equality follows from Proposition 8.3(c) with f\ =0,f2 = h. Now take /: = 0 in Proposition 8.4. Then, since k* = X{0}^ we have k*(^) - /*(0) < 0 if and only if 4> = 0 (since X{0} (O) - /* (c|)) = +00 - /* (O) = +00 if O 7^ 0 and X{0}(0) - /*(0) = - sup^^;^{0(x) - l(x)} = mfl(X) < 0 by (8.87)), and hence the right-hand side of (8.86) becomes the right-hand side of (8.88). D Remark 8.10. If {JC € X | l(x) < 0} 7^ 0 and (8.87) is not satisfied, that is, h(x) = - 0 0 for all X G X with l(x) < 0, then by Lemma 1.9 we have {x G X\ l(x) < 0} c {x e X\l(x) < 0}, whence since h is lower semicontinuous. sup h(x) < suph{{x G X\l(x) < 0}) = sup h{x) = —00, xeX

xeX

/(jc)<0

/(.v)<0

SO the first equality of (8.88) still holds. (5) Duality for the problem of d.c. infimization, with one reverse convex constraint set: Proposition 8.5. Let X be a locally convex space and let f, h,k: X ^ R be three convex functions. If f is finite valued and continuous, h = /?**, k = /:**, and mfk(X) < 0, then inf {fix) - h(x)} = inf {fix) - hix)}

xeX k(x)>0

xeX k(x)>0

inf

(8.89)

max{/z*(vI/)-hy7r(0)-/*(vI/ + ^0)}.

cl>edom/t*\{0}

Proof The first equality follows from Proposition 8.3(b), taking f — h and —k in the role of / and / of that proposition. Now take / = 0 in Theorem 8.4. Then, since d o m / = X (because / is assumed to be finite-valued), we have -(Xdom/)*(^) = - s u p O ( d o m / ) = - s u p O ( X ) = - 0 0 < 0 for O 7^ 0 and -(Xdom/)*(0) = inf 0(X) = 0. Also, by our assumption, /:*(0) = sup^^;^{0(x) kix)} = -infkiX) > 0. Therefore, )^*(0) - (Xdom/)*(^) < 0 if and only if O G dom/:*\{0}, and hence by (8.86), we obtain the second equality of (8.89). D

232

8. Duality for D.C. Optimization Problems

8.4 Duality for d.c. infimization with finitely many d.c. inequality constraints In this section we shall generalize the setting of the preceding section, considering the problem of d.c. infimization with finitely many d.c. (nonstrict and strict) inequality constraints. For I < m < +oo we shall use the notation / , : = { ! , . . . , m}. Let X be a locally convex space, and let f,hJi,ki\ functions. We shall first consider the primal problem (P<)

a=

inf

xeX

(8.90) X -> R (i e /m) he convex

{f(x)-h(x)],

(8.91)

li(x)-ki(x)<0{ielrr,)

and later also the primal problem with strict inequality constraints (P<)

a=

mf

{f(x)-h(x)}.

(8.92)

l,ix)-ki(x)<0iielm)

Remark 8.11. One can reduce these problems to the case of a single d.c. inequality constraint. Indeed, we have {fix) - h(x)} =

inf xeX li(x)-ki(x)<0

(ielm)

inf

{fix) - hix)},

(8.93)

xeX max,g/^ {/,U)-A:,(jc)}<0

where max/^/^ (// — ki) is a d.c. function, as has been mentioned at the beginning of Section 8.1; indeed, we can write max {liix) - kiix)} = T- k,

(8.94)

where / := max/^/^ {// + X1,G/^\{/} ^yl' ^ '-— ^jeim ^J ^^^ convex functions. A similar remark is valid also for the case of problem (P<) of (8.92). Then, in order to obtain duality formulas in locally convex spaces X involving the conjugates of the given functions, one needs to make /* and k* explicit, for which additional assumptions are necessary; moreover, this way leads to a complicated duality formula. Another way would be to consider the constraints //(x) — kiix) < 0 as a. single inequality constraint lix)—kix) < 0, with / — /: taking values in R^, but the approach of the preceding section fails in this case, since formula (8.68) is no longer vaHd for m >2. Therefore, we shall use other approaches. First we shall assume only that Z is a locally convex space and the functions /ci,..., A:^ are subdifferentiable on the feasible set TiP<) of problem (P<): ^(P<) := {x e X\ liix) - kiix) < 0 (/ G 7^)} = Hi^jjx e X\liix)-kiix)

< 0}.

(8.95)

Observe that J-'iP<) may be empty (using the convention inf 0 = +oo) and that the functions ki,... ,kfn need not be convex on the whole space X.

8.4 Duality for d.c. infimization withfinitelymany d.c. inequality constraints

233

Lemma 8.1. Let X be a locally convex space and let f,h,li,ki: X -^ R (i e Im) be functions such that k\,... ,k„i are subdijferentiable on the feasible set !F{P<). Define Q := { ( O i , . . . , O , ) e ( X * n a/:*(cDO n • • • H a / : : ( 0 , ) H ^ ( P < ) 7^ 0}. (8.96)

Then

HP<) =

U

{^ ^ ^l^'(^) - ^/U) + ^r(^/) <0(ie /.)}.

(8.97)

Proof Let X e T{P<), that is, //(jc) - /:/(JC) < 0 (/ e Im)- Since by our assumption k\,... ,kni are subdifferentiable at x, for each / e /;„ there exists O/ e 9^/(x), whence by (LI 13), X e dk*(^i) n - • • n dk^(<^^),Thus, {^u ..., ^m) e ^ . Also, by the Fenchel equality (1.112) we obtain //(jc) - 0/(x) +/:*(cD/) = li(x) - ki(x) < 0 (/ G / ^ ) . Conversely, let ( O i , . . . , 0;„) G Q and let x G X be such that //(x) - /(x) + kii^i) < 0 (/ G /;„). Then by the Fenchel inequality -ki{x) < -0/(jc) + ^*(0/), we obtain //(x) - )^/(jc) < //(x) - ^i{x) + )^*(0/) < 0, so x G J^(P<). • Proposition 8.6. (a) Under the assumptions of Lemma 8.1 w^ have a <

inf

inf

(i,...,
{/(jc) - 0(;c) + /i*(0)} := p. (8.98)

ATGX

l,(x)-^i{xnk*{i)<0{ieln,)

(b) 7^ m addition, h(x) = h**(x) for all x e J^{P<), then a =

fi.

(8.99)

Proof (a) By Lemmas 8.1 and 3.7, and by /i**(jc) :=

sup {cD(x) - /z*(0)} < /z(x)

(JC G J'(P<)),

(8.100)

Oedom/i*

we have a=

inf {/(x)-/i(x)} inf xe

U (4)j

{f(x)-h(x)}

{j6X|/,(y)-,(v)+/:;(^')<0 0"^^-)}

4>m)e^

inf

inf /,(JC)-0,(JC)+^;(4>,)<0

<

inf

{f(x)-h(x)} (ieln,)

inf

{/(x) -/Z**(JC)}

// U) -
inf (0,i,...,4);„)edom/?*x^

inf

{/(;c) - 0(x) + /i*(0)}

JCGX

l,(x)-,{x)+kl{,)<0{ielm)

= p.

(8.101)

(b) If in addition, h{x) = h**(x) for all x e J^(P<), then in (8.101) < becomes an equality. D

234

8. Duality for D.C. Optimization Problems

Remark 8.12. (a) We have (CD,,..., O^) € ^ ^ {x G X| liix) - 0,(jc) + /:*(0,) < 0 (/ G 7^)} # 0 ^^*(O,)-/*(O,)<0(/G/,,).

(8.102)

Indeed, by the above proof of Lemma 8.1, for any ( O i , . . . , 0,„) e Q and any X e T{P<) we have //(x) - 0/(x) + ^*(0/) < 0 (/ € 7^,), which proves the first implication in (8.102). Furthermore, {x e X\l,{x) - 0,(x) +^*(cl>,) < 0

(/ E U) 7^ 0

^ inf {//(x) - cD,(x) + k;{<^i)] < 0

(/ 6 7,)

xeX

4»^*(O/)-/*(O,)<0

(/G7,„),

which proves the second implication in (8.102). (b) Corresponding to (a) and Lemma 8.1, we have the following decomposition of the feasible set J^{P<):

nP<) = [j

{xe X\li{x) - cD,(x) +/c*(cDy) < 0 (/ G 7^)}. (8.103)

(cD,,...,0„,)e(X*)"' A:*(O,)-/*(cD,)<0 (/€/,„)

Indeed, the inclusion c in (8.103) follows from Lemma 8.1 and (8.102) above. In order to prove the reverse inclusion, let ( O i , . . . , O^,) e (X*)"", /:*(0/) - /*(0/) < 0 (/ e Im), and let jc € X be such that //(x) - <^i{x) + ^*(0/) < 0 (/ G 7^) (if no such x exists, then we are done). Then we cannot have //(x) = +oo, nor ki(x) = -cxD (because of the Fenchel inequality 0/(jc)
(8.104)

where

inf

inf

(0,a),,...,cD^)Gdom;i*x(A:*)'" .VGX A:*(O,)-/;((I),)<0 ( / G 4 , ) / / ( X ) - O , U ) + ^ ; ( O , ) < 0 (/€/,„)

{/(jc) - 0(x) + /z*(cD)}.

(8.105)

Indeed, the second inequality in (8.104) always holds, by (8.102), and the proof of the first inequality in (8.104) is similar to the above proof of Proposition 8.6(a). (d) From (c) and Proposition 8.6(b) it follows that under the assumptions of Lemma 8.1, if/z =/z**, then a = fi' = p. In the sequel we shall consider only Q and p.

8.4 Duality for d.c. infimization with finitely many d.c. inequality constraints

235

Proposition 8.7. Let X be a locally convex space and let f, hJi,ki\ X -^ R {i e Ifn) be functions such that h{x) = h"^* (x) for all x e T(P<). Then, for ^ of (8.98), we have P>

inf

sup /z*(CD)

^yr]ikl{
^

m

- (/+E''''') ('^+E"'*') • (8-106) Proof We have mf j / ( x ) -
m

= mf | / ( x ) + Y, nMx) -( + E

'?''^')^^^l

m

m

= - sup[( + Y, rii'^)ix)

- \f(x)

+Y

riihix)^

whence by Proposition 1.3, inf

{/(^)_(D(^) + /,*(cD)}

xeX liix)-^i(x)+k*(i)<0(iel,„)

> sup inf / ( x ) - 0 ( x ) + /z*(cD) +V^;,[/,(x)-cDKx)+/:*(,)] m

(

m

^

m

/z*(c|>) + V ^,/:*(cD,) - (/ + V;;///) (^ + E ^/^/)l which, by the definition (8.98) of fi, yields (8.106). D The right-hand side of (8.106) does not completely involve the conjugates / * and /*. However, we have the following duality result relating the value a of problem (8.91) to the conjugates of the data functions / , /i, //, and kj: Theorem 8.5. Let X be a locally convex space and let f,hJi,ki\ X -^ R (i e 1^) be functions with h(x) = h**(x)for all x e T{P<) and with the kt 's subdifferentiable on T(P<). Then for a of (8.91), we have a >

inf

{

sup m

r

'^

T1

.108)

236

8. Duality for D.C. Optimization Problems

Proof. Since h{x) = /z**(jc) (x e T{P<)), by Proposition 8.6(b) we have a = p. Now, for any ^ , vl/j,..., ^ ^ 6 Z* with ^ = X^J^o ^7 ^"^ r] e R^^^V/Q have ( / + E ??,';) (*) = sup ^ /=1

-^^^

vi/.(X) - / ( x ) - ^ j=0

??,/,(x)|

i=\ m

< sup{*o(x) - fix)} + ^supfvI/.Cx) - »7,/,(x)) xeX

y_l .reX

= r ( * o ) + ('?i/ir(*i) + • • • +

(iijjH^^),

whence

{f + f2 '''•''•)* - /*°('?i'i)*° • • • ^(^mU*,

(8.109)

where the right-hand side denotes the infimal convolution, defined by

(/*n(^i/i)*n...n(^^/,)*)(vi/):= inf

{/*(vI/o) + (^i/i)*(^i) + --- + (^./.)*(^.)}

(^eX*).

(8.110)

Hence by a = yS, Proposition 8.7, and (8.109), (8.110) with ^ = 0) + YlT=i ^i^i, we obtain (8.108). D Let us give now a result of duality in which the suprema in the right-hand side are attained. To this end, let us consider the following "Slater condition": V(c|)i,..., O^) e Q, 3xo e dom/, l.{xo)-i(xo)+kl(i)<0

a elm).

(8.111)

Remark 8.13. (a) By a well-known characterization of solvability of convex inequality systems (see, e.g., [184], Theorem 21.1), condition (8.111) is equivalent to V(cD,, . . . , CD,) G ^ , Vry = (A7i, . . . , ry,) e /?:;:, X ] ^7/ = 1,

i=\

i=\

(b) By the Fenchel inequality -ki(xo) < -0/(xo) + ^*(^/), for any XQ edom / as in (8.111) we have //(XQ) - ki{xo) < //(XQ) - ^/(^o) + Ki^i) < 0 {i e Im). (c) The inequalities in (8.111) imply that JCQ edom // (/ = 1, . . . , m). Hence if dom / 2 nj^jdom //, then the assumption XQ edom / in (8.111) can be omitted.

8.4 Duality for d.c. infimization with finitely many d.c. inequality constraints

237

Theorem 8.6. Let X be a locally convex space and let f,h,li,ki: X ^^ R (i e Im) be functions with h{x) = h**(x)for all x e T(P<) and with k\,... ,km subdijferentiable on T{P<). Furthermore, assume that (8. I l l ) holds. Then for a of (8.91), we have a=

inf

max / z * ( 0 ) - h V n / r ( 0 / ) m

m

\

-(/+E''''') (^ + E'?'*') • ^^-^^^^ Proof Combining Proposition 8.6(b) and Theorem 1.15, we obtain a = p=

inf

maxinf j / ( y ) - 0 ( y ) + /i*(0)

(0,4>i,...,0^)6dom/z*x^ r]eR'^ yeX I m i=\

whose right-hand side is nothing else than the right-hand side of (8.113).

D

Remark 8.14. (a) If there exists JCQ Gdom /Pidom /i D • • • Hdom Im at which / i , . . . , /^ are continuous, then by a result of [166], m

m

max E7=o*.=^+Er:

^ j-[/*(^o) + Y.^mun^i)\\,

(8.114)

here the convex functions / , / i , . . . , /^ are not assumed to be proper. (b) The assumptions of Theorem 8.6 on /z, k\,... ,km are satisfied when h is convex, lower semicontinuous, and proper on X or when h = +oo or /z = — oo and k\,... ,kni 3.Te convex, finite-valued, and continuous on X. Let us mention now some particular cases of the above. (l)lfku...,km= 0, then problem (P<) of (8.91) reduces to (Pi)

a=

inf

{f{x)-h{x)},

(8.115)

xeX li(x)<0{ieln,)

i.e., to the problem of infimization of a d.c. function over the solution set of a finite convex inequality system. Then (by (8.83)) k* = X{0} (i ^ Jm)^ whence 9/:*(0/) = 9X{0}(^/) = ^ for O/ = 0 and = 0 for O/ # 0 (/ G Im)- Hence assuming that the feasible set of (Pi) is not empty, that is, J'( Pi) := [x e X|//(x) < 0 (/ e 7^)} 7^ 0, the set Q of (8.96) reduces to {(0, . . . , 0)}. Consequently, (8.111) reduces to the usual Slater constraint qualification (1.309). Then, from Theorem 8.6 we obtain the following:

238

8. Duality for D.C. Optimization Problems

Corollary 8.2. Let X be a locally convex space and let f, hJi: X -^ R (i e Im) be functions with h(x) = h**(x) for all x e !F{P\). Furthermore, assume that (1.309) holds. Then inf [f{x)-h{x)}=

inf m a x | / z * ( 0 ) - ( / + Vr////)*(cD)).

li{x)
^

(8.116)

i=\

Remark 8.15. Corollary 8.2 implies again the second equality of Theorem 1.15. Indeed, if/? = 0, then h* = X{0}, so dom h* = [0] and /z*(0) = 0, and hence in this case, (8.116) can be written as inf

f{x) = m^x\-(f

xeX

rye/?? I

/,(jc)<0

+ Yr]ili) \

^

~

(0) = max inf / ( x ) + V ^,/,(x) , /

J

neR'^xeXi

'= 1

^

^

J

i=^

that is, the second equality of (1.310). (2) If / i , . . . , /^ = 0, then problem (P<) of (8.91) reduces to (P2)

a=

inf

xeX kiix)>0iielm)

{fix)-h(x)},

(8.117)

i.e., to the problem of infimization of a d.c. function over finitely many reverse convex constraints. In this case, the Slater condition (8.111) reduces to m

V ( O i , . . . , cD^) G ^2 := { ( O i , . . . , O^) G (X*)-| P I a/c*(cD,) H ^(^2) # 0}, 3x0 e dom/, -cD,(jco) -f/:*(0,) < 0 (/ G / ^ ) ,

(8.118)

where J^(P2) := {x e X\ ki(x) >0(i e Im)}}. the feasible set of (P2). By (8.112), an equivalent condition to (8.118) is V(cDi, . . . , O,) G ^2, V^ = (r;,, . . . , ^7,) G /?:;', £ rit = 1, m

m

J2mk*(i) < (Xdom/r(5]'7,o,).

(8.119)

From Theorem 8.6 we obtain now the following result: Corollary 8.3. Let X be a locally convex space and let f,h,ki'. X ^^ R (i e Im) be functions with f convex, h(x) = /z**(x) for all x e T{P2), and k\, .,. ,km subdifferentiable on ^(^2)- Furthermore, assume that (8.118) holds. Then for a of (8.117), we have ot

h\^)-Vy^r)ikU^i)

=

(m)edomh*xQ2 rieR+

/• =

!

'

8.4 Duality for d.c. infimization withfinitelymany d.c. inequality constraints

239

Remark 8.16. In the particular case that dim X < +oo and / is proper, since the inner (convex) infimization in the definition (8.98) of fi has linear constraints, no constraint qualification is needed in Theorem 8.6 to ensure that the duality gap is 0, and hence in this case the assumption (8.118) can be omitted in Corollary 8.3. For a single reverse convex constraint set, taking m = 1 in Corollary 8.3, and observing that ^(P2) = U e X\kdx)

>0} = k;\R^),

(8.121)

one obtains the following corollary: Corollary 8.4. Let X be a locally convex space and let f,h,k\\ X -^ R be functions with f convex and proper, and such that at each x e X with k\ (x) > 0 one has h{x) = h**(x) and dk\(x) ^ 0. Furthermore, assume that VOi e X\ dk\{^x) n k-\R^)

# 0, /:t(^i) < (Xdom/)*(Oi)

(8.122)

{or alternatively, that dim X < +oo). Then a=

inf

inf

Oedom/2*

max {/z*(cD) + y7;e*((Di) - / * ( 0 + r/Oi)}. (8.123) ri>0

In particular, for /z = 0 we obtain inf f(x)=

inf

xeX

max{ry^t(^i)-/*(^^i)}-

,GX*

^'^-^)-0

(8-124)

r]>0

a/:*(cD,)n^-'(/?+)/0

Remark 8.17. (a) By (8.121), if the function k\ is differentiable, the constraint a^*(0,) n k;\R^) / 0 in (8.124) reduces to ^i(V^*(Oi)) > 0, where V denotes the gradient. (b) In the case that d o m / = X, condition (8.122) is satisfied provided that inf ^1 (X) / 0 (or alternatively, provided that infk\ (X) is not attained). Let us pass now to the case of strict inequality constraints, i.e., to the primal problem (P<) of (8.92). We have the following decomposition of the feasible set: Lemma 8.2. Let X be a locally convex space and let f.hJi.ki'. X -^ R (i e I^) be functions such that k\, ... .k^ are subdifferentiable on the feasible set J-'(P<) of problem (8.92). Then J'{P<) =

U

[xeX\

Uix) - <^i{x) + /:*(0,) < 0 (/ e 7^)},

(8.125)

(4)|,...,O^)G0

where 0 : = j(i,...,,(xo) + k*{i) < 0

{ie U\.

(8.126)

240

8. Duality for D.C. Optimization Problems

Proof. If X G T{P<), that is, //(jc) — ki{x) < 0 (i e / ^ ) , then since k\,... ,km are subdifferentiable at JC, for each / e Im there exists O, e dki(x). Hence by the Fenchel equality (1.112) we obtain //(jc) - O^x) + /:*(0/) = //(JC) - kix) < 0 (/ e Im), so ( O i , . . . , 0;„) E 0 . Conversely, let ( O i , . . . , O^) e 0 and x G X be such that //(jc) — 4>/(jc) + kf(^i) < 0 (i e Im)- Then, similarly to the proof of D Lemma 8.1, we obtain JC G .F(P<). If the functions /i ,...,/;„ are convex, then similarly to the argument for obtaining condition (8.112), one can show that 0 : = j(cDi,...,cD^)G(XT I 1 ] mKi^i)

< (J2 riih + Xdom/) 02 '^'^')

(»? e A) K

(8.127)

where A = |(^i,...,^7.)e/?:;:i^^7, = i[.

(8.128)

If ki(x) = k**(x) (i G /^, JC G J'(P<)) (so ^1, . . . , A:;„ are subdifferentiable on J^(P<); see, e.g., [14], Chapter 2), and if h{x) = /z**(jc) (JC G J ^ ( P < ) ) , then from Lemma 8.2 we obtain a =

inf

(
inf

xeX liix)-^i{x)+k*(^i)<0

{/(jc) - cD(jc) + /z*(0)}.

(8.129)

If / and /i ,...,/;„ are convex, then from (8.126) and the first equality of Theorem 1.15 it follows that for any (O, O i , . . . , O^) G dom/z* x 0 we have inf

{f(x)-
+ h\)}

xeX /,(jc)-4),(x)+A:;(O,)<0 (ieirr,)

inf

{/(jc)-
(8.130)

li{x)-i{x)-\-k*ii)<0

Combining (8.129), (8.130), and Theorem 1.15, one obtains the following theorem: Theorem 8.7. Let X be a locally convex space and let f, hJi, kt: X ^^ R (i e Im) be convex functions with h{x) = h**(x), ki(x) = /:**(JC) (/ G Im)forallx G T(P<). Then mf

{f(x)-h(x)}

(8.131)

li(x)-ki(x)<0(ielm)

dom/i*x0

^—f /= 1

\

^--i /=1

/

\

^—i /=1

/J

8.4 Duality for d.c. infimization withfinitelymany d.c. inequality constraints

241

Remark 8.18. In the particular case when m = 1, we have 0 = {O G X*| 3xo e X, h(xo) - ^(xo) + k*{<^) < 0}, and we obtain again Theorem 8.4. Returning to nonstrict inequality constraints, now we shall show that under some additional restrictions, one can obtain a formula of type (8.113), with max replaced by sup, without assuming any constraint qualification. Namely, we shall consider the infimization problem iPK,<)

a:=

inf

{f(x)-h(x)},

(8.132)

li(x)-kilx)<0(ieL)

where ^ is a compact convex subset of a locally convex space X, and where / , h, U.ki'. X -^ RU {H-oo} (/ e Im) are functions taking finite values on K. For the feasible set nPK<)

:= {X e K\li{x) - ki{x) <0(ie

U)

(8.133)

we shall use the following decomposition: Lemma 8.3. Let X be a locally convex space, K a subsetofX, and / , h,li,ki: X -^ R (i e Im) functions such that k\, .,. .km are subdijferentiable on the set T{PK,<) of (S.133). Then

nPK,<) = \J

{xe K\U{x) - cDK^) + ^*(/) < 0 (/ G Im)].

Proof Similar to that of Lemma 8.1.

(8.134) D

Thus, the set ^ of Lemma 8.1 is now replaced by n ) ^ | dom/:*. Correspondingly, in the following theorem the set dom/i* x ^ of Theorem 8.6 will be replaced by dom/i* X nf^^domit*. Theorem 8.8. Let X be a topological linear space, K a compact convex subset of X, and f.hJi.ki'. X -^ R (/ e Im) finite-valued lower semicontinuous functions with h(x) = h**(x)for all x e T(PK,<) and with k\, ... ,km subdifferentiable on J^(PK,<)' Then, for a of (8.132), we have a =

inf (,i,...,^)edom/2*xn;"^, domk*

m

{

m

m

(8.135)

r ( 0 ) + Vry//:*(0,) -(f + XK + yrjili) ( ^ + Tr]ii) Proof Since h(x) = h**(x) for all x e T(PK,<), from Lemma 8.3 it follows, similarly to the proof of Proposition 8.6, that a =

inf

(0,Oi,...,cD^)e

dom/z*xn"L, domk*

inf

xeK

{f(x) - ^(x) + /i*(0)}.

/,(A:)-CD,U)+/:;(CD,)<0 (ieirr,)

(8.136)

242

8. Duality for D.C. Optimization Problems

Observe now that for any (O, O ] , . . . , 0,„) G dom/z* x nj'^j dom/:*, the numbers /z*(0), /:*(Oi),..., k;^(<^rn) are finite (indeed, if h^) = sup^^;^{0(jc) - h(x)} = —00, then
i) respectively, we obtain a =

inf (0,0|,...,0^)Gdom/z*xn)'i^, domA*

sup inf / ( x ) - 0 ( x ) + /i*(cD) + V ^ / ( / / ( x ) - ^ / ( - ^ ) + ^*(^/)) . (8.137) But by the definition of conjugate functions, we have

i=\

i= \

= - supj( + ^

??,0,)(x) - ( / ( x ) + XK{X) + ^ ; ? , / , ( X ) ) |

= mf |-cD(x) - ^/ = i r/yOK-^) + f{x) + ^

^////(x) j ,

and hence the right-hand side of (8.137) is nothing other than the right-hand side of (8.135). D Let us consider now the particular case /z = 0, so {Pl^)

a=

inf

xeK liix)-kiix)<0{ielm)

{PK,<)

of (8.132) becomes

/(x),

(8.138)

i.e., the problem of infimization of a convex objective function over finitely many d.c. inequality constraints. Corollary 8.5. Let X be a locally convex space, K a compact convex subset of X, and / , //, ki', X ^ R (/ G Im) finite-valued lower semicontinuous functions with k\, ... ,ktn subdifferentiable on ^(Pl<)

•= [x e K\li(x) - ki(x) < 0 (/ G /,,)},

(8.139)

the feasible set of(P^ <). Then for a of (8.138), we have a =

inf (Oi,...,4>w)en^, domA:*

.

i ,

m i=\

^

m

m

^

i=\

i=\

J

Moreover, if XQ G K is an optimal solution of (P^ ^), then the infimum on the right-hand side of (8.140) is attained at any (Oi ,...,;„) G nj^jdom k^ such that i G dki(xo) (i = I, ... ,m).

8.4 Duality for d.c. infimization withfinitelymany d.c. inequality constraints

243

Proof. Take /z = 0 in Theorem 8.8. Then /?* = X{0}, so dom/z* = {0}, and hence formula (8.135) reduces to (8.140). Furthermore, let JCQ be an optimal solution of (Pj^ <), that is, XQ e K, li(xo) - ki(xo) < 0 (i e / ^ ) , f(xo) = a, and let CD/ € dki(xoJ~{i = 1 , . . . , m). Then by the "Fenchel equality" (1.112), )^*(0,) = ^/(•^o) — ki(xo). Furthermore, using that XA:UO) = 0. we have, for any ij = (^i,...,r7^)G/?^, m

^

i=\

m

m

m

i= \

i=\

i= \

Consequently, m

m

^

m

E mKi'^i) -{f + XK + Y.'?''') ( E '?'^') m

m

< J2 ^i(^i(^0) - ki(Xo)) + f(Xo) + J2 ^'(^'(-^O) - ^/(^O)) m

= f(xo) + ^ rjiHiixo) - ki(xo)) < f{xo) = a, i=\

which, together with (8.140), completes the proof.

D

Remark 8.19. If J^(P^ <) 7^ 0 and f,k\,..., k^ are upper semicontinuous, then problem {P^ <) of (8.138) admits an optimal solution JCQ. Indeed, then —k\, . . . , —km are lower semicontinuous, and so are / i , . . . , Im (by our assumption), so the feasible set jr(pO ^) = [x e K\li(x) - ki(x) < 0 (/ G L)} is a closed subset of the compact set K, and hence compact. Let us mention, briefly, two examples of applications of the above. (1) Infimization of a d.c. function over a compact set: Let X be a Hilbert space, G a compact subset of X, and let us consider the primal problem (P,)

a= inf {fix)-h(x)},

(8.141)

xeG

where f,h: X ^^ P U {+00} are two functions that are finite on the closed convex hull K := coG of G (it is well known that K is compact), and such that / is lower semicontinuous and convex on K, and h(x) = h**(x) (x e G). Using that G = {x eX\ ^dist (jc, G)^ < 0} and formula (8.17), we can rewrite (PO of (8.141) as a=

inf

xeK IUIl'-sup^,G(2U.^)-||glh<0

[f{x)-h{x)].

(8.142)

Then the assumptions of Theorem 8.8 are satisfied, and one obtains a duality formula for a (see [159], Corollary 2). (2) Using, e.g., (8.23), the general linear programming problem with 0-1 variables X G {0, 1}" can be also reformulated as a d.c. infimization problem of the form (8.138), with A^ = [0, 1]", to which one can apply Corollary 8.5 and obtain a duality formula (see [159], Corollary 3).

244

8. Duality for D.C. Optimization Problems

8.5 Perturbational theory In this section we shall develop a perturbational theory of Lagrangian duality for d.c. infimization, by suitably modifying the ones for quasi-convex infimization (see Chapter 1, Section 1.4.2) and convex supremization (see Chapter 3, Section 3.4.2). We shall consider an unconstrained primal infimization problem (P)

Q' = inf0(X),

(8.143)

where X is a locally convex space and 0 : X ^- /?, and then taking in particular (j) = f — h and a suitable permutation /?, the duality theory for {P) of (8.143) will yield a duality theory for the primal problem of d.c. infimization (P) of (8.7). Let Z be a locally convex space (called set of "perturbations" or of "parameters"), and p: X X Z ^^ R 3. function (called "perturbation function") such that p(x, 0) = -(pix)

(x eX),

(8.144)

Of = inf (-/7(jc,0));

(8.145)

so (P) is nothing other than (P)

xeX

thus (P) is embedded into the family of infimization problems (PJ

v(z) := inf {-p(x. z)}

(z e Z).

(8.146)

xeX

For the subsequent computations it will be more convenient to define first the Lagrangian and then use it to define the dual problem. Let us define the Lagrangian function L: X x Z* -> /?, or simply the Lagrangian, associated with p by L(x, vj/) := - sup {vl/(z) - p (jc, z)} zeZ

= inf {p (jc, z) - ^iz)}

(xeX,^

e Z*),

(8.147)

zeZ

and the Lagrangian dual problem associated with the perturbation function p as the unconstrained infimization problem (D)

yS := inf A(Z*),

(8.148)

where A: Z* -> P is the dual objective function defined by A.(^) := - sup L(jc, vl/) = inf (-L(jc, vj/)) xeX

(vj/ e Z*).

(8.149)

^^^

Then by (8.148) and (8.149), P=

inf inf ( - L ( x , ^ ) ) .

(8.150)

8.5 Perturbational theory

245

Theorem 8.9. (a) We have 0(jc) < inf (-L(jc, vl/))

(jc G X),

(8.151)

(3/2< hence a = inf 0(X) < inf A(Z*) =

fi.

(8.152)

(b) If px{0) = p**(0) for all x e X, where pxi.): Z -> R are the partial fur :tions PAZ):=P(X,Z)

(xeX^zeZ),

(8.153)

the I

0(x) = inf (-L(jc, ^))

{x e X),

(8.154)

vyeZ*

an hence a = p.

(8.155)

Pr of (a) Observe first that for all JC G X and ^ G Z*, -PI{^)

= - sup{vl/(z) - /7(jc, z)} = L(x, vl/).

(8.156)

zeZ

Hi ice by (8.144), (8.153), and (8.156), for all x G X we have 0, c) = -p{x, 0) =

-PAO)

<

-PT(0)

= - sup

(-P:(^))

= inf (-L(x, vj/)),

an therefore by (8.143), (8.149), and (8.148), a = inf 0(X) < inf inf (-L{x, ^)) = inf inf {-L(x, ^ ) ) = inf X(^) = p. xeX vi/eZ*

^eZ*xeX

^eZ*

(b) If Px{0) = p**(0) for all JC G X, then, by the above arguments, we obtain (8 i54) and (8.155). D Concerning optimal solutions, we have the following: Tl eorem 8.10. Ifxo e X is an optimal solution of{P), with 0(JCO) G R and ^o ^ 9/ o(0), then ^o is an optimal solution of the dual problem (D) (of (8.148)), and wt have /7(xo,0)-L(jco,vl/o)=0,

(8.157)

X(vI/o) + L(jco, vl^o) = 0.

(8.158)

246

8. Duality for D.C. Optimization Problems

Proof. By 0(jco) € R and (8.144), we have p.^CO) e R. Then by (8.153), ^Q e dpx^iO), and (8.145), and since XQ is an optimal solution of (P), p(xo, z) = p,,(z) > p,,{0) + vl/o(z) = -a + vi/o(^)

(z e Z),

(8.159)

whence by (8.147), -L(xo, ^o) = sup {vI/o(z) - /7(xo, z)} < a, zeZ

and thus by (8.149), -X(^o) = sup L(x, VI/Q) > L(XO, ^o) > -oi. xeX

Consequently, A,(^o) < —^(-^o, ^o) S a, which together with Theorem 8.9 and (8.148) yields P = X(^o) = -L(xo, ^o) = a,

(8.160)

so ^0 is an optimal solution of (D) and satisfies (8.158). Finally, by p{xo, 0) = —a (which holds by (8.145) and since XQ is an optimal solution of (P)) and the last part of (8.160), we have (8.157). D Remark 8.20. (a) For the unconstrained d.c. infimization problem (8.7), let Z = X and (j) = f — h. Then the perturbation function p: X x X -> R defined by p(x, z) := h(x + z) - fix)

(jc, z G X)

(8.161)

satisfies (8.144), and the perturbational dual problem (8.148) yields the unperturbational dual problem (8.24) of Section 8.1. Indeed, by (8.147) and (8.161), for any X e X and ^ E Z* we have L(x, vj/) = - sup {^(z) - p (x, z)} = - sup {vl/(z) - h(x + z) + fix)} zeX

zeX

= - sup {vl/(x + z) - /z(jc + z)} - fix) + ^ix) zeX

= -/z*(vl/) - fix) + vi/(jc)

ixeX,

^ e Z*).

(8.162)

(vl/ G X*),

(8.163)

Consequently, by (8.149) and (8.162), Xi^) = -sup Lix,^)

= /z*(vI/)-/*(vI/)

XGX

whence by (8.148), we obtain P=

inf {/i*(vl/) -/*(xl/)},

(8.164)

which is nothing other than (8.24). (b) For Z, 0, and p as in (a) above, by (8.161) and (8.162) the equality (8.157) becomes

8.6 Duality for optimization problems involving maximum operators /zUo) = -/z*(vI'o) + ^oUo).

247 (8.165)

Also, by (8.163) and (8.162), the equality (8.158) becomes /(xo) + /*(vI/o) = vi/o(xo).

(8.166)

By the "Fenchel equality" (1.112), we have (8.165) and (8.166) if and only if ^oedf{xo)ndh{xo).

(8.167)

Since the Fenchel inequalities ^o(-^o) < ^(-^o) + ^*(^o) and ^o(-^o) < /(-^o) + /*(^o) always hold, (8.157) and (8.158) are called "extremality relations." An element XQ e X for which there exists ^o ^ ^* satisfying (8.167) is called a "critical point" of f — h. (c) lfh = /z**, then by (8.156), (8.162), (8.161), and (8.153), pTm

= sup {-P:(^)} vj/eX*

= sup L{x, vl/) = sup {-/i*(vl/) - fix) + vl/(x)} ^eX*

vi/eX*

= /i**(x) - fix) = hix) - fix) = pix, 0) = PAO)

ix e X),

Consequently, by Theorem 8.9 and (8.164), it follows that a = inf 0(X) = inf ( / - h)iX) = P= inf {/i*(vl/) - /*(vl/)}, ^eX*

and thus we have obtained another proof of the main part of Theorem 8.1 (b).

8.6 Duality for optimization problems involving maximum operators We shall now consider optimization problems in which the objective function is the maximum of two functions. Let X be a locally convex space and lei f,h: X -> R be two quasi-convex functions. We shall consider the unconstrained primal infimization problem iP)

inf max (/, -h)iX)

= inf max {fix),

-hix)}.

(8.168)

X€X

Thus, comparing problems (8.7) and (8.168), we see that now the difference f + —h is replaced by the maximum max (/, —h), or in other words, the operation -j— is replaced by the operation max — (or equivalendy, -j- is replaced by max). Remark 8.21. By Remark 8.11, problem iP) of (8.168) is a d.c. problem, but in order to apply to it the preceding results of this chapter, one would need to give explicidy the conjugates of the functions whose difference is the function max (/, —/z), which would lead to rather complicated expressions. Therefore, in the sequel we shall apply more direct methods to obtain duality results for problem (P).

248

8. Duality for D.C. Optimization Problems

Note that if G is a convex subset of X, then pc, the "representation function" of (1.224), is convex; indeed, epi pc = {(x, d) e X x R\ pcM < d] = G x R is convex (in contrast, recall that the function -XCG used in formula (8.12) is only quasi-convex, but it need not be convex). Hence the above problem (P) encompasses the following two particular cases: (1) If G is a convex subset of Z, and h: X -^ R is a. quasi-convex function, then for f = PQ the primal problem (8.168) becomes inf max (PG, -h)(X)

= inf(-/z)(G) = -sup/z(G),

(8.169)

i.e., a problem of quasi-convex supremization, studied in Chapters 3 and 4. (2) If G is a convex subset of X, / : X ^- /? is a function, and h. X -^ R'lSdi convex function, then for h = pc the primal problem (8.168) becomes inf max (/, -PG){X)

= inf / ( C G ) ,

(8.170)

i.e., a problem of reverse convex infimization, studied in Chapters 6 and 7. Remark 8.22. It is natural to consider also problem (P) of (8.168), where / is a quasi-convex function and h is a quasi-concave function, i.e., the problem (P)

inf max (/, h)(X) = inf max {/(jc), h(x)},

(8.171)

xeX

where / and h are two quasi-convex functions; this, too, has been studied in the literature (see Voile [290] and the references therein), but we shall not consider it here, since it is equivalent to the problem of quasi-convex infimization.

8,6.1 Duality via conjugations of type Lau The conjugations of type Lau (1.223) constitute a natural tool to study duality for unconstrained primal problems involving maximum operators. Indeed, we shall see in Remark 8.23 that they permit one to recover, as particular cases. Theorems 3.11 and 6.10. Proposition 8.8. Let X be a set, W ^ R , and A: 2^ ^^ 2^ a polarity. Then for X

any functions f^heR

we have

inf max {/(jc), -h^'^^^^^^^'(x)} = inf msix {h^^^\w), -f^^^\w)}. xeX

Proof Let f,heR. obtain

(8.172)

weW

Then by (1.226) (with g = h^^^'>), (1.144), and (1.223), we

8.6 Duality for optimization problems involving maximum operators inf max{/(x), -h^^^^^^^^'(x)]

= inf max|/(jc),

xeX

xeX

I

= inf

inf

= inf

inf

h^^^\w)

u;eCA({jc})

inf

rmix{f(x),h^^^\w)} m2ix{f{x),h^^^\w)}

= inf max\h^^^\w), weW

249

I

inf

f(x)]

jceCA'({u;})

J

= inf m2ix[h^^^\w), -f^^^\w)}.

D

weW

We have the following basic duality theorem for problem (8.168). Theorem 8.11. Let X be a set, W c 'R^, and A: 2^ -> 2 ^ a polarity. (a) For any functions f,heR we have the inequality inf max {/(x), -h(x)] < inf max{h^^^\w), xeX

-f^^^\w)}.

(8.173)

weW j^

(b) For any function h e R the following statements are equivalent: 1°. h is A^A-quasi-convex. 2°. We have inf max {/(jc), -h{x)} = inf maxf/z^^^^u;), -f^^^\w)} xeX

( / e J^).

weW

(8.174) Proof (a) Let fhel^^.By

/z^(^)^(^)' < h, we have -h < -/i^(A)^(^)', whence

inf max{/(x), -h{x)} < inf max{/(jc), -h^^^^^^^^\x)}, xeX

(8.175)

xeX

which together with (8.172), yields (8.173). (b)Let/z G ^ ^ . 1° =^ 2°. If 1° holds, then by (1.227) we have h = /z^(^)^(^)', whence by (8.172), we obtain (8.174). 2° =^ r. If 2° holds, then applying (8.174) to / replaced by /^(^)^(^)' and using that /^^^^ = (/^(A)L(A)y(A)^ ^^ ^^^^^^^ f^^ ^^^ f elR^, inf max {/(jc), -h(x)} = inf max{/i^^^Hu;), xeX

-f^^^\w)}

weW

= inf max{/i^*^*(u;),-(/^*^'^<^>y*^^(w')} = inf max {/^^'^'^'^''(x), -/!(x)}.

(8.176)

Now let / = ps,(h), where d e R. Then by (1.224), the left-hand side of (8.176) is inf max{/05^(;j)(jc), —/z(x)} = xeX

inf [—h(x)} = — sup h(x), xeSAh)

xeSAh)

250

8. Duality for D.C. Optimization Problems

and by (1.228) and (1.224), the right-hand side of (8.176) is inf max{(/)5,(/,))^^'^^^^^^'(x), --h{x)] = inf max{pA'A(5,(/.)(-^), -h(x)} xeX

xeX

=

inf

{—h(x)} = —

xeA'AiSj(h)

sup

h(x).

xGA'A(Sj(h)

Therefore, by (8.176), we have sup

h{x) = sup h(x) < d,

xeA'A(SAh))

xeSj(h)

whence h{x) < d for all x e A'A(Sd(h)), that is, A'AiSAh)) c Sd(h), which, since the opposite inclusion always holds, yields A'A(Sd(h)) = SdQi) for any d e /?. Hence by (1.153), hq(A'A)M =

inf

deR xeA'A(SAh))

d=

inf d = h(x)

(x e X).

deR xGSdih)

D

Remark 8.23. (a) Theorem 8.11 yields Theorem 3.11 as a particular case. Indeed, for f = PG of (1.224), the left-hand side of (8.174) becomes (8.169) and the righthand side of (8.174) becomes, using also (1.225) and (1.224), inf max [h^^^\w), -p^ioiu))}

=

inf

weW

h^^^\w),

IL'GCA(G)

and hence for h e Q{A'A), from (8.174) we obtain (3.109) with / replaced by h. (b) Theorem 8.11 yields also Theorem 6.10 as a particular case. Indeed, for h = PG the left-hand side of (8.174) becomes (8.170), and the right-hand side of (8.174) becomes, using also (1.225), (1.224), inf max{pA(G)(u;),-/^^^\u;)}= weW

inf

(-/^^^\u;)),

weA(G)

and hence, for G e C(A'A) (which is equivalent to PG € Q(A'A), by [254], formula (4.43)) from (8.174) we obtain (6.85). For W = (Z*\{0}) X R and the polarity A = A^' of (1.189), from Theorem 8.11 (b) we obtain the following: ^ Corollary 8.6. Let X be a locally convex space, and f,heR two functions, with h lower semicontinuous quasi-convex. Then inf max {/(jc), —h{x)] inf (O,^)G(A:*\{0})X__ iX*\{Q})xR

max! inf / ( x ) , ^ I^ xeX U)>^

inf h{x)\. xeX ^{x)>d

(8.177)

J

Proof. For the polarity A = A^^ (1.190) and (1.229) hold, so Theorem 8.11(b) yields the result. •

8.6 Duality for optimization problems involving maximum operators

251

Remark 8.24. Applying Corollary 8.6 to f = pc of (1.224), and using (8.169), we get -suph{G)=

inf

maxj inf pcix),-

(ct),J)G(X*\{0})> (ct),J)G(X*\{0})x/?

inf hix)\.

I JCGX (x)>d

xeX (x)>d

(8.178)

J

But inf

-00

if supO(G) > J,

-foo

if sup4>(G) < J.

PG(-^) = {

xex/'^' d

I

r-

V / —

whence by (8.178), sup/z(G) =

sup mini— inf pcM, inf h(x)\ (M)e(X*\m.R I 41^^^, ^1^)^^ J

=

sup

inf /z(jc),

{,d)e{X*\{0})xR^fl supcD(G)>J

*i-^)>^

SO we have obtained again Corollary 3.9 (in Chapter 3 we deduced it directly from Theorem 3.11 applied to the polarity A = A^ ^). One can replace in Corollary 8.6 (X*\{0}) x Rby X* x R, and one can give an extension of the latter to systems. To this end, we shall use the following lemma: Lemma 8.4. Let (X, Z, w) be a system with Z a locally convex space, f: X -> R a function, and h: Z ^^ R a lower semicontinuous quasi-convex function. Then (hu)(x) =

sup

inf h{z)

(x e X).

(8.179)

Proof The inequality > in (8.179) is obvious. In order to prove the opposite inequality, let r < h{u{x)) be arbitrary. Then u(x) ^ Sr(h), where Srih) is a closed convex set (by our assumptions onh). Hence by the strict separation theorem, there exists ^0 € 2* such that supvI/o(5,(/i)) < VI/O(M(JC)).

(8.180)

Let do := sup^o('5r(^))- Then z e Sr(h) implies ^o(z) < do, and hence ^o(^) > do implies h(z) > r. Consequently, inf:.^z.^oiz)>do h(z) > r, whence r 

i^,d)eZ*xR^„y%^ , %ou)(x)>d ^^^>^

Hence since r < h(u(x)) has been arbitrary, we obtain the inequality < in (8.179). D Now we can prove the following result:

252

8. Duality for D.C. Optimization Problems

Theorem 8.12. Let (X, Z,u) bea system with Z a locally convex space, f: X ^^ R a function, and h : Z -^ R a lower semicontinuous quasi-convex function. Then inf max{/(jc), —h{u{x))} = inf

(vI/,J)eZ*x/?

max! I

inf

xeX ^{u{x))>d

/(JC), -

inf h{z)\.

zeZ ^(z)>d

(8.181)

J

Proof By Lemma 8.4 and since (/?, <) is a completely distributive lattice, we have inf max {/(x), —h(u(x))} = inf max|/(jc), xeX

xeX

I

= inf

inf

inf inf

(^,d)eZ*xR

^^z ^l>(z)>d

^

maxj/(x), sup (—h{z))\

xeX {^J)eZ*xR {^u)ix)>d

=

sup (—h(z))\

(^,d)eZ*xR {^u)ix)>d [

inf

xeX i^u)(x)>d

^^2 ^(z)>d

^

m a x | / ( x ) , sup I

.g2 ^\z)>d

(—h(z))\, J

whence using again the complete distributivity of (/?, <), we obtain (8.181).

D

8.6.2 Duality via Fenchel conjugations One cannot replace in Theorem 8.11 conjugations of type Lau by Fenchel conjugations, as shown by the following example: Let X be a locally convex space, and / = -/z =
while (D*(cD) = sup^^;^{^(x) - Oo(x)} = X{Oo}(^)' whence inf max{/i*(0), -pm]

= inf max{x{-Oo}(^), -X{Oo}(^)} = 0-

Nevertheless, we shall show now that using Fenchel conjugations, one obtains other useful duality results for problem (8.168), based on the following simple observation: by the definition of Fenchel conjugates, for any function / : X -> /? we have /*(0) = sup^^;^{0(x) - f(x)} = - inf/(X), so in order to obtain a duality formula for inf/(Z) it will be sufficient to obtain a formula for / * ( 0 ) and then apply it to 4> = 0. Thus, let us first give a formula for the Fenchel conjugate of the max of a convex function and a concave function. To this end, we shall use the notational convention (8.6) and the well-known equalities min(supa/,<3) = sup min(a/,<3) min (a, -b) = min (r]a - i^b) r].i»>0

({fl/l/e/ ^ R.ci ^ R)^ ((a, b) e (^\{-hoo}) x 'R).

(8.182) (8.183)

8.6 Duality for optimization problems involving maximum operators Proposition 8.9. Let X be a locally convex space and let f,h: functions. (a) Ifh = /z**, then for any ^ e X* we have [max (/, -/z)]*(vl/) R be two

+ r])}. (8.184)

(b) If in addition, f is convex, then for any ^ e X* we have [max (/, -h)T(^)

=

sup

min {-r7/z*(0) + (z^/)*(^ + rj^)}.

(8.185)

Oedom/z* ^-^^0

Proof (a) If h = -oo, then /z* = -foo (so dom/z* = 0) and [max(/, -/z)]* = —oo, so that both sides of (8.184) are —oo. Assume now that h = +CXD, SO /Z* = —oo and max (/, —h) = / . Then for any rj > 0 and O G X* we have — z;/z*(0) = (—^)(—co) = +00, and hence by (1.92) and (1.84), we obtain inf {-r]/z*(cD) + (i^/)*(vl/ + z;
(cD G X* = dom/z*),

SO (8.185) (and hence (8.184)) holds also in this case. Thus it remains to consider the case h ^ ±00. If / = +00, then both sides of (8.184) are —00. Assume now that / ^ +00, so dom / 7^ 0. For any ^ e X* we have [max(/, -/z)]*(^) = sup{vl/(x) - max(/(x),

-h{x))}

xeX

=

sup {vl/(jc)-hmin (/z(x),-/(jc))}.

(8.186)

xedom f

But since /z = /z**, we have min ihix),-f(x))

= mini

sup {0(x) -/z*(cD)},-/(x)),

=

min {{0(jc) - /z*(0)},

whence by (8.182), min (/z(jc), -f(x))

sup

-/(JC)},

Oedom/?*

and thus [max (/, -/z)]*(vl/) =

sup

sup

min [(^ -f vl/)(x) - /z*((I>), ^(x) -

f(x)}.

xedom f Gdom/i*

Exchanging here sup^^^om/ ^"d sup^^^^^;^*, and applying (8.183), we obtain [max(/,-/z)]*(vI/) = sup

sup

^edomh*xGdomf

inf {z7[(0 + ^ ) ( ; c ) - / z * ( 0 ) ] - i ^ [ / ( x ) - v l / ( j c ) ] } . ^'^^^

(8.187)

254

8. Duality for D.C. Optimization Problems

Hence using that supinf < inf sup, it follows that [max(/,-/i)]*(vl/)< sup =

inf

sup

sup {^[(cD + vl/)(x)-/z*(cD)]-z^[/(jc)-\[/(x)]} inf |-^/z*(cD)-f

sup {[(^ + ?^)vl/-h y;0](jc) - i^/(jc)}

that is, (8.184). (b) Assume now that / is convex, with d o m / / 0 and /z ^ ±oo, /z = /z**, and let O e dom/i* (hence /z*(0) e /?). Then by Theorem 1.9, with C = d o m / , Z) = {(y;,2^) >0|y/ + ?^ = l},and (p(x. (r], 1^)) = ^[(cD + vi/)(x) - /z*((D)] - nf(x)

-

^(x)l

we have (even if / takes the value —oo), the equality sup

min {y/[(0 + ^)(x) - /2*((D)] - n/M min

- ^(x)]} =

sup {r/[(0 + ^)(Jc)-/z*(cD)]-z^[/(jc)-vI/(jc)]},

^ ' ^ ^ 0 jcGdom/

Hence by (8.187) (where we can replace inf by min, because of the compactness of D = {(T], i^)>0\r] + i} = 1} and continuity), we obtain (8.185). D The assumptions in Proposition 8.9 cannot be entirely omitted. Indeed, even if both / and h are convex, the inequality (8.184) may fail if /i / /z**, as shown by the following example: Example 8.1. Let X = R, fM=\

-1 '

ifx < 0 , r/'"!'

(8.188)

0

ifx < 0, . ' ifx > 0 ,

(8.189)

+00

h(x) = \

+00

if jc > 0,

so max (/, —h)(x) = 0 if jc < 0, — 1 if jc = 0, and +00 if jc > 0, whence [max(/, -h)T(0)

= sup{-max

{/(JC),

-h(x)}} = 1.

AG/?

On the other hand, /z*(0) = sup^^;^ {^M - h(x)} = sup^^o ^M = 0 if O > 0 and +00 if O < 0, whence dom /z* = { O e / ? | O > 0 } . Furthermore, for O edom /z* and yy, z^ > 0, ^ + ?^ = 1, we have (Z>/)*(Z7CD) = sup{z/cD(x) - i^f(x)} = sup{z70(x) - z^(-l)} = z?, xeR

x<0

8.6 Duality for optimization problems involving maximum operators

255

and hence sup

inf {-r//z*(cD) + (i}fy{r](t>)} = sup inf (?>/)*(y/0) =

inf ?^ = 0.

Also, if / is not convex, then the equality (8.185) may fail if h ^ /z**, as shown by the following example: Example 8.2. LtiX = R, /(.)=!*;

•'/--;'

1

h(x)=x

(8.190)

i f JC > — 1,

{xeR),

(8.191)

so max(/, —h)(x) = — x if x < —1, and 1 if x > —1, whence [max(/,-/z)]*(0) = sup{-max{/(x),-/2(x)}} = - 1 . xeR

On the other hand, /z*(0) = sup^.^^^ {0(jc) - x] = X{0}(^). whence dom /z* = {0} and /z*(0) = 0. Furthermore, for z^ > 0 we have (z^/)*(0) = sup,.^;j{-z?/(jc)} = 1^, and hence sup

inf {-r]h*((^)-\-{}}/)*(r]^)}=

inf (i>/)*(0) =

cDedom/7* ^'^>^

'?-^>0

inf

i}=0.

/].f?>0

Taking ^ = 0 in (8.185), we obtain as mentioned at the beginning of this section, a duality result for problem (P) of (8.168): Theorem 8.13. Let X be a locally convex set and let f,h\ functions, with h = h**. Then inf max {/(jc), -h(x)] = xeX

inf

X —> R be two convex

max [r]h*{^) - {§fy{7]<^)}.

(8.192)

Oedom/z* r].T9>0

Proof. If / = +00, then both sides of (8.192) are +oo. Assume now that / ^ +00, so (i^fy(r]<^) is never —oo, for e domh* and zy, z^ > 0, zy + z^ = 1. By Proposition 8.9(b) (with ^1/ = 0) we have inf max {/(jc), -h(x)} = - [ m a x ( / , -/z)]*(0) xeX

= =

sup inf

min {-r]hH^) + (?^/)*(^0)} max {-{-^/z*(0)-f (z^/)*(r;0)}}.

But since —y;/z*(0) and (i^/)*(r70) are never —CXD, for O G dom/z* and r/, z> > 0, yy + 1^ = 1, the last expression coincides with the right-hand side of (8.192). D

256

8. Duality for D.C. Optimization Problems Let us give an application of Theorem 8.13 in normed linear spaces.

Corollary 8.7. Let X be a normed linear space, and h: X ^^ R a function with h # iboo, h = /z**, h(0) < 0. Then Mmax{\\xh-hix)}= inf

\ '

(8.193)

Proof. Let f(x) = \\x\\

(xeX),

whence, for any ^ e domh* and ^, ?^ > 0, ^ + ?^ = 1, (^/)*(^(D) = sup{rj(x) - z^ lUII} = j ^^J ' I n J i l l t ^ ) = ^^^-*^^^^' where Bx* = {O G X*| ||cD|| < 1} (the unit ball of X*). Then by (8.192), we obtain inf max{||x||, -h(x)} = xeX

inf

max [T]h*((t>) -

x^BxAn^)]

(Pedomh* ?;.i?>0

=

inf

sup r]h*() =

inf

sup r]h*(^).

But by our assumption, h(0) < 0, whence h*() — sup^g;j^{0(x) — /z(x)} > —/z(0) > 0 for all O G X*, and hence the last expression coincides with the righthand side of (8.193). D Returning to locally convex spaces, let us give now an application of Theorem 8.13 to reverse convex infimization. Theorem 8.14. Let G be a closed convex subset of a locally convex space X, and let f: X -^ R be a convex function. Then for any number r e R such that max (r, inf/(G)) > i n f / ( C G ) ,

(8.194)

we have inf/(CG) =

inf

max {r](sup cD(G) + r) - (?^/)*(^?cD)}.

(8.195)

supO(G)<+oo r]-\-i}=\

Proof The basic observation for applying formula (8.192) to reverse convex infimization is that under the assumption (8.194) we have i n f / ( C G ) = inf max{/(x), -XG(-^) 4-r}.

(8.196)

xeX

Indeed, writing infj^-^x = rnin {infj^^G^ ii^fjcGCG)' ^^^ using the definition of XG and (8.194), we obtain

8.6 Duality for optimization problems involving maximum operators

257

inf max{/(jc), —XG(-^) + ^} = niin{inf max(/(x), r), inf max(/(x), —oo)} xeX

xeG

JCGCG

= min {max ( i n f / ( x ) , r), i n f / ( C G ) } = i n f / ( C G ) , xeG

which proves (8.196). Note also that for h = XG — f and any O G X* we have /z*(0) = X G ( ^ ) + ^ = supcI)(G) H- r, so dom/z* = {cl) G X*| supO(G) < -foo}. Hence applying (8.192) with h = XG — r^^^ obtain inf max{/(x), - X G ( X ) + r} =

inf

max [r^ixG - rY^

=

inf

max {r/(sup cD(G) + r) - (?^/)*(??0)},

- (z^/)*(^0)}

supO(G)<+oo^4.j?^l

which, combined with (8.196), yields (8.195).

D

Remark 8.25. Condition (8.194) should be compared with conditions (6.12) and (7.9). Finally, let us give another formula for the distance to a reverse convex set in a normed linear space. Theorem 8.15. Let X be a normed linear space, G a closed convex subset of X, and jco G G. Then for any number r e R such that r> dist(jco,CG),

(8.197)

we have .. w n^. dist(xo,LG)=

.. mf

supcD(G)-(I>(xo) + r

OGX*

l + IIOII

.

,g .QOX (8.198)

sup 0(G) <+oo

Proof The basic observation for applying formula (8.193) to the computation of dist (xo, CG) is that under the assumption (8.197) we have dist (xo, CG) = inf max{||j||, -XG-^iy)

+ r],

(8.199)

yeX

Indeed, by JCQ G G we have 0 G G - JCQ, whence inf

max{||>'||,-XG-{xo}(}^) + ^} =

yeG-{xo}

inf

max{||j|| , r} < max{0, r} = r,

yeG-{xo}

and on the other hand, max {||y ||, r} > r, so inf

maxdijil , -XG-ixo}(y) + r] = r.

(8.200)

yeG-{xo}

Furthermore, since —XG-{XO}(>') -^ f = —oo for y e C(G — xo), we have

258

8. Duality for D.C. Optimization Problems inf

rmix{\\y\\,-XG-{xo}(y) + r}=

>f

:yeC(G-{xo})

M

veC(G-.vo)

= dist (xo, CG).

(8.201)

Hence by (8.200), (8.201), and (8.197), we obtain inf maxdijil, -XG-{xo}(y) + r} = min[ yeX

inf

max{||j||,

-XG-{XO}(J)

+ ^}.

y€G-{.vo}

inf

max{||j|| , -XG-{xo}(y) + f)] = niin[r, dist (XQ, CG)] = dist (XQ, CG),

jeC(G-{xo})

which proves (8.199). Now let h = —r < 0; also, /i*(c|>) = sup {a>(x) -

XG-{XO}M

XG-{XQ}

- r. Then by XQ e G, we have h{0) =

+ d = sup 0(G - {xo}) + r

(cD G

Z*).

xeX

Hence applying (8.193) with h =

XG-{AO}

"" ^^ ^^ obtain

inf max (11,11, - X G - , . „ , W + r } =

Jnf.

sup
sup4>(G)<+oo

which, together with (8.199), yields (8.198).

D

Remark 8.26. From Theorem 8.15 one can obtain again Theorem 5.1 (see [291], proof of Corollary 3.5).

Duality for Optimization in the Framework of Abstract Convexity

In the preceding chapters we have studied duaUty for nonconvex optimization in locally convex spaces, using convexity of sets and/or convexity or quasi-convexity of functions, and we have also given some scattered duality results for optimization, in the framework of abstract convexity, for example. Theorems 3.11, 6.10, and 8.11 on A^A-convex sets and A^A-quasi-convex functions. In this chapter we want to give, more systematically, some duality results for optimization of more general functions on more general sets, in the framework of abstract convexity of sets and functions, extending some of the results of the preceding chapters. It is not our aim to give an exhaustive description, but rather to emphasize some main results and methods: some further results of those chapters, not mentioned in the sequel, can be also extended with similar methods. For simplicity, we shall use the term "abstract" for the respective optimization problems, rather than for the properties of abstract convexity involved. That is, instead of "reverse abstract convex infimization" and "optimization problems involving differences of abstract convex functions," we shall use the terms "abstract reverse convex infimization" and "abstract d.c. optimization," respectively, which will lead to no confusion.

9.1 Additional preliminaries from abstract convex analysis To describe briefly the main idea behind abstract convexity (for more details, see [254]), recall the following classical results from convex analysis: (1) By the strict

260

9. Duality for Optimization in the Framework of Abstract Convexity

separation theorem (see Chapter 1, Theorem 1.2), a set C in a locally convex space X is closed and convex if and only if each point that does not belong to this set can be separated from it by a closed half-space or by a continuous linear function; (2) by the Fenchel-Moreau theorem (see Chapter 1, Theorem 1.4), each lower semicontinuous convex function / : X ^^ /? is the upper envelope of the set of all continuous affine (i.e., continuous linear + constant) functions that are minorants of / . There arises in a natural way the idea, first, to study abstract convex sets, that is, subsets G of an arbitrary set X (no structures assumed) that enjoy the following separation property: each point that does not belong to G can be separated from G by a set M belonging to a given family M. of sets or by a function w belonging to a given set W of "elementary functions"; and second, to examine abstract convex functions, that is, functions f: X ^^ R that can be represented as upper envelopes of subsets of a given set W of "elementary functions" w. Naturally, abstract quasi-convex functions f \ X ^^ RdiXQ defined by the condition that all level sets Sd{f) (d e R) should be abstract convex. Just as the family of half-spaces and the set of linear functions on locally convex spaces lead to the theory of (usual) convex sets and convex functions, each family M of sets and each class W of elementary functions leads to a theory of abstract convex sets and abstract convex and quasi-convex functions with respect to M or W. Thus, let X be a set, and let W ^ R^, where R^ denotes the set of all functions w: X -^ R. Throughout this chapter we shall assume, without any special mention, that X and W are nonempty. We recall that a set C c X is said to be (a) W-convex, in symbols C e /C(H^), if for each x e CC there exists w e W such that sup u;(C) < w{x),

(9.1)

with the convention sup 0 = — oo, where 0 denotes the empty set; (b) W-evenly convex, in symbols C e £)C(W), if for each x e CC there exists w e W such that w(c) < w{x)

(c € C);

(9.2)

(c) W-evenly coaffine, in symbols C e £CA{W), if for each x eCC there exists w e W such that w(x) ^ w(C)

(9.3)

(i.e., such that w(x) 7^ w(c) (c e C)). Clearly, /C(W) c £)C(W) c £CA(W).

(9.4)

For a set C C X, and for its W-convex hull co^C (i.e., the intersection of all W-convex sets containing C), we have

9.1 Additional preliminaries from abstract convex analysis cowC=

PI

261

Sd(w)

(w,d)eWxR sup w(C)
= {jc e X I ${w, d) eW X R, sup w{C)
(9.5)

Also, KiW) = [C e2^\C

= cow C},

(9.6)

that is, C is W-convex if and only if C = cowC. Similarly, for the W-evenly convex hull ecoH^C of any set C (i.e., the intersection of all H^-evenly convex sets containing C) we have

c=

PI

Ad(w)

{w,d)eWxR w(c)
= {x eX\

$(w, d) eW X R, w(c)
(c e C)}.

(9.7)

In the particular case that X is a locally convex space and W = X*\[0], for each (w, d) e (X*\{0}) X R the set Sd(w) is a closed half-space, and conversely, every closed half-space can be written in this form (see (1.30)). Hence by (9.1) and the strict separation theorem, C e /C(X*\{0}) if and only if C is a closed convex subset of X. Also, the sets Ad(w) ((if, d) e (Z*\{0}) x R) are the open half-spaces of X. Hence by (9.2) and the definition (1.37) of evenly convex sets, C e £IC(X*\{0}) if and only if C is evenly convex. Furthermore, by (1.38) and (9.3), C e £CA(X*\{0}) if and only if C is evenly coaffine. Note also that since (9.1), (9.2), and (9.3) are not satisfied by w; = 0, we have /C(X*) = /C(Z*\{0}), 5/C(X*) = 5/C(X*\{0}), SCA(X'') = £CA(X*\{0}). Let us pass now to abstract convex functions. Given two sets X and W c /?^, the W-convex hull of a function f: X ^^ R is the function fcoiw): X ^^ R defined by fco(W)(x) := sup w(x)

(x e X).

(9.9)

weW

A function f: X ^^ Ris said to be W-convex, in symbols / e C(W), if / = fcoiW)In the particular case that X is a locally convex space and W — X* -f /?(c R^), the set of all continuous affine functions on X, by Theorem 1.4 of FenchelMoreau, we have / G C(X* -h R) (or equivalently, / is the supremum of a set of continuous affine functions) if and only if either / = ±oo or / is proper, convex, and lower semicontinuous. If W = X*, we h a v e / e C(X*) if and only if either / = —oo or / is a proper lower semicontinuous sublinear (i.e., convex and positively homogeneous) function (see [254], p. 109, Theorem 3.8). If X = (X, p) is a metric space, 0 < a < l , 0 < A ^ < +oo, and W^^A^ = {-Np{., y)"\y e X)}, then (see

262

9. Duality for Optimization in the Framework of Abstract Convexity

[254], p. 120, Theorem 3.14) for a function / : X -^ ^ we have / e C(Wa,N + R) if and only if either / = ±00 or f(X) c R and / is a-Holder continuous with constant A^, i.e., I / U i ) - f(x2)\ < Np(xuX2r

ixuX2 e X).

(9.10)

A function / : X —> ^ is called (a) W-quasi-convex, in symbols / G Q(W), if all its (lower) level sets Sd(f) {d e R) are \y-convex, i.e., if SAf)elC(W)

(deR);

(9.11)

(b) W-evenly quasi-convex, in symbols / e EQ(W), if SAf)

e £IC(W)

(d e R);

(9.12)

(c) W-evenly quasi-coaffine, in symbols / e EQA(W), if SAf)

e SCAiW)

(d e R).

(9.13)

By (9.4), we have Q(W) c EQ{W) c EQA(W),

(9.14)

For the W-quasi-convex hull (i.e., the greatest W-quasi-convex minorant) fq(w) of any function / : X -> /? we have fq(W)M ~

inf

d = sup sup

^ ^ ^ c /^^ xecow^dij)

deR

inf

f(y)

(x £ X).

(9.15)

weW >'^^ w(x)>dw{y)>d

For the W-evenly quasi-convex hull (i.e., the greatest W-evenly quasi-convex minorant) /eq(w) of any function / : X -^ /? we have fcq(W)(x)=

inf

J = sup sup

xGecow5rf(/)

= sup

inf

f(y)

vj{x)>.d w{y)>.d

inf

/(j)

(XGX).

(9.16)

For the W-evenly quasi-coafjine hull (i.e., the greatest W-evenly quasi-coaffine minorant) /eqa(iy) of any function f \ X -^ R^t have /eqa(W)(-^) =

inf

J = sup sup

d^R xeecawSd(f)

= sup weW

inf

deR

f(y)

weW w(x)=d

inf

f{y)

y^^ My)=d

(X e X).

(9.17)

y^^ w{y)=wix)

In the particular case that X is a locally convex space and W = X* or W = X*\{0}, we have / e G(X*) = G(X*\{0}) if and only if / is quasi-convex and

9.1 Additional preliminaries from abstract convex analysis

263

lower semicontinuous. Also, / e EQ{X*) = EQ{X*\{0}) if and only if / is evenly quasi-convex. Furthermore, / e ECA(X*) = ECA(X*\{0}) if and only if / is evenly quasi-coaffine. One can deduce the concepts and results on certain classes of sets C c X and functions f: X ^> R defined with the aid of a set of functions W c R^, in 3. unified way, by using the framework of convex sets and quasi-convex functions with respect to a family of subsets of X. Namely, let X be a set and let X c 2^, where 2^ denotes the family of all subsets of X. A set C C X is said to be M-convex, in symbols C 6 CiM), if for each x eCC there exists M e M such that C c M, jc G CM,

(9.18)

that is, C and each outside point x eCC can be "separated" by a set M G A^. For the M-convex hull COM ^ of any set C c X (i.e., the intersection of all Al-convex sets containing C), we have c o ^ C = n{M eM\C

^M}

= {x eX\$M

eMX

^M,x

e CM}. (9.19)

For example, if X is a locally convex space and M is the family of all closed half-spaces in X, then by (9.18) and the strict separation theorem, C(M) is the family of all closed convex subsets of X. If A1 is the family of all open half-spaces in X, then by (9.18) and the definition of evenly convex sets, C(M) is the family of all evenly convex subsets of X. If M is the family of all complements of hyperplanes in X, then C(M) is the family of all evenly coaffine subsets of X. If X is a partially ordered set and A^ c 2^ is the family of all subsets of X of the form Ma := {x e X\a ^ jc}, then (see [254], p. 65, Proposition 2.3) CiM) is the family of all "downward" sets in X (i.e., such that the relations g e G,x e X, x < g imply JC G G ) .

_

A function / : X ^- /? is called M -quasi-convex, in symbols / G Q(M), if all level sets Sd(f) (d e R) are A^-convex. For the M-quasi-convex hull fq(M) (i-^-^ the greatest A^-quasi-convex minorant) of any function / : X ^- /? we have UM)M=

inf

d=

sup inf f(y)

(x e X).

(9.20)

For example, if X is a locally convex space and M. is the family of all closed half-spaces in X, then Q(Ai) is the set of all usual lower semicontinuous quasiconvex functions on X. If M is the family of all open half-spaces in X, then Q(M) is the set of all evenly quasi-convex functions on X. If A^ is the family of all complements of hyperplanes in X, then Q(M) is the set of all evenly quasi-coaffine functions on X. If X is a partially ordered set and Al c 2^ is the family of all subsets of X of the form M^ := {x e X\a ^ x}, then (see [254], p. 146) QiM) is the set of all nondecreasing functions on X (i.e., such that the relations y,x e X,y < x imply f{y) < f(x)). If X is a topological space and M is the family of all closed subsets of X, then Q(M) is the set of all lower semicontinuous functions on X. Now let X be a set and let W c ^ ^ . If we choose M = S{W x R), the family of all subsets of X of the form

264

9. Duality for Optimization in the Framework of Abstract Convexity M^4 = Sd(w) = {x eX\

w{x)
(w eW,d

e R),

(9.21)

then from (9.5) and (9.19) it follows that cow C = COM C

(C c X),

(9.22)

and hence by (9.15) and (9.20),

Uw) = UM)

(f e R"");

(9.23)

in particular, /C(W) = C{M) and Q{W) = Q(M). If we choose M = A(W x R), the family of all subsets of X of the form My,,d = AdM

= {x eX\ w(x)
(w eW,d

e R),

(9.24)

then from (9.7) and (9.19) it follows that ecoH^ C = COM C

(C c X),

(9.25)

and hence by (9.16) and (9.20), /eq(W) = UM)

(f e R"");

(9.26)

in particular, £IC(W) = C(M) and EQ(W) = Q(M), If A: 2^ ^ 2 ^ is a polarity, then for A^A ^ 2^ defined by MA = {A\{w})\we

W]

(9.27)

we have C ( M A ) = C(A^A), Q(MA) = e(A'A).

(9.28)

Given two sets X and W c /?^, we have the following relation between Fenchel-Moreau W-biconjugates (1.207) and (W + R)-convcx hulls of functions / : X -> ^ (see, e.g., [254], Theorem 8.6): /** = fcoiw^R)

( / e W),

(9.29)

where R is identified with the set of all real-valued constant functions on X (each a e Ris identified with the function fa defined by fa (x) —a, for all x e X), Hence for a function / : X -> /? we have / = /** if and only if f is {W -f R)-convex. Lemma 9.1. Let Xbea set, W c /?^, ZQ e X and f\ X -> Ra W-convexfunction. Then f(zo) = /**(zo).

(9.30)

Proof By (9.9), (9.29), and (1.209), for any function / : X -> ^ we have fco(W) < fco{W-\-R) = / * * S / •

Hence if / is W-convex, i.e., / = fco(w). then by (9.31), / = /**.

(9.31)

D

9.1 Additional preliminaries from abstract convex analysis

265

In the particular case that X is a locally convex space and W = X*(C. R^), (1.206) and (1.207) become the functions / * : X* -^ ^ and /**: X -> ^ given by (1.95) and (1.97), i.e., the usual convex conjugate and biconjugate of / . By (9.29) (with W = X*), for a function f: X -> R we have f = /** if and only if f = fco(x*+R)^ i'^-y if and only if f is the supremum of a set of continuous affine functions', thus, we arrive at Theorem 1.4 of Fenchel-Moreau. Remark 9.1. (a) If W + /? = W, then by (9.29), we have / * * = fco{W+R) = fco(W)

(9.32)

for all functions / : X -> R. However, in general, W -\- R ^ W (for example, if X is a locally convex space and W = X*, the conjugate space of X, i.e., the set of all continuous linear functions on X, then W + R — X* + Risihe set of all continuous affine functions on X); also, in general, /** 7^ fcoiW) (see [254], p. 257). (b) Let us observe that /** of (1.207) remains unchanged if W is replaced by W-^R. One can define generalizations of the special polarities of Chapter 1 to polarities A: 2^ -^ 2 ^ and A: 2^ -^ 2^^^, where X is a set and W c 'R^, as follows (where we shall use the same notation in this more general case that we used in the particular cases of Chapter 1, since this will lead to no confusion): A J ; ( C ) = {W eW\ w{c) < sup u;(G) (c e C)}

Alio

= {weW\ sup w(C) < supM;(G)}

A^(C) = {w eW\ supw;(G) ^ w(C)} A^(C) = {w eW\ w(C) c w(G)} A^^C) = {{w,d) eW xR\

(C c X),

(C c X), (C c X),

A^2(C) = {(w;, d)eW xR\ w(c) < d (c e C)} A^^(C) = {(w, d) eW X R\ w{c) ^ d (c e C)}

(9.34) (9.35)

(C c X),

supw;(C) < d}

(9.33)

(9.36) (C c X), (C c X), (C c X).

(9.37) (9.38) (9.39)

For these polarities we have e((A^)'Aj,) c EQ(W),

(9.40)

QiiAlYAl)

c Q(W),

(9.41)

QdAlYA'c)

c EQA(W),

(9.42)

e((A^)^A^) c EQA(W),

(9.43)

IC{W) = C({A''yA''), cow C = (A''yA'\C) EJC(W) = CdA^^YA^^), ecow C = (A'^YA'\C)

Q(W) = QiiA^'YA''), (C e 2^), f^^w) = /qaA-'yAn) ( / G W), EQ{W) =

QUA^^YA^^),

(C e 2^), /eq(W) = /qccA'^yA'^) ( / e R"").

(9.44) (9.45) (9.46) (9.47)

266

9. Duality for Optimization in the Framework of Abstract Convexity

The expressions (9.45), (9.47) of cou^ C, ecow C and fq(W), feqiw) with the aid of the polarities A^^ and A^^ depend on two parameters, w e W and d e R. Assume now that Z is a set and W ^ R^, and let us consider the following properties: (Ki) W is3. "conical set," i.e., /xW c W

(0 < /x < +00).

(9.48)

(^2) There exists an element 0 e X such that w;(0) = 0

{w eW).

(9.49)

(Ks) For each x e X\{0} (with 0 of (9.49)) there exists w e W such that w(x) > 0 (hence in this case, 0 E X of (9.49) is unique). Assuming (^1), (K2), the sets C c. X with 0 e C and, under the assumptions (Ki)-(K3), the functions f: X ^^ R satisfying / ( 0 ) = inf/(X\{0}),

(9.50)

admit expressions of cow C, eco^^ C and /q(W), /eq(W) with the aid of simpler polarities, depending only on one parameter w e W. Indeed, assuming (K\), (K2), for the polarity A^^: 2^ ^ 2 ^ defined by A^\C)

= {weW

\ sup w(C) < 1}

(C c X),

(9.51)

we have cow C = (A^^)'A^^ (C)

(C c X, 0 E C),

(9.52)

and assuming (^i)-(^3), Uw)(x) = /q((AoiyAO.)(x)

( / e W, /(O) = inf /(X\{0}), x e X\{0}).

(9.53)

Furthermore, assuming (A^i), (^2), for the duality A^^: 2^ -> 2 ^ defined by A^^(C) = [weW

\ w(c)
C)}

(C c X),

(9.54)

we have ecow C = (A^^YA^^iO

(C c X, 0 G C),

(9.55)

and assuming (K\)-(Ki,), /eq(W)(x) = /q((A02yA02)(x)

( / G R \ / ( O ) = i u f / ( X \ { 0 } ) , X G X\{0}).

(9.56)

9.2 Surrogate duality for abstract quasi-convex supremization

267

9.2 Surrogate duality for abstract quasi-convex supremization, using polarities Ac: 2^ -» 2^ and Ac: 2 ^ - ^ 2^^^ Let us consider the primal supremization problem (P^) of (3.1), where G is a subset of a set X and f: X -^ R is SL function. For the polarities AJ^: 2^ -> 2 ^ (/ = 1 , . . . , 4) introduced in the preceding section, the dual value (3.42) becomes yS;. = sup inf /(C(A^)^({w;})) = sup ^

weW

weW

^ ; , = sup inf f(C(Aly({w})) ^

= sup

weW

weW

^^3 = sup inf f{C{Aly({w})) ^

inf

weW

(9.57)

inf

/(x),

(9.58)

/(x),

(9.59)

, .^^^ ,^. w(x)>supw(G)

= sup

weW

/(x),

, .^^^ ,^. w{x)>supw{G)

inf , ,^^^ ,^. w(x)=supw{G)

j6^4 = sup inf /(C(A^G)'({"'))) - sup

inf

fix).

(9.60)

We have the following generalizations of the results of Chapter 3 on unconstrained and constrained surrogate duality for primal supremization (the proofs are generaHzations of those of Chapter 3): Theorem 9.1. Let X be a set, W ^^^,

inf

fix) supu;(G)

^^Q *

f: X ^ ^ and G a subset ofX.

inf

fix)

iw e W),

(9.61)

inf

fix)

iw e W),

(9.62)

xeX w{x)>w{g)

or equivalently, if xeX w(x)>supwiG)

g^Q xeX ^ u)(x)>w(g)

then sup/(G)>sup weW

inf

fix).

(9.63)

fix)

(9.64)

, , ^ e ^ ,„, w;(jc)>supu;(G)

(b) V^e have sup/(G)supu;(G)

if and only if for each d < sup / ( G ) there exists w = Wd ^ W such that wiy)<

sup wiG)

iyeSAf))-

(9.65)

268

9. Duality for Optimization in the Framework of Abstract Convexity

(c) If we have (9.61) and iffor each d supu;(G)

Corollary 9.1. Let X be a set, W ^ 1R^ , G C X, and f: X -> J a function such that for each d < sup/(G) the level set Sdif) is W-evenly convex (e.g., let f e EQiW)). Then we have (9.64). Definition 9.1. For a set X and functions / , w;: X ^- /?, let (Pf,Ad)= •^

inf f(x)

(deR).

xeX w(x)>d

(9.67)

The function / is called regular with respect to w if iPf^^id) = sup (pf^M')

(d e R).

(9.68)

d'eR d'
From Theorem 9.1(a), one obtains the following corollary: Corollary 9.2. Let X be a set, W C R^, G a subset ofX such that supw{G)eR

(weW),

(9.69)

and f'. X -> R a function that is regular with respect to all w e W. Then we have (9.63). Corollary 9.3. Let X be a set, W C.~R^ ^ G a subset of X satisfying (9.69), and f: X ^^ R a function that is regular with respect to all w e W and such that for each d < sup / ( G ) the level set Sd(f) is W-evenly convex (the latter condition is satisfied, e.g., when f e EQ(W)). Then we have (9.66). Using Corollary 9.1, there follows the following corollary: Corollary 9.4. Let X be a set, W '^ 1R^ , G <^ X, and f: X ^ ^ a function such that for each d < sup/(G) the level set Sd(f) is W-evenly convex (e.g., let f e EQ(W)). Then we have sup/(G) supw{G)

Theorem 9.2. Inequality (9.70) holds if and only if for each d < sup / ( G ) there exists w = Wd ^ W such that sup w(Sd(f))

< sup w(G).

(9.71)

9.2 Surrogate duality for abstract quasi-convex supremization Theorem 9.3. IfGCXandf.X-^'R inf

269

satisfy

fix) sup

inf

fix).

(9.73)

w(x)=supwiG)

Theorem 9.4. Let X be a set, W ^^^,

f\ X ^ ^ , and G a subset ofX. We have

sup fiO) < sup

inf

fix)

(9.74)

weW w(x)^w(G) ,^§^,^,

if and only if for each d < sup / ( G ) there exists w = w^ e W such that wiSAf))

Q wiG).

(9.75)

Applying Theorem 3.11, in the form (3.111), to the polarity A = A^^ of (9.37), we obtain the following result: Corollary 9.5. Let X be a set, W ^^^, sup fiG)=

f e QiW), and G
(w,d)eWxR

inf

fix).

(9.76)

j^.^ .

Remark 9.2. The usefulness of the assumption W / 0, made at the beginning of Section 9.1, is shown, e.g., by the fact that for W = 0 the right-hand sides of (9.74)(9.76) are —oo, so in this case Theorem 9.4 and Corollary 9.5 would no longer be valid when sup / ( G ) 7^—00. Applying Theorem 3.11 to the polarity A = A^^: 2^ ^ obtain the following: Corollary 9.6. Let X be a set, W ^^^, sup/(G) =

sup

inf

2^^^ of (9.38), we

f e EQiW) and G c X. Then

fix) =

{w,d)eWxR f^,^. 3geGM8)>d'^^''^-'^

sup

inf

{w,g)eWxG

fix).

(9.77)

, \t^, . u;(x)>u;(g)

Note that Corollary 9.6 also follows by using that / = feq(W), applying (9.16), and taking sup^^^;. Theorem 9.5. Let X be a set, W C~R^ satisfying iK^-iK^), (9.50), andG ^X,G ^ {0}. Then sup/(G) =

sup

inf

weW supu;(G)>l

xeX Mx)>\

fix).

f € QiW) satisfying

(9.78)

270

9. Duality for Optimization in the Framework of Abstract Convexity

Remark 9.3. The assumption G ^ {0} cannot be omitted in Theorem 9.5. Indeed, for G = {0} we have sup / ( G ) = /(O), but {w e W \ sup w(G) > I] = {w e W \ w(0) > 1} = 0 (by (K2)), so the right-hand side of (9.78) is -cx). Theorem 9.6. Let X be a set, W c^J^ satisfying (K^yiK^), G ^ {0}. Then fying (9.50), andGOX, sup/(G) =

sup 3geGM8)>\

inf

f e EQ(W) satis-

fix),

(9.79)

"'(^^-^

Remark 9.4. Similarly to Remark 9.3, the assumption G / {0} cannot be omitted in Theorem 9.6.

9.3 Constrained surrogate duality for abstract quasi-convex supremization, using families of subsets of X Let us consider again the primal supremization problem (P^) of (3.1), where G is a subset of a set X and f:X -^ R is a. function. We shall use now the unifying framework of A^-quasi-convex functions / e Q(M), where A^ is a given family of subsets of X (see (9.20)). Remark 9.5. We do not need to assume that G e C(M) in the above problem. In fact (see [254], Corollary 8.25 (a)), we have / € QiM) if and only if sup / ( G ) = sup f(coM G)

(G c X).

(9.80)

Theorem 9.7. Let X be aset, M c^2^, f \ X ^~R a function, and G c X. Then sup/(G) >

sup

inf/(CM).

(9.81)

inf /(CM).

(9.82)

MeM

If in addition, f e Q(M), then sup / ( G ) =

sup MeM GnCM7^0

Proof If M e M , Gr\ (CM) 7^ 0, say ^ G G n CM, then sup/(G) > f{g) > inf /(CM), while if {M G A^ | G n CM) ^ 0} = 0, then the right-hand side of (9.81) is sup 0 = - 0 0 . Assume now, in addition, that / G Q{M). Let us first observe that | J { M G M I g G CM} = {M G A^ I G n CM 7^ 0}.

(9.83)

9.4 Surrogate duality for abstract reverse convex infimization

271

Hence by / G Q(M), (9.20), Lemma 3.7, and (9.83), we obtain sup/(G) = sup /q(^)(g) = sup sup inf / ( C M ) geG

geG MeM geCM

sup

inf /(CM) =

sup

inf /(CM).

D

Applying Theorem 9.7 to various families A4 c 2^ in a locally convex space X, we obtain the following more direct proofs of some geometric corollaries of Chapter 3, Section 3.3: Proof of Corollary 3.10. Let Al be the family of all closed half-spaces in X. Then / is lower semicontinuous and quasi-convex if and only if / G Q(M), and we have {CM \ M e M] = U (of (3.116)). Hence by Theorem 9.7, we obtain (3.116). Proof of Corollary 3.12. Let A4 =U. Then / is evenly quasi-convex if and only if / G Q(M), and we have {CM | M G A 1 } = V. Hence by Theorem 9.7, we obtain (3.120). Proof of Corollary 3.14. Let M = {CH \ H e H}. Then / is evenly quasicoaffine (see Chapter 1, Section 1.1) if and only if / G Q{M), and we have {CM | M £M] = H. Hence by Theorem 9.7, we obtain (3.125). Remark 9.6. Formula (9.82) permits us to define a "dual problem" to ( P ^ (of (3.1)), with the "dual set" {M G >1 | G H CM 7^ 0} c 2^, but in Section 9.2 above we have used some more natural "dual sets," in which the dual variables are functions w: X -^ R (rather than sets M c X). To this end, one could also have applied the method of [236]; namely, given a "dual set" W <^ R^ and a polarity A: 2^ -> 2^, one obtains duality theorems for sup/(G) involving W and A by applying Theorem 9.7 above to the family of subsets A^ A of (9.27) and using (9.28) and formula (3.110). However, in Section 9.2 we have given more direct proofs, using the properties of polarities A: 2^ -> 2^.

9.4 Surrogate duality for abstract reverse convex infimization, using polarities AG : 2^ -> 2^ and AG:2^^2^X^ Let us consider the primal infimization problem (P^) of (6.1), where f: X -> R is a function on a set X and G is an abstract convex subset of X. For the polarities A': 2^ -^ 2 ^ (/ = 1 , . . . , 4), introduced in Section 9.1, the dual value (6.26) becomes

272

9. Duality for Optimization in the Framework of Abstract Convexity ^3^3 = inf inf /(CCA^)'({«;})) = inf

inf

f{x),

(9.84)

/(^).

(9.85)

/(x),

(9.86)

W;(A:)>SUP u;(G)

)3;, =

inf inf /(C(A3C)'({"^})) =

inf

inf u;(jc)=supu;(G)

;6;, - inf inf /(C(A^)'({u;})) = inf ^G

w^W

inf

weW

xeX w(x)>supw(G)

yS;^ = inf inf /(C(A^)'({u;})) = inf ^G

weW

inf

weW

xeX w{x)^w(G)

/(x).

(9.87)

We have the following generalizations of some results of Chapter 6 on unconstrained and constrained surrogate duality for reverse convex infimization (the proofs are generalizations of those of Chapter 6): Theorem 9.8. Let Xbe aset,W
weW

Theorem 9.9. LetX beaset,W R a function. Then

^J^,

inf

xeX u;(jc)>sup u;(G)

/(x).

^ ^

(9.88)

G e SCA(W) (with G ^^ X), and f: X ->

i n f / ( C G ) = inf

weW

inf

f(x).

xeX w{x)^w(G)

(9.89)

Using (9.88) and (5.13), or alternatively applying Theorem 6.10 in the form (6.86) to the polarity A^^ of (9.37) and using (9.44), we obtain the following corollary: Corollary 9.7. Let X be a set, W ^^^, inf/(CG)=

f e 'R^ , and G e IC(W). Then inf

inf

{w,d)eWxR supw(G)
f(x).

(9.90)

xeX w{x)>d

Using (9.46), from Theorem 6.10 we obtain the following: Corollary 9.8. Let X be a set, W ^^^, inf / ( C G ) =

inf

inf

(w,d)eWxR xeX wig)d

f e^^

and G e £}C{W). Then

f(x) = inf

weW

inf

xeX w(x)>w(g) (geG)

f(x).

(9.91)

Remark 9.7. One can also give the following alternative proof of Corollary 9.8: Since G G £:/C(W), we have CG = {X e X\ 3w e W, w(g) < w(x) (g e G)}, whence

(9.92)

9.5 Constrained surrogate duality for abstract reverse convex infimization inf/(CG)=

inf

=

f(x)=

inf

xeX -^ 3weWMg)<wix) (geG)

inf

(d,w,x)eRxWxX w(g)
f(x) =

f(x)

iw,x)eWxX w{g)<w(x) (geG)

inf

273

inf

(w,d)eWxR xeX uj{g)d

f(x).

Let us assume now (Ki) and (^^2) of Section 9.1, and let us consider the case of sets G containing 0. Theorem 9.10. Let X be a set, W c.^^ G e IC(W) with 0 e G. Then inf/(CG)=

satisfying (Ki) and (K2), f e ^ ^ , and

inf

inf

f(x).

weW xeX supw(G)\

(9.93)

Theorem 9.11. Let X be a set, W
inf

inf

weW xeX w(g)<\ (geG) w(x)>l

f{x).

(9.94)

Remark 9.8. The assumption 0 6 G cannot be omitted in Theorems 9.10 and 9.11.

9.5 Constrained surrogate duality for abstract reverse convex infimization, using families of subsets of X Let us consider again the primal infimization problem (P'^) of (6.1), where / : X ^• /? is a function on a set X and G is an abstract convex subset of X. We shall use now the unifying framework of A^-convex sets G e C{M), where A^ is a given family ofsubsetsofX (see (9.18)). Remark 9.9. We do not need to assume that / e Q(M) in the above problem. In fact (see [254, Corollary 8.25(b)]), for a subset G of a set X the following statements are equivalent: 1^ G G CiM). 2°. We have inf / ( C G ) = inf fiCcoM G)

( / e W).

(9.95)

inf / ( C G ) = inf /q(Ai)(CG)

( / G W).

(9.96)

3°. We have

4°. We have inf / ( C G ) = inf /q(^)(Cco^ G)

( / G R"").

(9.97)

274

9. Duality for Optimization in the Framework of Abstract Convexity

Theorem 9.12. Let X be a set, M ^ 2^, f: X ^ ^, and G c^ X. Then inf/(CG)<

inf i n f / ( C M ) .

(9.98)

MeM

If in addition, G e C(M), then inf i n f / ( C M ) .

inf/(CG)=

(9.99)

Proof. If M G X , G c M, then CG 2 C M , whence inf / ( C G ) < inf / ( C M ) , while if {M G X I G C M} = 0, then the right-hand side of (9.99) is inf 0 = +oo. This proves (9.98). Assume now that G G C{M). Then, by (9.18) and Lemma 3.7, we have

inf /(CG) = inf/(C( P I M ) ) = inf/( [J (CM)) G^M

=

inf

MeM GQM

GQM

inf/(CM).

D

Remark 9.10. It is also convenient to use the following equivalent form of formula (9.99): inf / ( C G ) =

inf

inf / ( C M ) .

(9.100)

MeM

GnCA/=0

Applying Theorem 9.12 to various families A4 c 2^ in a locally convex space X, we obtain the following more direct proofs of some geometric corollaries of Chapter 6, Section 6.3: Proof of Corollary 6.5. Let M. =U {of (6.94)). Then G is an evenly convex set if and only if G G C{M). and we have {CM \M eM) = V. Hence by (9.100), we obtain (6.97). Proof of Corollary 6.6. Let M = {C// | H e H). Then G is evenly coaffine if and only if G G C(M), and we have {CM \ M e M} = H. Hence by (9.100), we obtain (6.98). Remark 9.11. Formula (9.99) permits us to define a "dual problem" to (P'') (of (6.1)), with the "dual set" {M G A^ | G c M} c 2^, but in Section 9.4 above we have used some more natural "dual sets," in which the dual variables are functions w: X ^^ R (rather than sets M c X). To this end, one could also have applied the method of [236], namely, given a set W c R^ and a polarity A: 2^ ^- 2^, one can obtain duality theorems for inf / ( C G ) , involving W and A, by applying Theorem 9.12 above to the family of subsets MA of (9.27) and using (9.28). However, in Section 9.4 we have given more direct proofs, using the properties of polarities A:2^->2^.

9.6 Duality for unconstrained abstract d.c. infimization

275

9.6 Duality for unconstrained abstract d.c. infimization In this section we shall consider the following unconstrained primal infimization problem, in which the objective function is the difference of two functions: Let X be a locally convex space and let / , /z: X ^- /? be two functions, at least one of them being abstract convex. Then, by abuse of language, we shall call the primal problem (P)

a = inf ( / 4- -h){X)

= inf {/(jc) + -h{x)}

(9.101)

xeX

an abstract d.c. optimization problem, and the objective function / + —/z an abstract d.c. function. Remark 9.12. If we make no abstract convexity assumption about / and h, then the optimization problems (9.101) encompass, as particular cases, the general infimization problem, as well as general supremization and reverse infimization problems. Indeed: (a) If G is a subset of a set X, then for h = —XG the primal problem (9.101) becomes inf ( / + -(-XG))(X)

= inf ( / + XG)(X) = inf/ (G),

(9.102)

i.e., a general infimization problem. (b) If G is a subset of a set X, then for f = XG the primal problem (9.101) becomes (8.4), i.e., a problem of supremization. (c) If G is a subset of a set X, then for h = —XCG the primal problem (9.101) becomes (6.9), i.e., a problem of reverse infimization. Now we shall give a duality theorem for abstract d.c. infimization problems (9.101). Let X be a set, W c ^ ^ , and f,h e W. For the primal problem (9.101), we shall consider the dual problem (D)

p := inf {h^w) -h -/*(u;)}.

(9.103)

weW

Since f*(w) is defined with the aid of the lower sums w{y) -\—f(y) (y e X) (see (1.206)), where both w(y) and f{y) may be ±oo, we shall no longer use the notational convention (8.6) of Chapter 8; however, the arguments will be generalizations of those of Chapter 8, using now the rules of computation for + and +, mentioned in Chapter 1. Proposition 9.1. Let X be a set and let W C^ R^. Then for any functions we have inf {fix) 4- -h**(x)} = inf {/Z*(M;) -h -/*(?/;)}. xeX

weW

f,heR^

(9.104)

276

9. Duality for Optimization in the Framework of Abstract Convexity

Proof, By (1.207), (1.86), (1.90), the associativity and commutativity of -f, and (1.206), we have inf [fix) + -/z**(jc)} = inf [fix) + - sup [w{x) +

xeX

xeX

weW

= inf [fix) + inf [h\w) + xeX

-h\w)}} -wix)}]

weW

= inf inf [fix) + [h\w) +

-wix)}}

xeXweW

= inf inf [fix) + [h*iw) + -wix)}} weW xeX

= inf [h'iw) + -riw)}.

D

weW

We have the following basic duality theorem for problem (9.101). Theorem 9.13. Let X be a set and let W
< inf {/i*(it;)+ - f\w)}.

xeX

(9.105)

weW

(b) For any function h e R^ the following statements are equivalent:

r. h = /z**. 2°. We have inf [fix) -i- -hix)} = inf {/z*(u;)+ - f*iw)} xeX

if e ^ ^ ) .

(9.106)

weW

Proof (a) Let f,heR^.By

/z** < /z we have —/z < —/i**, whence

inf {/(jc) + -hix)} < inf {/(x) + -/z**(jc)}, (9.107) which, togetherjA^ith (9.104), yields (9.105). (b)Let/i 6 R^. r => 2°. If 1° holds, then by 1° and (9.104), we have (9.106). 2° =^ 1°. If 2° holds, then applying (9.106) both to / and to / replaced by /** and using that / * = (/**)*, we obtain inf [fix) + -hix)} = inf {/i*(u;)+ - riw)} jceX

weW

= inf [h\w) + -(/**)*(!/;)} weW

= inf {f**{x) + -hix)]

if e

(9.108)

'R'').

xeX

But since a +-a

>0 ia e'R), from (9.108) forf = h it follows that

0 < inf [hix) + -hix)} = inf {/i**(jc) + -hix)} xeX

ix e X),

xeX

whence /z**(x) + -/z(jc) > 0 (JC G X). Thus by (1.88), h** > h, which, together with (1.209), yields 1°. D

9.6 Duality for unconstrained abstract d.c. infimization

277

Remark 9.13. If G is a subset of X, then for f = XG "^^ have X G ( ^ ) — ^^Vxexi^M + —XG(X)} = sup w(G) (w e W), whence inf {h*(w) + -XG(^)]

= inf |sup{w;(>^) + -h(y)} + - supit;(G)l.

weW

weWly^x

'

*

Consequently, by (8.4), Theorem 9.13, and (1.86), it follows that if/z = /i**, then sup/z(G) = - inf (XGU) + -h{x)} = - inf {h*(w) + xeX

= sup

-XGM)

weW

-

weW ^

sup {w(y) + -h(y)} + - sup wiG) \ I ^yeX

'

^ J

= sup inf [h(y) + -w{y)] + sup u ; ( G ) ; wew^y^^

•

J

thus we have obtained an extension of Theorem 3.14 to the case that X* and the proper lower semicontinuous function / : X ^- i? are replaced by W c i^^^nd any / : X ^ ^ such that / = /**, or equivalently (by (9.29)), that / : X -> ^ is (W + /?)-convex.

10 Notes and Remarks

Chapter 1 1.1. A brief list of some standard books on normed linear spaces and locally convex spaces has been mentioned in Chapter 1, Remark 1.1 (a). From the literature on separation theorems let us also recall that a function O e X*\{0} is said to strongly separate C\ from Ci if sup 0(Ci) < inf 0(C2); clearly, in this case O also strictly separates C\ from C2, but in general, the converse is not true. It is well known (see, e.g., Kelley and Namioka [117, Theorem 14.3 (i)]) that if C\ and C2 are disjoint convex subsets of a locally convex space X, there exists a function O strongly separating C\ from Ci if and only if 0 ^ C2 — Ci. This happens, e.g., when C2 is convex compact, and C\ is convex and closed, with Ci H C2 = 0 (see, e.g., Kelley and Namioka [117, Corollary 14.4]). Applying this to the particular case C2 = {jc}, a singleton, where x ^ Ci, we obtain again Theorem 1.2. Theorem 1.3 was given in [219] for int C / 0, and in [220, Addendum], for closed C. For some notes on evenly convex and evenly coaffine sets, and the corresponding hulls, see, e.g., [254]. Let us mention that necessary and sufficient conditions for a set C to be evenly convex, of a nonseparational character, have been discovered only recently (see Daniilidis and Martinez-Legaz [39]). Lemma 1.5 is due to Ascoli [2]. The general concept of a quasi-support hyperplane of a set in a locally convex space has been introduced in [231, p. 15], using for it the term "support hyperplane." Lemma 1.4 and Corollary 1.1 have been given in [231, pp. 15-16]. The equivalence 1° ^ 3° of Corollary 1.2, Lemma 1.6, and Remark 1.4 have been given in [210,

280

10. Notes and Remarks

Lemmas 1.3 and 1.4] and the remark made after it. In the equivalence 2° O 3° of Corollary 1.2, H can be replaced by any set A (see [210], where a set A has been called a "support set of the ball J5" if we have 2° of Corollary 1.2, with H replaced by A). However, Proposition 1.1 has not been observed in [231] or [210]. Corollary 1.4 has been given in [19], Proposition 1.1. Lemma 1.8, in the more general formulation of Remark 1.6 (b), can be found in [231], Lemma 2.1. Lemma 1.9 and Remark 1.7 are infinite-dimensional extensions, with the same proofs, of parts of Hiriart-Urruty and Lemarechal [104, Ch. 6, Proposition 1.3.3]. Besides (1.73), (1.309), (1.324), (1.426), and (8.111), there are also other "Slater conditions" in the literature, with various applications (see, e.g., Hiriart-Urruty and Lemarechal [104]). Some books on convex analysis containing the other parts of Section 1.1 (on polars, conjugations, and subdifferentials) are mentioned in Chapter 1 (e.g., Ekeland and Temam [54], loffe and Tikhomirov [111], Barbu and Precupanu [14], Holmes [106]). For some notes on the quasi-convex hull /q and on the functional hulls /q, /eq, and /qca of (1.134)-(1.136), see [254], Chapter 4, Section 4.2. Recently, Theorem 1.6 has been considerably refined by Penot and Zalinescu, who have shown (see [178], Proposition 5.4) that it remains valid if the assumptions on / and XQ are replaced by the assumptions that / is a proper convex function and XQ Gdom / , inff(X) < /(XQ), with the convention 0.9/(JCO) = A^(dom f;xo). A short proof of Theorem 1.8 using only separation arguments was given by Borwein and Zhuang [22]. For minimax theorems, see, e.g., the survey of Simons [206]. 1.2. For the theory of abstract convexity and some of its applications to optimization, see the monographs [254] and Pallaschke and Rolewicz [169], Rubinov [193]. The polarities A = A'^^: 2^ -> 2^*^^^^ (/ = 1,2,3,4) of (1.154), (1.160), (1.166) and (1.182) have been introduced and studied in [259]. Lemmas 1.10, 1.11, 1.12, Corollaries 1.5, 1.6, and Lemma 1.13, have been proved in [259]. The polarities A i': 2^ -^ 2(^*\{o^>^^ (/ = l,2,3)of(1.189), (1.191), and (1.193) and formulas (1.190), (1.192), (1.194), as well as the polarities A^' (/ = 1, 2) of (1.196), (1.199), and formulas (1.197), (1.198), (1.200), and (1.201), can be found in [256], Sections 2 and 3. Theorem 1.10 on the "axiomatic" characterization of Fenchel-Moreau conjugates, given in [237], has been the starting point of further research on various concepts of conjugations, obtained by replacing the "second condition" (1.204) with other conditions. Thus in [147], there have been introduced and studied mappings c:~R^ ^W satisfying (1.203) and (/ V df = fA-d

(f e ^ ^ , d e R);

(10.1)

such mappings have been called in [147] "v-dualities." A unified approach to conjugations and v-dualities c: R^ ^^ R^ was developed in [154], [156], namely, that of "dualities associated with a binary operation * on /?" (see (10.186), (10.187) below); for * = + and * = v, these reduce to conjugations and v-dualities, respectively.

10. Notes and Remarks

281

The pseudoconjugates (1.220) and the semiconjugates (1.221) have been introduced and studied in [224] and, respectively, [233]; actually, the semiconjugate has been introduced in order to obtain a conjugate whose biconjugate is the lower semicontinuous quasi-convex hull (formula (1.222)). For some other conjugates of functions f:X->R, useful for duality in convex and quasi-convex optimization, and for some generaUzations, see [234], [243], [254], [260], Atteia [5], Atteia and El Qortobi [6], Crouzeix [34], [35], Elster and Gopfert [59], Elster and Nehse [60], Elster and Wolf [62]-[64], Martinez-Legaz [138]-[141], Passy and Prisman [171], [172], Penot [176], Penot and Voile [177], Voile [287], [288]. The concept of conjugation has been extended to functions with values in the "canonical enlargement" of conditionally complete lattice ordered groups, in [152], [155]. Let us recall briefly this extension, since we shall use it in the Notes and Remarks to Chapter 8. A lattice ordered group A = (A, <, 0 ) is a set A endowed with a partial order < such that (A, <) is a lattice, with a binary operation 0 such that (A, 0 ) is a group, and such that all group translations are isotone, that is, for all a,b,ceA,a b. The neutral element for (g) is denoted by e. The inverse of a in A is denoted by a~\ A lattice ordered group (A, <, 0 ) such that (A, <) is a conditionally complete lattice (that is, each nonempty upper-bounded subset has a supremum and each nonempty lower-bounded subset has an infimum in the lattice, where the term "bounded" is used in the sense of the partial order <) is called a conditionally complete lattice ordered group. By a result of Iwasawa (see, for instance, Birkhoff [20], Chapter 13, Theorem 28), every conditionally complete lattice ordered group is commutative. It is natural to adjoin to every conditionally complete lattice a greatest element -(-CXD and a least element —oo, i.e., to consider the set A := A U {+00} U {—00}, with < extended to A by —00
= a^b

= a<S)b

(«,/?€ A),

(10.2)

a 0 (+00) = (+00) 0 a = +CX) (a e A),

(10.3)

a 0 (-00) = (-00) 0 a = - 0 0

{a e AU {-00}),

(10.4)

a 0 (+00) = (+00) 0 a = +00

(a e AU {+00}),

(10.5)

a 0 (-oc) = (-00) 0 a = - 0 0

{a € A).

(10.6)

The quadruple (A, <, 0 , 0 ) is called the canonical enlargement of (A, <, 0 ) . The inverse operation is extended from A to A by: T"^ = _L,

±-^ = T .

(10.7)

For any sets X and W, a mapping M: A^ -> A^, / ^ / ^ is called ([152], [155]) a conjugation if

(inf y;-)^ = sup/;.^ iel

({/;},,/ c A^) ,

(10.8)

iel

(a 0 / ) ^ = / ^ 0 a"^

{f eA^'.aeA)

.

(10.9)

282

10. Notes and Remarks

Among other results, in [152], [155] it has been shown that if Z, W are two sets, a mapping M: A^ -> A"^ is a conjugation (in the sense (10.8), (10.9)) if and only if there exists a {necessarily unique) coupling function (p\ X yiW -> A such that f^{w)

= sup{(^(x, w) (8) [/(jc)]-^}

( / eTi^.we

W).

(10.10)

xeX

In the particular case (A, <, 0 ) = ( / ? , < , + ) , we have (g) = +, (g) = +, and this result reduces to Theorem 1.10. In the case (A, <, (g)) == (/?++, <, x), where /?++ = (0, +oo), 0 and 0 are the so-called upper multiplication x and lower multiplication x of Moreau ([163], [164]), e = \, a"^ = ^ (a e A), and formulas (10.9) and (10.10) become, respectively, (a X / ) ^ = / ^ X a

( / € ^ \ a e R^).

f'^iw) = supUix, w;) X - i - 1 raY^

'

(10.11)

(feR/,weW).

(10.12)

f(x)i

Thus, roughly speaking, the sums and differences (of the case ( A , < , 0 ) = (R, <, +)) are replaced now by multiplications and fractions. Some applications to various concrete coupling functions (p: X x W ^^ R+, with W = X* x R and W = X\ have been given in [152] and [155]. In [80] there has been introduced and studied a concept called "(*. 5')-duality," which encompasses, as particular cases, both the dualities associated with a binary operation on R and the conjugations for functions with values in the canonical enlargement of a conditionally complete lattice ordered group. In addition, let us mention that for a function / : X -> /?+ on a locally convex space X, the following "conjugate function" f^^: X*\{0} -> /?+, which is useful in the study of duality for supremization of nonnegative quasi-convex functions, has been introduced by Rubinov and §im§ek [198]: /^^(0):=

sup - i xeX fix)

(CDGX*\{0}).

(10.13)

(x)>\

Unlike the Greenberg-Pierskalla quasi-conjugate (1.213), the "conjugate" of Rubinov-§im§ek satisfies (1.203), but not (1.204), so it is not a Fenchel-Moreau conjugate (1.202). Another conjugate, suitable for the study of nonnegative quasi-convex functions on a locally convex space X, has been introduced by El Qortobi [58], as follows: for my f: X -^ 'R md d e R, let /;(cD) := inf max {/(jc) -d,d-

0(x)}

(^ e X*),

(10.14)

xeX

and define the biconjugate of / by f^ix)

:= sup ( / ; ) : ( ^ )

(X e X).

(10.15)

10. Notes and Remarks

283

It turns out [58] that if / > 0, then f^ 0 is quasi-convex and lower semicontinuous, then f^ = f [58]. The subdifferential d^^^^f(zo) with respect to the conjugation of type Lau L(A) is a particular case of a more general concept of a subdifferential with respect to a conjugation, introduced and studied in [153] (see also [254]). For another "conjugate" and "subdifferential," introduced and studied by Thach [272], see Chapter 4, formulas (4.19) and (4.18) and the Notes and Remarks to Chapters 4 and 6. 1.3. After a period of development of the theory of best approximation of functions by various particular classes of functions (algebraic or trigonometric polynomials, or linear combinations of exponentials), it has been observed by Nicolescu [167] and, independently, Krein [123] (see also the book of Akhiezer [1]), that normed linear spaces constitute a natural framework for the study of best approximation. A systematic development of a modem theory of best approximation in normed linear spaces, i.e., in which the methods of functional analysis are applied in a consequent manner, has been begun in the papers [207], [208], [209], and the theory has been presented in the monographs [210] and [211]. Most of the results of Section 1.3 can be found in these monographs, but here there are included some improved proofs and some complements. Some additional early references to Theorem 1.11 are Eidelheit [53] (for Unear manifolds), Garkavi [79], Nirenberg [168], and Deutsch and Maserick [44], which also contain, essentially, Remarks 1.14(b), 1.15(b), (c) and Corollary 1.8. The reduction principle for best approximation by convex sets (Remark 1.16 (b)) has been emphasized in [226], pp. 276-277. Remark 1.17 (a) about the usefulness of maximal functions has been made in [211], p. 2. Corollary 1.9 has been given in [255], Remark 8 (b). Since elements of best approximation need not exist, or may be difficult to compute, the following notion has been introduced by Buck [27]: given any £ > 0, an element gs e G is called an "element of £-approximation" (or "e-best approximation") if | | x - g , | | < inf \\x-g\\+e.

(10.16)

geG

Clearly, for s = 0, these are just the usual elements of best approximation; for e > 0, such elements always exist, and are not unique. A characterization of elements of ^-approximation, in the case G is a linear subspace of a normed linear space X, which can be easily extended to the case of a convex set G, has been proved in [210], Theorem 6.12; for £ = 0, it reduces to Theorem 1.12. 1.4. This part will be longer than the Notes and Remarks to the other sections, since we shall give here some selections (on a subjective basis) from the vast literature on convex and quasi-convex duality and their various (nonconvex) generalizations, with special emphasis on those notions and results that can be found only in articles, but not yet in books.

284

10. Notes and Remarks

1.4.1. Although Theorem 1.13 is the particular case h = XG of formula (1.423) of Remark 1.32, for the first time it has been stated explicitly as a basic Lagrangian duality theorem for inf/(G) in [214], Theorem 2.1. Corollary 1.10 has been given in Rolewicz [192], Theorem V.2.1. Remark 1.19 has been made in [258], Remark 1.1 (b). The simple proof of Theorem 1.14, based on Theorem 1.13, has been given in [214] (see [214], Theorem 2.3 and Proposition 2.1); the proof mentioned in Remark 1.20 (a) can be found, e.g., in Holmes [106], p. 31. Corollary 1.11 and Proposition 1.2 have been given in [255], Remark 5, and Theorem C, respectively. We recall that given any ^ > 0, an element ge e G is called an "^-solution" (or "e-optimal solution") of the infimization problem (P) of (1.261) if figs) < inffiG)

+ e.

(10.17)

Clearly, for ^ = 0, these are just the usual optimal solutions; for ^ > 0, such elements always exist, and are not unique. Theorem 1.14 of characterization of the solutions of convex infimization problems has been extended by Strodiot, Nguyen, and Heukemes ([269], Section 2) to a characterization of ^-solutions, using ssubdifferentials and ^-normal sets, which, in the particular case that G is a convex subset of a normed linear space X, reduces to the characterization of elements of £-approximation mentioned in the Notes and Remarks to Section 1.3. We recall that an element go e G for which there exists a neighborhood U of go such that f(go)
(geGnU).

(10.18)

is called a "local solution" of the infimization problem (P) of (1.261) (in contrast with the elements of Sdf), which are called "global solutions"). Clearly, every global solution is also a local solution. Conversely, it is well known and easy to see that if both G and f are convex in problem (1.261), then every local solution go is global. Indeed, if go ^ G^ is not a global solution, i.e., if there exists g' e G such that fis') < f(go), then, since G and / are convex, we have gr^ := rjg + (l — r])go e G and f(g,)

< rif(g) + (1 - rj)f(go) < /(go),

whence since every neighborhood U of go contains g^ for any sufficiently small T] > 0, one cannot have (10.18), so go cannot be a local solution. If / is only quasiconvex, then the property is no longer true; indeed, for example, for X = R and the quasi-convex function f.R^^R defined by

fix)

X - I

if Jc > 1,

0 jc + 1

if - 1 < x < 1, if jc < - 1 ,

(10.19)

the point xo = 0 is a local minimum that is not a global minimum on R. For the study of functions / for which every local minimum is global, see, e.g., Martos [160],

10. Notes and Remarks

285

Zang and Avriel [297], Zang, Choo, and Avriel [298], Avriel and Zang [12], and Horst [108]. Some axiomatic characterizations of unperturbational Lagrangian dual objective functions (1.278), regarded not only as a function of the dual variable but as a function of three variables, namely, of the primal constraint set G, the primal objective function / , and the dual variable O, have been given in [146], [148], and [150]. For example, we mention here part of [146], Theorem 1, which we state here more generally, for any W
= inf {f{y) - w(y) + inf u;(G)} yeX

(G e 2^\{0}, f eW,we

W).

(10.20)

2°. For any index set I {including the empty set), and any G e 2^\{0}, {fi]iei ^ 'R^,W eW, f e J^, d e R, {G,}/e/ ^ 2^\0 with I / 0, andx\ x eX,we have XG infi^, fiM

finf/e/ = { I—00

XG,/,(W;)

/ / / 7^ 0 or inf u;(G) > - 0 0 , if I = id and inf w(G) =—00,

XGj+d = ^G,f+d, ^Ue/ G^j = inf XG,,/

,-,^^1, (10.21) (10.22)

iff # +00,

y{^'},xw(^) ^ ^(^/) _ ^(^)^

(10.23) (10.24)

where the inf in (10.23) is taken pointwise on W. Some characterizations of the associated Lagrangian functions have been given in [150], Section 1. Similarly to Remark 1.13 (b), instead of (10.20) one can consider, equivalently, the Lagrangian dual objective function X(w) := inf Ifiy) - (p(y, w) + inf (p{g, w)} yeX

(w e W),

(10.25)

• geG

where X and W are two sets and cp: XxW ^^ Risa. coupling function. The generality of (10.25) permits one to obtain some relations between combinatorial min-max equalities and Lagrangian duality equalities. Thus in [249] it has been shown that various min-max equalities that assert that the minimum cardinality of a "cover" is equal to the maximum cardinality of a "packing" are Lagrangian duality equalities with respect to the coupling function defined, in a suitable "incidence triple," as "the number of incidences." For valid min-max equalities, in [249] there have been obtained some characterizations of pairs consisting of a minimum cardinality

286

10. Notes and Remarks

cover and a maximum cardinality packing, in terms of incidence properties. Furthermore, applying the Lagrangian duality approach, with some new coupling functions, to some combinatorial min-max equalities that are not of this "all-cardinality covering-packing type," namely, to min-max equalities for "B-colorings," "A-cover packings," "weighted B-packings," and "weighted A-covers" in incidence triples (A, B, p), nonbipartite matchings, flows in networks and matroid intersections, in [250] there have been obtained some results on the "hyper-Lagrangian types" of these min-max equalities and some characterizations of a primal-dual pair of optimal solutions. Linear systems (X, Z, u) have been considered by Rolewicz ([188], [189]), who has also used for them the term "constant time linear control systems." Convex systems (X, Z, u) can be found in [221], and general systems, e.g., in [242]. Proposition 1.3 and Theorem 1.15 have been given in Martinez-Legaz and Voile [157], Lemmas 2.2 and 3.1. Also, Theorem 1.16 is due to the same authors ([159], Lemma 2). For the dual value corresponding to W and (10.20), i.e., for P = supX{W) = sup inf {f(y) - w(y) + inf u;(G)},

(10.26)

and more generally, for its "structured" extension (i.e., to the programming case), some duality theorems have been given in [251], e.g., the following theorem of weak Lagrangian duality ([251], Theorem 2.1): Theorem 10.2. Let (X, Z, u) be a system, T ^ Z, f: X ^ ^, and W ^ R^. (a) The following statements are equivalent: r. We have a := inf f{x) = sup inf {f(x) - wuix) + inf^(T)}

:= p.

(10.27)

u(x)eT

2°. For each c e R,c < a, there exists Wc ^ W satisfying (see (1.28)) sup (wu, - 1 ) (epi / ) < inf (u;, - 1 ) (T, c), inf w;(r) e R.

(10.28) (10.29)

(b) If in addition, (a,p)

^ (+00,-00),

fiW ^W (fM> 0 ) , \y + W c W,

(10.30)

(10.31) (10.32)

then the above statements are equivalent to: 3°. For each c e R,c < a, there exists (Wc, dc) e (W x R)\{(0, 0)} such that sup (wu, —d) (epi / ) < inf (it;, —d) (T, c).

(10.33)

10. Notes and Remarks

287

Remark 10.1. (a) Geometrically, Theorem 10.2 (b) means that under the assumptions (10.30)-( 10.32), we have weak duality a = )6 if and only if for each c e R, c < a, there exists {Wc, —dc) e {W ^ /?)\{(0, 0)} that strongly separates the "image set" U :={ux

{l})(epi/) = {(w(jc),r) e u{X) x R\{x,r)

e e p i / } C Z x /? (10.34)

from the set f^ := {r,c)} C Z X/?. (10.35) (b) Hildenbrandt and Nehse [97] have considered the case that Z is a Hnear space, r c Z is a convex cone with vertex 0,W = [w e R^\w convex, 'mfw{T) > 0}, and X{w) =

inf [fix) - wu{x)]

{w e W),

(10.36)

xedom f

and have proved ([97], Theorem 2.2) that a = supA,(H^) holds if and only if for each c e R, c < a, there exists Wc e W such that ivMy) < fiy) -c

(ye d o m / ) ,

(10.37)

or equivalently, such that "graph Wc separates" the sets U^ := {(w(j), f(y) — c) e u(X) X R\x e dom / } and T x (—R-^), in the sense that U^ ^epi Wc and T x (—/?+) c hypo Wc, where hypo Wc is the hypograph of Wc. A result of strong Lagrangian duality, corresponding to Theorem 10.2 above, in which c < of is replaced by a, can be found in [251], Theorem 2.3, together with some further Lagrangian duality theorems. The main results of [251] are called "general" Lagrangian duality theorems, both for the generality of the setting (no structure is assumed about (X, Z, w), T c Z, / : X -> R) and for the arbitrariness of W c R^^ which permits one to encompass, by suitable particular choices of W, the dual objective functions generated by various "augmented Lagrangians" (see the remark made in connection with formula (1.281)). Some other applications of nonlinear separation properties (i.e., separation by not necessarily linear functions) to optimality conditions and duality in constrained global optimization have been given, e.g., in Giannessi ([81], [82]), Cambini [30], Tardella [270], Mastroeni and Pappalardo with coauthors ([45], [137]), Evtushenko, Rubinov, and Zhadan [66], Rubinov and Uderzo [200], and in the recent monograph of Rubinov and Yang [201], Chapter 3, Section 3.2. These will be briefly described below, at the end of the Notes and Remarks to Chapter 1. Let us also mention the following recent approach to unperturbational Lagrangian-type duality for problems (P<) and (P<) of (1.303) and (1.304). When G is of the form m

G:=p|G,/0,

(10.38)

288

10. Notes and Remarks

where { G i , . . . , G^} is a finite collection of closed convex subsets of X, it is natural to consider the dual problem (Dm) to problem (P) of (1.261), defined by m

{Dm)

Pm= sup

m

mf\f(x)^Ti{x)-Tsup
(10.39)

where Gf denotes the barrier cone (1.347) of G/, and to call optimal solution of (Dm) any m-tuple ( O ? , . . . , O^) € G^ x • • • x G^ for which the sup in (10.39) is attained; if we have a =fimand (Dm) has an optimal solution, we say that strong duality holds for {(P), (D;„)}. Problem (Dm), which may be called the mLagrangian dual problem to (P), has been studied, e.g., in Gaffke and Mathar [75] and Deutsch, Li, and Swetits [42], under the assumptions that X is a Hilbert space and / is strictly convex, lower semicontinuous and Gateaux differentiable, and that for each O G Z* the function / + O is coercive. For locally convex spaces X, some results on the m-Lagrangian dual problem (Dm) and on the relations between the dual problems problems (D) and (Dm) in the case that the primal problem (P) admits a solution, and some applications to duality for the distance of an element XQ in a normed linear space X to a subset G of X of the form (10.38) in the case that jco has a nearest point in G, have been given in [255]. For example, we have the following simultaneous characterizations of solutions of (P) and (Dm) and strong duality for {(P), (Dm)} ([255], Theorem 1): Theorem 10.3. Let X be a locally convex space, G = PlJ^j Gi ^ &, where { G i , . . . , Gm) is a finite collection of closed convex subsets of X, go E G, f: X -^ R, and Oi, . . . , O^ € X"".Thefollowing statements are equivalent: r. We have m

m

f(go)=jnf |_/(x) + J2 ^'(^) - E^^p ^/(^')ji=\

(10-40)

i=\

2°. We have ^ieN(Gi;go)

(/ = l , . . . , m ) ,

(10.41)

m

J2^ie-df(go).

(10.42)

/=i

3°. go is an optimal solution of problem (P), ( O i , . . . , m) is an optimal solution ofproblem (Dm), and strong duality holds for {(P), (Dm)}Moreover, in these cases we have Oedf(go)

+ J2N(Gi;go).

(10.43)

/=i

Conversely, if (10.43) holds, then go is an optimal solution of problem (P), and we have strong duality for {(P), (Dm)}-

10. Notes and Remarks

289

Following Deutsch, Li, and Ward [43], a finite collection of sets {G\,... ,Gm} is said to have the strong CHIP (strong conical hull intersection property) if m

N{G; go) = Y, ^ ( ^ Z ' ^o)

(go e G).

(10.44)

i=\

Assuming that problem (P) has an optimal solution, we have the following relations between strong dualities for {(P), (D)] and {(P), (Dm)} ([255], Theorem 2): Theorem 10.4. Let X be a locally convex space, G = H/li ^i ^ 0' ^^^^^ { G i , . . . , G^} is a finite collection of closed convex subsets ofX, and f: X -^^ R a function such that problem (P) has an optimal solution. (a) If strong duality holds for {(P), (Dm)}, then strong duality holds also for {(P), (D)}. Namely, if a = fim
(

m

/(jc) + V c D . ( j c ) - y s u p O / ( G / ) . i=\

(10.45)

i=\

In particular, for best approximation these results yield the following extension of Corollary 1.9 ([255], Theorem 6): Theorem 10.6. Let X be a normed linear space, xo G X, and G — H/Li ^i / ^' where (Gi, . . . , Gm} is a finite collection of closed convex subsets ofX having the strong CHIP and such that xo admits a nearest point in G, say go- Then \\xo- goW = m i n | U o - g | | = geG

m

max liEr=,<^/ihi

y]{0/(xo) - sup CD, (G/)}. '-'

(10.46)

Unperturbational surrogate dual problems have been introduced by Glover ([86],[87]; see also Luenberger [136], Greenberg and Pierskalla [94]). The constraint sets [x e X\u(x) < 0} of problem (1.425) were introduced initially by Glover [86] for 0-1 integer programming, and have been called "surrogate constraint sets." The study of surrogate duality using quasi-conjugates was begun by Greenberg and Pierskalla [95], and continued by Crouzeix ([34]-[37]) and others.

290

10. Notes and Remarks

In the particular case that / is a continuous convex function, Theorem 1.17 has been obtained in [213], Theorem 2.1. Furthermore, Remarks 1.25 (a) and (c) have been given in [213], Remark 2.2 (c) and Section 1, respectively. The substitution method for convex optimization (Remark 1.26 (a)) has been emphasized in [214] and [213] (see also [238]). Remark 1.26 (b) has been made in [214]. Theorem 1.18 and Remark 1.27 (b), (c) have been given in [231], Theorem 1.1, equivalences 3° ^ 1° 4^ 2° and Remark 1.3, respectively. As an application of Theorem 1.18, let us mention the following sufficient condition in order to have (1.355), obtained in [231], Theorem 1.2: Theorem 10.7. Let X be a locally convex space, G a proper subset of X, and f: X -> R a function, such that G and the (possibly empty) level sets Sd(f) (d < inf/(G)) are convex and closed for a locally convex topology T on X, weaker than or equal to the initial topology on X, and either G or the sets Sd(f) (d < inf/(G)) are compact for T. Then we have (1.355). Proof Since G is a T-closed subset of X, by our assumption on T we have G = G in the initial topology, and hence since G is proper, G / Z. Therefore, since G is convex, by Remark 1.27 (a) we have G^ ^^^ 0. Thus if there exists nod < inf/(G) with Sd(f) # 0, then 3° of Theorem 1.18 is satisfied for any O e G^. On the other hand, for any d < inf/(G) with Sd(f) 7^ 0 we have G n Sd(f) = 0. Hence, by our assumptions on G and Sd(f), and the strong separation theorem, mentioned in the above Notes and Remarks to Section 1.1, there exists a function d e (X, T)*\{0} c Z*\{0} satisfying supOj(G) < inf Oj(5j(/)), whence (1.357). Consequently, by Theorem 1.18, we have (1.355). D In particular, we obtain the following result, given in [231], Corollary 1.3: Corollary 10.1. Let Xbea reflexive Banach space, G a proper closed convex subset of X, and f: X -^ R a lower semicontinuous quasi-convex function, such that either G or the level sets Sdif) (d < inf/(G)) are bounded. Then we have (1.355). Proof Since G and Sdif) (d < inf/(G)) are are closed and convex, they are also weakly closed (see, e.g., Dunford and Schwartz [49], p. 422). Hence since either G or the sets Sd(f) (d < inf/(G)) are bounded and since X is a reflexive Banach space, either G or the sets Sdif) id < inf/(G)) are weakly compact. Consequently, one can apply Theorem 10.7 (with T = the weak topology on X). D Theorems 1.19 and 1.20 have been obtained in [231], Theorems 2.1 and 2.2. The following related result has been proved by Laurent and Martinet [130]: If G is a convex subset of a locally convex space X and / : X -> R is convex and upper semicontinuous at some go E G with /(go) < -l-oo, then we have inf/(G) = maxinf/((V), VeV GCV

(10.47)

10. Notes and Remarks

291

where V is the set of all closed half-spaces in X. Proposition 1.4 has been given in [235], Theorem 1.1, equivalence 3° ^ 4°. Remark 1.28 can be found in [246], p. 359. Some equivalent conditions to (1.370) of Theorem 1.21 have been given in [235], Theorem 1.3, and [230], Theorem 3.2. Theorems 1.21, 1.22 and Propositions 1.4, 1.5 extend the corresponding results of [231], [235], [230] (called there "strip theorems"), where some connectedness assumptions have been made on the sets 0(G) and (A^(/)), 0 ( 5 j ( / ) ) . For related results and generalizations, see [231], [235], [229], [230], [240]. The following unified point of view on unperturbational surrogate dual problems to the primal infimization problem (1.261), which encompasses as particular cases the preceding surrogate dual problems, has been given in [230]: Given an arbitrary set X, a subset G of X, and a function / : X -^ /?, let us define a "surrogate dual problem" to (1.261) by P = PGJ = supX(W),

(10.48)

where W = WGJ is a set (the dual constraint set) and X = XQJ: W ^^ /? is the function (the dual objective function) defined by XGJ{W)

= inf fi^G^u.)

{w e W),

(10.49)

with {^G,W}WEW being a family of subsets of X related in some way to G. For example, taking X to be a locally convex space, W = {<^ e X*\{0}| supO(G) < O(xo)}, and QG, = {y e X\(y) = sup 0(G)}, problem (10.48) reduces to the right-hand side of (1.330) (with sup instead of max); taking W = {^ e X*\{0}| cD(xo) ^ ^(G)} and QG, = [y e X\ ^(y) e 0(G)}, we obtain the righthand side of (1.382). We apply this unified point of view also in Chapters 3 and 6 of the present book. Some axiomatic characterizations of unperturbational surrogate dual objective functions, regarded not only as a function of the dual variable, but as a function of three variables, namely, of the primal constraint set G, the primal objective function / , and the dual variable w, have been given in [149]. For example, we mention here partof [149], Theorem 3.1: Theorem 10.^. Let X and W be two sets. Then for a function X = A..,.(•)• 2^ x R^ X W -> R, the following statements are equivalent: 1°. X is a surrogate dual objective function, i.e., there exists a mapping ^ : 2^ x W -> 2^ such that we have (10.49). 2°. For any index set I (including the empty set), and any G G 2^, {/i}/e/ ^ R , f e R^ and d e R, we have AG,inf„;/, =infAG,y;, >^GJvd =^G,f V J, and (10.22).

(10.50) (10.51)

292

10. Notes and Remarks Moreover, in this case Q is uniquely determined, namely, ^G,w = {yeX\XG,^,^,{w)

= 0}.

(10.52)

Some characterizations of the associated surrogate Lagrangian functions have been given in [149], Section 3. 1.4.2. The historically first scheme of perturbational duality, due to Rockafellar (see, e.g., [185] and the references therein) has been slightly different; namely, he has used the dual objective function and the Lagrangian defined, respectively, by I(vl/) := inffinf {/?(x,z) + vi/(^)}} = _p*(0,-vl/) L(jc, vl/) := inf {p(x, z) + ^(z)}

(xeX,^

(vl/ G Z*),

e Z*).

(10.53) (10.54)

zeZ

However, here we have followed, as in Ekeland and Temam [54], the scheme (1.392), (1.396). Note that by (10.53) and (1.392) we have l(^) = X(-^) ( ^ e Z*), whence P := sup I(vl/) = sup A.(vl/) =

fi,

(10.55)

and thus both schemes lead to the same dual value (1.391). Let us also mention a geometric condition for duality of = ^, of van Slyke and Wets [286], as extended in Ponstein [180]: Let T = (0, R) (= {0} x R) and S := {(z, d) e Z X R\3x e X, p(x, z) < d}.

(10.56)

Then S Cepi 0 c 5, whence S = epi 0. The perturbation p\ X x Z -> R, satisfying (1.387), is said to be normal if

5 n r = 5 n r.

(10.57)

By Ponstein [180], Theorem 3.6.1, if p is normal and either a ^ +oc or ^ ^ —oo, then a = P\ on the other hand, if p is not normal, then a > p. In the more general case that X is an arbitrary set and Z* is replaced by a set W c R^^ one takes a fixed element zo ^ ^ (assuming, sometimes, that ^(zo) = 0 for all ^f e W), and following [228], one defines the Lagrangian dual objective function by k(^) := m{{mf{p(x,z)-^(z)} xeX^zeZ

+ ^(zo)}

( ^ e W);

(10.58)

*

in this case, the embedding condition (1.387) is replaced by p(x,zo) = (l>(x) (xeX).

(10.59)

However, for simplicity, let us return now to the particular case where X and Z are locally convex spaces, W = Z* and zo = 0. Note that in the particular case Z = X and

10. Notes and Remarks

P(X, Z) = f(x)

+ XG(X -Z)=\

'_^^^

^f^_^^Q

293

(•^' ^ ^ ^ )

condition 1° of Theorem 1.23 with (f) = f \- XG reduces to condition 2° of Proposition 1.2. If we have the equality (1.403), then the common value a = )6 is called a saddle value of L (see also (1.410)). Since by (1.404) and (1.399), the equality (1.412) of Theorem 1.23 can be written as min sup L{x, ^ ) = max inf L(jc, ^ ) , xeX vj/eZ*

(10.60)

4/eZ*jc€X

the problem of minimization of the function 0(x) = sup L(x, vl/)

(jc G X)

(10.61)

vl/eZ*

may be regarded as "half of a minimax problem whose saddle value exists. A theory of duality for infimization problems (1.386), with general (p: X ^^ R, based on this observation, has been constructed by Rockafellar [185]; for more details and further developments, see Ekeland and Temam [54] and Walk [293]. Any function L: X X Z* ^^ R (and, more generally, L: X x W ^^ R, where W c R^) satisfying (10.61) is called a Lagrangian function (see also Definition 10.1 (a) below, in the Notes and Remarks to Chapter 8). For the theory of "surrogate Lagrangian functions," such as (1.449), (1.452), (1.456), (1.465), (1.466), (1.468), see [240], [241], [242], Balder [13], Lindberg ([134], [135]), Martinez-Legaz ([138], [140]), and the references therein. An approach to the classical Fenchel-Rockafellar duality theorem (see Theorem 1.24 (a)), using the framework of general Fenchel-Moreau conjugation theory, has been given by Martinez-Legaz in [142]. Theorem 1.24 (b) can be found in Voile [292], with the mention that it occurs, e.g., in Combari, Laghdir, and Thibault [33]. Also, as mentioned in Voile [292], it has been observed in Combari, Laghdir, and Thibault [33], Laghdir and Voile [128], and Pennanen [173], that one can relax the assumption that h is increasing by requiring only the following condition: h{z) > h(u(x))

(x e domf

z > u(x)).

(10.62)

For Corollary 1.12 (a) see, e.g., Rockafellar [183]. For Corollary 1.12 (b), see, e.g., Rockafellar [183], Ekeland and Temam [54]. For the infimization problem (P) of (1.261), where G is a subset of a locally convex space X and f:X -> i? is a function, another perturbation theory and a corresponding theory of Lagrangian duality have been introduced by Kurcyusz ([125], [126]; see also Dolecki and Kurcyusz [48]), generalizing the "vertical perturbation" (1.442), (1.443). Namely, (1.261) is embedded into a family of perturbed problems (P^)

y^(z):=: inf f(x) xeQiz)

(z e Z).

(10.63)

294

10. Notes and Remarks

where Z is a locally convex space and Q: Z ^- 2^ a multifunction, called 2i perturbation multifunction or constraints multifunction, such that G = Q(0).

(10.64)

The associated Lagrangian function L^: X x Z* ^^ R is defined by L^(jc, vl/) := f(x) + -

sup ^(z)

(jc G X, vl/ E Z*),

(10.65)

zeQ~Hx)

where ^~Ms the inverse multifunction to Q, defined by Q-\x)

:={zeZ\x

e Q(z)},

(10.66)

and the dual objective function X^ and dual value P^ associated with L^ are defined in the usual way. This theory encompasses the important particular cases of Lagrangian duality for linear systems and convex systems (X, Z, u). Indeed ([222], Remark 4.1), if (X, Z, u) is a Hnear system and T is a convex subset of Z, or if (Z, Z, w) is a convex system and 7 = {z G Z| z < 0}, then for the multifunction ^ : Z - > 2^ defined by Q(z) := u-\z

-^T) = {xeX\

u(x) ez + T}

(z e Z)

(10.67)

{x e X).

(10.68)

we have Q-\x)

= {zeZ\

u{x) ez + T} = u(x) -T

Hence by (10.65) and (10.68), we obtain ([240], Theorem 3.3) L^{x, vl/) = f(x) + -

sup

^(z) = f(x) + - sup ^{u{x) - t)

zeu(x)-T

= fix) - ^(u(x)) f mf^(T)

teT

(x eX,^

e Z*),

(10.69)

which is nothing other than (1.416) for /z = xr; it is also the Lagrangian corresponding to formula (1.433). Similar remarks can be also made for various surrogate dual problems to problem (P) of (1.261). Indeed, for the perturbed problems (P^) of (10.63), with ^ : Z -> 2^ satisfying (10.64), one can define [240] the surrogate Lagrangians Lf^Jx, vl/) := fix) + XQi{zezmz)>o})ix) (X G Z, vl/ G Z*), L^ix, vi/) := fix) + xmzezmz)=o})ix)

ix e X,^

e Z*).

(10.70) (10.71)

Then by [240], Theorem 3.3, for ^ : Z -> 2^ defined by (10.67), one obtains Lg^ = Lsurr Of (1.456), and L J = L^ of (1.468). In [228] it has been proved that the perturbational Lagrangian duality theories of Rockafellar and Kurcyusz are, in a certain sense, equivalent, and it has been shown how to pass from one to the other. In particular, given an arbitrary multifunction

10. Notes and Remarks

295

^ : Z ^^ 2^ satisfying (10.64), the perturbation function PQ: X x Z ^^ R defined by Pnix, z) := fix) + Xf2(,)(;c)

{xeX,z€Z)

(10.72)

satisfies p^ix, 0) = f{x) + xcix) (x e X), and for v(z) and L{x, * ) of (1.389) and (1.396), respectively, we have viz) = inf {fix) + xniz)ix)} = v"(z)

iz e Z),

(10.73)

xeX

Lix, * ) - inf [fix) + XQ(z)ix) - *(z)l = L"(x, * ) (x e X, * e Z*). (10.74) Also, by [240], Theorem 3.1, for p ^ : X x Z ^ ^ of (10.72) and the surrogate Lagrangians Lsurr of (1.449) and Lj^ of (1.466) w e h a v e Lsurr — L" , L, = L'J (of (10.70) and (10.71), respectively). Therefore, instead of the Lagrangian (10.65) of Kurcyusz we consider here only the Lagrangian (1.396). In connection with strong duality for PJJ: of (1.467), i.e., with the equaUty inf fix) = max xeX u(x)eT

inf

vi/ez*

f(x),

(10.75)

xeX ^(uix))e^{T)

let us mention that following Rolewicz ([188], [189]), if (X, Z, u) is a linear system and (10.75) holds for all singletons T = {ZQ}, where zo ^ w(^), that is, if for each Zo ^ w(Z) there exists ^o ^ ^* such that inf fix)=

xeX u{x)=zo

inf

xeX ^o(u(x))=^o(zo)

fix),

(10.76)

then (X, Z, M, / ) is said to satisfy the Pontryagin maximum principle, or, briefly, the PMR For systems described by differential equations, the existence of ^o satisfying (10.76) leads to the classical formulation of Pontryagin's maximum principle (see Rolewicz [192]). This concept has been studied by Rolewicz ([188], [189], [191]), who has also extended it ([190], [192]) to the case of "non-one-point target sets," as follows: if T is any closed convex subset of Z, PMP holds if for each ZQ G M(X) there exists VJ/Q 6 Z* such that inf

xeX u{x)ezo+T

fix)=

inf

xeX vi/o(M(jc))=vi/o(2o)4-infvi/o(r)

fix).

(10.77)

Some dual characterizations of linear systems satisfying the PMP have been given in [216], [225], [232]. Fo£example ([225], Proposition 2.1), when (X, Z, u) is a linear system and / : Z -> /? is a continuous convex function, in order that (X, Z, w, / ) satisfy the PMP it is sufficient, and if u is one-to-one, it is also necessary, that a/(jco) n w*(Z*) 7^ 0

(jco e X).

(10.78)

Some new abstract PMPs, corresponding to the other formulas of Lagrangian and surrogate duality have been introduced in [232], as follows: If T is any subset of Z

296

10. Notes and Remarks

with M(X) n r 7^ 0, we say that (Z, Z, w, T, / ) satisfies the PMPi at ZQ e u{X) if there exists ^o ^ Z* such that inf

xeX uix)ezo-hT

fix) = inf {fix) - ^ouix) + inf VI/QCZO + T)}; xeX

(10.79)

the PMP2 at zo ^ " ( ^ ) if there exists ^0 ^ 2* such that inf

fix) =

w(jc)6Zo+7'

inf

/(jc);

(10.80)

^o(uix))>infyl'o(zo+T)

the PMP3 at zo ^ " ( ^ ) if there exists ^0 ^ 2* such that inf

jceX u(x)ezo+T

fix)=

inf

xeX ^o(u(x))e%(zo-^T)

/(jc);

(10.81)

the PMP„ if it satisfies the PMP„ at each zo € M(X). Some dual characterizations of such PMPs, using subdifferentials, have been given in [232]. The following generahzation of problem (P) of (1.413), with applications in the calculus of variations, has been also studied in the literature: Let iX, Z, u) be a linear system and J. X x Z -^ R a. function, and let the primal problem be iP)

a= inf Jix,uix)).

(10.82)

xeX

Defining the perturbation function p: X x Z -> Rby pix, z) := Jix, uix) -z) ix eX.ze Z), (10.83) a duality theory has been developed for problem (P) of (10.82) (see, e.g., Ekeland and Temam [54] and the references therein). In the particular case where y(jc, uix)) := fix) + hiuix))

ix eX.ze

Z),

(10.84)

where f: X ^^ R and h\ Z ^^ R axQ two convex functions, (10.82) and (10.83) reduce, respectively, to (1.413) and (1.415). However, here we did not consider generalizations of this type (although they have been developed also for nonconvex infimization, see, e.g., Toland [280], Auchmuty [10], Gao [76]). In [241] and [242] there have been developed a general theory of surrogate dual optimization problems, and, respectively, a general theory of dual optimization problems, encompassing the known dual problems as particular cases. Let us briefly mention some of the concepts introduced and studied in these papers. Considering the primal infimization problem (P) of (1.386), embedded into a family of perturbed optimization problems (1.389) with the aid of a perturbation p: X X Z -^ R satisfying (10.59) for some fixed zo ^ Z, one defines [241] a dual problem (Z)^^) to (1.386) by (Dpa)

Ppa = s^P^pQ(W). kp^iM^) = mfpiQ^x,zo),^)

(10.85) (^ e W),

(10.86)

10. Notes and Remarks

297

where W is a family of finite functions w: X -^ R and ^(x,zo),^ c. X x Z (\j/ ^ \Y). Observing that (Dp^) is not a surrogate dual problem to (P) of (1.261) (or equivalently, of problem (1.386) with 0 = f+jc)^ in the sensej 10.48), (10.49), but it induces a natural surrogate dual problem (DQ), with value ^Q = fip^, to the optimization problem (P)

5 = inf/7((X,zo)),

(10.87)

with objective function p and constraint set (X, zo) = {(x, zo)\x e X}, both defined in the "extended space" X x Z, and with value a equal to the value a = inf/(G) (by (10.59)), (DpQ) has been called in [241] SL perturbational extended surrogate dual (PES-dual) problem to (P). Of special interest are the so-called decomposed PES-dualproblems to (P), i.e., those in which Z is a locally convex space and ^ix,zo),^ = Xx ^l^^^^

(vl/ e Z*),

(10.88)

where Q^: {zo} x Z* -> 2^ (a multifunction), with zo of (10.59); these encompass, as £articular cases, e.g., the dual problems (Dsurr) of (1.444), (1.445), where zo = 0, ^(x,o),vi> = Xx{z e Z\ vl/(z) > 0} (so ^^^j ^ = {z e Z\ vl/(z) > 0}) , (D,) of (1.461), (1.462), and (Dyr) of (1.463), (1.464).' Another useful class of PES-dual problems is that of "perturbational conjugate dual problems" to (P), obtained as follows: Let W be any family of finite functions w: Z ^^ R, and let us consider any concept of conjugation h ^^ h^ (d e R)by which to each function h: Z ^^ R and each d e R there corresponds a "/x-J-conjugate function" h^: W -^ R. For p. X X Z ^^ R and ZQ ^ Z 2iS above, one defines the (p/i)-dualproblem to (P) as the optimization problem (Dp^)

Pp^=supXp^{W),

(10.89)

V ( ^ ) = -<(zo)(^)

(^ ^ ^ ) '

^1^-^^)

where f: Z -> /? is the "value function" (1.389) associated with p. For example, taking Z to be a locally convex space, W = Z*, and zo = 0, and considering the Greenberg-Pierskalla quasi-conjugates / J of (1.213), one obtains the surrogate dual problem (1.448), (1.447); similar remarks can be made for the pseudoconjugates / J of (1.220), and the semiconjugates / ^ of (1.221). If we consider the usual Fenchel conjugate / * : X* -> R of / , then, putting f^ = f* (d e R), we obtain the Lagrangian dual problem (1.408), (1.407). If S: /? x Z* -> 2^ is any multifunction, then for any f.Z^R and d e R, the "J-/x(S)-conjugate" of / is the function ^M(3). 2* -^ :^ defined ([234], [241]) by /;(-)(,!;) := _ inf/(S^,^)

(vl/ e Z*).

(10.91)

It turns out ([241], Theorem 5.5) that for every decomposed PES-dual problem (i.e., (10.86), (10.85), with S of (10.88)) we have (^;,,Xx^o) = (D^^(S)),

(10.92)

298

10. Notes and Remarks

where the multifunction S: /? x Z* ^- 2^ is defined by S,>p = ( " o * I arbitrary,

•^''"^' if J 7^ 0,

(^eZ*),

(10.93)

and conversely, for every "surrogate conjugation" (10.91) we have (10.92), where the multifunction Q^: {0} x Z* -> 2^ is defined by ^o,vi. = So,v,

(vl/ e Z*).

(10.94)

Without assuming a perturbation of (P) = (P(^,/) of (1.261), for any dual constraint set W = W^'-^ and any coupling function r = TQ: XxW ^^ R, with values TG(X, W) e R not depending on / , the (Wr)-dualproblem to (P) is defined [242] as the supremization problem (D) = ( D ^ / )

p = ^ ^ / = supXiW),

X(w) = X^{{w) = inf {fix) + TG(X, W)}

(10.95) (W G W).

(10.96)

Thus (D) is a Lagrangian dual problem to (P) in the sense mentioned in Chapter 1, with "penalty term" TG{X, W); here we write TG by abuse of notation, since TG is defined on X x W^'^, but we use this notation in order to emphasize that the values TG{X,W) e R are independent of / . Assuming, for simplicity, that W = W^'^ c R^, and that G Hdom w; 7^ 0, or equivalently, inf u;(G) < +oc (w e W), it turns out ([242], Theorem 2.1) that for r = IG : X x W ^ ^ defined by TG{X, W) := -w{x) f inf w;(G)

(x eX,w

e W),

(10.97)

the Lagrangian dual (1.277), (1.278) to (P), with X* replaced by any W c P ^ (i.e., as in (10.20), when W c P^), coincides with the (lyr)-dual problem to (P). Furthermore ([242], Theorem 2.3)^if W = W^^^ is any set and QG,W ^ X (W e W), then fori = TG: X xW -^'R defined by TG(X, W) := XnaJ^)

(xeX^we

W),

(10.98)

the surrogate dual problem (10.48), (10.49) to (P) coincides with the (Wr)-dual problem to (P). A perturbation function p: XxZ -^ P for problem (1.261), or equivalently, for problem (1.386) with 0 = Z + X G . satisfying (10.59) for some zo ^ ^, is said to be objective function separated [242], if there exists a coupling function TIG'- XXZ -> R with values nG{x,z) € R not depending on 0 such that p{x, z) = PG,ct>{x, z) = (p(x) 4- 7TG(X,

Z)

(X G

X, Z

G Z).

Note that by (10.99) and (10.59), we have 0(x) + 7TG(X, ZO) = (t){x) -\- XG{X)

(X G X),

(10.99)

10. Notes and Remarks

299

and hence by a remark of Moreau ([164], p. 116), Ttcix, zo) = XG(X)

(X e X, 0(jc) e R).

(10.100)

Thus if (pix) e R, then Ttdx, zo) is either 0 or +oo, but ndx, z) may also have other values, for z ^ Zo\ the case in which all values 7TG(X, Z) are either 0 or +oo is also of interest. As has been shown in [242], there are some close connections between the dual problems (10.95), (10.96) associated with TG, and the Lagrangian dual problems associated with TTG of (10.99). Some axiomatic characterizations of perturbational Lagrangian dual objective functions (1.392), and perturbational surrogate dual objective functions (e.g., (1.445), (1.462), (1.464)), and, more generally, of dual problems with respect to objective-function separated perturbations, in the spirit of Theorems 10.8 and 10.1 above, have been given in [151]. Some characterizations of the associated Lagrangian functions have been also given in [151]. In the paper [242], going in the opposite direction, it has been shown that each perturbational dual problem "can be derived from" a suitable unperturbational problem via a "scheme" of "formal replacements." Rather than giving the technical details of the general case, let us mention here that, for example, given the unperturbational Lagrangian dual problem (1.277), (1.278) to (P) of (1.261), and any pair (X, /?), with p: X x Z ^ R, satisfying (1.387), by replacing formally, in this unperturbational dual problem O e X*,jc e X , G , and f(x) by (0, ^) e (X X Zy = X* X Z* (see (1.28)), (JC, Z) e X X Z, (X, 0), and p{x, z) respectively, one obtains P=

sup

inf

{p(x, z) - (0, vl/)(jc, z) + inf (0, ^)(X, 0)}

(0,vI/)€(XxZ)* ix,z)eXxZ

= sup

inf

[p{x, z) - ^(z)},

(10.101)

\lteZ* (x,z)eXxZ

i.e., the perturbational Lagrangian dual problem (1.391), (1.392). Furthermore, given the unperturbational surrogate dual problem (1.376), (1.379) to (P) of (1.261), and any pair (X, p) satisfying (1.387), making the same replacements also in the definition of the surrogate constraint sets, e.g., in QG, of (1.378), one obtains the new surrogate constraint sets ^(x,o),(o,vi/) = {(Jc, z)eXxZ\

v|/(z) = 0} c X x Z

(^ e Z*),

(10.102)

i.e., those occurring in (1.464), and the corresponding perturbational surrogate dual problem (1.463), (1.464), since /3 =

sup

inf

p(x,z)=

sup p(x,z).

(10.103)

^(z)=0

In [248], [252], instead of using the scheme of formal replacements, the "perturbational dual problem corresponding to an unperturbational dual problem" has been redefined by means of explicit formulas, which has permitted to obtain further

300

10. Notes and Remarks

results. This has been achieved by distinguishing between a "problem" and its "instances," much in the way as is done in combinatorial optimization (following, e.g., Garey and Johnson [78], Papadimitriou and Steiglitz [170]), which permits a deeper understanding of the parameters of the dual optimization problems. Let us mention that the idea of regarding and studying the objective function of a dual optimization problem as a function not only of the dual variables, but also of the primal parameters (the primal constraint set and the primal objective function), is useful not only for the above-mentioned "scheme" of "formal replacements," but also for other applications. For example, in [247], considering the instances of {(P), (D)} in which the constraint G of (P) is a singleton, there has been introduced and studied the concept of the "subdifferential of a function / : X ^- 7? at a point XQ e X with respect to a primal-dual pair of optimization problems {(P), (^)}"; in particular, for the unperturbational Lagrangian problem (D) this becomes Balder's subdifferential [13], generalizing the usual subdifferential, while for the unperturbational surrogate dual problem (D) it becomes the "surrogate subdifferential" of [234], generalizing the "quasi-subdifferential" of Greenberg and Pierskalla [95] and Zabotin, Korablev, and Khabibullin [296]. Among other generalizations of convex duality theory let us mention that Rubinshtein [203] (see also [202]) has considered the primal problem (P)

f(x) = sup t

(x e X),

(10.104)

te(a,b) xeGi

where {Gt}te(a,b), —^^ Sci < b < +cx), is a family of subsets of a given set X with certain properties, and has defined a dual problem to it. Also, he has given several applications, e.g., to generalizations of linear optimization, quasi-convex optimization, and best approximation. Burkard, Hamacher, and Tind [28] have shown that duality in mathematical programming can be treated as a purely order-theoretic concept, as follows: Let L = (L, <) be a totally ordered complete lattice, Z = (Z, <) a partially ordered set, X an arbitrary set, f: X -> L and u: X ^^ Z functions, and z e Z, and consider the primal problem (P)

a = sup f(x).

(10.105)

xeX u(x)
Furthermore, let H+ denote the set of all increasing functions h: Z ^^ L, and for any set M c 7i+ define the dual problem to (P) with respect to M by (D) = (DM)

P = PM=

inf h{z).

(10.106)

heM hou>f

Then, as shown in Burkard, Hamacher, and Tind [28], one can obtain duality results for this pair of problems that lead to some applications in economics. Also, when L and Z are endowed with algebraic structures, and a "bottom element" is adjoined to L, their theory encompasses, as particular cases (for suitable choices of L,

10. Notes and Remarks

301

Z, and M), duality results of Hoffman [105] on "abstract linear programs," Gould ([92], [93]), Tind [278], Tind and Wolsey [279], U. Zimmermann ([301], [302]) on "algebraic linear programs," K. Zimmermann [299] and U. Zimmermann [300] on "extremal optimization problems" (see also the survey paper of Burkard and U. Zimmermann [29]), and other duality results. A further generalization, adopting some ideas of Tind and Wolsey [279], is the theory of "environments" of Wieczorek [294]. In this theory, an associated pair is defined as an ordered pair of nonempty sets (X, V), together with a coupUng function (p\ X xV -^ R, such that there are no distinct x,x' e X with ip{x, v) = (p{x\ v) for all i; € V and no distinct v,v' e V with (p{x, v) = (p{x, v') for all X G X; for simplicity, we shall write vx instead of (p{x,v). An environment (of mathematical programming) is an ordered pair of associated pairs. A minimization program, in the environment (X, V; Z, W), is a triple n — (UQ, Zo. ")» where i;o € y, Zo ^ Z are such that DQJC and w;zo are finite for all JC G X and u; G W, and where u\ X —> Z is a mapping. Then G := {JC G X| M(JC) < zo) is called the constraint set of TT, the function Vi^x, defined for all jc G X, is the objective function of JT, and an element go ^ G^ is an optimal solution oin if i;o^o = infgeG i^o^- Since in Wieczorek [294] only minimization problems are considered, in order to define the dual of a minimization program as a minimization program, two mappings w: X -^ Z and u' \^ -^ V (for an environment (X, V; Z, W)) are said to be dual to each other, if wu(x) = -u(w)x

(x eX,w

e W),

(10.107)

and two programs 7t = {VQ, ZO. U), ^' = (zo, vo, u') in the environments (X, V\ Z, W) and {W, Z; V, X) respectively are called dual to each other if so are u and u'. The duality theory constructed in this way is symmetric. The Lagrangian of a program n = (VQ, ZO, U) in the environment (X, V; Z, W) is defined [294] by CJT(X^ ^ ) •= ^0-^ + wu(x) - wzo

(x e X,w

e W).

(10.108)

Goh and Yang ([88], [89]) have introduced two nonlinear Lagrangian function approaches to the study of duality for the parameterized infimization problem with inequality constraints (Pe)

inf

xeX liix)
/(jc),

(10.109)

where / , / i , . . . , /^ : /?" -^ R are finite continuous functions, X is a subset of /?", and Oi e R (/ = 1 , . . . , m). First, for £:=:[e = {ei)'S G Z?'""^Vo = 1,^/ > 0 ( / = l , . . . , m ) } ,

(10.110)

they have defined the Lagrangian function C: X x £ ^^ Rby C(x, e) := maxf/(x), ^ , . . . , ^ ^ )

(JC G X, e G £)\

(10.111)

302

10. Notes and Remarks

furthermore, assuming that an optimal solution x* e X of (PQ) exists, they have defined the corresponding dual objective function, for any e e S,by He) :=

inf.ex C{x, e) —oo

xUi > ^ otherwise.

(/ = 1 , . . . , m),

^i^^m)

and the corresponding dual problem to (P^i) by {Do)

y6, :=supA(^).

(10.113)

eeE

Thus this dual objective function A relies on the assumption of existence of an optimal solution X*, and it cannot be computed, since a priori /(x*) is not known. Nevertheless, Goh and Yang have shown the usefulness of this approach, proving, for example, that A is maximized at e* := (l, 7 7 ^ , • • •, yfe)) (see [88], p. 157, Theorem 5.2.1), and that /(x*) = A(^*), so one has zero duality gap ([88, p. 158, Theorem 5.2.3]). Also, they have given a necessary and sufficient optimality condition ([88, p. 159, Theorem 5.3.1]), as well as a geometric interpretation of C{x, e) via supporting cones (instead of the supporting hyperplanes of the case of conventional Lagrangians) to the epigraph of the perturbation function. In their second approach, Goh and Yang have defined another Lagrangian function T : Xx£ ^ R,hy C\x,d)

:= max(

fix)

hix)-0,

L(x)-0^

do d\ {x eX,de£)\

dm

' (10.114)

furthermore, they have defined the corresponding dual objective function by X\d) := inf C\x,d)

(d e f ) ,

(10.115)

xeX

and the corresponding dual problem by (D,)

p,:=supk\d).

(10.116)

In contrast with the above A, this dual objective function X' is computable, since it does not rely on the assumption of existence of an optimal solution. Under some conditions, there is also a result on zero duality gap ([88, p. 175, Theorem 5.5.2]). Let us also present, briefly, the approach of Rubinov and Yang (see [201] and the references therein), in a slightly more general framework, namely, for the problem (P)

a=

inf fix),

(10.117)

xeX u{x)<0

where X is a set, Z is a partially ordered locally convex space, and w: X ^ - Z is a mapping (in [201], Z is a metric space and Z = R^). Problem (P) is completely determined by the mapping if,u):X-^RxZ defined by

10. Notes and Remarks (/, u)ix) := (fix),

u(x))

303

(X e X),

(10.118)

T := (/, u)iX) = [(fix), uix))\x e X]

(10.119)

and the set

is called the image set of problem (P). The idea of using the image set T of (10.119) for the study of problem iP) of (10.117), initiated by Giannessi [81], seems fruitful; actually, in the context of other approximation and optimization problems, the idea of using the image set goes back to Haar, Farkas, and Caratheodory, and has been used also by others, but here we are not concerned with its history. Assume that XQ := [x e X\uix) < 0} / 0 and inf/(Xo) > —oo; then T n (/? X Z_) 7^ 0, where Z_ := [z e Z\z < 0}, and there exists c e R such that nXo) := if u)iXo) c [id, z)eRxZ\d>c}. Defining r,:=T-ir],0)

irjeR),

n- := [id, z)eRxZ\d

(10.120)

<0,z<0],

(10.121)

one can observe that 7^ Pi H~ = 0 if and only if rj < a ([201], Proposition 3.1); indeed, clearly, there exists no jc G X such that fix) — r] < 0, M(JC) < 0 if and only if fix) > rj for all jc G X with w(jc) < 0. Considering a set of parameters Q, a function h:RxZxQ-^R (called a "convolution function"), and r] e R, Rubinov and Yang [201] have defined the Lagrange-type function L: X x Q ^^ R for problem (P), corresponding to h and rj, by Lix, CO) := hifix)

- rj, w(x), co) + rj

(jc G X, a; G Q),

(10.122)

the dual objective function A,: Q ^^ Rby Xico) := inf Lix, co)

ico G Q),

(10.123)

xeX

and the dual problem to iP) by (D)

^ := sup Xico);

(10.124)

usually, one takes rj = /(JCQ), where XQ is a reference point (e.g., an optimal solution of (P), if it exists). Among other results, they have proved (see [201], p. 78, Proposition 3.21, where Z = R"^) that if sup hid, z,co)
iid, z) e Rx Z_),

(10.125)

coeQ

then a > p. Furthermore, following the ideas of Giannessi from [81], they have called an RWS iregular weak separation) function any function h: RxZ xQ -^ R satisfying hid, z,(o) <0 iid, z) eH-,(joe Q), V(J, z) i H", 3a) G ^ , hid, z, (D) > 0,

(10.126) (10.127)

304

10. Notes and Remarks

with H~ of (10.121). By a version of the "weak alternative theorem" of Giannessi [81], given in Rubinov and Yang [201], p.82, Proposition 3.25, if h: R x Z x Q ^^ R is an RWS function and rj e R, the following two assertions cannot be simultaneously true: 1°. VJC eX,3o)e ^ , h{f{x) - r], w(jc), co) > 0. 2°. Tr^nn- 7^0. Choosing rj = / ( X Q ) , where XQ 6 X is some reference point, one deduces (see Rubinov and Yang [201], p. 83, Proposition 3.26) that if h: R x Z x Q -^ R is an RWS function such that sup h(d, z,co) <0

((d, z) e H-).

(10.128)

then an element XQ e X with W(JCO) < 0 is an optimal solution of (P) if and only if inf sup h(f(x)

- f(xo), u{x), co) > 0.

(10.129)

For other results on duality, e.g., for particular choices of h, we refer to Rubinov and Yang [201]. In [199] there has been developed a theory of duality for two important classes of nonconvex subsets of R^, based on analogies with the duality results for convex sets and reverse convex sets. Namely, let us recall that a subset G of /?" is called normal (KarUn [116], Rubinov, Glover, and Jeyakumar [197], Rubinov and Glover [195]) if geG,

xeRl,

X
(10.130)

a subset G of R^ is called [199] conormal (or reverse normal) if CG = R1\G is a normal set. It is well known and easy to see that a set G c R^ is normal (respectively, conormal) if and only if it is the solution set of a system of inequalities fi(x) < 0 (i e I) (respectively, fix) > 0 (i e /)), where (fi)iei is a family of increasing functions defined on R^ and / is an arbitrary set of indices. Normal sets have many applications, e.g., in mathematical economics, where they are called sets with free disposal (see Karlin [116]). As can be seen from the geometrical interpretation of the definitions of normal and conormal sets and the shape of the balls in the space /^, (see Figure 1.1), the most suitable norm for studying best approximation in R^ by such sets is the maximum-norm (1.2) in R"^. Note also that one does not study best approximation in this norm in the space /?^_^, but only in R^, e.g., because the least element of best approximation minPG(-^^)5 which plays a central role in this theory, need not exist in R^_^ (for example, the element x^ = (2, 1) G /?^^_ has no least element of best approximation in G := the unit square in R^^). In the sequel, for each / = (/i , . . . , / „ ) G /?!|. we shall consider the set /(/) : = { / G { 1 , . . . , « } | / , > 0 } ,

(10.131)

and we shall identify the vector / and the function {I,.): R^ ^^ R^ defined by

10. Notes and Remarks // ^\ . ^ I ™n/e/(/) liXi (x = (xi, . . . , x„) G Rl)

if 1(1) # 0,

\0(xeRl)

if/(/) = 0.

305

(10.132)

The function {/,.) defined by (10.132) is called a min-type function. Note that the coupling function (x, /) -> (/, jc) arising from (10.132) is not symmetric. Considering normal (respectively, conormal) subsets of R'^ as analogues (in a certain sense) of convex (respectively, reverse convex) subsets of R" and the mintype functions x -^ (/, x) of (10.132) as analogues of the linear functions x -> Yll=\ U^i of the classical theory of best approximation by convex sets, in [199] there have been obtained some duality results for best approximation in R\ by elements of normal (respectively, conormal) subsets. However, there are some differences as compared to the convex case. For example, it is well known that the main tool used in the theory of best approximation of an element x^ in a normed linear space X by a closed convex set G is separation of G and the ball B{x^, r) with center x^ and radius r = dist (JC^, G) by continuous linear functions, and that if x^ e CG, the possibility of such a separation is a consequence of the separability of G and any outside point; however, although each closed normal set G and any outside point can be "separated" by a min-type function, the separability of G and the ball B{x^, r) n R\ (in the sense of Theorem 10.10 below) is no longer a consequence of the separability of G and any outside point. For a detailed study of "min-type separation," see [199]. Given a set G c R^^ one can consider G as a subset of the topological space R^ and as a subset of the topological space R\. In the sequel, unless otherwise stated, by "closed" we shall mean closed in the topological space R^. We shall use the following concept (see e. g. KarHn [116]), in the form given in [199], Definition 2.2: Let G be a closed normal set. A point g G G is called a weak Pareto point of G if Gn{ag\a

> 1} = 0.

(10.133)

For the distance to a closed normal set, we have the following theorem: Theorem 10.9. ([199], Corollary 6.3). Let G be a closed normal set in R^ and x^ £ CG. Ifg^ = min PG(X^) is a weak Pareto point ofG, then dist (x^, G) =

max

min fjcf - - ) .

leRl /€/(/) V (/,g)
(10.134)

///

The following theorem gives a sufficient condition for a given g' e G to be a nearest point to x^. Theorem 10.10. ([199], Theorem 5.1). Let G be a closed normal set, x^ e CG, g' e G, and r' := ||x^ ~ 8'\ • if there exists I e R\ that "min-type separates'* G and the ball BQ{X^, r') := B(x^, r') O R\, i.e., satisfying (/, g> < 1 < (/, y)

{geG.ye

Bo{x\ / ) ) ,

(10.135)

306

10. Notes and Remarks

then g' e

PG(X^).

Moreover, if (10.135) holds with I = A, where i:=(j 8' [0

then g' = g^ := min

'^'^'«ifi i l{g').

(10.136)

PG(X^)-

A necessary and sufficient condition is given in the following theorem: Theorem 10.11. ([199], Theorem 5.2). For an element g' e G the following statements are equivalent: 1°. g' G PG{X^) cind g^ := min PG{X^) is a weak Paretopoint ofG. 2°. There exists I e R\ that min-type separates G and BQ{X^, r') {where r' := \\x^ - g'||), in the sense (10.135). For the distance to a conormal set, we have this theorem: Theorem 10.12. ([199], Theorem 7.2). Let G <^ R\bea Then dist (x\ CG) =

normal set, and x^ e G.

1 maxf max! I - x^).

inf

(10.137)

el(l)\ {l,g)<\(geG)

Corollary 10.2. ([199], Corollary 7.1). // G is a normal set, x^ e G, and dist (jc^, CG) is attainedfor some z^ € CG, then the inf in the right-hand side of (10.137) is attained for some l^. A characterization of nearest points in conormal sets is given in the following theorem: Theorem 10.13. ([199], Theorem 7.4). Let G ^ R^ be a normal set, and x^ e G. For an element z^ G CG the following statements are equivalent: r . We have \x^ -z^\

=dist(x^CG).

(10.138)

2°. There exists f e R^ with (/^, z^) > 1, such that {l^g)
(geG),

(10.139)

max (^ - x^) =

min

maxf- - xf).

iel(l^)\l^

leRi

ielil)\li

/

{l,g)<\

(10.140)

V

(geG)

In [144] there has been studied duality for best approximation by a related important class of nonconvex subsets of R". Namely, let us recall that a subset G of R^ is called downward if

10. Notes and Remarks geG,

xeR\x
307

(10.141)

Downward sets arise as sets of all solutions of a system of inequalities fi(x) < 0 (/ € / ) , where / is an arbitrary index set and, for each i e I, ft is an increasing function defined on R^. Convex downward subsets of R" play an important role in some parts of mathematical economics and cooperative game theory, where they are called comprehensive sets. Not necessarily convex downward sets have applications in the theory of games with nontransferable utility. For the study of downward sets, rather than using the "multiplicative" formula (10.132) above, it is more convenient to work with the (symmetric) "additive" coupling function (x, /) -> (p(xj) defined by (p(xj) := min (x, +//)

(x = ( x i , . . . , x„), / = ( / i , . . . , / J e /?"),

\
(10.142) and with the corresponding ("additive") min-type function (p{.,l): R"^ ^^ R. We have the following characterization of nearest points in closed downward sets: Theorem 10.14. ([144], Theorem 9). Let G be a closed downward subset of R^, x^ e CG, g' e G, and r' := \\x^ - g'||. We have g' e VG(X^) if and only if there exists I e R'^ such that (P(gj)<0<(p(yj)

(geG,yeB(x'',/)).

Moreover, if (10.143) holds with I = -g\

(10.143)

then g' = g^ := min Pcix^)-

Actually, as has been shown in [144], the multiplicative and additive cases are closely related.

Chapter 2 This chapter presents, essentially, the results of [73] and [256], some of them with new proofs, and some new results. In [73] only bounded subsets G of a normed linear space X have been considered, but this assumption is not made in Chapter 2; see Remark 2.1 (a) on the assumption of boundedness of G. Furthermore, the results of [256] on deviation and farthest points, proved here with direct methods, have been obtained in [256] as particular cases of results on convex supremization (presented here in Chapter 3). 2.1. At a certain stage of development of approximation theory in normed linear spaces, there have appeared also problems on deviation and farthest points; see, e.g., Motzkin, Straus, and Valentine [165] and Klee [118]. The first duality formulas for the deviation were obtained in [73]. For bounded sets G, Lemma 2.1 and Theorem 2.1 were given in [73], Lemma 2.1 and Theorem 2.1. Remarks 2.1 (a) and (c) have been made in [256], Remark 3.10. Furthermore, Remark 2.1 (b) can be found in [73], Remark 2.1. Corollary 2.2, Remark 2.3 (b) (with different proofs), and Theorem 2.3 have been given in [256], Corollaries 3.10, 3.9, and Theorem 3.5, respectively.

308

10. Notes and Remarks

2.2. The first characterizations of farthest points in terms of duality were obtained in [73]. Namely, Theorems 2.4, 2.5, Remarks 2.6 (a), 2.7, and Examples 2.1, 2.2 have been given in [73], Theorems 3.1, 3.2, Remarks 3.2, 3.3, and Examples 3.1, 3.2, respectively. Theorem 2.6 can be found in [256], Theorem 4.6. Furthermore, Theorems 2.7, 2.8, Remark 2.8, Corollary 2.3, and Example 2.3 have been given in [73], Theorems 4.1, 4.2, Remark 4.1, Corollary 4.1, and Example 4.1, respectively.

Chapter 3 Some of the results presented in Chapters 3 and 4 (e.g., most of the results of [256]) have been obtained in the more general context of abstract convex analysis (see Chapter 9), but in Chapters 3 and 4 they are given, for simplicity, in the framework of locally convex spaces. In the Notes and Remarks below, we shall often refer to these results without mentioning the more abstract context in which they have been obtained. For the results in the abstract context, references will be made in the Notes and Remarks to Chapter 9. 3.1. Lemma 3.1 and Theorem 3.1 have been given in [215], Lemma 2.1, Theorem 2.1 and Remark 2.2, while Remark 3.1 (a), (b) can be found in [215], Remarks 2.1 (a) and 2.3. Furthermore, Corollary 3.1 and Example 3.1 have been given in [215], Corollary 2.1 and Example 2.1, respectively. Lemma 3.2 is, essentially, a particular case of [221], Lemma 2.2, which has been applied in [221] to convex infimization, while here it is applied to quasi-convex supremization. 3.2. Lemma 3.3 and Remark 3.3 (a) were given in [259], Lemma 1 and Remark 1. Lemma 3.4 has been observed in [230], Proposition 1.1 and Corollary 1.1. Propositions 3.1, 3.2 and Theorem 3.3, in the equivalent language of (3.7) and (3.8) (actually, in the more general context of (3.10) and (3.11)), can be found in [244], Propositions 2.2, 2.1, and Theorem 2.1, respectively. Lemma 3.5, Theorems 3.4, 3.5, and Corollary 3.2 were proved in [259], Lemma 2, Theorems 1, 2, and Corollary 2, respectively. Also, Corollary 3.3 is, essentially, [259], Corollary 1. Theorem 3.7 and Corollary 3.4 have been given in [259], Theorem 3 and Corollary 4. Definition 3.1, of a function f: X ^ R regular with respect to a function 4>, is an extension of a definition of Crouzeix ([36], p. 214, and [37]); extending Martinez-Legaz [138], p. 66, one might also use the term "semiregular." Corollaries 3.5 and 3.6 have been given in [244], Proposition 3.1, and [259], Corollary 4, respectively; as has been noted in [259], Remark 4, the latter corollary is an improvement of [244], Proposition 3.2. Furthermore, Lemma 3.6, Corollary 3.7, Theorem 3.8, Remark 3.5, Corollary 3.8, and Theorems 3.9, 3.10, can be found in [259], Lemma 4(c), (d). Corollary 7, Theorem 4, Remark 6, Corollary 8, and Theorems 5 and 6 respectively. 3.3. Theorem 3.11 is due to Voile ([289], Corollary 5), who has deduced it as a corollary of his duality theorem for the max of two functions (see Chapter 8, Theo-

10. Notes and Remarks

309

rem 8.11 (b) and [256], Remark 3.5 (a)). Remarks 3.9(a), (b), (c) and (d) have been made in [256], Remarks 3.5 (a), (b), (c), and (e). Corollaries 3.9, 3.10, 3.11, and 3.12 have been given in [256], Corollaries 3.5, 3.1, 3.7 and 3.2 respectively, but the second and fourth of these have been obtained in [256] with different methods, namely, as particular cases of Chapter 9, Theorem 9.7 (those proofs are given here in Chapter 9, after Theorem 9.7). Remark 3.11 (a) has been made in [256], Remark 3.3, while Remark 3.11 (c) shows how formula (2.21) on the deviation was obtained in [256]. Corollary 3.13 has been mentioned, implicitly, in [256], Remark 3.7 (b). Corollary 3.14 and Theorems 3.12, 3.13 have been given in [256], Corollaries 3.3, 3.13, and 3.14 respectively, and Remarks 3.13 and 3.14 (a) have been made in [256], Remarks 3.11 and 3.12 respectively. The role of condition (1.195) in duahty results for convex supremization and reverse convex infimization, due to its role in the use of polarities that depend only on one parameter, was discovered by Thach ([271], [273], [274]). Let us also mention the following related results of Penot: Proposition 10.1. ([175], Proposition 4.1). Let X be a locally convex space. (a) For any G C. X and f: X -^ R we have sup / ( G ) > sup

inf

fix) > sup

1

inf

fix).

(10.144)

0(jc)>l

(b) If f is lower semicontinuous quasi-convex and fiO) = inf/(X), then sup / ( G ) = sup

inf

fix).

(10.145)

(c) If f is evenly quasi-convex and /(O) = inf/(Z), then sup / ( G ) = sup

inf

fix).

(10.146)

(A:)>1

Remark 3.14 (b) shows how Theorem 2.3 was obtained in [256]; indeed, see [256], proof of Theorem 3.5. Corollary 3.15 has been given in [256], Corollary 3.8. 3.4. Under the assumption that G is bounded. Theorem 3.14 and Remark 3.15 (c) have been given in [218], Theorems 2.1 and 2.2. 3.5. Perturbational Lagrangian duality for the unconstrained primal supremization problem (3.144) has been studied by Kanniappan [115], who has considered any perturbation function p: X x Z -^ R satisfying (3.145), the Lagrangian function L: X X Z* -> R of (3.152), and the dual value (3.155), and has observed the inequality (3.156); also, he has made essentially Remark 3.16. Some results on perturbational Lagrangian duality for primal supremization have been obtained as particular cases of d.c. duality (see Chapter 8 and the Notes and Remarks to it).

310

10. Notes and Remarks

3.6. For the primal supremization problem (3.162) some remarks on duality, in the sense of part (1) of Section 3.5, have been made in [244], Section 4. The set W of (3.165) and the polarity A^_,^^^: 2^ -^ 2"*^^*^^^°^ of (3.166) have been considered, in the equivalent language of multifunctions, in [244], formulas (4.2) and (4.6). Formula (3.173) has been given in [244], formula (4.7). Also, in [244], Section 4, it has been observed (in the equivalent language of multifunctions) that one can define, similarly to (3.166), the polarities A^-^^j^: 2^ -> 2"*(^*^\^0J (/ = 2, 3, 4) of (3.174)-(3.176), and one can obtain for them results corresponding to those of Section 3.2. Part (2) of Section 3.5, given here, was suggested by the model of Chapter 6, Section 6.5.1, part (2).

Chapter 4 The first characterizations of optimal solutions of the maximization problem for continuous convex objective functions, in terms of duality, have been given in [215] (see below, the Notes and Remarks to Section 4.2). 4.1. Remark 4.1, Theorem 4.1, and Corollaries 4.1-4.4 have been given in [256], Remark 4.1, Theorem 4.1, and Corollaries 4.1-4.4. The concept of strongly evenly quasi-convex functions (Definition 4.1) can be found in Konno, Thach, and Tuy [120], p. 30. Remark 4.3 has been made in [256], Remarks 4.3 (a) and 4.4. Furthermore, Propositions 4.1, 4.2 and Theorems 4.2, 4.3 have been proved in [256], Theorems 4.2-4.5, and Remark 4.4(a)-(c) can be found in [256], Remark 4.5(a)-(c). Also, Remark 4.4 (d) shows how Theorem 2.6 was obtained in [256]. We have also the following related results of Penot: Proposition 10.2. ([175], Proposition 4.2). Let Xbea locally convex space, G C X, and f \ X ^^ R an evenly quasi-convex function, with f(0) = — oo. (a) If(^oeX* is a solution of the dual problem (D^oi) (of (3.114)/or A = A°^), i.e., if (4.16) holds, then every go e G with 4>o(go) ^ 1 satisfies /(go) = max/(G). (b) If figo) = max/(G), then every minimizer of f^^^^ ^ on the half-space {O e X*|<J>(go) > 1} is a solution of the dual problem (D/^02) (of (3.114) for A = A02).

Actually, Penot [175] has used the polarity A: G -> A^^(G) U {0} instead of A^^ but clearly, f^^^\G) = f^^^''\G) (G c X). The quasi-conjugate / ^ mentioned in Remark 4.4 has been introduced by Thach [272] with the aim to define dual problems to quasi-convex supremization and reverse convex infimization, with some convenient properties (see the Notes and Remarks to Chapter 6, Section 6.3). For a function / : Z -> /? on a locally convex space X, let us also mention the following "quasi-conjugate" / ^ ^ : X* -^ /? introduced by Thach [271] (in the

10. Notes and Remarks

311

particular case when X = /?"), which is useful in the study of duality for quasiconvex supremization and reverse convex infimization: f/^(A-)(cD) = sup ,eX (-fix))

/^o(
if O G X*\{0},

ou)>i -sup/(X)

(10.147) if 0 = 0;

note that the conjugate of Rubinov and §im§ek (10.13) is a kind of a "multiplicative 1 analogue" of (10.147) (the expression -f(x) of (10.147) is replaced by jj-^ fix) in (10.13)). Unlike the Greenberg-Pierskalla quasi-conjugate (1.213), the "quasi-conjugates" f" and /^^ of Thach satisfy (1.203), but not (1.204), so they are not Fenchel-Moreau conjugates (1.202). Although it is not a result on duality, let us also mention the following characterization of quasi-convexity of functions, using maximum points, obtained by Bereanu ([18], Corollary 3.4) as an improvement of the "minimum principle" of Bauer [15]: If G is a convex subset of a locally convex space X and f \ X -^ R is a finite continuous function, then the following statements are equivalent. 1°. f is quasi-convex. 2°. For each compact subset Y of G, there exists an extreme point yo of Y such that fiyo) = m a x / ( r ) . 3°. For each compact convex subset Y of G, there exists an extreme point yo of Y such that f(yo) = max f(Y). Also (Bereanu [18], Theorem 3.2), if G c. X is compact and f \ G -> [—00, +oo) is an upper semicontinuous quasi-convex function, then there exists an extreme point go of G such that f{go) = max / ( G ) . 4.2. The first necessary and sufficient conditions for optimal solutions of the maximization problem for continuous convex objective functions, in terms of duality, were Theorems 4.4 (b) and 4.5. Theorem 4.4 (b). Remark 4.5 and Corollary 4.5 have been given in [215], Theorem 3.1, Remark 3.2, and Corollary 3.1. Furthermore, Remark 4.6 (a) and (b) can be found in [215], Remark 3.3 (b) and Example 3.1, respectively. Corollary 4.6, Theorem 4.5, and Remark 4.8 are in [215], Corollary 3.2, Theorem 3.2, and Remark 3.5. Corollaries 4.7, 4.8, Remark 4.9, and Theorem 4.6 have been given in [215], Theorem 4.1, Corollary 4.1, Remark 4.1, and Theorem 4.2, respectively. 4.3. Lemma 4.1 and Theorem 4.7 have been given in Strekalovsky [263], Proposition 1, Theorem 1, and the remark made after it (formula 3(c)), assuming that X is a reflexive Banach space and C is a closed convex subset of X. As has been shown by Strekalovsky, one can prove a result of the above type in locally convex spaces, provided that we replace the assumptions on Sf(gQ)(f) by the stronger assumptions of convexity and compactness of Sf(gQ)(f) for the weak topology a (Z, X*). Indeed, we have the following result: Theorem 10.15. ([268], Theorem 1). Let X be a locally convex space, f: X ^ R a lower semicontinuous convex function, and G a subset ofX such that the level set

312

10. Notes and Remarks

Sf(gQ)(f) is weakly compact and int 5/(go)(/) ^ ^om / . For an element go e G satisfying (4.49), the following statements are equivalent: r.f(go) = maxf{G). T. "behave df{x) c 7V(G; x)

{x e Ext 5/(,,)(/)),

(10.148)

where Ext Sf(gQ)(f) denotes the set of all extreme points ofS/^g^^if). Condition (10.148) means that 0 ( g -x)<0

(CD € df(x), geCxeExt

5/(,,)(/));

(10.149)

in order "to make (10.149) more manageable," Strekalovsky has proved the following result: Theorem 10.16. ([268], Theorem 2). Under the assumptions of Theorem 10.15, for an element go ^ G satisfying (4.49), the following statements are equivalent:

l°./(^o)=max/(G). 2°. We have lim 0 ( / -x)<0

(O G a/(x), {/} € A, X G Ext 5/(,,)(/)),

(10.150)

k-^+oo

where A := {{g^] c G\ lim^^^+oo ^(
10. Notes and Remarks

313

Chapter 5 The geometric interpretation of the problem of finding dist (XQ, CG), mentioned at the beginning of Chapter 5, has been noted by Penot [175], Example 6.4. The following related geometric problem can be found in Konno, Thach, and Tuy [120], p. 89, where for simplicity, XQ = 0: Let G C /?" be a closed convex set with 0 e int G and Gi, . . . , G^ C /?" closed convex sets with 0 ^ G^ (/ = ! , . . . , m). Find the greatest radius of an open ball centered at 0 that is contained in G and does not intersect any G/ (/ = 1 , . . . , m). A dual to this problem, given in Konno, Thach, and Tuy [120], p. 89, is the following: Find the radius of the smallest closed ball centered at 0 that contains the polar G° and intersects each polar G° (/ = 1 , . . . , m). Chapter 5 presents, essentially, the results of [74] and [256], some of them with new proofs, and some new results. In [74] only bounded open convex subsets G of a normed linear space X have been considered, i. e., such that X\G is a "cavern"), but in [220] (see also [226]) the openness of G has been replaced by the assumption that G has nonempty interior. Furthermore, the results on reverse convex best approximation, proved here with direct methods, have been obtained in [256] as particular cases of results on reverse convex infimization (presented here in Chapter 6). 5.1. The first duality formulas for the distance to a reverse convex set have been obtained in [74]. Theorem 5.1 and Remark 5.2 (b) (for G bounded open convex) have been proved in [74], Theorem 2.1, and its proof. Briec and Lemaire [26] (see also Lemaire [131], Lemaire and Voile [132]) have observed that G need not be bounded, since the proofs of [74, 220] did not use the assumption of boundedness of G. The case HG of Remark 5.2 (c) (for G bounded open convex) was given in [74], Remark 2.1. Corollary 5.1, Remark 5.3 (b), and Theorem 5.3 were given in [256], Corollaries 5.10, 5.9 and Theorem 5.5, respectively. Proposition 5.1 is due to Lemaire [131] (see also Briec and Lemaire [26]). 5.2. The first characterizations of nearest points in reverse convex sets, in terms of duality, were obtained in [74]. Theorem 5.4 (for G bounded open convex) was proved in [74], Theorem 3.1. Examples 5.1-5.3 were given in [74], Examples 3.13.3. respectively. Remarks 5.8 and 5.9 (b) were made in [74], Remarks 3.2 and 3.1, respectively. Theorem 5.5 is an improvement of [256], Theorem 6.3. Theorem 5.6 was proved in [74], Theorem 4.1. The optimal dual solutions have been called, in [74], "best functions." Remark 5.10, Theorems 5.7, 5.8 and Examples 5.4, 5.5, were given in [74], Remarks 4.1, 4.2, Theorems 4.2, 4.3, and Examples 4.1, 4.2, respectively. We recall that a subset G of a normed linear space X is called proximinal if 'PG(-^O) ¥=" 0 for all xo G Z. If G c X is convex, open, and bounded, then its complement CG is called a cavern. Thus Examples 5.4 and 5.5 give nonproximinal caverns, the second one even in the Hilbert space /^. A set G c Z is called semiChebyshev, respectively Chebyshev, if for each XQ e X the cardinality of VG(XO) is < 1 (i.e., VG(XO) is either empty or a singleton), respectively, 1 (i.e., VG(XO) is a singleton). It is known (see Klee [118]) that there exist also nonproximinal semiChebyshev caverns in l^. In connection with the famous (still unsolved) problem

314

10. Notes and Remarks

whether in /^ every Chebyshev set is convex, Klee [119] has conjectured and Asplund [3] has proved that if l^ contains a Chebyshev set that is not convex, then f also contains a Chebyshev cavern. The results of Chapter 5 mirror, in a remarkable way, those of Chapter 2. Indeed, Theorem 5.1, Remark 5.2(c), Corollary 5.1, Remark 5.3(b), Proposition 5.1, Theorem 5.2, Theorem 5.4, Remark 5.8, and Theorem 5.6 are counterparts of Theorem 2.1, Remark 2.1(b), Corollary 2.2, Remark 2.3(b), Proposition 2.1, Theorem 2.2, Theorem 2.5, Remark 2.7, and Theorem 2.7, respectively. In this direction, although it is not about duality, we also mention the following result relating nearest points and farthest points. Let us recall that for given jc G X and r > 0 the inversion of X in bd B(x, r) is the mapping y^x+

'

(y -X)

{ye X\{x})

(10.151)

Wy-xr of X\{x} into itself. Using these inversions, Ficken and Klee [118] have shown that there is a close connection between the convexity of Chebyshev sets in Hilbert space and the following problem on farthest points: If a closed convex subset Gofa Hilbert space X has the property that every XQ e X admits a unique farthest point in G, then does G necessarily consist of one pointl If the answer to this problem were affirmative, then every Chebyshev set in X would be convex [118].

Chapter 6 Some of the results presented in Chapters 6 and 7 (e.g., most of the results of [256]) have been obtained in the more general context of abstract convex analysis (see Chapter 9), but in Chapters 6 and 7 they are given, for simplicity, in the framework of locally convex spaces. In the Notes and Remarks below, we shall often refer to these results without mentioning the more abstract context in which they have been obtained. For the results in the abstract context, references will be made in the Notes and Remarks to Chapter 9. 6.1. Theorem 6.1 (c), under the assumption that / is continuous and convex, and with (6.19) instead of (6.3), has been proved in [220], Theorem 2.1. Formula (6.3) (and a generalization of it to problem (6.110)) has been shown to hold also under somewhat different assumptions, in [253], Theorem 3.1. Example 6.1 and Theorem 6.2 for a continuous convex function / can be found in [220], Example 2.1 and Theorem 2.2. Let us point out that it was actually the need of extending some of the duality results on reverse convex best approximation to reverse convex optimization (e.g., to Theorem 6.2) that has led to the discovery, in [219], of some separation and support theorems in normed linear spaces, such as Theorem 1.3, which complement the classical theory. 6.2. Propositions 6.1, 6.2 and Theorem 6.3, in the equivalent language of (6.4) and (6.5) (actually, in the more general context of (6.7) and (6.8)), have been given in

10. Notes and Remarks

315

[245], Propositions 2.1, 2.2, and Theorem 2.1 respectively. Lemma 6.1 and Corollary 6.1 can be found in [259], Lemma 8 and Corollary 13 respectively. Propositions 6.3 (a), (b) have been given in [259], the first parts of Theorems 7 and 8. Proposition 6.4 and Theorem 6.4 (a) have been proved in [259], Lemma 9 and Theorems 9 and 7, respectively. Theorem 6.4 (b) is in [259], Corollary 14. Remark 6.9 (c) and Theorem 6.7 have been given in [259], Theorem 10. Remark 6.13 and Theorem 6.9 have been given in [259], Theorem 13. 6.3. Theorem 6.10 is due to Voile ([289], Corollary 6), who has deduced it as a corollary of his duality theorem for the max of two functions (see Chapter 8, Theorem 8.11 (b) and [256], Remark 5.5 (a)). Rojnarks 6.15(a), (b) have been made in [256], Remarks 5.5 (b), and (c). Various "conjugation" concepts have been introduced by Thach ([271]-[275]), Penot ([174], [175]), and Rubinov and §im§ek [198], with the aim of obtaining a duality between convex supremization and reverse convex infimization problems. For example, Thach has considered, among others, the following two problems: (Fi)

ai=sup/(D),

{P2)

ai = mfh{R''\mi

(10.152) C),

(10.153)

where D c /?" is a compact convex set with 0 G D, C C /?" is a closed convex set with 0 G int C, and f,h:R^^'R are evenly quasi-convex functions satisfying (1.195). Using the quasi-conjugate / ^ of (4.19), he has defined dual problems to (Pi) and (P2), respectively, by (Di) (D2)

^1 = inf/^(/?"\int D°), ft

= sup/^(C°).

(10.154) (10.155)

and has proved that a\ = —P\ and 012 = —^2^ so there are no duality gaps, and these dualities are symmetric (i.e., for / = 1, 2, the dual to the dual problem (D/) of the primal problem (P/) is again the primal problem (P/)); moreover, the problems of quasi-convex supremization and reverse convex infimization turn out to be dual to each other. These should be compared with the corresponding results of Remark 6.15 (b), which use only the conjugates of type Lau f^^^\ where A: 2^ ^^ 2 ^ is an arbitrary polarity, with X being any set and W <^ R^. Corollaries 6.2, 6.3, 6.4, 6.5, and 6.6 have been given in [256], Corollaries 5.5, 5.1, 5.7, 5.2, and 5.3, respectively. Theorem 6.11, Remark 6.17 (a). Theorem 6.12, and Remark 6.18 (a) can be found in [256], Corollary 5.12, Remark 5.10, Corollary 5.13, and Remark 5.12. Remark 6.18 (c) shows how formula (5.29) on dist (XQ, CG) was obtained in [256], Theorem 5.5. Corollary 6.7 has been given in [256], Corollary 5.8. 6.4. Theorem 6.13 and Corollary 6.8 can be found, essentially, in [258], Theorem 4.2. Theorem 6.14 was given in [258], Theorem 4.3. 6.5.1. Part (1) of Section 6.5.1, given here, was suggested by the model of Chapter 3, Section 3.5, part (1).

316

10. Notes and Remarks

Part (2): Theorems 6.16, 6.17, Corollaries 6.9, 6.10, and Theorem 6.18 have been given in [253], Theorems 2.1, 2.2, Corollaries 2.1, 2.2, and Theorem 2.3 respectively. In the particular case that Z is a nonempty compact subset of /?", Z = R^, u is continuous, / is lower semicontinuous, T is a convex subset of Z, with u{X) n i n t r 7^ 0, and XQ e X, /(JCQ) = inf/(X), W(JCO) eint T, Theorem 6.18 yields a result of Thach, Burkard, and OettU ([276], Theorem 2.1(ii)). Remark 6.21 can be found in [253], Remark 2.3 (a), (b). Theorem 6.19 was given in [253], Theorem 2.4. Theorem 6.20 has been given earlier, in [245], Theorem 4.1. Corollary 6.11 can be found in [253], Remark 2.1 (b). Theorem 6.21 was given in [253], Theorem 3.1. 6.5.2. Theorem 6.22 and Remark 6.23 (a) were given in Lemaire [131], Theorem 4.1 and its proof. Remark 6.23 (b) and Corollary 6.12 can be found in Lemaire [131], Remark 4.1 and Corollary 4.1.

Chapter 7 The first necessary and sufficient conditions for optimal solutions of the minimization problem for reverse convex constraint sets, in terms of duality, were given in [220] (see below, the notes to Section 7.1). 7.1. Remark 7.1, Theorem 7.1, Corollaries 7.1-7.4, and Remark 7.2 have been given in [256], Remark 6.1, Theorem 6.1, Corollaries 6.1-6.4 and Remark 6.2. For / continuous and convex. Lemma 7.1 can be found in [220], Theorem 3.1 (a), Furthermore, Theorem 7.2 has been given in [220], Theorem 3.2 and Remark 3.2 (a), (b), while Remark 7.3 is nothing other than [220], Remark 3.3. Theorem 7.3 can be found in [220], Theorem 3.3. Theorems 7.2 and 7.3 were the first characterizations of optimal solutions of the minimization problem for reverse convex constraint sets, in terms of duality. Theorem 7.4, Remark 7.4, Theorem 7.5, Corollary 7.5, and Theorem 7.6 have been given in [220], Theorem 4.1, Remark 4.1, Theorem 4.2, Corollary 4.1, and Theorem 4.3, respectively. 7.2. Theorem 7.7 and Corollary 7.6 have been given in [256], Theorem 6.2 and Corollary 6.5. Remark 7.5 (b) shows how Theorem 5.5 was proved in [256], Theorem 6.3. Theorems 7.8 and 7.9 are due to Hiriart-Urruty ([102], Theorems 3.5 and 3.8).

Chapter 8 8.1. The second part of Remark 8.1 (a) can be found, essentially, in Tuy [284], p. 117, or Diir, Horst, and Locatelli [52], p. 639. Remark 8.1 (c) has been made by Hiriart-Urruty [102]. For the particular case that X = R", f: R"" ^ R is a finite continuous function, and G is a compact subset of R^, Remark 8.2 (b) can be found

10. Notes and Remarks

317

in Tuy ([284], p. 118); the general case given here has been observed by MartinezLegaz (personal communication). For Remark 8.2 (c) see, e.g., Tuy [283], p. 150, or Tuy [284], p. 119. The stability properties of the set DC(X) of all finite d.c. functions on a set X, the other examples of d.c. functions, and the examples of fields of applications of d.c. functions mentioned after Remark 8.1 can be found in Hiriart-Urruty ([102], [98]), Tuy ([283], [284]), and the references therein. For other results of the theory and optimization of d.c. functions, and for various applications of d.c. functions, see, e.g., the book of Horst and Pardalos [109] (especially the survey articles of Tuy [283] and Benson [16] in [109], and the references therein), and the monograph of Konno, Thach, and Tuy [120]. Proposition 8.1 can be found in [254], Proposition 8.7. The main part of Theorem 8.1 (b), namely the implication 1° =^ 2°, has been obtained independently by Toland [280] via some problems of the calculus of variations and mechanics (see also Toland [281]), and in [217], coming from some problems of best approximation and optimization (see [73], [215], [218]); therefore, some authors (MartinezLegaz [139], Martinez-Legaz and Seeger [145], Attouch and Thera, who gave a far-reaching generalization in [8], Lemaire [131], Voile [290, 291], Rubinov and Glover [196], Dur, Horst and Locatelli [52], and others) call it the "Toland-Singer theorem," while some other authors (apparently unaware of [217], [73], [215], [218], have called it the "theorem of Toland." Note also that a particular case was discovered earlier by Pshenichnyi ([181], p. 180, Lemma). The converse implication 2° ^ 1° in Theorem 8.1 (b) has been proved by Martinez-Legaz ([141], Theorem 3.1). Theorem 8.1 is closely related to a formula on the conjugate of the difference of two convex functions. Indeed, for any f,h e R^ and O G X* we have ( / + -/i)*(cD) = sup {
-h(x)}}

xeX

= - inf {(/ - ^)(x) + -h(x)}.

(10.156)

xeX

Hence using Theorem 8.1 (b), implication 1° =^ 2°, it follows that if h = /z**, then we have the following formula, which expresses ( / 4—h)* with the aid of / * and/z*: ( / + -hn
(10.157)

This formula has been proved by Hiriart-Urruty ([99], Theorem 2.2), and for X = /?", an alternative proof, using the subdifferentiability of h, has been given by Ellaia and Hiriart-Urruty [56]. In the more particular case that / and h are finite-valued convex functions and h is lower semicontinuous, formula (10.157) was proved earlier by Pshenichnyi ([181], pp. 180-182). Martinez-Legaz has shown that the converse is also true, i.e., if formula (10.157) holds for all / , then h = /i** ([141], Theorem 4.1 and Remark 4.1). Proofs and more details, together with some

318

10. Notes and Remarks

additional comments, can be found in [254], Chapter 8, Section 8.5, and the Notes and Remarks to it). Remarks 8.3(c), (d) and 8.4 have been made by Rubinov and Glover ([196], the beginning of Section 3 and Remark 3.2), who have also given, essentially, Proposition 8.2 ([196], Proposition 3.1 and its proof). 8.2. Remark 8.5 has been made by Strekalovsky [267], Remark 2.7. The equivalence 1° ^ 3° of Theorem 8.2 has been given by Hiriart-Urruty ([102], Theorem 3), as an improvement of a result of Strekalovsky ([267], Theorem 1). Remarks 8.6 (a), (b), have been made by Strekalovsky ([267], Remark 2.1); the example presented in part (b) is due to Dur ([51], Example 1.1). Remark 8.6 (c) can be found in Konno, Thach, and Tuy [120], p. 93, Theorem 4.5. Theorem 8.3 was discovered by Hiriart-Urruty in [100], Theorem 4.4, and the proof of the easy part (the necessity part) presented here is that of [100]. The proof of the sufficiency part of this result, given in [100], was rather complicated, and later some other proofs have appeared. We presented here the simplest one, due to Tuy and Oetth [285], in the further simplified version of Hiriart-Urruty ([101], Theorem 3.1). The necessary condition (8.40) for zo to be a global solution of the infimization problem (P) of (8.1) is due to Rockafellar (see, e.g., [184], Theorem 32.4)). Remark 8.7 (b) has been given by Hiriart-Urruty ([100], Corollary 4.5). Applying Theorem 8.3, Jeyakumar and Glover have obtained a characterization of global minimum points for d.c. infimization problems under convex inequality constraints, in terms of e-subdifferentials ([113], Theorem 3.2). Let us also mention the following extension of Theorem 8.3 to characterization of ^'-solutions, where s^ > 0 (see (10.17)), due to Hiriart-Urruty ([100], Theorem 4.4): a necessary and sufficient condition for zo to be an e'-solution of problem (P) of (8.1), where s' > 0, is that dsh{zo)^d,^,^f(zo)

(e>0),

(10.158)

deh(zo) £ a.+.'/(zo)

(e > 0);

(10.159)

or equivalently,

clearly, for ^' = 0 this is just Theorem 8.3. The necessary condition (8.40) for zo to be a global solution of the infimization problem (P) of (8.7) is known to be also a necessary condition for zo to be a local solution of the infimization problem (P) (see, e.g., Hiriart-Urruty [100], Proposition 3.1). Indeed, just as in the proof of the implication 1° => 2° of Theorem 8.3 (the case £ = 0), it follows that if zo is a local solution of (P), then for any neighborhood U of Zo such that fizo) - h{zo) < fix) - h(x)

(X e U),

(10.160)

we have fizo) < fix) - ix - xo)

ix e U);

(10.161)

10. Notes and Remarks

319

but this implies (see, e.g., Hiriart-Urruty and Lemarechal [104], p. 242) that (10.161) holds also for all x e X, i.e., that we have (8.40). Let us also mention that the necessary condition (8.40) for zo to be a local solution is also sufficient under some additional assumptions; for example, if f is piecewise affine, then by a theorem of Michelot [161], this condition is necessary and sufficient for zo to be a local solution of the infimization problem (P) of (8.7) (see Hiriart-Urruty [100], Theorem 4.1). For the case that X = R^ and / , h are finite convex functions, the following sufficient condition for local optimality was given by Diir ([50], Theorem 2.1): if there exists a number ^o > 0 such that dsh(zo) c djizo)

(0<8<

£o),

(10.162)

then Zo is a local solution of the infimization problem (P) of (8.7). 8.3. Remark 8.8 (b) has been made by Lemaire and Voile ([132], Remark 1). Theorem 8.4, Remark 8.9 (a), (b), and Proposition 8.3 have been given by Lemaire and Voile ([132], Theorem 1, Remark 2, and Proposition 2). The particular cases (l)-(5) of Theorem 8.4 have been noted by Lemaire and Voile in the same paper [132]; in particular. Proposition 8.4, Corollary 8.1, Remark 8.10, and Proposition 8.5 can be found in [132], Proposition 6, Corollary 4, and Proposition 7. One can also ensure the equality of the first and third terms in (8.89) under different assumptions. Namely, the following result has been proved in Lemaire and Voile [132], Proposition 8, with the mention that it is not a consequence of Theorem 8.4, but is an alternative of it, for the case of a nonstrict reverse convex inequality constraint ([132], Remark 7). Proposition 10.3. Let X be a locally convex space and let f,h,k: Z ^- R be three convex functions. If f = /**, h = /i**, / * is finite-valued and continuous for the Mackey topology T(X*, X), k is finite-valued and continuous for the Mackey topology r(X, X*), andinfk{X) < 0, then inf

[f(x)-h(x)}

xeX k{x)>0

inf ^,^eX €domA:*\{0}

sup {/z*(vl/) + ^ r ( 0 ) - /*(vl/ + r]^)}.

(10.163)

r]>0

If h = k = 0, then problem (Pj) of (8.61) reduces to a=

inf fix),

(10.164)

xeX /(jc)<0

i.e., to the problem of convex optimization with a convex inequality constraint. In this case, using the result mentioned in Remark 8.9(a), one can obtain (see Lemaire and Voile [132], p. 338, Proposition 4) a duality result that extends a known duality formula for this problem (see [129], p. 383). However, here we concentrate on nonconvex optimization.

320

10. Notes and Remarks

8.4. Remark 8.11, about various possible approaches to duality for the problem of d.c. infimization with finitely many d.c. (nonstrict and strict) inequality constraints, has been made by Lemaire and Voile [132], Section 5. Let us also mention that by techniques similar to those of Remarks 8.1 (c) and 8.11, every d.c. problem (8.91) and (8.92) can be easily reduced (see, e.g., Tuy [283], p. 157) to the "canonical form" (P)

a=

inf

xeX l{x)0

fix),

(10.165)

where all functions / , /, k are convex, i.e., to a convex infimization problem with one reverse convex constraint set. The results of Section 8.4 are due to Martinez-Legaz and Voile [157], [158], [159]. Thus Lemma 8.1 can be found in [157], Lemma 3.2. Proposition 8.6 has been given in [158], Remark 2.1 and Proposition 2.1. Proposition 8.7 can be found in [157], Section 2. Theorem 8.5 and Remark 8.13 (a) have been given in [158], Theorem 2.1 and Section 3, respectively, while Remarks 8.13 (b), (c) have been made in [157], Remarks 3.5 and 3.6. Theorem 8.6, Remarks 8.14 (a), (b), and Corollary 8.2 can be found in [157], Theorem 3.2, Remarks 3.3, 3.4, and Corollary 4.1. Furthermore, Corollary 8.3, Remark 8.16, Corollary 8.4, and Remark 8.17 have been given in [158], Corollary 4.2 (which is a version of [157], Corollary 4.2), Remark 4.1, Corollary 4.3, and Remark 4.3 respectively. The case of strict inequality constraints has been considered in Martmez-Legaz and Voile [157]. Thus Lemma 8.2 and Theorem 8.7 can be found in [157], Lemma 5.1 and Theorem 5.1, and the remarks made after them. Duality for d.c. infimization over compact constraint sets has been studied by Martinez-Legaz and Voile in [159]. Thus Lemma 8.3, Theorem 8.8, Corollary 8.5, Remark 8.19, and their particular cases (1), (2), have been given in [159], Lemma 1, Theorem 1, Corollary 1, Theorem 2 and Section 3, respectively. We have also the following result (see Martinez-Legaz and Voile [159], Theorem 2): Theorem 10.17. Let X be a locally convex space, K a convex weakly compact subset ofX (i.e., compact for the weak topology a (X, X*)), and / , /z, //, kt as in Theorem 8.8. Then for a of (8.132) we have

{

dom/i*xnr^, domitf

m

h*(^) -h y^ rjikU^i)

+

-

'-'

m

m

i=l

i=\

1

.

(10.166)

Remark 10.2. In the case that the convex functions l\,... ,1^ are continuous at a common point of K, where / is finite, we have (see, e.g., Moreau [163]) m

( / + y]'?,/,) ( * ) =

m

min

|r(vI'o) + V(j?,/ir(»I',)}.

(10.167)

10. Notes and Remarks

321

But if rji = 0, then y//// = Xdom/, (by (1.301)). Hence by Theorem 10.17 and (10.167), the value a is then expressed entirely in terms of the conjugate functions r,/f,r,fc;,X^andxl„„,. Duality results for optimization problems involving differences of vector-valued convex mappings u, v have been obtained recently by Voile [292], and Flores-Bazan and OettH [72] (see also Flores-Bazan and Martinez-Legaz [71]), e.g., for the infimization problem (P)

a = inf {f(x) + h{uix) - v(x))},

(10.168)

xeX

where X, Z are locally convex spaces, with Z = (Z, <) partially ordered, u: X ^^ Z and v: X -^ Z aie convex mappings, and / : X ^ - R,h: Z - > /?are two convex functions, with h being increasing. For functions with values in the canonical enlargement of a conditionally complete lattice ordered group A (see (10.2)-(10.7)), in [152], [155] it has been shown that if f, h e J^, (p: X X W ^ J, and h = /ZA^(<^)^^(<^)', where M((p) = M of (lOAO)and gMicpY ._

-^j ^

( ^ e A^),

(10.169)

then inf {/(x)(g)/i(jc)-M = inf {h^^'^\w)^[f^^'^\w)Y^}, xeX

(10.170)

weW

and conversely, if (10.170) holds for all f e A^, then h = /^^^(<^)^^(<^)^ In the particular case where (A, <, 0 ) = (/?, <, +), this result reduces to Theorem 8.1 (b). For a coupling function cp: X x W ^^ A defining the "^-subdifferentials with respect to M((^)"by a f ^^V(^o) := {w e W\ fixo) 0 f''^^\w)

0 [cp(xo, w)r' < a] (6 eA,s >e),

(10.171)

where e is the neutral element of A, in [155], Theorem 3.1, it has been proved that if X and W are two sets, (p : X x W ^^ A is a coupling function, f,h e A^ with h = /j^(<^)^('^)', and XQ e X with f(xo) e A, h(xo) e A, then the following statements are equivalent: 1°- f(xo) 0 hixo) = mm,^x{f(x) 0 h(x)}. 2°. a f ^^^/z(jco) c a f (^>/(jco) (eeA,s>

e).

In the case (A, < , 0 ) = ((0,+oo), <, x) andcp: X xW -^ (0,+oo), (10.171) becomes 8f(^)/(xo) =\weW\ I

f^^^^'^f""'^'"^ ^(xo, w)

< A I

(l<£<+oo),

(10.172)

322

10. Notes and Remarks

and the above result yields that the following statements are equivalent: 2°. d^^^'>h(xo) c a f (^V(jco) (1 < ^ < +00). Some further extensions of d.c. duality theory will be mentioned in the Notes and Remarks to Section 8.6.1. 8.5. The perturbational theory presented in this section is due to Toland ([280], [281], [282]). In connection with Theorem 8.10 and Remark 8.20(b), the following results of Toland ([280], Theorems 2.5 and 2.6) are of interest: Theorem 10.18. Assume that px{0) = p**(0)for all x e X, and let {x„} C X and ^0 ^ ^* be such that lim L(jc„, VI/Q) = supL(jc, VI/Q) = -X{^o). '^-^^

(10.173)

xeX

Then the following statements are equivalent: 1°. ^0 is ^^ optimal solution of the dual problem (D). 2°. {xn} is a minimizing sequence for {P) {L e., lim„_^oo(/ — ^)(-^n) = inf ( / — h)(X)) and p{Xn. 0) - L(x„, VJ/Q) ^ 0fl5n ^ 00.

(10.174)

Theorem 10.19. Assume that pj^O) = /?**(0) for all x e X, ^o is an optimal solution of the dual problem (D), XQ e X, and the mapping x -^ —L(x, ^o) is lower semicontinuous on X. If there exists a sequence [xn] C X such that x„ —> XQ for the weak topology G{X,X*) and lim L(jc„vI/o) = -A(vI/o),

(10.175)

then XQ is an optimal solution of problem (P), and we have (8.157) and (8.158). Finally, let us mention that for the unconstrained primal infimization problem (8.143), where X is a locally convex space and 0 : X -^ /? is a function, Auchmuty [10] has defined general concepts of "Lagrangians" of two different "types" for problem (P), and dual problems to (P) associated with these Lagrangians, as follows: Definition 10.1. Let Z be a locally convex space, with conjugate space Z*. For problem (P) of (8.143), a function L: X x Z* ^ ^ is called (a) a Lagrangian of type I if 0(jc) = sup L{x, ^)

{x e X);

(10.176)

(x e X),

(10.177)

(b) a Lagrangian of type II if 0(x) = inf L(jc, vl/) vl/eZ*

10. Notes and Remarks

323

Definition 10.2. (a) The dual problem to (P) of (8.143) associated with a Lagrangian L of type I is {DL)

PL := sup X^vi/),

(10.178)

vyeZ*

where A L ( ^ ) := inf L(jc, ^)

(^ e Z*);

(10.179)

xeX

(b) The anomalous dual problem to (P) of (8.143) associated with a Lagrangian L of type I is (Df)

pf := inf Af(vl/),

(10.180)

^>€Z'

where Xf (vl/) — sup L(jc, vl/)

(^^f e Z*);

(10.181)

xeX

(c) The dual problem to (P) of (8.143) associated with a Lagrangian L of type His (Dl)

PI := inf X^m,

(10.182)

where A^ : Z* ^ ^ is defined by (10.179). Remark 10.3. (a) The Lagrangians of the above types and the corresponding dual problems of the above types contain, as particular cases, the Lagrangians and dual problems for convex infimization, convex supremization, and reverse convex infimization, considered in the preceding. (b) Some further extensions of the above theory of Auchmuty have been given in the monograph of Gao [76]. As an introduction to Chapter 4, Section 4.3, he wrote: "In order to study the mathematical theory of duality in natural phenomena, we need to make some mathematical definitions for general nonlinear systems. However, it is very difficult, and the author hopes that this section will not prove to be confusing." Then, in that section, he introduced a large number of concepts, such as source variables, configuration variables, dual configuration space, elementary system (defined as an ordered pair of Banach spaces (X,V) together with a bilinear coupling function (^: X x V ^^ R), dead source space, active source space, follower source space, equilibrium system, potential system, strictly reflexive system, geometrical operator, constitutive operator, balance operator, canonical system, strictly canonical system, geometrically nonlinear (respectively, linear) system, constitutively (or physically) nonlinear (respectively, linear) system, fully nonlinear (respectively, linear) system, polar system. In the paper [77], Gao gave further developments and a more systematic and clearer presentation of the mathematical theory, as well as some more applications of it to convex-anticonvex optimization problems. However, most of this theory involves Gateaux differentiability, and therefore we do not enter here into details.

324

10. Notes and Remarks

8.6. Duality theorems for infimization problems involving minimum or maximum operations were studied for the first time by Flachs and Pollatschek [68]. 8.6.1. Proposition 8.8 can be found in Martmez-Legaz ([141], Lemma 5.1). The main part of Theorem 8.11 (b), namely the implication 1° => 2°, has been obtained by Voile ([289], Theorem 3). The converse implication 2° => 1° in Theorem 8.11 (b) has been proved by Martinez-Legaz ([141], Theorem 5.1). Theorem 8.11 is closely related to the following formulas on the conjugate of type Lau of the maximum of a convex function and a concave function, which correspond to formulas (10.156), (10.157) on the (Fenchel-Moreau) conjugate of the difference of two functions: max (/, -h)^^^\^)

= - inf max {max {/(JC), px\A'iW)(x)}, -h{x)}

(10.183)

xeX

and, when/z = /z^(^>^(^>', max (/, -h)^^^\^)

= sup min {f^^^\min

(cD, vj/)), -h^(^\^)};

(10.184)

for more details and references, see [254], Section 8.10, and the Notes and Remarks to it. Lemma 8.4 and Theorem 8.12 are due to Voile [290]. 8.6.2. The example given at the beginning of this section is due to Voile ([289], the Remark to Theorem 3). Proposition 8.9, Examples 8.1,8.2, Theorem 8.13, Corollary 8.7, and Theorems 8.14, 8.15, have been given in Voile [291], Theorem 2.1, Remarks 1, 2, and Corollaries 2.2, 3.3, 3.1, and 3.4, respectively. In [156], d.c. duality theory has been extended so as to encompass, as particular cases, some duality theorems of Chapter 8 for the infimum of the difference of two functions and the infimum of the maximum of two functions, as well as some formulas for the conjugate of the difference of two functions and the conjugate of type Lau of the maximum of two functions. Namely, given a binary operation (g) on R satisfying the condition (inf bi) 0 c = inf (bi 0 c) i€l

{{bt} ^^,ce

^),

(10.185)

iel

one defines [154] a duality A: R^ -^ R^ associated with 0 , or, briefly, ^-duality, by the conditions

(inf z;-)^ = sup z;.^

({/;} c R ^ ) ,

(10.186)

(/ (8) d)^ = f^'^d

(f e~R^,de R),

(10.187)

where 0 is the binary operation on R (the so-called conjugate of (g)) defined by a'^c :=-(-a

(S) c)

(a,ce^).

(10.188)

10. Notes and Remarks

325

In particular, if (g) = +, then 0 = -|— (since —(—a -j- c) = « -|—c, by (1.86)). Hence in this case, (10.186) and (10.187) become conditions (1.203) and (1.204) (with A = c), and thus by Theorem 1.10, the -j—dualities are just the conjugations (1.202). If (g) = V, then 0 = A - (since -(-a v c) = a A -c)). Hence in this case, (10.187) becomes the condition ( / V d)^ = f^ A-d

( / G W, d e R),

(10.189)

and thus the w-dualities (i.e., the dualities associated with the binary operation 0 = V on R) are those introduced and studied in [147]. By [154], Theorem 4.7, if X and W are two sets and (8) is a commutative binary operation on R satisfying (10.185) and admitting a neutral element e, then for each ^-duality A. R^ -^ R^ there exists a unique coupling function xj/: X x W ^ R such that f^(w)

= sup {if (x, u;)0/(jc)|

( / eW,w

e W);

(10.190)

xeX

if in addition, (g) is associative, then the converse is also true, i.e., every mapping A:~R^ ^W of the form (10.190) is a ^-duality ([154], Theorem 4.9). In particular, for A = +, this result encompasses Theorem 1.10, and, for (g) = v, it yields the representation of v-dualities given in [147], Theorem 2.1 and Example 2.1. Given two sets X and W, and a binary operation 0 on /?, satisfying condition (10.185), in [156] there have been obtained some duality theorems for the problem (P)

a = inf {/(jc) 0 h(x)],

(10.191)

xeX

where f,h: X -^ R arc two functions, which encompass, for example. Theorem 8.1 (taking (g) = +) and Theorem 8.11 (taking (g) = v). Remark 10.4. In [156], Appendix, and [80], a general theory has been constructed that encompasses as particular cases both the dualities with respect to a binary operation (g) on /? and the conjugations M: A^ ^^ A^ (i.e., the mappings satisfying (10.8), (10.9)), where A = (A, <, (g), (g)) is the canonical enlargement of a conditionally complete lattice ordered group (A, <, (g)). The main difficulty of this extension is that (g) = V is not a group operation on /? U {—oo}; for example, although the neutral element with respect to v is ^ = —oo (i.e., a v —oo = —oo v a = a for all a e /? U {—oo}), no element a e R admits an inverse a~^ (i.e., such that a V a~^ = a~^ v a = —oo). Nevertheless, based on the observations that (R, <) is a complete lattice and the mapping s: R ^^ R defined by s(a) = -a

(ae^)

(10.192)

is an antiautomorphism of this complete lattice, the following general framework has been introduced in [156], Section 5: Let A = (A, <) be a complete lattice, (g): A X A ^^ A a binary operation satisfying (inf bi) (g) c = inf (bi (g) c) iel

'

iel

{{bi} ^A,ce

A),

(10.193)

326

10. Notes and Remarks

and s: A -^ A an antiautomorphism of A. One defines in [156] a new binary operation &: A x A ^^ Aby a& b:= s{s-\a)

0 b)

(a, b e A).

(10.194)

Furthermore, given two sets X and W, a mapping A: A^ -> A^ is called [156] a "(<8), 5")-duality" if it satisfies (10.186) with R replaced by A, and A(a 0 / ) = A ( / ) ^' a

(f e A^, a e A),

(10.195)

and in [80] it is shown that under certain assumptions, the general form of a ( 0 , s)duality A: A^ ^ A^ is A(f)(w)

= sup {if{x, w) & fix)}

if eA^.we

W),

(10.196)

xeX

where V ^ : X x H ^ ^ - A i s a coupling function, uniquely determined by A. The importance of these concepts lies in the fact that even when 0 is not a group operation on A, one can still prove many results using an antiautomorphism s\ A ^^ A and the binary operation & of (10.194), which "replaces" the inversion of elements of a group. However, although "((g), 5^)-subdifferentials" have been introduced and studied in [80], no extension of d.c. duality theory to this general framework has been given there.

Chapter 9 9.1. For more details about the additional facts from abstract convex analysis mentioned in Section 9.1, see [254]. The polarities AJ^ (/ = 1,...,4) of (9.33)(9.36) and their properties (9.40)-(9.43) have been given in [259]. For the polarities A^' (/ = 1, 2, 3) of (9.37)-(9.39), and their properties (9.44)-(9.47), see [256]. For conditions {Kx)-{K3). the polarities A^' (/ = 1, 2) of (9.51), (9.54), and their properties (9.52), (9.53), (9.55), and (9.56), see [256], [194], and [257]. 9.2. Theorem 9.1 and Corollary 9.1 were given in [259], Theorem 3 and Corollary 4. Definition 9.1 can be found in [244]. Corollary 9.2 was given in [244], Proposition 3.1. Furthermore, Corollaries 9.3, 9.4, and Theorems 9.2-9.4 were obtained in [259], Corollaries 6, 7 and Theorems 4-6. Corollaries 9.5, 9.6, Theorems 9.5, 9.6, and Remarks 9.3, 9.4, can be found in [256], Corollaries 3.4, 3.6, Theorems 3.3, 3.4, and Remarks 3.11,3.12, respectively. 9.3. Theorem 9.7, and the more direct proofs of Corollaries 3.10, 3.12, 3.14 mentioned after it, were given in [256], Theorem 3.1, Remark 3.2, and Corollaries 3.13.3. Remark 9.6 was made in [256], p. 268. 9.4. Theorems 9.8 and 9.9 were obtained in [259], Theorems 10 and 13. Corollaries 9.7, 9.8, Theorems 9.10, 9.11, and Remark 9.8 were given in [256], Corollaries 5.4, 5.6, Theorems 5.3, 5.4, and Remark 5.12, respectively.

10. Notes and Remarks

327

9.5. Theorem 9.12, Remark 9.10, and the more direct proofs of Corollaries 6.5 and 6.6, were given in [256], Theorem 5.1, Remark 5.2, and Corollaries 5.2, 5.3. 9.6. Proposition 9.1 and Theorem 9.13 can be found in [254], Propositions 8.7 and 8.8; for more details on these results, see the above Notes and Remarks to Section 8.1.

References

[1] N.I. Akhiezer, Lectures on Approximation Theory. Gostehizdat, MoscowLeningrad, 1947 [Russian]. [2] G. Ascoli, Sugli spazi lineari metrici e le loro varieta lineari. Ann. Mat. Pura AppL (4) 10 (1932), 33-81, 203-232. [3] E. Asplund, Chebyshev sets in Hilbert space. Trans. Amen Math. Soc. 144 (1969), 235-240. [4] E. Asplund, Differentiability of the metric projection in finite-dimensional Euclidean spaces. Proc. Amen Math. Soc. 38 (1973), 218-219. [5] M. Atteia, Analyse convexe projective. Comptes Rendus Acad. Sci. Paris 276 (1973), 855-858. [6] M. Atteia and A. El Qortobi, Quasi-convex duality. In: Optimization and Optimal Control (A. Auslender, W. Oettli, and J. Stoer, eds.). Lecture Notes Control Inf. Sci. 30, Springer-Verlag, Berlin, Heidelberg, 1981, 3-8. [7] H. Attouch and H. Brezis, Duality for the sum of convex functions in general Banach spaces. In: Aspects of Mathematics and Its Applications (J. A. Barros, ed.), Elsevier, Amsterdam, 1986, 125-133. [8] H. Attouch and M. Thera, A general duality principle for the sum of two operators. J. Convex Anal. 3 (1996), 1-24. [9] J.-R Aubin and I. Ekeland, Estimates of the duality gap in nonconvex optimization. Math. Open Res. 1 (1976), 225-245. [10] G. Auchmuty, Duality for non-convex variational principles. J. Dijf. Eq. 50 (1983), 80-145. [11] A. Auslender, Optimisation. Methodes numeriques. Masson, Paris, New York, 1976.

330

References

[12] M. Avriel and I. Zang, Generalized arc wise-connected functions and characterizations of local-global minimum properties. J. Optim. Theory AppL 32 (1980), 407-425. [13] E.J. Balder, An extension of duality-stability relations to nonconvex optimization problems. SIAM J. Control Optim. 15 (1977), 329-343. [14] V. Barbu and Th. Precupanu, Convexity and Optimization in Banach Spaces. Editura Academiei and Sijthoff & Noordhoff, Bucuresti and Alphen aan de Rijn, 1978; second ed., Editura Academiei and D. Reidel, Bucuresti and Dordrecht-Boston-Lancaster, 1986. [15] H. Bauer, Minimalstellen von Funktionen und Extremalpunkte. Arch. Math. 9 (1958), 389-393. [16] H.R Benson, Concave minimization: theory, applications and algorithms. In: [109], 43-148. [17] V.I. Berdyshev, The stability of the problem of minimization with respect to a perturbation of the set of admissible elements. Matem. Sbornik 103 (145) (1977) [Russian]. [18] B. Bereanu, On the global minimum of a quasi-concave functional. Arch. Math. 25 (1974), 391-393. [19] C. Bergthaller and I. Singer, The distance to a polyhedron. Linear Alg. Appl. 169(1992), 111-129. [20] G. Birkhoff, Lattice Theory. Colloquium Publications, vol. 25, American Mathematical Society, 1967. [21] J.M. Borwein and A.S. Lewis, Convex Analysis and Nonlinear Optimization. Springer-Verlag, New York, Berlin, Heidelberg, 2000. [22] J.M. Borwein and D. Zhuang, On Fan's minimax theorem. Math. Progr 34 (1986), 232-234. [23] N. Bourbaki, Topologie Generale. Ch. IV: Nombres reels. Hermann, Paris, 1942. [24] N. Bourbaki, Espaces Vectoriels Topologiques (2 volumes). Hermann, Paris, 1953, 1955. [25] D. Braess, Nonlinear Approximation Theory. Springer-Verlag, Berlin, Heidelberg, New York, 1986. [26] W. Briec and B. Lemaire, Technical efficiency and distance to a reverse convex set. Eur J. Oper Res. 114 (1999), 178-187. [27] R.C. Buck, Applications of duality in approximation theory. In: Approximation of Functions (H.L. Garabedian, ed.), Elsevier, Amsterdam, London, New York, 1965, 27-42. [28] R.E. Burkard, H. Hamacher, and J. Tind, On abstract duahty in mathematical programming. Z. Oper Res. 26 (1982), 197-209. [29] R.E. Burkard and U. Zimmermann, Combinatorial optimization in linearly ordered semi-modules. A survey. In: Modern Applied Mathematics. Optimization and Operations Research (B. Korte, ed.), Amsterdam, 1982, 391436. [30] A. Cambini, Non-linear separation theorems, duality and optimality conditions. In: Optimization and Related Fields (R. Conti, E. de Giorgi, and

References

[31] [32] [33] [34] [35]

[36] [37] [38] [39]

[40] [41] [42] [43] [44] [45]

[46] [47] [48] [49] [50]

331

F. Giannessi, eds.), Lecture Notes Math. 1190, Springer-Verlag, Berlin, Heidelberg (1986), 57-93. E.W. Cheney, Introduction to Approximation Theory. McGraw-Hill, New York, St. Louis, 1966. E.W. Cheney and A.A. Goldstein, Tchebycheff approximation and related extremal problems. J. Math. Mech. 14 (1965), 87-98. C. Combari, M. Laghdir, and L. Thibault, Sous-differentielles des fonctions convexes composees. An«. Sci. Math. Quebec 18 (1994), 119-148. J.-P. Crouzeix, Contributions a I'etude des fonctions quasiconvexes. These. Univ. de Clermont, 1977. J.-P. Crouzeix, Conjugacy in quasiconvex analysis. In: Convex Analysis and Its Applications (A. Auslender, ed.). Lecture Notes Econ. Math. Systems 144, Springer-Verlag, Berlin, Heidelberg, 1977, 66-99. J.-P. Crouzeix, Continuity and differentiability properties of quasiconvex functions on /?". In: [205], 109-130. J.-P. Crouzeix, A duality framework in quasiconvex programming. In: [205], 207-225. J.-P. Crouzeix, J.-E. Martinez-Legaz, and M. Voile, eds.. Generalized convexity, generalized monotonicity. Kluwer Acad. Publ., Dordrecht, 1998. A. Daniilidis and J.-E. Martinez-Legaz, Characterizations of evenly convex sets and evenly quasiconvex functions. J. Math. Anal. Appl. 273 (2002), 5866. M.M. Day, Normed Linear Spaces. 3rd ed. Springer-Verlag, New York, Heidelberg, Berlin, 1973. F. Deutsch, Best Approximation in Inner Product Spaces. Springer-Verlag, New York, Berhn, Heidelberg, 2001. F. Deutsch, W. Li, and J. Swetits, Fenchel duality and the strong conical hull intersection property. J. Optim. Theory Appl. 102 (1999), 681-695. F. Deutsch, W. Li, and J. Ward, A dual approach to constrained interpolation from a convex subset of Hilbert space. J. Approx. Theory 90 (1997), 385-414. F. Deutsch and PH. Maserick, Applications of the Hahn-Banach theorem in approximation theory. SI AM Rev. 9 (1967), 516-530. PH. Dien, G. Mastroeni, M. Pappalardo, and PH. Quang, Regularity conditions for constrained extremum problems via image space. /. Optim. Theory A/7/7/.,80(1994), 19-37. F. Di Guglielmo, Estimates of the duality gap for discrete and quasiconvex optimization problems. In: [205], 281-298. S. Dolecki, Abstract study of optimality conditions. J. Math. Anal Appl. 73 (1980), 24-48; Corrigendum. J. Math. Anal Appl. 82 (1981), 295-296. S. Dolecki and S. Kurcyusz, On O-convexity in extremal problems. SI AM J. Control Optim. 16 (1978), 277-300. N. Dunford and J. Schwartz, Linear Operators. Part I: General Theory. Interscience Publ., New York, London, 1953. M. Diir, Conditions characterizing minima of differences of functions. Monatsh. Math. 134 (2002), 295-303.

332

References

[51] M. Diir, A parametric characterization of local optimality. Math. Open Res. 57 (2003), 101-109. [52] M. Diir, R. Horst, and M. Locatelli, Necessary and sufficient global optimality conditions for convex maximization revisited. / Math. Anal. Appl. Ill (1998), 637-649. [53] M. Eidelheit, Quelques remarques sur les fonctionnelles lineaires. Studia Math. 10(1948), 140-147. [54] I. Ekeland and R. Temam, Convex Analysis and Variational Problems. NorthHolland, Amsterdam, Oxford, 1976. [55] I. Ekeland and T. Tumbull, Infinite Dimensional Optimization and Convexity. University of Chicago Press, 1983. [56] R. Ellaia and J.-B. Hiriart-Urruty, The conjugate of the difference of convex functions. J. Optim. Theory Appl. 49 (1986), 493-498. [57] A. El Qortobi, Contributions a la theorie de la dualite pour les fonctionnelles quasiconvexes. These de 3-eme cycle. Univ. de Toulouse, 1980. [58] A. El Qortobi, Conjugaison quasi convexe des fonctionnelles positives. Ann. Sci. Math. Quebec 17 (1993), 155-167. [59] K.-H. Elster and A. Gopfert, Conjugation concepts in optimization. In: Methods of Operations Research, vol. 62 (R Rieder, A. Gessner, A. Peyerimhoff, and F.J. Radermacher, eds.), A. Hain, Frankfurt am Main (1990), 53-65. [60] K.-H. Elster and R. Nehse, Zur Theorie der Polarfunktionale. Math. Open 5m?. 5(1974), 3-21. [61] K.-H. Elster, R. Reinhardt, M. Schauble, and G. Donath, EinfUhrung in die Nichtlineare Optimierung. B.G. Teubner Verlagsgesellschaft, Leipzig, 1977. [62] K.-H. Elster and A. Wolf, Comparison between several conjugation concepts. In: Optimal Control (R. Bulirsch, A. Miele, J. Stoer, and K.H. Well, eds.). Lecture Notes Control Inform. Sci. 95, Springer-Verlag, Berlin, Heidelberg (1987), 79-93. [63] K.-H. Elster and A. Wolf, On a general concept of conjugate functions as an approach to nonconvex optimization problems. Preprint 149, Univ. of Pisa, 1987. [64] K.-H. Elster and A. Wolf, Recent results on generalized conjugate functions. In: Trends in Mathematical Optimization (K.-H. Hoffmann, J.-B.HiriartUrruty, C. Lemarechal, and J. Zowe, eds.), Birkhauser-Verlag, Basel (1988), 67-78. [65] J.J.M. Evers and H. van Maaren, Duality principles in mathematics and their relations to conjugate functions. Nieuw. Arch. Wish. 3 (1985), 23-68. [66] Yu.G. Evtushenko, A.M. Rubinov, and V.G. Zhadan, General Lagrange-type functions in constrained global optimization. Part I: Auxiliary functions and optimality conditions. Optim. Methods and Software 16 (2001), 193-230. [67] J. Flachs, Global saddle-point duality for quasi-concave programs. Math. ProgK 20 (19SII 321-341. [68] J. Flachs and M. Pollatschek, Duality theorems for certain programs involving minimum or maximum operations. Math. Progr 16 (1979), 348-370.

References

333

[69] F. Flores-Bazan, On a notion of subdifferentiability for non-convex functions. Optimization 33 (1995), 1-8. [70] F. Flores-Bazan, On minima of the difference of functions. /. Optim. Theory A/7/?/. 93 (1997), 525-531. [71] F. Flores-Bazan and J.-E. Martinez-Legaz, Simplified global optimality conditions in generalized conjugation theory. In: [38], 305-329. [72] F. Flores-Bazan and W. Oettli, Simphfied optimality conditions for minimizing the difference of vector-valued convex mappings. J. Optim. Theory Appl. 108 (2001), 571-586. [73] C. Franchetti and I. Singer, Deviation and farthest points in normed linear spaces. Rev. Roum. Math. Pures Appl. 24 (1979), 373-381. [74] C. Franchetti and I. Singer, Best approximation by elements of caverns in normed linear spaces. Boll. Un. Mat. Ital. (5) 17-B (1980), 33-43. [75] N. Gaffke and R. Mathar, A cyclic projection algorithm via duality. Metrika 36 (1989), 29-54. [76] D.Y. Gao, Duality Principles in Nonconvex Systems. Theory, Methods and Applications. Kluwer Acad. Publ., Dordrecht, 2000. [77] D.Y. Gao, Canonical dual transformation method and generalized triality theory in nonsmooth global optimization. J. Optim. Theory Appl. 17 (2000), 127-160. [78] M.R. Garey and D.S. Johnson, Computers and Intractability. Freeman, San Francisco, 1979. [79] A.L. Garkavi, Duality theorems for the approximation by elements of convex sets. UspekhiMat. Nauk 16, 4 (100) (1961), 141-145 [Russian]. [80] J. Getan, J.-E. Martinez-Legaz, and I. Singer, (*, ^)-dualities. J. Math. Sciences 115 (2003), 2506-2541. [81] F. Giannessi, Theorems of the alternative and optimality conditions. J. Optim. Theory Appl. 42 (1984), 331-365. [82] F. Giannessi, General optimality conditions via a separation scheme. In: Algorithms for Continuous Optimization (E. Spedicato, ed.). Kluwer Acad. Publ., Dordrecht, 1994, 1-23. [83] E. Giner, Local minimizers of integral functional are global minimizers. Proc. Amen Math. Soc. 123 (1995), 755-767. [84] K. Glashoff and S.-A. Gustaffson, Linear Approximation and Optimization. Springer-Verlag, New York, Heidelberg, Berlin, 1983. [85] B.M. Glover, Y. Ishizuka, V. Jeyakumar, and H.D. Tuan, Complete characterization of global optimality for problems involving the pointwise minimum of sublinear functions. SIAM J. Optim. 6 (1996), 362-372. [86] F. Glover, A multiphase-dual algorithm for the zero-one integer programming problem. Open Res. 13 (1965), 879-919. [87] F. Glover, Surrogate constraints. Open Res. 16 (1968), 741-769. [88] C.J. Goh and X.Q. Yang, Nonlinear Lagrangian theory for nonconvex optimization. J. Optim. Theory Appl. 109 (2001), 99-121. [89] C.J. Goh and X.Q. Yang, Duality in Optimization and Variational Inequalities. Taylor&Francis, London, 2002.

334

References

[90] E.G. Golshtein, The Theory of Duality in Mathematical Programming and Its Applications. Nauka, Moscow, 1971 [Russian]. [91] E.G. Golshtein and N.V. Tretyakov, Modified Lagrangians and Monotone Maps in Optimization Theory. Wiley, New York, 1996. [92] F.J. Gould, Extensions of Lagrange multipliers in nonlinear programming. SIAMJ. Applied Math. 17 (1969), 1280-1297. [93] F.J. Gould, Nonlinear duality theorems. Cahiers Centre Etudes Rech. Oper 14 (1972), 196-212. [94] H.J. Greenberg and W.R Pierskalla, Surrogate mathematical programming. Oper Res. 18 (1970), 924-939. [95] H.J. Greenberg and W.R Rierskalla, Quasi-conjugate functions and surrogate duality. Cahiers Centre Etudes Rech. Oper 15 (1973), 437-448. [96] R. Hettich und P. Zencke, Numerische Methoden der Approximation und semi-infiniter Optimierung. Teubner, Stuttgart, 1982. [97] R. Hildenbrandt and R. Nehse, On duality-separability relations. Optimization 16 (19^51 S05-S\S. [98] J.-B. Hiriart-Urruty, Generalized differentiability, duality and optimization for problems dealing with differences of convex functions. In: Convexity and Duality in Optimization (J. Ponstein, ed.). Lecture Notes Econ. Math. Systems 256, Springer-Verlag, Berlin, Heidelberg, 1985, 37-70. [99] J.-B. Hiriart-Urruty, A general formula on the conjugate of the difference of functions. Canad. Math. Bull. 29 (1986), 482-485. [100] J.-B. Hiriart-Urruty, From convex optimization to nonconvex optimization. Part 1: Necessary and sufficient conditions for global optimality. In: Nonsmooth Optimization and Related Topics (F.H. Clarke, V.F. Demyanov, and F. Giannessi, eds.), Plenum Press, New York, London, 1989, 219-239. [101] J.-B. Hiriart-Urruty, Conditions for global optimality. In: [109], 1-26. [102] J.-B. Hiriart-Urruty, Conditions for global optimality 2. / Global Optim. 13 (1998), 349-367. [103] J.-B. Hiriart-Urruty and Yu.S. Ledyaev, A note on the characterization of the global maxima of a (tangentially) convex function over a convex set. J. Convex Anal. 3 (1996), 55-61. [104] J.-B. Hiriart-Urruty and C. Lemarechal, Convex Analysis and Minimization Algorithms. Vols. 1, 2. Springer-Verlag, Berhn, Heidelberg, 1993. [105] A.J. Hoffman, On abstract dual linear programs. Naval Res. Logist. Quarterly 10 (1963), 369-373. [106] R.B. Holmes, A Course on Optimization and Best Approximation. Lecture Notes Math. 257, Springer-Verlag, Berlin, Heidelberg, New York, 1972. [107] R.B. Holmes, Geometric Functional Analysis and Its Applications. SpringerVerlag, New York, Heidelberg, Berlin, 1975. [108] R. Horst, A note on functions whose local minima are global. J. Optim. Theory Appl 36 (1982), 457-463. [109] R. Horst and P. Pardalos, eds.. Handbook of Global Optimization. Kluwer Acad. Publ., Dordrecht, 1995.

References

335

110] R. Horst and H. Tuy, Global Optimization (Deterministic Approaches). 3rd ed., Springer-Verlag, Berlin, Heidelberg, New York, 1996. I l l ] A.D. loffe and V.M. Tikhomirov, Theory of Extremal Problems [Russian]. Nauka, Moscow, 1974. English translation: North-Holland, AmsterdamOxford, 1979. 112] T.R. Jefferson and C.H. Scott, Duality for quasi-concave programs with explicit constraints. Math. Operationsforsch. Stat. Sen Optim. 11 (1980), 519530. 113] V. Jeyakumar and B.M. Glover, Characterizing global optimality for DC optimization problems under convex inequality constraints. J. Global Optim. 8 (1996), 171-187. 114] H.Th. Jongen, R Jonker, F. Twilt, Nonlinear Optimization in R". /. Morse Theory, Chebyshev Approximation. Verlag Peter Lang, Frankfurt/Main, 1983. 115] P. Kanniappan, Fenchel-Rockafellar type duality for a non-convex nondifferential optimization problem. J. Math. Anal. Appl. 97 (1983), 366-376. 116] S. Karlin, Mathematical Methods and Theory in Games, Programming and Economics. Pergamon Press, London, Paris, 1959. 117] J.L. Kelley and L Namioka, Linear Topological Spaces. Van Nostrand, London, Toronto, 1963. 118] V. Klee, Convexity of Chebyshev sets. Math. Ann. 142 (1961), 292-304. 119] V. Klee, Remarks on nearest points in normed linear spaces. In: Proc. Coll. on Convexity (Copenhagen, 1965), Univ. of Copenhagen, 1966, 168-176. 120] H. Konno, P.T. Thach, and H. Tuy, Optimization on Low Rank Nonconvex Structures. Kluwer Acad. Publ., Dordrecht, 1997. 121] G. Kothe, Topological Vector Spaces (two volumes). Springer-Verlag, New York, Heidelberg, Berlin, 1979. 122] W. Krabs, Optimierung und Approximation. B.G. Teubner, Stuttgart, 1975. 123] M.G. Krein, The L-problem in an abstract normed linear space. In: On Some Problems of the Theory of Moments (N.I. Akhiezer and M.G. Krein, eds.), Gonti, Kharkov (1937), 171-199 [Russian]. 124] C. Kuratowski, Topologie. 4-eme ed., PWN, Warszawa, 1958. 125] S. Kurcyusz, Some remarks on generalized Lagrangians. In: Optimization Techniques. Modelling and Optimization in the Service of Man. I (J. Cea, ed.). Lecture Notes Computer Sci. 40, Springer-Verlag, Berlin, Heidelberg (1976), 363-388. 126] S. Kurcyusz, On existence and nonexistence of Lagrange multipliers in Banach spaces. J. Optim. Theory Appl. 20 (1976), 81-110. 127] S.S. Kutateladze and A.M. Rubinov, The Minkowski Duality and Its Applications. Nauka, Novosibirsk, 1976 [Russian]. 128] M. Laghdir and M. Voile, A general formula for the horizon function of a convex composite function. Arch. Math. 73 (1999), 291-302. 129] P.-J. Laurent, Approximation et Optimisation. Hermann, Paris, 1972. 130] P.-J. Laurent and B. Martinet, Methodes duales pour le calcul de minimum d'une fonction convexe sur une intersection des convexes. In: Symposium on Optimization. Ill, Nice 1969. Lecture Notes Math. 132 (1970), 159-180.

336

References

[131] B. Lemaire, Duality in reverse convex optimization. SIAMJ. Optim. 8 (1998), 1029-1037. [132] B. Lemaire and M. Voile, Duality in d.c. programming. In: [38], 331-345. [133] B. Lemaire and M. Voile, A general duality scheme for nonconvex minimization problems with a strict inequality constraint. J. Global Optim. 13 (1998), 317-327. [134] P.O. Lindberg, A generalization of Fenchel conjugation giving generahzed Lagrangians and symmetric nonconvex duality. In: Survey of Mathematical Programming. I (A. Prekopa, ed.) North-Holland, Amsterdam (1979), 249267. [135] P.O. Lindberg, On quasiconvex duality. Preprint TRITA-MAT 14. Royal Inst. Technology Stockholm (1981). [136] D.G. Luenberger, Quasi-convex programming. SI AM J. Appl Math. 16 (1968), 1090-1095. [137] G. Mastroeni and M. Pappalardo, Separation and regularity in the image space. In: New Trends in Mathematical Programming. Kluwer Acad. Publ., Boston (1998), 181-190. [138] J.-E. Martinez-Legaz, Un concepto generalizado de conjugacion. Applicacion a las funciones quasiconvexas. Thesis, Barcelona, 1981. [139] J.-E. Martinez-Legaz, A generalized concept of conjugation. In: Optimization: Theory and Algorithms (J.-B. Hiriart-Urruty, W. Oettli, and J. Stoer, eds.), Lecture Notes Pure Appl. Math. 86, Marcel Dekker, New York (1983), 45-59. [140] J.-E. Martinez-Legaz, Quasi-convex duality theory by generalized conjugation methods. Optimization 19 (1988), 603-652. [141] J.-E. Martinez-Legaz, Generalized conjugation and related topics. In: Generalized Convexity and Fractional Programming with Economic Applications (A. Cambini, E. Castagnoli, L. Martein, P. Mazzoleni, and S. Schaible, eds.). Lecture Notes Econ. Math. Systems 345, Springer-Verlag, Berlin, Heidelberg, 1990, 168-197. [142] J.-E. Martinez-Legaz, Fenchel duality and related properties in general conjugation theory. Southeast Asian Bull. Math. 19 (1995), 99-106. [143] J.-E. Martmez-Legaz, Generalized convex duality and its economic applications. In: Handbook of Generalized Convexity, Generalized Monotonicity. (N. Hadjisavvas, S. Komlosi, and S. Schaible, eds.). Springer-Verlag, Berlin, Heidelberg (2004), 237-292. [144] J.-E. Martinez-Legaz, A.M. Rubinov, and I. Singer, Downward sets and their separation and approximation properties. J. Global Optimization 23 (2002), 113-137. [145] J.-E. Martinez-Legaz and A. Seeger, A formula on the approximate subdifferential of the difference of convex functions. Bull. Austral. Math. Soc. 45 (1992), 37-41. [146] J.-E. Martinez-Legaz and I. Singer, A characterization of Lagrangian dual problems. Note Mat. (G. Kothe Festschrift; VB. MoscatelH, ed.), 10 (1990), Suppl. 2, 389-394.

References

337

[147] J.-E. Martinez-Legaz and I. Singer, v-dualities and ±-dualities. Optimization 22 (1991), 483-511. [148] J.-E. Martinez-Legaz and I. Singer, Some characterizations of (/?-Lagrangian dual problems. Optimization 22 (1991), 835-843. [149] J.-E. Martinez-Legaz and I. Singer, Some characterizations of surrogate dual problems. Optimization 24 (1992), 1-11. [150] J.-E. Martinez-Legaz and I. Singer, Some further characterizations of unperturbational dual problems. In: Parametric Optimization and Related Topics. Ill (J. Guddat, H.Th. Jongen, B. Kummer, and F. Nozicka, eds.), Peter Lang Publ. House, Frankfurt/Main, 1993, 407-436. [151] J.-E. Martinez-Legaz and I. Singer, Some characterizations of perturbational dual problems. Optimization 29 (1994), 97-130. [152] J.-E. Martinez-Legaz and I. Singer, *-dualities. Optimization 30 (1994), 295315. [153] J.-E. Martinez-Legaz and I. Singer, Subdifferentials with respect to dualities. Zeitschr Oper Res. ZOR - Math. Methods Open Res. 42 (1995), 109-125. [154] J.-E. Martinez-Legaz and I. Singer, Dualities associated to binary operations o n ^ . J. Convex Anal. 2 (1995), 185-209. [155] J.-E. Martinez-Legaz and I. Singer, On conjugations for functions with values in extensions of ordered groups. Positivity 1 (1997), 193-218. [156] J.-E. Martinez-Legaz and I. Singer, An extension of d. c. duality theory, with an Appendix on *-subdifferentials. Optimization 42 (1997), 9-37. [157] J.-E. Martinez-Legaz and M. Voile, Duality in d. c. programming: the case of several d. c. constraints. J. Math. Anal. Appl. 237 (1999), 657-671. [158] J.-E. Martinez-Legaz and M. Voile, Duality in d. c. programming: the case of several d. c. constraints. New version of [157], presented at the International Conference on Mathematical Programming held in Matrahaza, March 1999 (unpublished). [159] J.-E. Martinez-Legaz and M. Voile, Duality for d. c. optimization over compact sets. In: Optimization Theory; Recent Developments from Matrahaza (F. Giannessi, P. Pardalos, and T. Rapcsak, eds.). Kluwer Acad. Publ., Dordrecht, 2000, 139-146. [160] B. Martos, Nonlinear Programming. North-Holland Publ. Co., Amsterdam, 1975. [161] C. Michelot, Caracterisation des minima locaux des fonctions de la classe d. c. Technical note, Univ. of Dijon, 1987. [162] J.-J. Moreau, Theoremes "inf-sup." Comptes Rendus Acad. Sci. Paris 258 (1964), 2720-2722. [163] J.-J. Moreau, Fonctionnelles Convexes. Seminaire sur les equations aux derivees partielles. College de France, 1966. [164] J.-J. Moreau, Inf-convolution, sous-additivite, convexite des fonctions numeriques. J. Math. Pares. Appl. 49 (1970), 109-154. [165] T.S. Motzkin, E.G. Straus, and F.A. Valentine, The number of farthest points. Pacific J. Math. 3 (1953), 221-232.

338

References

166] M. Moussaoui and M. Voile, Sur la quasicontinuite et les fonctions unies en dualite convexe. Comptes Rendus Acad. Sci. Paris Sen I Math. 322 (1996), 839-844. 167] M. Nicolescu, Sur la meilleure approximation d'une fonction donnee par les fonctions d'une famille donnee. Bui Fac. §tL Cemdufi 12 (1938), 120-128. 168] L. Nirenberg, Functional Analysis. Dittoed notes. Courant Inst. Math. Sci. New York Univ., New York, 1961. 169] D. Pallaschke and S. Rolewicz, Foundations of Mathematical Optimization. Convex Analysis Without Linearity. Kluwer Acad. Publ., Dordrecht, Boston, London, 1997. 170] C.H. Papadimitriou and K. Steiglitz, Combinatorial Optimization: Algorithms and Complexity. Prentice Hall, Englewood Cliffs, New Jersey, 1982. 171] U. Passy and E.Z. Prisman, Conjugacy in quasi-convex programming. Math. ProgK 30 (19S4\ 121-146. 172] U. Passy and E.Z. Prisman, A convex-like duality scheme for quasi-convex programs. Math. Progr 32 (1985), 278-300. 173] T. Pennanen, Graph convex mappings and AT-convex functions. J. Conv. Anal. 6 (1999), 235-266. 174] J.-P. Penot, Duality for radiant and shady programs. Acta Math. Vietnamica 22 (1997), 541-566. 175] J.-P. Penot, Duality for anticonvex programs. J. Global Optim. 19 (2001), 163-182. 176] J.-P. Penot, What is quasiconvex analysis? Optimization 47 (2000), 35-100. 177] J.-P. Penot and M. Voile, On quasi-convex duality. Math. Open Res. 15 (1990), 597-625. 178] J.-P. Penot and C. Zalinescu, Harmonic sum and duality, J. Convex Anal. 1 (2000), 95-113. 179] G. Pickert, Bemerkungen iiber Galois-Verbindungen. Arch. Math. 3 (1952), 285-289. 180] J. Ponstein, Approaches to the Theory of Optimization. Cambridge Univ. Press, Cambridge, London, 1980. 181] B.N. Pshenichnyi, Lemons sur les jeux differentiels. In: Controle Optimal et Jeux Differentiels. Cahier de I'lRIA, no. 4 (1971). 182] B.N. Pshenichnyi, Convex Analysis and Extremal Problems. Nauka, Moscow, 1980 [Russian]. 183] R.T. Rockafellar, Duality and stability in extremum problems involving convex functions. Pacific J. Math. 21 (1967), 167-187. 184] R.T. Rockafellar, Convex Analysis. Princeton University Press, Princeton, 1970. 185] R.T. Rockafellar, Conjugate Duality and Optimization. CBMS Reg. Confer. Series in Applied Math. 16, SIAM, Philadelphia, 1974. 186] R.T. Rockafellar, Augmented Lagrange multiplier functions and duality in nonconvex programming. SIAM J. Control 12 (1974), 268-283. 187] R.T. Rockafellar and R.J.-B. Wets, Variational Analysis. Springer-Verlag, Grundlehren Math. Wiss. 317, BerUn, Heidelberg, New York, 1998.

References

339

[188] S. Rolewicz, On general theory of linear systems. Beitrage zur Analysis 8 (1976), 119-127. [189] S. Rolewicz, Linear systems in Banach spaces. In: Calculus of Variations and Control Theory (D.L. Russell, ed.). Academic Press, New York, San Francisco, London, 1976, 245-256. [190] S. Rolewicz, On Pontryagin maximum principle for systems with non onepoint target set and systems with additional constraints. Math. Operationsforsch. Stat., Ser Optimization 10 (1979), 97-100. [191] S. Rolewicz, On maximum principle in Banach spaces. In: Methods of Mathematical Programming, Ossolineum, Wroclaw, 1982, 271-276. [192] S. Rolewicz, Functional Analysis and Control Theory. PWN, Warszawa and Reidel, Dordrecht, 1987. [193] A.M. Rubinov, Abstract Convexity and Global Optimization. Kluwer Acad. Publ., Boston, Dordrecht, London, 2000. [194] A.M. Rubinov and B.M. Glover, On generalized quasiconvex conjugation, Contemp. Math. 204 (1997), 199-216. [195] A.M. Rubinov and B.M. Glover, Duality for increasing positively homogeneous functions and normal sets. Rech. Oper./Oper Res. 32 (1998), 105-123. [196] A.M. Rubinov and B.M. Glover, Toland-Singer formula cannot distinguish a global minimizer from a choice of stationary points. Numen Funct. Anal. Optim. 20 {\999\ 99-119. [197] A.M. Rubinov, B.M. Glover, and V. Jeyakumar, A general approach to dual characterizations of solvability of inequality systems with applications. /. Convex Anal. 2 (1995), 309-344. [198] A.M. Rubinov and B. §im§ek: Dual problems of quasi-convex maximization. Bull. Austral. Math. Soc. 51 (1995), 139-144. [199] A.M. Rubinov and I. Singer, Best approximation by normal and conormal sets. J. Approx. Theory 107 (2000), 212-243. [200] A.M. Rubinov and A. Uderzo, On separation functions. J. Optim. Theory Appl. 109 (2001), 345-370. [201] A. Rubinov and X. Yang, Lagrange-type Functions in Constrained Nonconvex Optimization. Kluwer Acad. Publ., Boston, Dordrecht, 2003. [202] G. Sh. Rubinshtein, Dual extremal problems. Doklady Akad. Nauk SSSR 152 (1963), 288-291 [Russian]. [203] G. Sh. Rubinshtein, Duality in mathematical programming and some problems of convex analysis. Uspekhi Mat. Nauk 25, 5(155), 171-201 [Russian]. [204] H.H. Schaefer, Topological Vector Spaces. Macmillan, New York, 1966. [205] S. Schaible and W.T. Ziemba, eds.. Generalized Concavity in Optimization and Economics. Acad. Press, New York, 1981. [206] S. Simons, Minimax theorems and their proofs. In: Minimax and Applications (Ding-Zhu Du and Panos M. Pardalos, eds.), Kluwer Acad. Publ., Dordrecht, Boston, 1995, 1-23. [207] I. Singer, Properties of the surface of the unit ball and applications to the solution of the problem of uniqueness of the polynomial of best approximation in arbitrary Banach spaces. Studii Cercet. Mat. 1 (1956), 95-145 [Romanian].

340

References

[208] I. Singer, Caracterisation des elements de meilleure approximation dans un espace de Banach quelconque. Acta Sci. Math. 17 (1956), 181-189. [209] I. Singer, On the uniqueness of the element of best approximation in arbitrary Banach spaces. Studii Cercet. Mat. 8 (1957), 234-244 [Romanian]. [210] I. Singer, Best Approximation in Normed Linear Spaces by Elements of Linear Subspaces. Springer-Verlag, Grundlehren Math. Wiss. 171, Berlin, Heidelberg, New York, 1970. [211] I. Singer, The Theory of Best Approximation and Functional Analysis. CBMS Reg. Confer. Series in Applied Math. 13, SIAM, Philadelphia, 1974. [212] I. Singer, Generalizations of methods of best approximation to convex optimization in locally convex spaces. I: Extension of continuous linear functionals and characterizations of solutions of continuous convex programs. Rev. Roum. Math. Pures Appl. 19 (1974), 65-77. [213] I. Singer, Generalizations of methods of best approximation to convex optimization in locally convex spaces. II: Hyperplane theorems. /. Math. Anal. A/7/7/. 69 (1979), 571-584. [214] I. Singer, Some new applications of the Fenchel-Rockafellar duality theorem: Lagrange multiplier theorems and hyperplane theorems for convex optimization and best approximation. Nonlinear Anal. Theory, Meth. Appl. 3 (1979), 239-248. [215] I. Singer, Maximization of lower semi-continuous convex functional on bounded subsets of locally convex spaces. I: Hyperplane theorems. Appl. Math. Optim. 5 (1979), 349-362. [216] I. Singer, On the Pontryagin maximum principle for constant-time linear control systems in Banach spaces. J. Optim. Theory Appl. 27 (1979), 315-321. [217] I. Singer, A Fenchel-Rockafellar type duality theorem for maximization. Bull. Austral. Math. Soc. 20 (1979), 81-89. [218] I. Singer, Maximization of lower semi-continuous convex functionals on bounded subsets of locally convex spaces. II: Quasi-Lagrangian duality theorems. Result. Math. 3 (1980), 235-248. [219] I. Singer, Extension with larger norm and separation with double support in normed Hnear spaces. Bull. Austral. Math. Soc. 21 (1980), 93-105. [220] I. Singer, Minimization of continuous convex functionals on complements of convex subsets of locally convex spaces. Math. Open Stat. Ser. Optim. 11 (1980), 235-248. [221] I. Singer, Duality theorems for linear systems and convex systems. J. Math. Anal. Appl. 76 (1980), 339-368. [222] I. Singer, Duality theorems for constrained convex optimization. Control and Cybernetics 9 (1980), 37-45. [223] I. Singer, The norm of a linear functional with respect to a non-negative convex functional vanishing at the origin. J. Math. Anal. Appl. 78 (1980), 367377. [224] I. Singer, Pseudo-conjugate functionals and pseudo-duality. In: Mathematical Methods in Operations Research (Invited lectures presented at the Internal.

References

[225]

[226]

[227] [228]

[229]

[230] [231]

[232] [233] [234] [235]

[236]

[237]

[238]

341

Confer, held in Sofia, November 1980), Publ. House Bulg. Acad. Sci., Sofia (1981), 115-134. I. Singer, A characterization of constant-time linear control systems satisfying the Pontryagin maximum principle. J. Optim. Theory AppL 32 (1980), 379-384. I. Singer, Optimization and best approximation. In: Nonlinear Analysis, Theory and Applications (R. Kluge, ed.), Abhandl. Akad. Wiss. DDR, Abt. Math. Naturwiss.-Technik, 2N, Akademie-Verlag, Beriin (1981), 273-285. I. Singer, Duality theorems for perturbed convex optimization. J. Math. Anal. AppL 81 (1981), 437-452. I. Singer, On the perturbation and Lagrangian duality theories of Rockafellar and Kurcyusz. In: Vth Sympos. on Oper Res. Koln, 1980 (R. E. Burkard and T.H. Ellinger, eds.). Methods of Oper. Research 40, A. Hain Meisenheim GmbH, Konigstein/Ts. (1981), 153-156. I. Singer, Optimization by level set methods. Ill: Characterizations of solutions in the presence of duality. Numer Funct. Anal. Optim. 4(2) (19811982), 151-170. I. Singer, Optimization by level set methods. IV: Generalizations and complements. Numer Funct. Anal. Optim. 4(3) (1981-1982), 279-310. I. Singer, Optimization by level set methods. I. Duality formulae. In: Optimization: Theory and Algorithms (J.-B. Hiriart-Urruty, W. Oettli, and J. Stoer, eds.). Lecture Notes Pure Appl. Math. 86, Marcel Dekker, New York, 1983, 13-43. I. Singer, Abstract Pontryagin maximum principles for linear systems. Lin. Multilin. Alg. 13 (1983), 203-219. I. Singer, The lower semi-continuous quasi-convex hull as a normalized second conjugate. Nonlinear Anal. Theory, Methods, Appl. 1 (1983), 115-1121. I. Singer, Surrogate conjugate functionals and surrogate convexity. Applicable Anal. 16 (1983), 291-327. I. Singer, Optimization by level set methods. II: Further duality formulae in the case of essential constraints. In: Functional Analysis, Holomorphy and Approximation Theory. II (G.I. Zapata, ed.), Elsevier (North-Holland), Amsterdam-New York-Oxford, 1984, 383-411. I. Singer, Generalized convexity, functional hulls and applications to conjugate duality in optimization. In: Selected Topics in Operations Research and Mathematical Economics (G. Hammer and D. Pallaschke, eds.). Lecture Notes Econ. Math. Systems 226, Springer-Verlag, Berlin, Heidelberg, 1984, 49-79. I. Singer, Conjugation operators. In: Selected Topics in Operations Research and Mathematical Economics (G. Hammer and D. Pallaschke, eds.). Lecture Notes Econ. Math. Systems 226, Springer-Verlag, Beriin, Heidelberg, 1984, 80-97. I. Singer, Best approximation and optimization. J. Approx. Theory 40 (1984), 274-284.

342

References

[239] I. Singer, Optimization by level set methods. V: Duality theorems for perturbed optimization problems. Math. Operationsforsch. Stat. Ser. Optim. 15 (1984), 3-36. [240] I. Singer, Surrogate dual problems and surrogate Lagrangians. J. Math. Anal. App/. 98 (1984), 31-71. [241] I. Singer, A general theory of surrogate dual and perturbational extended surrogate dual optimization problems. J. Math. Anal. Appl. 104 (1984), 351-389. [242] I. Singer, A general theory of dual optimization problems. / Math. Anal. Appl. 116 (1986), 75-130. [243] I. Singer, Some relations between dualities, polarities, coupling functionals and conjugations. J. Math. Anal. Appl. 115 (1986), 1-22. [244] I. Singer, Generalizations of convex supremization duality. In: Nonlinear and Convex Analysis (B.-L. Lin and S. Simons, eds.). Lecture Notes in Pure Appl. Math. 107, Marcel Dekker, New York, 1987, 253-270. [245] L Singer, Optimization by level set methods. VL Generalizations of surrogate type reverse convex duality. Optimization 18 (1987), 485-499. [246] L Singer, On duality and stability of parametrized optimization problems and related topics. In: Parametric Optimization and Related Topics (J. Guddat, H.Th. Jongen, B. Kummer, and F. Nozicka, eds.), Akademie-Verlag, Berlin (1987), 355-375. [247] I. Singer, Abstract subdifferentials and some characterizations of optimal solutions. J. Optim. Theory Appl. 57 (1988), 361-368. [248] I. Singer, A general theory of dual optimization problems. II: On the perturbational dual problem corresponding to an unperturbational dual problem. ZOR-Methods and Models of Open Res. 33 (1989), 241-258. [249] I. Singer, Some relations between combinatorial min-max equalities and Lagrangian duality equalities, via coupling functions. I: Cardinality results. Rev. Roum. Math. Pures Appl. 34 (1989), 455^91. [250] I. Singer, Some relations between combinatorial min-max equalities and Lagrangian duality equalities, via coupling functions. II: Further results. Rev. Roum. Math. Pures Appl 34 (1989), 661-692. [251] I. Singer, Some general Lagrangian duality theorems. J. Math. Anal. Appl. 144 (1989), 26-51. [252] I. Singer, Some further relations between unperturbational and perturbational dual optimization problems. Optimization 22 (1991), 317-339. [253] I. Singer, Some further duality theorems for optimization problems with reverse convex constraint sets. J. Math. Anal. Appl. 171 (1992), 205-219. [254] I. Singer, Abstract Convex Analysis. Wiley-Interscience, New York, 1997. [255] I. Singer, Duality for optimization and best approximation over finite intersections. Numer Funct. Anal. Optim. 19 (1998), 903-915. [256] I. Singer, Duality in quasi-convex supremization and reverse convex infimization via abstract convex analysis, and applications to approximation. Optimization 45 (1999), 255-308. [257] I. Singer, Dual representations of hulls for functions satisfying /(O) = inf /(Z\{0}). Optimization 45 (1999), 309-342.

References

343

[258] I. Singer, Lagrangian duality theorems for reverse convex infimization. Numer. Funct. Anal. Optim. 21 (2000), 933-944. [259] I. Singer, On duality for quasi-convex supremization and reverse convex infimization. In: Optimization Theory. Recent Developments from Mdtrahdza (F. Giannessi, P. Pardalos, and T. Rapcsak, eds.), Kluwer Acad. Publ., Dordrecht, 2001, 225-254. [260] I. Singer, On suprema of abstract convex and quasi-convex hulls. In: Generalized Convexity and Generalized Monotonicity (N. Hadjisavvas, J.-E. Martinez-Legaz, and J.-P. Penot, eds.). Lecture Notes Econ. Math. Systems 502, Springer-Verlag, Berhn, Heidelberg, New York, Tokyo, 2001, 381-394. [261] M. Sion, On general minimax theorems. Pacific J. Math. 8 (1958), 171-176. [262] J. Stoer and C. Witzgall, Convexity and Optimization in Finite Dimensions. I. Springer-Verlag, Grundlehren Math. Wiss. 163, Berlin, Heidelberg, New York, 1970. [263] A.S. Strekalovsky, On the global extremum problem. Soviet Math. Doklady 35 (1987), 194-198. [264] A.S. Strekalovsky, On questions of global extremum in nonconvex extremal problems. Izv. Vysshih Uchebn. Zaved. 8 (1990), 74-80 [Russian]. [265] A.S. Strekalovsky, On search of global maximum of convex functions on a constraint set. Comput. Maths. Math. Phys. 33 (1993), 315-328. [266] A.S. Strekalovsky, Extremal problems on complements of convex sets. Translated from Kibemetika i Sistemnyi Analiz, Nl, Plenum Publ. Corp., 1993, 88-100. [267] A.S. Strekalovsky, On Global Optimality Conditions for D. C Programming Problems. Irkutsk State University, Irkutsk, 1997. [268] A.S. Strekalovsky, Global optimality conditions for nonconvex optimization. J. Global Optim. 12 (1998), 415-434. [269] J.-J. Strodiot, V.H. Nguyen, and N. Heukemes, e-optimal solutions in nondifferentiable convex programming and some related questions. Math. Progn 25 (1983), 307-328. [270] F. Tardella, On the image of a constrained extremum problem and some applications to the existence of a minimum. J. Optim. Theory Appl. 60 (1989), 93-104. [271] P.T. Thach, Quasi-conjugates of functions, duality relationship between quasiconvex minimization under a reverse convex constraint, and quasi-convex maximization under a convex constraint, and applications. J. Math. Anal. Appl. 159 (1991), 299-302. [272] P.T. Thach, A nonconvex duality with zero gap and applications. SIAM J. Optim. 4 (1994), 44-64. [273] P.T. Thach, A generalized duality and applications. J. Global Optim. 3 (1993), 311-324. [274] P.T. Thach, Global optimality criterion and a duality with zero gap in nonconvex optimization. SIAM J. Math. Anal. 24 (1993), 1537-1556. [275] P.T. Thach, Diewert-Crouzeix conjugation for general quasiconvex duality and applications. J. Optim. Theory Appl. 86 (1995), 719-743.

344

References

[276] RT. Thach, R.E. Burkard, and W. Oettli, Mathematical programs with a twodimensional reverse convex constraint. / Global Optim. 1 (1991), 145-154. [277] V.M. Tikhomirov, Some Problems of Approximation Theory. Izdatelstvo Mosk. Gos. Univ., Moscow, 1976 [Russian]. [278] J. Tind, On duality in nonconvex and integer programming. Oper Res. Verfahren 32 (1979), 193-201. [279] J. Tind and L.A. Wolsey, An elementary survey of general duality theory in mathematical programming. Math. Progr 21 (1981), 241-261. [280] J.F. Toland, Duality in nonconvex optimization. J. Math. Anal. Appl 66 (1978), 399-415. [281] J.F. Toland, A duality principle for nonconvex optimisation and the calculus of variations. Arch. Rat. Mech. Anal. 71 (1979), 41-61. [282] J.F. Toland, On subdifferential calculus and duality in non-convex optimization. In: Analyse Non-convexe [1977, Pau], Bull. Soc. Math. France, Memoire 60(1979), 177-183. [283] H. Tuy, D. c. optimization: theory, methods and algorithms. In [109], 149216. [284] H. Tuy, Convex Analysis and Global Optimization. Kluwer Acad. Publ., Dordrecht, 1998. [285] H. Tuy and W. Oettli, On necessary and sufficient conditions for global optimality. Rev. Mat. Apl. 15 (1994), 39-41. [286] R.M. Van Slyke and R.J.-B. Wets, A duality theory for abstract mathematical programs with applications to optimal control theory. J. Math. Anal. Appl. 22 (1968), 679-706. [287] M. Voile, Conjugaison par tranches. Ann. Mat. Pura Appl. (4) 139 (1985), 279-311. [288] M. Voile, Contributions a la dualite en optimisation et a I'epi-convergence. These. Univ. de Pau, 1986. [289] M. Voile, Conjugaison par tranches et dualite de Toland. Optimization 18 (1987), 633-642. [290] M. Voile, Quasiconvex duality for the max of two functions. In: Recent Advances in Optimization (P. Gritzmann, R. Horst, E. Sachs, and R. Tichatschke, eds.). Lecture Notes Econ. Math. Systems 452, Springer-Verlag, BerHn, Heidelberg, 1997, 365-379. [291] M. Voile, A formula on the conjugate of the max of a convex function and a concave function. J. Math. Anal. Appl. 220 (1998), 313-321. [292] M. Voile, Duality principles for optimization problems dealing with the difference of vector-valued convex mappings. J. Optim. Theory Appl. 114 (2002), 223-241. [293] M. Walk, Theory of Duality in Mathematical Programming. AkademieVerlag, Berlin, 1989. [294] A. Wieczorek, An abstract symmetric framework for duality in mathematical programming. Optimization 28 (1994), 249-266. [295] M. Wriedt, Konvexe Optimierungsoperatoren. Habilitationsschrift, Univ. of Kiel, 1976.

References

345

[296] Ya.I. Zabotin, A.I. Korablev, and R.F. Khabibullin, Conditions for an extremum of a functional in the presence of constraints. Kibernetika 6 (1973), 65-79 [Russian]. [297] I. Zang and M. Avriel, On functions whose local minima are global. J. Optim. Theory AppL 16(1975), 183-190. [298] I. Zang, E.U. Choo, and M. Avriel, A note on functions whose local minima are global. 7. Optim. Theory AppL 18 (1976), 555-559. [299] K. Zimmermann, Conjugate optimization problems and algorithms in the extremal vector space. Ekon.-mat. obzor 10 (1974), 428^39. [300] U. Zimmermann, On some extremal optimization problems. Ekon.-mat. oZ?zor 15 (1979), 438-442. [301] U. Zimmermann, Duality for algebraic linear programming. Linear Alg. AppL 32 (1980), 9-31. [302] U. Zimmermann, Linear and Combinatorial Optimization in Ordered Algebraic Structures. Annals of Discrete Mathematics 10, North-Holland, Amsterdam, 1981.

Index

a^ 169 aMOl (A, <, (8»), 280 (A, <, (8), (g)), 281 Adif),6 ^Cc(^o), 166 ^A, 122 y^A, 122

xc, 25 CC,6 CO C, 5 coC, 10 co^iC,263 cowC, 260 core C, 18 Conv(X), 79 (^o^ j ^

^A' 175 -6;. 185 ^1'109

C°°, 20 C,6 C(X),263

fie, 83 )3„,288 fi\nO y6M02,103 /Stagr, 69

A, 27 A', 28 A'A,28 A^, 29, 265 A^, 30, 265

/6Lgr'83

A3G,31,265

Aurr, 81 )S:u„, 83 bd C, 6 Bx,2

A4 , 33, 265 A^;, 66 A ^ , 69 A<",34,266

348

Index

35, 266 A " , 34, 265 A'2, 34, 265 A'3,34,265 A'^, 127 A02,

£ 2 ( 1 ^ ) , 262 £CA(W), 260 £:/C(W), 260 Jco? ^ ^

/co(H^), 261 /^^<^\ 35

/J'37 />^^37

^Ury 133 A^V, 133 Kj^ 135 A„'^, 135 K'T^ 135 9./(2o), 26 8(0, xo), 85 9/(zo), 23 9^<^)/(zo), 39 9 f <»'>/(xo), 321 d i s t ( C , , C 2 ) , 13 dist (xo, C), 13 dist (x,y),4 dom / , 22 DC(X),216 (£>.), 84 (£>,?), 83 (£»A),

122

122 (/^k), 185 ( 5 ^ ) , 185 ( Z ) J , 323 (£>2), 323 (Z)f), 323 (£>™), 288 (SA),

( A U T ) , 80

eca C , 1 0 eco C, 10 ecowC, 261 epi / , 6 £'eA(W),262

/ ^ ^ 38 /;,38 /^^,38 /eq, 26 fcqiW). 262 /eqa(Vy), 262 / ^ ^ ^ \ 38 jrMicp)^ 282, 321 /^(xo; X), 24 /q,26 /q,26 /qca 9 -^'

/q(A'A),29,266 /q(A^)^ 263 fq(W), 262 / ^ s ^ 282 / ^ , 141 /^«,311 /*,21,36 /**,21,36 ^ G ( - ^ O ) . 85

^ ( / ' < ) , 239 J^(P<), 232 ^ ( P ^ , < ) , 241 G^63 ^o.i/» 7 'WG,.VO. 63

int C, 6

Index Jix,u(x)),296

{Pe),2U (PI, 169

(^i), 266 iK2),266 (^3)'2^6 '^(^)'260

(P^), 101 (P,), 80, 130, 244 (/><), 56, 232 (F^), 56, 232

r, 170, 171

iPK.<),24l Pc(xo),40

A^ 102, 103 ^'=^'^22 y^, 175 ^i,109 A'^, 84 A", 83 ^surr, 80 i(A),38 L(A)',38 L,,84 L",294 L^,294 L%„, 294 Lsurr, 81

e(W),262 PC, 38 ^, 2 p+, 56 /;++, 282 R\2 a(X*,X),7 cf(X,X*),7 SAf),6 Supp / , 22 supp / , 2 5G(/),47

M A , 264 M G ( / ) , 101

N{C\ Co), 25 Ar,(C; Co), 25 iv,(C;xo),25 N{C; xo), 25

0'239

^ ' 233

u*, 76, 132 U^j, 17 f/|.d,8 f/|,,, 8 W,90 i^^^ 92

S2G,$, 102, 170

^^(;,Q), 6

^G,w, 103, 171

fi„„109

i;(2), 80, 130,244

OG{XO), 100

V*,^, 17

O G ( / ) , 148

(P),47,55 (/>.), 214 {(P), (£>)}, 52 {(P), (£>„)}, 288

Vl„

8

^Id. 8 V,67,90 ^^92 X*{f,k),\99

349

350

Index

+,20 +,20 X, 56 x,282 0 ^ 326 0,324 0,281 0,281 abstract convex analysis, 27, 259 addition, 1 lower, 20 upper, 20 adjoint of a mapping, 76, 132 associated pair, 301 axiomatic characterization of: Fenchel-Moreau conjugations, 280 dual objective functions, 285, 291, 299 the Lagrangian functions associated with -, 285, 292, 299 barrier cone, 63 biconjugate, biconjugate function, 21 Fenchel-Moreau, 36 bipolar theorem, 19 canonical enlargement, 281 cavern, 6 conjugate, conjugate function: concave, 21 convex, 21 Fenchel, 21 Fenchel-Moreau, 35 ofElQortobi, 282 of Rubinov and §im§ek, 282 of typeLau, 38 conjugation, 35, 281 surrogate, 298 constraint, constraint set, 47 abstract, 55 d.c, 214 essential, 61

general, 71 inessential, 64 primal, 47 structured, 54, 131, 190 surrogate,48, 60, 81, 102, 176 constraint qualification, 50 of Attouch-Brezis, 50 Slater, 57, 78, 236 (/,/)-, 227 constraints multifunction, see perturbation multifunction coupling function, 35 additive, 307 multiplicative, 304 natural, 35 critical point, 247 d.c. optimization, 213 abstract, 275 deviation, 85 directional derivative, 24 distance between two subsets, 13 between x and y, 4 from the empty set, 13 fromjco to C, 13 domain, effective domain, 22 dual constraint set, 48, 122, 184 dual objective function, 48, 72, 80, 103, 122, 130, 171, 175, 184, 192, 244 A-, 122, 185 dual of a polarity, 28 dual problem, duality, 46 unperturbational Lagrangian, 48, 50,56, 127, 135, 189, 198 perturbational Lagrangian, 72, 130 unperturbational surrogate, 60, 83, 102, 122, 170, 175, 185 perturbational surrogate, 80, 132 for d.c. infimization, 218, 226, 236, 244 for abstract quasi-convex supremization, 267, 270

Index for abstract reverse convex infimization 271, 273 for abstract d.c. infimization, 275 associated with a Lagrangian function, 52 for infimization problems involving maximum operators, 248, 252 TT-, 8 4

6>-, 83 A-, 122 PES-, perturbational extended surrogate dual, 297 decomposed, 297 (Wz)-, 298 anomalous, 323 m-Lagrangian, 288 perturbational conjugate, 297 quasi-convex, 80 duality: for best approximation, 40 for worst approximation, 87 strong, 51, 69, 73, 83, 180,288 weak, 51, 69, 73, 114,178,181 V-, 280, 325 *-, associated with a binary operation, 258, 296 (*, 5)-, 282, 326 duality equality: Lagrangian, 128 strong, 68 surrogate, 128 weak, 69, 178 duality gap, 51 duality inequality, 49 element: of best approximation, 40 of ^-approximation, 6:-best approximation, 283 of worst approximation, 85 environment, 301 epigraph, 6 epigraphic methods, 48, 50 8-solution, £-optimal solution, 284

351

£-subdifferential, 25 excess, see deviation extremality relations, 247 farthest point, see element of worst approximation Fenchel equality, 23 Fenchel inequality, 23 function: Of-Holder continuous, 262 with constant N, 262 A'A-quasi-convex, 29 abstract convex, 260 abstract quasi-convex, 260 abstract d.c, 275 additive, 7 affine, 7 best, 207, 313 concave, 21 continuous, 6 at a point, 6 convex, 21 d.c, diff-convex, 213 elementary, 260 evenly quasi-coaffine, 27 evenly quasi-convex, 26 strongly, 139 homogeneous, 6 indicator, 24 Lagrange-type, 309 linear, 6 lower semicontinuous, 6 at a point, 5 A1-quasi-convex, 263 maximal, 45 min-type, 305 additive, 307 optimal, optimal dual solution, 99, 148, 165 optimal value, marginal, 74, 295 positively homogeneous, 21 pseudoconjugate, 38 proper, 21 quasi-concave, 26 quasi-convex, 26

352

Index

evenly, 26 strongly evenly, 139 strongly, 142 /?-evenly convex, 127 regular weak separation, RWS, 303 regular with respcect io(p, 116 regular with respect to if, 268 strictly increasing along segments from 0, 142 subdifferentiable, 24 sublinear, 21 tangentially convex, 312 upper semicontinuous, 6 at a point, 6 W-convex, 261 VK-evenly quasi-coaffine, 262 V^-evenly quasi-convex, 262 W-quasi-convex, 262 group: lattice ordered, 281 conditionally complete, 281 half-space: closed, 8 open, 8 quasi-support, 16 half-space theorem: for infimization, 65 for quasi-convex supremization, 116, 119 for reverse convex infimization, 183 hull of a function: A'A-quasi-convex, 29 evenly quasi-coaffine, 27 evenly quasi-convex, 26 lower semicontinuous convex, 22 A^-quasi-convex, 242 quasi-convex, 26 lower semi-continuous quasi-convex, 33 W-convex, 241 W-evenly quasi-coaffine, 242

W-evenly quasi-convex, 241 W-quasi-convex, 241 hull of a set: A^ A-convex, 28 closed convex, 10 convex, 5 evenly coaffine, 10 evenly convex, 10 A^-convex, 263 V^-convex, 260 W-evenly coaffine, 260 ly-evenly convex, 260 hull operator, 28 hyperplane, 8 best, 207 optimal, 99, 166 quasi-support, 11 support, 11 hyperplane theorem (of surrogate duahty): for convex infimization, 63 for quasi-convex supremization, 163 for reverse convex infimization, 172 image set, 287, 303 incidence triple, 285 inequality constraint, 56, 198, 225, 232 inf-sup theorem: of Moreau, 27 of Sion-Kneser-Fan, 27 infimal convolution, 236 inner product, 4 instance of a problem, 300 inversion, 314 Lagrangian, Lagrangian function: associated with a perturbation, 244 augmented, 51 for infimization, 52 for perturbed infimization, 73 for perturbed supremization, 130 for d.c. infimization, 244

Index of Kurcyusz, 294 of type I, 322 of type II, 322 quasi-convex, 81 surrogate, 269 Lagrangian duality theorem, 48, 81, 127,135,189,228,244 general, 287 lattice: conditionally complete, 281 level set, 6 linearization, 222 mapping: antitone, 28 convex, 54 vector-valued, 320 min-max equality: combinatorial, 285 of all-cardinality covering-packing type, 286 for A-cover packings, 286 for ^-colorings, 286 for weighted A-covers, 286 for weighted 5-packings, 286 minimax theorem, 27 minimum principle of Bauer, 311 Moore-Smith closure operator, 29 multiplication: lower, 282 upper, 282 with a scalar, 1 nearest point, see element of best approximation norm, 2 normal cone, 24 extended, 25 objective function, 47 open ball, 5 opposite element, 1 optimization problem, xi convex, xi anticonvex, xi

353

convex-anticonvex, xi abstract d.c, 259 d.c.,213 extremal, 301 involving maximum operators, 247 penalize, 50 penalty term, 50 perturbation: horizontal, 80, 293 normal, 292 vertical, 80, 293 perturbation function, parameterization, 72 objective-function separated, 298 perturbation multifunction, 294 PMP, Pontryagin maximum principle, 295 abstract, 295 PMP„,296 polar set, 19 polarity, 27 primal parameters, 300 primal problem: of infimization, 47 perturbed, parameterized, 72, 80, 129, 244, 293 structured, 54 constrained, 47 unconstrained, 47 of convex infimization, 47 structured, 55 of quasi-convex infimization, 47 of quasi-convex supremization, 101 of reverse convex infimization, 169 of d.c. infimization, diff-convex, 213 unconstrained, 213 constrained, d.c. constrained, 214 with a d.c. inequality constraint, 225

354

Index

with finitely many d.c. inequality constraints, 232 primal-dual pair, 51, 288 program, programming problem, 55 convex, 55 linear, 55 abstract, 301 algebraic, 301 mathematical, 132 minimization, in an environment, 301 quasi-convex, 82 quasi-conjugate, quasi-conjugate function: of Greenberg-Pierskalla, 37 second, 37 normalized, 37 ofThach, 141,310 quasi-subdifferential, 300 quasi-support half-space, 16 quasi-support hyperplane, 11 reduction principle, 44, 64, 88, 106, 123, 125, 155, 173, 188 reverse convex best approximation, 153 saddle-value, 293 scheme of formal replacements, 299 semiconjugate, 38 separation: min-type, 305 nonlinear, 287 by a function, 8 strict, 8 strong, 279 by a hyperplane, 8 strict, 8 by a set, 263 set: abstract convex, 260 A^A-convex, 29 Chebyshev, 153 comprehensive, 307

conical, 266 conormal, reverse normal, 304 convex, 5 evenly coaffine, 10 evenly convex, 10 linearly open, 18 7V(-convex, 263 normal, 304 of ^-normal directions, 25 of extended £-normal directions, 25 of perturbations, of parameters, 72, 129, 244 polar, 19 proximinal, 149 semi-Chebyshev, 313 support, 22, 280 (X*, /?)-, 22 /^-evenly convex, 126 M^-convex, 260 \y-evenly coaffine, 260 ly-evenly convex, 260 Slater condition, see Slater constraint qualification solution, optimal solution: for quasi-convex minimization, 47 for quasi-convex maximization, 101, 137 for reverse convex minimization, 169, 203 for d.c. minimization, 221 global, 284,318 local, 284, 318 primal, 54, 75 dual, optimal dual, 54, 75 of (D^), 288 space: Banach, 2 conjugate, 7 Euclidean, 2 Hilbert, 4 linear, 2 locally convex, 5 normed linear, 2 complete, 2

Index topological linear, 5 strong CHIP, strong conical hull intersection property, 28 subdifferential, 22 of Balder, 300 ofThach, 141,283 surrogate, 300 with respect to a conjugation, 39, 283 with respect to a conjugation of type Lau, 283 with respect to a primal-dual pair of optimization problems, 300 (*,^K326 substitution method, 64, 128, 189, 198 subtraction, 2 system, 54 canonical, 323 constant time linear control, see linear system convex, 54 equlibrium, 323, linear, 54 constitutively, 323 fully, 295 geometrically, 295 polar, 295

potential, 323 strictly reflexive, 323 target set, 54, 131 non-one point, 295 theorem of: separation, 8 strict separation, 9 Fenchel-Moreau, 22 Moreau and Pshenichnyi, 24 Moreau-Rockafellar, 32 Fenchel-Rockafellar, 76 Pshenichnyi-Rockafellar, 52 topology: general, 5 Hausdorff, 5 norm, 5 weak, 7 weak*, 7 unit ball, 2 value, optimal value, 51 surrogate dual, 80 vector operations, 1, 7 weak alternative theorem, 304

355

Duality principles in nonconvex systems

Read more

Duality Principles in Nonconvex Systems - Theory, Methods and Applications (Nonconvex Optimization and its Applications, Volume 39)

Read more

Duality in vector optimization

Read more

Stochastic Approximation and Its Application (Nonconvex Optimization and Its Applications)

Read more

Conjugate Duality and Optimization

Read more

Conjugate duality in convex optimization

Read more

Convex Analysis and Nonlinear Optimization: Theory and Examples - Second edition (CMS Books in Mathematics)

Read more

Convexity and Well-Posed Problems (CMS Books in Mathematics)

Read more

Duality in Optimization and Variational Inequalities

Read more

Duality in optimization and variational inequalities

Read more

Simplicial Structures in Topology (CMS Books in Mathematics)

Read more

Classical Topics in Discrete Geometry (CMS Books in Mathematics)

Read more

Variational Methods in Partially Ordered Spaces (CMS Books in Mathematics)

Read more

Duality in Optimization and Variational Inequalities (Optimization Theory & Applications)

Read more

Techniques of Variational Analysis (CMS Books in Mathematics)

Read more

A Concrete Approach to Classical Analysis (CMS Books in Mathematics)

Read more

Techniques of Variational Analysis (CMS Books in Mathematics)

Read more

A Concrete Approach to Classical Analysis (CMS Books in Mathematics)

Read more

Conjugate gradient algorithms in nonconvex optimization

Read more

Generalized Convexity and Vector Optimization (Nonconvex Optimization and Its Applications)

Read more

Generalized Convexity and Vector Optimization (Nonconvex Optimization and Its Applications)

Read more

Cones and Duality (Graduate Studies in Mathematics)

Read more

Large-Scale Nonlinear Optimization (Nonconvex Optimization and Its Applications 83)

Read more

Approximation Algorithms for Combinatorial Optimization, APPROX'98

Read more

Mathematics and the Aesthetic: New Approaches to an Ancient Affinity (CMS Books in Mathematics)

Read more

The Riemann Hypothesis: A Resource for the Afficionado and Virtuoso Alike (CMS Books in Mathematics)

Read more

The Riemann Hypothesis: A Resource for the Afficionado and Virtuoso Alike (CMS Books in Mathematics)

Read more

Summa Summarum: CMS Treatises in Mathematics

Read more

Approximation Theory and Optimization. Tributes to M.J.D.Powell

Read more

A Course on Optimization and Best Approximation

Read more

Recommend Documents

Duality principles in nonconvex systems

Duality Principles in Nonconvex Systems - Theory, Methods and Applications (Nonconvex Optimization and its Applications, Volume 39)

Duality in vector optimization

Vector Optimization Series Editor: Johannes Jahn University of Erlangen-Nürnberg Department of Mathematics Martensstr. ...

Stochastic Approximation and Its Application (Nonconvex Optimization and Its Applications)

Stochastic Approximation and Its Applications Nonconvex Optimization and Its Applications Volume 64 Managing Editor: ...

Conjugate Duality and Optimization

Conjugate duality in convex optimization

Lecture Notes in Economics and Mathematical Systems 637 Founding Editors: M. Beckmann H.P. Künzi Managing Editors: Pr...

Convex Analysis and Nonlinear Optimization: Theory and Examples - Second edition (CMS Books in Mathematics)

Canadian Mathematical Society Societe mathematique du Canada Editors-in-chief Redacteurs-en-chef l.Borwein K. Dilcher ...

Convexity and Well-Posed Problems (CMS Books in Mathematics)

Editors-in-Chief Re´dacteurs-en-chef J. Borwein K. Dilcher Advisory Board Comite´ consultatif P. Borwein R. Kane S. Sh...

Duality in Optimization and Variational Inequalities

Duality in optimization and variational inequalities