Stability of Nonlinear Control Systems
MATHEMATICS IN SCIENCE A N D E N G I N E E R I N G A Series o f Monographs a n d Textbooks
Edited by Richard Bellman
The RAND Corporation, Santa Monica, California 1.
2. 3. 4. 5.
6.
7. 8.
9. 10. 11.
12. 13.
TRACY Y. THOMAS. Concepts from Tensor Analysis and Differential Geometry. 1961 TRACY Y. THOMAS. Plastic Flow and Fracture in Solids. 1961 ARIS. The Optimal Design of Chemical Reactors: A Study RUTHERFORD in Dynamic Programming. 1961 J O S E P H LA SALLEand SOLOMON LEFSCHETZ.Stability by Liapunov’s Direct Method with Applications. 1961 GEORGE LEITMANN (ed.) . Optimization Techniques: with Applications to Aerospace System. 1962 RICHARDBELLMANand K E N N E T HL. COOKE.Differential-Difference Equations. 1963 FRANKA. HAIGHT.Mathematical Theories of Traffic Flow. 1963 Discrete and Continuous Boundary Problems. 1964 F. V. ATKINSON. Non-Linear Wave Propagation: with AppliA. JEFFREY and T. TANIUTI. cations to Physics and Magnetohydrodynamics. 1964 JULIUS TOU. Optimum Design of Digital Control Systems. 1963 HARLEY FLANDERS. Differential Forms: with Applications to the Physical Sciences. 1963 SANFORD M. ROBERTS.Dynamic Programming in Chemical Engineering and Process Control. 1964 SOLOMON LEFSCHETZ.Stability of Nonlinear Control Systems. 1965
In preparation DIMITRIS N. CHORAFAS. Systems and Simulation A. A. PERVOZVANSKII. Random Processes in Nonlinear Control Systems V. E. B E N E ~Mathematical . Theory of Connecting Networks and Telephone Traffic WILLIAMF. AMES.Nonlinear Partial Differential Equations in Engineering A. HALANAY. Differential Equations : Stability, Oscillations, Time Lags R. E. MURPHY.Adaptive Processes in Economic Systems DIMITRISN. CHORAFAS. Control Systems Functions and programming Approaches J. ACZEL. Functional Equations MARSHALL C. PEASE,111. Methods of Matrix Algebra
STABILITY OF NONLINEAR CONTROL SYSTEMS Solomon Lefschetz PRINCETON UNIVERSITY THE NATIONAL UNIVERSITY OF MEXICO THE RESEARCH INSTITUTE FOR ADVANCED
STUDIES ( R I A S ) , BALTIMORE, MARYLAND
1965
ACADEMIC PRESS New York
-
London
COPYRIGHT @ 1965, BY ACADEMICPRESS INC. ALL RIGHTS RESERVED. NO PART OF THIS BOOK MAY BE REPRODUCED IN ANY FORM, BY PHOTOSTAT, MICROFILM, OR ANY OTHER MEANS, WITHOUT WRITTEN PERMISSION FROM THE PUBLISHERS.
ACADEMIC PRESS INC. 1 1 1 Fifth A v e n u e , New York, New York 10003 United Kingdom Edition published by ACADEMIC PRESS INC. (LONDON) LTD. Berkeley Square House, London W.l
LIBRARYOF CONGRESS CATALOG CARD NUMBER:64-24661
PRINTED IN THE UNITED STATFS OF AMERICA
PREFACE The object of this monograph is to present a concise picture of control stability as it has developed from the direct method of Liapunov. The main impetus for this theory came some fifteen years ago from the work of the Soviet mathematician, A. Lurie, and for a decade or so remained practically a Soviet monopoly. Recently however this monopoly has been broken mainly through the work of V. M. Popov of Romania and of Kalman, LaSalle, and the author in this country. New developments in the stability of control are so rapid that interesting results are being attained even as this volume goes to press. Although it is impossible, for this reason, to be completely up to date, it is hoped that the interested scientist, mathematician, physicist, or engineer will be able through reading this work to navigate by himself the turbulent waters of nonlinear control theory. It is hoped even more that the engineering “controllist” will find here fruitful material for his operations. The outline of the monograph is as follows: The first chapter dealing exclusively with the dimensions one and two is quite elementary, and above all does not appeal to vector-matrix technique. Nevertheless, many of the important concepts already make their appearance in this early chapter. However, readers with a fair grasp of linear algebra may pass directly to the next chapters. These chapters, I1 to VI, present the theory in what may be referred to as the pre-Popov period. Here we lean heavily upon vectors and matrices, except in Chapter VI in which the emphasis is on the discontinuous characteristic. Popov’s striking contribution is dealt with in Chapter VII, which is decidedly arduous owing to Popov’s extensive use of Fourier transforms and rather advanced analysis. The last control chapter-Chapter VIII-deals primarily with a theorem weaker than Kalman’s completion of Popov’s second theorem. Our theorem rests upon an important lemma due to Yacubovich. However, in its proof we follow from afar Kalman’s noteworthy treatment. The last c h q t e r consists virtually of a few appendices with which it did not seem appropriate to interrupt the main text. The author wishes to express his thanks to Dr. Robert Gambill who read most of the manuscript and made numerous corrections and valuable V
PREFACE
suggestions. He feels that he owes a good deal to discussions with various RIAS colleagues, notably Dr. Kalman, Dr. LaSalle, and Mr. Kenneth Meyer. The fact that this is the author's second monograph to appear in the Bellman Series is a strong indication of his high regard both for the Series and for the excellent work of the Academic Press. Finally the author takes pleasure in recognizing his debt to the U.S. Air Force, Office of Scientific Research [Contract A F 49(668)-12421, thc U.S. Army Ordnance Missile Command (Contract DA-36-034-ORD35 I4 Z), and the National Aeronautics and Space Administration (Contract NASw-718) whose support of our research stimulated this monograph. Cross references are to the Bibliography at the end, or to chapters in the monograph. Thus (III,2) or (111, $2) refers to $2 of Chapter 111, (III,2.4) to a statement or a relation 2.4 of $2 in Chapter 111; LaSalle [2] refers to item 2 under LaSalle in the Bibliography. November, I964
vi
SOLOMON LEFSCHETZ
Special Abridged Notations n.a.s.c. : Necessary and sufficient conditions E : Unit matrix If A is a square matrix A , = ZE - A , so that [ A z /= 0 is the characteristic equation of A C.C. : Completely controllable, complete controllability C.O. : Completely observable, complete observability f * g : Convolution of the functionsf, g q(a) = characteristic function
vii
This page intentionally left blank
CONTENTS PREFACE.
.
.
.
.
SPECIALABRIDGEDNOTATIONS .
INTRODUCTION
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. .
. .
. .
. .
. .
. .
.
V
vii 1
Chapter One. Introductory Treatment of Dimensions One and Two The Characteristic Function . . . $2. Systems of Dimension Unity. Direct Control 53. System of Dimension Unity. Indirect Control &I System . of Order Two .
. .
. . .
. . .
. . .
. . .
. . .
.
.
.
.
.
.
17
$3. Comparison with a Recent Result of Yacubovich . . $4. On the Utilization of Certain Complex Coordinate Systems . $5. Special Cases . . . . . . . .
. .
. .
. .
18 22 23 24
.
.
.
. .
. .
$1.
Chapter Two. Indirect Controls 51.
Vectors and Matrices
.
$2. Indirect Control. General Type
.
.
.
.
.
.
.
.
5 7 8 13
Chapter Three. Indirect Controls (Continued) $1. lnvariance under Change of Coordinates . $2. Reduction of the Number of Conditions on the 53. Luric's Method and a Variant . . $4. Application to Systems of Order Two .
.
.
.
Control Parameters
. .
. .
. .
. .
.
.
27 28 31 33
Chapter Four. Direct Controls. Linearization Multiple Feedback 61. Direct Control: General Case .
.
.
.
.
.
82. Direct Control, Standard Example $3. Reduction of an Indirect Control to a Special Direct Control . . . . . $4. Linearization of Direct Controls . . . $5. Linearization of Indirect Controls . . 56. Direct Control with Matrices B or C of Rank < 11. . 87. Direct Controls with Matrices B, C , of Rank n - 1. . $8. Direct Control Whose Matrix A Has Zero as a Characteristic
.
.
. . . .
. . . .
39 41 42 43 46 48 50
Root (Kenneth Meyer) . $9. Direct Control Whose Matrix Has a Pair of Conjugate Pure Imaginary Characteristic Roots (Kenneth Meyer) . . . . . . . . . . 510. Multiple Feedbacks
51 55 56
ix
CONTENTS
Chapter Five. Systems Represented by a Set of Equations of Higher Order Generalities 92, A Digression on Linear Systems $3. Indirect Control . . . pt. Indirect Control: An Example . 95. Direct Control . . . $1.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Continuous Approximation of Discontinuous Characteristics 92. Direct Discussion of Discontinuities . . . . 93. Some Examples . . . . . . . 9. Special Switching Lines . . . . . . 95. Multiple Feedback Switching Line . 96. Complementary Remarks . . . . .
. . . .
.
. . .
. . .
.
.
.
.
.
.
.
.
.
61 62 64 67 70
Chapter Six. Discontinuous Characteristics $1.
72 13 79 84 85 86
Chapter Seven. Some Recent Results of V. M. Popov Generalities. The Theorems of Popov . Preliminary Properties . . . Proof of Popov’s First Theorem . . The Generalized Liapunov Function of Popov Proop of Popov’s Second Theorem . Comparisons . . . . . . On the Function G ( z ) as Transfer Function. . . . . Direct Control. Conclusion .
. .
. . .
. . .
. . . . .
.
.
. . . . . .
.
. . .
. . . .
.
. . .
. . . .
. . . . . . .
87 91 93 98 100 101
104 105
105
Chapter Eight. Some Further Recent Contributions Controllability and Observability . . . . . . Reduction of the System to One with a Completely Controllable Pair ( A , b) and Completely Observable Pair (c’, A ) . . 93. A Special Form for Systems with Completely Controllable . . . . Pair ( A . h ) . Main Lemma (Yacubovich and Kalman) . . . . . Liapunov-Popov Function and Popov Inequality . . . Fundamental Theorem . . . . . . . A Recent Result of Morozan . . . . . . Return to the Standard Example . . . . . Direct Control . . . . . . . . Resume (Indirect Control: y > 0) Complement on the Finiteness of the Ratio cp(a)/a X
.
.
107 109
. . . . . . .
. . . . . .
I I3 114 1 I8 1I Y
121 122 123 124 125
CONTENTS
Chapter Nine.
Miscellaneous Complements
The Jordan Normal Form for Real or Complex Matrices $2. On a Determinantal Relation . . . . . $3. On Liapunov’s Matrix Equation . . . . $4. Liapunov and Stability . . . 81.
Appendix A : An Application o f Multiple Feedback Control
. .
. .
.
.
.
.
.
.
.
Appendix B : An Example from the Theory o f Nuclear Power Reactors (Kenneth Meyer) , .
BIBLIOGRAPHY INDEX
. .
,
.
. .
. .
.
. .
. .
. .
.
. . . .
128 132 133 136 139
.
.
142 144 149
xi
This page intentionally left blank
INTRODUCTION In the present monograph our concern will be solely with socalled real dynamical systems ; that is, with systems governed by a finite number of real parameters and whose performance is described by a finite set of ordinary differential equations. In order to present a full and clear picture, we propose to describe briefly in this Introduction the various simplifications which experience or practical considerations have imposed. Let x1,x2,...,x, be the parameters or coordinates and t the time. Time derivatives are written 1,3, ... . The equations of the system are then of the form (1)
&(xl )...)x,, x1)...,XI ,..., t ) = 0
( h = 1, 2 ,...)p).
As is well known, such a system is equivalent to another system of the general type
(2)
~XY,,...,ymrj1,...,jm, t ) = 0
(s = 1, 2,..., m).
Our first assumption is that the system (2) may be solved for the derivatives
)il,...,j,,thus giving rise to a new system
(3)
y j = F j ( y ,,..., y,, t )
j = 1, 2,..., m.
If we consider the y j and the F j as components of column vectors y , F , ( 3 ) may be given the simple form (4)
3 = F(Y, t). 1
INTRODUCTION
Let now y = q(t) be a particular solution of the system (4) and suppose that for some reason (e.g. stability) we are interested in the behavior of the solutions of (4) in its vicinity. Often the simplest way is to take as a new coordinate vector the vector difference z = y - q(t) and substitute in (4). This yields
i=3
-
Q(t) = F(y, t ) - Q(t) = F(z
+ q(t),t ) - ti(t),
which is of the form (5)
i = C(Z, t),
G(0, t ) = 0,
with the origin z = 0 corresponding to our earlier special solution y = q(t).The origin is a “point-solution’’ or critical point of the new system (5). We are now merely interested in the neighborhood of the origin. Mainly for reasons of mathematical convenience we shall only consider systems “without t” at the right:
i = G(z),
(6)
G(0) = 0,
or autonomous systems. Main reason : they are so much more manageable than nonautonomous systems. Let C be an autonomous n vector system f = X(x).
(7)
One may be especially interested in what happens to the solutions in the neighborhood of the origin x = 0, which we assume to be a critical point : X ( 0 ) = 0. Suppose also that the system contains a control mechanism aiming to maintain the solutions as close as possible to the solution x = 0. Let it be assumed finally that one has been able to divide the components x l ,..., x , of x into two sets: the system components y l ,..., y , and the control components z l ,.... zq, where p q = n. The system components define the state p vector y (describing the state of the system) and the z,, define the control q vector z. The system assumes then the form
+
(a)
(8)
3 = Y(Y,Z),
(b)
Z(0,O) = 0.
Y(0,O)= 0, it, is
i = Z(Y,Z),
The system without control or the fundamental system, as we shall call
3 = Y ( y ,0).
(9) 2
INTRODUCTION
The purpose of the control mechanism may now be stated as aiming to secure or possibly to improve the asymptotic stability behavior of the origin, principally as regards the state vector y. Now in this general form the system is only rarely tractable and so one must have recourse to partial or complete linearization. Complete linearization has been applied most extensively in control theory and technique. In substance it replaces the system (8) by linear approximations. Linear equations with constant coefficients are easily dealt with, and under the linearity assumption one may proceed quite far, so this is a well-beaten path only touched upon in Chapter V. In what follows the system but not the control vector will be linearized. As the fundamental system is generally quite stable, this is reasonable enough. The control, however, may involve a servomotor operating beyond a reasonable linearization of its characteristic. Our scheme will then assume the general form
3 = Ay
+ F(z)
i = C y + G(z).
Actually we shall deal almost exclusively with the case where the control variable z is a scalar (one-dimensional) and with the numerous related problems that even this relatively simple case presents. There are two basic types: (a) direct control; (b) indirect control. They may be represented jointly by a vector-matrix system f = A X -
0
=
CIX
-p
t
where the designations are the vector designations of (11, 41). The second type is (practically) the more important and receives most of our attention later. From the formal standpoint one may reduce each of these systems to the other. The system (1 1) raises the double question of the existence and uniqueness of the paths through any point of the x [the x,(] space in direct [indirect] controls. One of the conditions imposed upon q(o)(except in VI) is continuity. A very complete existence theorem (see the beginning of Niemickii and Stepanov [11)will then guarantee existence but not uniqueness of paths. Most practical functions q(o)have continuous “broken line”
3
INTRODUCTION
graphs, that is, consist of a finite number of arcs with unique and continuously turning tangent at each point. For these one may apply the existence theorem of Lefschetz ([l], p. 31, plus the (hardly modified) complement of p. 34) and pleasantly achieve both existence and path uniqueness. We will not return to these questions in the sequel. Since our fundamental problem is a stability problem, recourse must be had to the only general stability method available: Liapunov’s direct method. A resume is given in (IX,§4). More complete information will be found in LaSalle and Lefschetz [l]. Observe that uniqueness of path is never required in the Liapunov theory. A word of caution. The linearization of both the system and the control has brought about a certain special, almost algebraic technique in control theory. This technique had best be abandoned when one comes to nonlinear controls, and the differential equations faced directly, so that one sees clearly what mathematical theories (mainly due to Liapunov) must be applied. It is no small merit indeed of the Soviet leaders to have consistently done so from the beginning. Historical remark. Indirect controls, of such importance in modern industry, were first introduced some forty years ago by N. Minorsky [l] under the name of derivative control in connection with the control of ship motions (research carried out for the U.S. Navy). The modern impetus in the study of this general type of control was given by Lurie of Leningrad.
4
Chapter
1
INTRODUCTORY TREATMENT OF DIMENSIONS ONE AND TWO It so happens that a number of the major ideas and problems to be faced more fully later already appear for dimensions one and two. By way of introducing the general topic it seems worthwhile to deal at first somewhat fully with these elementary cases. This is the aim of the present chapter. For simplicity all quantities considered here will be scalars. In the later chapters the vector-matrix notation will be adopted in full force.
$1. The Characteristic Function The nonlinearity to be faced throughout the sequel will be caused by the characteristicfunction q(a) of the control mechanism. Here a is the so-called feedback signal, and the nonlinearity comes, usually, from a servomechanism or more generally from the nature of the control mechanism. From the standpoint of our mathematical treatment, what matters is the nature of the function q(a),and since this will appear in every system under consideration we may as well deal with it here and now. Characteristic functions may be very varied indeed. Some frequent types are given in Fig. 1. The first two are continuous; the other two are discontinuous. The discontinuous types present serious difficulties which 5
1 . INTRODUCTORY TREATMENT OF DIMENSIONS ONE AND TWO
will be discussed in (VI, $1). Except for these we shall limit the functions q(o)to a class said to be udmissihle governed by the following: Assumptions regarding characteristics. 1. The function q(o)is defined and continuous for all values of o. 11. q(o)= 0, a q ( a ) > 0 for all o # 0, i.e., q(o)has the sign of o. 111. The integrals
.r:
q(o)do diverge. Ix
The first property states in substance that the graph of q(o)has no jumps; the second property states that the graph is situated in the first and third quadrants ; finally according to the third property the area under the graph tends to infinity at both ends.
FIG. I
As a matter of fact it will be shown in (111, $4) that property I11 may usually be dispensed with.
The function @(o). defined as
=r
This function, of constant occurrence later, is @(a)
q ( a )do.
0
I t is continuous, zero for o = 0, positive otherwise as a consequence of 11, and + cc as a -, f 00 as follows from 111.
6
$2.
SYSTEMS OF DIMENSION UNITY. DIRECT CONTROL
$2. Systems of Dimension Unity. Direct Control Consider a system depending upon a single real variable. As described in the Introduction, one may select this variable, e.g., x , so that the desired position to be controlled corresponds to the value x = 0. Thus x represents the deviation that it is desired to minimize. Let the fundamental system be represented by a linear equation
(2.1)
X =
kx,
where k is a constant #O. The purpose of the control would be to accentuate the “return to zero” of the variable x . There are now essentially two distinct possibilities according to the manner of operating of the feedback. In a direct control it operates directly upon the system, in an indirect control it operates through one or more derivatives. In the former the operation is apt to be rather hard, through a ponderous mechanism. In an indirect control on the other hand, by making use of derivatives one may operate through a comparatively light scheme. For this reason indirect controls are often preferred to the other kind. At all events, mathematically, they are also much more interesting and for this reason they have been studied more extensively. In the present section we first deal with direct control. Physically speaking, a direct control regulating the system (2.1) is described by a system
(2.2)
i= kx
+ <,
4: = cp(a),
a = CX,
c # 0.
This system is equivalent to the single equation
X = kx + ~ ( C X ) (2.3) which is formally as simple as could be desired. It may actually be integrated as dx = t - to. (2.4) X”kX + C p W If one knows exactly the characteristic cp, one may calculate the solution. In practice this is not as interesting as it may seem. For various reasons the characteristic is often variable within a certain range, and it becomes important to estimate the effect of the control under these special circumstances, and perhaps to impose upon its parameters conditions guaranteeing suitable stability.
s
7
1. INTRODUCTORY TREATMENT OF DIMENSIONS ONE AND TWO
Now in the present instance one may readily find suitable restrictions guaranteeing the return of the system to zero no matter what its initial position. For the velocity at any time is kx + cp(cx). It is thus sufficient that this quantity have the sign of -x, i.e. that 1 f(x) = - kx 2 - X C ~ ( C X )= - k X 2 - -(cx)c~(cx) > 0. C
There are now two possibilities; (a) k < 0. That is the basic system is already stable. To increase this feature one merely needs to increasef(x) for all x. Since ucp(u) > 0, and - kx2 > 0 it is sufficient to choose the parameter c negative. (b) k > 0. To havef(x) > 0, one must first select c negative and also such that the graph of cp( - cx) passes below the line y = kx when x < 0 and above it when x > 0. By this means the control will change the unstable system (2.1) into a stable system: all solutions + 0 as t + 00.
53. System of Dimension Unity. Indirect Control The system just treated could hardly be more elementary. Indirect controls of one dimensional systems however present many of the features and complications of higher dimensional systems. We begin by choosing as fundamental system i= -kx,
(3.1)
k > 0,
so that the initial system is already stable. The effect of the control is to replace the system of order one (3.1) by a new system of order two,
where the constants b, c, p are the control parameters. It is evident that one must have b # 0, since b = 0 means that the control has no effect upon the system. But then one may replace the coordinate x by a new coordinate x* = x/b. This is merely a change of scale for x. The new system assumes the form X* = -kx* + (
8
$3.
SYSTEM OF DIMENSION UNITY. INDIRECT CONTROL
Upon writing now x for x* and c for bc we have the system X =
e
(3.3)
(T
-kx+5
= cp(4 = cx - p5,
which is like (3.2) but with b = - 1. The control parameters are now c, p. Regarding the existence and uniqueness of solutions see the end of the Introduction. The fundamental system (3.1) is already asymptotically stable and one wishes to strengthen this properly. In fact, one would like to solve : Lurie’s problem. To find n.a.s.c. for the asymptotic stability of the system (3.3) regardless of the initial conditions and whatever the choice of admissible characteristic function cp(o). That is, one would like to have n.a.s.c. to guarantee that any solution (x(t),<(t)),of (3.3) tends to the origin (0,O) regardless of the choice of cp(o). This is known as absolute stability. Observe that technically speaking, to solve this problem is actually more than one needs, for the variables x , 5 will be limited in extent and it would be sufficient to demand that all the solutions emanating from some bounded region R of the ( x , () plane do tend to the origin regardless of the choice of cp. However, in the absence of definiteness as regards the region R one may as well, and more easily, take R as the whole plane. In order to discuss absolute stability it is convenient at this stage to choose as coordinate, instead of 5, the parameter (T which appears in cp(a). The best way to achieve this is by means of a linear transformation of coordinates from x , 5 to y , (T defined as follows: (3.4)
y = -kx
+ 5,
(T
= cx - p(
The equations for the (y, (T) system are found at once to be
I’ = - k y (3.5)
+
~ ( 0 )
6 = cy - pcp(0).
As regards stability, the system (3.5) will be entirely equivalent to the system (3.3) if, and only if, the transformation (3.4) is nonsingular, that is if (3.4) may be solved uniquely for x, 5 in terms of y , (T.For then the “y, (T picture” will be essentially like the “ x , 5 picture”-the x, 5 picture
9
1. INTRODUCTORY TREATMENT OF DIMENSIONS ONE AND TWO
seen at “an angle” as it were. The necessary and sufficient condition (n.a.s.c.) to have this happen is that the determinant of the coefficients of x, ( in (3.4) be #O, or that k p - c # 0. (3.6) This is a first restriction on the control parameters p , c . It is assumed henceforth and we now deal solely with y, CT and the system (3.5). Since critical points are solutions, it is clear that a necessary condition for absolute stability is that in the y, CT plane the origin be the only critical point, i.e. the only solution of the system
(3.7)
-k y
+ ~ ( c T=) 0,
CY -
PV(CT)= 0.
In view of (3.6) the only solution of this system in the unknowns y, q(a) is y = 0, q(o) = 0. Since ~ ( c T #) 0 for CT # 0, the only solution of (3.7) is y = CT = 0, and so we are assured that the origin is in effect the only critical point in sight.
Y
FIG.2
Our treatment of absolute stability of the system (3.5) rests upon a fundamental result due to Liapunov. While his general theorem (see IX, $4) requires a good deal of technique (owing to its “n-dimensional” feature) it is intuitively accessible in our low dimensional case. Suppose that in some manner there has been found a continuous infinity of ovals surrounding the origin in the y, CT plane (Fig. 2). If we are in a position to show that every solution of the system (3.5) crosses every oval inward, and this regardless of the choice of the characteristic ~ ( c T ) , 10
$3.
SYSTEM OF DIMENSION UNITY. INDIRECT CONTROL
our purpose will have been achieved. This is in substance the geometric formulation of Liapunov’s so-called direct method. The ovals which we choose, following Lurie and Postnikov (see Lurie [l]), are the curves V = pyz
(3.8)
+ @(a) = h,
where p , h are positive constants. That they are ovals surrounding the origin is established readily enough. The curve C represented by (3.7) may be plotted from the relation
We must have 0 2 h, which yields - a 2 a 5 b, a and b positive. Hence, the curve has the aspect of Fig. 3. This is an oval symmetric with respect to the a axis. CT
t
FIG.3
FIG.4
The construction of the ovals is only part of the story. Let y be the path defined by a solution (y(t), a@)) of our system. Along y the function V(y(t),a(t)) = V ( t )must decrease since we desire that y tend to the origin as t --* co. Hence we desire that 0 along every y, or that
v -=
+
- P(y(t),a(t))=
-(2pyj
D
+ v ( o ) ~=) 2pkyZ + pq2(a)- 2 p + - ~ ( a )>y 0
(
for all choices of y and ~ ( a )This . will happen, since pk > 0, provided that both roots of 2pku2-2 p + - u + p = O
(
3
11
1.
INTRODUCTORY TREATMENT OF DIMENSIONS ONE AND TWO
are complex, i.e. if
Let p l , p z be the two roots off@) 6 P1
+ Pz
= (C -
=
0 and 6 its discriminant. We have
2pk)’ - C’
= 2Pk - c,
=
4pk(pk - c),
PIP2 =
C‘
4’
Now (3.9) requires that the roots off(p) = 0 be real and distinct and that p1 < p < p 2 . The roots will be real and distinct if, and only if, 6 > 0, or pk > c. Since one must be able to choose p > 0, one must have p 2 > 0. Since p1p2 > 0, both roots have the same sign and this sign is that of p1 + p 2 . Hence we must have pk > c/2, which holds if pk > c. Thus this last condition which is just (3.6) is sufficient to guarantee absolute stability. In the case just discussed, there is a certain unreality, for owing to k > 0, the fundamental system is already largely asymptotically stable. The effect of the control is then to make it more so. A more natural situation takes place when the fundamental system is actually unstable: i= kx,
k > 0,
and the effect of the control must be to make the system stable. This will occur only with a characteristic of a certain special type. To simplify matters, assume that for 101 small cp(o) has a power series expansion
do) = go + @(a) where g is a positive constant, and @ begins with terms of degree at least two, and otherwise has the properties 11, 111 earlier imposed on cp. As cp is not precisely fixed, we assume that g may vary within certain limits: 0 < u S g 5 8. Notice that in substance, the graph of cp is now a curve with positive slope g through the origin, and that this slope may vary between u and (Fig. 4). The equations with control are thus
3 = ky
+ go + @(a)
(5 = cy - pgo - p@(o)*
For 101 sufficiently small, one may achieve asymptotic stability if the roots r l , r2 of the equation 12
$4.
SYSTEM OF ORDER TWO
=
r2
+ (gp - k)r - gc - pgk = 0
have negative real parts. (LaSalle and Lefschetz [ l , p. 481.) Now r l r 2 = -g(c
+ pk),
rl
+ r2 = k - pg.
Whether r I ,r2 are both real or complex one must have r l r 2 > 0,
i.e. c
rl
+ pk < 0,
+ r2 < 0,
k - pg < 0,
and hence finally k -C -
k
9
Since k > 0 this implies in particular that c < 0. The asymptotic stability obtained is only of local character-nly for la1 sufficiently small.
94. System of Order Two It is convenient to deal first with indirect controls. Let the fundamental system be (4.1)
i = 1,2;
-kixi;
X i =
ki>0,
and let the system with control be xi = -kixi - ( b , (4.2)
ri = d o ) a
=C]Xl
+
c2x2
i = 1,2
- p5
where q(a) is an admissible characteristic function. For existence and uniqueness of the solutions see the end of the Introduction. 13
1.
INTRODUCTORY TREATMENT OF DIMENSIONS ONE AND TWO
Since q(o)= 0 if and only if o = 0, the origin x i the only critical point provided that the determinant
=
0,
5
=
0 will be
Upon expanding A and dividing by k l k 2 # 0 this condition becomes P
(4.3)
+ -b1c1 kl
b2c2 k2
+-fO.
Assume for the present that it is fulfilled. As in the previous case it is convenient to apply a coordinate transformation from the variables x l , x 2 , 5 to new variables y , , y,, o defined by
(4.4) a = ClXl
+ CZX2 - pt.
The determinant of the transformation is A, and so, under the assumption that (4.3)holds, the transformation is nonsingular. Thus again the stability properties will not be affected by the transformation. The differential equations in the new variables are found at once to be
3. =
- k . y1 .I - b Ida), .
(4.5) ci =
ClYl
+ CZY2
-
2-
= 132
Pd4.
In imitation of the treatment of the preceding case, consider the surfaces
V ( y ,a) = y 1 2
+ y z 2 + @(a) = u2.
They represent surfaces of revolution around the a axis generated by the curves y,Z @(a) = u2.
+
We have seen that these curves are concentric ovals around the origin. Hence our surfaces are likewise concentric ovals (of dimension two) around the origin. 14
$4.
SYSTEM OF ORDER TWO
We must find conditions under which the space ovals are all crossed inward by any path y of (4.5). Along such a path VY94
=
2Y1(-k,Y, - cp(4bl) + 2YA-kzYz
-
cp(4bZ)
+ ~ C J ) ( ~ I+Y cI z ~ z) PCP’(O) = - 2 ( k i y i 2 -
+ k,yz2) - pq2(a)
2dlY l c p ( 4 - 2dzYzcp(d
di = 6, - f c i ,
i = 1,2.
The expression V is to be negative for all y , , y , and not all zero. Since (P((T) = 0, if and only if, 0 = 0, this is the same as demanding that - V be positive for all y , , y , and cp(c) not all zero. This is the condition that all the paths penetrate every oval without exception. Now
-P= 2P1(Yl +
$y
+ k,(Y, +
$y]
A sufficient condition for absolute stability is then d12 2k1
p>-++.
(4.7)
dZ2 2k,
The right-hand side is positive unless d l = d , = 0. In any case (4.7) implies that p > 0. It is interesting to compare the inequalities (4.3) and (4.7). Identically
( b - f~)’+ 2bc = ( b + f~)’2 0. Hence
(b1 - fcl)’ 2k 1
+ ( 6 , -2kZ$c,)’
> - - -b1c1 -
kl
bzc, kz .
That is the inequality (4.7) implies (4.3). Hence (4.7) alone i s a sgficient condition for absolute stability. This result is a special case of an analogous and more general property due to LaSalle and taken up later (111, §4). 15
1. INTRODUCTORY TREATMENT OF DIMENSIONS ONE AND TWO
Direct control. The controlled system is now
x . = - k . xI ., - b .
, 4 4 4 9
(4.8)
a
= ClXl
i = 172
+ c2x2.
In the absence of ( one will not need to change coordinates. However, a difficulty arises regarding the condition that the origin be the only critical point. This requires that the relations -kixi
+ birp(clxl + c2x2)= 0,
i = 1,2
have the origin as their unique solution. In the absence of further information as to q(a)we can only assume that this is the case. Choose again V(x1,x2) = X I 2
+ x22 + @(a).
Since @(a) > 0 for all a # 0 and @(O) = 0, Vis positive for all x l , x 2 not both zero. From (4.8) there follows 6 = clf1
(4.9)
+ ~ 2 x 2= - k , c , x i
&r -
+
k 2 ~ 2 ~ 2(bici
By analogy then with the preceding case we find
-V
= 2jk1( x 1
+
+ k2(x2 +
+ b2~2)~(0).
$y]
:;fI ):;fI
+ (p - - - - V 2 ( 4 However, as shown in (IV, $1) one can only achieve here (4.10)
It will still guarantee absolute stability, but this is as far as we will proceed in this direction.
16
Chapter
2
INDIRECT CONTROLS With this chapter we initiate the general study of controls for n dimensional systems and it is continued in the next chapters. For the convenience of the reader some notions introduced in the previous chapter will be repeated. For the characteristic cp(a) however, we shall depend upon (1, $1). In dealing with a general dimension, vectors and matrices are obviously advantageous and they are discussed in $1.
$1. Vectors and Matrices By and large the notations and designations are those of LaSalle and Lefschetz [l, Chapt. I]. As a rule a, b, c,..., x, y will denote vectors and A, B, C,...,X ,Y will denote matrices, while small Greek letters will stand for scalars. In particular x denotes the column vector, one column matrix, whose components are xi, and A the matrix (a. ). !k The designation En, or E when n is obvious, represents the unitmatrix of order n. The transpose of a matrix A = (aij) is the matrix, written A’, whose elements are the symmetric of those of A relative to its principal diagonal. In particular the 1 x n matrix x’, corresponding to x above, is the one-row matrix: row vector, with the same elements as x. 17
2.
INDIRECT CONTROLS
If A = (ajk)is a complex matrix A* denotes the matrix ( a k j ) that is Abstractly ( )* = (-)’. Note that if A is square and nonsingular so that its inverse A - ’ is-defined, the operation of inversion commutes with ()’ or ( )* as the case may be (proof elementary). That is (A- I)’ = ( A ’ ) - or (A- ’)* = ( A * ) - l , for A square and nonsingular ;also for any A : (A)’ = Note also the following property. If x, y are both n vectors, then (cijk)’.
’ v).
x’y
=
cxjyj
=
y’x.
Recall that xy’ represents the n x n matrix XY’ = ( X j Y d
The designation V ( x ) stands for a Liapunov type scalar function of the vector x . The gradient operator ( d / d x l ..., , d/dx,) is thought of as a one-row matrix operator d/dx. Thus dV/dx represents the row vector with components dV/dx,. For easy reference we repeat that an n x n matrix A, whose characteristic roots all have negative real parts, is said to be stable. If the quadratic form x’Fx or hermitian form x*Hx are > 0 [ < 01 for all x # 0 we write F or H > O [ 01. For further remarks on quadratic or hermitian forms see (IX, $1).
-=
92. Indirect Control. General Type Recall that the difference between direct and indirect control is that the operation of the feedback is direct in the first and indirect-through a derivative-in the second. In our treatment indirect controls will serve as the predominant model and so they are fully discussed first. Let the fundamental (state) system be R = A x where x is an n vector and A a constant n x n matrix. Let the control depend upon a single coordinate 5. The indirect control system is (Lurie [l]) i= AX - ( b
g=
(2.1)
0
=
cp(4
c‘x - p5
where b , c are constant n vectors and cp is an admissible characteristic. This time we assume explicitly that A is nonsingular. 18
$2.
INDIRECT CONTROL. GENERAL TYPE
For the existence and uniqueness of the solutions see the end of the Introduction. Lurie's problem. To find n.a.s.c. to have (2.1) asymptotically stable in the large and this for all choices of an admissible cp(a). It implies that all solutions (x(t), 5(t)) of (2.1) -, 0: x = 0, 5 = 0, as t -, + 00 and this whatever an admissible cp(a). This is absolute stability. Owing to the special role of the parameter (T in relation to the system (2.1) it is manifestly desirable to choose coordinates with (T one of them. This is achieved by the transformation (2.2) given below. The important point is that the system in the new coordinates possess the same asymptotic stability properties as the initial system. To that end the only requirement is that the transformation be nonsingular. For it is then readily shown that if, in the initial system, the origin is asymptotically stable for all admissible cp, then the same property holds in the new system (see LaSalle and Lefschetz [l, p. 771). Take then the transformation (x, 5 ) -+ (y, (T) given by
(2.2)
y
=
AX - bl,
(T
= C'X - p<,
replacing (2.1) by j
(2.3)
(2.4)
(2.5)
=
A y - bV(a)
d- = c'y - pcp(0).
A
-b
c'
-p
# 0.
p # c'A-'b,
and this is assumed henceforth. Since a critical point is a solution, absolute stability requires that the origin y = 0, (T = 0 be the only critical point of the system (2.3). Since cp(o) = 0 if and only if (T = 0, (2.5) expresses also a n.a.s.c. for the origin to be the only critical point of (2.3). 19
2.
INDIRECT CONTROLS
Passing now to the problem of absolute stability, a first observation must be made regarding the matrix A itself. The system (2.3) must be asymptotically stable whenever one chooses the admissible function cp = pa, p > 0. Hence the matrix of this linear system
(;4
- pPP b)
must have no characteristic roots with positive real parts. Now for p small, these roots are very near those of A and the root zero. Hence A must have no root with positive real part. Let a matrix whose characteristic roots all have nonpositive real parts be referred to as critical or semistable. Thus aJirst necessary condition for absolute stability is that the matrix A be semistable. Since semistability brings about many complications (discussed in IV) we assume for the present that A is stable. Now the only general method available for absolute stability rests upon Liapunov’s asymptotic stability theorem (IX, 4.5) plus the Barbashin and Krassovskii complement (IX, 4.7). Following Lurie and Postnikov we look for a Liapunov function of type V(y,a) = y’By
(2.6)
+ @(a),
and calculate its time derivative along the paths of (2.3):
V =YBy + yBj
Hence from (2.3): - V = y’Cy
(2.7)
d
+ cp(0)6.
+ pcp2(o)+ 2cp(a)d’y =
Bb - )C
and more importantly we have the Liapunov relation A‘B
(2.9)
+ BA = -C.
Referring to Liapunov’s theorem we must first have V ( y ,a) positive defifor all values of y and a. This requires that B > 0. When nite (see IX, g), this condition is fulfilled we will have V > 0 if y # 0 and also > 0 if y = 0, a # 0 since then @ > 0 (property I1 of I, $1). This will actually hold for all admissible functions cp(a). Finally since B > 0 the expression Iy’Byl -+ 00 with IIyII, and owing to property 111 of (I, $1) @(a) -,co with 20
$2.
INDIRECT CONTROL. GENERAL TYPE
llyll + 101. Thus the requirement of the Barbashin and Krassovskii complement is actually fulfilled. That is the requirements on V(y,a) for absolute stability are satisfied through the mere fact that B > 0. There remains then to arrange matters so that - V is positive definite for all y, a. Here we have the fortunate circumstance that - contains a only through cp(a), and is a quadratic form in y and cp. (This is the great merit of the Lurie and Postnikov type of function V ) It suffices therefore to demand that - V be a positive definite quadratic form in y and cp. The n.a.s.c. due to Sylvester, is that the principal minors of the matrix
c 2)
all be positive. In particular this must hold for C and so C > 0, hence ICI # 0. Beyond this we still require that the determinant
Referring then to (IX, 52) this yields the fundamental inequality (Fi)
p
> d’C-’d.
Notice incidentally that it implies that
(2.10)
P
> 0,
an inequality obvious enough since it merely states that - V(0,a) > 0. Since C > 0 implies B > 0, the only conditions left are C > 0 and (Fi). Hence
(2.11) Theorem. N.a.s.c. in order that the Liapunovfunction V(y,a) of (2.6) be positive definite for all y, a, and that - V = - v(y(t), a(t)) along the solutions of (2.3) be a positive definite quadratic form in y and cp (hence positive definite in y, a) are C > 0 and (Fi). When these conditions are fulfilled the system (2.3) (hence also (2.1)) is absolutely stable. PROOF OF NECESSITY. Since - V is a positive definite quadratic form in (y, cp) we must have C > 0 and (Fi).
PROOFOF SUFFICIENCY. If C > 0 and (Fi) hold - is a positive definite quadratic form in (y, cp). From C > 0 follows then by Liapunov’s relation B > 0, and hence by our earlier argument that V is positive definite for all y, a and all admissible functions cp(o). 21
2.
INDIRECT CONTROLS
(2.12) Properties C > 0 and (Fi)imply (2.5). They imply absolute stability and hence that the origin is the only critical point, from which (2.5) follows. Another proof (algebraic) of (2.12) will be given in the next chapter (111, 2.2). The preceding scheme will guide us throughout the sequel and we formulate : General rule. In seeking absolutestability we will usually lookfora Liapunov function V ( y ,a) positive definite in (y, a) such that - V is a positive definite quadratic form in (y, cp).
Of course any other state variable, for example x, could replace y.
93. Comparison with a Recent Result of Yacubovich The inequality (Fi) may be phrased as follows: (3.1) A n.a.s.c. in order that the quadratic form in (y, 9 ) - V = y’Cy
+ pcpz + 240 d y
be positive definite is that C > 0 and (Fi)hold.
Now in Reference [2] Yacubovich obtained the following result : (3.2) A n.a.s.c. for the positive definiteness of the same form is that there exists a real vector g
=
df
f i such that C - gg‘ > 0.
Evidently the two statements are equivalent. To show that this is indeed the case one merely needs to “complete the squares” in two different manners. Incidentally there will result a new derivation of the inequality (Fi). The “square completing” corresponding to (3.1) is indicated by : -
V = (y‘ + cpd’C-’)C(y
+ cpC-’d) + (p-d’C-’d)cp2
and it is plain that, since C > 0, - V is positive definite if, and only if, (Fi)holds. The square completing corresponding to (3.2) is
22
$4.
ON THE UTILIZATION OF CERTAIN COMPLEX COORDINATE SYSTEMS
REMARKS.I. The only conditions imposed upon the control parameters are (2.5) and (Fi). 11. Actually a less stringent road to absolute stability is often available. Namely, according to a result of LaSalle (see IX, 4.8) it is often sufficient to have - V 2 0 provided that y = 0, a = 0 is the only solution of (2.3) in 0. However to prove this fact frequently requires considerable labor. It has therefore seemed best to adhere to the general rule.
v=
$4. On the Utilization of Certain Complex Coordinate Systems Referring to (IX, $1) one may have occasion to utilize a complex coordinate system in which a real point x is represented as follows: XI,...,X p ,
XI,..., 5,
X 2 p + I,..., X n
where the last q = n - 2 p coordinates are real. In particular the real vectors b, c have components bl,...,b,,
h,...,Lp,
b2,+ l,..., bn
Cl,..., C p ,
C1,**., C p ,
C z p t I,..., C n
with the last 4 components real. The associated matrix A is in the Jordan normal form as described in (IX, $1). This time, if the matrices B, C are hermitian (positive definite), there takes place the Liapunov complex relation. A*B
+ B A = -c,
and B is still unique. Upon introducing for convenience cp naturally V ( y , a ) = y*By
= @,
one takes
+ @(a).
Since now d- = c*y - pcp(rJ),
one finds - V = y*cy
+ pq@ + (@*y + qy*d), 23
2.
INDIRECT CONTROLS
where as before d
=
Bb
-
$c.
Since Sylvester's theorem holds for hermitian forms, or else utilizing the reduction of 93 leading to (3.1), we have this time instead of (Fi) p > d*C-'d
Fi*)
with C > 0 as n.a.s.c. to have - positive definite hermitian in (y, cp). Since real quadratic forms are merely special hermitian forms, the present development includes the earlier theory.
$5. Special Cases Since after all C is a completely arbitrary matrix > O one may specialize it and obtain a simpler form of (Fi)or (Fi*).Take in particular C = E. Then (Fi)or (Fi*)reduces to p
(5.1)
> d*d
=
lld1I2.
This may be achieved by using a special coordinate system in which C = E. Of more interest is the case where the matrix A is relatively simple. Suppose that A diag(&,..., An), so that the 1, are the characteristic roots of A. Assume the coordinates so chosen that actually A = diag(Al,...,An). This will be called the standard example. Set once and for all
-
Re,& =
(5.2)
-ph
> 0.
Referring to (IX,3.9) we have this time
Hence the components of d = Bb - 3c are
and we obtain directly d*C- ' d in terms of C. Take in particular C = diag(a,,...,an), ah > 0. As a consequence
B 24
=
(z'
diag -,...,
J;,
-
$5.
SPECIAL CASES
Thus (Fi) assumes the form
(5.3) The sum is a function of the variables P h , which are as yet purely arbitrary, except that they must be real and positive. If we obtain a minimum of this sum we will have a lower bound for p. To minimize the sum one merely has to minimize its individual terms, i.e. terms of the form
Here one must distinguish several cases.
A. 1 is real, hence b and c are also real. Then
;(
F(P) =
- ;)2.
If c = 0 the minimum of F(P) is obtained for = 0 and then F(0) = 0. If b = 0,c # 0, the lowest bound of F(P) corresponds to = co and is again zero. It is obviously zero if b = c = 0. Suppose bc # 0 and say bc < 0. One may assume that b < 0, c > 0. Now
F‘(P) = 2(;
);
+
(5 ;). -
The second factor is # 0, so that F’(B) = 0 only for P2 = - cp/b. Hence, FrniJP) = P b c / ~ l . Suppose now bc > 0. We may take b > 0, c > 0. Then in F’(P) the second factor alone may vanish and the minimum is zero. Note that in both cases
25
2.
INDIRECT CONTROLS
B. i, is complex, hence h, und ch ure both complex. This time one must deal directly with F ( P ) . Here
The cases h = 0 or c = 0 yield the same result as before. We may therefore assume that hc # 0. Now
for /I2
= CI = p
m = plc/bl (as before) and the minimum is
For the real case, this expression coincides with the one already obtained. Therefore, finally one obtains for p the lower bound (5.4)
26
Chapter
3
INDIRECT CONTROLS (continued) The general theory developed in the preceding chapter presents a number of side questions to be discussed in the present chapter.
81. Invariance under Change of Coordinates There have been obtained two inequalities involving expressions depending upon the coordinates, namely (11, 2.5) and (Fi).We shall show that these inequalities do not depend upon the coordinate system. To be precise, let y = Py0 be a transformation of coordinates and let A,, Bo,..., be the expressions corresponding to A , B,..., in the new coordinates. Let also (Fio),(11, 2.5)' correspond to (Fi)and (11, 2.5) in the new coordinates. We propose to prove: (1.1) (Fi)and (11, 2.5) are respectiuely equivalent to (Fio)and (11, 2.5)'.
This means that one is free to express these inequalities in any convenient coordinate system. The transformation y = P y , gives rise to a new system )" = P - ' A P y o - cp(a)P-'b
6
=
c'Py, - pcp(a).
27
3.
INDIRECT CONTROLS (CONTINUED)
Hence A , = P-'AP,
bo = P - ' b ,
c,'
=
c'P.
Consequently y'Py
= y,'P'BPy,
so that B, = P'BP, and also at once c,'A, b, = c'A- ' b . This proves the invariance of the inequality (11, 2.5). We also have
do
Moreover
=
B,bo - *c, = P'BPP- ' b - ~ P ' = c P'd.
c, 1 --( p ' c p ) - l
and therefore
= p-lc-lpt-1,
' '
do'C; 'do = d p p - C - p' - P'd = d C - ' d .
Since p is invariant we have p - d C - ' d = PO - do'C,'do
so that (FJ is likewise invariant.
52. Reduction of the Number of Conditions on the Control
Parameters
A variety of conditions have been imposed upon the control parameters b, c, p, cp(a). As we shall see, these conditions are far from independent and hence can be sharply reduced in number. They are: On cp(a): properties I, 11, 111 of (I, $1); notably 111 has been imposed to guarantee asymptotic stability in the large, i.e. in particular, that every solution (y(t),a(t))of (11, 2.3) -0 as t + 00. On b, c, p :
+
(2.1)
p # c'A-'b,
likewise (Fi) for some C > 0. Actually it was shown in (11, 2.12) that, indirectly C > 0 plus (Fi)imply (2.1). A more complete result is the following proposition due to LaSalle [l]. (2.2) Theorem of LaSalle. Let A be stable and C (hence B ) > 0. Then d C - ' d > c'A-'b.
(2.3) 28
52. REDUCTION OF THE NUMBER OF CONDITIONS ON THE CONTROL PARAMETERS Moreover theorem (11, 2.1 1) is still valid without imposing on q(a) property I11 of I , $1 (divergence o f t h e integral @(a) as a -, f 00).
(2.3a) Corollary. C > 0 plus (Fi)imply (2.1) in the strongerform p > c'A-'b. REMARK. A
weaker form of (2.3) was obtained earlier by Yacubovich.
PROOFOF (2.3). This is the crux of the matter and the more difficult part of the argument: we first show that if x , y are two n vectors then (Bx - y)'C- ' ( B x - y ) 2 2y'A- ' x .
(2.4)
By choosing x
=
b and y = fc, this yields
d C - ' d 2 -c'A-'b,
(2.5)
which is (2.3). Thus, we have to prove (2.4). This will be done through another relation. Incidentally the necessity of the relation (2.6)
p # c'A-'b
for absolute stability was also proved in (11, $2).
Let, then, u, v be two arbitrary n vectors, and Q an n x n matrix such
that
Q
(2.7)
+ Q' > 0.
Note that in view of u'(Qu) = (Qu)'u = u'Q'u, we have (2.8)
u'Qu = iu'(Q
+ Q')u > 0
for all u # 0.
Moreover (2.8) implies that Q is nonsingular. For otherwise there would exist a vector u # 0 such that Qu = 0 and hence u'Qu = 0. It follows that Q-' exists. Now in replacing in (2.8) u by Q - ' u # 0 if u # 0, there follows u'(Q- ' ) ' Q . ( Q - ' u ) = u'(Q- ' ) ' u if u # 0. Hence also u'*Q-'u > 0
(2.9)
if u # 0. Thus actually Q- ' likewise satisfies the relation (2.7) (Q replaced by Q- '). Let now a = (u' v'XQ Q')(u v ) - 2u'(Q Q')Q-'(Q Q')v,
+
+
+
/? = ~ ( u ' Q ' u'Q)Q-'(Qu
+
-
+
Q'u). 29
3.
INDIRECT CONTROLS (CONTINUED)
From (2.9) one infers that fl >= 0. On the other hand by expanding both a and p one verifies that a = p and so a 2 0. Upon replacing now u and u in a by (Q + Q')-'u and (Q + Q')-'u there follows (u' + u'XQ + Q')-'(u + U) - 2 u ' Q - l ~2 0. Set now Q = -BA, u = - y , u = Bx. This is legitimate since
Q There results then -(Bx
+ Q' = -(A'B + BA) = C > 0.
- y)'(A'B + BA)-'(Bx
- y ) 2 2y'A-'x
which is precisely (2.4). Thus (2.4) is proved and so is (2.3). Since p - c'A-'b > 0, the transformation (x, 5 ) + (y, a) of (11, 2.2), is justified and so we are at liberty to keep on operating with the variables y, a,that is with the system (II,2.3).
PROOFTHAT J(y(t))(+ (a(t)J+ 0 as t + tions the function
v
+ co. Under
our assump-
w ,4 = y'By + @(a)
is positive definite and is negative definite throughout the space (y, a). Hence (Liapunov, IX, 4.5) the origin is asymptotically stable. Referring to LaSalle and Lefschetz [l, p. 66, Theorem VIII], it is therefore sufficient to show that the solution (y(t),a(t)) is bounded. The first step is to show that it is defined in the future, that is, that there is no finite positive T such that (ly(t)(l + lcr(t)l + co as t + T from below. Take any 0 < to < 'I: Since 0 we have for t > t o :
v<
vo = V(Y(tO),@ON
> V(y(t),m.
There is also an a > 0 such that Y'BY L alJYJJ2
for all y. Hence for t > to
vo > allAt)112 + @(o(t)). Therefore I(y(t)(l is bounded for t = T and so for all t. Moreover @(a(t)) is also bounded. This may happen through a(t) being bounded for all t, or else through la(t)(+ co with t. All that is required is to exclude this second possibility.
30
$3.
LURIE'S METHOD A N D A VARIANT
Suppose then that lo(t)l + m. As a consequence for some T and t 2 'I:o(t) will have a fixed sign, say a(t) > 0. (The case o(t) < 0 would be treated in the same way.) Now 6 - C'JJ = C? - c'(A-'j
+ A-'bcp(o)) = -pcp(o).
Hence in view of (2.3a) d 44) - c'A-'y(t)) = - ( p - c'A-'b)cp(a) < 0. dt
Since by hypothesis o(t) + + 00 with t, and since Ily(t)II remains finite we may choose T so large that for t 2 T we have o(t) - C'A - ' y ( t ) > 0.
Upon integrating then from T t o t we obtain [o(t) - C'A - 'Y(t)]'T
< 0.
As a consequence, since Ily(t)II is bounded, so is the proof of the theorem.
o(t). This
completes
$3. Lurie's Method and a Variant By taking a more restricted type of matrix C and assuming that the characteristic roots of the matrix A are all distinct, Lurie obtained a narrower sufficiency condition for absolute stability than the inequality (Fi). In presenting these results we will operate at once with complex coordinates, that is take the situation of (11, $4). Now Lurie's choice of B and C is as follows: C = ad' diag(a, ,..., an), ah 0
+
B= -
(*) + Aj
where
lj = - p j
=-
jlk
+ diag
+ ivj,
(2,$) ...,
p j > 0.
Note that
JJ *cY = JJ * a * y
+ EphYhyh 31
3.
INDIRECT CONTROLS (CONTINUED)
Hence likewise B > 0 (mere verification). Then Lurie completes the squares in the expression of and obtains cross products of type y*y, whose coefficients, set equal to zero, give suficient conditions for absolute stability. The computations are definitely involved and we merely reproduce the final result with mild changes of notation. Observing that the only condition on the ah are that they be positive, we find for the Lurie system under the general Lurie assumption that every b j = - 1 :
-v
( j = 1,2,..., n).
(3.1)
His conclusion is that ifone canfind a real point a such that all the inequalities (3.1) are satisfied then the control is absolutely stable.
A variant. Observe that a simpler way to guarantee that positive definite is to take d = Bb - fc
(3.2)
=
v be
0.
With B, C and the general situation unchanged, the system (3.2) yields akbk
(3.3)
+
-djx-+--k l j
ajbj 2pj
-0,
2 j -
j = 1 , 2 ,..., n.
If one assumes, as done throughout in Lurie's equation that every bj = - 1 then (3.3) reduces to (3.2), except of course, for the additional term in Our system is thus mildly more complicated, but the difference is not really significant. However, its derivation is altogether simpler than that of Lurie.
fi.
REMARKS. I. As observed by the author in a recent paper (Lefschetz [3]) the relation (3.2) was utilized earlier and independently by Mufti. 11. The number of parameters in the matrix C of Lurie's special type is 2n. Since it is not certain that they are independent, one may say that his system depends on at most 2n parameters. On the other hand, the general system: C arbitrary positive, depends upon n(n + 1)/2 parameters and this number >2n for n 2 4. In point of fact for n = 2 the two systems depend upon three parameters-not the apparent four since three is the maximum possible. Thus, for n 2 4,the class of Lurie matrices C is definitely special.
32
54. APPLICATION
TO SYSTEMS OF ORDER TWO
54. Application to Systems of Order Two We shall take advantage of the low dimension to study our system ($3, variant) more fully. And specifically, we shall discuss at length the properties of the relation (3.2). First, a preliminary remark. We will assume the coordinates y , , y , so chosen that the matrix A is in one of the Jordan normal forms. Moreover, we will also suppose that under these coordinates the vector h has both components b,, h, # 0. Observe that from the control point of view, if e.g. b , = 0, the first equation of the system reads Y1
=
IlYl
which indicates that y , is not affected by the operation of the control. In other words, practically speaking the control is only partly effective. Thus, our assumption that both h,, h, # 0 means that the control is completely effective. Under our assumption then, the change of coordinates y j + hjyj, j = 1 , 2 will yield the same system but with
This scheme will somewhat simplify the calculations. We continue to assume, of course, that the matrix A is stable. There are then the following three possible normal forms for A : I. 11. 111.
(:
diag(A,,A,),
diag(1, I),
A, and I , real and A
real and
I
complex.
The following notations will be used : I. 11. 111.
A1 =
- p , < 0,
i2 = - p 2 < 0;
A = - p < 0; ;i = - p
+ iv,
p > 0.
33
3.
INDIRECT CONTROLS (CONTINUED)
Thus, the three normal forms are: diag(-p,, - p 2 ) ;
I. 11.
CASEI .
+ iv, - p - iv).
diag( - p
111.
The system to be discussed is then )'I
=
+ do)
-Ply1
= ClYl
;I.
+ C2Y2 - P 4 W
Choose now
c=(; As we assume C > 0 we must have
p >O,
(4.2)
pr - q 2 > 0.
=( 1:
As a consequence, it is known that
B (4.3)
Po=-,
P
qo=-
2P 1
Then (3.2) yields
> 0, 4
PI
+ P2
,
ro=-.
r
2P2
+ q o + +Cl = 0 qo + ro + )c2 = 0.
Po
(4.4)
ing :
The inequalities (4.2) are seen at once to be equivalent to the follow-
(4.5) 34
54.
APPLICATION TO SYSTEMS OF ORDER TWO
Upon expressing p o and ro in terms of qo from the relations (4.4) and substituting in (4.5) there follows ; g(40) = (& - M O Z - 3 C l + d q o - h 1 C Z < 0. (4.6) Suppose first E > 1. The discriminant of g(qo) is
If 6 5 0 and since E > 1 the inequality (4.6) does not hold. That is the present scheme based on (3.2) does not lead to a sufficiency condition for absolute stability. Assume 6 > 0. Then g(qo) has two distinct real roots q l , q,. One can only satisfy (4.6) by taking 40 in the interval ( q l , q,). On the other hand, p o > 0 requires qo < -f-cl. Now g(-+c,)
= &C12/4> 0.
Hence -+cl is not in the interval (ql,qz). Hence qo < -fcl can only be fulfilled if %q1 q,) < -fc,, or
+
-+cl
>
c1
+ c2
4(E - 1)
~
or finally ( 2 E - l)c, < -c*
(4.7) (4.7a)
(C1
- C2),
+ 4EClC2 > 0.
Thus, if E # 1, i.e. p, # p, or equivalently 1, # 1, both being real, the inequalities (4.7), (4.7a) together with p > 0 express sufficient conditions for absolute stability. We will then have an appropriate value of qo, hence from (4.4) appropriate values of ro, and so appropriate p , 4,r from (4.3). Suppose now that E = 1, i.e., that 1, = 1, = - p < 0. We still have, of course, A = diag(1, 1).Here (4.6) reduces to 2(c,
If c1 + c, # 0, this yields 40
+ c2)qrJ + ClC,
-c,c, a -21_ __ c1 + c,
if c 1
> 0.
+ c,
2 0,
a condition that may always be satisfied. Hence, again we will have appropriate values of qo, po, ro and so likewise of p , q, r. 35
3.
INDIRECT CONTROLS (CONTINUED)
+
Finally, if c1 c2 = 0 (4.7a) demands that (cl + c 2 ) < 0, which does not hold. Thus, also in the present instance, one cannot verify absolute stability on the strength of the system (3.2). CASE11. Normal form
.=(::I
Since p is real, we take again
(4.9)
and we have the inequalities (4.2). One must solve for B from Liapunov's relation (4.10)
A'B
+ BA = -C.
Set once more B
=
(::::).
Then from (4.10)
Upon identifying corresponding terms and noting that I = - p < 0, one finds the relations 2(PoP - 40) = P,
(24oP - 10) = 49
2rop
= 1.
Observing that in view of the second inequality (4.2) the first may be replaced by r > 0, we find from (4.2) the necessary inequalities ~ ~ O P ( P O P-
40) - (2cw - rOl2 > 0,
10
> 0.
Upon substituting for p o and ro from (4.4) there follows 4&2
+ q O M k 1 + q 0 b + qo) - (
2 w + 0 c 2 + q0)I2 > 0 +c2
36
+ 40 < 0.
$4.
APPLICATION TO SYSTEMS OF ORDER TWO
Upon setting $c2 + qo = S, this system reduces ultimately to S2 - 2&c1 (4.11)
+ c,)S +
<0
/A%,
S < 0.
Let f ( S ) denote the above quadratic in S. Since f ( S ) < 0 for some S , the two roots S , , S2 off@) = 0 must be real and there must be negative numbers in the interval (Sl,S2).Since SlS2 = p2c2’ > 0 the two roots have the same sign and so both must be negative. Hence, we must have S1 + S2 < 0. Thus, our conditions are (4.12)
To sum up in the present case if (4.12) holds and S is chosen between S , and S2 absolute stability is guaranteed.
A, 1.The system is now
CASE111. Characteristic roots (4.13) b = cy
+ cjj - pcp(a).
This time C must be a positive definite hermitian matrix e.g.
The condition C > 0 requires that p > 0,
(4.14)
p2
- qq > 0.
),
Then if B = - 1( Po 2
we have P o = -P P
9
40
qo=- 4 p
40
Po
+ iv
r , ro =-. P
37
3.
INDIRECT CONTROLS (CONTINUED)
This time (3.2) yields (4.15)
po
+ 40 + c = 0.
From (4.15) follows (4.16)
P
Hence from (4.14) p2(c
Upon setting (4.17)
I-:
=
PPO =
+ qo)(L' +
=
40)
- d c + 40).
- (p2
+ VZ)q,qo
> 0.
v2/p2 this inequality reduces to
~j(q0= ) CC;
+
+
40~;
~ O C -
We also have from (4.14) and (4.16) that c Hence we may write c =M
+ ig,
(70
~9,340> 0.
+ q,, must be real and negative.
=y -
ig,
where (4.18)
a+y
From (4.17) there follows as a condition
h(M) = M 2
+ 2yM - (1 + E ) p 2
- &y2 >
0.
Since we dispose of 4 let us choose it so that y # 0. Denote by a l , u2 the two roots of h(cc).Since a 1 a 2 < 0 both are real and of opposite sign, and M must be between them. Let u1 < 0. Since h ( - y ) < 0, -y is between the roots. Hence in order that (4.18) hold, one must take M < al. If that is done the present scheme based on (3.2) will effectively guarantee absolute stability.
38
Chapter
4
DIRECT CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK $1. Direct Control: General Case The basic system is (1.1)
i= AX - q(0)b 0 =
c’x
where A continues to be a stable matrix and q(a)is the usual characteristic function. The system is of the same order n as the state system i= A x . We refer again to the Introduction for existence and uniqueness of solutions. The problem continues to be to find sufficient conditions for absolute stability. At all events for absolute stability the origin must be the only critical point. That is, the only solution of
(1.2)
A X - q(0)b = AX - q(c’x)b = 0
must be x = 0. In the absence of more information about q(a),all that one can do is to assume explicitly that this condition is fulfilled. In a moment 6 will be required. Here
(1.3)
& = c ’ i = c‘Ax - q(0)c’b.
39
4. DIRECT CONTROLS, LINEARIZATION, A N D MULTIPLE FEEDBACK
Upon setting (1.4)
CO =
A'c,
po
=
c'b
the relation (1.3) assumes the more familiar form 6
= co'x -
poq(a).
In the search for absolute stability the temptation is great to follow the same path as for indirect controls. Unfortunately this turns out to be only partly possible. Take the same Liapunov function as in (11, § 2 ) : V ( X )= X'BX
(1.5) Here again
- V = X'CX
(1.6)
A'B d
+ @(a).
+ 2 d x q + c'brp2
+ BA = - C
=
Bb - ~ A ' c .
Choose once more C > 0, and as a consequence B > 0. Hence V ( x ) is positive definite for all x and +m with IIxII. Thus V still behaves satisfactorily. Not so, however, regarding F! We still have (as in 11, $3): - V = (X
(1.7)
+ C-'
drp)'C(x
+ C-'
dq)
+ (c'b - d C - '
d)rp2,
and one would like to adopt c'b > d C - ' d
(1.7a) as a condition. However
V = (2Bx - crp)'.i
(1.8)
=
(2Bx - c ~ ) ' ( Ax bq).
This shows that V cannot be formally a definite quadratic form in the independent variables x and rp since, for example, one may have Ax - brp = 0 for x , cp not both zero. The best that one may achieve is V 0, and this will follow if we impose c'b = d C - ' d.
(1.9)
since then by (1.7) (1.10)
-
40
P
= (X
+ C-'dq)'C(x + C - ' d q )
$2.
DIRECT CONTROL, STANDARD EXAMPLE
It is clear from the above that in the (n + 1)-dimensional space x, cp (cp considered as independent of x) - V is positive semidefinite and furthermore (since C > 0) - is zero only on a one-dimensional subspace of the space of x and cp. Now we shall make - V positive definite in x alone. By looking at (1.8) we can see that - is zero when Ax - bip is zero and since A is stable this system of equations has a one-dimensional subspace of solutions. Thus - V is zero if and only if A x - bcp = 0. Now we take into account that x and cp do not in fact vary independently. We have already assumed that the only critical point for (1.1) is the origin, hence A x - bcp(c'x) is not zero and - V is positive definite in x alone. Thus the system (1.1) is absolutely stable. An easy way to find that the origin is the only critical point is this: let A - pbc' be nonsingular for all p > 0, but that for some x,: Ax, - bcp(c'x,) = 0. Now cp(c'x,) = poc'xo, for some p o > 0. Hence (A - pobc')x, = 0 which implies x , = 0. We shall show later ($4) that [ A - pbc'l = IA((1- pc'A-'b) # 0. Hence [ A - pbc'l # 0 for all p > 0 if and only if c'A- ' b 5 0. (1.1) Theorem. If (1.9) holds, C > 0, and c'A-'b absolutely stable.
5 0 the system (1.1) is
A more flexible situation is discussed in $$6,7. In my paper [3] I wrongly used (1.7a). This was recently indicated in Aizerman and Gantmacher [l, p. 181.
42. Direct Control, Standard Example An interesting special case is the standard example of (11, $5). In this case A is taken as diag(Al,A2,...,A,,) where the Ah = -ph are real and negative. As in (11, $5) take C = diag(a,, ..., a,,), ah > 0 and as a consequence B
and
=
';;(
diag -,...,
-
41
4.
DIRECT CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK
Thus (1.9) assumes the form
or
The above equality can be satisfied if and only if ah = ph2(ch/bh)if we make the physically reasonable assumption that the bh and ch are not zero. The inequality ah > 0 implies Chbh > 0. Since c'A-'b = -Z(chbh/ph) the condition that every chbh> 0 implies c'A-'b < 0 and hence the origin is the only critical point (11, 2.5 and 1.4). Thus a suficient condition for absolute stability of (1.9) when A = diag(ll, ,..., An), A h < 0, is that every Chbh > 0. One should not be deceived by the greater mathematical simplicity of the equations for direct control. If one tries to use the Lurie variant for case I of (111, $4) one finds that the conditions cannot be satisfied. In this case (1.9) becomes c1
+ c2 = 0
and the inequality (4.6)of 111 becomes (E
-MO2
- 8PlCl
+ PZCZ)40
- 4PlP2CICZ < 0.
But the discriminant of this quadratic qo is zero and hence the inequality can never be satisfied.
$3. Reduction of an Indirect Control to a Special Direct Control Take the initial basic system (II,2.1) of indirect control and introduce new ( n + 1) vectors z
=(;I,
-b,
=(o), 0 1
42
c0 =
(:j.
w. LINEARIZATION OF DIRECT CONTROLS Define also an (n x 1) x (n + 1) matrix A , to take the place of A as A,
=
(:-3,
Then (11, 2.1) is equivalent to the system z = Aoz 0
- cp(o)b,
= CO‘Z
which is of the direct control type but with the peculiarity that the matrix A, is not stable since it has one characteristic root zero. The order of the new system is n + 1.
54. Linearization of Direct Controls Since po, p > 0, is an admissible cp function, its substitution for cp in a control equation gives rise to a linear system. In order to have absolute stability this linear system must be asymptotically stable. From this there will arise certain absolute stability conditions. It may be said that this question has been fully investigated by Yacubovich notably in references [a] and [4]. The linearization problem is closely related to the well-known problem of Aizerman. The latter author inquired essentially into the extent to which one could substitute for a function cp restricted by a d
< acp(a) < Po’
a linear function p~b,a < p < /I. Ample details and references on this problem will be found in Pliss [l]. Take first the direct control (1.1). The linearized system is (4.1)
i= ( A - pbc’)x.
A n.a.s.c. to have (4.1) asymptotically stable is that the matrix A - pbc’ be stable, i.e., that its characteristic roots have negative real parts. The characteristic equation of this matrix is (zE - A + pbc‘l = 0. It is convenient to write zE - A = A, so that 1A.l = 0 is the characteristic 43
4.
DIRECT CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK
equation of A itself. The above relation is then (A, + pbc'( = 0.
(4.2)
As in (11, $2) one shows that absolute stability does impose the condition that A possess no characteristic roots with positive real parts. This does not exclude that A possess characteristic roots with zero real parts: zero or pure imaginary. Let it be assumed then that A is semistable or stable and examine more closely the characteristic equation (4.2). We are concerned with p small and briefly assume that z is not a characteristic root of A so that A,-' exists. Then (4.2) is equivalent to IE pbc'Az-'I = 0. A closer look at this last expression will yield a simpler expression for (4.2). Consider the characteristic roots of the n x n matrix pq' where p and q are n vectors. Since each row of pq' is a multiple of the row vector q' the matrix has rank one and hence one nonzero characteristic root. Since the trace of a matrix is the sum of its characteristic roots it follows that the one nonzero characteristic root of pq' is trace pq'. But trace pq' = q'p and so IzE - pq'1 = Z " - ' ( Z - q'p). This reduces to IE - pq'1 = (1 - q'p) by setting z = 1. For q' = c'A;' and p = - p b , (4.2) becomes
+
(4.2a)
lAzl (1
+ pc'A,-'b)
= 0.
The solutions z ( p ) of (4.2a) for p small are very near to, but not identical with, the characteristic roots of A. Since A is semistable some are then very near to zero or to some points on the complex axis. We first investigate z(p) which + 0 with p. Since for such a solution IA,l # 0, it satisfies the relation (4.3) Let zero be a root of order k of [All = 0, and so a pole of order at most k of c'A; 'b. Thus near zero
The coefficients a, p,... are all real for they are merely the coefficients of the McLaurin expansion of zkc'A;'b. Moreover a # 0 since otherwise (4.2) would have the fixed solution z(p) = 0 and (4.1) would not 44
$4. LINEARIZATION OF DIRECT CONTROLS
be asymptotically stable for p > 0 and small. Thus zero is actually a pole of order k of c'A;'b. From the above series expansion there follows in succession C'A; ' b = 5 Zk
(I +
12
+ ...),
-1
Hence setting p = vk, v > 0, one finds
( ka, + -.-)
= (-a)llkv.
z 1 - -z
By the implicit function theorem this yields for v small
(4.4) This expression provides k determinations of z(p) for p small which are very near to the vertices of a regular polygon centered at the origin. Hence if k > 2 some determination will fall to the right of the imaginary axis thus contradicting the assumption that the linearized system is asymptotically stable. Therefore one requires that k 5 2, i.e. k = 0, 1, 2. The value k = 0 offers no interest; no characteristic root comes near zero for p small. There remains k = 1 or 2 and we examine these values separately. Letfirst k = 1. Then from (4.4) z = -av + ... which must be to the left of the imaginary axis. This requires that a > 0. Take now k = 2. This time z =
- -v2 P
k
+ ....
If a < 0 one of the solutions will fall to the right of the imaginary axis. 45
4. DIRECT
CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK
Hence one must have u > 0. As a consequence
Re z
=
-P -v2
+ -..
k Thus to have z fall to the left of the complex axis for p > 0 and small, one must have fl 2 0. To sum up when zero is a double characteristic root of A necessary absolute stability conditions for the system (1.1) are u > 0, 1-3 2 0. Quite similarly one may obtain the following results for absolute stability and a complex characteristic root iw : (a) i o may have at most multiplicity two. (b) I f i o is simple and in its vicinity
then R e a 2 0. (c) If i o is double and in its vicinity CIAz- b
=
u
(z - io)2
+-( z -P iw) + y + d(z - i o ) + *-,
then u > 0, R e g 2 0.
95. Linearization of Indirect Controls The linearized system (11, 2.3) is (5.1)
with characteristic equation A, -c'
Pb z
=
+ pp
0.
As before it is equivalent to 1+P
46
p
+ c'A; ' b
=
1
+ pg(z) = 0.
$5.
LINEARIZATION OF INDIRECT CONTROLS
If one considers (5.1) as a direct control then g(z) plays the role of the expression c'A;'b. In terms of the present A , we have: zero is at most a simple, and iw, w # 0, at most a double characteristic root of A . The special property of the zero root is caused by the fact that z = 0 is already a pole of g(z). The detailed analysis follows. (a) Zero is not a characteristic root of A . Then near z = 0 U
g(z) = Z
where u = p at once A;' which yields
+ p + yz + ...
+ c'A, 'b. Now A , = -
= - A and the calculation of A,' yields A - ' . Hence, referring to $4 one must have u > 0
p > c'A-'b,
an inequality already obtained by another method in (111, $2). (b) Zero is a simple characteristic root of A. Let then dA,- ' b = - + U Z
a + YZ + ....
As a consequence
and from tj4 one obtains the conditions
p 2 -p.
U>O,
(c) iw(w # 0) is a simple characteristic root o f A . Here then
Hence
Hence from (tj4), U
Im- > 0. w
47
4.
DIRECT CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK
(d) iw(o # 0) is a double characteristic root of A . This time c'AZ-'b =
+-
U
( z - io)2 ( z - io)
+ y + ...
and so from $4 Im7P 5 0.
Im-U c 0, w
1 0
$6. Direct Control with Matrices B or
C of Rank < n
Absolute stability has rested in $1 upon matrices B, C > 0. Still preserving the form of the Liapunov function V of $1, it is of interest to discuss to what extent one may be able to reach the desirable goal of absolute stability with critical matrices B or C. Recall these two properties $3):
(1x9
(6.1) If A is stable and C > 0 then B > 0. (6.2) If B and C > 0 then A is stable. Observe now that if c = 0 then the system (1.1) is uncontrolled. As this is without interest we may assume that c # 0. Hence one may select the coordinates such that c'x = x, = fs. This suggests the following convention : If F is an n x n matrix or f an n vector, denote by Fo or fo the result of deleting the last row and column from F and the last component fromf: In particular write A = l A0 g , u fl .
B = l Bo h
h'
P
1,
c = c/o k'
k
1.
Y
Heref, g , h are (n - 1) vectors and u, P, y are scalars. With these designations and coordinates the system (1.1) assumes the form
+.f - bocp(o) 6 = g'x, + ufs - pcp(a).
i o = Aoxo
(6.3)
48
56.
DIRECT CONTROL WITH MATRICES
B
OR
C
OF RANK
The case of particular interest is that of B of rank
must be positive definite and so B, > 0. Thus B is of rank n - 1 and this is assumed henceforth. Since we wish to have V > 0 for all admissible cp, this must happen for cp = 2pa, p > 0. Thus the quadratic form in xo,a x,'B,x,
(6.4)
+ 2h;x00 + (B + p)aZ
must be positive definite for all p > 0. Since B, > 0 the only additional Sylvester condition is
or, according to ( I X , 52) (6.5)
(B + p) > h'B, ' h .
Now
fl > h'B; ' h implies that B > 0 and so it is excluded. If
B = h'B, ' h thcn
+ h'B, ' ~ ) B , ( x ,+ hB; + paZ. Hence if one chooses as coordinates (x, + hB, la,a)our situation will x'Bx
=(x~'
'0)
be unchanged save that with the new coordinates /J = 0. This is assumed henceforth. As a consequence (6.5) becomes p > h'B, 'h.
Since this must hold for p arbitrarily small positive and B , > 0, necessarily h = 0. Thus under our stringent conditions we have B = diag(B,,O). Upon taking into account the properties of the admissible class cp it is easily verified that V fulfills all its expected requirements regarding absolute stability. 49
4. DIRECT CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK
The situation regarding - p is less simple. There are again the relations (1.5) and (1.6). One faces then two alternatives: (a) as in $1 obtaining absolute stability through A stable, C > 0, the relation (1.9) and a positive semidefinite quadratic form in x and cp; (b) accept that A may be unstable, preserve B, > 0 and look for -p(xo, a) a positive definite function of xo,a. Since then - p(xo,0) = x,'C,x, > 0 for x, # 0, one will have Co > 0. We do not return to (a) and in the next section discuss B,C both of rank n - 1. Coupled with our desire to have V ( x , , a) and -p(x,, a) positive definite this is equivalent to B,, C , > 0.
97. Direct Controls with Matrices B, C, of Rank n - 1 As just observed this is equivalent to B,,C, > 0. As in $6 let the coordinates be so chosen that B = diag(B,, 0). Now from the Liapunov relation for A , B,C there follows
(7.1) Hence (7.2)
Ao'Bo
+ BOA,
=
-Co,
k
=
-BOA
7
=
0.
Now the determinant of the quadratic form x'Cx must be zero, as rank C = n - 1. Hence
co k lkl
J= O.
Since Co > 0 and referring to (IX, $2) this yields k'C; ' k = 0. Since C'; is likewise > O necessarily k = 0, hence alsof = B; ' k = 0. Thus C = diag(Co,0), and (6.3) looks like this : (7.3)
X o = Aoxo - bocp(a)
6 = g'x, + cia - pcp(0). Note that ci is now a real characteristic root of A . Hence c1 g 0. Moreover from (6.2) and (7.2) there follows that A , is stable and so if ci < 0, A itself
$8. DIRECT CONTROL WHOSE MATRIX A
HAS ZERO AS A CHARACTERISTIC ROOT
is stable. The system (7.3) is then a standard direct control of the type of (1.1). Suppose that u = 0. Then the system (7.3) may be identified with (11, 2.3) if p # 0: an indirect control, or with (1.1) if p = 0: direct control, both of dimension n - 1 and basic matrix A,. In the first case A , b, c, y of (II,2.3) correspond here to A,, b,, g, x while in the second A , b, c of (1.1) correspond to A,, b,, A ; 'go. Both cases are covered by our earlier arguments. Suppose now u # 0. With V as before we can calculate V simply by observing that it is the sum of its value for u = 0 plus u times the coefficient of a.Hence:
-P= {x,'B,x,
+ 2d9'xocp(a) + pcp2(a)) - 2aocp(o),
do = Bobo - ig. Since a 5 0, and acp(a) > 0 for a # 0, in order to have - positive definite as a function of xo,a it is sufficient that the quadratic form in x,, cp in the bracket be positive definite if a = 0 positive semidefinite if u < 0. Hence the final condition (7.4)
Pp >=>{
do'C, 'do
for}
u=o u
< 0.
When (7.4) holds we will have - positive definite in x,, cr, that is, in x and this for all admissible cp. Since V-co with (Ix(1,absolute stability will have been achieved in both cases. The preceding discussion leads to the following result :
(7.5) Let both B, C be of rank n - 1. Then n.a.s.c. to have V and - V both positive deJinite as functions of x or x,, cp, are that in appropriate coordinates B = diag(B,, 0), C = diag(C,, 0) with B,, C,, > 0 (hence A , stable) plus the property (7.4). When this holds the system (1.1) is absolutely stable.
$8. Direct Control Whose Matrix A Has Zero as a Characteristic Root (Kenneth Meyer) In this section and the next we fully discuss several critical direct controls. In the first case the matrix of the linear part of the system has 51
4.
DIRECT CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK
a simple zero characteristic root. By a suitable change of coordinates the system is then f = A x - bq(a)
g=
(8.1)
Bda)
a = c‘x - y(
where x , b and c are n vectors; 5, B, y are scalars and A is an n x n stable matrix. Now /3 = 0 implies that 5 = constant and therefore 5 = 0 to achieve stability. Hence the system is just (1.1). We may therefore assume /I# 0. Then replacing 5 by
x
=
A x - bq(a)
v44
g=
(8.2)
a = c’x - y(.
This is a special form of an indirect control of dimension n with control variables 5, a. Its reduction to the natural indirect control of (11, 2.1) will be established in (VII, $1) (Popov system). There is no need therefore to discuss this system any further. Let us turn our attention then to the more interesting case of a double zero root. It will be convenient to take the system as n + 2 dimensional. If A,,... denotes the usual quantities A,... in the system (1.1) then by a suitable choice of coordinates A, = diag(02, A ) where A is an n x n stable matrix and O2 has one of two forms
J
02=(0
J.
0 0
0 0
or
02=(
If O2 is of the first form the system reduces to lil
= -B
ld4
t 2
= - B2Cp(a)
f = Ax - bq(a)
a = 7151
+ Y 2 t 2 + c’x
It is clear though that there exist nonzero constants that the above system has the solution t1 = gl, t2 = 52
r2,5, xand 0,E2a such0. =
=
$8. DIRECT CONTROL WHOSE MATRIX
A
HAS ZERO AS A CHARACTERISTIC ROOT
Thus this direct control cannot be stabilized when O2 is of the first form. In the second case the system reduces to
n=
y151
+
y252
+ c’x.
Now choose as a Liapunov function for V = ut12
(8.3)
+ X’BX+ @(a)
where a will be chosen later so that u > 0. In this case Bo = diag(a, 0, B) and from the Liapunov relation Ao’Bo
+ BOA0 = - Co
one finds Co = diag(0, 0, C),
-C =
A’B
+ BA.
Also do’ = (Bobo - +AO’cO)’ = ((aB, - + y 2 ) , 0. d ) d = Bb - )A‘c
Hence
co’bo = BlYl
+ B 2 Y 2 + c‘b.
- V = xO’COXO = x’Cx
+ 2do’xOcp + co’b0cp2
+ (as1 - y2)t1(p + 2d‘xcp
+ (BlYl + P 2 Y 2 + c’b)cp2. Let us suppose that there exists an a > 0 such that uB1 - t Y 2 or what is equivalent (8.4)
B1Y2
=
0
’0.
Then we can choose C > 0 and thus B > 0. Note now that Bo is not positive definite in t l , t 2and x but only positive definite in t1 and x.
53
4.
DIRECT CONTROLS. LINEARIZATION, A N D MULTIPLE FEEDBACK
c2,
Since y2 # 0 (8.4) the term @(a) makes V positive definite in (,, x and moreover V + cc as IlxII + Itl[+ It2[-+ x . We cannot achieve as much for - v. Let us, however, endeavor to make it a positive definite quadratic form in x and cp. This merely requires the inequality (Fd)or (8.5)
/jly,
+ P 2 y 2 + c‘h > d‘C- Id.
<,,
t2,x, u but - > 0 This only guarantees that - 2 0 in the space of in the space X, u. At this point we have recourse to the theorem of LaSalle (IX, 4.8). In our case = 0 if, and only if, x = 0, u = 0. This implies that if a solution of (8.3) is to remain in the set where = 0 it must satisfy
+ Y252. But the general solution to the above equation is t l = 6,, 5, = 6,t + 6,, 0 = y16, + ~ , ( 6 ~+t 6,) where 6, and b2 are arbitrary constants. Since lil
=
0,
li2
=
51,
0
= Y151
y2 # 0 this implies that 6, = S2 = 0. Thus the LaSalle conditions are satisfied and so absolute stability has been established under the condition C > 0, (8.4) and (8.5). It will be of some interest to compare the above conditions with the necessary conditions found in tj4. The function c’A;’b found in $4 is now
PlY2 -2,
+
PlYl
+ P2Y2
+
...
2
Thus we see that the conditions found in $4 are simply /J1y2 > 0 and plyl + P 2 y 2 2 0, and are compatible with (8.4) and (8.5). Indirect control whose matrix has zero as simple characteristic root. In accordance with the scheme of (33, this case may be identified with 1 and the one just considered. The indirect control is of dimension n has for “state” matrix diag(A,O) the state variables being x and t2.The preceding treatment : case O 2 may thus be directly applied here.
+
54
$9.
CONJUGATE PURE IMAGINARY CHARACTERISTIC ROOTS
99. Direct Control Whose Matrix Has a Pair of Conjugate Pure Imaginary Characteristic Roots (Kenneth Meyer) It is again convenient to take a system of dimension n coordinates the system will be represented by
(9.1)
+ 2. In suitable
f = AX - bq(o)
where the Greek letters are scalars with w > 0; x, b and c are n vectors and A is a real stable n x n matrix. Let A,, ... be as in the previous section. Thus A,
=
diag(io, - io, A),
b,’
=
(8,p, b’),
c,’ = (y,?, c’).
Choose as a Liapunov function
v=
+ x’Bx + @(o)
2u5g
where u will be specified in a moment and u will be positive. Here then
B,
=
diag(u, u, B).
From the Liapunov relation A,*B,
+ BOA, = -c,
one finds at once
C,
= diag(0, 0, C),
-C =
A’B + BA,
55
4.
DIRECT CONTROLS, LINEARIZATION, AND MULTIPLE FEEDBACK
Hence
-V
+ 2do'xOcp + co'bocp' = X'CX + 2 ( ( 4 - 3iwy)t; + (a/?+ $ion[+ d'x)cp + (By + pp + c'b)cp2. =
xO'COXO
Let us assume that there is a positive a such that (a13 - $my) = 0 or what is equivalent
These conditions may clearly be satisfied. We also choose C > 0 hence also B > 0, and as a consequence I/ is positive definite in x, t;, (,c for every admissible cp and moreover V - r 00 with llxll + 151 + Aiming again toward the theorem of LaSalle we require - V to be positive definite in x and cp. This merely requires here
[GI.
By
(9.3)
+ bT + c'b > d'C-'d.
Under the condition (9.3) -V is a positive definite quadratic form in x and cp. Thus - V = 0 if, and only if, x = 0, c = 0. Thus if a solution of (9.1) is to remain in the set where = 0 it must satisfy the equations
[ = iot;, f
= - iot,
yt;
+ y[
=0
That is, there must exist a constant 6 such that yde'"'
+ y&-'"'
= 0.
Since the two exponentials are linearly independent y6 = 0 and since y # 0,6 = 0. Thus LaSalle's conditions are satisfied and absolute stability is assured by C > 0, (9.2) and (9.3).
$10. Multiple Feedbacks In various practical situations one may have a control depending on several parameters. This case has already been dealt with by Letov [l, Chapter IV] and Popov [4]. 56
$10.
MULTIPLE FEEDBACKS
To conform with our general notations designate the analog
by the r vector
By analogy with the earlier situation, the following conditions are imposed upon the vector function f ( v ):
I. f ( v ) is continuous;
11. fh(v)’ u h > 0 if o h # 0, andfh(o) = 0 if Uh = 0 ; 111. the integral j f ( v ) d v along any ray from the origin v
divergent.
=0
is
This last condition plays the same role as its analog: to make all solutions tend to the origin as t + + 03. An example of a function such as f ( o ) (but not the only example) is one in which every fh = fh(uh) (scalar function) where fk satisfies the conditions of I, namely,
I.
is continuous for all oh; > 0 for uh # 0 ; fh(0)= 0 ; HI. j’ mfh(vh) doh is divergent. 11.
fh(vh)
vhfh(vh)
However, for instance I(v(lf(v),( f ( v )as just defined), is likewise a function of the general class. We have now the possibility of associating this new situation with indirect and direct controls. Indirect control. The system will be X
(10.1)
= AX
ti = f ( v )
u
=
HX
+ GU + Ru,
where A is our usual stable n x n matrix and G, H , R are constant n x r, r x n and r x r matrices. We proceed in full analogy with the earlier case. In the first place the origin will be the only critical point if, and only if, the determinant ( 10.2)
:,I 57
4.
DIRECT CONTROLS, LINEARIZATION, A N D MULTIPLE FEEDBACK
This is assumed henceforth. We are then justified in applying the change of coordinates ( x ,u) -+ ( y , u ) defined by y = A x + Gu, u = H x + Ru. The new system is (10.3)
The goal is still to make the system absolutely stable: asymptotically stable in the large and regardless of the choice of the functionf(u) within its class. The same theorem of Liapunov, with Barbashin-Krassovskii complement is to be applied. First define
@(u)
=
sa
f ( 0 ) .du
where the integral is taken along the ray Lfrom the origin to the point u. That is, if, say s is a parameter along the ray, with value s at u then
Sincef,(u) always has the sign of u,, (except that f h = 0 for uh = 0) we see that @(u), for u # 0, is a sum of positive terms if u # 0, and only zero for u = 0. Thus @(u) > 0 for every u # 0. It is, of course, continuous in u, and + + 00 with IIuIJ as the latter -+ 00. Choose now (10.4)
V Y , 0) = Y’BY
+Wu)
where B is selected as before: one takes an arbitrary n x n matrix C > 0 and defines B as the unique solution of Liapunov’s equation A’B
+ B A = -C.
One finds now (10.5)
- P(Y, u) = Y’CY - .f’(u)Rf(u) - f ’ ( W Y - Y ‘ K f
K
=
G’B
+ )H.
In writing this relation we have utilized the property that since f ’ H y = y ’ H f , we have
f ’ H y = 3f ’ H y + y’H’f).
58
9 10. MULTIPLE
FEEDBACKS
Since f ' Rf = f 'R'J we observe at all events that the positive definiteness of - implies that of - f ' R f = -f'(*(R + R'))f;and hence that
v
(10.6)
+ R' < 0.
R
Basically, however, the positive definiteness of - v is equivalent to the classical Hurwitz conditions for the positive definiteness of the quadratic form in (10.5) in the variables y,f: If we set K = (kij), R = (rij) and define
K,
=
A,
=
.,"-I
(kij);
R,
=
(rij);
i,j
S
s,
-Ks
-R]'
then the Sylvester conditions not yet fulfilled are (10.7)
A,>O,
A , > O ,..., A , > O .
These are the conditions which correspond to the unique inequality (Fi)for r = 1. Of course, the inequalities (10.7) imply (10.6). Indeed if one reverses the orders of the variables from y,.f to f ; y one obtains, from Sylvester's inequalities, as first conditions for the positive definiteness of - those which express that f ' ( g R + R'))f is positive definite.
Direct control. This time the system is
+ Gu
X
=
u
=fb)
AX
v = Hx,
or with u eliminated (10.8)
X
=
AX
u
=
Hx.
+ Gf(v)
It may also be written as a single vector equation X =
AX
+ Gf(Hx).
The origin will be the only critical point provided that, as assumed henceforth, A x + G f ( H x ) = 0 has x = 0 as its sole solution. One may conveniently complete (10.8) with i, = H i = H A x
+ HG(u) 59
4.
DIRECT CONTROLS, LINEARIZATION, A N D MULTIPLE FEEDBACK
or setting H A = H,, H G
=
fi
R,, by =
H,x
+ R,f(u).
Take now as before V ( X )= X’BX
+ @(u).
With B, C related as before, one obtains - V(X)= X’CX - f ’ ( ~ ) R o f ( ~-) f ’ K 0 x -
K O = G‘B
+ iH,.
~’Kof
Set now
K O = (k;);
R,
(r;);
Kso = (kij);
Kso = (k;);
i , j 5 s;
R,’ = (r:); Aso =
=
C
-Kso
-K,O’
-Rs
The difficulty found in $1 occurs here also. It is settled again by reference to LaSalle’s theorem (IX, 4.8) and the sufficient conditions for absolute stability are A , = 0 , A 2 = 0 ,..., A.,=O.
60
Chapter
5
SYSTEMS REPRESENTED BY A SET OF EQUATIONS OF HIGHER ORDER In practice one has often to deal with fundamental systems composed of several equations of any order. In theory this offers no novelty since by introducing more variables such a system may be reduced to the standard type of a number of equations of order unity. Practically however it is decidedly advantageous to be able to deal with these systems as one finds them and not after a more or less artificial reduction to another type. Our purpose in the present chapter is to examine some of the problems without undue change of type. This does not mean, of course, that we forego the theoretical convenience of the reduction to the standard set of equations of order one, but only that we will endeavor as much as possible to deal with the equations as they stand.
$1. Generalities The variety of the systems that may arise is virtually endless. For the sake of orientation we discuss rapidly a fundamental system consisting of a single equation of order n with constant coefficients: (1.1)
t+")
+ a,rfn- + ... + a,q ')
=
0. 61
5.
SYSTEMS REPRESENTED BY A SET OF EQUATIONS OF HIGHER ORDER
Introduce the new variables xl, x2,..., x, components of a vector x, and defined by x 1 = rl, x2 = ti,..., x, = q'"- l'.As a consequence the unique equation (1.1) becomes equivalent to the system x, = x2,
(1.2)
x, =
-a,x,
1, = x j,..., in1 = x, -
CL~X,-,
-
... - %lX
1
with coefficient matrix 0
1
0
0
.
.
1
The characteristic equation is
IrE - A J = r"
(1.3)
+ a,r"-' + *.. + u, = 0.
As before A is assumed to be stable. Part of our problem is that we will have to deal directly with the matrix A , that is with the coordinates x, since they have a particular significance for the problem under consideration. Or more accurately, one will be free to change coordinates provided that the system (1.2) retains its form and the matrix A is unchanged. This will become clear in a moment. This is as good a place as any to make a few rapid observations regarding linear systems. After that we turn our attention to indirect, then to direct controls.
52. A Digression on Linear Systems There has been developed around linear systems a widely used technique based upon the simple and well-known device of designating time differentiation by an operator, usually D.Thus x is written Dx,X is 62
$2.
A DIGRESSION O N LINEAR SYSTEMS
written D'x, etc. As an example the standard indirect control system assumes the form ( D - A)x Da
=
-5b
= cp(o)
a = c'x - p 5 .
If one has a nonlinear characteristic cp(a), this notation offers little advantage. Suppose, however, that the characteristic cp is linear, or say that we limit cp to a sufficiently small neighborhood of the origin to make it reasonable to linearize cp, i.e., to replace it by a linear approximation. Let this also be combined with a fundamental system such as (1.1) consisting of a single nth order equation. The resulting system assumes then the form (D"
+ alDn-' + ... + a,)q
=
5
where the a,, and yk are constants. If we set g(D) = D"
+ a1Dn-' + ... + a,
then the system assumes the form
Now the operators g(D) form what is known in algebra as a ring ofpolynomials and one may apply to such a collection the usual operations of rational algebra : addition, substraction and multiplication (but not division). As a consequence one may solve the system (2.1) in the usual way and obtain the relation
{(D+ P)@)
-
W))V
=
0.
The bracket will be denoted by k(D),sothat one has to deal with the equation (2.2)
k(D)q = 0. 63
5.
SYSTEMS REPRESENTED BY A SET OF EQUATIONS OF HIGHER ORDER
The characteristic equation is
k(A) = 0.
(2.3)
The function l/k(s) is the well-known transferfunction of the linear theory. Let rn be the degree of k ( l ) and let the roots IZ1,...,Am of k(A) all be distinct. The general solution of (2.2) is then
+ ... + Cme'mr.
q(t) = Cle'tr
Even if the Ah are not all distinct n.a.s.c. that q(t) and all its derivatives +O as t + + co is simply that every Re A h < 0. This is the full solution of the absolute stability problem in the present instance. A more complicated situation would correspond to e.g., r parameters ql,..., qr (fundamental parameters) and r + 1 equations k
gjk(D)qk
Let
- h,iiD)C
0' = 1, 2,...,r + 1).
=0
A(D) = JgjXD),...,gjr(D), hi(D)I*
The q's and also
< are solutions of A@)[ = 0.
The characteristic equation is again A(A)
(2.4)
=
0,
and the stability condition: the q's, 5 and all their derivatives 4 0 as t + + 00 is again : the real parts of all the roots of (2.4) must be negative.
$3. Indirect Control Taking (1.1) as the fundamental (state) system, the indirect control system has naturally the form $1
+ qq(n-1) +
ri = d o )
(3.1)
d =
64
ylq'"-"
**a
+ m,q
=
5
+ y z q ( n - 2 ) + ..* + ynq - pc.
$3.
INDIRECT CONTROL
Such a system may arise under the following conditions. Consider a system S made up of a chain of subsystems S1,..., Sn where Sh depends upon the variable q h and Sh acts upon Sh+ in accordance with a relation h
gh(D)?h = ?h+1,
1, Z Y . . ~n - 1
=
except that at the last step there appears the control variable gfl(Wfl
=
r.
Here gh(D) is a polynomial with constant coefficients in the differential parameter D. One may easily realize such a scheme in which for instance the gh(D) are linear or quadratic. By elimination and setting q 1 = q, one obtains the first equation (3.1): g1(D)gz(D)..*gn(D)q =
5.
Returning to the system (3.1) observe that here as in (11, $2) and for the same reason it is advantageous to change from the variables q,t to new variables c,a so that a becomes one of the basic variables of the system. This change is defined by tj = c and the new system is
p)+
(3.2)
=
+ ... +
ylc(n-l)
Cc r,
= Cp(0)
+ ... + Y n l - PCP(O).
We must prove however the equivalence of the (q,5 ) and in the sense that the conditions (4i,..., 4 - l), 5 )
+
(c, a) systems
0
(C, [)...)p-I), a)-+ 0 are equivalent. For that purpose it is convenient to use the equivalent vector forms e.g. x, and y, B. They are
<
f
=
AX - b5
tj = d o ) B
where
= c'x - p r
65
5.
SYSTEMS REPRESENTED BY A SET OF EQUATIONS OF HIGHER ORDER
If we set Y , = C, Y , = Z,...,yn
=
C
(n-
1)
then the transformation that we are considering is given as in (11, $2) by I = y = AX - t b a
= c’x
- pt.
The determinant of the transformation is
A c’
= y,, - pa,,
-p
and so we require
+
(3.3)
Y n - Pan 0. This inequality is assumed henceforth. The y, a system is the standard
3 = A y - q(a)b
(3.4)
6 = c’y - pcp(a). Hence the
(C,a) system is C(”) + a l P 1 )+ ... + a,,[
=
q(a)
6 = y , p - l )+ ... + Y n C - P P ( ~ ) The problem of Lurie. It may now be phrased for the system (3.4). Its interpretation for the C,a system is this: to find n.a.s.c. under which if [(t),a(t) is any solution of (3.5) then C(t),[ ( t ) ,...,C(”- ‘)(t), a@) all -0 as t -+ + co and this regardless of the choice of an admissible characteristic cp(0).
This is a case where the theory of the problem is best attacked for the y, a system (3.4). It is practically a matter of repeating the argument of (II,2) with the present definitions of A , B, C . Taking then
V Y , a) = Y’BY
+ @(a)
one obtains
-v 66
= y’Cy
+ pcp2(a)+ 2q(a)d’y
$4.
INDIRECT CONTROL: AN EXAMPLE
where C is arbitrary >O, B is the unique solution of A'B
+ BA = -C
and the vector d is given explicitly in terms of the actual b, c by
One may now write the sufficiency condition (Fi) of (II,2) for absolute stability p > dC-'d.
(Fi)
One will merely recall that (3.4) is a consequence of (Fi)(see 111, 9) so that C > 0 and (Fi)remain as the only required inequalities. The remaining considerations regarding an indirect control are the same as before. One must bear in mind, in applying the optimal inequality (111, $2) that the coordinate vector y hence also the vectors b, c are not necessarily the same as the initial vectors y, b, c.
$4. Indirect Control: An Example We will borrow an example from Letov [l, Chapt. 11, 55J-which Letov refers to as the second Bulgakov problem. Using at first the same designation for the constants as Letov, the system is (a) T2ij + Utj
+ k q = T25 t = cp(4
(4.1) (b)
0
= uv
1 + Etj + G2ij - -5. 1
Here T 2 and G2 are inertial constants, U and E are dissipative constants and k is a restoring force. At all events U and k are positive.
67 \
5.
SYSTEMS REPRESENTED BY A SET OF EQUATIONS OF HIGHER ORDER
One may change at once the expression of a, using (4.la) to
(4.2)
I
p = - - G2.
1 We thus see at once that absolute stability will require that p > 0 and hence that 1 > G2. 1
-
(4.3)
We may now change the system to the standard form (4.4) with the following values of the constants a1
U
=p
u2
G2U Y ~ = E - T2 '
(4.5)
=
k
T2 ~
~
kG2 = T
a
-
~
The matrix A is
and the characteristic equation is s2
+ u,s + u2 = 0.
Since ulru2 are positive, the roots have their real parts negative: A is stable. Let the matrices B, C be given by
68
$4.
INDIRECT CONTROL: AN EXAMPLE
The conditions that C > 0 are (4.6)
> 0,
p
The relation A'B
i
+ BA = - C
pr - q 2 > 0.
yields here
-2u2qo7
Po - u1qo - u 2 r 0 )
2(qo - %ro)
P o - u1qo - u2r07
or (4.7)
P
=
2u2q0,
4
=
u1qo
+ u2ro
- _ -
i"
4)
4 r
- Po
r = 2(u,r0 - qo).
We also know from general theory that with A stable, C > 0 implies B > 0. Choose as sufficient conditions for stability in the large, the vector relation d = 0 (111,3.2). Here
-d
=
+ b2
i, + 40
+y).
The determinant of the linear system (4.7) in po,qo,ro is 4u1u2 > 0. Hence (as we know already) there is a unique solution for po,qo,ro. We only need qo and ro and we find P qo = --, 2a2
r ro = 2a1
P + 2u,a, I-
Therefore
Hence d (4.81
=
0 yields the relations
+ u,y, = 0, p + ru2 + u1u2y1= 0. p
69
5.
SYSTEMS REPRESENTED BY A SET OF EQUATIONS OF HIGHER ORDER
The first relation already yields y 2 < 0. Moreover (4.6) implies r > 0 and hence the second of (4.8) yields also y1 < 0. Thus both control constants yl, y 2 must be negative. Upon eliminating p between the two relations of (4.8) there follows r = y 2 - a 1 y l . Since r > 0 one must have y 2 > a l y l . Once this is satisfied one takes p
=
-a2727
= 7 2 - a1y1,
141 =
Jpr.
Thus, both y1,y2 negative and y 2 > a1y2 are sufficient conditions for absolute stability of the system. In terms of the initial constant, the sufficient conditions are G 2u E<-
TZ ' $5, Direct Control
The natural equation of a direct control is +
q(n) + a l q ( n - l )
... + unq
< = cp(4 = y,q + + ylq'"-"
(5.1)
0
=
<
***
or in the form of a single equation (5.2)
q(")
+ a l q ( " - l )+ ... + a,,q = cp(y,q + ... + ylq("-')).
However, if one introduces the auxiliary variables x,,, with x 1 = q we obtain the equivalent system x, = X I , ..., x,x,, = -a,xl
(5.3)
0
=
ynxl
From this we draw
6- = Y , X ~
+
~
-yl(a,xl
or
-
= x,
a , - 1 x 2 - ... - u l x , - cp(a)
+ ... t ylxn.
+ ..* + + a , - 1 x 2 + ... + a l x , ~
670
1
-
1
~
3~ Z X .
= c'x - pcp(0)
~ ( 0 ) )
$5.
DIRECT CONTROL
where C'
=
(-ylctn,
yn-
y2 - ylul).
- ylan-
Also here b'
=
(0,..., 0, - l),
P =
-Y1.
Hence bln
- hJl%
+ %Yn - Y l U n - 1 ) b3n + %(Yn-l - 7 1 % - 2 1 b2n
. . .
bnn
+ !it72
-
Y l d
Here we have once more the difficulty of (IV, $1). It is settled again by reference to LaSalle's theorem (IX, 4.8) leading to the sufficient condition for absolute stability =
dC-'d.
71
Chapter
6
DISCONTINUOUS CHARACTERISTICS Discontinuous characteristics have already been mentioned in the first chapter. (See Fig. 5.) While they clearly escape our general theory, they happen to be of great importance in the applications, and this justifies our discussing them here.
$1. Continuous Approximation of Discontinuous Characteristics If one takes a strictly practical point of view, one will decide that since in nature instantaneous action is not possible, it is reasonable to replace the two characteristics (a) and (b) of Fig. 5 by those indicated in Fig. 6. For the type (a) the approximation behaves in accordance with (I, $1). However, for the type (b) property I1 of (I, $1): ocp(o) > 0 for o # 0, is violated and so further considerations are required. We discuss separately indirect and direct controls. Indirect control. Let the system be (11, 2.3), referred to the coordinates y,o. In endeavoring to apply our general method, we meet with the difficulty that V(y,a), formed as before, is not positive definite since it vanishes for y = 0, a < o < p. The best way to deal with this difficulty is by a generalization of Liapunov’s theory due to Zubov. All that is 72
$2. DIRECT DISCUSSION OF DISCONTINUITIES
FIG.5
required is the generalization of asymptotic stability. Now in Zubov’s book [l, p. 481, the appropriate result is found. Essentially Zubov shows that in the Liapunov treatment one may replace the isolated critical point (the origin) by any closed invariant set M . We recall that such a set (open or closed) is just a collection of complete paths. That is, if p is any point of M , then the complete path (forward and backward) of p lies in M. In our case the segment a 5 0 5 b, y = 0, is an invariant set, since each point is a critical point of the system. Thus one may apply directly the Liapunov results on asymptotic stability. In short, all our developments for an indirect control are valid save that absolute stability is to be interpreted in the “practical sense” that all solutions (y(t), ~ ( t )tend ) to (0, p) where p is some point of the (open) interval a, of Fig. 6b. This is as far as one may go.
Direct control. Take the equation (IV, 1.1) for the system. This time one verifies at once that the function V ( x )of (IV, $1) is still positive definite and so there is no difficulty in connection with this function.
$2. Direct Discussion of Discontinuities We shall now attack directly the problem of discontinuities. One might be inclined to the facile view that it suffices to take the
73
6.
DISCONTINUOUS CHARACTERISTICS
H
1
FIG.I
situation just considered and “pass to the limit.” Unfortunately this does not happen to be the case. As a preliminary step it is necessary to say something about discontinuities in differential equations in general. In this we follow in substance Andrk and Seibert [13. Let H be a hyperplane ((n - 1) subspace) of the n space of x. One may select the coordinate system so that H is the subspace x1 = 0. What we propose to do is to define an n vector system (2.1)
i= X ( x )
discontinuous across H. Consider a narrow layer L (region) of our space, with center H, defined by lxll < 6,6 > 0 and small (Fig. 7). Let Si,i = 1,2 be the two regions of n space separated by H , that is the regions S,: x 1 > 0 and S , : x1 < 0 and let Ri, i = 1,2, be the region consisting of Si plus L. Thus R , is the region x 1 > -6, and R, the region x 1 < 6 (Fig. 8).
FIG.8
14
$2.
DIRECT DISCUSSION OF DISCONTINUITIES
Consider now two distinct n vector systems
(2.2)
f =Xi(x),
i = 1,2
where X i is of class C' in Ri.Define X as coinciding with X iin the set Si = Si plus H. The resulting system (2.1) will (generally) have the hyper-
plane H as discontinuity. The procedure just followed offers the great advantage that the solutions of (2.1) in Riare naturally extended up to all points of H without requiring any "limit" complications which are really foreign to the main theme. Furthermore, it is well known that continuity over a closed set such as Simust actually be defined over a slightly larger open set, here the region Ri. It is pertinent to observe that our mode of definition does correspond to that occurring in the applications where an inevitable amount of inertia will carry a path reaching the hyperplane H slightly beyond H. We will assume that neither of the systems (2.2) has a critical point on H,that is that X '(x) # 0, i = 1,2 on the hyperplane. Regarding any path y of (2.1) what is of interest is its behavior on Si= Si plus H, and not actually throughout Ri. Concerning y in Si we restrict ourselves to the following possibilities as to its approach to a point xo of H: (a) y + xo with increasing t and as t tends to a certain value to, and without tangency to H at xo. In standard terminology xo = x(to -). (b) The same but xo = x(to +) (y tends to xo with decreasing t). (c) y -,xo and is tangent to H at xo in Si. (d) Finally, of practical importance is the case when one of the two systems has a path through xo running in the hyperplane H itself. While the cases just described cover all the situations envisaged below, our general scheme may readily take care of more complicated
FIG.9
75
Gyp 6.
DISCONTINUOUS CHARACTERISTICS
I
FIG.10
FIG. I I
situations. Notice at all events that one may well replace the hyperplane H by a manifold M of some sort. One refers to the hyperplane H or manifold M as switching hyperplane or manifold for the system (1.1). Still confining the discussion to H upon combining what happens on both sides of H one will find for xo the following possibilities: (1) Of the two arcs tending to xo on S , and S , the one, say in S , + xo, and the other in S , leaves xo (Fig. 10). We consider then the path and solution in S , as the extension of the solution in S , beyond xo. A point such as xo is known as a transition point. (2) The arcs on both sides of xo tend to the point xo (Fig. 11). We have then an end point. The solution from either side cannot be extended beyond xo. (3) The arcs on both sides leave the point xo (Fig. 12). We have then a starting point. Here again there is no possible extension across the point xo.
FIG.12
76
$2.
DIRECT DISCUSSION OF DISCONTINUITIES
FIG. 13
(4) On one side an arc is tangent to the switching hyperplane as indicated in Fig. 13 or else rests upon the hyperplane at both ends as in Fig. 14. In both cases we will refer to the point as mixed. (5) The system (2.1), gives rise to an arc y1 in S, tending to xo and the system (2.1)z has an arc y 2 in H leaving xo (Fig. 15), or else yi tends to xo in H and y z leaves xo in S , (arrows of Fig. 15 reversed). As a variant, not required later, both y 1 , y 2 tend to or both leave xo. These situations really correspond to the cases 1,2,3 with one of the arcs ending at xo situated in the hyperplane H. (6) Still another highly interesting situation, as yet but little considered for dimensions above two will only be described in the plane. It corresponds to several switching arcs issued from the same point, say the origin. Let there be s switching arcs I-,,..., Tsand let u h be the sector bounded by r h and r h + (rS+= r,).There are defined s systems
,,
(2.3)
ii
,
= X?(X1, xz),
h = 1,2,...) s
where for the system (2.3), the functions X?,Xzh are of class C' in an
FIG. 14
FIG.15
77
6.
DISCONTINUOUS CHARACTERISTICS
/
/
1'
I
I I
I
/--. , 0
\ '\
I
open region V, which contain the sector uh together with r hand r h + l , and has no critical point in V,. Moreover rh is a path of (2.3), tending to the origin with increasing t. A path y of (2.3), will run, e.g., for increasing t till it meets rh+l at a point xo, and will continue beyond x o with the arc of rh+l from x o to the origin. The effect of this situation is to make the origin reached by all paths in finite time (Fig. 16). The differential equation
xi = X i ( x l ,x2),
i = 1,2
for this situation is defined by X i = X p on uh plus rh and it is (generally) discontinuous along the switching arcs rh. Returning to the simpler cases 1,2,3, one may localize in H the different types of points as follows. At the point xo of H the arc y in R, will tend to or away from x o accordingly as Xli(O,xZo,..., x,') < 0 or >O. Hence, excluding the points where this quantity vanishes we will have : transition point starting or end point
i
wherever the product
X ' ( 0 , x2 ...., x,)X2(O,x* )...,x,)
>O
1 4 The inequalities characterize then the two types of sets in H : transition sets and starting or end point sets.
78
$3.
SOME EXAMPLES
93. Some Examples The examples of discontinuity to be discussed, of systems of dimension one and two, mostly with discontinuities of the type of Fig. 5, will serve to obtain some insight into the complications that may arise. In all the examples one may take as sets R , the whole plane itself, so that no further mention of these sets will be required. As our first example, take the indirect control system of (I, $3): represented by the system (I, 3.5) or
where now all letters denote scalars. As for cp we take the type of Fig. 5a, explicitly M,
do) =
-M,
a>O
a<0.
We also assume k > 0. In the present instance one may integrate. As above, let S, represent the half plane o > 0 and S, the half plane t~ < 0. The system for S, is
(34
j=-ky+M ,
c?=~y-pM.
The solutions may actually be defined for S, plus the line o axis). We may write
=
0 (the y
which integrates as y
=
M ae-kt + T .
This yields
79
6.
DISCONTINUOUS CHARACTERISTICS
* Y
FIG.17
which integrates as
The solution reaches
=
0 for t = 0 if /3 = cu/k. Thus
which must be taken only in the half plane In the lower half plane the system is )’= - k y - M ,
2 0.
b=cy+pM
FIG.18
80
B
$3.
SOME EXAMPLES
with solutions
We are interested in the behavior of the paths on both sides, but near to the y axis. Here several cases must be distinguished according to the values of certain constants. I. pk > c > 0. The general situation is described by Fig. 17. The points of the arc lyl < p(M/c) of the y axis are end points and the ends of this arc, the points y = &p(M/c),are transition points. 11. c > pk > 0. The general description is indicated in Fig. 18. The same arc as before consists of end points. 111. c < 0. Set c = -c0. Then if co < pk the situation is the symmetric of Fig. 18 with respect to the y axis while if co > pk, it is the symmetric of Fig. 17 relative to the same axis. An entirely analogous treatment may be applied to the more complicated characteristic of Fig. 5b. Let the system be (3.1) with a>a>O -a
< 0 < +a
o < -a.
I
FIG. 19
81
6.
DISCONTINUOUS CHARACTERISTICS
FIG.20
This time we have
Hence, the upper halves of our graphs will be valid for 0 > a and their lower halves for D < -u. In between the common system is The arcs represented by this system and corresponding to c > 0 are those in the region 1 ~ < 1 a of Figs. 19 and 20. They are actually parallel segments with slope da/dy = - k/c. The two figures correspond respectively to pc < k and pc > k, c > 0. For c < 0 one must take the symmetric of the two figures with respect to the y axis.
REMARK.Figures 17 and 18 show that as a control operation the scheme is satisfactory for (yl greater than the largest of M / k and p(M/c). To emphasize what has just been found, consider an example of a discontinuous direct control of order two. Let it be
82
$3.
SOME EXAMPLES
FIG.21
with dx1
+ x2) =
M for
(-M
Here the switching line is x1 the line. We have
+ x,
[
x1
+ x,
>0
x1
+ x,
< 0.
= 0. Consider first the system above
Tangency to the line will occur where -3(X, 4 x 1
M/3) = -1, -M)
-
x1 + x ,
=o
or -xl = x, = M . The behavior is shown in Fig. 21 above the line. The behavior below the switching line is just the symmetric with respect to the origin of the one just obtained. Between the points ( - M , M ) and ( M , - M ) all points are starting points. Outside they are transition points. The ends of the segment are mixed points.
83
6.
DISCONTINUOUS CHARACTERISTICS
+I
FIG.22
$4. Special Switching Lines It is evident that taking the locus CT = 0 as switching line has resulted in making the origin virtually inaccessible from nearby points. This could not be more opposed to the very purpose of a control. For some systems with discontinuous control one may construct switching lines so as to restore the stability of the origin. We borrow the example from Bushaw [l, p. 361. Let the system be (4.1)
(a) i = y ,
j = - x + M
(b) f = y ,
j =
-X
- M.
The two systems can be written respectively (X
M ) r = JJ,
(X
+ M ) y = y,
J’ = - ( x - M ) j = -(x
+ M)
whose paths are the circles of center (0, _+ M ) . Both are described clockwise. The switching line is the heavy line of Fig. 22. The system (4.la) prevails below the switching line and (4.lb) prevails above it. The immediate effect of the switching line is to make all paths tend to the origin inJinite time. A typical path is MNPQO of Fig. 22, with the end arc QO part of the switching line. 84
$5.
MULTIPLE FEEDBACK SWITCHING LINE
55. Multiple Feedback Switching Line The switching lines or spaces are equally interesting in the case of multiple feedbacks. We will merely illustrate this by an example more or less inspired by LaSalle [3]. The system is in two variables. (5.1)
The uj play the same role as thef, in (IV, lo), and just take the values f 1. Consider first u1 = u, = 1. The system is then
11
(5.2)
= -XI
+ 2,
12
=
- 2 ~ 2+ 6
which may also be written (x1
- 2 ) = -(xl
- 2),
(x, - 3) = -2( x,
-
3).
Hence the paths in the plane are those of
x, =
(5.3)
-XI,
1, = -2x,
shifted by (2,3). The paths of (4.3)are given by x 1 = me-', x, = or x 1 2= yx,, that is they are vertical parabolas. We take as one of the switching arcs (Fig. 23) the arc A 0 of the shifted parabola (vertex (2,3)) through the origin. For u 1 = - 1, u, = 1 the system is (xl
+ I ) = - ( X I + l),
(x, - 1) = -2(x,
- 1)
with parabolas of vertex ( - 1, 1) and we take as switching arc the arc OB of the parabola through the origin. The other two switching arcs
FIG.23
85
6. /
DISCONTINUOUS CHARACTERISTICS /
FIG.24
FIG.25
correspond to u 1 = -1, u2 = -1 and u 1 = 1, u2 = -1. They are the symmetrical OA‘, OB’ of the arcs OA, OB. The regions of operation of the systems corresponding to the four sets of values of u l , u2 are marked on the figure as (1, l),..., the path issued from a point M is M N O and it is clear that here again the origin is reached from every point of the plane in finite time.
§6. Complementary Remarks It is evident that the switching line scheme may be greatly expanded but we shall not attempt this here. What has been done suffices to show that the strict treatment of the discontinuous case brings in altogether new elements, for instance an entire segment of end points. Their presence contradicts entirely the notion that in spite of the discontinuity the origin remains a “nice” critical point. It is interesting to point out that various authors have endeavored to define a natural extension of the solutions about an end point beyond or along the switching line. In practice, one is apt to observe along an end point segment what is known as “chatter,” which occurs when the solutions get a little beyond the switching line and act as indicated in Fig. 24. Still another extension has been proposed recently by Filipov [l]. The application here is clearly described in Fig. 25 for the case n = 1 and a smooth switching curve. The vectors AB, AC are the velocities on both sides of the switching line L. The segment A B intersects the tangent to L at A at a point D and the vector A D represents the velocity vector in L for the extended motion. 86
Chapter
7
SOME RECENT RESULTS OF
V. M. POPOV Recently the Romanian applied mathematician, V. M. Popov, inspired perhaps by the classical Laplace transform methods of linear analysis, utilized with telling effect Fourier transforms in an attack on control problems (Popov [2]). This provided the impulsion for Yacubovich and then Kalman to return to more algebraic methods with noteworthy new results. In particular Kalman succeeded in completing Popov’s work in an important way. We shall deal partly with the work of Yacubovich and Kalman in the next chapter. The present chapter is devoted to a presentation of Popov’s very original first results.
$1. Generalities. The Theorems of Popov In his work Popov considers a system in the following form, slightly different from our standard form (11, $2): (1.1)
(a) i= A x - cp(a)b (b) rj = 440) (c) a = c’x - 75. 87
7. SOME
RECENT RESULTS OF V. M. POPOV
In this system all the letters have the same meaning as in (11, $2). In particular the matrix A is still stable. The scalar y is new. Existence and uniqueness of solutions are dealt with as they are at the end of the Introduction. The general purpose is still to find conditions for absolute stability (Lurie). A necessary condition for absolute stability is that x = 0, a = 0 be the only critical point, i.e., the only solution of AX - bq(a) = 0,
~ ( 0=)0.
Since q(a) = 0 when and only when a = 0, and since A - ’ exists this condition is actually fulfilled. Observe that if y = 0 then the system (1.1) is, for control purposes, the system i= A X - bq(c‘x), that is, it is a direct control. As we wish to discuss first indirect controls we assume for the present that y # 0. As a matter of fact: (1.2) Under the assumption that y # 0, a necessary condition for absolute stability is y > 0. Absolute stability requires that the system (1.1) be asymptotically stable for q(a) = pa, p > 0. That is, the system 2 = ( A - pbC’)X
f
= pdx
+ pbY(
- py(
must be asymptotically stable for p > 0. A paraphrase of the argument of (IV, $5) [case (a)] yields here simply that one must have y > 0. Since y # 0 the transformation (x, () -+ (x, a) defined by x = x, a = c’x - y( is nonsingular. It transforms the system (1.1) into the equivalent system (a) i= A x - q(a)b (1.3)
(b) 6 = C’AX- p ~ p ( 0 ) (c) p
=
+ c’b
which represents a standard indirect control such as (11, 2.3), and this without changing the system variable x. Thus (1.1) represents essentially a standard indirect control. 88
$1.
GENERALITIES. THE THEOREMS OF POPOV
Conversely there is a nonsingular transformation of coordinates reducing the standard indirect control system of (II,2.1) to the Popou type (1.1). Hence Popov's system represents a completely general indirect control. Write the fundamental system (IL2.1) as follows : f = AX - bo(
4 = cp(4 t s = crx - p ( .
Introduce the auxiliary variable z =x
- A-'bo(.
The transformation (x, () + ( z , () is manifestly nonsingular. Hence for all stability purposes the auxiliary (z, <) system will be fully equivalent to the fundamental system. The (z, 5 ) system is immediately found to be
i = AZ - A-'bocp(a)
4 = cp(4 c
= C'Z
- ( p - c'A - 'bo)(.
This is a Popov system with
b
=
-A-'bo,
y = p - C'A-lbO.
From this Popov system one may pass to the fundamental system (1.1) by the relations bo = -Ab, p = + c'b. REMARK.In the designations of (11, $2) we have shown (LaSalle) (111, $2) that p > c'A-'b. Since the previous c' is the present c'A-'b the same relation reads now y = p - c'b > 0. Thus one of the present conditions for absolute stability is the same as LaSalle's condition. Certain noteworthy expressions and their Fourier transforms must now be introduced. As in (IV, $4) set A , = zE - A so that IA,I = 0 is the characteristic equation of A. Since A has no pure complex characteristic roots IAi,l # 0 for all real w,hence A,' is defined for all such o. Recall that X ( t ) = exp(At) is the canonical solution of the matrix differential equation X = A X , that is, the solution such that X ( 0 ) = E . An important function occurring in the proof of Popov's first theorem is defined by v(t) = c'eA'b.
89
7.
SOME RECENT RESULTS OF V. M. POPOV
By definition the Fourier transform of a function u(t) is the integral (if it exists)
F(u)=
s,'
e-'%(t) dt
and similarly for vectors and matrices. In particular F(v)
=
lrn
exp( - A i m .t )d t . b
N(iw) = c' -c'
*
A,;'[exp(-A,,t)],'"b
=
c'A,;'b.
As this last expression will be of constant occurrence in the sequel we may note at once the general property:
(1.5) The expressions c'A,-'b, c'AG'b are invariant with respect to transformations of coordinates. In fact from (1.1) the transformation of coordinates x b + P-'b,
c ' + c'P,
A,
+ P-'A,P,
A;'
+ Px
yields
+ P-'Az-'P.
From this follows as asserted that c'A; ' b remains unchanged. In addition to N ( i o ) there will be required the function
(In $7 the function G(z) will be shown to be a "transfer" function). We may now state the two theorems of Popov: (1.6) First theorem of Popov. A suficient condition for the absolute stability of the system (1.1) is that for some nonnegative number q and all real w we have (1.7)
Re{(l
+ iwq)G(iw)}= Re{(l + iwq)(c'A,'b)} + qy 2 0.
The importance of this result is enhanced by (1.8) Popov's second theorem. I f the absolute stability of the system (1.1) may be determined by means of a Liapunov function V ( x , a ) of the form "quadratic in x, a plus /I& cp(a)do" then there exists a q 2 0 such that (1.7) holds.
To sum up, the first theorem asserts that the inequality (1.7) is a sufficient condition for absolute stability, while the second implies, in
$2.
PRELIMINARY PROPERTIES
Y
FIG.26
particular, that if our inequality (Fi) holds then (1.7) holds also. Thus (1.7) is broader. This is assuredly striking enough. The inequality (1.7) may be given, with Popov an interesting geometric interpretation. Set c'A,'~
+
Y
= S,(o)
+ iwS2(o)
1 0
i
where S1,S2 are real rational functions of o,in fact continuous for all o.Then (1.7) yields S , ( o ) - qoS,(o) 10. In the real x, y plane, consider the line L:
x-qy=o.
According to (1.7) the curve r: x = S,(o), y = oS2(o)has a tangent from the origin in quadrants 1 and 3 (Fig. 26) and is otherwise located below this tangent. The curve is of the type called unicursal in algebraic geometry.
$2. Preliminary Properties Before taking up the proofs of Popov's theorems it is convenient to discuss certain preliminary properties. Incidentally we will write systematically d t ) ,W) for rp(a(t)), @(o(t)). 91
7. SOME RECENT RESULTS OF V. M. POPOV
I. From (l.la) there follows the basic condition (Lefschetz [l, p. 721): (2.1)
~ ( t=) X ( t ) ~ o
Sb
X ( t - z)bq(z)dz,
where xo = x(O), and similarly later co = c(O), etc. From (2.1) and (1.1~)there follows (2.2)
o(t) =
c'w(t)x0-
Sbx(t
- z ) b q ( z ) d z )- y&)
~ (-t
T ) ~ ( Tdr ) -
~((t).
11. Let g(t) be a polynomial and let m be a positive constant. Let also h(t) = g(t) e-"".
Since b(t)l e-(m-k)',
0
is bounded for t 2 0 there is a positive constant C such that
Let h(t) be a finite sum of expressions g(t)e"',Re I < 0. If 0 < k < least - R e 1 then (2.3) holds also. We refer to h(t) as quasi exponential In particular since the matrix A is stable both I@) and b(t) are quasi exponential and so satisfy relations such as (2.3). We may in fact choose fixed C,k such that
111. If k > 0 andf(t) -1
For
92
+0
as t + co then likewise
$3.
PROOF OF POPOV'S FIRST THEOREM
and by the rule of 1'Hospital as t
-, + 0 0 :
lim h(t) = limlf(t)l
=
0.
IV. By convolution of two functionsf(t), g(t) is meant the operation
* defined by
f *g
=I'
f(r)g(t-
2 ) dr
0
and we note that f * g = g *f.With this notation one may write (2.1) and (2.2) as x
=
X(t)xo- X(t)*bq(t)
CT
=
c ' X ( t ) x , - v ( t ) * ~ ( t-) ~ ( ( t ) .
The basic properties are (2.5)
1'
Wfl
*fi) =
f ( t ) g ( t )dt =
-03
1'
-m
-Wfl)F(fA F(f)@(g) dw,
and if g is real (2.6)
/+O3/(t)g(t)dt = -m
-m
5 ( f ) @ ( g ) dw.
(For these properties consult Doetsch [l, pp. 157-1631 and Titchmarsh [I, P. 501.) Of course one must assume that the integrals just written do exist.
1;')
53. Proof of Popov's First Theorem Introduce the following functions (PT(t) =
for 0 5 t 5 T otherwise,
93
7.
SOME RECENT RESULTS OF V . M. POPOV
i(t)= c ' ( ~ ( t +) qX(t))x,. Now (2.2) yields
=
c'X(t)xo-
From this and (2.2) follows (3.1)
A(t) = a(t)
Sb
i(t - z)cp(z)dz
+ 4 4 t ) + yt(t) - i(t),
-
pcp(t).
0 5 t 5 T.
Another required property of A(t) is : (3.2) There exist positive constants k , C such that for t 2 T: (3.2a)
I,l(t)l < C e - k r .
Since both v(t) and i(t) are quasi exponential so is p(t). Hence for t 2 T a n d some k,C > 0
IA(t)l < C[ e-k"-r)IcpT(z)Idz
T
=
C e - k j ekrlcpT(z)ldz
0
0
which is (3.2a) with suitable C. Taking into account (3.2) or the quasi exponential property or continuity of the various functions integrated in the sequel, all our integrals will be meaningful-a property not emphasized later. Set now L(iw) = 9(/?), F(iw) = p ( ( P T ) .
By a basic property of Fourier integrals
9 ( t )= =
s,'"
e-iw'b(t)dt
=
i o N ( i o ) - c'b
ioiV(io) + y - p,
9 ( A ) = L(io)=
- {N(io)(l
+ i o q ) + qy}F(iw).
Define now the functions O(T) =
l ( t ) ( P T ( t ) dt. 0
94
$3.
PROOF OF POPOV’S FIRST THEOREM
(3.3) The inequality (1.7) (hypothesis of Theorem 1.6) implies that B(T) 5 0.
s
In fact referring to (2.6) + W
B(T) =
L(io)F(io) d o
-m
-1
+m
=
-m
+m
{N(iw)(l + iwq) + qy}F(io)F(io)d o IF(iw)12(N(iw)(1+ ioq)
+ qy) d o
- W
-1
s
=-
2n
IF(iw)12{ReN ( i o ) ( l + ioq) + q y } d o
+m -m
IF(io)12 Re(1
+ ioq)G(io)d o
As a consequence of (1.7) the last integral is 2 0 and so (3.3) follows.
Upon combining (3.1) and (3.3) there follows the important inequality
(3.4)
ST 0
a(t)cp(t) dt
+q
1
T
0
W)cp(t)dt + Y
T 0
t(t)cp(t)dt 5
1 T
0
Wcp(t)dt.
One must analyze now all these integrals. The left-hand side of (3.4):
s:
Since acp(o) > 0 for all u # 0, for all T : (3.5)
s:
a(t)cp(t)d t > 0.
Regarding the second integral (3.6)
&(t)cp(t)dt = @(T)- @(O).
Finally the third integral yields
95
7.
SOME RECENT RESULTS OF V. M. POPOV
A t the right-hand side of(3.4):
I’ 0
1
T
C(t)cp(t) dt =
0
i ( t ) r i ( t )dt
-1
‘7
= [C(t)m];
(If.
S(t)&) 0
Let t1= sup(t(t)(in 0 5 t 5 T Since [ is quasi exponential, and proportional to llxoI),if u = llxoII to,then
+
Since [ ( t )is quasi exponential of the same nature as
[(t)
Hence with C > 0 and independent of T :
Upon combining the inequalities (3.4), (3.6), (3.7), (3.8) there follows
1:
a@)&)
dt
+ q@(T)+ i t 2 ( T )5 Cut1 +,qO0+ +yu2.
Referring now to ( 1 . 1 ~the ) initial value oo of o(t) tends to zero with l(ol = u. Since @(a) is a continuous positive function of o and vanishes for CT = 0 one may take u so small that qQ0 < yu, where u is a given positive quantity. Hence the above inequality yields the following two special cases: llxoI(
+
Jl
(3.9)
a(t)cp(t) dt
< Cut,
+ $yu2 + yu.
Then if t1 = ((T,) and since we may take T @(T)2 0: (3.10)
f(ll)= t 1 2 - 2CU(,
-
y(u2
=
+24
T,, and also because 0.
+
Let l,’, tl”,be the roots o f f = 0. Since tl‘tl”= - ( u 2 20) S 0, the roots are of opposite sign and (3.10) requires that t, be between 96
93.
PROOF OF POPOV’S FIRST THEOREM
them. Since t1 > 0 it must be below the largest root or
t1 < c u
+ J(C’ + y)u2 + 2v.
Since this upper bound of It(t)l for 0 5 t 5 T is independent of T we conclude at once that for all t 2 0: (3.11)
It(t)l < c u
+ J(C’ + y)u’ + 2v.
Since t1is bounded we also have from (3.9) that a(t)cp(t)dt< co
(3.12)
that is: the integral is bounded. Note that the bounds in both cases depend solely upon the initial values x o and to. Going back to (2.1) we have X ( t - r ) b ~ (dr~ = )
Referring again to (2.1) the quasi-exponential property of X (t), X ( t ) and (3.11) we obtain (3.13)
IIx(t)ll < C(U
+ tl),
c > 0.
In view of (3.11) and (3.13) we conclude that given E > 0, one may select q > 0 such that if u < q then Ilx(t)ll 1t(t)l < E for all t 2 0. In other words the origin is stable for the system (1.1). To complete the proof of Popov’s first theorem we still need:
+
(3.14). Every solution (x(t),( ( t ) )of (1.1) tends to the origin x t + + 00, and this whatever the admissible function cp(a).
=
0, 5
=
0 as
As a preliminary step we require :
+
(3.15) Both o(t) and cp(t)+ 0 as t + co. The boundedness of IIx(t)ll and It(t)l, coupled with (1.1~)already show that a(t) is bounded. Since cp(a) is continuous and cp(0) = 0, cp(t)is 97
7.
SOME RECENT RESULTS OF V. M. POPOV
likewise bounded. From (1.3b) and the boundedness of (1x11 and q ( t ) follows that qt) is likewise bounded. Hence one may select an M > 0 such that la(t)l and Ib(f)l < M for all t 2 0. Suppose now that (3.15) does not hold for a.There exists then a S > 0, and a divergent positive sequence t , < t2 < ... such that la(tk)l > 6 for all k. We may actually assume that tk - tk-l > S/M, t , > S/2M and we will have la(t)l = la(tk)
+
s’
q?)dzl > la(t,)l
- /M(t-
fk)l
> 6 - )a
=
$6.
tk
Since q(a)is continuous and cp(0) = 0 if and only if a = 0, for M > 161 > T < tN+l, necessarily we will have Iq(a)l > R > 0. Hence if tN
with T. Since this contradicts (3.12), a ( t ) 4 0 and so does q(t).
PROOFOF (3.14). Since X ( t ) is quasi exponential in the sense that all its terms are, Ilx(t)ll -, 0 as a consequence of (2.1), of q ( t ) + 0 and of property 111 of $2. Referring now to (1.1~)since )Ix(t)I(+ 0, a(t) -, 0 and y # 0, l ( t ) also +O. This completes the proof of (3.14) and likewise of Popov’s first theorem.
$4. The Generalized Liapunov Function of Popov In his work Popov considered a more general Liapunov function than the one of the previous chapters. Before attacking the proof of Popov’s second theorem it will be profitable to discuss this generalization. Its general type is: quadratic in x, a plus j?@(a),i.e. :
+ on2 + a f ’ x + pD(0).
V(x, a) = x‘Bx
This function may immediately be put in the form
v = x’Bx + cr(a - c’x)2 + B@(a)+ o f ’ x .
Then
- V = X’CX
d + @p2 + 2d0‘xq + 2ayaq - (af’x) = dt
do = Bb - ($A’c
A’B
98
+ C~YC),
+ B A = -c.
$4. THE GENERALIZED LlAPUNOV FUNCTION OF POPOV
The E method. Upon making a substitution x + cPx,a + E%, cp + f c p , q r even (to preserve ocp > 0), V or V is turned into a polynomial P(E).Let As” be its lowest degree term. We shall then write P A AE”, and P will have the sign of AE“ for E small. Now
+
d dt
-(~f‘x)
=
+
(x’A’c - p ~ p ) f ’ ~a f ’ ( A ~ qb).
Hence the substitution x
a + a,
+ EX,
yields iff # 0 :
VA
cp
+ E’cp
-&Of‘AX.
Since A is stable it is nonsingular. Hence iff # 0 likewise o f ’ A x # 0 for x arbitrary and a # 0. Hence the sign of V for E small is then that of ~ a f ’ A xi.e. , it may be + or - according to the sign of E. Since this is ruled out necessarily f = 0. Thus finally (with repetition and to keep everything together) (4.1)
(4.2)
(4.3) (4.4)
+ a(a - c’x)’ + p@(a), + /3pcp2 + 2do‘cpx + 2ayacp,
V = x‘Bx
-V
= X‘CX
A‘B + BA =
-c, do = Bb - (;/3A’c + ayc).
(4.5) If absolute stability is to be determined by V of (4.1) positive deJinite one requires that u 2 0, p 2 0, a /3 > 0.
+
Let a # 0. The substitution x + E X , CT + a,cp + c3cp yields V A aa’, hence a > 0. Let p # 0. Then the substitution x + E ’ X , a + 8’0, cp + cp yields V A E ’ ~ @ ( oand ) , since @ > 0 one must have p > 0. If both a, p = 0, V = 0 for x = 0 and a # 0, hence a + /3 > 0. To sum up we are left with the following two types of Liapunov functions : The Lurie-Postnikov function. (4.6)
V = x’Bx
+ /3@(a)
7.
with
SOME RECENT RESULTS OF V. M. POPOV
+ ppcp2 + 2d’cpx Bb - + / ~ A ‘ c ; A‘B + BA = - C .
- V = X’CX
d
3
The Popov function. V = X‘BX + ~ ( a c’x)’ (4.7) (4.8)
+ pCD(0)
+ Bp’p2 + 2d0’xcp + 2uyocp = Bb - (#A’c + U Y C ) , A’B + BA = - C ,
- V = X’CX
do together with property (4.5).
$5. Proof of Popov’s Second Theorem Let us modify, with Kalman, Popov’s expression in (1.7) through replacing (1 ioq) for some 4 2 0 by 2uy imp for some nonnegative pair u , p such that u + p > 0 (a and fi do not vanish simultaneously). For u # 0 the two expressions are clearly equivalent as far as (1.7) goes. However the new expression for u = 0 corresponds to q very great in (1.7). The modified Popov inequality is :
+
+
(5.1)
P(u, p, w) = /3y
+ Re((2uy + iwp)c’A,’b}
20
for all real w and some pair u, p such that (5.la)
a20,
/320,
u+/l>O.
Thus Popov’s first theorem reads now: P 2 0, under (5.la) is a sufficient condition for absolute stability of the system (1.1). To prove Popov’s second theorem we must show that for the function I/ of (4.7), the double property V and - both positive definite for every admissible ‘p implies (1.7) for some q 2 0. We shall show, as does Popov, that it implies (5.1) with u, p the same constants as in (4.7). If a real bilinear form u’Fu > 0 for u # 0 we will write F > 0 as if it were a quadratic form. We will require presently the property: (5.2) Let u be allowed complex vector values. Then F > 0 and the property Re u*Fu > 0 for u # 0 are equivalent.
+
Here, if u = u1 iu, then Re u * Fu = ui‘Fu, + u,‘Fu, > 0 for u , , u2 not both zero if F > 0. Conversely Re u * Fu > 0 for u # 0 (complex) yields F > 0 for u real and #O.
$6.
COMPARISONS
Regarding the theorem itself, absolute stability has already led to the proof of (4.5) which is (5.la). Regarding (5.1) itself, begin with -V =
-2x'B(Ax - b q ) + 2aycp(uo -
Hence by (5.2):
C'X)
- P~[c'(Ax b q ) - yep] > 0.
Re{ - 2x * B(Ax - b q )
+ 2ayq(a - c'x)
This inequality is the starting point of the proof. By definition i o E = A + Aim. As already observed A,i' exists for all real w and so the preceding relation yields iwAG'b = A . A G ' b + b.
The hypothesis b = 0 is unrealistic since it means that the control is not operating. We assume then b # 0. As a consequence m(iw) = A i ' b # 0 for all w. Upon substituting in (5.3)x = - m , cp = puo, p > 0 and r~ = l / p ; the inequality must hold since it must hold for all complex x , real r~ and all admissible cp (in particular for cp = puo). Thus we find -2m*Biwm
+ B[(c'iwm) + y]
+ 2ay
Since m*Bm is real Re iwm*Bm
=
0. Hence
for all real o.Since the sum must be positive for arbitrarily large positive p, if a # 0 we must have P 2 0, while if a = 0 (and y # 0) we require P > 0, all this for all real o.Thus the Popov inequality (5.1) holds, with a, B the same constants as in V . This proves the theorem.
96. Comparisons It is interesting to compare what may be accomplished by our comparatively simple methods using the Liapunov-Popov function (4.7) and the earlier type (4.5). 101
7.
SOME RECENT RESULTS OF V . M. POPOV
From the function (4.5) one obtained the inequality of (11): p > d'C- 'd,
(Fi)
where in reference to the system (1.1) d = Bb
-
)A'c.
Passing now to the function (4.8) since GI 2 0 and acp > 0 for o # 0, we see that to have - positive definite it is sufficient that the quadratic form in x, cp: W
= X'CX
+ pcp2 + 2dO'cpx
be positive definite. The same reasoning as for the proof of (Fi)yields the inequality (6.1)
p > do'C- 'do.
Let for the present ccy
= u,
and set
$(u) = d0'C-l do do = d - U ~ C ,
2
=
=
Iu'
c'C- 'c,
+
- 2 p ~ V,
p
=
'
c'C- d,
'
v = d'C- d,
\
One must bear in mind that u must be 2 0. The right-hand side of (6.1), compared with that of (Fi)contains the additional parameter u and we may dispose of it to optimize the1 inequality (6.1). To be precise the least value of p afforded by (Fi)is ~
pm = d'C- Id,
while the least value pm* corresponding to (6.1) is to be obtained as the positive minimum of $(u) for u 2 0. One must then discuss this minimum. If one finds that pm > pm* the Liapunov-Popov function V will have been proved more advantageous, if not, the earlier function may as well be used. Observe now that c = 0 would mean that the feedback variable 0 is independent of the system variable x . Since this is entirely unrealistic and uninteresting, one may assume that the vector c # 0. Since C > 0, hence also C - ' > 0, the coefficient I > 0. It follows that the minimum of $(u) occurs for u = p / 2 . However this minimum is only admissible if p > 0. If p 5 0 the minimum of $(u), for u 2 0 occurs at u = 0. We discuss separately the two possibilities. 102
$6. COMPARISONS
(a) p > 0. The minimum $, of $(u) occurs for u value is p2
*m=v--=--
A
VA-
p2 -
= u, =
p / A and its
6
--
A
A
where 6 is the discriminant of the quadratic $(u). That is
6
=
(c'C- 1d)2- (c'C- 'c)(d'C-'4.
In order to have pm* = $,, the latter must be positive and so one must have 6 < 0. Thus if 6 > 0 one can only have pm* arbitrarily small. At all events since v = pm we have
Hence the Liapunov-Popov function V is certainly more advantageous if 6 0 and p > 0, that is c'C- 'd > 0. If 6 > 0 and d # 0 we have pm > 0 and pm* arbitrarily small, hence again the Liapunoy-Popov function is more advantageous. On the other hand whenev d = 0, one cannot claim any advantage for it. (b) p S 0. This ime u, = 0 and so pm = v = pm*. Hence no advantage is afforded by the Liapunov-Popov function.
-=
r
Application. Consider the standard example (111, $2) under the assumptions that b,c have no zero components, and also that C corresponds to optimum. In all cases for optimum
and
3.
C-'
=
diag(6, ...,
0
=
(A'c)'x - p<.
Recall that here
If one assumes that some hermitian forms and then p
=
Ah
are complex one must replace quadratic by
%c*C-'d
+ d*C-'c)
=
Rec*C-'d. 103
7. Hence since A
=
SOME RECENT RESULTS OF V. M. POPOV
diag(Al,..., A,,)
If this quantity is positive the Liapunov-Popov form will offer a definite advantage, otherwise this is doubtful. For instance if every Ah is real so that A h = - p h < 0, and if every product b,ch > 0 one finds p = 0 and one cannot affirm that the advantage rests with the Liapunov-Popov form.
$7. On the Function G(z) as Transfer Function Take the system (2.3) and apply to it the Laplace transformation. There results the relation, in which i,6,@, denote the Laplace transforms of x, cr, cp : (a) z 2
=
A 2 - b@(t)
(b) zo = C‘AX- &(t) From (7.1a) there follows i
=
- A ; ’ b @ and hence from (7.lb)
zu = -(c‘AA,-’b
(7.2)
+ p)g.
In view of A = zE - A, there follows zcr = -z(c’A;’b and finally
+ c‘b + p)@
Thus G(z)is the transfer function from -cp to cr. If we denote this transfer function by T ( z ) then Popov’s inequality (1.7) may be written Re((1
+ ioq)T(iw)}2 0.
This provides a link between this inequality and standard (linear) “transfer” technique. 104
$9.
CONCLUSION
$8. Direct Control The system to be dealt with is (1.1) but with y (8.1)
1 = A X - bq(o),
=
0, or
c = C’X.
Take a more general Vfunction than in (IV, $I), namely, V = X‘BX
(8.2)
+ PO(o)
hence with (8.3)
- P’ = X‘CX
A‘B
+ 2d’xq(a) + C’bq2(o)
+ BA = -C,
d
=
Bb - ~PPA’C.
Passing now to Popov’s theorems the proof of the first is directly applicable. Since y = 0 it reads: (8.4) Popou’s first theorem for a direct control. A suficient absolute stability condition is the inequality. (8.4a)
P(w) = Re((1
+ iwq)c‘Ai,’b} 2 0
for some q 2 0 and all real w.
As regards Popov’s second theorem we know from (IV, 91) that -Pis not positive definite in x and q.
59. Conclusion The special theoretical and even practical importance of Popov’s contribution is that in looking for absolute stability conditions it has made it possible to replace the search for a full matrix: B or C of(I1) by that of a mere number q 2 0 which may be determined from a graph (see $1). This applies also to the developments of the next chapter.
105
Chapter
8
SOME FURTHER RECENT CONTRIBUTIONS
In the present chapter we continue the treatment of absolute stability of the indirect control system of (VII, Ql), and notably still assume y > 0. Let P(a,fl,w) and V ( x , o ) be the Popov and Liapunov functions of (VII, 4.6, 4.7). By strengthening a theorem of Yacubovich [3] (our main lemma below) Kalman [2] succeeded in completing essentially Popov’s second theorem to a n.a.s.c., his “sufficiency” part however requiring a further strong restriction. Our proposed more modest task is to prove, following largely Kalman, that P > 0 plus a very simple restriction yield n.a.s.c. to have V and - 3 both positive definite and also absolute stability. While our treatment is thus relatively simple, it is only fair to say that most of the difficulties in Kalman’s treatment were caused by weakening “>” to “ 2 ” ( P 2 0, 3 5 0). However our simpler attack will suffice to give the reader an idea of that of Kalman. In the last section we discuss the effect of restricting the admissible class of characteristic functions by an inequality 0 icp(a)/o 5 K. While we shall lean almost entirely upon the work of Kalman, the notations are those of the previous chapters. For convenience the reader may use the following transfer table :
$1. CONTROLLABILITY A N D OBSERVABILITY Notations ofthetext: A
h
c
5
7
p ~
Notations oFKalman: F
g
h
-<
y
p
$1. Controllability and Observability While these concepts have appeared here and there in the literature before Kalman, he has had the great merit of giving them explicit form, content and application. See notably Kalman [2] and Kalman and associates [11. Roughly speaking a controlled system is completely controllable (c.c.) if one cannot decompose it into two systems with the control operating in only one of the systems; otherwise it is merely partially controllable or simply noncontrollable. Controllability operates through the vector b. Something similar takes place regarding observability and the vector c appearing in the expression of 0. The latter is the feedback or "visible" effect of the control, hence observable. Without going into further detail we adopt the following explicit definition (Kalman): The pair (A, b) is completely controllable (c.c.) whenever the vectors b, Ab, A'b, ..., A"- ' b are linearly independent. The pair (c', A ) is completely observable (c.o.) whenever ( A ' ;c) is C.C.As a matter of fact the important property of independence of the vectors Akb has been considered by many authors. See for example LaSalle [3, Introduction and p. 151. Our main purpose in the present chapter is the proof'of the fundamental theorem of $6. Now from a very general theory developed by Kalman [2, No. 121 one may infer: (1.1) There is no loss ofgenerality in assuming that the pair ( A , b) is completely controllable and the pair (c',A ) is completely observable.
The reduction of our system to this type and also its justification will be carried out in the next section. Recall this well-known property : if the characteristic equation of the matrix A is (14
2
+ a,l"-' + ... + u , = o
then A satisfies the matrix equation (1.3)
A"
+ a,An-' + ... + a , E = 0. 107
8.
SOME FURTHER RECENT CONTRIBUTIONS
This property is usually expressed as: the matrix satis3es its own characteristic equation. If ( A ,b) is c.c., (1.2) is the equation of lowest degree (up to a constant factor) satisfied by A : it is also its minimal equation. In fact if A satisfied an equation of degree p < n, e.g., AP + a,'AP-
' + ... + al'E = 0
then the p + 1 vectors b, Ab,..., APb, would not be independent, contradicting the complete controllability of ( A ,b). (1.4) The property of complete controllability of ( A ,b) is independent of the choice of coordinates.
For the change of coordinates x -+ Px yields A -+ P- ' A P , b -+ P - 'b. Hence the set of vectors S = {Akb}, 0 k n goes into S1 = {P-'Akb}. Since P and P - are nonsingular, linear independence of one of the sets implies that of the other, hence (1.4) follows. We shall also require this property :
-=
'
(1.5) A n.a.s.c. for complete controllability of the pair ( A ,b) is that
vector x is such that x'eA'b = 0 for all t then x
=
0.
if a
It will be convenient to utilize orthogonality. Two vectors u,u are orthogonal if u'u = u'u = 0. As is well known and readily proved, a n.a.s.c. for the existence of a vector u # 0 orthogonal to n vectors u l ,..., u, is that the uh be linearly dependent. Observe also that (1.3) yields Ak+"b+ a,Ak+"-lb
+ ..- + Akb = 0
whatever k. Hence the vectors Akb all depend linearly upon the set S defined above. We come now to the actual proof of (1.5). NECESSITY. Let ( A ,b) be C.C. and let x'eArb= 0 for all t. Upon differentiating k times and setting t = 0 there follows in particular x'Akb = 0, 0 5 k < n. Since x is orthogonal to the linearly independent vectors of S , necessarily x = 0. This proves necessity. SUFFICIENCY. Let the property of (1.5) hold. As a consequence if x'Akh = 0 whatever k then x' = 0. Since these Akb depend linearly upon
108
52.
REDUCTION OF THE SYSTEM
the vectors of S , if x is orthogonal to all the vectors of S alone then x = 0. Hence S consists of linearly independent vectors. Hence (A, b) is c.c.: sufficiency is proved and so is (13.
$2. Reduction of the System to One with a Completely Controllable Pair ( A , b) and Completely Observable Pair (c', A ) The above title states our objective. However to describe the reduction process a convenient notation is needed. Let A = diag(A,, ..., A,) and let nh be the order of Ah so that Xnh = n. Iff is any vector let f h denote the vector whose components of order n, + n2 + ... + nh-' + s, 1 5 s g nh, are the same as those off and the rest zero. Thus f = Cf h . Method of reduction. The only manner in which the pair ( A , b) will affect later arguments is through expressions of type f ' A - 'b, where under a transformation of coordinates x + Px, f behaves like c : f +,''P so that f 'Az- ' b is unchanged. Note that this implies freedom in changing coordinates. Now the following operation does not modify f 'Az- ' b : If A
then
=
diag(A,, ..., A,)
f 'Az- 'b
=
1f h'A,'bh.
Hence if bh = 0 the summand f h ' A i ' bh = 0 whatever f. Hence Ah and related coordinates will not affect any later argument and so they may be suppressed. That is, one may freely replace A, as far as the sequel goes, by diag(A, ,..., Ah- Ah+ ,..., A,). This may also be justified in the following manner. Since bh = 0 the vector xh satisfies 2 = A h X h which is a system with constant matrix and no control: control-neutral. Since A is stable so is A h . Hence the solutions xh(t)all + 0 naturally as t -+ +a and so one may as well disregard Ah and related coordinates. An analogous process may be applied in the following case. Let A h be a block matrix : Ah = c h ( n h ) , such as.occurs in the Jordan normal form (IX, $1). Let bh be such that its coordinate of order v = n , + ... + nh is zero. In view of the form of Ah the componentsf,, b, off, b enter inf 'A; ' b
,, ,
109
8.
SOME FURTHER RECENT CONTRIBUTIONS
solely through the expression
As above, then, one may suppress the coordinate x , and components without affecting anything. In particular C(A) will merely be replaced similar block but of order one unit less. This operation may of course be repeated. Our reduction consists, then, in the suppression of certain submatrices and terms and allowable coordinate transformations, i.e. in which real points are always represented by certain conjugate pairs of coordinates. These operations will not affect the nature of the Jordan normal form. A final remark referring to the submatrices Ah in the diagonal form. The statement : (A,,,bh) is C.C.merely means that the vectors A,,%', 0 =< s c nh are linearly independent. We now proceed with our process of reduction. As we may assume that A is in the Jordan normal form it is convenient to consider first a single block matrix. Matrix A = CG), 3, # 0. Since one may assume that the preceding reductions have already been applied one may assume that b # 0 and also that its component b, # 0. We prove: Property a. Under the preceding assumptions the pair (C(A),b) is completely controllable.
The proof is a consequence of the following two properties: Property fl. One may choose coordinates such that C(A) is unchanged but b' becomes the vector (0,..., 0,l). Property y. If b' = (0,..., 0, 1) the pair (C(A),b) is completely controllable.
PROOFOF /?.The transformation y,
= ClXl
y,
= ClX,
+ ... + c,x, + ... + C.-IX,
. . .
Yn = C ~ x n
is nonsingular provided that c1 # 0 and it preserves the block property 110
$2.
REDUCTION OF THE SYSTEM
of A. Choose it so that b has the prescribed form. This yields the system in the C h : C1bh -k CZbh+l -k
**'
-k Cn-h+lbn = 0,
clbn
=
h < n,
1.
The determinant is +b," # 0, hence there is a unique solution. Since c1 = l/b,,, the transformation of coordinates x + y is legitimate, and as it does not modify the form of A, property B follows. PROOFOFY. We will show that if x'eArb= 0 for all t then x Thus y will be a consequence of (1.5). Let eAr= (&). Then x'eArb=
=
0.
1
Xh/?hn.
Now an elementary calculation yields +n-h
Hence
The assumption implies that the polynomial of the sum is identically zero. Hence every xk = 0, x = 0 and y, hence also a, follows. General pair (A, b). We may take A in the Jordan normal form and without blocks whose bh = 0 or last component b, = 0. Take again a definite characteristic root A with two blocks @(A), C"(A) of orders p,q with p least for such blocks so that p S q. Let x,,', h = 1,2, ..., p and xi, k = 1, 2,..., q denote the associated coordinates. One may assume that their bJ, e.g. b' and b" are both of type (0,..., 0, 1). Apply the coordinate transformation x i - , + + xi-,+ - xl',..., xf + XI; - xp'. As a consequence the types of C'(A) and C"(A) will be unchanged but b" will be replaced by zero. Hence C"(A) may be suppressed. Upon repeating this operation as many times as necessary, A will still be in normal form but with distinct characteristic roots for distinct blocks and all the bh of type (0,...,0, 1). We must now show that when this happens (A, b) is C.C. Write simply A = diag(A,, ..., A,) where Ah = c&). Referring again to (1.5) one merely needs to show that if (2.1)
x'eArb= 0 111
8.
for all t then x
=
SOME FURTHER RECENT CONTRIBUTIONS
0. Now (2.1) is equivalent to this:
for all t implies that every X” = 0. As this property is a consequence of the complete controllability of (Ah, bh), (2.1) is proved. Hence finally ( A , b) is C.C. This completes the reduction. Complete observability of (c’, A ) . Let generally T denote the preceding operations on A which lower its order n and alone affect C.C.of ( A , b) or C.O.of (c’, A). Now starting with the initial A, to achieve C.C.of ( A , b ) may have required to apply operations T from A to A , of order n, < n and associated b’, c1 with ( A , , b’) C.C.If (c”, A , ) is not C.O.an analogous (dual) procedure will yield A , of order n, < n, and (c”, A,) c.o., etc. The process must clearly stop, e.g. with an A , = 0: final system control neutral, which is not realistic, and therefore ruled out, or with A , # 0, hence (in evident notations) with (A,, bo) C.C. and (c”, A,) C.O. Thus the reduction of (1.1) will have been achieved. We will say briefly that (A,,bo,co) is C.C.and C.O. Comparison of initial and reduced systems. Let all the designations of (VII, $1) be reserved for the reduced system. In view of the reduction process the initial system, conveniently assumed of dimension n p , has the general form
+
Here y is a p vector, the matrices A , , A , are p x p and n x p matrices and the triple (A, b, c) is C.C. and C.O. We are now faced with two distinct problems-mathematical and practical. As a mathematical problem one must deal with the system (2.2) as it stands and not suppress any coordinates : no reduction may be made. Practically however the situation is quite different. Let b, denote the vector like b corresponding to (2.2). This vector and the analog c, are design elements. If b,, c, have been chosen so that the parts b, c alone are #O, it means that one has considered the role of the vector y as immaterial as regards control. At this point one must recall that the vectors x , y merely represent deviations from certain initial system coordinate vectors 112
$3.
A SPECIAL FORM FOR SYSTEMS WITH COMPLETELY CONTROLLABLE PAIR
(see the Introduction). Thus the deviation y has been considered, by design, as immaterial. Hence it may reasonably be neglected. That is, one may replace y by zero and what is then left of the system (2.2) is really the system (VII, 1.1) but with ( A , b, c) C.C.and C.O. This assumption will be made throughout the rest of the chapter.
93. A Special Form for Systems with Completely Controllable Pair ( A , b) When the pair ( A , b) is completely controllable the following vectors en = b
en-,
+ a,E)b = ( A 2 + a,A + a,-,E)b =
(A
+ a,An-* + ... + a,E)b,
el = (An-'
where the a, are as in (1.2), are linearly independent and hence they constitute a base e. The effect of A on this base is given by Ae,
=
,
en- - anen
A e , - , = e n - 2 - a,-,e,
. . . Ae, =
-
ale,.
Hence if we adopt e as a base for coordinates, A will become
and b will be represented by (3.4)
b'
=
(0,0,..., 0,1). 113
8. SOME FURTHER
RECENT CONTRIBUTIONS
It will turn out later that this type of matrix A and vector b will alone need to concern us. Given the importance in the sequel and also, e.g., in Popov's relation of the expression c'A;'b, it is convenient to calculate it for the above pair (A, b). One must first calculate the vector A;'b = u. If A;' = (ap), the components of the vector u are aln,a2,,,...,a,,,,, that is, the last column of A2-'. This column consists of the cofactors of the last row of
divided by /A,(.The cofactors are readily found to be 1, z,...,z"-l. Hence if c,, are the components of c we have CIA; ' b =
(3.5)
C'
+ czz + -..+ c,zn-' lAzl
In particular
Since no particular properties of the vector c have been utilized in deriving (3.5) we may state: (3.7) Iff is any vector with componentsf, then (3.7a)
f ' A - ' b = f'
+ f z z + ... + f , z n lAzl
& Main I. Lemma (Yacubovich and Kalman) This lemma is at the root of the fundamental theorem to follow ($6). It comes closest to a result of Yacubovich [3, Theorem 31, and the necessity proof below differs very little from that of Yacubovich. However, the sufficiency proof, which is the more difficult part, is essentially inspired 114
§4.
MAIN LEMMA (YACUBOVICH AND KALMAN)
by the same part of the proof of Kalman's main lemma [2], perhaps the most original feature of his treatment. (4.1) Main 1emma.Given the stable matrix A, a symmetric matrix D > 0, vectors b # 0 and k, and scalars z 2 0, E > 0, then a n.a.s.c. for the existence of a solution as a matrix B (necessarily > O ) and vector q of the system
(a) A'B
(44
E
-
ED
B b - k = $9
(b)
is that
+ B A = -qq'
be small enough and that the Kalman relation z
(4.3)
+ 2Rek'AG'b
>0
be satisfied for all real o. As in (VII, $5) set m(io) = A i ' b . Thus m(io) is a complex vector function of o.With this notation one may also write (4.3) in the form z
(4.4)
+ k'rn + m*k > 0.
Notice now the identity ALB
+ BAi,
=
- (A'B
+ BA).
If one multiplies the right-hand side by AG'b and the left-hand side by b'A&-', then take account of (4.2) there follows (4.5)
rn*Bb
+ b'Bm = m*qq'rn + Ern*Dm.
This relation will be used at once. We come now to the proof of the lemma proper.
PROOFOF NECESSITY. In (4.5) replace Bb from (4.2). As a consequence (4.6)
2Re krn = Iq'rnl' - 2$
Re q'm
+ Em*Dm.
Moreover if one considers D as a hermitian matrix then D , = A~-'DA,<' is the hermitian matrix deduced from D by the change of coordinates x + A,<'x. Hence D , > 0 like D, and so since b # 0
6 = Eb'D,b
=
Em*Dm > 0.
Upon applying (4.6) there follows (4.7)
2 Re k'rn.= Iq'rnl' - 2& Re q'm
+ 6. 115
If q'm
=
8.
SOME FURTHER RECENT CONTRIBUTIONS
7
+ 2Rek'm = (1- &)' + pz + 6 > 0
1 + ip, (4.7) yields
which is (4.3). This proves necessity.
PROOFOF
SUFFICIENCY.
We first establish a preliminary result.
(4.8) If u is a real constant uector such that Re u'm (iw) = 0 whatever w then u = 0. Suppose that u # 0 and let we have I)~(Z)
=
u'm(z) =
uh
u1
be its components. Referring to (3.7)
+ u2z + ... + u,zn-l lAzl
This function has the following properties : (a) It is rational in z and not identically zero. (b) Its poles are among the characteristic roots of A and hence, since A is stable, they are all to the left of the complex axis. (c) Since the numerator of I),&) is of smaller degree than the denominator there is at least one such pole. (d) $,,(z) takes only complex values on the complex axis. It follows that +(z) = ii,b0(iz) is a rational function of z which takes only real values on the real axis and hence it is real. Moreover it has one or more poles and they are all to one side of the real axis. Now if a is such a pole so is d and the two are separated by the real axis. This contradiction shows that u = 0. Passing now to the sufficiency proof proper since both (4.9)
~ ( w=) m*k
+ km,
n(w) = m*Dm
are real rational functions of w with numerators of degree 5 n - 1 and denominator of degree n, both -,0 as w + + 00. Furthermore they are continuous for w finite. Hence they have finite upper and lower bounds. Let p be the upper bound of n(w) and v the lower bound of ~ ( w )Since . n(w) 0 for all finite w, we have p > 0. Hence
=-
+ m*k + k'm - Em*Dm 2 z + v - ep. Moreover owing to (4.3) 7 + v > 0. Hence if one chooses E < *[(z + v)/p], 7
we have (4.10)
7
116
+ m*k + k'm - Em*Dm =- 0.
$4.
MAIN LEMMA (YACUBOVICH AND KALMAN)
Let now $(z) = IAZI. Thus $(z) is a real polynomial with leading coefficient unity. Now the left hand side of (4.10)may be written z
Him) + k'm(io) + m*(io)k - Em*Dm = -
Here q(z) is a polynomial of degree 2n with leading coefficient t. Since q(io) is real and > O (4.10),q(iw) = q l ( 0 2 ) , q l a real polynomial without real roots. Hence q1(w2)= O(io)O(- io),
where O(z) is a real polynomial. Since the leading coefficient of O(z)O(-z) is t , that of O(z) is and the degree of O(z) is n. By division and since the leading coefficient of $(z) is unity
&,
where v(z) is a polynomial of degree at most n - 1. If q l , q2,..., are its (real) coefficients, define q by q' = (-q1,
..., -4").
Once q is known one obtains the matrix B from (4.2)and as we know B > 0. The above leads to the relation (4.1 1) z
+ k'm + m*k - &m*Dm=
v(io)
io)
Referring on the other hand to (2.2) and recalling the meaning of m we have
117
8.
SOME FURTHER RECENT CONTRIBUTIONS
Hence the relation (4.1 1) yields for the chosen q
+ m*k - Em*Dm = (m*q - &)(q'm
k'm
- &)
= m*qq'm - Jz(q'm = -
- T
+ m*q)
(m*Bb + b'Bm) - Em*Dm - &(q'm
+ m*q),
the last step by (4.5). Hence whatever w : m*(Bb - k - &q) =
+ (Bb - k - &q)'m
2Re(Bb - k - &q)'m
=
0.
Since the vector in parentheses is real it must vanish, showing that (4.2b) is satisfied. That is, a solution ( B , q) has been found for the system (4.2). Thus sufficiency of (4.3) is proved. This completes the proof of (4.1).
$5. Liapunov-Popov Function and Popov Inequality Their connection has already been emphasized (VII, §§4,5) and we recall that the function V ( x ,a), the related and the Popov inequality are
v
+ a(a - c ' x ) ~+ B@(o); - 3 = x'Cx + Bpcp2(a)+ 2d0'xcp(cr)+ 2ayacp(o); do = Bb - (@A'c + UYC). P(a,b,w) = f l y + Re((2ay + iwfl)c'&'b} 2 0. V ( X ,a) = X'BX
(5.1) (5.2)
(5.3) We also state the following generalization of the Lurie problem resembling one due to Kalman: Generalized Lurie problem. To3nd n.a.s.c. to assure absolute stability by means of the function V ( x ,a) of (5.1), i.e., through Vand - both positive for all x, a not both zero, and all admissible characteristic cp(a).
v
This problem is solved by the fundamental theorem ($6). Reduction of Kalman's relation (4.3) to Popov's (5.3). At first glance, although quite similar, they seem to deal with two different problems. 118
$6.
FUNDAMENTAL THEOREM
Actually by a specialization of the constant k appearing in (4.3) one obtains (5.3). Referring to the expression (5.2) of
+ UYC relation let z = /?p = /?(y + c'b). As a consequence (4.3) k
and in Kalman's yields (5.4)
let
/?p
=
Bb - do = @A'c
+ 2 Re{(&'A + uyc')A,'b}
The bracket may be written
> 0.
+ uyc'AG'b + i/?c'ioA,'b + uyc'A,'b.
@c'(ioE - Ai,)A,'b = - &'b
Since p - c'b = y, (5.4) reduces to (4.3). That is, with the substitutions indicated for the constants (5.3) reduces to (4.3).
56. Fundamental Theorem With the lemma behind us we are in position to prove : (6.1) Theorem. N.a.s.c. for both V and - V of 45 to be positive dejinite for all ( x , ~ )and choice of an admissible cp are the Popov-Kalman inequality (4.3) together with (6.2)
(a) u 2 0 , p 2 0 , u + / ? > O ; (b) z > 0 or z
=
0, do = 0, u > 0.
When these properties are satisfed the system (VII, 1.1) is absolutely stable.
(6.3) REMARK.It is quite instructive to compare the above theorem and the apparently similar theorem (11, 2.1 1). The earlier theorem refers to the Lurie-Postnikov function V and in its conditions there enter the matrix C and the control parameters b, c, p. In the present theorem the V function is the more general Popov type and in its conditions there enter merely the scalars u, /? and the control parameters. The difference is due, of course, to the appearance of the powerful Popov condition.
PROOF OF NECESSITY. The necessity of (6.2a) has already been proved. Regarding (6.2b) let 7 # 0. Then the substitution x + E X , cp -,cp, 0 + E% 119
8.
SOME FURTHER RECENT CONTRIBUTIONS
yields - V A zcp’ and so one must have z > 0. Suppose now z = 0, do # 0. Then the same substitution yields - p A 2c(p(o)d 0 ’ x : the sign of pchanges with that of E hence one must have do = 0. Then however u # 0, hence u > 0, since otherwise V = 0 for x = 0, CT 0. Thus (6.3b) must hold.
+
Consider now separately z > 0 and z
=
0.
I. z > 0. One may then write - V = X’(C - qq’)x
(6.4)
+ (&q + q’x)2 + 2uyacp,
where q is defined by (4.2b). Choose llxll large, E small, oo # 0 and fixed q’x = 0 if and o = E ’ O ~ ,q(o)= p 2 0 0 with p such that E~,uo,& q’x # 0, and any p > 0 if q’x = 0. Then - V A x ’ ( C - qq’)x, hence C - qq’ = D > 0. Thus (4.2) holds, hence by the lemma (4.3) is satisfied. Thus necessity is proved in this case. 11. z = 0, do = 0, u > 0. Taking o = 0 and any x , we have - V A x ‘ C x , hence C > 0 and so by (6.2~)B > 0. Since do = 0, (4.2) holds with q = 0. E = 1. Therefore the Popov-Kalman inequality is satisfied, and necessity is completely proved.
+
E
PROOFOF SUFFICIENCY. Since (4.3) holds given D > 0 there exists > 0 such that the system (4.2) has a solution (B, q). Note that C = ED
+ 44’ > 0,
hence also B > 0. It is convenient now to deal separately with Vand V. Take first K Ifu = 0 then B > 0 and so V > 0 for x # 0 or i f x = 0 for o # 0. Hence Vis positive definite for all X , C Tand admissible cp. On the other hand if u # 0 the sum of the first two terms in (5.1) is a positive definite quadratic form in x , CT and the conclusion is the same. Consider now V. If z # 0 (5.2) becomes
+
- V = EX’DX (&cp(a)
+ q’x)’ + 2~yocp(~).
Hence the sum of the first two terms is positive definite for all x , o and admissible cp and so - V has the same property. If z = 0 then do = 0, u > 0. Hence (5.2) reduces to - V = X’CX
+ 2uyocp(o).
Since C > 0 and u > 0 this expression is likewise positive definite for all x , o and admissible cp. 120
$7.
A RECENT RESULT OF MOROZAN
Since both V and - are, always, positive definite for all x, t~ and admissible cp, sufficiency is proved. PROOFOF ABSOLUTE STABILITY. All that is now needed is to show (Barbashin-Krassovskii complement, IX, 4.7). that V + 00 with llxll + Owing to B > 0 and property I11 of (I, $1) for cp this is true if a = 0 since then B > 0. It holds also when a > 0 since the first two terms in the expression (5.1) of Vmake up a positive definite quadratic form in x and 0.
$7. A Recent Result of Morozan It is interesting to return to the inequality (Fi)of (11, $2) for the number
p. In our present notations and since c loc. cit. is here A'c, and hence k for a = 0, /l= 1 is fA'c, we have (Fi)
p > (Bb
- k)'C-'(Bb
-
k).
At a meeting in Kiev in September 1961, the author raised the question of finding the minimum of p for all choices of the basic matrix C > 0. This question has recently been solved by Morozan [l]. We have however all that is required to obtain an answer here. Namely when a = 0, taking B = 1, (4.3)yields p
+ 2Rek'A,;'b
> 0.
Here, however, k = $4'~. Hence p
+ Re(c'A
*
Ak'b) > 0.
From Aim= i o E - A there follows p - c'b
+ Re iwc'Ai,'b
> 0.
Since this last inequality must hold for all real o we find (7.1)
p > c'b
+ sup Im oc'Ai,'b. m
Owing to the n.a.s.c. of the fundamental theorem the right-hand side represents the true least value of p. Owing no doubt to differences in notations this result does not coincide with that of Morozan. 121
8.
SOME FURTHER RECENT CONTRIBUTIONS
$8. Return to the Standard Example In the preceding chapter we have already made a comparison between the two types of function V ( x ,0 ) : (VII, 4.6), form of Lurie-Postnikov (a = 0), and Popov form (VII, 4.Q (a > 0). We return to the same question here and arrive at comparisons based upon rather simple estimates obtained from Popov's inequality. For simplicity the discussion will be restricted to the case /3 > 0. Thus Popov's inequality may be written
y
(8.1)
+ Re(6 + iw)c'A,'b
>0
where 6 = 2ay//3. In our earlier notation m(iw) = c'AL'b one may write (8.1) as
y > os,
(8.2)
-
=
Sl(o) + iS,(w)
6S1,
which is to hold for some 6 2 0 and all real o. It is evident that one may take /3 = 1. Then 6 = 0 corresponds to the Lurie-Postnikov type of function V ( x , a ) and 6 > 0 to the Popov generalization. Since (8.1) is independent of the choice of coordinates (a fact readily established) we may assume that A = diag(Al,..., An). As a consequence
and therefore
The relation (9.1) will only yield rather simple estimates when all the & are real. We confine our attention to this case. Thus = -ph < 0. Hence C'AL'b
Hence
122
=
1-ph + iw' bhch
$9.
DIRECT CONTROL
Taking into account the relation p p >
(8.3)
c
+ c‘b we obtain from (8.1):
=y
Ph(Ph Ph2
+
6)bhCh
oz
.
It is now necessary to distinguish between the signs of the products bhCh. Let bh’Ch‘ denote the positive products and Ph‘ their P h , and bit;, p i the negative products and their p,,. Suppose also that PI’
5 Pz‘
s ... 5 PP”
P;
s Pi s ... s P;.
Now it is clear that (8.3) will hold if one merely preserves the bh’Ch‘, chooses 6 = p l ’ and o = 0. Let
Similarly set
It is evident that if p p is the least of the numbers pp’, p: then p p is a suitable lower bound for the number p. Now let us see what one obtains as lower bound for p from our inequality (Fi). Referring to (II,5) its vector c, now written c,,, has for components -p,,c,,. Hence the inequality (Fi)yields here p >
1bh‘ch‘ = Pm.
It is clear that if the products bhch are not all negative p p < pm, hence the Popov type of Liapunov function, i.e. with a suitable u > 0, is then more advantageous than the Lurie-Postnikov type with u = 0 (our earlier type).
99. Direct Control The most interesting direct control of order n is the one which reduces to an indirect control of order n - 1 and is fully discussed in (IV, §§6,7). The state matrix A of the direct control has zero as simple characteristic root. All that we propose to do here is to adapt the theorem of $6 to that case.
123
8. SOME FURTHER
RECENT CONTRIBUTIONS
In the notations loc. cit. the system is Xo = A O X O
c i = g’x,
- boV(0) - pCp(0)
where A, is a stable ( n - 1) x (n - 1) matrix. One may apply directly the fundamental theorem of $6 under the following identifications : A, corresponds to A ;x, to x , bo to b, g’ = co’A, to c’A,co to c. Here also y = p - c,’A,b,. Finally p and q(a) have the same meaning as in $6.
$10. RQumC (Indirect Control: y > 0) The variety of results on the Popov expression P ( a , p , o ) and the Liapunov function V ( x , 6)of (5.1) as related to absolute stability, may be summarized as follows :
I. Popou’s first theorem. A sufficient condition for absolute stability is (10.1) P(cr,p,o) 2 0 for some a,B 2 0, a B > 0, and all real o.
+
11. Popou’s second theorem. (10.1) is a necessary condition to have absolute stability via Vand - p positive definite, with a and /?the same in P and K 111. Kalman’s theorem. (10.1) plus another (complicated) condition is a n.a.s.c. to have absolute stability through V > 0, p 4 0, with same a, fi in P and K
IV. Theorem of $6.(10.1) with P > 0 plus Bp > 0 or Bp = 0, do = 0: a > 0 are n.a.s.c. to have absolute stability secured through Vand -V both positive definite. V. However, in 111 and IV the pair (A, b) is assumed completely controllable. In both also a certain theorem of Yacubovich plays a major role. Suppose now that in the initial system (VII, 1.1) the pair ( A ,b) is not completely controllable. As we have shown in $2, one may choose coordinates, and select a reasonable vector cz such that the initial system is replaced by (2.2) together with (2.3) where now (Al, b) is completely controllable. Popov’s first theorem provides a sufficient condition for the 124
$11.
COMPLEMENT ON THE FINITENESS OF THE RATIO cp(0)/0
absolute stability of the full system (2.2, 2.3). It would evidently be most desirable to prove that P 2 0, supplemented, perhaps, by some simple inequality is also a necessary condition for absolute stability. Since (2.3) already has this property, it might suffice to obtain this result for a completely controllable pair ( A , b). Up to the present, however, this remains an open question.
$11. Complement on the Finiteness of the Ratio cp(a)/a Two recent publications led to this complement : (a) a noteworthy paper by Yacubovich [4] in which he deals not only with the restriction in the title but even with a possible isolated function cp(a); (b) an extensive monograph by Aizerman and Gantmacher [l] where the restriction in question is accepted throughout. This has induced the author to examine the possible modifications in the results of the chapter presented by the added condition (11.1)
0
# 0:
to our admissible class. As indirect and direct controls proceed along entirely distinct lines, the two cases are separated. Indirect control. Take I/=
and modify
X’BX+ U(O - c ‘ x ) ~+ /3@(~)
v by adding and subtracting A(0) = 2ay
Thus A(o)> 0 for
0
# 0, or cp(o)
- V = X’CX
A’B
(.q) -
cp(0).
-= KO and A(0)= 0 if q(a) = KO. Then + 2dO’xcp(0)+ A ( 0 )
+ z0cp2(0)
+ B A = -c,
do = Bb - ~ B A ‘ c- UYC. Replace now Popov’s initial expression by the K-Popov expression :
P(a, B, w, K)
=
P(a, p, w )
+ 2UKY ~
=
By
+ 2UY + Re((2a + iw/?)c’A,;’b}. K -
125
8.
SOME FURTHER RECENT CONTRIBUTIONS
Under the same modifications as before in $5 (expression of k ) it is identical with the K-Kalman expression
K(a, b, w,
K) =
zo
+ 2 Re k'AZ'b.
The new fundamental theorem is : (11.2) &Theorem for indirect control. N.a.s.c., in order that, with V as above, both V and - V be positive definite for all x, o and all K-admissible functions p (p restricted by 11.1) is that the K-Popov-Kalman inequality:
P(a, b, w, 4 > 0 hold for all real w together with
(1 1.3)
zo
> 0.
When these properties are satisfied the system is absolutely stable in the sense that cp is restricted by (11.1).
PROOFOF
NECESSITY. To
-V
prove (11.3) take cp(a) = K O so that
= X'CX
+ zocp2 + 2d0'px.
For x = 0 and zo # 0 then - = Z ~ K ~ hence O ~ , zo > 0. On the other hand zo = 0 is ruled out since then - V cannot be positive definite in x, O. Thus (1 1.3) holds. Write now - V = x'(C - qq')x
+ (&p + q'x)2.
Take any x # 0 and determine a by J z o ~ a = -q'x. As a consequence - V = x'(C - qq')x > 0 for all x # 0. Hence C - qq' = D > 0. Thus all the necessary conditions of the main lemma are fulfilled with z = zo. Hence the K-Popov-Kalman inequality holds and necessity is proved. PROOFOF SUFFICIENCY.It is practically the same as in (6.3) save that one need not consider zo = 0. The proof of absolute stability with the K restriction added is the same as in $6, with the modification
+
- V = EX'DX (&&a)
which does not affect the proof. 126
+ q'x)2 + 2ay
(o
- q(a)
?:p
$1 1.
COMPLEMENT O N THE FINITENESS OF THE RATIO cp(O)/U
Direct control. This time the system is (11.4) X’ = A X - bq(a), CT = C’X.
As Liapunov function take V ( X )= X’BX (1 1.5) hence
+ P@(D),
+ 2 d ’ x ~ ( o+) 7cp2(a)
- V(X)= X‘CX
(11.6)
+ B A = - C , d = Bb - &?A’c, T = fic‘b. Actually the role of v is really played by the function W ( x )= - v - s 0 ( A’B
9
- cp(0)
=
X‘CX
+ 2(d - ~SC)’X(P +
( +3 7
-
q2
where S > 0. In the presence of the restriction ( 1 1.1) the adequate theorem here is : (11.7) K-Theorem f o r a direct control. Suficient conditions for V positive definite as a function of x for all admissible functions cp satisfying (11.1) and W as a quadratic form in x and cp (unrestricted) is the K - P O ~ Oinequality. V ( 1 1.8)
S
P(S, /3, o,K) = K
+ Re{(S + io/?)c’A,’b}
>0
for some # 0, some positive 6, and all real w. When these conditions are fuljilled both V and - V are positive definite and we have absolute stability. REMARK. This theorem does not really differ in substance from a theorem of Aizerman and Gantmacher [2, p. 781. They give conditions referring to a Liapunov function V with the property that if V is positive definite under the restriction (11.1) then W is positive definite in x, cp without restriction. The proof of sufficiency can be carried out by a slight modification of the argument of the sufficiency proof of our fundamental theorem ($6). It is also obvious that when the given conditions hold
- v = w+s
(
0--
V:))
cp(0)
is positive definite in x (arbitrary) and cp (restricted by 11.1). Absolute stability is then established as in $6. 127
Chapter
9
MISCELLANEOUS COMPLEMENTS $1. The Jordan Normal Form for Real or Complex Matrices In the sections on vectors and matrices the notations are those of (11, 01). In connection with the reduction to the normal form, it is particularly convenient to have recourse to bases for vector spaces. Changing slightly our previous point of view, consider the symbol x to represent a certain vector u in the coordinate system x. Thus the transformation of coordinates y = Px, P nonsingular, does not change the vector u but merely provides a new representation of u in the y coordinate system. In other words the vector u is independent of the coordinate system but is merely endowed with various representations in various coordinate systems. Let eh denote the vector which in the x system has coordinates 6 h k (Kronecker deltas). The system {eh}is a base in the sense that the vector u may be written uniquely
u
(1.1)
=
elxl
+ ... + e,x,
(it is convenient to put the xi after the factors ej). A useful convention is to think of the eh as the components of a one-column matrix (vector) designated by e. Then in an evident sense, 128
5 1.
THE JORDAN NORMAL FORM FOR REAL OR COMPLEX MATRICES
under our matrix conventions (1.1) may be written (1.2) u = e‘x. Now if one applies the transformation x u
=
e’Py = f’y,
=
P y one finds
f ’ = e’P
or equivalently f = P‘e. Thus the coordinate transformation x = P y is associated with the “base transformation” f = P‘e. We have already observed on repeated occasions that if in the differential equation f = Ax, A constant, one applies the above transformation of coordinates, the system is replaced by A, = P--’AP. j = A,y, That is, the effect on the matrix A is to replace it by the similar also called equivalent matrix A,. The relationship is written with the standard equivalence symbol A, A. Furthermore A , is any matrix -A. In order to describe the Jordan normal form we will utilize two special designations. We will write A = diag(A,, ..., Ar), where the A,,, are, like A, square matrices and follow one another in the principal diagonal as if they were elements, the rest being zero matrices. We shall also write
-
9
. . . . I 1 I
where Cs(i) is n x n and all missing terms are zero. Actually one may also assume that the diagonal 1, 1,... is below the main diagonal. We now state without proof (for the proof see Lefschetz [1, Appendix 11):
(1.3)Theorem. Every complex matrix A is equivalent in the complex domain (that is by means of a complex transformation matrix P ) to a matrix A,
=
diag(CrI(Il),-., crs(&)) 129
9.
MISCELLANEOUS COMPLEMENTS
where the A h are the characteristic roots of A (and also of any matrix .-A), all included, some perhaps repeated. The order of the “blocks” C(A) is immaterial.
The proof loc. cit. is based on a systematic selection of base elements, one set e.g. ehlr...,ehrhcorresponding to the block Crh(&). Now when the matrix A is real, the characteristic roots appear in is a block conjugate pairs Ah, 1,. It is also shown loc. cit. that if crh(&) associated with Ah, then there appears also the block cr,,(&,). Moreover is the subbase associthe base vector f may be so chosen that if (hl,...,h*) then one may choose a subbase (fh1, . . . , f h r h ) corresponding ated with crh(Ah) as part off: As a consequence, if the vector u (the point x) is real to cr,(&) and if yj is its coordinate corresponding to f h j then its coordinate corresponding tofhj will beyj. This is expressed by the statement : real x points will be represented in the complex y coordinates by pairs of conjugate coordinates. Thus the y coordinates may be ordered like this :
91,...,Pp, Y2,+
y19-3 Y,,
l,...?
Y,
where the y2,+h are real and correspond to the blocks with real Ah. Letfj,A be associated as above to yj, pj. One may replace them in the base vector f by the real elements
which are both real. If yj, y; are the real coordinates corresponding to then identically
fi’,f
=
From this follows (since
Yj’
Yi.(T) fj
fi,&
+A
+ Y;(li). fj
-A
are independent)
y j + iy; 2
7
y.= J
y .‘ - iyl 2 . J
By examining the effect on the bases e (real) and f (complex) it is readily found that the transformation matrix P from the real coordinates
130
$1.
THE JORDAN NORMAL FORM FOR REAL OR COMPLEX MATRICES
x to the complex coordinates y has the form
where Q and Q are p x n matrices and R is a real q x n matrix, q
=
n - 2p.
Consider a general system
3
=X(x),
X(0) = 0
whereX is e.g. of class C' in a neighborhood SZ of the origin. We merely assert that a change of coordinates x = Py does not apect the stability properties ofthe origin. This is readily proved for instance by means of the inequality on p. 20 of LaSalle and Lefschetz [11. For additional information (especially proofs) of the properties just considered, see the following well-known books : Bellman [13, Gantmacher [ l ] , also in part Lefschetz [ l , Appendix I ] . We recall the following : (1.4) Theorem. Every positive definite quadratic [hermitian] form may be
reduced by a real [complex] affine transformation of coordinates to the form
The transformation, for example, for a quadratic form may be made in two steps. A first orthogonal transformation x + Px, P-' = P', reduces the form to xa&h2,
ah > 0.
Then the affine transformation xh + xh/\& brings it to the required type. We also recall the following property utilized earlier.
(1.5) If the real symmetric [hermitian] matrix C > 0 then likewise C - ' is real symmetric [hermitian] and >O. The treatment of the hermitian case is the same as the other [with )'I, so we confine our attention to the real case.
( )* instead of (
131
9.
MISCELLANEOUS COMPLEMENTS
Since C' = C we have C-' = (C')-' = (C-I)', and so C - ' is symmetric. Let now P be an orthogonal transformation reducing C to the diagonal form P-'CP
=
P'CP = D = diag(a,, ...; a,)
where every a,, > 0. Then
D-1 = P - l C - l ( P r ) - l
=
PrC-1(P-1)#= P ! c - ' ( P ' ) ' = p'C-'P,
Since D-' = diag(l/a,,..., l/a,), we have D-' > 0, and since it is the transform of C - ' by the orthogonal transformation P, likewise C - ' > 0.
92. On a Determinantal Relation There has occurred in the previous chapters a determinantal relation which we propose to derive in the present section. Let M be an (n + 1) x (n f 1) matrix and suppose that
M=(;
:j
where N is a nonsingular n x n matrix, p and 4 are n vectors and a is a scalar. The relation in question is IM( = INl(a - 4 ' W ' p ) .
(2.1)
To prove this relation multiply M (left-hand side) by diag(N-', 1). As a consequence diag(N-', 1). M = Let N-'p
132
=
r, so that the last matrix is
$3. ON
LIAPUNOV’S MATRIX EQUATION
and
From well-known rules on determinants it is found that rl and q1 appear in the expansion as the product -qlrl and nowhere else. By permuting rows and columns it is seen that similarly qh and i h appear in the product -qhrh and nowhere else. Hence IS1 = a - q‘r and from this to the desired relation (2.1) is but a step. The relation (2.1) may be generalized as follows. Let M=JQN
j:
where N is n x n, R is s x s and P , Q are n x s matrices, with N nonsingular. Then
IMI
=
JNI(IR- Q’N-IPI).
The proof is practically the same as for (2.1).
$3. On Liapunov’s Matrix Equation Take the n vector equation (3.1)
X =
AX
where A is a constant stable matrix. Let V(x)= x’Bx be a quadratic form. Its time derivative along the paths of (3.1) is (3.2)
v = av -. Ax = - W(X) = -X’CX ax
where we have the Liapunov relation
(3.3)
A’B
+ B A = -c.
This relation has frequently occurred in the previous chapters, the following property being repeatedly utilized : 133
9.
MISCELLANEOUS COMPLEMENTS
(3.4) If the matrix A is stable and C is any given matrix > 0 then (3.3) has a unique solution B and B > 0. The same property-holds for a matrix A with some complex characteristic roots, but still stable, save that (3.3) is replaced by
+ BA = -C
A*B
(3.5)
where B, C are now hermitian and >O. There is so little difference between the treatments of the two cases that it will suffice to deal with (3.4). As a matter of fact, the result has often been dealt with in the literature (see for example LaSalle and Lefschetz [ l ; 3, $171) but we believe that our present attack is particularly direct and simple. Observe first that a transformation of coordinates x = P y will replace (3.1) by
3 = Aoy,
A,
=
P-’AP
and x’Bx, x’Cx by y‘B,y, y’C,y, where B , = P’BP, Co = P’CP. Hence (3.3) yields PA’P‘-’P’BP
+ P’BPP-’AP
=
-P’CP
which is AO’BO
(3.6)
+ BOA0 = -Co.
Thus (3.3) is unaffected by a transformation of coordinates. Since this transformation may be inverted, it is sufficient to prove the asserted property for (3.6). Now one may choose the transformation of coordinates so that A , is triangular, that is, only with zeros above the main diagonal. Indeed the Jordan normal form is already of this form. The terms in the main diagonal are then the characteristic roots of A , (the same as those of A). Thus A,
=
diag(L, ,..., An)
+ D,
where D = (djk)is triangular with zeros in the main diagonal. Let bykk,cjq, be the terms of B,, C,. The equation (3.6) gives rise to a set of relations (3.7)
+
+
where djk is a linear form in the bhk such that h k > j k with coefficients in the drr Since every Re l j < 0, the system (3.7) can be solved step by step 134
$3.
ON LIAPUNOV’S MATRIX EQUATION
by an induction downward from j + k = 2n, and the solution is unique. Hence (3.6) has a unique solution B, and (3.3) has a unique solution B. If B is that solution (3.3) yields flA
+ A’#
= -C‘ = -C.
Hence, B’ is likewise a solution of (3.3)and so B’ = B. The treatment of (3.5)is the same with A’ replaced by A*. REMARK. There are two noteworthy special cases, repeatedly considered in the previous chapters as the standard example, when the solutions of (3.3) or (3.5) are especially simple. (a) The A,, are all real, Ah = - , u h < 0, and A = diag(-pl, ..., -p,,). (b) Some A,, are complex, Re Ah = - p h < 0 and A = diag(Al,...,A,,). In the case (a) the relation (3.1) yields at once h j
+ pklbjk
= cjk,
hence (3.8) In the case of (b) (3.2) yields hence now (3.9) We shall now prove: (3.10) T h e unique solution B of (3.3)just obtained is >O. Notice that with the earlier result this property is equivalent to this proposition : (3.1 1) Theorem. (Liapunov). Given the positive definite quadraticform W(x), the partial dgerential equation (3.2) for V ( x ) has a unique solution a s a positive definite quadratic form. At all events there is a solution V ( x ) = x‘Bx as a quadratic form. Suppose that V(x) is not positive definite. If it can take negative values, it is known from Liapunov’s instability theorem that the origin must be 135
9.
MISCELLANEOUS COMPLEMENTS
unstable for the system (3.1). However we know from the form of the solutions that the origin is asymptotically stable. Hence V(x)2 0. Suppose that V ( x o )= 0 for some x o # 0. Let y be the path issued from xo at time t = 0. Along y we have = - W ( x ) < 0, hence V ( x ) will become and remain < O along y. Since this contradicts the result already obtained V ( x ) 0 for x # 0, hence B 0. Thus both (3.10)and (3.11) are proved. We have actually all the elements for the proof of the following noteworthy proposition.
=-
=-
(3.12) Theorem. A necessary and suficient condition for the stability of the real matrix A is that there exist two real matrices B, C > 0 which satisfy the relation (3.3). Necessity has already been proved. To prove sufficiency let 1 be a characteristic root of A . There exists then a vector u # 0 such that Au = lu. Since A is real A’ = A*. Thus u*A* = lu*. Now from (3.6) follows u*(A*B + BA)u = 2(1 + X)U*BU< 0.
Since B > 0 this implies 2
+ 1= 2 Re 1 -= 0 and so A is stable.
$4. Liapunov and Stability As Liapunov’s theory has been referred to many times in the previous chapters, we shall give a rapid resume of it insofar as it applies to an autonomous system i= X ( x ) ,
(4.1)
X ( 0 ) = 0.
Let S R denote the spherical region llxll < R and H R the boundary sphere of the region. The system (4.1) is supposed to be of class C’ in a certain region S,. Liapunov defines the origin as: stable for (4.1) whenever given any 0 < E < A there corresponds to it a 0 < V ( E ) 5 E such that if x(t) is a solution whose initial position x o = x(0) lies in S, then x(t) lies in S, ever after ; Asymptotically stable whenever the origin is stable, and furthermore for some E every solution x(t) as above +O as t + + co ; Unstable whenever given any 0 < E < A and no matter what 0 < q < E there is always an x ( t ) as above reaching H,at some time t > 0. 136
@. LIAPUNOV AND STABILITY
The stability theorems of Liapunov given below rest upon this concept : A scalar function V ( x )is positive [negative] definite in the region S , whenever V ( x )is of class C' in S,, V(0)= 0, and V ( x )> O [
along a solution x(t) of (4.1) in S , is given by (4.3)
(4.4) Stability theorem. Whenever for some S , there exists a positive definite function V ( x )whose derivative V along the paths of (4.1) is 5 0 in S , then the origin is stable. (4.5) Asymptotic stability theorem. Whenever V is actually negative
definite in S , the origin is asymptotically stable. (4.6) Instability theorem. If there exists a positive definite function V(x)in S,, of class C' there, whose derivative V along the paths is likewise positive and of class C' outside the origin in S , then the origin is unstable. Moreover (not explicit in the proof but obvious) every H,, E < A, may be reached by any path initiated from a point of S , (origin excepted). The instability theorem, while not utilized in the previous chapters, will be required in a moment. For the proofs of the above theorems see LaSalle and Lefschetz [l, p. 571. (The instability theorem has been stated in less general form than loc. cit., but in a form sufficient for our present purpose.) (4.7) The Barbashin-Krassouskii complement to the asymptotic stability theorem [ l ] . In the latter theorem let: (a) SA = S , ; (b) V + co with IIxII. Then all solutions tend to the origin. A direct proof of this proposition offers no difficulty. However it is interesting to observe that (4.7) is a consequence of the instability theorem applied tv infinity. This requires a word of explanation. Let the space of x be closed at infinity by a single point. It becomes topologically an n sphere S" in which one may think of the point at infinity as the north pole N and the origin as the south pole S . The conditions of (4.7) applied to N state that upon defining W ( N )= 0, then the
137
9.
MISCELLANEOUS COMPLEMENTS
function W = 1/V is positive definite in any spherical region of center N with a time derivative W along the paths which is > 0 in any such region, N excepted. For W = - V / V z = - W 2 .V > 0 as stated. This is sufficient for the application of the instability theorem : N is unstable and every H,, whatever E, is reachable by any solution outside He. This is precisely the statement of (4.7). There are closely related investigations by Yoshizawa [l], and also, regarding the “closing at infinite by a point,” related contributions of Auslander and Seibert. See notably their paper [l]. In conclusion we recall the following noteworthy (4.8) Theorem of LaSalle. If the set of points where V < 1 is bounded for all finite positive 1 (obvious here) and the locus of points such that V = 0 contains no other solutions than the origin then the system under consideration for which V is positive dejinite and - V positive semidejinite is asymptotically stable in the large. And, in particular, ifthis holds for all admissible functions cp we have absolute stability.
138
Appendix A
AN APPLICATION OF MULTIPLE FEEDBACK CONTROL Throughout the present monograph it has been systematically assumed that the system matrix A had no characteristic roots with positive real parts. This is equivalent to assuming at the outset that the initial n vector system (1)
1 = AX
was stable. We shall say that the matrix A is irregular whenever it does possess characteristic roots with nonnegative real parts. This includes in particular the case when the system (1) is actually unstable: the precise situation when the action of a control seems most imperative! We shall now prove: Theorem. When tlie matrix A is irregular one may dejine a multiple feedback ..., or such that the matrix of tlie enlarged control with vnriables ol, 02, system ( x , a1,..., or)is stable.
Upon using, for example the reduction to the Jordan normal form, one may assume that A
=
diag(A,, Cl(A1),..-,Cr(5)) 139
APPENDIX A
where A , is stable and the blocks c h ( A h ) are irregular: the A h have nonnegative real parts. Let q h be the order of ch(&). Let also the coordinates be so designated that the system (1) assumes the form
A’ = A ] x ’ ,
(2)
h
j h = Ch(Ah)yh,
=
1, 2)...)r.
Our theorem will be proved if we can show that one may replace the yh system by an enlarged system with stable matrix. For convenience drop the index h and write the yh system as
3 = C(A)y
(3)
where y is a q vector. Consider the new system
b = q’y
+ ao,
f’= (0,O)...)0, Here ct and the g h are constants which are to be so determined that the matrix of (4) be stable. By referring to (IX, $2) and (VIII, $3), we find at once that the characteristic equation of (4)is (5)
(Z
- a)(z -
A)q
- g4(z
-
- ... - g1
=
0.
Take any ,u > 0. Our purpose will be fulfilled if we can select a and the g h , so that the above polynomial is (z - P ) ~ +’ . Upon setting z - A = u, A - a = 8,A - ,u = p , this reduces to selecting and the gk so that u4+ 1
+ pu4 - g p 4 -
1
- ... - g]
= (u + p ) q + l
Since this choice is always possible and even uniquely so, the theorem is proved. In the last analysis one will face the following situation: A system in the following variables : (1) the initial phase n vector x ; (2) the additional variables ol,c ~ , . . o ., ,, one for each block 140
ch(&);
AN APPLICATION OF MULTIPLE FEEDBACK CONTROL
(3) the standard old type feedback variable
0
with characteristic
q(a) such as studied for example in Chapters 11,111, and IV. The constant matrix A, corresponding to x and the CJ,, (order n
+ r)
is stable and we have a multiple feedback vector with components ( 0 1 , 0 2 9 * * * 9c r ,
6).
crare introduced we are back to the situations of the Once oI7CT~,..., text (a single feedback variable 0).
141
Appendix B
AN EXAMPLE FROM THE THEORY OF NUCLEAR POWER REACTORS (Kenneth Meyer) Our theory can be applied to problems outside of control theory. The following problem was studied by Smets [l]. Let the scalar q be the mean neutron density of a nuclear reactor. The neutron density q satisfies an equation tj = kq
where k, the reactivity, is a function of the state of the reactor. The reactivity k can be assumed to be a linear function of q and the temperatures yl,...,y, of various components of the reactor. For example y l , y 2 ,... may be the temperatures of the fuel, coolant etc. Specify k = k, c’y - p q where k , is a scalar constant and y’ = (jl,...,yn).If one assumes that the heat transfer arises from conduction one may apply Newton’s law of cooling to find that y satisfies the equation
+
j
=
A y - bq.
In the above equations A, b, c and p are as in the text. Let A be nonsingular. 142
AN EXAMPLE FROM THE THEORY OF NUCLEAR POWER REACTORS
Critical points. If y , q is a critical point for the above system of equations then A y - bq
=
0,
(ko
or y
=
{ko
A-'bq,
+ C'Y
- pq)q = 0
+ (c'A-'b
- p)q}q = 0.
Thus y , = 0, ql = 0 and y,
=
A-'b(p - c'A-'b)-'ko,
~2
= (p -
c'A-'b)-'ko
are the only critical points provided p - c'A-'b # 0. Since q 2 0, one must have q, 2 0. The point y , , q , corresponds to the reactor when shut down and y,, q, corresponds to the steady state operating point when the reactor is producing power. Clearly one would want the critical point y,, q, to be asymptotically stable for all y and all q > 0. Let q, # 0 and let us change the origin to the point y,, q, by the following change of coordinates : x = y - y, = y
8 = q - ~2
=
- A-'b(p
-
c'A-'b)-'ko
q - ( p - c'A-'b)-'kO.
Thus the kinetic equations are X
= AX - b8
e = k(8 + q,) k = C'X - p8
where 8 + q, = q > 0. Now o = log[(B + q,)/q,] is well defined since the argument of log is always positive and 8,o vanish together. Thus the above equations can be written X =
Ax - bq,(e" - 1)
r5 = c'x - pq,(e" - 1)
which is the standard form for indirect control with +(a) = q,(e" - 1). Hence all the theory developed in the text may be applied to this important physical problem. 143
BIBLIOGRAPHY
Andre, J., and Seibert, P. [ 1J ifber stiickweise lineare Differentialgleichungen die bei Regelungsproblemen auftreten. 1. Arch. Math. 7 , pp. 148-156 (1956). [2] Automatic and remote control. Proc. 1st Internat. Congr. Internat. Federation Automat. Control, MCJSCOW. U S S R , 1960 2. pp. 919-922 (1961). Butterworths, London. Aizerman. M. A. [ I ] On a problem on the stability of dynamical systems in the large. Uspehi M a t . Nauk 4 (1949).
Aizerman, M. A,, and Gantmacher, F. R. [ I ] Absolute stability of control systems. Russian (1963). English transl. Holden-Day, San Francisco, 1963. Auslander, J., and Seibert, P. [ 1J Prolongations and generalized Liapunov functions. In “International Symposium on Nonlinear Differential Equations and Nonlinear Mechanics” (J. F. LaSalle and S. Lefschetz, eds.), pp. 454462. Academic Press, New York, 1963. Barbashin, E. A., and Krassovskii, N. N. [ I ] On the existence of a Liapunov function in the case of asymptotic stability in the large. Prikl. M a t . Meh. 18, pp. 345-350 (1954). Bellman, R. [ I ] “Introduction to Matrix Analysis.” McGraw-Hill, New York, 1960. A very interesting and lively presentation of the subject. Contains many interesting problems. Bushaw, D. W. [ I ] Optimal discontinuous forcing terms (Princeton thesis, 1953). In “Contributions to the Theory of Nonlinear Oscillations” (S. Lefschetz, ed.), 4 (Ann. of Math. Studies No. 41). pp. 29-52. Princeton Univ. Press, Princeton, New Jersey, 1958.
144
BIBLIOGRAPHY Doetsch, G. [I] “Theorie und Anwendung der Laplace-Transformation.” Dover, New York, 1943. A standard work on Laplace transformations. Filippov, A. F. [I] Application of the theory of differential equations with discontinuous right hand sides to nonlinear problems in automatic control. Proc. / s t Internat. Congr. f n t u w / r . Federation Automat. Control, Moscow, U S S R , /Y60 1. pp. 1098-1 100 (1960). Butterworths, London. Gantmacher, F. R. [I] “The Theory of Matrices,” 2 vols. Chelsea, New York, 1959 (Russian, English transl.). Classical and very complete book on matrices, also underscoring relations with vectors and applications to differential equations. Kalman, R. E. [I] Canonical structure of linear dynamical systems. Proc. Nut. Acad. Sci. U.S.A. 48, pp. 596600 (1962). [2] Liapunov functions for the problem of Lurie in automatic controls. Pro(,. Nar. A w d . Sci. U.S.A. 49, pp. 201-205 (1963). A highly important and striking paper. Kalman, R. E.. Ho, Y. C.. and Narendra, K. S. [I] Controllability of linear dynamical systems. Contrih. Difl>runrial Eyutrtioru 1. pp. 189-213 (1963). A noteworthy paper. LaSalle. J. P. [I] Complete stability of a nonlinear control system. Proc. Nar. Acud. Sci. U.S.A. 48. pp. 600-603 (1962). Noteworthy contribution to our topic. [2] Stability and control. SIAM J . Control. 1, pp. 3-15 (1963). Excellent introduction and interesting applications. [3] The time optimal control problem. In “Contributions to the Nonlinear Theory of Oscillations” (S. Lefschetz, ed.). 5, (Ann. of Math. Studies No. 45). pp. 1-24, Princeton Univ. Press, Princeton, New Jersey, 1960. LaSalle, J. F., and Lefschetz. S. [I] “Stability by Liapunov’s Direct Method with Applications.” Academic Press, New York, 1961. Elementary introduction to Liapunov’s stability theory with many applications among others to control theory. Lefschetz, S. [I] “Differential Equations: Geometric Theory,” 2nd ed. Wiley (Interscience), New York, 1963. Contains all the general differential equation theory required for reading the present monograph. [2] Controls: an application of the direct method of Liapunov. Bol. Soc. Mat. Mexicuna, 5, pp. 139-143 (1960). [3] Some mathematical considerations of nonlinear automatic controls. Contrib. Diflerenrial Equations 1, pp. 1-28 (1963).
145
BIBLIOGRAPHY Letov, A . M. [I] “Stability of Nonlinear Controls,” 1st ed. Princeton Univ. Press, Princeton, New Jersey, 1961; 2nd ed., 1963 (Russian, English trans].). One of the early books on nonlinear control theory with many practical examples. Liapunov, A. M. [I] “Probleme general de la stabilite du mouvement” (Ann. of Math. Studies No. 17). Princeton Univ. Press, Princeton, New Jersey, 1947. A classical paper. Very rich and varied material. Required reading for anyone who wishes to study differential equations in depth. [2] “Complete Works,” I1 (Russian). Lurie, A. I. [I] “On Some Nonlinear Problems in the Theory of Automatic Control.” H. M. Stationery Office, London, 1951 (Russian, English trans].). The most important early paper on nonlinear control theory. Malkin, 1. G. [ I ] “Theory of Stability of Motion,” 1952 (Russian. English trans]. by Atomic Energy Commission, Washington, D.C.). Very rich material and well written; rather poor translation. Minorsky, N. [ I ] Directional stability of automatically steering bodies. J . SOC.of Naval Engrs. (May 1922). Morozan, T. [I] Remarques sur une note de Yacubovich. Compt. Rend. 254, pp. 1127-1129 (1962). Niemickii, V. V., and Stepanov, V. V. [I] “Qualitative Theory of Differential Equations” (Math. Ser. Princeton, No. 22). English version. Princeton Univ. Press, Princeton, New Jersey, 1960. Perhaps the best treatise on modern differential equations. Exceptionally rich content. Pliss, V. A. [ I ] Certain problems in the theory of stability of motion in the large. Izdatelstvo Leningradskooo Unio. (1958) (Russian). Popov, V. M. [ I ] Relaxing the sufficiency conditions for absolute stability. Auromar. i Telemeh. 19, pp. 1-7 (1958) (English transl.). [2] Absolute stability of nonlinear systems of automatic control. Automar. i Telemeh. 22, pp. 961-979 (1961). This outstanding and most original paper has marked a date in the development of our subject. Chapt. VII deals exclusively with it. [3] On a certain critical case in absolute stability. Automat. i Telemeh. 23, pp. 4-24 (1962). [4] Nouvedux criteriums de stabilite pour les systemes automatiques non-lineaires. Rev. Electrotech. Energet. Acad. R. P . Romine 5, pp. 73-88 (1960). Smets. H. B. [I] Stability in the large of heterogeneous power reactors. Bull. Acad. Roy. Belgique CI. Sci. 47. pp. 382405 (1961).
146
BIBLIOGRAPHY
Titchmarsh, E. C. [l] “Introduction to the Theory of Fourier Integrals.” Oxford Univ. Press (Clarendon), London and New York, 1937. A standard work on Fourier integrals. Yacubovich, V. A. [l] On the stability in the large of the undisturbed motion for the equations of indirect automatic control (Russian). Vestnik Leningrad. Uniu. p. 19 (1957). [2] On nonlinear differential equations of systems of automatic control with a single control element. Vestnik Leningrad. Uniu. pp. 120-153 (1960). [3] Solution of certain matrix inequalities occurring in the theory of automatic controls. Dokl. Akad. Nauk SSSR 143, pp. 1304-1307 (1962). A very important paper with many original results. [4] Absolute stability of nonlinear control systems in the critical cases. Automat. i Telemeh. 24(I),pp. 293-303 and (II), pp. 717-731 (1963). Yoshizawa, T. [I] Liapunov’s function and boundedness of solutions. Funkcial. Ekoac. 2, pp. 95-142 (1959). Zubov, V. I. [I] “The Methods of A. M. Liapunov and their Applications.” Leningrad, 1957.
147
INDEX
Aizerman’s problem, 43 absolute stability, 9, 19 asymptotic stability, 136
end point, 76 existence and uniqueness of solutions of differential equations, 3
Barbashin-Krassovskii complement to Liapunov’s asymptotic stability theorem, 137
feedback signal, 5 ( F , ) inequality for indirect control, 21 Fourier transform. 90 function @((a) = jg &(a) d“, 6
characteristic function Ha), 5 admissible, 6 assumptions on, 6 class C’, function of, 131 A scalar or vector function f ( x ) of the vector x is of class C’ in a region of the space of x whenever the partial derivatives df;laxj exist and are continuous in R. completely controllable pair (A, b), 107 observable pair (c’,A), 107 control, direct, indirect, 3 components of, 2 convolution of two functions, 93 critical point, 2 dynamical system, 1
hermitian form, 18 Expression x ’ H x where x is a vector and H is a matrix such that H = H*. instability, 136 Kalman’s inequality, 115 theorem, 124 LaSalle’s theorems, 28, 138 Liapunov function, 18 relation, 133 complex, 134 theorems on stability, 137
149
INDEX
matrix negative, 18 positive, 18 stable, 18 notations for, 17 Popov’s inequality, 90 theorems, 90 quadratic form, 18 Expression x’Fx where x is a vector and F is a symmetric matrix: F = F . quasi-exponential function, 92 stability, 136 absolute, 9, 19 asymptotic, 136
150
standard example, 21 starting point, 16 switching hyperplane, 76 line, 77, 84 system autonomous, 2 dynamical, 1 fundamental, 2 nonautonomous, 2 components of, 2 transfer function, 104 transition point, 76 Yacubovich’s lemma, 114